Next Article in Journal
Historical Geomagnetic Declination in Mainland Spain Derived from Topographic Fieldwork Records (1871–1940)
Previous Article in Journal
Elemental Geochemical Analysis for the Gold–Antimony Segregation in the Gutaishan Deposit: Insights from Stibnite and Pyrite
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Computational Framework for Reproducible Generation of Synthetic Grain-Size Distributions for Granular and Geoscientific Applications

by
Seweryn Lipiński
Faculty of Technical Sciences, University of Warmia and Mazury in Olsztyn, 10-036 Olsztyn, Poland
Geosciences 2025, 15(12), 464; https://doi.org/10.3390/geosciences15120464
Submission received: 23 October 2025 / Revised: 21 November 2025 / Accepted: 2 December 2025 / Published: 4 December 2025
(This article belongs to the Section Geomechanics)

Abstract

Particle size distribution (PSD), also referred to as grain-size distribution (GSD), is a fundamental characteristic of granular materials, influencing packing density, porosity, permeability, and mechanical behavior across soils, sediments, and industrial powders. Accurate and reproducible representation of PSD is essential for computational modeling, digital twin development (i.e., virtual replicas of physical systems), and machine learning applications in geosciences and engineering. Despite the widespread use of classical distributions (log-normal, Weibull, Gamma), there remains a lack of systematic frameworks for generating synthetic datasets with controlled statistical properties and reproducibility. This paper introduces a unified computational framework for generating virtual PSDs/GSDs with predefined statistical characteristics and a specified number of grain-size fractions. The approach integrates parametric modeling with two histogram-based allocation strategies: the equal-width method, maintaining uniform bin spacing, and the equal-probability method, distributing grains according to quantiles of the target distribution. Both methods ensure statistical representativeness, reproducibility, and scalability across material classes. The framework is demonstrated on representative cases of soils (Weibull), sedimentary and industrial materials (Gamma), and food powders (log-normal), showing its generality and adaptability. The generated datasets can support sensitivity analyses, experimental validation, and integration with discrete element modeling, computational fluid dynamics, or geostatistical simulations.

Graphical Abstract

1. Introduction

Granular materials are ubiquitous in both natural and engineered systems, encompassing soils, sands, sediments, industrial powders, and bulk solids [1,2,3]. A fundamental aspect of their characterization is the particle size distribution (PSD), also referred to as the grain-size distribution (GSD), which describes the statistical spread of particle diameters within a granular bed [4,5,6,7]. Whether naturally occurring or artificially produced, PSD plays a decisive role in determining the structural, physical, and mechanical behavior of granular systems, as it influences key properties such as packing density, porosity, permeability, mechanical stability, and flowability, making it critical for applications in geotechnics, civil engineering, agriculture, pharmaceuticals, and environmental science [8,9,10,11,12,13,14]. In materials science and geosciences alike, PSD informs the interpretation of sediment transport, soil formation, and diagenetic processes, as well as the design of composites, catalysts, and functional materials, where particle morphology directly affects macroscopic performance [6,13,15,16,17].
The complexity of PSD analysis arises from the diversity of granular materials and the variability in research objectives. Depending on the context, different statistical models may be more appropriate, and the choice of model can significantly impact the interpretation of experimental data and the accuracy of numerical simulations [18,19,20,21]. Classical distributions such as log-normal, Weibull, Rosin–Rammler, and gamma are commonly employed in the analysis of both natural sediments and engineered powders [22,23,24].
The growing reliance on computational modeling and digital twin concepts in geosciences and materials engineering has amplified the need for robust and reproducible input data, including synthetic PSDs [25,26,27,28]. With the rise in simulation-based approaches to soil mechanics, sediment transport, and multiphase flow, generating statistically controlled granular systems has become essential. Despite the availability of PSD models, there is no integrated framework that combines statistical flexibility with reproducibility and scalability across different granular systems. This gap limits the comparability of simulation results and hinders the development of standardized benchmarking protocols.
Beyond empirical measurements, the ability to generate synthetic PSD datasets with predefined statistical characteristics has become increasingly important. Virtual distributions enable controlled simulations, sensitivity analyses, and validation of experimental setups without extensive physical sampling [28,29,30,31]. They are particularly valuable when material availability is limited, experiments are costly, or reproducibility across laboratories is required. Moreover, synthetic datasets allow systematic exploration of hypothetical or extreme scenarios, facilitating investigations into how variations in PSD affect macroscopic behaviors such as compaction, permeability, or rheological response [32,33,34]. However, despite these advantages, existing approaches lack a unified, reproducible framework for generating synthetic PSDs across diverse granular systems, i.e., a gap this study aims to fill.
To address these limitations, this study introduces a unified framework for systematic generation of virtual PSDs with predefined statistical properties and a specified number of particle size fractions. The framework integrates parametric modeling constrained to strictly positive domains, reflecting the physical reality that particle diameters cannot be negative, with two complementary histogram binning procedures. The equal-width approach ensures uniform bin spacing across the particle size range, while the equal-probability approach allocates particles according to quantiles of the target distribution. Together, these methods balance resolution control with statistical representativeness, ensuring reproducibility and scalability across different granular systems. This directly addresses the identified gap by providing a standardized, scalable methodology for virtual PSD generation.
It should be emphasized that the proposed method does not introduce a new probabilistic distribution. Rather, it generates a synthetic grain-size distribution by numerically reproducing the analytical form of one selected model from the predefined set of twelve classical GSDs.
The applicability of the framework is demonstrated through case studies using literature data for soils (Weibull distributions) [35,36], industrial granular materials (Gamma distributions) [37], and food powders (log-normal distributions) [38]. These examples highlight how the proposed methodology differs from earlier model-specific or case-dependent approaches by offering a generalizable and integrative perspective. By combining theoretical insights with practical computational tools, the framework bridges the gap between empirical data collection and virtual material simulation. It provides reproducible and application-ready datasets for discrete element modeling (DEM), computational fluid dynamics (CFD), and geostatistical modeling, paving the way for standardized protocols, virtual replicas of physical systems (i.e., so-called “digital twins”), and open science initiatives in granular and geoscientific research.

2. Review of Mathematical Models for Particle Size Distribution in Granular Systems

The mathematical description of particle size distribution has been the subject of intensive research due to its fundamental role in governing the structural and functional properties of granular materials. Various models have been proposed, ranging from purely empirical equations to probability-based statistical distributions [18,19,20,22,39,40,41]. Each model offers distinct advantages in terms of interpretability, computational efficiency, and ability to reproduce experimental data across a wide range of particle sizes. The choice of an appropriate model is therefore not only a methodological consideration but also a determinant of the predictive reliability of granular system simulations.
Among the most applied approaches are empirical models, such as the Rosin–Rammler and Gaudin–Schuhmann distributions, historically used in mining, powder technology, and soil mechanics. These models provide relatively simple closed-form expressions that capture the general shape of PSDs but lack flexibility when applied to highly heterogeneous or multimodal samples [24,42,43,44].
In contrast, analytical and statistical models derived from probability theory offer greater adaptability. Distributions such as normal, log-normal, Weibull, gamma, and beta are widely used in materials science and geotechnical engineering because they are mathematically well-defined for non-negative values and can accommodate skewness and kurtosis typical of granular data [11,37,38,45]. For example, the log-normal distribution is particularly suitable for naturally fragmented or weathered materials, while the Weibull distribution provides a useful framework for systems governed by fracture and breakage processes.
The selection of a PSD model is context dependent. Factors such as the origin of the material, the measurement technique (e.g., sieve analysis, laser diffraction, or image-based methods), and the intended application strongly influence model suitability. Geotechnical analyses often favor simple parametric models for rapid classification, whereas pharmaceutical applications may require high-resolution statistical modeling to ensure reproducibility and regulatory compliance.
Table 1 summarizes the most frequently applied models for describing PSDs in granular systems. It emphasizes practical aspects, including typical applications, advantages, and limitations, providing guidance for model selection depending on material type, research objectives, and data availability.
In addition to widely applied log-normal, Rosin–Rammler, or Weibull functions, a number of less common empirical and process-based distributions have been used in specialized domains. These include the Fredlund function in geotechnical engineering [46,47], the Gompertz and Johnson families for asymmetric or skewed PSDs [48,49,50,51], and process-derived functions such as the Lifshitz–Slyozov–Wagner (LSW) model for Ostwald ripening or the Nukiyama–Tanasawa law in spray atomization research [52,53,54,55]. While these models often provide superior fit within their original domains, their broader applicability is constrained by mathematical complexity or domain-specific assumptions. Nevertheless, they highlight the diversity of approaches to PSD modeling, bridging purely statistical fitting with physically grounded process descriptions.

3. Mathematical Foundations of Particle Size Distribution Generation

This section outlines the mathematical foundations underlying the generation of virtual particle size distributions. A key requirement for constructing synthetic granular systems is the availability of a well-defined cumulative distribution function (CDF), preferably in closed analytical form. The CDF directly relates particle size to its probability of occurrence, enabling reproducible sampling of particle diameters and controlled generation of synthetic granular systems. Table 2 summarizes the availability and form of CDFs across commonly applied PSD models, distinguishing between those with closed-form solutions, empirically defined functions, and models requiring numerical integration.
Based on the availability of clearly defined, closed-form cumulative distribution functions, only distributions suitable for reproducible numerical generation were selected for further consideration. The selected models, presented in Table 3, include the Normal, Log-normal, Weibull (with Rosin–Rammler as a special case), Gaudin–Schuhmann, Gamma, Beta, Gompertz, and Johnson families (SU, SB, SL, SN). In the following sections, these distributions provide the mathematical foundation for constructing virtual particle size distributions.

4. Procedures for Generating Virtual Particle Size Distributions

Generating a particle size distribution (PSD) involves creating a histogram that represents the number of particles within defined diameter ranges. Each bin corresponds to a particle size fraction, and the count in each bin reflects the number of particles within that range. For virtual granular systems used in simulations, larger numbers of bins and total particles are typically employed to ensure statistical robustness and high resolution. In contrast, physically assembled granular beds are constrained by material availability and labor, limiting both the range of particle diameters and the total number of particles.
Two procedures are proposed:
  • The equal-width method, which ensures uniform bin spacing across the size range;
  • The equal-probability method, which partitions the distribution into bins of equal probability mass based on the target CDF.
Each method has its own advantages and limitations, making them suitable for different applications [56,57].

4.1. Equal-Width Bin Method

Objective—to create a histogram where all bins have the same width in terms of particle size (i.e., mm), and the bar heights represent the probability density function (PDF).
Step 1: Define the number of histogram bins and range
Choose the number of bins N (this value determines the histogram resolution and can be chosen according to the application or desired granularity) and determine the particle size range d m i n , d m a x , then compute the bin width:
d = d m a x d m i n N
Step 2: Compute bin boundaries
x i = d m i n + i · d ,   i = 0 , . . . , N .
Step 3: Generate samples using inverse transform sampling
Generate M uniform random values u j ~ U ( 0,1 ) . To avoid numerical instabilities (i.e., division by zero or undefined quantile values) at the boundaries (i.e., u = 0 or u = 1 ), uniform values should be restricted to a safe range such as e.g., u j 0.0001 ,   0.9999 , then map them to particle sizes using the inverse CDF:
The expected number of samples n i in each bin x i 1 , x i is calculated as:
x j = F 1 u j ,
where F 1 is the quantile function of the target distribution.
Step 4: Assign samples to bins and compute PDF
Count the number of samples n i in each bin, and compute the PDF for each bin:
P D F = n i M · d .
The resulting histogram approximates the probability density function of the target distribution.

4.2. Equal-Probability Bin Method

Objective—to create a histogram where each bin represents the same probability mass, resulting in variable bin widths.
Step 1: Define the number of histogram bins
Let N be the desired number of bins in the histogram.
Step 2: Determine the bins boundaries using the inverse CDF
To partition the probability space uniformly, the bin boundaries x 0 , x 1 , . . . , x N are computed using the inverse CDF (quantile function):
x i = F 1 i N ,   i = 0 , . . . , N .
This ensures each bin represents an equal probability mass of 1 N , resulting in variable bin widths depending on the distribution shape.
Step 3: Generate samples using inverse transform sampling
Same as in the equal-width method, i.e., using Equation (3).
Step 4: Assign samples to bins and compute PDF
Given a total number of samples M , the expected number of samples n i in each bin x i 1 , x i is calculated as:
P D F = n i M · ( x i x i 1 ) .
This directly relates the probability mass within each bin to the expected sample count. The resulting histogram approximates the probability density function of the target distribution.

4.3. Comparison of Methods

The proposed procedures provide a robust and reproducible framework for generating virtual PSDs from any known cumulative distribution function.
In practice, the equal-width method is recommended when comparing PSDs across multiple samples or aligning with experimental sieve data, as it provides a consistent scale and facilitates direct interpretation of particle size ranges.
The equal-probability method, on the other hand, is particularly useful for visualizing quantiles and ensuring that all bins contain data, which is advantageous for analyzing distribution tails or highly skewed particle size distributions.
A detailed comparison of both methods is presented in Table 4.
The choice of binning procedure has practical implications for subsequent modeling and simulation workflows. Equal-width binning provides direct comparability with laboratory sieve analysis, making it suitable for validation studies, calibration of empirical models, and routine PSD characterization. In contrast, equal-probability binning better captures the statistical extremes of a distribution, which may dominate processes such as filtration, transport, or percolation in porous media. It is also particularly useful in machine learning pipelines, where balanced representation of quantiles can improve feature extraction and reduce bias toward dominant particle sizes. In numerical simulations, such as DEM-based modeling of granular flows, the choice of binning strategy directly affects input dataset resolution and the accuracy of predicted macroscopic behavior. Thus, the selection of a binning method should be aligned with the intended application, whether it emphasizes physical comparability with experimental data or statistical representativeness of distribution tails [56,57].
Figure 1 illustrates the comparison of equal-width and equal-probability binning applied to an exemplary synthetic log-normal particle size distribution ( μ = 0, σ = 0.5). The example was generated with N = 15 bins and M = 100,000 samples.
The equal-width method results in a histogram with constant bin size across the entire diameter range, which allows straightforward interpretation and direct comparability with sieve-based laboratory data. However, this approach produces sparse bins in the tails of the log-normal distribution, underrepresenting rare but potentially important particle fractions. The equal-probability method, in contrast, partitions the probability space uniformly, producing variable bin widths that ensure balanced representation of both the central peak and the distribution tails. This highlights quantile structure and rare events, though the variable bar heights may be less intuitive. The superimposed theoretical log-normal probability density function demonstrates that both approaches adequately approximate the target distribution, albeit with different emphases.

5. Implementation and Case Studies of Synthetic Particle Size Distributions Generation

To demonstrate the practical applicability of the methodology described in Section 3, several case studies were conducted. Model parameters were sourced from literature for various materials, including soils and powders. These parameters were then used to generate virtual particle size distributions (PSDs) with a predefined number of bins and a fixed total number of particles.

5.1. Computational Environment

The framework was implemented in Python 3.11, utilizing NumPy 2.1.3 for numerical computations and matplotlib 3.10.0 for visualization, all within the Spyder 6.0.7 environment. All statistical analyses were performed using the same setup.

5.2. Generation of Virtual Particle Size Distributions for Soil Analysis Using the Weibull Distribution

The first case study focuses on the Weibull distribution, a widely used model for soils and other granular materials. Model parameters, i.e., scale ( λ ) and shape ( k ), were obtained from a literature dataset comprising 27 natural soils from Austria and Nepal, covering cobbles (Co), gravel (Gr), sand (Sa), silt (Si), and clay (Cl) [35,36]. This diversity ensures representation of granulometries ranging from coarse gravels to fine silts and clays.
Virtual PSDs were generated using 50 equal-width bins on a logarithmic scale, with 100,000 particles per distribution. Figure 2 shows the resulting probability density functions (PDFs) while Figure 3 shows the corresponding cumulative distribution functions, with markers indicating d10, d50, and d90 percentiles.
The variability in curve shapes shown in both figures reflects the influence of soil texture and provenance: some distributions are coarse-dominated (gravels and sands), others fine-dominated (silts and clays).
Grouping by dominant fraction (coarse, medium, fine) illustrates systematic differences in size accumulation patterns. Fine-dominated soils exhibit steep CDF slopes at small diameters, indicating rapid accumulation of mass in the fine fraction. Medium-textured soils show gradual transitions, reflecting balanced contributions from sand and silt, while coarse-dominated soils display flatter initial slopes and delayed inflection points, consistent with gravel and cobble prevalence.
Hierarchical clustering, visualized in the dendrogram (Figure 4), reveals three primary clusters among the 27 soil PSDs. The clustering was performed using Ward’s linkage and Euclidean distance based on d10, d50, and d90. Branch heights indicate increasing dissimilarity, with the highest merges separating coarse-, medium-, and fine-textured groups.
The analysis of the dendrogram reveals a clear division of soil samples into three main groups, each corresponding to dominant grain size fractions.
The first group (left branch) consists of fine-grained soils with small particle sizes and a relatively narrow distribution. This group includes twelve samples that exhibit short branch lengths, indicating a high degree of internal similarity.
The second group (middle branch) represents medium-textured soils, suggesting a balanced granulometric composition. These samples differ significantly from the fine-grained group, reflecting distinct differences in grain structure.
The third group (right branch) comprises coarse-grained soils with larger particle sizes. This is the most heterogeneous group, further subdivided into smaller clusters that reflect the variability among the analyzed samples.
This classification demonstrates that Weibull parameters capture essential granulometric characteristics and, when combined with clustering, can effectively group soils by similarity, supporting predictive modeling and classification tasks.
The heatmap of percentiles and grain size spread (Figure 5) provides an additional perspective. The color gradient, ranging from light yellow (low values) to dark blue (high values), facilitates rapid visual assessment. Coarse soils exhibit the highest d90 and spread values, reflecting large particles and wide ranges. Fine soils display low percentiles and minimal spread, while medium soils occupy an intermediate position.
These findings confirm that Weibull parameters can be linked to soil texture, enabling the generation of synthetic PSDs that realistically represent natural variability. Representative parameter ranges for different soil types are summarized in Table 5.
Overall, this case study demonstrates that Weibull parameters systematically capture soil texture variability. Synthetic PSDs generated from these parameters reproduced characteristic percentiles (d10, d50, d90), enabling classification, clustering, and predictive analyses. This confirms the framework’s applicability to geotechnical and hydrological contexts, where percentiles govern permeability, compressibility, and shear strength.

5.3. Generation of Virtual Particle Size Distributions for Granular Materials Using the Gamma Distribution

The second case study demonstrates the framework’s versatility for industrial and organic granular materials. The Gamma distribution, commonly used for positively skewed particle sizes, was applied to Tin, Gelatin, Iron, and Sodium Chloride. Parameters of the distributions, i.e., α (shape) and β (scale) were obtained from literature [37].
Virtual PSDs were generated using 100 logarithmically spaced bins across a particle size range of 1–1000 mm, with 100,000 particles per material. Figure 6 shows the resulting PDFs, each reflecting the influence of α and β : Tin and Sodium Chloride exhibited moderately skewed, narrow distributions, whereas Gelatin and Iron had broader, right-skewed distributions.
The corresponding CDFs are shown in Figure 7. They highlight cumulative size patterns, with higher α materials showing steeper slopes (more uniform particle sizes) and lower α materials exhibiting gradual accumulation (greater variability).
Key statistics, i.e., mean particle size, standard deviation, and skewness, are presented in Table 6. Gelatin and Iron displayed higher skewness, reflecting extended tails toward larger particles, whereas Tin and Sodium Chloride were more symmetric.
The differences in mean and variance reflect the inherent size characteristics of the materials, while skewness captures the asymmetry of particle size distribution. Gelatin and Iron show higher skewness, indicating a longer tail toward larger particle sizes, whereas Tin and Sodium Chloride distributions are more symmetric.
This case study illustrates several important aspects:
  • Controlled reproduction of material-specific PSDs—adjusting α and β allows precise tailoring of particle size distributions without physical measurements;
  • Quantitative characterization: PDF/CDF visualization combined with statistical descriptors ensures reproducible and complete description of particle systems;
  • Efficiency and scalability—large datasets (100,000 particles) were generated rapidly, enabling simulations, probabilistic analyses, and machine learning applications;
  • Universality—the framework is applicable beyond soils, covering industrial, chemical, and organic materials;
  • Interpretation of distribution shape—high α yields narrow, uniform distributions, low α produces broad, skewed distributions, demonstrating flexibility in modeling particle heterogeneity.
Overall, the Gamma-based case study confirms that the framework enables statistically representative, reproducible, and scalable generation of virtual granular datasets, supporting simulation and analysis of diverse engineering and industrial applications.

5.4. Generation of Virtual Particle Size Distributions for Food Powders Using the Log-Normal Distribution

To illustrate the applicability of the framework to fine granular materials in the food industry, a third case study was conducted using log-normal distributions. Food powders such as sugar, corn meal, and instant milk were selected based on available literature data [38].
Virtual PSDs were generated with 100 logarithmically spaced bins over a particle size range of 10–10,000 μm, and 100,000 particles per material. Figure 8 shows the PDFs: sugar and corn meal have narrow, nearly symmetric distributions, while instant milk exhibits a broader, right-skewed distribution due to higher σ .
Corresponding CDFs (Figure 9) reflect cumulative particle size patterns, with sugar and corn meal showing steep slopes (homogeneous particles) and instant milk accumulating more gradually (higher variability).
Understanding these distributions is important for optimizing mixing, transport, and dissolution processes. For instance, narrow distributions ensure consistent flow and packing, while broad distributions require careful handling to prevent segregation and maintain product uniformity.
From a practical standpoint, these results can directly inform process design and quality control in the food industry. For powders like sugar and corn meal with narrow, uniform distributions, standard handling and mixing procedures are sufficient to maintain consistent product quality. In contrast, powders like instant milk, with broad and skewed distributions, require tailored strategies, such as controlled feeding, careful blending, or sieving, to prevent segregation and ensure homogeneity in final products. The ability to rapidly generate virtual PSDs allows to simulate different scenarios, optimize equipment settings, and predict potential challenges in handling and processing granular materials.
To further illustrate the framework’s capabilities, heterogeneous mixtures of sugar and instant milk were simulated at seven ratios: 100:0, 85:15, 70:30, 50:50, 30:70, 15:85, and 0:100. Each mixture contained 100,000 particles sampled according to the specified proportions.
Figure 10 shows the resulting PDFs. Increasing the fraction of instant milk progressively broadened and skewed the distributions. The 100:0 mixture (pure sugar) remained narrow and symmetric, while the 0:100 mixture (pure instant milk) was broad and highly skewed. Intermediate mixtures reflected gradual transitions in distribution shape.
This simulation demonstrates that mixture composition strongly influences PSD shape, variability, and cumulative patterns, which in turn affect flowability, mixing efficiency, and dissolution behavior. By systematically generating mixtures, the framework enables predictive simulations for process design and quality control.
Overall, this case study confirms that the methodology accommodates fine powders with different skewness and spread and can simulate realistic heterogeneous mixtures. Importantly, the framework can generate mixtures of multiple components, effectively producing multimodal particle size distributions that reflect complex material granulometries. The approach is fully reproducible, scalable, and allows rapid generation of virtual PSDs, reinforcing the generality and practical applicability of the framework across diverse granular systems, from single-component powders to heterogeneous, multimodal mixtures.

5.5. Computational Performance of Histogram Generation Methods

Benchmarking the two histogram-generation procedures revealed clear differences in execution time. For five repeated runs using a log-normal distribution (15 bins, sample sizes from M = 103 to 106, the equal-width method showed consistently low and nearly constant execution times, typically between 5 × 10−5 and 5 × 10−4 s, indicating negligible scaling effect.
In contrast, the equal-probability method was systematically slower, with computation times increasing from approximately 3 × 10−4 s at M = 103 to 5 × 10−2–8×10−2 s at M = 106. The resulting performance ratio ranged from 1.2 to 36, with typical values between 10 and 20 for M ≥ 105.
Additional benchmarks with 100 bins confirmed this trend. The equal-width method remained consistently faster, while speed ratios increased from about 6–7 for small sample sizes (M = 103–104) to approximately 15–20 for large samples (M ≥ 106). These results indicate that the relative performance advantage of the equal-width method is robust across different histogram resolutions.
In summary, both methods produce statistically consistent PSDs, but the equal-width approach is markedly more efficient and therefore preferable for large-scale or iterative simulation workflows. The equal-probability method, while computationally more demanding, may be reserved for applications where bin adaptivity and quantile-based resolution outweigh performance considerations.

5.6. Discussion of Case Studies, Limitations and Future Work

The three case studies demonstrate that the proposed methodology yields reproducible and statistically controlled virtual particle size distributions. The generated synthetic PSDs are suitable for numerical simulations, sensitivity analyses, benchmarking, and virtual dataset creation, thereby bridging the gap between empirical measurements and computational modeling. By reproducing the analytical form of the selected distribution models, the framework ensures consistency across different granular materials and enhances comparability between simulation-based studies.
A key aspect of the presented examples is that they rely on literature-derived parameters rather than newly collected experimental data. This choice reflects the methodological objective of the study, which was to evaluate the generality, reproducibility, and model-agnostic nature of the proposed framework. Nevertheless, the method can be directly applied to laboratory measurements, enabling users to generate reproducible synthetic PSDs corresponding to empirical samples. In future work, the framework will be systematically validated using newly acquired experimental datasets to assess accuracy, quantify deviations, and demonstrate practical applicability in laboratory settings.
Despite these advantages, several limitations must be acknowledged.
First, the present study focuses on unimodal grain-size distributions to maintain clarity and computational tractability. In future work, the framework can be extended to multimodal distributions by combining multiple parametric models into mixture formulations. This enhancement would enable accurate representation of materials with complex granulometric signatures, such as bimodal soils, blended industrial powders, and composite particulate systems, thereby broadening the applicability of the approach.
Second, the methodology depends on accurate estimates of distribution parameters, which in practice may be affected by measurement noise, instrument resolution, or fitting procedures. Incorporating parameter uncertainty or Bayesian estimation presents a promising direction for future research.
Third, the method generates only particle sizes, without accounting for morphological characteristics such as particle shape, angularity, roundness, or surface texture. These properties are known to significantly influence mechanical and hydraulic behavior in granular systems. Although beyond the scope of the present work, the framework may be extended in the future by coupling PSD generation with morphological descriptors obtained from imaging-based analyses or empirical correlations.
Fourth, although the equal-probability approach offers excellent statistical representativeness, it incurs a higher computational cost due to repeated quantile function evaluations. In the revised manuscript, representative generation times and scaling behavior have been provided to guide users working with large datasets.
Finally, the current validation was performed on literature-based PSD models. While sufficient for demonstrating numerical correctness and generality, comprehensive experimental benchmarking will be essential to fully establish robustness, applicability, and transferability across different material classes.
By outlining these limitations and potential extensions, this section highlights both the flexibility and the current boundaries of the proposed framework. Its ability to generate reproducible, application-ready PSD datasets provides a foundation for standardized benchmarking protocols, improved DEM and CFD simulations, and broader adoption of open-science practices in granular and geoscientific research.

6. Conclusions

This paper introduced a systematic and reproducible framework for generating synthetic particle size distributions for virtual modeling and simulation of granular materials. By reviewing commonly used PSD models, the study identified those with well-defined cumulative distribution functions as particularly suitable for reproducible dataset generation through inverse transform sampling.
Two complementary procedures for histogram construction were proposed: the equal-width method, which aligns with conventional PSD representation and facilitates direct comparison with sieve analysis, and the equal-probability method, which ensures uniform representation of the probability space and highlights distribution tails. Together, these strategies balance interpretability with statistical rigor and reproducibility.
The framework was implemented in Python and demonstrated through case studies for soils (Weibull), sedimentary and industrial materials (Gamma), and food powders (log-normal), showing its versatility and applicability across diverse material classes. The generated datasets were statistically controlled, reproducible, and directly usable for model initialization, sensitivity analysis, and benchmarking in DEM, CFD, or geostatistical workflows. They also enable exploration of hypothetical or extreme conditions where experimental data are limited or inconsistent.
While the present work focuses on unimodal systems, the methodology can be naturally extended to multimodal, anisotropic, or spatially correlated particle systems by combining multiple distributions or introducing additional descriptors. Future developments may include automated parameter estimation from experimental datasets and standardized protocols for synthetic PSD benchmarking, further strengthening reproducibility and comparability across studies.
Overall, the proposed framework provides a flexible and transparent computational tool for bridging empirical grain-size data and virtual simulations. It supports data-driven research on soils, sediments, and particulate media, contributing to reproducible modeling practices and enhanced understanding of granular processes in geoscientific applications.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The author declares no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
CDFCumulative distribution function
PDFProbability density function
PSDParticle size distribution

References

  1. Duran, J. Sands, Powders, and Grains: An Introduction to the Physics of Granular Materials; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2012. [Google Scholar]
  2. Guyon, E.; Delenne, J.-Y.; Radjai, F. Built on Sand: The Science of Granular Materials; MIT Press: Cambridge, MA, USA, 2020. [Google Scholar]
  3. Jaeger, H.M.; Nagel, S.R.; Behringer, R.P. The Physics of Granular Materials. Phys. Today 1996, 49, 32–38. [Google Scholar] [CrossRef]
  4. Du, J.; Liu, C.; Yin, S.; Rehman, A.; Ding, Y.; Wang, L. Particle Size Distribution in a Granular Bed Filter. Particuology 2021, 58, 108–117. [Google Scholar] [CrossRef]
  5. Ferreira, R.L.; Medeiros, M.; Pereira, J.E.; Henriques, G.F.; Tavares, J.C.; Marvila, M.T.; de Azevedo, A.R. Effects of Particle Size Distribution of Standard Sands on the Physical-Mechanical Properties of Mortars. Materials 2023, 16, 844. [Google Scholar] [CrossRef] [PubMed]
  6. Islam, S.F.; Hawkins, S.M.; Meyer, J.L.; Sharman, A.R. Evaluation of Different Particle Size Distribution and Morphology Characterization Techniques. Addit. Manuf. Lett. 2022, 3, 100077. [Google Scholar] [CrossRef]
  7. Merkus, H.G. Particle Size, Size Distributions and Shape. In Particle Size Measurements; Particle Technology Series; Springer: Dordrecht, The Netherlands, 2009; Volume 17, pp. 13–42. [Google Scholar]
  8. Guo, W.; Guo, K.; Xing, Y.; Gui, X. A Comprehensive Review on Evolution Behavior of Particle Size Distribution During Fine Grinding Process for Optimized Separation Purposes. Miner. Process. Extr. Metall. Rev. 2024, 1–20. [Google Scholar] [CrossRef]
  9. Jiang, M.; Barreto, D.; Ding, Z.; Yang, K. Discrete Element Study of Particle Size Distribution Shape Governing Critical State Behavior of Granular Material. Fractal Fract. 2025, 9, 26. [Google Scholar] [CrossRef]
  10. Liu, X.; Qu, S.; Huang, J. Relationship between Physical Properties and Particle-Size Distribution of Geomaterials. Constr. Build. Mater. 2019, 222, 312–318. [Google Scholar] [CrossRef]
  11. Ribeiro, A.; Montes, F.; Sousa, J.; Pais, A. Comminution Technologies in the Pharmaceutical Industry: A Comprehensive Review with Recent Advances. Rev. Chem. Eng. 2025, 41, 69–100. [Google Scholar] [CrossRef]
  12. Sobieski, W.; Lipiński, S. The Influence of Particle Size Distribution on Parameters Characterizing the Spatial Structure of Porous Beds. Granul. Matter 2019, 21, 14. [Google Scholar] [CrossRef]
  13. Ulusoy, U. A Review of Particle Shape Effects on Material Properties for Various Engineering Applications: From Macro to Nanoscale. Minerals 2023, 13, 91. [Google Scholar] [CrossRef]
  14. Wu, J.; Feng, M.; Mao, X.; Xu, J.; Zhang, W.; Ni, X.; Han, G. Particle Size Distribution of Aggregate Effects on Mechanical and Structural Properties of Cemented Rockfill: Experiments and Modeling. Constr. Build. Mater. 2018, 193, 295–311. [Google Scholar] [CrossRef]
  15. Li, X.-A.; Sun, J.; Ren, H.; Lu, T.; Ren, Y.; Pang, T. The Effect of Particle Size Distribution and Shape on the Microscopic Behaviour of Loess via the DEM. Environ. Earth Sci. 2022, 81, 290. [Google Scholar] [CrossRef]
  16. Pujala, U.; Subramanian, V.; Anand, S.; Venkatraman, B. Effect of Morphological Properties on the Particle Size Distribution Measurements and Modelling of Porous and Nonspherical Aerosol Behaviour. Prog. Nucl. Energy 2023, 163, 104811. [Google Scholar] [CrossRef]
  17. Zhang, X.; Zhou, L.; Cai, X.; Zheng, X.; Liu, W.; Wang, G.; Tan, D.; Luo, X.; Dong, M. Heterogeneous Structures and Morphological Transitions of Composite Materials and Its Applications. Adv. Compos. Hybrid Mater. 2024, 7, 251. [Google Scholar] [CrossRef]
  18. Esmaeelnejad, L.; Siavashi, F.; Seyedmohammadi, J.; Shabanpour, M. The Best Mathematical Models Describing Particle Size Distribution of Soils. Model. Earth Syst. Environ. 2016, 2, 1–11. [Google Scholar] [CrossRef]
  19. Rastgou, M.; Bayat, H.; Mansoorizadeh, M. Fitting Soil Particle-Size Distribution (PSD) Models by PSD Curve Fitting Software. Pol. J. Soil Sci. 2019, 52, 211. [Google Scholar] [CrossRef]
  20. Zhang, H.; Wang, C.; Chen, Z.; Kang, Q.; Xu, X.; Gao, T. Performance Comparison of Different Particle Size Distribution Models in the Prediction of Soil Particle Size Characteristics. Land 2022, 11, 2068. [Google Scholar] [CrossRef]
  21. Zhou, J.; Li, C.; Arslan, C.A.; Hasanipanah, M.; Bakhshandeh Amnieh, H. Performance Evaluation of Hybrid FFA-ANFIS and GA-ANFIS Models to Predict Particle Size Distribution of a Muck-Pile after Blasting. Eng. Comput. 2021, 37, 265–274. [Google Scholar] [CrossRef]
  22. Bayat, H.; Rastgo, M.; Zadeh, M.M.; Vereecken, H. Particle Size Distribution Models, Their Characteristics and Fitting Capability. J. Hydrol. 2015, 529, 872–889. [Google Scholar] [CrossRef]
  23. Ord, A.; Blenkinsop, T.; Hobbs, B. Fragment Size Distributions in Brittle Deformed Rocks. J. Struct. Geol. 2022, 154, 104496. [Google Scholar] [CrossRef]
  24. Zhang, G.; Fan, Y.; Yang, R.; Li, S. Application of the Rosin-Rammler Function to Describe Quartz Sandstone Particle Size Distribution Produced by High-Pressure Gas Rapid Unloading at Different Infiltration Pressure. Powder Technol. 2022, 412, 117982. [Google Scholar] [CrossRef]
  25. Chávez-Angel, E.; Eriksen, M.B.; Castro-Alvarez, A.; Garcia, J.H.; Botifoll, M.; Avalos-Ovando, O.; Arbiol, J.; Mugarza, A. Applied Artificial Intelligence in Materials Science and Material Design. Adv. Intell. Syst. 2025, 7, 2400986. [Google Scholar] [CrossRef]
  26. Kokott, S.; Marek, A.; Merz, F.; Karpov, P.; Carbogno, C.; Rossi, M.; Rampp, M.; Blum, V.; Scheffler, M. Towards Efficient and Accurate Input for Data-Driven Materials Science from Large-Scale All-Electron Density Functional Theory (DFT) Simulations. Model. Simul. Mater. Sci. Eng. 2024, 32, 28–31. [Google Scholar]
  27. Kantaros, A.; Ganetsos, T.; Pallis, E.; Papoutsidakis, M. From Mathematical Modeling and Simulation to Digital Twins: Bridging Theory and Digital Realities in Industry and Emerging Technologies. Appl. Sci. 2025, 15, 9213. [Google Scholar] [CrossRef]
  28. Servin, M.; Vesterlund, F.; Wallin, E. Digital Twins with Distributed Particle Simulation for Mine-to-Mill Material Tracking. Minerals 2021, 11, 524. [Google Scholar] [CrossRef]
  29. Kamrin, K.; Hill, K.M.; Goldman, D.I.; Andrade, J.E. Advances in Modeling Dense Granular Media. Annu. Rev. Fluid Mech. 2024, 56, 215–240. [Google Scholar] [CrossRef]
  30. Richter, C.; Will, F. Introducing Metamodel-Based Global Calibration of Material-Specific Simulation Parameters for Discrete Element Method. Minerals 2021, 11, 848. [Google Scholar] [CrossRef]
  31. Sobieski, W.; Matyka, M.; Gołembiewski, J.; Lipiński, S. The Path Tracking Method as an Alternative for Tortuosity Determination in Granular Beds. Granul. Matter 2018, 20, 72. [Google Scholar] [CrossRef]
  32. Aicha, M.B.; Al Asri, Y.; Zaher, M.; Alaoui, A.H.; Burtschell, Y. Prediction of Rheological Behavior of Self-Compacting Concrete by Multi-Variable Regression and Artificial Neural Networks. Powder Technol. 2022, 401, 117345. [Google Scholar] [CrossRef]
  33. Herrera, N.; Mollehuara, R.; Gonzalez, M.S.; Okkonen, J. Predictive Insight into Tailings Flowability at Their Disposal Using Operating Data-Driven Artificial. Neural Network (ANN) Technique. Minerals 2024, 14, 737. [Google Scholar] [CrossRef]
  34. Liu, Y.; Mohammed, Z.M.; Ma, J.; Xia, R.; Fan, D.; Tang, J.; Yuan, Q. Machine Learning Driven Fluidity and Rheological Properties Prediction of Fresh Cement-Based Materials. Materials 2024, 17, 5400. [Google Scholar] [CrossRef]
  35. Soranzo, E. Dataset of Close-Range Soil Images and Corresponding Particle Size Distributions. Data Brief 2025, 60, 111631. [Google Scholar] [CrossRef] [PubMed]
  36. Soranzo, E.; Guardiani, C.; Wu, W. Convolutional Neural Network Prediction of the Particle Size Distribution of Soil from Close-Range Images. Soils Found. 2025, 65, 101575. [Google Scholar] [CrossRef]
  37. Es-Saheb, M.H.; Cogun, C.; Al-Matery, S. Statistical Characterization of Particulate Properties of Some Industrial Powders. J. King Saud Univ. Eng. Sci. 2001, 13, 117–133. [Google Scholar] [CrossRef]
  38. Yan, H.; Barbosa-Cánovas, G.V. Size Characterization of Selected Food Powders by Five Particle Size Distribution Functions Caracterización Del Tamaño de Partícula de Alimentos En Polvo Mediante Cinco Funciones de Distribuciones de Tamaño. Food Sci. Technol. Int. 1997, 3, 361–369. [Google Scholar] [CrossRef]
  39. Lawler, D.F. Particle Size Distributions in Treatment Processes: Theory and Practice. Water Sci. Technol. 1997, 36, 15–23. [Google Scholar] [CrossRef]
  40. Bayat, H.; Rastgou, M.; Nemes, A.; Mansourizadeh, M.; Zamani, P. Mathematical Models for Soil Particle-size Distribution and Their Overall and Fraction-wise Fitting to Measurements. Eur. J. Soil Sci. 2017, 68, 345–364. [Google Scholar] [CrossRef]
  41. Zisselmar, R.; Kellerwessel, H. Approximate Mathematical Description of Particle-Size Distributions—Possibilities and Limitations as to the Assessment of Comminution and Classification Processes. Part. Part. Syst. Charact. 1985, 2, 49–55. [Google Scholar] [CrossRef]
  42. Alderliesten, M. Mean Particle Diameters. Part VII. The Rosin-Rammler Size Distribution: Physical and Mathematical Properties and Relationships to Moment-Ratio Defined Mean Particle Diameters. Part. Part. Syst. Charact. 2013, 30, 244–257. [Google Scholar] [CrossRef]
  43. Colorado-Arango, L.; Menéndez-Aguado, J.M.; Osorio-Correa, A. Particle Size Distribution Models for Metallurgical Coke Grinding Products. Metals 2021, 11, 1288. [Google Scholar] [CrossRef]
  44. Macıas-Garcıa, A.; Cuerda-Correa, E.M.; Dıaz-Dıez, M.A. Application of the Rosin–Rammler and Gates–Gaudin–Schuhmann Models to the Particle Size Distribution Analysis of Agglomerated Cork. Mater. Charact. 2004, 52, 159–164. [Google Scholar] [CrossRef]
  45. Smith, J.E.; Jordan, M.L. Mathematical and Graphical Interpretation of the Log-Normal Law for Particle Size Distribution Analysis. J. Colloid Sci. 1964, 19, 549–559. [Google Scholar] [CrossRef]
  46. Fredlund, M.D.; Fredlund, D.G.; Wilson, G.W. An Equation to Represent Grain-Size Distribution. Can. Geotech. J. 2000, 37, 817–827. [Google Scholar] [CrossRef]
  47. Fredlund, M.D.; Wilson, G.W.; Fredlund, D.G. Use of Grain-Size Functions in Unsaturated Soil Mechanics. In Proceedings of the Advances in Unsaturated Geotechnics, Denver, CO, USA, 5–8 August 2000; pp. 69–83. [Google Scholar]
  48. Bantan, R.A.R.; Jamal, F.; Chesneau, C.; Elgarhy, M. Theory and Applications of the Unit Gamma/Gompertz Distribution. Mathematics 2021, 9, 1850. [Google Scholar] [CrossRef]
  49. El-Gohary, A.; Alshamrani, A.; Al-Otaibi, A.N. The Generalized Gompertz Distribution. Appl. Math. Model. 2013, 37, 13–24. [Google Scholar] [CrossRef]
  50. Cosenza, D.N.; Soares, P.; Guerra-Hernández, J.; Pereira, L.; González-Ferreiro, E.; Castedo-Dorado, F.; Tomé, M. Comparing Johnson’s SB and Weibull Functions to Model the Diameter Distribution of Forest Plantations through ALS Data. Remote Sens. 2019, 11, 2792. [Google Scholar] [CrossRef]
  51. Johnson, N.L. Bivariate Distributions Based on Simple Translation Systems. Biometrika 1949, 36, 297–304. [Google Scholar] [CrossRef] [PubMed]
  52. Niethammer, B.; Pego, R.L. On the Initial-Value Problem in the Lifshitz-Slyozov-Wagner Theory of Ostwald Ripening. SIAM J. Math. Anal. 2000, 31, 467–485. [Google Scholar] [CrossRef]
  53. Privitera, S.; Manetto, G.; Pascuzzi, S.; Pessina, D.; Cerruto, E. Drop Size Measurement Techniques for Agricultural Sprays: A State-of-the-Art Review. Agronomy 2023, 13, 678. [Google Scholar] [CrossRef]
  54. Tu, K.-N.; Gusak, A.M.; Sun, Q.; Yao, Y. A Comparison Between Ripening Under a Constant Volume and Ripening Under a Constant Surface Area. Nanomaterials 2025, 15, 316. [Google Scholar] [CrossRef]
  55. Wei, F.; Liang, Y.; Wang, H.; Hu, M.; Wang, L.; Wang, D.; Tang, M. Construction and Investigation of a Filtration Efficiency Test System for High-Efficiency Filter Materials Based on Mass Concentration. Processes 2024, 12, 981. [Google Scholar] [CrossRef]
  56. Gupta, H.V.; Ehsani, M.R.; Roy, T.; Sans-Fuentes, M.A.; Ehret, U.; Behrangi, A. Computing Accurate Probabilistic Estimates of One-D Entropy from Equiprobable Random Samples. Entropy 2021, 23, 740. [Google Scholar] [CrossRef] [PubMed]
  57. Roederer, M.; Treister, A.; Moore, W.; Herzenberg, L.A. Probability Binning Comparison: A Metric for Quantitating Univariate Distribution Differences. Cytometry 2001, 45, 37–46. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Comparison of particle size distribution histograms generated using the equal-width method (blue) and the equal-probability method (orange) for a log-normal distribution. Parameters: number of bins = 15, total number of samples M = 100,000. The theoretical probability density function is shown as a red line.
Figure 1. Comparison of particle size distribution histograms generated using the equal-width method (blue) and the equal-probability method (orange) for a log-normal distribution. Parameters: number of bins = 15, total number of samples M = 100,000. The theoretical probability density function is shown as a red line.
Geosciences 15 00464 g001
Figure 2. Family of probability density functions generated from Weibull distributions fitted to 27 natural soils.
Figure 2. Family of probability density functions generated from Weibull distributions fitted to 27 natural soils.
Geosciences 15 00464 g002
Figure 3. Family of cumulative distribution functions grouped by the dominant fraction (Fine, Medium, Coarse), markers on CDFs show d10 (circle), d50 (square), and d90 (triangle) percentiles.
Figure 3. Family of cumulative distribution functions grouped by the dominant fraction (Fine, Medium, Coarse), markers on CDFs show d10 (circle), d50 (square), and d90 (triangle) percentiles.
Geosciences 15 00464 g003
Figure 4. Hierarchical clustering dendrogram of 27 soil PSDs based on percentiles d10, d50, and d90. Three primary clusters are identified, corresponding to coarse-, medium-, and fine-grained soils.
Figure 4. Hierarchical clustering dendrogram of 27 soil PSDs based on percentiles d10, d50, and d90. Three primary clusters are identified, corresponding to coarse-, medium-, and fine-grained soils.
Geosciences 15 00464 g004
Figure 5. Heatmap of percentiles (d10, d50, d90) and grain size spread for all soils.
Figure 5. Heatmap of percentiles (d10, d50, d90) and grain size spread for all soils.
Geosciences 15 00464 g005
Figure 6. Probability density functions for Tin, Gelatin, Iron, and Sodium Chloride, generated using Gamma distributions with literature-derived parameters.
Figure 6. Probability density functions for Tin, Gelatin, Iron, and Sodium Chloride, generated using Gamma distributions with literature-derived parameters.
Geosciences 15 00464 g006
Figure 7. Cumulative distribution functions for the analyzed materials.
Figure 7. Cumulative distribution functions for the analyzed materials.
Geosciences 15 00464 g007
Figure 8. Probability density functions for sugar, corn meal, and instant milk, generated using log-normal distributions.
Figure 8. Probability density functions for sugar, corn meal, and instant milk, generated using log-normal distributions.
Geosciences 15 00464 g008
Figure 9. Cumulative distribution functions for the three analyzed food powders.
Figure 9. Cumulative distribution functions for the three analyzed food powders.
Geosciences 15 00464 g009
Figure 10. Probability density functions of sugar + instant milk mixtures at seven different ratios.
Figure 10. Probability density functions of sugar + instant milk mixtures at seven different ratios.
Geosciences 15 00464 g010
Table 1. Applications, advantages, and limitations of empirical and statistical models for describing particle size distributions in granular systems.
Table 1. Applications, advantages, and limitations of empirical and statistical models for describing particle size distributions in granular systems.
Distribution ModelTypical ApplicationsAdvantagesLimitations
Normal (Gaussian)basic modeling,
unimodal PSDs
simple,
intuitive
allows negative diameters,
poor fit for skewed PSDs
Rosin–Rammlermining, comminution,
powder technology
simple, widely used,
good for crushed materials
limited flexibility for
multimodal or skewed PSDs
Gaudin–Schuhmannsoil mechanics,
mineral processing
easy to apply,
intuitive
oversimplified, cannot represent fine fractions accurately
Log-normalsediments, soils,
natural materials
flexible, suitable for naturally fragmented systemspoor fit for strongly
asymmetric or bimodal PSDs
Weibullfracture mechanics,
powders, reliability
versatile, models
breakage-dominated processes
parameter estimation
sensitive to data quality
Gammacivil engineering,
porous media
flexible,
effectively models skewness
requires iterative fitting,
rarely used
Betanormalized fractions,
fine powders
highly flexible, models
symmetric and asymmetric shapes
requires normalization, limited
intuitive interpretation
Fredlundsoil mechanics,
gradation curves
captures soil gradationdomain-specific
Gompertzbiological
fragmentation
suitable for skewed,
bounded distributions
weak physical basis
Jakygeotechnics,
soil PSDs curves
simple, soil-specificvery limited applicability
Johnson (SU, SB, SL, SN)flexible fitting of
skewed PSDs
very versatile,
mimics many distributions
complex parameter fitting
Lifshitz–Slyozov–WagnerOstwald ripening,
metallurgy
process-based,
time-dependent description
valid only for
diffusion-controlled growth
Nukiyama–Tanasawasprays, atomization,
combustion
classical spray droplet lawlimited to atomization processes
Skaggssoil physics,
hydrology
links PSD with
hydraulic properties
soil-specific, limited general use
Table 2. Availability and form of cumulative distribution functions for particle size distribution models.
Table 2. Availability and form of cumulative distribution functions for particle size distribution models.
Distribution ModelCDF Availability
Normal (Gaussian)closed form (via error function)
Log-normalclosed form (via normal CDF of log(x))
Weibullclosed form
Rosin–Rammlerclosed form (special case of Weibull model)
Gaudin–Schuhmannclosed form (power law)
Gammaclosed form (via incomplete gamma function)
Betaclosed form (via incomplete beta function)
Gompertzclosed form
Johnson (SU, SB, SL, SN)closed forms (transformations of normal CDF)
Fredlunddefined empirically
Jakydefined empirically
Skaggsdefined empirically
Lifshitz–Slyozov–Wagnerno closed form—requires numerical integration
Nukiyama–Tanasawano closed form—requires numerical integration
Table 3. Cumulative distribution functions of the considered PSD models, together with the interpretation of their parameters.
Table 3. Cumulative distribution functions of the considered PSD models, together with the interpretation of their parameters.
Distribution ModelCDFModel Parameter(s)
Normal (Gaussian) 1 2 1 + erf x μ σ 2 μ —mean particle diameter,
σ —standard deviation (spread)
Log-normal 1 2 1 + erf ln x μ σ 2 ,   x > 0 μ —mean of ln x ,
σ —spread of ln x
Weibull 1 exp x λ k ,   x 0 λ —scale parameter,
k —shape parameter
Rosin–Rammler 1 exp x x 0 n ,   x   0 x 0 —characteristic particle size,
n —spread parameter
Gaudin–Schuhmann x x m a x n n —shape parameter
Gamma 1 Γ α 0 x / β t α 1 e t d t ,   x 0 α —shape parameter,
β —scale parameter
Beta I x α , β ,   x   0,1 α , β —shape parameters
Gompertz 1 exp b exp c x 1 ,   x 0 b —scale parameter,
c —growth/shape
Johnson SU Φ γ + δ sinh 1 x ξ λ γ , δ —shape parameters,
ξ —location, λ —scale parameter
Johnson SB Φ γ + δ ln x ξ ξ + λ x , x ξ , ξ + λ γ , δ —shape parameters,
ξ —location, λ —scale parameter
Johnson SL Φ γ + δ ln x ξ ,   x > ξ γ —shape parameter,
ξ —location, δ —scale parameter
Johnson SN Φ γ + δ x ξ x ξ 2 + λ 2 γ , δ –shape parameters,
ξ —location, λ —scale parameter
Table 4. Comparison of two histogram generation methods for synthetic PSDs—equal-width bins vs. equal-probability bins.
Table 4. Comparison of two histogram generation methods for synthetic PSDs—equal-width bins vs. equal-probability bins.
Equal-Width BinsEqual-Probability Bins
Bin widthconstant across the rangevariable (narrow in dense regions,
wide in tails)
Interpretationdirectly comparable heights,
standard in PSD analysis
each bin represents the same probability mass
Advantagesintuitive, easy comparison with theoretical PDFs,
aligns with sieve analysis
highlights tails,
ensures all bins are populated
Disadvantagessparse bins in tails,
may hide rare events
bar heights less intuitive, variable widths complicate interpretation
Computational costlow computational overhead,
only requires linear binning
higher, requires repeated evaluation of the quantile function
Best usecomparing PSDs,
matching lab data, computing statistics
visualizing quantiles,
emphasizing distribution tails, balanced data for ML workflows
Table 5. Exemplary Weibull parameters (scale λ , shape k) for representative soil textures.
Table 5. Exemplary Weibull parameters (scale λ , shape k) for representative soil textures.
Soil
Type
Scale Parameter λ (mm)Shape Parameter k Distribution
Characteristics
high-gravel soils5–100.4–0.8broad distributions with
dominant coarse fractions
uniform
sands
0.2–0.3>2.5steep curves, narrow spread,
well-sorted particle sizes
fine silty–clayey soils<0.050.5–1.0steep fine-dominated curves with extended tails in small sizes
Table 6. Statistical parameters of Gamma-based particle size distributions for selected materials.
Table 6. Statistical parameters of Gamma-based particle size distributions for selected materials.
MaterialMean (mm)Std Dev (mm)Skewness
Tin580.691.20.31
Gelatin22.68.10.90
Iron22.98.60.95
Sodium Chloride155.429.50.34
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Lipiński, S. A Computational Framework for Reproducible Generation of Synthetic Grain-Size Distributions for Granular and Geoscientific Applications. Geosciences 2025, 15, 464. https://doi.org/10.3390/geosciences15120464

AMA Style

Lipiński S. A Computational Framework for Reproducible Generation of Synthetic Grain-Size Distributions for Granular and Geoscientific Applications. Geosciences. 2025; 15(12):464. https://doi.org/10.3390/geosciences15120464

Chicago/Turabian Style

Lipiński, Seweryn. 2025. "A Computational Framework for Reproducible Generation of Synthetic Grain-Size Distributions for Granular and Geoscientific Applications" Geosciences 15, no. 12: 464. https://doi.org/10.3390/geosciences15120464

APA Style

Lipiński, S. (2025). A Computational Framework for Reproducible Generation of Synthetic Grain-Size Distributions for Granular and Geoscientific Applications. Geosciences, 15(12), 464. https://doi.org/10.3390/geosciences15120464

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop