Next Article in Journal
A Novel Ge-Doping Approach for Grain Growth and Recombination Suppression in Buffer-Free CIGSe Solar Cells
Previous Article in Journal
Compressive Failure Mechanisms of NCF Laminates with Double-Hole Defects
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Porosity/Cement Index and Machine Learning Models for Predicting Tensile and Compressive Strength of Cemented Silt in Varying Compaction Conditions

by
Jair Arrieta Baldovino
1,*,
Oscar E. Coronado-Hernández
2 and
Yamid E. Nuñez de la Rosa
3,*
1
Department of Civil Engineering, Universidad de Cartagena, Cartagena de Indias 130015, Colombia
2
Instituto de Hidráulica y Saneamiento Ambiental, Universidad de Cartagena, Cartagena de Indias 130001, Colombia
3
Faculty of Engineering and Basic Sciences, Fundación Universitaria Los Libertadores, Bogotá 110231, Colombia
*
Authors to whom correspondence should be addressed.
Materials 2026, 19(3), 498; https://doi.org/10.3390/ma19030498
Submission received: 21 November 2025 / Revised: 12 December 2025 / Accepted: 24 December 2025 / Published: 27 January 2026

Abstract

This study investigates the mechanical response of cemented silt subjected to 28 days of curing by integrating two predictive methodologies: porosity–cement index (η/Civ) and machine learning (ML) models. The soil was compacted over a wide range of molding water contents and dry densities, including optimum and off-optimum states, and stabilized with varying cement contents. Unconfined compressive strength ( q u ) and splitting tensile strength ( q t ) were evaluated as functions of cement dosage, curing time, porosity, water content, and the specific gravities of the soil and cement. The η/Civ index demonstrated a strong predictive capability for both q u and q t , with determination coefficients exceeding 0.980, and exhibited the expected power-law decay with increasing η/Civ. ML algorithms—particularly Gaussian Process Regression with a Matern 5/2 kernel—outperformed the empirical model, achieving R2 values of 0.963 (validation) and 0.997 (testing) for q u prediction. The q t model similarly reached R2 = 0.984–0.988, demonstrating high generalization and stability across curing and compaction conditions. Experimental results revealed substantial strength gains with decreasing η/Civ, with q u increasing from 100 kPa at η/Civ = 46 to 2900 kPa at η/Civ = 19, while q t rose from 10–15 kPa to 300 kPa across the same range.

1. Introduction

Since 2007, Consoli et al. [1] have used different approach to estimate the strength, durability, and stiffness of various stabilized soils, including those treated with lime and cement, as well as those incorporating different waste materials and alternative binders. The porosity-cement ratio was introduced initially by Larnach [2] in 2019, but it was later named the void/cement ratio when considering this index to estimate the unconfined compressive strength of cemented sand. The porosity–cement ratio exhibits a close correlation with both the intrinsic properties of the soil and the characteristics of the binders employed [1]. This approach has been applied to evaluate the geotechnical properties of clean sands, silts, and clays. The empirical estimation of unconfined compressive strength ( q u ) and ( q t ) is often expressed in the following form (Equation (1)):
q u   o r   q t = A η C i v x B
A, x, and B values depend on the properties of the soil and cement, as well as their interaction. Constant A is expressed in kPa. The index η/Civ expresses, in a single parameter, the combined influence of both porosity and binder content on the mechanical strength of a material [3,4]. Thus, the relative importance of each factor can be adjusted through the exponent x ; that is, when the effect of porosity is more significant, x assumes a value lower than 1.0. For several studies, the coefficient of determination (R2) values obtained for estimating q u , q t , stiffness ( G o ), and accumulated loss of mass ( A L M ) as functions of the η/Civ ratio are generally high (e.g., [4,5,6]).
Recent research has demonstrated that porosity–binder indices serve as a powerful and generalizable framework for predicting strength, stiffness, and durability across a wide range of stabilized soils and geomaterials. Nierwinski et al. [7] and Baldovino et al. [8] further extended the applicability of this approach by demonstrating that the porosity/binder index yields highly predictive models for q u and ultrasonic stiffness in clay treated with mineral binders, industrial waste additives, and biopolymers, although gel-based systems such as xanthan gum required significantly lower exponents due to their distinct microstructural bonding mechanisms. Alibeigibeni et al. [9] emphasize that recycled concrete aggregates (RCAs) represent a sustainable alternative to natural aggregates, although their higher porosity and adhered mortar typically reduce mechanical strength and durability. Nonetheless, pretreatment strategies, optimized mix designs, and the incorporation of supplementary cementitious materials substantially improve RCA performance. In addition, recent studies have highlighted the increasing applicability of machine learning (ML) techniques for predicting the mechanical performance of stabilized soils, particularly when sustainable and waste-based binders are incorporated. Sridhar et al. [10] developed a robust ML framework to predict the unconfined compressive strength and California Bearing Ratio (CBR) of lateritic soils stabilized with industrial wastes such as red mud, copper slag, and iron ore tailings, demonstrating that Random Forest and multilayer perceptron models achieved high predictive accuracy (R2 > 0.90), with SHAP analysis identifying curing time, maximum dry density, and waste dosage as dominant factors. Similarly, Daimary et al. [11] employed artificial neural network (ANN) models to estimate q u , maximum dry density, and optimum moisture content of lateritic soils treated with agro-industrial wastes (cement kiln dust and rice husk ash), achieving R2 values up to 0.98 and confirming the capability of ANN models to capture complex nonlinear soil–binder interactions. Extending ML applications to solid waste–cement systems, Tu et al. [12] proposed an optimized hybrid extreme gradient boosting model for CBR prediction, which outperformed conventional ML approaches and identified cement content, fine-grained fraction, and calcium oxide content as the most influential variables.
Similarly, Hanafi et al. [13] confirmed that an adjusted porosity index (exponent 0.32) unifies the prediction of q u initial shear modulus ( G o ), and A L M in alluvial clay blended with cement and marble dust, underscoring the index’s capacity to accommodate sustainable binder replacements. Muñoz et al. [14] also demonstrated that both η/Civ accurately correlate q u , q t , and A L M in soil–cement–glass polishing waste mixtures, reducing reliance on labor-intensive trial-and-error procedures and supporting the development of optimized geomaterials. In contrast, Ribeiro et al. [15] revealed that although η/Civ effectively captures early-age strength in dredged sediments treated with lime and cement, long-term degradation due to ettringite formation can override mechanically favorable porosity–binder states, emphasizing chemical durability as a critical complement to index-based design. Finally, Haider et al. [16] showed that mixtures incorporating PET shreds and reduced cement content still follow a porosity–binder trend for CBR (California Bearing Ratio), A L M , and shear modulus, demonstrating that the index remains valid even when the binder phase is mechanically augmented with polymeric reinforcement.
In parallel with the widespread use of porosity–binder indices, there has been a rapid expansion of machine learning (ML) methods for predicting the strength and related properties of stabilized soils, often with high accuracy and increasing levels of interpretability. Focusing on nano-silica-stabilized fine-grained soils, Thapa et al. [17] proposed deep learning models (CNN, LSTM, and RNN) to predict q u from an extensive database of CI, MI, and CL–ML soils, showing that a CNN-based model, embedded in a graphical user interface, can reliably capture strength gains of several hundred percent associated with nano-silica dosage and curing time. Kumar et al. [18] compared Gradient Boosting Machine, Random Decision Forest, non-parametric regression, and decision trees for nano-doped fly-ash-treated clayey soils and identified GBM as the most accurate and interpretable model, using sensitivity, monotonicity, and SHAP analyses to highlight the dominant roles of curing days, maximum dry density, and sodium hexametaphosphate content in q u development.
Moving beyond purely data-driven approaches, Yang et al. [19] introduced physics-informed ML frameworks for cement-stabilized soils—residual-learning hybrid corrector models and physics-informed neural networks—embedding a modified water–cement ratio model into training; the best-performing PINN-based multi-scale DNN achieved high R2 and, through SHAP, quantified the relative importance of cement content, water content, and clay fraction. A second contribution by Thapa et al. [20] addressed na–silica-stabilized infinite slopes, utilizing an Optuna-optimized hybrid RNN–CNN–LSTM classifier to achieve 99% accuracy. The study also employed XAI–SHAP techniques to demonstrate that cohesion, nano-silica content, and slope angle are the primary factors governing slope stability. At the material-design scale, Yao et al. [21] developed a zebra-optimized XGBoost model to predict compressive strength in cohesive soils stabilized with industrial by-product geopolymers, demonstrating that calcium oxide, silicon dioxide, and curing age dominate strength evolution, enabling practical mix-design windows based on chemical composition.
Similarly, Luo et al. [22] built a data-driven framework for q u prediction in stabilized soils and identified a deep forest model as the most accurate; SHAP analysis and response-surface interpretations revealed that NaOH and GGBS contents are key drivers of strength in alkali-activated systems. Wang [23] adopted an automated ML strategy based on decision trees enhanced with metaheuristic optimizers (DTSH, DTTS, DT), showing that the DTSH hybrid can achieve extremely high R2 values across diverse stabilized-soil datasets. Hu et al. [24] combined gradient boosting decision trees with a genetic algorithm to predict and simultaneously optimize the q u of GGBS–fly-ash geopolymer-stabilized soils, demonstrating the predominance of binder content, curing age, and GGBS/FA ratio in strength development. Teodoru et al. [25] integrated laboratory testing with Bayesian-optimized ML models for cement-treated clayey silt, identifying Random Forest as the optimal predictor and coupling it with uncertainty quantification and SHAP-based interpretation to produce a web-based q u prediction tool. Along similar lines, Linganagoudar et al. [26] used decision trees, Random Forest, and multilayer perceptrons to predict q u in lateritic soils treated with cement and FGD gypsum, showing that the nonlinear interactions between cement–gypsum dosage and curing duration can be captured with R2 ≈ 0.98, with curing time emerging as the most influential factor. Finally, Mustafa et al. [27] explored SVR and decision tree models for both stabilized and unstabilized soils, demonstrating that grain-size distribution and moisture content dominate q u in natural soils. In contrast, the type and content of stabilizers, along with density and moisture state, become the primary predictors in stabilized mixtures.
Given the growing evidence that both porosity–binder indices and machine learning frameworks provide powerful tools for interpreting and predicting the behavior of stabilized soils, this study aims to integrate these two approaches to evaluate the mechanical response of cemented silt compacted at different moisture and density conditions. Specifically, the objective is to assess the extent to which the porosity–cement index and a set of supervised machine learning models can accurately estimate the unconfined compressive strength ( q u ) and splitting tensile strength ( q t ) of mixtures cured for 28 days. To achieve this, the analysis incorporates key mixture-design variables—including cement content, curing time, molding density, water content, specific gravities, and porosity—to identify the relative predictive capacity of each method and determine whether data-driven learning can complement or outperform the classical porosity–cement framework. Although recent studies have combined porosity–binder indices with machine learning techniques for predicting the behavior of stabilized soils, the present research introduces a distinct contribution by explicitly integrating the porosity–cement index (η/Civ) as a physically based parameter within a comprehensive ML framework. Unlike purely data-driven approaches, this study evaluates both optimum and non-optimum compaction conditions and directly compares classical η/Civ formulations with advanced ML models.

2. Experimental Program

The experimental plan is primarily based on comparing the application of the porosity-to-cement ratio (η/Civ) and machine learning methods to estimate the unconfined compressive strength ( q u ) and the splitting tensile strength ( q t ) compacted in strategic molding points variating the water content and dry unit weight, also compatible with optimum compaction conditions of optimum water content (OWC) and maximum dry unit weights (MDD) of silty soil improved with Portland cement. The data analysis of the porosity-to-cement index in the optimal compaction conditions was reported by Baldovino et al. [28], and the report of the influence of the porosity/cement index for estimating q u and q t in non-optimal compaction conditions was published by Baldovino et al. [29].
A total of n = 408 tests were performed for the determination of q u and q t at non-optimal conditions (i.e., fixing dry unit weight and water content) and n = 216 tests for estimating q u and q t at optimal compaction conditions (i.e., OMC and MDD).
Although this study focuses on a single silty soil stabilized with Portland cement, the porosity–cement index adopted herein is a generalized framework previously validated for different soil types. Nevertheless, the specific empirical parameters obtained in this research are material-dependent and require recalibration when applied to other soils or binders.

2.1. Materials

The materials used in this study included silty soil and high-early-strength Portland cement, along with distilled water used in the preparation of test specimens.
In concordance with Baldovino et al. [28,29], the soil used in this study corresponds to a high-plastic silt (Unified Soil Classification System: MH), characterized by a liquid limit of 50.82% and a plastic limit of 35.96%, resulting in a plasticity index of 14.86% (ASTM D4318 [30]). Its specific gravity is 2.62 (ASTM D854 [31]). The particle-size distribution (Figure 1) indicates a predominantly fine-grained material, composed of 60% silt-sized particles and only 5% clay, while the sand fraction (5% coarse, 12% medium, and 18% fine) represents a minor portion of the matrix (NBR 6502 [32]). The effective size (D10) of 0.003 mm and mean particle diameter (D50) of 0.038 mm, together with a uniformity coefficient (Cu = 12.88) and a coefficient of curvature (Cc = 0.88), reflect a well-graded non-plastic silty structure with limited coarse particles. In its natural state, the soil exhibits an unconfined compressive strength ( q u ) of 104.58 kPa and a splitting tensile strength ( q t ) of 16.62 kPa, yielding a q t / q u ratio of 0.16, which is typical of weakly bonded fine-grained soils. Direct shear testing (ASTM D3080 [33]) yields a friction angle of 26° and a cohesion value of 23 kPa, parameters consistent with those of a moderately cohesive silt. Visually, the soil presents a characteristic yellow coloration.
The Portland cement used in this study exhibits a chemical composition typical of high-calcium binders, characterized by a predominance of CaO (60.76%), followed by significant contents of SiO2 (18.96%) and Al2O3 (4.30%), which are essential for the formation of calcium silicate hydrates (C–S–H) and calcium aluminate phases responsible for strength development. Minor oxides, including Fe2O3 (2.95%), MgO (3.26%), and SO3 (3.18%), fall within expected ranges for commercial cements, ensuring adequate sulfate balance and volumetric stability. The cement contains a low insoluble residue (0.77%), indicative of high clinker purity. Compressive strengths of 44.7 MPa at 7 days and 54.2 MPa at 28 days demonstrate mechanical performance. The fineness of 0.04% contributes to increased reactivity by providing a larger surface area for hydration reactions. Additionally, the specific gravity of 3.11 (NBR 16605 [34]) aligns with values typical of Portland cements. The Supplementary Materials contain the information used in this study.

2.2. Specimen Molding and Preparation

All specimens, both for unconfined compressive and splitting tensile tests, were compacted by static pressing in three layers within steel cylindrical molds. The specimens prepared for compression and tensile strength tests had a height of 100 mm and a diameter of 50 mm. After compaction, all specimens were sealed and cured in a humid chamber for 7, 14, or 28 days (depending on the compaction conditions) prior to conducting the compression and tensile tests. The details of the curing conditions and mixture compositions are summarized in Table 1 and Table 2.
Figure 2 shows the compaction curves of the soil under standard, intermediate, and modified energies, along with the 100% saturation lines and the molding points of the soil–cement specimens. The molding points are divided into two groups: points prepared at a constant water content with varying dry unit weights, and points corresponding to optimum compaction conditions (i.e., MDD and OWC).

2.3. Unconfined Compressive and Splitting Tensile Protocols

The unconfined compression tests were conducted in accordance with ASTM D2166/D2166M-16 [35], while the indirect tensile (Brazilian) tests followed the procedures outlined in ABNT NBR 7222 [36]. All mechanical tests ( q u and q t ) were carried out using a Geotechnik testing machine with a loading rate of 1.14 mm/min and a maximum capacity of 20 kN. Data acquisition was performed with a force sensitivity of 0.25 N, ensuring precise measurement of load–deformation behavior throughout the tests.

3. Machine Learning Methodology

In this study, each category of machine learning model was employed to predict the values of q u and q t . Table 3 summarizes the selected algorithm types. With respect to interpretability, linear models, decision trees, and Support Vector Machines (SVM), efficiently trained linear models are relatively easy to interpret, meaning that the influence of the predictors can be assessed rapidly and predictions can be generated efficiently [37]. In contrast, SVM models with nonlinear kernels offer limited interpretability. The remaining machine learning models—Gaussian Process Regression, kernel-based models, ensembles of trees, and neural networks—are generally more challenging to interpret [37]. Nevertheless, both Gaussian Process Regression and neural networks provide strong predictive performance for q u and q t . Table 1 and Table 2 present the information regarding the dataset employed for implementing the ML models.
Table 4 presents the formulas associated with the machine learning algorithms employed in this study. These formulas are organized into eight categories; however, depending on the model type, several algorithmic variants may be derived based on their internal parameters. In total, twenty-eight ML models were evaluated. The hyperparameter optimization framework provided in MATLAB R2024b was adopted to obtain the best statistical performance for all models. A summary of the underlying formulas is provided, taking into account their relative complexity. Accordingly, several clarifying observations are included to facilitate the interpretation of the machine learning results.
As a crucial aspect of the methodology, it is essential to select appropriate predictors together with their corresponding responses when applying machine learning models. Table 5 presents the predictor–response pairs used in this study. In total, five predictors were identified, as shown in the experimental program of this research (see Section 2). These predictors are utilized for computing the two responses of the ML algorithms ( q u and q t ).
During the development of the ML models, a 5-fold cross-validation scheme was employed (80% of the dataset), with 20% of the dataset reserved for testing purposes to minimize the risk of overfitting.

4. Results and Discussions

4.1. Effects of Porosity-to-Cement Index on Unconfined Compressive and Splitting Tensile Strength Considering the Optimum Compaction Conditions

Figure 3 correlates the porosity/cement index and q u and q t for 7, 14, and 28 days, when the soil was compacted at MDD and OWC, as presented in detail in Table 6. The experimental results of q u confirm a strong inverse relationship between the porosity–cement index (η/Civ) and q u of the cemented silt, consistent with the classical porosity–cement framework. For specimens cured for 28 days under standard compaction energy, η/Civ decreases from approximately 37 for mixtures with 3% cement to about 22, 16, and 13 for cement contents of 5%, 7%, and 9%, respectively. In parallel, q u increases from roughly 435 kPa at 3% cement to around 775 kPa at 5%, 1080 kPa at 7%, and 1450 kPa at 9%. A similar trend is observed for the intermediate and modified compaction energies: for 28-day curing and 5% cement, η/Civ decreases from approximately 22.6 (standard) to 17.1 (intermediate) and 14.0 (modified), while q u increases from 770 kPa to 1780 kPa and then to 1900 kPa. These results indicate that both higher cement content and higher compaction energy primarily act by reducing η/Civ, densifying the soil skeleton, and increasing the volumetric fraction of cement, which collectively enhance qu.
As expected, lower values of η/Civ—resulting from reduced porosity and/or higher volumetric cement contents—produce consistently higher q t values. For specimens compacted under Standard energy at 28 days, η/Civ decreased from approximately 37.1 to 12.8, yielding a corresponding increase in q t from 50 kPa (3% cement) to 236 kPa (9% cement). Under Intermediate energy, the same curing age showed an even more pronounced response: η/Civ ranged from 28.0 to 9.9, while q t increased from 124 kPa to 386 kPa, demonstrating the dual effect of higher density and improved cement bonding. The trend was strongest for the Modified compaction energy, in which η/Civ dropped to values as low as 9, resulting in q t levels that reached 525 kPa for 9% cement at 28 days. A similar pattern was observed for shorter curing times. At 14 days, Standard-energy specimens exhibited q t values ranging from 38 to 213 kPa as η/Civ decreased from 36.5 to 13.1, while Intermediate and Modified energies again produced enhanced strength, with q t reaching 327 kPa and 436 kPa, respectively, for η/Civ values near 8–10. After only 7 days of curing, the relationship remained consistent: q t increased from 36 kPa to 133–170 kPa under Standard energy (η/Civ from 36.5 to 13.0), and up to 365 kPa under Modified energy (η/Civ = 8.1–10.3).
Table 6 presents the equations governing the unconfined compressive and splitting tensile strengths of compacted soil–cement blends.

4.2. Effects of Porosity-to-Cement Index on Unconfined Compressive and Splitting Tensile Strength Considering the Non-Optimum Compaction Conditions

Figure 4 presents the results of the q u of soil–cement blends considering the porosity-to-cement index (adjusted to 0.50) and 28 days of curing for the blends compacted at the molding dry unit weight and with variable water content, as presented in Table 1. The results reveal a clear and systematic influence of the η / C i v 0.50 index on the 28-day q u of the cemented silt, demonstrating that reductions in η / C i v 0.50 are consistently associated with substantial strength gains across all moisture conditions. For specimens compacted near 10% water content, q u values increased from approximately 60–72 kPa at η / C i v 0.50   = 45.9 to more than 600 kPa when η / C i v 0.50 decreased to 27.5 and further exceeded 900 kPa when η / C i v 0.50 reached 23.2. At slightly higher moisture levels (14%), the same trend persisted: q u increased from 75–82 kPa at η / C i v 0.50 = 45.7 to about 550–620 kPa when η / C i v 0.50 decreased to 30.7 and then rose sharply to 770–810 kPa at η / C i v 0.50 = 27.4; further reductions to ~23.0 resulted in strengths between 1.39 and 1.43 MPa. At moisture contents around 19%, high strengths were also obtained, with q u rising from 115 kPa at η / C i v 0.50 = 46.1 to 475 kPa at η / C i v 0.50 = 35.8 and reaching 925 kPa when η / C i v 0.50 approached 27.4. Subsequently, strengths greater than 2.0 MPa were recorded at n  η / C i v 0.50 = 19.3. This behavior was consistent for higher molding moisture (24–29%), where initial q u values remained below 300 kPa at η / C i v 0.50 = 46.0 but increased to 800 kPa when η / C i v 0.50 fell to ~36.0 and exceeded 1.1–1.4 MPa at η / C i v 0.50 = 25.8; ultimately, values between 2.0 and 3.0 MPa were achieved at η / C i v 0.50 = 19.3.
Figure 5 presents the results of q t of soil–cement blends considering the η / C i v 0.50 index and 28 days of curing for the blends compacted at the molding dry unit weight and variable water content, as presented in Table 1. For all curing conditions, q t increases systematically as η / C i v 0.50 decreases, reflecting the dominant influence of reduced porosity and higher volumetric cement content on bond formation and tensile resistance. At low water contents (9.8–10.2%), where the mixtures exhibit both lower porosities (39–45%) and reduced η / C i v 0.50 values (19–31), q t reaches its highest levels, typically ranging between 100 kPa and 260 kPa, with peak values exceeding 250 kPa for η / C i v 0.50 = 19.3. As moisture increases to intermediate levels (14–19%), η / C i v 0.50 rises to the range of 22–46, and q t correspondingly decreases, producing tensile strengths mostly between 20 kPa and 120 kPa. At high molding water contents (24–34%), the index remains elevated (26–46) due to increased porosity and reduced dry density, and q t drops significantly, generally falling within 12–60 kPa, even with comparable cement contents. The results show a clear inverse and nonlinear relationship: lower η / C i v 0.50 values (indicating denser structures and higher cement effectiveness) consistently yield higher q t , whereas increases in moisture content promote higher porosity and weaker interparticle bonding, reducing tensile strength.
Table 7 describes the equations governing the unconfined compressive and splitting tensile strengths of compacted soil–cement blends.

4.3. Normalization Equations for Estimating the Unconfined Compressive and Splitting Tensile Strength

The normalization process begins by dividing the porosity–cement ratio by the corresponding water content values. These results are then further normalized with respect to the curing time (tc). In this way, a single equation is developed to estimate both the unconfined compressive strength and the splitting tensile strength results. Thus, Figure 6 presents the normalization of q u and q t results in a unique equation. The equation form is as follows:
q u   a n d   q t = w   f u n c t i o n η C i v x B
where w is a function of the water content. For soil–cement blends compacted in different water content and dry unit weights, the general equations for estimating the strength of compacted blends are as follows for q u and q t , respectively:
q u   = 0.0022 ω 4 0.214 ω 3 + 6.83 ω 2 72.8 ω + 409 × 10 4 η C i v 0.50 2.27
q t   = 0.0003 ω 4 0.035 ω 3 + 1.29 ω 2 17.35 ω + 94.02 × 10 4 η C i v 0.50 2.27
For validating the equations, experimental values of water content, porosity, and Civ for each specimen were replaced in Equations (9) and (10) with compressive and tensile strength (when corresponding). Thus, Figure 7 provides the R2 adjusted values of the estimating equations. Both q u and q t general equations yielded high values of R2 (above 0.700), specifically R2 = 0.853 (compressive) and R2 = 0.738 (splitting tensile).
For soil–cement blends compacted in OMC and MDD, the general equations for estimating the strength of compacted blends are as follows for q u and q t , respectively:
q u   = 76.38 t c 0.30 × 10 4 η C i v 0.50 2.27
q t   = 9.56 t c 0.39 × 10 4 η C i v 0.50 2.27
Equations (11) and (12) are a function of curing time. To validate the equations, experimental values for curing time, porosity, and Civ for each specimen were substituted into Equations (11) and (12) for compressive and tensile strength, respectively. Thus, Figure 8 provides the R2 adjusted values for the estimating equations. Both q u and q t general equations yielded high values of R2, specifically R2 = 0.951 (compressive) and R2 = 0.957 (splitting tensile).

4.4. Machine Learning Results

The eight algorithm categories were evaluated together with their internal variants, resulting in a total of twenty-eight machine learning presets. The root-mean-square error (RMSE) and the coefficient of determination ( R 2 ) were computed for each preset during both the validation and testing stages for the two response variables, q u and q t , as shown in Table 8. The cells highlighted in gray correspond to the best-performing model for each response. Overall, the results indicate that the Matern 5/2 Gaussian Process Regression (GPR) model provides the most accurate predictions for both responses, yielding the lowest RMSE values and R 2 values approaching unity. Accordingly, RMSE values of 61.8 and 10.5 were obtained for q u during the validation and testing stages, respectively, whereas RMSE values of 12.2 and 12.6 were obtained for q t in these stages. Similarly, the R 2 values were close to 1. During the validation stage, a value of 0.963 was obtained for q u , while the testing stage reached a value of 0.997. For q t , the corresponding R 2 values were 0.984 and 0.988 for the validation and testing stages, respectively. The robust linear method was the only approach to yield negative R2 values during the training phase for q u . No negative values were observed for q t at this stage.
In general, the different GPR variants exhibited excellent performance, demonstrating high predictive capability and robust generalization. It is also worth noting that several neural network architectures, particularly the wide and multilayer configurations, achieved competitive accuracy, with RMSE and R 2 values comparable to those of the GPR models. Linear models presented poor generalization to unseen data. Tree-based models exhibited moderate performance; fine and boosted trees performed reasonably well, whereas medium and coarse trees showed reduced accuracy. Support Vector Machines offered mixed results for q u and q t .
For the selected GPR model, the selected hyperparameters are summarized as follows: constant basis function; Matern 5/2 kernel function; use of an isotropic kernel with automatic length-scale determination; automatic estimation of the noise variance (sigma) and signal standard deviation; and activation of data standardization and numerical hyperparameter optimization options.
Figure 9 illustrates the predictive capability of the GPR model for q u and q t . The trained dataset consists of 240 samples collected during the experimental program, of which 80% were used for validation and the remaining samples for testing. Figure 9a,b compares the true values (experimental samples) with the predicted values obtained using the Matern 5/2 GPR model for q u . The black line represents the perfect-fit condition, whereas the magenta dashed line shows the linear trend of the dataset. Both lines exhibit nearly identical behavior, demonstrating an excellent level of agreement. For the validation stage, these results correspond to the following expression:
q u , P   = 0.9902 q u , T + 13.8
where T corresponds to the true values, and P represents the values predicted by the Matern 5/2 GPR model.
A similar interpretation applies to the results for q t , as the Matern 5/2 GPR preset also provides highly accurate predictions for this response. Once again, the black and magenta dashed lines are practically parallel, reflecting the excellent agreement between the true and predicted values during both the validation and testing stages, as shown in Figure 9c,d.
Figure 10 shows that the residuals are predominantly concentrated around zero over the entire range of true response values, indicating essentially unbiased predictive behavior of the model. Most of the residuals fall within the interval −100 to 100 kPa, confirming that the prediction errors remain limited for the vast majority of the samples and that no systematic overestimation or underestimation is evident across the response domain.
In addition, Shapley values were computed to identify the most influential predictors for estimating q u and q t , as shown in Figure 11. Based on the average impact on the model output (mean absolute Shapley values), the most relevant predictors, in descending order, are C c , γ d , W c , and T c for both responses. The remaining predictors ( G s , c and G s , s ) exhibited very low influence, indicating a weak dependency of the model on these variables.
Considering that the cement content is the most sensitive predictor according to the Shapley values, the boxplots analyzed in Figure 12 show a strong level of consistency between the true and predicted responses. For both q u and q t , the medians of the predicted values closely follow those of the experimental data across the full cement-content range (approximately 300–1600 kPa for q u and 20–180 kPa for q t ). The interquartile ranges also exhibit comparable widths, indicating that the model adequately captures the variability of the responses. The substantial overlap between the true and predicted distributions confirms the robustness and reliability of the Matern 5/2 GPR model predictions across the full parameter range.
A comparison was made between the porosity-to-cement model and the Matern 5/2 GPR model to examine the order of magnitude of their predictions, as shown in Table 9. The comparative analysis between the porosity–cement index and the Matern 5/2 Gaussian Process Regression (GPR) model demonstrates the clear superiority of ML-based approaches for predicting the mechanical behavior of cemented silt. While the porosity–cement formulations yielded moderate to strong correlations (R2 = 0.738–0.957), the Matern 5/2 GPR achieved substantially higher accuracy, reaching R2 values up to 0.997 for qn and 0.988 for qt in testing. This improvement reflects the model’s ability to capture nonlinear interactions between porosity, cement content, molding water content, and curing processes—relationships that the traditional index cannot fully represent. The robustness of the GPR model across validation and testing scenarios (R2 > 0.96) confirms its high generalization potential, offering reliable predictions even under varying compaction and chemical conditions. These findings highlight the growing importance of machine learning in soil stabilization, where ML can complement classical indices by providing more precise, data-driven insight into strength evolution, enabling optimized mix design and reducing experimental cost and uncertainty in geotechnical engineering.
It of utmost importance to highlight that ML models must be used considering the range of applications where these models were trained, validated, and tested, which corresponds to the information presented in Table 1 and Table 2. Considering the dataset used in this study, the cross-fold validation method was employed to mitigate the risk of overfitting. In addition, not only did the validation stage yield satisfactory results but the testing stage also confirmed that the selected machine learning model is robust and not affected by overfitting (see the Matern 5/2 GPR ML method in Table 8).

5. Conclusions

-
The porosity–cement index (η/Civ) proved to be a robust and unifying parameter for predicting the mechanical behavior of the cemented silt, exhibiting strong correlations for both q u and q t . For 28-day curing, the best-fit exponent converged to x = 0.50, producing high determination coefficients (R2 = 0.98 for q u and R2 = 0.97 for q t ), confirming the validity of a power-law relationship for materials compacted under Standard, Intermediate, and Modified energies.
-
Mechanical strength increased markedly with decreasing η/Civ, demonstrating the dominant influence of porosity reduction over cement volume fraction. For q u , mixtures with η/Civ = 45–46 exhibited very low strengths (60–80 kPa), whereas reducing η/Civ to 19–21 yielded q u between 1500 and 3000 kPa, representing 15–25-fold increases. For q t , the same trend was observed: at η/Civ = 46, q t remained below 15 kPa, while values of 22–23 produced q t between 130 and 330 kPa, and η/Civ = 19 yielded peak strengths above 400 kPa, confirming a consistent strengthening mechanism for both tensile and compressive responses.
-
Compaction water content played a critical role in defining the porosity–cement state and the corresponding strength envelope. Specimens molded at w = 10.0–10.2% achieved the lowest porosities (39%) and the highest q u values, reaching 1500–3000 kPa depending on Civ. In contrast, increasing the water content to 19–24% raised the porosity to 50–51%, resulting in a strength below 120 kPa for the duplicate cement content. This demonstrates the strong coupling between molding water content, packing structure, and cementation efficiency.
-
Machine learning models (Gaussian Process Regression, Matern 5/2 kernel) outperformed the empirical porosity–cement model in prediction accuracy, achieving R2 = 0.963 (validation) and R2 = 0.997 (testing) for q u and R2 = 0.984–0.988 for q t . The ML models captured nonlinear interactions among moisture, density, curing age, and binder content that are not explicitly represented in the η/Civ formulation. However, when used together, both approaches provide complementary insights: η/Civ explains the mechanics, while ML enhances predictive precision.
-
The combined framework of porosity–cement index + machine learning offers a robust dual methodology for the design of cemented silt geomaterials, enabling both mechanistic understanding and high-accuracy prediction. This study demonstrates that η/Civ efficiently generalizes physical behavior across compaction energies and moisture states, while ML provides superior prediction for engineering applications. This integrated approach significantly reduces experimental effort and enables the optimization of mix designs for sustainable ground improvement.
-
While ML algorithms provide superior predictive accuracy, η/Civ offers a mechanistic explanation of strength development across varying compaction states. The combined framework demonstrates that physically based indices and data-driven models are complementary rather than redundant, providing a practical and robust methodology for the design and optimization of cement-stabilized soils.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ma19030498/s1.

Author Contributions

Conceptualization, J.A.B. and O.E.C.-H.; methodology, J.A.B. and O.E.C.-H.; validation, J.A.B. and Y.E.N.d.l.R.; formal analysis, J.A.B. and O.E.C.-H.; investigation, J.A.B.; resources, Y.E.N.d.l.R. and J.A.B.; writing—original draft preparation, J.A.B. and O.E.C.-H.; writing—review and editing, J.A.B. and O.E.C.-H.; visualization, Y.E.N.d.l.R.; supervision, J.A.B.; funding acquisition, Y.E.N.d.l.R. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Fundación Universitaria Los Libertadores—Colombia Project ING-13-25 covering the APC.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding authors.

Acknowledgments

The authors thank the Universidad de Cartagena and Fundación Universitaria Los Libertadores (Bogotá, Colombia).

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Consoli, N.C.; Foppa, D.; Festugato, L.; Heineck, K.S. Key Parameters for Strength Control of Artificially Cemented Soils. J. Geotech. Geoenviron. Eng. 2007, 133, 197–205. [Google Scholar] [CrossRef]
  2. Larnach, W.J. Relationship between Dry Density, Voids/Cement Ratio and the Strength of Soil-Cement Mixtures. Civ. Eng. Public Work. Rev. UK 1960, 60, 903–905. [Google Scholar]
  3. Consoli, N.C.; Dalla Rosa, A.; Corte, M.B.; Lopes, L.D.S.; Consoli, B.S. Porosity-Cement Ratio Controlling Strength of Artificially Cemented Clays. J. Mater. Civ. Eng. 2011, 23, 1249–1254. [Google Scholar] [CrossRef]
  4. Scheuermann Filho, H.C.; Dias Miguel, G.; Cesar Consoli, N. Porosity/Cement Index over a Wide Range of Porosities and Cement Contents. J. Mater. Civ. Eng. 2022, 34, 06021011. [Google Scholar] [CrossRef]
  5. Román Martínez, C.; Nuñez de la Rosa, Y.E.; Estrada Luna, D.; Baldovino, J.A.; Jordi Bruschi, G. Strength, Stiffness, and Microstructure of Stabilized Marine Clay-Crushed Limestone Waste Blends: Insight on Characterization through Porosity-to-Cement Index. Materials 2023, 16, 4983. [Google Scholar] [CrossRef]
  6. Consoli, N.C.; da Silva, A.; Barcelos, A.M.; Festugato, L.; Favretto, F. Porosity/Cement Index Controlling Flexural Tensile Strength of Artificially Cemented Soils in Brazil. Geotech. Geol. Eng. 2020, 38, 713–722. [Google Scholar] [CrossRef]
  7. Nierwinski, H.P.; Sosnoski, J.; Heidemann, M. Evaluation of Strength and Durability of Compacted Bauxite Tailings Treated with Cement. Clean Technol. Environ. Policy 2025, 27, 5141–5150. [Google Scholar] [CrossRef]
  8. Baldovino, J.D.J.A.; Ortega, R.T.; Nuñez de la Rosa, Y.E. Experimental Stabilization of Clay Soils in Cartagena de Indias Colombia: Influence of Porosity/Binder Index. Appl. Sci. 2025, 15, 5895. [Google Scholar] [CrossRef]
  9. Alibeigibeni, A.; Stochino, F.; Zucca, M.; Gayarre, F.L. Enhancing Concrete Sustainability: A Critical Review of the Performance of Recycled Concrete Aggregates (RCAs) in Structural Concrete. Buildings 2025, 15, 1361. [Google Scholar] [CrossRef]
  10. Sridhar, H.N.; Shiva Kumar, G.; Ramaraju, H.K.; Ujwal, M.S.; Vinay, A.; Pandit, P. Machine Learning Framework for Predicting and Improving the Unconfined Compressive Strength and California Bearing Ratio of Lateritic Soil Stabilized with Industrial Wastes. Discov. Sustain. 2025. [Google Scholar] [CrossRef]
  11. Daimary, N.; Sarmah, D.; Bhattacharjee, A.; Barman, U.; Saikia, M.J. Geotechnical Performance of Lateritic Soil Subgrades Stabilized with Agro-Industrial Waste: An Experimental Assessment and ANN-Based Predictive Modelling. Geotechnics 2025, 5, 65. [Google Scholar] [CrossRef]
  12. Tu, Y.; Yao, Q.; Gu, S.; Yang, J. Efficient Prediction of California Bearing Ratio in Solid Waste-Cement-Stabilized Soil Using Improved Hybrid Extreme Gradient Boosting Model. Mater. Today Commun. 2025, 43, 111627. [Google Scholar] [CrossRef]
  13. Hanafi, M.; Javed, I.; Ekinci, A. Evaluating the Strength, Durability and Porosity Characteristics of Alluvial Clay Stabilized with Marble Dust as a Sustainable Binder. Results Eng. 2025, 25, 103978. [Google Scholar] [CrossRef]
  14. Muñoz, Y.O.; Villota-Mora, A.J.E.; Brandão, D.L.; Orioli, M.A.; Britto, T.S.S.; Baldovino, J.A.; dos Santos Izzo, R.L. Enhancing Soil–Cement Properties Using Glass Polishing Waste: Impact of Porosity and Binder Indices. Geotech. Geol. Eng. 2025, 43, 100. [Google Scholar] [CrossRef]
  15. Ribeiro, I.S.; Fagundes, D.D.F.; Nierwinski, H.P. Effects of Ettringite Formation on the Stability of Cement-Treated Sediments. Resources 2025, 14, 73. [Google Scholar] [CrossRef]
  16. Haider, A.B.; Iravanian, A.; Selman, M.H.; Ekinci, A. Using Waste PET Shreds for Soil Stabilization: Efficiency and Durability Assessment. Int. J. Geosynth. Ground Eng. 2023, 9, 48. [Google Scholar] [CrossRef]
  17. Thapa, I.; Ghani, S. Nano-Silica and Machine Learning-Based Soil Stabilization: Advancing Sustainable and Clean Technologies for Resilient Infrastructure. Prog. Eng. Sci. 2025, 2, 100131. [Google Scholar] [CrossRef]
  18. Kumar, A.; Sinha, S.; Pradhananga, R. Assessment of Unconfined Compressive Strength of Nano-Doped Fly Ash-Treated Clayey Soil Using Machine Learning Tools. Sci. Rep. 2025, 15, 33847. [Google Scholar] [CrossRef]
  19. Yang, H.; Hu, G.; Liu, L.; Li, Y.; Deng, Y.; Wu, J. A Physics-Informed and SHAP-Enhanced Modeling Framework for Predicting Strength of Cement-Stabilized Soil. Case Stud. Constr. Mater. 2025, 23, e05280. [Google Scholar] [CrossRef]
  20. Thapa, I.; Ghani, S.; Paramasivam, P.; Tufa, M.A. Development of an Optimized Deep Learning Model for Predicting Slope Stability in Nano Silica Stabilized Soils. Sci. Rep. 2025, 15, 26213. [Google Scholar] [CrossRef]
  21. Yao, Q.; Tu, Y.; Yang, J. Indirect Estimation of Compressive Strength of Industrial Byproduct-Geopolymer Stabilized Cohesive Soils: A Novel Hybrid Extreme Gradient Boosting Model. Bull. Eng. Geol. Environ. 2025, 84, 465. [Google Scholar] [CrossRef]
  22. Luo, Z.; Xue, X.; Xiong, C.; Huang, R.; Du, J.; Li, J.; Liu, K. Data-Driven Prediction of Unconfined Compressive Strength in Stabilized Soils Using Machine Learning. Discov. Appl. Sci. 2025, 7, 1122. [Google Scholar] [CrossRef]
  23. Wang, L. Automated Machine Learning Techniques for Estimating the Unconfined Compressive Strength of Soil Stabilization. Int. J. Interact. Des. Manuf. (IJIDeM) 2025, 19, 6347–6359. [Google Scholar] [CrossRef]
  24. Hu, J.; Li, Z.; Qin, X.; Dong, S.; Zhao, T.; Li, H. Machine Learning-Based Optimization of Geopolymer-Stabilized Soil Formulation Using Blast Furnace Slag and Fly Ash. Mater. Today Commun. 2025, 48, 113500. [Google Scholar] [CrossRef]
  25. Teodoru, I.-B.; Owusu-Yeboah, Z.; Aniculăesi, M.; Dascălu, A.V.; Hörtkorn, F.; Amelio, A.; Lungu, I. Prediction of Unconfined Compressive Strength in Cement-Treated Soils: A Machine Learning Approach. Appl. Sci. 2025, 15, 7022. [Google Scholar] [CrossRef]
  26. Linganagoudar, C.M.; Kumar, G.S.; Ujwal, M.S.; Rohith, G.; Vinay, A.; Pandit, P. Modeling the Unconfined Compressive Strength of Lateritic Soil Treated with FGD Gypsum as a Partial Cement Replacement. Mater. Res. Express 2025, 12, 065501. [Google Scholar] [CrossRef]
  27. Mustafa, Y.M.H.; Wudil, Y.S.; Zami, M.S.; Al-Osta, M.A. Machine Learning Approach for Assessment of Compressive Strength of Soil for Use as Construction Materials. Eng 2025, 6, 84. [Google Scholar] [CrossRef]
  28. Baldovino, J.D.J.A.; Izzo, R.L.D.S.; Pereira, M.D.; Rocha, E.V.D.G.; Rose, J.L.; Bordignon, V.R. Equations Controlling Tensile and Compressive Strength Ratio of Sedimentary Soil–Cement Mixtures under Optimal Compaction Conditions. J. Mater. Civ. Eng. 2020, 32, 04019320. [Google Scholar] [CrossRef]
  29. Baldovino, J.D.J.A.; Izzo, R.L.D.S.; Feltrim, F.; da Silva, É.R. Experimental Study on Guabirotuba’s Soil Stabilization Using Extreme Molding Conditions. Geotech. Geol. Eng. 2020, 38, 2591–2607. [Google Scholar] [CrossRef]
  30. ASTM D4318-10; Stardard Test Methods for Liquid Limit, Plastic Limit and Plasticity Index of Soils. ASTM International: West Conshohocken, PA, USA, 2010.
  31. ASTM D854; Standard Test Methods for Specific Gravity of Soil Solids by Water Pycnometer. ASTM International: West Conshohocken, PA, USA, 2014.
  32. ABNT NBR 6502; Rochas e Solos. Associação Brasileira de Normas Técnicas: São Paulo, Brazil, 1995.
  33. ASTM D3080-11; Standard Test Method for Direct Shear Test of Soils under Consolidated Drained Conditions. ASTM International: West Conshohocken, PA, USA, 2011.
  34. ABNT NBR 16605-17; Cimento Portland e Outros Materiais Em Pó—Determinação Da Massa Específica. ABNT: São Paulo, Brazil, 2017.
  35. ASTM D 2166-03; Standard Test Method for Unconfined Compressive Strength of Cohesive Soil 1. ASTM International: West Conshohocken, PA, USA, 2003.
  36. ABNT NBR 7222; Informação e Documentação—Artigo em Publicação Periódica Científica Impressa—Apresentação. ABNT: Rio de Janeiro, Brazil, 2011.
  37. Mathworks Statistics and Machine Learning ToolboxTM User’s Guide R2020a; MATLAB Mathworks Inc.: Natick, MA, USA, 2020.
Figure 1. The granulometric curve of the silty soil sample.
Figure 1. The granulometric curve of the silty soil sample.
Materials 19 00498 g001
Figure 2. The compaction curve of soil (at standard, intermediate, and modified energies), saturation line of 100%, molding points, and optimum points.
Figure 2. The compaction curve of soil (at standard, intermediate, and modified energies), saturation line of 100%, molding points, and optimum points.
Materials 19 00498 g002
Figure 3. Results of unconfined compressive strength and splitting tensile strength of soil–cement blends considering the porosity-to-cement ratio index (adjusted to 0.50) and 7, 24, and 28 days of curing. Blends were compacted at MDD and OMC (Table 2).
Figure 3. Results of unconfined compressive strength and splitting tensile strength of soil–cement blends considering the porosity-to-cement ratio index (adjusted to 0.50) and 7, 24, and 28 days of curing. Blends were compacted at MDD and OMC (Table 2).
Materials 19 00498 g003
Figure 4. Results of unconfined compressive strength of soil–cement blends considering the porosity-to-cement index (adjusted to 0.50) and 28 days of curing. Blends were compacted at molding dry unit weight and variable water content (Table 1).
Figure 4. Results of unconfined compressive strength of soil–cement blends considering the porosity-to-cement index (adjusted to 0.50) and 28 days of curing. Blends were compacted at molding dry unit weight and variable water content (Table 1).
Materials 19 00498 g004
Figure 5. Results of splitting tensile strength of soil–cement blends considering the porosity-to-cement index (adjusted to 0.50) and 28 days of curing. Blends were compacted at molding dry unit weight and variable water content (Table 1).
Figure 5. Results of splitting tensile strength of soil–cement blends considering the porosity-to-cement index (adjusted to 0.50) and 28 days of curing. Blends were compacted at molding dry unit weight and variable water content (Table 1).
Materials 19 00498 g005
Figure 6. Normalization equations to estimate the splitting tensile strength and compressive strength of soil–cement blends, considering the curing times and water contents.
Figure 6. Normalization equations to estimate the splitting tensile strength and compressive strength of soil–cement blends, considering the curing times and water contents.
Materials 19 00498 g006
Figure 7. Results considering the normalized equations when mixes were compacted at conditions described in Table 1 of estimated: (a) compressive strength; and (b) splitting tensile strength.
Figure 7. Results considering the normalized equations when mixes were compacted at conditions described in Table 1 of estimated: (a) compressive strength; and (b) splitting tensile strength.
Materials 19 00498 g007
Figure 8. Results considering the normalized equations when mixes were compacted at OMC and MDD of estimated: (a) compressive strength; and (b) splitting tensile strength.
Figure 8. Results considering the normalized equations when mixes were compacted at OMC and MDD of estimated: (a) compressive strength; and (b) splitting tensile strength.
Materials 19 00498 g008
Figure 9. Comparison between true and predicted values for (a) q u during the validation stage; (b) q u during the testing stage; (c) q t during the validation stage; and (d) q t during the testing stage.
Figure 9. Comparison between true and predicted values for (a) q u during the validation stage; (b) q u during the testing stage; (c) q t during the validation stage; and (d) q t during the testing stage.
Materials 19 00498 g009
Figure 10. Comparison between residuals and true responses for the Matern 5/2 GPR model for the validation stage: (a) q u and (b) q t .
Figure 10. Comparison between residuals and true responses for the Matern 5/2 GPR model for the validation stage: (a) q u and (b) q t .
Materials 19 00498 g010
Figure 11. Computation of Shapley values for (a) q u and (b) q t .
Figure 11. Computation of Shapley values for (a) q u and (b) q t .
Materials 19 00498 g011
Figure 12. Boxplot analysis for cement content: (a) q u and (b) q t .
Figure 12. Boxplot analysis for cement content: (a) q u and (b) q t .
Materials 19 00498 g012
Table 1. Mixed proportion design for soil–cement compacted blends cured under 28 days at non-optimal compaction conditions (i.e., varying γd and w).
Table 1. Mixed proportion design for soil–cement compacted blends cured under 28 days at non-optimal compaction conditions (i.e., varying γd and w).
Molding
γd (kN/m3)
Soil (%)C (%)Molding w (%)tc (Days)SpecimensTest
13.0100310, 14.67, 19.33, 24, 28.67, and 33.342836 q u   and   q t
100510, 14.67, 19.33, 24, 28.67, and 33.342836 q u   and   q t
100710, 14.67, 19.33, 24, 28.67, and 33.342836 q u   and   q t
100910, 14.67, 19.33, 24, 28.67, and 33.342836 q u   and   q t
13.75100328.67 and 33.342812 q u   and   q t
100528.67 and 33.342812 q u   and   q t
100728.67 and 33.342812 q u   and   q t
100928.67 and 33.342812 q u   and   q t
14.5100310, 14.67, 19.33, 24, and 28.672830 q u   and   q t
100510, 14.67, 19.33, 24, and 28.672830 q u   and   q t
100710, 14.67, 19.33, 24, and 28.672830 q u   and   q t
100910, 14.67, 19.33, 24, and 28.672830 q u   and   q t
16.0100310, 14.67, 19.33, and 242824 q u   and   q t
100510, 14.67, 19.33, and 242824 q u   and   q t
100710, 14.67, 19.33, and 242824 q u   and   q t
100910, 14.67, 19.33, and 242824 q u   and   q t
Table 2. Mixed proportion design for soil–cement blends cured under 7, 14, and 28 days at optimal compaction conditions (i.e., OWC and MDD).
Table 2. Mixed proportion design for soil–cement blends cured under 7, 14, and 28 days at optimal compaction conditions (i.e., OWC and MDD).
EnergySoil (%)C (%)OWC (%)MDD (kN/m3)tc (Days)SpecimensTest
Standard10032613.857, 14, and 2818 q u   and   q t
100526.513.807, 14, and 2818 q u   and   q t
100726147, 14, and 2818 q u   and   q t
100925.5147, 14, and 2818 q u   and   q t
Intermediate10031815.657, 14, and 2818 q u   and   q t
10051815.557, 14, and 2818 q u   and   q t
100718.515.557, 14, and 2818 q u   and   q t
10091815.557, 14, and 2818 q u   and   q t
Modified10031516.857, 14, and 2818 q u   and   q t
10051517.057, 14, and 2818 q u   and   q t
100714.516.957, 14, and 2818 q u   and   q t
10091516.957, 14, and 2818 q u   and   q t
Table 3. Interpretability of categories of machine learning models.
Table 3. Interpretability of categories of machine learning models.
Model Type (Presets)Interpretability
EasyHard
Linear (Linear, Interactions, Robust, and Stepwise)
Decision Trees (Fine, Medium, and Coarse)
Support Vector Machine—SVM (Linear, Quadratic, Cubic, Fine Gaussian, Medium Coarse, and Coarse Gaussian) (linear SVM) (other kernels)
Efficiently Trained Linear (Least Squares and Linear SVM)
Gaussian Process Regression (Squared Exponential, Matern 5/2, Exponential, and Rational Quadratic)
Kernel models (SVM and Least Squares)
Ensembles of Trees (Boosted and Bagged)
Neural Networks (Narrow, Medium, Wide, Bilayered, and Trilayered)
Table 4. Formulas of machine learning models.
Table 4. Formulas of machine learning models.
Model TypeFormulaEquation No.ObservationNotation
Linear y = β 0 + β 1 x + ε (2) y = response, x = predictor, β 0 = intersection term, β 1 = slope term, and ε = error term.
TreesNot presented-Prediction begins at the root node and proceeds down to a leaf node. None.
Support Vector Machine (SVM) f x = n = 1 N ( α n α n * ) G ( x n , x ) + b (3)It is used for nonlinear SVM regression. G ( x n , x ) = kernel function, f x = function for computing new values, N = observations, and b = bias term.
Efficiently Trained LinearNot presented-Corresponds to linear least squares models and linear SVM.None.
Gaussian Process Regression P ( y | f ,   X ) ~ N ( y | H β + f , σ 2 I ) (4)None. β = coefficient that depends on data, y = response, x = predictors, N = Gaussian distribution, σ 2 = variance, and H = vector of basis functions.
Kernel Models f x = i = 1 N α i K ( x i , x ) (5)None. f x = function for computing new values, K ( x i , x ) = kernel function, and α i = coefficients.
Ensembles of Trees f x = t = 1 T α t h t ( x ) (6)Applicable for the boosting algorithm. f x = function for computing new values, α t = weights of the weak hypothesis, and h t ( x ) = prediction of learner with index t .
Neural Networks a 3 = f 3 ( L W 3 , 2 f 2 L W 2 , 1 f 1 I W 1 , 1 p + b 1 + b 2 + b 3 ) (7)It corresponds to the trilayered neural network. a = output vector, p = input vector, W = refers to weight, I = input matrix, and L = layer matrix. The numbers refer to the layers.
Table 5. Identification of predictors and responses for applying ML models.
Table 5. Identification of predictors and responses for applying ML models.
VariablesIdentificationSource
Predictors (inputs obtained from experimental program) W c = water content.
γ d = unit dry weight.
C c = cement content.
G s , c = specific gravity content.
G s , s = specific gravity of soil.
T c = curing time.
Experimental Program (Section 2).
Responses (outputs computed by ML models) q u = unconfined compressive strength.
q t = splitting tensile strength.
Machine learning models (Section 3).
Table 6. Results of equations for estimating unconfined compressive and splitting tensile strength of soil–cement compacted blends, derivation of porosity-to-cement ratio index, and considering the OMC and OWC (Figure 3).
Table 6. Results of equations for estimating unconfined compressive and splitting tensile strength of soil–cement compacted blends, derivation of porosity-to-cement ratio index, and considering the OMC and OWC (Figure 3).
TestCuring TimeA Value (×104)x ExponentB ExponentR2
q u 7137.610.50−2.270.915
14167.860.50−2.270.891
28208.830.50−2.270.956
q t 720.45 0.50−2.270.944
1426.75 0.50−2.270.953
2835.14 0.50−2.270.959
Table 7. Results of equations for estimating unconfined compressive and splitting tensile strength of soil–cement compacted blends, derivation of porosity-to-cement ratio index, and considering the variation in water content and dry unit weight (Figure 4 and Figure 5).
Table 7. Results of equations for estimating unconfined compressive and splitting tensile strength of soil–cement compacted blends, derivation of porosity-to-cement ratio index, and considering the variation in water content and dry unit weight (Figure 4 and Figure 5).
TestWater ContentA Value (×104)x ExponentB ExponentR2
q u 10.00117.220.50−2.270.920
14.67160.230.50−2.270.914
19.33203.290.50−2.270.942
24.00241.920.50−2.270.975
28.67220.010.50−2.270.971
33.34183.320.50−2.270.965
q t 10.0017.190.50−2.270.905
14.6721.440.50−2.270.863
19.3327.510.50−2.270.893
24.0038.720.50−2.270.920
28.6734.630.50−2.270.933
33.3425.410.50−2.270.914
Table 8. Comparison of statistical performance measures for the machine learning models.
Table 8. Comparison of statistical performance measures for the machine learning models.
Preset q u q t
RMSE
(V)
R2
(V)
RMSE
(T)
R2
(T)
RMSE
(V)
R2
(V)
RMSE
(T)
R2
(T)
Linear305.10.103242.2−0.46243.20.80356.20.765
Interactions Linear258.30.357223.3−0.24226.10.92833.90.914
Robust Linear326.4−0.026201.0−0.00746.60.77263.60.698
Stepwise Linear256.20.368226.6−0.27926.70.92533.70.915
Fine Tree154.10.77160.50.90941.70.81738.70.889
Medium Tree221.50.527138.60.52251.60.71957.10.757
Coarse Tree295.30.160200.6−0.00268.00.51382.00.499
Linear SVM321.90.002206.8−0.06645.10.78660.80.725
Quadratic SVM223.30.520190.40.09623.70.94130.70.930
Cubic SVM213.00.563118.30.65198.3−0.01661.30.721
Fine Gaussian SVM265.50.321145.20.47533.90.87946.50.839
Medium Gaussian SVM277.70.257175.30.23423.70.94126.50.948
Coarse Gaussian SVM320.50.010205.8−0.05551.00.72666.70.669
Efficient Linear Least Squares304.90.104240.3−0.43948.10.75661.10.722
Efficient Linear SVM320.30.012205.3−0.05073.50.43188.70.414
Boosted Trees109.90.88451.00.93526.80.92533.90.914
Bagged Trees177.60.696105.60.72238.60.84348.00.828
Squared Exponential GPR58.00.96811.20.99713.90.98012.80.988
Matern 5/2 GPR61.80.96310.50.99712.20.98412.60.988
Exponential GPR55.70.97012.80.99613.90.98010.90.991
Rational Quadratic GPR61.70.96310.30.99712.30.98412.40.989
Narrow NN154.30.771118.40.65121.90.94925.30.952
Medium NN60.10.96525.20.98415.30.97514.60.984
Wide NN35.40.98815.00.99414.60.97811.90.990
Bilayered NN106.00.89297.20.76519.00.96213.80.986
Trilayered NN73.00.94925.40.98417.20.96914.40.984
SVM Kernel321.60.004204.6−0.04392.40.101120.2−0.076
Least Squares Regression Kernel166.20.73482.90.82932.80.88763.60.699
Note: V = validation, T = Test, NN = neural network, and GPR = Gaussian Process Regression.
Table 9. Comparison of R2 values between porosity-to-cement and Matern 5/2 models.
Table 9. Comparison of R2 values between porosity-to-cement and Matern 5/2 models.
ResponsePorosity-to-Cement ModelMatern 5/2 GPR Model
EquationR2R2
ValidationTesting
q u Equation (9)0.9510.9630.997
q t Equation (10)0.9570.9840.988
q u Equation (11)0.8530.9630.997
q t Equation (12)0.7380.9840.988
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Baldovino, J.A.; Coronado-Hernández, O.E.; Nuñez de la Rosa, Y.E. Porosity/Cement Index and Machine Learning Models for Predicting Tensile and Compressive Strength of Cemented Silt in Varying Compaction Conditions. Materials 2026, 19, 498. https://doi.org/10.3390/ma19030498

AMA Style

Baldovino JA, Coronado-Hernández OE, Nuñez de la Rosa YE. Porosity/Cement Index and Machine Learning Models for Predicting Tensile and Compressive Strength of Cemented Silt in Varying Compaction Conditions. Materials. 2026; 19(3):498. https://doi.org/10.3390/ma19030498

Chicago/Turabian Style

Baldovino, Jair Arrieta, Oscar E. Coronado-Hernández, and Yamid E. Nuñez de la Rosa. 2026. "Porosity/Cement Index and Machine Learning Models for Predicting Tensile and Compressive Strength of Cemented Silt in Varying Compaction Conditions" Materials 19, no. 3: 498. https://doi.org/10.3390/ma19030498

APA Style

Baldovino, J. A., Coronado-Hernández, O. E., & Nuñez de la Rosa, Y. E. (2026). Porosity/Cement Index and Machine Learning Models for Predicting Tensile and Compressive Strength of Cemented Silt in Varying Compaction Conditions. Materials, 19(3), 498. https://doi.org/10.3390/ma19030498

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop