Machine Learning Estimation of the Unit Weight of Organic Soils
Abstract
1. Introduction
2. Description of Geotechnical Research
2.1. The Local Subsoil
2.2. The Geotechnical Research Methods
2.2.1. In Situ Testing with CPT Static Probe (fs, qc)
2.2.2. The Laboratory Test (w, LOIT, γt)
3. Description of Statistical and Numerical Analysis Tools
3.1. Statistical Analysis Tools
- Calculation of basic statistical parameters;
- Checking of the normality of variable distributions;
- Calculation of coefficients of linear correlation matrices;
- Development of linear multiple regression models.
3.2. Numerical Analysis
4. Results and Discussion
4.1. Statistical Analysis
4.2. Machine Learning Analysis
4.2.1. Example One γt (fs, qc)
4.2.2. Example Two γt (LOIT, w)
5. Conclusions
- the use of machine learning to build regression models allowed for a more accurate estimation of the unit weight of organic soils for both analyzed examples;
- the use of machine learning tools can be an important element of a geotechnical engineer’s workshop;
- reaching for advanced regression tools should be preceded by statistical data analysis, on the basis of which comparative models can be obtained.
- The presented results may be supplemented by a comparison with classical nonlinear regression models.
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
ML | Machine Learning |
CPT | Cone Penetration Test |
MAPE | Mean Absolute Percentage Error |
SVM | Support Vector Machine |
GPR | Gaussian Process Regression |
ANN | Artificial Neural Network |
References
- Gilbert, P.A. Effect of Sampling Disturbance on Laboratory-Measured Soil Properties; U.S. Army Engineer Waterways Experiment Station: Vicksburg, MS, USA, 1992. [Google Scholar]
- Mayne, P.W.; Peuchen, J.; Bouwmeester, D. Soil unit weight estimation from CPTs. In Proceedings of the 2rd International Symposium on Cone Penetration Testing, Huntington Beach, CA, USA, 9–11 May 2010. [Google Scholar]
- Robertson, P.; Cabal, K. Estimating soil unit weight from CPT. In Proceedings of the 2nd International Symposium on Cone Penetration Testing, Huntington Beach, CA, USA, 9–11 May 2010. [Google Scholar]
- Ozer, A.T.; Bartlett, S.F.; Lawton, E.C. CPTU and DMT for estimating soil unit weight of Lake Bonneville Clay. Geotech. Geophys. Site Charact. 2012, 4, 291–296. [Google Scholar]
- Ghanekar, R.K. Unit weight estimation from CPT for Indian offshore soft calcareous clay. In CPTU and DMT in Soft Clays and Organic Soils; Młynarek, Z., Wierzbicki, J., Eds.; Exlemplum Press: Poznań, Poland, 2014; pp. 31–44. [Google Scholar]
- Kovacevic, M.S.; Gavin, K.G.; Reale, C.; Libric, L. The use of neural networks to develop CPT correlations for soils in northern Croatia. In Proceedings of the 4th International Symposium on Cone Penetration Testing (CPT’18), Delft, The Netherlands, 21–22 June 2018; CRC Press: Boca Raton, FL, USA, 2018. [Google Scholar] [CrossRef]
- Lengkeek, H.J.; de Greef, J.; Joosten, S. CPT based unit weight estimation extended to soft organic soils and peat. In Proceedings of the 4th International Symposium on Cone Penetration Testing (CPT’18), Delft, The Netherlands, 21–22 June 2018; CRC Press: Boca Raton, FL, USA, 2018. [Google Scholar] [CrossRef]
- Lengkeek, H.J.; Brinkgreve, R.B.J. CPT-based unit weight estimation extended to soft organic clays and peat: An update. In Cone Penetration Testing 2022; CRC Press: Boca Raton, FL, USA, 2022. [Google Scholar] [CrossRef]
- Bishop, C.M.; Bishop, H.; Cham, S. Deep Learning—Foundations and Concepts; Springer Nature: London, UK, 2023. [Google Scholar] [CrossRef]
- Deka, P.C. A Primer on Machine Learning Applications in Civil Engineering; CRC Press: Boca Raton, FL, USA, 2019. [Google Scholar] [CrossRef]
- Sargiotis, D. Transforming Civil Engineering with AI and Machine Learning: Innovations, Applications, and Future Directions. Int. J. Res. Publ. Rev. 2025, 6, 3780–3805. [Google Scholar] [CrossRef]
- Mobasheri, F.; Hosseinpoor, M.; Yahia, A.; Pourkamali-Anaraki, F. Machine Learning as an Innovative Engineering Tool for Controlling Concrete Performance: A Comprehensive Review. Arch. Comput. Methods Eng. 2025, 1–45. [Google Scholar] [CrossRef]
- Ebid, A.M. 35 Years of (AI) in Geotechnical Engineering: State of the Art. Geotech. Geol. Eng. 2021, 39, 637–690. [Google Scholar] [CrossRef]
- Padarian, J.; Minasny, B.; McBratney, A.B. Machine learning and soil sciences: A review aided by machine learning tools. Soil 2020, 6, 35–52. [Google Scholar] [CrossRef]
- Liu, H.; Su, H.; Dias do Costa, D. State-of-the-art review on the use of AI-enhanced computational mechanics in geotechnical engineering. Artif. Intell. Rev. 2024, 57, 196. [Google Scholar] [CrossRef]
- Rauter, S.; Tschuchnigg, F. CPT Data Interpretation Employing Different Machine Learning Techniques. Geosciences 2021, 11, 265. [Google Scholar] [CrossRef]
- Abu-Farsakh, M.Y.; Shoaib, M.M. Machine Learning Models to Evaluate the Load-Settlement Behavior of Piles from Cone Penetration Test Data. Geotech. Geol. Eng. 2024, 42, 3433–3449. [Google Scholar] [CrossRef]
- Nierwinski, H.P.; Pfitscher, R.J.; Barra, B.S.; Menegaz, T.; Odebrecht, E. A practical approach for soil unit weight estimation using artificial neural networks. J. South Am. Earth Sci. 2023, 131, 104648. [Google Scholar] [CrossRef]
- Roy, S.; Abir, A.R.; Ansary, M.A. Use of CPT and other parameters for estimating soil unit weight using optimized machine learning models. Earth Sci. Inform. 2025, 18, 221. [Google Scholar] [CrossRef]
- Lechowicz, Z.; Sulewska, M.J. Assessment of the Undrained Shear Strength and Settlement of Organic Soils under Embankment Loading Using Artificial Neural Networks. Materials 2023, 16, 125. [Google Scholar] [CrossRef]
- Yousefpour, N.; Medina-Cetina, Z.; Hernandez-Martinez, F.G.; Al-Tabbaa, A. Stiffness and Strength of Stabilized Organic Soils—Part II/II: Parametric Analysis and Modeling with Machine Learning. Geosciences 2021, 11, 169. [Google Scholar] [CrossRef]
- Ulloa, H.O.; Ramirez, A.; Jafari, N.H.; Kameshwar, S.; Harrouch, I. Machine Learning–Based Organic Soil Classification Using Cone Penetrometer Tests. J. Geotech. Geoenvironmental Eng. 2024, 150, 9. [Google Scholar] [CrossRef]
- Czudec, G.; Bulanda, J. Geological and Engineering Geological Conditions for Recognition—Engineering for the Construction of Multi-Storey Building at Witolda Street in Rzeszów; Geotech, Ltd.: Aurora, ON, Canada; Department of Geological Services Design and Construction and the Environment: Rzeszow, Poland, 2010. [Google Scholar]
- Straż, G. Identification of local organic soils based on cone penetration test results. Constr. Archit. 2014, 13, 49–56. [Google Scholar] [CrossRef]
- PN-B-04452:2002; Geotechnics. Field Studies. Polish Committee for Standardization: Warsaw, Poland, 2002.
- EN 1997-1: 2008; Eurocode 7: Geotechnical Design—Part 1: General Rules. European Committee for Standardization: Brussels, Belgium, 2008.
- EN 1997-2: 2009/10; Eurocode 7: Geotechnical Design—Part 2: Ground Investigation and Testing. European Committee for Standardization: Brussels, Belgium, 2009.
- PN-EN ISO 22476-12:2009; Geotechnical Investigation and Testing—Field Testing—Part 12: Mechanical Cone Penetration Test. Polish Committee for Standardization: Warsaw, Poland, 2009.
- PN-EN ISO 22476-1:2023-06; Geotechnical investigation and Testing—Field Testing—Part 1: Electrical Cone and Piezocone Penetration Test. Polish Committee for Standardization: Warsaw, Poland, 2023.
- PN-B-04481: 1988; Building Soils. Laboratory Tests. Polish Committee for Standardization: Warsaw, Poland, 1988.
- PN-B-02480: 1986; Building Soils. Terms, Symbols, Division and Description of Soil. Polish Committee for Standardization: Warsaw, Poland, 1986.
- PN-EN ISO 14688-2: 2018-05; Geotechnical Investigation and Testing. Identification and Classification of Soil. Part 1: Identification and Description. European Committee for Standardization: Brussels, Belgium, 2018.
- PN-EN ISO 14688-1: 2018-05; Geotechnical Investigation and Testing. Identification and Classification of Soil. Part 2: Principles for a Classification. European Committee for Standardization: Brussels, Belgium, 2018.
- PN-EN ISO 17892-1: 2015; Geotechnical Investigation and Testing—Laboratory Testing of Soil—Part 1: Determination of Water Content. European Committee for Standardization: Brussels, Belgium, 2015.
- Marut, M.; Straż, G. Verification of standard guidelines for organic matter content determination in organic soils by the loss on ignition method. Geol. Rev. 2016, 64, 918–924. [Google Scholar]
- Heiri, O.; Lotter, A.F.; Lemcke, G. Loss on ignition as a method for estimating organic and carbonate content in sediments: Reproducibility and comparability of results. J. Paleolimnol. 2001, 25, 101–110. [Google Scholar] [CrossRef]
- PN-EN ISO 17892-2:2014; Geotechnical Investigation and Testing—Laboratory Testing of Soil—Part 2: Determination of Bulk Density. European Committee for Standardization: Brussels, Belgium, 2014.
- Straż, G.; Borowiec, A. Evaluation of the unit weight of organic soils from a CPTM using an Artificial Neural Networks. Arch. Civ. Eng. 2021, 67, 259–281. [Google Scholar] [CrossRef]
- Straż, G.; Borowiec, A. Estimating the unit weight of local organic soils from laboratory tests using artificial neural networks. Appl. Sci. 2020, 10, 2261. [Google Scholar] [CrossRef]
- Stanisz, A. An Accessible Course in Statistics Using STATISTICA PL with Examples from Medicine. In Basic Statistics; StatSoft: Kraków, Poland, 2006; Volume 1: Basic Statistics. [Google Scholar]
- Stanisz, A. An Accessible Course in Statistics Using STATISTICA PL with Examples from Medicine. In Basic Statistics; StatSoft: Kraków, Poland, 2007; Volume 2: Linear and nonlinear models. [Google Scholar]
- TIBCO Software Inc. Statistica (Data Analysis Software System), v. 13. 2017. Available online: https://docs.tibco.com/pub/stat/tibco-statistica-13-3-0_documentation.zip (accessed on 6 August 2025).
- MathWorks: MATLAB Documentation (R2024b). Available online: https://ww2.mathworks.cn/products/new_products/release2024b.html (accessed on 13 December 2024).
- Snoek, J.; Larochelle, H.; Adams, R.P. Practical Bayesian Optimization of Machine Learning Algorithms. Adv. Neural Inf. Process. Syst. 2012, 25, 2960–2968. [Google Scholar] [CrossRef]
- Breiman, L.; Friedman, J.H.; Olshen, R.A.; Stone, C.J. Classification and Regression Trees; CRC Press: Boca Raton, FL, USA, 1984. [Google Scholar]
- Breiman, L. Bagging Predictors. Mach. Learn. 1996, 26, 123–140. [Google Scholar] [CrossRef]
- Freund, Y.; Schapire, R.E. A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting. J. Comput. Syst. Sci. 1997, 55, 119–139. [Google Scholar] [CrossRef]
- Vapnik, N.V. The Nature of Statistical Learning Theory; Springer: New York, NY, USA, 1995. [Google Scholar] [CrossRef]
- Rasmussen, C.E.; Williams, C.K.I. Gaussian Processes for Machine Learning; MIT Press: Cambridge, MA, USA, 2006. [Google Scholar]
Nr | Model Type | Preset | Interpretability | Flexibility |
---|---|---|---|---|
M1 | Linear Regression | Linear | Easy | Very low |
M2 | Linear Regression | Interactions Linear | Easy | Medium |
M3 | Linear Regression | Robust Linear | Easy | Very low |
M4 | Linear Regression | Stepwise Linear | Easy | Medium |
M5 | Decision Tree | Fine Tree | Easy | High |
M6 | Decision Tree | Medium Tree | Easy | Medium |
M7 | Decision Tree | Coarse Tree | Easy | Low |
M8 | SVM | Linear SVM | Easy | Low |
M9 | SVM | Quadratic SVM | Hard | Medium |
M10 | SVM | Cubic SVM | Hard | Medium |
M11 | SVM | Fine Gaussian SVM | Hard | High |
M12 | SVM | Medium Gaussian SVM | Hard | Medium |
M13 | SVM | Coarse Gaussian SVM | Hard | Low |
M14 | Efficiently Trained Linear | Least Squares | Easy | Medium |
M15 | Efficiently Trained Linear | SVM | Easy | Medium |
M16 | Ensemble of Trees | Boosted Trees | Easy | Medium |
M17 | Ensemble of Trees | Bagged Trees | Easy | Medium |
M18 | Gaussian Process | Squared Exponential GPR | Hard | Automatic |
M19 | Gaussian Process | Matern 5/2 GPR | Hard | Automatic |
M20 | Gaussian Process | Exponential GPR | Hard | Automatic |
M21 | Gaussian Process | Rational Quadratic GPR | Hard | Automatic |
M22 | Neural Network | Narrow Neural Network | Hard | Medium |
M23 | Neural Network | Medium Neural Network | Hard | Medium |
M24 | Neural Network | Wide Neural Network | Hard | Medium |
M25 | Neural Network | Bilayered Neural Network | Hard | High |
M26 | Neural Network | Trilayered Neural Network | Hard | High |
M27 | Kernel | SVM Kernel | Hard | Medium |
M28 | Kernel | Least Squares Kernel | Hard | Medium |
Variable | Count | Mean | Min | Max | Std Dev |
---|---|---|---|---|---|
X1: fs [kPa] | 135 | 47.59 | 9.80 | 209.03 | 42.90 |
X2: qc [kPa] | 135 | 314.64 | 140.00 | 644.00 | 106.88 |
X3: LOIT [%] | 135 | 20.72 | 5.02 | 84.93 | 20.73 |
X4: w [%] | 135 | 103.34 | 23.52 | 417.91 | 106.22 |
Y: γt [kN/m3] | 135 | 15.25 | 10.27 | 19.86 | 2.60 |
Nr | Model Type | Preset | R2 (Validation) | R2 (Test) | MAPE% (Validation) | MAPE% (Test) |
---|---|---|---|---|---|---|
M1 | Reference model | Linear | 0.211 | 0.253 | 13.87 | 13.57 |
M2 | Linear Regression | Interactions Linear | 0.208 | 0.249 | 13.84 | 13.55 |
M6 | Decision Tree | Medium Tree | 0.483 | 0.285 | 9.59 | 11.06 |
M11 | SVM | Fine Gaussian | 0.373 | 0.633 | 10.19 | 8.52 |
M17 | Ensemble | Bagged Trees | 0.498 | 0.363 | 9.40 | 10.62 |
M21 | Gaussian Process | Rational Quadratic | 0.437 | 0.585 | 9.29 | 9.57 |
M26 | Neural Network | Trilayered | 0.317 | 0.663 | 9.93 | 7.37 |
Nr | Model Type | Preset | R2 (Validation) | R2 (Test) | MAPE% (Validation) | MAPE% (Test) |
---|---|---|---|---|---|---|
M1 | Reference model | Linear | 0.787 | 0.797 | 6.10 | 7.87 |
M2 | Linear Regression | Interactions Linear | 0.927 | 0.934 | 3.40 | 3.90 |
M5 | Decision Tree | Fine Tree | 0.967 | 0.968 | 1.97 | 2.95 |
M12 | SVM | Fine Gaussian | 0.963 | 0.968 | 2.07 | 2.38 |
M17 | Ensemble | Bagged Trees | 0.973 | 0.973 | 1.80 | 2.63 |
M20 | Gaussian Process | Exponential | 0.980 | 0.992 | 1.47 | 1.25 |
M23 | Neural Network | Medium | 0.971 | 0.989 | 1.75 | 2.48 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Borowiec, A.; Straż, G.; Sulewska, M.J. Machine Learning Estimation of the Unit Weight of Organic Soils. Appl. Sci. 2025, 15, 9079. https://doi.org/10.3390/app15169079
Borowiec A, Straż G, Sulewska MJ. Machine Learning Estimation of the Unit Weight of Organic Soils. Applied Sciences. 2025; 15(16):9079. https://doi.org/10.3390/app15169079
Chicago/Turabian StyleBorowiec, Artur, Grzegorz Straż, and Maria Jolanta Sulewska. 2025. "Machine Learning Estimation of the Unit Weight of Organic Soils" Applied Sciences 15, no. 16: 9079. https://doi.org/10.3390/app15169079
APA StyleBorowiec, A., Straż, G., & Sulewska, M. J. (2025). Machine Learning Estimation of the Unit Weight of Organic Soils. Applied Sciences, 15(16), 9079. https://doi.org/10.3390/app15169079