Parametric Modeling of the Unsaturated Soil Hydraulic Conductivity Function Using Tree-Based and Ensemble Machine Learning Algorithms: A Comparative Analysis of Cubist, Random Forest, and LightGBM
Abstract
1. Introduction
2. Materials and Methods
2.1. Fitting Soil Moisture Data and Estimating Hydraulic Conductivity Function
2.2. Development of PTFs
2.3. Data Preprocessing
2.4. Machine Learning Algorithms
2.4.1. RF Algorithm
| Algorithm 1: Random forest (RF) |
| Parameters: Number of trees = 500; |
| Procedure |
| 1 For t = 1 to T: |
| 2 - Draw a bootstrap sample from the data |
| 3 - Grow a decision tree ft using a random feature subset |
| 4 - Store the decision tree ft |
| 5 End For |
| 6 To predict: |
| 7 - Return average of all tree predictions |
2.4.2. Cubist Algorithm
| Algorithm 2: Cubist |
| Parameters: Number of committees = 10; |
| Neighbors = 5; |
| Procedure |
| 1 Train a rule-based tree: |
| 2 - Split nodes to minimize prediction error |
| 3 - Fit linear models at leaves using least squares |
| 4 Generate rules |
| 5 - Convert tree paths into interpretable if-then rules |
| 6 To predict: |
| 7 - Identify matching rule/path |
| 8 - Apply associated linear model |
2.4.3. LightGBM Algorithm
| Algorithm 3: Light gradient-boosting machine (LightGBM) |
| Parameters: Learning rate (η) = 0.01; |
| Number of leaves = 31; |
| Number of rounds (T) = 1000; |
| Procedure |
| 1 Initialize model with a constant value |
| 2 For t = 1 to T: |
| 3 - Compute the gradients of the loss function |
| 4 - Fit regression tree ft to the computed gradients |
| 5 - Update prediction using the output of ft |
| 6 End For |
| 7 Return the final prediction |
2.5. Model Training and Implementation
2.6. Shapley Additive Explanation Analysis for Variable Importance
2.7. Evaluation Statistics and Taylor Diagram for Performance Comparison
3. Results and Discussion
3.1. Descriptive Statistics and Data Distribution Analysis
3.2. Performance Comparison of Cubist, RF, and LightGBM Across PTFs
3.3. Decoding Predictive Relationships: SHAP-Based Interpretation of SHCF Parameters
3.4. Contextualization of Cubist Performance Within the Broader PTF Literature
3.5. Limitations and Future Work
4. Conclusions
Supplementary Materials
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Scarfone, R.; Wheeler, S.J.; Lloret-Cabot, M. A hysteretic hydraulic constitutive model for unsaturated soils and application to capillary barrier systems. Geomech. Energy Environ. 2022, 30, 100224. [Google Scholar] [CrossRef]
- Fuentes, C.; Chávez, C.; Brambila, F. Relating Hydraulic Conductivity Curve to Soil-Water Retention Curve Using a Fractal Model. Mathematics 2020, 8, 2201. [Google Scholar] [CrossRef]
- Ontman, R.; Groffman, P.M.; Driscoll, C.T.; Cheng, Z. Surprising relationships between soil pH and microbial biomass and activity in a northern hardwood forest. Biogeochemistry 2023, 163, 265–277. [Google Scholar] [CrossRef]
- Dahunsi, J.; Pathirana, S.; Cheema, M.; Krishnapillai, M.; Galagedara, L. Estimating soil hydraulic conductivity from time-lapse ground-penetrating radar data in podzolic soils using the green-ampt model. J. Hydrol. 2025, 657, 133059. [Google Scholar] [CrossRef]
- Rastgou, M.; He, Y.; Jiang, Q. Implementation and efficient evaluation of backpropagation network training algorithms in parametric simulations of soil hydraulic conductivity curve. J. Hydrol. 2024, 636, 131302. [Google Scholar] [CrossRef]
- Cui, L.-X.; Cheng, Q.; So, P.S.; Tang, C.-S.; Tian, B.-G.; Li, C.-Y. Relationship between root characteristics and saturated hydraulic conductivity in a grassed clayey soil. J. Hydrol. 2024, 645, 132231. [Google Scholar] [CrossRef]
- Yuliana, Y.; Apriyono, A.; Kamchoom, V.; Boldrin, D.; Cheng, Q.; Tang, C.-S. Seasonal dynamics of root growth and desiccation cracks and their effects on soil hydraulic conductivity. Eng. Geol. 2025, 349, 107973. [Google Scholar] [CrossRef]
- Wang, Y.; Ma, R.; Zhu, G. Improved Prediction of Hydraulic Conductivity with a Soil Water Retention Curve That Accounts for Both Capillary and Adsorption Forces. Water Resour. Res. 2022, 58, e2021WR031297. [Google Scholar] [CrossRef]
- Albalasmeh, A.; Mohawesh, O.; Gharaibeh, M.; Deb, S.; Slaughter, L.; El Hanandeh, A. Artificial neural network optimization to predict saturated hydraulic conductivity in arid and semi-arid regions. CATENA 2022, 217, 106459. [Google Scholar] [CrossRef]
- Tong, Y.; Wang, Y.; Zhou, J.; Guo, X.; Wang, T.; Xu, Y.; Sun, H.; Zhang, P.; Li, Z.; Lauerwald, R. Expanding scales: Achieving prediction of van Genuchten model hydraulic parameters in deep profiles by incorporating broad in situ soil information in pedotransfer functions. J. Hydrol. 2025, 656, 132912. [Google Scholar] [CrossRef]
- Tartakovsky, A.M.; Marrero, C.O.; Perdikaris, P.; Tartakovsky, G.D.; Barajas-Solano, D. Physics-Informed Deep Neural Networks for Learning Parameters and Constitutive Relationships in Subsurface Flow Problems. Water Resour. Res. 2020, 56, e2019WR026731. [Google Scholar] [CrossRef]
- Arbor, A.; Schmidt, M.; Zhang, J.; Bulmer, C.; Filatow, D.; Kasraei, B.; Smukler, S.; Heung, B. Filling the gaps in soil data: A multi-model framework for addressing data gaps using pedotransfer functions and machine-learning with uncertainty estimates to estimate bulk density. CATENA 2024, 245, 108310. [Google Scholar] [CrossRef]
- Bayat, H.; Sedaghat, A.; Sinegani, A.A.S.; Gregory, A.S. Investigating the relationship between unsaturated hydraulic conductivity curve and confined compression curve. J. Hydrol. 2015, 522, 353–368. [Google Scholar] [CrossRef]
- Doussan, C.; Ruy, S. Prediction of unsaturated soil hydraulic conductivity with electrical conductivity. Water Resour. Res. 2009, 45, W10408. [Google Scholar] [CrossRef]
- Sedaghat, A.; Bayat, H.; Sinegani, A.S. Estimation of soil saturated hydraulic conductivity by artificial neural networks ensemble in smectitic soils. Eurasian Soil Sci. 2016, 49, 347–357. [Google Scholar] [CrossRef]
- Vereecken, H.; Weynants, M.; Javaux, M.; Pachepsky, Y.; Schaap, M.; Genuchten, M.T. Using pedotransfer functions to estimate the van Genuchten–Mualem soil hydraulic properties: A review. Vadose Zone J. 2010, 9, 795–820. [Google Scholar] [CrossRef]
- Rastgou, M.; Bayat, H.; Mansoorizadeh, M.; Gregory, A.S. Estimating the soil water retention curve: Comparison of multiple nonlinear regression approach and random forest data mining technique. Comput. Electron. Agric. 2020, 174, 105502. [Google Scholar] [CrossRef]
- Lee, J.; Cho, J.; Justice, D.; Kim, S. Towards Explainability of Classical Neural Network via Quantum Computing. In 2024 IEEE International Conference on Quantum Computing and Engineering (QCE); IEEE: New York, NY, USA, 2024; pp. 398–399. [Google Scholar]
- Farasati, M.; Seyedian, M.; Fathaabadi, A. Predicting soil hydraulic conductivity using random forest, SVM, and LSSVM models. Nat. Resour. Model. 2024, 37, e12407. [Google Scholar] [CrossRef]
- Sihag, P.; Mohsenzadeh Karimi, S.; Angelaki, A. Random forest, M5P and regression analysis to estimate the field unsaturated hydraulic conductivity. Appl. Water Sci. 2019, 9, 129. [Google Scholar] [CrossRef]
- Veloso, M.F.; Rodrigues, L.N.; Filho, E.I.F. Evaluation of machine learning algorithms in the prediction of hydraulic conductivity and soil moisture at the Brazilian Savannah. Geoderma Reg. 2022, 30, e00569. [Google Scholar] [CrossRef]
- Mouaddine, A.; Barakat, A.; Hajaj, S.; Mosaid, H.; Bouzekraoui, H.; Bni, Z.; Hilali, A. Predicting and mapping soil saturated hydraulic conductivity in the Beni Moussa irrigated perimeter (Tadla Plain, Morocco) using Random Forest machine learning model. Model. Earth Syst. Environ. 2025, 11, 82. [Google Scholar] [CrossRef]
- Nemes, A.; Schaap, M.; Leij, F.; Wösten, J. Description of the unsaturated soil hydraulic database UNSODA version 2.0. J. Hydrol. 2001, 251, 151–162. [Google Scholar] [CrossRef]
- van Genuchten, M.T. A closed-form equation for predicting the hydraulic conductivity of unsaturated soils. Soil Sci. Soc. Am. J. 1980, 44, 892–898. [Google Scholar] [CrossRef]
- Mualem, Y. A new model for predicting the hydraulic conductivity of unsaturated porous media. Water Resour. Res. 1976, 12, 513–522. [Google Scholar] [CrossRef]
- Schaap, M.G.; Van Genuchten, M.T. A modified Mualem-van Genuchten formulation for improved description of the hydraulic conductivity near saturation. Vadose Zone J. 2006, 5, 27–34. [Google Scholar] [CrossRef]
- Wösten, J.; Pachepsky, Y.A.; Rawls, W. Pedotransfer functions: Bridging the gap between available basic soil data and missing soil hydraulic characteristics. J. Hydrol. 2001, 251, 123–150. [Google Scholar] [CrossRef]
- Zhang, Y.; Schaap, M.G. Weighted recalibration of the Rosetta pedotransfer model with improved estimates of hydraulic parameter distributions and summary statistics (Rosetta3). J. Hydrol. 2017, 547, 39–53. [Google Scholar] [CrossRef]
- Mazziotta, M.; Pareto, A. Normalization methods for spatio-temporal analysis of environmental performance: Revisiting the Min–Max method. Environmetrics 2022, 33, e2730. [Google Scholar] [CrossRef]
- Breiman, L.; Adele, C.; Andy, L.; Wiener, M. randomForest: Breiman and Cutlers Random Forests for Classification and Regression, Version 4.7-1.2; CRAN: Vienna, Austria, 2024. [CrossRef]
- Yang, K.; Wang, J.; Zhao, G.; Wang, X.; Cong, W.; Yuan, M.; Luo, J.; Dong, X.; Wang, J.; Tao, J. NIDS-CNNRF integrating CNN and random forest for efficient network intrusion detection model. Internet Things 2025, 32, 101607. [Google Scholar] [CrossRef]
- Zhou, R.; Chen, J.; Cui, S.; Li, L.; Qian, J.; Zhao, H.; Huang, G. A data-driven framework to identify influencing factors for soil heavy metal contaminations using random forest and bivariate local Moran’s I: A case study. J. Environ. Manag. 2025, 375, 124172. [Google Scholar] [CrossRef]
- Probst, P.; Boulesteix, A.-L. To tune or not to tune the number of trees in random forest. J. Mach. Learn. Res. 2018, 18, 6673–6690. [Google Scholar]
- Butler, B.M. Mineral sources of aqua regia extractable base cations in Scottish soils interpreted from Cubist models trained with quantitative mineralogy data. Chem. Geol. 2020, 551, 119773. [Google Scholar] [CrossRef]
- Pouladi, N.; Møller, A.B.; Tabatabai, S.; Greve, M.H. Mapping soil organic matter contents at field level with Cubist, Random Forest and kriging. Geoderma 2019, 342, 85–92. [Google Scholar] [CrossRef]
- Thai, H.-T. Machine learning for structural engineering: A state-of-the-art review. In Structures; Elsevier: Amsterdam, The Netherlands, 2022; pp. 448–491. [Google Scholar]
- Yin, Z.; Jiao, J.; Xie, P.; Luo, H.; Wei, L. Multi-objective optimization control for shield cutter wear and cutting performance using LightGBM and enhanced NSGA-II. Autom. Constr. 2025, 171, 105957. [Google Scholar] [CrossRef]
- Marcot, B.G.; Hanea, A.M. What is an optimal value of k in k-fold cross-validation in discrete Bayesian network analysis? Comput. Stat. 2021, 36, 2009–2031. [Google Scholar] [CrossRef]
- Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
- Kuhn, M.; Steve, W.; Chris, K.; Coulter, N.; Quinlan, R.; Rulequest Research Pty Ltd. Cubist: Rule- and Instance-Based Regression Modeling, Version 0.6.0; CRAN: Vienna, Austria, 2025. [CrossRef]
- Shi, Y.; Guolin, K.; Damien, S.; James, L.; Qi, M.; Thomas, F.; Taifeng, W.; Wei, C.; Weidong, M.; Qiwei, Y.; et al. Lightgbm: Light Gradient Boosting Machine, Version 4.6.0; CRAN: Vienna, Austria, 2025. Available online: https://github.com/Microsoft/LightGBM (accessed on 1 June 2026).
- Han, J.; Guzman, J.A.; Chu, M.L. Prediction of gully erosion susceptibility through the lens of the SHapley Additive exPlanations (SHAP) method using a stacking ensemble model. J. Environ. Manag. 2025, 383, 125478. [Google Scholar] [CrossRef]
- Ungaro, F.; Calzolari, C.; Busoni, E. Development of pedotransfer functions using a group method of data handling for the soil of the Pianura Padano–Veneta region of North Italy: Water retention properties. Geoderma 2005, 124, 293–317. [Google Scholar] [CrossRef]
- Wadoux, A.M.J.C.; Walvoort, D.J.J.; Brus, D.J. An integrated approach for the evaluation of quantitative soil maps through Taylor and solar diagrams. Geoderma 2022, 405, 115332. [Google Scholar] [CrossRef]
- Pachepsky, Y.A.; Rawls, W.; Lin, H. Hydropedology and pedotransfer functions. Geoderma 2006, 131, 308–316. [Google Scholar] [CrossRef]
- Parhi, S.K.; Patro, S.K. Compressive strength prediction of PET fiber-reinforced concrete using Dolphin echolocation optimized decision tree-based machine learning algorithms. Asian J. Civ. Eng. 2024, 25, 977–996. [Google Scholar] [CrossRef]
- Probst, P.; Wright, M.N.; Boulesteix, A.L. Hyperparameters and tuning strategies for random forest. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 2019, 9, e1301. [Google Scholar] [CrossRef]
- Barreñada, L.; Dhiman, P.; Timmerman, D.; Boulesteix, A.-L.; Van Calster, B. Understanding overfitting in random forest for probability estimation: A visualization and simulation study. Diagn. Progn. Res. 2024, 8, 14. [Google Scholar] [CrossRef]
- Zhou, Z.-H. Ensemble Learning. In Machine Learning; Zhou, Z.-H., Ed.; Springer: Singapore, 2021; pp. 181–210. [Google Scholar]
- Chen, J.; He, Y.; Liang, Y.; Wang, W.; Duan, X. Estimation of gross calorific value of coal based on the cubist regression model. Sci. Rep. 2024, 14, 23176. [Google Scholar] [CrossRef] [PubMed]
- Kuhn, M.; Johnson, K. Applied Predictive Modeling; Springer: New York, NY, USA, 2013; Volume 26. [Google Scholar]
- Quinlan, J.R. Combining instance-based and model-based learning. In Proceedings of the Tenth International Conference on Machine Learning; Elsevier: Amsterdam, The Netherlands, 1993; pp. 236–243. [Google Scholar]
- Ke, G.; Meng, Q.; Finley, T.; Wang, T.; Chen, W.; Ma, W.; Ye, Q.; Liu, T.-Y. Lightgbm: A highly efficient gradient boosting decision tree. In Proceedings of the Advances in Neural Information Processing Systems 30, Long Beach, CA, USA, 4–9 December 2017. [Google Scholar]
- Rastgou, M.; Bayat, H.; Mansoorizadeh, M.; Gregory, A.S. Estimating Soil Water Retention Curve by Extreme Learning Machine, Radial Basis Function, M5 Tree and Modified Group Method of Data Handling Approaches. Water Resour. Res. 2022, 58, e2021WR031059. [Google Scholar] [CrossRef]
- Schaap, M.G.; Leij, F.J.; van Genuchten, M.T. Rosetta: A computer program for estimating soil hydraulic parameters with hierarchical pedotransfer functions. J. Hydrol. 2001, 251, 163–176. [Google Scholar] [CrossRef]
- Touil, S.; Degre, A.; Chabaca, M.N. Sensitivity analysis of point and parametric pedotransfer functions for estimating water retention of soils in Algeria. Soil 2016, 2, 647. [Google Scholar] [CrossRef]
- Horn, R.; Smucker, A. Structure formation and its consequences for gas and water transport in unsaturated arable and forest soils. Soil Tillage Res. 2005, 82, 5–14. [Google Scholar] [CrossRef]
- Schaap, M.G.; Leij, F.J. Improved prediction of unsaturated hydraulic conductivity with the Mualem-van Genuchten model. Soil Sci. Soc. Am. J. 2000, 64, 843–851. [Google Scholar] [CrossRef]
- Merdun, H.; Çınar, Ö.; Meral, R.; Apan, M. Comparison of artificial neural network and regression pedotransfer functions for prediction of soil water retention and saturated hydraulic conductivity. Soil Tillage Res. 2006, 90, 108–116. [Google Scholar] [CrossRef]
- Weynants, M.; Vereecken, H.; Javaux, M. Revisiting Vereecken pedotransfer functions: Introducing a closed-form hydraulic model. Vadose Zone J. 2009, 8, 86–95. [Google Scholar] [CrossRef]
- Kashani, M.H.; Ghorbani, M.A.; Shahabi, M.; Naganna, S.R.; Diop, L. Multiple AI model integration strategy—Application to saturated hydraulic conductivity prediction from easily available soil properties. Soil Tillage Res. 2020, 196, 104449. [Google Scholar] [CrossRef]
- Azadmard, B.; Mosaddeghi, M.R.; Ayoubi, S.; Chavoshi, E.; Raoof, M. Estimation of near-saturated soil hydraulic properties using hybrid genetic algorithm-artificial neural network. Ecohydrol. Hydrobiol. 2020, 20, 437–449. [Google Scholar] [CrossRef]
- Williams, C.G.; Ojuri, O.O. Predictive modelling of soils’ hydraulic conductivity using artificial neural network and multiple linear regression. SN Appl. Sci. 2021, 3, 152. [Google Scholar] [CrossRef]
- Granata, F.; Di Nunno, F.; Modoni, G. Hybrid Machine Learning Models for Soil Saturated Conductivity Prediction. Water 2022, 14, 1729. [Google Scholar] [CrossRef]
- Adjuik, T.A.; Nokes, S.E.; Montross, M.; Sama, M.P.; Wendroth, O. Predictor Selection and Machine Learning Regression Methods to Predict Saturated Hydraulic Conductivity from a Large Public Soil Database. J. ASABE 2023, 66, 285–296. [Google Scholar] [CrossRef]
- Moosavi, A.A.; Nematollahi, M.A.; Omidifard, M. Comparing machine learning approaches for estimating soil saturated hydraulic conductivity. PLoS ONE 2024, 19, e0310622. [Google Scholar] [CrossRef]
- Demir, S.; Sahin, E.K. The effectiveness of data pre-processing methods on the performance of machine learning techniques using RF, SVR, Cubist and SGB: A study on undrained shear strength prediction. Stoch. Environ. Res. Risk Assess. 2024, 38, 3273–3290. [Google Scholar] [CrossRef]
- Nguyen, H.; Bui, X.-N.; Tran, Q.-H.; Mai, N.-L. A new soft computing model for estimating and controlling blast-produced ground vibration based on Hierarchical K-means clustering and Cubist algorithms. Appl. Soft Comput. 2019, 77, 376–386. [Google Scholar] [CrossRef]
- Schaap, M.G.; Leij, F.J.; van Genuchten, M.T. Neural network analysis for hierarchical prediction of soil hydraulic properties. Soil Sci. Soc. Am. J. 1998, 62, 847–855. [Google Scholar] [CrossRef]












| PTF | Algorithm | Training | Testing | ||||
|---|---|---|---|---|---|---|---|
| RMSD ± SD | R2 ± SD | Per-Sample RMSD ± SD | R2 ± SD | Per-Sample MAE ± SD | 10-Fold CV RMSD ± SD | ||
| PTF1 | CU | 6.02 ± 11.17 | 0.96 ± 0.08 | 8.00 ± 18.13 | 0.95 ± 0.10 | 1.75 ± 3.78 | 7.89 ± 3.84 |
| RF | 4.84 ± 11.73 | 0.98 ± 0.05 | 8.63 ± 19.55 | 0.95 ± 0.10 | 2.59 ± 5.39 | 8.45 ± 3.58 | |
| LGBM | 6.65 ± 14.72 | 0.97 ± 0.07 | 8.62 ± 19.78 | 0.95 ± 0.10 | 1.27 ± 4.37 | 8.44 ± 4.38 | |
| PTF2 | CU | 6.72 ± 16.18 | 0.96 ± 0.10 | 6.92 ± 15.75 | 0.96 ± 0.09 | 1.56 ± 3.35 | 6.90 ± 3.30 |
| RF | 4.74 ± 11.62 | 0.98 ± 0.04 | 7.91 ± 18.48 | 0.95 ± 0.10 | 2.29 ± 5.29 | 7.80 ± 3.84 | |
| LGBM | 6.28 ± 13.73 | 0.97 ± 0.07 | 8.59 ± 18.50 | 0.95 ± 0.10 | 1.60 ± 4.18 | 8.50 ± 4.55 | |
| PTF3 | CU | 6.05 ± 15.18 | 0.97 ± 0.08 | 7.25 ± 17.69 | 0.96 ± 0.01 | 1.22 ± 2.81 | 7.22 ± 4.56 |
| RF | 4.06 ± 10.58 | 0.99 ± 0.04 | 6.91 ± 16.73 | 0.96 ± 0.09 | 2.32 ± 5.91 | 6.87 ± 2.95 | |
| LGBM | 4.93 ± 11.63 | 0.98 ± 0.06 | 7.66 ± 17.56 | 0.97 ± 0.09 | 1.40 ± 4.45 | 7.58 ± 3.63 | |
| PTF4 | CU | 5.57 ± 13.14 | 0.97 ± 0.05 | 6.49 ± 12.53 | 0.96 ± 0.08 | 1.06 ± 2.50 | 6.43 ± 3.55 |
| RF | 3.89 ± 9.89 | 0.99 ± 0.03 | 6.96 ± 16.64 | 0.96 ± 0.09 | 2.22 ± 5.43 | 6.76 ± 3.82 | |
| LGBM | 4.66 ± 10.62 | 0.98 ± 0.04 | 7.41 ± 15.97 | 0.97 ± 0.09 | 1.38 ± 3.32 | 7.33 ± 3.53 | |
| PTF | Cubist | RF | LightGBM |
|---|---|---|---|
| PTF1 | −0.54 ± 4.13 | 0.12 ± 5.98 | 0.13 ± 4.55 |
| PTF2 | −0.19 ± 3.69 | 0.60 ± 5.73 | −0.21 ± 4.47 |
| PTF3 | 0.16 ± 3.05 | 1.07 ± 6.26 | 0.54 ± 4.63 |
| PTF4 | −0.01 ± 2.63 | 0.94 ±5.76 | 0.24 ± 3.59 |
| Study | Database-Location | Number of Samples | Input Variables | Algorithm | R2 |
|---|---|---|---|---|---|
| [58] | UNSODA | 235 | θr, θs, α, n, Ks | ANN | 0.64 |
| [59] | Turkey | 276 | PSD, BD, and DPS | MLR | 0.64 |
| [60] | Belgium | 166 | ST, BD, and OC | MLR | 0.91 |
| [20] | India | 20 | ST, BD, and MC | RF | 0.82 |
| [61] | Iran | 245 | ST, BD, OM, EC, and pH | ANN | 0.91 |
| [62] | Iran | 212 | PSD, OM, EC, and BD | ANN | 0.80 |
| [63] | - | 144 | PSD, PI, and MDD, MC | ANN | 0.95 |
| [21] | Brazil | 188 | PSD, BD, PD, P, and PWP | RF | 0.41 |
| [64] | SWIG | 640 | ST, OM, BD, dg, σg, and θs | ANN, RF and SVR | 0.83 |
| [65] | FSCD | 4686 | ST, and BD | RF | 0.71 |
| [19] | Iran | - | ST, EC, pH, P, BD, and SAR | RF | 0.89 |
| [66] | Iran | 25 | θs, θi, BD, EC, pH, CCE, MWD, and GMD | ANN | 0.92 |
| [22] | Morocco | 72 | ST, MC, BD, P, OM, OC, and CC | RF | 0.94 |
| This study | UNSODA | 196 | ST, BD, FC, and PWP | Cubist | 0.96 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Wang, P.; Rastgou, M.; Qi, Z.; Jiang, Q.; He, Y. Parametric Modeling of the Unsaturated Soil Hydraulic Conductivity Function Using Tree-Based and Ensemble Machine Learning Algorithms: A Comparative Analysis of Cubist, Random Forest, and LightGBM. Agronomy 2026, 16, 1116. https://doi.org/10.3390/agronomy16111116
Wang P, Rastgou M, Qi Z, Jiang Q, He Y. Parametric Modeling of the Unsaturated Soil Hydraulic Conductivity Function Using Tree-Based and Ensemble Machine Learning Algorithms: A Comparative Analysis of Cubist, Random Forest, and LightGBM. Agronomy. 2026; 16(11):1116. https://doi.org/10.3390/agronomy16111116
Chicago/Turabian StyleWang, Peng, Mostafa Rastgou, Zhiming Qi, Qianjing Jiang, and Yong He. 2026. "Parametric Modeling of the Unsaturated Soil Hydraulic Conductivity Function Using Tree-Based and Ensemble Machine Learning Algorithms: A Comparative Analysis of Cubist, Random Forest, and LightGBM" Agronomy 16, no. 11: 1116. https://doi.org/10.3390/agronomy16111116
APA StyleWang, P., Rastgou, M., Qi, Z., Jiang, Q., & He, Y. (2026). Parametric Modeling of the Unsaturated Soil Hydraulic Conductivity Function Using Tree-Based and Ensemble Machine Learning Algorithms: A Comparative Analysis of Cubist, Random Forest, and LightGBM. Agronomy, 16(11), 1116. https://doi.org/10.3390/agronomy16111116

