Machine Learning-Based Prediction of Heavy Metal Contamination and Ecological Risk in Karst Agricultural Soils
Abstract
1. Introduction
2. Methods and Materials
2.1. Study Area
2.2. Sample Collection and Analysis
2.3. Data Collection
2.4. Machine Learning Patterns
2.5. Potential Ecological Risk Models
2.6. Spatial Bivariate Correlation Analysis
2.7. Network Analysis
2.8. Data Processing
3. Results and Discussion
3.1. Statistical Evaluation of Heavy Metal Concentrations and Chemical Properties
3.2. Performance Comparison of Predictive Models
3.3. Feature Importance
3.4. Ecological Risk Assessment
3.5. Spatial Associations Between Ecological Risk Levels and Environmental Drivers
3.6. Network Pattern
4. Conclusions
Supplementary Materials
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Timothy, N.A.; Williams, E.T. Environmental pollution by heavy metal: An overview. Int. J. Environ. Chem. 2019, 3, 72–82. [Google Scholar] [CrossRef]
- Wan, X.; Yang, J.; Song, W. Pollution status of agricultural land in China: Impact of land use and geographical position. Soil Water Res. 2018, 13, 234–242. [Google Scholar] [CrossRef]
- Zerga, B. Karst topography: Formation, processes, characteristics, landforms, degradation and restoration: A systematic review. Watershed Ecol. Environ. 2024, 6, 252–269. [Google Scholar] [CrossRef]
- De Waele, J.; Gutiérrez, F.; Parise, M.; Plan, L. Geomorphology and natural hazards in karst areas: A review. Geomorphology 2011, 134, 1–8. [Google Scholar] [CrossRef]
- Chen, J.; Yu, J.; Bai, X.; Zeng, Y.; Wang, J. Fragility of karst ecosystem and environment: Long-term evidence from lake sediments. Agric. Ecosyst. Environ. 2020, 294, 106862. [Google Scholar] [CrossRef]
- Wang, Y.F.; Liang, L.S.; Jing, J.L.; Luo, F.L.; Wang, A.N. Characteristic of drought and flood in the Dian-Qian-Gui karst areas based on TRMM-Z Index. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2020, 42, 917–924. [Google Scholar] [CrossRef]
- Micó, C.; Recatalá, L.; Peris, M.; Sánchez, J. Assessing heavy metal sources in agricultural soils of an European Mediterranean area by multivariate analysis. Chemosphere 2006, 65, 863–872. [Google Scholar] [CrossRef] [PubMed]
- Li, J.; He, M.; Han, W.; Gu, Y. Analysis and assessment on heavy metal sources in the coastal soils developed from alluvial deposits using multivariate statistical methods. J. Hazard. Mater. 2009, 164, 976–981. [Google Scholar] [CrossRef] [PubMed]
- Fu, S.; Wei, C.Y. Multivariate and spatial analysis of heavy metal sources and variations in a large old antimony mine, China. J. Soils Sediments 2013, 13, 106–116. [Google Scholar] [CrossRef]
- Wang, Y.; Li, Y.; Yang, S.; Liu, J.; Zheng, W.; Xu, J.; Liu, X. Source apportionment of soil heavy metals: A new quantitative framework coupling receptor model and stable isotopic ratios. Environ. Pollut. 2022, 314, 120291. [Google Scholar] [CrossRef]
- Ha, H.; Olson, J.R.; Bian, L.; Rogerson, P.A. Analysis of heavy metal sources in soil using kriging interpolation on principal components. Environ. Sci. Technol. 2014, 48, 4999–5007. [Google Scholar] [CrossRef]
- Huang, Y.; Deng, M.; Wu, S.; Japenga, J.; Li, T.; Yang, X.; He, Z. A modified receptor model for source apportionment of heavy metal pollution in soil. J. Hazard. Mater. 2018, 354, 161–169. [Google Scholar] [CrossRef]
- Bonten, L.T.; Groenenberg, J.E.; Weng, L.; van Riemsdijk, W.H. Use of speciation and complexation models to estimate heavy metal sorption in soils. Geoderma 2008, 146, 303–310. [Google Scholar] [CrossRef]
- Badenko, V.; Kurtener, D.; Krueger, E. Utilization of fuzzy set theory for interpretation of data of investigations of soil contamination by heavy metals. Eur. Agrophysical J. 2014, 1, 25–41. [Google Scholar] [CrossRef]
- Yang, S.; Taylor, D.; Yang, D.; He, M.; Liu, X.; Xu, J. A synthesis framework using machine learning and spatial bivariate analysis to identify drivers and hotspots of heavy metal pollution of agricultural soils. Environ. Pollut. 2021, 287, 117611. [Google Scholar] [CrossRef] [PubMed]
- Wang, Y.; Zhao, Y.; Xu, S. Application of VNIR and machine learning technologies to predict heavy metals in soil and pollution indices in mining areas. J. Soils Sediments 2022, 22, 2777–2791. [Google Scholar] [CrossRef]
- Zhang, H.; Yin, A.; Yang, X.; Fan, M.; Shao, S.; Wu, J.; Gao, C. Use of machine-learning and receptor models for prediction and source apportionment of heavy metals in coastal reclaimed soils. Ecol. Indic. 2021, 122, 107233. [Google Scholar] [CrossRef]
- Wang, H.; Yilihamu, Q.; Yuan, M.; Bai, H.; Xu, H.; Wu, J. Prediction models of soil heavy metal(loid)s concentration for agricultural land in Dongli: A comparison of regression and random forest. Ecol. Indic. 2020, 119, 106801. [Google Scholar] [CrossRef]
- Zheng, J.; Wang, P.; Shi, H.; Zhuang, C.; Deng, Y.; Yang, X.; Xiao, R. Quantitative source apportionment and driver identification of soil heavy metals using advanced machine learning techniques. Sci. Total Environ. 2023, 873, 162371. [Google Scholar] [CrossRef]
- Li, X.; Gu, H.; Tang, R.; Zou, B.; Liu, X.; Ou, H.; Wen, B. A Fusion XGBoost Approach for Large-Scale Monitoring of Soil Heavy Metal in Farmland Using Hyperspectral Imagery. Agronomy 2025, 15, 676. [Google Scholar] [CrossRef]
- Zhang, Q.; Wang, C. Natural and human factors affect the distribution of soil heavy metal pollution: A review. Water Air Soil Pollut. 2020, 231, 350. [Google Scholar] [CrossRef]
- Cui, X.; Zhang, J.; Wang, X.; Pan, M.; Lin, Q.; Khan, K.Y.; Chen, G. A review on the thermal treatment of heavy metal hyperaccumulator: Fates of heavy metals and generation of products. J. Hazard. Mater. 2021, 405, 123832. [Google Scholar] [CrossRef]
- Liu, P.; Wu, Q.; Hu, W.; Tian, K.; Huang, B.; Zhao, Y. Effects of atmospheric deposition on heavy metals accumulation in agricultural soils: Evidence from field monitoring and Pb isotope analysis. Environ. Pollut. 2023, 330, 121740. [Google Scholar] [CrossRef]
- Zheng, F.; Guo, X.; Tang, M.; Zhu, D.; Wang, H.; Yang, X.; Chen, B. Variation in pollution status, sources, and risks of soil heavy metals in regions with different levels of urbanization. Sci. Total Environ. 2023, 866, 161355. [Google Scholar] [CrossRef]
- Dragović, S.; Mihailović, N.; Gajić, B. Heavy metals in soils: Distribution, relationship with soil characteristics and radionuclides and multivariate assessment of contamination sources. Chemosphere 2008, 72, 491–495. [Google Scholar] [CrossRef]
- Lu, A.X.; Wang, J.H.; Pan, L.G.; Han, P.; Han, Y. Determination of Cr, Cu, Zn, Pb and As in soil by field portable X-ray fluorescence spectrometry. Spectrosc. Spectr. Anal. 2010, 30, 2848–2852. [Google Scholar]
- Hakanson, L. An ecological risk index for aquatic pollution control.a sedimentological approach. Water Res. 1980, 14, 975–1001. [Google Scholar] [CrossRef]
- Zhang, H.; Cai, A.; Wang, X.; Wang, L.; Wang, Q.; Wu, X.; Ma, Y. Risk assessment and source apportionment of heavy metals in soils from Handan City. Appl. Sci. 2021, 11, 9615. [Google Scholar] [CrossRef]
- China Environmental Monitoring Centre. Values of Soil Elements in China; China Environmental Science Press: Beijing, China, 1990. [Google Scholar]
- Huang, G.; Wang, X.; Chen, D.; Wang, Y.; Zhu, S.; Zhang, T.; Liao, L.; Tian, Z.; Wei, N. A hybrid data-driven framework for diagnosing contributing factors for soil heavy metal contaminations using machine learning and spatial clustering analysis. J. Hazard. Mater. 2022, 437, 129324. [Google Scholar] [CrossRef] [PubMed]
- Hu, B.; Shao, S.; Fu, Z.; Li, Y.; Ni, H.; Chen, S.; Zhou, Y.; Jin, B.; Shi, Z. Identifying heavy metal pollution hot spots in soil-rice systems: A case study in South of Yangtze River Delta, China. Sci. Total Environ. 2019, 658, 614–625. [Google Scholar] [CrossRef] [PubMed]
- Zhang, X.Y.; Sui, Y.Y.; Zhang, X.D.; Meng, K.; Herbert, S.J. Spatial variability of nutrient properties in black soil of Northeast China. Pedosphere 2007, 17, 19–29. [Google Scholar] [CrossRef]
- Niazkar, M.; Menapace, A.; Brentan, B.; Piraei, R.; Jimenez, D.; Dhawan, P.; Righetti, M. Applications of XGBoost in water resources engineering: A systematic literature review (Dec 2018–May 2023). Environ. Model. Softw. 2024, 174, 105971. [Google Scholar] [CrossRef]
- Ye, M.; Zhu, L.; Li, X.; Ke, Y.; Huang, Y.; Chen, B.; Feng, H. Estimation of the soil arsenic concentration using a geographically weighted XGBoost model based on hyperspectral data. Sci. Total Environ. 2023, 858, 159798. [Google Scholar] [CrossRef]
- Li, S.; Zhang, Y. An integrated model of log-normal ordinary Kriging interpolation-based source-specific human health risk assessment (LSR) for soil heavy metal pollution: Insights from an abandoned industrial area in China. Environ. Monit. Assess. 2025, 197, 1204. [Google Scholar] [CrossRef]
- Jia, X.; Fu, T.; Hu, B.; Shi, Z.; Zhou, L.; Zhu, Y. Identification of the potential risk areas for soil heavy metal pollution based on the source-sink theory. J. Hazard. Mater. 2020, 393, 122424. [Google Scholar] [CrossRef]
- Shi, T.; Zhang, J.; Shen, W.; Wang, J.; Li, X. Machine learning can identify the sources of heavy metals in agricultural soil: A case study in northern Guangdong Province, China. Ecotoxicol. Environ. Saf. 2022, 245, 114107. [Google Scholar] [CrossRef] [PubMed]
- Fest, E.P.M.J.; Temminghoff, E.J.M.; Comans, R.N.J.; van Riemsdijk, W.H. Partitioning of organic matter and heavy metals in a sandy soil: Effects of extracting solution, solid to liquid ratio and pH. Geoderma 2008, 146, 66–74. [Google Scholar] [CrossRef]
- Ding, Q.; Cheng, G.; Wang, Y.; Zhuang, D. Effects of natural factors on the spatial distribution of heavy metals in soils surrounding mining regions. Sci. Total Environ. 2017, 578, 577–585. [Google Scholar] [CrossRef] [PubMed]
- He, Z.L.; Yang, X.E.; Stoffella, P.J. Trace elements in agroecosystems and impacts on the environment. J. Trace Elem. Med. Biol. 2005, 19, 125–140. [Google Scholar] [CrossRef]
- Kwiatkowska-Malina, J. Functions of organic matter in polluted soils: The effect of organic amendments on phytoavailability of heavy metals. Appl. Soil Ecol. 2018, 123, 542–545. [Google Scholar] [CrossRef]
- Zhai, M.; Kampunzu, H.A.B.; Modisi, M.P.; Totolo, O. Distribution of heavy metals in Gaborone urban soils (Botswana) and its relationship to soil pollution and bedrock composition. Environ. Geol. 2003, 45, 171–180. [Google Scholar] [CrossRef]
- Andra, S.S.; Makris, K.C.; Charisiadis, P.; Costa, C.N. Co-occurrence profiles of trace elements in potable water systems: A case study. Environ. Monit. Assess. 2014, 186, 7307–7320. [Google Scholar] [CrossRef] [PubMed]
- Shah, S.A.; Shao, Y.; Zhang, Y.; Zhao, H.; Zhao, L. Texture and trace element geochemistry of quartz: A review. Minerals 2022, 12, 1042. [Google Scholar] [CrossRef]








| Min | Mean | Median | Max | SD | CV | Skewness | Kurtosis | |
|---|---|---|---|---|---|---|---|---|
| Cd (μg kg−1) | 26 | 2491 | 1603 | 45,407 | 2860 | 1.15 | 3.39 | 20.77 |
| Hg (μg kg−1) | 46 | 556 | 479 | 73,997 | 1016 | 1.83 | 51.28 | 3374 |
| As (mg kg−1) | 1.1 | 38.6 | 36.1 | 367 | 27.02 | 0.7 | 2.26 | 12.66 |
| Cr (mg kg−1) | 20 | 254 | 220 | 1247 | 149 | 0.59 | 0.86 | 0.8 |
| Cu (mg kg−1) | 4.3 | 48.25 | 45.6 | 191 | 19.69 | 0.41 | 1 | 2.24 |
| Ni (mg kg−1) | 6 | 69.8 | 62.8 | 667 | 39.3 | 0.56 | 2.59 | 19.51 |
| Pb (mg kg−1) | 5.2 | 51.4 | 48.5 | 216 | 23.36 | 0.45 | 0.42 | −0.43 |
| Zn (mg kg−1) | 23.9 | 222 | 195 | 1323 | 124 | 0.56 | 1.24 | 2.39 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Liu, Z.; Wu, J.; Li, J.; Zheng, G.; Qin, J.; Gu, W.; Li, J. Machine Learning-Based Prediction of Heavy Metal Contamination and Ecological Risk in Karst Agricultural Soils. Land 2026, 15, 304. https://doi.org/10.3390/land15020304
Liu Z, Wu J, Li J, Zheng G, Qin J, Gu W, Li J. Machine Learning-Based Prediction of Heavy Metal Contamination and Ecological Risk in Karst Agricultural Soils. Land. 2026; 15(2):304. https://doi.org/10.3390/land15020304
Chicago/Turabian StyleLiu, Zhe, Juan Wu, Jie Li, Guodong Zheng, Jianxun Qin, Wenbo Gu, and Jiacai Li. 2026. "Machine Learning-Based Prediction of Heavy Metal Contamination and Ecological Risk in Karst Agricultural Soils" Land 15, no. 2: 304. https://doi.org/10.3390/land15020304
APA StyleLiu, Z., Wu, J., Li, J., Zheng, G., Qin, J., Gu, W., & Li, J. (2026). Machine Learning-Based Prediction of Heavy Metal Contamination and Ecological Risk in Karst Agricultural Soils. Land, 15(2), 304. https://doi.org/10.3390/land15020304
