Machine Learning-Based Analysis of Arsenic Migration from Soil to Highland Barley in High Geological Background Areas
Abstract
1. Introduction
2. Materials and Methods
2.1. Overview of the Study Area and Sample Collection
2.2. Sample Pretreatment and Chemical Analysis
2.2.1. Sample Pretreatment
2.2.2. Determination of pH, Eh, SOM and Total Metal Content
2.2.3. Measurement of Chemically Extractable and Bioavailable Arsenic in Soil
2.3. Construction of HB-As Prediction Model
2.3.1. Model Selection
2.3.2. Model Development
2.3.3. Model Evaluation
2.3.4. Model Explanation
3. Results
3.1. Soil Properties and as Pollution Status
3.2. Correlation Analysis Between HB-As and Related Influencing Factors
3.3. Performance of Machine Learning Prediction Models
4. Discussion
4.1. The Paradox of High Soil as but Low Grain Accumulation: Immobilization Mechanisms
4.2. Superiority of Dynamic Bioavailability Assessment (DGT) over Chemical Extraction
4.3. Deciphering Factor Influence: The Power of ML Interpretation (SHAP and PDP)
4.4. Limitations and Future Perspectives
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- de Conti, A.; Madia, F.; Schubauer-Berigan, M.K.; Benbrahim-Tallaa, L. Carcinogenicity of some metals evaluated by the A synopsis of the evaluations of arsenic, cadmium, cobalt, and antimony. Toxicol. Appl. Pharmacol. 2025, 504, 117506. [Google Scholar] [CrossRef]
- Zhang, R.; Yang, Y.; Deji, Y.; Li, H.; Li, Y.; Nima, C.; Zhao, S.; Gong, H. Factors influencing the spatial distribution and individual variation in urinary fluoride levels in Tibet, China. Chemosphere 2023, 326, 138493. [Google Scholar] [CrossRef]
- Shi, W.; Qiao, F.W.; Zhou, L. Identification of Ecological Risk Zoning on Qinghai-Tibet Plateau from the Perspective of Ecosystem Service Supply and Demand. Sustainability 2021, 13, 5366. [Google Scholar] [CrossRef]
- Yu, C.R.; Sun, Y.F.; Zhong, X.Y.; Yu, Z.B.; Li, X.Y.; Yi, P.; Jin, H.J.; Luo, D.L. Arsenic in permafrost-affected rivers and lakes of Tibetan Plateau, China. Environ. Pollut. Bioavailab. 2019, 31, 226–232. [Google Scholar] [CrossRef]
- Xiao, F.J.; Cui, X.M.; Zhao, Y.Z.; Fu, J.J.; Yu, T.; Bu, D.; Zhang, Q.Y. Concentration, spatial distribution, and source apportionment of heavy metals in agricultural soils from the Yarlung Zangbo River Basin, Tibetan Plateau. Environ. Earth Sci. 2023, 82, 577. [Google Scholar] [CrossRef]
- Zhang, X.P.P.; Deng, W.; Yang, X.M.M. The background concentrations of 13 soil trace elements and their relationships to parent materials and vegetation in Xizang (Tibet), China. J. Asian Earth Sci. 2002, 21, 167–174. [Google Scholar] [CrossRef]
- Yin, Y.Y.; Wang, J.A.; Leng, G.Y.; Zhao, J.T.; Wang, L.; Ma, W.D. Future potential distribution and expansion trends of highland barley under climate change in the Qinghai-Tibet plateau (QTP). Ecol. Indic. 2022, 136, 108702. [Google Scholar] [CrossRef]
- Ma, W.D.; Jia, W.; Zhou, Y.T.; Liu, F.G.; Wang, J.A. Prediction of Suitable Future Natural Areas for Highland Barley on the Qinghai-Tibet Plateau under Representative Concentration Pathways (RCPs). Sustainability 2022, 14, 6617. [Google Scholar] [CrossRef]
- Zeng, F.R.; Ali, S.; Zhang, H.T.; Ouyang, Y.B.; Qiu, B.Y.; Wu, F.B.; Zhang, G.P. The influence of pH and organic matter content in paddy soil on heavy metal availability and their uptake by rice plants. Environ. Pollut. 2011, 159, 84–91. [Google Scholar] [CrossRef]
- Antoniadis, V.; Robinson, J.S.; Alloway, B.J. Effects of short-term pH fluctuations on cadmium, nickel, lead, and zinc availability to ryegrass in a sewage sludge-amended field. Chemosphere 2008, 71, 759–764. [Google Scholar] [CrossRef]
- Huang, R.Q.; Gao, S.F.; Wang, W.L.; Staunton, S.; Wang, G. Soil arsenic availability and the transfer of soil arsenic to crops in suburban areas in Fujian Province, southeast China. Sci. Total Environ. 2006, 368, 531–541. [Google Scholar] [CrossRef]
- Dai, Y.C.; Lv, J.L.; Liu, K.; Zhao, X.Y.; Cao, Y.F. Major controlling factors and prediction models for arsenic uptake from soil to wheat plants. Ecotoxicol. Environ. Saf. 2016, 130, 256–262. [Google Scholar] [CrossRef]
- Majumder, S.; Banik, P. Insights into the comparison of machine learning models on rice grain arsenic prediction: Interplay of rice cultivation systems and soil environmental factors. Environ. Pollut. 2025, 381, 126646. [Google Scholar] [CrossRef] [PubMed]
- Hu, B.F.; Xue, J.; Zhou, Y.; Shao, S.; Fu, Z.Y.; Li, Y.; Chen, S.C.; Qi, L.; Shi, Z. Modelling bioaccumulation of heavy metals in soil-crop ecosystems and identifying its controlling factors using machine learning. Environ. Pollut. 2020, 262, 114308. [Google Scholar] [CrossRef]
- Lu, X.S.; Sun, L.; Zhang, Y.; Du, J.Y.; Wang, G.Q.; Huang, X.H.; Li, X.Z.; Wang, X.Z. Predicting Cd accumulation in crops and identifying nonlinear effects of multiple environmental factors based on machine learning models. Sci. Total Environ. 2024, 951, 175787. [Google Scholar] [CrossRef] [PubMed]
- Palansooriya, K.N.; Li, J.; Dissanayake, P.D.; Suvarna, M.; Li, L.; Yuan, X.; Sarkar, B.; Tsang, D.C.W.; Rinklebe, J.; Wang, X.; et al. Prediction of Soil Heavy Metal Immobilization by Biochar Using Machine Learning. Environ. Sci. Technol. 2022, 56, 4187–4198. [Google Scholar] [CrossRef]
- Yang, H.; Huang, K.; Zhang, K.; Weng, Q.; Zhang, H.; Wang, F. Predicting Heavy Metal Adsorption on Soil with Machine Learning and Mapping Global Distribution of Soil Adsorption Capacities. Environ. Sci. Technol. 2021, 55, 14316–14328. [Google Scholar] [CrossRef]
- Wang, Z.; Zhang, C.C.; Lv, D.L.; Zhang, Q.; Yin, P.; Guo, R.Z.; Chen, Q.S.; Cai, Y.M.; Zhao, Y.J. Diffusive gradient in thin films combined with machine learning to discern the accumulation characteristics and driving factors of Cd and Cu in soil-rice systems. J. Hazard. Mater. 2025, 495, 138924. [Google Scholar] [CrossRef]
- Yang, Y.P.; Wang, P.; Yan, H.J.; Zhang, H.M.; Cheng, W.D.; Duan, G.L.; Zhu, Y.G. NH4H2PO4-extractable arsenic provides a reliable predictor for arsenic accumulation and speciation in pepper fruits (Capsicum annum L.). Environ. Pollut. 2019, 251, 651–658. [Google Scholar] [CrossRef]
- Gao, G.; Zhang, C.; Zuo, J.; Sun, J.; Liu, W.; Wang, Z.; Zhao, Y. Development and application of a layered double hydroxides binding layer based DGT technique for simultaneous sampling low molecular weight organic acids and arsenic in soil. Talanta 2025, 300, 129255. [Google Scholar] [CrossRef] [PubMed]
- Ke, B.; Nguyen, H.; Bui, X.N.; Bui, H.B.; Choi, Y.; Zhou, J.; Moayedi, H.; Costache, R.; Nguyen-Trang, T. Predicting the sorption efficiency of heavy metal based on the biochar characteristics, metal sources, and environmental conditions using various novel hybrid machine learning models. Chemosphere 2021, 276, 130204. [Google Scholar] [CrossRef]
- Jia, X.L.; Hu, B.F.; Marchant, B.; Zhou, L.Q.; Shi, Z.; Zhu, Y.W. A methodological framework for identifying potential sources of soil heavy metal pollution based on machine learning: A case study in the Yangtze Delta, China. Environ. Pollut. 2019, 250, 601–609. [Google Scholar] [CrossRef]
- Guo, G.M.; Lin, L.Y.; Jin, F.M.; Masek, O.; Huang, Q. Application of heavy metal immobilization in soil by biochar using machine learning. Environ. Res. 2023, 231, 116098. [Google Scholar] [CrossRef] [PubMed]
- Li, H.M.; Wang, J.H.; Wang, Q.G.; Tian, C.H.; Qian, X.; Leng, X.Z. Magnetic Properties as a Proxy for Predicting Fine-Particle-Bound Heavy Metals in a Support Vector Machine Approach. Environ. Sci. Technol. 2017, 51, 6927–6935. [Google Scholar] [CrossRef] [PubMed]
- Zou, Z.Y.; Wang, Q.L.; Wu, Q.S.; Li, M.H.; Zhen, J.B.; Yuan, D.Y.; Zhou, M.; Xu, C.; Wang, Y.C.; Zhao, Y.P.; et al. Inversion of heavy metal content in soil using hyperspectral characteristic bands-based machine learning method. J. Environ. Manag. 2024, 355, 120503. [Google Scholar] [CrossRef]
- Peng, Y.Z.; Yu, G.I. Model multifactor analysis of soil heavy metal pollution on plant germination in Southeast Chengdu, China: Based on redundancy analysis, factor detector, and XGBoost-SHAP. Sci. Total Environ. 2024, 954, 176605. [Google Scholar] [CrossRef]
- Ling, S.X.; Li, X.N.; Zhang, S.M.; Wei, W.; Liu, Z.X.; Feng, J.J.; Wang, J.W.; Sun, C.W.; Zhang, G. Machine and deep learning-driven geospatial prediction modelling of soil cadmium and nickel with pollution risk mapping underlaid by black shale in Qianjiang County, China. J. Hydrol. Reg. Stud. 2025, 62, 102857. [Google Scholar] [CrossRef]
- Qiu, Y.F.; Zhou, S.L.; Zhang, C.C.; Qin, W.D.; Lv, C.X.; Zou, M.M. Identification of potentially contaminated areas of soil microplastic based on machine learning: A case study in Taihu Lake region, China. Sci. Total Environ. 2023, 877, 162891. [Google Scholar] [CrossRef]
- Li, J.; Pan, L.J.; Suvarna, M.; Tong, Y.W.; Wang, X.N. Fuel properties of hydrochar and pyrochar: Prediction and exploration with machine learning. Appl. Energy 2020, 269, 115166. [Google Scholar] [CrossRef]
- Zhao, B.; Zhu, W.X.; Hao, S.F.; Hua, M.; Liao, Q.L.; Jing, Y.; Liu, L.; Gu, X.Y. Prediction heavy metals accumulation risk in rice using machine learning and mapping pollution risk. J. Hazard. Mater. 2023, 448, 130879. [Google Scholar] [CrossRef]
- Zhang, Z.C.; Xu, B.; Xu, W.M.; Wang, F.; Gao, J.; Li, Y.; Li, M.; Feng, Y.C.; Shi, G.L. Machine learning combined with the PMF model reveal the synergistic effects of sources and meteorological factors on PM pollution. Environ. Res. 2022, 212, 113322. [Google Scholar] [CrossRef]
- Li, W.; Huang, G.; Tang, N.; Lu, P.; Jiang, L.; Lv, J.; Qin, Y.; Lin, Y.; Xu, F.; Lei, D. Effects of heavy metal exposure on hypertension: A machine learning modeling approach. Chemosphere 2023, 337, 139435. [Google Scholar] [CrossRef] [PubMed]
- Li, R.X.; Zhang, R.; Yang, Y.; Li, Y.H. Accumulation characteristics, driving factors, and model prediction of cadmium in soil-highland barley system on the Tibetan Plateau. J. Hazard. Mater. 2023, 453, 131407. [Google Scholar] [CrossRef]
- Burachevskaya, M.; Minkina, T.; Fedorenko, A.; Fedorenko, G.; Chernikova, N.; Rajput, V.D.; Mandzhieva, S.; Bauer, T. Accumulation, translocation, and toxicity of arsenic in barley grown in contaminated soil. Plant Soil 2021, 467, 91–106. [Google Scholar] [CrossRef]
- Xue, Q.; Ran, Y.; Tan, Y.Z.; Peacock, C.L.; Du, H.H. Arsenite and arsenate binding to ferrihydrite organo-mineral coprecipitate: Implications for arsenic mobility and fate in natural environments. Chemosphere 2019, 224, 103–110. [Google Scholar] [CrossRef] [PubMed]
- Dai, Y.C.; Nasir, M.; Zhang, Y.L.; Gao, J.K.; Lv, Y.M.; Lv, J.L. Comparison of DGT with traditional extraction methods for assessing arsenic bioavailability to in different soils. Chemosphere 2018, 191, 183–189. [Google Scholar] [CrossRef] [PubMed]
- Ye, T.C.; Liu, T.C.; Yi, H.L.; Du, J.J.; Wang, Y.; Xiao, T.F.; Cui, J.L. arsenic immobilization by natural iron (oxyhydr)oxide precipitates in As-contaminated groundwater irrigation canals. J. Environ. Sci. 2025, 153, 143–157. [Google Scholar] [CrossRef]
- Hartley, W.; Dickinson, N.M.; Riby, P.; Lepp, N.W. Arsenic mobility in brownfield soils amended with green waste compost or biochar and planted with. Environ. Pollut. 2009, 157, 2654–2662. [Google Scholar] [CrossRef]
- Yang, W.C.; Li, J.X.; Nie, K.; Zhao, P.W.; Xia, H.; Li, Q.; Liao, Q.; Li, Q.Z.; Dong, C.H.; Yang, Z.H.; et al. Machine learning-based identification of critical factors for cadmium accumulation in rice grains. Environ. Geochem. Health 2025, 47, 2. [Google Scholar] [CrossRef]
- Li, Y.; Yu, Y.L.; Ding, S.Y.; Dai, W.J.; Shi, R.G.; Cui, G.Y.; Li, X.D. Application of machine learning in soil heavy metals pollution assessment in the southeastern Tibetan plateau. Sci. Rep. 2025, 15, 13579. [Google Scholar] [CrossRef]
- Pan, Q.X.; Liu, C.; Zhang, C.; Zheng, H.L.; Ding, W.; Li, H. Mn/Fe bimetallic MOF-driven sulfite activation for synergistic non-radical oxidation and chemisorption of arsenite: Involvement of high-valent Fe (IV)/Mn(V) species. J. Hazard. Mater. 2025, 496, 139234. [Google Scholar] [CrossRef]
- Zhang, L.Q.; Hu, J.; Li, C.; Chen, Y.Y.; Zheng, L.G.; Ding, D.; Shan, S.F. Synergistic mechanism of iron manganese supported biochar for arsenic remediation and enzyme activity in contaminated soil. J. Environ. Manag. 2023, 347, 119127. [Google Scholar] [CrossRef]
- Sun, J.; Li, B.; Liu, Y.; Wu, Z.Q.; Shi, L.; Zhou, X.; Wu, P.C.; Yao, K.S. Detection of composite heavy metal content in rape leaf using feature clustering and hyperspectral imaging technology. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2025, 340, 126387. [Google Scholar] [CrossRef]
- Li, X.; Chen, B.; Chen, W.S.; Yin, Y.L.; Huang, L.X.; Wei, L.; Awad, M.; Liu, Z.Z. Predictive Machine Learning Model to Assess the Adsorption Efficiency of Biochar-Heavy Metals for Effective Remediation of Soil-Plant Environment. Toxics 2024, 12, 575. [Google Scholar] [CrossRef] [PubMed]
- Mishra, D.; Chakrabortty, R.; Sen, K.; Pal, S.C.; Mondal, N.K. Groundwater vulnerability assessment of elevated arsenic in Gangetic plain of West Bengal, India; Using primary information, lithological transport, state-of-the-art approaches. J. Contam. Hydrol. 2023, 256, 104195. [Google Scholar] [CrossRef] [PubMed]
- Honma, T.; Ohba, H.; Kaneko-Kadokura, A.; Makino, T.; Nakamura, K.; Katou, H. Optimal Soil Eh, pH, and Water Management for Simultaneously Minimizing Arsenic and Cadmium Concentrations in Rice Grains. Environ. Sci. Technol. 2016, 50, 4178–4185. [Google Scholar] [CrossRef] [PubMed]
- Norton, G.J.; Adomako, E.E.; Deacon, C.M.; Carey, A.M.; Price, A.H.; Meharg, A.A. Effect of organic matter amendment, arsenic amendment and water management regime on rice grain arsenic species. Environ. Pollut. 2013, 177, 38–47. [Google Scholar] [CrossRef]
- Fischel, M.H.H.; Clarke, C.E.; Sparks, D.L. Arsenic sorption and oxidation by natural manganese-oxide-enriched soils: Reaction kinetics respond to varying environmental conditions. Geoderma 2024, 441, 116715. [Google Scholar] [CrossRef]
- Wang, J.J.; Bai, L.Y.; Zeng, X.B.; Su, S.M.; Wang, Y.N.; Wu, C.X. Assessment of arsenic availability in soils using the diffusive gradients in thin films (DGT) technique-a comparison study of DGT and classic extraction methods. Environ. Sci. Process. Impacts 2014, 16, 2355–2361. [Google Scholar] [CrossRef]
- Chen, R.; Cheng, N.U.; Ding, G.Y.; Ren, F.M.; Lv, J.G.; Shi, R.G. Predictive model for cadmium uptake by maize and rice grains on the basis of bioconcentration factor and the diffusive gradients in thin-films technique. Environ. Pollut. 2021, 289, 117841. [Google Scholar] [CrossRef]
- Faria, M.C.S.; Rosemberg, R.S.; Bomfeti, C.A.; Monteiro, D.S.; Barbosa, F.; Oliveira, L.C.A.; Rodriguez, M.; Pereira, M.C.; Rodrigues, J.L. Arsenic removal from contaminated water by ultrafine δ-FeOOH adsorbents. Chem. Eng. J. 2014, 237, 47–54. [Google Scholar] [CrossRef]








| Model | Dataset | MAE | RMSE | R2 | |
|---|---|---|---|---|---|
| S1 | MLR | Training | 0.0564 | 0.1242 | 0.254 |
| Test | 0.0578 | 0.1218 | 0.168 | ||
| SVR | Training | 0.0346 | 0.0997 | 0.587 | |
| Test | 0.0381 | 0.1049 | 0.541 | ||
| RF | Training | 0.0233 | 0.0650 | 0.699 | |
| Test | 0.0267 | 0.0753 | 0.594 | ||
| S2 | MLR | Training | 0.0588 | 0.1212 | 0.385 |
| Test | 0.0607 | 0.1211 | 0.309 | ||
| SVR | Training | 0.0255 | 0.0724 | 0.737 | |
| Test | 0.0306 | 0.0788 | 0.684 | ||
| RF | Training | 0.0278 | 0.0687 | 0.736 | |
| Test | 0.0327 | 0.0811 | 0.623 | ||
| S3 | MLR | Training | 0.0556 | 0.1169 | 0.344 |
| Test | 0.0576 | 0.1171 | 0.223 | ||
| SVR | Training | 0.0250 | 0.0726 | 0.723 | |
| Test | 0.0300 | 0.0800 | 0.660 | ||
| RF | Training | 0.0261 | 0.0648 | 0.743 | |
| Test | 0.0323 | 0.0790 | 0.611 | ||
| S4 | MLR | Training | 0.0596 | 0.1204 | 0.364 |
| Test | 0.0621 | 0.1217 | 0.238 | ||
| SVR | Training | 0.0216 | 0.0680 | 0.718 | |
| Test | 0.0264 | 0.0700 | 0.683 | ||
| RF | Training | 0.0274 | 0.0640 | 0.756 | |
| Test | 0.0325 | 0.0760 | 0.651 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Zuo, J.; Zhang, C.; Liang, X.; Cai, Y.; Li, Y.; Hu, Y.; Zhao, Y. Machine Learning-Based Analysis of Arsenic Migration from Soil to Highland Barley in High Geological Background Areas. Sustainability 2026, 18, 1782. https://doi.org/10.3390/su18041782
Zuo J, Zhang C, Liang X, Cai Y, Li Y, Hu Y, Zhao Y. Machine Learning-Based Analysis of Arsenic Migration from Soil to Highland Barley in High Geological Background Areas. Sustainability. 2026; 18(4):1782. https://doi.org/10.3390/su18041782
Chicago/Turabian StyleZuo, Jiahui, Chuangchuang Zhang, Xuefeng Liang, Yanming Cai, Ye Li, Yandi Hu, and Yujie Zhao. 2026. "Machine Learning-Based Analysis of Arsenic Migration from Soil to Highland Barley in High Geological Background Areas" Sustainability 18, no. 4: 1782. https://doi.org/10.3390/su18041782
APA StyleZuo, J., Zhang, C., Liang, X., Cai, Y., Li, Y., Hu, Y., & Zhao, Y. (2026). Machine Learning-Based Analysis of Arsenic Migration from Soil to Highland Barley in High Geological Background Areas. Sustainability, 18(4), 1782. https://doi.org/10.3390/su18041782
