Previous Article in Journal
A Structured Literature Review of the Application of Local Climate Zones (LCZ) in Urban Climate Modelling
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

A Multilevel Machine Learning Framework for Mapping and Predicting Diffuse and Point-Source Heavy Metal Contamination in Surface Soils

by
Maria Silvia Binetti
1,2,
Carmine Massarelli
2,* and
Emanuele Barca
2
1
Department of Earth and Geoenvironmental Sciences, University of Bari Aldo Moro, 70125 Bari, Italy
2
Environment and Territory Research Unit, Construction Technologies Institute, Italian National Research Council (ITC-CNR), 70124 Bari, Italy
*
Author to whom correspondence should be addressed.
Submission received: 5 November 2025 / Revised: 28 December 2025 / Accepted: 30 December 2025 / Published: 31 December 2025
(This article belongs to the Section AI and Big Data in Earth Science)

Abstract

This study addresses the global challenge of superficial soil contamination by heavy metals, focusing on differentiating natural geogenic sources from anthropogenic contributions in complex industrial–urban environments. We develop an integrated geostatistical and multivariate framework combining soil metal concentration analysis with AERMOD atmospheric dispersion modeling using a comparative multi-model machine learning approach (including Extreme Gradient Boosting, Random Forest, and Ridge Regression). Applied to the industrialized area of Taranto, Southern Italy, this approach incorporates spatial autocorrelation and multiple environmental predictors to identify contamination patterns and sources. The results reveal variable predictive accuracy across metals, with RF generally outperforming the other algorithms. The model achieved its highest performance for copper (R2 = 0.58, RMSE = 25.82), Tin (R2 = 0.53, RMSE = 5.95), and chromium, while showing instability for others. These disparities highlight the differential influence of remote sensing data on contamination mapping. The framework advances the quantitative assessment of soil pollution by linking atmospheric deposition and spatial processes with causal interpretability.
Keywords: AERMOD; advanced spatial analysis; XGBoost; Random Forest; Ridge Regression; metal digital mapping AERMOD; advanced spatial analysis; XGBoost; Random Forest; Ridge Regression; metal digital mapping

Share and Cite

MDPI and ACS Style

Binetti, M.S.; Massarelli, C.; Barca, E. A Multilevel Machine Learning Framework for Mapping and Predicting Diffuse and Point-Source Heavy Metal Contamination in Surface Soils. Earth 2026, 7, 4. https://doi.org/10.3390/earth7010004

AMA Style

Binetti MS, Massarelli C, Barca E. A Multilevel Machine Learning Framework for Mapping and Predicting Diffuse and Point-Source Heavy Metal Contamination in Surface Soils. Earth. 2026; 7(1):4. https://doi.org/10.3390/earth7010004

Chicago/Turabian Style

Binetti, Maria Silvia, Carmine Massarelli, and Emanuele Barca. 2026. "A Multilevel Machine Learning Framework for Mapping and Predicting Diffuse and Point-Source Heavy Metal Contamination in Surface Soils" Earth 7, no. 1: 4. https://doi.org/10.3390/earth7010004

APA Style

Binetti, M. S., Massarelli, C., & Barca, E. (2026). A Multilevel Machine Learning Framework for Mapping and Predicting Diffuse and Point-Source Heavy Metal Contamination in Surface Soils. Earth, 7(1), 4. https://doi.org/10.3390/earth7010004

Article Metrics

Back to TopTop