Previous Article in Journal
Advancing Tree Species Classification with Multi-Temporal UAV Imagery, GEOBIA, and Machine Learning
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Modelling the Spatial Distribution of Soil Organic Carbon Using Machine Learning and Remote Sensing in Nevado de Toluca, Mexico

by
Carmine Fusaro
1,
Yohanna Sarria-Guzmán
2,
Francisco Erik González-Jiménez
3,
Manuel Saba
4,*,
Oscar E. Coronado-Hernández
5 and
Carlos Castrillón-Ortíz
4
1
Faculty of Engineering, Universidad de San Buenaventura, Calle Real de Ternera # 30-966, Cartagena 130010, Colombia
2
Food Engineering Program, Universidad de Cartagena, Calle 30 # 48-152, Cartagena 130001, Colombia
3
Faculty of Chemical Sciences, Universidad Veracruzana, Prolongación Oriente 6, # 1009, Rafael Alvarado, Orizaba 94340, Veracruz, Mexico
4
Civil Engineering Program, Universidad de Cartagena, Calle 30 # 48-152, Cartagena 130001, Colombia
5
Institute of Hydraulics and Environmental Sanitation, Universidad de Cartagena, Calle 30 # 48-152, Cartagena 130001, Colombia
*
Author to whom correspondence should be addressed.
Geomatics 2025, 5(3), 43; https://doi.org/10.3390/geomatics5030043 (registering DOI)
Submission received: 11 July 2025 / Revised: 28 August 2025 / Accepted: 1 September 2025 / Published: 8 September 2025

Abstract

Highlights

What are the main findings?
  • The Quadratic Support Vector Machine using NDVI, elevation, and land use achieved high accuracy (R2 = 0.84) for predicting soil organic carbon (SOC) in the volcanic landscape of Nevado de Toluca.
  • Models using NDVI and BSI consistently outperformed those using MSAVI2 for SOC estimation, highlighting differences among vegetation indices in heterogeneous mountain terrains.
What is the implication of the main finding?
  • Integrating multispectral remote sensing indices with machine learning enables accurate and cost-effective SOC mapping in ecologically complex areas.
  • The approach supports scalable carbon monitoring and informs sustainable land management and conservation strategies in mountainous ecosystems.

Abstract

Accurate soil organic carbon (SOC) estimation is critical for assessing ecosystem services, carbon budgets, and informing sustainable land management, particularly in ecologically sensitive mountainous regions. This study focuses on modelling the spatial distribution of SOC within the heterogeneous volcanic landscape of the Nevado de Toluca (NdT), central Mexico, an area spanning 535.9 km2 and characterised by diverse land uses, altitudinal gradients, and climatic regimes. Using 29 machine learning algorithms, we evaluated the predictive capacity of three key variables: land use, elevation, and the Normalised Difference Vegetation Index (NDVI) derived from satellite imagery. Complementary analyses were performed using the Bare Soil Index (BSI) and the Modified Soil-Adjusted Vegetation Index 2 (MSAVI2) to assess their relative performance. Among the tested models, the Quadratic Support Vector Machine (SVM) using NDVI, elevation, and land use emerged as the top-performing model, achieving a coefficient of determination (R2) of 0.84, indicating excellent predictive accuracy. Notably, 14 models surpassed the R2 threshold of 0.80 when using NDVI and BSI as predictor variables, whereas MSAVI2-based models consistently underperformed (R2 < 0.78). Validation plots demonstrated strong agreement between observed and predicted SOC values, confirming the robustness of the best-performing models. This research highlights the effectiveness of integrating multispectral remote sensing indices with advanced machine learning frameworks for SOC estimation in mountainous volcanic ecosystems

1. Introduction

Terrestrial ecosystems are crucial in the global carbon (C) cycle [1,2,3]; the natural environments can absorb or release consistent amounts of greenhouse gases such as carbon dioxide (CO2) and methane (CH4) [4,5]. Also, they can store a considerable quantity of C in biomass plants, litter, and soil pools [6,7]. In particular, the C stored in the earth’s soil, which ranges from 1500 to 2400 Pg [8], exceeds the combined total of C hoarded in the atmosphere and the biosphere [9]. The global pool of soil organic carbon (SOC) has been estimated at three times greater than the atmospheric C pool and about five times greater than the biotic C pool [10,11,12,13].
Storage of SOC is the result of different ecological dynamics, i.e., photosynthesis, organic matter decomposition, and soil respiration [14]. The SOC is an essential parameter of healthy soil; it contributes to multiple ecosystem services (i.e., nutrient cycles, water cycle, and food production) and multiple soil properties (i.e., aggregate stability, soil texture, porosity, and water-holding capacity) [15,16].
Several factors, such as geographic location, climate, soil moisture, soil texture, soil pH, leaching and land use, could determine or modify SOC in natural ecosystems [17,18,19]. The clay fraction in the soil accumulates more SOC than any other fraction (silt and sand) [20]. Acidic soil pH may depress the decomposition of freshly added organic material in humid forest soils [21,22]. The SOC in forest soils is generally negatively impacted by reduction/abatement in density, abundance, and biodiversity of vegetation (primary and shrubby), which involves a drop in litter input and changes in the distribution of plant roots [23,24].
Human activities such as agriculture, livestock husbandry, tourism, and urbanisation seriously compromise ecological services [25,26]. The SOC concentrations in the mountain soils show considerable spatial variability and, in many cases, can be profoundly altered by human activities and land use changes [24,27,28,29]. The conversion of virgin forests to croplands can reduce SOC stocks by an average of 40% [30]; furthermore, total C loss can reach 50% in the topsoil in agroforestry ecosystems due to land use change and constant agriculture [31,32].
Accurate SOC thematic maps based on a limited number of sample points and a few other data points about landscapes have always been challenging for researchers [33,34]. Readily accessible environmental variables, derived from digital elevation models (DEM) (altitude, slope), remote sensing systems (spectral indices) or thematic maps (land use, climate), are valuable inputs to improve the performance of GIS algorithms and the accuracy of SOC maps [35]. The remote sensing data could be acquired in different ways, i.e., unmanned aerial vehicle [36], airborne [37], and satellites [38,39].
Additionally, machine learning techniques, with remote sensing inputs (spectral indices) such as Normalized Difference Vegetation Index (NDVI), Enhanced Vegetation Index (EVI), Bare Soil Index (BSI) and Modified Soil-Adjusted Vegetation Index 2 (MSAVI2), are widely used for SOC geostatic analysis in different types of ecosystems and other geographic scales [21,40,41]. Multivariate analysis and machine learning methods for SOC mapping in natural ecosystems include generalised linear model (GLM), linear additive model (LAM), cubist, random forest (RF), and support vector machine (SVM) [42,43,44,45].
The “Área de Protección de Flora y Fauna del Nevado de Toluca” (NdT) [46] of 535.90 km2 is located in central Mexico. As part of the Trans-Mexican Volcanic Belt, it reaches 4680 masl and is the fourth highest mountain of the country [47]. The NdT constitutes a nationally relevant biogeographic area that includes different types of landscapes, i.e., forest of pine, forest of fir, forest of oak, mixed forests of pine–oak and fir–pine, alpine pasture on the top of the mountain and agriculture on the lower slopes [47,48]. Illegal extraction of wood, land use change (conversion of forests into croplands), and forest fires are the leading causes of degradation and loss of SOC reserves in this region [47].
This research aimed to predict soil organic carbon (SOC) in the topsoil of the heterogeneous mountainous ecosystem of Nevado de Toluca (NdT) by applying twenty-nine machine learning (ML) techniques and linking the results to land use. Extensive sampling was conducted to ensure homogeneous coverage of the diverse landscapes within NdT, while model inputs included land use, elevation, and three widely used spectral indices—NDVI, BSI, and MSAVI2—derived from Sentinel-2 multispectral imagery. The novelty of this study lies in systematically benchmarking a large set of ML algorithms under complex volcanic mountain conditions, where steep altitudinal gradients, diverse land uses, and microclimatic variability pose significant challenges for SOC prediction. By integrating spectral indices with environmental variables, the proposed framework improves SOC prediction accuracy and demonstrates its applicability as a cost-effective and scalable monitoring tool. Beyond its local relevance, this approach can serve as a reference model for SOC analysis across the Trans-Mexican Volcanic Belt and, more broadly, in heterogeneous mountain ecosystems worldwide.

Study Area

The Área de Protección de Flora y Fauna del Nevado de Toluca, also known as Xinantécatl (Figure 1), is an eroded stratovolcano located in the central part of Mexico (18°51′31″–19°19′03″ N, 99°38′54″–100°09′58″ W). This heterogeneous high mountain ecosystem, part of the Trans-Mexican Volcanic Belt, stretches over about 535.90 km2 and rises to 4680 masl. The region is characterised by diverse landscapes, including pine, fir, oak and mixed forests (≃257.23 km2), alpine grasslands at higher elevations (≃69.67 km2), and agricultural lands on lower slopes (≃209.00 km2). The soils in the NdT are classified into four main types, i.e., andosols, regosols, fluvisols, and cambisols. The region experiences three dominant climates, i.e., semi-cold, sub-humid, and cold, with an average annual precipitation of 1050 mm and temperatures ranging from −4 °C to 12 °C. The NdT is particularly sensitive to human impacts; in particular, the exploitation of natural resources (wood logging) and land use change (deforestation for agricultural expansion) have contributed to significant environmental degradation and loss of SOC reserves [49].

2. Materials and Methods

Integrating empirical field sampling with computational modelling, a hybrid approach was implemented to construct a predictive model for Soil Organic Carbon (SOC) (Figure 2). The Nevado de Toluca (NdT) region served as the study area, where SOC concentrations were quantified via established laboratory protocols utilising 281 georeferenced soil samples. Subsequently, machine learning methodologies were employed to model SOC content, with a comparative analysis of diverse regression algorithms to ascertain the optimal predictive paradigm. Rigorous statistical evaluation was conducted to validate model performance, yielding a quantitative framework for enhanced SOC estimation within ecological investigations.

2.1. Sampling and Laboratory Analysis

The topographic maps (scale 1:50,000) from the Istituto Nacional de Estadística y Geografía of Mexico (INEGI) were used as the primary cartographic reference, while the Continuo de Elevaciones Mexicano 3.0 (CEM 3.0), with a 15 m × 15 m resolution, was consulted for the altitudinal gradient [50].
Two hundred eighty-one (281) sampling locations, including 168 samples in forests, 48 samples in agriculture, and 65 samples in natural grasslands/moorlands, were selected to cover the area evenly and ensure a spatially representative sampling design (Figure 1).
At each sampling site, five individual soil samples (~1 kg each) were collected from the 0–15 cm top soil horizon following a rhomboidal sampling scheme (Figure 3). The central point coordinates were recorded using a portable global positioning system (GPS).
The five samples from each location were analysed separately, and their SOC values were averaged, obtaining a representative estimate for the rhomboidal site. For laboratory analysis, soil samples were air-dried and sieved through a 2 mm mesh [51]. The SOC concentrations were quantified using a Total Organic Carbon Analyser (TOC-VCSN) from Shimadzu, Canby, OR, USA.

2.2. Remote Sensing

This study examines the correlation between Soil Organic Carbon (SOC) content and vegetation indices in the Nevado de Toluca region. To ensure high spatial precision, sentinel-2 multispectral imagery was selected as the primary data source. While the MODIS sensor was initially considered for its extensive spectral coverage, its coarse spatial resolution (250–1000 m per pixel) limited its applicability for the fine-scale analysis required in this study.
The choice of Sentinel-2 was primarily driven by its higher spatial resolution (10 m for visible and near-infrared [NIR] bands), which is well-suited for computing vegetation indices in the heterogeneous landscapes of Nevado de Toluca. Although Landsat 9 provides similar multispectral capabilities, its coarser spatial resolution (30 m for multispectral data) was less aligned with the study’s precision requirements. Given that the analysis focused on vegetation indices derived from visible and NIR bands, Sentinel-2 was deemed the most appropriate dataset. The multispectral images were acquired from the Copernicus Sentinel-2 mission, operated by the European Space Agency (ESA), and downloaded via the Copernicus Browser platform. The Multi-Spectral Instrument (MSI) onboard Sentinel-2 features 13 spectral bands spanning the visible, near-infrared (VNIR), and shortwave infrared (SWIR) regions, with spatial resolutions of 10, 20, and 60 m and a swath width of 290 km [52]. This selection ensures an optimal balance between spectral richness and spatial detail, facilitating accurate assessment of SOC-vegetation relationships in the study area.
To ensure high-quality remote sensing data for analysing SOC and vegetation indices, a 1% cloud cover threshold was applied. This filter allowed for the selection of cloud-free Sentinel-2 images, minimising atmospheric interference in the study area. The Copernicus Browser platform provides two types of Sentinel-2 products: L1C (Level-1C): Radiometrically and geometrically corrected but retains the original sensor geometry, without atmospheric correction; and L2A (Level-2A): Includes atmospheric correction, producing surface reflectance data in a geometrically and radiometrically corrected format. For greater accuracy and control over the atmospheric correction process, L1C data were selected. Atmospheric correction was subsequently applied using ACOLITE software [53], developed by the Royal Belgian Institute of Natural Sciences (RBINS). Initially designed for coastal and inland water applications, ACOLITE has been successfully adapted for terrestrial environments, offering robust atmospheric correction and image processing capabilities.
After applying the search filters, a set of 10 images captured between 1 January 2024, and 22 November 2024, was obtained. These images were visually inspected to ensure the absence of residual cloud cover, making the 1 March 2024, image the most suitable for analysis. To optimise data processing, the study area was extracted using ACOLITE’s region-of-interest (ROI) clipping function, defined by the following geographic coordinates: North; 19.31310°; South: 19.00000°; West; −99.95396° and East: −99.64265°. This cropping process significantly reduced the file size, enhancing computational efficiency by focusing solely on the relevant portion of the scene.
The Sentinel Application Platform (SNAP) software was used to manage the information generated after the atmospheric correction performed with ACOLITE. The resulting ACOLITE product is in Network Common Data Form (NetCDF) format, which facilitates its handling in SNAP due to its compatibility and flexibility for multispectral data processing. The atmospherically corrected image was loaded into SNAP, where the Band Math function was used to apply the Normalised Difference Vegetation Index (NDVI) (1), Bare Soil Index (BSI) (2), and Modified Soil-Adjusted Vegetation Index 2 (MSAVI2) (3) spectral index algorithms. These indices were selected due to their widespread validation and robustness in differentiating vegetation, bare soil, and mixed land covers in heterogeneous landscapes. While more recent SOC-specific indices have been proposed in the literature [54,55,56], their application remains less standardized in operational remote sensing workflows. The use of NDVI, BSI, and MSAVI2 therefore ensures methodological comparability and reproducibility, while establishing a baseline for SOC prediction in mountainous environments.
N D V I = B 8 B 4 B 8 + B 4
B S I = B 11 + B 4 ( B 8 + B 2 ) B 11 + B 4 + ( B 8 + B 2 )
M S A V 12 = 1 2 2 B 8 + 1 2 ( B 8 ) + 1 2 8 B 8 B 4
The calculated spectral indices provided critical insights into the study area’s terrain conditions and land cover characteristics. These indices optimised subsequent analytical processes by enhancing the differentiation of vegetation, bare soil, and mixed land covers. Following the computation of the raster datasets for these indices, the spectral values corresponding to the precise georeferenced locations of the field measurements were extracted. This step enabled a direct correlation between remotely sensed data and in situ observations, facilitating a more comprehensive and accurate assessment of the environmental conditions within the study area.

2.3. Machine Learning

This study employed a comprehensive suite of 29 Machine Learning (ML) regression models, encompassing nine distinct algorithmic families and their respective variants, to predict the target variable. These models included Linear Regression (LR) [57], Stepwise Linear Regression (SLR) [58], computationally Efficient Linear regression [59], Kernel-based Regression (KR) [60], Support Vector Machines (SVM) [61], Tree-based regression (TR) [62], Ensemble Methods (EM) [63], Gaussian Process Regression (GPR) [64], and Neural Networks (NN) [65]. Acknowledging the established body of literature detailing the theoretical underpinnings of machine learning algorithms, Table 1 summarises the critical hyperparameters for each model. All statistical analyses and model implementations were performed using MATLAB R2025 (The MathWorks Inc., Natick, MA, USA), utilising the Statistics and Machine Learning Toolbox. Statistical inferences were conducted at a 95% confidence level (α = 0.05) (Figure 4).
To rigorously evaluate model performance, the following metrics were computed: coefficient of determination (R2) [66,67], root mean squared error (RMSE) [68,69], mean squared error (MSE) [70], mean absolute error (MAE) [71], mean absolute percentage error (MAPE) [72,73], and training time. These metrics comprehensively assessed each model’s predictive accuracy and computational efficiency. To prevent overfitting, all models were validated using a k-fold cross-validation scheme (75% of data for training and 25% for validation), which partitioned the dataset into stratified folds to preserve sample representativeness. Model hyperparameters were optimized within the training folds exclusively, ensuring that evaluation metrics were computed on independent validation data. This procedure was repeated four times to minimize variability introduced by random partitioning. Furthermore, algorithm-specific mechanisms such as regularization (in LR and SVM), pruning (in decision trees), ensemble averaging (in bagged/boosted models), and Bayesian optimization (in Gaussian process regression) inherently reduced the risk of model overfitting. The final performance assessment was based on the average validation results rather than training accuracy, providing a robust estimate of each model’s generalization capacity. It is important to note that although the k-fold cross-validation strategy used here ensured statistical robustness, it did not explicitly account for spatial autocorrelation. Consequently, training and validation folds may contain spatially close samples, potentially leading to optimistic performance estimates. The authors acknowledge this limitation and recommend that future research incorporate spatial cross-validation schemes, such as spatial blocking or leave-location-out approaches, to further mitigate spatial dependence effects.

3. Results

Figure 5 shows that all three indices consistently highlighted the volcanic center as an area of sparse vegetation and exposed surfaces. However, beyond this shared feature, their spatial distributions diverged. NDVI strongly emphasized vegetation cover, BSI accentuated bare surfaces, and MSAVI2 provided a smoother representation of vegetation density. These complementary patterns reflect the distinct sensitivities of each index while reinforcing the central volcanic zone as a common feature of interest. Table 2 presents the descriptive statistics for the vegetation, elevation, and soil organic carbon (SOC) indices, including mean and standard deviation. Table 3 reports the quantitative error metrics (e.g., R2, RMSE, MAE) for all evaluated machine learning models.
Of the 29 tested models, 14 achieved a coefficient of determination (R2) greater than 0.80 when using NDVI, elevation, and land use as predictor variables, demonstrating strong predictive performance for SOC estimation. The model with the best performance was the Quadratic Support Vector Machine (SVM), which achieved an R2 of 0.84, indicating excellent predictive accuracy (Figure 6). Similarly, 14 models exceeded an R2 of 0.80 when using the Bare Soil Index (BSI) as input. In contrast, none of the models using MSAVI2 surpassed an R2 value of 0.78, suggesting a lower predictive capacity of this index for SOC estimation.
Figure 7 also includes the validation plots comparing predicted versus actual SOC values for the top-performing models: (a) Quadratic SVM using NDVI, (b) Matern 5/2 Gaussian Process Regression (GPR) using MSAVI2, and (c) Rational Quadratic GPR using BSI. These plots highlight the strong agreement between observed and predicted values for the most accurate models under each index scenario.
The soil organic carbon prediction model results correspond well with the field-measured data (Figure 8). Both maps reveal similar spatial patterns, especially in the highest concentration range (21.01–35.00 g C kg−1), which is predominantly distributed in the central and southern areas of the study region. Table 4 shows the Mean Absolute Error (MAE) values, revealing distinct performances of the vegetation indices across SOC ranges. NDVI consistently showed the lowest errors in both low (0–12 g C kg−1; MAE = 1.9) and high (21.01–35 g C kg−1; MAE = 3.3) SOC ranges, indicating its robustness in capturing extreme SOC conditions. In contrast, BSI performed best at intermediate SOC levels (12.01–21 g C kg−1; MAE = 2.7), where it outperformed both NDVI and MSAVI2. MSAVI2, however, exhibited the highest MAE values across all SOC ranges, with errors exceeding 7.0, suggesting limited suitability for SOC prediction in this context. NDVI and BSI emerged as the most reliable indices, with their performance depending on the SOC interval considered.
NDVI performs better at low and high SOC ranges because it is highly sensitive to vegetation vigor and canopy density, which are strongly influenced by organic matter content. Areas with very low SOC often support sparse or stressed vegetation, making NDVI effective in detecting these conditions. Similarly, areas with high SOC generally sustain healthier vegetation, which again enhances NDVI’s response. However, NDVI tends to saturate in intermediate SOC levels where vegetation density is moderate, reducing its sensitivity.
BSI, on the other hand, is designed to capture bare soil conditions and responds better in intermediate SOC ranges, where the soil surface is partially exposed and vegetation does not dominate the spectral signal. This may explain why BSI produced the lowest MAE in the 12.01–21.00 g C kg−1 range. MSAVI2 was expected to reduce vegetation saturation effects compared to NDVI, but in this study, it may have been affected by background soil reflectance and mixed pixels, leading to higher errors across all SOC ranges.

4. Discussion

Soil carbon sequestration, which consists of transferring carbon dioxide CO2 from the atmosphere into the soil in the form of organic carbon, is influenced by various factors such as land use (forests, croplands, grasslands, moorlands) [74,75,76], land use change (conversion of forests to croplands or livestock pastures) [77,78], land management (mineral fertilisation) [79,80], climate (rainfall, temperature, humidity) [81,82], geography (altitude and slope soil) [83], physicochemical and microbiological parameters (soil depth, soil texture, microbiota) [84,85].
The SOC in the top soil in the NdT was highly varied and ranged from 0.39 g C kg−1 (high moor zone) to 60.54 g C kg−1 (forest). Our results indicate that SOC were significantly higher in the forest soil (mean: 25.66 g C kg−1, ±8.82 g C kg−1) than in the high moorland (mean: 12.12 g C kg−1, ±6.50 g C kg−1) and arable soil (mean: 7.42 g C kg−1, ±3.08 g C kg−1), which suggests that land use plays a key role in its spatial distribution. Other authors have reached the same conclusions as us.
Land use change, i.e., forest conversion to cropland in the NdT, has been linked with reduced SOC concentrations. Ref. [85] indicated that SOC, in the heterogeneous ecosystem of Coka watershed (Southern Ethiopia), decreased in the following order: bushland and forestland, grassland, cropland and bare land [86]. Those who focused their interest on the heterogeneous geographical area of Northeastern India reported that SOC was significantly higher in the topsoil of the forest than in the adjacent arable land and natural grassland. Ref. [87] indicated a mean SOC content of 44.70 g C kg−1 in the top forest soil and only 14.70 g C kg−1 in cropland top soil. Finally, ref. [51] stated that land use is the primary driver of SOC spatial distribution in a high mountain ecosystem of National Park La Malinche, an eroded stratovolcano, located approximately 250 km east of NdT.
The SOC concentrations registered in maise monoculture sites, in the lower altitude areas of NdT, varied from 0.85 to 18.85 g C kg−1 and are comparable to those indicated by [88]. In the Song-Nen Plain maise croplands (Northeast China). The conventional agricultural practices, such as monoculture, tillage, and removal of crop residue, which are common among farmers in the NdT, most likely contributed to reducing SOC in the topsoil; furthermore, the legume-based rotation systems, where high SOC concentrations are generally recorded, have been largely abandoned in recent decades. According to [89], organic farming, which is characterised by periodically rotating crop fields in grass or legumes, has the potential to increase SOC concentrations and accumulation rates.
The SOC concentrations in the top forest soil were highly variable and ranged from 6.99 to 60.54 g C kg−1; these results are comparable with those indicated by [51,90]. The organic matter (plant litter, plant roots, soil microorganisms and animal residues) is the primary source of SOC. The C soil inputs depend on the vegetation type and structure [91]; the forest sites where NDVI values ranged between 0.5 and 0.7, which are compatible with dense canopy cover and significant leaf litter accumulation, presented higher SOC concentrations [92]. According to [93], SOC showed a significantly high positive correlation with NDVI in the forest context, even refs. [94,95] indicated that NDVI was an effective SOC predictor in India.
At the top of the volcano (over 4000 masl) the vegetation was limited exclusively to grasses and high moor; the SOC varied between 0.39 and 29.31 g C kg−1. It is likely that the soil erosion, caused by heavy rains and strong winds, and steep geography of the summit had impacted the SOC concentrations. Many authors pointed out soil erosion as the primary cause of the loss of SOC in alpine grassland landscapes [96,97,98].
A wide range of machine learning algorithms have been used to study SOC spatial distribution and its geoclimatic associated patterns [99,100,101,102]. The readily accessible environmental variables are often calculated through digital elevation models (DEMs) and satellite images (spectral indices). The twenty-nine (29) machine learning models used to predict the SOC in the NdT leveraged three key explanatory variables, i.e., land use, altitude, and satellite-derived vegetation indices (NDVI, BSI, and MSAVI2). Our results underscore the reliability of remote sensing data and topographic variables when integrated into advanced machine learning frameworks for predicting the SOC in this heterogeneous ecosystem. Fourteen (14) models—Linear, Interactions linear, Robust linear, Stepwise linear, Linear SVM, Quadratic SVM, Cubic SVM, Medium gaussian SVM, Coarse gaussian SVM, Squared exponential GPR, Matern 5/2 GPR, Exponential GPR, Rational quadratic GPR, and Narrow neural network—achieved determination coefficients (R2) greater than 0.80 when NDVI, elevation, and land use were used as complementary inputs; this indicates the robustness of these variables in estimating SOC at regional scales in a heterogeneous and multi-fragmented ecosystem. The Quadratic SVM emerged as the top-performing model, reaching an R2 of 0.84 (MSE: 14.34 (g C kg−1) and RMSE: 3.79 g C kg−1). According to [100], the NDVI and precipitation were the most significant features/predictors for machine learning techniques explaining SOC variability in a heterogeneous area (Northern Iran). The NDVI is commonly used as an input for machine learning techniques in many studies, which indicate that SOC is highly dependent on vegetation cover [103,104,105,106].
Even when BSI inputs replaced NDVI data, the performances of fourteen (14) models—Linear, Interactions linear, Robust linear, Stepwise linear, Quadratic SVM, Cubic SVM, Medium gaussian SVM, Coarse gaussian SVM, Squared exponential GPR, Matern 5/2 GPR, Exponential GPR, Rational quadratic GPR,
The narrow neural network was satisfactory, with an R2 threshold of 0.80. The Rational Quadratic GPR was the top-performing model, reaching an R2 of 0.83 in predicting SOC (MSE: 3.72 (g C kg−1) and RMSE: 13.88 g C kg−1). Previous studies have demonstrated that BSI is a significant predictor of SOC variability in different types of ecosystems: agricultural land [107,108,109], floodplains [110], alpine wetlands [111], and forests [112,113].
Finally, when MSAVI2, elevation, and land use were chosen as inputs, the machine learning models exhibited weaker performance, with none exceeding an R2 of 0.78.
The weaker performance of MSAVI2-based models in this study may be explained by the ecological and spectral characteristics of the Nevado de Toluca landscape. MSAVI2 was originally developed to minimize soil background effects in areas with low vegetation cover, improving over NDVI in sparse canopies. However, in the present study, the volcanic soils and heterogeneous vegetation cover (ranging from dense forest to open grassland and agricultural land) likely reduced its effectiveness. Specifically, in densely vegetated areas, MSAVI2 saturates similarly to NDVI and provides no clear advantage in sparsely vegetated volcanic terrains. On the other hand, where soil reflectance is highly variable due to lithological differences, the soil-adjustment term in MSAVI2 may amplify noise rather than reduce it.
In contrast, NDVI and BSI appear to better capture the vegetation and soil conditions that correlate with SOC across this heterogeneous volcanic environment. NDVI, despite its known saturation issue, performed more robustly because much of the study area is covered by moderately to highly vegetated land uses. BSI provided complementary information by enhancing the contrast between bare soil and vegetated surfaces, which was particularly relevant for sparsely covered agricultural or degraded lands.
Therefore, the underperformance of MSAVI2 does not necessarily reflect a weakness of the index itself, but rather its limited suitability to the specific biophysical conditions of Nevado de Toluca, where NDVI and BSI better captured SOC-related variability.
Validation plots further confirmed the consistency between predicted and observed SOC values. The Quadratic SVM models measured and predicted SOC values using NDVI and consistently displayed no significant difference across the study area.
Comparing the performances of machine learning techniques in predicting SOC across multiple ecosystems is challenging due to each study area’s local peculiarities (climate, geography, extension, heterogeneity, human activities). However, integrating freely available remote sensing products with machine learning techniques offers a cost-effective and scalable approach for SOC monitoring in a wide range of ecosystems.

5. Conclusions

This study aimed to model and predict soil organic carbon (SOC) across the complex mountainous landscape of the Nevado de Toluca (NdT) using a suite of 29 machine learning algorithms. The models leveraged three key explanatory variables—land use, elevation, and satellite-derived vegetation indices (NDVI, BSI, and MSAVI2)—to assess their effectiveness in capturing SOC spatial variability within this heterogeneous volcanic ecosystem.
The results underscore the predictive power of remote sensing and topographic variables when integrated into advanced machine learning frameworks. Of the 29 evaluated models, nearly half (14 models) achieved a coefficient of determination (R2) above 0.80 when NDVI, elevation, and land use were used as predictors. This highlights the robustness of these variables in estimating SOC at regional scales. The Quadratic Support Vector Machine (SVM) emerged as the top-performing model, reaching an R2 of 0.84, demonstrating excellent accuracy in capturing SOC distribution patterns.
Comparable results were observed when using the Bare Soil Index (BSI) as a predictor, with 14 models surpassing the R2 threshold of 0.80. However, MSAVI2-based models exhibited weaker performance, with none exceeding an R2 of 0.78, suggesting limitations in this index for SOC prediction in sparsely vegetated volcanic terrains.
Validation plots further confirmed the consistency between predicted and observed SOC values, particularly for the best models associated with each index: Quadratic SVM for NDVI, Matern 5/2 Gaussian Process Regression (GPR) for MSAVI2, and Rational Quadratic GPR for BSI.
This work contributes significantly to the growing knowledge of SOC modelling in high-altitude and ecologically diverse regions. Integrating freely available remote sensing products with machine learning techniques offers a cost-effective and scalable approach for SOC monitoring, which is critical for carbon budgeting, ecosystem services assessment, and sustainable land management. Future studies may benefit from including hyperspectral data or higher-resolution imagery and temporal analysis to evaluate SOC dynamics over time.

Author Contributions

Conceptualization, C.F. and Y.S.-G.; methodology, F.E.G.-J.; software, M.S.; validation, Y.S.-G. and O.E.C.-H.; formal analysis, F.E.G.-J.; investigation, M.S.; resources, O.E.C.-H.; data curation, C.F.; writing—original draft preparation, Y.S.-G.; writing—review and editing, C.C.-O.; visualization, C.C.-O.; supervision, C.F.; project administration, M.S.; funding acquisition, Y.S.-G. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data is contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Midgley, G.F.; Bond, W.J.; Kapos, V.; Ravilious, C.; Scharlemann, J.P.; Woodward, F.I. Terrestrial carbon stocks and biodiversity: Key knowledge gaps and some policy implications. Curr. Opin. Environ. Sustain. 2010, 2, 264–270. [Google Scholar] [CrossRef]
  2. Navarrete-Segueda, A.; Martínez-Ramos, M.; Ibarra-Manríquez, G.; Vázquez-Selem, L.; Siebe, C. Variation of main terrestrial carbon stocks at the landscape-scale are shaped by soil in a tropical rainforest. Geoderma 2018, 313, 57–68. [Google Scholar] [CrossRef]
  3. Schimel, D.S. Terrestrial biogeochemical cycles: Global estimates with remote sensing. Remote Sens. Environ. 1995, 51, 49–56. [Google Scholar] [CrossRef]
  4. Filonchyk, M.; Peterson, M.P.; Zhang, L.; Hurynovich, V.; He, Y. Greenhouse gases emissions and global climate change: Examining the influence of CO2, CH4, and N2O. Sci. Total Environ. 2024, 935, 173359. [Google Scholar] [CrossRef] [PubMed]
  5. Tagesson, T.; Schurgers, G.; Horion, S.; Ciais, P.; Tian, F.; Brandt, M.; Ahlström, A.; Wigneron, J.-P.; Ardö, J.; Olin, S.; et al. Recent divergence in the contributions of tropical and boreal forests to the terrestrial carbon sink. Nat. Ecol. Evol. 2020, 4, 202–209. [Google Scholar] [CrossRef]
  6. Romero-Sanchez, M.E.; Gonzalez-Hernandez, A.; Velasco-Bautista, E.; Correa-Diaz, A.; Ortiz-Reyes, A.D.; Perez-Miranda, R. Can Combining Machine Learning Techniques and Remote Sensing Data Improve the Accuracy of Aboveground Biomass Estimations in Temperate Forests of Central Mexico? Geomatics 2025, 5, 30. [Google Scholar] [CrossRef]
  7. Ray, G.; Singh, M. Tropical Forest Carbon Accounting Through Deep Learning-Based Species Mapping and Tree Crown Delineation. Geomatics 2025, 5, 15. [Google Scholar] [CrossRef]
  8. Patton, N.R.; Lohse, K.A.; Seyfried, M.S.; Godsey, S.E.; Parsons, S.B. Topographic controls of soil organic carbon on soil-mantled landscapes. Sci. Rep. 2019, 9, 6390. [Google Scholar] [CrossRef]
  9. Scharlemann, J.P.; Tanner, E.V.; Hiederer, R.; Kapos, V. Global soil carbon: Understanding and managing the largest terrestrial carbon pool. Carbon Manag. 2014, 5, 81–91. [Google Scholar] [CrossRef]
  10. Georgiou, K.; Jackson, R.B.; Vindušková, O.; Abramoff, R.Z.; Ahlström, A.; Feng, W.; Harden, J.W.; Pellegrini, A.F.A.; Polley, H.W.; Soong, J.L.; et al. Global stocks and capacity of mineral-associated soil organic carbon. Nat. Commun. 2022, 13, 3797. [Google Scholar] [CrossRef]
  11. Lal, R. Soil Carbon Sequestration Impacts on Global Climate Change and Food Security. Science 2004, 304, 1623–1627. [Google Scholar] [CrossRef]
  12. Lal, R. Managing Soils and Ecosystems for Mitigating Anthropogenic Carbon Emissions and Advancing Global Food Security. Bioscience 2010, 60, 708–721. [Google Scholar] [CrossRef]
  13. Ross, C.W.; Grunwald, S.; Myers, D.B. Spatiotemporal modeling of soil organic carbon stocks across a subtropical region. Sci. Total Environ. 2013, 461–462, 149–157. [Google Scholar] [CrossRef] [PubMed]
  14. Bréchet, L.M.; Lopez-Sangil, L.; George, C.; Birkett, A.J.; Baxendale, C.; Trujillo, B.C.; Sayer, E.J. Distinct responses of soil respiration to experimental litter manipulation in temperate woodland and tropical forest. Ecol. Evol. 2018, 8, 3787–3796. [Google Scholar] [CrossRef] [PubMed]
  15. Eze, S.; Magilton, M.; Magnone, D.; Varga, S.; Gould, I.; Mercer, T.G.; Goddard, M.R. Meta-analysis of global soil data identifies robust indicators for short-term changes in soil organic carbon stock following land use change. Sci. Total Environ. 2023, 860, 160484. [Google Scholar] [CrossRef] [PubMed]
  16. Rieke, E.L.; Bagnall, D.K.; Morgan, C.L.; Flynn, K.D.; Howe, J.A.; Greub, K.L.; Mac Bean, G.; Cappellazzi, S.B.; Cope, M.; Liptzin, D.; et al. Evaluation of aggregate stability methods for soil health. Geoderma 2022, 428, 116156. [Google Scholar] [CrossRef]
  17. Abegaz, A.; Winowiecki, L.A.; Vågen, T.-G.; Langan, S.; Smith, J.U. Spatial and temporal dynamics of soil organic carbon in landscapes of the upper Blue Nile Basin of the Ethiopian Highlands. Agric. Ecosyst. Environ. 2016, 218, 190–208. [Google Scholar] [CrossRef]
  18. Liptzin, D.; Norris, C.E.; Cappellazzi, S.B.; Mac Bean, G.; Cope, M.; Greub, K.L.; Rieke, E.L.; Tracy, P.W.; Aberle, E.; Ashworth, A.; et al. An evaluation of carbon indicators of soil health in long-term agricultural experiments. Soil Biol. Biochem. 2022, 172, 108708. [Google Scholar] [CrossRef]
  19. Rodrigues, C.I.D.; Brito, L.M.; Nunes, L.J.R. Soil Carbon Sequestration in the Context of Climate Change Mitigation: A Review. Soil Syst. 2023, 7, 64. [Google Scholar] [CrossRef]
  20. Jagadamma, S.; Lal, R. Distribution of organic carbon in physical fractions of soils as affected by agricultural management. Biol. Fertil. Soils 2010, 46, 543–554. [Google Scholar] [CrossRef]
  21. Li, L.; Wang, B.; Feng, P.; Liu, D.L.; He, Q.; Zhang, Y.; Wang, Y.; Li, S.; Lu, X.; Yue, C.; et al. Developing machine learning models with multi-source environmental data to predict wheat yield in China. Comput. Electron. Agric. 2022, 194, 106790. [Google Scholar] [CrossRef]
  22. Sellan, G.; Thompson, J.; Majalap, N.; Robert, R.; Brearley, F.Q. Impact of soil nitrogen availability and pH on tropical heath forest organic matter decomposition and decomposer activity. Pedobiologia 2020, 80, 150645. [Google Scholar] [CrossRef]
  23. Golchin, A.; Misaghi, M. Investigating the effects of climate change and anthropogenic activities on SOC storage and cumulative CO2 emissions in forest soils across altitudinal gradients using the century model. Sci. Total Environ. 2024, 943, 173758. [Google Scholar] [CrossRef]
  24. Roberts, N.J.; Harihar, A.; Zhou, X.; She, W.; Jiang, G. Global Tiger Density Linked with Forest Carbon Stock, Top-Down and Bottom-Up. Glob. Change Biol. 2025, 31, e70191. [Google Scholar] [CrossRef] [PubMed]
  25. Houghton, R.A.; Nassikas, A.A. Negative emissions from stopping deforestation and forest degradation, globally. Glob. Change Biol. 2018, 24, 350–359. [Google Scholar] [CrossRef] [PubMed]
  26. Zhai, T.; Wang, J.; Fang, Y.; Qin, Y.; Huang, L.; Chen, Y. Assessing ecological risks caused by human activities in rapid urbanization coastal areas: Towards an integrated approach to determining key areas of terrestrial-oceanic ecosystems preservation and restoration. Sci. Total Environ. 2020, 708, 135153. [Google Scholar] [CrossRef] [PubMed]
  27. Parras-Alcántara, L.; Lozano-García, B.; Galán-Espejo, A. Soil organic carbon along an altitudinal gradient in the Despeñaperros Natural Park, southern Spain. Solid Earth 2015, 6, 125–134. [Google Scholar] [CrossRef]
  28. Seifu, W.; Elias, E.; Gebresamuel, G.; Khanal, S. Impact of land use type and altitudinal gradient on topsoil organic carbon and nitrogen stocks in the semi-arid watershed of northern Ethiopia. Heliyon 2021, 7, e06770. [Google Scholar] [CrossRef]
  29. Zhang, H.; Zhuang, S.; Qian, H.; Wang, F.; Ji, H. Spatial Variability of the Topsoil Organic Carbon in the Moso Bamboo Forests of Southern China in Association with Soil Properties. PLoS ONE 2015, 10, e0119175. [Google Scholar] [CrossRef]
  30. Guo, L.B.; Gifford, R.M. Soil carbon stocks and land use change: A meta analysis. Glob. Change Biol. 2002, 8, 345–360. [Google Scholar] [CrossRef]
  31. Jat, M.L.; Chakraborty, D.; Ladha, J.K.; Parihar, C.M.; Datta, A.; Mandal, B.; Nayak, H.S.; Maity, P.; Rana, D.S.; Chaudhari, S.K.; et al. Carbon sequestration potential, challenges, and strategies towards climate action in smallholder agricultural systems of South Asia. Crop Environ. 2022, 1, 86–101. [Google Scholar] [CrossRef]
  32. Luo, Z.; Wang, E.; Sun, O.J. Soil carbon change and its responses to agricultural practices in Australian agro-ecosystems: A review and synthesis. Geoderma 2010, 155, 211–223. [Google Scholar] [CrossRef]
  33. Goodman, J.M.; Owens, P.R.; Libohova, Z. Predicting soil organic carbon using mixed conceptual and geostatistical models. In Digital Soil Assessments and Beyond; CRC Press: Boca Raton, FL, USA, 2012; pp. 155–161. [Google Scholar] [CrossRef]
  34. Winzeler, H.E.; Owens, P.R.; Waltman, S.W.; Libohova, Z.; Waltman, W.J. Soil climate indicators from the Geographically Explicit Newhall Simulation Model (GEN) as potential environmental covariates in digital soil mapping applications. In Digital Soil Assessments and Beyond; CRC Press: Boca Raton, FL, USA, 2012; pp. 167–171. Available online: https://www.taylorfrancis.com/chapters/edit/10.1201/b12728-36/soil-climate-indicators-geographically-explicit-newhall-simulation-model-gen-potential-environmental-covariates-digital-soil-mapping-applications-winzeler-owens-waltman-libohova-waltman?context=ubx&refId=eb5016f9-afcb-4265-9b05-d8ee1c9ef9b2 (accessed on 31 May 2025).
  35. Mirzaee, S.; Ghorbani-Dashtaki, S.; Mohammadi, J.; Asadi, H.; Asadzadeh, F. Spatial variability of soil organic matter using remote sensing data. Catena 2016, 145, 118–127. [Google Scholar] [CrossRef]
  36. Wehrhan, M.; Sommer, M. A Parsimonious Approach to Estimate Soil Organic Carbon Applying Unmanned Aerial System (UAS) Multispectral Imagery and the Topographic Position Index in a Heterogeneous Soil Landscape. Remote Sens. 2021, 13, 3557. [Google Scholar] [CrossRef]
  37. Dvorakova, K.; Shi, P.; Limbourg, Q.; van Wesemael, B. Soil Organic Carbon Mapping from Remote Sensing: The Effect of Crop Residues. Remote. Sens. 2020, 12, 1913. [Google Scholar] [CrossRef]
  38. Garosi, Y.; Ayoubi, S.; Nussbaum, M.; Sheklabadi, M.; Nael, M.; Kimiaee, I. Use of the time series and multi-temporal features of Sentinel-1/2 satellite imagery to predict soil inorganic and organic carbon in a low-relief area with a semi-arid environment. Int. J. Remote. Sens. 2022, 43, 6856–6880. [Google Scholar] [CrossRef]
  39. Suleymanov, A.; Tuktarova, I.; Belan, L.; Suleymanov, R.; Gabbasova, I.; Araslanova, L. Spatial prediction of soil properties using random forest, k-nearest neighbors and cubist approaches in the foothills of the Ural Mountains, Russia. Model. Earth Syst. Environ. 2023, 9, 3461–3471. [Google Scholar] [CrossRef]
  40. Muñoz, A.S.M.; Alvis, Á.I.G.; Martínez, I.F.B. A random forest model to predict soil organic carbon storage in mangroves from Southern Colombian Pacific coast. Estuar. Coast. Shelf Sci. 2024, 299, 108674. [Google Scholar] [CrossRef]
  41. Zhou, Y.; Zhao, X.; Guo, X.; Li, Y. Mapping of soil organic carbon using machine learning models: Combination of optical and radar remote sensing data. Soil Sci. Soc. Am. J. 2022, 86, 293–310. [Google Scholar] [CrossRef]
  42. Gomes, L.C.; Faria, R.M.; de Souza, E.; Veloso, G.V.; Schaefer, C.E.G.R.; Filho, E.I.F. Modelling and mapping soil organic carbon stocks in Brazil. Geoderma 2019, 340, 337–350. [Google Scholar] [CrossRef]
  43. Ismail, M.; Roy, S.K.; Dey, T.; Basak, J.K.; Chowdhury, M.A.A.; Ullah, F.; Ahmed, S. Carbon sequestration potential of Sonneratia apetala plantation forests in the Chakaria Sundarbans: Effects of stand age and structure. Reg. Stud. Mar. Sci. 2025, 84, 104104. [Google Scholar] [CrossRef]
  44. Parvizi, Y.; Fatehi, S. Geospatial digital mapping of soil organic carbon using machine learning and geostatistical methods in different land uses. Sci. Rep. 2025, 15, 4449. [Google Scholar] [CrossRef] [PubMed]
  45. Veronesi, F.; Schillaci, C. Comparison between geostatistical and machine learning models as predictors of topsoil organic carbon with a focus on local uncertainty estimation. Ecol. Indic. 2019, 101, 1032–1044. [Google Scholar] [CrossRef]
  46. Secretaría de Medio Ambiente y Recursos Naturales (SEMARNAT). Decreto que Reforma, Deroga y Adiciona Diversas Disposiciones del Diverso Publicado el 25 de Enero de 1936, por el que se Declaró Parque Nacional la Montaña Denominada “Nevado de Toluca” que fue Modificado por el Diverso Publicado el 19 de Febrero de 1937. 2013. Available online: https://www.gob.mx/cms/uploads/attachment/file/116130/01_10_13_DECRETO_que_reforma__deroga_y_adiciona_diversas_disposiciones_del_PN_Nevado_de_Toluca.pdf (accessed on 31 May 2025).
  47. Secretaría de Medio Ambiente y Recursos Naturales (SEMARNAT); Comisión Nacional de Áreas Naturales Protegidas (CONANP). Programa de Manejo Área de Protección de Flora y Fauna Nevado de Toluca. 2016. Available online: https://simec.conanp.gob.mx/pdf_libro_pm/104_libro_pm.pdf (accessed on 3 September 2025).
  48. Inostroza, C.C.; Reyes, P.M. Estado de México: Instituciones, Políticas Públicas y Actividad Productiva; El Colegio Mexiquense: Zinacantepec, Mexico, 2022. [Google Scholar]
  49. Agramont, A.R.E.; Maass, S.F.; Bernal, G.N.; Hernández, J.I.V.; Fredericksen, T.S. Effect of human disturbance on the structure and regeneration of forests in the Nevado de Toluca National Park, Mexico. J. For. Res. 2012, 23, 39–44. [Google Scholar] [CrossRef]
  50. de Estadística y Geografía, I.N. Continuo de Elevaciones Mexicano y Modelos Digitales de Elevación. Available online: https://www.inegi.org.mx/app/geo2/elevacionesmex/ (accessed on 31 May 2025).
  51. Fusaro, C.; Sarria-Guzmán, Y.; Chávez-Romero, Y.A.; Luna-Guido, M.; Muñoz-Arenas, L.C.; Dendooven, L.; Estrada-Torres, A.; Navarro-Noya, Y.E. Land use is the main driver of soil organic carbon spatial distribution in a high mountain ecosystem. PeerJ 2019, 7, e7897. [Google Scholar] [CrossRef]
  52. EOS Data Analytics Inc. Sentinel-2 Imagery: View and Download Satellite Data. Available online: https://eos.com/find-satellite/sentinel-2/ (accessed on 4 April 2025).
  53. REMSEM—OD Nature Remote Sensing and Ecosystem Modelling Team, ACOLITE. Available online: https://odnature.naturalsciences.be/remsem/software-and-data/acolite (accessed on 4 April 2025).
  54. Liu, Q.; He, L.; Guo, L.; Wang, M.; Deng, D.; Lv, P.; Wang, R.; Jia, Z.; Hu, Z.; Wu, G.; et al. Digital mapping of soil organic carbon density using newly developed bare soil spectral indices and deep neural network. Catena 2022, 219, 106603. [Google Scholar] [CrossRef]
  55. Salas, E.A.L.; Kumaran, S.S. Perimeter-Area Soil Carbon Index (PASCI): Modeling and estimating soil organic carbon using relevant explicatory waveband variables in machine learning environment. Geo-Spat. Inf. Sci. 2024, 27, 1739–1746. [Google Scholar] [CrossRef]
  56. Bartholomeus, H.M.; Schaepman, M.E.; Kooistra, L.; Stevens, A.; Hoogmoed, W.B.; Spaargaren, O.S.P. Spectral reflectance based indices for soil organic carbon quantification. Geoderma 2008, 145, 28–36. [Google Scholar] [CrossRef]
  57. The MathWorks Inc. Linear Regression. Available online: https://la.mathworks.com/help/stats/linear-regression-model-workflow.html (accessed on 27 March 2025).
  58. The MathWorks Inc. Stepwise Regression. Available online: https://la.mathworks.com/help/stats/stepwise-regression.html (accessed on 27 March 2025).
  59. The MathWorks Inc. Linear Least Squares. Available online: https://ch.mathworks.com/help/optim/linear-least-squares.html (accessed on 30 March 2025).
  60. The MathWorks Inc. Regression Kernel. Available online: https://la.mathworks.com/help/stats/regressionkernel.html (accessed on 27 March 2025).
  61. The MathWorks Inc. Support Vector Machine Regression. Available online: https://ch.mathworks.com/help/stats/support-vector-machine-regression.html (accessed on 30 March 2025).
  62. The MathWorks Inc. Regression Trees. Available online: https://ch.mathworks.com/help/stats/regression-trees.html (accessed on 27 March 2025).
  63. The MathWorks Inc. Ensemble Algorithms. Available online: https://la.mathworks.com/help/stats/ensemble-algorithms.html (accessed on 27 March 2025).
  64. The MathWorks Inc. Gaussian Process Regression Models. Available online: https://la.mathworks.com/help/stats/gaussian-process-regression-models.html (accessed on 27 March 2025).
  65. The MathWorks Inc. RegressionNeuralNetwork. Available online: https://ch.mathworks.com/help/stats/regressionneuralnetwork.html (accessed on 30 March 2025).
  66. The MathWorks Inc. Coefficient of Determination (R-Squared). Available online: https://ch.mathworks.com/help/stats/coefficient-of-determination-r-squared.html (accessed on 27 March 2025).
  67. Dodge, A. Coefficient of Determination. In The Concise Encyclopedia of Statistics; Springer: New York, NY, USA, 2008; pp. 88–91. [Google Scholar] [CrossRef]
  68. The MathWorks Inc. Root Mean Squared Error. Available online: https://ch.mathworks.com/help/matlab/ref/rmse.html (accessed on 27 March 2025).
  69. Chai, T.; Draxler, R.R. Root mean square error (RMSE) or mean absolute error (MAE)?—Arguments against avoiding RMSE in the literature. Geosci. Model. Dev. 2014, 7, 1247–1250. [Google Scholar] [CrossRef]
  70. The MathWorks Inc. Mean Squared Error. Available online: https://ch.mathworks.com/help/deeplearning/ref/dlarray.mse.html (accessed on 27 March 2025).
  71. The MathWorks Inc. Mean Squared Error. Available online: https://ch.mathworks.com/help/deeplearning/ref/mae.html (accessed on 27 March 2025).
  72. Kim, S.; Kim, H. A new metric of absolute percentage error for intermittent demand forecasts. Int. J. Forecast. 2016, 32, 669–679. [Google Scholar] [CrossRef]
  73. The MathWorks Inc. Mape. Available online: https://ch.mathworks.com/help/matlab/ref/mape.html (accessed on 30 March 2025).
  74. Merino, A.; Omil, B.; Piñeiro, V.; Barros, N.; Souza-Alonso, P.; Campo, J. Soil C dynamics after deforestation and subsequent conversion of arable cropland to grassland in humid temperate areas. Sci. Total Environ. 2023, 901, 165793. [Google Scholar] [CrossRef] [PubMed]
  75. Niu, X.; Liu, C.; Jia, X.; Zhu, J. Changing soil organic carbon with land use and management practices in a thousand-year cultivation region. Agric. Ecosyst. Environ. 2021, 322, 107639. [Google Scholar] [CrossRef]
  76. Smith, P. Land use change and soil organic carbon dynamics. Nutr. Cycl. Agroecosyst. 2008, 81, 169–178. [Google Scholar] [CrossRef]
  77. Beillouin, D.; Corbeels, M.; Demenois, J.; Berre, D.; Boyer, A.; Fallot, A.; Feder, F.; Cardinael, R. A global meta-analysis of soil organic carbon in the Anthropocene. Nat. Commun. 2023, 14, 3700. [Google Scholar] [CrossRef]
  78. Ferreira, A.C.C.; Leite, L.F.C.; de Araújo, A.S.F.; Eisenhauer, N. Land-Use Type Effects on Soil Organic Carbon and Microbial Properties in a Semi-arid Region of Northeast Brazil. Land Degrad. Dev. 2016, 27, 171–178. [Google Scholar] [CrossRef]
  79. Just, C.; Armbruster, M.; Barkusky, D.; Baumecker, M.; Diepolder, M.; Döring, T.F.; Heigl, L.; Honermeier, B.; Jate, M.; Merbach, I.; et al. Soil organic carbon sequestration in agricultural long-term field experiments as derived from particulate and mineral-associated organic matter. Geoderma 2023, 434, 116472. [Google Scholar] [CrossRef]
  80. Keesstra, S.; Rodrigo-Comino, J.; Novara, A.; Giménez-Morera, A.; Pulido, M.; Di Prima, S.; Cerdà, A. Straw mulch as a sustainable solution to decrease runoff and erosion in glyphosate-treated clementine plantations in Eastern Spain. An assessment using rainfall simulation experiments. Catena 2019, 174, 95–103. [Google Scholar] [CrossRef]
  81. Chen, X.; Deng, Q.; Lin, G.; Lin, M.; Wei, H. Changing rainfall frequency affects soil organic carbon concentrations by altering non-labile soil organic carbon concentrations in a tropical monsoon forest. Sci. Total Environ. 2018, 644, 762–769. [Google Scholar] [CrossRef]
  82. Zhang, L.; Zheng, Q.; Liu, Y.; Liu, S.; Yu, D.; Shi, X.; Xing, S.; Chen, H.; Fan, X. Combined effects of temperature and precipitation on soil organic carbon changes in the uplands of eastern China. Geoderma 2019, 337, 1105–1115. [Google Scholar] [CrossRef]
  83. Jakšić, S.; Ninkov, J.; Milić, S.; Vasin, J.; Živanov, M.; Jakšić, D.; Komlen, V. Influence of Slope Gradient and Aspect on Soil Organic Carbon Content in the Region of Niš, Serbia. Sustainability 2021, 13, 8332. [Google Scholar] [CrossRef]
  84. Bhattacharyya, S.S.; Ros, G.H.; Furtak, K.; Iqbal, H.M.N.; Parra-Saldívar, R. Soil carbon sequestration—An interplay between soil microbial community and soil organic matter dynamics. Sci. Total Environ. 2022, 815, 152928. [Google Scholar] [CrossRef]
  85. Buraka, T.; Elias, E.; Lelago, A. Soil organic carbon and its’ stock potential in different land-use types along slope position in Coka watershed, Southern Ethiopia. Heliyon 2022, 8, e10261. [Google Scholar] [CrossRef] [PubMed]
  86. Saha, R.; Chaudhary, R.S.; Somasundaram, J. Soil Health Management under Hill Agroecosystem of North East India. Appl. Environ. Soil Sci. 2012, 2012, 696174. [Google Scholar] [CrossRef]
  87. Kocyigit, R.; Demirci, S. Long-term changes of aggregate-associated and labile soil organic carbon and nitrogen after conversion from forest to grassland and cropland in northern Turkey. Land. Degrad. Dev. 2012, 23, 475–482. [Google Scholar] [CrossRef]
  88. Wang, Z.-M.; Zhang, B.; Song, K.-S.; Liu, D.-W.; Ren, C.-Y. Spatial Variability of Soil Organic Carbon Under Maize Monoculture in the Song-Nen Plain, Northeast China. Pedosphere 2010, 20, 80–89. [Google Scholar] [CrossRef]
  89. Gattinger, A.; Muller, A.; Haeni, M.; Skinner, C.; Fliessbach, A.; Buchmann, N.; Mäder, P.; Stolze, M.; Smith, P.; Scialabba, N.E.-H.; et al. Enhanced top soil carbon stocks under organic farming. Proc. Natl. Acad. Sci. USA 2012, 109, 18226–18231. [Google Scholar] [CrossRef]
  90. Martens, D.A.; Reedy, T.E.; Lewis, D.T. Soil organic carbon content and composition of 130-year crop, pasture and forest land-use managements. Glob. Change Biol. 2004, 10, 65–78. [Google Scholar] [CrossRef]
  91. Castellano, M.J.; Mueller, K.E.; Olk, D.C.; Sawyer, J.E.; Six, J. Integrating plant litter quality, soil organic matter stabilization, and the carbon saturation concept. Glob. Change Biol. 2015, 21, 3200–3209. [Google Scholar] [CrossRef]
  92. Mondal, A.; Khare, D.; Kundu, S.; Mondal, S.; Mukherjee, S.; Mukhopadhyay, A. Spatial soil organic carbon (SOC) prediction by regression kriging using remote sensing data. Egypt. J. Remote Sens. Space Sci. 2017, 20, 61–70. [Google Scholar] [CrossRef]
  93. Banday, M.; Bhardwaj, D.R.; Pala, N.A. Influence of forest type, altitude and NDVI on soil properties in forests of North Western Himalaya, India. Acta Ecol. Sin. 2019, 39, 50–55. [Google Scholar] [CrossRef]
  94. Kumar, P.; Sajjad, H.; Tripathy, B.R.; Ahmed, R.; Mandal, V.P. Prediction of spatial soil organic carbon distribution using Sentinel-2A and field inventory data in Sariska Tiger Reserve. Nat. Hazards 2018, 90, 693–704. [Google Scholar] [CrossRef]
  95. Mirchooli, F.; Kiani-Harchegani, M.; Darvishan, A.K.; Falahatkar, S.; Sadeghi, S.H. Spatial distribution dependency of soil organic carbon content to important environmental variables. Ecol. Indic. 2020, 116, 106473. [Google Scholar] [CrossRef]
  96. Lai, C.; Li, C.; Peng, F.; Xue, X.; You, Q.; Zhang, W.; Ma, S. Plant community change mediated heterotrophic respiration increase explains soil organic carbon loss before moderate degradation of alpine meadow. Land Degrad. Dev. 2021, 32, 5322–5333. [Google Scholar] [CrossRef]
  97. Nie, X.J.; Zhao, T.Q.; Qiao, X.N. Impacts of soil erosion on organic carbon and nutrient dynamics in an alpine grassland soil. Soil Sci. Plant Nutr. 2013, 59, 660–668. [Google Scholar] [CrossRef]
  98. Zhang, Z.; Zhan, T.; Li, Y.; Wang, Y.; Yu, T.; Sun, J. Soil organic carbon stock responded more sensitively to degradation in alpine meadows than in alpine steppes on the Qinghai-Tibetan Plateau. Land Degrad. Dev. 2023, 34, 353–361. [Google Scholar] [CrossRef]
  99. Agaba, S.; Ferré, C.; Musetti, M.; Comolli, R. Mapping Soil Organic Carbon Stock and Uncertainties in an Alpine Valley (Northern Italy) Using Machine Learning Models. Land 2024, 13, 78. [Google Scholar] [CrossRef]
  100. Emadi, M.; Taghizadeh-Mehrjardi, R.; Cherati, A.; Danesh, M.; Mosavi, A.; Scholten, T. Predicting and Mapping of Soil Organic Carbon Using Machine Learning Algorithms in Northern Iran. Remote Sens. 2020, 12, 2234. [Google Scholar] [CrossRef]
  101. John, K.; Isong, I.A.; Kebonye, N.M.; Ayito, E.O.; Agyeman, P.C.; Afu, S.M. Using Machine Learning Algorithms to Estimate Soil Organic Carbon Variability with Environmental Variables and Soil Nutrient Indicators in an Alluvial Soil. Land 2020, 9, 487. [Google Scholar] [CrossRef]
  102. Suleymanov, A.R.; Suleymanov, R.R.; Belan, L.N.; Asylbaev, I.G.; Tuktarova, I.O.; Shagaliev, R.D.; Bogdan, E.A.; Fairuzov, I.I.; Mirsayapov, R.R.; Davydychev, A.N. Drivers of Soil Organic Carbon Spatial Distribution in the Southern Ural Mountains: A Machine Learning Approach. Eurasian Soil Sci. 2024, 57, 1942–1949. [Google Scholar] [CrossRef]
  103. Abbaszad, P.; Asadzadeh, F.; Rezapour, S.; Aqdam, K.K.; Shabani, F. Evaluation of Landsat 8 and Sentinel-2 vegetation indices to predict soil organic carbon using machine learning models. Model. Earth Syst. Environ. 2024, 10, 2581–2592. [Google Scholar] [CrossRef]
  104. Fathizad, H.; Taghizadeh-Mehrjardi, R.; Ardakani, M.A.H.; Zeraatpisheh, M.; Heung, B.; Scholten, T. Spatiotemporal Assessment of Soil Organic Carbon Change Using Machine-Learning in Arid Regions. Agronomy 2022, 12, 628. [Google Scholar] [CrossRef]
  105. Nabiollahi, K.; Eskandari, S.; Taghizadeh-Mehrjardi, R.; Kerry, R.; Triantafilis, J. Assessing soil organic carbon stocks under land-use change scenarios using random forest models. Carbon Manag. 2019, 10, 63–77. [Google Scholar] [CrossRef]
  106. Triantakonstantis, D.; Karakostas, A. Soil Organic Carbon Monitoring and Modelling via Machine Learning Methods Using Soil and Remote Sensing Data. Agriculture 2025, 15, 910. [Google Scholar] [CrossRef]
  107. Nwaogu, C.; Diagi, B.E.; Ekweogu, C.V.; Ajeyomi, A.S.; Ejiogu, C.C.; Emereibeole, E.I.; Eneche, P.S.U.; Okeke, O.J.; Edokpa, D.O.; Chike, E.; et al. Soil organic carbon stocks as driven by land use in Mato Grosso State: The Brazilian Cerrado agricultural frontier. Discov. Sustain. 2024, 5, 382. [Google Scholar] [CrossRef]
  108. Zayani, H.; Fouad, Y.; Michot, D.; Kassouk, Z.; Baghdadi, N.; Vaudour, E.; Lili-Chabaane, Z.; Walter, C. Using Machine-Learning Algorithms to Predict Soil Organic Carbon Content from Combined Remote Sensing Imagery and Laboratory Vis-NIR Spectral Datasets. Remote Sens. 2023, 15, 4264. [Google Scholar] [CrossRef]
  109. Zhang, X.; Xue, J.; Chen, S.; Wang, N.; Xie, T.; Xiao, Y.; Chen, X.; Shi, Z.; Huang, Y.; Zhuo, Z. Fine Resolution Mapping of Soil Organic Carbon in Croplands with Feature Selection and Machine Learning in Northeast Plain China. Remote Sens. 2023, 15, 5033. [Google Scholar] [CrossRef]
  110. Mallik, S.; Bhowmik, T.; Mishra, U.; Paul, N. Mapping and prediction of soil organic carbon by an advanced geostatistical technique using remote sensing and terrain data. Geocarto Int. 2022, 37, 2198–2214. [Google Scholar] [CrossRef]
  111. Castelo-Cabay, M.; Piedra-Fernandez, J.A.; Ayala, R. Deep learning for land use and land cover classification from the Ecuadorian Paramo. Int. J. Digit. Earth 2022, 15, 1001–1017. [Google Scholar] [CrossRef]
  112. Gogoi, N.; Choudhury, M.; Hasan, M.S.U.; Changmai, B.; Baruah, D.; Samanta, P. Soil organic carbon pool in diverse land utilization patterns in North-East India: An implication for carbon sequestration. Environ. Dev. Sustain. 2024. [Google Scholar] [CrossRef]
  113. Sajjad, H.; Kumar, P.; Srivastava, P.K.; Pathak, S.O.; Ahmed, M.; Kumar, V.; Dobriyal, M.; Kumari, P.; Pandey, P.C. Assessing soil organic carbon and its relation with biophysical and ecological parameters in tropical forest ecosystem India. Geocarto Int. 2024, 40, 2441388. [Google Scholar] [CrossRef]
Figure 1. Spatial distribution of in situ samples. The boundaries of the study area are marked in yellow and sample points in red.
Figure 1. Spatial distribution of in situ samples. The boundaries of the study area are marked in yellow and sample points in red.
Geomatics 05 00043 g001
Figure 2. Methodology Workflow.
Figure 2. Methodology Workflow.
Geomatics 05 00043 g002
Figure 3. Field sample collection scheme.
Figure 3. Field sample collection scheme.
Geomatics 05 00043 g003
Figure 4. Machine Learning methodology.
Figure 4. Machine Learning methodology.
Geomatics 05 00043 g004
Figure 5. Spatial distribution of the indices (a) NDVI, (b) BSI, (c) MSAVI2 in the study area.
Figure 5. Spatial distribution of the indices (a) NDVI, (b) BSI, (c) MSAVI2 in the study area.
Geomatics 05 00043 g005aGeomatics 05 00043 g005b
Figure 6. Box & whisker R2 range for each Vegetation Index.
Figure 6. Box & whisker R2 range for each Vegetation Index.
Geomatics 05 00043 g006
Figure 7. Validation Predicted vs. Actual Plot for prediction. Better models: (a) Quadratic SVM for NDVI, R2 = 0.84; (b) Matern 5/2 GPR for MSAV12, R2 = 0.78; (c) Rational Quadratic GPR for BSI, R2 = 0.83.
Figure 7. Validation Predicted vs. Actual Plot for prediction. Better models: (a) Quadratic SVM for NDVI, R2 = 0.84; (b) Matern 5/2 GPR for MSAV12, R2 = 0.78; (c) Rational Quadratic GPR for BSI, R2 = 0.83.
Geomatics 05 00043 g007
Figure 8. Comparison of SOC distribution maps: (a) Measured field data; (b) Predicted values output of the model.
Figure 8. Comparison of SOC distribution maps: (a) Measured field data; (b) Predicted values output of the model.
Geomatics 05 00043 g008
Table 1. Machine learning model architecture.
Table 1. Machine learning model architecture.
Model TypePreset
TreeFine Tree
Linear RegressionLinear
Linear RegressionInteractions Linear
Linear RegressionRobust Linear
Stepwise Linear RegressionStepwise Linear
TreeFine Tree
TreeMedium Tree
TreeCoarse Tree
SVMLinear SVM
SVMQuadratic SVM
SVMCubic SVM
SVMFine Gaussian SVM
SVMMedium Gaussian SVM
SVMCoarse Gaussian SVM
Efficient LinearEfficient Linear Least Squares
Efficient LinearEfficient Linear SVM
EnsembleBoosted Trees
EnsembleBagged Trees
Gaussian Process RegressionSquared Exponential GPR
Gaussian Process RegressionMatern 5/2 GPR
Gaussian Process RegressionExponential GPR
Gaussian Process RegressionRational Quadratic GPR
Neural NetworkNarrow Neural Network
Neural NetworkMedium Neural Network
Neural NetworkWide Neural Network
Neural NetworkBilayered Neural Network
Neural NetworkTrilayered Neural Network
KernelSVM Kernel
KernelLeast Squares Regression Kernel
Table 2. Vegetation, elevation and SOC indices with their respective standard deviation.
Table 2. Vegetation, elevation and SOC indices with their respective standard deviation.
Land UseNDVIMSAV12BSIElevationSOC
Agriculture0.529 ± 0.2410.722 ± 0.1010.016 ± 0.2013507.76 ± 282.67318.85 ± 3.080
Forest0.611 ± 0.2490.746 ± 0.091−0.053 ± 0.2153349.449 ± 317.00225.66 ± 8.820
Grasslands/Moorlands0.615 ± 0.1980.752 ± 0.084−0.040 ± 0.1783459.552 ± 292.93912.12 ± 6.500
Table 3. Quantitative error metrics for each evaluated model. Columns labeled NDVI, MSAVI2, and BSI correspond to models where the respective index was combined with elevation and land use as predictors for SOC. In Gray the best R2 models for each index.
Table 3. Quantitative error metrics for each evaluated model. Columns labeled NDVI, MSAVI2, and BSI correspond to models where the respective index was combined with elevation and land use as predictors for SOC. In Gray the best R2 models for each index.
PresetRMSEMSER2MAEMAPE %
NDVIMSAV12BSINDVIMSAV12BSINDVIMSAV12BSINDVIMSAV12BSINDVIMSAV12BSI
Fine Tree4.715.294.7122.1727.9822.170.730.670.733.573.993.5722.8627.6822.86
Linear3.784.963.7814.2624.6314.260.830.710.832.963.932.9619.8728.7719.87
Interactions Linear3.745.033.7414.0225.3014.020.830.700.832.923.972.9220.5529.8820.55
Robust Linear3.784.983.7814.2624.8014.260.830.710.832.963.922.9619.7928.1219.79
Stepwise Linear3.784.973.7814.2724.7014.270.830.710.832.963.912.9620.2628.5720.26
Fine Tree4.715.294.7122.1727.9822.170.730.670.733.573.993.5722.8627.6822.86
Medium Tree4.375.054.3719.0725.5119.070.770.700.773.373.873.3721.6728.7121.67
Coarse Tree4.685.484.6821.9030.0621.900.730.650.733.834.393.8333.0536.5733.05
Linear SVM3.794.983.7914.3424.7814.340.830.710.832.973.912.9719.8627.4519.86
Quadratic SVM3.684.773.6813.5122.7413.510.840.730.842.873.792.8718.7728.3418.77
Cubic SVM3.815.313.8114.5028.1514.500.820.670.822.983.922.9821.7028.1521.70
Fine Gaussian SVM4.595.684.5921.1032.2121.100.740.620.743.494.233.4926.3834.2426.38
Medium Gaussian SVM3.744.373.7413.9619.1313.960.830.780.832.893.302.8919.3623.0119.36
Coarse Gaussian SVM3.764.893.7614.1723.9114.170.830.720.832.963.862.9620.3028.3520.30
Efficient Linear Least Squares9.059.249.0581.8685.4781.860.000.000.007.637.877.6379.8778.5079.87
Efficient Linear SVM9.059.249.0581.9285.3881.920.000.000.007.637.887.6381.2477.6381.24
Boosted Trees4.384.864.3819.1523.6319.150.770.720.773.333.673.3321.1925.0621.19
Bagged Trees4.174.724.1717.3622.2717.360.790.740.793.283.583.2822.0426.6022.04
Squared Exponential GPR3.724.393.7213.8819.2613.880.830.770.832.893.442.8919.6325.1819.63
Matern 5/2 GPR3.714.373.7113.7919.1213.790.830.780.832.883.412.8819.4724.8319.47
Exponential GPR3.804.483.8014.4420.0714.440.820.760.822.903.432.9019.6925.3219.69
Rational Quadratic GPR3.724.373.7213.8819.1313.880.830.780.832.893.422.8919.6324.8719.63
Narrow Neural Network3.754.913.7514.0524.1114.050.830.720.832.923.862.9220.1827.8520.18
Medium Neural Network4.205.004.2017.6125.0117.610.790.710.793.233.703.2322.0627.4022.06
Wide Neural Network6.967.416.9648.4054.8748.400.410.360.414.855.584.8534.0041.3834.00
Bilayered Neural Network4.726.034.7222.3036.3322.300.730.570.733.374.363.3723.3331.3023.33
Trilayered Neural Network4.315.124.3118.6226.1818.620.770.690.773.303.723.3021.3425.7921.34
SVM Kernel5.195.945.1926.9835.2326.980.670.590.674.054.634.0538.7643.6138.76
Least Squares Regression Kernel4.115.074.1116.9225.6716.920.790.700.793.213.963.2124.7933.6324.79
Table 4. MAE values for SOC ranges.
Table 4. MAE values for SOC ranges.
RangeSOC (g C kg−1)MAE
NDVIMSAV12BSI
10.00–12.001.911.84.9
212.01–21.003.07.32.7
321.01–35.003.312.85.2
All0.00–35.002.911.14.4
MAE (Mean Absolute Error)
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Fusaro, C.; Sarria-Guzmán, Y.; González-Jiménez, F.E.; Saba, M.; Coronado-Hernández, O.E.; Castrillón-Ortíz, C. Modelling the Spatial Distribution of Soil Organic Carbon Using Machine Learning and Remote Sensing in Nevado de Toluca, Mexico. Geomatics 2025, 5, 43. https://doi.org/10.3390/geomatics5030043

AMA Style

Fusaro C, Sarria-Guzmán Y, González-Jiménez FE, Saba M, Coronado-Hernández OE, Castrillón-Ortíz C. Modelling the Spatial Distribution of Soil Organic Carbon Using Machine Learning and Remote Sensing in Nevado de Toluca, Mexico. Geomatics. 2025; 5(3):43. https://doi.org/10.3390/geomatics5030043

Chicago/Turabian Style

Fusaro, Carmine, Yohanna Sarria-Guzmán, Francisco Erik González-Jiménez, Manuel Saba, Oscar E. Coronado-Hernández, and Carlos Castrillón-Ortíz. 2025. "Modelling the Spatial Distribution of Soil Organic Carbon Using Machine Learning and Remote Sensing in Nevado de Toluca, Mexico" Geomatics 5, no. 3: 43. https://doi.org/10.3390/geomatics5030043

APA Style

Fusaro, C., Sarria-Guzmán, Y., González-Jiménez, F. E., Saba, M., Coronado-Hernández, O. E., & Castrillón-Ortíz, C. (2025). Modelling the Spatial Distribution of Soil Organic Carbon Using Machine Learning and Remote Sensing in Nevado de Toluca, Mexico. Geomatics, 5(3), 43. https://doi.org/10.3390/geomatics5030043

Article Metrics

Back to TopTop