Machine Learning-Based Spatial Prediction of Soil Erosion Susceptibility Using Geo-Environmental Variables in Karst Landscapes of Southwest China
Abstract
1. Introduction
2. Study Area
3. Methodology
3.1. Data Collection
3.1.1. Experimental Data
3.1.2. Geo-Environmental Factors
- (1)
- Lithologic: The lithological setting serves as a primary factor shaping regional landforms, soils, and vegetation patterns. It significantly affects an area’s vulnerability to soil erosion, given that erosion mechanisms are strongly tied to the physicochemical characteristics of lithological materials exposed at the surface [38,39,40]. The lithological data used in this study were extracted from geological maps with a scale of 1:50,000. The identified lithologies consist of dolomite, dolomitic horizons, successions where dolomite alternates with clastic materials, lithologies outside the carbonate group, massive limestone, limestone containing thin interlayers, alternating strata of limestone and dolomite, limestone interbedded with clastic sediments, mixed carbonate–clastic rocks, and other carbonate–clastic interbedded units.
- (2)
- Topography: Topographic factors, particularly slope characteristics, exert a significant influence on the initiation and progression of soil erosion. Slope gradient is a key determinant in the formation of gullies, while slope aspect affects a range of environmental processes, including weathering intensity, soil moisture distribution, erosion rates, vegetation patterns, and overall geomorphological dynamics. The digital elevation model (DEM) applied in this research was constructed with a grid size of 30 m, and the dataset was sourced from the Geospatial Data Cloud platform (http://www.gscloud.cn/) (accessed on 1 November 2024). Using the DEM as a base dataset, key physiographic and geomorphological parameters—including elevation, slope, and aspect—were derived through spatial analysis in ArcGIS 10.8.
- (3)
- Soil texture: It is widely recognized that soil texture is an important control mechanism for runoff formation and infiltration, and therefore an important factor in the formation of depressions [41]. According to the soil census data for the study area, there are seven soil textures in the region, including silty clay loam, silt loam, loamy clay, rocks, and stony soil. The soil texture raster distribution map was generated by applying the data conversion tools within the ArcGIS software environment.
- (4)
- Water-related factors: Annual precipitation, proximity to stream channels, and stream density were incorporated into the analysis as representative hydrological factors to evaluate their impact on the initiation and subsequent development of soil erosion [42]. In this study, the stream network was delineated through hydrological analysis in ArcGIS using a 30 m resolution dataset. Based on this network, stream proximity and density layers were produced using the Euclidean Distance and Line Density tools available in ArcGIS 10.8.
- (5)
- Anthropogenic and Environmental Features:
- (a)
- Anthropogenic Factors: The anthropogenic activities significantly impact soil erosion initiation and development, for instance, roadways, where concentrated runoff often induces gully formation. During road construction, the natural drainage patterns are altered, and exposed soil becomes highly susceptible to erosion due to runoff accumulating on impermeable surfaces [2]. Therefore, A map depicting the distance from sites to nearby roads and road density were obtained with the ArcGIS10.8 distance function and Line density tools.
- (b)
- Hydrological and geomorphological processes are significantly influenced by land use, as these factors control the generation of surface runoff and the movement of sediments. The spatial pattern of land use in the study region was extracted from Landsat 7 ETM+ (30 m spatial resolution) satellite images through a supervised classification approach, and the interpretation was subsequently verified through field surveys. The land use was classified into nine primary categories: paddy field, arid land, garden area, forest, grassland, building land, bare ground, bare rock, and water. Environmental Factor: Vegetation Cover: Vegetation cover acts as an essential environmental factor controlling soil erosion, particularly in karst landscapes where vegetation stability directly affects slope erosion and sediment yield [43,44,45]. Vegetation coverage was calculated using the vegetation index NDVI, and the formula is as follows:
3.2. Soil Erosion Susceptibility Estimation Empirical Model
3.2.1. Conceptual Framework and Estimation of Soil Erosion Susceptibility Index
3.2.2. Impact Factors
- (1)
- Rainfall erosivity factor (R): The rainfall erosivity factor represents the potential of precipitation, either over a specific time period or from a single storm event, to cause soil erosion. Using the experimental rainfall data collected at 5 min intervals, the peak 30 min rainfall rates (mm/h) and the corresponding rainfall erosivity (R, MJ·mm/(ha·h)) were calculated using the following equation [45,51]:
- (2)
- Soil erodibility factor (K): Soil erodibility (K) is the property of a soil that makes it susceptible to damage by erosion. The K factor characterizes the influence of soil properties on erosion processes. The larger the K, the less vulnerable it becomes to soil erosion, and the smaller the K, the more susceptible the soil is. The K value is related to the type of soil, the texture of the soil, and the content of soil organic matter. The K value can be estimated based on organic carbon content and particle size distribution data derived from the results of soil survey [52], and the K value distribution map can be obtained by spatial interpolation.where K—soil erodibility factor; Sa—percentage of sand particles (0.05–2.0 mm); Sᵢ—percentage of silt particles (0.002–0.05 mm); Cₗ—percentage of clay particles (<0.002 mm); C—organic carbon content (%); Sn—a normalized sand fraction defined as Sa/100; and exp—exponential function.
- (3)
- Slope length–steepness factor (LS). The LS factor incorporates the impacts exerted by both slope length (L) and slope gradient (S). ArcGIS10.8 spatial analyst function was taken into consideration to estimate the LS factor. Considering the equation of Moore and Burch [53], the L factor was estimated:where L denotes the slope length factor and M represents the slope length index, and an value of 0.5 was adopted in the text, as it is suitable for estimating slope length under steeper terrain conditions.
- (4)
- Vegetation management factor (C). The C factor refers to the ratio of soil loss from cultivated land under specific farming practices to that from bare soil under continuous fallow conditions [7]. In soil research, the C factor not only mitigates soil erosion but also contributes to soil and water conservation, and its values generally range between 0 and 1. The larger the value of C, the greater the amount of soil erosion. For the assessment of vegetation coverage (fv), this study employed China’s 30 m resolution annual maximum NDVI dataset (1986–2024), which is based on U.S. Landsat remote sensing imagery. The NDVI data for 2023 were used in this study. The method described by Karydas et al. and Tian et al. was then employed to determine the C value [55,56], which was obtained by calculating the vegetation coverage (fv) to establish a linear regression equation [57].where C denotes the cover management factor, whereas fv is the fractional vegetation coverage ratio. When the vegetation coverage is over 78.3%, the amount of surface erosion is extremely weak, and the value of C is recorded as 0. When the vegetation coverage equals to 0, the value of C is 1.
- (5)
- The conservation practice factor (P). The P factor quantifies the impact of soil and water conservation measures. These interventions are designed to lower runoff quantity and flow rate, which consequently diminishes the potential for soil erosion [55,61]. The P factor generally varies between 0 and 1. A p value of 0 signifies complete prevention of soil erosion in the area, while a value of 1 indicates the absence of any erosion control measures. In this study, P-factor values were determined according to land use types extracted from 2020 Landsat 8 OLI images (30 m resolution). The images were atmospherically corrected and verified using field observations. Based on visual interpretation, the study area was classified into sloping cropland, terraced fields, and naturally vegetated land. Sloping cropland was further divided into plots with or without soil conservation measures, such as contour tillage and grass strips. Accordingly, a p value of 1.0 was assigned to sloping cropland lacking conservation measures and to naturally vegetated areas, 0.01 to terraced land, and 0.2–0.7 to sloping cropland adopting contour farming or other mechanical or vegetative practices [62,63]. The P-factor map was produced by reclassifying the land-use raster based on these criteria to maintain consistency with the RUSLE computation grid.
3.3. Machine Learning Algorithms
- (1)
- Back-propagation artificial neural network (BPANN): BPANN is a multilayer feedforward network trained using the error back-propagation algorithm, and it remains one of the most widely adopted learning methods for multilayer ANN architectures [64]. Through modifications to the learning rate and the quantity of hidden nodes, the BP algorithm calculates the connection weights from the input layer, through the hidden layer, and finally to the output layer. The training of the neural network is conducted by iteratively updating these weights so as to reduce the error between the model’s predicted values and the actual outputs [65,66].
- (2)
- Boosted regression trees (BRTs): BRTs are fitted statistical models that combine the two methods of regression trees and boosting [67]. Boosting is a powerful ensemble method that improves predictive performance through an iterative process of training new trees to correct errors made by prior models. It can be regarded as an additive regression approach, where each simple tree is incorporated step by step in a forward manner [68]. The BRT model constructs an ensemble of multiple trees, which not only overcomes the inherent shortcomings of single-tree methods but also integrates their strengths to achieve reduced model variance and enhanced predictive performance [69].
- (3)
- Random forest (RF): The RF is a nonparametric multivariate approach that can be employed for performing regression as well as classification, interaction detection, clustering, and variable selection [70,71]. This algorithm generates thousands of trees so as to develop a ‘forest’. The growth of each tree is achieved through the regression tree method, based on bootstrap samples of the data. Additionally, at each node position, a random subset of variables is selected. Based on the majority vote among all decision trees, the final model is constructed [72]. The model offers several advantages, including independence from data distributional assumptions, low computational demands, reduced risk of overfitting, suitability for high-dimensional datasets, and superior predictive accuracy [73].
- (4)
- Support Vector Regression (SVR): SVR is a regression technique developed within the framework of Support Vector Machines (SVM). SVM is an important machine learning model, with the advantages of self-learning ability, global minimum, insensitivity to changes in input data, nonlinear mapping, etc., and it also has stronger advantages in establishing the correlation prediction model than the traditional prediction model [74]. There are two core classifications of Support Vector Machine (SVM): one is Support Vector Classification (SVC) for classification tasks, and the other is Support Vector Regression (SVR) for regression tasks. SVR has a distinct characteristic: rather than minimizing the training error of observed values, it attempts to reduce the generalized error bound to the lowest possible level, thereby achieving generalized performance [75]. Studies on SVR indicate that this method serves as an effective alternative for approximating intricate engineering computations, delivering superior accuracy and enhanced robustness in function fitting [76].
3.4. Evaluation Criteria of Model Performance
4. Results
4.1. Soil Erosion Susceptibility Assessment Based on RUSLE Model
4.2. Spatial Mapping of Predictive Factors and Machine Learning Sample Datasets
4.3. Soil Erosion Susceptibility Machine Learning Models and Validation
4.4. Geo-Environmental Variables Importance Analysis
4.5. Soil Erosion Susceptibility Mapping
5. Discussion
5.1. Soil Erosion Susceptibility Estimation Models
5.2. Variable Importance in the Susceptibility Models
5.3. Soil Erosion Susceptibility Characteristics and Spatial Differentiation
6. Conclusions and Perspectives
Supplementary Materials
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Koch, A.; McBratney, A.; Adams, M.; Field, D.; Hill, R.; Crawford, J.; Minasny, B.; Lal, R.; Abbott, L.; O’Donnell, A.; et al. Soil Security: Solving the Global Soil Crisis. Glob. Policy 2013, 4, 434–441. [Google Scholar] [CrossRef]
- Zhan, C.S.; Jiang, S.S.; Sun, F.B.; Jia, Y.W.; Yue, W.F.; Niu, C.W. Quantitative contribution of climate change and human activities to runoff changes in the Wei river basin, China. Hydrol. Earth Syst. Sci. 2014, 11, 2149–2175. [Google Scholar]
- Amundson, R.; Berhe, A.A.; Hopmans, J.W.; Olson, C.; Sztein, A.E.; Sparks, D.L. Soil and human security in the 21st century. Science 2015, 348, 1261071. [Google Scholar] [CrossRef]
- Lal, R.; Pimentel, D. Soil erosion: A carbon sink or source? Science 2008, 319, 1040–1042. [Google Scholar] [CrossRef]
- Park, S.; Oh, C.; Jeon, S.; Jung, H.; Choi, C. Soil erosion risk in Korean watersheds, assessed using the revised universal soil loss equation. J. Hydrol. 2011, 399, 263–273. [Google Scholar] [CrossRef]
- Xu, H.; Zhu, X.; Borrelli, P.; Cao, L.; Shao, M. Current status and medium- and long-term variation of soil erosion by water in China. Geosustainability 2025, 6, 100372. [Google Scholar]
- Wischmeier, W.H.; Smith, D.D. Predicting Rainfall Erosion Losses; USDA: Washington, DC, USA, 1978; Agricultural Handbook No. 537.
- Liu, B.Y.; Xie, Y.; Zhang, K.L. Soil Erosion Prediction Model; China Science and Technology Press: Beijing, China, 2001. [Google Scholar]
- Yan, J.; Wang, S.; Feng, J.; He, H.; Wang, L.; Sun, Z.; Zheng, C. New 30-m resolution dataset reveals declining soil erosion with regional increases across Chinese mainland (1990–2022). Remote Sens. Environ. 2025, 323, 114681. [Google Scholar]
- Yin, C.; Bai, C.; Zhu, Y.; Shao, M.; Han, X.; Qiao, J. Future soil erosion risk in China: Differences in erosion driven by general and extreme precipitation under climate change. Earth Future 2025, 13, e2024EF005390. [Google Scholar] [CrossRef]
- Zheng, M.; Cai, Q.; Cheng, Q. Modelling the runoff-sediment yield relationship using a proportional function in hilly areas of the Loess Plateau, North China. Geomorphology 2008, 93, 288–301. [Google Scholar] [CrossRef]
- Beasley, D.B.; Huggins, L.F.; Monke, A. ANSWERS: A model for watershed planning. Trans. ASAE 1980, 23, 938–0944. [Google Scholar] [CrossRef]
- Laflen, J.M.; Lwonard, J.L.; Foster, G.R. WEPP a new generation of erosion prediction technology. J. Soil Water Conserv. 1991, 46, 34–38. [Google Scholar] [CrossRef]
- Xie, S.; Zhang, R.; Wang, M. The study on sediment yield model of rainfall in loess hilly gully region of the middle reaches of the Yellow River. In Water and Sediment of the Yellow River Foundation; Papers on Changes in Water and Sediment of the Yellow River; Yellow River Water Conservancy Press: Zhengzhou, China, 1993; Volume 5, pp. 238–274. [Google Scholar]
- Tang, L. The study on the basin sediment model. Adv. Water Sci. 1996, 7, 47–53. [Google Scholar] [CrossRef]
- Cai, Q.; Wang, G.; Chen, Y. Process and Simulation of Soil Erosion and Sediment Yield in Small Watershed on the Loess Plateau; Science Press: Beijing, China, 1998. [Google Scholar]
- Kinnell, P.I.A.; Wang, J.; Zheng, F. Comparison of the abilities of WEPP and the USLE-M to predict event soil loss on steep loessal slopes in China. Catena 2018, 171, 99–106. [Google Scholar] [CrossRef]
- Srivastava, A.; Brooks, E.S.; Dobre, M.; Elliot, W.J.; Wu, J.Q.; Flanagan, D.C.; Gravelle, J.A.; Link, T.E. Modeling forest management effects on water and sediment yield from nested, paired watersheds in the interior Pacific Northwest, USA using WEPP. Sci. Total Environ. 2020, 701, 134877. [Google Scholar] [CrossRef]
- Morgan, R.; Quinton, J.; Smith, R.; Govers, G.; Poesen, J.; Auerswald, K.; Chisci, G.; Torri, D.; Styczen, M. The European Soil Erosion Model (EUROSEM): A dynamic approach for predicting sediment transport from fields and small catchments. Earth Surf. Process. Landf. 1998, 23, 527–544. [Google Scholar] [CrossRef]
- Wang, H.; Cai, Q.; Zhu, Y. Evaluation of the EUROSEM Model for predicting water erosion on steep slope land in the Three Gorges Reservoir Area, China. Geogr. Res. 2003, 22, 579–589. [Google Scholar]
- Arnold, J.G.; Srinivasan, R.; Ramanarayanan, T.S.; DiLuzio, M. Water Resources of the Texas Gulf Basin. Water Sci. Technol. 1999, 39, 121–133. [Google Scholar] [CrossRef]
- Abdelwahab, O.M.M.; Ricci, G.F.; De Girolamo, A.M.; Gentile, F. Modelling soil erosion in a Mediterranean watershed: Comparison between SWAT and AnnAGNPS models. Environ. Res. 2018, 166, 363–376. [Google Scholar] [CrossRef]
- Panagopoulos, Y.; Dimitriou, E.; Skoulikidis, N. Vulnerability of a northeast Mediterranean island to soil loss. Can grazing management mitigate erosion? Water 2019, 11, 1491. [Google Scholar] [CrossRef]
- Ouyang, Z.-Y.; Wang, X.-K.; Miao, H. China’s eco-environmental sensitivity and its spatial heterogeneity. Acta Ecol. Sin. 2000, 20, 9–12. (In Chinese) [Google Scholar]
- Liu, K.; Kang, Y.; Cao, M.; Tang, G.; Sun, G. GIS-Based Assessment on Sensitivity to Soil and Water Loss in Shaanxi Province. J. Soil Water Conserv. 2004, 18, 168–170. (In Chinese) [Google Scholar]
- Abedini, M.; Ghasemian, B.; Shirzadi, A.; Shahabi, H.; Chapi, K.; Pham, B.T.; Bin Ahmad, B.; Tien Bui, D. A novel hybrid approach of Bayesian Logistic Regression and its ensembles for landslide susceptibility assessment. Geocarto Int. 2018, 34, 1427–1457. [Google Scholar] [CrossRef]
- Conoscenti, C.; Agnesi, V.; Angileri, S.; Cappadonia, C.; Rotigliano, E.; Märker, M. A GIS-based approach for gully erosion susceptibility modelling: A test in Sicily, Italy. Environ. Earth Sci. 2013, 70, 1179–1195. [Google Scholar] [CrossRef]
- Vu, D.T.; Tran, X.L.; Cao, M.T.; Tran, T.C.; Hoang, N.D. Machine learning based soil erosion susceptibility prediction using social spider algorithm optimized multivariate adaptive regression spline. Measurement 2020, 164, 108066. [Google Scholar] [CrossRef]
- Wang, X.; Zhong, X.; Fan, J. Assessment and spatial distribution of sensitivity of soil erosion in Tibet. Acta Ecol. Sin. 2004, 59, 183–188. [Google Scholar] [CrossRef]
- Chen, P.; Hu, L.; Li, Y.; Deng, O. Sensitivity assessment and spatial distribution of soil erosion in Longmen Mountains region. Bull. Soil Water Conserv. 2017, 37, 237–241. [Google Scholar]
- Asmamaw, L.B.; Mohammed, A.A. Identification of soil erosion hotspot areas for sustainable land management in the Gerado catchment, North-eastern Ethiopia. Remote Sens. Appl. Soc. Environ. 2019, 13, 306–317. [Google Scholar] [CrossRef]
- Zhang, E.; Peng, S.; Feng, H. Sensitivity Assessment of Soil Erosion and Its Spatial Pattern Evolution in Dianchi Lake Basin Based on GIS and RUSLE. J. Soil Water Conserv. 2020, 34, 115–122. [Google Scholar]
- Rahmati, O.; Tahmasebipour, N.; Haghizadeh, A.; Pourghasemi, H.R.; Feizizadeh, B. Evaluation of different machine learning models for predicting and mapping the susceptibility of gully erosion. Geomorphology 2017, 298, 118–137. [Google Scholar] [CrossRef]
- Nguyen, K.A.; Chen, W.; Lin, B.S.; Seeboonruang, U. Comparison of Ensemble Machine Learning Methods for Soil Erosion Pin Measurements. ISPRS Int. J. Geo Inf. 2021, 10, 42. [Google Scholar] [CrossRef]
- Zhao, Y. Distribution of Fragile Ecological Environment Types and Their Comprehensive Management in China; China Environmental Science Press: Beijing, China, 1994; pp. 1–6. (In Chinese) [Google Scholar]
- Li, Y.Q.; Deng, O.; Yang, G.B.; Fang, Q.B. Distribution Characteristics of Rainfall erosivity R Value in Yellow Soil Area of Central Guizhou Karst Mountainous Region. Bull. Soil Water Conserv. 2021, 41, 31–49. (In Chinese) [Google Scholar]
- Deng, O.; Li, M.; Yang, B.; Yang, G.; Li, Y. Erosive Rainfall Thresholds Identification Using Statistical Approaches in a Karst Yellow Soil Mountain Erosion-Prone Region in Southwest China. Agriculture 2024, 14, 1421. [Google Scholar] [CrossRef]
- Conforti, M.; Aucelli, P.P.C.; Robustelli, G.; Scarciglia, F. Geomorphology and GIS analysis for mapping gully erosion susceptibility in the Turbolo stream catchment (Northern Calabria, Italy). Nat. Hazards 2010, 56, 881–898. [Google Scholar] [CrossRef]
- Luo, X.L.; Bai, X.Y.; Tan, Q.; Chen, H.; Ran, C.; Xi, H.P. Effect of lithology background on the correlation between soil erosion and rock desertification. Acta Ecol. Sin. 2018, 38, 8717–8725. [Google Scholar] [CrossRef]
- Xie, B.; Yang, G.B.; Yang, Q.Q.; Li, Y.Q.; Fang, Q.B. Study on spatial differentiation of lithology and soil erosion in central Guizhou. J. Nat. Sci. Hunan Norm. Univ. 2023, 46, 10–16. [Google Scholar]
- Chen, L.; Zhang, Y.; Wang, Q.; Yang, Y. Identification and analysis of soil erosion characteristics and main influencing factors in the karst mountainous are-as of Hubei and Chongqing. J. Hubei Univ. Nat. Sci. 2020, 42, 172–178, 184. [Google Scholar]
- Ren, Q.; Yan, Y.; Gan, Y.; Fu, W.; Dai, Q.; Gao, R.; Lan, X. Effects of Short-Duration High-Intensity Rainfall on Erosion and Sediment Yield of Typical Karst Slope Farmland. J. Soil Water Conserv. 2019, 33, 105–112. [Google Scholar]
- Zhang, F.; Xiong, K.; Chen, H.; Yang, Z.; Fan, Y. Soil Erosion Factors and Ecobenefit Evaluation in the Karst Plateau-Mountain Region—With a Special Reference to Shiqiao Catchment of Bijie in Guizhou. Res. Soil Water Conserv. 2009, 16, 88–92, 97. (In Chinese) [Google Scholar]
- Yang, Q.; Yang, G.B.; Zhao, Q.; Dai, L. Characteristics of runoff sediment yield on slopes under different rainfall and vegetation cover in karst areas. Bull. Soil Water Conserv. 2020, 40, 9–16. (In Chinese) [Google Scholar]
- Peng, R.; Deng, H.; Li, R.; Li, Y.; Yang, G.; Deng, O. Plot-Scale Runoff Generation and Sediment Loss on Different Forest and Other Land Floors at a Karst Yellow Soil Region in Southwest China. Sustainability 2023, 15, 57. [Google Scholar] [CrossRef]
- Lin, N.; Sun, P.; Tang, J.; Li, Z.; Wang, C. Quantitative Study of Water and Soil Conservation Value on Songnen Plain. J. Soil Water Conserv. 2006, 20, 155–159. [Google Scholar]
- Zeng, C. Soil Erosion Evolution and Spatial Correlation Analysis in a Typical Karst Geomorphology Using RUSLE with GIS; Guizhou Normal University: Guiyang, China, 2018. [Google Scholar]
- Gao, J.B.; Zuo, L.Y.; Wang, H. The spatial trade-offs and differentiation characteristics of ecosystem services in karst peak-cluster depression. Acta Ecol. Sin. 2019, 39, 7829–7839. [Google Scholar] [CrossRef]
- Zhou, Y.; Jiang, Q.; Liu, L.; He, C.; Zhou, Y.; Kassandro, M. Soil and water loss and ecological risk assessment in southeastern Tibet of southwestern China. J. Beijing For. Univ. 2023, 45, 88–98. [Google Scholar]
- Mo, J.; Chen, Y.; Mo, W. Comparison of key indicators and evaluation models of soil and water loss sensitivity in karst ecosystem. Res. Soil Water Conserv. 2021, 28, 257–266. [Google Scholar]
- Wischmeir, W.H. Estimating the Loss Equation’s Cover and Management Factor for Undisturbed Areas. In Proceedings of Sediment Yield Workshop; U.S. Department of Agriculture Sedimentation Laboratory: Oxford, MS, USA, 1972. [Google Scholar]
- Williams, J.R.; Renard, K.G.; Dyke, P.T. EPIC: A New Method for Assessing Erosion’s Effect on Soil Productivity. J. Soil Water Conserv. 1983, 38, 381–383. [Google Scholar] [CrossRef]
- Moore, I.; Burch, G. Physical Basis of the Length Slope Factor in the Universal Soil Loss Equation. Soil Sci. Soc. Am. J. 1986, 50, 1294–1298. [Google Scholar] [CrossRef]
- Liu, B.Y.; Nearing, M.A.; Risse, L.M. Slope gradient effects on soil loss for steep slopes. Trans. ASAE 1994, 37, 1835–1840. [Google Scholar] [CrossRef]
- Karydas, C.G.; Sekuloska, T.; Silleos, G.N. Quantification and site-specification of the support practice factor when mapping soil erosion risk associated with olive plantations in the Mediterranean island of Crete. Environ. Monit. Assess. 2009, 149, 19–28. [Google Scholar] [CrossRef]
- Tian, Y.C.; Zhou, Y.M.; Wu, B.F.; Zhou, W.F. Risk assessment of water soil erosion in upper basin of Miyun Reservoir, Beijing, China. Environ. Geol. 2009, 57, 937–942. [Google Scholar] [CrossRef]
- Cai, C.; Ding, S.; Shi, Z.; Huang, L.; Zhang, G. Study of Applying USLE and Geographical Information System IDRISI to Predict Soil Erosion in Small Watershed. J. Soil Water Conserv. 2000, 14, 19–24. (In Chinese) [Google Scholar]
- Schönbrodt, S.; Behrens, T.; Seeber, C.; Scholten, T. Assessing the USLE crop and management factor C for soil erosion modeling in a large mountainous watershed in Central China. J. Earth Sci. 2010, 21, 835–845. [Google Scholar] [CrossRef]
- Wang, S.; Zhang, K.; Fan, J. Evaluation of RUSLE C-factor estimation based on vegetation coverage in typical karst catchments of southwest China. Catena 2020, 188, 104459. [Google Scholar]
- Zhang, X.; Li, P.; Liu, Y. Quantitative assessment of vegetation effects on soil erosion in karst mountainous regions using RUSLE and remote sensing data. Ecol. Indic. 2021, 121, 107140. [Google Scholar]
- Renard, K.G.; Foster, G.R.; Weesies, G.A.; McCool, D.K.; Yoder, D.C. Predicting Soil Erosion by Water: A Guide to Conservation Planning with the Revised Universal Soil Loss Equation (RUSLE); U.S. Department of Agriculture: Washington, DC, USA, 1997; Agriculture Handbook No. 703; pp. 1–251.
- Bu, Z.; Sun, J.; Zhou, F. A study on quantitative remote sensing method of soil erosion and its application. ACTA Pedol. Sin. 1997, 34, 235–245. [Google Scholar]
- You, S.; Li, W. Estimation of soil erosion supported by GIS—A case study in Guanji township, Tai he, Jiangxi. J. Nat. Resour. 1999, 14, 62–68. [Google Scholar]
- Rumelhart, D.; Hinton, G.; Williams, R. Learning representations by back-propagating errors. Nature 1986, 323, 533–536. [Google Scholar] [CrossRef]
- Leung, H.; Haykin, S. The complex backpropagation algorithm. IEEE Trans. Signal Process. 1991, 39, 2101–2104. [Google Scholar] [CrossRef]
- Ding, S.; Su, C.; Yu, J. An optimizing BP neural network algorithm based on genetic algorithm. Artif. Intell. Rev. 2011, 36, 153–162. [Google Scholar] [CrossRef]
- Friedman, J.H. Greedy function approximation: A gradient boosting machine. Ann. Stat. 2001, 29, 1189–1232. [Google Scholar] [CrossRef]
- Aertsen, W.; Kint, V.; Van Orshoven, J.; Özkan, K.; Muys, B. Comparison and ranking of different modelling techniques for prediction of site index in Mediterranean mountain forests. Ecol. Model. 2010, 221, 1119–1130. [Google Scholar] [CrossRef]
- Sahour, H.; Gholami, V.; Vazifedan, M.; Saeedi, S. Machine learning ap-plications for water-induced soil erosion modeling and mapping. Soil Tillage Res. 2021, 211, 105032. [Google Scholar] [CrossRef]
- Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
- Liaw, A.; Wiener, M. Classification and regression by random Forest. R News 2002, 2, 18–22. [Google Scholar]
- Micheletti, N.; Foresti, L.; Robert, S.; Leuenberger, M.; Pedrazzini, A.; Jaboyedoff, M.; Kanevski, M. Machine learning feature selection methods for landslide susceptibility mapping. Math. Geosci. 2014, 46, 33–57. [Google Scholar] [CrossRef]
- Caruana, R.; Niculescu-Mizil, A. An empirical comparison of supervised learning algorithms. In Proceedings of the 23rd International Conference on Machine Learning, Pittsburgh, PA, USA, 25–29 June 2006; pp. 161–168. [Google Scholar]
- Vapnik, V. The Nature of Statistical Learning Theory; Springer: New York, NY, USA, 1995. [Google Scholar]
- Ma, J.; Theiler, J.; Perkins, S. Accurate On-line Support Vector Regression. Neural Comput. 2003, 15, 2683–2703. [Google Scholar] [CrossRef]
- Gao, J.B.; Gunn, S.R.; Harris, C.J. Mean Field Method for the Support Vector Machine Regression. Neurocomputing 2003, 50, 391–405. [Google Scholar] [CrossRef]
- Li, Y.; Zhu, J. An assessment on regional Soil Erosion Sensitivity based on GIS: Taking Nujiang Prefecture as an area of study. J. Yunnan Univ. 2017, 39, 98–106. (In Chinese) [Google Scholar]
- Yang, G.; Li, Y.; An, Y. Pixel-based assessment and spatial distribution of sensitivity of soil erosion in Guizhou. Carsologica Sin. 2006, 25, 73–78. [Google Scholar]








| Factors | Very Low | Low | Moderate | High | Very High |
|---|---|---|---|---|---|
| Rank Value | 1 | 2 | 3 | 4 | 5 |
| R | ≤2000 | 2000–3000 | 3000–4000 | 4000–5000 | >5000 |
| K | 0 | 0–0.20 | 0.20–0.40 | 0.40–0.60 | >0.60 |
| LS | ≤10 | 10–20 | 20–30 | 20–30 | >30 |
| C | 0 | 0–0.15 | 0.15–0.30 | 0.30–0.60 | >0.60 |
| P | 0 | 0–0.35 | 0.35–0.60 | 0.60–0.80 | >0.80 |
| Data Set | Index | BPANN | BRTs | RF | SVR | ||||
|---|---|---|---|---|---|---|---|---|---|
| Train | Test | Train | Test | Train | Test | Train | Test | ||
| DS2 | RMSE | 0.680 | 1.230 | 0.081 | 0.119 | 0.187 | 0.234 | 0.790 | 0.871 |
| MAE | 0.170 | 0.263 | 0.036 | 0.085 | 0.101 | 0.186 | 0.114 | 0.206 | |
| R2 | 0.515 | 0.463 | 0.996 | 0.816 | 0.843 | 0.673 | 0.445 | 0.464 | |
| DS2 | RMSE | 0.493 | 0.867 | 0.368 | 0.733 | 0.374 | 0.816 | 0.798 | 1.062 |
| MAE | 0.076 | 0.115 | 0.032 | 0.107 | 0.035 | 0.163 | 0.263 | 0.669 | |
| R2 | 0.545 | 0.422 | 0.858 | 0.616 | 0.815 | 0.642 | 0.431 | 0.417 | |
| DS3 | RMSE | 0.574 | 0.816 | 0.033 | 0.065 | 0.374 | 0.463 | 0.433 | 0.865 |
| MAE | 0.236 | 0.307 | 0.101 | 0.107 | 0.132 | 0.194 | 0.201 | 0.494 | |
| R2 | 0.654 | 0.543 | 0.929 | 0.736 | 0.854 | 0.603 | 0.499 | 0.416 | |
| Average | RMSE | 0.582 | 0.971 | 0.161 | 0.306 | 0.312 | 0.504 | 0.674 | 0.933 |
| MAE | 0.161 | 0.228 | 0.056 | 0.100 | 0.090 | 0.181 | 0.193 | 0.456 | |
| R2 | 0.571 | 0.476 | 0.928 | 0.723 | 0.837 | 0.639 | 0.458 | 0.432 | |
| Susceptibility Class | BPANN | BRTs | RF | SVR | ||||
|---|---|---|---|---|---|---|---|---|
| Km2 | % | Km2 | % | Km2 | % | Km2 | % | |
| Very low | 544.08 | 8.22 | 300.50 | 4.54 | 347.50 | 5.25 | 499.07 | 7.54 |
| Low | 835.32 | 12.62 | 662.56 | 10.01 | 743.31 | 11.23 | 993.51 | 15.01 |
| Moderate | 1749.40 | 26.43 | 1406.54 | 21.25 | 1531.64 | 23.14 | 1867.88 | 28.22 |
| High | 2302.75 | 34.79 | 2853.45 | 43.11 | 2662.16 | 40.22 | 2193.54 | 33.14 |
| Very high | 1187.45 | 17.94 | 1395.95 | 21.09 | 1334.39 | 20.16 | 1065.00 | 16.09 |
| Total | 6619.00 | 100.00 | 6619.00 | 100.00 | 6619.00 | 100.00 | 6619.00 | 100.00 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Yang, B.; Li, Y.; Li, M.; Deng, O.; Yang, G.; Lei, X. Machine Learning-Based Spatial Prediction of Soil Erosion Susceptibility Using Geo-Environmental Variables in Karst Landscapes of Southwest China. Land 2025, 14, 2277. https://doi.org/10.3390/land14112277
Yang B, Li Y, Li M, Deng O, Yang G, Lei X. Machine Learning-Based Spatial Prediction of Soil Erosion Susceptibility Using Geo-Environmental Variables in Karst Landscapes of Southwest China. Land. 2025; 14(11):2277. https://doi.org/10.3390/land14112277
Chicago/Turabian StyleYang, Binglan, Yiqiu Li, Man Li, Ou Deng, Guangbin Yang, and Xinyong Lei. 2025. "Machine Learning-Based Spatial Prediction of Soil Erosion Susceptibility Using Geo-Environmental Variables in Karst Landscapes of Southwest China" Land 14, no. 11: 2277. https://doi.org/10.3390/land14112277
APA StyleYang, B., Li, Y., Li, M., Deng, O., Yang, G., & Lei, X. (2025). Machine Learning-Based Spatial Prediction of Soil Erosion Susceptibility Using Geo-Environmental Variables in Karst Landscapes of Southwest China. Land, 14(11), 2277. https://doi.org/10.3390/land14112277

