Next Article in Journal
Coupled and Coordinated Development of the Tourism Industry and Urbanization in Marginal and Less Developed Regions—Taking the Mountainous Border Areas of Western Yunnan as a Case Study
Previous Article in Journal
Effects of Main Land-Use Types on Plant and Microbial Diversity and Ecosystem Multifunctionality in Degraded Alpine Grasslands
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Study on Soil Erosion Driving Forces by Using (R)USLE Framework and Machine Learning: A Case Study in Southwest China

1
School of Surveying and Land Information Engineering, Henan Polytechnic University, Jiaozuo 454150, China
2
Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China
3
Shenzhen Engineering Laboratory of Ocean Environmental Big Data Analysis and Application, Shenzhen 518055, China
*
Author to whom correspondence should be addressed.
Land 2023, 12(3), 639; https://doi.org/10.3390/land12030639
Submission received: 31 January 2023 / Revised: 3 March 2023 / Accepted: 5 March 2023 / Published: 8 March 2023

Abstract

:
Soil erosion often leads to land degradation, agricultural production reduction, and environmental deterioration, which seriously restricts the sustainable development of regions. Clarifying the driving factors of soil erosion is the premise of preventing soil erosion. Given the lack of current research on the driving factors/force changes of soil erosion in different regions or under different erosion intensity grades, this paper pioneered to use machine learning methods to address this problem. Firstly, the widely used (Revised) Universal Soil Loss Equation ((R)USLE) framework was applied to simulate the spatial distribution of soil erosion. Then, the K-fold algorithm was used to evaluate the accuracy and stability of five machine learning algorithms for fitting soil erosion. The random forest (RF) method performed best, with average accuracy reaching 86.35%. Then, the Permutation Importance (PI) and the Partial Dependence Plot (PDP) methods based on RF were introduced to quantitatively analyze the main driving factors under different geological conditions and the driving force changes of each factor under different erosion intensity grades, respectively. Results showed that the main drivers of soil erosion in Chongqing and Guizhou were cover management factors (PI: 0.4672, 0.4788), while that in Sichuan was slope length and slope factor (PI: 0.6165). Under different erosion intensity grades, the driving force of each factor shows nonlinear and complex inhibitory or promoting effects with factor value changing. These findings can provide scientific guidance for the refined management of soil erosion, which is significant for halting or reversing land degradation and achieving sustainable use of land resources.

1. Introduction

Soil erosion usually leads to the loss of soil nutrients, land degradation, agricultural production reduction, increase in runoff, and aggravation of geological disasters such as landslides and debris flows [1,2]. According to statistics, the global average soil loss is 12~15 tons per hectare per year, severely threatening the ecological environment and human life safety [3,4]. The Sustainable Development agenda put forward by the United Nations in 2015 clearly points out that halting or reversing land degradation is one of the goals that urgently need to be achieved [5]. Topography, vegetation coverage, precipitation, soil texture, and other factors jointly affect soil erosion intensity. For example, the slope affects the soil erosion rate, and the vegetation canopy can reduce the direct scour of the soil surface caused by heavy rain. Studying the driving factors of soil erosion and quantifying their influence on soil erosion plays an important role in developing appropriate measures for soil erosion control and ecological restoration [6].
In the quantitative study of soil erosion, the (Revised) Universal Soil Loss Equation ((R)USLE) families are the most widely used and effective methods [7,8]. Borrelli et al. [9] constructed a Global Applications of Soil Erosion Modeling Tracker (GASEMT) database including 435 soil-erosion models, in which the models belonging to the (R)USLE families are by far the most widely used soil erosion models worldwide, covering 41% of the total applications in the database. In addition, they also found a strong rising trend of the (R)USLE-type applications across all continents. So the (R)USLE framework is favored by scholars because of its full consideration of influence factors, simple model, and easily accessed data [10,11,12,13,14]. It has been applied successfully to the soil erosion study in many areas of China, including the mountainous regions in southern Yunnan, the Loess Plateau, the middle and lower reaches of the Yangtze River, and the Karst areas achieved good results [15,16,17,18].
In recent years, based on the quantitative evaluation of soil erosion, scholars have further focused on its driving factors [19]. Zhang et al. [20], Chen et al. [21], Yang et al. [22], graded or classified the soil erosion factors such as slope, elevation, and land cover type, and analyzed the soil erosion amount of each factor under different erosion intensity or classifications. However, this method does not consider the effect of multi-factor synergism on soil erosion. Wang et al. [4], Zhao et al. [23], Matomela et al. [24], and other scholars have quantitatively analyzed the driving factors of soil erosion using the geographic detector method, which can realize the identification of driving forces based on the synergistic effect of two factors. As we know, soil erosion results from multi-factor coupling, and this method is still insufficient to explain the complex multi-factor process of soil erosion. Machine learning has the ability to learn relevant rules from data automatically. It can effectively explain multivariable correlation and has been applied to research on driving forces in many fields [25], such as land cover change [26], soil moisture content [27], forest fire [28], power demand [29], and locust plague outbreak [30], and has shown good application potential. However, few scholars have applied machine learning algorithms to studies on soil erosion driving forces.
In the past decade, global land use change has increased the total amount of soil erosion by 2.5%, and the soil erosion rate is one to two orders of magnitude higher than the soil formation rate [31,32]. China has the most severe soil erosion, with broad soil erosion areas and high erosion rates. According to the Ministry of Ecology and Environment statistics, the area of soil loss in China was 271.08 × 104 km2 in 2019, and the soil loss in southwest China is the most serious [4]. Most of the research on soil erosion driving forces carried out by domestic scholars in southwest China is mainly in a single region, lacking comparisons among soil erosion driving factors in different geological environments. In fact, in the same area, the main driving factor of soil erosion may change under different erosion tensity grades. Still, relevant studies are rarely reported, which is significant for implementing refined soil erosion management.
In summary, the current research on soil erosion driving forces still has some problems, such as few considerations of the synergy effects of multiple factors and unclear change rules of factor driving force under different erosion intensity grades. This situation makes it difficult to carry out refined soil erosion management and restricts the sustainable use of land resources to a certain extent. Based on the estimation of soil erosion modulus with multi-source data, this paper takes the lead in applying machine learning methods to study the driving factors and force changes of soil erosion in Southwest China. The main contribution of this paper is to evaluate the fitting ability of various machine learning models to soil erosion, explore the main driving factors of soil erosion under the synergistic action of multiple factors, and quantify the complex changes of driving forces of each factor under different erosion intensity grades. Specifically, this paper aims: (1) to determine an optimal machine learning algorithm that has good performance on soil erosion fitting; (2) to quantify the complex change relationship of erosion factors on soil erosion under different erosion intensity grades; and (3) to provide suggestions for refined management of soil erosion to support the sustainable development goal of halting land degradation.

2. Materials and Methods

2.1. Study Area and Data Sources

2.1.1. Study Area

Southwest China, as the main distribution area of hydraulic erosion in China, has complex topography and geomorphology, numerous rivers, and rich natural resources (Figure 1a). Sichuan, Guizhou, and Chongqing suffer from severe soil erosion every year. Sichuan Province is located in the upper reaches of the Yangtze River, the terrain is high in the west and low in the east, and it crosses the first and second steps of China. The mountains and hills are widely distributed, the topographic gradient is large, and hydraulic erosion is severe in Sichuan. According to statistics, about 24.9% of the areas in Sichuan suffer from soil erosion [33]. Guizhou Province is a Karst plateau with the main uplift in Sichuan Basin, northern Guangxi, and western Hunan. The landform is mainly mountainous, and the terrain is broken, with an average elevation of more than 1100 m. Because of the unique geological structure and the destruction of forest vegetation, Guizhou has become one of the high-risk areas for soil loss in China. Since 1981, an area of 47,247.37 km2 of soil erosion in Guizhou has been governed [34]. Due to the large number of mountains, large slopes, uneven seasonal distribution of precipitation, and spatial differences in soil types in Chongqing, regional soil erosion is relatively strong [35]. According to the statistics of the Chongqing Conservancy Bureau, soil loss occurred in 30.52% of the city in 2020. To sum up, the topography, geological environment, vegetation cover conditions, and soil erosion intensity grades in the above three regions are different (Figure 1b). Therefore, this study selects Sichuan, Chongqing, and Guizhou as the study area.

2.1.2. Data Sources

The data used in this study mainly include soil, climate, land use, topography, vegetation cover, and other data sources. The rainfall erosivity factor (R) is calculated from the monthly surface rainfall data set of 0.5° × 0.5° in China obtained by the National Meteorological Center. The data set is of good quality after cross-verification and error analysis. The soil erodibility factor (K) is calculated from the Chinese soil data set based on the Harmonized World Soil Database (HSWD) provided by the National Qinghai–Tibet Plateau Data Center. The slope length and steepness factors (LS) are calculated from the DEM data supplied by NASA JPL, with a spatial resolution of 30 m. The cover-management factor (C) is calculated from MODIS Surface Reflectance products (MYD09GA). This product has a resolution of 500 m and is used to calculate NDVI. The support practice factor (p) is assigned by the land cover product MCD12Q1 (IGBP). It is derived using supervised classifications of MODIS Terra and Aqua reflectance data. The study area includes 16 land cover types (Figure 2). Due to the unavailability of some soil erosion factor data in recent years, the relatively new year of 2020 was selected for research. Details of each data set are shown in Table 1.

2.2. Methods

The research method of this paper includes three parts: (1) Estimation of soil erosion based on the (R)USLE framework, including the calculation of erosion factors and the classification of soil erosion intensity grades. (2) Soil erosion simulation of five machine learning models based on the K-fold algorithm. (3) Explanation of machine learning on driving force of soil erosion based on Permutation Importance (PI) and Partial Dependence (PDP) methods. The technical flowchart is shown in Figure 3.

2.2.1. Estimation of Soil Erosion Based on the (R)USLE Framework

The RUSLE framework is an improved USLE model put forward by the Agricultural Research Institute of the United States Department of Agriculture (USDA) in 1992 and implemented in 1997 [36], which is suitable for a broader range of empirical models [37]. Its mathematical formula is as follows:
A = R · K · L S · C · P
where A is the average annual soil erosion modulus (t·km−2·a−1); R is the rainfall erosivity factor; K is the soil erodibility factor; LS is the slope length and steepness factors; C is the cover-management factor; and P is the support practice factor.
In order to better adapt to different landforms of China, this paper comprehensively referred to the application cases of (R)USLE framework in China and selected proper calculation methods for factors LS, C, and p to construct a new (R)USLE model suitable for the study area base on the original USLE model. The calculation methods for each factor are as follows:
(1)
Calculation of rainfall erosivity factor (R)
Rainfall is the main driving factor of soil erosion, and raindrop splash and runoff erosion are the main forms of soil erosion [38]. In this study, the rainfall erosivity model based on monthly rainfall established by Wischmeier and Smith [39] is used to calculate factor R. The relationship between rainfall and erosivity is an exponential function. The annual rainfall is calculated through the accumulation of monthly rainfall, and the rainfall erosivity factor is calculated. The mathematical formula is shown in Equation (2):
R = i = 1 12 1.735 × 10 [ 1.5 lg ( p i 2 p ) 0.8188 ]
where R is the rainfall erosivity factor; pi is monthly rainfall (in mm); p is annual rainfall (in mm).
(2)
Calculation of soil erodibility factor (K)
Soil erodibility factor K is calculated by the soil erosion and productivity impact estimation model (EPIC) [3]. The soil organic matter and the particle composition were used to calculate the K value. The mathematical formula is shown in Equation (3):
K = 0.1317 × { 0.2 + 0.3 exp ( 0.0256 S A N ( 1 S I L 100 ) ] } × [ S I L C A L + S I L ] 0.3 × { 1.0 0.25 C [ C + exp ( 3.72 2.95 C ) ] } × { 1.0 0.7 S N 1 [ S N 1 + exp ( 5.51 + 22.9 S N 1 ) ] }
where SAN is the subsoil sand fraction; SIL is the subsoil silt fraction; CAL is the subsoil clay fraction (in %); C is the topsoil carbon content (in %). SN1 = 1 − SAN/100.
(3)
Calculation of slope length and steepness factors (LS)
Topography is the basic geographical factor affecting soil erosion. Estimates of slope length and steepness over large areas are usually calculated from DEM [38]. The slope length factor L adopts the empirical formula proposed by Fu et al. [40]. Its mathematical formulas are shown in Equations (4) and (5):
L = ( λ 22.13 ) m
m = { 0.2       θ 0.5 ° 0.3       θ 1.5 ° 0.4       θ 3 °       0.5       θ > 3 °      
where λ denotes the slope length (in m); m is the slope length index; θ is the slope angle (in °).
The slope steepness factor S adopts the slope factor calculation formula put forward by Zhang et al. [41]. The mathematical formula is shown in Equation (6):
S = { 10.8 s i n θ + 0.03           θ < 5 ° 16.8 s i n θ 0.5               θ 5 ° 21.9 s i n θ 0.96           θ 10 °
where θ is the slope angle.
Then the LS is calculated by L multiply S.
(4)
Calculation of cover-management factor (C)
Factor C adopts the calculation method proposed by Cai et al. [42] and Li and Zhen [43]. The mathematical formula is shown in Equations (7) and (8):
N D V I C = N D V I i N D V I m i n N D V I m a x N D V I m i n
C = exp ( a × N D V I C b N D V I C ]
where NDVIC is the value of factor C of the calculated pixel, NDVImax and NDVImin are the maximum and minimum values of the whole study area; a and b are parameters determining the shape of the NDVIC curve, and their values are 2 and 1, respectively.
(5)
Calculation of support practice factor (P)
The conservation practice factor P reflects the influence of specific conservation measures on soil erosion. It ranges from 0 to 1, with 0 indicating where soil erosion will not occur and 1 indicating the area where no soil and water conservation measures have been taken. The value of factor P was assigned based on the global vegetation classification product of the International Geosphere-Biosphere Programme (IGBP) in 2020 and the assignment rules of references You and Li [44] and Cha, Deng, and Gu [45]. It is generally considered that soil and water conservation measures are not taken in the herbaceous cover, natural vegetation, broad-leaved trees, shrubs, mosaic herbs, and grasslands, so factor P in these areas is assigned 1. Usually, soil erosion does not occur in rivers and impervious surface distribution areas. So factor P in these areas is given 0. The p value of barren land (at least 60% of area is non-vegetated barren (sand, rock, soil) areas with less than 10% vegetation) is given 0.4, and that of cropland is 0.7.
All factors were resampled to 30 m resolution and unified into the same projection coordinate system of WGS_1984_Albers. At last, the soil erosion intensity was graded into six grades according to the Standards for classification and gradation of soil erosion issued by the Ministry of Water Resources of the People’s Republic of China [46] (Table 2).

2.2.2. Soil Erosion Simulation Based on Machine Learning and K-Fold Algorithm

Five traditional machine learning models were selected in this study to verify the fitting accuracy of various machine learning methods for soil erosion, including the tree-based model (Decision tree, DT), the linear model (Logistics Regression, LR), the kernel function-based model (Support Vector Machine, SVM), and the integrated algorithm (Random Forest, RF, and Gradient Boosting Regression Tree, GBRT). The principles of the above machine learning algorithms can be found in references [47,48,49,50,51].
We obtained training data by randomly spreading points by taking the soil erosion intensity result of 2020 as labels and the factors R, K, LS, C, and P as features. The points in the three regions were obtained by randomly scattering 2,000,000 points in the whole study area and ensuring the distance of the adjacent points was no less than 100 m. We finally obtained 1,658,265 points due to the distance constraints and counted the number of points falling into the three regions, respectively. A total of 1,048,575 data points were obtained in Sichuan, 97,211 data points in Chongqing, and 512,479 data points in Guizhou. Each factor was normalized from 0 to 1 by using the membership ambiguity function. Five algorithms were applied to fit the soil erosion modulus, including DT, LR, SVM, RF, and GBRT. The K-fold algorithm was used to evaluate the accuracy and stability of the models, and then the best model was selected to study the driving forces of soil erosion.
The K-fold algorithm divides the data into K disjoint subsets called packets. For each packet, the remaining data are used to train a model, and then the data of this packet are used in the trained model to generate predictions. When the K packets are processed in a loop, the predicted values of each packet are summarized and compared with the actual target to evaluate the model’s accuracy [50,52]. The accuracy can be calculated by Equation (9):
A c c u r a c y = T P + T N T P + T N + F P + F N
where TP stands for True Positive; TN stands for True Negative; FP stands for False Positive; and FN stands for False Negative.

2.2.3. Driving Forces Explanation of Soil Erosion Based on Machine Learning

(1)
PI method for exploring the main driving factor of soil erosion
The Permutation Importance (PI) is a heuristic method proposed by Altmann et al. [53] to normalize the measure of feature importance, which can correct the deviation of feature importance. It can identify potential feature interactions in the model to understand the importance of features to the results. The PI method randomly disrupts the single column of the verification data and makes the target and all other data columns stay where they are. It uses the resulting data set to predict and calculate the loss function through prediction and actual target values and judge the importance of variables by comparing the accuracy of the disrupted data. In this study, PI was used to analyze the main driving factors of soil erosion in different regions. The flow of the algorithm is shown in Algorithm 1.
Algorithm 1. PI implementation process.
Input: Fitting pre-training model (m), Dataset (D)
Calculate the model accuracy of dataset(D) about S
For columns j (columns D):
  Several repetitions 1, …, k:
  Randomly scramble the column j, Generate corrupted data Dk,j
  Calculate (Dk,j) the accuracy Sk,j
  Importance calculation formula is shown in Equation (10):
i j = s 1 K k = 1 K S k , j
Note: Sk,j: Accuracy; K: Number of repetitions; ij: Importance of one factor (j: randomly scramble column).
(2)
PDP method for exploring driving force changes of each factor
Although the PI method can simulate the driving effect of multi-factor coupling by disturbing the order of features, it cannot reflect the impact intensity of each factor change on soil erosion. Partial Dependence Plot [54] (PDP) can show the marginal effect of one or more features on the prediction results to explain whether the influence of features on the results is linear, monotonous, or more complex (Equation (11)). PDP assumes that the feature XC is not related to other feature XS, and marginalizes the resulting output to the feature distribution in set C. By marginalizing other features, a function that only depends on the features in S is obtained to explore the relationship between the features S and the predicted results. PI shows which feature impacts the results most, while PDP shows how features affect the model results. In this study, the PDP was used to analyze the driving force variation of each erosion factor under different erosion intensity grades.
f s ( X s ) = 1 N i = 1 N f ( X S , X i C )
where fS is the prediction model; S is the independent variable; and C is the other independent variables set.

3. Results

3.1. Soil Erosion Intensity Distribution in the Study Area

Figure 4 shows the distribution maps of the five soil erosion driving factors. The higher the value of these factors, the easier it is to promote soil erosion. Although Figure 4 shows that these five factors have no strong relation in their spatial distribution, they jointly determine the spatial distribution of soil erosion in the study area. These maps can help us understand how the driving factors work when the soil erosion tensity changes.
The study area’s average annual soil erosion modulus in 2020 is estimated based on (R)USLE. In order to further analyze the spatial distribution characteristics of soil erosion intensity, the soil erosion results were graded according to Algorithm 1. The spatial distribution maps of soil erosion intensity in Sichuan, Guizhou, and Chongqing are shown in Figure 5. The soil erosion intensity is graded into six grades: very light, light, moderate, strong, very strong, and extremely strong.
It can be seen from Figure 5 that the overall degree of soil erosion in Sichuan Province is relatively high, and the areas with strong, very strong, and extremely strong soil erosion intensity grades are mainly concentrated in the northwest of Sichuan. The erosion intensity grades in Chongqing and Guizhou are mostly very light or light.
The area and proportion of each soil erosion intensity grade and average annual soil erosion modulus in each region are shown in Table 3. The results showed that about 32% of the areas had apparent soil erosion in 2020, in which the strong to extremely strong grades accounted for 18.97% of the total area, and the average erosion modulus was 50.22 t·km−2·a−1. The overall risk of soil erosion in Sichuan is rather high. The results are relatively consistent with the 14.71% proportion of strong and more intense erosion areas in Sichuan reported in the 2020 Soil and Water Conservation Bulletin. The soil erosion intensity in Guizhou and Chongqing is generally in very light and light grades, and the risk of soil erosion is relatively low. However, there are still areas above moderate erosion intensity grade in these two regions, which is consistent with the results of Wang et al. [4] of southwest China.
Figure 5a shows that the spatial distribution of soil erosion in Sichuan has apparent erosion-graded performance, which is more severe than that in Chongqing and Guizhou. The topography and land cover in Sichuan are more complex than those in Chongqing and Guizhou. Therefore, Sichuan Province will be taken as an example in the subsequent analysis of the driving forces of various factors under different erosion intensity grades.

3.2. Soil Erosion Simulation Based on Machine Learning

Five machine learning models, including DT, LR, SVM, RF, and GBRT, were used to fit the soil erosion in the three regions, and the accuracy of each model was evaluated based on the K-fold algorithm. Each model has been cross-validated ten times, and the accuracy evaluation results are shown in Table 4 and Figure 6. The results show that the fitting accuracy of RF is the highest in the three regions with an average accuracy of 86.35%. At the same time, from the fluctuation degree of fitting accuracy in the three regions, it can be seen that RF has the highest stability.
It can be found from Figure 6 that the accuracies of SVM and LR are always at a low level in 10 cross-validations, which is because the idea of SVM and LR is to make the best partition through a line (or a higher-dimensional equivalence class) [48,49]. The dividing line of this kind of algorithm is linear, so these two algorithms do not perform very well in multi-classification scenarios. RF is an integrated algorithm based on DT. Evaluating the importance of input features is an important function of RF. A random process is introduced into the model to calculate the accuracy of the model by disturbing the out-of-bag data that does not participate in DT training. Hence, RF has higher accuracy than DT. Although GBRT is also an integrated algorithm based on DT, the prediction accuracy of the model is not stable without recalibration, and the time complexity of GBRT is much higher than that of RF [55,56]. Therefore, RF was selected to quantitatively attribute the driving force of soil erosion in this study.

3.3. Driving Factors and Driving Force Variation of Soil Erosion

3.3.1. Main Driving Factors of Soil Erosion in Different Regions

Based on RF and PI methods, the importance of soil erosion factors in the three regions in 2020 was ranked (Table 5 and Figure 7). It can be seen that the main driving factors of soil erosion are different in the three regions. The most important driving factor in Chongqing and Guizhou is factor C, while that in Sichuan is factor LS.
The landforms of Sichuan vary significantly from east to west, with a complex topography and great height difference. The eastern part of Sichuan is one of the largest basins in China, the northwest region is the northwest plateau, and the southwest is the northern section of the Hengduan Mountains. Therefore, the LS factor (PI = 0.6165 ± 0.0029) is the main driving factor of soil erosion in Sichuan. On the other hand, the overall vegetation coverage in Sichuan is 69%, which is lower than that in Chongqing and Guizhou, making it another reason why the LS factor becomes the main driving factor. Despite the LS factor, the C factor also shows its second important position in Sichuan, which means that the C factor is a significant factor in soil erosion.
The land use in Guizhou is characterized by prominent mountain features, various land-use types, a high land reclamation rate, a large proportion of sloping farmland, and large forest and grassland areas. Building land and unused land in Guizhou only account for 13.87%, making the C factor (PI = 0.4788 ± 0.0012) the main driver of soil erosion in Guizhou. At the same time, due to the diversity of land use types in Guizhou, the importance of the P factor for soil erosion is higher than that of the K factor compared with Sichuan and Chongqing.
Chongqing is located in the upper reaches of the Yangtze River and belongs to the subtropical monsoon humid climate, making evergreen broad-leaved forest the typical vegetation. The vegetation coverage density is high in Chongqing, with a vegetation cover area of 75,989.2 km2, accounting for about 92.25% of the total area of the city. Under the dual influence of climate and vegetation, the C factor has become the main driving factor of the whole region (PI = 0.4672 ± 0.0042). At the same time, the landforms of Chongqing are mainly hills and mountains, of which mountains account for 76%, making the LS factor the second important factor of soil erosion (PI = 0.4259 ± 0.0075).
As we all know, heavy rainfall will cause soil loss, and a certain amount of water current scour is necessary for soil loss [57]. The rainfall in the three regions is mainly light rain, and the number of light rain days accounts for about 75% of the total precipitation days [58]. Therefore, the importance of factor R in all three regions shows the ranking third, indicating that although factor R is not ranked high, it also has a certain importance.

3.3.2. Variation of the Driving Forces under Different Erosion Intensity Grades

The above research shows that the soil erosion intensity grades in Guizhou and Chongqing are mainly very light and light. The soil erosion intensity in Sichuan is relatively high, and about 32% of the areas have apparent soil erosion. Therefore, this study takes Sichuan as an example to analyze the driving force changes of factors under different erosion intensity grades to put forward scientific and reasonable soil and water control measures for soil erosion. Figure 8 shows the PDP diagram of each driving factor for soil erosion in Sichuan, which can characterize the changing trend of driving forces of various factors under different erosion intensity grades.
Vegetation can reduce the erosion of precipitation on the ground and use roots to stabilize the soil. It can be found from Figure 8a that there is a negative correlation between the cover-management factor (C) and soil erosion under the grade of very light. Under the erosion intensity grades of light, moderate, and strong, the driving force of the C factor on soil erosion first increases and then weakens. However, under the very strong erosion intensity grade, the C factor changed from a negative correlation (very light) to a strong positive correlation to soil erosion. Under the extremely strong grade, the increase of the C factor shows the phenomenon of restraining soil erosion. So, we could know that maintaining greening can effectively prevent and control soil erosion under low erosion intensity grades, while strengthening greening can reduce the impact of soil erosion under very light, moderate, and strong erosion intensity grades. Under the very strong and extremely strong grades, the inhibitory effect of the C factor on soil erosion gradually weakened along with the increase of the C factor. It is because soil erosion is the result of multi-factor coupling. Most of the very strong and extremely strong grades are distributed in areas with great height differences and complex topography, and the enhancement of vegetation coverage cannot restrain the growth rate of soil erosion.
Different soil types have different textures, and the abilities of soil erosion resistance are also different. The soil erodibility factor (K) reflects the reducing effect after implementing soil and water conservation measures, and the value is between 0 and 1. Figure 8b show that the K factor will inhibit soil erosion at the moderate erosion intensity grade. At the erosion intensity grades of strong and above, the K factor positively correlates with soil erosion in general.
It is found that there is an obvious linear relationship between the Support Practice Factor (P) and soil erosion, which is because when we assign values to land cover types, most of the values are concentrated in 0 and 1. It is difficult to capture the influence of small amount of data on the results in the process of calculation, so selecting higher resolution land cover data to explain the changes of soil erosion under different grades is the content that needs further discussion.
The slope length and steepness factors (LS) affect the soil erosion intensity by acting on the discharge and velocity of the slope runoff. Figure 8d shows that the LS factor’s effect on soil erosion is very complex and related to Sichuan’s physical geography. The LS factor is negatively correlated with soil erosion at the strong grade and below. At the extremely strong grade, soil erosion will increase dramatically with the increase of slope length and steepness. Therefore, at the extremely strong erosion intensity grade, proper construction of a gentle slope and reduction of height difference can effectively prevent and control soil erosion.
The rainfall factor (R) is the unit rainfall acting on the surface, producing the energy of soil and water flow. Figure 8e shows a significant positive correlation between R factors and soil erosion under extremely strong and very strong grades. Therefore, strengthening waterproofing under high erosion intensity grades can effectively prevent and control soil erosion.
Based on the above analysis, PDP shows that different measures can be taken to prevent and control soil erosion under different grades. For example, strengthening greening can effectively prevent and control soil erosion at the middle and low erosion intensity grades. In contrast, building gentle slopes, reducing height differences, and enhancing waterproofing can effectively prevent soil erosion at a high erosion intensity grade.

4. Discussion

4.1. (R)USLE Validation

Due to the significant terrain undulation of the study area and the higher time consumption for field measurement of soil loss, it is hard to obtain actual field data. So, we used previous studies and widely approved soil loss products to verify the reliability of the (R)USLE results from both qualitative and quantitative aspects. Qualitatively, com-pared with the results of Chen, Xiong, and Lan [34] and Wang et al. [4] in southwest China, the spatial distribution results of soil erosion intensity in this study are basically consistent with the previous results. Quantitatively, we used the soil erosion monitoring system, considered could be used in scientific research, proposed by Ouellette [59] in 2021 to calculate the soil erosion modulus of Chongqing in 2020 to compare with the result of this study. Products provided by this system are created using sentinel-2 data to monitor the NDVI of the whole year and the change of time series of bare soil to calculate soil erosion modulus. The results show that the average annual soil erosion modulus calculated by sentinel-2 in Chongqing is 0.63 t·km−2·a−1 (Figure 9), which is only lower 5.97% than the result of 0.67 t·km−2·a−1 of this study. Although, due to the limitation of the computing power of the system platform, the annual soil erosion modulus of Sichuan and Guizhou were not obtained, it also proved that our results estimated by the (R)USLE framework are reliable to some extent.

4.2. Capability of Machine Learning to Explain the Driving Force of Soil Erosion

When the parameters are not re-calibrated, the RF is the most accurate and stable model in fitting soil erosion, with an average accuracy of 86.35%, which means that the machine learning method can be used to estimate soil erosion precisely. Compared with the traditional single-factor driving force analysis, and two-factor interaction based on the geographic detector, the PI method adopted in this paper can realize the importance rank of all driving factors by fully considering the synergistic effect of multiple factors and then identifying the main soil erosion factors in different regions. The PI results proved differences in the main driving factor in different regions. Furthermore, the PDP results can provide more information than the PI method by showing whether the influence of factors on soil erosion is linear, positive, negative, or more complex. It shows that the main driving factors may change when the soil erosion intensity grades vary. In addition, the PDP can also explain the changing trend of driving forces of various factors under different erosion intensity grades by analyzing continuous variables other than discrete variables as the two-factor interaction method does.
In conclusion, the machine learning method can not only simulate soil erosion with high precision and obtain the main driving factors of different regions, but also provide more information about the complex changing process of the driving forces along with factors changing at different erosion intensity grades. So, we conclude that the machine learning method has an excellent capability to explain the driving forces of soil erosion.

4.3. Analysis of Driving Force of Soil Erosion

Clarifying the driving factors and their driving force changing low over complex geographical conditions is the key to preventing soil erosion effectively. Our result shows that the main driving factor of soil erosion in Sichuan is the LS factor, which is consistent with the study that carried out in the Tuojiang River Basin in Sichuan [60]. Liu et al. [61] analyzed the driving factor of soil erosion in the Sancha River Basin in Guizhou, and concluded that vegetation factors dominated soil erosion, which was also consistent with C being the main driving factor of soil erosion in Guizhou in this work. As for the main driving factor of soil erosion in Chongqing, though many research have studied the influencing factors of soil erosion, no one had pointed out which factor was dominated the soil erosion (which study that analyze only some areas of Chongqing) [62,63]. This paper finds that the cover-management factor is the main driving factor of soil erosion in Chongqing. All these findings can provide guidance for making general governance measures of soil erosion.
As we know, under different environmental conditions, the main driving factor of soil erosion may be different, especially in large areas with significant terrain undulation like Sichuan. So, more specific information for how driving force changing when conditions are different is need to make refined and adapted-to-local management measures. From the PDP diagram, we found that each factor’s driving force generally shows a trend from inhibition to promotion along with the soil erosion tensity changing from very light to extremely strong. The variation trend of each factor has its own characteristics. Taking the LS factor as an example, in the very light grade, it always showed an inhibitory effect; in the light and moderate grades, it presented a trend of promoting first and then inhibiting; while in the strong and very strong grades, it presented an opposite trend, and finally tended to a gentle trend. In the extremely strong grade, the LS factor showed a nonlinear trend of increased promoting influence. The above information indicates that the driving force of each factor on soil erosion is constantly changing under different erosion intensity grades. By comparing the changing trend of each factor under the same erosion tensity grade, we found that the C and LS factors were the two main driving factors of soil erosion. With the erosion tensity getting strong, the influence of the LS factor was gradually stronger than that of the C factor. These findings are different from previous studies, and they can provide more detailed information for refined management of soil erosion. Therefore, compared with identifying the main driving factor, digging the relationship between factors’ driving force change trend and different erosion rates is more important.

4.4. Suggestions for Soil Erosion Prevention and Control Measures

Soil erosion is considered a major environmental threat in terrestrial ecosystems, which seriously impacts the ecological environment and productivity [64]. The formation of soil erosion patterns is caused by many factors, such as K, LS, P, R, and C. Therefore, it is necessary to analyze the factors affecting erosion intensity patterns from many aspects. In this paper, the erosion intensity grades of strong, very strong, and extremely strong in Sichuan account for 18.91% of the total area, and the erosion intensity is high in the west and low in the southeast. The causes of soil erosion are complex and changeable. So, the quantitative description of the driving force changes can effectively explain the action form of factors under different erosion intensity grades and then provide scientific guidance for the formulation of corresponding soil erosion prevention and control measures in different erosion areas.
Combined with the soil erosion tensity and driving force analysis results in Sichuan, erosion hot areas could be located. Then corresponding protective measures in different areas could be taken, which could provide scientific decision making for regional sustainable development and effectively reduce the investment of funds. Most areas with strong and extremely strong erosion intensity grades in Sichuan have steep topography and low vegetation cover. In addition, due to limited cultivated land resources, Sichuan has a lot of unreasonable land use, especially in western Sichuan, where the natural conditions are bad. The main geomorphological types are high mountains and extremely high mountains with a significant disparity in topography, making soil erosion especially severe. The main land-use patterns in the areas with moderate erosion intensity grade and below are shrubland, sparse forest, and cultivated land. The regional vegetation coverage is relatively high, and soil erosion is relatively slight. Based on the above understanding, this paper puts forward the following suggestions for the prevention and control of soil erosion in Sichuan: (1) to strengthen greening and returning farmland to the areas of very light, light, moderate, and strong erosion intensity grades; and (2) to build gentle slope, reduce height difference and improve the erosion resistibility in the areas of extremely strong and very strong erosion intensity grades.

4.5. Research Limitations and Prospects

(1)
Due to the significant terrain undulation of the study area and the continuous long-term time requirement of field soil loss data obtaining, which may take a whole year, we did not use the actual field data to validate the results of (R)USLE. Instead, we verified the reliability of (R)USLE results qualitatively and quantitatively by comparing the distribution of soil erosion and the erosion rate of this paper with previous studies or existing widely approved soil loss products, which is a limitation of this study. In the future, with the accumulation of field monitoring data on soil erosion by local supervision departments of water and soil conservation, this issue may be addressed.
(2)
According to the main driving factor and the driving force change trend, this paper provided several targeted suggestions for soil erosion management of Sichuan Province. Although each driving factor was finally sampled to a unified spatial resolution of 30 m, the original resolution of the factor R is 0.5° (~55 km), making the spatial information not detailed enough. When the driving factor data obtained are fine enough, we can extend the application of machine learning on soil erosion driving force to a smaller scale, such as river watersheds that are prone to soil erosion, to support the managers to carry out more targeted and accurate prevention and control measures.
(3)
In this study, we conducted a detailed analysis of soil erosion and its driving force in Sichuan Province in 2020. In the future, we can develop long-term monitoring of soil erosion and its driving force changes on this basis. Then, combined with the governance measures taken during the study period and the driving factors changes, evaluate the effectiveness of different governance measures, and explore better treatment schemes for different erosion intensity grades.

5. Conclusions

Based on the (R)USLE framework, this paper reveals the distribution pattern of soil erosion in Sichuan, Guizhou, and Chongqing. Based on machine learning algorithms, the soil erosion of southwest China was simulated, the main driving factors of soil erosion in different regions were analyzed, and the driving force changes of soil erosion factors under different erosion intensity grades were explained. The results show that the machine learning method has an excellent capability for fitting the soil erosion and exploring the driving force of soil erosion. The RF model performed better than models DT, LR, SVM and GBRT in fitting soil erosion with an average accuracy of 86.35%. PI method can be used to rank the importance of all driving factors. The PDP method can explain well whether the influence of factors on soil erosion is linear, positive, negative, or more complex. These results can provide scientific guidance for the refined management of soil erosion, then contribute to halting or reversing land degradation and achieving sustainable use of land resources.

Author Contributions

Conceptualization: Y.G. and L.Z.; methodology: Y.G. and L.Z.; software, Y.G.; writing—original draft preparation, Y.G. and L.Z.; writing—review and editing, J.C., X.L., H.L., Y.R. and Z.W.; supervision, L.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China [42171323, 42001286] and the Sustainable Development Project of Science, Technology and Innovation Commission of Shenzhen Municipality [KCXFZ202002011006298].

Data Availability Statement

The data that support the findings of this study are openly available in Google Earth Engine at https://developers.google.com/earth-engine/datasets/ (accessed on 4 March 2022), in the National Meteorological Center of China at http://data.cma.cn (accessed on 4 March 2022)and in the National Qinghai-Tibet Plateau Data Center at http://data.tpdc.ac.cn (accessed on 4 March 2022).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Ganasri, B.; Ramesh, H. Assessment of Soil Erosion by RUSLE Model Using Remote Sensing and GIS-A Case Study of Nethravathi Basin. Geosci. Front. 2016, 7, 953–961. [Google Scholar] [CrossRef] [Green Version]
  2. Mirzaee, S.; Ghorbani-Dashtaki, S.; Mohammadi, J.; Asadi, H.; Asadzadeh, F. Spatial Variability of Soil Organic Matter Using Remote Sensing Data. Catena 2016, 145, 118–127. [Google Scholar] [CrossRef]
  3. Tanyaş, H.; Kolat, Ç.; Süzen, M.L. A New Approach to Estimate Cover-Management Factor of RUSLE and Validation of RUSLE Model in the Watershed of Kartalkaya Dam. J. Hydrol. 2015, 528, 584–598. [Google Scholar] [CrossRef]
  4. Wang, M.; Wang, H.S.; Jiang, C.; Sun, J.X. Spatial Soil Erosion Patterns and Quantitative Attribution Analysis in Southwestern China Based on RUSLE and Geo-Detector Model. J. Basic. Sci. Eng. 2021, 29, 1386–1402. [Google Scholar] [CrossRef]
  5. United Nations (UN). Transforming Our World: The 2030 Agenda for Sustainable Development; A/RES/70/1; United Nations: New York, NY, USA, 2015; Available online: https://undocs.org/en/A/RES/70/1 (accessed on 4 March 2022).
  6. Wang, H.Q. The Spatiotemporal Variations and Drivers Analysis of Soil Erosion in the Orthern North China’s Region. Beijing For. Univ. 2020, 4, 100082. (In Chinese) [Google Scholar]
  7. Niu, L.N.; Shao, Q.Q.; Liu, G.B.; Tang, Y.Z. Analysis on Spatiotemporal Characteristics and Factors of Soil Erosion in Liupanshui City. J. Geo-Inf. Sci. 2019, 21, 13. (In Chinese) [Google Scholar]
  8. Rao, E.; Xiao, Y.; Ouyang, Z.; Yu, X. National Assessment of Soil Erosion and Its Spatial Patterns in China. Ecosyst. Healthsust. 2015, 1, 1–10. [Google Scholar] [CrossRef]
  9. Borrelli, P.; Alewell, C.; Alvarez, P.; Anache, J.A.A.; Baartman, J.; Ballabio, C.; Bezak, N.; Biddoccu, M.; Cerdà, A.; Chalise, D.; et al. Soil Erosion Modelling: A Global Review and Statistical Analysis. Sci. Total Environ. 2021, 780, 146494. [Google Scholar] [CrossRef]
  10. Dissanayake, D.; Morimoto, T.; Ranagalage, M. Accessing the Soil Erosion Rate Based on RUSLE Model for Sustainable Land Use Management: A Case Study of the Kotmale Watershed, Sri Lanka. Model. Earth. Syst. Environ. 2019, 5, 291–306. [Google Scholar] [CrossRef]
  11. Koirala, P.; Thakuri, S.; Joshi, S.; Chauhan, R. Estimation of Soil Erosion in Nepal Using a RUSLE Modeling and Geospatial Tool. Geosciences 2019, 9, 147. [Google Scholar] [CrossRef] [Green Version]
  12. Thapa, P. Spatial Estimation of Soil Erosion Using RUSLE Modeling: A Case Study of Dolakha District, Nepal. Environ. Syst. Res. 2020, 9, 1–10. [Google Scholar] [CrossRef]
  13. Kebede, Y.S.; Endalamaw, N.T.; Sinshaw, B.G.; Atinkut, H.B. Modeling Soil Erosion Using RUSLE and GIS at Watershed Level in the Upper Beles, Ethiopia. Environ. Chall. 2021, 2, 100009. [Google Scholar] [CrossRef]
  14. Gong, W.; Liu, T.; Duan, X.; Sun, Y.; Zhang, Y.; Tong, X.; Qiu, Z. Estimating the Soil Erosion Response to Land-Use Land-Cover Change Using GIS-Based RUSLE and Remote Sensing: A Case Study of Miyun Reservoir, North China. Water 2022, 14, 742. [Google Scholar] [CrossRef]
  15. Chen, F.; Li, H.B. Spatial-Temporal Variations of Soil Erosion in Southern Yunnan Mountainous Area Using GIS and RUSLE: A Case Study in Yunyang County, Yunnan Province, China. Chin. J. Appl. Ecol. 2021, 32, 9. [Google Scholar] [CrossRef]
  16. Mu, X.M.; Li, P.F.; Liu, B.T.; Zhao, G.J.; Gao, P.; Sun, W.Y. Spatial-Temporal Development and Driving Mechanisms of Erosion on the Chinese Loess Plateau Between 1901 and 2016. Yellow River 2022, 44, 36–45. (In Chinese) [Google Scholar]
  17. Gao, J.; Wang, H. Temporal Analysis on Quantitative Attribution of Karst Soil Erosion: A Case Study of a Peak-Cluster Depression Basin in Southwest China. Catena 2019, 172, 369–377. [Google Scholar] [CrossRef]
  18. Fang, G.; Yuan, T.; Zhang, Y.; Wen, X.; Lin, R. Integrated Study on Soil Erosion Using RUSLE and GIS in Yangtze River Basin of Jiangsu Province (China). Arab. J. Geosci. 2019, 12, 1–13. [Google Scholar] [CrossRef]
  19. Yao, K.; Zhou, B.; He, L.; Liu, B.; Luo, H.; Liu, G.L.; Li, Y.X. Change of Soil Erosion in the Upper Reaches of the Minjiang River Based on Geo-Detector. Res. Soil Water Conserv. 2022, 29, 85–91. (In Chinese) [Google Scholar]
  20. Zhang, S.; Xiong, D.H.; Wu, H.; Yuan, Y.; Li, W.X.; Zhang, W.D. Research on Spatial Variation of Soil Erosion in Sunshui River Basin Based on RUSLE Model. J. Soil Water Conserv. 2021, 35, 7. (In Chinese) [Google Scholar]
  21. Chen, H.; Jiang, X.C.; Ren, L.; Tang, X.Q.; Xiao, X. Quantitative Evaluation of Soil Erosion in the Huaihe River Basin Based on RUSLE Model. Chin. J. Soil Sci. 2021, 52, 165–176. (In Chinese) [Google Scholar]
  22. Yang, M.H.; Yao, Z.H.; Cao, L.H.; Huang, J.; Wang, S.X.; Yan, A. Spatial-Temporal Variation Characteristics of Soil Erosion in Ji River Demonstration Zone Based on RUSLE. China Rural. Water Hydropower 2019, 8, 178–185. (In Chinese) [Google Scholar]
  23. Zhao, Y.; Liu, L.; Kang, S.; Ao, Y.; Han, L.; Ma, C. Quantitative Analysis of Factors Influencing Spatial Distribution of Soil Erosion Based on Geo-Detector Model under Diverse Geomorphological Types. Land 2021, 10, 604. [Google Scholar] [CrossRef]
  24. Matomela, N.; Li, T.; Ikhumhen, H.O.; Raimundo Lopes, N.D.; Meng, L. Soil Erosion Spatio-Temporal Exploration and Geodetection of Driving Factors Using InVEST-Sediment Delivery Ratio and Geodetector Models in Dongsheng, China. Geocarto. Int. 2022, 37, 13039–13056. [Google Scholar] [CrossRef]
  25. Li, J.N.; Xiong, R.B.; Lan, Y.Y.; Pong, L.; Guo, J.F.; Chen, X.Q. Overview of The Frontier Progress of Causal Machine Learning. J. Comput. Res. Dev. 2023, 60, 59–84. [Google Scholar]
  26. Meng, Y.R. Quantitative Assessment of the Importance of Bio-Physical Drivers of Land Cover Change Based on a Random Forest Method; Northwest A&F University: Xianyang, China, 2021. (In Chinese) [Google Scholar]
  27. Du, M.C.; Zhang, J.Y.; Wang, Z.L.; Liu, C.S.; Wang, G.Q. Soil Moisture Content Change Characteristics and Driving Factors of Maize Farmland in the Yanglou Watershed, Northern Anhui Province. J. Water. Res. Water. Eng. 2022, 33, 206–214. [Google Scholar]
  28. Zhu, Z.; Zhao, G.; Wang, Q.H.; Gao, Z.L.; Deng, X.F.; Huang, P.J. Driving Factors of Forest Fire and Fire Risk Zoning in Kunming City. J. Zhejiang A F Univ. 2022, 39, 380–387. (In Chinese) [Google Scholar]
  29. He, Y.; Guang, F.; Chen, R. Prediction of Electricity Demand of China Based on the Analysis of Decoupling and Driving Force. Iet. Gener. Transm. Dis. 2018, 12, 3375–3382. [Google Scholar] [CrossRef]
  30. Sun, R.; Huang, W.; Dong, Y.; Zhao, L.; Zhang, B.; Ma, H.; Geng, Y.; Ruan, C.; Xing, N.; Chen, X.; et al. Dynamic Forecast of Desert Locust Presence Using Machine Learning with a Multivariate Time Lag Sliding Window Technique. Remote Sens. 2022, 14, 747. [Google Scholar] [CrossRef]
  31. Amundson, R.; Berhe, A.A.; Hopmans, J.W.; Olson, C.; Sztein, A.E.; Sparks, D. Soil and human security in the 21st century. Science 2015, 348, 1261071. [Google Scholar] [CrossRef] [Green Version]
  32. Borrelli, P.; Robinson, D.A.; Fleischer, L.R.; Lugato, E.; Ballabio, C.; Alewell, C.; Meusburger, K.; Modugno, S.; Schütt, B.; Ferro, V. An assessment of the global impact of 21st century land use change on soil erosion. Nat. Commun. 2017, 8, 2013. [Google Scholar] [CrossRef] [Green Version]
  33. Yao, K.; Zhou, B.; He, L.; Li, Y.X. Temporal and Spatial Dynamic Changes of Soil Erosion in Mountain Area of Southwest Sichuan Province During 2000-2018. Bull. Soil Water Conserv. 2021, 41, 224–230+251. (In Chinese) [Google Scholar]
  34. Chen, Q.W.; Xiong, K.N.; Lan, A.J. Assessment of Erosion Risk in Guizhou Province Based on Geological Information System (GIS) Technology. J. Yangtze River Sci. Res. Inst. 2020, 37, 47–52+66. (In Chinese) [Google Scholar]
  35. Xiao, Y.; Ou, Y.Z.Y.; Xu, W.H.; Xiao, K.; Xiao, Q. Gis-Based Spatial Analysis of Soil Erosion and Soil Conservation in Chongqing, China. Acta Ecol. Sin. 2013, 35, 7130–7138. [Google Scholar] [CrossRef]
  36. Renard, K.G. Predicting Soil Erosion by Water: A Guide to Conservation Planning with the Revised Universal Soil Loss Equation (RUSLE); United States Government Printing: Washington, DC, USA, 1997; ISBN 0-16-048938-5.
  37. Chen, Y.B.; Liang, J.M.; Yu, W.Y. The Characteristics of Spatial-Temporal Differentiation of Soil Erosion Based on Rusle Model: A Case Study of Chaoyang City, Liaoning Province. Geogr. Sci. 2015, 35, 365–372. (In Chinese) [Google Scholar]
  38. Chen, S.X.; Yang, X.H.; Xiao, L.L.; Cai, H.Y. Study of Soil Erosion in the Southern Hilly Area of China Based on RUSLE Model. Resour. Sci. 2014, 36, 10. (In Chinese) [Google Scholar]
  39. Wischmeier, W.H.; Smith, D.D. Predicting Rainfall Erosion Losses: A Guide to Conservation Planning; Department of Agriculture; United States Department of Agriculture: Washington, DC, USA, 1978; pp. 1–69.
  40. FU, S.H.; Liu, B.Y.; Zhou, G.Y.; Sun, Z.X.; Zhu, X.L. Calculation tool of topographic factors. Sci. Soil. Water. Conserv. 2015, 13, 105–110. [Google Scholar] [CrossRef]
  41. Zhang, Y.; Liu, B.Y.; Shi, P.J.; Jiang, Z.S. Crop Cover Factor Estimating for Soil Loss Prediction. Acta Ecol. Sin. 2001, 21, 7. (In Chinese) [Google Scholar]
  42. Cai, C.F.; Ding, S.W.; Shi, Z.H.; Huang, L.; Zhang, G.Y. Study of Applying USLE and Geographical Information System IDRISI to Predict Soil Erosion in Small Watershed. J. Soil Water Conserv. 2000, 14, 6. (In Chinese) [Google Scholar]
  43. Li, T.H.; Zhen, L.N. Soil Erosion Changes in the Yanhe Watershed From 2001 to 2010 Based on RUSLE Model. J. Nat. Res. 2012, 27, 12. [Google Scholar] [CrossRef]
  44. You, S.C.; Li, W.N. Estimation of Soil Erosion Supported by GIS–A Case Study of Guanxi Township, Taihe County, Jiangxi Province. J. Nat. Resour. 1999, 14, 63–69. (In Chinese) [Google Scholar]
  45. Cha, L.S.; Deng, G.H.; Gu, J.C. Dynamic Changes of Soil Erosion in the Chaohu Watershed from 1992 to 2013. Acta Geogr. Sin. 2015, 70, 1708–1719. (In Chinese) [Google Scholar]
  46. Ministry of Water Resources of the People’s Republic of China. Standards for Classification and Gradation of Soil Erosion; Ministry of Water Resources of the People’s Republic of China: Beijing, China, 2008. (In Chinese)
  47. Myles, A.J.; Feudale, R.N.; Liu, Y.; Woody, N.A.; Brown, S.D. An Introduction to Decision Tree Modeling. J. Chemometr. 2004, 18, 275–285. [Google Scholar] [CrossRef]
  48. Hilbe, J.M. Logistic Regression Models; Chapman and Hall/CRC: Boca Raton, FL, USA, 2009. [Google Scholar] [CrossRef]
  49. Suthaharan, S. Support Vector Machine. In Machine Learning Models and Algorithms for Big Data Classification; Springer: Berlin/Heidelberg, Germany, 2016; pp. 207–235. Available online: https://link.springer.com/chapter/10.1007/978-1-4899-7641-3_9 (accessed on 4 March 2022).
  50. Rigatti, S.J. Random Forest. J. Insur. Med. 2017, 47, 31–39. [Google Scholar] [CrossRef] [Green Version]
  51. Huang, Y.; Liu, Y.; Li, C.; Wang, C. GBRTVis: Online Analysis of Gradient Boosting Regression Tree. J. Vis. Jpn. 2019, 22, 125–140. [Google Scholar] [CrossRef]
  52. Brink, H.; Richards, J.; Fetherolf, M. Real-World Machine Learning; Simon and Schuster: New York, NY, USA, 2016. [Google Scholar]
  53. Altmann, A.; Toloşi, L.; Sander, O.; Lengauer, T. Permutation Importance: A Corrected Feature Importance Measure. Bioinformatics 2010, 26, 1340–1347. [Google Scholar] [CrossRef] [Green Version]
  54. Greenwell, B.M. Pdp: An R Package for Constructing Partial Dependence Plots. R J. 2017, 9, 421. [Google Scholar] [CrossRef] [Green Version]
  55. Wang, Y.; Zhou, Z.H.; Zhou, A.Y. Machine Learning and Its Application; Tsinghua University Publishing House Co., Ltd.: Beijing, China, 2006. [Google Scholar]
  56. Bernard, S.; Heutte, L.; Adam, S. Influence of Hyperparameters on Random Forest Accuracy. In Proceedings of the International Workshop on Multiple Classifier Systems, Reykjavik, Iceland, 10–12 June 2009; Springer: Berlin/Heidelberg, Germany, 2009; pp. 171–180. Available online: https://link.springer.com/chapter/10.1007/978-3-642-02326-2_18 (accessed on 4 March 2022). [CrossRef] [Green Version]
  57. Nearing, M.; Pruski, F.; O’neal, M. Expected Climate Change Impacts on Soil Erosion Rates: A Review. J. Soil. Water. Conserv. 2004, 59, 43–50. Available online: https://www.jswconline.org/content/59/1/43 (accessed on 4 March 2022).
  58. Zhang, Q.; Li, Y.Q. Climate Variation of Rainfall and Rain Day in Southwest China for Last 48 Years. Plateau Meteorol. 2014, 33, 372–383. (In Chinese) [Google Scholar]
  59. Ouellette, W. Soil Erosion Watch—A Bootstrapped Approach to Identify the World’s Degrading Soils. 2021. Available online: https://medium.com/soilwatch/soil-erosion-watch-a-bootstrapped-approach-to-identify-the-worlds-degrading-soils-21016f72f68d (accessed on 4 March 2022).
  60. Zhong, X.Z.; Zhang, S.; Wu, R.J.; Jing, Y.B.; Meng, L.L.; Zhou, T. Analysis on Dynamic Changes and Driving Forces of Soil Erosion in Tuojiang River Basin. Res. Soil Water Conserv. 2022, 29, 43–49+56. (In Chinese) [Google Scholar]
  61. Liu, L.; Liang, H.; Jiao, S.L.; Dai, A.N. The Study on Soil Erosion Sensitivity Based on Karst Watershed for GIS—Taking the Three Cha He Basion in Guizhou Province as an Example. J. Guizhou Norm. Univ. (Nat. Sci.) 2015, 33, 6. (In Chinese) [Google Scholar]
  62. MU, F.Y.; Yang, M.; Yu, Q.; Liu, Z.T. Risk Assessment of Natural Disasters on Highway Caused by Soil Erosion Based on RF-RUSLE Model: A Case Study of Banan District in Chongqing. J. Chongqing Jiaotong Univ. (Nat. Sci.) 2020, 39, 114. (In Chinese) [Google Scholar]
  63. Xue, J.P.; Lyu, D.W.; Wang, D.Y.; Wang, Y.M.; Yin, D.L.; Zhao, Z.; Mu, Z.J. Assessment of soil erosion dynamics using the GIS-Based RUSLE Model: A case study of wangjiagou watershed from the Three Gorges Reservoir Region, Southwestern China. Water 2018, 10, 1817. [Google Scholar] [CrossRef] [Green Version]
  64. Xiong, M.; Sun, R.; Chen, L. A Global Comparison of Soil Erosion Associated with Land Use and Climate Type. Geoderma 2019, 343, 31–39. [Google Scholar] [CrossRef]
Figure 1. (a) Location of the study area in China; (b) Elevation distribution of the study area.
Figure 1. (a) Location of the study area in China; (b) Elevation distribution of the study area.
Land 12 00639 g001
Figure 2. Land cover map of the study area.
Figure 2. Land cover map of the study area.
Land 12 00639 g002
Figure 3. Technical flowchart. (a) Soil erosion estimation of southeast China based on (R)USLE framework; (b) Soil erosion simulation using ML models; (c) Driving force analysis of soil erosion by using RF model. R: Rainfall Erosivity Factor; K: Soil Erodibility Factor; LS: Slope Length and Steepness Factors; C: Cover-Management Factor; P: Support Practice Factor; ML: Machine Learning; DT: Decision Tree; LR: Logistics Regression; SVM: Support Vector Machine; RF: Random Forest; GBRT: Gradient Boosting Regression Tree.
Figure 3. Technical flowchart. (a) Soil erosion estimation of southeast China based on (R)USLE framework; (b) Soil erosion simulation using ML models; (c) Driving force analysis of soil erosion by using RF model. R: Rainfall Erosivity Factor; K: Soil Erodibility Factor; LS: Slope Length and Steepness Factors; C: Cover-Management Factor; P: Support Practice Factor; ML: Machine Learning; DT: Decision Tree; LR: Logistics Regression; SVM: Support Vector Machine; RF: Random Forest; GBRT: Gradient Boosting Regression Tree.
Land 12 00639 g003
Figure 4. Spatial distribution characteristics of five soil erosion factors. (a) R: rainfall erosivity factor; (b) K: soil erodibility factor; (c) LS: slope length and steepness factors; (d) C: cover-management factor; (e) P: support practice factor.
Figure 4. Spatial distribution characteristics of five soil erosion factors. (a) R: rainfall erosivity factor; (b) K: soil erodibility factor; (c) LS: slope length and steepness factors; (d) C: cover-management factor; (e) P: support practice factor.
Land 12 00639 g004
Figure 5. Distribution of soil erosion intensity in the study area. (a) Sichuan; (b) Guizhou; (c) Chongqing.
Figure 5. Distribution of soil erosion intensity in the study area. (a) Sichuan; (b) Guizhou; (c) Chongqing.
Land 12 00639 g005
Figure 6. Accuracy evaluation of five machine learning models. DT, Decision Tree; LR, Logistics Regression; SVM, Support vector machine; RF, Random Forest; GBRT, Gradient Boosting Regression Tree.
Figure 6. Accuracy evaluation of five machine learning models. DT, Decision Tree; LR, Logistics Regression; SVM, Support vector machine; RF, Random Forest; GBRT, Gradient Boosting Regression Tree.
Land 12 00639 g006
Figure 7. The importance of different factors in three regions. (a) Sichuan; (b) Guizhou; (c) Chongqing.
Figure 7. The importance of different factors in three regions. (a) Sichuan; (b) Guizhou; (c) Chongqing.
Land 12 00639 g007
Figure 8. The PDP diagrams of each factor under different erosion intensity grades; (a) Cover-Management Factor; (b) Soil Erodibility Factor; (c) Support Practice Factor; (d) Slope Length and Steepness Factors; (e) Rainfall Erosivity Factor; 1 to 6 represents the soil erosion intensity grade is very light, light, moderate, strong, very strong, and extremely strong, respectively. The y-axis represents the influence of different factors on soil erosion; positive values represent promotion, and negative values represent inhibition). The X-axis represents the range of factors. The blue shaded area indicates the level of confidence. The points on the lines represent the concentrated range of different factors.
Figure 8. The PDP diagrams of each factor under different erosion intensity grades; (a) Cover-Management Factor; (b) Soil Erodibility Factor; (c) Support Practice Factor; (d) Slope Length and Steepness Factors; (e) Rainfall Erosivity Factor; 1 to 6 represents the soil erosion intensity grade is very light, light, moderate, strong, very strong, and extremely strong, respectively. The y-axis represents the influence of different factors on soil erosion; positive values represent promotion, and negative values represent inhibition). The X-axis represents the range of factors. The blue shaded area indicates the level of confidence. The points on the lines represent the concentrated range of different factors.
Land 12 00639 g008
Figure 9. Soil Erosion Hazard (Log A) Distribution of Chongqing City in 2020.
Figure 9. Soil Erosion Hazard (Log A) Distribution of Chongqing City in 2020.
Land 12 00639 g009
Table 1. Data sources.
Table 1. Data sources.
DataTypeSpatial ResolutionYearData Sources
Vector boundary of the study areashp2020http://www.resdc.cn (accessed on 4 March 2022)
Rainfall erosivity factor (R)txt2020http://data.cma.cn (accessed on 4 March 2022)
Soil erodibility factor (K)tif1000 m2009http://data.tpdc.ac.cn (accessed on 4 March 2022)
Slope length and steepness factors (LS)tif30 m2000NASA SRTM DEM
Cover-management factor (C)tif500 m2020MOD09GA
Support practice factor (P)tif500 m2020MCD12Q1
Table 2. Soil erosion grading standard [46].
Table 2. Soil erosion grading standard [46].
Soil Erosion Intensity GradesSoil Erosion Modulus/(102·t·km−2·a−1)
Very light<5
Light5–20
Moderate20–50
Strong50–80
Very strong80–150
Extremely strong>150
Table 3. Area and proportion of each erosion intensity grade and erosion modulus in three regions.
Table 3. Area and proportion of each erosion intensity grade and erosion modulus in three regions.
Erosion Intensity GradeSichuanGuizhouChongqing
Area
(104 km2)
Proportion
(%)
Area
(104 km2)
Proportion
(%)
Area
(104 km2)
Proportion
(%)
Very light32.69667.6815.38188.067.81595.44
Light4.2998.901.7409.960.3193.89
Moderate2.1524.450.2491.420.0350.42
Strong1.7503.620.0630.360.0110.13
Very strong2.5215.220.0270.160.0060.08
Extremely strong4.88910.130.0060.040.0030.04
Average annual soil erosion modulus (t·km−2·a−1)50.221.990.67
Table 4. Accuracy evaluation of five machine learning models in the three regions.
Table 4. Accuracy evaluation of five machine learning models in the three regions.
RegionDTLRSVMRFGBRT
Chongqing80.55%64.46%64.48%86.69%82.98%
Sichuan76.04%46.86%43.71%85.16%75.19%
Guizhou79.65%59.09%58.29%87.19%82.11%
Average78.75%56.80%55.49%86.35%80.09%
Table 5. Ranking of factor importance.
Table 5. Ranking of factor importance.
SichuanGuizhouChongqing
FactorPIFactorPIFactorPI
LS0.6165 ± 0.0029C0.4788 ± 0.0012C0.4672 ± 0.0042
C0.5757 ± 0.0013LS0.4455 ± 0.0023LS0.4259 ± 0.0075
R0.2422 ± 0.0016R0.1533 ± 0.0017R0.0876 ± 0.0023
K0.0404 ± 0.0005P0.0589 ± 0.0016K0.0268 ± 0.0010
P0.0028 ± 0.0001K0.0434 ± 0.0013P0.0183 ± 0.0012
Note: R: Rainfall Erosivity Factor; K: Soil Erodibility Factor; LS: slope Length and Steepness Factors; C: Cover-Management Factor; P: Support Practice Factor.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ge, Y.; Zhao, L.; Chen, J.; Li, X.; Li, H.; Wang, Z.; Ren, Y. Study on Soil Erosion Driving Forces by Using (R)USLE Framework and Machine Learning: A Case Study in Southwest China. Land 2023, 12, 639. https://doi.org/10.3390/land12030639

AMA Style

Ge Y, Zhao L, Chen J, Li X, Li H, Wang Z, Ren Y. Study on Soil Erosion Driving Forces by Using (R)USLE Framework and Machine Learning: A Case Study in Southwest China. Land. 2023; 12(3):639. https://doi.org/10.3390/land12030639

Chicago/Turabian Style

Ge, Yuankai, Longlong Zhao, Jinsong Chen, Xiaoli Li, Hongzhong Li, Zhengxin Wang, and Yanni Ren. 2023. "Study on Soil Erosion Driving Forces by Using (R)USLE Framework and Machine Learning: A Case Study in Southwest China" Land 12, no. 3: 639. https://doi.org/10.3390/land12030639

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop