1. Introduction
Deserts are defined by the China Scientific and Technical Nomenclature Review Committee as geographic landscapes with sparse vegetation formed under arid climatic conditions [
1]. In addition, one kind of special desert type, called “cold deserts”, is widely distributed in upper alpine or high-latitude subpolar zones due to physiological drought caused by alpine and low temperatures. Desert is defined as a kind of geographic landscape with sparse or bare surface vegetation formed under arid or alpine climatic conditions [
2]. Desert areas are characterized by scarce precipitation, arid climates, poor soil, and sparse vegetation [
3]. The unique structure and function of desert areas differ from those of other ecosystems, providing important ecological service values in terms of wind and sand control, soil conservation, carbon sequestration and oxygen release, cultural tourism, hydrological regulation, and maintenance of biodiversity and providing a material basis for the survival and development of people living in desert regions [
4,
5,
6]. However, the ecological environment of desert areas is fragile and vulnerable to environmental degradation and desertification due to external disturbances [
7,
8]. Moreover, different types of deserts have different causes and processes of formation, as well as different geomorphological conditions, vegetation conditions, soil characteristics, and exploitation and utilization, resulting in different environmental problems [
9,
10]. At present, desertification has affected one-fourth of the global land area and the livelihoods of nearly one billion people, causing severe environmental degradation and enormous economic losses and threatening the survival and development of human beings [
11,
12]. China is one of the countries most seriously affected by desertification in the world, with a desertified land area of approximately 2.6 million km
2, accounting for 27.20% of the country’s land area, and a population of nearly 400 million people affected by desertification, resulting in a direct economic loss of approximately RMB 54 billion [
13]. The cold and dry climate in most parts of the Qinghai–Tibetan Plateau (QTP) has created favorable conditions for the development of desertification [
14]. Desertification has become a serious problem that hinders the socioeconomic development of the plateau and threatens the ecological security of the region and even Asia [
15]. As one of the most serious environmental and ecological problems in the world, the United Nations Conference on Environment and Development (UNCED) included desertification in Agenda 21 as an important issue affecting the sustainable development of human society [
16]. Therefore, a comprehensive understanding of the distribution and change characteristics of different desert types is highly important for maintaining the function of deserts and preventing the further development of desertification.
The Qinghai–Tibetan Plateau is affected by aridity and low temperatures, and the deserts in the region are widely distributed and of various types [
17]. The selection of appropriate classification features for different deserts plays an important role in accurately extracting desert types in the study area. However, owing to the similarity of the spectral response in desert areas, accurately distinguishing desert types via only optical remote sensing data is difficult [
18]. In their study on desertification information extraction in the Aral Sea region, Song et al. noted that microwave backscattering is very sensitive to changes in the surface soil particle size, and this response is more obvious in arid areas with sparse vegetation [
19]. Zhang et al. also noted in their discussion of the desert classification system in alpine regions that terrain is an important factor affecting the distribution of water and heat, which is closely related to the distribution of deserts [
20]. In addition, some studies have shown that combining texture data with remote sensing data can improve the accuracy of image classification and object recognition [
21]. However, the efficiency of preprocessing large amounts of geographic data and remote sensing data via conventional methods is low. Google Earth Engine (GEE), an online processing platform that provides rich remote sensing data and machine learning algorithms internally, can realize the acquisition, processing, analysis, and application of data in a single unit, which can greatly improve the efficiency of image processing and has been widely used in the field of remote sensing classification [
22,
23]. However, the current application of machine learning classification algorithms to identify desert types and distributions via the GEE platform is still relatively rare. This study uses the GEE data processing platform to extract the QTP desert types, which is of great practical significance for quickly understanding the current status and changing characteristics of large desert areas.
Although machine learning algorithms are known for the advantages of strong generalization ability and high classification accuracy, they can also automate the processing of large amounts of data and have been shown to produce good classification results even when processing high-dimensional and complex data [
24,
25]. However, machine learning models are black-box models, and the relationship between the input features and model predictions is difficult to understand [
26]. To understand the decision-making process within a machine learning model and to explain the impact of input variables on the prediction results, an in-depth and systematic analysis of input features is necessary [
27]. Traditional feature evaluation methods, such as the mean decrease in impurity (MDI) and permutation-based feature importance measures, tend to assess the global importance of features and cannot analyze each prediction individually [
28,
29]. Decision tree-based machine learning classifiers (e.g., RF and GTB) internally provide an MDI-based feature importance metric, and most of the current research on classification feature importance is based on this approach [
28,
30]. However, this approach only evaluates the contribution of features to the overall model performance, and the unique contribution to different categories is not considered. SHAP is a game theory-based approach with the main goal of explaining the output of a black-box model by attributing the prediction results to different input features [
31]. SHAP provides both local and global explanations in terms of the model’s influence process and output results and allows not only the overall importance of features to be assessed but also the degree of contribution (magnitude and direction) of each feature to the model results. Therefore, the SHAP model effectively overcomes the limitations of traditional feature importance analysis methods [
32]. Owing to its strong theoretical foundation and rich visualization tools, it has been widely used in the field of natural and social sciences [
33]. The SHAP model has been applied to classify wetland plant communities in northeastern China, soil textures, igneous rocks, and urban land use [
34,
35,
36,
37]. These findings suggest that SHAP provides a more nuanced interpretation of the predicted results, increases the transparency of machine learning classification models, and helps to improve understanding and trust in the models. This study applies the SHAP model in combination with machine learning algorithms in QTP desert type extraction, aiming to clarify the contribution and importance of different classification features in the overall desert extraction and the identification of different desert types from both global and local perspectives.
The QTP is known as the third pole of the Earth and is more sensitive to climate change and human activity disturbances than other regions [
38]. Exploring the drivers of changes in desert dynamics is highly important for the prevention and control of land desertification and the protection and improvement of the ecological environment in the QTP. Many studies have shown that the dynamic change in deserts is a complex process that is influenced by both changes in natural factors and human activities [
39,
40]. Because of this complexity, previous studies have focused on analyzing the influencing factors of desertification from a single aspect of climate or human activity [
41]. Some studies suggested that climate change affects soil quality and vegetation cover, leading to land degradation and the development of desertification [
42,
43]. Other studies argued that anthropogenic factors such as grazing and wood-cutting lead to the expansion of desertification [
44,
45]. However, these studies did not consider both the impact of natural factors and human activities on desertification changes. Therefore, there has been a gradual increase in studies using correlation analysis, principal component analysis, and residual trend analysis methods to analyze natural and anthropogenic factors affecting desert change [
46,
47,
48]. However, these research methods cannot fully reflect the nonlinearity and complexity of the influencing factors and ignore the interaction between the factors [
49,
50]. The geodetector model is a spatial statistical method for detecting the spatial dissimilarity of geographic phenomena, which not only quantifies the importance of different factors in a geographic phenomenon but also explains the influence of the interaction between any two factors [
51]. Using the geodetector to analyze the driving factors of QTP desert change, quantify the importance of each factor on the dynamic changes in deserts, and reveal the influence of the interactions between the factors on the dynamic changes in deserts.
The main objectives of this study are to (1) compare the performance of different machine learning classification algorithms in QTP desert typing via the GEE platform; (2) implement the SHAP method to analyze the importance of different classification features from both global and local perspectives on the basis of the optimal classification model; and (3) quantify the drivers of changes in QTP desert dynamics from 2000 to 2020 via geodetector. The research results can provide support for QTP desertification control measures and ecological environment management.
4. Discussion
4.1. Machine Learning Algorithm Classification Performance
In this study, we have used the five different machine learning classification algorithms for the extraction of different desert types of QTP to select the optimal algorithm. The RF classifier reliably classifies the target by the prediction of the decision tree ensemble and has been widely used by virtue of its low sensitivity to missing values and unbalanced training data, as well as its good anti-noise and anti-overfitting abilities [
64]. By comparing the classification accuracy of each classification algorithm as well as the local detail portrayal, our study found that the RF has higher accuracy compared to other classification algorithms and also has a better effect on the portrayal of each desert type. This finding is in line with the results of several other studies. Pizarro et al. compared the performance of six machine learning methods, RF, CART, GTB, SVM, minimum distance, and naive Bayes, when applied to Andean ecosystem land cover classification via the GEE platform, and the results revealed that RF was the better method [
80]. Yang et al., in their study on the classification of land cover in the Qilian Mountains, noted that the classification accuracy of RF was higher than that of CART and SVM [
81]. Meanwhile, it has been reported that compared to other classifiers, the RF is less sensitive to parameter settings and better able to deal with a large number of features [
82], which may be one of the reasons why the RF displays a higher classification performance. Our study result was found when the overall classification results and local detail descriptions of different machine learning algorithms were compared: RF and GTB provided greater consistency in the extraction of desert types. This may be attributed to the similarity of the model structure between the RF and GTB classifiers, both of which are integrated algorithms based on decision trees. The CART classifier, although it is also a decision tree-based classification model, as a binary decision tree classifier, has a simpler structure than the RF and GTB; thus, its classification performance is relatively poor. In addition, our study shows that KNN and SVM display poorer performance in classifying desert types, which may be related to their classification principles and sample selection. KNN is a proximity algorithm, and its classification prediction is strongly influenced by neighboring samples [
83]. SVM classification is based on the idea that only training samples located on class boundaries can effectively discriminate between classes [
69]. In selecting the samples, we followed the principle of uniformity, which makes KNN and SVM susceptible to the influence of different samples, thus resulting in misclassification, and this effect is more obvious in regions with complex desert types (
Figure 3 region c and region d).
4.2. Impact of Classification Features on Desert Type Identification
The importance of classification features based on the SHAP model revealed that the terrain features of elevation and slope had the highest overall importance in the extraction of desert types. This is because terrain is an important factor affecting water and heat distributions and has an important impact on the spatial distribution of deserts through direct or indirect effects [
84]. Moreover, SAR data can reflect the dielectric properties of features, soil roughness, particle size, and other information, which is highly suitable for the research and monitoring of desert areas [
85]. Zhu et al. reported that due to the difference in the surface roughness of rocks between sandy plains and paleochannels, the inversion of SAR signals in desert areas also significantly differed [
86]. We found that VV and VH play important roles in identifying sandy deserts through the importance of each classification feature in the extraction of different desert types, which further confirms the applicability of radar data in the study of desert areas. GLCM is a commonly used method of statistical analysis of texture, which can reflect the uniformity of the gray distribution of the image, the depth of the texture grooves, as well as the degree of brightness and darkness of the image and other information [
87]. Based on this information, texture features can improve the distinguishing ability of “same object different spectrum” and “same spectrum foreign body” phenomena in images, thus improving the classification accuracy [
88]. In our study, due to the differences in the composition and terrain, the various desert types with similar spectra show different texture information. Thus, texture features play an important role in the classification feature. In addition, because shortwave infrared radiation is more sensitive to moisture changes [
89], the importance of shortwave infrared signals is greater than that of the near-infrared and visible light bands in the classification process, especially in the identification of saline deserts, which is second only to the importance of elevation and slope. This is the case because the saline deserts in the QTP are distributed mainly in the Qaidam Basin, where the climate is arid and evaporation is high, and salts accumulate on the surface because of the continuous evaporation of soil and surface water [
90]. Spectral indices based on combinations of spectral bands can further increase the spectral variability among different features and are now widely used to classify various types of features [
74]. However, in our study, we found that the commonly used vegetation indices were less helpful in classifying desert types, while the TGSI showed greater feature importance than the other indices. This is because all desert types have low vegetation cover, and the vegetation index is less sensitive to areas with low vegetation cover and therefore has some limitations when applied in arid desert areas [
91]. Additionally, since there is a significant difference in grain size between sandy deserts and gravelly deserts, the TGSI has relatively high importance in distinguishing between these two types of deserts.
4.3. Impact Factors on Desert Change
Desert area ecosystems are very fragile and sensitive to environmental changes and are susceptible to changes due to external disturbances. In this study, we analyzed the effects of 12 different types of factors on QTP desert change. The results of factor detection revealed that the terrain factor plays an important role in QTP desert change, and Slo has the highest q statistic value. This is because terrain factors (Ele, Slo) can directly influence surface runoff, soil erosion, and local microclimate conditions through the redistribution of hydrothermal conditions, thus exerting an important influence on desert development and change [
92,
93]. In addition, Chen et al. noted that the QTP, as the third pole of the earth, is the most sensitive region to climate change and is the most significant region of climate wetting in China [
53]. Precipitation directly affects changes in vegetation, which in turn has an important influence on the distribution of and changes in deserts. Our results also revealed that among climate factors, precipitation has the greatest influence on desert change. This finding is consistent with the results of the study of QTP desertification influencing factors by Cuo et al. [
94]. Vegetation is the result of the combined influence of terrain, climate, soil, and other factors, and the response of different vegetation types to climate change is spatially heterogeneous [
95]. Our study revealed that among environmental factors, vegetation type also has a relatively important influence on desert change.
In addition, animal husbandry is the traditional land use in the QTP area, and overgrazing leads to serious grassland degradation, land desertification, and other problems [
96,
97]. To restore and protect the environment, the Chinese government has implemented a series of ecological restoration projects, such as the development and protection of grasslands through the construction of fences, grazing bans, and rotational grazing, as well as soil and water erosion control and desertification control [
98,
99,
100,
101]. Ma et al. noted that after analyzing the ecological benefits generated by the management and protection measures in different areas of the QTP, the implementation of a series of ecological restoration projects in the QTP effectively curbed the trend of ecological degradation [
102]. Zhao et al. noted that a series of ecological restoration projects effectively promoted an increase in vegetation cover on the QTP [
103]. The results of geodetector-based research revealed that the actual livestock load in most areas of the QTP has been effectively controlled, and the detection of HF and ALCC factors has also had an important influence on changes in the QTP deserts. Moreover, the development of and changes in deserts are not only the result of a single factor but are affected by multiple factors, such as terrain, climate, the environment, and human activities [
49]. The results of the interaction analysis based on geodetector show that the interaction between any two factors is enhanced by bidirectionals or nonlinearly enhanced, which further confirms the above viewpoint.
4.4. Advantages and Limitations of This Study
In this study, we compared the performance of five different machine learning algorithms in desert type extraction, from which we selected the optimal classification method, which is highly important for effectively and quickly understanding the spatial distribution of QTP deserts. Subsequently, the importance of different features in the classification and their positive and negative impacts on different desert types were analyzed via the SHAP model from both global and local perspectives, which effectively compensated for the shortcomings of traditional importance feature assessment. In addition, the factors affecting desert change were quantified via geodetector, and the effects of interactions among the factors were analyzed, which is conducive to obtaining a comprehensive understanding of the dynamic process of desert change and has important reference value for QTP environmental protection and ecological restoration.
However, there are still some shortcomings in this study, such as the time range of Sentinel-1 data and the “salt and pepper effect” that exists in the pixel-based classification process, all of which exert a certain impact on the recognition of changes in deserts. In this study, although we fused patches smaller than 6 × 6 pixel size to the surrounding patches to minimize the influence of the “salt and pepper effect” on the QTP desert classification and change analysis, it may still have some influences for the results. Second, in order to maintain the temporal consistency of the images in each period as much as possible, we all use the July and August images for the median synthesis, but due to the large extent of the Qinghai–Tibetan Plateau, there will still be some temporal differences in the image information of different locations, which may have some impact on the results of the classification. In addition, due to the similarity in spectral response between salt crust and mild saline deserts, they are treated as one type in the classification process. However, there are large differences in the degree of salinity, and we will try to use more information and methods to differentiate them in subsequent studies. Moreover, owing to the limited distribution of meteorological stations in the QTP region, the interpolated data used for each meteorological element cannot fully reflect the actual meteorological conditions on site, which may affect the results of the geodetector. In addition, although we have optimized the discretization of continuous variables in the geodetector model, grid density still has an impact on the detection results. In the future, we will consider using deep learning algorithms and object-oriented classification methods to conduct studies about classification and related problems.
Despite these limitations, this study used SHAP and geodetector to deepen the understanding of desert distribution and change, which provided data support for desertification control on the QTP. In the prevention and control of desertification, it is important to pay attention to adjusting the land-use structure, reasonably allocating the structure of agricultural, animal husbandry, and industrial production, and preventing soil erosion and land desertification caused by human activities (e.g., returning sloping farmland to forests or grasslands, prohibiting and rotating grazing, and ecological protection and restoration of the mining areas). In addition, a series of ecological restoration projects (e.g., protection and restoration of vegetation, establishment of sand barriers, etc.) can be carried out in accordance with local conditions to improve soil quality and reduce the migration capacity of wind and sand, with the purpose of controlling the expansion of desertification.
5. Conclusions
This study compared the performance of five different machine learning classification algorithms (RF, CTB, CART, KNN, and SVM) for QTP desert type extraction to select the optimal algorithm. The results showed that the RF classifier achieved the best performance, with an overall accuracy of 87.11% and a kappa coefficient of 0.83. Meanwhile, by comparing the PA and UA of different desert types when applying different machine learning algorithms for classification and combining the results with the local details comparison, it is found that the RF has a better classification performance in recognizing most of the desert types (SD, GD, LD, RD, AC). Afterward, the SHAP method was used to analyze the importance of different classification features in the RF model classification process from both global and local perspectives. From the perspective of overall desert classification, the most prominent contributions are elevation and slope, followed by VV, VH, and GLCM. From the perspective of the impact of various classification features on the extraction of different types of deserts, the radar backscatter coefficient VV helps distinguish sandy deserts from other types of deserts; VH helps distinguish RD, AC, SD, and LD deserts; slope helps distinguish RD, AC, and other types of deserts; elevation plays a more prominent role in distinguishing alpine cold deserts from other types of deserts; and the short-wave infrared band SR_B7 has an important role in the identification of salt crusts and saline deserts. In addition, an analysis of the changes in the QTP desert from 2000 to 2020 revealed that the QTP desert area exhibited an overall reversal trend during the study period, with the proportion of desert area to the QTP decreasing from 28.62% to 26.20%. The area of the deserts decreased by 19,643.96 km2 from 2000 to 2010, and decreased by 47,311.51 km2 from 2010 to 2020. And the rate of QTP desert reversal accelerated significantly after 2010. The reversal of deserts mainly comes from the transformation of GD and LD to non-deserts, which are mainly distributed in the northwestern part of the Qiangtang Plateau, the northeastern part of the Qaidam Basin, and those places around the salt lakes in the basin. Research results based on geodetector reveal that the key factors affecting desert change include Slo, Pre, Ele, VT, and HF. The interaction results among various factors indicate that changes in the QTP desert are influenced by multiple factors, such as terrain, climate, environment, and human activities. The interactions among various factors play important roles in changes in the QTP desert through bidirectional enhancement or nonlinear enhancement.