Next Article in Journal
Heavy Metal Spatial Variation Mechanism and Ecological Health Risk Assessment in Volcanic Island Soils: A Case Study of Weizhou Island, China
Previous Article in Journal
Source and Ecological Risk Assessment of Potentially Toxic Metals in Urban Riverine Sediments Using Multivariate Analytical and Statistical Tools
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Enhanced Landslide Susceptibility Assessment in Western Sichuan Utilizing DCGAN-Generated Samples

1
Chongqing Jinfo Mountain Karst Ecosystem National Observation and Research Station, School of Geographical Sciences, Southwest University, Chongqing 400715, China
2
School of Geographical Sciences, Southwest University, Chongqing 400715, China
3
Chongqing Engineering Research Center for Remote Sensing Big Data Application, Southwest University, Chongqing 400715, China
4
Key Laboratory of Optoelectronic Technology and Systems of the Education Ministry of China, Chongqing University, Chongqing 400044, China
*
Author to whom correspondence should be addressed.
Land 2025, 14(1), 34; https://doi.org/10.3390/land14010034
Submission received: 15 November 2024 / Revised: 12 December 2024 / Accepted: 26 December 2024 / Published: 27 December 2024

Abstract

:
The scarcity of landslide samples poses a critical challenge, impeding the broad application of machine learning techniques in landslide susceptibility assessment (LSA). To address this issue, this study introduces a novel approach leveraging a deep convolutional generative adversarial network (DCGAN) for data augmentation aimed at enhancing the efficacy of various machine learning methods in LSA, including support vector machines (SVMs), convolutional neural networks (CNNs), and residual neural networks (ResNets). Experimental results present substantial enhancements across all three models, with accuracy improved by 2.18%, 2.57%, and 5.28%, respectively. In-depth validation based on large landslide image data demonstrates the superiority of the DCGAN-ResNet, achieving a remarkable landslide prediction accuracy of 91.31%. Consequently, the generation of supplementary samples via the DCGAN is an effective strategy for enhancing the performance of machine learning models in LSA, underscoring the promise of this methodology in advancing early landslide warning systems in western Sichuan.

1. Introduction

Landslides represent a prevalent form of geological hazard, characterized by their extensive destructiveness and widespread occurrence. They constitute a significant threat to public safety, inflicting substantial socioeconomic losses [1]. Western Sichuan is affected by the uplift and compression of the Qinghai–Tibet Plateau, as well as the erosion of water systems. The geological structure is complex and active, leading to frequent earthquakes. Landslides, as one of the secondary geological hazards triggered by strong earthquakes, constitute the predominant geological hazard type in western Sichuan [2,3]. Landslide susceptibility assessment (LSA) involves analyzing the spatial distribution of historical landslides along with environmental factors, such as topography, geomorphology, geological formations, and hydrometeorology, to predict the risk of landslides in the study area [4]. Validating landslide occurrence mechanisms and conducting precise landslide susceptibility predictions serve as a crucial reference basis for disaster prevention and engineering mitigation.
Numerous scholars have conducted research on regional landslide susceptibility, with methodologies evolving from qualitative analysis to quantitative and intelligent assessment. The quantitative methods can be classified into deterministic and non-deterministic approaches. Deterministic methods employ geomechanical models alongside spatial geological information to predict the probability of landslide occurrences in specific regions, such as the Shallow Landslide Stability Model (SHALSTAB) proposed by Dietrich et al. [5] and the Stability Index Map (SINMAP) proposed by Pack et al. [6]. Non-deterministic methods are divided into knowledge-driven methods based on expert experience and data-driven methods based on data analysis. Their critical disparity lies in the determination of factor weights [7].
Since the 1990s, the field of artificial intelligence has undergone significant development, witnessing the emergence of various machine learning models such as random forests (RFs), support vector machines (SVMs), artificial neural networks (ANNs), and convolutional neural networks (CNNs). These models have showcased robust data mining capabilities, adept at handling large volumes of heterogeneous data from diverse sources. Their wide applications span across fields such as geological hazard research, medical research, economics, etc. In recent years, various machine learning methods have been applied to LSA; Abdelaziz and Bahare, respectively, conducted literature reviews and studies based on landslides in Iran, comparing various algorithm models, and both achieved favorable results [8,9]. Compared to traditional LSA methods, deep neural networks extract features from various influencing factors in landslide areas more efficiently, thus exhibiting superior prediction accuracy [10,11]. Nevertheless, they also require a larger amount of data samples for training [12]. Hussin et al. found that when limiting the amount of training data below a certain threshold, the model showed significantly poor performance [13]. Numerous studies demonstrated that methods such as geometric transformations, color space transformations, and adding noise can partially address the issue of sample scarcity [14,15,16]. However, research by Shorten and Khoshgoftaar suggests that these methods pose risks of losing crucial information and contaminating the dataset, consequently leading to a decline in model performance [17]. The emergence of generative adversarial networks (GANs) has provided a new way to address these challenges [18]. Gu et al. introduced an enhanced SinGAN method for generating remote-sensing images [19]. Their study verified that the generated images exhibited high structural similarity and logical consistency with real images. These images effectively serve as training data for the model, yielding satisfactory results. Husam and Biswajeet generated additional landslide samples for training through GAN and demonstrated that additional samples can effectively improve the performance of LSA models constructed using ANN and RF [20].
GANs show great potential in addressing the issue of insufficient training samples. However, GANs suffer from training difficulty and instability [19,21]. Therefore, this study proposes a data augmentation technique built upon a DCGAN, leveraging its convolutional structures for enhanced stability and expedited convergence during training. Subsequently, the augmented samples are used to improve the effectiveness of machine learning methods in landslide sensitivity assessment [21]. The remaining sections of this article are organized as follows: Section 2 introduces the overview of the study area and the sources of data. Section 3 describes the experimental process, highlighting the machine learning models, LSA modeling methodologies, and the evaluation metrics of these models. Then, Section 4 presents the results of each step in the experimental process. Moreover, Section 4 verifies the feasibility of enhancing the performance of LSA models through data augmentation using a DCGAN. Section 5 discusses the environmental factors and additional sample numbers. Finally, Section 6 summarizes this article.

2. Materials

2.1. Study Area

The western Sichuan region is shown in Figure 1a, and it is situated in the transitional zone between the eastern edge of the Qinghai–Tibet Plateau and the Chengdu Plain. In terms of administrative divisions, the study area includes Ya’an City, Panzhihua City, Aba Prefecture, Garze Prefecture, and Liangshan Prefecture. The terrain of the study area is rugged, with elevations ranging from 300 to 7500 m (shown as Figure 1b). The topography exhibits significant variations, mainly consisting of mountainous and hilly areas, transitioning gradually from plain areas in the east to hilly regions and high mountain valleys in the west. Under the influence of topography, the study area is characterized by vertical climatic zones. In the valley areas, the climate is mainly subtropical, warm–temperate, and temperate, while in the mountainous plateau regions, it is primarily sub-frigid and frigid. Annual rainfall is unevenly distributed, primarily concentrating between May and October, with precipitation ranging from 400 to 1400 mm [22]. Its spatial distribution shows a decreasing trend from east to west. The geological structure of the western Sichuan region is shown in Figure 1c, with stratigraphic fault zones distributed both vertically and horizontally; the predominant rock types in the study area include clastic rock, metamorphic rock, continental deposit, carbonatite, and basalt [22]. Among them, sandstone, mudstone, and siltstone within the clastic rocks collectively form the easily erodible and prone-to-landslide strata [23].
Affected by natural and anthropogenic factors, geological disasters such as collapses, landslides, and debris flows frequently occur in western Sichuan. These disasters are characterized by their widespread occurrence, extensive impact, and rapid onset, causing severe losses to the safety of people’s lives and property. Therefore, western Sichuan is an ideal region for validating LSA methods [24].

2.2. Inventory of Landslides

The historical landslide point data catalog was established based on survey data of historical geological disasters in Sichuan and combined with remote sensing images. A total of 3515 historical landslide points was identified in western Sichuan (shown as Figure 1b). After filtering out repeated occurrences of landslides at the same location and consolidating multiple landslides within small areas, 2000 historical landslide points were selected as the base data for the study. Among them, small-to-medium-sized landslides prevail, accounting for 95% of the total number of landslides. The patterns of landslides include sliding–cracking, creeping–shearing, and tilting–fracturing [22]. In addition, in LSA, the selection of negative samples significantly influences the prediction results. Negative samples are typically selected from areas outside of the landslide region randomly, making it difficult to ensure their reliability and consequently leading to a decrease in model performance [25]. Therefore, efforts should be made to select sample points from areas with lower landslide risk. This study utilizes the information value (IV) model to select non-landslides by analyzing the spatial distribution of landslides and their relationship with influencing factors, thereby quantifying the contribution of each factor to landslide occurrence [26]. The methodology involves reclassifying the factors, calculating the proportion of landslide and non-landslide pixels within each category, and determining the IV for each category. Subsequently, the IVs of all factors are aggregated and reclassified to produce a preliminary landslide risk map for the study area. Finally, a landslide dataset was constructed by randomly selecting non-landslide samples from areas with low landslide risk at a 1:1 ratio, as shown in Figure 2.
The validation dataset was constructed based on the aerial image interpretation dataset of landslides in Sichuan and surrounding areas published by Zeng et al., as well as high-definition historical imagery from Google Earth [27]. Thirteen large landslide events that occurred in western Sichuan from 2017 to 2022 were identified based on information gathered from the relevant literature and news sources.

2.3. Landslide Factors

Selecting landslide factors is crucial for constructing LSA models, typically encompassing terrain, land cover, hydrometeorological conditions, engineering geology, and human activities. Their selection is often based on the relationship with historical geological hazards.
In the context of landslide triggers in western Sichuan, a total of 18 factors were chosen as inputs for the LSA models. The data sources are shown in Table 1 [28,29,30]. All data were resampled to a spatial resolution of 30 m, and continuous variables such as elevation, slope, and the NDVI were reclassified using the Jenks natural breaks classification method (shown in Figure 3) [31].
Topographic factors: Topographic factors include elevation, aspect, plan curvature, profile curvature, slope, relief amplitude, the topographic wetness index (TWI), and the landform. Except for the landform, which is obtained from the national 1:200,000 digital geological map, the remaining factors are calculated from 30 m resolution elevation data. These factors play an essential role in the development of landslides. For example, elevation and aspect have important control functions. Elevation differences can lead to variations in vegetation and potential energy within a region. Moreover, different aspects are affected by different sunlight levels, resulting in spatial differences in soil moisture, thereby influencing the development of landslides. The frequency ratio (FR) analysis of partial typical factors indicates that when the elevation is below 2700 m, the landslide frequency consistently exceeds 1 (Table 2), suggesting a higher likelihood of landslide occurrences within this elevation range [32].
Land cover and hydrometeorological factors: Land cover and hydrometeorological factors have important influences on landslide occurrence, including the stream power index (SPI), sediment transport index (STI) [33], distance to rivers, precipitation, land use, and the normalized difference vegetation index (NDVI). For example, the NDVI reflects the land cover types and vegetation growth status in the study area. Regions with dense vegetation tend to have more stable soil, resulting in lower landslide probability. Conversely, factors such as precipitation and watercourses increase soil moisture and slope erosion, leading to decreased shear strength of the soil and slope stability. In the study area, regions with NDVI values ranging from 0.5 to 0.7 have frequency ratios over 1. Similarly, areas with precipitation exceeding 750 mm and located within 3 km of rivers also have higher frequency ratios than others. This indicates that landslides are more likely to occur in these environments.
Engineering geology and human activities: The frequency and magnitude of landslides vary depending on the geological formations present. Historical records of geological hazards in western Sichuan indicate that landslides and collapses are particularly prevalent in debris rock layers primarily composed of sandstone and mudstone. Moreover, the construction of roads can exacerbate land degradation and soil displacement, consequently increasing slope instability. Notably, within a 1.5 km radius of roads in the study area, the frequency of landslides is significantly higher compared to other regions.

3. Methods

In this study, additional samples were generated using a DCGAN to augment the dataset, facilitating LSA in western Sichuan through an SVM, CNN, and ResNet. The experimental procedure is outlined as follows (refer to Figure 4): (1) compilation of the historical landslide inventory in the study area and acquisition of 18 fundamental environmental factors. (2) Conducting multicollinearity analysis and employing GeoDetector to eliminate irrelevant factors associated with landslides. Principal component analysis (PCA) was utilized to mitigate redundant information and streamline model complexity. (3) Expansion of the landslide dataset via a DCGAN, followed by the training of LSA models (SVM, CNN, ResNet) using both the original and augmented datasets. (4) Generation of landslide susceptibility prediction maps for western Sichuan using the trained models, followed by validation using large-scale landslide data.

3.1. VIF Analysis and GeoDetector

This study quantitatively evaluates the contributions of 18 landslide factors to landslides by calculating the variance inflation factor (VIF) and GeoDetector (GeoDetector_2015_example in Excel). The VIF value quantifies the degree to which the variance of a regression coefficient is inflated due to correlations among independent variables. A higher VIF value and a lower tolerance (TOL) value indicate higher multicollinearity for the factor. The GeoDetector, proposed by Wang et al. [34], is a statistical method based on spatial autocorrelation theory. Its core idea is that if a factor has a significant impact on a geographical event, then there will be a similarity in the spatial distribution of the factor and the geographical event. In this study, the differentiation and factor detection models of the geographic detector are applied to quantitatively analyze the strength of factors in explaining spatial differentiation of events [35,36]. Factor detection is measured using the q statistic, which is in the range of [0, 1]. A higher q-value indicates a stronger explanation of the factor for landslides.

3.2. Data Augmentation Based on DCGAN

A DCGAN, proposed by Radford et al., represents a generative adversarial network incorporating deep convolutional structures [37]. It is employed for unsupervised learning and the generation of similar images. A DCGAN amalgamates CNN architecture into traditional GANs with specific modifications (refer to Figure 5). Notably, the generator and discriminator dispense with the pooling layers of the CNN, opting instead for batch normalization (BN) layers following each convolution (transpose convolution) operation. The model is described by the following mathematical equation [21]:
min   G max D V D , G = E x ~ P data x log   D x + E z ~ P z z log 1     D ( G ( z ) )
where min   G max D V D , G represents the convergence direction of the generator and discriminator, x denotes real samples, and G ( z ) represents samples generated from random vectors. P data and P z denote the real data distribution and the generated data distribution identified by the discriminator. The ultimate objective of the model is to find min G that produces synthesized data resembling real data.

3.3. LSA Machine Learning Models

3.3.1. Support Vector Machine (SVM)

An SVM is a machine learning algorithm rooted in statistical theory, designed to minimize structural risk [38]. Within the field of landslide susceptibility research, SVMs find extensive application, particularly with the radial basis function (RBF) kernel, favored for its ability to explore complex, nonlinear mappings [39]. In this study, RBF-SVM is employed as one of the LSA models. The performance of RBF-SVM is influenced by parameters such as the kernel width parameter γ and the regularization parameter C. We utilize the original dataset as input and identify the optimal parameters through grid search [40,41], in which γ = 0.01, C = 1.

3.3.2. Convolutional Neural Network (CNN)

A CNN is a widely used feed-forward neural network. In this study, a CNN is selected to construct an LSA model, exploring the relationship between landslides and environmental factors. The model maps the probability of landslide occurrence onto the interval [0, 1], treating the task as a regression problem to achieve accurate landslide prediction [42]. The CNN architecture includes three convolutional layers and BN layers for training. A maximum pooling layer is inserted between the convolutional layers to filter redundant information. Finally, a fully connected layer is employed to propagate features and adjust them to the appropriate size. The output layer employs 2 neuron outputs to represent landslide (output of 1) and non-landslide (output of 0) categories [43,44].

3.3.3. Residual Neural Network (ResNet)

A ResNet is an improved convolutional neural network proposed by He et al. [45] that addresses the problem of gradient disappearance by introducing the concept of residual learning. The residual mapping layer is superimposed onto the shallow network to create the residual block through skip connections. This innovation significantly improves accuracy and mitigates gradient loss [46]. The output of the function mapping layer can be expressed as [23]
y = H ( x , W i ) + x
where x and y refer to the input and output of the residual block, respectively. H ( x , W i ) is the residual mapping parameter. In this study, the ResNet18 model with 4 stages and a total of 8 base residual blocks is employed.

3.4. Evaluation Indicators

To evaluate model performance, five metrics are calculated based on the confusion matrix, including Accuracy (ACC), Precision (PRE), True Positive Rate (TPR), F1-score, and the Matthews Correlation Coefficient (MCC). Among them, ACC, PRE, and TPR fall within the range [0, 1], with values closer to 1 indicating superior performance. These metrics can be calculated as follows:
ACC = TP + TN TP + TN + FP + FN
PRE = TP TP + FP
TPR = TP TP + FN
F 1 score = 2   ×   PRE   ×   TPR PRE + TPR
MCC = TP   ×   TN FP   ×   FN TP + FP TP + FN TN + FP TN + FN
Furthermore, the performance of classification models can be evaluated using receiver operating characteristic (ROC) curves, where the horizontal axis represents the false positive rate (proportion of correctly predicted non-landslide samples) and the vertical axis represents the true positive rate (proportion of correctly predicted landslide samples). The ROC curve reflects the classification effectiveness of the model, quantified by the area under the curve (AUC). A higher AUC value indicates a higher classification accuracy of the model.

4. Results

4.1. Landslide Factor Screening

The multicollinearity among initially selected landslide factors can distort the LSA model during training, leading to convergence difficulties or random predictions. Additionally, an excess of landslide factors inevitably introduces redundant information, thereby affecting the accuracy of predictive results. The VIF results of landslide factors are presented in Table 3, where VIF values closer to 10 indicate more severe collinearity. Specifically, the VIF values of the STI and slope factors are 9.733 and 11.081, respectively, indicating collinearity among these factors. Consequently, the STI and slope factors are excluded from subsequent analyses. The q-value results are shown in Figure 6. Factors such as the SPI, profile curvature, the TWI, plan curvature, and aspect have q-values less than 0.01, indicating a weak explanation for landslide events and thus are considered redundant information. Consequently, a total of 11 factors are selected for subsequent LSA model construction.

4.2. Quality Analysis of Additional Landslide Samples

This study augmented the landslide dataset to enhance the performance of three landslide susceptibility models, namely an SVM, CNN, and ResNet18. The model hyperparameters are shown in Table 4. When employing generated samples for LSA, an excessive number of additional samples may lead to issues such as indistinct features or uniform sample characteristics, potentially resulting in decreased model performance. Therefore, the optimal number of samples needs to be determined. In this study, six augmented datasets were generated by the DCGAN, with the number of additional samples increasing incrementally from 100 to 600 (100, 200, 300, 400, 500, 600). Subsequently, predictions were made using LSA models trained on these datasets. Training accuracy, testing accuracy, and the AUC were utilized as evaluation metrics to assess the accuracy and classification capabilities of the models, aiming to determine the optimal number of additional samples. The experimental results are as follows (Figure 7):
(1)
When the number of additional samples is 100, 200, and 300, the testing accuracies of the SVM are improved by 2.56%, 2.66%, and 2.3%, respectively. However, those of the CNN and ResNet show no significant changes, and there is a difference of approximately 5% to 10% between the accuracy of the training and testing datasets, indicating the presence of overfitting.
(2)
With 400 and 500 additional samples, the testing accuracies of the SVM are improved by 1.88% and 2.18%, respectively. The testing accuracies of the CNN are improved by 1.54% and 2.57%, respectively. Moreover, those of the ResNet are improved by 3.37% (400) and 5.28%, respectively.
(3)
When there are 600 additional samples, compared to 200 additional samples, the testing accuracy of the SVM is decreased by 2.31%. However, it is improved by 0.35% compared to the original dataset. Additionally, compared to 500 additional samples, the testing accuracies of the CNN and ResNet were decreased by 2.11% and 4.3%, respectively, although compared to the original dataset, they were improved by 0.46% and 0.98%, respectively.
The simple structure of the SVM determines that it does not require a large amount of data during training, and it is less sensitive to the number of additional samples. Consequently, the improvement in prediction accuracy compared to the original dataset is always limited. On the other hand, deep neural network models like the CNN and ResNet can leverage larger datasets more effectively during training, enabling them to better explore the underlying relationships within the data. Therefore, as the number of additional samples increases, the performance of these models also improves. However, the utilization of pseudo samples generated from real samples may lead to the dataset becoming less diverse, thus reducing its quality. According to the test result, both the CNN and ResNet show significant improvements when training with 400 and 500 additional samples. Specifically, in the case of 400 additional samples, the CNN exhibits the best landslide classification performance (AUC = 92.42%), while the ResNet performs best with 500 additional samples (AUC = 93.61%). Taking all factors into account, when there are 500 additional samples, the SVM, CNN, and ResNet show the greatest improvement. Therefore, the final LSA models are trained on datasets with 500 additional samples in this study.

4.3. LSA Results

4.3.1. Effectiveness of LSA Machine Learning Models

The quantitative evaluation results of the LSA models trained with the original dataset and the DCGAN-augmented dataset (500 additional samples) are shown in Table 5. On the original dataset, the SVM performs poorly with an accuracy of 76.08%, while both the CNN and ResNet achieve similar accuracies of over 86%, demonstrating excellent performance in landslide prediction. Based on comprehensive evaluation metrics, the SVM shows the lowest performance. The CNN has a lower PRE value than the ResNet but higher TPR, F1-score, and MCC values than the ResNet. These results suggest that the CNN achieves superior overall prediction accuracy, albeit with a tendency to misclassify some non-landslides as landslides. From a landslide prevention perspective, the prediction pattern of the CNN can effectively reduce the risk of landslides in densely populated and built-up areas, while the prediction pattern of the ResNet is more suitable for sparsely populated and high-altitude mountainous areas.
Training with the DCGAN-augmented dataset substantially enhances the performance of all LSA models, with the ResNet demonstrating optimal performance. Compared to training on the original dataset, the ACC of the CNN is increased by 2.57%, while other metrics are increased by 0.03 to 0.05, indicating higher overall landslide prediction accuracy and greater sensitivity in identifying landslides and non-landslides. The improvement of the ResNet is more significant, with an ACC reaching 91.78%, marking an impressive increase of 5.28%. Moreover, metrics such as PRE, TPR, and F1-score all surpass 0.9, and the MCC is increased by approximately 0.1. The results suggest that training the ResNet model with the DCGAN-augmented dataset leads to enhanced overall prediction accuracy and reduced misclassification of landslides and non-landslides.
The ROC curve reflects the model’s prediction accuracy and error rate (Figure 8). In the training of the original dataset, the classification performance of the CNN (AUC = 80.23%) is lower than the SVM (AUC = 82.17%) and ResNet (AUC = 81.09%), despite the CNN having the highest accuracy. However, the SVM exhibits a more stable landslide classification. With an additional 500-sample dataset, both the CNN (AUC = 91.01%) and ResNet (AUC = 93.61%) show significant improvements in classification performance. Compared to training on the original dataset, the CNN’s AUC increases by 10.78%, while the ResNet demonstrates the best classification performance, with an AUC increase of 12.52%.

4.3.2. Analysis of Landslide Susceptibility Mapping (LSM)

The CNN and ResNet, which perform better, were selected as classifiers for LSM. Figure 9 shows the LSM results in western Sichuan based on the models trained with the original dataset and DCGAN-augmented dataset, as well as detailed maps. For the original LSM results, the distributions of landslide-prone areas classified by the two LSA models are generally similar, with high-risk areas concentrated in the eastern and southern parts of the study area, and the probability of landslides decreasing as the altitude increases towards the west. However, an in-depth analysis of regions (a) and (b), where landslides are more frequent, reveals that the CNN-predicted map shows a larger area of very high landslide probability compared to the ResNet. The CNN shows a tendency to over-predict the landslides, indicating a greater inclination to classify some moderate and high-risk areas as very high-risk areas. Region (a) is characterized by high-altitude mountains, canyons, and grasslands with sparse populations, resulting in limited economic losses from landslides. In landslide prevention and control in this area, prediction from the ResNet is more feasible. In contrast, region (b) encompasses several cities characterized by low altitudes and dense populations. As mentioned earlier, the “over-prediction” of the CNN is more in line with the practical need for landslide prevention in these areas. According to the LSM results using the models trained with the DCGAN-augmented dataset, the distribution of risk areas still follows the trend of high-risk areas being concentrated in the eastern and southern parts of the study area, which are characterized by low altitudes and high human activity. As the altitude increases towards the west, the probability of landslides decreases. Compared to the original LSM results, the main difference in predictions lies in the very low-risk areas. Combining the percentage of landslide sensitivity zones depicted in Figure 10, it is observed that the proportion of very low-risk areas predicted by the DCGAN-ResNet is significantly higher. Additionally, compared to the ResNet model trained on the original dataset, the predicted moderate-risk areas of the DCGAN-ResNet model are decreased noticeably.

4.3.3. Validation Based on Large-Scale Landslide Images

This study validated the LSM results by visually interpreting 13 large landslide areas that occurred in western Sichuan from 2017 to 2022 with Google’s historical high-resolution imagery and high-resolution aerial orthophotos. Table 6 presents the prediction results of these 13 landslides based on area statistics, with six major landslides shown in Figure 11. It is evident that in the validation of the original LSA results, the CNN is significantly superior to the ResNet. When the model predicts landslide areas as high and very high-risk areas, it is considered a successful prediction. The success rate of the CNN prediction reached 87.76%, which is 7.38% higher than the ResNet. Particularly, in the case of the Jiuzhaigou landslide group, while the CNN successfully predicted the majority of landslide areas, the ResNet classified most of them as moderate-risk areas. The validation of the DCGAN-LSA results led to the following conclusions: (1) The DCGAN-CNN shows a decrease of 4.13% in the prediction success rate. However, there is a 1.9% increase in its ability to predict landslide areas as very high-risk areas. (2) The DCGAN-ResNet exhibits a significant improvement compared to the ResNet, with a prediction success rate of 91.31%, representing an increase of 10.93%. Additionally, the DCGAN-ResNet shows greater sensitivity in landslide recognition, successfully predicting 66.52% of the areas with occurred landslides as very high-risk areas, making it the best performer among the four models.

5. Discussion

In landslide susceptibility modeling, eliminating irrelevant factors helps reduce model complexity and improve training efficiency. After factor selection, it was found that factors such as slope and the STI have high multicollinearity, while factors like aspect, curvature, the SPI, and the TWI have low contributions for landslide events. Therefore, both types of factors should be excluded in the subsequent modeling process. Historical landslides occur more frequently in low-altitude areas with high precipitation and frequent human activities. The prediction results of the CNN and ResNet are also similar to the distribution of historical landslides, with landslide risk significantly higher in urban areas with dense populations at low altitudes compared to other areas.
The evaluation of original LSA models and DCGAN-LSA models indicates that the DCGAN can effectively enhance the performance of LSA models. When trained with the augmented dataset, the SVM, CNN, and ResNet all exhibit improved accuracy. Furthermore, deep neural networks are sensitive to the quantity of additional samples, with a shortage leading to overfitting and an excess resulting in underfitting. Therefore, before incorporating additional samples, it is essential to consider the number of samples as a model parameter and conduct repeated testing to determine the optimal outcome.
In the validation process of LSA mapping results, traditional validation methods typically involve cataloging a portion of landslide points separately when constructing the dataset. These points are excluded from model training and testing, serving solely for the final validation. However, this paper argues that the primary application of LSA is to predict and mitigate future landslide risks in the region. Therefore, in the final validation phase, accuracy is evaluated by conducting a zonal area analysis using recent landslide surface data from the past few years. This approach aims to accurately assess the practical performance of the LSA model in real-world applications.
This study demonstrates the viability of supplementing training samples with a DCGAN to enhance the performance of LSA machine learning models, addressing the issue of insufficient samples. However, there are still limitations in this research. (1) Obtaining landslide data for certain areas may be challenging, potentially leading to omissions in the analysis of landslide factors. (2) While generating synthetic samples with the DCGAN, some unrealistic samples were observed. Although manual screening was performed to remove them during the experiments, there is still potential for improvement in the application of DCGANs for generating landslide samples. The focus should primarily be on improving the trainability and stability of GAN models when learning multi-factor landslide data, with the goal of achieving high-quality and efficient landslide sample generation to better address the issue of insufficient data in LSA.

6. Conclusions

In the traditional LSA model construction process, when the data are insufficient, it is common to either manually select input data to make the features more distinct or use improved algorithm models to accommodate small sample data in order to enhance the model’s prediction accuracy [47,48]. This study introduces a DCGAN-based data augmentation method to enhance the efficacy of three machine learning methods for LSA in western Sichuan. Experimental results indicate that (1) the SVM, CNN, and ResNet demonstrate the ability to extract the intrinsic connections among various landslide factors, achieving satisfactory classification performance. (2) The DCGAN-based data augmentation method is effective in enhancing the performance of various machine learning models for LSA. It not only increases sample diversity but also helps the model better capture the potential distribution of landslides, thus improving the generalization capability of the LSA models. Notably, the CNN and ResNet are sensitive to the quantity of additional samples. (3) Training with the augmented dataset, the SVM and CNN show relatively modest improvements. The success rate of CNN predictions is decreased during validation. However, the ResNet exhibits significant enhancement.

Author Contributions

All authors contributed to the study conception and design. Y.T. is mainly responsible for the methodology, formal analysis, resources, and original draft writing; H.L. is mainly responsible for project administration and reviewing and editing the paper; Z.Q. is mainly responsible for software and resources; H.X. is mainly responsible for validation and data curation; X.Z. is mainly responsible for software. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the Postdoctoral Innovation Talents Support Program of Chongqing under grant no. CQBX202322, the Postdoctoral Fellowship Program of CPSF under grant no. GZC20232172, and the National Natural Science Foundation of China under grant 42301102.

Data Availability Statement

The datasets analyzed during the current study are available in the Geovis Earth Open Platform [https://datacloud.geovisearth.com/common-products] (accessed on 14 April 2023).

Acknowledgments

F. A. thanks Songwei Gu, Rui Zhang and Yinghong Jing, who provided valuable revisions of the study to improve the article.

Conflicts of Interest

All authors certify that they have no affiliations with or involvement in any organization or entity with any financial interest or non-financial interest in the subject matter or materials discussed in this manuscript.

References

  1. Fan, X.M.; Scaringi, G.; Korup, O.; West, A.J.; van Westen, C.J.; Tanyas, H.; Hovius, N.; Hales, T.C.; Jibson, R.W.; Allstadt, K.E.; et al. Earthquake-Induced Chains of Geologic Hazards: Patterns, Mechanisms, and Impacts. Rev. Geophys. 2019, 57, 421–503. [Google Scholar] [CrossRef]
  2. Liu, S.G.; Yang, Y.; Deng, B.; Zhong, Y.; Wen, L.; Sun, W.; Li, Z.W.; Jansa, L.; Li, J.X.; Song, J.M.; et al. Tectonic evolution of the Sichuan Basin, Southwest China. Earth-Sci. Rev. 2021, 213, 103470. [Google Scholar] [CrossRef]
  3. Zhou, S.Y. Seismicity simulation in Western Sichuan of China based on the fault interactions and its implication on the estimation of the regional earthquake risk. Chin. J. Geophys.-Chin. Ed. 2008, 51, 165–174. [Google Scholar] [CrossRef]
  4. Chae, B.G.; Park, H.J.; Catani, F.; Simoni, A.; Berti, M. Landslide prediction, monitoring and early warning: A concise review of state-of-the-art. Geosci. J. 2017, 21, 1033–1070. [Google Scholar] [CrossRef]
  5. Dietrich, W.E., Jr.; Dunne, T. Sediment Budget for a Small Catchment in Mountainous Terrain; Routledge: London, UK, 1978. [Google Scholar]
  6. Tarboton, D.; Goodwin, C. The SINMAP approach to terrain stability mapping. In Proceedings of the 8th Congress of the International Association of Engineering Geology, Vancouver, BC, Canada, 21–25 September 1998; pp. 21–25. [Google Scholar]
  7. Reichenbach, P.; Rossi, M.; Malamud, B.D.; Mihir, M.; Guzzetti, F. A review of statistically-based landslide susceptibility models. Earth-Sci. Rev. 2018, 180, 60–91. [Google Scholar] [CrossRef]
  8. Abdelaziz, M.; Ali, P.Y.; Jie, D.; Jim, W.; Binh, T.; Dieu, T.B.; Ram, A.; Boumezbeur, A. Machine learning methods for landslide susceptibility studies: A comparative overview of algorithm performance. Earth-Sci. Rev. 2020, 207, 103225. [Google Scholar] [CrossRef]
  9. Bahareh, K.; Biswajeet, P.; Seyed, A.N.; Alireza, M.; Shattri, M. Assessment of the effects of training data selection on the landslide susceptibility mapping: A comparison between support vector machine (SVM), logistic regression (LR) and artificial neural networks (ANN). Geomat. Nat. Hazards Risk 2018, 9, 49–69. [Google Scholar] [CrossRef]
  10. Hakim, W.L.; Rezaie, F.; Nur, A.S.; Panahi, M.; Khosravi, K.; Lee, C.W.; Lee, S. Convolutional neural network (CNN) with metaheuristic optimization algorithms for landslide susceptibility mapping in Icheon, South Korea. J. Environ. Manag. 2022, 305, 114367. [Google Scholar] [CrossRef]
  11. Mohan, A.; Singh, A.K.; Kumar, B.; Dwivedi, R. Review on remote sensing methods for landslide detection using machine and deep learning. Trans. Emerg. Telecommun. Technol. 2021, 32, e3998. [Google Scholar] [CrossRef]
  12. Yu, C.X.; Huo, J.Y.; Li, C.J.; Zhang, Y.N. Landslide Displacement Prediction Based on a Two-Stage Combined Deep Learning Model under Small Sample Condition. Remote Sens. 2022, 14, 3732. [Google Scholar] [CrossRef]
  13. Hussin, H.Y.; Zumpano, V.; Reichenbach, P.; Sterlacchini, S.; Micu, M.; van Westen, C.; Balteanu, D. Different landslide sampling strategies in a grid-based bi-variate statistical susceptibility model. Geomorphology 2016, 253, 508–523. [Google Scholar] [CrossRef]
  14. Cubuk, E.D.; Zoph, B.; Mané, D.; Vasudevan, V.; Le, Q.V. AutoAugment: Learning Augmentation Policies from Data. arXiv 2018, arXiv:1805.09501. [Google Scholar] [CrossRef]
  15. Taylor, L.; Nitschke, G.S. Improving Deep Learning using Generic Data Augmentation. arXiv 2017, arXiv:1708.06020. [Google Scholar] [CrossRef]
  16. Moreno-Barea, F.J.; Strazzera, F.; Jerez, J.M.; Urda, D.; Franco, L. Forward Noise Adjustment Scheme for Data Augmentation. In Proceedings of the 2018 IEEE Symposium Series on Computational Intelligence (SSCI), Bangalore, India, 18–21 November 2018; pp. 728–734. [Google Scholar] [CrossRef]
  17. Shorten, C.; Khoshgoftaar, T.M. A survey on Image Data Augmentation for Deep Learning. J. Big Data 2019, 6, 60. [Google Scholar] [CrossRef]
  18. Yi, X.; Walia, E.; Babyn, P. Generative adversarial network in medical imaging: A review. Med. Image Anal. 2019, 58, 101552. [Google Scholar] [CrossRef]
  19. Gu, S.W.; Zhang, R.; Luo, H.X.; Li, M.Y.; Feng, H.M.; Tang, X.G. Improved SinGAN Integrated with an Attentional Mechanism for Remote Sensing Image Classification. Remote Sens. 2021, 13, 1713. [Google Scholar] [CrossRef]
  20. Husam, A.H.A.; Biswajeet, P. Spatial landslide susceptibility assessment using machine learning techniques assisted by additional data created with generative adversarial networks. Geosci. Front. 2021, 12, 625–637. [Google Scholar] [CrossRef]
  21. Fang, W.; Zhang, F.H.; Sheng, V.S.; Ding, Y.W. A Method for Improving CNN-Based Image Recognition Using DCGAN. Cmc-Comput. Mat. Contin. 2018, 57, 167–178. [Google Scholar] [CrossRef]
  22. Yongjian, B.; Yongbo, T.; Mingjie, M.; Xiaohui, X.; Yanchao, G.; Hua, G.; Renji, B.; Wei, X. Characteristics and temporal-spatial distribution of geohazards in western Sichuan. Sediment. Geol. Tethyan Geol. 2022, 42, 666–674. [Google Scholar] [CrossRef]
  23. Liu, T.; Chen, T.; Niu, R.Q.; Plaza, A. Landslide Detection Mapping Employing CNN, ResNet, and DenseNet in the Three Gorges Reservoir, China. IEEE J. Sel. Top. Appl. Earth Observ. Remote Sens. 2021, 14, 11417–11428. [Google Scholar] [CrossRef]
  24. Liu, G.X.; Dai, E.F.; Xu, X.C.; Wu, W.X.; Xiang, A.C. Quantitative Assessment of Regional Debris-Flow Risk: A Case Study in Southwest China. Sustainability 2018, 10, 2223. [Google Scholar] [CrossRef]
  25. Chang, Z.L.; Huang, J.S.; Huang, F.M.; Bhuyan, K.; Meena, S.R.; Catani, F. Uncertainty analysis of non-landslide sample selection in landslide susceptibility prediction using slope unit-based machine learning models. Gondwana Res. 2023, 117, 307–320. [Google Scholar] [CrossRef]
  26. Wubalem, A.; Meten, M. Landslide susceptibility mapping using information value and logistic regression models in Goncha Siso Eneses area, northwestern Ethiopia. SN Appl. Sci. 2020, 2, 807. [Google Scholar] [CrossRef]
  27. Zeng, C.; Cao, Z.; Su, F.; Zeng, Z.; Yu, C. A dataset of high-precision aerial imagery and interpretation of landslide and debris flow disaster in Sichuan and surrounding areas between 2008 and 2020. China Sci. Data 2022, 7. [Google Scholar] [CrossRef]
  28. Fan, X.; Scaringi, G.; Xu, Q.; Zhan, W.; Dai, L.; Li, Y.; Pei, X.; Yang, Q.; Huang, R. Coseismic landslides triggered by the 8th August 2017 Ms 7.0 Jiuzhaigou earthquake (Sichuan, China): Factors controlling their spatial distribution and implications for the seismogenic blind fault identification. Landslides 2018, 15, 967–983. [Google Scholar] [CrossRef]
  29. Xie, W.; Li, X.; Jian, W.; Yang, Y.; Liu, H.; Robledo, L.F.; Nie, W. A Novel Hybrid Method for Landslide Susceptibility Mapping-Based GeoDetector and Machine Learning Cluster: A Case of Xiaojin County, China. ISPRS Int. J. Geo-Inf. 2021, 10, 93. [Google Scholar] [CrossRef]
  30. Yi, Y.; Zhang, Z.; Zhang, W.; Jia, H.; Zhang, J. Landslide susceptibility mapping using multiscale sampling strategy and convolutional neural network: A case study in Jiuzhaigou region. Catena 2020, 195, 104851. [Google Scholar] [CrossRef]
  31. Aditian, A.; Kubota, T.; Shinohara, Y. Comparison of GIS-based landslide susceptibility models using frequency ratio, logistic regression, and artificial neural network in a tertiary region of Ambon, Indonesia. Geomorphology 2018, 318, 101–111. [Google Scholar] [CrossRef]
  32. Li, L.P.; Lan, H.X.; Guo, C.B.; Zhang, Y.S.; Li, Q.W.; Wu, Y.M. A modified frequency ratio method for landslide susceptibility assessment. Landslides 2017, 14, 727–741. [Google Scholar] [CrossRef]
  33. Castronova, A.M.; Goodall, J.L. A hierarchical network-based algorithm for multi-scale watershed delineation. Comput. Geosci. 2014, 72, 156–166. [Google Scholar] [CrossRef]
  34. Wang, J.F.; Zhang, T.L.; Fu, B.J. A measure of spatial stratified heterogeneity. Ecol. Indic. 2016, 67, 250–256. [Google Scholar] [CrossRef]
  35. Yang, J.T.; Song, C.; Yang, Y.; Xu, C.D.; Guo, F.; Xie, L. New method for landslide susceptibility mapping supported by spatial logistic regression and GeoDetector: A case study of Duwen Highway Basin, Sichuan Province, China. Geomorphology 2019, 324, 62–71. [Google Scholar] [CrossRef]
  36. Zhou, X.Z.; Wen, H.J.; Zhang, Y.L.; Xu, J.H.; Zhang, W.G. Landslide susceptibility mapping using hybrid random forest with GeoDetector and RFE for factor optimization. Geosci. Front. 2021, 12, 101211. [Google Scholar] [CrossRef]
  37. Radford, A.; Metz, L.; Chintala, S. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks. arXiv 2016. [Google Scholar] [CrossRef]
  38. Yao, X.; Tham, L.G.; Dai, F.C. Landslide susceptibility mapping based on support vector machine: A case study on natural slopes of Hong Kong, China. Geomorphology 2008, 101, 572–582. [Google Scholar] [CrossRef]
  39. Bui, D.T.; Tuan, T.A.; Klempe, H.; Pradhan, B.; Revhaug, I. Spatial prediction models for shallow landslide hazards: A comparative assessment of the efficacy of support vector machines, artificial neural networks, kernel logistic regression, and logistic model tree. Landslides 2016, 13, 361–378. [Google Scholar] [CrossRef]
  40. Huang, Y.; Zhao, L. Review on landslide susceptibility mapping using support vector machines. Catena 2018, 165, 520–529. [Google Scholar] [CrossRef]
  41. Pradhan, B. A comparative study on the predictive ability of the decision tree, support vector machine and neuro-fuzzy models in landslide susceptibility mapping using GIS. Comput. Geosci. 2013, 51, 350–365. [Google Scholar] [CrossRef]
  42. Shorten, C.; Khoshgoftaar, T.M. Spatial Correlation and Value Prediction in Convolutional Neural Networks. IEEE Comput. Archit. Lett. 2019, 18, 10–13. [Google Scholar] [CrossRef]
  43. Wang, Y.; Fang, Z.C.; Hong, H.Y. Comparison of convolutional neural networks for landslide susceptibility mapping in Yanshan County, China. Sci. Total Environ. 2019, 666, 975–993. [Google Scholar] [CrossRef]
  44. Qin, Z.L.; Zhou, X.Y.; Li, M.Y.; Tong, Y.X.; Luo, H.X. Landslide Susceptibility Mapping Based on Resampling Method and FR-CNN: A Case Study of Changdu. Land 2023, 12, 1213. [Google Scholar] [CrossRef]
  45. He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (Cvpr), Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar] [CrossRef]
  46. Ullo, S.L.; Mohan, A.; Sebastianelli, A.; Ahamed, S.E.; Kumar, B.; Dwivedi, R.; Sinha, G. A New Mask R-CNN-Based Method for Improved Landslide Detection. IEEE J. Sel. Top. Appl. Earth Observ. Remote Sens. 2021, 14, 3799–3810. [Google Scholar] [CrossRef]
  47. Wang, Y.T.; Seijmonsbergen, A.C.; Bouten, W.; Chen, Q.T. Using statistical learning algorithms in regional landslide susceptibility zonation with limited landslide field data. J. Mt. Sci. 2015, 12, 268–288. [Google Scholar] [CrossRef]
  48. Demoulin, A.; Chung, C. Mapping landslide susceptibility from small datasets: A case study in the Pays de Herve (E Belgium). Geomorphology 2007, 89, 391–404. [Google Scholar] [CrossRef]
Figure 1. Overview of the study area. (a) Location of the study area; (b) elevation and historical landslide location; (c) geological structure.
Figure 1. Overview of the study area. (a) Location of the study area; (b) elevation and historical landslide location; (c) geological structure.
Land 14 00034 g001
Figure 2. Distribution of landslides and non-landslides.
Figure 2. Distribution of landslides and non-landslides.
Land 14 00034 g002
Figure 3. Environmental factor maps of the landslide events. (a) Elevation; (b) aspect; (c) plan curvature; (d) profile curvature; (e) slope; (f) SPI; (g) STI; (h) TWI; (i) relief amplitude; (j) distance to faults; (k) distance to road; (l) distance to river; (m) lithology; (n) landform; (o) land use; (p) soil; (q) precipitation; (r) NDVI.
Figure 3. Environmental factor maps of the landslide events. (a) Elevation; (b) aspect; (c) plan curvature; (d) profile curvature; (e) slope; (f) SPI; (g) STI; (h) TWI; (i) relief amplitude; (j) distance to faults; (k) distance to road; (l) distance to river; (m) lithology; (n) landform; (o) land use; (p) soil; (q) precipitation; (r) NDVI.
Land 14 00034 g003
Figure 4. Technological route.
Figure 4. Technological route.
Land 14 00034 g004
Figure 5. Deep convolutional generative adversarial model architecture.
Figure 5. Deep convolutional generative adversarial model architecture.
Land 14 00034 g005
Figure 6. Results of GeoDetector analysis.
Figure 6. Results of GeoDetector analysis.
Land 14 00034 g006
Figure 7. Accuracy and AUC of landslide susceptibility assessment models trained with additional samples.
Figure 7. Accuracy and AUC of landslide susceptibility assessment models trained with additional samples.
Land 14 00034 g007
Figure 8. ROC curve.
Figure 8. ROC curve.
Land 14 00034 g008
Figure 9. Landslide susceptibility maps using CNN and ResNet in western Sichuan. (a) Aba prefecture in Sichuan, (b) Panzhihua, Liangshan, and Ya’an.
Figure 9. Landslide susceptibility maps using CNN and ResNet in western Sichuan. (a) Aba prefecture in Sichuan, (b) Panzhihua, Liangshan, and Ya’an.
Land 14 00034 g009
Figure 10. Percentage of landslide sensitivity zones.
Figure 10. Percentage of landslide sensitivity zones.
Land 14 00034 g010
Figure 11. Validation of landslide susceptibility mapping results based on large landslide data. (a) Jiuzhaigou landslide group, (b) Maoxian Diexi mountain landslide, (c) Longxi mountain landslide in Wenchuan, (d) Jinchuan Danzhamu mountain landslide, (e) Han Yuan mountain landslide in Ya’an, (f) Jiulong County mountain landslide in Garze.
Figure 11. Validation of landslide susceptibility mapping results based on large landslide data. (a) Jiuzhaigou landslide group, (b) Maoxian Diexi mountain landslide, (c) Longxi mountain landslide in Wenchuan, (d) Jinchuan Danzhamu mountain landslide, (e) Han Yuan mountain landslide in Ya’an, (f) Jiulong County mountain landslide in Garze.
Land 14 00034 g011
Table 1. Data sources of landslide factors.
Table 1. Data sources of landslide factors.
Data NameSourcesTypeResolution
ElevationNASA SRTM DEMraster30 (m)
Geological faultsActive tectonic map of China (1:4,000,000)vector1:4,000,000
RoadNational Basic Geographic Databasevector1:250,000
River systemNational Basic Geographic Databasevector1:250,000
LithologyNational 1:200,000 digital geological mapvector1:200,000
LandformNational 1:200,000 digital geological mapvector1:200,000
Soil1:1 million soil map of the People’s Republic of Chinavector1:1,000,000
PrecipitationChina Meteorological Administration (CMA)station-
Land useGLOBELAND30raster30 (m)
NDVIGoogle Earth Engineraster30 (m)
Table 2. Frequency ratios of some typical landslide factors.
Table 2. Frequency ratios of some typical landslide factors.
Landslide FactorsValueLandslidesPixel NumberFrequency Ratios
Elevation (m)<150076212,521,10610.701
1500~210056622,803,7224.364
2100~270037832,203,6392.064
2300~320015231,160,1600.858
3200~36006643,476,0580.267
3600~40005159,416,0500.151
4000~43001454,487,4580.045
4300~4600858,221,4260.024
>4600337,378,4500.014
NDVI<0.12611,539,4890.396
0.1~0.314736,268,4910.713
0.3~0.5788159,049,0400.871
0.5~0.7886112,372,1501.386
>0.715332,438,8520.829
Precipitation (mm)<55015581,219,4970.336
550~65025693,794,7110.480
650~750490106,941,3150.806
750~100079865,695,3622.136
>10003014,017,18413.175
Distance to a river (km)<1519292320803.122
1~358556,067,2151.835
3~6.340285,449,9230.827
6.3~9.622070,887,6070.546
9.6~1314352,398,1390.480
13~178034,924,9280.403
>175122,708,2180.395
LithologyA (Clastic rock)724107,987,2221.179
B (Granite)346,525,3980.916
C (Metamorphic rock)269133,115,3270.355
D (Continental deposit)31749,941,7421.116
E (Urban)2399,216,8524.560
F (Lake)3111,4104.735
G (Carbonatite)25131,535,6691.400
H (Outcrop)0660,4140
I (Basalt)16312,401,4492.311
Distance to road (km)<0.596432,899,1675.152
0.5~1.545847,514,0331.695
1.5~3.531471,330,0260.774
3.5~613661,810,4010.387
6~106358,396,2570.190
10~153136,685,4270.149
>153443,032,7990.139
Table 3. Multi-collinearity analysis of landslide factors.
Table 3. Multi-collinearity analysis of landslide factors.
Landslide FactorsVIFTOL
Aspect1.006760.99329
Lithology1.097880.91085
Profile curvature1.10110.90818
Plan curvature1.229130.81358
Distance to fault1.285920.77765
Landform1.354680.73818
NDVI1.416160.70613
Land use1.451530.68892
Distance to a river1.459960.68495
Soil1.85640.53868
Relief amplitude1.993120.50173
Precipitation2.344960.42645
Distance to road2.641310.37860
SPI4.157780.24051
TWI4.187070.23883
Elevation5.884570.16994
STI9.733190.10274
Slope11.08130.09024
Table 4. Model hyperparameters.
Table 4. Model hyperparameters.
HyperparameterDCGANCNNResNet18
Kernel4 × 43 × 33 × 3
Pooling-2 × 23 × 3
Activation FunctionD: LeakyReLU
G: Tanh,ReLu
TanhReLU
OptimizerAdamAdamAdam
Loss FunctionBCELossCELossCELoss
Learning RateD: 0.0002
G: 0.001
5 × 10−45 × 10−4
Epoch6000150150
Table 5. Evaluation of landslide susceptibility assessment models.
Table 5. Evaluation of landslide susceptibility assessment models.
ModelACC%PRETPRF1-SocerMCC
SVM77.080.76270.76070.76040.5236
CNN86.690.86310.87310.86810.7339
ResNet86.500.87180.84010.85570.7294
DCGAN-SVM79.260.79270.79060.78260.5853
DCGAN-CNN89.260.90740.87410.88990.7856
DCGAN-ResNet91.780.91490.90940.91210.8257
Table 6. Validation of landslide susceptibility mapping.
Table 6. Validation of landslide susceptibility mapping.
ModelVery LowLowModerateHighVery High
CNN0012.24%28.68%59.08%
ResNet0019.62%60.42%19.96%
DCGAN-CNN0016.37%22.65%60.98%
DCGAN-ResNet04.15%5.79%24.79%66.5%
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Tong, Y.; Luo, H.; Qin, Z.; Xia, H.; Zhou, X. Enhanced Landslide Susceptibility Assessment in Western Sichuan Utilizing DCGAN-Generated Samples. Land 2025, 14, 34. https://doi.org/10.3390/land14010034

AMA Style

Tong Y, Luo H, Qin Z, Xia H, Zhou X. Enhanced Landslide Susceptibility Assessment in Western Sichuan Utilizing DCGAN-Generated Samples. Land. 2025; 14(1):34. https://doi.org/10.3390/land14010034

Chicago/Turabian Style

Tong, Yuanxin, Hongxia Luo, Zili Qin, Hua Xia, and Xinyao Zhou. 2025. "Enhanced Landslide Susceptibility Assessment in Western Sichuan Utilizing DCGAN-Generated Samples" Land 14, no. 1: 34. https://doi.org/10.3390/land14010034

APA Style

Tong, Y., Luo, H., Qin, Z., Xia, H., & Zhou, X. (2025). Enhanced Landslide Susceptibility Assessment in Western Sichuan Utilizing DCGAN-Generated Samples. Land, 14(1), 34. https://doi.org/10.3390/land14010034

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop