Landslide Susceptibility Assessment in Ya’an Based on Coupling of GWR and TabNet

Li, Jiatian; Wang, Ruirui; Shi, Wei; Yang, Le; Wei, Jiahao; Liu, Fei; Xiong, Kaiwei

doi:10.3390/rs17152678

Open AccessArticle

Landslide Susceptibility Assessment in Ya’an Based on Coupling of GWR and TabNet

by

Jiatian Li

^1,2

,

Ruirui Wang

^1,2,*,

Wei Shi

³,

Le Yang

^4,5,

Jiahao Wei

^1,2

,

Fei Liu

^1,2 and

Kaiwei Xiong

^1,2

¹

College of Forestry, Beijing Forestry University, Beijing 100083, China

²

Beijing Key Laboratory of Precision Forestry, Beijing Forestry University, Beijing 100083, China

³

Beijing Ocean Forestry Technology Co., Ltd., Beijing 100083, China

⁴

Key Laboratory of Biological Resources and Biosafety, Institute of Plateau Biology Research of Xizang Autonomous Region, Lhasa 850000, China

⁵

School of Ecology and Nature Conservation, Beijing Forestry University, Beijing 100083, China

^*

Author to whom correspondence should be addressed.

Remote Sens. 2025, 17(15), 2678; https://doi.org/10.3390/rs17152678

Submission received: 21 May 2025 / Revised: 15 July 2025 / Accepted: 31 July 2025 / Published: 2 August 2025

Download

Browse Figures

Versions Notes

Abstract

Landslides are destructive geological hazards, making accurate landslide susceptibility assessment essential for disaster prevention and mitigation. However, existing studies often lack scientific rigor in negative sample construction and have unclear model applicability. This study focuses on Ya’an City, Sichuan Province, China, and proposes an innovative approach to negative sample construction using Geographically Weighted Regression (GWR), which is then integrated with Tabular Network (TabNet), a deep learning architecture tailored to structured tabular data, to assess landslide susceptibility. The performance of TabNet is compared against Random Forest, Light Gradient Boosting Machine, deep neural networks, and Residual Networks. The experimental results indicate that (1) the GWR-based sampling strategy substantially improves model performance across all tested models; (2) TabNet trained using the GWR-based negative samples achieves superior performance over all other evaluated models, with an average AUC of 0.9828, exhibiting both high accuracy and interpretability; and (3) elevation, land cover, and annual Normalized Difference Vegetation Index are identified as dominant predictors through TabNet’s feature importance analysis. The results demonstrate that combining GWR and TabNet substantially enhances landslide susceptibility modeling by improving both accuracy and interpretability, establishing a more scientifically grounded approach to negative sample construction, and providing an interpretable, high-performing modeling framework for geological hazard risk assessment.

Keywords:

landslide susceptibility; GWR; TabNet; negative sample selection; structured data modeling; machine learning

1. Introduction

Landslides are spatial displacement processes involving the downslope movement of rocks, soil, and loose debris, either as a coherent mass or in scattered form, driven by tectonic activity, hydrometeorological factors, and gravitational forces [1]. In China, landslides primarily occur in hilly and mountainous areas and are characterized by wide distribution, delayed onset, and significant destructiveness. Accurate landslide susceptibility assessment is vital for early warning and disaster mitigation efforts. Ya’an City, situated in the transitional zone between the Sichuan Basin and the Tibetan Plateau in southwestern China, features complex topography, rugged terrain, and active tectonic activity, resulting in a high frequency of landslide occurrences. Assessing landslide susceptibility in Ya’an offers both strong regional representativeness and substantial practical relevance.

Current landslide susceptibility assessment methods can be broadly categorized into two groups: qualitative approaches based on expert judgment and quantitative approaches grounded in geostatistics and machine learning [2,3]. Qualitative methods such as hierarchical analysis [4] and fuzzy comprehensive evaluation [5] rely heavily on expert experience and are often unsuitable for large-scale or objective assessments. Statistical models, including the information value model [6], the weight of evidence model [7], and the frequency ratio model [8], use mathematical formulas to approximate variable relationships. However, these methods often struggle to capture nonlinear and complex interactions among environmental factors. In contrast, machine learning approaches, such as Boosted Regression Trees (BRTs) [9], Support Vector Machines (SVMs) [10,11], Random Forests (RFs) [12], and Light Gradient Boosting Machine (LightGBMs) [13], have been proven to be able to capture the complex interactions and demonstrate good accuracy. However, their performance may deteriorate on large-scale datasets or in the presence of complex feature interactions due to limitations in representational capacity and generalization ability [14].

The emergence of deep learning has introduced models such as deep neural networks (DNNs) [15], Convolutional Neural Networks (CNNs) [16], Recurrent Neural Networks (RNNs) [17], and Long Short-Term Memory (LSTM) [18] networks into landslide susceptibility modeling. More advanced architectures such as Residual Networks (ResNet) [19] and U-Net [20] further enhance the ability to capture complex spatial patterns and extract high-level features. However, due to differences in architecture and learning mechanisms, these models exhibit varying strengths and suitability for different tasks: DNNs are suited for structured tabular data, CNNs excel at extracting spatial features from images, RNNs and LSTMs are designed for temporal sequence modeling, and ResNet/U-Net architectures are effective in learning deep representations and spatial hierarchies. In landslide susceptibility assessment, which is primarily driven by multi-factor structured datasets, not a single deep learning model is universally optimal. Deep learning approaches face limitations in training efficiency, interpretability, and their adaptability to tabular data formats.

To overcome these limitations, this study introduces Tabular Network (TabNet)—a deep learning architecture specifically developed for structured data. Proposed by Google Cloud in 2019 [21], TabNet has gained popularity in domains such as financial risk assessment [22] and medical diagnosis [23], and has also been applied in environmental [24] and remote sensing research [25]. However, its application to landslide susceptibility analysis remains largely unexplored. By integrating the interpretability and sparse feature selection mechanisms of tree-based models with the end-to-end learning capabilities of neural networks, TabNet offers a promising framework for spatial risk modeling. To assess its performance, we compare TabNet against traditional machine learning models (RF, LightGBM) and deep learning models (DNN, ResNet).

One of the fundamental principles of machine learning is that training datasets must include both positive samples and negative samples, whose quality plays a critical role in determining model performance. In landslide susceptibility modeling, positive samples refer to locations where landslides have already occurred, typically derived from historical landslide inventories. Negative samples refer to locations where no landslide has been recorded or where the terrain is currently considered stable without visible signs of slope failure, and are usually selected from areas where landslides have not occurred. However, randomly selected non-landslide areas may contain areas that may be in the early stages of instability, thereby introducing label noise and adversely affecting the predictive performance of the model [26]. Most existing studies rely on random sampling, which does not guarantee the reliability of negative samples and may result in the mislabeling of high-risk areas as negatives, thereby undermining predictive accuracy.

Geographically Weighted Regression (GWR), a spatial regression technique that accounts for spatial nonstationarity, produces local coefficients that capture the spatial variability in factor contributions. Coefficients with low absolute values indicate weak explanatory power and limited linear correlation with landslide occurrence [27]. GWR has been widely applied in spatial partitioning and hazard sensitivity assessment, demonstrating strong interpretability in spatial risk evaluation [28]. Although some studies have attempted to incorporate GWR into negative sample selection [29], such applications remain relatively uncommon and lack a standardized or systematic framework. In this study, we compute the GWR coefficients for each landslide susceptibility indicator, apply absolute value transformation and normalization, and subsequently aggregate the coefficients to derive a composite explanatory index. Negative samples are selected from regions with low composite scores to reduce the likelihood of mislabeling potentially high-risk areas, thus improving the distinguishability and reliability of training data.

In summary, this study focuses on Ya’an City in Sichuan Province, China, and integrates GWR with TabNet to assess landslide susceptibility. We conduct a comparative analysis of GWR-based and traditional random negative sampling strategies, and evaluate four modeling approaches (RF, LightGBM, DNN, ResNet) alongside TabNet to examine how different negative sampling strategies and modeling methods influence the uncertainty of landslide susceptibility predictions. Through a novel sample construction strategy and tailored model architecture, the proposed approach significantly enhances predictive accuracy and spatial adaptability, offering both technical support and methodological guidance for landslide risk management.

2. Study Area and Data Sources

2.1. Study Area

Ya’an City is located in the central-western part of Sichuan Province, China, spanning from 28°51′ to 30°56′N latitude and 101°56′ to 103°23′E longitude, as shown in Figure 1. Ya’an is located in the transitional zone between the Tibetan Plateau and the Sichuan Basin, characterized by significant elevation differences and complex terrain which define its distinctive topography and geomorphology. Situated along the boundary between the Indian and Eurasian plates, the region features dense fault systems and diverse lithological units, creating favorable conditions for landslides, a result of its geological structure. Its subtropical humid monsoon climate brings abundant, concentrated rainfall, especially during the rainy season, often triggering slope failures through surface runoff accumulation. In addition, human activities such as road construction and urban development also disturb natural slopes, further increasing the risk of landslides. Therefore, selecting Ya’an as the study area is both representative and provides valuable insights for landslide susceptibility assessment in regions with similar geomorphological and climatic characteristics.

2.2. Data Sources

The landslide dataset was constructed by integrating multiple sources, including results from geological hazard investigations and landslide point data released by local governments or relevant agencies, resulting in the identification of 1481 historical landslide points in Ya’an City. All landslide points were standardized to the same coordinate reference system and checked for alignment with environmental raster layers. Spatial outliers and duplicate entries were removed through visual inspection to ensure spatial consistency and reliability.

Landslides are triggered by a combination of natural and anthropogenic factors, involving topographic, geological, hydrometeorological, and human activity-related elements. To systematically capture these potential influencing factors, this study draws extensively on previous research and incorporates the geological characteristics and landslide development context of the study area. A total of 17 indicators were initially selected for the landslide susceptibility assessment, including the following: elevation, slope, aspect, Slope Length and Steepness Factor (LS) [30], Terrain Ruggedness Index (TRI) [31], plan curvature [32], profile curvature [33], Topographic Wetness Index (TWI) [34], Stream Power Index (SPI) [35], lithology, distance to faults, earthquake density, annual precipitation, annual Normalized Difference Vegetation Index (NDVI) [36], distance to rivers, land cover, and distance to roads.

The data sources corresponding to these indicators are summarized in Table 1. For indicators without explicit citations, the data were obtained from existing datasets and processed using GIS-based techniques. According to existing research, utilizing a grid unit with a resolution of 30 m or finer in landslide susceptibility analysis enhances the model’s predictive capability [37,38]. Therefore, all indicators were standardized to the same spatial resolution of 30 m and temporal scale, and were uniformly preprocessed. The final processed datasets are presented in Figure 2.

2.3. Division of Evaluation Units and Selection of Susceptibility Indicators

This study uses slope units as the fundamental spatial unit for landslide susceptibility assessment. Slope units are delineated based on valley boundaries and are widely recognized for their effectiveness in capturing local terrain and geomorphological features [39]. They also exhibit strong spatial control over the development of geological hazards such as landslides, debris flows, and rockfalls, making them a commonly used unit in hazard susceptibility evaluations [40]. In this study, slope units were generated using hydrological analysis tools in the GIS platform. The main processing steps included sink filling, flow direction extraction, flow accumulation calculation, stream network generation, and watershed segmentation [41]. A total of 16,491 slope units were delineated across the study area.

As a preliminary step, this study uses the Geodetector method to filter out those susceptibility indicators that have low relevance to landslide occurrence [42]. Geodetector is a statistical tool based on the principle of spatial stratified heterogeneity [43]. Its core metric, the q-value, quantifies the extent to which a given factor explains the spatial distribution of a target variable. A q-value closer to 1 indicates stronger explanatory power. As shown in Figure 3a, the q-value results were calculated, and a threshold of 0.05 was applied in this study [44,45]. Seven indicators with weak spatial explanatory power were excluded, and the remaining ten were retained for subsequent modeling.

To reduce potential interference from multicollinearity among indicators, this study performed a correlation analysis. Pearson correlation coefficients [46] were calculated between all pairs of susceptibility indicators to quantitatively assess the degree of correlation. As shown in Figure 3b, there is a strong correlation between TRI and slope, as well as between annual precipitation and elevation, with coefficients exceeding 0.7 [47]. Due to this strong correlation, TRI and annual precipitation were excluded, and the remaining eight indicators were retained as input variables for the modeling phase.

3. Method

The overall workflow of this study is illustrated in Figure 4 and consists of four main stages:

Data preparation and preprocessing: This step includes data collection, the division of evaluation units, and the selection of susceptibility indicators (described in Section 2.2 and Section 2.3).

Sample data construction: Historical landslide points serve as positive landslide samples, while negative landslide samples are generated using the GWR-based negative sample construction method. A random sampling-based negative sample construction is also included for comparison.

Model training phase: Given the generated samples from the previous step, TabNet is trained and compared with several widely used methods, including RF, LightGBM, DNN, and ResNet. Model performance is assessed using several commonly used metrics.

Prediction phase: The best-performing TabNet model is used for landslide susceptibility prediction, followed by feature importance analysis, offering support for disaster prevention, risk management, and spatial planning.

3.1. Negative Sample Construction Based on GWR

This study introduces the GWR model to guide the selection of negative samples. The workflow of GWR-based negative sample construction process is illustrated in Figure 5a.

3.1.1. GWR Modeling Principle

GWR is a local regression technique that accounts for spatial nonstationarity by fitting a separate regression model at each spatial location [27], which enables a more precise representation of spatial heterogeneity. The local regression coefficients produced by GWR represent the spatial variation in the explanatory power of individual factors. In this study, GWR is applied using the eight selected susceptibility indicators from Section 2.3 as independent variables, and the number of historical landslide occurrences within each slope unit as the dependent variable. The general form of the GWR model is expressed as follows:

y_{i} = β_{0} (u_{i}, v_{i}) + \sum_{k} β_{k} (u_{i}, v_{i}) x_{i k} + ε_{i} .

In this equation:

y_{i}

is the number of landslide occurrences in the

i^{t h}

slope unit;

x_{i k}

is the value of the

k^{t h}

susceptibility indicators for the

i^{t h}

slope unit;

(u_{i}, v_{i})

denotes the geographic coordinates of the

i^{t h}

unit;

β_{k} (u_{i}, v_{i})

is the regression coefficient of the

k^{t h}

indicators at location

(u_{i}, v_{i})

; and

ε_{i}

is the random error term.

3.1.2. Identification of Low-Explanatory Areas and Selection of Negative Samples

The GWR model reveals spatial nonstationarity in the relationships between independent and dependent variables, with its regression coefficients reflecting spatial variability in the explanatory capacity of each factor with respect to landslide occurrence. Note that the purpose of this process is not to directly assess landslide susceptibility, but rather to identify areas where the model exhibits weak explanatory performance. When the absolute values of regression coefficients in a given area approach zero, it indicates that the corresponding factors have limited or unstable contributions to landslide occurrence in that region. These areas can be regarded as low-explanatory zones, where the model struggles to distinguish between landslide-prone and stable units, making them suitable candidates for negative sample selection.

To construct reliable negative samples, this study computed the absolute values of the GWR coefficients for the eight selected susceptibility indicators. These values were then normalized and aggregated to produce a composite explanatory index, where lower index values indicate weaker overall explanatory capacity for landslide occurrence. The natural breaks classification method [48]—a data clustering technique that minimizes within-class variance and maximizes between-class variance for optimal classification—was applied to divide the index into five levels, as it is widely used in geoscientific research, particularly in spatial risk zoning [49,50]. And the two lowest levels, which represent regions with the weakest explanatory power, were selected as low-explanatory zones. Within these zones, negative samples equal in number to the positive samples were randomly selected. This approach helps avoid the mislabeling of potentially high-risk areas as negative samples, thereby improving the reliability of the training data and enhancing the generalization performance of the model.

To evaluate the effectiveness of the GWR-based negative sample construction method, a baseline approach based on conventional random sampling was also implemented. Specifically, a 100 m buffer [49,51] was generated around each historical landslide point, and an equal number of negative samples were randomly selected from areas outside these buffer zones.

3.2. Landslide Susceptibility Modeling Based on TabNet

This study employs TabNet to model landslide susceptibility. To comprehensively evaluate modeling performance, several widely used methods are selected for comparison, including traditional machine learning models (RF and LightGBM), as well as deep learning models (DNN and ResNet). The overall modeling workflow is illustrated in Figure 5b.

3.2.1. Overview of TabNet

The architecture of TabNet is illustrated in Figure 6. TabNet preserves the end-to-end and representation learning characteristics of deep neural networks, while also incorporating the interpretability and sparse feature selection of tree-based models. This hybrid design enables TabNet to deliver strong predictive performance on tabular data tasks, often matching or exceeding that of mainstream tree-based algorithms, while also providing enhanced model transparency. Given that landslide susceptibility assessment is a task dominated by structured and multi-factor data, TabNet demonstrates strong adaptability to this setting.

TabNet consists of multiple decision steps, each employing feature and attentive transformers for dynamic feature selection and nonlinear transformation. The input features are first processed through Batch Normalization. At each decision step, the Attentive Transformer generates a sparse feature mask that dynamically selects the most relevant subset of features. This mask is applied to the input before passing it to the Feature Transformer, which consists of multiple fully connected layers with ReLU activations to enhance the model’s nonlinear representation capacity. The transformed output is then divided by the split module into two branches: one is aggregated across all steps via the aggregation module, while the other is propagated to the next decision step for further feature selection and transformation. Finally, the accumulated output from all decision steps is fed into a fully connected layer to generate the final prediction.

During model training, the TabNet model is optimized using the Adam optimizer, with binary cross-entropy as the loss function. All input features are standardized before training. The core hyperparameters of TabNet are configured as follows: both the feature transformer and decision step dimensions are set to 64; the number of decision steps is 3; the relaxation factor is 1.2; and the sparsity regularization coefficient is set to 0.01. All other parameters are kept at their default values. The model is trained for a maximum of 100 epochs with a batch size of 256.

To assess the applicability and effectiveness of the TabNet model, this study introduces four widely used comparative models for training and evaluation. These models include RF [52], which constructs multiple decision trees by bootstrapping the training data and aggregates their predictions through majority voting; LightGBM [53], an efficient implementation of the Gradient Boosting Decision Tree framework that employs a leaf-wise growth strategy and histogram-based algorithm to accelerate training and improve predictive performance; DNN [54], a feedforward architecture composed of multiple fully connected layers capable of capturing complex nonlinear relationships; and ResNet [55], which extends the DNN structure by incorporating residual connections to mitigate the vanishing gradient problem and enhance training stability.

3.2.2. Cross-Validation

To enhance the stability and generalization of model evaluation, this study adopts a cross-validation approach for training and testing each model. The fundamental idea is to partition the dataset into several subsets, using one subset as the test set and the remaining subsets for training in each iteration [56,57]. This training–validation cycle is repeated multiple times, and the evaluation results are averaged. This method effectively mitigates performance fluctuations caused by the randomness of data partitioning, thereby improving the reliability and representativeness of the evaluation outcomes. Cross-validation has been widely adopted in landslide susceptibility modeling and has been proven to provide a more objective assessment of model performance [58,59].

In this study, a 10-fold cross-validation strategy is employed, in which the dataset is evenly divided into 10 subsets. In each round, one fold is used as the test set and the remaining nine are used as the training set. The average performance across the 10 folds is reported as the final evaluation metric of the model.

3.2.3. Model Performance Evaluation Metrics

This study employs accuracy (ACC), recall (RE), and Area Under the Receiver Operating Characteristic (ROC) Curve (AUC) to evaluate the performance of different models in landslide susceptibility assessment. In addition, ROC curves are used to visualize the models’ discriminative capabilities. To comprehensively assess the agreement between predicted and actual outcomes, as well as the reliability of probabilistic outputs, this study further introduces Cohen’s Kappa coefficient (Kappa) [60] and the Brier score [61] as supplementary evaluation metrics. The definitions and corresponding mathematical formulations of these metrics are summarized in Table 2.

4. Results

4.1. Comparison of Results Under Different Negative Sample Construction Strategies

In this study, the eight selected susceptibility indicators are used as independent variables, and the number of landslide occurrences within each slope unit is taken as the dependent variable for GWR fitting. Spatial maps of the regression coefficients for each indicator are shown in Figure 7a–h, illustrating the spatial heterogeneity in their explanatory influence with respect to landslide occurrence. A larger absolute value of a coefficient indicates stronger explanatory influence in that region, while values close to zero suggest a lack of statistically significant association. Positive coefficients denote a positive correlation between the factor and landslide occurrence, whereas negative coefficients indicate an inverse relationship.

Subsequently, the absolute values of the GWR coefficients for all susceptibility indicators were calculated, normalized, and aggregated to a composite explanatory index for each slope unit. This index was then classified into five levels using the natural breaks method, as illustrated in Figure 7i.

A clear spatial correspondence is observed between higher values of the composite explanatory index and the concentration of historical landslide points, validating the spatial rationality of the GWR fitting results. The two lowest index levels, which represent areas with weak overall explanatory power, were selected as candidate zones for negative sample construction. The landslide points in these areas are scattered and very sparse, accounting for only 5% of the total landslides in the study area. Within these zones, an equal number of negative samples were randomly selected to match the number of positive samples. The spatial distribution of the GWR-based negative samples is shown in Figure 8a. For comparison, a conventional random sampling strategy was implemented as a baseline, and the distribution of these randomly selected negative samples is shown in Figure 8b.

Both sets of negative samples were incorporated into the same landslide susceptibility modeling approach, and the resulting AUC values are presented in Table 3. The results demonstrate that the GWR-based negative sampling method substantially enhances predictive performance across all models.

4.2. Comparison of Landslide Susceptibility Modeling Results Across Different Models

The performance metrics for each model are summarized in Table 4 and Figure 9. The results demonstrate that TabNet consistently outperforms all other models across most metrics, achieving the highest AUC (0.9828), ACC (0.9298), and the lowest Brier Score (0.0521), indicating both superior discriminative ability and reliable probabilistic prediction. Moreover, TabNet also records the highest Kappa value (0.8589), reflecting a high level of agreement between predicted and actual classifications after adjusting for chance.

Among the traditional machine learning models, RF shows strong performance, achieving the highest recall (0.9489), slightly surpassing TabNet in identifying landslide occurrences. LightGBM also performs competitively, achieving the second-highest AUC (0.9722), though with slightly lower recall and Kappa.

In contrast, the deep learning models DNN and ResNet exhibit comparatively weaker performance. DNN achieves a lower ACC (0.9046) and recall (0.8974), along with a higher Brier Score (0.0720), indicating poorer probability calibration. Despite the use of residual connections, ResNet records a lower ACC (0.8992), recall (0.8903), and Kappa (0.7977), along with the highest Brier Score (0.0735), underscoring its limitations in modeling small to medium-sized structured datasets.

In summary, all evaluated models exhibit distinct levels of predictive capability in landslide susceptibility modeling. Traditional machine learning methods offer stable and interpretable results, making them well-suited for practical applications that require fast and reliable deployment. Deep neural networks demonstrate potential in capturing complex feature dependencies but remain sensitive to data scale and training strategies. TabNet, by contrast, achieves an effective balance among predictive accuracy, interpretability, and adaptability.

4.3. Landslide Susceptibility Prediction and Feature Importance Analysis Based on TabNet

4.3.1. Landslide Susceptibility Prediction

Based on its superior predictive performance, we use TabNet to generate landslide susceptibility predictions across the study area. A continuous susceptibility index ranging from 0 to 1 is produced and subsequently classified into five levels using the natural breaks method. The statistical distribution of these susceptibility classes is summarized in Table 5 and Figure 10.

The results indicate that both the proportion of landslide occurrences and the density of landslide points increase with higher susceptibility levels. The very-high-susceptibility zone, which accounts for only 23.50% of the total study area, contains 73.40% of all recorded landslide events, with a density of 0.3100 events per km², substantially exceeding that of other classes.

The high- and moderate-susceptibility zones contain 10.53% and 9.18% of the total landslide occurrences, with corresponding densities of 0.1441 events per km² and 0.0918 events per km², indicating a moderate degree of risk concentration in these zones.

The low- and very-low-susceptibility zones exhibit substantially lower landslide densities, at 0.0374 and 0.0033 events per km². These zones cover 14.32% and 45.00% of the study area and are considered relatively safe regions with limited landslide activity. The clear spatial delineation of risk levels in the TabNet-generated susceptibility map further validates the model’s reliability and underscores its practical utility in landslide risk management and regional planning.

As shown in Figure 11, the spatial distribution map reveals pronounced spatial heterogeneity in landslide susceptibility across the study area. Very-high-susceptibility zones are primarily concentrated in the eastern, northeastern, and southeastern regions of Ya’an City, where rugged terrain, steep slopes, highly fractured rock masses, and dense fault structures converge. These areas constitute key zones for landslide disaster prevention and control.

High- and moderate-susceptibility zones form patchy and banded patterns surrounding the very high susceptibility regions. They are typically located in areas with abrupt terrain transitions or sharp geological structural changes, reflecting both strong spatial clustering and transitional characteristics of landslide occurrence.

In contrast, low- and very-low-susceptibility zones are predominantly distributed in the northern, northwestern, central, and southern parts of the study area. These areas are characterized by relatively flat terrain, gentle slopes, high vegetation cover, and the presence of concentrated agricultural or built-up land, indicating relatively low landslide risk.

To examine the spatial clustering of landslide occurrences across different landslide susceptibility zones, we performed a hotspot analysis using the Getis-Ord Gi* [63] statistic on both the landslide points and susceptibility classification results, followed by a quantitative interpretation of the outputs. The results are presented in Table 6 and Table 7 and Figure 12. In this context, a hotspot refers to a location where high landslide susceptibility zoning values exhibit statistically significant spatial clustering. Conversely, a cold spot indicates a spatial cluster of low-susceptibility zoning values.

A total of 812 landslide points were identified as hotspots (Gi Bi ≥ 1, p-value < 0.1), representing 54.83% of all landslide points, with 23.57% classified as highly significant hotspots (p-value < 0.01), indicating statistically significant spatial clustering and confirming the non-random nature of landslide distribution. Among these hotspot points, 791 fell within extremely-high-susceptibility zones, accounting for 97.41% of all hotspots and 72.77% of the landslide points in that zone. This demonstrates a pronounced spatial aggregation of landslides in high-susceptibility areas, providing indirect validation for the rationality of the susceptibility zoning scheme. In contrast, cold spots (Gi Bi ≤ −1, p-value < 0.1) were mainly distributed in medium-, low-, and very low-susceptibility zones, comprising less than 20% of all landslide points.

4.3.2. Feature Importance Analysis

To further examine the influence of individual factors on landslide susceptibility, the feature importance scores produced by TabNet model are used to quantitatively evaluate the spatial contribution of each variable, as illustrated in Figure 13.

The results indicate that elevation has the highest importance score at 0.3583, far exceeding all other factors. This suggests that topographic elevation is the dominant driver of landslide occurrence. High-elevation areas are often associated with complex terrain and steep and unstable slopes, and are frequently identified as core zones for landslide activity.

Land cover ranks second in feature importance, with a score of 0.1692, highlighting the significant influence of human activities on slope stability. Construction and agricultural land use frequently involve excavation, terrain modification, and vegetation clearance, all of which may increase the risk of slope disturbance and landslides.

The annual NDVI contributes an importance score of 0.1479, underscoring the protective role of vegetation cover. Areas with high NDVI values typically exhibit dense vegetation, which enhances slope stability by reinforcing soil structure and mitigating erosion caused by rainfall and surface runoff.

Slope (0.0821) and distance to roads (0.0792) demonstrate moderate influence. The former reflects the gravitational component of landslide triggering, while the latter suggests the destabilizing effects of road cuts, drainage alterations, and vibrations from traffic.

Distance to rivers (0.0631) and lithology (0.0562) exhibit lower but non-negligible contributions. River proximity may promote toe erosion and saturation, while lithological properties affect soil strength and water permeability.

Finally, the LS exhibits the lowest importance score at 0.044, indicating a relatively limited role in landslide prediction within the context of this study.

5. Discussion

5.1. Rationale and Limitations of GWR-Based Negative Sample Construction

The results of this study demonstrate that the negative sample construction method based on GWR outperforms the traditional random sampling method across all models. This finding aligns with previous research, suggesting that incorporating spatial distribution characteristics into the negative sample construction process significantly enhances the model’s discriminative ability and generalization performance in landslide susceptibility modeling [65]. This is because traditional random sampling typically selects negative samples from areas without recorded landslides, which may inadvertently include locations that have not experienced landslides but possess high susceptibility. Such incorrect labeling of negative samples can mislead the model and compromise its predictive performance.

Several studies have looked into enhancing negative sample construction methods in landslide susceptibility evaluation, including random selection in areas with slopes of less than 5° [66], random selection outside the landslide buffer zone [67], and selection based on information theory methods [68]. The novelty of our GWR-based negative sample construction method lies in its incorporation of spatial regression information between landslides and influencing factors. This allows the selected negative samples to carry a clearer “low-risk” semantic, thereby enhancing their scientific validity and representativeness. It enables the model to learn more robust discriminative features, providing more accurate and reliable methodological support for landslide susceptibility assessment.

However, the GWR-based negative sample construction method also has certain limitations, particularly its relatively complex computational process. Specifically, its application to large-scale datasets may result in high computational costs due to the need to compute pairwise spatial distances and fit a separate weighted least squares regression at each spatial location. Therefore, improving the computational efficiency of the GWR-based negative sample construction method to meet the demands of large-scale and real-time landslide prediction is a key area for future research. Additionally, considering issues such as data imbalance, spatial heterogeneity, and the complex interactions of influencing factors in landslide susceptibility modeling, future studies should explore diversified negative sample selection methods. These methods should be tailored to the specific characteristics of different datasets and model requirements to further optimize the quality and selection strategy of negative samples.

5.2. Evaluating TabNet’s Robustness and Applicability in Susceptibility Modeling

In terms of model comparison, the TabNet model, a deep neural network specifically designed for structured data, demonstrates exceptional performance, achieving an AUC of 0.9828 and significantly outperforming the other comparison models. This success can be attributed to the unique structural advantages of TabNet. It retains the interpretability and sparse feature selection mechanisms of tree-based models, while also incorporating the end-to-end learning capabilities of neural networks. These combined strengths allow TabNet to more accurately uncover nonlinear relationships and complex features in the data, thereby enhancing both predictive accuracy and model interpretability.

To further evaluate the model’s robustness, we conducted a sensitivity analysis for the TabNet model to evaluate its robustness to key hyperparameters. We systematically varied several important parameters within recommended ranges, as shown in Table 8. The results showed that TabNet consistently achieved high AUC values across different hyperparameter combinations, with an average AUC of 0.9759 and a standard deviation of only 0.0033. This indicates that TabNet’s performance remains stable and robust under a wide range of configurations, and that its parameter settings offer strong representativeness for structured geospatial data.

Although TabNet achieved excellent performance in this study, its generalizability remains to be further validated. Future research should focus on evaluating the model across larger, more complex datasets and diverse geographic regions to assess its robustness under varying environmental and geological conditions. In addition, due to its relatively high computational cost during training, future work could explore model optimization strategies to improve efficiency and enhance applicability to large-scale or real-time scenarios.

5.3. Relevance and Limitations of Susceptibility Indicators

Through feature importance analysis of the TabNet model, it was found that elevation, land cover, and annual NDVI play a dominant role in the model. This aligns with theoretical knowledge in the field of geology and supports findings from previous research [69]. However, some studies have reached different conclusions. For instance, some studies emphasize the significant role of rainfall in landslide susceptibility, suggesting that precipitation is a key factor in determining landslide occurrence [70]. These differences may be attributed to the adoption of different modeling approaches, which vary in how they interpret and assign importance to input features. In addition, regional differences in topography, geological conditions, and climate can further influence the relevance of specific factors. Therefore, future landslide susceptibility modeling requires the selection of an appropriate model and careful consideration of the study area’s environmental characteristics. Given the high importance of land cover revealed in this study, future research could further explore human–environment interactions by integrating indicators of anthropogenic activity. Examples include night-time light intensity, population density, or infrastructure distribution, which may help better capture the impact of human disturbance on slope stability and support the development of more comprehensive, coupled modeling frameworks.

Another important aspect not considered in this study is the role of extreme weather events. Due to the focus of this study on long-term landslide susceptibility rather than event-based hazard assessment, it does not account for extreme weather events, such as short-term intense rainfall or typhoons. However, these events are widely recognized as critical triggers of landslides. Future research could address this limitation by incorporating high-resolution meteorological data or historical records of extreme events to enhance the model’s temporal prediction capability.

In addition, this study does not explicitly account for the spatial autocorrelation between landslide occurrences and explanatory variables, which may influence model predictions and statistical inference. In spatial phenomena such as landslides, nearby locations often share similar environmental characteristics, leading to the spatial clustering of both the dependent and independent variables. Ignoring this spatial dependence may result in biased estimation of feature importance or overestimation of model performance due to the violation of the independence assumption. Future research could incorporate spatial statistical methods or geospatial machine learning frameworks such as spatial lag models to better capture the influence of spatial dependence and improve the robustness of landslide susceptibility assessments.

6. Conclusions

This study constructs a series of landslide susceptibility prediction models based on slope unit partitioning and historical landslide data in Ya’an City. Comparative experiments are conducted from two key perspectives: negative sample construction strategies and modeling architectures, leading to the following main conclusions:

(1): The GWR-based negative sampling method consistently outperforms traditional random sampling across all evaluated models, significantly improving both discriminative accuracy and generalization performance. By selecting negative samples from regions with low composite explanatory power, the GWR approach enhances the representativeness of low-risk areas and improves the distinction between positive and negative instances.
(2): As a deep neural network specifically designed for structured data, TabNet achieves outstanding performance in landslide susceptibility modeling, attaining an average AUC of 0.9828, which is substantially higher than those of RF, LightGBM, DNN, and ResNet. TabNet exhibits strong adaptability to structured inputs, high predictive accuracy, and robust interpretability, confirming its effectiveness and suitability for modeling complex, multi-factor geological hazards.
(3): Feature importance analysis identifies elevation, land cover, and annual NDVI as the most highly influential factors, highlighting the combined effects of topographic conditions, anthropogenic disturbances, and vegetation cover on landslide susceptibility.

Author Contributions

Conceptualization, J.L. and R.W.; methodology, J.L.; software, J.L., W.S. and L.Y.; validation, J.L. and F.L.; writing—original draft preparation, J.L.; writing—review and editing, J.L., F.L., J.W. and K.X.; visualization, J.L., F.L., J.W. and K.X.; supervision, J.L. and R.W.; project administration, R.W., W.S. and L.Y.; funding acquisition, R.W. All authors have read and agreed to the published version of the manuscript.

Funding

This study was supported by An Impact Assessment and Biodiversity Conservation Study on Habitat Quality of Typical Wildlife Taxa under the “Liangjiang-Sihe” Afforestation and Greening Project (XZ202501YD0016) and the National Natural Science Foundation of China (41971376).

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

Author Wei Shi was employed by the company Beijing Ocean Forestry Technology Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Cruden, D. A simple definition of a landslide. Bull. Eng. Geol. Environ. 1991, 43, 27–29. [Google Scholar] [CrossRef]
Reichenbach, P.; Rossi, M.; Malamud, B.D.; Mihir, M.; Guzzetti, F. A review of statistically-based landslide susceptibility models. Earth-Sci. Rev. 2018, 180, 60–91. [Google Scholar] [CrossRef]
Pourghasemi, H.R.; Teimoori Yansari, Z.; Panagos, P.; Pradhan, B. Analysis and evaluation of landslide susceptibility: A review on articles published during 2005–2016 (periods of 2005–2012 and 2013–2016). Arab. J. Geosci. 2018, 11, 193. [Google Scholar] [CrossRef]
Das, S.; Sarkar, S.; Kanungo, D.P. GIS-based landslide susceptibility zonation mapping using the analytic hierarchy process (AHP) method in parts of Kalimpong Region of Darjeeling Himalaya. Environ. Monit. Assess. 2022, 194, 234. [Google Scholar] [CrossRef]
Bahrami, Y.; Hassani, H.; Maghsoudi, A. Landslide susceptibility mapping using AHP and fuzzy methods in the Gilan province, Iran. GeoJournal 2021, 86, 1797–1816. [Google Scholar] [CrossRef]
Farooq, S.; Akram, M.S. Landslide susceptibility mapping using information value method in Jhelum Valley of the Himalayas. Arab. J. Geosci. 2021, 14, 824. [Google Scholar] [CrossRef]
Chen, L.; Guo, H.; Gong, P.; Yang, Y.; Zuo, Z.; Gu, M. Landslide susceptibility assessment using weights-of-evidence model and cluster analysis along the highways in the Hubei section of the Three Gorges Reservoir Area. Comput. Geosci. 2021, 156, 104899. [Google Scholar] [CrossRef]
Shano, L.; Raghuvanshi, T.K.; Meten, M. Landslide susceptibility mapping using frequency ratio model: The case of Gamo highland, South Ethiopia. Arab. J. Geosci. 2021, 14, 632. [Google Scholar] [CrossRef]
Pourghasemi, H.R.; Sadhasivam, N.; Amiri, M.; Eskandari, S.; Santosh, M. Landslide susceptibility assessment and mapping using state-of-the art machine learning techniques. Nat. Hazards 2021, 108, 1291–1316. [Google Scholar] [CrossRef]
Marjanović, M.; Kovačević, M.; Bajat, B.; Voženílek, V. Landslide susceptibility assessment using SVM machine learning algorithm. Eng. Geol. 2011, 123, 225–234. [Google Scholar] [CrossRef]
Tien Bui, D.; Tuan, T.A.; Klempe, H.; Pradhan, B.; Revhaug, I. Spatial prediction models for shallow landslide hazards: A comparative assessment of the efficacy of support vector machines, artificial neural networks, kernel logistic regression, and logistic model tree. Landslides 2016, 13, 361–378. [Google Scholar] [CrossRef]
Trigila, A.; Iadanza, C.; Esposito, C.; Scarascia-Mugnozza, G. Comparison of Logistic Regression and Random Forests techniques for shallow landslide susceptibility assessment in Giampilieri (NE Sicily, Italy). Geomorphology 2015, 249, 119–136. [Google Scholar] [CrossRef]
Sun, D.; Wu, X.; Wen, H.; Gu, Q. A LightGBM-based landslide susceptibility model considering the uncertainty of non-landslide samples. Geomat. Nat. Hazards Risk 2023, 14, 2213807. [Google Scholar] [CrossRef]
L’heureux, A.; Grolinger, K.; Elyamany, H.F.; Capretz, M.A. Machine learning with big data: Challenges and approaches. IEEE Access 2017, 5, 7776–7797. [Google Scholar] [CrossRef]
Azarafza, M.; Azarafza, M.; Akgün, H.; Atkinson, P.M.; Derakhshani, R. Deep learning-based landslide susceptibility mapping. Sci. Rep. 2021, 11, 24112. [Google Scholar] [CrossRef]
Wang, Y.; Fang, Z.; Hong, H. Comparison of convolutional neural networks for landslide susceptibility mapping in Yanshan County, China. Sci. Total Environ. 2019, 666, 975–993. [Google Scholar] [CrossRef]
Isazade, V.; Qasimi, A.b.; Namivandi, M.S.; Amin, M.S.; Ahlem, G. Landslide Susceptibility Assessment Using Recurrent Neural Network (RNN)—A Case of Chabahar and Konarak in Iran. Indian Geotech. J. 2025, 55, 1–21. [Google Scholar] [CrossRef]
Xiao, L.; Zhang, Y.; Peng, G. Landslide susceptibility assessment using integrated deep learning algorithm along the China-Nepal highway. Sensors 2018, 18, 4436. [Google Scholar] [CrossRef] [PubMed]
Liu, T.; Chen, T.; Niu, R.; Plaza, A. Landslide detection mapping employing CNN, ResNet, and DenseNet in the three gorges reservoir, China. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2021, 14, 11417–11428. [Google Scholar] [CrossRef]
Ghorbanzadeh, O.; Crivellari, A.; Ghamisi, P.; Shahabi, H.; Blaschke, T. A comprehensive transferability evaluation of U-Net and ResU-Net for landslide detection from Sentinel-2 data (case study areas from Taiwan, China, and Japan). Sci. Rep. 2021, 11, 14629. [Google Scholar] [CrossRef]
Arik, S.Ö.; Pfister, T. Tabnet: Attentive interpretable tabular learning. In Proceedings of the AAAI Conference on Artificial Intelligence, Virtual, 2–9 February 2021; pp. 6679–6687. [Google Scholar]
Yu, C.; Jin, Y.; Xing, Q.; Zhang, Y.; Guo, S.; Meng, S. Advanced user credit risk prediction model using lightgbm, xgboost and tabnet with smoteenn. In Proceedings of the 2024 IEEE 6th International Conference on Power, Intelligent Computing and Systems (ICPICS), Shenyang, China, 26–28 July 2024; pp. 876–883. [Google Scholar]
Joseph, L.P.; Joseph, E.A.; Prasad, R. Explainable diabetes classification using hybrid Bayesian-optimized TabNet architecture. Comput. Biol. Med. 2022, 151, 106178. [Google Scholar] [CrossRef] [PubMed]
Yan, J.; Xu, T.; Yu, Y.; Xu, H. Rainfall forecast model based on the tabnet model. Water 2021, 13, 1272. [Google Scholar] [CrossRef]
Shah, C.; Du, Q.; Xu, Y. Enhanced TabNet: Attentive interpretable tabular learning for hyperspectral image classification. Remote Sens. 2022, 14, 716. [Google Scholar] [CrossRef]
Zhu, Y.; Liu, S.; Yin, K.; Zeng, T.; Guo, Z.; Liu, Z.; Yang, H. Impact of negative sampling strategies on landslide susceptibility assessment. Adv. Space Res. 2025, 76, 592–613. [Google Scholar] [CrossRef]
Fotheringham, A.S.; Brunsdon, C.; Charlton, M. Geographically weighted regression. Sage Handb. Spat. Anal. 2009, 1, 243–254. [Google Scholar]
Li, Y.; Liu, X.; Han, Z.; Dou, J. Spatial proximity-based geographically weighted regression model for landslide susceptibility assessment: A case study of Qingchuan area, China. Appl. Sci. 2020, 10, 1107. [Google Scholar] [CrossRef]
Xueqiang, G.; Chuanjie, X.; Xiewen, H.; Yayun, H.; Yonghao, Z.; Yu, Z. Landslide susceptibility assessment and zonation using negative sampling strategy: A case study of Bazhong area, Sichuan Province. Chin. J. Geol. Hazard Control 2025, 36, 146. [Google Scholar]
McCool, D.; Foster, G.; Weesies, G. Slope length and steepness factors (LS). In Predicting Soil Erosion by Water: A Guide to Conservation Planning with the Revised Universal Soil Loss Equation (RUSLE); US Department of Agriculture: Washington, DC, USA, 1997; Volume 703, pp. 101–141. [Google Scholar]
Riley, S.J.; DeGloria, S.D.; Elliot, R. Index that quantifies topographic heterogeneity. Intermt. J. Sci. 1999, 5, 23–27. [Google Scholar]
Wilson, J.P.; Gallant, J.C. Primary topographic attributes. In Terrain Analysis-Principles and Application; John Wiley & Sons: Hoboken, NJ, USA, 2000; pp. 51–86. [Google Scholar]
Zevenbergen, L.W.; Thorne, C.R. Quantitative analysis of land surface topography. Earth Surf. Process. Landf. 1987, 12, 47–56. [Google Scholar] [CrossRef]
Beven, K.J.; Kirkby, M.J. A physically based, variable contributing area model of basin hydrology/Un modèle à base physique de zone d’appel variable de l’hydrologie du bassin versant. Hydrol. Sci. J. 1979, 24, 43–69. [Google Scholar] [CrossRef]
Moore, I.D.; Grayson, R.; Ladson, A. Digital terrain modelling: A review of hydrological, geomorphological, and biological applications. Hydrol. Process. 1991, 5, 3–30. [Google Scholar] [CrossRef]
Rouse, J.; Haas, R.; Schell, J.; Deering, D. Monitoring vegetation systems in the great plains with ERTS proceeding. In Proceedings of the Third Earth Reserves Technology Satellite Symposium, Washington, DC, USA, 1 January 1974; p. 317. [Google Scholar]
Yu, X.; Chen, H. Research on the influence of different sampling resolution and spatial resolution in sampling strategy on landslide susceptibility mapping results. Sci. Rep. 2024, 14, 1549. [Google Scholar] [CrossRef] [PubMed]
Meena, S.R.; Gudiyangada Nachappa, T. Impact of spatial resolution of digital elevation model on landslide susceptibility mapping: A case study in Kullu Valley, Himalayas. Geosciences 2019, 9, 360. [Google Scholar] [CrossRef]
Jacobs, L.; Kervyn, M.; Reichenbach, P.; Rossi, M.; Marchesini, I.; Alvioli, M.; Dewitte, O. Regional susceptibility assessments with heterogeneous landslide information: Slope unit-vs. pixel-based approach. Geomorphology 2020, 356, 107084. [Google Scholar] [CrossRef]
Tian, S.; Zhang, S.; Tang, Q.; Fan, X.; Han, P. Comparative study of landslide susceptibility assessment based on different evaluation units. J. Nat. Disasters 2019, 2019, 137–145. [Google Scholar]
Shang, H.; Ni, W.; Cheng, H. Application of slope unit division to risk zoning of geological hazards of Pengyang County. Soil Water Conserv. China 2011, 3, 48–50. [Google Scholar]
Liu, Y.; Zhang, W.; Zhang, Z.; Xu, Q.; Li, W. Risk factor detection and landslide susceptibility mapping using Geo-Detector and Random Forest Models: The 2018 Hokkaido eastern Iburi earthquake. Remote Sens. 2021, 13, 1157. [Google Scholar] [CrossRef]
Wang, J.-F.; Zhang, T.-L.; Fu, B.-J. A measure of spatial stratified heterogeneity. Ecol. Indic. 2016, 67, 250–256. [Google Scholar] [CrossRef]
Gu, T.; Li, J.; Wang, M.; Duan, P.; Zhang, Y.; Cheng, L. Study on landslide susceptibility mapping with different factor screening methods and random forest models. PLoS ONE 2023, 18, e0292897. [Google Scholar] [CrossRef]
Zhang, Q.; He, Y.; Chen, X.; Gao, B.; Zhang, L.; Zhang, Z.; Lu, J.; Zhang, Y. Landslide susceptibility assessment in Shenzhen based on multi-scale convolutional neural networks model. Chin. J. Geol. Hazard Control 2024, 35, 146–162. [Google Scholar]
Zou, K.H.; Tuncali, K.; Silverman, S.G. Correlation and simple linear regression. Radiology 2003, 227, 617–628. [Google Scholar] [CrossRef]
Krehbiel, T.C. Correlation Coefficient Rule of Thumb. Decis. Sci. J. Innov. Educ. 2004, 2, 97. [Google Scholar] [CrossRef]
Jenks, G.F. The data model concept in statistical mapping. Int. Yearb. Cartogr. 1967, 7, 186–190. [Google Scholar]
An, B.; Zhang, Z.; Xiong, S.; Zhang, W.; Yi, Y.; Liu, Z.; Liu, C. Landslide Susceptibility Mapping Based on Ensemble Learning in the Jiuzhaigou Region, Sichuan, China. Remote Sens. 2024, 16, 4218. [Google Scholar] [CrossRef]
Arabameri, A.; Saha, S.; Roy, J.; Chen, W.; Blaschke, T.; Tien Bui, D. Landslide susceptibility evaluation and management using different machine learning methods in the Gallicash River Watershed, Iran. Remote Sens. 2020, 12, 475. [Google Scholar] [CrossRef]
Meier, C.; Jaboyedoff, M.; Derron, M.-H.; Gerber, C. A method to assess the probability of thickness and volume estimates of small and shallow initial landslide ruptures based on surface area. Landslides 2020, 17, 975–982. [Google Scholar] [CrossRef]
Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
Ke, G.; Meng, Q.; Finley, T.; Wang, T.; Chen, W.; Ma, W.; Ye, Q.; Liu, T.-Y. Lightgbm: A highly efficient gradient boosting decision tree. Adv. Neural Inf. Process. Syst. 2017, 30, 52. [Google Scholar]
Hinton, G.E.; Osindero, S.; Teh, Y.-W. A fast learning algorithm for deep belief nets. Neural Comput. 2006, 18, 1527–1554. [Google Scholar] [CrossRef]
He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
Refaeilzadeh, P.; Tang, L.; Liu, H. Cross-validation. In Encyclopedia of Database Systems; Springer: New York, NY, USA, 2009; pp. 532–538. [Google Scholar]
Kuhn, M.; Johnson, K. Applied Predictive Modeling; Springer: New York, NY, USA, 2013; Volume 26. [Google Scholar]
Dai, X.; Zhu, Y.; Sun, K.; Zou, Q.; Zhao, S.; Li, W.; Hu, L.; Wang, S. Examining the spatially varying relationships between landslide susceptibility and conditioning factors using a geographical random forest approach: A case study in Liangshan, China. Remote Sens. 2023, 15, 1513. [Google Scholar] [CrossRef]
Brenning, A. Spatial prediction models for landslide hazards: Review, comparison and evaluation. Nat. Hazards Earth Syst. Sci. 2005, 5, 853–862. [Google Scholar] [CrossRef]
Cohen, J. A coefficient of agreement for nominal scales. Educ. Psychol. Meas. 1960, 20, 37–46. [Google Scholar] [CrossRef]
Glenn, W.B. Verification of forecasts expressed in terms of probability. Mon. Weather. Rev. 1950, 78, 1–3. [Google Scholar]
Landis, J.R.; Koch, G.G. The measurement of observer agreement for categorical data. Biometrics 1977, 33, 159–174. [Google Scholar] [CrossRef] [PubMed]
Getis, A.; Ord, J.K. The analysis of spatial association by use of distance statistics. Geogr. Anal. 1992, 24, 189–206. [Google Scholar] [CrossRef]
Manap, N.; Borhan, M.N.; Yazid, M.R.M.; Hambali, M.K.A.; Rohan, A. Determining spatial patterns of road accidents at expressway by applying Getis-Ord Gi* spatial statistic. Int. J. Recent Technol. Eng. 2019, 8, 345–350. [Google Scholar] [CrossRef]
Guo, Z.; Tian, B.; Zhu, Y.; He, J.; Zhang, T. How do the landslide and non-landslide sampling strategies impact landslide susceptibility assessment?—A catchment-scale case study from China. J. Rock Mech. Geotech. Eng. 2024, 16, 877–894. [Google Scholar]
Kavzoglu, T.; Sahin, E.K.; Colkesen, I. Landslide susceptibility mapping using GIS-based multi-criteria decision analysis, support vector machines, and logistic regression. Landslides 2014, 11, 425–439. [Google Scholar] [CrossRef]
Dou, H.; He, J.; Huang, S.; Jian, W.; Guo, C. Influences of non-landslide sample selection strategies on landslide susceptibility mapping by machine learning. Geomat. Nat. Hazards Risk 2023, 14, 2285719. [Google Scholar] [CrossRef]
Xiaoting, Z.; Faming, H.; Weicheng, W.; Chuangbing, Z.; Shiyi, Z.; Lihan, P. Regional Landslide Susceptibility Prediction Based on Negative Sample Selected by Coupling Information Value Method. Adv. Eng. Sci./Gongcheng Kexue Yu Jishu 2022, 54, 25. [Google Scholar]
Wang, Y.; Feng, L.; Li, S.; Ren, F.; Du, Q. A hybrid model considering spatial heterogeneity for landslide susceptibility mapping in Zhejiang Province, China. Catena 2020, 188, 104425. [Google Scholar] [CrossRef]
Huang, F.; Chen, J.; Liu, W.; Huang, J.; Hong, H.; Chen, W. Regional rainfall-induced landslide hazard warning based on landslide susceptibility mapping and a critical rainfall threshold. Geomorphology 2022, 408, 108236. [Google Scholar] [CrossRef]

Figure 1. Geographical location of the study area.

Figure 2. Landslide susceptibility assessment indicators: (a) elevation; (b) slope; (c) aspect; (d) LS; (e) TRI; (f) plan curvature; (g) profile curvature; (h) TWI; (i) SPI; (j) lithology; (k) distance to faults; (l) earthquake density; (m) annual precipitation; (n) annual NDVI; (o) distance to rivers; (p) land cover; (q) distance to roads.

Figure 3. (a) Results of q-value calculation; (b) correlation analysis results.

Figure 4. Workflow of landslide susceptibility assessment.

Figure 5. (a) Workflow of GWR-based negative sample construction; (b) workflow of landslide susceptibility modeling.

Figure 6. The architecture of TabNet.

Figure 7. Results of GWR: (a) elevation; (b) slope; (c) LS; (d) lithology; (e) annual NDVI; (f) distance to rivers; (g) land cover; (h) distance to roads; (i) composite explanatory index.

Figure 8. Spatial distribution of negative samples: (a) negative samples constructed based on GWR; (b) negative samples constructed based on random sampling.

Figure 9. ROC curves of different models: (a) RF; (b) LightGBM; (c) DNN; (d) ResNet; (e) TabNet.

Figure 10. Landslide susceptibility zonation statistics based on TabNet.

Figure 11. TabNet landslide susceptibility prediction.

Figure 12. Hotspot analysis results.

Figure 13. Feature importance scores produced by TabNet (unitless and normalized to sum to 1).

Table 1. Data sources.

Data Type	Data Name	Resolution	Data Source
Historical Landslide Point Data	Spatial Distribution Data of Geological Hazard Points in China	-	Resource and Environmental Science Data Platform
Elevation Data	ASTER GDEM Data of Ya’an City	30 m	Geospatial Data Cloud site
LS Data	Slope Length and Steepness Factor Dataset for the 20 Countries of the Pan-Third Pole Region	7.5 arc-s	National Tibetan Plateau Data Center
Lithology Data	Spatial Lithology Distribution Data of China	7.5 arc-s	Resource and Environmental Science Data Platform
Fault Vector Data	Vector Data of Active Faults in China	-	Seismic Active Fault Survey Data Center
Earthquake Epicenter Data	Global Earthquake Data (2000–2022)	-	United States Geological Survey
Annual Precipitation Data	Annual Precipitation Data of China (2000–2022)	1 km	National Earth System Science Data Center
River and Road Vector Data	Fundamental Geographic Information Data of China	-	National Geomatics Center of China
Annual NDVI Data	Annual Average NDVI Data of China (2000–2022)	1 km	Resource and Environmental Science Data Platform
Land Cover Data	Land Cover Data of China	30 m	National Geomatics Center of China

Table 2. Formulas and definitions of evaluation metrics.

Metric	Formula *	Definition
ACC	$A C C = \frac{T P + T N}{T P + T N + F P + F N}$	Proportion of correctly identified landslide and non-landslide units.
RE	$R E = \frac{T P}{T P + F N}$	Proportion of actual landslide units that are correctly classified as landslides.
ROC	Plot of TPR vs. FPR	Reflects the model’s ability to distinguish landslide from non-landslide areas; the closer the curve is to the top-left corner, the better the performance.
AUC	Area under the ROC curve	Quantifies the model’s discrimination ability; values closer to 1 indicate stronger performance.
Kappa	$K a p p a = \frac{p_{0} - p_{e}}{1 - p_{e}}$	Measures agreement between the true classes and the classifications; values closer to 1 indicate stronger consistency, while values near 0 or negative suggest performance equivalent to random guessing, or indicate systematic bias.
Brier Score	$B r i e r S c o r e = \frac{1}{N} \sum_{i = 1}^{N} {(f_{i} - o_{i})}^{2}$	Measures the mean squared difference between predicted probabilities and actual outcomes; lower scores reflect more reliable probabilistic predictions [62].

* TP (True Positive) and TN (True Negative) represent correctly predicted landslides and non-landslides, respectively, while FP (False Positive) and FN (False Negative) denote misclassified non-landslides and landslides. In the Kappa coefficient,

p_{0}

denotes the observed agreement between predicted and actual labels, and

p_{e}

denotes the expected agreement by chance. For the Brier Score,

N

is the total number of samples,

f_{i}

is the predicted probability of landslide occurrence for sample

i

, and

o_{i}

is the actual observed outcome.

Table 3. AUC values * of models based on two negative sample selection methods.

	Negative Samples Selected Based on Random Sampling	Negative Samples Selected Based on GWR
RF	0.9300	0.9661
LightGBM	0.9333	0.9722
DNN	0.9022	0.9632
ResNet	0.9029	0.9618
TabNet	0.9456	0.9828

* Results are presented as the average of 10-fold cross-validation.

Table 4. Performance metrics * of different models.

	ACC	RE	AUC	Kappa	Brier Score
RF	0.9281	0.9489	0.9661	0.8558	0.0601
LightGBM	0.9207	0.9195	0.9722	0.8409	0.0614
DNN	0.9046	0.8974	0.9632	0.8083	0.0720
ResNet	0.8992	0.8903	0.9618	0.7977	0.0735
TabNet	0.9298	0.9196	0.9828	0.8589	0.0521

* Results are presented as the average of 10-fold cross-validation.

Table 5. Statistical results of landslide susceptibility zonation using TabNet.

Susceptibility Level	Landslide Count	Landslide Proportion (%)	Area (km²)	Area Proportion (%)	Landslide Density (Events/km²)
Very Low	22	1.49	6716.32	45.00	0.0033
Low	80	5.40	2137.48	14.32	0.0374
Moderate	136	9.18	1481.74	9.93	0.0918
High	156	10.53	1082.48	7.25	0.1441
Very High	1087	73.40	3506.67	23.50	0.3100

Table 6. Statistics of hot- and cold spots based on Gi Bin confidence levels.

Gi Bin Value *	p-Value	Meaning	Count	Proportion (%)
3	p < 0.01	99% Confidence Hotspot	349	23.57
2	0.01 ≤ p < 0.05	95% Confidence Hotspot	376	25.39
1	0.05 ≤ p < 0.10	90% Confidence Hotspot	87	5.87
0	p ≥ 0.10	Not Significant	428	28.9
−1	0.05 ≤ p < 0.10	90% Confidence Cold Spot	12	0.81
−2	0.01 ≤ p < 0.05	95% Confidence Cold Spot	41	2.77
−3	p < 0.01	99% Confidence Cold Spot	188	12.69

* The Gi Bin value is a categorical representation of the statistical significance of spatial clustering derived from the Getis-Ord Gi* statistic [64].

Table 7. Distribution of hot- and cold spots across landslide susceptibility levels.

Susceptibility Level	Landslide Count	Hotspot Count	Hotspot Proportion (%)	Cold Spot Count	Cold Spot Proportion (%)
Very Low	22	0	0	20	90.91
Low	80	0	0	75	93.75
Moderate	136	5	3.68	102	75
High	156	16	10.26	26	16.67
Very High	1087	791	72.77	18	1.66

Table 8. TabNet hyperparameter sensitivity analysis and optimal settings.

Hyperparameter	Description	Tested Range	Optimal Value
n_d	Dimension of the decision layer	[16, 32, 64]	64
n_a	Dimension of the attentive transformer layer	[16, 32, 64]	64
n_steps	Number of decision steps	[3, 5, 7]	3
gamma	Feature reuse penalty coefficient	1.0, 1.2, 1.5	1.2
lambda_sparse	Weight of sparsity regularization	1 × 10⁻⁴, 1 × 10⁻³, 1 × 10⁻²	1 × 10⁻²

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Li, J.; Wang, R.; Shi, W.; Yang, L.; Wei, J.; Liu, F.; Xiong, K. Landslide Susceptibility Assessment in Ya’an Based on Coupling of GWR and TabNet. Remote Sens. 2025, 17, 2678. https://doi.org/10.3390/rs17152678

AMA Style

Li J, Wang R, Shi W, Yang L, Wei J, Liu F, Xiong K. Landslide Susceptibility Assessment in Ya’an Based on Coupling of GWR and TabNet. Remote Sensing. 2025; 17(15):2678. https://doi.org/10.3390/rs17152678

Chicago/Turabian Style

Li, Jiatian, Ruirui Wang, Wei Shi, Le Yang, Jiahao Wei, Fei Liu, and Kaiwei Xiong. 2025. "Landslide Susceptibility Assessment in Ya’an Based on Coupling of GWR and TabNet" Remote Sensing 17, no. 15: 2678. https://doi.org/10.3390/rs17152678

APA Style

Li, J., Wang, R., Shi, W., Yang, L., Wei, J., Liu, F., & Xiong, K. (2025). Landslide Susceptibility Assessment in Ya’an Based on Coupling of GWR and TabNet. Remote Sensing, 17(15), 2678. https://doi.org/10.3390/rs17152678

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Landslide Susceptibility Assessment in Ya’an Based on Coupling of GWR and TabNet

Abstract

1. Introduction

2. Study Area and Data Sources

2.1. Study Area

2.2. Data Sources

2.3. Division of Evaluation Units and Selection of Susceptibility Indicators

3. Method

3.1. Negative Sample Construction Based on GWR

3.1.1. GWR Modeling Principle

3.1.2. Identification of Low-Explanatory Areas and Selection of Negative Samples

3.2. Landslide Susceptibility Modeling Based on TabNet

3.2.1. Overview of TabNet

3.2.2. Cross-Validation

3.2.3. Model Performance Evaluation Metrics

4. Results

4.1. Comparison of Results Under Different Negative Sample Construction Strategies

4.2. Comparison of Landslide Susceptibility Modeling Results Across Different Models

4.3. Landslide Susceptibility Prediction and Feature Importance Analysis Based on TabNet

4.3.1. Landslide Susceptibility Prediction

4.3.2. Feature Importance Analysis

5. Discussion

5.1. Rationale and Limitations of GWR-Based Negative Sample Construction

5.2. Evaluating TabNet’s Robustness and Applicability in Susceptibility Modeling

5.3. Relevance and Limitations of Susceptibility Indicators

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI