Developing a Hybrid Model to Enhance the Robustness of Interpretability for Landslide Susceptibility Assessment

Yan, Xiao; Zhang, Dongshui; Han, Yongshun; Li, Tongsheng; Zhong, Pin; Ning, Zhe; Tan, Shirou

doi:10.3390/ijgi14070277

Open AccessArticle

Developing a Hybrid Model to Enhance the Robustness of Interpretability for Landslide Susceptibility Assessment

by

Xiao Yan

¹

,

Dongshui Zhang

^1,*,

Yongshun Han

¹,

Tongsheng Li

²,

Pin Zhong

¹,

Zhe Ning

¹ and

Shirou Tan

¹

School of Earth Sciences and Spatial Information Engineering, Hunan University of Science and Technology, Xiangtan 411201, China

²

Hunan Institute of Geological Disaster Investigation and Monitoring, Changsha 410004, China

^*

Author to whom correspondence should be addressed.

ISPRS Int. J. Geo-Inf. 2025, 14(7), 277; https://doi.org/10.3390/ijgi14070277

Submission received: 17 April 2025 / Revised: 10 June 2025 / Accepted: 26 June 2025 / Published: 16 July 2025

(This article belongs to the Special Issue Advances in Remote Sensing and GIS for Natural Hazards Monitoring and Management)

Download

Browse Figures

Versions Notes

Abstract

Landslide is one of the most damaging natural hazards, causing extensive damage to the infrastructure and threatening human life. Although advances have been made in landslide susceptibility assessment by objective explainable machine learning, the interpretability robustness of traditional single landslide susceptibility model is still low. The proposed interpretable hybrid model in this study overcomes these challenges and aims to enhance the stability of landslide susceptibility interpretability. The model integrates three base machine learning models—LightGBM, XGBoost, and Random Forest—using a heterogeneous category strategy, thereby enhancing the robustness of model interpretability. The hybrid model is interpreted using SHAP (Shapley Additive Explanations) values, which quantify feature contributions. A 10-fold cross-validation with the coefficient of variation (CV) metric reveals that the hybrid model outperforms individual base models in terms of interpretive robustness, yielding a lower CV value of 0.175 compared to 0.208 for LightGBM, 0.240 for XGBoost, and 0.207 for the Random Forest model. Although predictive accuracy remains comparable to the baseline models, the hybrid model provides more stable and reliable interpretability results for landslide susceptibility. It identifies the slope, elevation, and LS factor as the three most important factors for landslide susceptibility in Xi’an city. Furthermore, the quantitative nonlinear relationships between these predisposing factors and susceptibility were identified, providing empowering knowledge for the landslides risk prevention and urban planning in the regions vulnerable to landslides.

Keywords:

landslide susceptibility assessment; machine learning models; Shapley Additive Explanations; robustness of model interpretability

1. Introduction

Landslides, which involve the downward movement of rock, soil, and organic material due to gravity, are considered one of the most destructive geohazards [1]. They are among the most dangerous natural disasters, often causing significant damage to infrastructure such as homes, roads, and bridges, and posing severe risks to human life [2]. According to the United Nations Development Program, landslides rank as the second-most frequent geological hazard worldwide, leading to substantial economic losses each year [3]. The World Health Organization (WHO) reports that between 1998 and 2017, landslides affected over 4.8 million people and resulted in more than 18,000 fatalities [2]. Unfortunately, global warming is expected to increase the frequency and intensity of extreme rainfall events, thereby raising the number of people exposed to landslide hazards [4]. The prevailing view is that the most effective approach to minimizing landslide risk lies in the reliable monitoring, assessment, and identification of landslide-prone areas [3]. Landslide susceptibility refers to the likelihood of a landslide occurring under specific topographical and environmental conditions. Understanding this susceptibility is crucial for identifying areas at risk of landslides and is an effective non-engineering method for mitigating landslide-related disasters [5,6].

At present, the models for predicting landslide susceptibility mainly include three types: physically based models, statistical models, and machine learning models [7]. The physically based models is a mechanism-oriented technology, such as the infinite slope stability model [5]. Although physical models can provide reasonably accurate results, the materially intensive nature of their input parameters has limited the applicability of physically based models to smaller spatial extents [8,9]. Landslide susceptibility refers to the likelihood of a landslide occurring of a certain magnitude in a region and is traditionally estimated based on statistical methods that quantify the empirical relationships between environmental conditioning factors and the historical location of landslides [10]. Using these relationships, we can infer patterns of susceptibility in other areas with similar characteristics. Statistical models can be more economical compared to physically based models, thus allowing for wider regional deployment [11]. However, these statistical methods, such as frequency ratio (FR) and the Weights of Evidence (WoE), are generally assumption-driven and could not appropriately describe the nonlinear relationships between the factors with susceptibility, and often requiring intricate combinations of input parameters [12,13]. These machine learning algorithms allow models to describe the nonlinear relationships between predictive variables and landslides, as mentioned above, resulting in better predictive performance [14]. The popular machine learning methods used in landslide susceptibility models include the support vector machine [15], artificial neural networks [16], the Random Forests (RF) [17]. Moreover, several ensemble learning models and deep learning techniques, including Gradient Boosting Decision Trees (GBDT) [18], AdaBoost [19], and Convolutional Neural Networks [20], have also been leveraged for susceptibility prediction.

Predicting landslides is necessary for disaster prevention. Machine learning model processes are expected to be optimized to allow for effective early warning and forecasting of disasters. Nevertheless, models based on machine learning are usually considered to be a “black box” and cannot explain how terms affect susceptibility [21]. To overcome this, there is an increasing research focus on the fundamental techniques of machine learning interpretability [22], also known as Explainable Artificial Intelligence (XAI). The most popular method regarding this case is Shapley Additive Explanations (SHAP), which is based on game theory; it can assign each feature a quantifiable influence and this method can therefore offer detailed explanation in local and global level about a model prediction [23]. Although SHAP has been successfully used to quantify relationships between factors and outcomes prediction of landslide susceptibility as shown by Zhang et al. and Pradhan et al. [24,25], little focus has been placed on the robustness of these explanations. However, the interpretation yielded by the SHAP method is not stable under parameterization and input deviation in the model [26]. Consequently, there is growing concern in the field to make model explanations more robust. Recently, Bommer et al. introduced a framework for evaluating XAI methods to assist the selection of appropriate XAI techniques to specific research objectives [27]. However, they did not specifically argue that those choices explain better than the explanatory model they were inserted into the context but simply that there are other choices. Recent studies have shown that there are ensemble explanatory outcomes when different machine learning models are used together. Thus, it results in more robust and reliable model interpretations compared to interpretations based on a single model [28]. Motivated by this, developing an interpretable hybrid model can be a promising direction to improve the robustness of landslide susceptibility interpretations.

This study addresses the issue of poor robustness in landslide susceptibility explanations using traditional single models by proposing an interpretable hybrid model to bridge this gap. We introduce an effective and manageable heterogeneous category strategy to integrate three base machine learning models: LightGBM, XGBoost, and the Random Forest (RF) model. The objectives of this study are (1) to conduct a comparative robustness analysis of different landslide susceptibility models based on SHAP, (2) to develop a hybrid model using the heterogeneous category strategy to enhance the robustness of model explanations, and (3) to quantify the nonlinear relationships between landslide susceptibility and influencing factors in Xi’an using the developed hybrid model.

2. Materials

2.1. Study Area

Xi’an is the capital of Shaanxi Province. The city lies between 108°50′ E to 109°10′ E longitude and 33°30′ N to 34°30′ N latitude (Figure 1). Xi’an experiences a continental monsoon climate, with four distinct seasons. The city is intersected by eight rivers, including the Jingjiang, Changjiang, Juehe, and Weihe rivers, the latter being the primary source of drinking water [29]. The altitude within the city varies from 336 m to 3748 m, with the southern region being mountainous and the northern region predominantly flat. The annual precipitation ranges from 500 to 700 mm [30], with the average annual temperature recorded at 14.08 °C. January temperatures hover near the freezing point, while July sees an average of 27.0 °C. The lithology is primarily composed of silty clay, loess, sand, and gravel. These materials have a complex origin, predominantly derived from alluvial, diluvial, and aeolian depositional processes [31]. Most rainfall occurs between July and October, a period during which the southern mountainous areas are prone to geological hazards [32]. Despite these challenges, Xi’an is home to a wealth of historical and cultural heritage, with numerous ancient relics, tombs, and traditional structures, including the Terracotta Army, the Han Dynasty Chang’an City, and the Tang Dynasty Daming Palace. [33]. The threat of landslide hazards to both historical sites and urban development underscores the need for comprehensive landslide susceptibility assessments and analysis of driving factors within the region.

2.2. Data

2.2.1. Landslide Inventory

A landslide inventory serves as the foundation for assessing landslide susceptibility and plays a crucial role in the accurate evaluation and efficient management of landslide risks [22]. For this study, the landslide inventory data were acquired from the Resource and Environmental Science Data Platform (https://www.resdc.cn/, accessed on 3 February 2024), and the accuracy of the landslide information has been verified [34]. This dataset records the spatial distribution of landslide geohazards across China from 1949 to 2011 [35]. In the study area, 347 landslide points were detected. An equal number of points, which were not associated with landslides, were randomly selected within the study area. In the machine learning model, for landslide points (i.e., positive samples), the value was set to 1, while for non-landslide points (i.e., negative samples), the value was set to 1. Finally, we correctly combined the 70% positive and negative samples into a training dataset (243 landslide points and 243 non-landslide points) and the remaining 30% as the validation dataset (104 landslide points and 104 non-landslide points).

2.2.2. Landslide Conditioning Factors

Landslides are influenced by factors such as topography, geology, hydrology, meteorological conditions, vegetation cover, and human activities. Fifteen conditioning factors were selected for use in landslide susceptibility modeling, as presented in Figure 2 and listed in Table 1. These factors include the following: meteorological conditions represented by average annual rainfall (AR); topography represented by elevation, LS factor (LS), convergence index (CI), plan curvature (PLC), relative slope position (RSP), profile curvature (PRC), and slope; geological categories represented by lithology; human activities represented by land use; hydrological conditions represented by the Topographic Wetness Index (TWI) and distance to river (DR); and vegetation cover represented by the Normalized Difference Vegetation Index (NDVI) and soil. These factors are frequently employed in landslide susceptibility predictions with in-depth information provided in the studies of Guo and Sharma et al. [36,37]. Since the original spatial resolution of the DEM data was 90 m × 90 m, each factor was converted to the same resolution to retain original data accuracy as much as possible [38].

3. Methodology

Figure 3 illustrates the framework developed to improve robustness of the interpretability of landslide susceptibility modeling. Firstly, 15 conditioning factors were chosen based on domain expertise and their spatial relevance. The feature selection followed using Pearson correlation analysis and the Information Gain Ratio (IGR) to mitigate multicollinearity and preserve the most informative variables. Three machine learning models—LightGBM, XGBoost, and Random Forest—were trained independently and then combined using a heterogeneous ensemble approach, weighted by each model’s Area Under the Curve (AUC), to create a hybrid model. To facilitate the interpretable analysis, Shapley Additive Explanations (SHAP) were applied to quantify the contribution of each feature. Lastly, a 10-fold cross-validation scheme was applied, and the coefficient of variation (CV) was used to evaluate the consistency of feature importance rankings across different folds.

3.1. Feature Selection Methods

3.1.1. Pearson Correlation Analysis

Usually, a Pearson correlation analysis should be performed before building a machine learning-based landslide susceptibility model to reduce multicollinearity [39] by removing the strongly correlated conditioning factors. Its linear correlation scale is based on the Pearson correlation coefficient among two variables (X, Y) in the statistics [40]. The Pearson correlation coefficient ranges from +1 to −1, with +1 representing a perfect positive linear correlation, 0 indicating no linear correlation, and −1 signifying a perfect negative linear correlation. To minimize multicollinearity and redundancy, we ensured that the pairwise correlation coefficients among conditioning factors did not exceed 0.8. The Pearson correlation coefficient formula is as follows:

r = \frac{1}{n - 1} \sum_{i = 1}^{n} (\frac{X_{i} - \bar{X}}{σ_{x}}) (\frac{Y_{i} - \bar{Y}}{σ_{y}}),

(1)

where t, n denotes the number of samples, X_i and Y_i refer to the individual data points indexed by i, and

X

and

\bar{Y}

represent the means of the samples. σ_x and σ_y correspond to the standard deviations of the respective samples.

3.1.2. Information Gain Rate

Given the different types of landslides and their varying responses to factors, it is crucial to identify and eliminate factors that exhibit low or no predictive value during the modeling process [41]. The Information Gain Ratio (IGR) is widely employed for selecting significant landslide factors. In the IGR approach, landslide driving factors with a large information gain value indicate that they have strong predictive ability [42]. Higher IGR for a factor can also be calculated on the previous study by Quinlan et al. [43].

3.2. Baseline Machine Learning Models

3.2.1. LightGBM Model

The LightGBM (Light Gradient Boosting Machine) algorithm, introduced by Microsoft, is an enhanced Gradient Boosting Decision Tree (GBDT) model [44]. Because of its superior accuracy, shorter execution time, and less memory exploit, it has been extensively applied in forecasting landslide susceptibility [45]. This algorithm minimizes the errors of continuous, factor-based decision trees to the largest extent. The specific features that greatly enhance learning efficiency include leaf-wise growth strategy and histogram-based node splitting [46]. LightGBM’s basic learner is a decision tree, which can be denoted as [47]

y = f (x) = \sum_{i = 1}^{T} α_{i} \cdot h_{i} (x)

(2)

where y represents the predicted value, x denotes the input features, T is the total number of trees,

α

_i is the weight of the ith tree, and h_i(x) refers to the prediction made by the ith tree.

3.2.2. XGBoost Model

XGBoost, introduced by Chen and Guestrin et al. [48], offers a scalable and efficient framework that integrates multiple decision trees to generate robust predictions, even in the presence of noisy and complex data [49]. Additionally, it employs a more regularized model formulation to mitigate overfitting, leading to improved performance [50]. This makes XGBoost particularly promising for applications in landslide hazard mapping and assessment. The general formulation for XGBoost prediction can be represented as

Y = \sum_{k = 1}^{k} f_{k} (X)

(3)

where Y is the predicted output, X represents the input features, K is the number of trees, and f_k(X) is the prediction from the kth tree.

3.2.3. Random Forest Model

Random Forest (RF) is a robust ensemble learning method that can be used for classification, regression tasks [51]. It can also be seen as a collection of random Decision Trees (DT). Typically, ensemble models outperform individual models, which is why multiple independently trained Decision Trees are combined to form the Random Forest [52]. RF constructs a large number of decision trees during the modeling process. The final classifier is derived by aggregating the outputs of these trees through majority voting [53]. This method has been successfully applied to landslide susceptibility modeling, demonstrating strong performance.

The optimization of parameter of al the models was performed by the Baysean optimization algorithm [54].

3.3. Construction of Interpretable Hybrid Model

3.3.1. Heterogeneous Category Strategy

To integrate the performance predictions of multiple models, ensemble modeling methods are widely adopted, and the resulting hybrid models typically offer better classification than individual models [55]. One such ensemble method, known as heterogeneous category, is used to assign performance weights to models, overcoming the drawbacks of simple averaging [56]. This method improves the weighted averaging approach based on the AUC (area under the curve) values and has been applied in disaster susceptibility modeling [12]. The ensemble method is created using the following formula:

E M = \frac{\sum_{i = 1}^{n} (A U C_{i} \cdot M_{i})}{\sum_{i = 1}^{n} A U C_{i}}

(4)

where EM represents the resulting ensemble model and AUC_i is the AUC value of the ith single model (M_i).

3.3.2. Shapley Additive Explanations

The practical advantage of applying machine learning in decision-making processes lies in the ability of machine learning models to enhance the accuracy of landslide predictions. Additionally, these models can be explained through interpretability techniques [57]. Recently, Shapley Additive Explanations (SHAP) have emerged as an effective method for interpreting the modeling processes of machine learning and deep learning models [58]. Additionally, SHAP facilitates the evaluation of factor interactions through the computation of boosted Shapley values, which provides global performance insights while preserving local accuracy [59]. The Shapley value is mathematically defined as

ϕ_{j} (ν) = \sum_{S \subseteq \{1, \dots, p\} \{j\}} \frac{|S|! (p - |S| - 1)!}{p!} (ν_{x} (S \cup \{j\}) - ν_{x} (S)),

(5)

where S represents a subset of the p features used by the model, x refers to the feature value vector of the instance under analysis, and v_x(S) denotes the prediction generated for the feature values in the subset S. Importantly, the interpretation of the hybrid model is grounded in a heterogeneous category strategy, which facilitates the creation of an interpretable hybrid model. The final SHAP values are obtained by aggregating the SHAP values of the three models, weighted in accordance with the heterogeneous category strategy.

3.4. The Evaluation for Model Performance and Interpretive Robustness

Model performance is evaluated using various statistical metrics, including the area under the receiver operating characteristic curve (AUC), accuracy, F1 score, recall, and precision. The AUC measures the model’s ability to distinguish between landslide and non-landslide areas. Accuracy indicates the overall correctness of predictions. Precision reflects the proportion of correctly predicted landslide instances among all predicted positives, while recall measures the proportion of actual landslides correctly identified. The F1 score, as the harmonic mean of precision and recall, provides a balanced measure of model performance in imbalanced datasets [60]. These allow the evaluation of the result of the model in relation to the test samples and the predicted value of true positive and false positive [61]. In particular, they are determined as

A U C = \frac{\sum T P + \sum T N}{P + N}

(6)

A c c u r a c y = (T P + T N) / N

(7)

Re c a l l = T P / (T P + F N)

(8)

F 1 = (2 \times \frac{T P}{T P + F P} \times \frac{T P}{T P + F N}) / (\frac{T P}{T P + F P} + \frac{T P}{T P + F N})

(9)

\Pr e c i s o n = T P / (T P + F P)

(10)

where P denotes the locations where landslides occur, while N refers to areas without landslides. True positives (TP) and true negatives (TN) represent correctly predicted landslide and no-landslide locations, respectively. False negatives (FN) and false positives (FP) refer to incorrectly predicted landslide and no-landslide locations. N represents the total number of samples.

To evaluate the robustness of model interpretation, this study employs the coefficient of variation (CV) to quantify the variability in factor rankings across 10-fold cross-validation runs. The CV is a statistical metric used to evaluate the level of variation in each indicator within a system. A higher CV indicates greater variability, suggesting lower interpretive robustness of the model. The CV is calculated as follows:

σ = \frac{1}{k} \sum_{i = 1}^{k} r_{i}

(11)

μ = \sqrt{\frac{1}{k} {\sum_{i = 1}^{k} (r_{i} - μ)}^{2}}

(12)

C V = \frac{σ}{μ}

(13)

where CV is the coefficient of variation,

σ

is the mean value,

μ

is the standard deviation of the rankings of a given factor r_i, and k is the number of folds.

Furthermore, McNemar’s chi-squared tests was employed to examine the statistical significance of differences between model outputs [62].

4. Results

4.1. Feature Selection

To reduce potential redundancy among predictive variables and improve both model efficiency and performance, we employed the information gain ratio (IGR) and Pearson correlation coefficients for optimal feature selection. Figure 4a shows the contributions of each factor according to their IGR values, with 12 out of 15 factors exhibiting IGR values greater than zero. Aspect, soil, and DR showed IGR values of zero, indicating a negligible influence on landslide occurrence in the study area. Consequently, these three factors were excluded from further analysis. Furthermore, as shown in Figure 4b, the pairwise correlation coefficients among the remaining 12 landslide conditioning factors were all below 0.8, with a maximum value of 0.79, indicating low multicollinearity and minimal redundant information. Accordingly, 12 factors were retained for the development of subsequent machine learning models.

4.2. Interpretive Robustness and Model Performance

The hybrid model is developed to enhance interpretability robustness while ensuring that predictive accuracy remains competitive. To this end, the coefficient of variation (CV) was used to quantify the robustness of feature rankings based on SHAP values under 10-fold cross-validation. A lower CV indicates greater consistency across folds. As shown in Figure 5, the proposed hybrid model demonstrates the highest robustness, with the lowest CV value of 0.175 (Figure 5d), whereas the three baseline models exhibit CVs exceeding 0.2, suggesting weaker consistency in their explanations. The boxplots further support this observation: only a single outlier is observed for the hybrid model, in contrast to multiple fluctuations in the baseline models, indicating that the hybrid model provides more reliable feature rankings. In terms of model accuracy (Table 2), five commonly used evaluation metrics were employed. While the hybrid model does not significantly outperform the baseline models, it maintains a comparable and stable level of accuracy. These findings suggest that the improved interpretability of the hybrid model is achieved without compromising predictive performance, supporting its practical applicability. Table 3 presents the results of McNemar’s chi-squared tests [62] applied to the susceptibility maps, organized in a symmetric matrix format. The analysis reveals statistically significant differences among the susceptibility outputs of different models, with all p-values below 0.05. This indicates that, despite similar overall model accuracies, the spatial patterns of susceptibility differ considerably, underscoring the necessity of ensemble modeling.

4.3. Landslide Susceptibility Mapping

Landslide susceptibility mapping is the direct output of the models and serves as a critical component of susceptibility assessment. It is therefore essential to validate the spatial patterns of susceptibility generated by the proposed hybrid model. Figure 6 presents the spatial distributions of landslide susceptibility in Xi’an, as predicted by the three baseline models and the hybrid model, while Table 4 summarizes the proportional areas assigned to each susceptibility class. It is worth noting that the natural breaks (Jenks) method was used to classify landslide susceptibility into discrete levels. As shown in Figure 6, the hybrid model produces a spatial pattern that is consistent with those of the baseline models and aligns well with the observed distribution of historical landslides, demonstrating its reliable predictive capability. Further insights from Table 3 reveal that, although the area proportions of susceptibility classes predicted by the hybrid model are generally comparable to those of the baseline models, some differences remain. Specifically, the hybrid model’s estimates for each susceptibility level tend to fall between those of the three baseline models. This intermediate positioning suggests improved generalization and reduced uncertainty of the hybrid model, relative to individual models.

4.4. Interpretation of Landslide Susceptibility

4.4.1. Global Interpretation

Determining the operational direction of each driving factor (Figure 7) and assessing their contributions (Figure 8) are crucial for understanding the landslide formation mechanisms in the study area. However, the explanatory outcomes may differ across models, making it necessary to explore these discrepancies and demonstrate the advantages of the hybrid model. As shown in Figure 7, the operational direction of each factor is similar across the three baseline models. For instance, slope and susceptibility exhibit a generally positive relationship, while elevation and susceptibility show an inverse correlation. However, Figure 8 reveals notable differences in factor rankings across the three baseline models. Specifically, although the LightGBM and XGBoost models agree on the three most important factors, discrepancies emerge from the fourth position onwards. This divergence is more pronounced in comparison to the RF model. For instance, both LightGBM and XGBoost identify slope as the most significant factor, while RF ranks elevation as the most influential. This discrepancy can even change further with different training iterations (Figure 5). Therefore, a more stable hybrid model is valuable, as it mitigates the uncertainty inherent in the interpretation of individual models. As shown in Figure 8d, the hybrid model combines results from all three base models, and the contribution of each factor (indicated by bar length) lies between the contributions of the baseline models. Additionally, the operational direction of each factor in the hybrid model remains consistent with that in the original baseline models. These findings demonstrate that the hybrid model reduces uncertainty in factor interpretation and provides more reliable explanatory results.

4.4.2. Marginal Effects of Driving Factors

Using the hybrid model, we applied SHAP values to interpret how the six most significant factors influence landslide susceptibility in the study area, as shown in Figure 9. The factors, including slope, elevation, LS, RSP, NDVI, and TWI, were found to have the greatest impact on landslide susceptibility. Notably, the relationships between these factors and susceptibility are nonlinear, with clear threshold effects that highlight the marginal contributions of each factor. Specifically, for slope, the threshold was identified at 5.9° and 39.6°. Susceptibility increases when slope lies between these two values, while it decreases when the slope is either below 5.9° or above 39.6°. Similarly, elevation promotes susceptibility between 490 m and 1375 m, but inhibits it above 1375 m or below 490 m. For LS, susceptibility is enhanced when the value is between 5.0 and 29.6, with a suppression occurring outside this range. RSP also exhibits a similar pattern: susceptibility increases between 0.012 and 0.185 but decreases outside this range. NDVI shows susceptibility enhancement between 0.53 and 0.64, with inhibition outside these values. Finally, TWI enhances susceptibility between 5.92 and 8.56 but reduces it outside this range. These threshold effects clearly demonstrate the complex, marginal contributions of each factor to landslide susceptibility. By using the hybrid model, we are able to quantitatively describe these nonlinear relationships and better understand the marginal effects of each factor.

5. Discussion

5.1. The Advances of the Hybrid Model

This study addresses the challenge of improving the interpretability and robustness of landslide susceptibility assessments, which remains a critical issue in geohazard prediction (e.g., [28,63]). Recent studies have highlighted the strengths and limitations of commonly used machine learning models, such as LightGBM, XGBoost, and Random Forest (RF), in providing accurate susceptibility predictions [64]. However, these models often exhibit inconsistencies in feature importance rankings and variation in model explanations, which can undermine their reliability in real-world applications [65,66]. This study presents a hybrid model that enhances the robustness of interpretability for landslide susceptibility predictions by integrating multiple models, providing a more robust explanation of landslide susceptibility. The hybrid model offers notable advantages over traditional models. In terms of interpretability, it demonstrates greater robustness in feature rankings across folds, as evidenced by the lower coefficient of variation (CV) values (Figure 5). This robustness is essential for ensuring reliable, consistent outputs in susceptibility assessments, where fluctuations in factor importance can lead to different predictive results. Previous studies have emphasized the importance of stable, interpretable models in geohazard prediction, yet single models often fail to provide such consistency [67]. The hybrid model alleviates this limitation effectively, offering a more dependable solution for understanding and explaining landslide susceptibility. Regarding predictive performance, although the hybrid model does not significantly outperform the individual models (Table 2), it maintains comparable accuracy while offering superior robustness. This aligns with the original intention of the EBM model proposed by CA et al., which integrates gradient boosting techniques with decision trees to provide both direct interpretability and competitive accuracy [68]. In contrast to this study, the hybrid model we propose employs a more accessible integration strategy that combines multiple base models and their explanatory results, offering greater scalability. In summary, the hybrid model retains high accuracy while reducing variability in the interpretation of results, which is a key advantage over standalone models.

5.2. Interpretability of Driving Factors

The interpretation of driving factors influencing landslide susceptibility is a critical aspect of hazard assessment, as understanding these factors can lead to more accurate risk predictions and informed decision-making [69]. This study makes an important contribution by identifying and quantifying the nonlinear relationships between key factors like slope, elevation, and topographic wetness index (TWI) with landslide susceptibility. Consistent with the study by Wang et al., elevation and slope emerged as the most influential factors associated with landslide occurrence [70]. While the correlation between elevation and landslides may seem ambiguous, it could be attributed to the concentration of logging roads within certain elevation bands, which alters terrain stability and land use patterns. In addition, the study finds that slope increases susceptibility within a range of 5.9° to 39.6°, beyond which it begins to decrease. These results align with those of Liu et al., who identified a high incidence of landslides within the slope range of 2.95° to 68.19° [71]. Similarly, elevation shows a peak susceptibility between 490 m and 1375 m, with susceptibility dropping outside of this range. These findings push forward the field’s understanding by demonstrating that the relationship between landslide susceptibility and driving factors is far from simple linearity. In the context of the existing literature, this work moves beyond the traditional emphasis on identifying which factors are important (e.g., slope, elevation, rainfall) and begins to delve deeper into how these factors interact with each other in a nonlinear fashion. Previous studies have often highlighted slope as a key factor in landslide susceptibility assessments (e.g., [72,73,74]), but without clearly defining the specific range of slope values that lead to increased risk. By incorporating these threshold effects generated by interpretable machine learning models into the analysis, this study adds valuable detail to the ongoing research into landslide susceptibility.

5.3. Implication and Limitation

This study’s identification of threshold effects and nonlinear relationships in landslide susceptibility has significant implications for Xi’an, a city with rich cultural heritage and growing infrastructure [75]. By identifying critical thresholds for factors like slope (5.9° to 39.6°) and elevation (490 m to 1375 m), urban planners can make informed decisions about land use in landslide-prone areas, minimizing risks to both modern infrastructure and ancient landmarks such as the city walls and Terracotta Army. The study’s focus on multi-factor interactions is crucial for Xi’an’s long-term development, balancing urbanization with the preservation of cultural sites. These insights can help guide sustainable development and enhance the city’s resilience to landslides, ensuring both growth and heritage preservation. The proposed hybrid model demonstrates strong potential for application across diverse geographic environments. Its flexible design enables the incorporation of region-specific conditioning factors, allowing it to adapt to varying environmental, climatic, and urban conditions. Moreover, the model can be implemented entirely using Python 3.9, enhancing its accessibility and facilitating broader adoption in both academic and practical settings.

Several limitations are inevitably present in this study. The interpretable hybrid model proposed is based on three popular base machine learning models. However, the integration of deep learning models such as CNN, LSTM, and transformer, among others, warrants further development. The strategy used in this study, heterogeneous category, effectively integrates the models in a simple manner, but a comparison with other strategies, such as stacking and bagging, in terms of model interpretability deserves further exploration. In addition, differentiating landslide types in future driver analyses would significantly enhance the robustness and interpretability of the results. Despite these limitations, we emphasize the contribution of the proposed interpretable hybrid model in improving the robustness of model explanations.

6. Conclusions

This study proposes an interpretable hybrid model to improve the robustness of model interpretability for landslide susceptibility assessments. The model adopts a heterogeneous category strategy, integrating three machine learning models (LightGBM, XGBoost, and Random Forest) along with their SHAP, achieving a comprehensive interpretation of multiple models. The conclusions are as follows:

(1): The hybrid model demonstrates superior robustness, with a coefficient variation (CV) value of 0.175, significantly lower than the CV values exceeding 0.2 for the baseline models. This indicates more reliable feature rankings across folds.
(2): Although the hybrid model does not drastically outperform the individual models, it maintains competitive predictive accuracy, with an AUC of 0.87, accuracy of 0.80, precision of 0.79, recall of 0.87, and F1 score of 0.83. This highlights its effectiveness in providing stable and consistent results for landslide susceptibility mapping.
(3): The study identifies critical threshold values for factors like slope (5.9° to 39.6°) and elevation (490 m to 1375 m), which demonstrate nonlinear relationships with landslide susceptibility. These insights contribute to a more nuanced understanding of the factors influencing landslide occurrence.

By integrating multiple models, the hybrid approach minimizes uncertainties in factor interpretation, offering more stable and dependable results compared to individual models, particularly in terms of understanding factor interactions. Despite these strengths, the integration of deep learning models, such as CNN, LSTM, and transformer, remains a promising direction for future research to further improve the robustness and generalization capabilities of the model.

Author Contributions

Conceptualization, Xiao Yan, Dongshui Zhang and Yongshun Han; methodology, Xiao Yan, Dongshui Zhang and Yongshun Han; validation, Dongshui Zhang; investigation, Xiao Yan, Dongshui Zhang, Yongshun Han and Tongsheng Li; writing—original draft preparation, Xiao Yan and Dongshui Zhang; writing—review and editing, Xiao Yan, Dongshui Zhang, Yongshun Han, Tongsheng Li, Pin Zhong, Zhe Ning and Shirou Tan; supervision, Xiao Yan, Dongshui Zhang and Yongshun Hang; project administration, Dongshui Zhang; funding acquisition, Dongshui Zhang. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Major Scientific Research Project of the Hunan Geological Institute (grant number HNGSTP202303), the Hunan Provincial Natural Science Foundation Program (grant number 2024JJ5147), the Key Projects of Hunan Provincial Department of Education (grant number 24A0342), Open Fund (NO. hndzgczx2024011) of Hunan Provincial Geological Disaster Monitoring Early Warning and Emergency Rescue Engineering Technology Research Center, the Natural Resources Research (Standards) Post-subsidy Project of the Hunan Provincial Department of Natural Resources (grant number HBZ20240164), and the Hunan Innovation and Entrepreneurship Training Program for College Students (grant number S2024105340105).

Data Availability Statement

The data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Ado, M.; Amitab, K.; Maji, A.K.; Jasińska, E.; Gono, R.; Leonowicz, Z.; Jasiński, M. Landslide Susceptibility Mapping Using Machine Learning: A Literature Survey. Remote Sens. 2022, 14, 3029. [Google Scholar] [CrossRef]
Liu, S.; Wang, L.; Zhang, W.; He, Y.; Pijush, S. A comprehensive review of machine learning-based methods in landslide susceptibility mapping. Geol. J. 2023, 58, 2283–2301. [Google Scholar] [CrossRef]
Azarafza, M.; Azarafza, M.; Akgün, H.; Atkinson, P.M.; Derakhshani, R. Deep learning-based landslide susceptibility mapping. Sci. Rep. 2021, 11, 24112. [Google Scholar] [CrossRef] [PubMed]
Gariano, S.L.; Guzzetti, F. Landslides in a changing climate. Earth-Sci. Rev. 2016, 162, 227–252. [Google Scholar] [CrossRef]
Reichenbach, P.; Rossi, M.; Malamud, B.D.; Mihir, M.; Guzzetti, F. A review of statistically-based landslide susceptibility models. Earth-Sci. Rev. 2018, 180, 60–91. [Google Scholar] [CrossRef]
Wang, H.; Zhang, L.; Luo, H.; He, J.; Cheung, R.W.M. AI-powered landslide susceptibility assessment in Hong Kong. Eng. Geol. 2021, 288, 106103. [Google Scholar] [CrossRef]
Achu, A.L.; Aju, C.D.; Di Napoli, M.; Prakash, P.; Gopinath, G.; Shaji, E.; Chandra, V. Machine-learning based landslide susceptibility modelling with emphasis on uncertainty analysis. Geosci. Front. 2023, 14, 101657. [Google Scholar] [CrossRef]
Zeng, T.; Wu, L.; Peduto, D.; Glade, T.; Hayakawa, Y.S.; Yin, K. Ensemble learning framework for landslide susceptibility mapping: Different basic classifier and ensemble strategy. Geosci. Front. 2023, 14, 101645. [Google Scholar] [CrossRef]
Pourghasemi, H.R.; Kornejady, A.; Kerle, N.; Shabani, F. Investigating the effects of different landslide positioning techniques, landslide partitioning approaches, and presence-absence balances on landslide susceptibility mapping. CATENA 2020, 187, 104364. [Google Scholar] [CrossRef]
Lin, Q.; Lima, P.; Steger, S.; Glade, T.; Jiang, T.; Zhang, J.; Liu, T.; Wang, Y. National-scale data-driven rainfall induced landslide susceptibility mapping for China by accounting for incomplete landslide data. Geosci. Front. 2021, 12, 101248. [Google Scholar] [CrossRef]
Huang, F.; Ye, Z.; Jiang, S.-H.; Huang, J.; Chang, Z.; Chen, J. Uncertainty study of landslide susceptibility prediction considering the different attribute interval numbers of environmental factors and different data-based models. CATENA 2021, 202, 105250. [Google Scholar] [CrossRef]
Liu, J.; Jiyan, W.; Junnan, X.; Weiming, C.; Yi, L.; Yifan, C.; Yufeng, H.; Yu, D.; Wen, H.; Yang, G. Assessment of flood susceptibility mapping using support vector machine, logistic regression and their ensemble techniques in the Belt and Road region. Geocarto Int. 2022, 37, 9817–9846. [Google Scholar] [CrossRef]
Marzini, L.; D’Addario, E.; Papasidero, M.P.; Chianucci, F.; Disperati, L. Influence of Root Reinforcement on Shallow Landslide Distribution: A Case Study in Garfagnana (Northern Tuscany, Italy). Geosciences 2023, 13, 326. [Google Scholar] [CrossRef]
Zhu, A.-X.; Miao, Y.; Liu, J.; Bai, S.; Zeng, C.; Ma, T.; Hong, H.J.C. A similarity-based approach to sampling absence data for landslide susceptibility mapping using data-driven methods. Catena 2019, 183, 104188. [Google Scholar] [CrossRef]
Huang, Y.; Zhao, L. Review on landslide susceptibility mapping using support vector machines. CATENA 2018, 165, 520–529. [Google Scholar] [CrossRef]
Abbaszadeh Shahri, A.; Spross, J.; Johansson, F.; Larsson, S. Landslide susceptibility hazard map in southwest Sweden using artificial neural network. CATENA 2019, 183, 104225. [Google Scholar] [CrossRef]
Sun, D.; Wen, H.; Wang, D.; Xu, J. A random forest model of landslide susceptibility mapping based on hyperparameter optimization using Bayes algorithm. Geomorphology 2020, 362, 107201. [Google Scholar] [CrossRef]
Song, Y.; Niu, R.; Xu, S.; Ye, R.; Peng, L.; Guo, T.; Li, S.; Chen, T. Landslide Susceptibility Mapping Based on Weighted Gradient Boosting Decision Tree in Wanzhou Section of the Three Gorges Reservoir Area (China). ISPRS Int. J. Geo-Inf. 2019, 8, 4. [Google Scholar] [CrossRef]
Wu, Y.; Ke, Y.; Chen, Z.; Liang, S.; Zhao, H.; Hong, H. Application of alternating decision tree with AdaBoost and bagging ensembles for landslide susceptibility mapping. CATENA 2020, 187, 104396. [Google Scholar] [CrossRef]
Liu, R.; Yang, X.; Xu, C.; Wei, L.; Zeng, X. Comparative Study of Convolutional Neural Network and Conventional Machine Learning Methods for Landslide Susceptibility Mapping. Remote Sens. 2022, 14, 321. [Google Scholar] [CrossRef]
Wang, N.; Zhang, H.; Dahal, A.; Cheng, W.; Zhao, M.; Lombardo, L. On the use of explainable AI for susceptibility modeling: Examining the spatial pattern of SHAP values. Geosci. Front. 2024, 15, 101800. [Google Scholar] [CrossRef]
Lv, J.; Zhang, R.; Shama, A.; Hong, R.; He, X.; Wu, R.; Bao, X.; Liu, G. Exploring the spatial patterns of landslide susceptibility assessment using interpretable Shapley method: Mechanisms of landslide formation in the Sichuan-Tibet region. J. Environ. Manag. 2024, 366, 121921. [Google Scholar] [CrossRef]
Padarian, J.; McBratney, A.B.; Minasny, B. Game theory interpretation of digital soil mapping convolutional neural networks. SOIL 2020, 6, 389–397. [Google Scholar] [CrossRef]
Zhang, J.; Ma, X.; Zhang, J.; Sun, D.; Zhou, X.; Mi, C.; Wen, H. Insights into geospatial heterogeneity of landslide susceptibility based on the SHAP-XGBoost model. J. Environ. Manag. 2023, 332, 117357. [Google Scholar] [CrossRef]
Pradhan, B.; Dikshit, A.; Lee, S.; Kim, H. An explainable AI (XAI) model for landslide susceptibility modeling. Appl. Soft Comput. 2023, 142, 110324. [Google Scholar] [CrossRef]
Jiang, S.; Sweet, L.-b.; Blougouras, G.; Brenning, A.; Li, W.; Reichstein, M.; Denzler, J.; Shangguan, W.; Yu, G.; Huang, F.; et al. How Interpretable Machine Learning Can Benefit Process Understanding in the Geosciences. Earth’s Future 2024, 12, e2024EF004540. [Google Scholar] [CrossRef]
Bommer, P.L.; Kretschmer, M.; Hedström, A.; Bareeva, D.; Höhne, M.M.-C. Finding the Right XAI Method—A Guide for the Evaluation and Ranking of Explainable AI Methods in Climate Science. Artif. Intell. Earth Syst. 2024, 3, e230074. [Google Scholar] [CrossRef]
Panigrahi, B.; Razavi, S.; Doig, L.E.; Cordell, B.; Gupta, H.V.; Liber, K. On Robustness of the Explanatory Power of Machine Learning Models: Insights From a New Explainable AI Approach Using Sensitivity Analysis. Water Resour. Res. 2025, 61, e2024WR037398. [Google Scholar] [CrossRef]
Ullah, M.; Li, J.; Wadood, B. Analysis of Urban Expansion and its Impacts on Land Surface Temperature and Vegetation Using RS and GIS, A Case Study in Xi’an City, China. Earth Syst. Environ. 2020, 4, 583–597. [Google Scholar] [CrossRef]
Yang, Z.; Song, J.; Cheng, D.; Xia, J.; Li, Q.; Ahamad, M.I. Comprehensive evaluation and scenario simulation for the water resources carrying capacity in Xi’an city, China. J. Environ. Manag. 2019, 230, 221–233. [Google Scholar] [CrossRef]
Liu, X.; Shao, S.; Shao, S. Landslide susceptibility zonation using the analytical hierarchy process (AHP) in the Great Xi’an Region, China. Sci. Rep. 2024, 14, 2941. [Google Scholar] [CrossRef]
Zhuang, J.; Peng, J.; Iqbal, J.; Liu, T.; Liu, N.; Li, Y.; Ma, P. Identification of landslide spatial distribution and susceptibility assessment in relation to topography in the Xi’an Region, Shaanxi Province, China. Front. Earth Sci. 2015, 9, 449–462. [Google Scholar] [CrossRef]
Li, D.; Wang, J.; Shi, K. Research on the Investigation and Value Evaluation of Historic Building Resources in Xi’an City. Buildings 2023, 13, 2244. [Google Scholar] [CrossRef]
Wu, W.; Zhang, Q.; Singh, V.P.; Wang, G.; Zhao, J.; Shen, Z.; Sun, S. A Data-Driven Model on Google Earth Engine for Landslide Susceptibility Assessment in the Hengduan Mountains, the Qinghai–Tibetan Plateau. Remote Sens. 2022, 14, 4662. [Google Scholar] [CrossRef]
Li, W.-Y.; Liu, C.; Hong, Y.; Zhang, X.-H.; Wan, Z.-M.; Saharia, M.; Sun, W.-W.; Yao, D.-J.; Chen, W.; Chen, S.; et al. A public Cloud-based China’s Landslide Inventory Database (CsLID): Development, zone, and spatiotemporal analysis for significant historical events, 1949–2011. J. Mt. Sci. 2016, 13, 1275–1285. [Google Scholar] [CrossRef]
Guo, Z.; Tian, B.; Zhu, Y.; He, J.; Zhang, T. How do the landslide and non-landslide sampling strategies impact landslide susceptibility assessment?—A catchment-scale case study from China. J. Rock Mech. Geotech. Eng. 2024, 16, 877–894. [Google Scholar] [CrossRef]
Sharma, N.; Saharia, M.; Ramana, G.V. High resolution landslide susceptibility mapping using ensemble machine learning and geospatial big data. CATENA 2024, 235, 107653. [Google Scholar] [CrossRef]
Yu, X.; Chen, H. Research on the influence of different sampling resolution and spatial resolution in sampling strategy on landslide susceptibility mapping results. Sci. Rep. 2024, 14, 1549. [Google Scholar] [CrossRef]
Selamat, S.N.; Majid, N.A.; Taha, M.R. Multicollinearity and spatial correlation analysis of landslide conditioning factors in Langat River Basin, Selangor. Nat. Hazards 2025, 121, 2665–2684. [Google Scholar] [CrossRef]
Hong, H.; Liu, J.; Zhu, A.X. Modeling landslide susceptibility using LogitBoost alternating decision trees and forest by penalizing attributes with the bagging ensemble. Sci. Total Environ. 2020, 718, 137231. [Google Scholar] [CrossRef]
Liu, Q.; Huang, D.; Tang, A.; Han, X. Model performance analysis for landslide susceptibility in cold regions using accuracy rate and fluctuation characteristics. Nat. Hazards 2021, 108, 1047–1067. [Google Scholar] [CrossRef]
Yu, L.; Cao, Y.; Zhou, C.; Wang, Y.; Huo, Z. Landslide Susceptibility Mapping Combining Information Gain Ratio and Support Vector Machines: A Case Study from Wushan Segment in the Three Gorges Reservoir Area, China. Appl. Sci. 2019, 9, 4756. [Google Scholar] [CrossRef]
Quinlan, J.R. Induction of decision trees. Mach. Learn. 1986, 1, 81–106. [Google Scholar] [CrossRef]
Ke, G.; Meng, Q.; Finley, T.; Wang, T.; Chen, W.; Ma, W.; Ye, Q.; Liu, T.-Y. Lightgbm: A highly efficient gradient boosting decision tree. In Proceedings of the 31st Annual Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA, 4–9 December 2017. [Google Scholar]
Sun, D.; Wu, X.; Wen, H.; Shi, S.; Gu, Q. Improving generalization performance of landslide susceptibility model considering spatial heterogeneity by using the geomorphic label-based LightGBM. Bull. Eng. Geol. Environ. 2024, 83, 361. [Google Scholar] [CrossRef]
Wang, Y.; Ling, Y.; Chan, T.O.; Awange, J. High-resolution earthquake-induced landslide hazard assessment in Southwest China through frequency ratio analysis and LightGBM. Int. J. Appl. Earth Obs. Geo-Inf. 2024, 131, 103947. [Google Scholar] [CrossRef]
Sun, D.; Chen, D.; Zhang, J.; Mi, C.; Gu, Q.; Wen, H. Landslide Susceptibility Mapping Based on Interpretable Machine Learning from the Perspective of Geomorphological Differentiation. ISPRS Int. J. Geo-Inf. 2023, 12, 1018. [Google Scholar] [CrossRef]
Chen, T.; Guestrin, C. XGBoost. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016. [Google Scholar]
Loksa, D.; Ko, A.J. The role of self-regulation in programming problem solving process and success. In Proceedings of the ACM Conferences on International Computing Education Research, Melbourne, VIC, Australia, 8–12 September 2016. [Google Scholar]
Zhang, Y.; Deng, L.; Han, Y.; Sun, Y.; Zang, Y.; Zhou, M. Landslide Hazard Assessment in Highway Areas of Guangxi Using Remote Sensing Data and a Pre-Trained XGBoost Model. Remote Sens. 2023, 15, 3350. [Google Scholar] [CrossRef]
Pradhan, A.M.S.; Kim, Y.-T. Rainfall-Induced Shallow Landslide Susceptibility Mapping at Two Adjacent Catchments Using Advanced Machine Learning Algorithms. ISPRS Int. J. Geo-Inf. 2020, 9, 569. [Google Scholar] [CrossRef]
Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
Akinci, H.; Kilicoglu, C.; Dogan, S. Random Forest-Based Landslide Susceptibility Mapping in Coastal Regions of Artvin, Turkey. ISPRS Int. J. Geo-Inf. 2020, 9, 553. [Google Scholar] [CrossRef]
Sun, D.; Xu, J.; Wen, H.; Wang, D. Assessment of landslide susceptibility mapping based on Bayesian hyperparameter optimization: A comparison between logistic regression and random forest. Eng. Geol. 2021, 281, 105972. [Google Scholar] [CrossRef]
Zhou, S.; Zhang, D.; Wang, M.; Liu, Z.; Gan, W.; Zhao, Z.; Xue, S.; Müller, B.; Zhou, M.; Ni, X.; et al. Risk-driven composition decoupling analysis for urban flooding prediction in high-density urban areas using Bayesian-Optimized LightGBM. J. Clean. Prod. 2024, 457, 142286. [Google Scholar] [CrossRef]
Pourghasemi, H.R.; Yousefi, S.; Kornejady, A.; Cerdà, A. Performance assessment of individual and ensemble data-mining techniques for gully erosion modeling. Sci. Total Environ. 2017, 609, 764–775. [Google Scholar] [CrossRef] [PubMed]
Liu, D.; Cao, C.; Dubovyk, O.; Tian, R.; Chen, W.; Zhuang, Q.; Zhao, Y.; Menz, G. Using fuzzy analytic hierarchy process for spatio-temporal analysis of eco-environmental vulnerability change during 1990–2010 in Sanjiangyuan region, China. Ecol. Indic. 2017, 73, 612–625. [Google Scholar] [CrossRef]
Sun, D.; Ding, Y.; Wen, H.; Zhang, F.; Zhang, J.; Gu, Q.; Zhang, J. SHAP-PDP hybrid interpretation of decision-making mechanism of machine learning-based landslide susceptibility mapping: A case study at Wushan District, China. Egypt. J. Remote Sens. Space Sci. 2024, 27, 508–523. [Google Scholar] [CrossRef]
Bacanin, N.; Perisic, M.; Jovanovic, G.; Damaševičius, R.; Stanisic, S.; Simic, V.; Zivkovic, M.; Stojic, A. The explainable potential of coupling hybridized metaheuristics, XGBoost, and SHAP in revealing toluene behavior in the atmosphere. Sci. Total Environ. 2024, 929, 172195. [Google Scholar] [CrossRef]
He, Y.; Ding, M.; Duan, Y.; Zheng, H.; He, W.; Liu, J. Debris flows dynamic risk assessment and interpretable Shapley method-based driving mechanisms exploring—A case study of the upper reach of the Min River. Ecol. Indic. 2025, 173, 113400. [Google Scholar] [CrossRef]
Xiong, J.; Pei, T.; Qiu, T. A Novel Framework for Spatiotemporal Susceptibility Prediction of Rainfall-Induced Landslides: A Case Study in Western Pennsylvania. Remote Sens. 2024, 16, 3526. [Google Scholar] [CrossRef]
Liu, J.; Zhao, X.; Chen, Y.; Sun, H.; Gu, Y.; Xu, S. Uncertainty pattern and an integration strategy in flood susceptibility modeling: Limited sample size. J. Hydrol. 2025, 658, 133184. [Google Scholar] [CrossRef]
Kavzoglu, T.; Kutlug Sahin, E.; Colkesen, I. An assessment of multivariate and bivariate approaches in landslide susceptibility mapping: A case study of Duzkoy district. Nat. Hazards 2015, 76, 471–496. [Google Scholar] [CrossRef]
Le, X.H.; Choi, C.; Eu, S.; Yeon, M.; Lee, G. Quantitative evaluation of uncertainty and interpretability in machine learning-based landslide susceptibility mapping through feature selection and explainable AI. Front. Environ. Sci. 2024, 12, 1424988. [Google Scholar] [CrossRef]
Hong, H.; Miao, Y.; Liu, J.; Zhu, A.X. Exploring the effects of the design and quantity of absence data on the performance of random forest-based landslide susceptibility mapping. CATENA 2019, 176, 45–64. [Google Scholar] [CrossRef]
Rudin, C.; Chen, C.; Chen, Z.; Huang, H.; Semenova, L.; Zhong, C. Interpretable machine learning: Fundamental principles and 10 grand challenges. Stat. Surv. 2022, 16, 1–85. [Google Scholar] [CrossRef]
Wang, Y.; Liu, Y.; Cao, Z.; Zhang, D. Prediction of contraction channel scour depth: Based on interpretability analysis and PCA-enhanced SVR. J. HydroInform. 2024, 26, 3287–3305. [Google Scholar] [CrossRef]
Chen, C.; Fan, L. Interpretability of Statistical, Machine Learning, and Deep Learning Models for Landslide Susceptibility Mapping in Three Gorges Reservoir Area; Cornell University: Ithaca, NY, USA, 2024. [Google Scholar]
Caleca, F.; Confuorto, P.; Raspini, F.; Segoni, S.; Tofani, V.; Casagli, N.; Moretti, S. Shifting from traditional landslide occurrence modeling to scenario estimation with a “glass-box” machine learning. Sci. Total Environ. 2024, 950, 175277. [Google Scholar] [CrossRef]
Huang, F.; Mao, D.; Jiang, S.-H.; Zhou, C.; Fan, X.; Zeng, Z.; Catani, F.; Yu, C.; Chang, Z.; Huang, J.; et al. Uncertainties in landslide susceptibility prediction modeling: A review on the incompleteness of landslide inventory and its influence rules. Geosci. Front. 2024, 15, 101886. [Google Scholar] [CrossRef]
Lineback Gritzner, M.; Marcus, W.A.; Aspinall, R.; Custer, S.G. Assessing landslide potential using GIS, soil wetness modeling and topographic attributes, Payette River, Idaho. Geomorphology 2001, 37, 149–165. [Google Scholar] [CrossRef]
Moazzam, M.F.U.; Vansarochana, A.; Boonyanuphap, J.; Choosumrong, S.; Rahman, G.; Djueyep, G.P. Spatio-statistical comparative approaches for landslide susceptibility modeling: Case of Mae Phun, Uttaradit Province, Thailand. SN Appl. Sci. 2020, 2, 384. [Google Scholar] [CrossRef]
Hua, Y.; Wang, X.; Li, Y.; Xu, P.; Xia, W. Dynamic development of landslide susceptibility based on slope unit and deep neural networks. Landslides 2021, 18, 281–302. [Google Scholar] [CrossRef]
Conforti, M.; Ietto, F. Modeling Shallow Landslide Susceptibility and Assessment of the Relative Importance of Predisposing Factors, through a GIS-Based Statistical Analysis. Geosciences 2021, 11, 333. [Google Scholar] [CrossRef]
Wang, F.; Xu, P.; Wang, C.; Wang, N.; Jiang, N. Application of a GIS-Based Slope Unit Method for Landslide Susceptibility Mapping along the Longzi River, Southeastern Tibetan Plateau, China. ISPRS Int. J. Geo-Inf. 2017, 6, 172. [Google Scholar] [CrossRef]

Figure 1. The study area of Xi’an city in China: (a) elevation and landslides of Xi’an city, (b) Shanxi province in China, and (c) Xi’an city in Shanxi province.

Figure 2. The landslide conditioning factors used in this study. (a) Rainfall (AR); (b) Convergence Index (CI); (c) Elevation; (d) Land use; (e) Lithology; (f) LS factor; (g) NDVI; (h) Plan Curvature (PLC); (i) Profile curvature (PRC); (j) Relative Slope Position (RSP); (k) Slope; (l) Topographic Wetness Index (TWI); (m) Aspect; (n) Soil; (o) Distance to River (DR).

Figure 3. The workflow of this study.

Figure 4. The Information Gain Rate (a) values and the Pearson correlation coefficients (b) of the landslide conditioning factors.

Figure 5. Statistical comparison of factor rankings across four different models under 10-fold cross-validation: (a) LightGBM model, (b) XGBoost model, (c) Random Forest model, (d) hybrid model.

Figure 6. The spatial pattern of landslide susceptibility predicted by four models: (a) LightGBM model, (b) XGBoost model, (c) Random Forest model, (d) hybrid model.

Figure 7. The SHAP summary plot of landslide conditioning factors interpretated by different models: (a) LightGBM model, (b) XGBoost model, (c) Random Forest model, (d) hybrid model.

Figure 8. The factors contribution ranking based on SHAP values interpretated by different models: (a) LightGBM model, (b) XGBoost model, (c) Random Forest model, (d) hybrid model.

Figure 9. Marginal effects of landslide conditioning factors on landslide susceptibility interpretated by the hybrid model.

Table 1. The data source used in this study.

Factors	Data	Source of Data	Time	Resolution
AR	Annual spatially interpolated dataset of meteorological elements in China	Resource and Environmental Science Data Platform (https://www.resdc.cn/) (accessed on 3 February 2024)	1960–2020	1 km
Elevation	Digital Elevation Model (DEM)	Shuttle Radar Topography Mission (SRTM, https://www.earthdata.nasa.gov/sensors/srtm) (accessed on 13 February 2024)	2000	90 m
CI
LS
PLC
PRC
Slope
RSP
TWI
Aspect
Lithology	Lithology	China Geological Survey (https://www.cgs.gov.cn/) (accessed on 13 February 2024)	/	1:10,000
NDVI	MOD13A1	(https://lpdaac.usgs.gov/products/mod13a1v006/) (accessed on 15 February 2024)	2020	500 m
Land use	CLCD	https://zenodo.org/records/5816591#.ZAWM3BVBy5c (accessed on 21 February 2024)	2020	30 m
DR	River	HydroSHEDS (https://www.hydrosheds.org/) (accessed on 18 February 2024)	2013	/
Soil	Soil	National Earth System Science Data Center (accessed on 25 February 2024)	/	1:1,000,000

Average annual rainfall (AR), Convergence Index (CI), profile curvature (PRC), LS factor (LS), Normalized Difference Vegetation Index (NDVI), plan curvature (PLC), relative slope position (RSP), distance to river (DR), Topographic Wetness Index (TWI).

Table 2. Performance metrics of the four models.

Models	AUC	Accuracy	Precision	Recalls	F1 scores
LightGBM	0.86	0.78	0.77	0.88	0.82
XGBoost	0.87	0.79	0.79	0.83	0.81
Random Forest	0.86	0.78	0.77	0.86	0.81
Hybrid	0.87	0.8	0.79	0.87	0.83

Table 3. McNemar’s test results comparing landslide susceptibility outputs across different models.

Models	LightGBM	XGBoost	Random Forest	Hybrid
LightGBM	0	**	**	**
XGBoost	8573.9	0	**	**
Random Forest	12,033.8	736.6	0	**
Hybrid	7757.3	1283.1	3921.1	0

Notes: ** represents the statistical significance p < 0.05.

Table 4. The area percentage of landslide susceptibility classes, and the frequency ratio (FR) predicted by different models (unit: %).

Susceptibility	LightGBM		XGBoost		Random Forest		Hybrid
	P/%	FR	P/%	FR	P/%	FR	P/%	FR
Very low	49.38	0.01	51.08	0.01	35.17	0.00	42.72	0.01
Low	17.13	0.17	13.38	0.15	23.46	0.02	20.44	0.07
Medium	10.19	0.42	11.23	0.49	14.65	0.57	11.79	0.47
High	11.24	1.92	11.98	1.56	14.99	1.67	12.72	1.61
Very high	12.06	5.85	12.33	5.84	11.73	5.63	12.33	5.96

Frequency ratio was derived as the ratio between the proportion of landslide occurrences and the proportion of the corresponding area.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Published by MDPI on behalf of the International Society for Photogrammetry and Remote Sensing. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yan, X.; Zhang, D.; Han, Y.; Li, T.; Zhong, P.; Ning, Z.; Tan, S. Developing a Hybrid Model to Enhance the Robustness of Interpretability for Landslide Susceptibility Assessment. ISPRS Int. J. Geo-Inf. 2025, 14, 277. https://doi.org/10.3390/ijgi14070277

AMA Style

Yan X, Zhang D, Han Y, Li T, Zhong P, Ning Z, Tan S. Developing a Hybrid Model to Enhance the Robustness of Interpretability for Landslide Susceptibility Assessment. ISPRS International Journal of Geo-Information. 2025; 14(7):277. https://doi.org/10.3390/ijgi14070277

Chicago/Turabian Style

Yan, Xiao, Dongshui Zhang, Yongshun Han, Tongsheng Li, Pin Zhong, Zhe Ning, and Shirou Tan. 2025. "Developing a Hybrid Model to Enhance the Robustness of Interpretability for Landslide Susceptibility Assessment" ISPRS International Journal of Geo-Information 14, no. 7: 277. https://doi.org/10.3390/ijgi14070277

APA Style

Yan, X., Zhang, D., Han, Y., Li, T., Zhong, P., Ning, Z., & Tan, S. (2025). Developing a Hybrid Model to Enhance the Robustness of Interpretability for Landslide Susceptibility Assessment. ISPRS International Journal of Geo-Information, 14(7), 277. https://doi.org/10.3390/ijgi14070277

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Developing a Hybrid Model to Enhance the Robustness of Interpretability for Landslide Susceptibility Assessment

Abstract

1. Introduction

2. Materials

2.1. Study Area

2.2. Data

2.2.1. Landslide Inventory

2.2.2. Landslide Conditioning Factors

3. Methodology

3.1. Feature Selection Methods

3.1.1. Pearson Correlation Analysis

3.1.2. Information Gain Rate

3.2. Baseline Machine Learning Models

3.2.1. LightGBM Model

3.2.2. XGBoost Model

3.2.3. Random Forest Model

3.3. Construction of Interpretable Hybrid Model

3.3.1. Heterogeneous Category Strategy

3.3.2. Shapley Additive Explanations

3.4. The Evaluation for Model Performance and Interpretive Robustness

4. Results

4.1. Feature Selection

4.2. Interpretive Robustness and Model Performance

4.3. Landslide Susceptibility Mapping

4.4. Interpretation of Landslide Susceptibility

4.4.1. Global Interpretation

4.4.2. Marginal Effects of Driving Factors

5. Discussion

5.1. The Advances of the Hybrid Model

5.2. Interpretability of Driving Factors

5.3. Implication and Limitation

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI