Comparative Performance of Machine Learning Models for Landslide Susceptibility Assessment: Impact of Sampling Strategies in Highway Buffer Zone
Abstract
1. Introduction
2. Study Area
3. Materials and Methods
3.1. Landslide Inventory
3.2. Contributing Factors
3.3. Landslide Susceptibility Assessment
3.3.1. Sampling the Training and Validation Dataset
3.3.2. Modeling Techniques
- Naïve Bayes (NB)
- Support Vector Machine (SVM)
- SVM–RF Hybrid Modeling Framework (SVM–RF)
- Extreme Gradient Boosting (XGBoost)
3.3.3. Assessing Predictive Performance
- (True Positives): the number of correctly predicted landslide instances;
- (False Positives): the number of non-landslide instances incorrectly predicted as landslides;
- (True Negatives): the number of correctly predicted non-landslide instances;
- (False Negatives): the number of landslide instances incorrectly predicted as non-landslides.
3.3.4. Assessing Factors’ Importance in Models
4. Results and Discussion
4.1. Multicollinearity Assessment of the Contributing Indicators
4.2. Landslide Susceptibility Maps
4.3. The Predictive Performance
- When using hazard-informed sampling methods such as GLHM, which reduce label noise and improve data quality, multiple models, including SVM-RF, XGBoost, SVM, and NB, demonstrate comparable and robust performance. In this scenario, model choice can be guided by other factors such as computational cost, interpretability, and ease of deployment, since performance differences are minimal, and models exhibit consistent robustness.
- When data are sampled randomly without hazard-level constraints, resulting in noisier and less reliable labels, more complex models like SVM-RF and XGBoost should be preferred. These models show significantly better predictive performance and greater robustness compared to simpler models, which are more sensitive to label noise and exhibit lower accuracy and stability.
4.4. The Importance Ranking of Contributing Factors
4.5. Possible Use
5. Conclusions
- XGBoost consistently achieved the best performance under both sampling strategies. Under GLHM sampling, XGBoost obtained the highest AUROC (94.61%, IQR: 3.58) and accuracy (84.30%, IQR: 2.81), outperforming the other models. Pairwise comparisons further confirmed the statistical superiority of XGBoost, with significant differences observed especially when compared to NB and SVM models (adjusted p-values < 0.05 or <0.001 in multiple comparisons). These results highlight the strong generalization capability of XGBoost in complex landslide susceptibility assessments, benefiting from its ability to model nonlinear interactions and handle high-dimensional feature spaces efficiently.
- The GLHM sampling method demonstrated a clear advantage over random sampling across all models. Both AUROC and accuracy were consistently improved under GLHM. For AUROC, the improvements for NB, SVM, SVM-RF, and XGBoost under GLHM reached +8.44 (p < 0.001), +7.11 (p < 0.001), +3.45 (p = 0.023), and +3.04 (p = 0.029), respectively. A similar trend was observed for accuracy, with increases of +11.30% (p < 0.001) for NB, +8.33% (p < 0.001) for SVM, +7.40% (p = 0.002) for SVM-RF, and +8.31% (p < 0.001) for XGBoost. These results indicate that GLHM effectively enhances model performance, likely by improving the representativeness of the training data and better capturing the underlying distribution of landslide and non-landslide units.
- Interpretability analysis using SHAP values further demonstrated that the choice of sampling method not only affects model performance but also the attribution of contributing factors. Under GLHM, top-ranked features were consistent across models, with STI (e.g., 0.2936 in SVM, 0.2947 in SVM-RF), NDVI (e.g., 0.3202 in SVM-RF, 0.2358 in SVM), and slope (e.g., 0.1374 in NB) appearing most frequently among the top three features. In contrast, under random sampling, feature rankings varied more widely, and models exhibited greater reliance on features such as elevation (e.g., 0.1260 in SVM) and lithology, which may reflect artifacts introduced by sampling from spatially mixed hazard contexts.
6. Limitations and Future Work
- Simplified Landslide Representation: This study used the centroids of historical landslide deposits as representative points. While practical, this may not fully capture landslide morphology. Future work could explore alternative point selections, such as the headscarp, or adopt polygon-based landslide datasets to better reflect their spatial extent and improve model fidelity.
- Sampling Strategy Constraints: The GLHM-based method for selecting non-landslide samples helped reduce bias, but its reliance on coarse, global-scale hazard data limits regional applicability. Future research could explore alternative non-landslide sampling strategies to improve the spatial representativeness and robustness of the dataset.
- Static Input Variables: The current models use static environmental factors, ignoring temporal triggers like rainfall or land use change. Future studies should integrate dynamic variables and time-series data to improve the predictive capability and adaptability of susceptibility models.
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Huang, Y.; Xu, C.; Zhang, X.; Li, L.; Xu, X. Research in the field of natural hazards based on bibliometric analysis. Nat. Hazards Rev. 2023, 24, 04023012. [Google Scholar] [CrossRef]
- Chenchen, X.; Chong, X.; Yuandong, H.; Jielin, L.; Xiaoyi, S.; Xiwei, X.; Huiran, G.; Junxue, M.; Zikang, X. Advances in the study of natural disasters induced by the 23.7 extreme rainfall event in North China. Nat. Hazards Res. 2025, 5, 1–13. [Google Scholar] [CrossRef]
- Ministry of Natural Resources, P.R.C. China Natural Resources Statistics Bulletin 2024. 2025. Available online: https://gi.mnr.gov.cn/202503/t20250314_2881937.html (accessed on 14 March 2025).
- Aleotti, P.; Chowdhury, R. Landslide hazard assessment: Summary review and new perspectives. Bull. Eng. Geol. Environ. 1999, 58, 21–44. [Google Scholar] [CrossRef]
- Carrara, A.; Guzzetti, F.; Cardinali, M.; Reichenbach, P. Current limitations in modeling landslide hazard. In Proceedings of IAMG; De Frede Editori: Napoli, Italy, 1998; pp. 195–203. [Google Scholar]
- Gaidzik, K.; Ramírez-Herrera, M.T. The importance of input data on landslide susceptibility mapping. Sci. Rep. 2021, 11, 19334. [Google Scholar] [CrossRef] [PubMed]
- Md Sharafat, C. A review on landslide susceptibility mapping research in Bangladesh. Heliyon 2023, 9, e17972. [Google Scholar] [CrossRef] [PubMed]
- Lokesh, P.; Madhesh, C.; Aneesh, M.; Padala Raja, S. Machine learning and deep learning-based landslide susceptibility mapping using geospatial techniques in Wayanad, Kerala state, India. HydroResearch 2025, 8, 113–126. [Google Scholar] [CrossRef]
- Faming, H.; Zuokui, T.; Zizheng, G.; Filippo, C.; Jinsong, H. Uncertainties of landslide susceptibility prediction: Influences of different spatial resolutions, machine learning models and proportions of training and testing dataset. Rock Mech. Bull. 2023, 2, 100028. [Google Scholar] [CrossRef]
- Su, Y.; Chen, Y.; Lai, X.; Huang, S.; Lin, C.; Xie, X. Feature adaptation for landslide susceptibility assessment in “no sample” areas. Gondwana Res. 2024, 131, 1–17. [Google Scholar] [CrossRef]
- Guo, Z.; Tian, B.; Zhu, Y.; He, J.; Zhang, T. How do the landslide and non-landslide sampling strategies impact landslide susceptibility assessment?—A catchment-scale case study from China. J. Rock Mech. Geotech. Eng. 2024, 16, 877–894. [Google Scholar] [CrossRef]
- Lu, J.; He, Y.; Zhang, L.; Zhang, Q.; Gao, B.; Chen, H.; Fang, Y. Ensemble learning landslide susceptibility assessment with optimized non-landslide samples selection. Geomat. Nat. Hazards Risk 2024, 15, 2378176. [Google Scholar] [CrossRef]
- Meng, S.; Shi, Z.; Li, G.; Peng, M.; Liu, L.; Zheng, H.; Zhou, C. A novel deep learning framework for landslide susceptibility assessment using improved deep belief networks with the intelligent optimization algorithm. Comput. Geotech. 2024, 167, 106106. [Google Scholar] [CrossRef]
- Goetz, J.N.; Brenning, A.; Petschko, H.; Leopold, P. Evaluating machine learning and statistical prediction techniques for landslide susceptibility modeling. Comput. Geosci. 2015, 81, 1–11. [Google Scholar] [CrossRef]
- Nurwatik, N.; Ummah, M.H.; Cahyono, A.B.; Darminto, M.R.; Hong, J.-H. A Comparison Study of Landslide Susceptibility Spatial Modeling Using Machine Learning. ISPRS Int. J. Geo-Inf. 2022, 11, 602. [Google Scholar] [CrossRef]
- Hong, H.; Wang, D.; Zhu, A.; Wang, Y. Landslide susceptibility mapping based on the reliability of landslide and non-landslide sample. Expert Syst. Appl. 2024, 243, 122933. [Google Scholar] [CrossRef]
- Lin, Q.; Wang, Y.; Cheng, Q.; Huang, J.; Tian, H.; Liu, G.; He, K. The Alasu rock avalanche in the Tianshan Mountains, China: Fragmentation, landforms, and kinematics. Landslides 2024, 21, 439–459. [Google Scholar] [CrossRef]
- Abuduxun, N.; Xiao, W.; Windley, B.F.; Chen, Y.; Huang, P.; Sang, M.; Li, L.; Liu, X. Terminal suturing between the Tarim Craton and the Yili-Central Tianshan arc: Insights from mélange-ocean plate stratigraphy, detrital zircon ages, and provenance of the South Tianshan accretionary complex. Tectonics 2021, 40, e2021TC006705. [Google Scholar] [CrossRef]
- Gao, J.; Klemd, R. Formation of HP–LT rocks and their tectonic implications in the western Tianshan Orogen, NW China: Geochemical and age constraints. Lithos 2003, 66, 1–22. [Google Scholar] [CrossRef]
- Mohamed, R.; Lamees, M.; Ahmed, H.; Mohamed, A.S.Y.; Mohamed Elsadek, M.S.; Adel Kamel, M. Landslide susceptibility assessment along the Red Sea Coast in Egypt, based on multi-criteria spatial analysis and GIS techniques. Sci. Afr. 2024, 23, e02116. [Google Scholar] [CrossRef]
- Chen, W.; Peng, J.; Hong, H.; Shahabi, H.; Pradhan, B.; Liu, J.; Zhu, A.-X.; Pei, X.; Duan, Z. Landslide susceptibility modelling using GIS-based machine learning techniques for Chongren County, Jiangxi Province, China. Sci. Total Environ. 2018, 626, 1121–1135. [Google Scholar] [CrossRef] [PubMed]
- Santangelo, M.; Marchesini, I.; Bucci, F.; Cardinali, M.; Fiorucci, F.; Guzzetti, F. An approach to reduce mapping errors in the production of landslide inventory maps. Nat. Hazards Earth Syst. Sci. 2015, 15, 2111–2126. [Google Scholar] [CrossRef]
- Paul, G.P.; Alejandra, H.R. Landslide susceptibility index based on the integration of logistic regression and weights of evidence: A case study in Popayan, Colombia. Eng. Geol. 2021, 280, 105958. [Google Scholar] [CrossRef]
- Dai, F.; Lee, C.; Zhang, X. GIS-based geo-environmental evaluation for urban land-use planning: A case study. Eng. Geol. 2001, 61, 257–271. [Google Scholar] [CrossRef]
- Piyoosh, R.; Ramesh Chandra, L. Landslide risk analysis between Giri and Tons Rivers in Himachal Himalaya (India). Int. J. Appl. Earth Obs. Geoinf. 2000, 2, 153–160. [Google Scholar] [CrossRef]
- Pham, B.T.; Tien Bui, D.; Dholakia, M.; Prakash, I.; Pham, H.V. A comparative study of least square support vector machines and multiclass alternating decision trees for spatial prediction of rainfall-induced landslides in a tropical cyclones area. Geotech. Geol. Eng. 2016, 34, 1807–1824. [Google Scholar] [CrossRef]
- Hobbs, W.H. Lineaments of the Atlantic border region. Bull. Geol. Soc. Am. 1904, 15, 483–506. [Google Scholar] [CrossRef]
- Miloš, M.; Miloš, K.; Branislav, B.; Vít, V. Landslide susceptibility assessment using SVM machine learning algorithm. Eng. Geol. 2011, 123, 225–234. [Google Scholar] [CrossRef]
- Park, N.-W. Using maximum entropy modeling for landslide susceptibility mapping with multiple geoenvironmental data sets. Environ. Earth Sci. 2015, 73, 937–949. [Google Scholar] [CrossRef]
- Pham, B.T.; Bui, D.T.; Prakash, I.; Dholakia, M.B. Hybrid integration of Multilayer Perceptron Neural Networks and machine learning ensembles for landslide susceptibility assessment at Himalayan area (India) using GIS. Catena 2017, 149, 52–63. [Google Scholar] [CrossRef]
- Meijun, Z.; Mengzhen, Y.; Guoxiang, Y.; Gang, M. Risk analysis of road networks under the influence of landslides by considering landslide susceptibility and road vulnerability: A case study. Nat. Hazards Res. 2023, 4, 387–400. [Google Scholar] [CrossRef]
- Xian, Y.; Wei, X.; Zhou, H.; Chen, N.; Liu, Y.; Liu, F.; Sun, H. Snowmelt-triggered reactivation of a loess landslide in Yili, Xinjiang, China: Mode and mechanism. Landslides 2022, 19, 1843–1860. [Google Scholar] [CrossRef]
- Zhuang, M.; Gao, W.; Zhao, T.; Hu, R.; Wei, Y.; Shao, H.; Zhu, S. Mechanistic Investigation of Typical Loess Landslide Disasters in Ili Basin, Xinjiang, China. Sustainability 2021, 13, 635. [Google Scholar] [CrossRef]
- Wu, X.; Xu, X.; Yu, G.; Ren, J.; Yang, X.; Chen, G.; Xu, C.; Du, K.; Huang, X.; Yang, H. The China Active Faults Database (CAFD) and its web system. Earth Syst. Sci. Data 2024, 16, 3391–3417. [Google Scholar] [CrossRef]
- Yong, Y.; Rensheng, C.; Guohua, L.; Zhangwen, L.; Xiqiang, W. Trends and variability in snowmelt in China under climate change. Hydrol. Earth Syst. Sci. 2022, 26, 305–329. [Google Scholar] [CrossRef]
- Cressie, N. The origins of kriging. Math. Geol. 1990, 22, 239–252. [Google Scholar] [CrossRef]
- Heckmann, T.; Gegg, K.; Gegg, A.; Becht, M. Sample size matters: Investigating the effect of sample size on a logistic regression susceptibility model for debris flows. Nat. Hazards Earth Syst. Sci. 2014, 14, 259–278. [Google Scholar] [CrossRef]
- Webb, G.I. Naïve Bayes. In Encyclopedia of Machine Learning; Sammut, C., Webb, G.I., Eds.; Springer US: Boston, MA, USA, 2010; pp. 713–714. [Google Scholar]
- Cortes, C.; Vapnik, V. Support-vector networks. Mach. Learn. 1995, 20, 273–297. [Google Scholar] [CrossRef]
- Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
- Chen, T.; Guestrin, C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd Acm Sigkdd International Conference on Knowledge Discovery and Data Mining; Association for Computing Machinery: New York, NY, USA, 2016; pp. 785–794. [Google Scholar] [CrossRef]
- Brenning, A.; Long, S.; Fieguth, P. Detecting rock glacier flow structures using Gabor filters and IKONOS imagery. Remote Sens. Environ. 2012, 125, 227–237. [Google Scholar] [CrossRef]
- Ruß, G.; Brenning, A. Data mining in precision agriculture: Management of spatial information. In Proceedings of the Computational Intelligence for Knowledge-Based Systems Design: 13th International Conference on Information Processing and Management of Uncertainty, IPMU 2010, Dortmund, Germany, 28 June–2 July 2010; Proceedings 13. pp. 350–359. [Google Scholar] [CrossRef]
- MacQueen, J. Some methods for classification and analysis of multivariate observations. In Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, Volume 1: Statistics, 1967; University of California Press: Oakland, CA, USA, 1967; pp. 281–298. [Google Scholar]
- Lundberg, S.M.; Lee, S.-I. A unified approach to interpreting model predictions. Adv. Neural Inf. Process. Syst. 2017, 30. [Google Scholar]
- Rao, A.R.; Wang, H.; Gupta, C. Predictive Analysis for Optimizing Port Operations. Appl. Sci. 2025, 15, 2877. [Google Scholar] [CrossRef]
- Basist, A.; Bell, G.D.; Meentemeyer, V. Statistical relationships between topography and precipitation patterns. J. Clim. 1994, 7, 1305–1315. [Google Scholar] [CrossRef]
- Bingqi, Z.; Jingjie, Y.; Xiaoguang, Q.; Patrick, R.; Heigang, X. Climatic and geological factors contributing to the natural water chemistry in an arid environment from watersheds in northern Xinjiang, China. Geomorphology 2012, 153–154, 102–114. [Google Scholar] [CrossRef]
- Zheng, B.; Chen, K.; Li, B.; Li, Y.; Shi, L.; Fan, H. Climate change impacts on precipitation and water resources in Northwestern China. Front. Environ. Sci. 2024, 12, 1377286. [Google Scholar] [CrossRef]
- Heping, S.; Shi, Q.; Xingrong, L.; Xianxian, S.; Xingkun, W.; Dongyuan, S.; Sangjie, Y.; Jiale, H. Relationship between continuous or discontinuous of controlling factors and landslide susceptibility in the high-cold mountainous areas, China. Ecol. Indic. 2025, 172, 113313. [Google Scholar] [CrossRef]
- Jenks, G.F. The data model concept in statistical mapping. Int. Yearb. Cartogr. 1967, 7, 186–190. [Google Scholar]
No. | Indicator System | Number of Indicators | Source |
---|---|---|---|
1 | Elevation (m), Slope (°), Aspect, Slope length (m), Topographic wetness index (TWI), Plan curvature, Profile curvature, Distance from stream (m), Lithology, Distance from fault (m), Distance from geo-boundary (m), NDVI | 12 | Miloš et al. [28] |
2 | Slope (°), Elevation (m), Plan curvature, Profile curvature, Catchment area, Catchment height, Convergence index, Topographic wetness index (TWI), Aspect, Surface roughness | 10 | Goetz, Brenning, Petschko and Leopold [14] |
3 | Elevation (m), Slope angle (°), Distance from lineaments (m), Forest type, Lithology, Soli drainage | 6 | Park [29] |
4 | Slope angle (°), Aspect, Elevation (m), Curvature, Plan curvature, Profile curvature, Soil type, Land cover, Rainfall (mm), Distance to roads (m), Road density (km/km2), Distance to rivers (m), River density (km/km2), Distance to lineaments (m), Lineament density (km/km2), | 14 | Binh Thai et al. [30] |
5 | Slope (°), Aspect, Elevation (m), Plan curvature, STI, TWI, Distance to rivers (m), Distance to roads (m), Distance to faults (m), NDVI, Land use, Lithology, Rainfall (mm) | 13 | Chen, Peng, Hong, Shahabi, Pradhan, Liu, Zhu, Pei and Duan [21] |
6 | Slope (°), Elevation (m), Lithology, Distance to rivers (m), Relief amplitude (m), Rainfall (mm) | 6 | Meijun et al. [31] |
Model | Zoning | Area (km2) | Area Proportion | Number of Landslide Points | Landslide Point Proportion | Landslide Point Density (Point/km2) |
---|---|---|---|---|---|---|
NB-GLHM | Very low | 195.74 | 8.84% | 2 | 1.96% | 0.0102 |
Low | 457.69 | 20.66% | 23 | 22.55% | 0.0503 | |
Medium | 918.34 | 41.46% | 23 | 22.55% | 0.0250 | |
High | 347.35 | 15.68% | 25 | 24.51% | 0.0720 | |
Severe | 295.87 | 13.36% | 29 | 28.43% | 0.0980 | |
NB-Random | Very low | 360.16 | 16.26% | 1 | 0.98% | 0.0028 |
Low | 618.49 | 27.92% | 9 | 8.82% | 0.0146 | |
Medium | 745.53 | 33.66% | 21 | 20.59% | 0.0282 | |
High | 397.30 | 17.94% | 51 | 50.00% | 0.1284 | |
Severe | 93.51 | 4.22% | 20 | 19.61% | 0.2139 | |
SVM-GLHM | Very low | 771.69 | 34.84% | 1 | 0.98% | 0.0013 |
Low | 396.21 | 17.89% | 3 | 2.94% | 0.0076 | |
Medium | 252.14 | 11.38% | 2 | 1.96% | 0.0079 | |
High | 252.83 | 11.41% | 5 | 4.90% | 0.0198 | |
Severe | 542.13 | 24.48% | 91 | 89.22% | 0.1679 | |
SVM-Random | Very low | 1170.48 | 52.84% | 2 | 1.96% | 0.0017 |
Low | 263.89 | 11.91% | 2 | 1.96% | 0.0076 | |
Medium | 181.34 | 8.19% | 4 | 3.92% | 0.0221 | |
High | 191.23 | 8.63% | 12 | 11.76% | 0.0628 | |
Severe | 408.05 | 18.42% | 82 | 80.39% | 0.2010 | |
SVM-RF-GLHM | Very low | 836.32 | 37.76% | 1 | 0.98% | 0.0012 |
Low | 408.70 | 18.45% | 6 | 5.88% | 0.0147 | |
Medium | 329.52 | 14.88% | 17 | 16.67% | 0.0516 | |
High | 350.06 | 15.80% | 26 | 25.49% | 0.0743 | |
Severe | 290.40 | 13.11% | 52 | 50.98% | 0.1791 | |
SVM-RF-Random | Very low | 934.45 | 42.19% | 7 | 6.86% | 0.0075 |
Low | 589.61 | 26.62% | 18 | 17.65% | 0.0305 | |
Medium | 358.73 | 16.20% | 40 | 39.22% | 0.1115 | |
High | 181.34 | 8.19% | 14 | 13.73% | 0.0772 | |
Severe | 150.86 | 6.81% | 23 | 22.55% | 0.1525 | |
XGBoost-GLHM | Very low | 428.50 | 19.35% | 0 | 0.00% | 0.0000 |
Low | 613.06 | 27.68% | 3 | 2.94% | 0.0049 | |
Medium | 786.30 | 35.50% | 44 | 43.14% | 0.0560 | |
High | 230.71 | 10.42% | 22 | 21.57% | 0.0954 | |
Severe | 156.43 | 7.06% | 33 | 32.35% | 0.2110 | |
XGBoost-Random | Very low | 407.70 | 18.41% | 5 | 4.90% | 0.0123 |
Low | 534.19 | 24.12% | 6 | 5.88% | 0.0112 | |
Medium | 873.28 | 39.43% | 33 | 32.35% | 0.0378 | |
High | 251.73 | 11.36% | 32 | 31.37% | 0.1271 | |
Severe | 148.11 | 6.69% | 26 | 25.49% | 0.1755 |
Model | GLHM Median (IQR) | Random Median (IQR) | Δ | p-Values |
---|---|---|---|---|
AUROC(%) | ||||
NB | 90.49 (3.44) | 82.05 (4.29) | +8.44 | <0.001 *** |
SVM | 90.82 (2.93) | 83.71 (5.75) | +7.11 | <0.001 *** |
SVM-RF | 91.67 (2.83) | 88.22 (3.43) | +3.45 | 0.023 ∙ |
XGBoost | 94.61 (3.58) | 91.56 (4.51) | +3.04 | 0.029 ∙ |
Accuracy(%) | ||||
NB | 82.62 (0.76) | 71.31 (2.68) | +11.30 | <0.001 *** |
SVM | 82.84 (1.84) | 74.51 (2.48) | +8.33 | <0.001 *** |
SVM-RF | 81.59 (2.13) | 74.20 (4.70) | +7.40 | 0.002 ** |
XGBoost | 84.30 (2.81) | 75.99 (2.85) | +8.31 | <0.001 *** |
Model Pair | AUROC (%) | Model Pair | Accuracy (%) | ||
---|---|---|---|---|---|
Significance | Adj. Significance (Bonferroni) | Significance | Adj. Significance (Bonferroni) | ||
GLHM | |||||
NB-SVM | 1.000 | 1.000 | SVM-RF-NB | 0.007 | 0.044 |
NB-SVM-RF | 0.074 | 0.442 | SVM-RF-SVM | 0.074 | 0.442 |
NB-XGBoost | 0.007 | 0.044 * | SVM-RF-XGBoost | 0.007 | 0.044 * |
SVM-SVM-RF | 0.371 | 1.000 | NB-SVM | 0.371 | 1.000 |
SVM-XGBoost | 0.074 | 0.442 | NB-XGBoost | 0.007 | 0.044 * |
Random | |||||
NB-SVM | 0.074 | 0.442 | NB-SVM-RF | 0.371 | 1.000 |
NB-SVM-RF | 0.007 | 0.044 * | NB-SVM | <0.001 | <0.001 *** |
NB-XGBoost | <0.001 | <0.001 *** | NB-XGBOOST | 0.007 | 0.044 * |
SVM-SVM-RF | 0.007 | 0.044 * | SVM-RF-SVM | 1.000 | 1.000 |
SVM-XGBoost | <0.001 | <0.001 *** | SVM-RF-XGBOOST | 0.371 | 1.000 |
SVM-RF-XGBoost | 0.371 | 1.000 | SVM-XGBOOST | 0.371 | 1.000 |
Contributing Factors | Models | |||||||
---|---|---|---|---|---|---|---|---|
NB (GLHM) | SVM (GLHM) | SVM-RF (GLHM) | XGBoost (GLHM) | NB (Random) | SVM (Random) | SVM-RF (Random) | XGBoost (Random) | |
NDVI | 0.0544 | 0.2358 | 0.3202 | 0.1715 | 0.0366 | 0.2268 | 0.2542 | 0.0822 |
STI | 0.0649 | 0.2936 | 0.2947 | 0.1003 | 0.0573 | 0.3924 | 0.3447 | 0.1597 |
TWI | 0.0541 | 0.2029 | 0.1646 | 0.0471 | 0.0471 | 0.3015 | 0.2455 | 0.1070 |
Slope | 0.1374 | 0.0174 | 0.0962 | 0.0478 | 0.0832 | 0.1001 | 0.0363 | 0.1069 |
Profile curvature | 0.0693 | 0.1517 | 0.1582 | 0.0922 | 0.0587 | 0.0005 | 0.0588 | 0.0671 |
Distance to rivers | 0.0426 | 0.0704 | 0.0187 | 0.0911 | 0.0634 | 0.1008 | 0.0838 | 0.0643 |
Lithology | 0.0674 | 0.0381 | 0.0283 | 0.0827 | 0.0902 | 0.0442 | 0.0605 | 0.0606 |
Elevation | 0.0311 | 0.0383 | 0.0402 | 0.0448 | 0.0397 | 0.1260 | 0.0987 | 0.0880 |
Aspect | 0.0103 | 0.0911 | 0.0714 | 0.0561 | 0.0376 | 0.0803 | 0.0610 | 0.0382 |
Land cover | 0.0229 | 0.0613 | 0.1122 | 0.0366 | 0.0430 | 0.0212 | 0.0223 | 0.0436 |
Distance to roads | 0.0140 | 0.0783 | 0.1078 | 0.0184 | 0.0021 | 0.0290 | 0.0596 | 0.0703 |
Distance to faults | 0.0065 | 0.0413 | 0.0628 | 0.0343 | 0.0320 | 0.0527 | 0.0532 | 0.0428 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Tang, Z.; Qiu, S.; Xia, H.; Lin, D.; Bai, M. Comparative Performance of Machine Learning Models for Landslide Susceptibility Assessment: Impact of Sampling Strategies in Highway Buffer Zone. Appl. Sci. 2025, 15, 8416. https://doi.org/10.3390/app15158416
Tang Z, Qiu S, Xia H, Lin D, Bai M. Comparative Performance of Machine Learning Models for Landslide Susceptibility Assessment: Impact of Sampling Strategies in Highway Buffer Zone. Applied Sciences. 2025; 15(15):8416. https://doi.org/10.3390/app15158416
Chicago/Turabian StyleTang, Zhenyu, Shumao Qiu, Haoying Xia, Daming Lin, and Mingzhou Bai. 2025. "Comparative Performance of Machine Learning Models for Landslide Susceptibility Assessment: Impact of Sampling Strategies in Highway Buffer Zone" Applied Sciences 15, no. 15: 8416. https://doi.org/10.3390/app15158416
APA StyleTang, Z., Qiu, S., Xia, H., Lin, D., & Bai, M. (2025). Comparative Performance of Machine Learning Models for Landslide Susceptibility Assessment: Impact of Sampling Strategies in Highway Buffer Zone. Applied Sciences, 15(15), 8416. https://doi.org/10.3390/app15158416