Binary Logistic Regression Outperforms Decision Tree Modeling for Event-Based Landslide Prediction: Application to Dynamic Hazard and Threshold Mapping in Central Italy
Abstract
1. Introduction
2. Methods
2.1. Study Area
2.2. Logistic Regression
2.3. Decision Tree Quest
2.4. Dataset and Analysis
- Land use, derived from Corine Land Cover at the first level, classified into 5 categories [29]: agricultural surfaces, artificial surfaces, semi-natural wooded territories, wetlands, and water bodies.
- Slope gradient, classified into 4 classes [6]: 0–16°, 16–25°, 25–35°, and >35°.
- Lithology, classified into 6 categories [6]: clays and marly clays with intercalations of sandstone, sands and conglomerates; deposits; limestones, flinty limestones, subordinately marls and clay marls; marls, clay marls and marly limestones; sandstones, marly clays, subordinately conglomerates; shales and marls encompassing calcareous, marly limestones and arenaceous bodies.
- Slope aspect, classified into 9 categories: flat, north, north-east, east, south-east, south, south-west, west, and north-west.
- Topographic Wetness Index (TWI), classified into 7 classes [30]: 0–2, 2–4, 4–6, 6–8, 8–10, 10–12, and 12–23.
- Antecedent soil saturation, expressed as a percentage for the day preceding the extreme precipitation event.
- Maximum daily precipitation during the event, expressed in mm.
- Cumulative precipitation over the duration of the extreme event, expressed in mm.
2.5. Precipitation Data Processing
2.6. Saturation Data Processing
3. Results
3.1. Statistical Analysis of BLR
3.2. Statistical Analysis of Decision Trees QUEST (Quick, Unbiased, Efficient Statistical Tree)
3.3. Comparison QUEST-BLR
3.4. Interpretation and Physical Meaning of Parameters Based on Statistical Results
3.5. Susceptibility, Hazard, and Activation Thresholds for the Study Area
4. Discussion
Limitations
5. Conclusions
- A Dynamic Susceptibility Map: Conditioning the model on average annual maximum rainfall provides a realistic baseline of landslide probability.
- A Scenario-Based Hazard Map: Using the 100-year return period rainfall transforms the susceptibility model into a temporal hazard assessment for extreme events.
- A Spatially Distributed Threshold Map: This novel product inverts the model to pinpoint the specific rainfall intensity required to exceed a critical probability threshold (p > 0.7) at any location. This effectively spatializes empirical thresholds, moving from a regional “if-then” curve to a localized “how much rain, and where” guidance system.
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Wu, X.; Chen, X.; Zhan, F.B.; Hong, S. Global research trends in landslides during 1991–2014: A bibliometric analysis. Landslides 2015, 12, 1215–1226. [Google Scholar] [CrossRef]
- Trigila, A.; Iadanza, C.; Guerrieri, L.; Hervás, J. The IFFI project (Italian landslide inventory): Methodology and results. Guidel. Mapp. Areas Risk Landslides Eur. 2007, 23, 15. [Google Scholar]
- Trigila, A.; Iadanza, C.; Bussettini, M.; Mariani, S.; D’Ascola, F.; Salmeri, A.; Cassese, M.L.; Pesarino, V.; Di Paola, G.; Romeo, S.; et al. Dissesto idrogeologico in Italia: Pericolosità e indicatori di rischio: Edizione 2024. Rapporto 2025, 415, 2025. [Google Scholar]
- Handwerger, A.L.; Huang, M.H.; Fielding, E.J.; Booth, A.M.; Bürgmann, R. A shift from drought to extreme rainfall drives a stable landslide to catastrophic failure. Sci. Rep. 2019, 9, 1569. [Google Scholar] [CrossRef]
- Gentilucci, M.; Materazzi, M.; Pambianchi, G. Statistical analysis of landslide susceptibility, Macerata province (Central Italy). Hydrology 2021, 8, 5. [Google Scholar] [CrossRef]
- Gentilucci, M.; Pelagagge, N.; Rossi, A.; Domenico, A.; Pambianchi, G. Landslide susceptibility using climatic–environmental factors using the weight-of-evidence method—A study area in Central Italy. Appl. Sci. 2023, 13, 8617. [Google Scholar] [CrossRef]
- Segoni, S.; Tofani, V.; Rosi, A.; Catani, F.; Casagli, N. Combination of rainfall thresholds and susceptibility maps for dynamic landslide hazard assessment at regional scale. Front. Earth Sci. 2018, 6, 85. [Google Scholar] [CrossRef]
- Liu, Q.; Zhao, Q.; Lan, Q.; Huang, C.; Yang, X.; Tang, Z.; Deng, M. Regional dynamic hazard assessment of rainfall–induced landslide guided by geographic similarity. Bull. Eng. Geol. Environ. 2024, 83, 501. [Google Scholar] [CrossRef]
- Bhandary, N.P.; Dahal, R.K.; Timilsina, M.; Yatabe, R. Rainfall event-based landslide susceptibility zonation mapping. Nat. Hazards 2013, 69, 365–388. [Google Scholar] [CrossRef]
- Antonetti, G.; Gentilucci, M.; Aringoli, D.; Pambianchi, G. Analysis of landslide susceptibility and tree felling due to an extreme event at mid-latitudes: Case study of Storm Vaia, Italy. Land 2022, 11, 1808. [Google Scholar] [CrossRef]
- Shahabi, H.; Rahimzad, M.; Tavakkoli Piralilou, S.; Ghorbanzadeh, O.; Homayouni, S.; Blaschke, T.; Lim, S.; Ghamisi, P. Unsupervised deep learning for landslide detection from multispectral sentinel-2 imagery. Remote Sens. 2021, 13, 4698. [Google Scholar] [CrossRef]
- Zhang, X.; Pun, M.O.; Liu, M. Semi-supervised multi-temporal deep representation fusion network for landslide mapping from aerial orthophotos. Remote Sens. 2021, 13, 548. [Google Scholar] [CrossRef]
- Chrysafi, A.A.; Tsangaratos, P.; Ilia, I.; Chen, W. Rapid Landslide Detection Following an Extreme Rainfall Event Using Remote Sensing Indices, Synthetic Aperture Radar Imagery, and Probabilistic Methods. Land 2024, 14, 21. [Google Scholar] [CrossRef]
- Ayalew, L.; Yamagishi, H. The application of GIS-based logistic regression for landslide susceptibility mapping in the Kakuda-Yahiko Mountains, Central Japan. Geomorphology 2005, 65, 15–31. [Google Scholar] [CrossRef]
- Alkhasawneh, M.S.; Ngah, U.K.; Tay, L.T.; Mat Isa, N.A.; Al-Batah, M.S. Modeling and testing landslide hazard using decision tree. J. Appl. Math. 2014, 2014, 929768. [Google Scholar] [CrossRef]
- Sun, D.; Wen, H.; Wang, D.; Xu, J. A random forest model of landslide susceptibility mapping based on hyperparameter optimization using Bayes algorithm. Geomorphology 2020, 362, 107201. [Google Scholar] [CrossRef]
- Yao, J.; Yao, X.; Zhao, Z.; Liu, X. Performance comparison of landslide susceptibility mapping under multiple machine-learning based models considering InSAR deformation: A case study of the upper Jinsha River. Geomat. Nat. Hazards Risk 2023, 14, 2212833. [Google Scholar] [CrossRef]
- Nhu, V.H.; Mohammadi, A.; Shahabi, H.; Ahmad, B.B.; Al-Ansari, N.; Shirzadi, A.; Geertsema, M.; Kress, V.R.; Karimzadeh, S.; Valizadeh Kamran, K.; et al. Landslide detection and susceptibility modeling on cameron highlands (Malaysia): A comparison between random forest, logistic regression and logistic model tree algorithms. Forests 2020, 11, 830. [Google Scholar] [CrossRef]
- Perlich, C.; Provost, F.; Simonoff, J.S. Tree induction vs. logistic regression: A learning-curve analysis. J. Mach. Learn. Res. 2003, 4, 211–255. [Google Scholar]
- Kirasich, K.; Smith, T.; Sadler, B. Random forest vs logistic regression: Binary classification for heterogeneous datasets. SMU Data Sci. Rev. 2018, 1, 9. [Google Scholar]
- Couronné, R.; Probst, P.; Boulesteix, A.L. Random forest versus logistic regression: A large-scale benchmark experiment. BMC Bioinform. 2018, 19, 270. [Google Scholar] [CrossRef] [PubMed]
- Forkuor, G.; Hounkpatin, O.K.; Welp, G.; Thiel, M. High resolution mapping of soil properties using remote sensing variables in south-western Burkina Faso: A comparison of machine learning and multiple linear regression models. PLoS ONE 2017, 12, e0170478. [Google Scholar] [CrossRef] [PubMed]
- Halder, K.; Srivastava, A.K.; Ghosh, A.; Das, S.; Banerjee, S.; Pal, S.C.; Chatterjee, U.; Bisai, D.; Ewert, F.; Gaiser, T. Improving landslide susceptibility prediction through ensemble recursive feature elimination and meta-learning framework. Sci. Rep. 2025, 15, 5170. [Google Scholar] [CrossRef] [PubMed]
- Evangelia, C.; Jie, M.; Collins, G.; Steyerberg, E.; Verbakel, J.; Van Calster, B. A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models. J. Clin. Epidemiol. 2019, 110, 12–22. [Google Scholar]
- Gentilucci, M.; Barbieri, M.; Burt, P. Climate and territorial suitability for the Vineyards developed using GIS techniques. In Conference of the Arabian Journal of Geosciences; Springer International Publishing: Cham, Switzerland, 2018; pp. 11–13. [Google Scholar]
- Hosmer, D.W.; Lemeshow, S. Applied Logistic Regression; Wiley: New York, NY, USA, 2000. [Google Scholar]
- Gentilucci, M.; Pambianchi, G. Prediction of snowmelt days using binary logistic regression in the Umbria-Marche apennines (Central Italy). Water 2022, 14, 1495. [Google Scholar] [CrossRef]
- Loh, W.Y.; Shih, Y.S. Split selection methods for classification trees. Stat. Sin. 1997, 7, 815–840. [Google Scholar]
- Gentilucci, M.; Rossi, A.; Pelagagge, N.; Aringoli, D.; Barbieri, M.; Pambianchi, G. GEV analysis of extreme rainfall: Comparing different time intervals to analyse model response in terms of return levels in the study area of central Italy. Sustainability 2023, 15, 11656. [Google Scholar] [CrossRef]
- Gentilucci, M.; Barbieri, M.; Younes, H.; Rihab, H.; Pambianchi, G. Analysis of Wildfire Susceptibility by Weight of Evidence, Using Geomorphological and Environmental Factors in the Marche Region, Central Italy. Geosciences 2024, 14, 112. [Google Scholar] [CrossRef]
- R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2025; Available online: https://www.R-project.org/ (accessed on 12 December 2025).
- Gentilucci, M.; Djouohou, S.I.; Barbieri, M.; Hamed, Y.; Pambianchi, G. Trend analysis of streamflows in relation to precipitation: A case study in Central Italy. Water 2023, 15, 1586. [Google Scholar] [CrossRef]
- European Soil Data Centre (ESDAC). European Soil Data Centre: Soil Data and Information Systems; Joint Research Centre, European Commission: Brussels, Belgium, 2023; Available online: https://esdac.jrc.ec.europa.eu/ (accessed on 25 January 2025).
- Cosby, B.J.; Hornberger, G.M.; Clapp, R.B.; Ginn, T.R. A statistical exploration of the relationships of soil moisture characteristics to the physical properties of soils. Water Resour. Res. 1984, 20, 682–690. [Google Scholar] [CrossRef]
- Rawls, W.J.; Brakensiek, D.L.; Saxton, K.E. Estimation of soil water properties. Trans. ASAE 1982, 25, 1316–1320. [Google Scholar] [CrossRef]
- Allen, R.G.; Pereira, L.S.; Raes, D.; Smith, M. Crop Evapotranspiration–Guidelines for Computing Crop Water Requirements (FAO Irrigation and Drainage Paper No. 56); Food and Agriculture Organization of the United Nations: Rome, Italy, 1998. [Google Scholar]
- Fu, Z.; Ma, H.; Wang, F.; Dou, J.; Zhang, B.; Fang, Z. An Integrated Framework of Positive-unlabeled and Imbalanced learning for Landslide Susceptibility Mapping. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2024, 17, 15596–15611. [Google Scholar] [CrossRef]
- Guzzetti, F.; Peruccacci, S.; Rossi, M.; Stark, C.P. The rainfall intensity–duration control of shallow landslides and debris flows: An update. Landslides 2008, 5, 3–17. [Google Scholar] [CrossRef]
- Reichenbach, P.; Rossi, M.; Malamud, B.D.; Mihir, M.; Guzzetti, F. A review of statistically-based landslide susceptibility models. Earth-Sci. Rev. 2018, 180, 60–91. [Google Scholar] [CrossRef]
- Pradhan, B.; Seeni, M.I.; Kalantar, B. Performance evaluation and sensitivity analysis of expert-based, statistical, machine learning, and hybrid models for producing landslide susceptibility maps. In Laser Scanning Applications in Landslide Assessment; Springer International Publishing: Cham, Switzerland, 2017; pp. 193–232. [Google Scholar]
- Sahin, E.K.; Colkesen, I.; Kavzoglu, T. A comparative assessment of canonical correlation forest, random forest, rotation forest and logistic regression methods for landslide susceptibility mapping. Geocarto Int. 2020, 35, 341–363. [Google Scholar] [CrossRef]
- Lombardo, L.; Cama, M.; Conoscenti, C.; Märker, M.; Rotigliano, E.J.N.H. Binary logistic regression versus stochastic gradient boosted decision trees in assessing landslide susceptibility for multiple-occurring landslide events: Application to the 2009 storm event in Messina (Sicily, southern Italy). Nat. Hazards 2015, 79, 1621–1648. [Google Scholar] [CrossRef]
- Nhu, V.H.; Shirzadi, A.; Shahabi, H.; Singh, S.K.; Al-Ansari, N.; Clague, J.J.; Jaafari, A.; Chen, W.; Miraki, S.; Dou, J.; et al. Shallow landslide susceptibility mapping: A comparison between logistic model tree, logistic regression, naïve bayes tree, artificial neural network, and support vector machine algorithms. Int. J. Environ. Res. Public Health 2020, 17, 2749. [Google Scholar] [CrossRef]
- Chen, W.; Yang, Z. Landslide susceptibility modeling using bivariate statistical-based logistic regression, naïve Bayes, and alternating decision tree models. Bull. Eng. Geol. Environ. 2023, 82, 190. [Google Scholar] [CrossRef]
- Rahman, H.A.A.; Wah, Y.B.; He, H.; Bulgiba, A. Comparisons of ADABOOST, KNN, SVM and logistic regression in classification of imbalanced dataset. In International Conference on Soft Computing in Data Science; Springer: Singapore, 2015; pp. 54–64. [Google Scholar]
- Sifa, S.F.; Mahmud, T.; Tarin, M.A.; Haque, D.M.E. Event-based landslide susceptibility mapping using weights of evidence (WoE) and modified frequency ratio (MFR) model: A case study of Rangamati district in Bangladesh. Geol. Ecol. Landsc. 2020, 4, 222–235. [Google Scholar] [CrossRef]
- Ng, C.W.W.; Yang, B.; Liu, Z.Q.; Kwan, J.S.H.; Chen, L. Spatiotemporal modelling of rainfall-induced landslides using machine learning. Landslides 2021, 18, 2499–2514. [Google Scholar] [CrossRef]
- Lee, M.J.; Park, I.; Won, J.S.; Lee, S. Landslide hazard mapping considering rainfall probability in Inje, Korea. Geomat. Nat. Hazards Risk 2016, 7, 424–446. [Google Scholar] [CrossRef]
- Alcântara, E.; Baião, C.F.; Guimarães, Y.C.; Mantovani, J.R.; Marengo, J.A. Machine learning reveals lithology and soil as critical parameters in landslide susceptibility for Petrópolis (Rio de Janeiro State, Brazil). Nat. Hazards Res. 2025, 5, 539–553. [Google Scholar] [CrossRef]








| Statistic | Intercept Only Model (Null Model) | Model with Predictors |
|---|---|---|
| Observations | 2097 | 2097 |
| Sum of weights | 2097.0000 | 2097.0000 |
| Df | 2096 | 2077 |
| −2 Log-Likelihood | 1227.9975 | 680.2939 |
| Mc Fadden’s R2 | 0.0000 | 0.4460 |
| Cox and Snell R2 | 0.0000 | 0.2299 |
| Nagelkerke R2 | 0.0000 | 0.5186 |
| Akaike Information Criterion (AIC) | 1229.9975 | 720.2939 |
| Schwarz Bayesian Criterion (SBC)/Bayesian Information Criterion (BIC) | 1235.6458 | 833.2592 |
| Test/Source | Chi-Square | Pr > Chi2 | Additional Notes |
|---|---|---|---|
| −2 Log-Likelihood Ratio | 547.7 | <0.0001 | The model is highly significant overall |
| Score Test | 412.4 | <0.0001 | Confirms global significance of the model |
| Wald Test | 242.5 | <0.0001 | Predictors are jointly significant |
| Max daily prec. | 29.3 (Wald), 117.3 (LR) | 0.0012; <0.0001 | Strongly significant predictor |
| Land use (CLC) | 6.2 (Wald), 97.8 (LR) | <0.0001 | Strongly significant predictor |
| Slope | 99.4 (Wald), 254.6 (LR) | <0.0001 | Strongly significant predictor |
| Lithology | 76.4 (Wald), 175.2(LR) | <0.0001 | Strongly significant predictor |
| Aspect | 62.0 (Wald), 176.9 | 0.0686; <0.0001 | Marginally significant (Wald); Significant (LR) |
| Hosmer–Lemeshow test | 7.0 | 0.54 | Model fit is adequate; H0 not rejected |
| From\To | 0 | 1 | Total | % Correct |
|---|---|---|---|---|
| 0 | 1576 | 341 | 1917 | 82.21% |
| 1 | 34 | 146 | 180 | 81.11% |
| Total | 1610 | 487 | 2097 | 82.12% |
| From\To | 0 | 1 | Total | % Correct |
|---|---|---|---|---|
| 0 | 665 | 164 | 829 | 80.22% |
| 1 | 13 | 58 | 71 | 81.69% |
| Total | 678 | 222 | 900 | 80.33% |
| Predictor | Unified Model | Interpretation of Effect |
|---|---|---|
| Rainfall Intensity | β = +0.026 (p < 0.0001) OR = 1.03 | A significant trigger. Each 1 mm increase in intensity increases the odds of failure by 2.6%. |
| Slope (16–25°) | β = −2.872 (p < 0.0001) OR = 0.06 | The most consistent stabilizing factor. This class has 94.3% lower odds of failure compared to the 0–16° reference class. |
| Lithology (Clays) | β = +1.796 (p < 0.0001) OR = 6.03 | The paramount risk factor. Presence of this lithology increases the odds of failure by over 6 times compared to deposits. |
| Aspect (Southeast) | β = −2.498 (p = 0.0002) OR = 0.08 | Southeast-facing slopes are associated with a drastic (91.8%) reduction in risk compared to South-facing slopes. |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Gentilucci, M.; Younes, H.; Hadji, R.; Pambianchi, G. Binary Logistic Regression Outperforms Decision Tree Modeling for Event-Based Landslide Prediction: Application to Dynamic Hazard and Threshold Mapping in Central Italy. Earth 2026, 7, 56. https://doi.org/10.3390/earth7020056
Gentilucci M, Younes H, Hadji R, Pambianchi G. Binary Logistic Regression Outperforms Decision Tree Modeling for Event-Based Landslide Prediction: Application to Dynamic Hazard and Threshold Mapping in Central Italy. Earth. 2026; 7(2):56. https://doi.org/10.3390/earth7020056
Chicago/Turabian StyleGentilucci, Matteo, Hamed Younes, Rihab Hadji, and Gilberto Pambianchi. 2026. "Binary Logistic Regression Outperforms Decision Tree Modeling for Event-Based Landslide Prediction: Application to Dynamic Hazard and Threshold Mapping in Central Italy" Earth 7, no. 2: 56. https://doi.org/10.3390/earth7020056
APA StyleGentilucci, M., Younes, H., Hadji, R., & Pambianchi, G. (2026). Binary Logistic Regression Outperforms Decision Tree Modeling for Event-Based Landslide Prediction: Application to Dynamic Hazard and Threshold Mapping in Central Italy. Earth, 7(2), 56. https://doi.org/10.3390/earth7020056
