Next Article in Journal
Enhancing Wildfire Monitoring with SDGSAT-1: A Performance Analysis
Previous Article in Journal
Unveiling Forest Density Dynamics in Saihanba Forest Farm by Integrating Airborne LiDAR and Landsat Satellites
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Role of Collecting Data on Various Site Conditions Through Satellite Remote Sensing Technology and Field Surveys in Predicting the Landslide Travel Distance: A Case Study of the 2022 Petrópolis Disaster in Brazil

by
Thiago Dutra dos Santos
1 and
Taro Uchida
2,*
1
Graduate School of Science and Technology, University of Tsukuba, Tsukuba 305-8572, Japan
2
Institute of Life and Environmental Sciences, University of Tsukuba, Tsukuba 305-8577, Japan
*
Author to whom correspondence should be addressed.
Remote Sens. 2025, 17(19), 3337; https://doi.org/10.3390/rs17193337
Submission received: 8 August 2025 / Revised: 12 September 2025 / Accepted: 21 September 2025 / Published: 29 September 2025
(This article belongs to the Section Remote Sensing in Geology, Geomorphology and Hydrology)

Abstract

Highlights

What are the main findings?
  • To predict the landslide travel distance, it is necessary to consider not only the landslide scale itself, but also the site conditions of the initiation, runout, and deposition zones.
  • A substantial proportion of the site condition data was obtained through satellite remote sensing, while a smaller proportion was acquired through field surveys.
What is the implication of the main finding?
  • Even in regions with limited data, satellite remote sensing technologies can significantly improve the prediction of the distance reached by landslides.
  • The approach provides a scalable, cost-effective, and rapid framework, making it especially valuable for disaster risk reduction in resource-constrained regions.

Abstract

Landslide runout distance is governed not only by collapsed magnitude but also by site-specific geoenvironmental conditions. While remote sensing techniques has advanced landslide susceptibility mapping, its application to runout modeling remains limited. This study examined the role of collecting data on various site conditions through remote sensing and field surveys datasets in predicting the landslide travel distance from the 2022 disaster in Petrópolis, Rio de Janeiro. A total of 218 multivariate linear regression models were developed using morphometric, remote sensing, and field survey variables collected across collapse, transport, and deposition zones. Results show that predictive accuracy was limited when based solely on landslide scale (R2 = 0.06–0.10) but improved substantially with the inclusion of site condition data across collapse, transport, and deposition zones (R2 = 0.49–0.51). Additionally, model performance was strongly influenced by runout path typology, with channelized flows producing the most stable and accurate predictions (R2 = 0.73–0.90), while obstructed and open-slope paths performed worse (R2 = 0.39–0.61). These findings demonstrate that empirical models integrating multizonal site-condition data and runout path typology offer a scalable, reproducible framework for landslide hazard mapping in data-scarce, complex mountainous urban environments.

1. Introduction

Landslides are formally classified as geohazards in international frameworks and represent one of the most pervasive and destructive hazards worldwide, being strongly associated with hydrometeorological triggers, particularly intense rainfall, as well as seismic activity and anthropogenic disturbances [1,2]. To mitigate potential landslide impacts, hazard mapping is a fundamental activity. Hazard maps provide essential information for land use planning, early-warning, and evacuation systems, thereby supporting strategies to reduce potential losses and fatalities [3,4,5]. So, numerous studies have been conducted to create more accurate landslide disaster hazard maps [6,7,8,9,10,11,12].
Hazard maps of landslide disasters commonly consist of two parts: initiation of the landslide and its runout and deposition. Several studies have been conducted to assess the spatial patterns of landslide susceptibility, with a focus on landslide initiation [13,14,15]. While, landslide runout, defined as the total horizontal distance traveled by sediments, has been also studied in the last several decades [16,17,18,19,20,21,22]. Within the scope of landslide hazard assessment, predicting the maximum runout distance (or travel distance) is essential for delineating impact zones, informing urban planning, and designing early warning systems [2,16,21]. These methods are broadly categorized into empirical-statistical models and physically-based (or dynamic) simulations.
There are two complementary approaches that can be considered to improve the accuracy of landslide extent prediction. The first approach involves refining the model structure and modelling techniques, while the second approach involves improving the accuracy and quality of the input data used in the model. Recent advancements in remote sensing technology have made it possible to obtain more accurate and diverse spatial geoenvironmental data. Specifically, accurately selecting and collecting the variables that influence landslide extent can improve prediction accuracy. However, many regions still lack sufficient data on site conditions and landslides. In this study, we focused on the second approach. We would therefore like to discuss data collection methods, particularly those that can be used effectively in areas where data is currently insufficient.
Various empirical models focused on correlations between landslide magnitude (i.e., area and volume) and mobility coefficient (i.e., friction coefficient (H/L) or the angle of reach [21,22,23,24,25,26,27,28,29,30,31]. Previous studies showed that runout is a highly non-linear and multifactorial phenomenon, influenced not only by the volume and geometry of the initial failure but also by a complex interplay of terrain attributes, soil and lithological properties, hydrological conditions, and land use [29,32,33]. That is, despite similar landslide magnitudes, the mobility coefficients were widely scattered, suggesting that other factors, such as the landslide materials, water content, and the topography of the runout area, also impacted the landslide runout [7,16,34]. In physically based models, these additional factors are incorporated and considered to control landslide runout [35,36,37,38,39]. There are physically based models that commonly reproduce the actual travel of landslides [40,41,42]. However, it is essential to note that collecting sufficient input data, such as topography, the location of artificial objects, soil properties, and soil water content, for many sites with high landslide susceptibility often requires considerable costs and manpower [20,28].
In view of the aforementioned points, it is considered necessary to gather information on the various site conditions, since the runout distance of a landslide is influenced not only by the scale of the landslide itself, but also by various characteristics of the landslide materials and the site conditions. Advances in remote sensing technology have recently facilitated the acquisition of various spatial data, including topography, vegetation, and artificial objects [33,43,44,45,46]. Furthermore, recent studies have confirmed that even in areas where comprehensive landslide information is conventionally lacking, satellite imagery will provide important information for understanding the characteristics of landslides [47,48,49,50,51,52]. However, despite the existence of numerous studies that have utilized satellite imagery to assess spatial patterns of landslide susceptibility, i.e., initiation zone, there is a paucity of research that has focused on evaluating the runout of landslides [11,53]. Therefore, this study aims to advance landslide runout modeling by developing and evaluating a set of statistical-empirical models based on geoenvironmental site conditions derived from both remote sensing and field survey data. Therefore, the primary objective of this study was to evaluate the effectiveness of remote sensing-derived site-condition variables in predicting landslide runout distances in tropical densely urbanized mountainous regions. Specifically, we aimed to address the following research questions: (i) to what extent can the accuracy of landslide runout distance predictions be improved by incorporating the spatial distribution of site-condition variables obtained from satellite remote sensing data? (ii) in which landslide process zones—initiation, transport, or deposition—is the use of site-condition information most effective in enhancing runout prediction? To explore these questions, we applied a multivariate linear regression framework using the simplest statistical approach as a first step toward assessing the predictive utility of remote sensing variables. The study focused on the slopes surrounding the base of landslides in Petrópolis, Brazil, an area that has experienced severe damage from landslides runout in recent years.

2. Study Area

2.1. Petrópolis Municipality

Petrópolis is a historical and touristic imperial city in the mountainous Rio de Janeiro state region in southeastern Brazil (Figure 1). It covers a total area of 791,144 km2 and has a population of 278,881 [54]. Initially dominated by the Atlantic Rainforest biome, the landscape has undergone significant anthropogenic changes, creating a mosaic of urban areas, croplands, and pastures [55,56]. These alterations have increased the city’s vulnerability to natural hazards, with over 15,000 households in high or very high-risk zones [57].
Petrópolis is primarily located within the geomorphological context of the Serra do Mar Escarpment, with elevations ranging from 40 to 2500 m.a.s.l. and an average elevation of 845.5 m. The city features high plateaus, mountainous and high montane regions, dissected and gentle hills, steep mountainous escarpments, and areas with colluvial-talus and alluvial-colluvial deposits [58]. The regional geological setting refers to the Ribeira Mobile Belt, predominantly composed of highly metamorphic rocks such as gneisses with well-developed foliation and syn-tectonic granitoid rocks [59]. Locally, the city is underlain by the Neoproterozoic Rio Negro Complex, characterized mainly by migmatites and granitoids [60]. The soils predominantly consist of cambisol, latosol, and neosol [61]. These soils are primarily characterized by low-fertility clays, good drainage, and high aluminum saturation [62,63]. The steep slopes in the region inhibit the formation of deep soil layers, resulting in relatively shallow soils overlying the bedrock [56], evidenced by the frequent presence of rocky outcrops [64]. The climate is tropical highland type, marked by mild yearly temperatures and well-distributed rainfall. The precipitation is approximately 2200 mm [56]. The stormy season extends from October to March, with monthly rainfall often exceeding 230 mm/month [65].

2.2. Catastrophic Sediment Disasters in Petrópolis

The earliest recorded sediment disasters in Petrópolis occurred in January 1895, resulting in one fatality [66]. Notable events include the March 1966 disaster, where landslides, debris flows, and floods due to rainfall (320 mm over 18 h; and 675 mm over 3 days) caused 80 deaths and displaced 400 people [67]. In March 1979, another event resulted in 87 fatalities and left 300 homeless due to landslides [66,68]. On 5 February 1988, a substantial quantity of cumulative rainfall (776 mm over 24 days) led to a severe disaster that partially destroyed the city, resulting in over 134 fatalities and leaving more than 1000 people homeless [69]. In December 2001, an intense rainfall (300 mm over 14 h) triggered another episode, causing 50 fatalities and displacing 469 people [66]. The January 2011 disaster is considered the most significant in Brazilian history and the eighth worst globally [70,71]. Heavy rainfall (573.6 mm over 10 days; 260 mm over 24 h) triggered 3600 landslides, resulting in 947 deaths, over 400 missing persons, and more than 50,000 homeless. These numbers represent the cumulative losses across the seven severely affected municipalities, including Petrópolis [72]. In March 2013, intense rainfall (390 mm over 24 h) resulted in 34 fatalities, and in November 2016, another episode (155 mm over 24 h) caused two deaths due to rockfalls [66]. In 2022, the city experienced two torrential rainfall episodes, 15 February (258 mm/3 h) and 20 March (534 mm over 24 h), triggering several landslides (Figure 1), leading to 231 fatalities [73], intense damage to urban infrastructure, commercial and industrial areas. In March 2024, heavy rainfall (300 mm over 24 h) triggered 580 landslides and flooding, resulting in nine deaths and leaving 600 people homeless [74].

3. Materials and Methods

The February 2022 disaster was selected for detailed analysis because it represents the most tragic sediment-related event ever recorded in the municipality of Petrópolis, remains underexplored in the scientific literature, and was triggered by an extreme rainfall intensity of 258 mm in just 3 h.

3.1. Data Collection and Production

3.1.1. Landslide Inventory

The landslide inventory used in this study, comprising 171 mapped landslides, was derived from a pixel-based change detection approach using satellite imagery from the Landsat 8–9 Operational Land Imager (OLI), with a spatial resolution of 30 m. Pre- and post-event images were acquired from the United States Geological Survey (USGS) Earth Explorer platform [75], corresponding to dates before the landslide event on 20 March 2022, on 31 January 2022 (summer), and another after the event on 13 April 2022 (autumn). Landslide-affected areas were identified through a pixel-based change detection approach applied to spectral variations and subsequently converted into vector polygons for spatial analysis. To address geometric distortions inherent to pixel-based outputs, all polygons were manually validated and refined using satellite imagery in Google Earth Pro, ensuring alignment with actual geomorphic features. The landslide affected area were subdivided into two, collapse (C) and transport-deposition (TD) zones (Figure 2) [76,77]. Landslide area was calculated in a Geographic Information System (GIS) by delineating polygons and computing their surface area based on projected geometry. Landslide volume were estimated using geometric approximations. In the absence of pre- and post-event digital elevation models, it was not possible to calculate volumes through differential topography [2,20]. Therefore, we employed an empirical approach in which volume was approximated by multiplying the affected area by an assumed average thickness [29,78]. Following common practice in the literature and specific analyses of the 2022 disaster [64,72,73,79,80], an initial fixed thickness of 1 m was adopted. To evaluate the uncertainty associated with this assumption, a sensitivity analysis was performed by recalculating volumes with alternative thickness values of 0.5 m and 1.5 m. The results indicated that statistical performance remained unchanged across different thickness values.
To assess landslide mobility, we employed morphometric indices based on the relationship between the elevation drop and the runout distance of the landslide mass (Figure 2). For clarity, notations referring to totals are denoted with the subscript t—Ht (total height), Lt (total length), Vt (total volume); collapse-specific metrics with the subscript c—Hc, Lc, Wc, Ac, Vc; and transport–deposit metrics with the subscript td—Htd, Wtd. The first parameter used was the mobility index, commonly expressed as the ratio between the total fall height (H) and the horizontal travel distance (L), denoted as H/L. This ratio serves as a proxy for the apparent coefficient of friction, reflecting the degree of energy dissipation during movement [22,23]. Additionally, we calculated the equivalent of friction angle (α) or angle of reach, which is derived from the inverse tangent of the H/L ratio: tan(α) = H/L. This metric provides a physically interpretable measure that approximates the average slope of the landslide trajectory and has been widely adopted as a standardized mobility index in landslide studies [21,29]. The angle of reach enables the comparison of mobility across different landslide types and volumes, assuming that more mobile landslides travel longer distances relative to their fall height, thereby yielding lower α values. The authors demonstrated that while smaller landslides tend to exhibit higher angles of reach due to greater influence of local terrain and frictional resistance, large-volume landslides typically show a flattening trend in α values, attributed to momentum-driven dynamics and reduced relative friction. Therefore, the H/L ratio and angle of reach (α) serve as effective, scale-sensitive indicators of landslide mobility, particularly when rheological data or dynamic simulations are unavailable. To provide a detailed analysis of the influence of on-site condition data on landslide runout behavior and reduce the variability, the landslide inventory was categorized into four mobility classes based on mobility ratio (H/L): low mobility (1.00 ≤ H/L ≤ 3.00), moderate mobility (0.70 ≤ H/L < 1.00), high mobility (0.30 ≤ H/L < 0.70), and very high mobility (0.07 ≤ H/L < 0.30) [21].

3.1.2. Site-Condition Datasets

-
Dataset Source and Resolution
The selection of relevant landslide conditioning factors is a critical step in accurately modeling landslide behavior, especially in densely populated regions [1]. Guided by the understanding of landslide processes and the availability of spatial data [19,81], this study selected a total of 23 conditioning factors, which were systematically organized into four geoenvironmental thematic categories: (i) quantitative remote sensing—RS_Qt, (ii) qualitative remote sensing—RS_QL, (iii) quantitative field survey-based data—FS_QT, and (iv) qualitative field survey-based data—FS_QL (Table 1). The remote sensing dataset included a digital elevation model (DEM) derived from the Shuttle Radar Topography Mission (SRTM) (30 × 30 m resolution) [82]. Additionally, orthophotos at a scale of 1:30,000 [83] were employed as high-resolution visual references to support the interpretation of topographic features, infrastructure, and geomorphological context. Satellite imagery from Landsat 8 (Section 3.1) was also utilized to generate spectral indices relevant to terrain and surface condition analysis. In this study, we investigated the availability of Sentinel-2 imagery with high spatial resolution. However, the available scenes from the days immediately following the disaster in 2022 showed extensive cloud cover in the region of interest. As emergency and reconstruction activities in the damaged area significantly altered the site conditions after 10–15 days, Landsat OLI imagery was prioritized as it provided the best available coverage during the critical time window. Land use and land cover (LULC) data from the MapBiomas Collection 8 (1985–2022) were integrated into the analysis to characterize the anthropogenic and environmental settings that influence landslide behavior. This dataset, derived from Sentinel-2 imagery at a 10 × 10 m resolution [55], enables the detailed classification of urban areas, forest formations, pasturelands, and mosaic land use patterns, providing essential context for interpreting human-induced impacts on landslide extent. For this study, we employed a subset of classes restricted to those present within the landslide-affected urban area of Petrópolis, ensuring that the analysis captured only the local environmental conditions relevant to the event. To ensure consistency, the LULC layer (10 m) was resampled to a common 30 m grid prior to incorporation into the landslide inventory dataset. This harmonization minimized spatial misalignment and ensured comparability across datasets. Additional spatial layers included geomorphological units (1:80,000 scale), derived from the interpretation of orthophotos at 1:30,000 resolution [58], and road network (1:25,000) data obtained from OpenStreetMap [84]. Field survey thematic maps were incorporated to complement the environmental characterization of the study area. These included geological maps [60,85] and a soil type map [61], both at a 1:100,000 scale, as well as a rainfall isoet map representing average annual precipitation at a 1:50,000 scale [86].
-
Data Production
The geospatial data were processed and analyzed in QGIS and SAGA GIS [87]. The DEM was used to calculate widely adopted topographic parameters for landslide modeling. All terrain derivatives were computed for each 30 × 30 m grid cell, corresponding to the native resolution of the original SRTM dataset (Table 1; Figure 3). Topographic metrics included altitude (ALT), slope angle (SLP), aspect (ASP), plan curvature (PLCV), profile curvature (PRCV), topographic position index (TPI), topographic roughness index (TRI), and the length-slope factor (LSF) [78,81,88,89,90]. We also calculated topographic characteristics which have been used as a proxy of hydrological and sediment transport processes, such as the topographic wetness index (TWI), stream power index (SPI), and sediment transport index (STI) [91].
Spectral images were employed to estimate normalized indices that served as proxies for on-site environmental conditions (Table 1; Figure 3). The Normalized Difference Vegetation Index (NDVI) reflects vegetation health and density. NDVI was calculated based on ((NIR − RED))/((NIR + RED)), for Landsat 8, NIR corresponds to near infrared band 5 and Red to band 4 [92]. Normalized Difference Water Index (NDWI) provides a proxy for surface moisture conditions, with higher values indicating wetter soils or standing water. NDWI was calculated based on ((NIR − SWIR1))/((NIR + SWIR1)), for Landsat 8, NIR corresponds to band 5 and SWIR1 to short-weave infrared band 6 [93,94]. NDBI was calculated based on ((SWIR1 − NIR))/((SWIR1 + NIR)), for Landsat 8, SWIR1 corresponds to band 6 and NIR to band 5 [44]. The normalization spectral indices range is between −1 and +1.
To quantify structural, hydrological, and anthropogenic linear features influencing landslide behavior, four density-based indices—Drainage Density Index (DDI), Lineament–Fault Density Index (LFDI), Structural Density Index (SDI), and Road Density Index (RDI)—were computed using line density analysis (Table 1; Figure 3). Each index was calculated by applying a line density function over vector line datasets within a defined radius (kernel), producing a continuous raster surface that expresses the total length of linear features per unit area (km/km2). The DDI was derived from river and stream networks and reflects terrain dissection and potential soil saturation near channels. The LFDI was calculated from mapped geological faults and lineaments, representing zones of structural weakness [6,95,96]. The SDI combines lithological boundaries and discontinuities to capture broader structural complexity. These three indices were based on geological layer maps [60,85]. The RDI, computed from the OpenStreetMap [84] road network, serves as a proxy for anthropogenic disturbance and infrastructure exposure [95,97]. Density indices dataset was generated with a uniform spatial resolution of 30 m × 30 m. This standardization ensured comparability among indices (RDI, LFDI, SDI, and DDI) and avoided scale-related biases in the subsequent analyses.
Climatic influence was incorporated through long-term precipitation data (Table 1; Figure 3), represented by a 30-year (1977–2006) average annual rainfall [86]. The original map was available as a vector layer containing point data from meteorological stations across the region. To integrate this information into the spatial modeling framework, the point layer was interpolated into a continuous raster surface using the Inverse Distance Weighting (IDW).
-
Data Extraction
To characterize the on-site conditions influencing landslide behavior, we extracted geoenvironmental variables at three reference points for each mapped landslide body: the initiation point of collapse (C), the beginning of the transport phase (T), and the end of deposition (D) (Figure 2). These points were manually delineated based on visual interpretation of orthophotos and satellite imagery. At each point, values from all continuous and categorical raster layers were extracted using the native tools available in QGIS. Specifically, the “Sample Raster Values” tool was used to assign numerical values (e.g., slope, TRI, NDVI) to each point based on the underlying raster cell. In contrast, categorical values (e.g., land use, soil type, lithology) were assigned to each point by spatially intersecting point locations with vector-based polygon layers, using the “Join attributes by location” tool. Table 2 provides a summarized description of the qualitative site-condition classification variables considered in this study. This point-based sampling strategy ensured that each landslide was represented by a standardized set of site-specific parameters across its three process zones (C, T, D), enabling consistent and robust statistical comparison and modeling.

3.2. Statistical Analyses

This study conducted the statistical analyses using Jamovi version 2.5.3, an open-source statistical software.

3.2.1. Factor Analyses (FA)

Spearman’s rank correlation coefficient (SRC) test (ρ) was used for nonparametric data distribution to measure the strength and direction of the association between the transport-deposited (TD) length and the site-condition quantitative parameters. The strength of the correlation is interpreted based on: 0–0.2 (very weak), 0.2–0.4 (weak), 0.4–0.6 (moderate), 0.6–0.8 (strong), and 0.8–1 (very strong) [98]. Analysis of Variance (One-way ANOVA) was also used to investigate the effect of qualitative factors, such as soil type, geomorphological unit, and lithological types, on the landslide’s TD length [99]. Due to the non-normal distribution of the data, a non-parametric approach, the Kruskal–Wallis (KW) test, was employed [100,101]. This method assesses whether significant differences exist between the medians of multiple groups across different categorical variables [102]. The KW test provides a chi-square statistic (χ2) and a p-value indicating whether significant differences exist between the groups’ medians. Also, epsilon-squared (ε2) was calculated to estimate the effect size, classifying it as small effect (ε2 < 0.01), moderate effect (0.01 ≤ ε2 < 0.08), and significant effect (ε2 ≥ 0.08) [103].

3.2.2. Linear Regression Models

Multivariate Linear Regression (MLR) is a widely used statistical technique designed to model the relationship between a single dependent variable and multiple independent variables Equation (1) [33,104,105]. model performance was evaluated using the coefficient of determination (R2) and Root Mean Squared Error (RMSE).
Y i = β 0 + K = 1 K β K X i K + i ,
where Yi represents the dependent variable for the ith observation, β0 is the intercept term, and K = 1 K β K X i K corresponds to the summation of the product of coefficients βK and their corresponding independent variables (XiK) across all predictors (K). The term i is the error term (residual), accounting for the deviation of observed values from those predicted by the model. The coefficients (β) were estimated using the Ordinary Least Squares (OLS) method [106,107].

3.3. Research Design

To systematically investigate the controlling factors of site condition parameters on landslide runout distance, we adopted a sequential modeling framework (Figure 4). The research design was structured into three progressive phases, each addressing specific analytical objectives: (i) development of standalone models to evaluate the individual effects of distinct variable sets; (ii) construction of combined models to explore the effects of integrating multiple data sources; and (iii) development of multi-stage models to explore complex interactions across landslide process zones. A total of 218 MLR models were developed based on the complete landslide inventory (n = 171), ensuring a robust empirical basis for performance evaluation. The Model 1 comprised univariate models using post-failure morphometric (PFM) parameters from the collapsed zone, such as height (Hc), length (Lc), width (Wc), area (Ac), volume (Vc), and logarithmic values (LogAc and LogVc) (Figure 4). These variables are widely cited in runout prediction literature [2,19,21,29,104]. Model 2 accessed the individual predictive power of geoenvironmental site-condition variables, considering quantitative and qualitative inputs (Table 2). These were organized by source into remote sensing-based (2A) and field survey-based (2B) data acquisition approaches (Figure 4). Model 2 to 6 were independently developed for the collapse, transport, and deposition points. This approach allowed us to assess the specific influence of site-condition variables at each stage of landslide propagation, recognizing that different factors may control runout [18,32,39].
In the second modeling phase, we assessed whether integrating distinct data sources could enhance predictive accuracy [16]. Model 3 were developed by combining variables selected in factor analysis (Section 3.2.1), identified in four groups: remote sensing-based, quantitative (3Aqt), qualitative (3Aql), and field survey-based, quantitative (3Bqt), qualitative (3Bql). To control for redundancy, multicollinearity diagnostics were performed using the Variance Inflation Factor (VIF) and Tolerance (TOL) statistics. VIF values were interpreted using standard thresholds: VIF ≤ 5 indicating low collinearity, 5 < VIF ≤ 10 acceptable, and VIF > 10 as high [12,108]. Tolerance (1-VIF) was used as a complementary indicator, with values above 0.2 generally considered acceptable. Predictors exhibiting high collinearity were excluded to improve model robustness. Model’s predictor significance was further evaluated using the F-statistic, which tests the null hypothesis that all regression coefficients are equal to zero. A significant F-value indicates that the model explains a substantial portion of the variance in the dependent variable [12,109]. In Model 4, we systematically assessed the combined effect of grouped remote sensing, field survey, quantitative and qualitative geoenvironmental site-condition variables developed in model 3. In Model 5, post-failure morphometric parameters were combined with different geoenvironmental datasets (from Model 4). Model 6 further explored the complete interaction between PFM parameters and the integral geoenvironmental models for each landslide stage, offering a more general perspective of landslide behavior. Subsequently, in the third modeling stage, we developed Model 7, which combined PFM predictors with Model 6 variables from multiple landslide zones (i.e., collapse–transport—CD), collapse–deposit—CD, and transport–deposit—TD). Finally, Model 8 extended this approach by simultaneously integrating each morphometric descriptor and geoenvironmental variables from the three landslide stages into a single, thoroughly combined model (CTD-based). This configuration aimed to maximize predictive capacity by accounting for spatial continuity, environmental heterogeneity, and failure geometry in a unified framework. Through this stepwise modeling design, we established a systematic comparison of morphometric, remote sensing, and field-based predictors, identifying the influence of site conditioning factors for estimating landslide runout behavior, particularly in data-scarce and urbanized environments. The statistical outputs of the CTD-based models (type 8)—including multicollinearity diagnostics (Variance Inflation Factor—VIF and Tolerance values), standardized regression coefficients, normality tests, heteroscedasticity tests, and residual analyses—are provided in the Appendix B.
To assess the predictive uncertainty of each model, we calculated upper and lower bounds based on a proportional deviation from the observed values [110]. Specifically, we applied a scaled RMSE, using the expression ±K × RMSE, where K is a user-defined adjustment factor that controls the confidence envelope around the 1:1 reference line. In this study, we adopted a margin of K = ±0.30 [107]. The RMSE used for this calculation was obtained by computing the average RMSE across all models within each modeling group (e.g., Model 1) separately for each landslide stage (e.g., collapse). This approach ensured that the uncertainty bounds were tailored to the predictive behavior of each specific modeling context. These bounds were plotted around the 1:1 line in the predicted versus observed plots to visually delineate regions of overestimation (above the upper bound) and underestimation (below the lower bound).
To further assess model robustness and generalization capacity, we implemented a two-step validation procedure. First, a bootstrap resampling strategy was applied to the full dataset of 171 landslides. In each iteration, a bootstrap sample of size 171 was drawn with replacement for model calibration. Because of this procedure, some cases were sampled multiple times while others were excluded; on average, about 63% of the landslides appeared at least once in the training set, while the remaining ~37% (66 cases) formed the out-of-bag (OOB) set, which served as an independent test set. Second, 10-fold cross-validation was applied within each bootstrap training set to further estimate generalization capacity and reduce overfitting. This procedure was restricted to the final CTD-based models (type 8), which represent the culmination of the modeling framework. Generalization performance was evaluated using the R2, RMSE, and mean absolute error (MAE).

4. Results

4.1. General Characteristics of Landslides in the 2022 Petrópolis Event

The analysis of initiation zones revealed that the predominant lithological units associated with the landslides were diorites, tonalites, gabbros, and gneisses, collectively representing 75% of the events and belonging to the same geological formation. Secondary occurrences included migmatites and gneisses (16%) and granites (9%). In terms of geomorphological context, landslides were most frequently observed within the high hills domain (44%), followed by mountainous escarpments (26%) and the broader mountainous domain (22%). Regarding soil types, the majority of landslides occurred in urbanized areas with anthropogenic soil disturbance (35%), with natural soil types such as cambisols (29%) and latosols (22%) also showing notable concentrations. Additionally, approximately 14% of the events were located on steep slopes characterized by shallow soils overlaying exposed bedrock, indicating conditions of limited soil depth and high susceptibility to failure. The distribution analysis of landslide areas revealed that moderate-sized landslides are the most common (60%), followed by small landslides (37%) and medium–large-sized occurrences (4%). The volume distribution exhibited a broad spectrum ranging from very small to extremely large events. Most landslides fall within the moderate (33%), small (32%), and medium-large (23%) classes, spanning from 103 to 106 m3. Larger landslides, exceeding 106 m3, are less frequent (9%). Based on satellite image interpretation, the TD runout zones were classified into four morphological types: channelized (CHAN), channelized–obstructed (CHOB), obstructed (OBS), and unobstructed (UNOB), according to their flow trajectory and physical confinement. OBS paths were the most common (50%), followed by UNOB (22%), CHOB (20%), and CHAN (8%). Natural obstructions included barriers such as vegetation corridors, terraces, and overlapping landslide deposits, while anthropogenic obstructions were mainly related to roads, buildings, and cut-and-fill platforms. The depositional slope angles corresponding to these runout types ranged from 5° to 49°, with an overall average of 18°. In addition, CHAN landslides exhibited the lowest mean slope angles, while OBS and UNOB runouts were typically associated with steeper depositional zones. This suggests that greater slope inclination may favor more open flow trajectories, while flatter terrain promotes lateral spreading and obstruction by surrounding features.

4.2. Factor Analysis

4.2.1. Correlation Between Post-Failure Morphometric Parameters and Landslide Runout

The TD length was strongly associated with total length (0.89) and volume (0.61) but showed only a moderate correlation with total height (0.32). In the collapsed zone, TD length exhibited weak associations with height (0.24), length (0.22), width (0.19), altitude (0.13), and slope angle (0.12). For the transport-deposited zone, TD length exhibited a moderate link with height (0.32) and a weak correlation with width (0.26), transported altitude (0.05), slope angle (0.12), and deposition slope angle (0.02). A weak negative correlation was observed with deposition altitude (−0.05) (Table 3).

4.2.2. Correlation Between Site-Condition Features and TD Runout Length

Variable selection for each landslide process stage was primarily based on Spearman’s rank correlation coefficients, with a threshold of ρ ≥ 0.10 (Table 4). Several variables demonstrated consistent correlations across all stages, including ASP (0.15 ≤ ρ ≤ 0.28), TPI (0.12 ≤ ρ ≤ 0.23), SDI (−0.18 ≤ ρ ≤ −0.19), LFDI (−0.14 ≤ ρ ≤ −0.18), and PLCV (0.11 ≤ ρ ≤ 0.18), highlighting their pervasive influence on landslide dynamics from initiation through deposition. However, due to stage-specific, variable selection was tailored separately for each zone. For the collapsed and transport zones, selected variables included ALT, SLP, ASP, PLCV, PRCV, TPI, TRI, SPI, TWI, NDWI, LFDI, SDI, NDVI, NDBI, and RNF. In contrast, the deposition zone was characterized by a subset comprising ASP, PLCV, TPI, TRI, TWI, LFDI, SDI, NDVI, NDBI, and RNF.

4.2.3. Correlation Between Qualitative Parameters and TD Length

In the collapse zone, the one-way ANOVA showed that soil type (χ2 = 9.84; ε2 = 0.058) had the strongest effect, followed by LULC (χ2 = 3.73; ε2 = 0.022), on runout distance (Figure 5). Geomorphological units (χ2 = 2.56; ε2 = 0.015) and lithology (χ2 = 0.30; ε2 = 0.002) showed a weak correlation. In the transport phase, soil type (χ2 = 12.009; ε2 = 0.071) and lithology (χ2 = 9.94; ε2 = 0.058) showed a substantial impact. Geomorphological unit (χ2 = 2.44; ε2 = 0.014) and LULC (χ2 = 2.18; ε2 = 0.013) were weakly related with TD length. In the deposition stage, lithology (χ2 = 19.03; ε2 = 0.112), soil type (χ2 = 10.92; ε2 = 0.064) and geomorphological units demonstrated significant influence on the TD length (χ2 = 10.62; ε2 = 0.062). For LULC, results showed a lower influence (χ2 = 4.74; ε2 = 0.028).

4.3. Linear Regression Models

4.3.1. Univariate Regression Models (1 to 2)

Detailed outputs of the intermediate models (Models 2–6) are provided in Appendix A. The univariate models based on post-failure morphometric parameters from the collapse zone (Model 1) yielded coefficients of determination (R2) ranging from 0.06 to 0.10 (Figure 6). The best-performing variables were the length and LogAc (R2 = 0.10). Overall, the results indicate that individual morphometric parameters have limited predictive performance in explaining runout distance. Similarly, models utilizing a solitary site-condition variable (Model 2) exhibited very low predictive power (R2 < 0.10).
In summary, the analyses of intermediate models (Models 3 to 6) showed a pattern of progressive improvement in the R2 value as a number of parameters used in models increased from one to many (Appendix A). The results from Model 3 showed a slight improvement in performance by using multiple parameters relating to site conditions belonging to a single data group (quantitative or qualitative data from remote sensing or field surveys). In the collapse zone and transport zone, the highest performance was achieved when using a group of quantitative data from remote sensing (R2 = 0.14 and 0.11). The model that used data from multiple data groups (Model 4) demonstrated improved accuracy. The highest accuracy was achieved when using data from all field conditions, obtained through remote sensing and field surveys, with an R2 value of 0.21.
The addition of landslide morphometric parameters (Model 5) marked a moderate improvement compared with Model 3. The highest accuracy was achieved using collapse length (Lc) and environmental descriptors in the collapse and transport zones (R2 = 0.20). In terms of data source, the quantitative variables obtained from remote sensing yielded the strongest performance in the collapse and transport zones (R2 = 0.16–0.20). The model that used data from multiple data groups for site condition variables and landslide morphometric parameter (Model 6) reinforced and the highest R2 was 0.28 (Appendix A).

4.3.2. Multi-Stage Linear Regression Models (7 to 8)

The multistage integration models (Model 7), combine variables from two landslide stages. In the CT models, the highest R2 was achieved by combining collapse length with environmental variables (R2 = 0.35; Figure 7), followed by log-transformed variables and height (R2 = 0.32). In the CD models, predictive power increased substantially, with collapse length yielding the highest outcome (R2 = 0.44; Figure 8), while all other variables ranged between R2 = 0.42 and 0.43. A similar pattern was observed in the TD models, where length again achieved the best performance (R2 = 0.44; Figure 9). These results confirm that integrating environmental descriptors from multiple process zones enhances model accuracy, with collapse length (Lc) consistently emerging as the strongest post-failure predictor. Among the multi-stage configurations, CD and TD models exhibited equally high and stable predictive performance, with all R2 values concentrated between 0.42 and 0.44.
The final multistage morphometric models (Model 8) integrate collapse, transport, and deposit zones (CTD-based) variables, achieving the highest predictive accuracy among all tested configurations (Figure 10). The combination of collapse length (Lc_CTD) with geoenvironmental descriptors yielded the best performance (R2 = 0.51), followed closely by logarithmic transformations of volume and area (R2 = 0.50). Other metrics such as height, volume, and area also showed consistent results (R2 = 0.49), reinforcing the general utility of collapse morphometry across multiple stages. Despite slight differences among variables, all morphometric predictors produced stable and convergent performance in the full multistage context, with narrow variation in R2 (0.49–0.51). To verify the stability of these predictors, multicollinearity was evaluated VIF and TOL values. All predictors in the final models presented VIF < 4 and Tolerance > 0.25, indicating no critical multicollinearity. The complete statistical analysis is provided in the Appendix B.

4.3.3. Model’s Type 8 Cross-Validation

The performance of the CTD-based models (type 8) under 10-fold cross-validation is presented in Table 5. Across all specifications, predictive capacity declined sharply compared to training evaluation, with R2 values dropping to 0.10–0.15 and error metrics increasing to RMSE ≈ 70–73 m and MAE ≈ 47–49 m. Among the tested models, LC_CTD yielded the best performance (R2 = 0.15, RMSE = 70.33 m, MAE = 47.58 m).

5. Discussion

5.1. Can the Landslide Travel Distance Predictions Be Improved by Using Information Obtained from Satellite Remote Senszing?

The results demonstrate that predictive accuracy varied substantially depending on the type and integration of input data (Figure 11a). Models incorporating post-failure morphometric (PFM) parameters achieved lower performance (median R2 ≈ 0.08). Also, models based exclusively on remote sensing (RS) or field survey (FS) variables presented consistently low median R2 values (below 0.03–0.04). These revealed their limited individual capacity to capture the complexity of landslide runout dynamics in the study area. This relatively modest predictive value of PFM parameters aligns with prior empirical findings, which emphasize that PFM alone cannot reliably predict runout due to high dispersion in mobility coefficients across events of similar magnitude [26,27]. Improvements in model performance were evident when RS variables were combined with PFM datasets (median R2 = 0.14). This reflects the additive value of RS-based descriptors. Although RS data are frequently used in susceptibility modeling [111,112], their application in runout prediction remains relatively uncommon, particularly when decoupled from physically-based modeling frameworks [11]. The present results show that such variables can support empirical modeling if used in conjunction with site-specific parameters. The highest performance was obtained when RS, FS, and PFM variables were integrated in a unified model structure (median R2 > 0.35). The integrated configuration outperformed all others by a substantial margin, indicating that no single data type is sufficient to explain runout behavior under the topographic and anthropogenic complexity observed in urban settings.

5.2. Influence of Landslide Stage on Predictive Performance

Model performance increased progressively with the integration of multiple landslide stages, underscoring the cumulative nature of runout dynamics (Figure 11b). Substantial improvements were observed when two zones were combined. Models integrating collapse and transport reached a median R2 of 0.32, whereas combinations involving collapse and deposit or transport and deposit achieved higher accuracy (median R2 = 0.43). These findings align with conceptual models of landslide propagation, which emphasize that runout results from a sequence of interdependent processes—including initial failure, flow propagation, and terminal deposition—each governed by distinct physical controls [2,20,32]. The enhanced performance of models incorporating depositional components highlights the critical role of terminal-phase conditions. This is consistent with prior findings that emphasize the influence of terrain resistance and stratigraphic contrasts during the final stage of landslide mobility [10,39,78,113]. The highest model performance was observed when all three process stages were integrated (CTD), with a median R2 of 0.50. This supports the hypothesis that multistage modeling provides a more comprehensive representation of the physical controls governing landslide mobility, particularly in heterogeneous or anthropogenically modified terrain [7,114].

5.3. Influence of Landslide Runout Path

Hungr (2007) emphasized that confined channels promote compression of the debris mass, increasing basal pore pressure and reducing internal energy dissipation [115]. While McDougall & Hungr (2005) demonstrated that in unconfined settings, debris flows are more susceptible to lateral spreading, velocity decay, and irregular deposition, especially on convex or fragmented surfaces [18]. Studies by Scheevel (2017) and Guo et al. (2020) demonstrated that even minor obstructions can cause redirection, deceleration, or bifurcation of debris flows, depending on their relative position and physical configuration [90,114]. Frodella et al. (2022) documented that built environments often act as physical filters, interrupting the spatial continuity of flows and forcing premature deposition [10]. These previous studies suggested that the flow path geometry and the obstacles in flow path may give large impacts on travel distance of landslide.
Figure 12 shows the best advanced multivariate regression models (Model 6–8), which incorporate collapse length (Lc) along with geoenvironmental variables, stratified by typological TD runout classification (Section 4.1), as channelized (14 landslides), channelized-obstructed (34 landslides), obstructed (86 landslides), and unobstructed (38 landslides), offering a process-based framework to assess how topographic confinement and physical barriers condition the mobility of landslides in complex urban-mountainous settings. It is unfortunate that the sample size is limited, and the results are thought to be subject to considerable uncertainty. However, a brief examination of the findings is warranted. Among all classes, channelized flows exhibited the most consistent and high predictive performance (R2 = 0.73–0.90).
The channelized-obstructed class, although still associated with favorable predictive performance, revealed a moderate reduction in model accuracy compared to purely channelized flows (R2 = 0.47–0.63). In contrast, obstructed flows exhibiting no channelized flow path—displayed significantly reduced explanatory power (R2 = 0.39–0.70).
Finally, unobstructed flows, despite their apparent topographic open-slope configuration, presented the lowest model performance (R2 = 0.31–0.61). Taken together, these findings suggested that runout path morphology may be a control on the predictability of landslide mobility. These results reinforce the argument, supported by Legros (2002) and McDougall (2017), that integrating information on flow confinement and physical obstructions may improve prediction ability of landslide behavior, particularly in urbanized mountainous landscapes such as Petrópolis [20,29].

5.4. Limitations and Future Directions

The results of cross-validation (Table 5) indicate that even the final model has low generalization performance, as evidenced by the low R2 value and high error metrics. This trend likely reflects the problem of overfitting, which is often seen in empirically derived landslide prediction models [1,6,12,81]. As we mentioned in the introduction, we consider that the accuracy of landslide runout distance prediction is influenced by two factors: the modeling method and the input data. Based on this, there are two possible reasons for the low performance: first, several effects cannot be described as linear relationships; second, several variables are missing, even though these variables have a significant impact on landslide travel distances. Previous studies have shown that although morphometric indicators capture trends within datasets, their predictive transferability is constrained by the inherently non-linear and site-specific nature of runout processes, particularly in dense urbanized settings [21,29,30,115]. These findings underscore the limitations of multivariate linear regression for runout prediction in complex environments. To improve model performance, we have to test non-linear statistical methods and/or physical models in the future.
In terms of data collection, this study faced four significant limitations. Firstly, although this study prioritized the use of 30 m spatial resolution data for landslide research, focusing on the availability of data immediately after disasters and the ease with which historical data could be accessed across various regions, it is clear that a 30-m resolution may be too coarse to accurately represent the topography and detailed land use in mountainous areas [104,116]. Further consideration of an appropriate spatial resolution is therefore essential. Secondly, detailed data on landslide volume is lacking. Many previous studies have confirmed that landslide runout distance depends heavily on landslide volume e.g., [64,72,73,79,80]. However, while this study obtained data on average landslide depth, it did not obtain depth data for each individual landslide area. This presents a significant challenge given that collecting detailed landslide depth data is often difficult in many areas [2]. Furthermore, although remote sensing has provided more surface information, data on subsurface conditions remains limited. This may also have affected the accuracy of the predictions. Fourth, it is possible that an insufficient number of data may have had an impact of the model performance. It is considered that further data accumulation is necessary. In conclusion, higher prediction accuracy and stability can be achieved by collecting more data on site conditions, landslide morphology and landslide sediment characteristics, and combining this with non-linear statistical methods and/or physical models.

5.5. Practical Applications

The consistent identification of collapse length as the most robust predictor of runout (Section 5.1) suggests its use as a rapid screening metric for preliminary hazard assessments. model performance improved substantially when geoenvironmental data from multiple landslide stages were integrated, underscoring the need to consider collapse, transport, and deposition conditions jointly (Section 5.2). The results also showed that in highly urbanized and geomorphologically complex contexts, remote sensing and field survey data exerted greater influence on predictive performance. Additionally, stratification by runout path typology (Section 5.3) further revealed that channelized flows are the most predictable, whereas obstructed and open-slope flows showed lower predictability. Taken together, these findings demonstrate the practical potential of the models for supporting disaster risk reduction. Despite remaining challenges (Section 5.4), the results provide actionable insights for disaster risk management in resource-constrained contexts based on the developed data collection methods. First, it is essential to recognize that the impact area of a landslide depends on two main factors: magnitude and topographical conditions. When accurate magnitude estimates are not feasible, municipalities can develop multiple scenarios of varying magnitudes to delineate potential hazard zones. Integrating collapse length with susceptibility maps provides a practical approach to identifying slopes with higher potential reach and prioritizing them for detailed surveys or preventive interventions [11]. Second, in terms of site conditions, it is crucial to collect data not only from the initiation area but also from the flow path and deposition zone. Our results confirmed that incorporating deposition-zone parameters yielded the most stable predictive performance. Moreover, utilizing freely available remote sensing data provides an effective means of gathering site-condition information where field surveys are limited. The systematic accumulation of historical data on landslide shape, extent, and geoenvironmental conditions remains crucial for decision-making. This approach is particularly relevant in Brazil, where federal risk assessment methods remain largely heuristic and national inventories continue to serve as the primary basis for producing local hazard maps [1,3,11,20]. For instance, differentiating runout path highlights confined valleys and gullies as priority locations for infrastructure protection, while obstructed and open-slope flows require complementary field validation before supporting land-use decisions [2,3,16,21]. Third, prediction accuracy is not always guaranteed, even with sufficient data on magnitude and site conditions. It is therefore essential to account for uncertainty explicitly when creating hazard maps. One practical strategy is to delineate impact areas broadly enough to encompass the majority of past events, reducing the likelihood of unforeseen disasters in unexpected areas. Within this context, the proposed workflow offers clear benefits by enabling more detailed, evidence-based, and context-specific assessments, while remaining feasible for municipal civil defense agencies.

6. Conclusions

Predicting landslide runout is particularly challenging in tropical mountainous regions, where urban expansion, complex terrain, and scarce data hinder model development. While satellite remote sensing has advanced susceptibility mapping, its use in runout prediction remains limited, especially in Brazil’s urbanized slopes where physically based models are often unfeasible. This study applied a statistical-empirical approach integrating morphometric descriptors with geoenvironmental variables from remote sensing and field surveys across all landslide stages—collapse, transport, and deposition—to evaluate their predictive value and improve understanding of mobility patterns in complex urban tropical settings.
The results confirm that runout distance is not governed solely by the magnitude of the landslide, but is strongly conditioned by a range of geoenvironmental factors that vary across different landslide zones. A key finding of this study is that landslide runout predictions improve markedly when site-condition variables derived from satellite remote sensing and field surveys are incorporated. Models relying solely on post-failure morphometric parameters showed limited explanatory power (R2 = 0.06–0.10), underscoring the inadequacy of single-source predictors to represent the multifactorial nature of landslide mobility. By contrast, models integrating remote sensing and field-based variables—especially when combined with morphometric descriptors such as collapse length (Lc)—achieved significantly higher performance (R2 = 0.49–0.51). Furthermore, the inclusion of data from all three process zones (collapse, transport, deposition) substantially enhanced prediction accuracy. While single-stage models underperformed (R2 < 0.30), multistage combinations (CT, CD, TD) yielded stronger results (R2 ≈ 0.44), with full integration across all stages (CTD) producing the best outcomes (R2 = 0.49–0.51). These findings reinforce that landslide runout is governed by collective, stage-dependent mechanisms that require comprehensive representation. Furthermore, stratifying models by runout path morphology revealed that channelized flows enabled the most accurate and stable predictions (R2 = 0.73–0.90), due to directional confinement and kinetic efficiency. In contrast, obstructed and unobstructed (open-slope) flows—more susceptible to energy dissipation and topographic disruptions—exhibited lower and more dispersed predictive power (R2 = 0.39–0.61), emphasizing the importance of accounting for terrain variability in empirical modeling.
Overall, these findings demonstrated that robust landslide runout prediction requires an integrative strategy combining geoenvironmental data (landslide magnitude, remote sensing–derived data, field survey observations), multistage zoning of landslide processes, and consideration of runout path characteristics. This approach not only improves model performance but also ensures practical applicability in hazard assessment frameworks, particularly in data-scarce settings. Although this study advances empirical runout prediction, future research should explore non-linear modeling approaches to more effectively capture landslide behavior, incorporating dynamic variables such as rainfall thresholds and land cover change to strengthen early warning systems and territorial planning in landslide-prone regions.

Author Contributions

Conceptualization, T.D.d.S. and T.U.; methodology, T.D.d.S. and T.U.; software, T.D.d.S.; investigation, T.D.d.S.; data curation, T.D.d.S.; writing—original draft preparation, T.D.d.S.; writing—review and editing, T.D.d.S. and T.U.; validation, T.U.; supervision, T.U. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded in part by the SDGs Global Leader (Latin America) Program, Japan International Cooperation Agency (JICA), and Sabo Frontier Foundation.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request. The main dataset derived from public domain resources are available in SGB RiGeo at https://rigeo.sgb.gov.br/handle/doc/15692 (accessed on 23 March 2024).

Acknowledgments

We express our gratitude to the Japan International Co-operation Agency (JICA) for supporting the PhD scholar grant through the Knowledge Co-Creation Program; we also express our gratitude for financial support from Sabo Frontier Foundation. The authors would like to express appreciation for the valuable comments of the associated editors and anonymous reviewers.

Conflicts of Interest

The authors affirm that they have no competing interests, ensuring the integrity and impartiality of the research presented in this manuscript.

Abbreviations

The following abbreviations are used in this manuscript:
AcronymMeaning
AcCollapse area
ALTAltitude
ALTcCollapse altitude
ALTdDeposit altitude
ALTtTransport altitude
ANOVAAnalysis of Variance
ASPAspect
CCollapse zone
DDIDrainage Density Index
DEMDigital Elevation Model
DDeposition zone
ε2Epsilon-squared (effect size)
FAFactor Analysis
FSField Survey
GISGeographic Information System
HTotal fall height
H/LHeight over length ratio (mobility coefficient)
HcCollapse height
HtTotal height
HtdTransport–Deposit height
IDWInverse Distance Weighting
KWKruskal–Wallis test
LLength
LcCollapse length
LFDILineament–Fault Density Index
LogAcLog-transformed collapse area
LogVcLog-transformed collapse volume
LSFLength–Slope Factor
LULCLand Use and Land Cover
MLRMultivariate Linear Regression
NDVINormalized Difference Vegetation Index
NDWINormalized Difference Water Index
NDBINormalized Difference Built-up Index
OLIOperational Land Imager
OLSOrdinary Least Squares
PFMPost-Failure Morphometry
PLCVPlan Curvature
PRCVProfile Curvature
QtQuantitative variable
QlQualitative variable
R2Coefficient of Determination
RDIRoad Density Index
RMSERoot Mean Squared Error
RNFAverage Rainfall
RSRemote Sensing
SAGA GISSystem for Automated Geoscientific Analyses
SDIStructural Density Index
SLPSlope
SLPcCollapse slope angle
SLPdDeposit slope angle
SLPtTransport slope angle
SPIStream Power Index
SRCSpearman Rank Correlation
STISediment Transport Index
SRTMShuttle Radar Topography Mission
TTransport zone
TDTransport–Deposition zone
TOLTolerance (collinearity diagnostic)
TPITopographic Position Index
TRITerrain Ruggedness Index
TWITopographic Wetness Index
USGSUnited States Geological Survey
VcCollapse volume
VIFVariance Inflation Factor
VtTotal volume
WcCollapse width
WtdTransport–Deposited width

Appendix A. Intermediate Linear Regression Models

Appendix A.1. Univariate Regression Models (Type 2)

The univariate models based on geoenvironmental site-condition variables (Model 2) exhibited low predictive performance across all landslide stages, with R2 values below 0.10 in all cases (Figure A1, Figure A2 and Figure A3). For remote sensing quantitative variables, the best results were obtained for aspect and plan curvature in the collapse zone (R2 = 0.06; Figure A1a), aspect in the transport zone (R2 = 0.08; Figure A2a), and TPI in the deposition zone (R2 = 0.02; Figure A3a). For remote sensing qualitative variables, the highest R2 values were observed in the deposition zone for GeoUnits (R2 = 0.05) and LULC (R2 = 0.04; Figure A3b), with negligible performance in the other stages (Figure A1b and Figure A2b). Among the field survey quantitative variables, LFDI yielded the best result in the deposition zone (R2 = 0.03; Figure A3c), while all other predictors showed R2 ≤ 0.02 across stages (Figure A1c and Figure A2c). For field survey qualitative variables, soil type presented the highest R2 values in the collapse and transport zones (R2 = 0.06; Figure A1d and Figure A2d), and lithology reached its maximum in the deposition zone (R2 = 0.06; Figure A3d).
Figure A1. Standalone univariate linear regression models for collapse points. In (a) models based on quantitative remote sensing variables; in (b) models using qualitative remote sensing indicators; in (c) displays results from quantitative field survey parameters; and in (d) illustrates models derived from qualitative field survey-based assessments. K × RMSE (K = 0.30) bounds indicate over- and underestimation thresholds; the TD Length line represents the ideal 1:1 predicted–observed relationship.
Figure A1. Standalone univariate linear regression models for collapse points. In (a) models based on quantitative remote sensing variables; in (b) models using qualitative remote sensing indicators; in (c) displays results from quantitative field survey parameters; and in (d) illustrates models derived from qualitative field survey-based assessments. K × RMSE (K = 0.30) bounds indicate over- and underestimation thresholds; the TD Length line represents the ideal 1:1 predicted–observed relationship.
Remotesensing 17 03337 g0a1
Figure A2. Standalone univariate linear regression models for transport points. In (a) models based on quantitative remote sensing variables; in (b) models using qualitative remote sensing indicators; in (c) displays results from quantitative field survey parameters; and in (d) illustrates models derived from qualitative field survey-based assessments.
Figure A2. Standalone univariate linear regression models for transport points. In (a) models based on quantitative remote sensing variables; in (b) models using qualitative remote sensing indicators; in (c) displays results from quantitative field survey parameters; and in (d) illustrates models derived from qualitative field survey-based assessments.
Remotesensing 17 03337 g0a2
Figure A3. Standalone univariate linear regression models for deposit points. In (a) models based on quantitative remote sensing variables; in (b) models using qualitative remote sensing indicators; in (c) displays results from quantitative field survey parameters; and in (d) illustrates models derived from qualitative field survey-based assessments. K × RMSE (K = 0.30) bounds indicate over- and underestimation thresholds; the TD Length line represents the ideal 1:1 predicted–observed relationship.
Figure A3. Standalone univariate linear regression models for deposit points. In (a) models based on quantitative remote sensing variables; in (b) models using qualitative remote sensing indicators; in (c) displays results from quantitative field survey parameters; and in (d) illustrates models derived from qualitative field survey-based assessments. K × RMSE (K = 0.30) bounds indicate over- and underestimation thresholds; the TD Length line represents the ideal 1:1 predicted–observed relationship.
Remotesensing 17 03337 g0a3

Appendix A.2. Multivariate Linear Regression Models (Types 3 to 6)

The factor analysis-based models (Model 3) showed improved predictive performance compared to individual variables, particularly for the collapse and transport zones. In the collapse zone, the highest R2 was achieved by the remote sensing quantitative model (3Aqt; R2 = 0.14; Figure A4a), followed by field-based qualitative variables (3Bql; R2 = 0.06), field-based quantitative variables (3Bqt; R2 = 0.04), and remote sensing qualitative inputs (3Aql; R2 = 0.02). In the transport zone, the best results were obtained for the remote sensing quantitative group (3Aqt; R2 = 0.11) and the field-based qualitative model (3Bql; R2 = 0.09; Figure A4b), while other models remained below R2 = 0.05. In the deposition zone, the best-performing group was the remote sensing qualitative model (3Aql; R2 = 0.09), followed by the field-based qualitative model (3Bql; R2 = 0.08), and both 3Aqt and 3Bqt models (R2 = 0.05; Figure A4c).
Figure A4. Multivariate linear regression Model 3 (factor analysis-based) for each landslide stage. In (a) models for collapse points, in (b) models for transport points, and in (c) models for deposit points. Each panel displays four model types: remote sensing—quantitative (3Aqt), remote sensing—qualitative (3Aql), field survey—quantitative (3Bqt), and field survey—qualitative (3Bql). K × RMSE (K = 0.30) bounds indicate over- and underestimation thresholds; the TD Length line represents the ideal 1:1 predicted–observed relationship.
Figure A4. Multivariate linear regression Model 3 (factor analysis-based) for each landslide stage. In (a) models for collapse points, in (b) models for transport points, and in (c) models for deposit points. Each panel displays four model types: remote sensing—quantitative (3Aqt), remote sensing—qualitative (3Aql), field survey—quantitative (3Bqt), and field survey—qualitative (3Bql). K × RMSE (K = 0.30) bounds indicate over- and underestimation thresholds; the TD Length line represents the ideal 1:1 predicted–observed relationship.
Remotesensing 17 03337 g0a4
The combined-source multivariate models (Model 4) demonstrated moderate improvements in predictive performance across all landslide zones. In the collapse zone, the highest R2 was obtained by the fully integrated model (4_comb_Col.; R2 = 0.21), followed by three configurations that combined remote sensing quantitative variables with either field-based quantitative (Aqt_Bqt_C), field-based qualitative (Aqt_Bql_C), or remote sensing qualitative variables (A_qt_ql_C), all yielding R2 = 0.17 (Figure A5a). In the transport zone, the best result was again achieved by the fully integrated model (4B_comb_Transp.; R2 = 0.21), followed by the combination of remote sensing quantitative and field survey qualitative variables (Aqt_Bql_T; R2 = 0.17; Figure A5b). In the deposition zone, the fully integrated model (4B_comb_Dep.; R2 = 0.21) outperformed all other configurations, with several combinations showing similar intermediate values (R2 = 0.13–0.14; Figure A5c).
Figure A5. Multivariate linear regression Model 4 (combined-source models) for each landslide stage. In (a) models for collapse points; in (b) models for transport points; and in (c) models for deposit points. Each panel displays model combinations integrating quantitative and/or qualitative geoenvironmental variables from remote sensing and field survey sources. K × RMSE (K = 0.30) bounds indicate over- and underestimation thresholds; the TD Length line represents the ideal 1:1 predicted–observed relationship.
Figure A5. Multivariate linear regression Model 4 (combined-source models) for each landslide stage. In (a) models for collapse points; in (b) models for transport points; and in (c) models for deposit points. Each panel displays model combinations integrating quantitative and/or qualitative geoenvironmental variables from remote sensing and field survey sources. K × RMSE (K = 0.30) bounds indicate over- and underestimation thresholds; the TD Length line represents the ideal 1:1 predicted–observed relationship.
Remotesensing 17 03337 g0a5
The post-failure and factor analysis-based multivariate models (Model 5) yielded varied predictive performance depending on the type of geoenvironmental input and landslide stage. Using quantitative remote sensing variables, the best results were observed in combination with length in the collapse and transport zones (R2 = 0.20), followed by volume (R2 = 0.19 and 0.18, respectively). In the deposition zone, the length-based model also performed best (R2 = 0.14), confirming the strong contribution of morphometric dimensions when integrated with remote sensing indices (Figure A6a, Figure A7a and Figure A8a). Applying qualitative remote sensing variables, the overall performance was lower. The highest R2 values were obtained for length and volume in the transport zone (R2 = 0.10), with a modest improvement for length in the deposition zone (R2 = 0.11) and the collapse zone (R2 = 0.12; Figure A6b, Figure A7b and Figure A8b). In quantitative field survey models, the best performance was recorded in the deposition zone, where the combination of length and volume achieved an R2 of 0.17. Predictive power was lower in the collapse (R2 ≤ 0.11) and transport zones (R2 ≤ 0.12) (Figure A6c, Figure A7c and Figure A8c). By combining qualitative field survey variables, results were more balanced across all zones. In the transport zone, the length and volume variables reached R2 = 0.14, while in the deposition zone, these variables reached R2 = 0.16 and 0.14, respectively. In the collapse zone, the highest R2 for this group was length (R2 = 0.12; Figure A6d, Figure A7d and Figure A8d). In general, the most consistent and best-performing post-failure parameter was length (Lc), which achieved the highest R2 in all data types and landslide stages. In terms of data source, remote sensing quantitative models yielded the strongest predictive power. However, field survey qualitative models outperformed qualitative remote sensing models in the collapse and transport stages. In contrast, in the deposition zone, qualitative remote sensing models slightly surpass field survey-based models.
Figure A6. Multivariate linear regression models of type 5 (post-failure and factor-based models) for collapse points. In (a) models combining post-failure parameters with quantitative remote sensing variables; in (b) with qualitative remote sensing variables; in (c) with quantitative field survey variables; and in (d) with qualitative field survey variables. K × RMSE (K = 0.30) bounds indicate over- and underestimation thresholds; the TD Length line represents the ideal 1:1 predicted–observed relationship.
Figure A6. Multivariate linear regression models of type 5 (post-failure and factor-based models) for collapse points. In (a) models combining post-failure parameters with quantitative remote sensing variables; in (b) with qualitative remote sensing variables; in (c) with quantitative field survey variables; and in (d) with qualitative field survey variables. K × RMSE (K = 0.30) bounds indicate over- and underestimation thresholds; the TD Length line represents the ideal 1:1 predicted–observed relationship.
Remotesensing 17 03337 g0a6
Figure A7. Multivariate linear regression models of type 5 (post-failure and factor-based models) for transport points. In (a) models combining post-failure parameters with quantitative remote sensing variables; in (b) with qualitative remote sensing variables; in (c) with quantitative field survey variables; and in (d) with qualitative field survey variables. K × RMSE (K = 0.30) bounds indicate over- and underestimation thresholds; the TD Length line represents the ideal 1:1 predicted–observed relationship.
Figure A7. Multivariate linear regression models of type 5 (post-failure and factor-based models) for transport points. In (a) models combining post-failure parameters with quantitative remote sensing variables; in (b) with qualitative remote sensing variables; in (c) with quantitative field survey variables; and in (d) with qualitative field survey variables. K × RMSE (K = 0.30) bounds indicate over- and underestimation thresholds; the TD Length line represents the ideal 1:1 predicted–observed relationship.
Remotesensing 17 03337 g0a7
Figure A8. Multivariate linear regression Model 5 (post-failure and factor-based models) for deposit points. In (a) models combining post-failure parameters with quantitative remote sensing variables; in (b) with qualitative remote sensing variables; in (c) with quantitative field survey variables; and in (d) with qualitative field survey variables.
Figure A8. Multivariate linear regression Model 5 (post-failure and factor-based models) for deposit points. In (a) models combining post-failure parameters with quantitative remote sensing variables; in (b) with qualitative remote sensing variables; in (c) with quantitative field survey variables; and in (d) with qualitative field survey variables.
Remotesensing 17 03337 g0a8
The post-failure and landslide stage-based models (Model 6) combined individual morphometric parameters with zone-specific geoenvironmental factors derived from all available data sources—remote sensing and field survey, quantitative and qualitative—after controlling for multicollinearity. These models were constructed separately for each landslide process point. In the collapse zone, the highest predictive performance was achieved by combining length with the full set of collapse-stage geoenvironmental variables (R2 = 0.28; Figure A9). In the transport zone, a similar pattern was observed: the best-performing model was length (R2 = 0.27; Figure A10), followed by volume (R2 = 0.26) and height (R2 = 0.25). In the deposition zone, the highest R2 was again obtained for length (R2 = 0.27; Figure A11), followed closely by volume (R2 = 0.26). Across all zones, length (Lc) consistently showed the strongest predictive capacity, reinforcing its importance when integrated with targeted environmental descriptors from each landslide stage.
Figure A9. Multivariate linear regression Model 6 (post-failure and complete landslide collapse-based models). Each panel displays combinations of post-failure parameters with geoenvironmental variables specific to the collapse zone, integrating all data sources (remote sensing and field survey, both quantitative and qualitative). K × RMSE (K = 0.30) bounds indicate over- and underestimation thresholds; the TD Length line represents the ideal 1:1 predicted–observed relationship.
Figure A9. Multivariate linear regression Model 6 (post-failure and complete landslide collapse-based models). Each panel displays combinations of post-failure parameters with geoenvironmental variables specific to the collapse zone, integrating all data sources (remote sensing and field survey, both quantitative and qualitative). K × RMSE (K = 0.30) bounds indicate over- and underestimation thresholds; the TD Length line represents the ideal 1:1 predicted–observed relationship.
Remotesensing 17 03337 g0a9
Figure A10. Multivariate linear regression Model 6 (post-failure and complete landslide transport-based models). Each panel displays combinations of post-failure parameters with geoenvironmental variables specific to the transport zone, integrating all data sources (remote sensing and field survey, both quantitative and qualitative). K × RMSE (K = 0.30) bounds indicate over- and underestimation thresholds; the TD Length line represents the ideal 1:1 predicted–observed relationship.
Figure A10. Multivariate linear regression Model 6 (post-failure and complete landslide transport-based models). Each panel displays combinations of post-failure parameters with geoenvironmental variables specific to the transport zone, integrating all data sources (remote sensing and field survey, both quantitative and qualitative). K × RMSE (K = 0.30) bounds indicate over- and underestimation thresholds; the TD Length line represents the ideal 1:1 predicted–observed relationship.
Remotesensing 17 03337 g0a10
Figure A11. Multivariate linear regression Model 6 (post-failure and complete landslide deposit-based models). Each panel displays combinations of post-failure parameters with geoenvironmental variables specific to the deposit zone, integrating all data sources (remote sensing and field survey, both quantitative and qualitative). K × RMSE (K = 0.30) bounds indicate over- and underestimation thresholds; the TD Length line represents the ideal 1:1 predicted–observed relationship.
Figure A11. Multivariate linear regression Model 6 (post-failure and complete landslide deposit-based models). Each panel displays combinations of post-failure parameters with geoenvironmental variables specific to the deposit zone, integrating all data sources (remote sensing and field survey, both quantitative and qualitative). K × RMSE (K = 0.30) bounds indicate over- and underestimation thresholds; the TD Length line represents the ideal 1:1 predicted–observed relationship.
Remotesensing 17 03337 g0a11

Appendix B. Complete Statistical Analysis for CTD-Based Multivariate Linear Regression Models (Model 8)

Appendix B.1. CTD-Based Model

Table A1. Overall fit statistics and significance test.
Table A1. Overall fit statistics and significance test.
Model Fit MeasuresOverall Model Test
ModelR2Adjusted R2RMSEFdf1df2p
CTD-based 0.4810.18255.31.61621080.015
Table A2. Normality test results (Shapiro–Wilk).
Table A2. Normality test results (Shapiro–Wilk).
Normality Tests
TestsStatisticp
Shapiro-Wilk0.962<0.001
Table A3. Heteroskedasticity test results (Breusch–Pagan) residuals.
Table A3. Heteroskedasticity test results (Breusch–Pagan) residuals.
Heteroskedasticity Tests
TestsStatisticp
Breusch-Pagan67.80.288
Table A4. Durbin–Watson test for residual autocorrelation.
Table A4. Durbin–Watson test for residual autocorrelation.
Durbin–Watson Test for Autocorrelation
AutocorrelationDW Statisticp
0.1511.690.034
Table A5. Collinearity diagnostics (Variance Inflation Factor—VIF and Tolerance) for predictors.
Table A5. Collinearity diagnostics (Variance Inflation Factor—VIF and Tolerance) for predictors.
Collinearity Statistics
PredictorVIFTolerance
D.LULC1.330.752
D.SOIL2.790.358
D.GEOU1.770.564
D.Litho1.740.576
D.ASP1.670.599
D.PLCV2.130.47
D.RNF2.770.361
D.TPI1.70.589
D.TWI2.960.338
D.TRI2.270.441
D.SDI2.340.428
D.NDVI1.70.587
D.LFDI3.640.275
T.PLCV2.630.38
T.TPI2.680.373
T.Litho2.050.488
T.SOIL1.610.623
T.LULC1.60.625
C.ASP1.770.564
C.PLCV2.190.456
C.RNF1.270.784
C.TPI2.70.371
C.TWI2.610.383
C.TRI2.530.396
C.LFDI3.460.289
C.NDVI2.340.427
C.NDBI2.220.451
C.Litho1.470.68
C.GEOU2.410.415
C.SOIL3.060.327
C.LULC1.490.67
Table A6. Standardized regression coefficients, significance tests, and confidence intervals for predictors.
Table A6. Standardized regression coefficients, significance tests, and confidence intervals for predictors.
PredictortpStand. Estimate95% Confidence Interval
LowerUpper
Intercept a−0.026620.979
D.LULC:
   Mosaic of Uses—Forest Formation0.794990.4280.29114−0.43481.017
   Pasture—Forest Formation−0.553710.581−0.23268−1.06560.6003
   Rocky Outcrop—Forest Formation−1.486530.14−1.55343−3.62480.518
   Urban Area—Forest Formation−1.118210.266−0.24297−0.67370.1877
D.SOIL:
   CXbd4—AR30.908770.3651.14162−1.34843.6317
   CXbd6—AR33.100820.0021.818420.6562.9808
   LVAd24—AR32.132830.0351.341120.09472.5875
   Urban area—AR33.32460.0011.745510.70482.7862
D.GEOU:
   Colluvial-talus slopes—Alluvial-colluvial slopes0.077520.9380.04354−1.06971.1568
   Dissected hills and low hills—Alluvial-colluvial slopes−0.380930.704−0.23338−1.44780.981
   Floodplains—Alluvial-colluvial slopes0.391660.6960.27552−1.11891.6699
   High hills domain—Alluvial-colluvial slopes−0.482150.631−0.24528−1.25370.7631
   Mountainous Domain—Alluvial-colluvial slopes0.103140.9180.06488−1.18191.3116
   Mountainous escarpments—Alluvial-colluvial slopes0.071940.9430.05175−1.37421.4777
D.Litho:
   NP3_gamma_1rnb—NP23_gamma_1rn−1.531190.129−0.5537−1.27050.1631
   NP3_gamma_3so—NP23_gamma_1rn−1.176460.242−0.72108−1.9360.4938
   NP3_gamma_3sor—NP23_gamma_1rn0.270730.7870.30479−1.92682.5363
   O1_gamma_6nfan—NP23_gamma_1rn−1.139990.257−1.7855−4.89011.3191
D.ASP−0.014780.988−0.00171−0.23130.2279
D.PLCV0.152190.8790.02245−0.270.3149
D.RNF−0.241980.809−0.04643−0.42680.3339
D.TPI1.382120.170.16273−0.07070.3961
D.TWI0.003620.9977.44 × 10−4−0.40630.4078
D.TRI−1.109010.27−0.17436−0.4860.1373
D.SDI0.545810.5860.08849−0.23290.4099
D.NDVI0.372830.710.04401−0.190.278
D.LFDI−2.072090.041−0.52276−1.0228−0.0227
T.PLCV−0.03040.976−0.00555−0.36750.3564
T.TPI−0.392790.695−0.07294−0.4410.2951
T.Litho:
   NP3_gamma_1rnb—NP23_gamma_1rn0.640480.5230.24076−0.50430.9859
   NP3_gamma_3so—NP23_gamma_1rn1.091140.2780.59516−0.4861.6763
T.SOIL:
   CXbd4—AR3−0.924490.357−0.42433−1.33410.4855
   CXbd6—AR3−0.224820.823−0.10161−0.99750.7943
   LVAd24—AR3−1.721670.088−0.74263−1.59760.1124
   Urban area—AR3−0.530310.597−0.20147−0.95450.5516
T.LULC:
   Mosaic of Uses—Forest Formation−1.188170.237−0.34351−0.91660.2296
   Pasture—Forest Formation0.254060.80.11943−0.81241.0512
   Rocky Outcrop—Forest Formation−1.678840.096−1.59302−3.47390.2878
   Urban Area—Forest Formation0.378880.7060.11997−0.50770.7476
C.ASP0.387950.6990.04771−0.19610.2915
C.PLCV0.799710.4260.12162−0.17980.4231
C.RNF0.925010.3570.08178−0.09350.257
C.TPI1.70180.0920.31813−0.05240.6887
C.TWI0.190750.8490.03451−0.32410.3931
C.TRI−0.522380.602−0.09154−0.43890.2558
C.LFDI1.932910.0560.46319−0.01180.9382
C.NDVI0.807540.4210.13109−0.19070.4529
C.NDBI1.66610.0990.2564−0.04860.5614
C.Litho:
   NP3_gamma_1rnb—NP23_gamma_1rn0.037470.970.00933−0.48440.5031
   NP3_gamma_3so—NP23_gamma_1rn−0.263050.793−0.10685−0.9120.6983
C.GEOU:
   Dissected hills and Low hills—Colluvial-Talus slopes−0.085020.932−0.11678−2.83962.6061
   High mountainous domain—Colluvial-Talus slopes−0.180980.857−0.21352−2.5522.125
   Mountainous domain—Colluvial-Talus slopes−0.129870.897−0.17054−2.77362.4325
   Mountainous escarpments—Colluvial-Talus slopes−0.233450.816−0.25682−2.43741.9238
C.SOIL:
   CXbd4—AR3−0.516180.607−0.63197−3.05881.7949
   CXbd6—AR3−2.489420.014−1.35166−2.4279−0.2754
   LVAd24—AR3−1.941570.055−1.25958−2.54550.0263
   Urban area—AR3−2.802290.006−1.53099−2.6139−0.4481
C.LULC:
   Mosaic of Uses—Forest Formation−1.149970.253−0.30729−0.83690.2224
   Pasture—Forest Formation−1.620910.108−0.64037−1.42350.1427
   Rocky Outcrop—Forest Formation−1.337430.184−1.23602−3.06790.5959
   Urban Area—Forest Formation−1.008680.315−0.31439−0.93220.3034
Note: Intercept a corresponds to the reference levels of categorical variables. Reference categories are indicated after the dash, and all coefficients represent deviations relative to these baselines.
Figure A12. CTD-based model residual diagnosis: In (a) quantile–quantile (Q–Q) plot comparing standardized residuals (dots) against the theoretical quantiles of a normal distribution. The straight line indicates the expected 1:1 fit under normality; and in (b) scatter plot of residuals (dots) versus runout distance (Length.TD), used to assess heteroscedasticity and potential model bias.
Figure A12. CTD-based model residual diagnosis: In (a) quantile–quantile (Q–Q) plot comparing standardized residuals (dots) against the theoretical quantiles of a normal distribution. The straight line indicates the expected 1:1 fit under normality; and in (b) scatter plot of residuals (dots) versus runout distance (Length.TD), used to assess heteroscedasticity and potential model bias.
Remotesensing 17 03337 g0a12

Appendix B.2. Hc-CTD Model

Table A7. Overall fit statistics and significance test.
Table A7. Overall fit statistics and significance test.
Model Fit MeasuresOverall Model Test
ModelR2Adjusted R2RMSEFdf1df2p
Hc_CTD0.4910.19154.71.64631070.012
Table A8. Normality test results (Shapiro–Wilk).
Table A8. Normality test results (Shapiro–Wilk).
Normality Tests
TestsStatisticp
Shapiro–Wilk0.96<0.001
Table A9. Heteroskedasticity test results (Breusch–Pagan) residuals.
Table A9. Heteroskedasticity test results (Breusch–Pagan) residuals.
Heteroskedasticity Tests
TestsStatisticp
Breusch-Pagan68.50.296
Table A10. Durbin–Watson test for residual autocorrelation.
Table A10. Durbin–Watson test for residual autocorrelation.
Durbin–Watson Test for Autocorrelation
AutocorrelationDW Statisticp
0.1511.690.034
Table A11. Collinearity diagnostics (variance inflation factor—VIF and tolerance) for predictors.
Table A11. Collinearity diagnostics (variance inflation factor—VIF and tolerance) for predictors.
Collinearity Statistics
PredictorVIFTolerance
D.LULC1.340.747
D.SOIL2.820.355
D.GEOU1.780.561
D.Litho1.740.575
D.ASP1.680.594
D.PLCV2.140.466
D.RNF2.770.361
D.TPI1.70.589
D.TWI3.010.332
D.TRI2.270.441
D.SDI2.340.428
D.NDVI1.720.583
D.LFDI3.750.267
T.PLCV2.650.378
T.TPI2.770.361
T.Litho2.050.488
T.SOIL1.610.621
T.LULC1.60.623
C.ASP1.790.558
C.PLCV2.190.456
C.RNF1.280.78
C.TPI2.930.342
C.TWI2.610.383
C.TRI2.550.392
C.LFDI3.520.284
C.NDVI2.360.423
C.NDBI2.220.451
C.Litho1.480.678
C.GEOU2.420.413
C.SOIL3.110.322
C.LULC1.50.666
Height.C1.850.542
Table A12. Standardized regression coefficients, significance tests, and confidence intervals for predictors.
Table A12. Standardized regression coefficients, significance tests, and confidence intervals for predictors.
PredictortpStand. Estimate95% Confidence Interval
LowerUpper
Intercept a−0.011310.991
D.LULC:
   Mosaic of Uses—Forest Formation1.034940.3030.38234−0.351.1147
   Pasture—Forest Formation−0.676280.5−0.2836−1.11490.5477
   Rocky Outcrop—Forest Formation−1.487190.14−1.54573−3.60610.5147
   Urban Area—Forest Formation−0.920890.359−0.20075−0.63290.2314
D.SOIL:
   CXbd4—AR30.765660.4460.9612−1.52753.4499
   CXbd6—AR32.804750.0061.662490.48742.8375
   LVAd24—AR31.847660.0671.17423−0.08562.4341
   Urban area—AR32.893160.0051.555820.48982.6219
D.GEOU:
   Colluvial-talus slopes—Alluvial-colluvial slopes0.031380.9750.01754−1.09031.1254
   Dissected hills and low hills—Alluvial-colluvial slopes−0.527820.599−0.32322−1.53720.8907
   Floodplains—Alluvial-colluvial slopes0.434630.6650.3042−1.08331.6917
   High hills domain—Alluvial-colluvial slopes−0.609810.543−0.30969−1.31650.6971
   Mountainous Domain—Alluvial-colluvial slopes0.003880.9970.00243−1.24051.2454
   Mountainous escarpments—Alluvial-colluvial slopes0.14420.8860.1033−1.31681.5234
D.Litho:
   NP3_gamma_1rnb—NP23_gamma_1rn−1.422210.158−0.51301−1.22810.2021
   NP3_gamma_3so—NP23_gamma_1rn−1.232070.221−0.7515−1.96070.4577
   NP3_gamma_3sor—NP23_gamma_1rn0.219180.8270.24557−1.97552.4667
   O1_gamma_6nfan—NP23_gamma_1rn−1.090370.278−1.69972−4.791.3905
D.ASP−0.192370.848−0.02232−0.25240.2077
D.PLCV−0.029180.977−0.00431−0.29740.2888
D.RNF−0.19860.843−0.03792−0.41640.3406
D.TPI1.36360.1760.15971−0.07250.3919
D.TWI−0.258780.796−0.0537−0.46510.3577
D.TRI−1.097440.275−0.17162−0.48160.1384
D.SDI0.590640.5560.09528−0.22450.4151
D.NDVI0.552390.5820.06534−0.16920.2998
D.LFDI−1.662430.099−0.43001−0.94280.0828
T.PLCV0.110630.9120.02019−0.34150.3819
T.TPI2.31 × 10−414.42 × 10−5−0.3790.3791
T.Litho:
   NP3_gamma_1rnb—NP23_gamma_1rn0.676960.50.25316−0.48820.9945
   NP3_gamma_3so—NP23_gamma_1rn0.98810.3250.53743−0.54081.6156
T.SOIL:
   CXbd4—AR3−1.136080.258−0.5243−1.43920.3906
   CXbd6—AR3−0.325660.745−0.14673−1.03990.7465
   LVAd24—AR3−1.85630.066−0.79958−1.65350.0543
   Urban area—AR3−0.669170.505−0.25396−1.00630.4984
T.LULC:
   Mosaic of Uses—Forest Formation−1.068710.288−0.30835−0.88030.2636
   Pasture—Forest Formation0.486660.6270.23046−0.70831.1692
   Rocky Outcrop—Forest Formation−1.554740.123−1.47274−3.35060.4051
   Urban Area—Forest Formation0.422520.6730.13311−0.49140.7577
C.ASP0.594270.5540.07342−0.17150.3183
C.PLCV0.767080.4450.11606−0.18390.416
C.RNF0.763890.4470.06757−0.10780.2429
C.TPI0.999050.320.20175−0.19860.6021
C.TWI0.257520.7970.04638−0.31070.4034
C.TRI−0.710450.479−0.12485−0.47320.2235
C.LFDI1.635740.1050.39675−0.08410.8776
C.NDVI1.006360.3170.16403−0.15910.4871
C.NDBI1.692220.0940.25902−0.04440.5625
C.Litho:
   NP3_gamma_1rnb—NP23_gamma_1rn−0.035840.971−0.00889−0.50060.4828
   NP3_gamma_3so—NP23_gamma_1rn−0.110830.912−0.04502−0.85020.7602
C.GEOU:
   Dissected hills and Low hills—Colluvial-Talus slopes−0.066330.947−0.09063−2.79922.618
   High mountainous domain—Colluvial-Talus slopes−0.206760.837−0.24264−2.56912.0838
   Mountainous domain—Colluvial-Talus slopes−0.028320.977−0.03708−2.63252.5583
   Mountainous escarpments—Colluvial-Talus slopes−0.284650.776−0.31163−2.48191.8586
C.SOIL:
   CXbd4—AR3−0.264380.792−0.32655−2.77512.122
   CXbd6—AR3−2.221290.028−1.21662−2.3024−0.1309
   LVAd24—AR3−1.546130.125−1.02692−2.34360.2898
   Urban area—AR3−2.162840.033−1.24679−2.3896−0.104
C.LULC:
   Mosaic of Uses—Forest Formation−1.316640.191−0.3522−0.88250.1781
   Pasture—Forest Formation−1.814120.072−0.7194−1.50550.0667
   Rocky Outcrop—Forest Formation−1.505250.135−1.39278−3.2270.4415
   Urban Area—Forest Formation−0.935170.352−0.2903−0.90570.3251
Height.C1.47640.1430.18792−0.06440.4402
Note: Intercept a corresponds to the reference levels of categorical variables. Reference categories are indicated after the dash, and all coefficients represent deviations relative to these baselines.
Figure A13. Hc-CTD model residual diagnosis: In (a) quantile–quantile (Q–Q) plot comparing standardized residuals (dots) against the theoretical quantiles of a normal distribution. The straight line indicates the expected 1:1 fit under normality; and in (b) scatter plot of residuals (dots) versus runout distance (Length.TD), used to assess heteroscedasticity and potential model bias.
Figure A13. Hc-CTD model residual diagnosis: In (a) quantile–quantile (Q–Q) plot comparing standardized residuals (dots) against the theoretical quantiles of a normal distribution. The straight line indicates the expected 1:1 fit under normality; and in (b) scatter plot of residuals (dots) versus runout distance (Length.TD), used to assess heteroscedasticity and potential model bias.
Remotesensing 17 03337 g0a13

Appendix B.3. Lc-CTD Model

Table A13. Overall fit statistics and significance test.
Table A13. Overall fit statistics and significance test.
Model Fit MeasuresOverall Model Test
ModelR2Adjusted R2RMSEFdf1df2p
Lc_CTD0.5070.21753.91.75631070.006
Table A14. Normality test results (Shapiro–Wilk).
Table A14. Normality test results (Shapiro–Wilk).
Normality Tests
TestsStatisticp
Shapiro-Wilk0.965<0.001
Table A15. Heteroskedasticity test results (Breusch–Pagan) residuals.
Table A15. Heteroskedasticity test results (Breusch–Pagan) residuals.
Heteroskedasticity Tests
TestsStatisticp
Breusch-Pagan70.20.248
Table A16. Durbin–Watson test for residual autocorrelation.
Table A16. Durbin–Watson test for residual autocorrelation.
Durbin–Watson Test for Autocorrelation
AutocorrelationDW Statisticp
0.1611.680.022
Table A17. Collinearity diagnostics (Variance Inflation Factor—VIF and Tolerance) for predictors.
Table A17. Collinearity diagnostics (Variance Inflation Factor—VIF and Tolerance) for predictors.
Collinearity Statistics
PredictorVIFTolerance
D.LULC1.340.747
D.SOIL2.810.356
D.GEOU1.780.561
D.Litho1.740.575
D.ASP1.680.596
D.PLCV2.140.467
D.RNF2.780.359
D.TPI1.70.589
D.TWI2.980.335
D.TRI2.280.439
D.SDI2.350.425
D.NDVI1.710.584
D.LFDI3.710.269
T.PLCV2.670.374
T.TPI2.710.369
T.Litho2.050.488
T.SOIL1.610.621
T.LULC1.610.622
C.ASP1.780.563
C.PLCV2.20.455
C.RNF1.280.784
C.TPI2.810.356
C.TWI2.610.383
C.TRI2.530.395
C.LFDI3.520.284
C.NDVI2.360.424
C.NDBI2.220.45
C.Litho1.470.679
C.GEOU2.410.414
C.SOIL3.120.321
C.LULC1.510.661
Length.C1.840.544
Table A18. Standardized regression coefficients, significance tests, and confidence intervals for predictors.
Table A18. Standardized regression coefficients, significance tests, and confidence intervals for predictors.
PredictortpStand. Estimate95% Confidence Interval
LowerUpper
Intercept a0.20680.837
D.LULC:
   Mosaic of Uses—Forest Formation1.18460.2390.43012−0.28971.1499
   Pasture—Forest Formation−0.66080.51−0.27201−1.0880.544
   Rocky Outcrop—Forest Formation−1.49390.138−1.52807−3.55580.4996
   Urban Area—Forest Formation−0.74610.457−0.16072−0.58780.2663
D.SOIL:
   CXbd4—AR30.91160.3641.12092−1.31653.5584
   CXbd6—AR32.67930.0091.563920.40682.7211
   LVAd24—AR32.00490.0471.236960.01392.46
   Urban area—AR32.94710.0041.536180.50292.5695
D.GEOU:
   Colluvial-talus slopes—Alluvial-colluvial slopes−0.10040.92−0.05535−1.14811.0374
   Dissected hills and low hills—Alluvial-colluvial slopes−0.6420.522−0.38718−1.58270.8083
   Floodplains—Alluvial-colluvial slopes0.1520.8790.10519−1.26691.4773
   High hills domain—Alluvial-colluvial slopes−0.84070.402−0.42323−1.42120.5748
   Mountainous Domain—Alluvial-colluvial slopes0.17720.860.10915−1.11181.3301
   Mountainous escarpments—Alluvial-colluvial slopes0.1030.9180.07253−1.32341.4685
D.Litho:
   NP3_gamma_1rnb—NP23_gamma_1rn−1.57030.119−0.55578−1.25740.1458
   NP3_gamma_3so—NP23_gamma_1rn−1.08640.28−0.65245−1.8430.5381
   NP3_gamma_3sor—NP23_gamma_1rn0.34430.7310.3795−1.80572.5647
   O1_gamma_6nfan—NP23_gamma_1rn−1.18650.238−1.819−4.8581.22
D.ASP−0.2270.821−0.02584−0.25150.1998
D.PLCV−0.12860.898−0.0187−0.3070.2696
D.RNF−0.49490.622−0.09345−0.46780.2809
D.TPI1.44210.1520.1662−0.06230.3947
D.TWI−0.29670.767−0.0601−0.46170.3415
D.TRI−0.91620.362−0.14154−0.44780.1647
D.SDI0.80450.4230.12837−0.18790.4447
D.NDVI0.65610.5130.07632−0.15430.3069
D.LFDI−1.59120.115−0.40113−0.90090.0986
T.PLCV0.39010.6970.07083−0.28910.4307
T.TPI−0.04640.963−0.00852−0.37270.3557
T.Litho:
   NP3_gamma_1rnb—NP23_gamma_1rn0.79830.4260.29423−0.43651.0249
   NP3_gamma_3so—NP23_gamma_1rn1.1740.2430.62691−0.43171.6855
T.SOIL:
   CXbd4—AR3−1.13930.257−0.51356−1.40720.3801
   CXbd6—AR3−0.55190.582−0.2464−1.13150.6387
   LVAd24—AR3−1.9460.054−0.82422−1.66390.0154
   Urban area—AR3−0.71480.476−0.26649−1.00560.4726
T.LULC:
   Mosaic of Uses—Forest Formation−1.05840.292−0.30011−0.86220.262
   Pasture—Forest Formation0.73330.4650.34432−0.58661.2752
   Rocky Outcrop—Forest Formation−1.74440.084−1.62021−3.46140.221
   Urban Area—Forest Formation0.42150.6740.13063−0.48380.745
C.ASP0.51340.6090.06187−0.1770.3008
C.PLCV0.62880.5310.09388−0.20210.3898
C.RNF0.84860.3980.07349−0.09820.2452
C.TPI1.00430.3180.19128−0.18630.5689
C.TWI0.30330.7620.05375−0.29760.4051
C.TRI−0.6730.502−0.11563−0.45620.225
C.LFDI1.49580.1380.35703−0.11610.8302
C.NDVI1.11280.2680.17815−0.13920.4955
C.NDBI1.83370.0690.27663−0.02240.5757
C.Litho:
   NP3_gamma_1rnb—NP23_gamma_1rn0.22160.8250.05419−0.43050.5389
   NP3_gamma_3so—NP23_gamma_1rn−0.21540.83−0.08567−0.8740.7027
C.GEOU:
   Dissected hills and Low hills—Colluvial-Talus slopes−0.07080.944−0.09517−2.76052.5702
   High mountainous domain—Colluvial-Talus slopes−0.22050.826−0.25468−2.5442.0346
   Mountainous domain—Colluvial-Talus slopes−0.11780.906−0.15141−2.69952.3966
   Mountainous escarpments—Colluvial-Talus slopes−0.33950.735−0.36588−2.50231.7705
C.SOIL:
   CXbd4—AR3−0.27550.783−0.33192−2.72042.0565
   CXbd6—AR3−1.86050.066−1.02142−2.10980.0669
   LVAd24—AR3−1.48180.141−0.95904−2.24210.324
   Urban area—AR3−1.87760.063−1.06762−2.19480.0596
C.LULC:
   Mosaic of Uses—Forest Formation−1.69530.093−0.45563−0.98840.0772
   Pasture—Forest Formation−2.20720.029−0.88199−1.6741−0.0898
   Rocky Outcrop—Forest Formation−1.69590.093−1.55005−3.36190.2618
   Urban Area—Forest Formation−1.02930.306−0.31399−0.91870.2907
Length.C2.39620.0180.298930.05160.5462
Note: Intercept a corresponds to the reference levels of categorical variables. Reference categories are indicated after the dash, and all coefficients represent deviations relative to these baselines.
Figure A14. Lc-CTD model residual diagnosis: In (a) quantile–quantile (Q–Q) comparing standardized residuals (dots) against the theoretical quantiles of a normal distribution. The straight line indicates the expected 1:1 fit under normality; and in (b) scatter plot of residuals (dots) versus runout distance (Length.TD), used to assess heteroscedasticity and potential model bias.
Figure A14. Lc-CTD model residual diagnosis: In (a) quantile–quantile (Q–Q) comparing standardized residuals (dots) against the theoretical quantiles of a normal distribution. The straight line indicates the expected 1:1 fit under normality; and in (b) scatter plot of residuals (dots) versus runout distance (Length.TD), used to assess heteroscedasticity and potential model bias.
Remotesensing 17 03337 g0a14

Appendix B.4. Ac-CTD Model

Table A19. Overall fit statistics and significance test.
Table A19. Overall fit statistics and significance test.
Model Fit MeasuresOverall Model Test
ModelR2Adjusted R2RMSEFdf1df2p
Ac_CTD0.4910.19154.81.64631070.013
Table A20. Normality test results (Shapiro–Wilk).
Table A20. Normality test results (Shapiro–Wilk).
Normality Tests
TestsStatisticp
Shapiro–Wilk0.959<0.001
Table A21. Heteroskedasticity test results (Breusch–Pagan) residuals.
Table A21. Heteroskedasticity test results (Breusch–Pagan) residuals.
Heteroskedasticity Tests
TestsStatisticp
Breusch-Pagan65.30.397
Table A22. Durbin–Watson test for residual autocorrelation.
Table A22. Durbin–Watson test for residual autocorrelation.
Durbin–Watson Test for Autocorrelation
AutocorrelationDW Statisticp
0.1421.710.038
Table A23. Collinearity diagnostics (variance inflation factor—VIF and tolerance) for predictors.
Table A23. Collinearity diagnostics (variance inflation factor—VIF and tolerance) for predictors.
Collinearity Statistics
PredictorVIFTolerance
D.LULC1.330.75
D.SOIL2.80.357
D.GEOU1.790.559
D.Litho1.740.575
D.ASP1.680.596
D.PLCV2.140.467
D.RNF2.780.36
D.TPI1.70.589
D.TWI2.990.335
D.TRI2.310.433
D.SDI2.340.427
D.NDVI1.730.577
D.LFDI3.660.273
T.PLCV2.680.372
T.TPI2.690.372
T.Litho2.050.487
T.SOIL1.610.622
T.LULC1.60.624
C.ASP1.780.561
C.PLCV2.230.449
C.RNF1.280.783
C.TPI2.760.362
C.TWI2.610.383
C.TRI2.570.39
C.LFDI3.480.288
C.NDVI2.370.422
C.NDBI2.220.45
C.Litho1.470.679
C.GEOU2.410.414
C.SOIL3.090.323
C.LULC1.510.664
Area.C1.670.598
Table A24. Standardized regression coefficients, significance tests, and confidence intervals for predictors.
Table A24. Standardized regression coefficients, significance tests, and confidence intervals for predictors.
PredictortpStand. Estimate95% Confidence Interval
LowerUpper
Intercept a0.12330.902
D.LULC:
   Mosaic of Uses—Forest Formation0.94790.3450.34734−0.37911.0738
   Pasture—Forest Formation−0.57510.566−0.24047−1.06940.5884
   Rocky Outcrop—Forest Formation−1.47330.144−1.53207−3.59350.5293
   Urban Area—Forest Formation−1.04710.297−0.22668−0.65580.2025
D.SOIL:
   CXbd4—AR31.03970.3011.30479−1.18313.7927
   CXbd6—AR32.98630.0041.748430.58782.9091
   LVAd24—AR32.15740.0331.349810.10952.5901
   Urban area—AR33.29960.0011.724370.68842.7604
D.GEOU:
   Colluvial-talus slopes—Alluvial-colluvial slopes−0.05230.958−0.02937−1.14161.0829
   Dissected hills and low hills—Alluvial-colluvial slopes−0.52350.602−0.32065−1.5350.8937
   Floodplains—Alluvial-colluvial slopes0.31310.7550.21949−1.17021.6091
   High hills domain—Alluvial-colluvial slopes−0.74180.46−0.38199−1.40280.6388
   Mountainous Domain—Alluvial-colluvial slopes0.17170.8640.10754−1.13451.3496
   Mountainous escarpments—Alluvial-colluvial slopes0.01380.9890.00989−1.41031.43
D.Litho:
   NP3_gamma_1rnb—NP23_gamma_1rn−1.53080.129−0.55078−1.2640.1625
   NP3_gamma_3so—NP23_gamma_1rn−1.2330.22−0.75242−1.96210.4573
   NP3_gamma_3sor—NP23_gamma_1rn0.29080.7720.32577−1.8952.5465
   O1_gamma_6nfan—NP23_gamma_1rn−1.15060.252−1.79314−4.88251.2962
D.ASP−0.14130.888−0.01635−0.24570.213
D.PLCV−0.01650.987−0.00244−0.29540.2906
D.RNF−0.37840.706−0.07256−0.45270.3076
D.TPI1.36650.1750.16011−0.07220.3924
D.TWI−0.19150.848−0.03949−0.44820.3693
D.TRI−0.8220.413−0.13093−0.44670.1848
D.SDI0.61010.5430.09852−0.22160.4186
D.NDVI0.64530.520.07722−0.160.3145
D.LFDI−1.89770.06−0.47971−0.98080.0214
T.PLCV0.25170.8020.04663−0.32060.4139
T.TPI−0.27840.781−0.0516−0.4190.3158
T.Litho:
   NP3_gamma_1rnb—NP23_gamma_1rn0.69830.4860.26138−0.48061.0034
   NP3_gamma_3so—NP23_gamma_1rn1.21560.2270.66209−0.41771.7418
T.SOIL:
   CXbd4—AR3−0.93430.352−0.4267−1.3320.4786
   CXbd6—AR3−0.3020.763−0.13599−1.02870.7567
   LVAd24—AR3−1.69450.093−0.72744−1.57850.1236
   Urban area—AR3−0.48510.629−0.18348−0.93320.5663
T.LULC:
   Mosaic of Uses—Forest Formation−1.07720.284−0.31081−0.88280.2612
   Pasture—Forest Formation0.41080.6820.19329−0.73941.126
   Rocky Outcrop—Forest Formation−1.69090.094−1.5964−3.4680.2752
   Urban Area—Forest Formation0.45140.6530.1424−0.48290.7677
C.ASP0.53850.5910.06626−0.17760.3101
C.PLCV0.54260.5890.08335−0.22120.3879
C.RNF0.84080.4020.0741−0.10060.2488
C.TPI1.350.180.25741−0.12060.6354
C.TWI0.17220.8640.031−0.32590.3879
C.TRI−0.76780.444−0.13592−0.48690.215
C.LFDI1.76930.080.42451−0.05110.9001
C.NDVI1.0260.3070.16775−0.15640.4919
C.NDBI1.63160.1060.24993−0.05370.5536
C.Litho:
   NP3_gamma_1rnb—NP23_gamma_1rn0.01960.9840.00485−0.48650.4962
   NP3_gamma_3so—NP23_gamma_1rn−0.3690.713−0.14954−0.95290.6538
C.GEOU:
   Dissected hills and Low hills—Colluvial-Talus slopes−0.13630.892−0.18638−2.89752.5248
   High mountainous domain—Colluvial-Talus slopes−0.23480.815−0.27582−2.60442.0528
   Mountainous domain—Colluvial-Talus slopes−0.14570.884−0.19044−2.78082.4
   Mountainous escarpments—Colluvial-Talus slopes−0.30620.76−0.33562−2.50821.837
C.SOIL:
   CXbd4—AR3−0.53880.591−0.65642−3.07161.7587
   CXbd6—AR3−2.3070.023−1.2557−2.3347−0.1767
   LVAd24—AR3−1.79490.075−1.16457−2.45080.1216
   Urban area—AR3−2.40680.018−1.34458−2.4521−0.2371
C.LULC:
   Mosaic of Uses—Forest Formation−1.35280.179−0.3635−0.89620.1692
   Pasture—Forest Formation−1.86540.065−0.74576−1.53830.0468
   Rocky Outcrop—Forest Formation−1.50440.135−1.39298−3.22850.4425
   Urban Area—Forest Formation−0.93510.352−0.29039−0.9060.3253
Area.C1.44650.1510.16688−0.06180.3956
Note: Intercept a corresponds to the reference levels of categorical variables. Reference categories are indicated after the dash, and all coefficients represent deviations relative to these baselines.
Figure A15. Ac-CTD model residual diagnosis: In (a) quantile–quantile (Q–Q) plot comparing standardized residuals (dots) against the theoretical quantiles of a normal distribution. The straight line indicates the expected 1:1 fit under normality; and in (b) scatter plot of residuals (dots) versus runout distance (Length.TD), used to assess heteroscedasticity and potential model bias.
Figure A15. Ac-CTD model residual diagnosis: In (a) quantile–quantile (Q–Q) plot comparing standardized residuals (dots) against the theoretical quantiles of a normal distribution. The straight line indicates the expected 1:1 fit under normality; and in (b) scatter plot of residuals (dots) versus runout distance (Length.TD), used to assess heteroscedasticity and potential model bias.
Remotesensing 17 03337 g0a15

Appendix B.5. Vc0.5-CTD Model

Table A25. Overall fit statistics and significance test.
Table A25. Overall fit statistics and significance test.
Model Fit MeasuresOverall Model Test
ModelR2Adjusted R2RMSEFdf1df2p
Vc_CTD_0.50.490.18954.81.63631070.013
Table A26. Normality test results (Shapiro–Wilk).
Table A26. Normality test results (Shapiro–Wilk).
Normality Tests
TestsStatisticp
Shapiro–Wilk0.957<0.001
Table A27. Heteroskedasticity test results (Breusch–Pagan) residuals.
Table A27. Heteroskedasticity test results (Breusch–Pagan) residuals.
Heteroskedasticity Tests
TestsStatisticp
Breusch–Pagan65.40.392
Table A28. Durbin–Watson test for residual autocorrelation.
Table A28. Durbin–Watson test for residual autocorrelation.
Durbin–Watson Test for Autocorrelation
AutocorrelationDW Statisticp
0.1441.710.06
Table A29. Collinearity diagnostics (variance inflation factor—VIF and tolerance) for predictors.
Table A29. Collinearity diagnostics (variance inflation factor—VIF and tolerance) for predictors.
Collinearity Statistics
PredictorVIFTolerance
D.LULC1.340.749
D.SOIL2.820.354
D.GEOU1.790.558
D.Litho1.740.575
D.ASP1.690.59
D.PLCV2.150.466
D.RNF2.770.361
D.TPI1.70.589
D.TWI2.990.334
D.TRI2.290.436
D.SDI2.340.428
D.NDVI1.720.58
D.LFDI3.720.269
T.PLCV2.650.377
T.TPI2.710.37
T.Litho2.050.488
T.SOIL1.610.62
T.LULC1.60.624
C.ASP1.810.551
C.PLCV2.190.456
C.RNF1.280.783
C.TPI2.820.354
C.TWI2.610.383
C.TRI2.540.394
C.LFDI3.520.284
C.NDVI2.360.423
C.NDBI2.220.449
C.Litho1.470.68
C.GEOU2.410.414
C.SOIL3.120.321
C.LULC1.490.669
Vol.C_0.51.580.634
Table A30. Standardized regression coefficients, significance tests, and confidence intervals for predictors.
Table A30. Standardized regression coefficients, significance tests, and confidence intervals for predictors.
PredictortpStand. Estimate95% Confidence Interval
LowerUpper
Intercept a0.085330.932
D.LULC:
   Mosaic of Uses—Forest Formation0.938210.350.34408−0.38291.0711
   Pasture—Forest Formation−0.697540.487−0.29354−1.12780.5407
   Rocky Outcrop—Forest Formation−1.511570.134−1.57327−3.63660.49
   Urban Area—Forest Formation−1.074370.285−0.23263−0.66190.1966
D.SOIL:
   CXbd4—AR30.973120.3331.21867−1.26393.7013
   CXbd6—AR32.693950.0081.620450.4282.8129
   LVAd24—AR32.055070.0421.289260.04562.5329
   Urban area—AR33.109070.0021.642320.59522.6895
D.GEOU:
   Colluvial-talus slopes—Alluvial-colluvial slopes−0.078360.938−0.04412−1.16011.0719
   Dissected hills and low hills—Alluvial-colluvial slopes−0.580560.563−0.35812−1.58090.8647
   Floodplains—Alluvial-colluvial slopes0.331150.7410.23223−1.1581.6224
   High hills domain—Alluvial-colluvial slopes−0.743390.459−0.38403−1.40810.6401
   Mountainous Domain—Alluvial-colluvial slopes0.186730.8520.11719−1.12691.3613
   Mountainous escarpments—Alluvial-colluvial slopes0.005840.9950.00419−1.41781.4261
D.Litho:
   NP3_gamma_1rnb—NP23_gamma_1rn−1.484740.141−0.53508−1.24950.1793
   NP3_gamma_3so—NP23_gamma_1rn−1.134970.259−0.69318−1.90390.5176
   NP3_gamma_3sor—NP23_gamma_1rn0.243690.8080.27328−1.94982.4964
   O1_gamma_6nfan—NP23_gamma_1rn−1.124410.263−1.75407−4.84661.3384
D.ASP−0.24740.805−0.02896−0.2610.2031
D.PLCV−0.024530.98−0.00363−0.29730.2901
D.RNF−0.336440.737−0.06444−0.44420.3153
D.TPI1.335130.1850.15667−0.07590.3893
D.TWI−0.18980.85−0.0392−0.44870.3703
D.TRI−0.902530.369−0.14282−0.45650.1709
D.SDI0.535550.5930.08648−0.23360.4066
D.NDVI0.588930.5570.07013−0.16590.3062
D.LFDI−1.752320.083−0.44995−0.9590.0591
T.PLCV0.122910.9020.02249−0.34030.3853
T.TPI−0.191390.849−0.03577−0.40630.3347
T.Litho:
   NP3_gamma_1rnb—NP23_gamma_1rn0.629360.530.23562−0.50650.9778
   NP3_gamma_3so—NP23_gamma_1rn1.18380.2390.64445−0.43471.7236
T.SOIL:
   CXbd4—AR3−1.034720.303−0.47449−1.38350.4346
   CXbd6—AR3−0.420220.675−0.19111−1.09270.7105
   LVAd24—AR3−1.812130.073−0.78−1.63330.0733
   Urban area—AR3−0.558660.578−0.21141−0.96160.5388
T.LULC:
   Mosaic of Uses—Forest Formation−1.227810.222−0.35363−0.92460.2173
   Pasture—Forest Formation0.327610.7440.15359−0.77581.083
   Rocky Outcrop—Forest Formation−1.560810.122−1.48048−3.36080.3999
   Urban Area—Forest Formation0.425730.6710.13432−0.49110.7598
C.ASP0.67260.5030.0843−0.16420.3327
C.PLCV0.768780.4440.11647−0.18390.4168
C.RNF0.859730.3920.07579−0.0990.2505
C.TPI1.222950.2240.23845−0.14810.625
C.TWI0.194290.8460.035−0.32220.3922
C.TRI−0.658510.512−0.11549−0.46320.2322
C.LFDI1.643140.1030.39947−0.08250.8814
C.NDVI0.991390.3240.1618−0.16170.4853
C.NDBI1.570650.1190.24133−0.06330.5459
C.Litho:
   NP3_gamma_1rnb—NP23_gamma_1rn0.080770.9360.02005−0.4720.512
   NP3_gamma_3so—NP23_gamma_1rn−0.340070.734−0.13779−0.9410.6654
C.GEOU:
   Dissected hills and Low hills—Colluvial-Talus slopes−0.130440.896−0.17854−2.8922.5349
   High mountainous domain—Colluvial-Talus slopes−0.239570.811−0.28173−2.6132.0495
   Mountainous domain—Colluvial-Talus slopes−0.203210.839−0.26614−2.86252.3302
   Mountainous escarpments—Colluvial-Talus slopes−0.297150.767−0.3259−2.50011.8483
C.SOIL:
   CXbd4—AR3−0.507910.613−0.61932−3.03651.7979
   CXbd6—AR3−1.953230.053−1.11039−2.23740.0166
   LVAd24—AR3−1.692460.093−1.10905−2.40810.19
   Urban area—AR3−2.306880.023−1.30916−2.4342−0.1842
C.LULC:
   Mosaic of Uses—Forest Formation−1.209350.229−0.32209−0.85010.2059
   Pasture—Forest Formation−1.728530.087−0.68211−1.46440.1002
   Rocky Outcrop—Forest Formation−1.378440.171−1.26913−3.09430.5561
   Urban Area—Forest Formation−0.962610.338−0.29899−0.91470.3167
Vol.C.0.51.375110.1720.14975−0.06610.3656
Note: Intercept a corresponds to the reference levels of categorical variables. Reference categories are indicated after the dash, and all coefficients represent deviations relative to these baselines.
Figure A16. Vc0.5-CTD model residual diagnosis: In (a) quantile–quantile (Q–Q) plot comparing standardized residuals (dots) against the theoretical quantiles of a normal distribution. The straight line indicates the expected 1:1 fit under normality; and in (b) scatter plot of residuals (dots) versus runout distance (Length.TD), used to assess heteroscedasticity and potential model bias.
Figure A16. Vc0.5-CTD model residual diagnosis: In (a) quantile–quantile (Q–Q) plot comparing standardized residuals (dots) against the theoretical quantiles of a normal distribution. The straight line indicates the expected 1:1 fit under normality; and in (b) scatter plot of residuals (dots) versus runout distance (Length.TD), used to assess heteroscedasticity and potential model bias.
Remotesensing 17 03337 g0a16

Appendix B.6. Vc1.0-CTD Model

Table A31. Overall fit statistics and significance test.
Table A31. Overall fit statistics and significance test.
Model Fit MeasuresOverall Model Test
ModelR2Adjusted R2RMSEFdf1df2p
Vc_CTD_1.00.490.18954.81.63631070.013
Table A32. Normality test results (Shapiro–Wilk).
Table A32. Normality test results (Shapiro–Wilk).
Normality Tests
TestsStatisticp
Shapiro-Wilk0.957<0.001
Table A33. Heteroskedasticity test results (Breusch–Pagan) residuals.
Table A33. Heteroskedasticity test results (Breusch–Pagan) residuals.
Heteroskedasticity Tests
TestsStatisticp
Breusch-Pagan65.40.392
Table A34. Durbin–Watson test for residual autocorrelation.
Table A34. Durbin–Watson test for residual autocorrelation.
Durbin–Watson Test for Autocorrelation
AutocorrelationDW Statisticp
0.1441.710.058
Table A35. Collinearity diagnostics (variance inflation factor—VIF and tolerance) for predictors.
Table A35. Collinearity diagnostics (variance inflation factor—VIF and tolerance) for predictors.
Collinearity Statistics
PredictorVIFTolerance
D.LULC1.340.749
D.SOIL2.820.354
D.GEOU1.790.558
D.Litho1.740.575
D.ASP1.690.59
D.PLCV2.150.466
D.RNF2.770.361
D.TPI1.70.589
D.TWI2.990.334
D.TRI2.290.436
D.SDI2.340.428
D.NDVI1.720.58
D.LFDI3.720.269
T.PLCV2.650.377
T.TPI2.710.37
T.Litho2.050.488
T.SOIL1.610.62
T.LULC1.60.624
C.ASP1.810.551
C.PLCV2.190.456
C.RNF1.280.783
C.TPI2.820.354
C.TWI2.610.383
C.TRI2.540.394
C.LFDI3.520.284
C.NDVI2.360.423
C.NDBI2.220.449
C.Litho1.470.68
C.GEOU2.410.414
C.SOIL3.120.321
C.LULC1.490.669
Volume collapse (1.0)1.580.634
Table A36. Standardized regression coefficients, significance tests, and confidence intervals for predictors.
Table A36. Standardized regression coefficients, significance tests, and confidence intervals for predictors.
PredictortpStand. Estimate95% Confidence Interval
LowerUpper
Intercept a0.085330.932
D.LULC:
   Mosaic of Uses—Forest Formation0.938210.350.34408−0.38291.0711
   Pasture—Forest Formation−0.697540.487−0.29354−1.12780.5407
   Rocky Outcrop—Forest Formation−1.511570.134−1.57327−3.63660.49
   Urban Area—Forest Formation−1.074370.285−0.23263−0.66190.1966
D.SOIL:
   CXbd4—AR30.973120.3331.21867−1.26393.7013
   CXbd6—AR32.693950.0081.620450.4282.8129
   LVAd24—AR32.055070.0421.289260.04562.5329
   Urban area—AR33.109070.0021.642320.59522.6895
D.GEOU:
   Colluvial-talus slopes—Alluvial-colluvial slopes−0.078360.938−0.04412−1.16011.0719
   Dissected hills and low hills—Alluvial-colluvial slopes−0.580560.563−0.35812−1.58090.8647
   Floodplains—Alluvial-colluvial slopes0.331150.7410.23223−1.1581.6224
   High hills domain—Alluvial-colluvial slopes−0.743390.459−0.38403−1.40810.6401
   Mountainous Domain—Alluvial-colluvial slopes0.186730.8520.11719−1.12691.3613
   Mountainous escarpments—Alluvial-colluvial slopes0.005840.9950.00419−1.41781.4261
D.Litho:
   NP3_gamma_1rnb—NP23_gamma_1rn−1.484740.141−0.53508−1.24950.1793
   NP3_gamma_3so—NP23_gamma_1rn−1.134970.259−0.69318−1.90390.5176
   NP3_gamma_3sor—NP23_gamma_1rn0.243690.8080.27328−1.94982.4964
   O1_gamma_6nfan—NP23_gamma_1rn−1.124410.263−1.75407−4.84661.3384
D.ASP−0.24740.805−0.02896−0.2610.2031
D.PLCV−0.024530.98−0.00363−0.29730.2901
D.RNF−0.336440.737−0.06444−0.44420.3153
D.TPI1.335130.1850.15667−0.07590.3893
D.TWI−0.18980.85−0.0392−0.44870.3703
D.TRI−0.902530.369−0.14282−0.45650.1709
D.SDI0.535550.5930.08648−0.23360.4066
D.NDVI0.588930.5570.07013−0.16590.3062
D.LFDI−1.752320.083−0.44995−0.9590.0591
T.PLCV0.122910.9020.02249−0.34030.3853
T.TPI−0.191390.849−0.03577−0.40630.3347
T.Litho:
   NP3_gamma_1rnb—NP23_gamma_1rn0.629360.530.23562−0.50650.9778
   NP3_gamma_3so—NP23_gamma_1rn1.18380.2390.64445−0.43471.7236
T.SOIL:
   CXbd4—AR3−1.034720.303−0.47449−1.38350.4346
   CXbd6—AR3−0.420220.675−0.19111−1.09270.7105
   LVAd24—AR3−1.812130.073−0.78−1.63330.0733
   Urban area—AR3−0.558660.578−0.21141−0.96160.5388
T.LULC:
   Mosaic of Uses—Forest Formation−1.227810.222−0.35363−0.92460.2173
   Pasture—Forest Formation0.327610.7440.15359−0.77581.083
   Rocky Outcrop—Forest Formation−1.560810.122−1.48048−3.36080.3999
   Urban Area—Forest Formation0.425730.6710.13432−0.49110.7598
C.ASP0.67260.5030.0843−0.16420.3327
C.PLCV0.768780.4440.11647−0.18390.4168
C.RNF0.859730.3920.07579−0.0990.2505
C.TPI1.222950.2240.23845−0.14810.625
C.TWI0.194290.8460.035−0.32220.3922
C.TRI−0.658510.512−0.11549−0.46320.2322
C.LFDI1.643140.1030.39947−0.08250.8814
C.NDVI0.991390.3240.1618−0.16170.4853
C.NDBI1.570650.1190.24133−0.06330.5459
C.Litho:
   NP3_gamma_1rnb—NP23_gamma_1rn0.080770.9360.02005−0.4720.512
   NP3_gamma_3so—NP23_gamma_1rn−0.340070.734−0.13779−0.9410.6654
C.GEOU:
   Dissected hills and Low hills—Colluvial-Talus slopes−0.130440.896−0.17854−2.8922.5349
   High mountainous domain—Colluvial-Talus slopes−0.239570.811−0.28173−2.6132.0495
   Mountainous domain—Colluvial-Talus slopes−0.203210.839−0.26614−2.86252.3302
   Mountainous escarpments—Colluvial-Talus slopes−0.297150.767−0.3259−2.50011.8483
C.SOIL:
   CXbd4—AR3−0.507910.613−0.61932−3.03651.7979
   CXbd6—AR3−1.953230.053−1.11039−2.23740.0166
   LVAd24—AR3−1.692460.093−1.10905−2.40810.19
   Urban area—AR3−2.306880.023−1.30916−2.4342−0.1842
C.LULC:
   Mosaic of Uses—Forest Formation−1.209350.229−0.32209−0.85010.2059
   Pasture—Forest Formation−1.728530.087−0.68211−1.46440.1002
   Rocky Outcrop—Forest Formation−1.378440.171−1.26913−3.09430.5561
   Urban Area—Forest Formation−0.962610.338−0.29899−0.91470.3167
Volume collapse (1.0)1.375110.1720.14975−0.06610.3656
Note: Intercept a corresponds to the reference levels of categorical variables. Reference categories are indicated after the dash, and all coefficients represent deviations relative to these baselines.
Figure A17. Vc1.0-CTD model residual diagnosis: In (a) quantile–quantile (Q–Q) plot comparing standardized residuals (dots) against the theoretical quantiles of a normal distribution. The straight line indicates the expected 1:1 fit under normality; and in (b) scatter plot of residuals (dots) versus runout distance (Length.TD), used to assess heteroscedasticity and potential model bias.
Figure A17. Vc1.0-CTD model residual diagnosis: In (a) quantile–quantile (Q–Q) plot comparing standardized residuals (dots) against the theoretical quantiles of a normal distribution. The straight line indicates the expected 1:1 fit under normality; and in (b) scatter plot of residuals (dots) versus runout distance (Length.TD), used to assess heteroscedasticity and potential model bias.
Remotesensing 17 03337 g0a17

Appendix B.7. Vc1.5-CTD Model

Table A37. Overall fit statistics and significance test.
Table A37. Overall fit statistics and significance test.
Model Fit MeasuresOverall Model Test
ModelR2Adjusted R2RMSEFdf1df2p
Vc_CTD_1.50.490.18954.81.63631070.013
Table A38. Normality test results (Shapiro–Wilk).
Table A38. Normality test results (Shapiro–Wilk).
Normality Tests
TestsStatisticp
Shapiro–Wilk0.957<0.001
Table A39. Heteroskedasticity test results (Breusch–Pagan) residuals.
Table A39. Heteroskedasticity test results (Breusch–Pagan) residuals.
Heteroskedasticity Tests
TestsStatisticp
Breusch-Pagan65.40.392
Table A40. Durbin–Watson test for residual autocorrelation.
Table A40. Durbin–Watson test for residual autocorrelation.
Durbin–Watson Test for Autocorrelation
AutocorrelationDW Statisticp
0.1441.710.044
Table A41. Collinearity diagnostics (variance inflation factor—VIF and tolerance) for predictors.
Table A41. Collinearity diagnostics (variance inflation factor—VIF and tolerance) for predictors.
Collinearity Statistics
PredictorVIFTolerance
D.LULC1.340.749
D.SOIL2.820.354
D.GEOU1.790.558
D.Litho1.740.575
D.ASP1.690.59
D.PLCV2.150.466
D.RNF2.770.361
D.TPI1.70.589
D.TWI2.990.334
D.TRI2.290.436
D.SDI2.340.428
D.NDVI1.720.58
D.LFDI3.720.269
T.PLCV2.650.377
T.TPI2.710.37
T.Litho2.050.488
T.SOIL1.610.62
T.LULC1.60.624
C.ASP1.810.551
C.PLCV2.190.456
C.RNF1.280.783
C.TPI2.820.354
C.TWI2.610.383
C.TRI2.540.394
C.LFDI3.520.284
C.NDVI2.360.423
C.NDBI2.220.449
C.Litho1.470.68
C.GEOU2.410.414
C.SOIL3.120.321
C.LULC1.490.669
Vol.C_1.51.580.634
Table A42. Standardized regression coefficients, significance tests, and confidence intervals for predictors.
Table A42. Standardized regression coefficients, significance tests, and confidence intervals for predictors.
PredictortpStand. Estimate95% Confidence Interval
LowerUpper
Intercept a0.085330.932
D.LULC:
   Mosaic of Uses—Forest Formation0.938210.350.34408−0.38291.0711
   Pasture—Forest Formation−0.697540.487−0.29354−1.12780.5407
   Rocky Outcrop—Forest Formation−1.511570.134−1.57327−3.63660.49
   Urban Area—Forest Formation−1.074370.285−0.23263−0.66190.1966
D.SOIL:
   CXbd4—AR30.973120.3331.21867−1.26393.7013
   CXbd6—AR32.693950.0081.620450.4282.8129
   LVAd24—AR32.055070.0421.289260.04562.5329
   Urban area—AR33.109070.0021.642320.59522.6895
D.GEOU:
   Colluvial-talus slopes—Alluvial-colluvial slopes−0.078360.938−0.04412−1.16011.0719
   Dissected hills and low hills—Alluvial-colluvial slopes−0.580560.563−0.35812−1.58090.8647
   Floodplains—Alluvial-colluvial slopes0.331150.7410.23223−1.1581.6224
   High hills domain—Alluvial-colluvial slopes−0.743390.459−0.38403−1.40810.6401
   Mountainous Domain—Alluvial-colluvial slopes0.186730.8520.11719−1.12691.3613
   Mountainous escarpments—Alluvial-colluvial slopes0.005840.9950.00419−1.41781.4261
D.Litho:
   NP3_gamma_1rnb—NP23_gamma_1rn−1.484740.141−0.53508−1.24950.1793
   NP3_gamma_3so—NP23_gamma_1rn−1.134970.259−0.69318−1.90390.5176
   NP3_gamma_3sor—NP23_gamma_1rn0.243690.8080.27328−1.94982.4964
   O1_gamma_6nfan—NP23_gamma_1rn−1.124410.263−1.75407−4.84661.3384
D.ASP−0.24740.805−0.02896−0.2610.2031
D.PLCV−0.024530.98−0.00363−0.29730.2901
D.RNF−0.336440.737−0.06444−0.44420.3153
D.TPI1.335130.1850.15667−0.07590.3893
D.TWI−0.18980.85−0.0392−0.44870.3703
D.TRI−0.902530.369−0.14282−0.45650.1709
D.SDI0.535550.5930.08648−0.23360.4066
D.NDVI0.588930.5570.07013−0.16590.3062
D.LFDI−1.752320.083−0.44995−0.9590.0591
T.PLCV0.122910.9020.02249−0.34030.3853
T.TPI−0.191390.849−0.03577−0.40630.3347
T.Litho:
   NP3_gamma_1rnb—NP23_gamma_1rn0.629360.530.23562−0.50650.9778
   NP3_gamma_3so—NP23_gamma_1rn1.18380.2390.64445−0.43471.7236
T.SOIL:
   CXbd4—AR3−1.034720.303−0.47449−1.38350.4346
   CXbd6—AR3−0.420220.675−0.19111−1.09270.7105
   LVAd24—AR3−1.812130.073−0.78−1.63330.0733
   Urban area—AR3−0.558660.578−0.21141−0.96160.5388
T.LULC:
   Mosaic of Uses—Forest Formation−1.227810.222−0.35363−0.92460.2173
   Pasture—Forest Formation0.327610.7440.15359−0.77581.083
   Rocky Outcrop—Forest Formation−1.560810.122−1.48048−3.36080.3999
   Urban Area—Forest Formation0.425730.6710.13432−0.49110.7598
C.ASP0.67260.5030.0843−0.16420.3327
C.PLCV0.768780.4440.11647−0.18390.4168
C.RNF0.859730.3920.07579−0.0990.2505
C.TPI1.222950.2240.23845−0.14810.625
C.TWI0.194290.8460.035−0.32220.3922
C.TRI−0.658510.512−0.11549−0.46320.2322
C.LFDI1.643140.1030.39947−0.08250.8814
C.NDVI0.991390.3240.1618−0.16170.4853
C.NDBI1.570650.1190.24133−0.06330.5459
C.Litho:
   NP3_gamma_1rnb—NP23_gamma_1rn0.080770.9360.02005−0.4720.512
   NP3_gamma_3so—NP23_gamma_1rn−0.340070.734−0.13779−0.9410.6654
C.GEOU:
   Dissected hills and Low hills—Colluvial-Talus slopes−0.130440.896−0.17854−2.8922.5349
   High mountainous domain—Colluvial-Talus slopes−0.239570.811−0.28173−2.6132.0495
   Mountainous domain—Colluvial-Talus slopes−0.203210.839−0.26614−2.86252.3302
   Mountainous escarpments—Colluvial-Talus slopes−0.297150.767−0.3259−2.50011.8483
C.SOIL:
   CXbd4—AR3−0.507910.613−0.61932−3.03651.7979
   CXbd6—AR3−1.953230.053−1.11039−2.23740.0166
   LVAd24—AR3−1.692460.093−1.10905−2.40810.19
   Urban area—AR3−2.306880.023−1.30916−2.4342−0.1842
C.LULC:
   Mosaic of Uses—Forest Formation−1.209350.229−0.32209−0.85010.2059
   Pasture—Forest Formation−1.728530.087−0.68211−1.46440.1002
   Rocky Outcrop—Forest Formation−1.378440.171−1.26913−3.09430.5561
   Urban Area—Forest Formation−0.962610.338−0.29899−0.91470.3167
Vol.C_1.51.375110.1720.14975−0.06610.3656
Note: Intercept a corresponds to the reference levels of categorical variables. Reference categories are indicated after the dash, and all coefficients represent deviations relative to these baselines.
Figure A18. Vc1.5-CTD model residual diagnosis: In (a) quantile–quantile (Q–Q) plot comparing standardized residuals (dots) against the theoretical quantiles of a normal distribution. The straight line indicates the expected 1:1 fit under normality; and in (b) scatter plot of residuals (dots) versus runout distance (Length.TD), used to assess heteroscedasticity and potential model bias.
Figure A18. Vc1.5-CTD model residual diagnosis: In (a) quantile–quantile (Q–Q) plot comparing standardized residuals (dots) against the theoretical quantiles of a normal distribution. The straight line indicates the expected 1:1 fit under normality; and in (b) scatter plot of residuals (dots) versus runout distance (Length.TD), used to assess heteroscedasticity and potential model bias.
Remotesensing 17 03337 g0a18

Appendix B.8. Log10Ac-CTD Model

Table A43. Overall fit statistics and significance test.
Table A43. Overall fit statistics and significance test.
Model Fit MeasuresOverall Model Test
ModelR2Adjusted R2RMSEFdf1df2p
LogAc_CTD0.4950.19754.51.66631070.01
Table A44. Normality test results (Shapiro–Wilk).
Table A44. Normality test results (Shapiro–Wilk).
Normality Tests
TestsStatisticp
Shapiro-Wilk0.963<0.001
Table A45. Heteroskedasticity test results (Breusch–Pagan) residuals.
Table A45. Heteroskedasticity test results (Breusch–Pagan) residuals.
Heteroskedasticity Tests
TestsStatisticp
Breusch–Pagan68.30.303
Table A46. Durbin–Watson test for residual autocorrelation.
Table A46. Durbin–Watson test for residual autocorrelation.
Durbin–Watson Test for Autocorrelation
AutocorrelationDW Statisticp
0.1531.680.03
Table A47. Collinearity diagnostics (variance inflation factor—VIF and tolerance) for predictors.
Table A47. Collinearity diagnostics (variance inflation factor—VIF and tolerance) for predictors.
Collinearity Statistics
PredictorVIFTolerance
D.LULC1.340.749
D.SOIL2.80.357
D.GEOU1.780.561
D.Litho1.740.575
D.ASP1.680.595
D.PLCV2.140.468
D.RNF2.780.359
D.TPI1.70.589
D.TWI2.990.335
D.TRI2.270.44
D.SDI2.370.422
D.NDVI1.720.583
D.LFDI3.660.273
T.PLCV2.670.374
T.TPI2.680.373
T.Litho2.050.487
T.SOIL1.610.62
T.LULC1.610.622
C.ASP1.780.563
C.PLCV2.20.455
C.RNF1.270.784
C.TPI2.760.362
C.TWI2.610.383
C.TRI2.550.393
C.LFDI3.470.289
C.NDVI2.390.418
C.NDBI2.230.448
C.Litho1.470.68
C.GEOU2.420.413
C.SOIL3.070.325
C.LULC1.510.662
Log10Ac1.770.565
Table A48. Standardized regression coefficients, significance tests, and confidence intervals for predictors.
Table A48. Standardized regression coefficients, significance tests, and confidence intervals for predictors.
PredictortpStand. Estimate95% Confidence Interval
LowerUpper
Intercept a0.08420.933
D.LULC:
   Mosaic of Uses—Forest Formation1.00030.3190.36558−0.35891.0901
   Pasture—Forest Formation−0.54520.587−0.22703−1.05260.5985
   Rocky Outcrop—Forest Formation−1.38040.17−1.43276−3.49030.6248
   Urban Area—Forest Formation−0.87390.384−0.19008−0.62120.2411
D.SOIL:
   CXbd4—AR30.88060.381.09655−1.37183.5649
   CXbd6—AR33.02130.0031.758870.60482.9129
   LVAd24—AR31.9620.0521.22917−0.01282.4711
   Urban area—AR33.10870.0021.630620.59082.6704
D.GEOU:
   Colluvial-talus slopes—Alluvial-colluvial slopes−0.01080.991−0.006−1.11081.0988
   Dissected hills and low hills—Alluvial-colluvial slopes−0.32430.746−0.197−1.40131.0073
   Floodplains—Alluvial-colluvial slopes0.26120.7940.18263−1.20341.5687
   High hills domain—Alluvial-colluvial slopes−0.66660.506−0.33795−1.3430.6671
   Mountainous Domain—Alluvial-colluvial slopes0.12240.9030.07632−1.15941.312
   Mountainous escarpments—Alluvial-colluvial slopes0.13180.8950.09403−1.32011.5081
D.Litho:
   NP3_gamma_1rnb—NP23_gamma_1rn−1.69110.094−0.60837−1.32150.1048
   NP3_gamma_3so—NP23_gamma_1rn−1.17210.244−0.71195−1.91610.4922
   NP3_gamma_3sor—NP23_gamma_1rn0.31410.7540.3505−1.86182.5628
   O1_gamma_6nfan—NP23_gamma_1rn−1.15110.252−1.7867−4.86361.2902
D.ASP−0.20160.841−0.02327−0.25220.2056
D.PLCV−0.0130.99−0.00191−0.29310.2893
D.RNF−0.43510.664−0.08325−0.46260.2961
D.TPI1.37620.1720.16059−0.07070.3919
D.TWI−0.23420.815−0.04812−0.45540.3591
D.TRI−0.9760.331−0.15256−0.46240.1573
D.SDI0.83020.4080.13528−0.18770.4583
D.NDVI0.5910.5560.06969−0.16410.3035
D.LFDI−1.87310.064−0.47158−0.97070.0275
T.PLCV0.25670.7980.04712−0.31670.4109
T.TPI−0.29550.768−0.05447−0.41990.3109
T.Litho:
   NP3_gamma_1rnb—NP23_gamma_1rn0.79260.430.29636−0.44491.0376
   NP3_gamma_3so—NP23_gamma_1rn1.05250.2950.56911−0.50281.6411
T.SOIL:
   CXbd4—AR3−1.11280.268−0.50911−1.4160.3978
   CXbd6—AR3−0.41870.676−0.18874−1.08220.7048
   LVAd24—AR3−1.77210.079−0.75762−1.60520.0899
   Urban area—AR3−0.69070.491−0.26113−1.01060.4883
T.LULC:
   Mosaic of Uses—Forest Formation−0.96240.338−0.27813−0.8510.2948
   Pasture—Forest Formation0.60750.5450.28926−0.65461.2331
   Rocky Outcrop—Forest Formation−1.69830.092−1.59696−3.4610.2671
   Urban Area—Forest Formation0.61330.5410.19424−0.43360.8221
C.ASP0.31540.7530.03848−0.20330.2803
C.PLCV0.70040.4850.10575−0.19360.4051
C.RNF0.92580.3570.08111−0.09260.2548
C.TPI1.30310.1950.24725−0.12890.6234
C.TWI0.19950.8420.03576−0.31960.3911
C.TRI−0.73090.466−0.12785−0.47460.2189
C.LFDI1.81270.0730.43174−0.04040.9039
C.NDVI1.15220.2520.18939−0.13650.5153
C.NDBI1.86430.0650.28611−0.01810.5904
C.Litho:
   NP3_gamma_1rnb—NP23_gamma_1rn0.0240.9810.00592−0.48340.4953
   NP3_gamma_3so—NP23_gamma_1rn−0.17170.864−0.0692−0.86840.73
C.GEOU:
   Dissected hills and Low hills—Colluvial-Talus slopes−0.28640.775−0.39254−3.10962.3245
   High mountainous domain—Colluvial-Talus slopes−0.37460.709−0.44072−2.7731.8916
   Mountainous domain—Colluvial-Talus slopes−0.26010.795−0.33942−2.92652.2477
   Mountainous escarpments—Colluvial-Talus slopes−0.51470.608−0.56877−2.75941.6219
C.SOIL:
   CXbd4—AR3−0.37040.712−0.45107−2.86521.9631
   CXbd6—AR3−2.39140.019−1.28956−2.3586−0.2205
   LVAd24—AR3−1.79160.076−1.15671−2.43660.1232
   Urban area—AR3−2.48410.015−1.36576−2.4557−0.2758
C.LULC:
   Mosaic of Uses—Forest Formation−1.46940.145−0.39645−0.93130.1384
   Pasture—Forest Formation−1.87630.063−0.74302−1.5280.042
   Rocky Outcrop—Forest Formation−1.68780.094−1.58247−3.44110.2762
   Urban Area—Forest Formation−0.95480.342−0.29511−0.90780.3176
Log10Ac1.72540.0870.21002−0.03130.4513
Note: Intercept a corresponds to the reference levels of categorical variables. Reference categories are indicated after the dash, and all coefficients represent deviations relative to these baselines.
Figure A19. Log10Ac-CTD model residual diagnosis: In (a) quantile–quantile (Q–Q) plot comparing standardized residuals (dots) against the theoretical quantiles of a normal distribution. The straight line indicates the expected 1:1 fit under normality; and in (b) scatter plot of residuals (dots) versus runout distance (Length.TD), used to assess heteroscedasticity and potential model bias.
Figure A19. Log10Ac-CTD model residual diagnosis: In (a) quantile–quantile (Q–Q) plot comparing standardized residuals (dots) against the theoretical quantiles of a normal distribution. The straight line indicates the expected 1:1 fit under normality; and in (b) scatter plot of residuals (dots) versus runout distance (Length.TD), used to assess heteroscedasticity and potential model bias.
Remotesensing 17 03337 g0a19

Appendix B.9. Log10Vc1.0-CTD Model

Table A49. Overall fit statistics and significance test.
Table A49. Overall fit statistics and significance test.
Model Fit MeasuresOverall Model Test
ModelR2Adjusted R2RMSEFdf1df2p
LogVc1.0_CTD0.4960.19954.51.67631070.01
Table A50. Normality test results (Shapiro–Wilk).
Table A50. Normality test results (Shapiro–Wilk).
Normality Tests
TestsStatisticp
Shapiro–Wilk0.964<0.001
Table A51. Heteroskedasticity test results (Breusch–Pagan) residuals.
Table A51. Heteroskedasticity test results (Breusch–Pagan) residuals.
Heteroskedasticity Tests
TestsStatisticp
Breusch-Pagan70.60.239
Table A52. Durbin–Watson test for residual autocorrelation.
Table A52. Durbin–Watson test for residual autocorrelation.
Durbin–Watson Test for Autocorrelation
AutocorrelationDW Statisticp
0.1441.70.04
Table A53. Collinearity diagnostics (variance inflation factor—VIF and tolerance) for predictors.
Table A53. Collinearity diagnostics (variance inflation factor—VIF and tolerance) for predictors.
Collinearity Statistics
PredictorVIFTolerance
D.LULC1.330.749
D.SOIL2.80.357
D.GEOU1.780.562
D.Litho1.740.575
D.ASP1.680.597
D.PLCV2.140.467
D.RNF2.780.359
D.TPI1.70.588
D.TWI3.020.331
D.TRI2.270.441
D.SDI2.370.422
D.NDVI1.720.583
D.LFDI3.690.271
T.PLCV2.670.375
T.TPI2.680.373
T.Litho2.060.486
T.SOIL1.610.621
T.LULC1.610.621
C.ASP1.770.564
C.PLCV2.20.455
C.RNF1.280.784
C.TPI2.760.362
C.TWI2.610.383
C.TRI2.570.389
C.LFDI3.470.288
C.NDVI2.370.422
C.NDBI2.220.45
C.Litho1.480.677
C.GEOU2.420.413
C.SOIL3.080.324
C.LULC1.510.662
LogVc1.01.890.53
Table A54. Standardized regression coefficients, significance tests, and confidence intervals for predictors.
Table A54. Standardized regression coefficients, significance tests, and confidence intervals for predictors.
PredictortpStand. Estimate95% Confidence Interval
LowerUpper
Intercept a0.17510.861
D.LULC:
   Mosaic of Uses—Forest Formation0.96940.3350.35304−0.36891.075
   Pasture—Forest Formation−0.57510.566−0.23925−1.06390.5854
   Rocky Outcrop—Forest Formation−1.4310.155−1.48145−3.53380.5709
   Urban Area—Forest Formation−0.87570.383−0.19014−0.62050.2403
D.SOIL:
   CXbd4—AR30.80620.4221.00451−1.46553.4745
   CXbd6—AR32.92140.0041.705890.54832.8635
   LVAd24—AR31.91290.0581.20021−0.04362.444
   Urban area—AR32.9950.0031.580860.53452.6272
D.GEOU:
   Colluvial-talus slopes—Alluvial-colluvial slopes0.03530.9720.01963−1.08291.1221
   Dissected hills and low hills—Alluvial-colluvial slopes−0.37560.708−0.22777−1.43010.9745
   Floodplains—Alluvial-colluvial slopes0.35840.7210.24962−1.13121.6304
   High hills domain—Alluvial-colluvial slopes−0.62170.535−0.31403−1.31530.6872
   Mountainous Domain—Alluvial-colluvial slopes0.08370.9330.05212−1.18231.2866
   Mountainous escarpments—Alluvial-colluvial slopes0.18260.8550.13028−1.28421.5448
D.Litho:
   NP3_gamma_1rnb—NP23_gamma_1rn−1.59790.113−0.57227−1.28220.1377
   NP3_gamma_3so—NP23_gamma_1rn−1.11930.266−0.67966−1.88340.524
   NP3_gamma_3sor—NP23_gamma_1rn0.28470.7760.31734−1.89212.5267
   O1_gamma_6nfan—NP23_gamma_1rn−1.13720.258−1.76336−4.83721.3104
D.ASP−0.16090.872−0.01851−0.24660.2096
D.PLCV−0.0570.955−0.00839−0.29990.2832
D.RNF−0.43020.668−0.08217−0.46080.2965
D.TPI1.48280.1410.17304−0.05830.4044
D.TWI−0.34590.73−0.0717−0.48260.3392
D.TRI−1.07580.284−0.16748−0.47610.1412
D.SDI0.8290.4090.13477−0.18750.457
D.NDVI0.60230.5480.07096−0.16260.3045
D.LFDI−1.77740.078−0.44979−0.95140.0519
T.PLCV0.24720.8050.04523−0.31750.408
T.TPI−0.32390.747−0.05959−0.42430.3051
T.Litho:
   NP3_gamma_1rnb—NP23_gamma_1rn0.74820.4560.27889−0.461.0178
   NP3_gamma_3so—NP23_gamma_1rn0.91690.3610.49763−0.57821.5735
T.SOIL:
   CXbd4—AR3−1.19180.236−0.54774−1.45880.3633
   CXbd6—AR3−0.38050.704−0.1709−1.06120.7194
   LVAd24—AR3−1.85890.066−0.79565−1.64420.0529
   Urban area—AR3−0.7070.481−0.26714−1.01620.4819
T.LULC:
   Mosaic of Uses—Forest Formation−0.85060.397−0.24767−0.82490.3295
   Pasture—Forest Formation0.6290.5310.29949−0.64441.2433
   Rocky Outcrop—Forest Formation−1.67680.097−1.57516−3.43740.2871
   Urban Area—Forest Formation0.6680.5060.21219−0.41750.8419
C.ASP0.32970.7420.04016−0.20130.2816
C.PLCV0.71940.4730.10844−0.19040.4073
C.RNF0.85810.3930.07517−0.09850.2488
C.TPI1.2910.1990.2447−0.13110.6205
C.TWI0.27520.7840.04933−0.30610.4047
C.TRI−0.84730.399−0.14953−0.49940.2003
C.LFDI1.77540.0790.42305−0.04930.8954
C.NDVI1.08670.280.17683−0.14570.4994
C.NDBI1.78410.0770.27225−0.03030.5748
C.Litho:
   NP3_gamma_1rnb—NP23_gamma_1rn−0.05310.958−0.01312−0.50260.4763
   NP3_gamma_3so—NP23_gamma_1rn−0.03060.976−0.0124−0.81640.7916
C.GEOU:
   Dissected hills and Low hills—Colluvial-Talus slopes−0.26990.788−0.36903−3.07922.3412
   High mountainous domain—Colluvial-Talus slopes−0.36140.719−0.42418−2.75121.9028
   Mountainous domain—Colluvial-Talus slopes−0.23730.813−0.30904−2.89082.2727
   Mountainous escarpments—Colluvial-Talus slopes−0.48470.629−0.53319−2.71371.6473
C.SOIL:
   CXbd4—AR3−0.26780.789−0.3278−2.7542.0984
   CXbd6—AR3−2.31480.023−1.25102−2.3224−0.1797
   LVAd24—AR3−1.75390.082−1.1332−2.4140.1476
   Urban area—AR3−2.36150.02−1.31005−2.4098−0.2103
C.LULC:
   Mosaic of Uses—Forest Formation−1.5120.133−0.40911−0.94550.1273
   Pasture—Forest Formation−1.89550.061−0.75045−1.53530.0344
   Rocky Outcrop—Forest Formation−1.69290.093−1.58321−3.43720.2707
   Urban Area—Forest Formation−0.98380.327−0.3036−0.91540.3082
LogVc1.01.790.0760.23201−0.02490.489
Note: Intercept a corresponds to the reference levels of categorical variables. Reference categories are indicated after the dash, and all coefficients represent deviations relative to these baselines.
Figure A20. Log10Vc1.0 residual diagnosis: In (a) quantile–quantile (Q–Q) plot comparing standardized residuals (dots) against the theoretical quantiles of a normal distribution. The straight line indicates the expected 1:1 fit under normality; and in (b) scatter plot of residuals (dots) versus runout distance (Length.TD), used to assess heteroscedasticity and potential model bias.
Figure A20. Log10Vc1.0 residual diagnosis: In (a) quantile–quantile (Q–Q) plot comparing standardized residuals (dots) against the theoretical quantiles of a normal distribution. The straight line indicates the expected 1:1 fit under normality; and in (b) scatter plot of residuals (dots) versus runout distance (Length.TD), used to assess heteroscedasticity and potential model bias.
Remotesensing 17 03337 g0a20

References

  1. Guzzetti, F.; Carrara, A.; Cardinali, M.; Reichenbach, P. Landslide hazard evaluation: A review of current techniques and their application in a multi-scale study, Central Italy. Geomorphology 1999, 31, 181–216. [Google Scholar] [CrossRef]
  2. Hungr, O.; Corominas, J.; Eberhardt, E. Estimating landslide motion mechanism, travel distance, and velocity. In Landslide Risk Management; Hungr, O., Fell, R., Couture, R., Eberhardt, E., Eds.; Balkema: Balkema, Rotterdam, 2005; pp. 99–128. [Google Scholar] [CrossRef]
  3. Fell, R.; Corominas, J.; Bonnard, C.; Cascini, L.; Leroi, E.; Savage, W.Z. Guidelines for landslide susceptibility, hazard and risk zoning for land use planning. Eng. Geol. 2008, 102, 85–98. [Google Scholar] [CrossRef]
  4. Uchida, T.; Nishimoto, H.; Osanai, N.; Shimizu, T. Countermeasures for Sediment-related Disasters in Japan using Hazard Maps. Int. J. Eros. Control Eng. 2009, 2, 46–53. [Google Scholar] [CrossRef]
  5. Corominas, J.; Guzzetti, F.; Lan, H.; Macciotta, R.; Marunteranu, C.; McDougall, S.; Strom, A. Revisiting landslide risk terms: IAEG commission C-37 working group on landslide risk nomenclature. Bull. Eng. Geol. Environ. 2023, 82, 450. [Google Scholar] [CrossRef]
  6. Guzzetti, F.; Reichenbach, P.; Cardinali, M.; Galli, M.; Ardizzone, F. Probabilistic landslide hazard assessment at the basin scale. Geomorphology 2005, 72, 272–299. [Google Scholar] [CrossRef]
  7. Zou, Z.; Xiong, C.; Tang, H.; Criss, R.E.; Su, A.; Liu, X. Prediction of landslide runout based on influencing factor analysis. Environ. Earth Sci. 2017, 76, 723. [Google Scholar] [CrossRef]
  8. Pimentel, J.; Dutra, T.; Ribeiro, R.S.; Pfaltzgraff, P.A.D.S.; Brenny, M.E.R.; Peixoto, D.; Silva, D.R.D.; Iwanami, H.; Nishimura, T. Risk Assessment and Hazard Mapping Technique in the Project for Strengthening National Strategy of Integrated Natural Disaster Risk Management. Int. J. Eros. Control Eng. 2020, 13, 35–47. [Google Scholar] [CrossRef]
  9. Xu, Y.; George, D.L.; Kim, J.; Lu, Z.; Riley, M.; Griffin, T.; De La Fuente, J. Landslide monitoring and runout hazard assessment by integrating multi-source remote sensing and numerical models: An application to the Gold Basin landslide complex, northern Washington. Landslides 2021, 18, 1131–1141. [Google Scholar] [CrossRef]
  10. Frodella, W.; Rosi, A.; Spizzichino, D.; Nocentini, M.; Lombardi, L.; Ciampalini, A.; Vannocci, P.; Ramboason, N.; Margottini, C.; Tofani, V.; et al. Integrated approach for landslide hazard assessment in the High City of Antananarivo, Madagascar (UNESCO tentative site). Landslides 2022, 19, 2685–2709. [Google Scholar] [CrossRef]
  11. Keck, J.; Istanbulluoglu, E.; Campforts, B.; Tucker, G.; Horner-Devine, A. A landslide runout model for sediment transport, landscape evolution, and hazard assessment applications. Earth Surf. Dyn. 2024, 12, 1165–1191. [Google Scholar] [CrossRef]
  12. Park, J.-Y.; Yoon, S.; Lee, D.-H.; Lee, S.-R.; Lim, H.-H. Determination of average growth rate based on statistical relationships using geomorphological and geotechnical variables in predictive debris flow simulations. Geomorphology 2024, 444, 108955. [Google Scholar] [CrossRef]
  13. Guzzetti, F.; Reichenbach, P.; Ardizzone, F.; Cardinali, M.; Galli, M. Estimating the quality of landslide susceptibility models. Geomorphology 2006, 81, 166–184. [Google Scholar] [CrossRef]
  14. Liang, W.-L.; Uchida, T. Performance and topographic preferences of dynamic and steady models for shallow landslide prediction in a small catchment. Landslides 2022, 19, 51–66. [Google Scholar] [CrossRef]
  15. Huang, F.; Xiong, H.; Jiang, S.-H.; Yao, C.; Fan, X.; Catani, F.; Chang, Z.; Zhou, X.; Huang, J.; Liu, K. Modelling landslide susceptibility prediction: A review and construction of semi-supervised imbalanced theory. Earth Sci. Rev. 2024, 250, 104700. [Google Scholar] [CrossRef]
  16. Rickenmann, D. Empirical relationships for debris flows. Nat. Hazards 1999, 19, 47–77. [Google Scholar] [CrossRef]
  17. McDougall, S.; Hungr, O. A model for the analysis of rapid landslide motion across three-dimensional terrain. Can. Geotech. J. 2004, 41, 1084–1097. [Google Scholar] [CrossRef]
  18. McDougall, S.; Hungr, O. Dynamic modelling of entrainment in rapid landslides. Can. Geotech. J. 2005, 42, 1437–1448. [Google Scholar] [CrossRef]
  19. Rickenmann, D. Runout prediction methods. In Debris-Flow Hazards and Related Phenomena; Jakob, M., Hungr, O., Eds.; Springer: Berlin/Heidelberg, Germany, 2005; pp. 305–324. [Google Scholar] [CrossRef]
  20. McDougall, S. 2014 Canadian Geotechnical Colloquium: Landslide runout analysis—Current practice and challenges. Can. Geotech. J. 2017, 54, 605–620. [Google Scholar] [CrossRef]
  21. Corominas, J. The angle of reach as a mobility index for small and large landslides. Can. Geotech. J. 1996, 33, 260–271. [Google Scholar] [CrossRef]
  22. Heim, A. Bergsturz und Menschenleben: Zurich Fretz und Wasmuth; BiTech Publishers Ltd.: Vancouver, BC, Canada, 1932; 195p, (English Translation by Skermer, N.A., Landslides and Human Lives; 1989; 218p.). [Google Scholar]
  23. Scheidegger, A.E. On the prediction of the reach and velocity of catastrophic landslides. Rock Mech. Felsmech. Mec. Des Roches 1973, 5, 231–236. [Google Scholar] [CrossRef]
  24. Hsü, K.J. Catastrophic Debris Streams (Sturzstroms) Generated by Rockfalls; Geological Society of America Bulletin: Boulder, CO, USA, 1975; Volume 86, pp. 129–140. [Google Scholar]
  25. Hutchinson, J.N. General report: Morphological and geotechnical parameters of landslides in relation to geology and hydrogeology. Landslides 1988, 1, 3–35. [Google Scholar] [CrossRef]
  26. Nicoletti, P.G.; Sorriso-Valvo, M. Geomorphic controls of the shape and mobility of rock avalanches. GSA Bull. 1991, 103, 1365–1373. [Google Scholar] [CrossRef]
  27. Finlay, P.J.; Mostyn, G.R.; Fell, R. Landslide risk assessment: Prediction of travel distance. Can. Geotech. J. 1999, 36, 556–562. [Google Scholar] [CrossRef]
  28. Dai, F.C.; Lee, C.F.; Ngai, Y.Y. Landslide risk assessment and management: An overview. Eng. Geol. 2002, 64, 65–87. [Google Scholar] [CrossRef]
  29. Legros, F. The mobility of long-runout landslides. Eng. Geol. 2002, 63, 301–331. [Google Scholar] [CrossRef]
  30. Hunter, G.; Fell, R. Travel distance angle for “rapid” landslides in constructed and natural soil slopes. Can. Geotech. J. 2003, 40, 1123–1141. [Google Scholar] [CrossRef]
  31. Guzzetti, F.; Ardizzone, F.; Cardinali, M.; Rossi, M.; Valigi, D. Landslide volumes and landslide mobilization rates in Umbria, central Italy. Earth Planet. Sci. Lett. 2009, 279, 222–229. [Google Scholar] [CrossRef]
  32. Iverson, R.M. Landslide triggering by rain infiltration. Water Resour. Res. 2000, 36, 1897–1910. [Google Scholar] [CrossRef]
  33. Zhan, W.; Fan, X.; Huang, R.; Pei, X.; Xu, Q.; Li, W. Empirical prediction for travel distance of channelized rock avalanches in the Wenchuan earthquake area. Nat. Hazards Earth Syst. Sci. 2017, 17, 833–844. [Google Scholar] [CrossRef]
  34. Okura, Y.; Kitahara, H.; Kawanami, A.; Kurokawa, U. Topography and volume effects on travel distance of surface failure. Eng. Geol. 2003, 67, 243–254. [Google Scholar] [CrossRef]
  35. Iverson, R.M. Regulation of landslide motion by dilatancy and pore pressure feedback. J. Geophys. Res. Earth Surf. 2005, 110, 2004JF000268. [Google Scholar] [CrossRef]
  36. Iverson, N.R.; Mann, J.E.; Iverson, R.M. Effects of soil aggregates on debris-flow mobilization: Results from ring-shear experiments. Eng. Geol. 2010, 114, 84–92. [Google Scholar] [CrossRef]
  37. Iverson, R.M.; George, D.L.; Allstadt, K.; Reid, M.E.; Collins, B.D.; Vallance, J.W.; Schilling, S.P.; Godt, J.W.; Cannon, C.M.; Magirl, C.S.; et al. Landslide mobility and hazards: Implications of the 2014 Oso disaster. Earth Planet. Sci. Lett. 2015, 412, 197–208. [Google Scholar] [CrossRef]
  38. Wang, G.; Sassa, K. Factors affecting rainfall-induced flowslides in laboratory flume tests. Géotechnique 2001, 51, 587–599. [Google Scholar] [CrossRef]
  39. Wang, G.; Sassa, K. Pore-pressure generation and movement of rainfall-induced landslides: Effects of grain size and fine-particle content. Eng. Geol. 2003, 69, 109–125. [Google Scholar] [CrossRef]
  40. Crosta, G.B.; Imposimato, S.; Roddeman, D.G. Numerical modelling of large landslides stability and runout. Nat. Hazards Earth Syst. Sci. 2003, 3, 523–538. [Google Scholar] [CrossRef]
  41. Li, Y.; Lin, F.; Lou, L.; Li, J.; Xu, Z.; Zhu, S.; Luo, X.; Huo, G.; Zhao, Q. Performance analysis of landslide susceptibility assessment under different factor-filtering models. Arab. J. Geosci. 2021, 14, 1160. [Google Scholar] [CrossRef]
  42. Nishiguchi, Y.; Uchida, T. Long-Runout-Landslide-Induced Debris Flow: The Role of Fine Sediment Deposition Processes in Debris Flow Propagation. J. Geophys. Res. Earth Surf. 2022, 127, e2021JF006452. [Google Scholar] [CrossRef]
  43. Zhao, Y.; Meng, X.; Qi, T.; Chen, G.; Li, Y.; Yue, D.; Qing, F. Modeling the Spatial Distribution of Debris Flows and Analysis of the Controlling Factors: A Machine Learning Approach. Remote Sens. 2021, 13, 4813. [Google Scholar] [CrossRef]
  44. Yasin, M.Y.; Abdullah, J.; Noor, N.M.; Yusoff, M.M.; Noor, N.M. Landsat observation of urban growth and land use change using NDVI and NDBI analysis. IOP Conf. Ser. Earth Environ. Sci. 2022, 1067, 012037. [Google Scholar] [CrossRef]
  45. Yu, L.; Zhou, C.; Wang, Y.; Cao, Y.; Peres, D.J. Coupling Data- and Knowledge-Driven Methods for Landslide Susceptibility Mapping in Human-Modified Environments: A Case Study from Wanzhou County, Three Gorges Reservoir Area, China. Remote Sens. 2022, 14, 774. [Google Scholar] [CrossRef]
  46. Tang, X.; Li, H.; Qin, G.; Huang, Y.; Qi, Y. Evaluation of Satellite-Based Precipitation Products over Complex Topography in Mountainous Southwestern China. Remote Sens. 2023, 15, 473. [Google Scholar] [CrossRef]
  47. Marcelino, E.V.; Formaggio, A.R.; Maeda, E.E. Landslide inventory using image fusion techniques in Brazil. Int. J. Appl. Earth Obs. Geoinf. 2009, 11, 181–191. [Google Scholar] [CrossRef]
  48. Martha, T.R.; Kerle, N.; Jetten, V.; Van Westen, C.J.; Kumar, K.V. Characterising spectral, spatial and morphometric properties of landslides for semi-automatic detection using object-oriented methods. Geomorphology 2010, 116, 24–36. [Google Scholar] [CrossRef]
  49. Neranjan, S.; Uchida, T.; Yamakawa, Y.; Hiraoka, M.; Kawakami, A. Geometrical Variation Analysis of Landslides in Different Geological Settings Using Satellite Images: Case Studies in Japan and Sri Lanka. Remote Sens. 2024, 16, 1757. [Google Scholar] [CrossRef]
  50. Novellino, A.; Pennington, C.; Leeming, K.; Taylor, S.; Alvarez, I.G.; McAllister, E.; Arnhardt, C.; Winson, A. Mapping landslides from space: A review. Landslides 2024, 21, 1041–1052. [Google Scholar] [CrossRef]
  51. Brien, D.L.; Reid, M.E.; Cronkite-Ratcliff, C.; Perkins, J.P. Topographic controls on landslide mobility: Modeling hurricane-induced landslide runout and debris-flow inundation in Puerto Rico. Nat. Hazards Earth Syst. Sci. 2025, 25, 1229–1253. [Google Scholar] [CrossRef]
  52. Laghari, M.M.; Jagirani, S.S.; Cheng, N.; Irfan, M.; Zardari, I.A.; David, U.; Akbar, Z.; Zewen, Y.; Abu Bakar, M. Advances in Landslide Detection through Google Earth Engine: A Comprehensive Review. Nat. Hazards Rev. 2025, 26, 05025007. [Google Scholar] [CrossRef]
  53. Liu, J.; Wu, Y.; Gao, X.; Zhang, X. A Simple Method of Mapping Landslides Runout Zones Considering Kinematic Uncertainties. Remote Sens. 2022, 14, 668. [Google Scholar] [CrossRef]
  54. IBGE—Instituto Brasileiro de Geografia e Estatística. 2022 Demographic Census Database. 2022. Available online: https://censo2022.ibge.gov.br/panorama/mapas.html?localidade=BR (accessed on 18 January 2024).
  55. MapBiomas Project. Collection 8 of the Annual Land Cover and Land Use Maps of Brazil (1985–2022). MapBiomas Data 2023, 1. [Google Scholar] [CrossRef]
  56. Guerra, A.J.T.; Gonçalves, L.F.H.; Lopes, P.B.M. Evolução Histórico-Geográfica Da Ocupação Desordenada E Movimentos De Massa No Município De Petrópolis, Nas Últimas Décadas. Rev. Bras. Geomorfol. 2007, 8, 35–43. [Google Scholar] [CrossRef]
  57. PMRR—Petrópolis Municipal Risk Reduction Plan [Governmental]. Prefeitura Municipal de Petrópolis, Secretaria de Proteção e Defesa Civil. Available online: https://www.petropolis.rj.gov.br/pmp/index.php/defesa-civil/plano-municipal-de-reducao-de-risco (accessed on 1 August 2023).
  58. Shinzato, E.; Dantas, M.E.; Renk, J.F.C.; Garcia, M.L.T.; Costa, L. Geomorphological Map: Municipality of Petrópolis, RJ. [Rio de Janeiro]: CPRM. Scale 1:80,000. 2017. Available online: https://rigeo.sgb.gov.br/handle/doc/18182 (accessed on 15 December 2023).
  59. Heilbron, M.; Ribeiro, A.; Valeriano, C.M.; Paciullo, F.V.; Almeida, J.C.H.; Trouw, R.J.A.; Tupinambá, M.; Eirado Silva, L.G. The Ribeira Belt; Heilbron, M., Cordani, U.G., Alkmim, F.F., Eds.; Springer International Publishing: São Francisco Craton, Brazil, 2017; pp. 277–302. [Google Scholar] [CrossRef]
  60. Tupinambá, M.; Heilbron, M.; Duarte, B.P.; de Almeida, J.C.H.; Valladares, C.S.; Pacheco, B.T.; Salomão, M.S.; Conceição, F.R.; da Silva, L.G.E.; de Almeida, C.G.; et al. Geologia e Recursos Minerais da Folha de Nova Friburgo SF-23-Z-B-II, Estado do Rio de Janeiro, Escala 1:100.000; CPRM—Serviço Geológico do Brasil: Santa Teresa, Brazil, 2012.
  61. Adamy, A.; Holanda, J.L.R. Map of Susceptibility to Mass Gravitational Movements and Flooding: Municipality of Petrópolis—RJ. [Map]. Rio de Janeiro: SGB—Geological Survey of Brazil. 2016. Available online: https://rigeo.sgb.gov.br/handle/doc/15692 (accessed on 12 March 2023).
  62. Carvalho-Filho, A.; de Lumbreras, J.F.; Dos Santos, R.D. Os Solos do Estado do Rio de Janeiro; CPRM: Brasília, DF, Brasil, 2000. [Google Scholar]
  63. Vieira, B.C.; Fernandes, N.F. Landslides in Rio de Janeiro: The role played by variations in soil hydraulic conductivity. Hydrol. Process. 2004, 18, 791–805. [Google Scholar] [CrossRef]
  64. Avelar, A.S.; Netto, A.L.C.; Lacerda, W.A.; Becker, L.B.; Mendonça, M.B. Mechanisms of the Recent Catastrophic Landslides in the Mountainous Range of Rio de Janeiro, Brazil. In Landslide Science and Practice; Margottini, C., Canuti, P., Sassa, K., Eds.; Springer: Berlin/Heidelberg, Germany, 2013. [Google Scholar] [CrossRef]
  65. CCKP—Climate Change Knowledge Portal. Trends and Variability—Historical. 2023. Available online: https://climateknowledgeportal.worldbank.org/country/brazil/trends-variability-historical (accessed on 28 September 2024).
  66. Memorial Petrópolis. Movimento Contra Amnésia do Céu Azul. Linha do Tempo. Memorial Petrópolis. 2024. Available online: https://www.memorialpetropolis.app/linhadotempo (accessed on 23 June 2024).
  67. Schuster, R.L.; Salcedo, D.A.; Valenzuela, L. Overview of catastrophic landslides of South America in the twentieth century. In Catastrophic Landslides; Evans, S.G., Degraff, J.V., Eds.; Geological Society of America: Boulder, CO, USA, 2002. [Google Scholar] [CrossRef]
  68. Guerra, A. Catastrophic events in Petrópolis city (Rio de Janeiro state), between 1940 and 1990. GeoJournal 1995, 37, 349–354. [Google Scholar] [CrossRef]
  69. Gerscovich, D.M.S.; Vargas, E.A.; De Campos, T.M.P. On the evaluation of unsaturated flow in a natural slope in Rio de Janeiro, Brazil. Eng. Geol. 2006, 88, 23–40. [Google Scholar] [CrossRef]
  70. Netto, A.L.C.; Sato, A.M.; de Souza Avelar, A.; Vianna, L.G.G.; Araújo, I.S.; Ferreira, D.L.; Silva, R.P. January 2011: The Extreme Landslide Disaster in Brazil. In Landslide Science and Practice; Margottini, C., Canuti, P., Sassa, K., Eds.; Springer: Berlin/Heidelberg, Germany, 2013. [Google Scholar] [CrossRef]
  71. Rosi, A.; Canavesi, V.; Segoni, S.; Dias Nery, T.; Catani, F.; Casagli, N. Landslides in the Mountain Region of Rio de Janeiro: A Proposal for the Semi-Automated Definition of Multiple Rainfall Thresholds. Geosciences 2019, 9, 203. [Google Scholar] [CrossRef]
  72. Dourado, F.; Coutinho Arraes, T.; Fernandes Silva, M. O Megadesastre da Região Serrana do Rio de Janeiro—As Causas do Evento, os Mecanismos dos Movimentos de Massa e a Distribuição Espacial dos Investimentos de Reconstrução no Pós-Desastre, Anuário do Instituto de Geociências. 2012. Available online: https://www.researchgate.net/publication/273563964_O_Megadesastre_da_Regiao_Serrana_do_Rio_de_Janeiro_-_as_Causas_do_Evento_os_Mecanismos_dos_Movimentos_de_Massa_e_a_Distribuicao_Espacial_dos_Investimentos_de_Reconstrucao_no_Pos-Desastre (accessed on 24 January 2024).
  73. Alcântara, E.; Marengo, J.A.; Mantovani, J.; Londe, L.R.; San, R.L.Y.; Park, E.; Lin, Y.N.; Wang, J.; Mendes, T.; Cunha, A.P.; et al. Deadly disasters in southeastern South America: Flash floods and landslides of February 2022 in Petrópolis, Rio de Janeiro. Nat. Hazards Earth Syst. Sci. 2023, 23, 1157–1175. [Google Scholar] [CrossRef]
  74. Globo (G1); Rangel, A.B. Petrópolis Registrou 580 Deslizamentos por Conta da Chuva em Quatro Dias; Tempo Continua Instável Nesta Segunda [G1]. 2024. Available online: https://g1.globo.com/rj/regiao-serrana/noticia/2024/03/25/petropolis-registrou-580-deslizamentos-por-conta-da-chuva-em-quatro-dias-tempo-continua-instavel-nesta-segunda.ghtml (accessed on 25 March 2024).
  75. USGS—United States Geological Survey; Earth Resources Observation and Science (EROS) Center. Collection-2 Landsat 8–9 OLI (Operational Land Imager) and TIRS (Thermal Infrared Sensor) Level-2 Science Products [Dataset]; U.S. Geological Survey: Sioux Falls, SD, USA, 2013. [Google Scholar] [CrossRef]
  76. IAEG—The International Geotechnical Societies’ UNESCO Working Party on World Landslide Inventory. A suggested method for reporting a landslide. Bull. Int. Assoc. Eng. Geol. 1990, 41, 5–12. [Google Scholar] [CrossRef]
  77. Cruden, D.M.; Varnes, D.J. Landslide Types and Processes. In Landslides: Investigation and Mitigation; Turner, A.K., Schuster, R.L., Eds.; Transportation Research Board: Washington, DC, USA; National Research Council: Ottawa, ON, Canada, Special Report; 1996; Volume 247, pp. 36–75. [Google Scholar]
  78. Gao, L.; Zhang, L.M.; Chen, H.X.; Fei, K.; Hong, Y. Topography and geology effects on travel distances of natural terrain landslides: Evidence from a large multi-temporal landslide inventory in Hong Kong. Eng. Geol. 2021, 292, 106266. [Google Scholar] [CrossRef]
  79. Fernandes, N.F.; Guimarães, R.F.; Gomes, R.A.T.; Vieira, B.C.; Montgomery, D.R.; Greenberg, H. Topographic controls of landslides in Rio de Janeiro: Field evidence and modeling. CATENA 2004, 55, 163–181. [Google Scholar] [CrossRef]
  80. Ochi, H.; Shimoda, Y.; Nishio, Y.; Machado, R.P. Sediment-related Disasters in Petrópolis, Brazil, on February 15 and March 20, 2022. Int. J. Eros. Control. Eng. 2022, 15, 31–38. [Google Scholar] [CrossRef]
  81. Reichenbach, P.; Rossi, M.; Malamud, B.D.; Mihir, M.; Guzzetti, F. A review of statistically-based landslide susceptibility models. Earth Sci. Rev. 2018, 180, 60–91. [Google Scholar] [CrossRef]
  82. INPE—Instituto Nacional de Pesquisas Espaciais. Digital Elevation Model extracted from the SRTM Originating from the TOPODATA Project. 2011. Available online: https://rigeo.sgb.gov.br/handle/doc/15692 (accessed on 12 March 2023).
  83. IBGE (2010). Orthophotos, at a scale of 1:30,000 (Medium Flight, at a Scale of 1:30,000, Vertical Accuracy Greater than 5 m, and grid of 10 m × 10 m). Rio de Janeiro Project, Rio de Janeiro. 2010. Available online: https://rigeo.sgb.gov.br/handle/doc/15692 (accessed on 12 March 2023).
  84. OSM—OpenStreetMap Contributors. OpenStreetMap®. 2024. Available online: https://www.openstreetmap.org (accessed on 1 March 2024).
  85. Valladares, C.; Pascutti, A.G.; da Silva, T.M.; Heilbron, M.; de Almeida, J.C.H.; Duarte, B.P.; Tupinambá, M.; Nogueira, J.R.; Valeriano, C.M.; da Silva, L.G.E.; et al. Geologia e recursos minerais da folha de Três Rios—SF.23-Z-B-I, Estado do Rio de Janeiro, Escala 1:100.000. Available online: https://rigeo.sgb.gov.br/handle/doc/11523 (accessed on 26 March 2023).
  86. De Pinto, E.J.A.; de Azambuja, A.M.S.; Farias, J.A.M.; Pickbrenner, K.; de Salgueiro, J.H.P.B. (Coords). Pluviometric Atlas of Brazil: Monthly Isohyets, Quarterly Isohyets, Annual Isohyets, Driest Months, Wettest Months, Driest Quarters, Wettest Quarters. Brasília: CPRM; Brazil Geology Program; Geodiversity Survey. Scale 1:5, 000,000. 2011. Available online: https://rigeo.sgb.gov.br/handle/doc/11558 (accessed on 12 March 2023).
  87. Conrad, O.; Bechtel, B.; Bock, M.; Dietrich, H.; Fischer, E.; Gerlitz, L.; Wehberg, J.; Wichmann, V.; Böhner, J. System for Automated Geoscientific Analyses (SAGA) v. 2.1.4. Geosci. Model Dev. 2015, 8, 1991–2007. [Google Scholar] [CrossRef]
  88. Catani, F.; Lagomarsino, D.; Segoni, S.; Tofani, V. Landslide susceptibility estimation by random forests technique: Sensitivity and scaling issues. Nat. Hazards Earth Syst. Sci. 2013, 13, 2815–2831. [Google Scholar] [CrossRef]
  89. Chen, W.; Xie, X.; Wang, J.; Pradhan, B.; Hong, H.; Bui, D.T.; Duan, Z.; Ma, J. A comparative study of logistic model tree, random forest, and classification and regression tree models for spatial prediction of landslide susceptibility. CATENA 2017, 151, 147–160. [Google Scholar] [CrossRef]
  90. Guo, J.; Yi, S.; Yin, Y.; Cui, Y.; Qin, M.; Li, T.; Wang, C. The effect of topography on landslide kinematics: A case study of the Jichang town landslide in Guizhou, China. Landslides 2020, 17, 959–973. [Google Scholar] [CrossRef]
  91. Sajadi, P.; Sang, Y.-F.; Gholamnia, M.; Bonafoni, S.; Mukherjee, S. Evaluation of the landslide susceptibility and its spatial difference in the whole Qinghai-Tibetan Plateau region by five learning algorithms. Geosci. Lett. 2022, 9, 9. [Google Scholar] [CrossRef]
  92. Youssef, A.M.; Pourghasemi, H.R. Landslide susceptibility mapping using machine learning algorithms and comparison of their performance at Abha Basin, Asir Region, Saudi Arabia. Geosci. Front. 2021, 12, 639–655. [Google Scholar] [CrossRef]
  93. Gao, B. NDWI—A normalized difference water index for remote sensing of vegetation liquid water from space. Remote Sens. Environ. 1996, 58, 257–266. [Google Scholar] [CrossRef]
  94. Chiu, C.-Y.; Hsu, S.-C.; Huang, Y.; Zhang, N.; Peng, C. Predicting landslide extent of satellite images based on image classification extraction. J. Phys. Conf. Ser. 2024, 2870, 012010. [Google Scholar] [CrossRef]
  95. Ayalew, L.; Yamagishi, H. The application of GIS-based logistic regression for landslide susceptibility mapping in the Kakuda-Yahiko Mountains, Central Japan. Geomorphology 2005, 65, 15–31. [Google Scholar] [CrossRef]
  96. Dahal, A.; Lombardo, L. Explainable artificial intelligence in geoscience: A glimpse into the future of landslide susceptibility modeling. Comput. Geosci. 2023, 176, 105364. [Google Scholar] [CrossRef]
  97. Corominas, J.; van Westen, C.J.; Frattini, P.; Cascini, L.; Malet, J.P.; Fotopoulou, S.; Catani, F.; van den Eeckhaut, M.; Mavrouli, O. Recommendations for the quantitative analysis of landslide risk. Bull. Eng. Geol. Environ. 2014, 73, 209–263. [Google Scholar] [CrossRef]
  98. Rezaei, M.; Fathizadeh, O.; Darand, M.; Asghari, S. The relationship between hydrological extreme events and climate change. Hydrol. Process. 2018, 32, 1451–1464. [Google Scholar] [CrossRef]
  99. Chen, C.-Y.; Chang, J.-M. Landslide dam formation susceptibility analysis based on geomorphic features. Landslides 2016, 13, 1019–1033. [Google Scholar] [CrossRef]
  100. Kruskal, W.H.; Wallis, W.A. Use of ranks in one-criterion variance analysis. J. Am. Stat. Assoc. 1952, 47, 583–621. [Google Scholar] [CrossRef]
  101. Lee, J.-U.; Cho, Y.-C.; Kim, M.; Jang, S.-J.; Lee, J.; Kim, S. The Effects of Different Geological Conditions on Landslide-Triggering Rainfall Conditions in South Korea. Water 2022, 14, 2051. [Google Scholar] [CrossRef]
  102. Dupire, S.; Bourrier, F.; Monnet, J.-M.; Bigot, S.; Borgniet, L.; Berger, F.; Curt, T. The protective effect of forests against rockfalls across the French Alps: Influence of forest diversity. For. Ecol. Manag. 2016, 382, 269–279. [Google Scholar] [CrossRef]
  103. Navarro, D.J. Learning Statistics with R: A Tutorial for Psychology Students and Other Beginners, 6th ed.; University of New South Wales: Sydney, NSW, Australia, 2018; Available online: https://learningstatisticswithr.com (accessed on 4 November 2024).
  104. Brideau, M.-A.; De Vilder, S.; Massey, C.; Mitchell, A.; McDougall, S.; Aaron, J. Empirical Relationships to Estimate the Probability of Runout Exceedance for Various Landslide Types. In Understanding and Reducing Landslide Disaster Risk; Guzzetti, F., Mihalić Arbanas, S., Reichenbach, P., Sassa, K., Bobrowsky, P.T., Takara, K., Eds.; Springer International Publishing: Cham, Switzerland, 2021; pp. 321–327. [Google Scholar] [CrossRef]
  105. Tian, L.; Liu, B.; Wang, J. Application of multivariate regression models in hydrology. J. Hydrol. 2022, 614, 128542. [Google Scholar] [CrossRef]
  106. Hastie, T.; Tibshirani, R.; Friedman, J. The Elements of Statistical Learning; Springer: New York, NY, USA, 2009. [Google Scholar] [CrossRef]
  107. Montgomery, D.C.; Peck, E.A.; Vining, G.G. Introduction to Linear Regression Analysis; John Wiley & Sons: Hoboken, NJ, USA, 2021. [Google Scholar]
  108. Pradhan, A.M.S.; Lee, J.-M.; Kim, Y.-T. Semi-quantitative method to identify the vulnerable areas in terms of building aggregation for probable landslide runout at the regional scale: A case study from Soacha Province, Colombia. Bull. Eng. Geol. Environ. 2019, 78, 5745–5762. [Google Scholar] [CrossRef]
  109. Baeza, C.; Corominas, J. Assessment of shallow landslide susceptibility by means of multivariate statistical techniques. Earth Surf. Process. Landf. 2001, 26, 1251–1263. [Google Scholar] [CrossRef]
  110. Tabachnick, B.G.; Fidell, L.S. Using Multivariate Statistics, 4th ed.; Allyn and Bacon: Boston, MA, USA, 2001. [Google Scholar]
  111. Merghadi, A.; Yunus, A.P.; Dou, J.; Whiteley, J.; ThaiPham, B.; Bui, D.T.; Avtar, R.; Abderrahmane, B. Machine learning methods for landslide susceptibility studies: A comparative overview of algorithm performance. Earth Sci. Rev. 2020, 207, 103225. [Google Scholar] [CrossRef]
  112. Liu, W.; Yang, Z.; He, S. Modeling the landslide-generated debris flow from formation to propagation and run-out by considering the effect of vegetation. Landslides 2021, 18, 43–58. [Google Scholar] [CrossRef]
  113. Iverson, R.M.; George, D.L. Modelling landslide liquefaction, mobility bifurcation and the dynamics of the 2014 Oso disaster. Géotechnique 2016, 66, 175–187. [Google Scholar] [CrossRef]
  114. Scheevel, C.R. Predicting Landslide Stability, Runout, and Failure Velocity at Cook Lake Landslide, Wyoming [M.S., Colorado School of Mines]. In ProQuest Dissertations and Theses (1927886322). ProQuest Dissertations & Theses Global. 2017. Available online: https://www.proquest.com/dissertations-theses/predicting-landslide-stability-runout-failure/docview/1927886322/se-2?accountid=25225 (accessed on 5 April 2023).
  115. Hungr, O. Dynamics of Rapid Landslides. In Progress in Landslide Science; Sassa, K., Fukuoka, H., Wang, F., Wang, G., Eds.; Springer: Berlin/Heidelberg, Germany, 2007; pp. 47–57. [Google Scholar] [CrossRef]
  116. Fannin, R.J.; Wise, M.P. An empirical-statistical model for debris flow travel distance. Can. Geotech. J. 2001, 38, 982–994. [Google Scholar] [CrossRef]
Figure 1. Spatial distribution of singular landslide catastrophic events on urbanized slopes in Petrópolis, Rio de Janeiro, Brazil.
Figure 1. Spatial distribution of singular landslide catastrophic events on urbanized slopes in Petrópolis, Rio de Janeiro, Brazil.
Remotesensing 17 03337 g001
Figure 2. Diagram illustrating the main morphometric parameters used to characterize landslide geometry across collapse, transport, and deposition zones.
Figure 2. Diagram illustrating the main morphometric parameters used to characterize landslide geometry across collapse, transport, and deposition zones.
Remotesensing 17 03337 g002
Figure 3. Site-condition factors used as input data for multivariate linear regression modeling.
Figure 3. Site-condition factors used as input data for multivariate linear regression modeling.
Remotesensing 17 03337 g003
Figure 4. Model design flowchart for landslide runout assessing controlling factors.
Figure 4. Model design flowchart for landslide runout assessing controlling factors.
Remotesensing 17 03337 g004
Figure 5. Comparative analysis of categorical predictors across landslide phases. (a) Chi-square (χ2) statistics from Kruskal–Wallis tests, indicating the strength of association between categorical variables and each landslide zone (collapse, transport, deposition); (b) corresponding effect size (ε2), representing the proportion of variability explained by each categorical variable across zones.
Figure 5. Comparative analysis of categorical predictors across landslide phases. (a) Chi-square (χ2) statistics from Kruskal–Wallis tests, indicating the strength of association between categorical variables and each landslide zone (collapse, transport, deposition); (b) corresponding effect size (ε2), representing the proportion of variability explained by each categorical variable across zones.
Remotesensing 17 03337 g005
Figure 6. Univariate linear regression models for collapse post-failure morphometric parameters. Upper and lower (K × RMSE) bounds represent overestimation and underestimation thresholds, respectively. The TD Length line indicates the 1:1 reference, where predicted values equal observed values under ideal model performance.
Figure 6. Univariate linear regression models for collapse post-failure morphometric parameters. Upper and lower (K × RMSE) bounds represent overestimation and underestimation thresholds, respectively. The TD Length line indicates the 1:1 reference, where predicted values equal observed values under ideal model performance.
Remotesensing 17 03337 g006
Figure 7. Model 7 type multivariate linear regression models combining post-failure and Collapse–Transport (CT) geoenvironmental variables. Upper and lower (K × RMSE) bounds represent overestimation and underestimation thresholds, respectively (K = 0.30). The TD Length line indicates the 1:1 reference, where predicted values equal observed values under ideal model performance.
Figure 7. Model 7 type multivariate linear regression models combining post-failure and Collapse–Transport (CT) geoenvironmental variables. Upper and lower (K × RMSE) bounds represent overestimation and underestimation thresholds, respectively (K = 0.30). The TD Length line indicates the 1:1 reference, where predicted values equal observed values under ideal model performance.
Remotesensing 17 03337 g007
Figure 8. Model 7 type multivariate linear regression models combining post-failure and Collapse–Deposit (CD) geoenvironmental variables. Upper and lower (K × RMSE) bounds represent overestimation and underestimation thresholds, respectively (K = 0.30). The TD Length line indicates the 1:1 reference, where predicted values equal observed values under ideal model performance.
Figure 8. Model 7 type multivariate linear regression models combining post-failure and Collapse–Deposit (CD) geoenvironmental variables. Upper and lower (K × RMSE) bounds represent overestimation and underestimation thresholds, respectively (K = 0.30). The TD Length line indicates the 1:1 reference, where predicted values equal observed values under ideal model performance.
Remotesensing 17 03337 g008
Figure 9. Model 7 type multivariate linear regression models combining post-failure and Transport–Deposit (TD) geoenvironmental variables. Upper and lower (K × RMSE) bounds represent overestimation and underestimation thresholds, respectively (K = 0.30). The TD Length line indicates the 1:1 reference, where predicted values equal observed values under ideal model performance.
Figure 9. Model 7 type multivariate linear regression models combining post-failure and Transport–Deposit (TD) geoenvironmental variables. Upper and lower (K × RMSE) bounds represent overestimation and underestimation thresholds, respectively (K = 0.30). The TD Length line indicates the 1:1 reference, where predicted values equal observed values under ideal model performance.
Remotesensing 17 03337 g009
Figure 10. Model 8 type multivariate linear regression models combining post-failure morphometric predictors with geoenvironmental variables from all three landslide stages (CTD). K × RMSE (K = 0.30) bounds indicate over- and underestimation thresholds; the TD Length line represents the ideal 1:1 predicted–observed relationship.
Figure 10. Model 8 type multivariate linear regression models combining post-failure morphometric predictors with geoenvironmental variables from all three landslide stages (CTD). K × RMSE (K = 0.30) bounds indicate over- and underestimation thresholds; the TD Length line represents the ideal 1:1 predicted–observed relationship.
Remotesensing 17 03337 g010
Figure 11. Boxplots summarizing the predictive performance (R2) of the linear regression models across different data groupings and modeling strategies. In (a), the effect of data source combinations, including post-failure morphometries (PFM), remote sensing (RS), and field survey (FS) data. In (b), landslide stages and their combinations: individual zones (Collapse, Transport, Deposit), paired stages (CT, CD, TD), and all combined stages (CTD). Boxes indicate the interquartile range (IQR), whiskers extend to 1.5 × IQR, black × represent group medians, and black dots ● denote outliers.
Figure 11. Boxplots summarizing the predictive performance (R2) of the linear regression models across different data groupings and modeling strategies. In (a), the effect of data source combinations, including post-failure morphometries (PFM), remote sensing (RS), and field survey (FS) data. In (b), landslide stages and their combinations: individual zones (Collapse, Transport, Deposit), paired stages (CT, CD, TD), and all combined stages (CTD). Boxes indicate the interquartile range (IQR), whiskers extend to 1.5 × IQR, black × represent group medians, and black dots ● denote outliers.
Remotesensing 17 03337 g011
Figure 12. Multivariate linear regression models (Models 6 and 8) stratified by runout path type. Panels (ac) represent single-stage models using the collapse, transport, and deposition zones, respectively. Panels (df) show multistage combinations: collapse–transport (CT), collapse–deposition (CD), and transport–deposition (TD), respectively. Panel (g) displays collapse–transport–deposition (CTD).
Figure 12. Multivariate linear regression models (Models 6 and 8) stratified by runout path type. Panels (ac) represent single-stage models using the collapse, transport, and deposition zones, respectively. Panels (df) show multistage combinations: collapse–transport (CT), collapse–deposition (CD), and transport–deposition (TD), respectively. Panel (g) displays collapse–transport–deposition (CTD).
Remotesensing 17 03337 g012
Table 1. Characteristics of site-condition thematic datasets.
Table 1. Characteristics of site-condition thematic datasets.
CategoryTypeVariableSource
Remote Sensing (RS)Quantitative (Qt)Altitude (ALT)DEM SRTM (30 × 30 m)
Slope angle (SLP)
Aspect (ASP)
Plan curvature (PLCV)
Profile curvature (PRCV)
Topographical Position Index (TPI)
Topographic roughness index (TRI)
Length-Slope Factor (LSF)
Stream Power Index (SPI)
Sediment Transport Index (STI)
Topographic Wetness Index (TWI)
Drainage density index (DDI)
Normalized Difference Water Index (NDWI)Landsat 8 OLI (30 × 30 m)
Normalized Difference Vegetation Index (NDVI)
Normalized Difference Building Index (NDBI)
Road density index (RDI) [84]—1:25,000
Qualitative (Ql)Geomorphological units (GeoU)[58]
Land Use Land Cover (LULC)[55]
Field Survey
(FS)
Quantitative (Qt)Lineament—Fault density index (LFDI) Geological maps
Structural density index (SDI) 1:100,000
Rainfall (RNF)1:50,000
Qualitative (Ql)Lithology typeGeological map (1:100,000)
Soil typePedological map (1:100,000)
Table 2. Description of qualitative site-condition thematic datasets.
Table 2. Description of qualitative site-condition thematic datasets.
QualitativeCodeDescription
LithologyNP23_gamma_1rnNeoproterozoic diorites, tonalites, gabbros, and gneisses.
NP3_gamma_1rnbNeoproterozoic gneisses and migmatites.
NP3_gamma_1soNeoproterozoic granites.
O1_gamma_6nfanPhanerozoic granites.
NP2_gamma_3sorNeoproterozoic orthogneisses.
SoilAR3Rock outcrop, associated with secondary cambisols and tertiary litholic neosols.
LVAd24Dystrophic Red-Yellow latosol, clayey or loamy.
CXbd4Haplic cambisol Tb, dystrophic, medium or argillic; associated with a talic B horizon.
CXbd6Haplic cambisol Tb, dystrophic, clayey or medium; associated with a talic B horizon.
Urban areaUrban area
Geomorphological unitAlluvial-colluvial slopesGently inclined depositional surfaces (5–10°) composed of poorly sorted sandy-clayey to clayey-sandy colluvial materials, interdigitated with flat fluvial plain deposits.
Colluvial-talus slopesDepositional slopes (5–10°) composed of very poorly sorted colluvial materials with sandy-clayey to clayey-sandy matrix and abundant blocks, interdigitated with gently inclined alluvial-colluvial ramp deposits. Typically located at the base of steep slopes and escarpments.
Dissected hills and low hillsConvex-concave slopes and rounded hilltops with relief amplitude between 50 and 120 m and slopes of 5–20°. Moderate drainage density.
Flood plainsSub-horizontal surfaces (0–3°) located in valley bottoms, composed of well-sorted sandy, sandy-clayey, or clayey deposits.
High hill domainsConvex-concave geometry with well-dissected terrain, rounded to pointed summits, relief amplitude of 80–250 m, slopes of 10–35°, and moderate to high drainage density.
Mountainous domainsStraight to concave slopes with aligned ridge tops, pointed or slightly rounded, relief amplitude >300 m. High drainage density, with steep slopes (20–45°) and occasional sub-vertical rocky walls.
Mountainous escarpmentsHighly dissected, steep slopes (>300 m relief amplitude, 30–45° slope), with straight to concave profiles, rocky cliffs, and sharp or aligned ridge tops. High drainage density.
Land Use and Land CoverForest formationVegetation types with a predominance of tree species with high-density continuous canopy.
Mosaic of usesFarming areas where it was not possible to distinguish between pasture and
agriculture. In Petropolis is often associated with the transition between informal settlements and areas that have been deforested.
PastureNatural or planted, related to farming activity.
Rock outcropNaturally exposed rocks in the terrestrial surface without soil cover, often with partial presence of rock vegetation and high slope.
Urban areaUrban areas with a predominance of non-vegetated surfaces, including roads, highways and constructions.
Table 3. Spearman correlation coefficients (ρ) between key post-failure morphometric landslide attributes and TD length.
Table 3. Spearman correlation coefficients (ρ) between key post-failure morphometric landslide attributes and TD length.
Post-Failure Morphometric VariablesTransport-Deposit Length
Collapse height (Hc)0.24
Collapse width (Wc)0.19
Collapse length (Lc)0.22
Collapse altitude (ALTc)0.13
Collapse slope angle (SLPc)0.12
Transport-Deposit height (Htd)0.32
Transport-Deposited width (Wtd)0.26
Transport altitude (ALTt)0.05
Transport slope angle (SLPt)0.12
Deposit altitude (ALTd)−0.05
Deposit slope angle (SLPd)0.02
Total height (Ht)0.32
Total length (Lt)0.89
Total volume (Vt)0.62
Note: Cells with p values < 0.05 are highlighted in gray to indicate statistical significance.
Table 4. Spearman correlation coefficients (ρ) between quantitative on-site conditions parameters and TD length.
Table 4. Spearman correlation coefficients (ρ) between quantitative on-site conditions parameters and TD length.
VariablesCollapsedTransportedDeposited
ALT0.130.06−0.05
SLP0.120.120.02
ASP0.260.280.15
PLCV0.180.130.11
PRCV0.130.130.01
TPI0.230.190.12
TRI0.130.130.07
SPI0.100.110.06
LSF0.020.04−0.01
STI−0.020.03−0.01
TWI−0.180.13−0.08
DDI0.010.010.05
NDWI0.150.050.02
LFDI−0.14−0.14−0.18
SDI−0.19−0.18−0.18
NDVI−0.26−0.11−0.07
RNF0.050.070.09
RDI0.000.020.05
NDBI0.230.040.05
Note: Cells with p values < 0.05 are highlighted in gray to indicate statistical significance.
Table 5. Generalization performance metrics of CTD-based models (type 8) under 10-fold cross-validation.
Table 5. Generalization performance metrics of CTD-based models (type 8) under 10-fold cross-validation.
CTD_Based10-Fold Cross-Validation
ModelsR2RMSEMAE
CTD-based0.1271.6648.18
Hc_CTD0.1271.6348.19
LC_CTD0.1570.3347.58
Ac_CTD0.1072.5648.48
Vc1.0_CTD0.1072.5848.79
LogAc_CTD0.1171.8248.06
LogVc1.0_CTD0.1271.5647.76
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

dos Santos, T.D.; Uchida, T. The Role of Collecting Data on Various Site Conditions Through Satellite Remote Sensing Technology and Field Surveys in Predicting the Landslide Travel Distance: A Case Study of the 2022 Petrópolis Disaster in Brazil. Remote Sens. 2025, 17, 3337. https://doi.org/10.3390/rs17193337

AMA Style

dos Santos TD, Uchida T. The Role of Collecting Data on Various Site Conditions Through Satellite Remote Sensing Technology and Field Surveys in Predicting the Landslide Travel Distance: A Case Study of the 2022 Petrópolis Disaster in Brazil. Remote Sensing. 2025; 17(19):3337. https://doi.org/10.3390/rs17193337

Chicago/Turabian Style

dos Santos, Thiago Dutra, and Taro Uchida. 2025. "The Role of Collecting Data on Various Site Conditions Through Satellite Remote Sensing Technology and Field Surveys in Predicting the Landslide Travel Distance: A Case Study of the 2022 Petrópolis Disaster in Brazil" Remote Sensing 17, no. 19: 3337. https://doi.org/10.3390/rs17193337

APA Style

dos Santos, T. D., & Uchida, T. (2025). The Role of Collecting Data on Various Site Conditions Through Satellite Remote Sensing Technology and Field Surveys in Predicting the Landslide Travel Distance: A Case Study of the 2022 Petrópolis Disaster in Brazil. Remote Sensing, 17(19), 3337. https://doi.org/10.3390/rs17193337

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop