Improving GIS-Based Landslide Susceptibility Assessments with Multi-temporal Remote Sensing and Machine Learning

This study developed a systematic approach with machine learning (ML) to apply the satellite remote sensing images, geographic information system (GIS) datasets, and spatial analysis for multi-temporal and event-based landslide susceptibility assessments at a regional scale. Random forests (RF) algorithm, one of the ML-based methods, was selected to construct the landslide susceptibility models. Different ratios of landslide and non-landslide samples were considered in the experiments. This study also employed a cost-sensitive analysis to adjust the decision boundary of the developed RF models with unbalanced sample ratios to improve the prediction results. Two strategies were investigated for model verification, namely space- and time-robustness. The space-robustness verification was designed for separating samples into training and examining data based on a single event or the same dataset. The time-robustness verification was designed for predicting subsequent landslide events by constructing a landslide susceptibility model based on a specific event or period. A total of 14 GIS-based landslide-related factors were used and derived from the spatial analyses. The developed landslide susceptibility models were tested in a watershed region in northern Taiwan with a landslide inventory of changes detected through multi-temporal satellite images and verified through field investigation. To further examine the developed models, the landslide susceptibility distributions of true occurrence samples and the generated landslide susceptibility maps were compared. The experiments demonstrated that the proposed method can provide more reasonable results, and the accuracies were found to be higher than 93% and 75% in most cases for space- and time-robustness verifications, respectively. In addition, the mapping results revealed that the multi-temporal models did not seem to be affected by the sample ratios included in the analyses.


Introduction
Landslide risk assessment and management are critical and extensive tasks that provide systematic and rigorous processes for slope engineering practice and management [1], making them important tools for addressing landslide hazards [2]. The framework of landslide risk assessment and management consists of several subtasks, such as susceptibility, hazard, and risk assessments. In addition, van Westen et al. [3,4] proposed a landslide risk assessment procedure that was mostly consistent with Dai et al. [2]. The term landslide susceptibility represents the likelihood (0 to 1) or degree (e.g., low, moderate, or high) of landslide occurrences in an area with given local terrain attributes [5]. Although landslide susceptibility assessment is a fundamental and essential task within the landslide risk assessment and management framework, there is no general consensus at present as to which is

Study Site and Data Preprocessing
The Shimen reservoir watershed (Figure 1), which is located in northern Taiwan, was selected as the study site because of numerous typical shallow landslide cases in this area. The total area of the study site covered approximately 764 km 2 of the mountainous terrain. The average annual precipitation in the watershed was approximately 2500 mm/year. The elevation measured from the digital elevation model (DEM) ranged from 250 to 3500 m above sea level. More than 60% of the study area had a slope degree greater than 16 • , and approximately 29% of the above area had a slope degree between 16 • and 28 • . The used DEM was generated by the Ministry of the Interior in Taiwan during 2002 to 2004. According to the geological and soil maps published by the Central Geological Survey of Taiwan, the watershed comprised nine geological categories (e.g., argillite, shale, quartzitic, and etc) and six primary soil types (e.g., clay, sand, loam and mixed soils). The collected soil map had some null values in the northern region, which were ignored in the landslide susceptibility assessments in this study. The primary land-cover in the watershed was forest but had limited agricultural activities near a few tribal areas. Detailed information regarding geology and soil as well as the spatial distribution of river, road, and fault within the study site has been mentioned in Tsai et al. [10]. The landslide inventory used in this study consisted of shallow landslide as identified with multi-temporal satellite remote sensing images and pixel-based change detection analysis from 2004 to 2008 [48]. Most of the detected landslides were also verified through field investigation. A summary of landslides detected after typhoons is listed in Table 1. They were rasterized into 10 m ×10 m cells to be compatible with other datasets for modeling. The deposition regions in the landslide samples were already eliminated using an empirical criterion [49], that is, the slope of landslide samples must be larger than 10 • , otherwise they were considered the deposition area (non-landslide region). The spatial distribution of the landslide inventory can be also found in Tsai et al. [10]. A total of 14 landslide factors (Table 2) were collected or derived from the vector and raster GIS datasets. In addition, all factor data sets were resampled to a 10 m × 10 m raster format regardless of their original scale or spatial resolution to be compatible with the modeling system. For example, aspect, curvature, and slope factors were derived from the DEM. Furthermore, the vegetative factor normalized difference vegetation index (NDVI) was derived from the multispectral SPOT-4 and SPOT-5 satellite images, which were acquitted before the typhoon events (second column in Table 1) during 2004 to 2008, and provided by the Center for Space and Remote Sensing Research in National Central University of Taiwan. The NDVI values acquired at different times can be used to quantitatively represent the multi-temporal vegetation characteristics of the study site. To address the NDVI variations of the same surface due to difference in topographic, radiometric, and atmospheric conditions, this study adopted pseudo invariant feature (PIF) normalization [50,51] and Minnaert topographic correction [52] to minimize temporal and spatial differences. To obtain the rainfall distributions, this study applied the commonly used algorithms of inverse distance weighting (IDW) and Kriging interpolation. These methods are used to generate event-by-event accumulative and maximum hourly rainfall maps based on the rainfall-gauge data. The non-landslide samples were randomly extracted, and the numbers of samples were equal to 1, 4, 7, and 10 times the landslide samples in this study. Then these landslide and non-landslide samples can be used to extract the corresponding landslide factors to produce datasets. The data preprocessing and sampling steps show in Figure 2 in detail.

Developed Machine Learning Based Model
A three-phase study was developed for development of multi-temporal and event-based landslide susceptibility models. In the first phase, RF-based landslide susceptibility models were constructed as shown in Modeling of Figure 2. In the second stage, the cost-sensitive analysis was used to avoid the over-fitting predictions as also shown in Modeling of Figure 2. Finally, the developed models were verified using space-and time-robustness strategies to generate the landslide susceptibility map as shown in Assessment of Figure 2 and Section 2.3. This study employed the RF algorithm [53], which is one of the ML-supervised methods, to construct the multi-temporal and event-based landslide susceptibility models. The concept of RF was similar to the DT algorithm, a classical and popular approach in the related domains. Both methods apply the information gain (IG) measure or Gini index to evaluate the degree of impurity of landslide factors or variables. A landslide-related variable, which has a larger IG or Gini value, must be selected because it has a higher priority for construction of a conditional node, and further, this factor should be ignored in the next computation. After several iterations, a sequence of rules was constructed that can be used to classify other instances. Unlike DT, the RF algorithm randomly separates training data into many subsets using the bootstrap algorithm to build many trees for optimization. Therefore, more favorable results can be obtained using the RF classifier than using the DT algorithm. Discrete (or nominal) and continuous (or numeric) formats are commonly used in the geospatial field. For nominal data, entropy theory is used to compute IG [10], as described in Equations (1)- (3), where E(A) indicates the entropy of all training data; p and n represent the numbers of positive (landslide) and negative (non-landslide) labels, respectively; E'(a) and v indicate the entropy and subset amounts of a specific landslide factor, respectively; E(a j ) is the entropy of the subset in a specific landslide factor computed by Equation (1); and IG(a) means the IG of a specific landslide factor. For numeric data, the Gini index was applied to evaluate the IG [10] as described in Equations (4) and (5), where C indicates a segmented point for a specific landslide factor used to divide numeric data into two parts and N 1 and N 2 represent the numbers of a ≤ C and a > C, respectively.
According to cost matrix, the cost-sensitive analysis or learning can be used to adjust the decision boundary of a developed model, which reclassifies training samples to reduce and to balance the omission or commission error of certain classes [54][55][56]. The size of the cost matrix is equal to that of the confusion matrix. A confusion matrix consists of true positive (TP), false negative (FN), false positive (FP), and true negative (TN). In the cost matrix, diagonal values (i.e., TN and TP) represent the cost of correct classification, and remainder values (i.e., FN and FP) describe the misclassification costs between various classes. In general, the costs of diagonal and other elements are set to 0 and 1, respectively. The decision boundary can be changed by adjusting the cost of incorrect elements to include or exclude more samples for balancing the classification result of a certain class. However, a positive or negative effect on the results of other classes might occur. To balance the false alarm and missing rates and to maintain the reliability and veracity of the overall results, different costs were tested for the target class in this study. More precisely, the optimal prediction (R) for an instance x in class i can be expressed as in Equation (6), where P(j|x) is the probability for estimation of classification of an instance into class j and C represents the cost. The optimal prediction is class 1 (landslide) if and only if the expected cost of this prediction is less than or equal to the expected cost of predicting class 0 (non-landslide), as presented in Equation (7). Given p = P(1|x) and C 00 = C 11 = 0, Equation (7) is equivalent to Equation (8), and a threshold p* can be defined as Equation (10) based on Equation (9), which classifies an instance x as positive if the P(1|x) is larger than or equal to the threshold [57].

Verification and Mapping
To verify the constructed landslide susceptibility models, the quantitative results of space-robustness and time-robustness verifications, which compare the output and reference labels (e.g., landslide or occurrence and non-landslide or non-occurrence) of the check or prediction datasets as identified by a threshold, are derived from the confusion matrix. In most cases, a sample is assigned into the non-landslide class when the landslide likelihood is smaller than 0.5, otherwise, the sample is not classified the non-occurrence label (landslide). The threshold was equal to p* after cost-sensitive analysis was conducted. Further, a landslide susceptibility map can be generated instance by instance based on the landslide likelihoods or output categories (e.g., low, medium and high susceptibilities) of the constructed model, if the verified results are acceptable.
This study adopted four commonly used quantitative indices for verification, namely overall accuracy (OA), precision (also called as the user's accuracy, UA), recall (also called as the producer's accuracy, PA), and area under the receiver operating characteristic (ROC) curve. The PA and UA as described in Equations (11) and (12) can reflect the errors class by class [10], where M is a component in a confusion matrix, Nc represents the number of classes, and i and j indicate the indices for columns and rows in a confusion matrix, respectively. Moreover, 1-recall and 1-precision can reflect the omission (missing) and commission (false alarm) errors, respectively. The OA presents the percentage of correctly classified samples [10], as described in Equation (13). The area under the curve (AUC) for ROC was also commonly used for evaluation of landslide susceptibility models [12,15,[58][59][60], and generally, the value ranged from 0.5 to 1. A larger AUC represents higher reliability.

Results
The modeling process and accuracy assessments in this study were performed using the WEKA platform (http://www.cs.waikato.ac.nz/ml/weka/), which is a free and open-source tool. The main parameter for RF-based computation was the number of trees (Ntree). Du et al. [26] mentioned that 10-200 Ntree did not have any influence on the results, but an increase in Ntree increased the computation loading. Therefore, in this study, 100 trees were used, which was suggested by WEKA, for executing RF-based landslide susceptibility assessments. The developed models were compared with the commonly used methods, namely DT, Bayes network, and logistic regression.

Multitemporal Landslide Susceptibility Assessments
The multi-temporal dataset used in the study included all landslide samples extracted after typhoon events between 2004 and 2008. The non-landslide samples of the same period were randomly extracted, and the numbers of samples were equal to 1, 4, 7, and 10 times the landslide samples in the test cases. These samples can be used to extract the corresponding landslide factors to produce multi-temporal datasets. For modeling, two-thirds of the samples from 2004 to 2007 (referred to as training data) were randomly selected from the multi-temporal dataset to construct the models, and the remainder (referred to as check data) were used for space-robustness verification. The constructed models were also used to predict the time-robustness verification of samples from 2008. Two prediction datasets were used in this study, namely multiple-and single-event samples. The multiple-event dataset indicated that the samples included all instances of 2008. On the contrary, the single-event dataset indicated that the collected records were extracted from a single typhoon event.

Space-robustness Verification
The quantitative evaluations of space-robustness verification are presented in Figure 3. The OA may not fully represent the overall results of the unbalanced sample proportion cases, especially when the large sample class is easily detected, leading to an optimistic evaluation. Thus, the OA measure was only applied in equal sample proportion cases. Three important findings are illustrated in Figure 3. First, the space-robustness verification results revealed that the constructed models can provide flexible or acceptable results in all cases. In other words, the patterns of landslide occurrence from 2004 to 2007 can be described effectively. Second, in cases of unequally proportioned samples, the deterioration was observed in the landslide detection results obtained using the Bayes network and logistic regression algorithms, but the RF-based results were stable. More precisely, the Bayes network-and logistic regression-based models had lower UA and PA accuracies because of the increase in the unbalanced sample proportions. Third, in terms of the modeling performances, the RF-based results outperformed others in all cases.

Time-Robustness Verification with Multiple-Event Samples
The prediction results of quantitative evaluation of the multiple-event samples of 2008 dataset are presented in Figure 4. The accuracies were significantly lower than those for the space-robustness results. Comparing Figure 4 with Figure 3, two points should be emphasized. First, although the landslide's PA was slightly lower in the equal sample proportion case, the prediction capability based on the logistic regression algorithm yielded acceptable results. The RF-based model had higher commission and omission errors in terms of non-landslide and landslide detection, but the PA and UA accuracies were the highest for detection of the non-occurrence and occurrence classes in this case. The performance of the Bayes network algorithm was the lowest. Second, in the unequal sample proportion cases, the non-landslide detection results were improved because of the increase in the number of samples. However, although the logistic regression algorithm provided acceptable UA accuracies in all cases, a smaller number of landslide samples in the modeling process can cause a decrease in occurrence detection. To address this issue, cost-sensitive analysis was employed to adjust the decision boundary during the classification-based modeling. The confusion matrices and quantitative accuracies for predicting the 2008 dataset were calculated using the RF algorithm after adjusting for the decision boundary. The results of the cost-sensitive analysis for different sample proportions are illustrated in Figure 5. The constructed model without cost-sensitive analysis (i.e., cost is equal to 1) had a high FN and low FP, resulting in unbalanced predictions and high missing and low false alarms. To address this issue and to reduce the problem of FN misclassification, the FN cost was increased for re-prediction of the 2008 dataset. A decrease in the TN and FN assignments was observed in the equal sample proportion case (Figure 5a). The trends of FP and TP were obtained from the low cost. In addition, a cross over between TN and TP as well as FN and FP occurred when the cost was 50. Figure 5b illustrates the optimal quantitative result, which was obtained at a cost of 50, where all indices were close to 0.8. For the unbalanced sample cases, an increase in TN values was observed with an increase in the number of non-landslide samples (Figure 5c,e,g). A decreasing trend for TN and FN and an increasing trend for FP and TP were observed with lower costs. A trade-off between landslide omission (missing) and commission (false alarm) errors in the unbalanced sample proportion cases is presented in Figure 5d,f,h. In the cases of unequal sample proportions, the results obtained based on costs of 100, 500, 1000, and 3000 (Figure 5d,f,h) exhibited higher performances, but produced unbalanced predictions between landslide's omission and commission errors.  Same procedures of adjusting decision boundary were applied for the DT, Bayes network, and logistic regression algorithms. Further, Figure 6 compares the representative results (i.e., arrows in Figure 5) of all tests. Increasing PAs in unequal sample ratio cases was more critical than improving UAs in this study because landslide susceptibility assessment was the preceding part of the landslide risk assessment and management framework. Reducing commission errors (false alarm) can be expected through further tasks (e.g., hazard and risk assessments). Therefore, for the unequal sample ratio cases, the arrows in Figure 5 with lower omission errors (missing) were selected as the representative results for the comparison. On the basis of Figure 6, it appeared that the RF method with cost-sensitive analysis outperformed the other methods. In the 1:1 case, the RF model seemed to perform the best, considering that all the accuracy indices were larger than 0.79. The trade-off between the landslide's UA and PA or preserving of the PA was also observed in the unbalanced sample proportion cases. Overall, RF models with cost-sensitive analysis can predict the unbalanced samples most effectively, although a certain degree of false alarms exists. To understand the interaction between the occurrence samples and the models, susceptibility distributions of true occurrence samples were illustrated with histograms in this study. Five susceptibility intervals were considered, namely very high (>80%), high (60%-80%), medium to high (40%-60%), medium to low (20%-40%), and low (<20%), where the higher likelihoods represented higher landslide occurrence potential. Ideally, all susceptibilities of the true landslide occurrence samples should be close to 100%, indicating true occurrence. However, the landslide susceptibility distributions obtained for the 2008 dataset on the basis of the model using the original RF algorithm were much lower than 100% (Table 3). These results also indicated that performing cost-sensitive analysis resulted in more reasonable landslide susceptibility results.

Time-robustness Verification with Single-event Samples
The scenario discussed in this section is similar to that discussed in Section 3.1.2, but each prediction dataset consists of a single typhoon event. This type of verification was designed to explore the event-based prediction capability of multi-temporal landslide susceptibility models. The models were employed to predict landslides associated with Typhoon Fung-wong, Sinlaku, and Jangmi, which occurred in 2008, with different numbers of non-landslide samples. Similar procedures were applied to datasets from 2004 to 2007 for time-robustness verification of the models based on the abovementioned single-event samples. Representative results for prediction for Typhoon Fung-wong, Sinlaku, and Jangmi obtained using different algorithms are presented in Figures 7-9. Examination of these figures revealed that the results produced by the RF method with cost-sensitive analysis were more favourable in most cases. In the 1:1 case, all indices obtained with the RF model for prediction of Typhoon Fung-wong, Sinlaku, and Jangmi were larger than 0.74, 0.73, and 0.82, respectively. A trade-off between the landslide UA and PA or keeping of the landslide PA was also observed in the unbalanced sample proportion cases.  This study also examined the landslide susceptibility distributions of the true occurrence samples for Typhoon Fung-wong, Sinlaku, and Jangmi based on the RF algorithm with cost-sensitive analysis ( Table 4). Table 4 reveals that the developed models produced reasonable landslide susceptibility values for the true occurrence samples.  Abbreviations: L, landslide class; N, non-landslide class.

Susceptibility Mapping
The multi-temporal landslide susceptibility maps generated from the developed models are presented in Figure 10. The RF models constructed without cost-sensitive analysis were too conservative (close to training data of occurrence area) and led to large omission errors. By contrast, the RF models with cost-sensitive analysis produced more stable and reasonable landslide susceptibility maps.

Event-Based Landslide Susceptibility Assessments
To ensure that there were sufficient samples to predict other events, the Typhoon Aere, Matsa, and Sinlaku samples, with the largest numbers of samples (Table 1), were selected to construct the landslide susceptibility models for space-and time-robustness verifications. Two-thirds of samples of each typhoon event were randomly extracted to develop the models, and the remainder samples were the check data. For time-robustness verification, one of the constructed models was employed to predict the other two events. The RF, DT, Bayes network algorithm, and logistic regression algorithm were also integrated with cost-sensitive analysis during the modeling process. In addition, sample ratios of 1:1, 1:4, 1:7, and 1:10 were considered for landslide and non-landslide classes.

Space-Robustness Verification
Procedures similar to those discussed in Section 3.1.1 were applied to the selected typhoon events. The evaluation results are presented in Figure 11. Three important points were observed. First, the results in the equal sample proportion cases presented accuracies greater than 0.85. Second, there was an increase in the number of landslide commission (Typhoon Aere) and omission (Typhoon Aere and Sinlaku) errors in the Bayes network and logistic regression results for some typhoons in the unbalanced sample proportion cases. Third, the results from the RF-and DT-based models remained stable in all cases, especially those of the RF algorithm-based model.

Time-robustness Verification
A procedure similar to that detailed in Section 3.1.3 was applied for Typhoon Matsa, Sinlaku, and Aere predictions. An overall comparison of the best prediction results for Typhoon Matsa, Sinlaku, and Aere is presented in Table 5. On the basis of this table, the accuracies were higher than 0.7 in the 1:1 cases, 0.63 in the 1:4 cases, 0.63 in the 1:7 cases, and 0.6 in the 1:10 cases, except for the landslide UAs of the unequal sample proportion cases. The results also indicated that the RF algorithm with cost-sensitive analysis performed better than the other algorithms. The evaluated landslide susceptibility results of the true occurrence samples for RF models with cost-sensitive analysis are presented in Table 6. Most of the true landslide samples were correctly labeled with high susceptibility. Table 5. Best results of the event-based landslide susceptibility assessments.

Susceptibility Mapping
The event-based landslide susceptibility maps generated based on the constructed models and Typhoon Sinlaku sample, as an example, are presented in Figure 12. The figures reveal that the equal sample ratio produced reasonable spatial patterns for event-based landslide susceptibility modeling, whereas the extremely unbalanced ratios seemed to cause over-fitting of the models because these patterns were close to the training data of occurrence area.

Discussion
The purpose of constructing multi-temporal and event-based landslide susceptibility models was to explore the capability of developed procedures for spatial pattern description and temporal prediction. Experimental results based on space-and time-robustness verifications as well as mapping results demonstrated the feasibility and stability of the developed algorithm (i.e., RF with cost-sensitive analysis). The reason was probably that the bootstrap and optimization steps in the RF algorithms and decision boundary adjustments in the cost-sensitive analysis improved the performance because they can help reduce the problem of over-fitting. However, the extremely unbalanced sample ratio was not suggested for event-based modeling due to the serious over-fitting issue.
In terms of the landslide and non-landslide samples, it was assumed that for most algorithms, misclassification errors have an equal cost. However, in many real-world applications, this assumption is not true. In this study, the results of time-robustness verification of unequal sample proportion cases and quantitative evaluation without cost-sensitive analysis revealed that the FN errors (i.e., missing or omission rates) were more serious than FP errors (i.e., false alarm or commission rates). This result was consistent with a previous study by Desai and Jadav [54]. From a statistical point of view, the low accuracies suggested a significant disagreement between the constructed models and prediction datasets. Two scenarios may have caused prediction errors. First, the predicted landslide samples revealed that similar conditions occurred in the past, but these locations in the prediction data were presently stable. Second, some landslide areas may have been classified into the non-landslide class because the models did not have similar occurrence situations derived from previous experience (training data). These may be the reasons for accuracies lower than those in the space-robustness verification.
The effectiveness of cost-sensitive analysis was explored to deal with landslide susceptibility assessments with balanced and imbalanced class distributions. Reducing omission (missing) errors in unequal sample ratio cases was more important than decreasing commission (false alarm) errors in this study because landslide susceptibility assessment was a preceding part of landslide risk assessment and management framework. Reducing commission errors was expected through further tasks. The experiments demonstrated that the accuracies of the developed models were better than 93% and 75% in most cases for space-and time-robustness verifications, respectively. It has been demonstrated that elaborate back analysis of past landslides gives insight of critical factors affecting their triggering [61,62]. Thus, back analysis on a number of observed past landslides of the region considered may give insight of critical factors affecting landslides and thus verify or adjust the machine learning based models and the factors used. On the basis of generated maps, Figures 10b,d,f,h, and 12a are suggested to be used for landslide risk assessment because the susceptibility distributions are reliable as listed in Tables 3, 4 and 6. Future studies can further include other information to achieve more accurate results of landslide risk assessments and management and to assist in land planning and policy making.

Conclusions
The development, which integrated the RF algorithm and cost-sensitive analysis with the GIS datasets and remotely sensed images, is presented to evaluate multi-temporal and event-based landslide susceptibilities for the Shimen reservoir watershed and is compared with three commonly used algorithms in the related field (i.e., DT, Bayes network, and logistical regression). The experimental results of space-robustness verification indicated that the RF algorithm outperformed the others, and all RF accuracies for the multi-temporal and event-based landslide susceptibility models were higher than 0.93 (Figures 3 and 11). However, using such models to predict other events (time-robustness verification) may not produce plausible results because of the over-fitting issue. It is similar to the results of Rossi et al. [6], Tsai et al. [10] and Chang et al. [46].
To address the over-fitting issue and to improve the prediction capability, cost-sensitive analysis was conducted to adjust decision boundaries. The constructed multi-temporal and event-based models were evaluated and verified. The results in most cases indicated that the developed models produced better and more stable accuracies (Figures 6-9 and Table 5). Regarding the unequal sample ratios in this study, reducing missing error was critical and the commission error was expected to decrease through further tasks in landslide risk assessment and management framework. The accuracies of the constructed models were greater than 0.75 in most cases for time-robustness verifications. To further examine the developed models, the landslide susceptibility distributions of true occurrence samples (Tables 3, 4 and 6) and the generated landslide susceptibility maps (Figures 10 and 12) were compared. They demonstrate that using cost-sensitive analysis can provide more reasonable results than the original algorithms. Furthermore, landslide susceptibility map generated through the results of the developed method revealed that the multi-temporal models were unaffected by the sample ratios, but use of extremely unbalanced sample ratios for event-based modeling was not suggested.
The developed landslide susceptibility modeling framework also contributes to decision and policy making for better land planning and watershed management, and for better understanding of the eco-environmental impacts in the affected areas. In practice, collecting detailed field data is usually costly and time-consuming at a regional scale and may further reduce the effectiveness of knowledge-driven approaches. In such cases using geo-spatial technologies for data acquisition, processing and analysis may provide a more economic and efficient alternative for landslide assessments. On the other hand, in the knowledge-driven approaches, the stability can be determined accurately for specific slope sites, and these results are valuable for other cases. The outcomes of data-driven approaches are usually suitable for areas with long-term observation and data collection. Therefore, combining data-and knowledge-driven approaches for multi-scale landslide analysis and mapping can help support land planning and policy marking. Finally, the effect of the quality of landslide susceptibility models on the consequent tasks could be also examined in the future.

Conflicts of Interest:
The authors declare that they have no conflict of interest.