Landslide Susceptibility Mapping and Interpretation in the Upper Minjiang River Basin

: To enable the accurate assessment of landslide susceptibility in the upper reaches of the Minjiang River Basin, this research intends to spatially compare landslide susceptibility maps obtained from unclassiﬁed landslides directly and the spatial superposition of different types of landslide susceptibility map, and explore interpretability using cartographic principles of the two methods of map-making. This research using the catalogs of rainfall and seismic landslides selected nine background factors those affect the occurrence of landslides through correlation analysis ﬁnally, including lithology, NDVI, elevation, slope, aspect, proﬁle curve, curvature, land use, and distance to faults, to assess rainfall and seismic landslide susceptibility, respectively, by using a WOE-RF coupling model. Then, an evaluation of landslide susceptibility was conducted by merging rainfall and seismic landslides into a dataset that does not distinguish types of landslides; a comparison was also made between the landslide susceptibility maps obtained through the superposition of rainfall and seismic landslide susceptibility maps and unclassiﬁed landslides. Finally, confusion matrix and ROC curve were used to verify the accuracy of the model. It was found that the accuracy of the training set, testing set, and the entire data set based on the WOE-RF model for predicting rainfall landslides were 0.9248, 0.8317, and 0.9347, and the AUC area were 1, 0.949, and 0.955; the accuracy of the training set, testing set, and the entire data set for seismic landslides prediction were 0.9498, 0.9067, and 0.8329, and the AUC area were 1, 0.981, and 0.921; the accuracy of the training set, testing set, and the entire data set for unclassiﬁed landslides prediction were 0.9446, 0.9080, and 0.8352, and the AUC area were 0.9997, 0.9822, and 0.9207. Both of the confusion matrix and the ROC curve indicated that the accuracy of the coupling model is high. The southeast of the line from Mount Xuebaoding to Lixian County is a high landslide prone area, and through the maps, it was found that the extremely high susceptibility area of seismic landslides is located at a higher elevation than rainfall landslides by extracting the extremely high susceptibility zones of both. It was also found that the results of the two methods of evaluating landslide susceptibility were signiﬁcantly different. As for a same background factor, the distribution of the areas occupied by the same landslide occurrence class was not the same according to the two methods, which indicates the necessity of conducting relevant research on distinguishing landslide types.


Introduction
China is one of the countries where geological hazards such as landslides, avalanches, and mudslides occur frequently; geological hazards generally cause serious human and socio-economic losses.The exploration of the spatial relationship between geological hazards and occurrence factors can reduce the risk of geological hazards to a certain extent.Landslide hazard susceptibility is the answer to the question of where landslides are likely to occur in space, and it aims to establish the relationship between landslides and the factors that affect landslide occurrence and evaluate the regional landslide hazard susceptibility based on this relationship [1][2][3].
Canadian geologist Agterberg proposed the weight of evidence model based on the GIS platform [4][5][6]; Dahal et al., 2008 [7] explored the weight of evidence approach in the evaluation of the landslide susceptibility of small watersheds using two small watersheds in Shikoku, Japan as the study area; Sadisun et al., 2021 [8] used Sigi Biromaru as the study area to undertake a landslide susceptibility evaluation based on the weight of evidence model; and Yang et al., 2020 [9] and Hu et al., 2020 [10] evaluated the landslide susceptibility of Jiuzhaigou and Badong County based on the weight of evidence method, respectively, and the studies all showed that the accuracy of the model was high, and then many scholars improved and applied it based on the common weight of evidence method [7,8,11,12].With the development of spatial-temporal data mining, machine learning has also been gradually applied to the evaluation of geological hazards by scholars.Bai et al., 2011 [13] studied the landslide susceptibility of Lianyungang city in China using a rare event logistic regression model in 2011; Mao et al., 2015 [14] found that, compared with plain Bayesian, uncertaintybased Bayesian classification in evaluating the landslide hazard in Baota district of Yan'an city could better reflect the landslide hazard development characteristics; Wang et al., 2021 [15] carried out a landslide susceptibility evaluation in Jingzhou County, Hunan Province using an SVM algorithm.The random forest model is a popular model for landslide susceptibility evaluation that is less sensitive to noise and outliers, less prone to overfitting, more inclusive of data imbalance, and more stable in prediction [16,17].Wu et al., 2017 [18] evaluated the landslide hazard in the Dongjiang Basin based on the random forest model; Liu et al., 2018 [19] evaluated the landslide susceptibility in Sha and Xi township-Xeitan township in the Three Gorges reservoir area using a random forest model.Gradually, deep learning has also been applied to predict the susceptibility of hazards such as landslides owing to its superiority.Mandal et al., 2021 used a CNN convolutional neural network to predict the landslide susceptibility of Rorachu river basin [20]; and Wang et al., 2023 used a deep learning algorithm to predict the landslide susceptibility of the Jiuzhaigou area [21].But deep learning algorithms are often implicit in the expression of knowledge structures, so they have the disadvantage of difficult interpretability.
The coupling of multiple models for landslide susceptibility evaluation is currently receiving more and more attention.Arabameri et al., 2019 evaluated landslide susceptibility in the Gorganroud watershed of northern Iran using the LNRF-LMR coupled model [22]; Pourghasemi et al., 2019 evaluated the susceptibility of flood and landslide hazards in the Lorestan province of Iran using a SWARA-ANFIS-Gray wolf coupled model, respectively [23]; Guo et al., 2019 [24] evaluated the landslide susceptibility of the Wanzhou district in the Three Gorges reservoir area based on a coupled model of weight of evidence and a BP neural network; Li et al., 2021 [25] coupled an information quantity model with a logistic regression model to predict landslide susceptibility in Chongyi county, Jiangxi province; Ma et al., 2022 [26] used a coupled random forest-frequency ratio model to evaluate the landslide susceptibility in Lueyang County, which improved the accuracy by 10.7% and 4.9%, respectively, compared with the two single models; Bai et al., 2022 [27] evaluated the landslide susceptibility in the northeast of Yu based on the entropy index and a random forest coupled model, and the studies all showed that the coupled model has a higher prediction accuracy than the single prediction model.
The multi-hazard direction in hazard science research is one of the hotspots at home and abroad [28][29][30][31][32][33], but, at present, the whole is still dominated by single hazard research.In terms of the susceptibility mapping of hazards, a large number of studies have also mapped the susceptibility of single hazards, such as landslides, floods, forest fires, etc., due to the differences in the background and triggering factors of each hazard [34][35][36] and, among the methods used to span the spatial susceptibility from a single hazard to a multi-hazard, the spatial superposition of single hazards has mainly been used [23].From the big concept of hazards to the small concept of landslides, the same background factor or triggering factor has different impacts on the development of different types of landslides, so when mapping the landslide susceptibility of a certain area, different landslide catalogs should also be used as input.Currently, some scholars have also carried out studies to distinguish landslide types with different trigger factors: Wang et al., 2012 analyzed the causal mechanism of landslide hazards triggered by rainfall and earthquakes through physical simulation experiments [37]; Ding 2013 used suitable models to study the mechanisms of landslides under earthquakes and rainfall [38]; Bai et al., 2013 used a logistic regression model to separately assess rainfall and seismic landslide susceptibility; and the necessity of separating rainfall and seismic landslides was also proposed through related studies [39].In this research, we propose to evaluate the landslide susceptibility of the upper reaches of the Minjiang River Basin by using different mapping principles and using both unclassified and classified types of landslides, respectively.
At present, there has been relatively little research on the classification of rainfall and seismic landslides in the upper reaches of the Minjiang River Basin conducted by predecessors.Therefore, the susceptibility mapping of the two types of landslides is lacking, making it difficult to provide sufficient support for the prevention and control of landslide geological disasters.This research intends to take the upper reaches of the Minjiang River Basin as an example to obtain evaluation maps of rainfall, seismic, and unclassified landslide susceptibility, respectively.The development patterns of different types of landslides based on background factors will be explored at the same time.Meanwhile, this research proposes to refer to the spatial superposition of multiple hazards [40], obtaining spatial comparisons of landslide susceptibility maps obtained from unclassified landslides and the final landslide susceptibility map obtained by overlaying rainfall and seismic landslide susceptibility maps, and it also attempts to explain the variability in spatial distribution in terms of cartographic principles.In addition, this research combines the weight of evidence with the random forest algorithm.The reason for coupling the weight of evidence with the random forest model is that this research mainly investigates the effect of different landslide catalogs on the prediction results of compound landslide susceptibility (i.e., spatial superposition of rainfall and seismic landslide susceptibility maps to obtain the compound landslide susceptibility map, and susceptibility map obtained by using unclassified landslides), and attempts to explain the two results in terms of the mapping principle in order to help elucidate the need to use different landslide catalogs for different problems.The contribution of different cataloged landslides to each background factor is different since the two methods use different landslide catalogs, which makes the spatial relationship between landslides and background factors not the same, so it is necessary to firstly get the spatial relationship to derive the contribution of each background factor to the landslide prediction, as opposed to directly inputting the values of the background factors into the model for prediction.The coupling of the two models can make the prediction results more realistic.

Study Area
The upper Minjiang River Basin is located in the eastern margin of the Tibetan Plateau, covering an area of about 22,000 km 2 [41], and its elevation changes are between 694 and 5840 m.The geological structure is complex, mainly composed of NE-and NWoriented faults, with frequent seismic activity [42].The stratigraphic lithology is dominated by sandstones and siltstones interspersed with micaceous rocks; shales, micaceous, and siltstones; limestones and sandstones; and granitic rocks.It is also influenced by the southwest monsoon, with rainfall concentrated in summer, and it experiences serious effects of geological hazards such as landslides.The region is similar to other regions located on the Tibetan Plateau, with the same active environmental conditions and more intense climate change [43].The upper reaches of the Minjiang River Basin studied in this research mainly include Songpan, Maoxian, Lixian, Wenchuanxian, and Dujiangyan where several major cities are located, and 3343 landslide sites were collected here, including 407 rainfall landslides and 2936 seismic landslides.The catalogued data of landslides used in this research and the overview of the study area are shown in Figure 1.
Remote Sens. 2023, 15, x FOR PEER REVIEW 4 include Songpan, Maoxian, Lixian, Wenchuanxian, and Dujiangyan where several m cities are located, and 3343 landslide sites were collected here, including 407 rainfall l slides and 2936 seismic landslides.The catalogued data of landslides used in this rese and the overview of the study area are shown in Figure 1.

Data
This study used the inventory data of rainfall and seismic landslide point sites i upper reaches of the Minjiang River Basin, as well as background factors related to occurrence of two types of landslides.The specific data and sources are shown in Tab

Data
This study used the inventory data of rainfall and seismic landslide point sites in the upper reaches of the Minjiang River Basin, as well as background factors related to the occurrence of two types of landslides.The specific data and sources are shown in    Factors related to landslide occurrence DTM image with 90 m spatial resolution [41] NDVI Resource and environmental science data registration and publishing system [44] Among the data we used in this research, the rainfall landslides were obtained from the 1:100,000 landslide hazard survey and mapping of China from 1999 to 2008 by the Ministry of Land and Resources of China, and there were 407 rainfall landslide point sites; the seismic landslides were obtained from the Ministry of Land and Resources of China.According to the investigation of landslide disasters caused by the 2008 Wenchuan earthquake, there were a total of 2936 landslide point sites.From Figure 1, it can be seen that landslide point sites were roughly distributed along the main stream of the Minjiang River and its tributaries.The geologic background factors related to the occurrence of landslides (such as slope, slope direction, etc.) were extracted from DTM images with 90 m spatial resolution.The NDVI index data related to the occurrence of landslides were obtained from the year-by-year NDVI maximum dataset of China with 30 m spatial resolution released by Xu Xinliang in the Resource and Environmental Science Data Registration and Publication System.

Weight of Evidence Model
Taking landslide as an example, weight of evidence method is a data-driven model based on the uncertainty of probability and Bayes' law to find the posterior probability of landslide occurrence by spatially superimposing the evidence factors assigned weights based on exploring the spatial correlation between landslides and the evidence factors affecting landslide occurrence.
The basic principle is as follows [45,46]: Suppose there are m landslide point sites in the study area.Firstly, the study area is spatially divided into a grid according to a certain scale, and there is only one landslide point site in each grid, then, the priori probability of landslide occurrence in the study area is (m/a), and n 1 , n 2 , n 3 , etc. are n geological factors related to landslide occurrence.Choose the jth geological factor, make n j that a geological factor exists and ¬n j that a geological factor does not exist, then overlay the landslide layer with the geological factor n j layer to obtain 4 cases of n j ∩ m, ¬n j ∩ m, n j ∩ ¬m, and ¬n j ∩ ¬m, and the following 4 conditional probabilities can be defined based on these: P m|¬n j = ¬n j ∩ m ¬n j P ¬m|¬n j = ¬n j ∩ ¬m ¬n j (4) Based on the above four conditional probability formulas, the Bayes' law yields the following: P m|¬n j = P ¬n j |m P(m) P(¬n j (5) The jth evidence layer has positive weight W + j and negative weight W − j , respectively, and is taken as W + j when n j exists and W − j when n j does not exist: W − j = ln P(¬n j |m) This leads to a relationship between the evidence power of the n j layer and the likelihood ratio and conditional ratio of the occurrence of landslides, which is expressed as the evidence power in the form of In this research, we obtain the total weight W of the evidence factors by finding each evidence factor W + j and W − j based on the SDM toolbox developed on Arcgis platform.

Random Forest Model
Breiman [47] proposed random forest as an integrated method in 2001.The sampling method used in random forest is bootstrap resampling, and its randomness lies in the random sampling of the feature factors and the number of samples.Then, decision trees are used to model each bootstrap sample based on the extracted samples.When modeling the random forest, the number of decision trees ultimately used to construct the random forest can be determined by the trend of the model's error variation.The decision trees are independent of each other, and each one of them yields a classification result; the final classification result is obtained by the result of the plurality of all decision trees.
The construction of a decision tree often goes through 3 processes: feature selection, decision tree generation, and decision tree pruning.The selection of features is based on the information gain.The information gain of feature factor n j on dataset d is denoted as g d, n j , which is the difference between the empirical entropy H (d) of dataset d and the empirical conditional entropy of d under the given condition of feature factor n j [48].
Suppose X is a discrete random variable taking finite values with probability distribution P(X = x i )= p i , I = 1, 2, . . . n, then the entropy of the random variable X is as follows: H ( X )= − ∑ n i=1 p i logp i (10) Suppose the random variable (X, Y) with joint probability distribution P(X = x i , Y = y i )= p ij , i = 1, 2, . . . n; j = 1, 2, . . . m, then the conditional entropy is as follows: where p i = P ( X = x i ), i = 1, 2, . . . n.Then, the information gain of feature factor n j on dataset D is noted as follows: The criterion for decision tree generation is to select the feature factors with the greatest information gain in order from the root node for the construction of root and leaf nodes.To improve the generalization ability of the model, the generated decision tree also needs to be pruned.

WOE-RF Model
The weight value calculated in the WOE model is an expression of the spatial relationship between the factors affecting the occurrence of landslides and landslides.By calculating the weight of factors on landslides, the secondary factors with similar effects on landslides can be grouped into one category based on the weight value, which can reduce the redundancy of the input data of machine learning models and improve the accuracy of machine learning to a certain extent.Figure 2 shows the schematic diagram of the application of the combination of the weight of evidence and the random forest model.
tionship between the factors affecting the occurrence of landslides and landslides.By calculating the weight of factors on landslides, the secondary factors with similar effects on landslides can be grouped into one category based on the weight value, which can reduce the redundancy of the input data of machine learning models and improve the accuracy of machine learning to a certain extent.Figure 2 shows the schematic diagram of the application of the combination of the weight of evidence and the random forest model.

Accuracy Validation of WOE-RF Model
In this research, the confusion matrix [49] and the ROC curve [50] are used to validate the accuracy of the WOF-RF model.
Through the confusion matrix, we can obtain four cases: actual landslide and predicted landslide (TP), actual not landslide and predicted not landslide (TN), actual landslide but predicted not landslide (FN), and actual not landslide but predicted landslide (FP), from which we can obtain the model prediction accuracy and recall based on the formula: The accuracy of the model was also evaluated by AUC area under the ROC curve, with the horizontal axis of the ROC curve representing false positives and the vertical axis representing true positives.Generally, the larger the AUC area under the ROC curve, the higher the accuracy is indicated.The above two ways of evaluating the prediction accuracy of the model can roughly assess the applicability of the model.

Landslide Susceptibility Classification
The obtained landslide susceptibility maps of rainfall, seismic, and unclassified landslide in the upper Minjiang River Basin were classified based on natural intermittent points [10,51], and the natural intermittent point method can be able to make the smallest differences within the classified classes and the largest differences between classes.Since the landslide and non-landslide threshold in this research was set to 0.9, the value of class 5 will be changed to [0.9, 1] on the basis of the natural interruption point classification.

Landslide Background Factors Pretreatment
Considering the area of the upper reaches of the Minjiang River Basin, the distribution of landslides, and the accuracy of the original topography, this research chose to establish a 200 × 200 m grid for analysis, and initially selected lithology, NDVI, elevation, slope, aspect, relief amplitude, surface cutting depth, profile curve, plan curve, curvature, land use, distance to faults, and distance to rivers for analysis, which are the 13 most relevant background factors to the landslides in the upper reaches of Minjiang River Basin.Among them, two linear elements of faults as well as rivers were buffered at 200 m intervals throughout the whole study area, and correlation analysis was performed for the 13 selected factors, then correlation coefficients that were less than 0.3 were considered uncorrelated between the factors [52]; finally, there were nine background factors involved in the model building through correlation analysis.The nine landslide susceptibility modeling background factors included the following: lithology, NDVI, elevation, slope, aspect, profile curve, curvature, land use, and distance to faults.Among them, the two discrete variables of lithology and land use by category were renumbered; the aspects were divided into nine categories according to the values, where -1 was divided into plane, (0, 22.5) and (337.5, 360) for north, (22.5, 67.5) for northeast, (67.5, 112.5) for east, (112.5, 157.5) for southeast, (157.5, 202.5) for south, (202.5, 247.5) for southwest, (247.5, 292.5) for west, and (292.5, 337.5) for northwest; the remaining continuous type factors were divided into equal intervals according to the distribution of the data.The specific factor grades are shown in Figure 3.

Acquisition of Weights for Landslide Background Factors
Based on the nine landslide background factors of lithology, NDVI, elevation, slope, aspect, profile curve, curvature, land use, and distance to faults, the factor weights of rainfall, seismic, and unclassified landslides were obtained after grading; the weights of the nine factors are as shown in Tables 2-4, respectively:

Acquisition of Weights for Landslide Background Factors
Based on the nine landslide background factors of lithology, NDVI, elevation, slope, aspect, profile curve, curvature, land use, and distance to faults, the factor weights of rainfall, seismic, and unclassified landslides were obtained after grading; the weights of the nine factors are as shown in Tables 2-4, respectively:   The 407 rainfall and 2936 seismic landslides collected were used as positive samples for the respective models, then a 5 km buffer zone was established with the positive sample points of rainfall and seismic landslides as the center, and the negative samples of the two types of landslides were selected from outside the buffer zone according to the ratio of positive samples:negative samples of 1:1 to form the data set; the data sets of rainfall and seismic landslides were randomly selected according to 3:1 to form the training set and testing set, and input into the random forest model respectively.
When the number of decision trees are 3000 and 2000, respectively, the errors of rainfall and seismic landslide random forest models tend to stabilize.Based on the average reduction Gini coefficients obtained from the constructed models, the importance ranking of the two influence factors was obtained.The importance of the background factors of rainfall landslides from high to low are as follows: DEM, land use, distance to faults, lithology, profile curve, NDVI, aspect, slope, curvature, and the importance of the background factors of seismic landslides from high to low are as follows: DEM, lithology, distance to faults, slope, land use, NDVI, aspect, curvature, and profile curve.
The rainfall and seismic landslide susceptibility maps obtained based on the WOE-RF model are shown in Figure 4a,b, respectively, and the susceptibility maps of the two types of landslides are divided into five zones, namely, extremely high, high, medium, low, and extremely low based on natural interruption points.The statistical results of the distribution of rainfall and seismic landslides on each zone are shown in Table 5. Figure 4 and Table 5 all show that both rainfall and seismic landslides generally show the trend that the lower the landslide susceptibility level is, the less the known landslides are distributed on it and the larger the proportion of its area of the whole study area.Figure 5a,b show extremely high susceptibility areas of rainfall and seismic landslides.Through comparison, it can be found that the similarities between the two are that both extremely high areas of rainfall and seismic landslides are distributed along the main parts and tributaries of rivers; both rainfall and seismic landslides are prone in Maoxian, Lixian, Wenchuanxian, and Dujiangyan, whereas Songpanxian is less prone to landslides compared with the other four counties and cities, especially rainfall landslides; along the line from Xueboding mountain to Lixian, there are relatively few rainfall and seismic landslides in the northwest, while they are mainly concentrated in the southeast area of the upper Minjiang River Basin.
The distribution of the three geological conditions of elevation, land use, and lithology, which are the most important for the evaluation of the susceptibility of rainfall and seismic landslides in extremely high susceptibility areas, were analyzed separately, and the results are shown in Figure 6:   Figure 6a,b show that the extremely high susceptibility areas of rainfall and seismic landslides are concentrated at the altitudes of 1000-2500 m and 1500-3000 m, respectively, which indicates that the development of seismic landslides is higher than that of rainfall landslides.Figure 6c,d, 1-7 represent Garden plot, Woodland, Land for water bodies and water conservancy facilities, Grassland, Commercial area, Land for industrial and mining warehousing, and Other land, respectively, and Figure 6c,d show that both rainfall and seismic landslides occur easily in Garden plot, Woodland, and Grassland.Figure 6e,f, 1-15 represent Sandstone and siltstone interbedded with phyllite; Shale, phyllite, and siltstone; Granitic rocks; Syenite; Diorite; Limestone and sandstone; Unconsolidated deposits; Sandstone, siltstone, and shale; Limestone intercalated with shale; Limestone, sandstone, and shale; Limestone and dolomite intercalated with phyllite; Dolomite, silicalite, phyllite, sandstone, and siltstone; Amphibolite; Sandstone and siltstone intercalated with slate; and Sandstone and siltstone interbedded with shale.Figure 6e,f show that both rainfall and seismic landslides are likely to occur in Sandstone and siltstone interbedded with phyllite; Granitic rocks; Limestone and sandstone; Limestone and dolomite intercalated with phyllite; Dolomite, silicalite, phyllite, sandstone, and siltstone; and Amphibolite.The distribution of the three geological conditions of elevation, land use, and lithology, which are the most important for the evaluation of the susceptibility of rainfall and seismic landslides in extremely high susceptibility areas, were analyzed separately, and the results are shown in Figure 6:

Comparison of Landslide Susceptibility Areas from Two Methods
Due to the different ranges of background factors that favor the occurrence of rainfall and seismic landslides, many studies currently do not distinguish the types of landslides (rainfall-type, seismic-type, or others) when evaluating landslide susceptibility, which may lead to inaccurate results.Therefore, this research attempts to spatially overlay different types of landslide susceptibility maps after distinguishing the types of landslides, and then spatially compare with the landslide susceptibility map without distinguishing the types of landslides in order to see the differences in the distribution of the same class of landslide prone areas between the two.
The collected 3343 landslides of unclassified landslides (after removing landslides distributed in the same grid, 3286 landslides remained) are used as positive samples for the model, a 5 km buffer zone is established with the positive sample points as the center, and the negative samples of landslides are selected from outside the buffer zone in the ratio of positive samples: negative samples of 1:1 to form the data set; then, the unclassified landslide data set is randomly generated in the ratio of 3:1 to form the training set and testing set, and input to the random forest model.When the number of decision trees is 3000, the error of the random forest model tends to be stable.Based on the average reduction Gini coefficient obtained from the constructed model, the importance ranking of the influencing factors obtained from the highest to the lowest are as follows: DEM, lithology, distance to faults, NDVI, slope, land use, aspect, profile curve, and curvature.
The landslide susceptibility map of unclassified landslides obtained based on the WOE-RF model is shown in Figure 7.The landslide susceptibility maps are divided into five classes based on natural interruption points: extremely high, high, medium, low, and extremely low, and the statistical results of landslide distribution on each interval are shown in Table 6. Figure 7a shows the landslide susceptibility map of the upper Minjiang River Basin calculated based on the WOE-RF model regardless of landslide type.Figure 7b shows the spatial superposition of the rainfall and seismic landslide susceptibility maps, and classifies (extremely high, extremely high), (extremely high, high), (extremely high, medium), (extremely high, low), (extremely high, extremely low) as extremely high landslide susceptibility zones; (high, high), (high, medium), (high, low), (high, very low) as high landslide susceptibility zones; (medium, medium), (medium, low), (medium, extremely low) as medium landslide susceptibility zones; (low, low), (low, extremely low) as low landslide susceptibility zones; and (extremely low, extremely low) as extremely low landslide susceptibility zones [32].
The statistical information on the landslide susceptibility zones in the upper reaches of the Minjiang River Basin obtained by the two methods is shown in Table 6, both of which show that the higher the grade of the landslide susceptibility zone, the more the number of known landslides falling into the zone.The difference in the proportion of known landslides in the corresponding susceptibility zones to all known landslides is 1.66%, 1.3%, 0.33%, 0.46%, and 0.23%, respectively, all within 2%; the difference in the ratio of each landslide susceptibility zone to all known landslides is 0.5%, 1.42%, 4.44%, 10.22%, 16.58%, respectively; the overlapping areas of the corresponding susceptibility intervals were superimposed and found to be 84.91%,48.20%, 39.39%, 44.84%, and 65.44% for extremely high, high, medium, low, and extremely low, respectively.The results of the landslide susceptibility assessment obtained using spatial superposition and unclassified landslides are compared by using proportional statistics, and the results are shown in Figure 8.The solid line shows the distribution of different grades of landslide susceptibility obtained by using the unclassified landslides, and the dashed line shows the distribution of different grades of landslide susceptibility obtained using the spatial superposition of rainfall and seismic landslide susceptibility maps.The results of the landslide susceptibility assessment obtained using spatial superposition and unclassified landslides are compared by using proportional statistics, and the results are shown in Figure 8.The solid line shows the distribution of different grades of landslide susceptibility obtained by using the unclassified landslides, and the dashed line shows the distribution of different grades of landslide susceptibility obtained using the spatial superposition of rainfall and seismic landslide susceptibility maps.

WOF-RF Model Accuracy Evaluation (1) Confusion matrix
Based on the confusion matrix and calculated from Table 7, the accuracy, recall of rainfall, and seismic and unclassified landslides can be calculated separately, as shown in Table 8: As can be seen from Table 8, the accuracy of rainfall, seismic and unclassified landslide dataset are above 80%; meanwhile, except for the lower recall rate of the rainfall landslide test set, all the others are above 80%, although the recall rate of rainfall landslide testing set is lower, and the recall rate of the whole dataset of the rainfall landslide is above 90%, so both the accuracy and recall rate show that the accuracy of the model is high.There are the same results between the two resources of landslide susceptibility: for the elevation factor, the predicted landslide areas for extremely low, low, medium, high, and extremely high grades are concentrated in (3500, 4500), (3000, 4000), (2500, 4000), (2500, 3500), and (1500, 3000) for both.For the lithology factor, extremely low, low, medium, high, and extremely high grades are concentrated in Sandstone and siltstone interbedded with phyllite, Granitic rocks; Sandstone and siltstone interbedded with phyllite; Sandstone and siltstone interbedded with phyllite, limestone, and sandstone; Sandstone, and siltstone interbedded with phyllite, limestone and sandstone, and limestone and dolomite intercalated with phyllite; Sandstone and siltstone interbedded with phyllite, limestone and sandstone, limestone and dolomite intercalated with phyllite, and Amphibolite; they are all concentrated in Sandstone and siltstone interbedded with phyllite.As for the fault factor, extremely low-, low-, medium-and high-grade landslide prediction areas are mainly concentrated in (0, 150), which means (0, 30,000) m, and extremely high-grade areas are concentrated in (0, 100), which means (0, 20,000) m.For the NDVI factor, extremely low-, low-, medium-, high-, and extremely high-grade landslide prediction areas are mainly distributed in (6000, 9000).For slope factor, extremely low-and low-grade landslide prediction areas are mainly distributed in (10,40), and medium, high, and extremely high grade are mainly distributed in (10,50).For the land use factor, extremely low-, low-, medium-, high-, and extremely high-grade landslide prediction areas are mainly distributed in Garden plot and Woodland.For the aspect factor, except for the plane direction, the distributions of extremely low, low, medium, high and extremely high landslide prediction areas are not much different in each slope direction, and the distribution is relatively uniform.For the profile curve and curvature factor, extremely low, low, medium, high, and extremely high landslide prediction areas are all concentrated in (-10, 10).

WOF-RF Model Accuracy Evaluation (1) Confusion matrix
Based on the confusion matrix and calculated from Table 7, the accuracy, recall of rainfall, and seismic and unclassified landslides can be calculated separately, as shown in Table 8: As can be seen from Table 8, the accuracy of rainfall, seismic and unclassified landslide dataset are above 80%; meanwhile, except for the lower recall rate of the rainfall landslide test set, all the others are above 80%, although the recall rate of rainfall landslide testing set is lower, and the recall rate of the whole dataset of the rainfall landslide is above 90%, so both the accuracy and recall rate show that the accuracy of the model is high.
(2) ROC curve Figure 9a,b, and c show the ROC curves of rainfall, seismic, and unclassified landslides, respectively.Figure 9a shows that the AUC areas of the rainfall landslide training set, testing set, and the whole data set are 0.9997, 0.9485, and 0.9547, respectively.Figure 9b shows that the AUC areas of the seismic landslide training set, testing set, and the whole data set are 0.9996, 0.9809, and 0.9211, respectively.Figure 9c shows that the AUC areas of the unclassified landslide training set, testing set, and the whole data set are 0.9997, 0.9822, and 0.9207, respectively.It can be seen that the WOE-RF model has a high evaluation accuracy for all three landslide datasets.The importance ranking of the factors for evaluating the susceptibility of rai

Comparison of Rainfall and Seismic Landslide Susceptibility in the Upper Reaches of Minjiang River Basin
The importance ranking of the factors for evaluating the susceptibility of rainfall landslides in the upper reaches of Minjiang River Basin is as follows: DEM, land use, distance to faults, lithology, profile curve, NDVI, aspect, slope, and curvature.The importance ranking of the factors for evaluating the susceptibility of seismic landslides in the upper reaches of Minjiang River Basin is as follows: DEM, lithology, distance to faults, slope, land use, NDVI, aspect, curvature, and profile curve.In terms of the ranking of the importance of factors, DEM has the greatest influence on the development of rainfall and seismic landslides, indicating that unstable slope movement develops at a specific elevation location; in addition, land use and lithology are the second most important factors affecting rainfall and seismic landslides, respectively; the distance to faults is the third most important factor affecting rainfall and seismic landslides.Seismic landslides are commonly found in places close to the faults, and the reason why faults have a greater influence on rainfall landslides in this study may be that unstable activities at faults provide a potential source of material for rainfall, and therefore shallow rainfall landslides are likely to occur when rainfall occurs.However, the specific causes need to be analyzed in depth in the context of the complex geology of the upper Minjiang River Basin.By comparing the two susceptibility maps, it was found that the areas with high susceptibility levels are roughly distributed along the rivers in terms of spatial distribution, and the southeast direction of the line from Xuebaoding mountain to Lixian is the high landslide occurrence area for both.The extremely high susceptibility areas of the two were extracted separately and their distributions in DEM, land use, and lithology factors were analyzed.It was found that the differences between the distributions of the two in terms of land use and lithology are small, and seismic landslides are more likely to occur at a height of 1500-3000 m, while rainfall landslides are more likely to occur at a height of 1000-2500 m.This may be related to the triggering principles of the two, with most seismic landslides being bedrock landslides and most rainfall landslides being shallow landslides; meanwhile, this is also consistent with the conclusion that "rainfall landslides are more likely to occur at lower slopes, while earthquake landslides are more likely to occur at steeper slopes" in the upper reaches of Minjiang River Basin, obtained using a statistical method of landslide proportion by Bai et al. [41].Therefore, from the above results, it is clear that we cannot simply regard different types of landslides as the same when conducting detailed studies.

Comparison of Two Mapping Methods for Landslide Susceptibility in the Upper Minjiang River Basin
The two methods of obtaining landslide susceptibility in the upper reaches of the Minjiang River Basin do not differ greatly in terms of the distribution of known landslides on each susceptibility interval and the proportion of area occupied by each susceptibility interval when directly performing machine learning on unclassified landslides, as well as spatial stacking on the susceptibility of rainfall and seismic landslides.However, when the spatial superposition of the two methods was carried out, it was found that extremely high and extremely low susceptibility zones had 84.91% and 65.44% overlap, respectively, while the other three susceptibility zones did not overlap by more than 50%, indicating that the prediction results of the two methods are different.At the same time, statistical analysis was done on the distribution of both corresponding landslide prediction grading areas in terms of background factors.The differences in the distribution of predicted results in terms of lithology, NDVI, slope, aspect, profile curve, and curvature are within 10%, but the differences in the distribution of both low and medium landslide prediction grading areas in terms of elevation are larger; the differences in the distribution of medium-graded landslide prediction areas in terms of distance to faults are larger; the differences in the distribution of low-graded landslide prediction areas in terms of land use are larger.
The reasons for the differences are speculated as follows: for unclassified landslides composed of rainfall and seismic landslides, the spatial relationship between the two types of landslides and the background factors affecting the landslide development is considered at the same time.Whether from the calculation of weight or the machine learning algorithm of random forest, it is equivalent to taking the union of the results that are conducive to both rainfall landslides and seismic landslides.For the landslide susceptibility map generated by spatial stacking of rainfall and seismic landslide susceptibility maps, the initial consideration was given to the demand for background factors for different types of landslides, as the development of different types of landslides requires different requirements for background factors.At the same time, the final landslide susceptibility level determined by stacking is based on the maximum susceptibility interval of different types of landslides before stacking.From this perspective, in fact, there have been differences since the initial calculation of weights that express the spatial relationship between landslides and factors.Considering the types of landslides, the spatial relationships between rainfall, seismic landslides, and background factors were obtained, and the same background factor was given different weights, respectively, when machine learning was conducted.The relationship between the two types of landslides and background factors was also considered separately.When not considering the type of landslides, the weight of the obtained factors took into account rainfall and seismic landslides at the same time, and the same factor only had one weight that expresses the relationship between all landslides and factors, which also affected the subsequent calculation of machine learning.When conducting machine learning algorithms simultaneously, the situation where two types of landslides coexist was also considered.Based on the above analysis, it can be determined that there may be significant differences between the landslide susceptibility obtained using the spatial stacking of rainfall and seismic landslide susceptibility maps and the susceptibility results obtained directly using unclassified landslides.
It can be seen that not every scenario can be considered without the type of landslide.For the assessment of landslide susceptibility, it is necessary to separate different types of landslides when the needs for landslide background factors are not exactly the same for both types of landslides.All in all, it is essential to judge whether the different types of landslides need to be viewed separately for different problem scenarios and to solve the detailed problems.

Limitations and Prospects
The factors affecting the occurrence of landslides can be divided into background factors and triggering factors; this research is based on using the background factors affecting the occurrence of landslides to evaluate the spatial susceptibility of rainfall and seismic landslides in the upper reaches of the Minjiang River Basin.However, the role of trigger factors cannot be ignored.Based on this research, rainfall factors should be considered for inclusion in the study of rainfall landslides, and peak ground acceleration factor should be considered for inclusion in the study of seismic landslides.
In addition, in the susceptibility mapping of landslides in the upper reaches of the Minjiang River Basin, it was found that the landslide susceptibility maps obtained using the spatial superposition of rainfall and seismic landslide susceptibility maps differed significantly from those obtained directly from unclassified landslides, which on the one hand illustrates the necessity of classifying landslides for specific problems.At the same time, the respective weights of the two landslide susceptibility layers were not considered when the spatial overlay was conducted, and the distribution of the two different types of landslides in terms of each geological factor shows that they have different needs for background factors.In addition, the changes in weights due to the frequency of landslide occurrences of both can be considered in the spatial overlay of the two in the future.

Conclusions
This research takes the upper reaches of the Minjiang River Basin as the study area and evaluates the susceptibility of rainfall, seismic, and unclassified landslides based on WOE-RF.Then, a spatial comparison of landslide susceptibility maps in the upper reaches of the Minjiang River Basin, obtained by overlaying landslide susceptibility maps from rainfall and earthquake landslides and directly from unclassified landslides, was conducted: (1) In terms of model construction, the event impact factors entered into the machine learning model in previous studies are often assigned weights by the expert scoring method, AHP, and other biased subjective methods.In this research, we used a purely data-driven weight of evidence method without human intervention to assign corresponding weights to each factor to participate in the calculation of the model, and the factor weights obtained from weight of evidence are the expression of the spatial relationship between landslides and the factors influencing the occurrence of the landslides, which can reduce the redundancy of the data input to the machine learning model to a certain extent.(2) In terms of spatial location distribution, rainfall and seismic landslides have the following points in common: they are prone to occur along rivers; landslides are more likely to occur in Maoxian, Lixian, Wenchuanxian, and Dujiangyancity, while landslides are less likely to occur in Songpanxian; and landslides are more likely to occur in the southeast of the line from Xuebaoding to Lixian.In terms of the distribution of geological factors, seismic landslides are distributed at a slightly higher elevation than rainfall landslides, whilst land use and lithological conditions in both susceptible areas are similar.(3) The differences between the landslide susceptibility maps obtained by superimposing rainfall and seismic landslide susceptibility maps and the result obtained by directly using unclassified landslides are large, which is mainly caused by the difference in the principles of the two mapping methods and shows that it is important to see whether it is necessary to differentiate the types of landslides for solving the problems in different contexts.(4) The accuracy of the rainfall, seismic, and unclassified landslide models calculated from the confusion matrix are all above 80%, and the AUC area is greater than 0.9, both of which indicate the high accuracy of the WOE-RF model.

Figure 1 .
Figure 1.Overview of the study area.

Figure 1 .
Figure 1.Overview of the study area.

Figure 2 .
Figure 2. The Flow Chart of WOE-RF Coupling Model Application.

Figure 2 .
Figure 2. The Flow Chart of WOE-RF Coupling Model Application.

29 Figure 3 .
Figure 3. Classification of landslide background factors in the upper reaches of Minjiang River Basin.

Figure 3 .
Figure 3. Classification of landslide background factors in the upper reaches of Minjiang River Basin.

Figure 4 .
Figure 4. (a) Spatial distribution of rainfall landslides in the upper reaches of Minjiang River Basin; (b) Spatial distribution of seismic landslides in the upper reaches of Minjiang River Basin.

Figure
Figure5a,b show extremely high susceptibility areas of rainfall and seismic landslides.Through comparison, it can be found that the similarities between the two are that both extremely high areas of rainfall and seismic landslides are distributed along the main parts and tributaries of rivers; both rainfall and seismic landslides are prone in Maoxian, Lixian, Wenchuanxian, and Dujiangyan, whereas Songpanxian is less prone to landslides compared with the other four counties and cities, especially rainfall landslides; along the line from Xueboding mountain to Lixian, there are relatively few rainfall and seismic landslides in the northwest, while they are mainly concentrated in the southeast area of the upper Minjiang River Basin.

Figure 4 .
Figure 4. (a) Spatial distribution of rainfall landslides in the upper reaches of Minjiang River Basin; (b) Spatial distribution of seismic landslides in the upper reaches of Minjiang River Basin.

29 Figure 5 .
Figure 5. (a) Extremely high prone areas of rainfall landslides in the upper reaches of Minjiang River Basin; (b) Extremely high prone areas of seismic landslides in the upper reaches of Minjiang River Basin.

Figure 5 .
Figure 5. (a) Extremely high prone areas of rainfall landslides in the upper reaches of Minjiang River Basin; (b) Extremely high prone areas of seismic landslides in the upper reaches of Minjiang River Basin.

Figure 6 .
Figure 6.Comparison of distribution of rainfall and seismic landslides that are highly prone to occur in elevation, land use, and lithology ((a) Distribution of highly prone areas of rainfall landslide in elevation; (b) Distribution of highly prone areas of seismic landslide in elevation; (c) Distribution of highly prone areas of rainfall landslide in land use; (d) Distribution of highly prone areas of seismic landslide in land use; (e) Distribution of highly prone areas of rainfall landslide in lithology; (f) Distribution of highly prone areas of seismic landslide in lithology).

Figure
Figure6a,b show that the extremely high susceptibility areas of rainfall and seismic landslides are concentrated at the altitudes of 1000-2500 m and 1500-3000 m, respectively, which indicates that the development of seismic landslides is higher than that of rainfall landslides.Figure6c,d, 1-7 represent Garden plot, Woodland, Land for water bodies and water conservancy facilities, Grassland, Commercial area, Land for industrial and mining warehousing, and Other land, respectively, and Figure 6c,d show that both rainfall and

Figure 6 .
Figure 6.Comparison of distribution of rainfall and seismic landslides that are highly prone to occur in elevation, land use, and lithology ((a) Distribution of highly prone areas of rainfall landslide in elevation; (b) Distribution of highly prone areas of seismic landslide in elevation; (c) Distribution of highly prone areas of rainfall landslide in land use; (d) Distribution of highly prone areas of seismic landslide in land use; (e) Distribution of highly prone areas of rainfall landslide in lithology; (f) Distribution of highly prone areas of seismic landslide in lithology).

29 Figure 7 .
Figure 7. Zoning map of unclassified landslide susceptibility in the upper reaches of Minjiang River Basin ((a) Unclassified landslides susceptibility map; (b) Spatial overlay landslides susceptibility map).

Figure 7 .
Figure 7. Zoning map of unclassified landslide susceptibility in the upper reaches of Minjiang River Basin ((a) Unclassified landslides susceptibility map; (b) Spatial overlay landslides susceptibility map).

29 Figure 8 .
Figure 8. Predicted distribution results of different landslide susceptibility classes on background factors ((a) Distribution on elevation; (b) Distribution on lithology; (c) Distribution on distance to faults; (d) Distribution on NDVI; (e) Distribution on slope; (f) Distribution on land use; (g) Distribution on aspect; (h) Distribution on profile curve; (i) Distribution on curvature).

Figure 8 .
Figure 8. Predicted distribution results of different landslide susceptibility classes on background factors ((a) Distribution on elevation; (b) Distribution on lithology; (c) Distribution on distance to faults; (d) Distribution on NDVI; (e) Distribution on slope; (f) Distribution on land use; (g) Distribution on aspect; (h) Distribution on profile curve; (i) Distribution on curvature).

Figure
Figure 8a-i show the distribution of both in terms of elevation, lithology, distance to faults, NDVI, slope, land use, aspect, profile curve, and curvature, respectively.There are the same results between the two resources of landslide susceptibility: for the elevation factor, the predicted landslide areas for extremely low, low, medium, high, and extremely high grades are concentrated in (3500, 4500), (3000, 4000), (2500, 4000), (2500, 3500), and (1500, 3000) for both.For the lithology factor, extremely low, low, medium, high, and extremely high grades are concentrated in Sandstone and siltstone interbedded with phyllite, Granitic rocks; Sandstone and siltstone interbedded with phyllite; Sandstone and siltstone interbedded with phyllite, limestone, and sandstone; Sandstone, and siltstone interbedded with phyllite, limestone and sandstone, and limestone and dolomite intercalated with phyllite; Sandstone and siltstone interbedded with phyllite, limestone and sandstone, limestone and dolomite intercalated with phyllite, and Amphibolite; they are all concentrated in Sandstone and siltstone interbedded with phyllite.As for the fault factor, extremely low-, low-, medium-and high-grade landslide prediction areas are mainly

Figure 9 . 1 .
Figure 9. (a) ROC curve of rainfall landslide in the upper reaches of Minjiang River Basin; (b) curve of seismic landslide in the upper reaches of Minjiang River Basin; (c) ROC curve of unc fied landslide in the upper reaches of Minjiang River Basin. 5. Discussion 5.1.Comparison of Rainfall and Seismic Landslide Susceptibility in the Upper Reaches of Minjiang River Basin

Figure 9 .
Figure 9. (a) ROC curve of rainfall landslide in the upper reaches of Minjiang River Basin; (b) ROC curve of seismic landslide in the upper reaches of Minjiang River Basin; (c) ROC curve of unclassified landslide in the upper reaches of Minjiang River Basin.

Table 1 .
Data sources for spatial susceptibility assessment of landslides in the upper reaches of jiang River Basin.
[44]mic landslides Ministry of Land and Resources of China: vestigation of Landslide Hazard Caused 2008 Wenchuan Earthquake in China [4 Factors related to landslide occurrence DTM image with 90 m spatial resolution [NDVIResource and environmental science data r tration and publishing system[44]

Table 1 .
Data sources for spatial susceptibility assessment of landslides in the upper reaches of Minjiang River Basin.

Table 2 .
Weight value of rainfall landslides background factors.

Table 3 .
Weight value of seismic landslides background factors.

Table 4 .
Weight value of unclassified landslides background factors.

Table 5 .
Susceptibility classification and distribution of rainfall and seismic landslides in the upper reaches of Minjiang River Basin.

Table 6 .
Susceptibility classification and distribution of unclassified landslides in the upper reaches of Minjiang River Basin.

Table 7 .
Confusion matrix of rainfall, seismic, unclassified landslides' training set, testing set, and the whole data set.

Table 8 .
Accuracy, recall of rainfall, seismic, unclassified landslides' training set, testing set, and the whole data set.