A Landslide Susceptibility Assessment Method Based on GIS Technology and an AHP-Weighted Information Content Method: A Case Study of Southern Anhui, China

Based on geographic information system (GIS) technology in conjunction with two methods for assessing landslide susceptibility (LS)—namely, a method using experts’ knowledge and experience, and a mathematical/statistical method—the LS of southern Anhui, China is assessed using an analytic hierarchy process (AHP) via an AHP-weighted information content method. Landslide-affecting factors are categorized into three main types and 10 subtypes. The values of spatial characteristics of the landslide-affecting factors are obtained using GIS technology. The AHP method is then employed to compare the importance and weights of landslide-affecting factors. The information content method is used to convert the measured values of the landslide-affecting factors in the study area to data reflecting regional stability. The closeness of the relationships between the classification levels of each landslide-affecting factor and landslide occurrence are calculated. The LS of the study area is assessed using the proposed method. The LS assessment shows that high LS, relatively high LS, moderate LS, relatively low LS and low LS regions account for 21.3%, 20.6%, 20.1%, 11.7% and 26.3% of the study area, respectively. Finally, the accuracy of the LS assessment results is analyzed using two methods: the assessment, including an analysis of random landslide sites for the validating models; and the area below a receiver operating characteristic (ROC) curve of area under curve (AUC) value. The results show that the proportion of landslide sites in the regions of each LS level determined using the AHP-weighted information content method increases as the LS level increases, and that the accuracies of the AHP-weighted information content method were 8.1% and 5.7% higher than those of the AHP method and information content method, respectively.


Introduction
In geomorphology, a "landslide" is the movement of a mass of rock, debris or earth down a slope under the influence of gravity [1][2][3].To study landslides using geographic information system (GIS), it is first necessary to identify landslides based on their characteristics as deep-seated, including main scarps, debris, mounds and hollows, or shallow landslides.Shallow landslides are composed of scar, flow and deposit parts [4][5][6].Landslides, one of the main types of natural disasters, gravely threaten and harm people's lives, property and environments.Data show that from January 2004 to December 2016, a total of 55,997 people were killed in 4862 separate non-seismic landslide events around the world.The spatial distribution of landslides varies, with Asia representing the dominant geographical area [7].Landslide assessment [8][9][10] is of great importance for the prediction [11] and targeted prevention and control of landslides.Landslide assessments include susceptibility, hazard and risk assessments.Susceptibility assessments are the basis [12][13][14] for hazard and risk assessments.Susceptibility refers to the spatial possibility of occurrence of landslides of a specific type and volume.Hazard assessments determine the possibility of occurrence of a specific landslide in a given region within a certain period of time.A risk assessment adds relevant information from the relevant disaster-affected bodies, including their temporal and spatial probabilities, vulnerability and numbers (casualties and economic losses).Therefore, a landslide susceptibility (LS) assessment of a given area is the basis for a landslide assessment of the area.
Currently, there are two main types of LS assessment methods: assessments based on expert knowledge and experience [15,16], and mathematical and statistical methods [17,18].Methods based on expert knowledge and experience use qualitatively determined LS based on experts' knowledge of landslides, and such methods include the geological and geomorphological analysis method [19], the factor analysis method [20], the fuzzy comprehensive evaluation method [21][22][23] and the analytic hierarchy process (AHP) method [24][25][26].The geological and geomorphological analysis method comprehensively determines and qualitatively assesses the LS of the study area based on a combination of geological and geomorphological parameters of landslide formation obtained from field surveys.The factor analysis method selects landslide-affecting factors based on experts' knowledge and experience, and assigns weights to these factors based on their importance in determining an LS assessment index.The fuzzy comprehensive evaluation and AHP methods are both based on the expert rating method.Using the AHP method, multiple hazards (landslide, flood, seismic hazards, etc.) must account for the uncertainty in the weighting coefficients [27][28][29].The ratings are established based on the classification of assessment indices.Weights are assigned to landslide-affecting factors by means of membership, and on this basis the LS is assessed.Methods based on expert knowledge and experience methods are, to a certain extent, subjective, and all methods require a rating based on experts' knowledge and experience.Ratings by different experts often lead to different assessment results.
Mathematical and statistical analysis methods assume that there is no correlation between affecting factors.These methods perform statistical analysis primarily based on the available geological data for landslides, quantitatively assess various classifications of each landslide-affecting factor based on data and establish a correlation between the landslide-affecting factors of each classification and landslide formation [30,31].The main mathematical and statistical analysis methods include the information content method [32][33][34], the weight of evidence method [35,36], the logistic regression method [37][38][39] and the multivariate linear regression method [40][41][42].The information content method calculates the spatial characteristics of each landslide-affecting factor based on the information content representing the LS level, and uses these characteristics to measure the closeness of the relationship between a factor and landslide occurrence.The weight of evidence method calculates the posteriori probability based on available landslide records and, on that basis, determines which regions of the study area are susceptible to landslides, assessing LS using overlay analysis of relevant topographic information regarding landslides.The logistic regression and multivariate linear regression methods determine the probability of landslide occurrence by determining quantitative correlations between landslide-affecting factors and constructing a regression model.Other models, such as machine learning algorithms [43]-including the random forest (RF), boosted regression tree (BRT), classification and regression tree (CART) and general linear (GLM) models-have been applied in earth sciences.RF is a model-building strategy that provides estimators of the Bayes classifier [44].Using RFs, the influence of sensitivity and scaling issues in LS mapping can be studied, while indicating the unit (scale) and training process, which strongly influence classification accuracy and the prediction process [45].This study presents an LS assessment method based on geographic information system (GIS) technology combined with an AHP-weighted information content method.This method combines the advantages of expert knowledge and experience with mathematical and statistical analysis.

Information for the Study Area
The study area, southern Anhui, is prone to frequent landslides.Situated in eastern mainland China (29 • 31 -31 • N, 116 • 31 -119 • 45 E), the study area is under the jurisdiction of Anhui Province and encompasses a total area of 36,536 km 2 .Records show that 707 landslides have occurred in the study area in recent years.Landslide development in the study area is primarily controlled by such basic factors as geological structure, terrain and landforms and hydrometeorological conditions.Located within the Yangtze stratigraphic region, the study area has a complex geological structure with residual Neoarchean and Paleoproterozoic formations composing the deep crust, and primarily Mesoproterozoic metamorphic rock sequences as basement.Faults within the study area are mostly northeast-trending and nearly east-west-trending, which has a significant controlling effect on geomorphic and geological conditions.With complex terrain and landform conditions, the study area is home to high mountains, deep valleys and steep slopes, as well as a wide distribution of phyllite, sandy shale and granite bedrock that has undergone severe weathering.Topographically, the study area gradually increases in elevation from north to south.There are plains in the northern part of southern Anhui along the Yangtze River.However, mountains and hills account for more than 70% of the total area of southern Anhui.There are three mountain ranges within the study area that extend from southwest to northeast, including the Jiuhua, Huangshan and Tianmu ranges.A large number of basins and valleys, mostly approximately 200 m and a few below 100 m in elevation, are distributed between these three parallel mountain ranges.Due to its hydrometeorological conditions, the study area is prone to landslides.Additionally, the study area has a mild subtropical monsoon climate with a wide annual range of temperatures, moderate precipitation, abundant sunshine, a long frost-free season and prevailing easterlies.The water systems in southern Anhui are part of the Yangtze River Basin, which form a dense, water-rich network.As a result of the geological structure, terrain and landform and hydrological conditions, landslides in the study area are zonally distributed, spatially concentrated and uneven.There is a low landslide distribution density in alluvial valley plains and hilly regions below 200 m in elevation.Landslides in regions above 400 m but below 800 m in elevation in the Jiuhua, Huangshan and Tianmu Ranges are zonally distributed along concentrated fault zones.Landslides are concentrated and occur in groups in regions characterized by relatively soft rocks (e.g., phyllite and sandy shale).
LS data were extracted from the List of Potential Small-Scale Geological Disaster Sites in Anhui Province, which is published by the public Geological Survey and Environmental Monitoring Center of Anhui Province.Shallow landslides composed of scars, flows and deposits were used in this paper, as shown in Figure 1.
There are a total of 707 landslide sites in southern Anhui, as shown in Figure 2. Of these sites, 495 were used in training modelling, and 212 were used to validate the accuracy of the model.The information for the factors was obtained using the following GIS spatial analysis-based methods.Geological structure data (i.e., formation lithology and fault data) were obtained by vectorizing a 1:250,000 geological map.Additionally, 30 m digital elevation model (DEM) data from the United States Geological Survey (USGS) were used as terrain and landform data.Topographic elevation, slope, aspect and curvature data were extracted directly from DEM data.Vector data for first-and second-order streams extracted from a 1:50,000 topographic map were used as hydrological data, as shown in Table 1.In this study, by overlaying each landslide-affecting factor layer and the landslide site layer, the attribute values of the landslide sites in the factor layer were determined.On this basis, the landslide-affecting factors were classified (i.e., reclassified).The information content of each classification factor was calculated and correlated to various levels of the corresponding thematic map.The results generated by each thematic map were subjected to an overlay analysis based on the weight of each factor in the AHP model.The AHP-weighted information content was calculated by summing the product of the correlation weight of each landslide-affecting factor and its classification information content.By classifying the calculation results, an LS distribution map based on the weighted information content model was obtained.Finally, 212 landslide sites were used to examine the accuracy of the model.The landslide locations were then randomly divided into a ratio of 70/30 for the training and validation models.In this study, by overlaying each landslide-affecting factor layer and the landslide site layer, the attribute values of the landslide sites in the factor layer were determined.On this basis, the landslideaffecting factors were classified (i.e., reclassified).The information content of each classification factor was calculated and correlated to various levels of the corresponding thematic map.The results generated by each thematic map were subjected to an overlay analysis based on the weight of each factor in the AHP model.The AHP-weighted information content was calculated by summing the product of the correlation weight of each landslide-affecting factor and its classification information content.By classifying the calculation results, an LS distribution map based on the weighted information content model was obtained.Finally, 212 landslide sites were used to examine the accuracy of the model.The landslide locations were then randomly divided into a ratio of 70/30 for the training and validation models.

AHP-Weighted Information Content Method
In an LS assessment, the study area is divided into several types of homogenous regions, which are then ranked based on LS.The key to an LS assessment lies in the possible landslide-affecting factors [46][47][48].Therefore, the selection of landslide-affecting factors is particularly important.The occurrence of a landslide is the result of the combined action of multiple types of factors.Overall, these factors can be categorized into two types: internal factors (e.g., geologic structural factors,

AHP-Weighted Information Content Method
In an LS assessment, the study area is divided into several types of homogenous regions, which are then ranked based on LS.The key to an LS assessment lies in the possible landslide-affecting factors [46][47][48].Therefore, the selection of landslide-affecting factors is particularly important.The occurrence of a landslide is the result of the combined action of multiple types of factors.Overall, these factors can be categorized into two types: internal factors (e.g., geologic structural factors, terrain and landform factors and hydrological factors) and external factors (e.g., inducing factors such as rainfall, earthquakes and roads).This study considers LS to be caused by internal factors and anthropogenic factors.Specifically, the following four types (including 10 subtypes) of factors are considered: geology and structure (both lithology and distance from faults), terrain and landforms (including normalized difference vegetation index (NDVI), elevation, slope, aspect and profile curvature), hydrometeorological conditions (both distance from rivers and annual rainfall) and anthropogenic factors (distance from roads).
This study proposes a method for calculating AHP-weighted information content.In this method, the information content model is improved using the AHP method.The AHP-weighted information content is calculated by summing the product of the correlation weight of each landslide-affecting factor and its classification information content.The equation for this method is as follows: In Equation ( 1), β1-β10 are the respective weights of the 10 landslide-affecting factors (lithology, distance from faults, NDVI, elevation, slope, aspect, profile curvature, distance from rivers, annual rainfall and distance from roads) determined using the AHP model method; G1-G10 are the raster layers obtained using the GIS reclassification method, which is used to reclassify the spatial levels of susceptibility caused by the landslide-affecting factors that are determined using the information content model method; and L is the LS level of the region.Figure 3 shows the classification of LS obtained by the spatial overlay of the GIS raster data.
In Equation ( 1), β1-β10 are the respective weights of the 10 landslide-affecting factors (lithology, distance from faults, NDVI, elevation, slope, aspect, profile curvature, distance from rivers, annual rainfall and distance from roads) determined using the AHP model method; G1-G10 are the raster layers obtained using the GIS reclassification method, which is used to reclassify the spatial levels of susceptibility caused by the landslide-affecting factors that are determined using the information content model method; and L is the LS level of the region.Figure 3 shows the classification of LS obtained by the spatial overlay of the GIS raster data.
In this study, the importance of the landslide-affecting factors is determined by comparing any two factors using the AHP method, which is based on certain criteria, and then quantitatively calculated based on a predefined scale.A judgement matrix is formed with the results.On this basis, the weight of each landslide-affecting factor is calculated.Based on the actual conditions in the regions where landslides have occurred, the measured values (i.e., the parameters of the landslideaffecting factors) are converted to information contents that reflect regional stability.The closeness of the relationships between the classification level of each landslide-affecting factor and landslide occurrence is assessed by calculating the information content provided by the factor to the landslide event.When assessing LS, the information content method defines the landslide as the study object.Some landslide-affecting factors such as faults, lithology, elevation, slope and aspect are the assessment indices of the model.Relevant regional LS is predicted and classified based on the information contents contributed by these landslide-affecting factors and their comprehensive levels.
Whether or not a landslide will occur is related to the amount and quality of information on disaster-causing factors and is determined based on information contents.The theoretical model for calculating information contents is as follows: In Equation ( 2), I(Y, x x x ⋯ x ) is the information content provided by the combination of the disaster-causing factors x , x , x , ⋯ , x ; P(Y, x x x ⋯ x ) is the probability of landslide occurrence under the combined action of the disaster-causing factors x , x , x , ⋯ , x ; and P(Y) is the probability of landslide occurrence.In this study, the importance of the landslide-affecting factors is determined by comparing any two factors using the AHP method, which is based on certain criteria, and then quantitatively calculated based on a predefined scale.A judgement matrix is formed with the results.On this basis, the weight of each landslide-affecting factor is calculated.Based on the actual conditions in the regions where landslides have occurred, the measured values (i.e., the parameters of the landslide-affecting factors) are converted to information contents that reflect regional stability.The closeness of the relationships between the classification level of each landslide-affecting factor and landslide occurrence is assessed by calculating the information content provided by the factor to the landslide event.When assessing LS, the information content method defines the landslide as the study object.Some landslide-affecting factors such as faults, lithology, elevation, slope and aspect are the assessment indices of the model.Relevant regional LS is predicted and classified based on the information contents contributed by these landslide-affecting factors and their comprehensive levels.
Whether or not a landslide will occur is related to the amount and quality of information on disaster-causing factors and is determined based on information contents.The theoretical model for calculating information contents is as follows: In Equation ( 2), I(Y, x 1 x 2 x 3 • • • x n ) is the information content provided by the combination of the disaster-causing factors x 1 , x 2 , x 3 , • • • , x n ; P(Y, x 1 x 2 x 3 • • • x n ) is the probability of landslide occurrence under the combined action of the disaster-causing factors x 1 , x 2 , x 3 , • • • , x n ; and P(Y) is the probability of landslide occurrence.
In this study, feasible calculation equations supported by the disaster-causing factor samples are used as substitutes in the calculation.Based on the landslide conditions in the study area, Equations ( 4)-( 6) are used to calculate and comprehensively analyze the information contents of the landslide-affecting factors.
The information content provided by each factor to the landslide event K is calculated as follows: where P(x i /H) is the probability of occurrence of x i under the landslide distribution conditions and P(x i ) is the probability of occurrence of x i in the study area.
In practice, frequency can be used to estimate probability as follows: where S is the total area of the study area; N is the total area of landslides in the study area; S i is the number of disaster-causing factors x i present in the study area; and N i is the number of landslides distributed within a certain disaster-causing factor x i .The goal of the model is to calculate the relative distribution density N i /N S i /S of each type of landslide in each disaster-causing factor layer, following i=1 where the total information content Ii is used as the comprehensive index that measures the effects of the factor on landslide occurrence.The higher Ii, the more easily landslides can occur.G i is the reclassification of rasters of the landslide-affecting factors.Based on Ii, five LS levels are ultimately determined for each landslide-affecting factor, including high, relatively high, moderate, relatively low and low LS levels.A map showing the LS levels associated with each landslide-affecting factor is ultimately obtained using the GIS raster data reclassification method.
In this study, the effects of 10 factors (i.e., lithology, distance from faults, NDVI, elevation, slope, aspect, profile curvature, distance from rivers, annual rainfall and distance from roads) on landslide formation in the study area are analyzed using a GIS-based spatial analysis method.The study area is ultimately divided into five types of regions based on weighted information contents using the proposed calculation equations.These include high LS, relatively high LS, moderate LS, relatively low LS and low LS regions, thereby achieving an assessment of LS and division of the study area based on LS level.

AHP Method
The occurrence of landslides is comprehensively affected by geological structure, terrain and landform, as well as hydrometeorological conditions.Thus, the LS of the study area was assessed using 10 affecting factors (i.e., lithology, distance from faults, NDVI, elevation, slope, aspect, profile curvature, distance from rivers, annual rainfall and distance from roads).The weight of each assessment factor was determined using the AHP method after establishing an assessment index system.Based on the actual conditions in the study area combined with expert experience, a judgement matrix was constructed for the landslide-affecting factors in the study area, as shown in Table 2, which summarizes the normalized weights.The matrix (λmax = 0.703; consistency index (CI) = 0.089; random index (RI) = 1.48; consistency ratio (CR) = 0.57 < 1) passed the consistency check.

Information Content Method
In an LS assessment, the measured values of the factors reflecting regional stability are often converted to information contents, which are then used as quantitative indices for LS-based division.The involved calculation principle is as follows.The information prediction viewpoint states that the amount and quality of information obtained in the geological disaster prediction process is correlated with the actual conditions of the occurrence of the geological disaster; that is, information content can be used to assess and predict geological disasters.Based on the distribution characteristics and formation conditions of landslides in southern Anhui, as well as relatively extensively used assessment indices, the 10 landslide-affecting factors are reclassified.Figure 4 shows the classification of the landslide-affecting factors.

Information Content Method
In an LS assessment, the measured values of the factors reflecting regional stability are often converted to information contents, which are then used as quantitative indices for LS-based division.The involved calculation principle is as follows.The information prediction viewpoint states that the amount and quality of information obtained in the geological disaster prediction process is correlated with the actual conditions of the occurrence of the geological disaster; that is, information content can be used to assess and predict geological disasters.Based on the distribution characteristics and formation conditions of landslides in southern Anhui, as well as relatively extensively used assessment indices, the 10 landslide-affecting factors are reclassified.Figure 4 shows the classification of the landslide-affecting factors.(j) Distance from roads.Four types of buffer zones are created at distances of <50, 50-150, 150-300 and >300 m from the road.

AHP-Weighted Information Content Method
Landslides are primarily controlled by the four main factors (i.e., geological structure, terrain and landforms, hydrometeorological conditions and anthropogenic factors).The 10 landslide-affecting factors (i.e., lithology, distance from faults, NDVI, elevation, slope, aspect, profile curvature, distance from rivers, annual rainfall and distance from roads) are classified into 46 levels using the GIS reclassification function.Table 3 summarizes the number of landslides distributed within a certain disaster-causing factor (Ni) in the study area, the number of landslides distributed within a certain disaster-causing factor to total ratio (Ni/N), the area of landslides distributed within a certain disaster-causing factor to total ratio (Si/S), information content (I), weight, weighted information content for each level of each landslide-affecting factor and Rank.
The calculated information contents show that, in the information contents for the 46 assessment factors calculated in this study, the maximum and minimum AHP-weighted information contents are 0.3901 and −0.5661, respectively.Landslide development is concentrated in the relatively soft and hard rock regions of southern Anhui.There is a relatively high probability of landslide occurrence within 2 km of faults.There is a relatively high probability of landslide occurrence when NDVI is high.Elevation and slope have a notable controlling effect on landslide formation.Landslides mainly occur in regions with an elevation of 150-450 m and a slope of 5 • -20 • .There is a relatively high probability of landslide occurrence on concave slopes.The probability of landslide occurrence is within 1 km of a body of water.Precipitation also has a relatively notable controlling effect on landslide formation.Landslides mainly occur in regions with annual rainfall greater than 1900 mm.The probability of landslide occurrence is highest within 150 m of a body of water.

Results
The weighted information contents of the 10 landslide-affecting factors (i.e., lithology, distance from faults, NDVI, elevation, slope, aspect, profile curvature, distance from rivers, annual rainfall and distance from roads) are the assessment indices for the LS-based division.The LS assessment of southern Anhui was obtained based on the information contents and weights in Table 3 using the following equation: where L is the weighted information content of the landslide-affecting factors for southern Anhui; β i is the AHP weight of the ith landslide-affecting factor; and G ij is the information content of the classification attribute j of the ith landslide-affecting factor (where I = 1, 2, 3, 4, 5, 6, 7, 8, 9 and 10, respectively correspond to lithology, distance from faults, NDVI, elevation, slope, aspect, profile curvature, distance from rivers and annual rainfall).
With the weighted information content-based LS assessment model for southern Anhui, the information content of each single factor layer was multiplied by its weight (obtained using the AHP analysis method), then the single factor layers with the weighted information contents were overlain using the ArcGIS raster calculator tool, thereby generating an overlain weighted information content map for landslides, as shown in Figure 5.
southern Anhui was obtained based on the information contents and weights in Table 3 using the following equation: where L is the weighted information content of the landslide-affecting factors for southern Anhui; β is the AHP weight of the ith landslide-affecting factor; and G is the information content of the classification attribute j of the ith landslide-affecting factor (where I = 1, 2, 3, 4, 5, 6, 7, 8, 9 and 10, respectively correspond to lithology, distance from faults, NDVI, elevation, slope, aspect, profile curvature, distance from rivers and annual rainfall).
With the weighted information content-based LS assessment model for southern Anhui, the information content of each single factor layer was multiplied by its weight (obtained using the AHP analysis method), then the single factor layers with the weighted information contents were overlain using the ArcGIS raster calculator tool, thereby generating an overlain weighted information content map for landslides, as shown in Figure 5.  Here, the natural breaks method, which is commonly used in mathematical statistics, is employed to reclassify the weighted information content map for landslides in southern Anhui.The LS of the study area is categorized into five levels: low LS (−0.635, −0.376); relatively low LS (−0.376, −0.131); moderate LS (−0.131, 0.059); relatively high LS (0.059, 0.237); and high LS (0.237, 0.476).
Based on the landslide susceptibility assessment map for southern Anhui (Figure 4), the proportions of regions of southern Anhui with various LS levels (low LS; relatively low LS; moderate LS; relatively high LS; and high LS) can be determined using the GIS-based spatial statistical function, the results of which are as follows: high LS regions account for 21.3%; relatively high LS regions account for 20.6%; moderate LS regions account for 20.1%; relatively low LS regions account for 11.7%; and low LS regions account for 26.3%.
Following Table 2 and in conjunction with the list of weighted information contents for landslide sites in southern Anhui (Table 3), the characteristics of the relatively high LS and high LS regions of southern Anhui are determined by the following: (a) Lithology: Formation lithology is the material basis for landslide formation and activity.
Landslides within a certain region occur within a certain formation.Rock type and hardness affect slope stability and difficulty of surface erosion, and are two of the most important landslide-affecting factors and internal conditions for landslide formation.Landslides vary significantly between formations that differ in geologic age and lithology.Landslides in southern Anhui mostly occur in regions characterized by relatively soft rock.(b) Distance from faults: Structural fault zones, which develop on weak structural planes, contain fragmented rocks and are central locations for landslide occurrences.Small faults often become a controlling factor for landslide boundaries.In terms of the overall trend, landslides in southern Anhui are controlled by faults.The farther a region is from a fault, the lower its landslide density.Landslide numbers and area densities are significantly higher in regions within 2 km of a fault than in other regions.(c) NDVI: In a region with high vegetation coverage, soil erosion does not occur easily, terrain erosion is slow and the damage to slopes is relatively insignificant.Generally, NDVI is an important index that reflects vegetation cover.The higher the NDVI, the higher the vegetation growth.
The probability of landslide occurrence in southern Anhui is relatively high in regions with a high NDVI (0.80-0.86).(d) Elevation: Elevation reflects the terrain in a small region.However, landslides are significantly affected by elevation.Clearly, landslides in southern Anhui are mainly concentrated within an elevation range of 150-450 m.(e) Slope: Slope is an important factor that affects landslide development and directly affects slope stability.The landslides occur depending on the type of landslide and the materials of the slopes, as well as on triggers such as rainfall.As a slope increases, the shear stress on the slope increases, and the probability of landslide occurrence also increases.In southern Anhui, landslides are concentrated in regions with slopes of 5 • -25 • , and the rate of landslide occurrence in regions with slopes of 5 • -15 • is the highest, suggesting that regions with slopes of 5 • -15 • are favorable for landslide occurrence in southern Anhui.(f) Aspect: Slopes with different aspects differ in solar radiation intensity, which affects a multitude of factors, including temperature difference, evaporation capacity, vegetation cover and slope surface, as well as slope and landslide stability.Aspects that favor landslide occurrence in the study area are 157.5 • -202.5 • and 247.5 • -292.5 • .(g) Profile curvature: Profile curvature is categorized into three types-convex slopes (profile curvature > 0.5), concave slopes (profile curvature < −0.5) and linear slopes (−0.5 < profile curvature < 0.5).In contrast, precipitation can easily accumulate on a concave slope, which will soften the soil.Concave slopes in southern Anhui are the most prone to landslides.(h) Distance from rivers: Rivers stress the conditions of landslide masses and alter their previous stable state, which may allow landslides to occur more easily.Landslides are active in regions where scouring intensity is enhanced as a result of hydrostatic action or human disturbance.In southern Anhui, there is a high probability of landslide occurrence in regions within 0-50 m of a river.(i) Annual rainfall: Precipitation is the most important landslide-inducing factor in southern Anhui.
Heavy-precipitation storms and persistent rainfall can both induce landslides.In regions with loose soil, rainwater infiltrates the slopes, thereby inducing landslides.In southern Anhui, there is a relatively high probability of landslide occurrence in regions with annual rainfall greater than 1900 mm.(j) Distance from roads: It is obvious that the artificial and natural parts of slopes around roads are more sensitive to landslide manifestation [29].In southern Anhui, there is a high probability of landsliding in regions within 0-150 m of a road.

Discussion
The accuracy of the LS assessment results for the study area was comparatively analyzed using two methods: the 30% random landslide site for the validation models and the area under a receiver operating characteristic (ROC) curve (AUC).

Random Landslide Site for the Validating Models Assessment and Analysis
Based on the landslide susceptibility assessment results, 212 random landslide sites were introduced as validation data, and the proportions of random landslide sites in regions of each LS level were calculated using the AHP-weighted information content method, the AHP method and the information content method, respectively (as shown in Figure 6).
ndsliding in regions within 0-150 m of a road.cussion he accuracy of the LS assessment results for the study area was comparatively analyzed using ethods: the 30% random landslide site for the validation models and the area under a receiver ing characteristic (ROC) curve (AUC).dom landslide site for the validating models assessment and analysis ased on the landslide susceptibility assessment results, 212 random landslide sites were uced as validation data, and the proportions of random landslide sites in regions of each LS ere calculated using the AHP-weighted information content method, the AHP method and ormation content method, respectively (as shown in Figure 6).A comparison of the LS assessment results and the distribution of random landslide sites in study area shows that all of the LS assessment results correspond well to the distribution of lands sites (as showed in Table 4).However, as demonstrated in Figure 7, the proportion of landslide in the regions of each LS level determined using the AHP-weighted information content met increases as the LS level increases, showing a notable linear correlation.This suggests that th levels determined by the AHP-weighted information content method agree with the ac occurrence of landslides and that the assessment results are relatively reliable.A comparison of the LS assessment results and the distribution of random landslide sites in the study area shows that all of the LS assessment results correspond well to the distribution of landslide sites (as showed in Table 4).However, as demonstrated in Figure 7, the proportion of landslide sites in the regions of each LS level determined using the AHP-weighted information content method increases as the LS level increases, showing a notable linear correlation.This suggests that the LS levels determined by the AHP-weighted information content method agree with the actual occurrence of landslides and that the assessment results are relatively reliable.study area shows that all of the LS assessment results correspond well to the distribution of landslide sites (as showed in Table 4).However, as demonstrated in Figure 7, the proportion of landslide sites in the regions of each LS level determined using the AHP-weighted information content method increases as the LS level increases, showing a notable linear correlation.This suggests that the LS levels determined by the AHP-weighted information content method agree with the actual occurrence of landslides that the assessment results are relatively reliable.

AUC-Based Comparative Analysis
The performances of landslide assessments according to the three evaluation methods are shown in Table 5.The AHP-weighted information content method indicates the highest performance for classification of landslide areas (sensitivity = 75.94%),followed by the information content method (sensitivity = 75.49%)and the AHP method (sensitivity = 74.75%).For the classification of non-landslide areas, the highest performance is from the AHP-weighted information content method (specificity = 75.94%),followed by the information content method (specificity = 73.64%)and the AHP method (specificity = 71.68%).In general, all of the methods performed well in the classification of both landslide and non-landslide areas.The general performance of landslide methods using the ROC curve method [50,51] is shown in Figure 8.All landslide models performed well in the LS assessment (AUC > 0.8).Of the methods, the AHP-weighted information content method showed the highest performance (AUC = 0.893), followed by the information content method (AUC = 0.842) and the AHP method (AUC = 0.821), respectively.The three methods all produced relatively good assessment results.The accuracy of the AHP-weighted information content method was 8.1% and 5.7% higher than those of the AHP and information content methods, respectively, indicating that the proposed AHP-weighted information content method can produce better LS assessment results for the study area compared to the conventional AHP and information content methods.
The general performance of landslide methods using the ROC curve method [50,51] is shown in Figure 8.All landslide models performed well in the LS assessment (AUC > 0.8).Of the methods, the AHP-weighted information content method showed the highest performance (AUC = 0.893), followed by the information content method (AUC = 0.842) and the AHP method (AUC = 0.821), respectively.The three methods all produced relatively good assessment results.The accuracy of the AHP-weighted information content method was 8.1% and 5.7% higher than those of the AHP and information content methods, respectively, indicating that the proposed AHP-weighted information content method can produce better LS assessment results for the study area compared to the conventional AHP and information content methods.Hence, methods based on expert knowledge and experience, as well as mathematical and statistical analysis, both have advantages and disadvantages in assessing LS [50][51][52].Methods based on expert knowledge and experience can determine the correlations between landslide-affecting factors and the relationship between each landslide-affecting factor and landslide occurrence based on the knowledge of geological experts and their experience with landslides.However, expert judgements are predominantly non-qualitative and empirical, thereby affecting the objectivity of the assessment results and rendering the assessment results unverifiable.
Mathematical and statistical analysis methods quantitatively calculate the relationship between each single landslide-affecting factor and landslide occurrence based on geological and environmental data for landslides, thereby avoiding a subjective assignment of weights to the assessment indices.However, these methods overlook the complexity of geological causes of regional landslides and fail to satisfactorily describe the correlations between a number of landslide-affecting factors.The key to landslide susceptibility assessment lies in the correlations and relationships between landslide-affecting factors and LS level.Expert knowledge and experience can be used to assign a relatively satisfactory weight to each factor.However, due to a lack of data and numerical calculations, models constructed using methods based on expert knowledge and experience have low objectivity and universality.Mathematical and statistical analysis can reflect the values of the indices of each single factor at each classification level, but fail to consider the importance of each factor and the relationships between factors.As a result, there is a lack of understanding of the relationships between landslide-affecting factors in geological knowledge.
To overcome the deficiencies in the methods based on expert knowledge and experience and mathematical and statistical analysis methods used in current landslide research, some researchers have attempted to combine these two types of methods [53][54][55][56][57] by assessing LS using mathematical and statistical methods coupled with data calculation methods based on expert experience with landslides.However, LS assessment is a complex process and requires a comprehensive consideration of various factors.Combining methods based on expert knowledge and experience and mathematical and statistical analysis methods still faces some challenges, including (1) how to obtain the spatial characteristics of each landslide-affecting factor and calculate the difference between spatial characteristics at each classification level via GIS [58-60]; (2) how to determine the correlations between landslide-affecting factors and assign a weight to each landslide-affecting factor from the formation mechanism of landslides based on expert knowledge and experience; and (3) how to calculate the correlations between the numerical indices of each level of each landslide-affecting factor with the LS level from the specific conditions surrounding the occurrence of a landslide using mathematical and statistical methods [31].In view of these problems, this study presents an LS assessment method based on GIS technology combined with an AHP-weighted information content method.First, the spatial characteristics of each classification level of each landslide-affecting factor are calculated using the GIS spatial analysis method.Second, the correlation weight between landslide formation and each landslide-affecting factor is qualitatively calculated using the AHP method based on expert knowledge and experience.Third, the information content method is employed to quantitatively correlate the LS level with the numerical indices of each factor at each classification level from mathematical and statistical perspectives.Finally, by constructing an AHP-weighted information content method, the weight of each AHP factor is combined, relatively satisfactorily, with the numerical indices of each information content factor at each classification level.This method combines the advantages of expert knowledge and experience with mathematical and statistical analysis.Additionally, the LS of southern Anhui was also assessed, and the results were used to examine the applicability of the proposed method based on the AHP-weighted information content method to an assessment of LS.

Conclusions
The LS assessment results obtained using the AHP-weighted information content method agree well with actual landslide observations.The actual proportion of landslides in a region increases linearly with the LS level (low LS; relatively low LS; moderate LS; relatively high LS; and high LS) of the region and, theoretically, conforms to the classification principle.
To establish an assessment index system, a weighted information content method is used to determine each classification parameter of the model.The AHP method, which is used to obtain weighting coefficients, is combined with the information content method.The resultant method combines qualitative analysis and quantitative calculations.Based on the AUC values, the accuracy of the AHP-weighted information content method is 8.1% and 5.7% higher than those of the AHP and information content methods, respectively.As a result, the AHP-weighted information content method produces more reliable LS assessment results.
The GIS spatial analysis method is used to process and calculate 46 levels of spatial data for the 10 landslide-affecting factors in southern Anhui and their weighted information contents.Because of its advantages regarding data analysis, GIS spatial analysis can rapidly and accurately store and display spatial data and mathematical and statistical results using the proposed LS calculation equation.The proportion of the area of regions of each LS level to the total area of southern Anhui is determined using the GIS-based spatial statistical function and calculated as follows: high LS regions account for 21.3%; relatively high LS regions account for 20.6%; moderate LS regions account for 20.1%; relatively low LS regions account for 11.7%; and low LS regions account for 26.3%, as shown in Figure 9. linearly with the LS level (low LS; relatively low LS; moderate LS; relatively high LS; and high LS) of the region and, theoretically, conforms to the classification principle.
To establish an assessment index system, a weighted information content method is used to determine each classification parameter of the model.The AHP method, which is used to obtain weighting coefficients, is combined with the information content method.The resultant method combines qualitative analysis and quantitative calculations.Based on the AUC values, the accuracy of the AHP-weighted information content method is 8.1% and 5.7% higher than those of the AHP and information content methods, respectively.As a result, the AHP-weighted information content method produces more reliable LS assessment results.
The GIS spatial analysis method is used to process and calculate 46 levels of spatial data for the 10 landslide-affecting factors in southern Anhui and their weighted information contents.Because of its advantages regarding data analysis, GIS spatial analysis can rapidly and accurately store and display spatial data and mathematical and statistical results using the proposed LS calculation equation.The proportion of the area of regions of each LS level to the total area of southern Anhui is determined using the GIS-based spatial statistical function and calculated as follows: high LS regions account for 21.3%; relatively high LS regions account for 20.6%; moderate LS regions account for 20.1%; relatively low LS regions account for 11.7%; and low LS regions account for 26.3%, as shown in Figure 9.

Figure 1 .
Figure 1.Examples of landslide types in southern Anhui, China: (a) landslide in Xuanzhou county, (b) landslide in Yuexi county.

Figure 2 .
Figure 2. Distribution of landslide sites in southern Anhui, China: (a) elevation map based on a 30 m digital elevation model (DEM) from shuttle radar topography mission (STRM), (b) geologic map of the distribution of rocks of different ages.

Figure 2 .
Figure 2. Distribution of landslide sites in southern Anhui, China: (a) elevation map based on a 30 m digital elevation model (DEM) from shuttle radar topography mission (STRM), (b) geologic map of the distribution of rocks of different ages.

Figure 3 .
Figure 3. Landslide susceptibility (LS) assessment based on geographic information system (GIS)and the analytic hierarchy process (AHP)-weighted information content method.

Figure 3 .
Figure 3. Landslide susceptibility (LS) assessment based on geographic information system (GIS) and the analytic hierarchy process (AHP)-weighted information content method.

Figure 5 .
Figure 5. LS assessment of the study area based on GIS spatial analysis and the AHP-weighted information content method.

Figure 5 .
Figure 5. LS assessment of the study area based on GIS spatial analysis and the AHP-weighted information content method.

Figure 6 .
Figure 6.Validation of LS assessment of 212 randomly selected landslide sites for the validation models.(a) AHP-weighted information content method, (b) AHP method, (c) information content method.

Figure 6 .
Figure 6.Validation of LS assessment of 212 randomly selected landslide sites for the validation models.(a) AHP-weighted information content method, (b) AHP method, (c) information content method.

Figure 7 .
Figure 7. Analysis of the proportion of landslide sites in regions of different LS levels determined using the three methods.

Figure 8 .
Figure 8. Receiver operating characteristic (ROC)-based comparison of the LS assessment results and analysis of area under curve (AUC) data.

Figure 8 .
Figure 8. Receiver operating characteristic (ROC)-based comparison of the LS assessment results and analysis of area under curve (AUC) data.

Table 1 .
Research data introduction.

Table 2 .
AHP judgement matrix of the landslide-affecting factors.

Table 3 .
Calculation of the AHP-weighted information contents of the landslide-affecting factors.

Table 4 .
Comparison of the proportions of landslide sites determined using the three methods.

Table 4 .
Comparison of the proportions of landslide sites determined using the three methods.

Table 4 .
Comparison of the proportions of landslide sites determined using the three methods.

Table 5 .
Model performance of the three methods.