Next Article in Journal
Priority Branches for Ship Detection in Optical Remote Sensing Images
Previous Article in Journal
An Improved Boundary-Aware Perceptual Loss for Building Extraction from VHR Images
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Shallow Landslide Susceptibility Models Based on Artificial Neural Networks Considering the Factor Selection Method and Various Non-Linear Activation Functions

1
Department of Civil and Environmental Engineering, KAIST, Daejeon 34141, Korea
2
Department of Ocean Engineering, Pukyong National University, Pusan 48513, Korea
*
Author to whom correspondence should be addressed.
Remote Sens. 2020, 12(7), 1194; https://doi.org/10.3390/rs12071194
Submission received: 19 February 2020 / Revised: 4 April 2020 / Accepted: 7 April 2020 / Published: 8 April 2020
(This article belongs to the Section Remote Sensing in Geology, Geomorphology and Hydrology)

Abstract

:
Landslide susceptibility mapping is well recognized as an essential element in supporting decision-making activities for preventing and mitigating landslide hazards as it provides information regarding locations where landslides are most likely to occur. The main purpose of this study is to produce a landslide susceptibility map of Mt. Umyeon in Korea using an artificial neural network (ANN) involving the factor selection method and various non-linear activation functions. A total of 151 historical landslide events and 20 predisposing factors consisting of Geographic Information System (GIS)-based morphological, hydrological, geological, and land cover datasets were constructed with a resolution of 5 x 5 m. The collected datasets were applied to information gain ratio analysis to confirm the predictive power and multicollinearity diagnosis to ensure the correlation of independence among the landslide predisposing factors. The best 11 predisposing factors that were selected in this study were randomly divided into a 70:30 ratio for training and validation datasets, which were used to produce ANN-based landslide susceptibility models. The ANN model used in this study had a multi-layer perceptron (MLP) structure consisting of an input layer, one hidden layer, and an output layer. In the output layer, the logistic sigmoid function was used to represent the result value within the range of 0 to 1, and six non-linear activation functions were used for the hidden layer. The performance of the landslide susceptibility models was evaluated using the receiver operating characteristic curve, Kappa index, and five statistical indices (sensitivity, specificity, accuracy, positive predictive value (PPV), negative predictive value (NPV)) with the training dataset. In addition, the landslide susceptibility models were validated using the aforementioned measures with the validation dataset and were compared using the Friedman test to check the significant differences among the six developed models. The optimal number of neurons was determined based on the aforementioned performance evaluation and validation results. Overall, the model with the best performance was the MLP model with the logistic sigmoid activation function in the output layer and the hyperbolic tangent sigmoid activation function with five neurons in the hidden layer. The validation results of the best model showed a sensitivity of 82.61%, specificity of 78.26%, accuracy of 80.43%, PPV of 79.17%, NPV of 81.82%, a Kappa index of 0.609, and AUC of 0.879. The results of this study highlight the effectiveness of selecting an optimal MLP model structure for shallow landslide susceptibility mapping using an appropriate predisposing factor section method.

1. Introduction

Shallow landslides are one of the most common and frequent geo-disasters that occur in mountainous regions [1]. In most areas of Korea, where approximately 63% of the territory consists of mountainous regions, soil layers are generally less than 2–3 m in thickness with underlying bedrock [2]. In addition, the annual rainfall in the central region of Korea is approximately 1200–1500 mm, and more than half of the annual precipitation is concentrated during the months from July to September due to the influence of the Monsoon season. Due to these topographical and climate conditions, Korean mountains are regarded as regions that are susceptible to shallow landslides [3,4]. According to the statistics of the Korea Forest Service from 1976 to 2018, an average of 34 casualties and 395 ha of landslides occur annually. Considering such figures, there is a growing national interest in the development of proactive technologies for the prevention and mitigation of landslide hazards.
Landslide susceptibility mapping is well recognized as an essential element in supporting decision-making activities for disaster prevention and mitigation, as it provides information regarding landslide-prone areas. However, reliable spatial prediction of landslides remains a challenging task due to its complexity as it is affected by various internal factors (e.g., hydro-geotechnical properties, lithology, forestry, geological structure, topographic conditions) and external factors (e.g., rainfall, the melting of snow, earthquakes, volcanic eruptions) [5,6]. To resolve these problems, many studies have been conducted over several decades with the goal of developing high-performance-based landslide susceptibility models through various approaches, which can be divided into two categories: physically based methods and data-driven methods.
Physically based methods of landslide prediction [7,8] are generally expressed as safety factors of slope stability, which refers to the ratio of soil shear strength to the shear stress of potential sliding surfaces in the slope. Such methods do not require a historical inventory of landslides when developing susceptibility maps but require detailed geotechnical properties and geometric conditions. As such, physically based models are more practical for site-specific areas with homogeneous conditions [9], as it is expensive and time-consuming to build up a database for applications in large-scale areas [10,11].
Data-driven methods of landslide prediction [10,11,12] estimate potential landslides by analyzing and interpreting the relationship between historical landslide data and various predisposing factors through the means of statistical or machine learning techniques without physical processes. Therefore, historical landslide data and various factors related to landslide occurrence should be collected as the first step for the landslide susceptibility mapping [13]. Recent advances in data mining and soft computing have made it possible to easily link with Geographic Information System (GIS) platforms, enabling landslide susceptibility assessment over wide areas [14].
According to the literature review, artificial neural network (ANN) models have been reported as a suitable machine learning method for predicting non-linear and complex phenomena [15,16]. Such models have been widely applied for landslide susceptibility modeling [17,18,19,20]. In a study of applying an ANN-based susceptibility model, Vasu et al. [21] improved the predictive ability of the ANN by integrating a hybrid feature selection and an extreme learning machine. Tien Bui et al. [22] compared two training algorithms (Levenberg–Marquardt and Bayesian regularization network) and found that the latter algorithm was more robust and efficient. Lee et al. [23] showed that an ANN model performed better with the weights of each factor being determined compared to without determining the weighting. Ermini et al. [24] compared two architectures of ANN models (Multi-Layer Perceptron and Probabilistic Neural Network) and obtained slightly better results with the former architecture. Despite these efforts, there is still a multitude of considerations that should be accounted for when developing an optimal ANN model capable of high levels of performance [25,26], such as factor selection, the number of neurons and layers, and activation functions.
In this study, landslide susceptibility maps of Mount Umyeon were produced using ANN models with consideration of various model architectures. The main objective of this study is to determine the optimal structure of the ANN model considering the factor selection method and various activation functions for high-performance-based landslide susceptibility mapping. In the factor selection stage, information gain ratio and multicollinearity analysis were applied for the evaluation of predictive power and mutual exclusivity of the landslide predisposing factors. Once evaluated, the optimal architecture of the ANN model was selected with consideration of the number of neurons and various activation functions by evaluating model performance using receiver operating characteristics (ROCs), Kappa index, and various statistical evaluation measures. Finally, a non-parametric test (Friedman test) was conducted to compare the developed susceptibility models to confirm significant differences.

2. Study Area and Spatial Database

2.1. Description of the Study Area

The study area is Mount Umyeon, which is located in the southern part of Seoul Special City, Korea, between latitudes 37°27’00"N and 37°28’55"N and longitudes 126°59’02"E and 127°01’41"E, as shown in Figure 1. This area covers a surface area of approximately 5.1 km2, with the highest elevation being 293 m above sea level. The geological setting of this area is mainly composed of Pre-Cambrian banded biotite gneiss and granitic gneiss. The annual average precipitation of this area is 1450 mm. Extreme heavy rainfall from 26 July to 27 July in 2011 (two days of cumulative rainfall exceeding 365 mm, as shown in Figure 2) triggered approximately 151 shallow landslide events. Most of the landslides transformed into debris-flows, which flowed along the channel in the mountain, reaching cars, roads, and infrastructure. Sixteen casualties were reported, and more than 10 buildings were damaged by the debris, resulting in an economic loss of over 15 million USD. For additional detailed information on the study area, refer to the following papers: Yune et al. [27] and Jeong et al. [28].

2.2. Landslide Inventory

Collecting an accurate landslide inventory is the most important step in the development of reliable and efficient landslide susceptibility models. In this study, 51-cm-resolution digital orthographic images (Figure 3) provided by the National Geographic Information Institute (NGII) were used to identify locations of landslide initiation. One hundred and fifty-one shallow landslide locations were determined by comparing these images before and after the 2011 landslide events (Figure 3). A landslide inventory map was produced as feature points using ArcMap version 10.6.1 (Figure 1).

2.3. Landslide Predisposing Factors

Landslides occur due to complex interactions between various geo-environmental factors. A total of 20 landslide predisposing variables were selected based on abundant literature review and were categorized into four types (morphological, hydrological, geological, and land covers types) [12] as shown in Table 1 and Figure 4. The Digital Elevation Model (DEM), which was provided with a 1:5000 scale by NGII, is fundamental data that is converted into morphological and hydrological variables. Geological and land cover variables were obtained from a 1:25,000 scale forest soil map produced by the Korea Forest Service (KFS). All landslide predisposing candidates were constructed with a 5 × 5 m resolution using ArcMap version 10.6.1 (Esri, Redlands, CA, USA) and consisted of 239,280 grid cells. Frequency ratio (FR) analysis was performed to evaluate the relationships between landslide occurrence and the predisposing factors, as shown in Table A1.

2.3.1. Morphological Types

Many studies have shown that the occurrence of landslides is affected by morphological factors, such as elevation, slope, curvature, topographic ruggedness index (TRI), surface relief ratio (SRR), aspect, and site exposure index (SEI). In this study, elevation is in the range of 23.4–293 m. Approximately half of the shallow landslides occurred at the middle elevation of the mountain between 104–203 m, whereas fewer shallow landslides developed at lower elevations. Slope is well known as the most critical cause of landslide occurrence. The slope distribution ranges from 0° to 59°, and the number of landslide events increased with increasing slope angle, with no landslides occurring in the case of slope angles less than 13°. This is due to the shear stress of the soil being directly affected by the slope angle. Curvature is the rate of angular change that indicates the bending degree of a line or surface and affects the deceleration and acceleration of the flowing surface water. The values are continuous data: 0 for planar, negative for concave, and positive for convex. TRI describes the difference in elevation values between a center cell ( c 0 ) and others surrounding it [30]. It can be calculated using the following equation:
TRI = ( c i c 0 ) 2 ,
where c i is the elevation of each neighbor cell to c 0 . The results of the FR analysis showed that the occurrence of landslides increased as the TRI value increased (Table A1). SRR indicates rugosity considering the maximum, minimum, and mean elevation of each grid [31]. It can be calculated as follows:
SRR = z m e a n z min z max z min ,
where z max , z min , and z m e a n are the maximum, minimum, and mean elevations, respectively. Aspect is the main direction of the slope to which the pixel of the DEM data belongs, expressed in angular units (e.g., 0° in the north direction and 180° in the south direction). The angle is measured from the north to the slope in the clockwise direction. SEI is the rescaling aspect to a north/south axis by multiplying the slope. It is described as the relative degree of sun exposure from coolest to warmest locations according to the following equation:
SEI = s l o p e · cos ( π · a s p e c t 180 180 )
Aspect and SEI are important in determining soil water content and factors affecting vegetation in relation to sun exposure [32].

2.3.2. Hydrological Types

In this study, we considered topographic wetness index (TWI), sediment transport index (STI), stream power index (SPI), and distance from stream as hydrological factors for evaluating landslide susceptibility. TWI is frequently used to quantify soil moisture, which greatly influences landslide occurrence. This is due to the potential decrease in soil strength caused by increased pore water pressure, which is the main cause of landslide initiation [33]. TWI is given by the equation
TWI = ln ( α tan β ) ,
where α is the local upslope draining area that indicates the amount of water flowing through a certain point, and tan β is the local slope [34]. STI is a measure of the sedimentation transport capacity that represents the possibility of potential landslides. STI can be calculated by the equation
STI = ( A s 22.13 ) 0.6 ( sin β 0.0896 ) 1.3 ,
where A s is the specific catchment area, and β is the local slope angle in degrees [35]. SPI is an indicator of the erosive power of flowing water and increases with surge flowing caused by large upslope draining areas and steep slopes. SPI can be calculated using the equation
SPI = A s tan β ,
where A s is the specific catchment area, and β is the local slope angle in degrees [36]. The results of FR analysis showed that the trend of landslide occurrence increased as the values of STI and SPI increased within the range of 0 to 100 (Table A1). Distance from stream was used to assess landslide susceptibility as it may influence rainfall drainage and runoff processes. It was measured according to the Euclidean distance method in ArcGIS 10.6.1, although the results indicated that no landslides occurred more than 120 m away from the stream.

2.3.3. Geological Types

Geological features play an important role in landslide susceptibility as such factors can involve a variety of soil and rock properties such as strength, structure, fracture, and composition. In this study, lithology and weathering level were selected as geological features, which are seldom used in the development of landslide susceptibility maps [37]. The lithology of this area consists of metamorphic (75%) and sedimentary (24%) rocks. The weathering level in this area is high (46%) or moderate (53%). The results of FR analysis indicate that landslides occurred predominantly in metamorphic areas and areas with a high level of weathering (Table A1).

2.3.4. Land Cover Types

Land cover factors have also been recognized as significant causes that affect slope instability in landslide-prone areas [38,39]. In this study, effective soil depth, soil texture, soil type, soil density, forest type, forest density, and distance from roads were chosen as seven candidate predisposing factors of landslide susceptibility. The effective soil depth ranged from 1 cm to 69 cm, and shallower soil depths induced higher frequencies of landslide occurrence (Table A1). This is due to the fact that, assuming the same permeability, thinner soil layers require a shorter time to become saturated soil layers: then, the saturated soil causes slope instability due to the reduced shear stress. The soil texture of the study area consists of silty loam (38%) and sandy loam (61%), and the soil type is composed of dry brown forest soil (49%), slight dry brown soil (45%), and moderately mist brown forest soil (5%). The results of the FR analysis showed that landslides occurred mainly in sandy loam and dry brown forest soil (Table A1). The soil density of the study area was mainly loose density (79%), but medium dense soil (11%) showed higher FR values (=1.35) compared to loose dense areas (FR=0.97). Forest type and density are important factors as trees can affect slope stability due to root strength, water adsorption, and tree weight [40]. The forest type in this area consists of coniferous (2%), broadleaf (90%), and mixed forest (6%) with mostly dense forest areas. The distance from road was considered a landslide predisposing candidate to evaluate landslide susceptibility as man-made roads in mountains may be a potential cause for slope instability.

3. Methodology

A landslide susceptibility analysis was performed through seven main processes as follows: (1) collection of historical landslide data; (2) construction of landslide predisposing factors; (3) preparation of training and validation datasets; (4) application of a filter method to select suitable database subsets; (5) development of landslide susceptibility models; (6) validation and comparison of landslide susceptibility models; (7) selection of the best performing model. Steps (1) and (2) were described earlier in Section 2, and the remaining steps are described below.

3.1. Preparation of Training and Validation Datasets

Supervised learning, including the ANN model, requires the preparation and preprocessing of input-target pairs. Targets in landslide susceptibility analysis methods are generally classified into two classes: landslide occurrence (assigned as “1”) and non-landslide occurrence (assigned as “0”). A total of 20 landslide predisposing factors were considered as input variables in this study (Figure 4). The continuous variables were rescaled in the range of 0.01 to 0.99 using the min-max normalization formula [22] as follows:
z = x min ( x ) max ( x ) min ( x ) ( U L ) + L ,
where z is the normalized value, x is the original value, and U and L are the upper and lower normalization bounds, respectively. The nominal variables were calculated as frequency ratios (Table A1) and were also normalized using the same method as mentioned earlier.
Preprocessed input-target data for landslide susceptibility modeling should be divided into training and validation datasets. The training dataset is used for model generation, whereas the validation dataset (not the data used for training) is used to validate the developed models and confirm the predictive ability and accuracy of each model. Although there are no exact standards for dividing the two data subsets, this study divided the training and validation subsets according to a 70:30 ratio. Of the total 151 landslide points, 106 and 45 landslide points were randomly split between the training and validation subsets, respectively (Figure 1). The same number and ratio of non-landslide points were also randomly selected from areas safe from landslides. Finally, the values of the 20 landslide predisposing factors were extracted to build the training and validation datasets from the landslide and non-landslide points.

3.2. Landslide Predisposing Factor Analysis

The landslide predisposing factor analysis was conducted to select suitable factors for producing landslide susceptibility maps, which is known to be useful in constructing and simplifying machine learning models [41]. Among the various factor selection techniques, we used the filter method, which is an approach to evaluate the relationship between input variables through mathematical and statistical measures. In this study, information gain ratio analysis and multicollinearity analysis were performed among the filter-based factor selection methods.

3.2.1. Information Gain Ratio Analysis

Information gain ratio (IGR) is widely used as a factor selection technique of landslide predisposing factors when quantifying importance based on information theory [13,42,43]. Landslide predisposing factors with high IGR values indicate high predictive power for landslide susceptibility modeling. On the contrary, landslide predisposing factors with low IGR values exhibit low predictive power, which may adversely affect the performance of the susceptibility model. Therefore, it is necessary to select appropriate factors for high performance-based susceptibility model generation.
The basic principle of IGR is as follows. Let a training dataset S be a set consisting of n input variables, and n( C i , S) is the number of variables in S belonging to the class C i (landslide, non-landslide). The important quantity (referred to as information or entropy) of S is given as follows:
I n f o ( S ) = i = 1 2 n ( C i , S ) | S | log 2 n ( C i , S ) | S |
The amount of information based on the division of S into subsets ( S 1 ,   S 2 ,…, S m ) regarding the landslide predisposing factor L is calculated as follows:
I n f o ( S , L ) = j = 1 m S j | S | I n f o ( S )
Then, the IGR for landslide predisposing factor L is estimated as follows:
I G R ( S , L ) = I n f o ( S ) I n f o ( S , L ) S p l i t I n f o ( S , L ) ,
where SplitInfo represents the entropy generated by splitting the training data S into m subsets. SplitInfo is defined as
S p l i t I n f o ( S , L ) = j = 1 m | S j | | S | log 2 | S j | | S |

3.2.2. Multicollinearity Analysis

Multicollinearity refers to a phenomenon in which certain predisposing factors have a strong correlation with other factors and thus have a negative effect on model accuracy and quality. To resolve this problem, it is necessary to analyze the correlation between predisposing factors through multicollinearity diagnosis when developing statistical or machine learning models. There are several methods of detecting multicollinearity such as the variance inflation factor (VIF) and the tolerance analysis [44], the Farrar–Glauber test [45], the condition number test [46], and the bivariate correlation analysis. In this study, we used Pearson’s correlation, VIF, and the tolerance analysis, which are commonly considered for multicollinearity diagnosis in landslide studies [47,48].
Pearson’s correlation method was used in this study to confirm the correlation between individual landslide predisposing factors. Pearson’s correlation coefficient (r) is defined as the covariance of two factors divided by the product of the standard deviations, as follows:
r X Y = i = 1 n ( X i X ¯ ) ( Y i Y ¯ ) i = 1 n ( X i X ¯ ) 2 i = 1 n ( Y i Y ¯ ) 2 ,
where X and Y are the landslide predisposing factors, and X ¯ and Y ¯ are the means of X and Y, respectively. An r value higher than 0.7 indicates a high correlation between X and Y, whereas an r value lower than 0.3 indicates a low correlation.
VIF measures the variation of standard deviation and is increased due to collinearity between the landslide predisposing factors. VIF is calculated as follow:
VIF = 1 / tolerance = 1 1 R 2 ,
where R 2 is the coefficient of determination. The magnitude of multicollinearity can be analyzed through the size of VIF and tolerance. The cutoff values of VIF and the tolerance in this study were 10 and 0.1, respectively.

3.3. Landslide Susceptibility Analysis

3.3.1. Artificial Neural Networks

Biological brains store and learn information by sending and receiving signals through synapses that connect neurons to nerve cells. An artificial neural network (ANN) is an algorithm created to mimic how information processing is performed by the human brain, which performs complex computations by connecting multiple neurons. In this study, the multi-layer perceptron (MLP) model, which is the most widely used ANN model in landslide studies [13,18,22,49], was used to estimate the non-linear relationships between shallow landslides and the predisposing factors. The MLP model generally consists of an input layer, one or more hidden layers, an output layer, and the connection of neurons, as illustrated in Figure 5.
The input layer of the network, or the first layer, provides information from the outside to the network. The number of neurons in the input layer equals to the number of landslide predisposing factors. The neurons pass the input data on to new neurons in the hidden layer. In the hidden and output layers, the net is calculated as the sum of the products of each weight and bias, and then the output value is calculated by inputting the net value to the activation functions, as described in Equations (14) and (15):
y j = f ( n e t j ) = f ( i = 1 n w j i x i + b i ) ,
z o u t = f ( n e t o u t ) = f ( j = 1 m w j y j + b j ) ,
where x i is the input value,   y i and z o u t are the output values of the hidden and output layers, respectively, w j i and w j are the synaptic weights, b i and b j are the biases, n and m are the number of neurons in the input and hidden layers, respectively, and f is an activation function, such as linear and non-linear functions. The complexity of the model depends on the number of neurons in the hidden layer, which is determined by the training and testing results of the MLP model.
The learning procedure of MLP is divided into two main processes: i) feed-forward and ii) back-propagation. For the feed-forward phase, the input value is propagated to the output layer and all weights in the network are randomly assigned, resulting in predictive value. In the subsequent back-propagation phase, the weights are updated to minimize the difference between the predicted value and actual value of the network through the gradient descent method. This process is repeated until a mean square error (MSE) value reaches a certain threshold (less than 0.01). In this study, the ANN model was trained using Bayesian regularization [50,51], which is commonly used for error back-propagation algorithms [22,49]. Initial training parameters were set to the default values of MATLAB version R2017a.

3.3.2. Activation Functions

The role of the activation function is to add non-linearity to the network by deciding whether a neuron should be activated or not, which makes ANN models capable of learning and performing complex phenomena. Therefore, selecting the appropriate activation function is an essential task involved in predicting complex phenomena. As shown in Figure 5, in the MLP model, the activation function is used in the hidden and output layers. In this study, we used one hidden layer structure for a simple comparison between activation functions, so that a total of two activation functions are applied to each of the hidden and output layers.
There are several types of activation functions, each of which has its own various advantages and limitations. So far, no activation function stands out as having the best performance overall other functions, and thus, activation functions need to be selected based on how well it matches the characteristics of the implemented model. Activation functions can be categorized into two types: i) linear and ii) non-linear. A linear activation function is a function in which the output is the product of an input value multiplied by a certain constant value, described in Equation (16),
f ( x ) = c x ,
where x is the input value and c is a constant.
In contrast, there are several types of non-linear activation functions. This study utilizes functions that are commonly used, as follows: i) the symmetric hard limit (Hard-lims) function, ii) the symmetric saturating linear (Sat-lins) function, iii) the radial basis (Rad-bas) function, iv) the logistic sigmoid (Log-sig) function, v) the rectified linear unit (ReLU) function, and vi) the hyperbolic tangent sigmoid (Tan-sig) activation function.
The Hard-lims function is a binary function that sends a signal of 1 for positive and –1 for negative values, which can be expressed as follows:
f ( x ) = { 1 , x 0 1 , x < 0
The Sat-lins function has identical outputs for inputs in the range of −1 to 1, an output value of 1 if the input is greater than 1, and an output value of −1 if the input is less than −1, which can be expressed as follows:
f ( x ) = { 1 , x > 1 x , 1 x 1 1 , x < 0
The Rad-bas function is expressed in Equation (19). A Gaussian function is generally used for the Radbas function. It has an output value that increases or decreases monotonically with distance from the center point.
f ( x ) = exp ( ( x c ) 2 r 2 ) ,
where c is the center and r is the radius. The ReLU function has an identity for all positive values, and zero for all negative values, which can be expressed as follows:
f ( x ) = { x , x 0 0 , x < 0
The Log-sigmoid function is given Equation (21), and outputs values ranging between 0 and 1. Larger inputs converge to 1, and smaller inputs converge to 0, hence the output is not zero-centered.
f ( x ) = 1 1 + e x
The tan-sigmoid function is given by Equation (22). It has an output range from −1 to 1. The shape of the tan-sigmoid function is loosely similar to log-sigmoid as an S-shape, but the tan-sigmoid function is centered on zero.
f ( x ) = e x e x e x + e x
Users of MLP models must decide on which activation functions to use for the hidden and output layers. In this study, the activation functions of the hidden layer were the six non-linear types of functions (Hard-lims, Sat-lins, Rad-bas, ReLU, Log-sig, and Tan-sig). For the output layer, the Log-sig activation function, which is frequently used for binary classification, was used to express the result between 0 and 1. In this case, the closer the result is to 1, the higher the probability of a landslide. In contrast, the closer the result is to 0, the lower the probability of a landslide. In the case of the linear activation function, as it is mainly used in the output layer to develop regression models, it was excluded from this study.

3.4. Assessment of Model Performance

3.4.1. Statistical Evaluation Measures

The performance of landslide susceptibility models can be evaluated using various statistical measures. In this study, we used sensitivity, specificity, accuracy, positive predictive value (PPV), negative predictive value (NPV), and Kappa index to evaluate the performance of the models. Sensitivity measures the proportion of landslide pixels that are correctly identified as landslide occurrences. Specificity measures the proportion of non-landslide pixels that are correctly identified as non-landslide occurrences. Accuracy measures the proportion of landslide and non-landslide pixels that are correctly identified. Positive predictive value is the probability that predicted landslide pixels have actual landslide occurrences. Negative predictive value is the probability that predictive non-landslide pixels have actual non-landslide occurrences. The Kappa index, which is generally regarded as a reliability measure, is used to measure the quality of classification models [52]. In other words, it can be used to evaluate how effectively a landslide susceptibility model classifies landslide pixels [53]. According to Landis and Koch [54], the degree to which the observed and predicted values are in agreement in terms of the Kappa index are summarized as follows: 0.81–1.00 indicates near-perfect agreement, 0.61–0.80 indicates substantial agreement, 0.41–0.60 indicates moderate agreement, 0.21–0.40 indicates fair agreement, 0.00–0.20 indicates slight agreement, and values lower than 0 indicate poor agreement. These statistical measures can be calculated from a confusion matrix and are expressed as follows:
Sensitivity = T P T P + F N ,
Specificity = T N T N + F P ,
Accuracy = T P + T N T P + N T + F P + F N
P P V = T P F P + T P
N P V = T N F N + T N
Kappa   index = p o b s e r v e d p e x p e c t e d 1 p e x p e c t e d
where TP (true positive) and TN (true negative) are the numbers of correctly identified pixels, whereas FP (false positive) and FN (false negative) are the numbers of pixels that were incorrectly identified. pobserved is identical to accuracy, and pexpected, which is expressed by Equation (29), is the expected proportion of landslide and non-landslide pixels that are in agreement.
p e x p e c t e d = ( T P + F P A l l ) ( T P + F N A l l ) + ( F N + T N A l l ) ( F P + T N A l l ) ,
where All is the summation of the number of correctly identified pixels and incorrectly identified pixels, which is calculated as TP+TN+FN+FP.

3.4.2. Receiver Operating Characteristic Curve

The receiver operating characteristic (ROC) curve is widely used to confirm the performance of landslide susceptibility models. The curve is a plot of the true positive rate (=Sensitivity) against the false positive rate (=1–Specificity) with various cut-off settings. The area under the ROC curve (AUC) can be used for quantitative comparisons of model performance. The closer the AUC value is to 1, the higher the capability of the model to distinguish between landslides and non-landslides, whereas when AUC is 0.5, the model is said to have no capability in separating landslides and non-landslides.

3.4.3. Non-Parametric Statistical Test

A non-parametric test is a statistical method of confirming statistical significance based on given data without assuming the probability distribution of the parameters. Such tests were used to confirm statistically significant differences among the developed landslide susceptibility models. In this study, the Friedman test was considered to compare the performance of the landslide susceptibility models. This non-parametric test is based on the null hypothesis, in which no significant differences exist between the performances of the landslide susceptibility models. If the significant probability (p-value), which is the probability of obtaining test results with significant differences, is greater than a certain value (5%, 0.05), the null hypothesis is accepted. Conversely, if the p-value is lower than 0.05, then the null hypothesis is rejected.

4. Results

4.1. Landslide Predisposing Factor Analysis

4.1.1. Predictive Ability Analysis

In this study, IGR analysis of the landslide predisposing factors, which is the first factor-selecting step, was conducted to check the predictive ability of each factor. The results show that slope is the most significant landslide predisposing factor with an average merit value of 0.38, followed by STI (0.35), TRI (0.34), SPI (0.30), SRR (0.30), aspect (0.25), TWI (0.23), soil depth (0.23), soil type (0.20), curvature (0.20), lithology (0.15), SEI (0.11), and elevation (0.10), as shown in Figure 6. The remaining seven predisposing factors (forest density, forest type, soil texture, soil density, weathering, distance from stream, and distance from road) were analyzed to be infinitesimal landslide predisposing factors with an average merit value of 0. Therefore, in this study, 13 landslide predisposing factors with an average merit value greater than 0 were selected among the 20 factors, and the remaining seven factors were eliminated.

4.1.2. Multicollinearity Diagnostic Analysis

Selected 13 landslide predisposing factors were performed for multicollinearity using Pearson’s correlation, VIF and tolerance analysis. In the first step, Pearson’s correlation analysis was conducted between pairs of landslide predisposing factors. The results show that Elevation with TRI (0.76), Slope with TRI (0.80), and STI with SPI (0.83) pairs have a high correlation value, as shown in Table 2. If these values exceed 0.7, it can be suspected that there is multicollinearity. The TRI represents the ruggedness of the terrain through the altitude difference of neighboring cells, so it can be explained that it correlates with the elevation and the slope. STI and SPI are also calculated based on the upslope contributing area and the slope value, which explains the high correlation between them. In these cases, TRI and SPI variables with relatively lower average merit values were removed from the landslide predisposing factor candidates.
The next step for selecting landslide predisposing factors was to analyze VIF and tolerance. The result shows that the highest value of VIF is 5.426 and the lowest value of tolerance 0.184, as shown in Table 3. These values satisfied critical thresholds, which are VIF > 10 or tolerance < 0.1, which represent no multicollinearity among the 11 landslide predisposing factors. Finally, the best 11 landslide predisposing factors were selected through factor selection based on IGR and multicollinearity analysis.

4.2. Landslide Susceptibility Modeling, Validation, and Comparison

4.2.1. Selecting the Best Number of the Neurons in the Hidden Layer

Using the 11 selected predisposing factors, MLP models consisting of Hard-lims, Sat-lins, Rad-bas, Log-sig, and Tan-sig activation functions for the hidden layer with the Log-sig activation function for the output layer were produced using the training dataset. To determine the optimal number of neurons in the hidden layer, performance evaluation was carried out using classification accuracy with the training and validation datasets. In addition, the Kappa index was determined with the validation dataset. The results of the Hard-lims model showed that as the number of neurons increased, the overall classification accuracy and Kappa index increased, but converged or decreased to a certain value beyond a certain number of neurons. In the case of the Sat-lins and ReLU models, the accuracy and Kappa index tended to rise and fall slightly without significant change as the number of neurons increased. The results of the Rad-bas, Log-sig, and Tan-sig models showed that, as the number of neurons increased, the accuracy with the training dataset increased. On the other hand, the accuracy and Kappa index with the validation dataset decreased or fluctuated beyond a certain number of neurons. The results for the optimal number of neurons in the hidden layer (Figure 7) were as follows: eight neurons for the Hard-lims model (Figure 7a), six neurons for the Sat-lins model (Figure 7b), three neurons for the Rad-bas model (Figure 7c), six neurons for the ReLU model (Figure 7d), four neurons for the Log-sig model (Figure 7e), and five neurons for the Tan-sig model (Figure 7f).

4.2.2. Evaluation of the Model Performance

Performance evaluation of the landslide susceptibility models was performed based on the training data and the optimal number of neurons determined in the previous section. Based on the results of the model evaluation (Table 4), the Tan-sig model exhibited the highest results for overall statistical evaluation indices, whereas the Hard-lims model exhibited the lowest. The Tan-sig model had a sensitivity of 92.38%, which indicates that 92.38% of the landslide pixels were correctly identified as landslide occurrences, followed by the Log-sig model (90.48%), the ReLU model (89.52%), the Rad-bas model (88.57%), the Sat-lins model (86.67%), and the Hard-lims model (68.57%). In addition, the Tangent-sig model exhibited the highest specificity (87.62%), indicating that 87.62% of the non-landslide pixels were correctly identified as non-landslide occurrences, followed by the Log-sig model (84.76%), the Rad-bas model (79.05%), the ReLU model (74.29%), the Hard-lims model (72.38%), and Sat-lins model (68.57%). The highest accuracy value was 90.00% with the Tan-sig model, which indicates that 90.00% of the landslide and non-landslide pixels were correctly identified.
The highest positive predictive value was 88.18% with the Tan-sig model, which indicates an 88.18% chance of predictive landslide pixels undergoing actual landslide occurrences. This result is followed by a value of 85.59% with the Log-sig model, 80.87% with the Rad-bas model, 77.69% with the ReLU model, 73.39% with the Sat-lins model, and 71.29% with the Hard-lims model. The model with the highest negative predictive value is the Tan-sig with a 92.00% chance of predictive landslide pixels having actual non-landslide occurrences, followed by the Log-sig model with 89.90%, the ReLU model with 87.64%, the Rad-bas model with 87.37%, the Sat-lins model with 83.72% and the Hard-lims model with 69.72%.
The Kappa index values ranged from 0.410 to 0.800 for the six models, indicating that the strength of agreement between the observed and the predicted values of the model was moderate for the Hard-lims model (0.410) and the Sat-lins model (0.552), and substantial for the ReLU model (0.638), the Rad-bas model (0.676), the Log-sig model (0.752), and the Tan-sig model (0.800). All six models had high AUC values, which represents the capability of distinguishing between landslide and non-landslide occurrences; among them, the Tan-sig model had the highest AUC value of 0.968.

4.2.3. Validation of the Model Performance

The results of landslide susceptibility model performance were validated using the statistical evaluation measures based on the validation dataset, as shown in Table 5. The results showed that the Tan-sig model had the highest performance in terms of the overall evaluation indices. The Tan-sig model exhibited the highest values of sensitivity (82.61%), specificity (78.26%), and accuracy (80.43%), which means 82.61% of the landslide pixels were correctly identified as landslide occurrences, 78.26% of the non-landslide pixels were correctly identified as non-landslide occurrences, and 80.43% of the landslide and non-landslide pixels were correctly identified. In addition, the Tan-sig model had the highest positive predictive value and negative predictive value of 79.19% and 81.82%, respectively, indicating a 79.19% chance of predictive landslide pixels having actual landslide occurrences and an 81.82% chance of predictive landslide pixels having actual non-landslide occurrences. The Hard-lims model had the lowest values of all statistical measures due to the relatively low values of true positive and true negative and relatively high values of a false positive and false negative.
The Kappa index values ranged from 0.304 to 0.609 for the six models, indicating that the strength of agreement between the observed and predicted values of the models was fair for the Hard-lims model (0.304), moderate for the Sat-lins model (0.478), the Rad-bas model (0.478), the ReLU model (0.522), and the Log-sig model (0.565), and substantial for the Tan-sig model (0.609). The highest value of AUC was 0.879 with the Tan-sig model, which indicates that the model was the most capable of distinguishing between landslide and non-landslide occurrences, followed by the Log-sig model (0.877), the ReLU model (0.843), the Rad-bas model (0.821), the Sat-lins model (0.815), and the Hard-lims model (0.781). The statistical indices evaluated using the training dataset were higher than the indices evaluated using the validation dataset.

4.2.4. Comparison of the Model Performance

The non-parametric Friedman test with a p-value threshold of 5% was performed to compare the performances of the landslide susceptibility models. The results indicated that the p-values were lower than 0.05; therefore, the null hypothesis is rejected, which indicates the existence of statistically significant differences between the performances of the landslide susceptibility models. The results of the six landslide susceptibility models from the Friedman test are shown in Table 6.

4.2.5. Production of Landslide Susceptibility Maps

All six models were used to create landslide susceptibility maps for the study area. Landslide susceptibility indices (LSI) were divided into five levels as follows: Very High, High, Moderate, Low, and Very Low. The threshold for differentiating the occurrence of landslides and non-landslides was set as an LSI value of 0.5. The intervals of each class were then set as follows: Very High (LSI 0.95), High (0.95 > LSI 0.85), Moderate (0.85 > LSI 0.70), Low (0.79 > LSI 0.50), or Very Low (LSI < 0.50).
Landslide susceptibility maps were produced based on various activation functions (Figure 8). Table 7 shows the calculated percentages of historical landslides according to susceptibility class. The results showed that the Tan-sig model had the following percentages: 35.8% for Very High, 22.5% for High, 31.1% for Moderate, 4.6% for Low, and 6% for Very Low. In addition, the Tan-sig model showed that 89.4% of all historical landslides ranged from the Very High to Moderate classes.
Landslide density [55], which is defined as a ratio between the percentage of historical landslide pixels (PL) and the percentage of all areas (Pall) on the map for a given susceptibility class, is calculated by the following equation:
Landslide   Density = P L P a l l
The highest value of landslide density was 4.38 for the Very High susceptible class of the Tan-sig model. All landslide density results are summarized in Table 8.

5. Discussion

Landslide susceptibility mapping is an essential task in the determination of landslide-prone areas and is well recognized as an important step in the prevention and mitigation of landslide hazards. Many researchers have utilized ANN models to develop landslide susceptibility models [15,16,17,18,19,20,21,22]. Despite such attempts, there is still a multitude of considerations involved in determining the optimal structure of a high performance-based ANN model, such as landslide predisposing factor selection, the number of neurons in the hidden layer, and the activation function.
The first important step in developing a landslide susceptibility map involves building a reliable database of input–output pairs as it can control the performance of the susceptibility model. In this study, a landslide inventory was constructed in the form of a feature point with 5 m resolution at the center of the source area. The inventory is able to represent the overall morphological, hydrological, geological, and land cover characteristics of the study area as the landslides that occurred in the study area were shallow and translational or slightly rotational types. For other types of landslides, such as deep failures, it would be more suitable to construct an inventory in the form of feature polygons.
A total of 20 landslide predisposing factors (elevation, slope, aspect, curvature, TRI, SRR, SEI, soil density, forest type, forest density, and distance from road) were established through an abundant literature review of existing landslide susceptibility studies. The established predisposing factors were normalized to a comparable range of 0.01–0.99 for further data analysis and ANN modeling. This process can guarantee stable convergence of weight and biases in ANN modeling [56]. Future studies are recommended to use geotechnical databases such as internal friction angle, cohesion, and permeability coefficient, although such databases may require significant amounts of money and time to build. As such factors directly influence slope stability, reliable research results may be obtained.
Factor selection for assessing landslide susceptibility is an essential task that influences the quality of ANN models, as not all factors affect landslide occurrence. In this study, information gain ratio (IGR), Pearson correlation, VIF, and tolerance analyses were subsequently performed to check the predictive power of each predisposing factor and conduct multicollinearity diagnosis. Although there is no universal agreement regarding factor selection methods, a high-performance ANN model was successfully developed through the method applied in this study.
In the IGR analysis phase, slope showed the highest value of average merit among the predisposing factors, which is judged to be due to its significant contribution to the factor of safety. In contrast, six factors (forest density, soil texture, forest type, soil density, weathering, distance from stream, and distance from road) were determined to possess no predictive ability and were excluded from this study. Nonetheless, the six excluded factors should be further studied in other regions, as these factors may possess predictive power if additional databases are accumulated from different regions. In the multicollinearity diagnosis phase, slope and elevation showed high correlation with TRI as well as STI and SPI. Although high correlation does not necessarily indicate multicollinearity, the calculation formulas of TRI, STI, and SPI indicate high correlation between each variable. Thus, data regarding TRI and SPI were eliminated at low-predictive ability orders. Finally, VIF and tolerance analyses determined that there was no multicollinearity between the 11 selected factors.
The 11 selected predisposing factors were randomly split into a 70:30 ratio for training and validation. Although there are no specific guidelines for dividing datasets, this process may prevent the overfitting or underfitting problem and enable reliable model verifications compared to models that do not divide datasets. The importance of dividing datasets for training and validation was also mentioned and discussed by Chung and Fabbri [57], Tien Bui et al. [22], and several other researchers.
In this study, six models, each with a different non-linear activation function in the hidden layer, were evaluated and validated using the Kappa index, AUC, and five statistical measures. As a result, the best performing MLP model was the model that used the hyperbolic tangent sigmoid (Tan-sig) function with five neurons in the hidden layer (Figure 7, Table 4 and Table 5). The models developed with the six activation functions were identified as comparable models by the non-parametric Friedman test, which showed the models as having significant differences with each other (Table 6). Finally, the Tan-sig model showed that 89.4% of all historical landslides ranged from the Very High to Moderate classes and produced a landslide density result of 4.38 for the Very High susceptible class, which is the highest value among the six models (Table 7 and Table 8). In other words, a Tan-sig function in the hidden layer best represents the complex and non-linear relationship between the predisposing factors and landslide occurrence in the study area.
The susceptibility model developed in this study is based on a single-event inventory with one extreme rainfall pattern. Slope failures are caused by the weakening of soil unsaturated shear strength as the soil becomes saturated due to rainfall infiltration. The destabilizing force exerted on the soil layer is related to the layer thickness and geotechnical properties as these factors affect normal stress and shear strength, respectively. Rainfall patterns and soil permeability dictate the rate of water infiltration into the soil; hence, both affect the saturation of the soil layer at a particular time and location. For example, if a soil has a shallow depth and a large permeability coefficient, it will be more sensitive to rainfall patterns with intensive rainfall over a short period. In contrast, if the soil layer is deep and the permeability is relatively small, rainfall patterns with low rainfall intensity over long periods of time will have greater effects on landslide occurrence. Therefore, in order to enhance the performance of the susceptibility model, a future follow-up study should be conducted using an updated multi-temporal landslide inventory generated with consideration of other rainfall conditions.

6. Conclusions

This study demonstrates the systematic procedure of determining the optimal structure of an ANN-based landslide susceptibility model for identifying landslide-prone areas in Mount Umyoen, Korea. The main objective of this study was to design the optimal structure of the proposed MLP model, taking into account the factor selection method and various non-linear activation functions. The seven main procedures to achieve this purpose were as follows: (1) collecting historical landslide data, (2) constructing landslide predisposing factors, (3) preparing training and validation datasets, (4) applying a factor selection to select suitable database subsets, (5) developing landslide susceptibility models, (6) validating and comparing landslide susceptibility models, and (7) selecting the best performing model.
The best model was the MLP model consisting of an 11 × 5 × 1 structure with the hyperbolic tangent sigmoid function in the hidden layer and the logistic sigmoid function in the output layer. The validation process confirmed that the best model (11 × 5 for the tan-sig function × 1 for the log-sig function) had a sensitivity of 82.61%, specificity of 78.26%, accuracy of 80.43%, positive predictive value of 79.17%, negative predictive value of 81.82%, and an AUC value of 0.879. In addition, the Kappa index was 0.609, indicating substantial agreement between the observed and predicted values. As a final conclusion, the results of this study may be useful for preemptive response in landslide-risk areas.

Author Contributions

Conceptualization, D.-H.L.; Methodology, D.-H.L.; Software, D.-H.L.; Validation, D.-H.L.; Formal Analysis, D.-H.L.; Investigation, D.-H.L.; Resources, D.-H.L.; Data Curation, D.-H.L.; Writing-Original Draft Preparation, D.-H.L.; Writing-Review & Editing, Y.-T.K. and S.-R.L.; Visualization, D.-H.L.; Supervision, S.-R.L.; Project Administration, Y.-T.K. and S.-R.L.; Funding Acquisition, S.-R.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the Basic Research Laboratory Program through the National Research Foundation of Korea funded by the Ministry of Science and ICT (NRF-2018R1A4A1025765) and the Korea Ministry of Land, Infrastructure and Transport (MOLIT) as Innovative Talent Education Program for Smart City.

Acknowledgments

We would like to thank the editors of remote sensing and three anonymous reviewers who gave us valuable comments that would be of great help in improving the quality of our paper.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. Frequency ratio between landslides and predisposing factors.
Table A1. Frequency ratio between landslides and predisposing factors.
FactorClassNo. of Pixels in Domain% of Pixels in DomainNo. of Landslide% of LandslideFrequency Ratio 1
Elevation23.4–68.340,7150.17140.090.54
(m)68.3–11388,8560.37360.240.64
113–15857,1200.24510.341.41
158–20330,3070.13240.161.25
203–24816,3330.07260.172.52
248–29359490.0200.000.00
Slope0–1236,9350.1500.000.00
(degree)12–24129,9450.54380.250.46
24–3664,6840.27980.652.40
36–4774790.03150.103.18
47–592370.0000.000.00
AspectFlat37130.0200.000.00
N29,0170.12110.070.60
NE28,5880.12190.131.05
E31,4560.13240.161.21
SE32,2580.13140.090.69
S32,6510.14240.161.16
SW29,8540.12210.141.11
W23,1610.10210.141.44
NW28,5820.12170.110.94
CurvatureConcave112,2640.47980.651.38
Planar10,8930.0500.000.00
Convex116,1230.49530.350.72
TRI0–1267510.0300.000.00
12–2454,2620.2350.030.15
24–3263,7160.27220.150.55
32–4259,8850.25450.301.19
42–5948,6900.20670.442.18
59–9559760.02120.083.18
SRR0–0.2352300.0200.000.00
0.23–0.4643,2240.18260.170.95
0.46–0.53130,4870.55920.611.12
0.53–0.6752,4510.22330.221.00
0.67–0.8978880.0300.000.00
SEI−58–−369790.0000.000.00
−36–−1446,3870.19410.271.40
–14–7.5104,6770.44440.290.67
7.5–2982,1890.34540.361.04
29–5150480.02120.083.77
TWI1.2–2.369,4650.29440.291.00
2.3–2.885,4220.36620.411.15
2.8–3.559,6510.25370.250.98
3.5–4.618,9010.0880.050.67
4.6–9.958410.0200.000.00
STI0–528,7430.1200.000.00
5–25129,8870.54550.360.67
25–5055,7270.23640.421.82
50–10017,8170.07230.152.05
>10071060.0390.062.01
SPI0–513,1420.0500.000.00
5–50103,8200.43340.230.52
50–10050,6760.21430.281.34
100–50058,6280.25630.421.70
>50013,0140.05110.071.34
Distance0–2179,7050.33460.300.91
from21–4373,8010.31450.300.97
stream43–6757,8080.24440.291.21
(m)67–12026,8750.11160.110.94
120–25010910.0000.000.00
LithologyMetamorphic179,6070.751420.941.25
Sedimentary57,3440.2490.060.25
No data23290.0100.000.00
WeatheringHigh109,1360.461050.701.52
Moderate127,8150.53460.300.57
No data23290.0100.000.00
Effective1–3390,7740.391000.661.70
soil depth33–4956,4260.24310.210.87
(cm)49–5668,3070.29200.130.46
56–6921,4440.0900.000.00
No data2,3290.0100.000.00
Soil textureSilty loam90,7120.38410.270.72
Sandy loam146,2390.611100.731.19
No data23290.0100.000.00
Soil type 2 B 1 117,4810.491090.721.47
B 2 107,1420.45420.280.62
B 3 12,3280.0500.000.00
No data23290.0100.000.00
Soil densityMedium dense27,0190.11230.151.35
Loose188,7020.791160.770.97
Very loose21,2300.09120.080.90
No data23290.0100.000.00
Forest typeConiferous57960.0210.010.27
Broadleaf216,2850.901400.931.03
Mixed14,9140.0690.060.96
No forest22850.0110.010.69
Forest Dense226,6440.951460.971.02
DensityModerate10,3510.0440.030.61
No forest22850.0110.010.69
Distance0–130100,9380.42590.390.93
from road130–30059,6320.25430.281.14
(m)300–55043,3860.18230.150.84
550–90017,7340.07130.091.16
900–150017,5900.07130.091.17
1 Frequency ration is calculated by % of pixels in domain / % of landslide. 2 B 1 is dry brown forest soil, B 2 is slight dry brown forest soil, and B 3 is moderately moist brown forest soil.

References

  1. Cho, S.E. Prediction of shallow landslide by surficial stability analysis considering rainfall infiltration. Eng. Geol. 2017, 231, 126–138. [Google Scholar] [CrossRef]
  2. Pradhan, A.M.S.; Lee, S.R.; Kim, Y.T. A shallow slide prediction model combining rainfall threshold warnings and shallow slide susceptibility in Busan, Korea. Landslides 2019, 16, 647–659. [Google Scholar] [CrossRef]
  3. Park, D.W.; Lee, S.R.; Vasu, N.N.; Kang, S.H.; Park, J.Y. Coupled model for simulation of landslides and debris flows at local scale. Nat. Hazards 2016, 81, 1653–1682. [Google Scholar] [CrossRef]
  4. Park, J.Y.; Lee, S.R.; Lee, D.H.; Kim, Y.T.; Lee, J.S. A regional-scale landslide early warning methodology applying statistical and physically based approaches in sequence. Eng. Geol. 2019, 260, 1–14. [Google Scholar] [CrossRef]
  5. Jeong, S.; Lee, K.; Kim, J.; Kim, Y. Analysis of rainfall-induced landslide on unsaturated soil slopes. Sustainability 2017, 9, 1280. [Google Scholar] [CrossRef] [Green Version]
  6. Park, H.J.; Lee, J.H.; Woo, I. Assessment of rainfall-induced shallow landslide susceptibility using a GIS-based probabilistic approach. Eng. Geol. 2013, 161, 1–15. [Google Scholar] [CrossRef]
  7. Kim, J.; Lee, K.; Jeong, S.; Kim, G. GIS-based prediction method of landslide susceptibility using a rainfall infiltration-groundwater flow model. Eng. Geol. 2014, 182, 63–78. [Google Scholar] [CrossRef]
  8. Formetta, G.; Capparelli, G.; Versace, P. Evaluating performance of simplified physically based models for shallow landslide susceptibility. Hydrol. Earth Syst. Sci. 2016, 20, 4585–4603. [Google Scholar] [CrossRef] [Green Version]
  9. Van Westen, C.J.; Terlien, M.T.J. An approach towards deterministic landslide hazard analysis in GIS. A case study from Manizales (Colombia). Earth Surf. Process. Landforms 1996, 21, 853–868. [Google Scholar] [CrossRef]
  10. Tofani, V.; Bicocchi, G.; Rossi, G.; Segoni, S.; D’Ambrosio, M.; Casagli, N.; Catani, F. Soil characterization for shallow landslides modeling: A case study in the Northern Apennines (Central Italy). Landslides 2017, 14, 755–770. [Google Scholar] [CrossRef] [Green Version]
  11. Canli, E.; Mergili, M.; Thiebes, B.; Glade, T. Probabilistic landslide ensemble prediction systems: Lessons to be learned from hydrology. Nat. Hazards Earth Syst. Sci. 2018, 18, 2183–2202. [Google Scholar] [CrossRef] [Green Version]
  12. Reichenbach, P.; Rossi, M.; Malamud, B.D.; Mihir, M.; Guzzetti, F. A review of statistically-based landslide susceptibility models. Earth-Sci. Rev. 2018, 180, 60–91. [Google Scholar] [CrossRef]
  13. Tien Bui, D.; Tuan, T.A.; Klempe, H.; Pradhan, B.; Revhaug, I. Spatial prediction models for shallow landslide hazards: A comparative assessment of the efficacy of support vector machines, artificial neural networks, kernel logistic regression, and logistic model tree. Landslides 2016, 13, 361–378. [Google Scholar] [CrossRef]
  14. Tien Bui, D.; Tuan, T.A.; Hoang, N.D.; Thanh, N.Q.; Nguyen, D.B.; Van Liem, N.; Pradhan, B. Spatial prediction of rainfall-induced landslides for the Lao Cai area (Vietnam) using a hybrid intelligent approach of least squares support vector machines inference model and artificial bee colony optimization. Landslides 2017, 14, 447–458. [Google Scholar] [CrossRef]
  15. Shahin, M.A.; Jaksa, M.B.; Maier, H.R. State of the art of artificial neural networks in geotechnical engineering. Electron. J. Geotech. Eng. 2008, 13, 1–25. [Google Scholar]
  16. Lee, S.J.; Lee, S.R.; Kim, Y.S. An approach to estimate unsaturated shear strength using artificial neural network and hyperbolic formulation. Comput. Geotech. 2003, 30, 489–503. [Google Scholar] [CrossRef]
  17. Lee, S. Landslide susceptibility mapping using an artificial neural network in the Gangneung are, Korea. Int. J. Remote Sens. 2007, 28, 4763–4783. [Google Scholar] [CrossRef]
  18. Pradhan, B.; Lee, S. Landslide susceptibility assessment and factor effect analysis: Backpropagation artificial neural networks and their comparison with frequency ratio and bivariate logistic regression modelling. Environ. Model. Softw. 2010, 25, 747–759. [Google Scholar] [CrossRef]
  19. Ercanoglu, M. Landslide susceptibility assessment of SE Bartin (West Black Sea region, Turkey) by artificial neural networks. Nat. Hazards Earth Syst. Sci. 2005, 5, 979–992. [Google Scholar] [CrossRef] [Green Version]
  20. Arnone, E.; Francipane, A.; Noto, L.V.; Scarbaci, A.; La Loggia, G. Strategies investigation in using artificial neural network for landslide susceptibility mapping: Application to a Sicilian catchment. J. Hydroinformatics 2014, 16, 502–515. [Google Scholar] [CrossRef]
  21. Vasu, N.N.; Lee, S.R. A hybrid feature selection algorithm integrating an extreme learning machine for landslide susceptibility modeling of Mt. Woomyeon, South Korea. Geomorphology 2016, 263, 50–70. [Google Scholar] [CrossRef]
  22. Tien Bui, D.; Pradhan, B.; Lofman, O.; Revhaug, I.; Dick, O.B. Landslide susceptibility assessment in the Hoa Binh province of Vietnam: A comparison of the Levenberg-Marquardt and Bayesian regularized neural networks. Geomorphology 2012, 171–172, 12–29. [Google Scholar] [CrossRef]
  23. Lee, S.; Ryu, J.H.; Won, J.S.; Park, H.J. Determination and application of the weights for landslide susceptibility mapping using an artificial neural network. Eng. Geol. 2004, 71, 289–302. [Google Scholar] [CrossRef]
  24. Ermini, L.; Catani, F.; Casagli, N. Artificial Neural Networks applied to landslide susceptibility assessment. Geomorphology 2005, 66, 327–343. [Google Scholar] [CrossRef]
  25. Madić, M.J.; Radovanović, M.R. Optimal selection of ANN training and architectural parameters using taguchi method: A case study. FME Trans. 2011, 39, 79–86. [Google Scholar]
  26. Kavzoglu, T.; Mather, P.M. The role of feature selection in artificial neural network applications. Int. J. Remote Sens. 2002, 23, 2919–2937. [Google Scholar] [CrossRef]
  27. Yune, C.Y.; Jeong, S.; Kim, M.M. Susceptibility assessment of rainfall induced landslides: A case study of the debris flow on July 27, 2011 at Umyeonsan (Mt.). In Proceedings of the 19th International Conference on Soil Mechanics and Geotechnical Engineering, Seoul 2017, Seoul, Korea, 17–22 September 2017; pp. 265–278. [Google Scholar]
  28. Jeong, S.; Kim, Y.; Lee, J.K.; Kim, J. The 27 July 2011 debris flows at Umyeonsan, Seoul, Korea. Landslides 2015, 12, 799–813. [Google Scholar] [CrossRef]
  29. Park, D.W.; Nikhil, N.V.; Lee, S.R. Landslide and debris flow susceptibility zonation using TRIGRS for the 2011 Seoul landslide event. Nat. Hazards Earth Syst. Sci. 2013, 13, 2833–2849. [Google Scholar] [CrossRef] [Green Version]
  30. Riley, S.J.; DeGloria, S.D.; Elliot, R. A Terrain Ruggedness Index that Quantifies Topographic Heterogeneity. Intermt. J. Sci. 1999, 5, 23–27. [Google Scholar]
  31. Pike, R.J.; Wilson, S.E. Elevation-Relief Ratio, Hypsometric Integral, and Geomorphic Area-Altitude Analysis. Geol. Soc. Am. Bull. 1971, 82, 1079–1084. [Google Scholar] [CrossRef]
  32. Davies, K.W.; Petersen, S.L.; Johnson, D.D.; Davis, D.B.; Madsen, M.D.; Zvirzdin, D.L.; Bates, J.D. Estimating juniper cover from national agriculture imagery program (NAIP) imagery and evaluating relationships between potential cover and environmental variables. Rangel. Ecol. Manag. 2010, 63, 630–637. [Google Scholar] [CrossRef]
  33. Ray, R.L.; Jacobs, J.M. Relationships among remotely sensed soil moisture, precipitation and landslide events. Nat. Hazards 2007, 43, 211–222. [Google Scholar] [CrossRef]
  34. Beven, K.J.; Kirkby, M.J.; Schofield, N.; Tagg, A.F. Testing a physically-based flood forecasting model (TOPMODEL) for three U.K. catchments. J. Hydrol. 1984, 69, 119–143. [Google Scholar] [CrossRef]
  35. Moore, I.D.; Burch, G.J. Physical basis of the length-slope factor in the universal soil loss equation. Soil Sci. Soc. Am. J. 1986, 50, 1294–1298. [Google Scholar] [CrossRef]
  36. Moore, I.D.; Grayson, R.B.; Ladson, A.R. Digital terrain modelling: A review of hydrological, geomorphological and biological applications. Hydrol. Process. 1984, 5, 3–30. [Google Scholar]
  37. Segoni, S.; Pappafico, G.; Luti, T.; Catani, F. Landslide susceptibility assessment in complex geological settings: Sensitivity to geological information and insights on its parameterization. Landslides 2020. [Google Scholar] [CrossRef] [Green Version]
  38. Carrara, A.; Cardinali, M.; Detti, R.; Guzzetti, F.; Pasqui, V.; Reichenbach, P. GIS techniques and statistical models in evaluating landslide hazard. Earth Surf. Process. Landf. 1991, 16, 427–445. [Google Scholar] [CrossRef]
  39. Conforti, M.; Pascale, S.; Robustelli, G.; Sdao, F. Evaluation of prediction capability of the artificial neural networks for mapping landslide susceptibility in the Turbolo River catchment. Catena 2014, 113, 236–250. [Google Scholar] [CrossRef]
  40. Dietrich, W.E.; Reiss, R.; Hsu, M.; Montgomery, D.R. A process-based model for colluvial soil depth and shallow landsliding using digital elevation data. Hydrol. Process. 1995, 9, 383–400. [Google Scholar] [CrossRef]
  41. James, G.; Witten, D.; Hastie, T.; Tibshirani, R. An Introduction to Statistical Learning; Springer: Berlin/Heidelberg, Germany, 2013; pp. 1–170. [Google Scholar]
  42. Shirzadi, A.; Soliamani, K.; Habibnejhad, M.; Kavian, A.; Chapi, K.; Shahabi, H.; Chen, W.; Khosravi, K.; Pham, B.T.; Pradhan, B.; et al. Novel GIS based machine learning algorithms for shallow landslide susceptibility mapping. Sensors (Switzerland) 2018, 18, 3777. [Google Scholar] [CrossRef]
  43. Zhou, C.; Yin, K.; Cao, Y.; Ahmed, B.; Li, Y.; Catani, F.; Pourghasemi, H.R. Landslide susceptibility modeling applying machine learning methods: A case study from Longju in the Three Gorges Reservoir area, China. Comput. Geosci. 2018, 112, 23–37. [Google Scholar] [CrossRef] [Green Version]
  44. O’Brien, R.M. A caution regarding rules of thumb for variance inflation factors. Qual. Quant. 2007, 41, 673–690. [Google Scholar] [CrossRef]
  45. Donald, F.; Glauber, R. Multicollinearity in Regression Analysis: The Problem Revisited. Rev. Econ. Stat. 1967, 49, 92–107. [Google Scholar]
  46. Belsley, D.A. A Guide to using the collinearity diagnostics. Comput. Sci. Econ. Manag. 1991, 4, 33–50. [Google Scholar]
  47. Bui, D.T.; Lofman, O.; Revhaug, I.; Dick, O. Landslide susceptibility analysis in the Hoa Binh province of Vietnam using statistical index and logistic regression. Nat. Hazards 2011, 59, 1413–1444. [Google Scholar] [CrossRef]
  48. Biswajeet, P.; Saied, P. Comparison between prediction capabilities of neural network and fuzzy logic techniques for L and slide susceptibility mapping. Disaster Adv. 2010, 3, 26–34. [Google Scholar]
  49. Kayri, M. Predictive abilities of Bayesian regularization and Levenberg-Marquardt algorithms in artificial neural networks: A comparative empirical study on social data. Math. Comput. Appl. 2016, 21, 20. [Google Scholar] [CrossRef]
  50. MacKay, D.J.C. Bayesian Interpolation. Neural Comput. 1992, 4, 415–447. [Google Scholar] [CrossRef]
  51. Foresee, F.D.; Hagan, M.T. Gauss-Newton approximation to Bayesian learning. In Proceedings of the International Joint Conference on Neural Networks, Houston, TX, USA, 12 June 1997; pp. 1930–1935. [Google Scholar]
  52. Cohen, J. A Coefficient of Agreement for Nominal Scales. Educ. Psychol. Meas. 1960, 20, 37–46. [Google Scholar] [CrossRef]
  53. Guzzetti, F.; Reichenbach, P.; Ardizzone, F.; Cardinali, M.; Galli, M. Estimating the quality of landslide susceptibility models. Geomorphology 2006, 81, 166–184. [Google Scholar] [CrossRef]
  54. Landis, J.R.; Koch, G.G. The Measurement of Observer Agreement for Categorical Data. Biometrics 1977, 33, 159–174. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  55. Pham, B.T.; Pradhan, B.; Tien Bui, D.; Prakash, I.; Dholakia, M.B. A comparative study of different machine learning methods for landslide susceptibility assessment: A case study of Uttarakhand area (India). Environ. Model. Softw. 2016, 84, 240–250. [Google Scholar] [CrossRef]
  56. Sola, J.; Sevilla, J. Importance of input data normalization for the application of neural networks to complex industrial problems. IEEE Trans. Nucl. Sci. 1997, 44, 1464–1468. [Google Scholar] [CrossRef]
  57. Chung, C.J.; Fabbri, A.G. Predicting landslides for risk analysis—Spatial models tested by a cross-validation technique. Geomorphology 2008, 94, 438–452. [Google Scholar] [CrossRef]
Figure 1. Landslide locations in the study area.
Figure 1. Landslide locations in the study area.
Remotesensing 12 01194 g001
Figure 2. Hourly and total rainfall distribution on 25–27 July 2011 at Namhyun and Seocho rainfall station [29].
Figure 2. Hourly and total rainfall distribution on 25–27 July 2011 at Namhyun and Seocho rainfall station [29].
Remotesensing 12 01194 g002
Figure 3. Digital orthoimages of the study area: (a) before landslides in 2011; (b) after landslides in 2011.
Figure 3. Digital orthoimages of the study area: (a) before landslides in 2011; (b) after landslides in 2011.
Remotesensing 12 01194 g003
Figure 4. Landslide predisposing factors: (a) elevation; (b) slope; (c) aspect; (d) curvature; (e) topographic ruggedness index (TRI); (f) surface relief ratio (SRR); (g) site exposure index (SEI); (h) topographic wetness index (TWI); (i) sediment transport index (STI); (j) stream power index (SPI); (k) distance from stream; (l) lithology; (m) weathering; (n) effective soil depth; (o) soil texture; (p) soil type; (q) soil density; (r) forest type; (s) forest density; (t) distance from road.
Figure 4. Landslide predisposing factors: (a) elevation; (b) slope; (c) aspect; (d) curvature; (e) topographic ruggedness index (TRI); (f) surface relief ratio (SRR); (g) site exposure index (SEI); (h) topographic wetness index (TWI); (i) sediment transport index (STI); (j) stream power index (SPI); (k) distance from stream; (l) lithology; (m) weathering; (n) effective soil depth; (o) soil texture; (p) soil type; (q) soil density; (r) forest type; (s) forest density; (t) distance from road.
Remotesensing 12 01194 g004aRemotesensing 12 01194 g004bRemotesensing 12 01194 g004c
Figure 5. Architecture of the multi-layer perceptron (MLP) model.
Figure 5. Architecture of the multi-layer perceptron (MLP) model.
Remotesensing 12 01194 g005
Figure 6. Average merit of the landslide predisposing factors.
Figure 6. Average merit of the landslide predisposing factors.
Remotesensing 12 01194 g006
Figure 7. Performance of MLP models according to the activation function and the number of neurons: (a) Hard-lims, (b) Sat-lins, (c) Rad-bas, (d) ReLU, (e) Log-sig, and (f) Tan-sig functions.
Figure 7. Performance of MLP models according to the activation function and the number of neurons: (a) Hard-lims, (b) Sat-lins, (c) Rad-bas, (d) ReLU, (e) Log-sig, and (f) Tan-sig functions.
Remotesensing 12 01194 g007aRemotesensing 12 01194 g007b
Figure 8. Landslide susceptibility maps produced using the (a) Hard-lims, (b) Sat-lins, (c) Rad-bas, (d) ReLU, (e) Log-sig, and (f) Tan-sig functions.
Figure 8. Landslide susceptibility maps produced using the (a) Hard-lims, (b) Sat-lins, (c) Rad-bas, (d) ReLU, (e) Log-sig, and (f) Tan-sig functions.
Remotesensing 12 01194 g008
Table 1. Spatial database for the landslide susceptibility map.
Table 1. Spatial database for the landslide susceptibility map.
TypeFactorSourceScale
(Resolution)
Organization
Landslide InventoryDigital Ortho Images51 × 51 cmNGII
MorphologicalElevationDEM1:5,000NGII
Slope (5 × 5 m)
Aspect
Curvature
TRI
SRR
SEI
HydrologicalTWIDEM1:5,000NGII
STI (5 × 5 m)
SPI
Distance from stream
GeologicalLithologyForest Soil Map1:25,000KFS
Weathering (5 × 5 m)
Land CoverSoil effective depthForest Soil Map1:25,000KFS
Soil type (5 × 5 m)
Soil texture
Soil density
Forest type
Forest density
Distance from road
Table 2. Pearson’s correlation between pairs of landslide predisposing factors.
Table 2. Pearson’s correlation between pairs of landslide predisposing factors.
EL 1SL 2AS 3CU 4TRISRRSEITWISTISPILI 5SD 6
EL 11
SL 20.311
AS 30.120.101
CU 40.160.210.161
TRI0.760.800.130.101
SRR0.060.070.070.180.111
SEI0.040.060.060.330.000.011
TWI−0.07−0.420.010.06−0.38−0.240.091
STI0.110.230.07−0.070.220.000.030.471
SPI−0.01−0.050.02−0.06−0.030.000.020.400.831
LI 5−0.10−0.08−0.030.05−0.12−0.020.210.02−0.07−0.031
SD 6−0.27−0.110.020.06−0.23−0.050.180.240.120.110.641
ST 70.340.110.30.150.250.020.09−0.15−0.12−0.090.29−0.22
1 Elevation, 2 Slope, 3 Aspect, 4 Curvature, 5 Lithology, 6 Soil depth, and 7 Soil type.
Table 3. Multicollinearity analysis for the landslide predisposing factors.
Table 3. Multicollinearity analysis for the landslide predisposing factors.
NumberLandslide Predisposing FactorVIFTolerance
1Elevation1.3940.717
2Slope4.5700.219
3Aspect1.2560.796
4Curvature1.2040.830
5SRR2.3690.422
6SEI1.0710.933
7TWI5.4260.184
8STI2.3960.417
9Lithology3.7630.266
10Soil depth4.4620.224
11Soil type2.8780.347
Table 4. Results of model performance evaluation.
Table 4. Results of model performance evaluation.
MeasuresHard-limsSat-linsRad-basReLULog-sigTan-sig
True positive729193949597
True negative767283788992
False positive293322271613
False negative33141211108
Sensitivity (%)68.5786.6788.5789.5290.4892.38
Specificity (%)72.3868.5779.0574.2984.7687.62
Accuracy (%)70.4877.6283.8181.9087.6290.00
PPV (%)71.2973.3980.8777.6985.5988.18
NPV (%)69.7283.7287.3787.6489.9092.00
Kappa index0.4100.5520.6760.6380.7520.800
AUC0.8370.8630.9360.9300.9640.968
Table 5. Results of model performance validation.
Table 5. Results of model performance validation.
MeasuresHard-limsSat-linsRad-basReLULog-sigTan-sig
True positive323737363738
True negative282931343536
False positive181715121110
False negative14991098
Sensitivity (%)69.5780.4380.4378.2680.4382.61
Specificity (%)60.8763.0467.3973.9176.0978.26
Accuracy (%)65.2171.7473.9176.0978.2680.43
PPV (%)64.0068.5271.1575.0077.0879.17
NPV (%)30.5576.3277.5077.2779.5581.82
Kappa index0.3040.4350.4780.5220.5650.609
AUC0.7810.8150.8210.8430.8770.879
Table 6. Comparison of the six landslide susceptibility models using the Friedman test.
Table 6. Comparison of the six landslide susceptibility models using the Friedman test.
Rankingp-value
(α = 0.05)
χ2
(Chi-square)
Hard-limsSat-linsRad-basReLULog-sigTan-sig
3.613.733.553.423.063.631.78 × 10−424.445
Table 7. Percentages of historical landslides according to susceptibility class (unit: %).
Table 7. Percentages of historical landslides according to susceptibility class (unit: %).
Hard-limsSat-linsRad-basReLULog-sigTan-sig
Very High0016.65.327.235.8
High 0017.220.521.922.5
Moderate36.450.335.137.826.231.1
Low32.537.117.222.515.24.6
Very Low31.112.613.913.912.66.0
Total100100100100100100
Table 8. Landslide density according to susceptibility class.
Table 8. Landslide density according to susceptibility class.
ClassesHard-limsSat-linsRad-basReLULog-sigTan-sig
Very High-0.002.764.273.194.38
High -0.001.242.081.461.43
Moderate2.863.611.531.651.201.40
Low0.901.060.540.640.530.16
Very Low0.610.260.550.450.490.24

Share and Cite

MDPI and ACS Style

Lee, D.-H.; Kim, Y.-T.; Lee, S.-R. Shallow Landslide Susceptibility Models Based on Artificial Neural Networks Considering the Factor Selection Method and Various Non-Linear Activation Functions. Remote Sens. 2020, 12, 1194. https://doi.org/10.3390/rs12071194

AMA Style

Lee D-H, Kim Y-T, Lee S-R. Shallow Landslide Susceptibility Models Based on Artificial Neural Networks Considering the Factor Selection Method and Various Non-Linear Activation Functions. Remote Sensing. 2020; 12(7):1194. https://doi.org/10.3390/rs12071194

Chicago/Turabian Style

Lee, Deuk-Hwan, Yun-Tae Kim, and Seung-Rae Lee. 2020. "Shallow Landslide Susceptibility Models Based on Artificial Neural Networks Considering the Factor Selection Method and Various Non-Linear Activation Functions" Remote Sensing 12, no. 7: 1194. https://doi.org/10.3390/rs12071194

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop