Evaluation of Recent Advanced Soft Computing Techniques for Gully Erosion Susceptibility Mapping: A Comparative Study

Gully erosion is a problem; therefore, it must be predicted using highly accurate predictive models to avoid losses caused by gully development and to guarantee sustainable development. This research investigates the predictive performance of seven multiple-criteria decision-making (MCDM), statistical, and machine learning (ML)-based models and their ensembles for gully erosion susceptibility mapping (GESM). A case study of the Dasjard River watershed, Iran uses a database of 306 gully head cuts and 15 conditioning factors. The database was divided 70:30 to train and verify the models. Their performance was assessed with the area under prediction rate curve (AUPRC), the area under success rate curve (AUSRC), accuracy, and kappa. Results show that slope is key to gully formation. The maximum entropy (ME) ML model has the best performance (AUSRC = 0.947, AUPRC = 0.948, accuracy = 0.849 and kappa = 0.699). The second best is the random forest (RF) model (AUSRC = 0.965, AUPRC = 0.932, accuracy = 0.812 and kappa = 0.624). By contrast, the TOPSIS (Technique for Order Preference by Similarity to Ideal Solution) model was the least effective (AUSRC = 0.871, AUPRC = 0.867, accuracy = 0.758 and kappa = 0.516). RF increased the performance of statistical index (SI) and frequency ratio (FR) statistical models. Furthermore, the combination of a generalized linear model (GLM), and functional data analysis (FDA) improved their performances. The results demonstrate that a combination of geographic information systems (GIS) with remote sensing (RS)-based ML models can successfully map gully erosion susceptibility, particularly in low-income and developing regions. This method can aid the analyses and decisions of natural resources managers and local planners to reduce damages by focusing attention and resources on areas prone to the worst and most damaging gully erosion.


Introduction
By relying on the principles of systemic view, geomorphology can help understand the mechanisms governing the natural environment. This knowledge enables humans to act in such a way that their activities will not damage the natural environment and instead complement natural processes. Nature-based solutions have been highlighted as a superior approach to land management based on engineered structures, which are still preferred by many landowners and managers [1]. To work with the landscape, geomorphologists, using their knowledge of natural morpho-dynamic factors, can predict environmental responses to prospective remedies to ensure that ecosystem services will be preserved or even restored. Before designing solutions to stop degradation and enable restoration, it is wise to understand the current state of the land.
Given the destructive effects of gully erosion (GE), solutions for managing this phenomenon to achieve sustainable development are essential [2]. Gully erosion-susceptibility mapping (GESM) is one basic method [3] to understand the mechanisms behind gully erosion. To predict the patterns of GE, a gully-erosion inventory and methods to identify and measure pertinent gully-erosion conditioning factors (GECFs) are needed [4]. Technically speaking, drainage networks, soil characteristics, rainfall, land use, topography, and lithology have been identified as the relevant GEFs controlling gully erosion and development [5]. A geographic information system (GIS), remote sensing (RS), and statistical data analyses are indispensable tools for examination of multidimensional outcomes like GE. Several factors are potential influences [3]. A variety of GIS-based approaches for GESM have been proposed and they can be classified into three types: multicriteria decision-making (MCDM), statistical modeling, and machine learning (ML) models. MCDM models are based on the knowledge of decision makers to identify, select, and weight conditioning factors [6][7][8][9][10]. These factors are combined to develop a GE model. Although recent developments in mathematical science, computational science, and computer technology have yielded more than 20 new MCDM models, the ranking of factors remains subjective. Statistical models provide a general advantage of working with diverse types of independent variables, like continues, binary, and categorical data [5]. The most successful models may be: information value (IV) [5], conditional probability [11], frequency ratio (FR) [12], evidential belief function [3], index of entropy (IoE) [13], certainty factor [14], weights of evidence (WOE) [15], and logistic regression [16]. The performance of statistical models is low, however. ML has proven to be efficient for GE modeling due to its ability to handle small training sets and factors with complex relationships. The most successful ML models for GE consist of multivariate adaptive regression spline [17], maximum entropy (ME) [18], boosted regression tree [19], artificial neural network (ANN) [20], random forest (RF) [21], linear discriminant analysis [22], bagging best-first decision tree [23], support vector machine [24], classification and regression trees [20], and flexible discriminant analysis [14], generalized linear model (GLM) [25], functional data analysis (FDA) [26], and the technique for order preference by similarity to the ideal solution (TOPSIS) [27].
Many ML models have been used by researchers, each has its disadvantages and advantages [22]. The selection of a suitable method is critical and requires careful consideration [24]. A comprehensive comparative assessment of MCDM, statistical, and ML models for GESM for use in arid and semi-arid regions of the world has not been conducted. This research attempts to begin to fill this gap by comparing seven models-MCDM-based TOPSIS, the statistically based statistical index (SI) and FR, and the ML-based RF, ME, GLM, and FDA-for GESM. SI is a simple and quantitatively suitable model that has been applied to landslide-susceptibility mapping [28,29], but this is the first time it has been used for GESM.
Ensemble models, combinations of two or more statistical and ML techniques, have been proven to work for GESM [3,11]. Theoretically, ensemble models inherit the virtues and eliminate the shortcomings of individual techniques to form more robust models [30,31]. They ensure diversity to guarantee high prediction performance of their models. Ensemble models for gully-erosion modeling can be classified into simple integration models, homogeneous frameworks, and heterogeneous frameworks [32]. Simple integration is a simple assemblage of individual methods. A homogenous framework (i.e., boosting, bagging, rotation forest) creates subsets from the original training set, uses a ML algorithm to generate a classifier for each subset, and groups the classifiers into an ensemble model. This procedure is also used for heterogenous frameworks. The difference is that different ML algorithms are used for each of the subsets to create classifiers. Errors are dramatically reduced by combining independent learners into ensemble models [30]. This study also combines models to achieve a better collective performance to map gully-erosion susceptibility.
The study was conducted in the Dasjard River watershed, Iran. This watershed is an arid region [33] that has experienced severe GE in recent years. The aims of this study are to explore the capabilities of several individual and ensemble approaches for GESM, to evaluate the influences of GECFs on GE, and to validate the modeled susceptibility maps using several criteria. Comparison of MCDM, statistical, and ML approaches and their ensembles with a RS dataset to evaluate several GEFs for GESM is novel to this study.

Description of the Study Area
The Dasjard River watershed is found between 35 • 51 24 and 36 • 22 32 N and between 55 • 29 54 and 56 • 23 14 E (Figure 1), covering 2820.29 km 2 . Elevation ranges from 793 to 2418 m above sea level (m.a.s.l.). The steepest slope in the watershed is 73 • and the mean is 3.8 • . The central portion of the watershed is generally flat. In fact, 36.5% of the study area is a relatively flat plain. Precipitation ranges from 47.34 mm to 230.43 mm annually across the region, but the average is 154.3 mm. More than 75% of the precipitation occurs in December and January [33]. The mean annual temperature is 17.8 • C and the range is from 43 • C to −6 • C [33]. The climates across the watershed are arid and semiarid [33].
Reading from the 1:100,000 scale Toroud sheet from the Geological Survey Department of Iran [34], the region is covered by Quaternary lithotypes. Clayey material, well-sorted sand dunes, salt concretions, mixed terrace deposits, and swamp or marsh deposits are the most important units [34]. The geological structure of the study area is crossed by an important E-to-W Quaternary strike fault (the Toroud fault) that is responsible for uplift of a metamorphic basement in the northern part of the study area. Morphologically, steep slopes (average > 30 • ) dominate the northern third of the area. High local relief is a product of heavy dissection of an uplifted surface. These slopes display the rectilinear-convex profile of V-shaped valleys. The central third of the watershed has gentle slopes where Quaternary deposits are found as outcrops. Denudation from mass wasting and water erosion has significantly affected hill slopes. The profiles are well articulated with concave or convex shapes and are incised by concave valleys. The landscape of the middle of the watershed is characterized by terraces, deeply dissected by V-shaped or concave valleys. More recent fluvial terraces and alluvial fans are also commonplace. Several soil types are found in the Dasjard River watershed. The soils are poorly developed (Aridisols and Entisols) [35] and frequently appear truncated or strongly degraded at the surface by water erosion.
The gully features in the study region range from 0.79 m to 365 m in length, and are at least 0.65 m deep, but can be as much as 7.2 m deep. The widths of gullies range from 0.76 m to 19.4 m. These formations reflect the primary mechanism of landscape degradation in the area, particularly soil erosion. Water erosion occurs relatively slowly over long periods of time, but even relatively small rainfall events can yield significant gully incision and retrogradation. Field monitoring of gully head cuts in the area has distinguished lateral erosion, primarily caused by instability of the perimeter edges of the incision, which leads to gravitational collapse ranging from micro-to meso-scale impacts. The main gullies have V-and U-shaped cross-sections that retrograde into steep, unstable scarps. The river valley contains gullies that formed on both river terraces and slopes. Gullying degrades agricultural land, roadways, and irrigation canals, threatening settlements and local economies.
Piping is an important process related to gully erosion. Piping dissolves soluble soil materials and disaggregates loose soil. Like sinkholes, pipes undercut structures and create tunnels. These features are most often caused by infiltration into susceptible materials and subsequent shallow groundwater flows. The Biarjamand watershed has extensive deposits of fine-grained (silt and clay) soils and in soils with soluble (salt, gypsum, and carbonate) mineral fractions. The former, due to their clay content, expand and contract as they moisten and desiccate. During dry seasons, the soil contracts, weakens, and cracks. During wet seasons, the cracks provide paths for infiltration and subsurface flows. The latter dissolve, chemically removing soil fractions and horizontally transporting minerals in solution with flowing water and then vertically to the surface through leaching.
Tunneling and gullying also occur in formations in low-and flat-lands that contain marls and silts. Erosion begins along gully scarps where water may stagnate and create holes from shrink and swell processes. As a hole grows, it may eventually connect to a main gully, widening and elongating it. Piping, tension-crack development, dispersion, bank collapse, and rill erosion are important mechanisms initiating and developing gullies. Tunneling and gullying also occur in formations in low-and flat-lands that contain marls and silts. Erosion begins along gully scarps where water may stagnate and create holes from shrink and swell processes. As a hole grows, it may eventually connect to a main gully, widening and elongating it. Piping, tension-crack development, dispersion, bank collapse, and rill erosion are important mechanisms initiating and developing gullies.

Data Used
A gully-erosion inventory map (GEIM) was developed from an extensive field survey. Three hundred and six gully erosion events were identified in the region and geolocated with a global positioning system (GPS) device ( Figure 2). They were randomly divided into a training set (70%, 213 gully locations) and a validation set (30%, 92 gully locations). Gullies occupy 141.3 km 2 , comprising 5% of the study area. The gullies, mapped as polygons, were converted to points (locations of the head-cut portion of each gully). The point locations were used in modeling and validation. An

Data Used
A gully-erosion inventory map (GEIM) was developed from an extensive field survey. Three hundred and six gully erosion events were identified in the region and geolocated with a global positioning system (GPS) device ( Figure 2). They were randomly divided into a training set (70%, 213 gully locations) and a validation set (30%, 92 gully locations). Gullies occupy 141.3 km 2 , comprising Sensors 2020, 20, 335 5 of 22 5% of the study area. The gullies, mapped as polygons, were converted to points (locations of the head-cut portion of each gully). The point locations were used in modeling and validation. An equivalent number and percentage of non-gully point locations were randomly chosen and were used in calibration and validation procedures.
impacts significantly increase overland flow and enable more rapid run-off, which easily erodes bare soil and causes gullying [47]. The distance to road measurements were computed from a road network layer that was extracted from a 1:50,000-scale topographic map and Google Earth images. Vegetation protects soil from many types of erosion. Vegetation can decrease the vulnerability of an area by aiding infiltration and holding soil in places with plants' roots [2]. The NDVI was computed from LANDSAT-8 data and Equation (5): where IR is the infrared portion of the electromagnetic spectrum and R is the red portion of the electromagnetic spectrum. The layers were unified using the UTM Zone39N geographic coordinate system at a pixel size of 12.5 m (the DEM's spatial resolution). The classes of the GECFs are presented in Table 2.  The fifteen most important gully erosion conditioning factors (GECFs) were chosen based on a literature review [3,11,[13][14][15], assessments of the physical characteristics of the study area, the scale, and availability of data, and a consideration of the intended purpose of research. These GECFs are elevation, slope, plan curvature (PC), topography wetness index (TWI), convergence index (CI), terrain ruggedness index (TRI), topography position index (TPI), distance to stream, drainage density, distance to road, normalized difference vegetation index (NDVI), rainfall, soil type, land use/land cover (LU/LC), and lithology (Table 1). An ALOS DEM with a spatial resolution of 12.5 m, topography and geology maps with 1:50,000 (www.ngo-org.ir) and 1:100,000 (Toroud sheet) scales, LANDSAT-8 images archived by USGS (https://earthexplorer.usgs.gov/), a soil map with 1:100,000 scale, and rainfall statistics for a 30-year period (1986 to 2016) were used to prepare GECFs (Figure 3a-o).
Elevation affects vegetation and precipitation patterns. They therefore control the spatial distribution of gully erosion and the processes at work [36]. Slope affects surface runoff, soil erosion, and drainage density patterns. The steepness of the slope is also important as it enhances or attenuates the energy of erosive processes and gully erosion [37]. Plan curvature causes convergence or divergence of water flows on slopes and influences downslope flow [36]. These three parameters were extracted from the ALOS DEM.
Erosive-runoff capacity reflects transport capacity and flow velocity, and is determined by TWI. TWI is crucial for identifying areas prone to gully erosion [36]. The CI measures how flow in a cell diverges (negative CI values) or converges (positive CI values) [38]. The TRI indicates convexity and concavity of slopes which influences gully erosion [39]. The TPI compares the height of each pixel in the DEM to the average height of the pixels around it. This factor enables classification of landscapes into morphological classes. Positive and negative values indicate that a pixel is higher or lower in elevation than the pixels that surround it [40]. TWI, SPI, TRI, and TPI were calculated with Equations (1)-(4) [41,42]: Sensors 2020, 20, 335 where A S is the catchment area of the basin (m 2 /m), β is slope steepness (degrees), x is the elevation of each neighbor cell to a specific cell (0,0) (m), and max and min are the largest and smallest elevations among the nine neighboring pixels. E pixel is the elevation of the cell, and E surrounding is the mean elevation of the neighbor pixels.   Gully erosion depends on the lithology of the material at or near the surface [43,44]. The lithology layer was prepared by digitizing a geological map (Geological Survey Department of Iran, Toroud sheet at 1:100,000 scale) [34]. Description of lithology units in the study area are shown in Table S1.
Land use/land cover (LU/LC) impacts slope stability and gully formation. Bare lands are very prone to erosion, but land with vegetative cover has significantly less erosion [45]. An LU/LC map of the study area was produced from Landsat 8 imagery sensed on 9 August 2018. A supervised classification using the maximum likelihood algorithm was used to create the LU/LC map. The map was ground-truthed using 495 ground control points (GCP). The Kappa coefficient of the map generated is 96.23%. Gullies are often linked to the drainage/stream network, and they facilitate the transport of material eroded from upland areas [36]. Higher drainage densities generate greater amounts of surface runoff [46]. To measure the parameter distance to stream and drainage density, the stream network was extracted from a PALSAR DEM in Arc Hydro. To provide more accurate flow direction and flow accumulation measures, holes in the DEM were filled, after which flow direction and accumulation were extracted. A threshold of 500 cells was used to extract the stream network. After stream extraction, the Euclidean Distance and Line Density tools in ArcGIS10.5 were used to calculate distances and densities of streams.
Roads as impenetrable surfaces disrupt natural drainage with improperly constructed culverts, by concentrating surface runoff, and by altering the hydrological functions of hillslopes. These impacts significantly increase overland flow and enable more rapid run-off, which easily erodes bare soil and causes gullying [47]. The distance to road measurements were computed from a road network layer that was extracted from a 1:50,000-scale topographic map and Google Earth images. Vegetation protects soil from many types of erosion. Vegetation can decrease the vulnerability of an area by aiding infiltration and holding soil in places with plants' roots [2]. The NDVI was computed from LANDSAT-8 data and Equation (5): where IR is the infrared portion of the electromagnetic spectrum and R is the red portion of the electromagnetic spectrum. The layers were unified using the UTM Zone39N geographic coordinate system at a pixel size of 12.5 m (the DEM's spatial resolution). The classes of the GECFs are presented in Table 2.

Frequency Ratio (FR) and Statistical Index (SI)
Two bivariate statistical models, FR and SI (descriptions and explanations found in [53][54][55][56]), have high potential for modeling environmental processes [53]. In these models, GECF and GEIM are considered dependent and independent variables, respectively. Each GECF thematic layer was analyzed relative to gullying to generate a weight of classes for that factor. The probability of gullying for each pixel was calculated using the algebraic sum of the weights of classes of all layers.

Random Forest (RF)
The RF model can be used to assess environmental issues and hazards [57]. This model combines several tree algorithms to generate repeated predictions of each phenomenon [58]. It can also learn complicated patterns and factor in the nonlinear relationships between explanatory and dependent variables. It can also incorporate and combine different data types because it does not assume anything about the distributions of the data. This model can incorporate thousands of input variables without deleting any. Details of RF can be found in [59]. In this study, RF analyses were conducted in R 3.3.1 using the 'Randomforest' package [60].

Maximum Entropy (ME)
ME is a prediction model guided by entropy maximization [61]. This model maximizes the probabilities without parametric assumptions about the input variables [62]. A detailed explanation of ME can be found in [63].

Generalized Linear Model (GLM)
GLM is the extension of the classic linear-regression model [64]. A detailed explanation of this model can be found in [64]. The species distribution modeling (SDM) package [65] was used to run GLM in R 3.3.3.

Functional Data Analysis (FDA)
The FDA model, suitable for observation data consisting of a series of real functions, was proposed by Ramsay and Dalzell [66]. A detailed explanation of the FDA model can be found in [67]. The FDA model was used to construct the GESM with the SDM package in R [65]. TOPSIS was introduced by Hwang and Yoon [68]. The underlying logic of TOPSIS is to define the positive and negative ideal solutions. Details of this model are found in [69]. To prepare GESM with TOPSIS, the qualitative parameters were converted into quantitative parameters using the FR method. The parameters should have ascending or descending trends. Parameters that did not follow either of these trends were also weighted using the FR method. Once weighted, GECF values for 500 randomly selected points in the study area were extracted and input into SPSS. A decision matrix with 15 columns and 500 rows was prepared. The TOPSIS model was then applied in SPSS and the final weight of each point was determined. The GIS point layer was populated using interpolation (kriging), thus creating a GESM.

Ensemble Approaches (GLM-FDA, FR-RF and SI-RF)
The consensus is that each model has its apparent advantages and disadvantages [3]. In this study, an ensemble of five models-FR, SI, GLM, FDA and RF-were used to produce GESMs. These integrated methods eliminate several disadvantages of bivariate methods: the failure to calculate the importance of parameters and non-calculation of the spatial relationships between the feature of interest (e.g., gullying) and the parameters that affect their formation.

Methodology
This research consists of several main steps (Figure 4). Data collection occurred either in the library, in the field, or in the laboratory.
Step 2: Multicollinearity analysis. If collinearity occurs among the parameters, the prediction accuracy of a model will decrease [3]. Indices of tolerance (TOL) and variance inflation factor (VIF) were used to evaluate collinearity [70]. If VIF ≤ 5 or 10 and TOL≤ 0.1 or 0.2, then no collinearity exists between factors [71].
Step 3: Configuring and training the GE models.
Step 4: Performance assessment using cutoff dependence (Area under prediction rate curve [AUPRC] and area under success rate curve [AUSRC]) and cutoff independence (accuracy and kappa).

Methodology
This research consists of several main steps (Figure 4). Data collection occurred either in the library, in the field, or in the laboratory.
Step 2: Multicollinearity analysis. If collinearity occurs among the parameters, the prediction accuracy of a model will decrease [3]. Indices of tolerance (TOL) and variance inflation factor (VIF) were used to evaluate collinearity [70]. If VIF ≤ 5 or 10 and TOL≤ 0.1 or 0.2, then no collinearity exists between factors [71].
Step 3: Configuring and training the GE models.
Step 4: Performance assessment using cutoff dependence (Area under prediction rate curve [AUPRC] and area under success rate curve [AUSRC]) and cutoff independence (accuracy and kappa).

Multicollinearity Test (MT)
The MT (Table 2) showed that no collinearity existed amongst conditioning factors. The minimum and maximum of TOL and VIF were (0.214-0.940) and (1.108-4.66), respectively. All

Multicollinearity Test (MT)
The MT (Table 2) showed that no collinearity existed amongst conditioning factors. The minimum and maximum of TOL and VIF were (0.214-0.940) and (1.108-4.66), respectively. All thematic layers were used in the modeling processes.

Spatial Relationship between Conditioning Factors and Gully Locations
The spatial relationships between gully locations and the GECFs calculated with the FR and SI models are shown in Table 3. Elevations below 1005 m, slopes < 5 • , flat plan curvature, TWI > 11.69, and CI > 53.7 are strongly correlated with gullies. Regarding TRI < 1.09, TPI from −2.85 to 2.28, <100 m from a stream, drainage densities ranging from 1.79 to 2.26 km/km 2 , and places between 500 to 1000 m to the nearest road were the most susceptible to GE. NDVI between −0.04 and 0.12, <114.05 mm rain, aridisols, bareland (LU/LC) and Quaternary lithotypes (clayey material, well-sorted sand dunes, salt concretions, mixed terrace deposits and swamp or marsh deposits) were strongly correlated with gullies.

Relative Importance of Conditioning Factors Using the RF Model
The relative importance of conditioning factors was determined using the RF model ( Figure 5). Slope (21.46), TPI (17.96), and elevation (16.89) were keys to GE in the study area. By contrast, NDVI, convergence index, and drainage density are least important determinants of gully formation. The distance to stream, soil type, LU/LC, lithology, distance to road, rainfall, TRI, plan curvature, and TWI rank from 4th to 12th, respectively.

Relative Importance of Conditioning Factors Using the RF Model
The relative importance of conditioning factors was determined using the RF model ( Figure 5). Slope (21.46), TPI (17.96), and elevation (16.89) were keys to GE in the study area. By contrast, NDVI, convergence index, and drainage density are least important determinants of gully formation. The distance to stream, soil type, LU/LC, lithology, distance to road, rainfall, TRI, plan curvature, and TWI rank from 4th to 12th, respectively.

Gully Erosion Susceptibility Mapping (GESM)
The minimum and maximum values (Table 4) of the GESMs (Figure 6a-j) produced with the 10 models are diverse. The proportions of the study area classified into the five susceptibility classes by each model (Figure 7) display significantly different results, with SI-RF and SI generating the most widespread classification of land into high or very high susceptibility and ME classifying the greatest percentage of the area as low or very low. The minimum and maximum values (Table 4) of the GESMs (Figure 6a-j) produced with the 10 models are diverse. The proportions of the study area classified into the five susceptibility classes by each model (Figure 7) display significantly different results, with SI-RF and SI generating the most widespread classification of land into high or very high susceptibility and ME classifying the greatest percentage of the area as low or very low. Table 4. Values of resulted gully erosion maps using ten models.

* Models
Classification with a Natural Break Model Very Low Low Moderate High Very High

Validation of Results
Validation employed AUSRC ( Figure 8) and revealed the RF model (AUSRC = 0.965) ( Table 5) performed best, whereas considering AUPRC, accuracy, and kappa criteria, the ME model (AUPRC = 0.948, accuracy = 0.849 and kappa = 0.699) performed best, followed by RF (Table 5 and Figure 9). Results indicate that combining the RF model with FR and SI models increased the performances of the latter compared to them as stand-alone models.

Validation of Results
Validation employed AUSRC ( Figure 8) and revealed the RF model (AUSRC = 0.965) (Table 5) performed best, whereas considering AUPRC, accuracy, and kappa criteria, the ME model (AUPRC = 0.948, accuracy = 0.849 and kappa = 0.699) performed best, followed by RF (Table 5 and Figure 9). Results indicate that combining the RF model with FR and SI models increased the performances of the latter compared to them as stand-alone models.

Validation of Results
Validation employed AUSRC ( Figure 8) and revealed the RF model (AUSRC = 0.965) (Table 5) performed best, whereas considering AUPRC, accuracy, and kappa criteria, the ME model (AUPRC = 0.948, accuracy = 0.849 and kappa = 0.699) performed best, followed by RF (Table 5 and Figure 9). Results indicate that combining the RF model with FR and SI models increased the performances of the latter compared to them as stand-alone models.

Discussion
In this study, 10 GIS-based statistical, machine-learning, and multicriteria models were integrated with RS data to generate GESMs. The statistical models FR and SI were included because of their simplicity and high efficiency, and to ease of interpretation of the results [72][73][74]. The TOPSIS method was used because it requires only simple calculations and minimal computation time, has the capacity to rank the alternatives, uses both quantitative and qualitative criteria, and can determine the relative importance of alternatives and compliance with the conditions. The TOPSIS method, being local and experimental, is known as one of the best methods for decision making [75]. It is well suited for several scenarios and criteria [67,75]. The RF, MI, FDA, and GLM models were included because of their capacities to predict environmental phenomena [63,66]. In cases in which key data are missing, the FDA model is more efficient than traditional methods. The GLM method solves nonlinear and multiclass problems well [63]. Any of the models in this set of selected models could produce a reliable GESM.
Assessments of the spatial relationships between the GECFs and the gullies show that gullies formed mainly in areas below 1005 m elevation and on slopes angles less than 5 • . Topography affects vegetation types, drainage area, geomorphological processes, weathering, soil moisture, drainage density, soil types, and precipitation. All these directly or indirectly influence GE potential [17,76,77]. Areas with gentle slopes have a high potential for accumulation of overland flows that can initiate gullying [78]. Surface and subsurface water are also key factors for gullying [79]. In both circumstances (above and below ground), the slope is the main factor initiating gully formation [80].
Flat topography is highly correlated with susceptibility to GE. This agrees with the findings of [51]. High TWI is strongly correlated with gullies in the study area. Arabameri et al. [72] used an integrated model to predict GE in the Mahabia watershed and also stated that areas with high TWI also had high positive susceptibility. Drainage density is highly correlated with susceptibility to GE, confirming the findings of [19,81,82]. Proximity to rivers and roads was positively correlated to gullies, as was reported by [11,83,84]. Natural drainage patterns are often disrupted by poorly located or poorly constructed culverts placed during road construction. Subsequently, soil is easily eroded by concentrated runoff created by impervious surfaces [85].
NDVI analysis reveals that areas with more vegetation growth had fewer gullies; areas with less vegetation had higher frequency of gully formation. This finding corroborates the results of [5,11]. Vegetation greatly reduces runoff and limits erosion by increasing infiltration and by protecting soil with root growth [86,87]. Aridisols are highly susceptible to GE in this region, and this agrees with [88].
Land use underpins geomorphological and hydrological processes by affecting runoff generation, sediment dynamics, and overland flows [89]. LU/LC analysis showed that agricultural and bare landscapes, where soil is often disturbed, where surface water is often concentrated [90][91][92][93], and where the surface is often unprotected by vegetation [88], had the highest susceptibility to GE in our study region. Because GE depends on the lithology of materials at or just below the surface [5] the spatial patterns of sediment origins were evaluated. Quaternary lithotypes in the study area have the highest susceptibility to GE, which is coincident with the findings of [88]. Gullying is a natural phenomenon that depends on the thresholds of several conditioning factors (e.g., rainfall, topography, flow hydraulics, pedology and land use). It is more likely to occur at locations where thresholds have been exceeded [94].
Examination of the relative importance of GECFs shows that slope, TPI, and elevation were the most important in the study area and corroborates [11][12][13]23]. Zabihi et al. [13] tested three models (FR, WoE and IoE) to model GE in Iran and found that of 12 GECFs, elevation and LU/LC were the most important in their study area. Meliho et al. [12] used IV and FR for GESM in the Ourika watershed in Morocco, and they found that LU/LC and slope had the most influence on gully formation.
The GESMs were classified into five different gully-erosion susceptibility classes (very low to very high) using four classification methods: geometrical interval, quantile, equal interval, and natural breaks. Comparing the results of each classification method with the high and very high gully-erosion susceptibility classes, it is clear that the natural break method provides the most accurate classification scheme. This agrees with the findings of [3].
The validation results using cutoff-dependent (AUPRC and AUSRC) and cutoff-independent (accuracy and kappa) criteria shows that ML models outperform statistical and MCDM-based models [19,70,88,95]. ML models are advantageous because they do not require a strict set of assumptions as is the case with many statistical methods [22]. ML models also use algorithms to discern the relationships between GECFs and gullies and therefore do not rely on a structural model [19]. These models also benefit from iterative learning algorithms which help them learn and improve [19].
AUSRC shows that RF had the best performance among the 10 models. This confirms the findings of [88,96]. Arabameri et al. [88] used three data-mining models for gully-erosion assessment in the Shahroud watershed in northeastern Iran, and found that RF performed best. When there is considerable noise in data, this method is less sensitive to ANNs and can better assess factors compared with others [97,98]. The most important advantages of RF models are their capacities to learn nonlinear relationships, that they have high predictive accuracy, they are able to determine relative factor importance, they can deal with distorted data, and they have high categorization ability.
Based on the AUPRC, accuracy, and kappa results, ME was shown to have the best overall performance. This reflects similar findings in [98][99][100][101]. ME exceeds other ML models because it uses search-based optimization to determine the relative importance of factors [101]. Results of the ensemble RF-FR and RF-SI statistical models reveal that RF is one of the best classification algorithms to use to considerably improve the performance of single classifiers [102,103]. Moreover, RF can decrease the dependence of statistical models on the relationships among the conditioning factors [30].

Conclusions
GE is a bane of rural sectors in arid and semi-arid regions of the world. To combat the formation of gullies, it is often necessary to diagnose the scope of the problem and the causes endemic to the local environment. GESM is a key tool for sustainable management and use of soil and water resources. The foundation of GESM is the spatial predictive model. There is no consensus among scholars about the best modeling approach (statistical, ML, or MCDM) to generate an accurate spatial assessment of GE. There are constant technological advancements that make data available for incorporation and analysis in ways that are more efficient and economical. These conditions are very important in regions that are more remote or that have fewer financial resources. This study attempted to tackle this question by comparing ten individual and ensemble models-FR, SI, RF, ME, FDA, GLM, TOPSIS, SI-RF, FR-RF and GLM-FDA-from among the extant statistical, ML, and MCDM approaches to model gully-erosion susceptibility in the Dasjard River watershed, Iran. RS data and GIS techniques were used to compile and analyze 15 environmental, geological, geomorphological, and anthropogenic GECFs that were selected according to the MT. VIF and TOL indicate that there is no multicollinearity among them. The results show that the RF ML model performed the best according to the AUROC. In this model, slope, TPI, and elevation were the key factors generating gullying in the study area. The RF model combines several tree algorithms to iteratively predict a phenomenon. RF can learn complicated patterns and can consider nonlinear relationships between dependent and independent variables. Furthermore, it can integrate data of different types, requires no assumptions about the normality of the data used, and can incorporate thousands of variables without discarding any of them. Based on the AUSRC, AUPRC, accuracy, and kappa values, ME outperformed even RF. Validation tests indicate that RF and ME, which are both ML models, outperformed all statistical and MCDM models tested. Ensemble models, particularly the combinations of RF with FR and SI, improved the prediction accuracy and success achieved by the individual models on their own. The scientific achievement of this study is that, by combining ML models with a suitable set of GECFs, data describing extant gullies, RS data, and GIS, one can produce reliably accurate GESMs. These GESMs were achieved with a method that is easy to use and can provide valuable information for planners or managers to prevent or respond to gully-erosion problems. This methodology can be used to assess gully-erosion susceptibility in similar regions of the world, especially in arid and semi-arid environments.