Next Article in Journal
Object-Based Predictive Modeling (OBPM) for Archaeology: Finding Control Places in Mountainous Environments
Next Article in Special Issue
Remote Sensing and Geoscience Information Systems Applied to Groundwater Research
Previous Article in Journal
A New Method Based on a Multilayer Perceptron Network to Determine In-Orbit Satellite Attitude for Spacecrafts without Active ADCS Like UVSQ-SAT
Previous Article in Special Issue
Land Use/Land Cover Changes Impact on Groundwater Level and Quality in the Northern Part of the United Arab Emirates
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Application of Support Vector Regression and Metaheuristic Optimization Algorithms for Groundwater Potential Mapping in Gangneung-si, South Korea

1
Department of Smart Regional Innovation, Kangwon National University, Gangwon-do, Chuncheon-si 24341, Korea
2
Geoscience Platform Research Division, Korea Institute of Geoscience and Mineral Resources (KIGAM), 124, Gwahak-ro Yuseong-gu, Daejeon 34132, Korea
3
Department of Geophysical Exploration, Korea University of Science and Technology, 217 Gajeong-ro Yuseong-gu, Daejeon 34113, Korea
4
Department of Science Education, Kangwon National University, Gangwon-do, Chuncheon-si 24341, Korea
5
Department of Geophysics, Kangwon National University, Gangwon-do, Chuncheon-si 24341, Korea
*
Author to whom correspondence should be addressed.
Remote Sens. 2021, 13(6), 1196; https://doi.org/10.3390/rs13061196
Submission received: 5 January 2021 / Revised: 22 February 2021 / Accepted: 17 March 2021 / Published: 21 March 2021

Abstract

:
The availability of groundwater is of concern. The demand for groundwater in Korea increased by more than 100% during the period 1994–2014. This problem will increase with population growth. Thus, a reliable groundwater analysis model for regional scale studies is needed. This study used the geographical information system (GIS) data and machine learning to map groundwater potential in Gangneung-si, South Korea. A spatial correlation performed using the frequency ratio was applied to determine the relationships between groundwater productivity (transmissivity data from 285 wells) and various factors. This study used four topography factors, four hydrological factors, and three geological factors, along with the normalized difference wetness index and land use and soil type. Support vector regression (SVR) and metaheuristic optimization algorithms—namely, grey wolf optimization (GWO), and particle swarm optimization (PSO), were used in the construction of the groundwater potential map. Model validation based on the area under the receiver operating curve (AUC) was used to determine model accuracy. The AUC values of groundwater potential maps made using the SVR, SVR_GWO, and SVR_PSO algorithms were 0.803, 0.878, and 0.814, respectively. Thus, the application of optimization algorithms increased model accuracy compared to the standard SVR algorithm. The findings of this study improve our understanding of groundwater potential in a given area and could be useful for policymakers aiming to manage water resources in the future.

Graphical Abstract

1. Introduction

Water resources are an essential aspect for living in this world, including surface water and groundwater, and are recycled through evaporation, precipitation and surface runoff. Recent climate change projections point to increased spatial and temporal heterogeneity in the water cycle, which would lead to water demand outstripping supply [1]. Over the next several decades, demand for water resources, including groundwater, is expected to increase significantly [2,3,4]. Groundwater is defined as water in a saturated area which fills in the pore spaces between mineral grains or cracks and fractured rocks in a rock mass [5]. The World Economic Forum has stated that water shortages will be a global problem in the future. Currently, approximately 20% of water consumed by humans is derived from groundwater, and this proportion is projected to increase over the next several decades [6,7]. Climatic conditions also have an important role in groundwater availability, affecting both spill patterns and runoff time [2]. The widespread use of groundwater in industry, agriculture and everyday life presents challenges related to its management. With the existence of climate anomalies and increasing population development, it will cause a shortage of water resources in various regions.
In Korea, groundwater demand increased by more than 100% from 1994 to 2014 [8]. The increasing demand for high-quality water, coupled with the anticipated pressure of global climate change, urgently requires quantitative methods for assessing groundwater production in aquifers. However, due to this costly and time-consuming method [5], a reliable approach for assessing aquifer productivity has not been well established. Therefore, the development of a reasonable groundwater potential model is essential for future system development, effective management, and sustainable use of groundwater resources [9]. Also, a reliable groundwater analysis model is needed to facilitate resource management and identify additional water sources worldwide.
In general, groundwater exploration programs are mainly based on hydrological tests, field surveys and geophysical methods [4,10]. However, these lines of action are time-consuming, costly, and demand experienced workers [11]. Meanwhile, field exploration based on hydrological or geophysical resistivity surveys cannot always represent factors that influence groundwater conditions and movements [12]. Therefore, it is necessary to develop groundwater analysis methods that can clarify the hydrological relationship with groundwater, such as the use of data specific capacity, transmissivity and yield [13].
A careful study of the literature shows that groundwater potential maps have different meanings to different authors. In general, groundwater potential is defined as optimal zones for groundwater development or how likely groundwater is to be present [14]. Groundwater potential mapping estimates the probability of groundwater occurrence in a given area. In general, this mapping involves statistical analysis of various types of field data. Remote sensing (RS) data collection processes enhance spatial coverage which increasing the type and availability of data [15,16,17,18]. Geographical information system (GIS) technology can be used to assess large areas in a more cost-efficient manner [12,19]. The development of GIS technology has increased in recent years as various spatial modeling techniques having been introduced to evaluate groundwater. Preliminary GIS studies using machine learning and statistical methods can be useful to analyze groundwater availability according to topographical and geographical factors, among others [7]. Thus, potential groundwater wells can be mapped to facilitate groundwater detection more efficiently.
Along with the increasing number and complexity of data in GIS, a reliable model was required to help solve those problems. Various models have been proposed to assess groundwater potential, including frequency ratio (FR) [20,21], weight of evidence (WoE) [22,23] and evidential belief function (EBF) [24,25] models, as well as machine learning models such as artificial neural network (ANN) [26], random forest (RF) [27,28], logistic regression [29,30], and support vector machine (SVM) [13,31] models. Some of these studies still have limitations in their predictions which are influenced by the accuracy of the data set and the internal structure of the model [32]. In addition, the computational process using large data sets and different ranges of validation and training values is a weakness of artificial neural networks [33]. In the context of groundwater mapping, several studies still use indirect indicators such as yield, resistance and spring location compared to hydraulic constants such as transmissivity and specific capacity.
In this study, the groundwater potential mapping was constructed and analyzed based machine learning approach using support vector regression (SVR) with training and datasets was derived from hydraulic dataset of transmissivity. SVR a variant of SVM models have also been developed and widely used for GIS mapping [34,35]. The principles of SVR and SVM are similar, although SVR has additional parameter settings [36]. Parameter values of the SVR can influence the learning process and reliability of model results. Therefore, determining operational parameters becomes a challenge in the process of getting the expected results.
In order to overcome the determining parameter challenges, meta-heuristic optimization algorithms were used in this research, including grey wolf optimization (GWO) [35,37] and particle swarm optimization (PSO) [38,39]. GWO is an optimization algorithm that has been widely used in the process of optimizing models in GIS applications [40]. It has been widely tailored for a wide variety of optimization problems due to its impressive characteristics over other swarm intelligence methods: it has very few parameters, and no derivation information is required in the initial search [41]. Likewise, PSO is considered reliable in the computational process for model optimization which provides a soft computational and quick convergence [39]. It is hoped that the process of using the optimization algorithm can improve the performance of the groundwater model. The meta-heuristic optimization algorithm can increase the accuracy of model predictions by tuning SVR machine learning parameters.
However, the use of SVR and meta-heuristic algorithms is still rarely used in groundwater mapping research. Therefore, in this study, the application of the hybrid algorithm for mapping groundwater potential was carried out. Combined with the use of hydraulic datasets derived from transmissivity as direct indicators to get better results. In this study had the advantage of comparing the accuracy and reliability of SVR and SVR optimization models for groundwater analysis based on the area under the curve (AUC). The AUC denotes the accuracy probability of groundwater occurrence. The FR describes the spatial relationships between dependent and independent variables in the context of groundwater potential; factors are ranked for ease of interpretation. The factors used in this study were obtained from RS data. The results could serve as a reference for policymakers aiming to manage water resources, and for future machine learning-based groundwater potential mapping.

2. Materials and Methods

2.1. Study Area

The study area was the Gangneung-si area of Gangwon-do Province, located on the east coast of the Korean Peninsula at 37°45′ N, 128°54′ E. The total coastline length is approximately 73.72 km. With a population of 213,199 people and a population density of 205/km2, Gangneung is one of the three largest cities in Gangwon-do Province [42]. A Sentinel-2 optical image of the study area is shown in Figure 1. Gangneung has warmer weather in summer and colder weather in winter than other areas. The months with the highest and lowest average temperatures are July (36.1 °C) and January (−1.8 °C), respectively. The mean annual precipitation in Gangneung is 1320.3 mm/year, with 753.4 mm occurring in summer [43].
The geologic distribution in Gangneung-si consists mainly of Jurassic granite, followed by Precambrian and Triassic age sedimentary rocks around the lower coast [44]. In addition, alluvial deposits are distributed along rivers and tributary streams. The aquifer of the study area is dominantly alluvium, with sedimentary rock in the lowlands [43].
In general, the groundwater in Gangneung-si comes from several sources, including rivers and dams. The available capacity is 1,650,000 m3/year. Groundwater in this area is mainly used for agriculture (71.6%), followed by domestic use (28.4%) [45]. The land use distribution of the study area is 80.4% forest, followed by agricultural land and urban areas. Due to the projected increase in groundwater demand in the area, groundwater potential assessment is needed.

2.2. Groundwater Datasets

Groundwater productivity data was calculated based on groundwater transmissivity (T) data obtained from 285 wells in Gangneung-si. T values above the median were included in the inventory dataset used for groundwater potential analysis. Groundwater pumping data were obtained from national- and local government-level groundwater surveys conducted by the Korea Water Resource Corporation (K-Water) [46].
T represents the flow rate under a unit hydraulic gradient through a unit width of a particular thickness’s aquifer. It is the product of the average hydraulic conductivity and the thickness of the aquifer formation, and can be calculated as follows:
K ( x , y ) = 1 b 0 b K ( x , y , z )   d z
T = K b
where T is transmissivity, K is hydraulic conductivity, and b is aquifer thickness. A decrease in the drawdown and a thicker aquifer produce higher T values. By combining Equation (2) and Darcy’s law, we can calculate the amount of water flowing in aquifer units [47].
In this study, the T data were applied to the FR and machine learning models, and used to examine various aspects of groundwater. Groundwater transmissivity data (T) obtained from well locations as mentioned above. In order to apply FR and machine learning models, groundwater productivity data were converted into binary form which is 0 and 1 [48]. The split criteria was the median of groundwater transmissivity, where the value above the transmissivity is designated as “1”, and the other data is expressed as “0” [12]. The groundwater productivity data were randomly extracted with their statistical attributes and divide into a data set with 285 points, where half of the well points meeting the criterion in each dataset for training and testing. The groundwater productivity data were randomly separated into training (70%) and testing (30%) datasets, of which proportions used are typical of machine learning studies [49,50]. T data from 199 and 86 wells were included in the training and testing datasets, respectively.

2.3. Selection of Groundwater-Related Factors

It is important to select the most relevant conditioning factors for groundwater potential mapping. Topographic, geologic, and hydrologic factors can affect the probability of groundwater occurrence, while factors such as land use, climate, and vegetation affect groundwater recharge capacity and demand [51]. Based on a literature review, 13 factors related to groundwater were selected and classified as topographical factors (slope, slope height, elevation, topographic wetness index [TWI]), hydrological factors (precipitation, LS-factor, and water density) [15,31,52]. Moreover, geological factors (lithology, distance to fault, lineament density), land use, soil type, and the normalized difference wetness index (NDWI) are considered in this study [53,54]. However, the relations among the factors with groundwater have not been verified either statistically or quantitatively. In this study, the 13 groundwater related-factors were reviewed with regard to transmissivity using FR. Afterward, the 13 factors were selected and applied to the SVR algorithm to generate the groundwater probability map. The 13 factors were derived from the RS images shown in Figure 2.
Thematic maps (elevation, slope, slope height, LS factor, and TWI) were derived from a digital elevation model (DEM) constructed from digital topographic maps (scale 1:5000) provided by the National Geography Information Institute (NGII) in 2015. We generated the thematic map by using the GIS application. The topographic maps were constructed based on ground control points from digital aerial photographs and ground surveys. Further calibration of the topographic maps was performed based on field surveys. All data were resampled to a pixel size of 30 × 30 m. Topography is important for runoff generation and retention, and water concentrations [55].
Elevation in the study area is denoted by contour lines and is associated with climate, soil and vegetation conditions [56]. In elevated regions, runoff conditions are higher and infiltration is low, even though precipitation is higher [16]. Bare areas have slower runoff, and infiltration leads to groundwater recharge. The elevation map used in this research is shown in Figure 2a.
The infiltration rate is often analyzed when mapping groundwater potential, and is influenced by slope conditions. On a steep slope, the water flow rate is high, which reduces the infiltration rate in the recharge zone [57,58]. Slope height is related to flow rate and can affect slope stability, which lead to the runoff rate. Gentle slope means slower runoff and therefore more time for infiltration. Conversely, steep slope means more erosion and shorter residence time. In groundwater potential mapping, this is usually related to the low probability of unconsolidated sediment accumulation and recharge [55]. GIS data were analyzed to determine slope variation in the study area and its effect on groundwater potential, as shown in Figure 2b,c.
The TWI, used widely in groundwater potential mapping studies, is shown in Figure 2d. The TWI provides information on the spatial distribution of hydrological variables such as infiltration potential or soil moisture. Also, the TWI can reflect the relation between water accumulation on any point area and the gravitational force that drives water down slope [59]. The higher index represents a lower slope and larger area, which provide a positive correlation between groundwater occurrence and TWI [5].
The relationship of LS with groundwater was determined to know the water behavior that leads to groundwater potential, as shown in Figure 2e. Precipitation data are crucial for mapping groundwater potential, and can help reveal the relationship between precipitation and groundwater recharge [60]. The precipitation map in this study was based on average precipitation data obtained from 103 weather stations in South Korea, for the period 2015–2020, by the Korea Meteorological Administration (KMA). We created a spatial map of precipitation—shown in Figure 2f—by using inverse distance weighting (IDW) interpolation.
Water density controlled by different litho-unit, structure and morphology of an area and helps to assess the characteristics of runoff and groundwater infiltration [61]. Water density is related to permeability and surface runoff, which affect groundwater potential. The high-water density means that the runoff can be emptied quickly and the possibility of infiltration is less. However, there are some exceptions because groundwater is expected to accumulate in alluvial sediments in flat areas of the watershed [62]. The water density map for this study is shown in Figure 2g. The NDWI evaluates wetness based on green and near-infrared bands. The use of the NDWI factor can describe the condition of soil moisture related to the humidity by the vegetation cover. This factor can reflect the evapotransport conditions, as well as the infiltration rate associated with the water table conditions. Even so, the annual average precipitation accumulation factor is used to complement the consideration of the potential presence of groundwater [53,63]. In this research, the NDWI of Gangneung-si (Figure 2h) was process from Landsat-8 OLI data in 2019.
Groundwater is closely associated with the landscape and land use; the landscape is affected by anthropogenic activities [64]. Land use affects groundwater resources by influencing recharge and water demand [51]. Land use is a significant factor affecting the groundwater recharge process, as it influences evapotranspiration, runoff and recharge of the groundwater system [65]. Kompsat-2 and 3 satellite images were used to reconstruct the 1:50,000 scale land use map in 2012 of the Ministry of Environment [66]. We divided the land use types into seven categories: urbanized area, agricultural area, forest area, grassland area, marsh area, bare ground area, and water area (Figure 2i). Soil type is often used in groundwater recharge and evaluation studies. This is mainly because soil permeability is directly related to effective porosity, particle shape and size, and porosity, which means that soil type plays an important role in infiltration [67,68]. For this reason, studies that consider soil-related variables are often carried out in conjunction with rainfall and/or recharge [69].The soil map used in this study (Figure 2j) was constructed on a 1:25,000 scale map issued by the Rural Development Administration (RDA), the data was constructed by field research by RDA in 2007.
Lithology is an important factor in the occurrence and distribution of groundwater, and provides information about the water stored in a particular area [64]. Lithology can affect groundwater recharge via its influence on water percolation [50]. The lithology map used in this study was created based on the 1:50,000 scale geological maps of the Korea Institute of Geoscience and Mineral Resources (KIGAM), which was provided in 2015. The lithology of the study area was divided into granite, gneiss, sedimentary rock, alluvium, and limestone categories, as shown in Figure 2k. The occurrence and movement of groundwater are controlled by porosity, permeability, aquifer layout, horizontal distribution, and recharge area. In the east part of the study area, both the Quaternary and Tertiary aquifers have major porosity, which is conducive to the formation of potential groundwater in the coastal aquifers [70].
Lineaments are generally described in studies analyzing fractures, and provide information on the linear properties of geological structures [71]. The distribution of lineaments is related to the location of groundwater, due to its association with the presence of faults, fractures, and joints, all of which impact porosity and permeability [72]. The relationship between lineament density and groundwater productivity can be derived for a given area; the lineament map of our study area is shown in Figure 2l. A fault is a geological structure that should be considered when mapping groundwater potential. Faults influence groundwater potential in a given area. The distribution of faults is related to groundwater storage potential, which in turn affects the groundwater recharge rate [31]. The fault map of this study is displayed in Figure 2m.

2.4. Methodology

To map groundwater potential using a machine learning algorithm, several steps must be performed, as described below:
First, the spatial correlations between the presence of groundwater and related factors (topographical, geological, and hydrological factors) are calculated using the FR. In this research, we analyzed the spatial correlation between groundwater well locations (T points) and 13 factors related to groundwater productivity, based on the FR of each factor. We obtained the spatial correlations between groundwater and related factors by calculating the ratio between the groundwater productivity area and the whole study area. The FR for each factor was calculated by the following Equation (3) [73].
F R = %   o f   c l a s s   o f   r e l a t e d   f a c t o r %   o f   t o t a l   a r e a
When the FR is >1, the spatial correlation of a particular class of a given factor with groundwater is stronger, and vice versa for lower FR values [74]. FR results can be used as a reference for groundwater potential mapping [75].
To apply the machine learning algorithm, T data are converted to binary form, then partitioned randomly to training data (70%) and testing data (30%). The groundwater related factor maps were produced at 30 m resolution. First, the value of T is determined, and included as an independent variable in the training dataset. Then, all data are classified as categorical or continuous. Continuous variables include the TWI, LS factor, water density, lineament density, slope, distance to fault, and elevation. Categorical variables include lithology, soil type, and land use.
In this study, a metaheuristic optimization algorithm (GWO and PSO) was used to determine the operational parameters of the SVR based on a prepared dataset (factors). Then, the SVR machine learning algorithm was used to create a groundwater probability model in Gangneung. In order to validate the mapping results, 285 T data points were randomly divided into training and testing datasets, as discussed above. Model validation was carried out through ROC curve analysis of the testing dataset (30%). Receiver operating characteristic (ROC) curve analysis, as an index of model performance, is commonly used to assess predictive accuracy [76]. To quantitatively determine the accuracy of the model verification, the area under the curve (AUC) of the ROC curve is calculated for the total area and correct predictive accuracy is obtained. AUC values ranges between 0.5 and 1; higher values indicate more reliable algorithm performance. The workflow of the groundwater potential mapping carried out in this study is provided in Figure 3.

2.5. Support Vector Regression

SVR is based on the SVM algorithm rule, as stated previously [77]. SVR is one of the most widely used supervised classification methods uses for regression and classification problems because of its ability to universally approximate the multivariate task at any degree of accuracy [78]. In regression analysis, the correlation, or nonlinear mapping characteristic f ( x ) , of the input and output of the learner is acquired. The SVR seeks to generate a “nonlinear mapping characteristic” to map the training data { x i , y i , ; i = 1 , , n } to an excessively high dimensional characteristic space. The nonlinear mapping of the input and output of the learner can be defined in Equation (4) [79]:
F f ( x ) = w T φ ( x ) + b ,
where w and b are the coefficients to be adjusted. The empirical risk can be defined as Equation (5):
R e m p ( f ) = 1 n i = n n Θ ε ( y i , f ( x ) ) ,
where Θ ε is the insensitive loss function, which can be calculated using Equation (6):
Θ ε ( y i , f ( x ) ) = { | f ( x ) y | ε ,         i f   Θ ε ( y i , f ( x ) )     ε 0 ,         O . W
This equation is used to acquire the best hyperplane for dividing the training data into subsets with the optimal separation distance. SVR is an optimizing problem with the following goal function:
M i n w , b , ξ * , ξ R ε ( w , ξ * , ξ ) = 1 2 w T w + C i = n n ( ξ i + ξ i n ) ,
where C is the trade-off between the first and second equation terms. In this equation the large weights can be adjusted by maximizing the distance between data points. The loss feature (ε) is used to divide the training errors between f(x) and y. The constraints of this optimization problem are shown in Equation (8):
y i w T φ ( x i ) b ξ i * ε , i = 1 , n . y i w T φ ( x i ) b ξ i * ε , i = 1 , n . ξ i , ξ i n 0 , i = 1 , n .
As noted previously, the SVR parameters affect the accuracy of model predictions. Hence, it is essential to select suitable parameters. In this study, the metaheuristic optimizing algorithm was used to determine the SVR parameters.

2.6. Grey Wolf Optimization

GWO is a metaheuristic algorithm that imitates the hunting behavior and social hierarchy of grey wolves [80]. Grey wolves live in groups with a social dominance hierarchy. They engage in various group activities, including hunting prey [81]. Grey wolves are classified according to social status as (from high to low) alpha, beta, delta, or omega. Alpha wolves are tasked with making decisions; beta wolves assist and advise them [82]. Delta wolves obey the leaders, as do omega wolves. In the GWO technique, the alpha, beta, and delta wolves guide the other wolves to the best area for hunting. GWO has several steps based on the hunting behavior of the grey wolf, namely searching for, tracking, chasing approaching and attacking prey. The location of other wolves during exploration (hunting) can be updated according to the location of the leader wolves ( X α ,   X β , X δ ) using Equation (9) [40]:
X 1 ( t ) = X α ( t ) A 1 . | C 1 . X α ( t ) X ( t ) | X 2 ( t ) = X β ( t ) A 2 . | C 2 . X β ( t ) X ( t ) | X 3 ( t ) = X δ ( t ) A 3 . | C 3 . X δ ( t ) X ( t ) | X ( t + 1 ) = { X 1 ( t ) + X 2 ( t ) + X 3 ( t ) } 3
where X represent the position vector of grey wolf and t shows the number of iterations. A and C are coefficient vectors given in Equation (10):
A = 2 a . r 1 a C = 2 r 2
where a is linearly decreased from 2 to 0 during iteration. Meanwhile, r 1 and r 2 indicate random vectors in the range between 0 and 1.

2.7. Particle Swarm Optimization

PSO is an optimization technique that imitates the collective behavior exhibited by a flock of birds, a school of fish, or a swarm of insects. This method is similar to the GA, which uses population fitness data to find an optimal solution to a given problem [83]. PSO is advantageous for optimizing nonlinear problems, showing fast convergence and requiring few computations. These capabilities separate PSO from other evolutionary algorithms, such as the GA. In PSO, each bird in a flock is considered as a particle, which are searched for in n-dimensional space to find the optimal solution (where n is the number of problem parameters) [39]. Particles are scattered randomly within the search space. After every iteration, according to the equation 11 and equation 12 every particle adjusts its location by finding the most specific location that it has ever occupied and also the best one adjacent to its neighbor.
To simulate the behavior of a flock of birds, the location, and rate of change therein, of the i-th iteration are given by x i t = ( x i 1 t , x i 2 t , , x i n t ) and v i t = ( v i 1 t , v i 2 t , , v i n t ) , respectively. During model training, the location, and rate of change therein, of the i-th iteration are updated using the following Equations (11) and (12) [84]:
v i t + 1 = W v i t + C i r 1 ( P i n t x i n t ) + C 2 r 2 ( P g n t x i n t )
x i n t + 1 = x i t + v i t + 1
where W is the inertia weight, Ci and C2 are the personal and social learning factors, respectively, r1 and r2 have random values from 0 to 1, and P i n t and P g n t are the best locations for particle i and the swarm, respectively, at iteration n. The algorithm continues until the best location for each particle is equal to the best position for all particles. All particles are focused on one point in space, and the solution to the problem is thus optimized [38].

3. Results

3.1. Relationships between Groundwater and Related Factors

FR values can provide information on the relationship between groundwater potential and related (topography, hydrology, geology, and land cover) factors. The FR is calculated for each class of each factor based on the T value. The FR values calculated in this study are shown in Table 1.
According to the FR value of 5.34, there was abundant groundwater in areas with low elevation (0–100 m) and its show a strong spatial correlation with groundwater. Slope showed an inverse relationship with groundwater availability. Groundwater probability was highest for the (low) slope class of 0–3.96 (FR = 4.79), where it is more difficult for groundwater to accumulate on steep slopes due to the water flow and velocity conditions [55]. Slope height and LS-factor also showed an inversely proportional relationship with groundwater potential. The FR values were highest for the slope height class of 12.74–14.86 (FR = 3.04) and LS-factor class of 0–5.8 (FR = 4.79).
The presence of groundwater was more likely with a higher TWI. Water-retaining ability and water density are promoted by a high TWI (>13.93). The highest FR value for water density factor was seen in the >11.49 class (FR = 2.79). This spatial correlation of water density is an example of one exception given the geological and lowland characteristics of this area. The highest FR value for NDWI was in the 0.57 to −0.04 class (3.75). Taken together, the results show that the probability of groundwater is higher in areas with bodies of water.
The relationships between the presence of groundwater and geological factors were also analyzed. Regarding the distance from a fault factor, the highest FR value occurred in the class of >12,624 m (FR = 1.86). Lineament density showed an inversely proportional relationship with the presence of groundwater, and the highest FR value occurred in the class of 0.1–0.77 (FR = 2.43). An area with lower lineament density has better recharge potential and is more likely to hold groundwater [71]. And for FR calculation of lithology factor shown the high spatial correlation with Alluvium (FR = 4.802). Alluvium as the dominant aquifer structure in this area has a relation with groundwater presence.
Regarding soil types, the FR value was highest for immature paddy, at 13.1. Among the land use types, the FR value was highest for urban areas, at 5.98. The urban area has a relation with the groundwater condition which indicates by groundwater consumption and the rate of groundwater recharge by anthropogenic activity.

3.2. Construction of Groundwater Potential Maps

Groundwater potential mapping of the Gangneung-si area was performed by applying a machine learning algorithm, as discussed above. A combination of 13 groundwater-related factors served as the dependent variables, and can be mainly classified as topography, hydrology, geology, and land use factors. Optimization algorithms (GWO and PSO) were applied to the SVR machine learning method to produce the groundwater potential maps. In the maps, blue color indicates a very high probability of groundwater, and brown color indicates a very low probability. The mapping result of each algorithm can be seen in Figure 4.
In general, high probability class of groundwater; blue areas were similar among the three algorithms (SVR, SVR_GWO and SVR_PSO). These high probability class are located in the eastern part which is associated with areas composed of alluvium and low elevation. Based on the existing spatial thematic data, accumulated water on flowing to lower areas can increase the infiltration rate in the lowlands. Lowland conditions which are dominated by alluvium and sedimentary rocks that have pores allow the infiltration process that leads to groundwater recharging. However, the infiltration rate is highly dependent on the type of land cover, soil properties and saturation level in absorption. On the other hand, the distribution of moderate class of SVR_GWO and SVR_PSO was found in coastal area. This finding could be related to coastal aquifer conditions which could be influenced by other factors such as sea water intrusion. Besides, low probability (brown) areas are in the western part of the study area, which is characterized by rock and forest cover. Also, the characterized of western part has a weak spatial relationship with groundwater presence than its affect to the mapping result.
A validation step was conducted to assess the reliability of the groundwater potential map from each algorithm. The accuracy of the groundwater potential maps generated using the three algorithms was then evaluated based on ROC curve analysis of the testing dataset (30% of all data). The AUC values were 0.803, 0.878, and 0.814 for SVR, SVR_GWO and SVR_PSO, respectively (Figure 5). The results imply that the algorithms are useful for distinguishing high groundwater potential areas.
Groundwater mapping accuracy was increased by applying the optimization algorithm to the SVR machine learning method. SVR_GWO and SVR_PSO increased mapping accuracy by 8.81% and 1.64%, respectively, and model performance was generally good based on the AUC values of >0.8 [46]. The increased accuracy was achieved by tuning the parameter of SVR algorithm based on optimization results calculation.

3.3. Sensitivity Analysis

Sensitivity analysis was conducted to assess the relative influence of each factor on groundwater potential, and to validate the above-described results. The 13 groundwater-related factors were removed one by one from the dataset, for all three algorithms. The resulting mapping accuracy results allowed us to determine each factor’s influence, as shown in Table 2. The SVR model showed accuracy increases of 0.4% and 0.3% on removal of the precipitation and distance to fault factors, respectively. Mapping accuracy decreased by 5.1%, 2.6% and 2.3% on inclusion of the land-use, soil type, and lithology factors, respectively. Thus, these factors had a considerable influence on the groundwater probability mapping performance of the SVR model.
For the SVR_GWO model, when the soil type, NDWI, and elevation factors were removed, mapping accuracy decreased by 1.3%, 1.3%, and 1.2%, respectively. On the other hand, removal of the TWI, distance to fault, and water density factors had no effect on accuracy, while removal of the precipitation factor increased accuracy by 0.3%. Besides, for the SVR_PSO model, the accuracy decreased when the NDWI and lithology factors were removed, by 1.7% and 1.4%, respectively. These relationships can be explained that NDWI represents the index of wetness in certain area and lithology describe the characteristic of the geological structure which is important in groundwater presence. Overall, the results were acceptable and indicate that the algorithms can be usefully applied to this research area.
Apart from the influence of these factors, sensitivity analysis can evaluate the consistency of the map. This study shows that there are factors that have a dominant contribute on the results of mapping accuracy, namely, land use and lithology conditions of the area. Land use can affect the rate recharge of groundwater, which influences evapotranspiration, runoff and recharge system [54]. Lithology conditions can indicate the potential for storage and aquifer conditions in the area leading to water sources. Also, lithology can influence the infiltration and percolation of waterflow [69]. However, further hydrogeological analysis can be used as a comparison in the context of understanding the factors that influence groundwater recharge conditions.

4. Discussion

Groundwater is essential for human life and livelihoods. Groundwater mapping can be improved, and rendered more cost-effective, by identifying geophysical and hydrological factors associated with subsurface storage. This study developed a machine learning approach for analyzing RS and GIS data to map groundwater potential; combining statistical models and machine learning algorithms (FR and SVR) can facilitate scientific decision-making as it pertains to a variety of problems [85]. FR values are an efficient way to simplify equations and aid interpretation of results [86]. Integration of RS and GIS data can improve time and cost efficiency in the context of groundwater probability mapping [17,57]. Machine learning algorithms can be used to model natural phenomena involving factors with nonlinear relationships, and so were applied to map groundwater potential in this study. These models can handle and analyze the complex nonlinear relationships between groundwater potential and various factors.
The groundwater potential mapping results that we obtained using the SVR machine learning algorithm showed that the study area could be classified based on groundwater probability. Areas with high groundwater potential were associated with alluvium, which was distributed around rivers and tributaries that flow into the sea. Alluvium is a type of aquifer with sufficient porosity and permeability to allow water to accumulate [58,87]. In addition, land use also influences groundwater potential; the high probability areas were detected near agricultural and urban [51], which are also the land use types that use the most groundwater. Groundwater resource availability affected by human activities such as urbanization and changing land cover, it will affect changing recharge rates. Besides, precipitation can affect the potential for groundwater, which is related to groundwater recharging conditions. Furthermore, high precipitation does not always increase groundwater recharge. The higher the precipitation can increase surface water overflow and hinder the process of groundwater recharge [70]. In addition, the infiltration rate which is indirectly influenced by soil conditions and the saturation level of water absorption are other factors that are considered. Meanwhile, for the forest land use areas, it is possible that many wells may contain groundwater, considering that 70% of the study area is forest area. The influence of the groundwater related factor was examined by the sensitivity analysis, in which mapping accuracy decreased by 5.1%, 2.3%, and 2.6% when land use, lithology, and soil type, respectively, were not included in the SVR process. Thus, three related factors commonly used in groundwater study as important role in term of infiltration rate which lead to groundwater presence. On the other hand, the potential for high groundwater potential found on the coastal area requires further observation regarding the effect of sea water intrusion on the presence of groundwater. As we know, sea water can fill pores in the soil structure in coastal areas and this has the potential to occur in Gangneung and affect groundwater conditions [88]. Several studies also argue for the potential for sea water intrusion in coastal areas [89,90]. For that, it is necessary to field investigation regarding groundwater conditions, especially around the coast [70]. As well as evaluation related to other spatial factors that can affect groundwater for further research. However, this is a preliminary study that can provide an overview of the potential for groundwater in Gangneung.
ROC curve analysis was performed to assess the performance of the three algorithms. The AUC values of the SVR, SVR_GWO and SVR_PSO models were 0.803, 0.878 and 0.814, respectively. Thus, SVR_GWO was reach the highest AUC value than another machine algorithm in this research. The accuracy values of all algorithms were good, so these models should be useful for this area of research. Application of the GWO and PSO optimization algorithms increased the SVR model accuracy by 8.6% and 1.6%, respectively. The optimization process tuned the gamma, epsilon, and C parameters in the SVR algorithm [36]. The optimization aimed to minimize the root mean square error (RMSE), which indicates the discrepancy between observation and predictions; lower RMSE values indicate higher model quality [80]. The SVR algorithm was used for groundwater potential mapping because it has several advantages: a good solution is not required at the start of the iterative process, it can be used in conjunction with other optimization methods, and multi-input, nonlinear optimization problems can be solved efficiently [40]. The increased accuracy of SVR achieved by applying the optimization algorithms is consistent with the results of Al-Fugara et al. (2019), who reported that spatial mapping of groundwater was improved by using “SVR-RBF-GA” and “SVR-RBF-GS” methods [34]. Furthermore, Panahi et al. (2020), reported better landslide mapping performance using a hybrid SVR method compared with the adaptive neuro-fuzzy inference system (ANFIS) [40]. However, there are also some disadvantages of SVR; for example, the absence of data mining features means that data are classified immediately, although labeled data can be extracted, which increases classification accuracy [35].
This study focuses on the application of machine learning methods and integration with remote sensing for groundwater potential mapping. However, there are still limitations in this study, especially related to the explanation of the physical water balance and groundwater refill mechanism. Furthermore, hydrological and phreatic surface analysis is still needed to carry out as a crosscheck-validation of the groundwater potential map [91]. This is necessary to obtain an overview of the groundwater recharge and can provide further information to improve understanding of physical water balance. In terms of methodology, the use of an optimization algorithm promises alternative approach in the future in groundwater mapping. In particular, there are still potential advantages in a combination of expert experience and deep learning applications that have data mining features [14,92]. Therefore, this result is a preliminary study initiated in groundwater mapping by combining a hybrid algorithm with groundwater capacity data. Further studies can be initiated by constructing a phreatic surface of aquifer to get a more detailed picture of the relationship between factors and groundwater potential.

5. Summary and Conclusions

A groundwater potential map was constructed for the Gangneung-si area using the machine learning method, based on 13 groundwater-related factors (mainly categorized as topography, hydrology, geology, and land use factors). FR values were used to assess the correlations of the factors with groundwater occurrence. Groundwater data from 285 wells in Gangneung-si were analyzed; these data were divided randomly into training (70%) and testing (30%) datasets. The GWO and PSO optimization algorithms were applied to the SVR. ROC curve analysis was used to assess the accuracy of each model. A sensitivity test was also performed for linkage the related factors with groundwater probability. The land use and lithology factors were the major factors used to construct the groundwater potential maps in this research.
In summary, three new models were developed for reasonable groundwater potential mapping: SVR, SVR_GWO, and SVR_PSO. The AUC values were 0.803, 0.878, and 0.814, respectively. Thus, the SVR_GWO model was more efficient for evaluating groundwater potential than the SVR and SVR_PSO models. Furthermore, the variance in accuracy of each factor could be small because the selected variable is important in the analysis. Nevertheless, this approach using a hybrid algorithm could be useful for groundwater mapping and exploration development. Due to some uncertainties associated with the methods, sample size, and raster spatial resolution, further studies aimed at developing accurate and reliable regional-scale groundwater potential maps are needed. Other related factor analysis methods and further hydrology analysis could be used to improve groundwater potential mapping results.

Author Contributions

Conceptualization, C.-W.L., S.L., and M.F.F.; methodology, C.-W.L., S.L., and M.F.F.; software, S.L., and M.F.F.; validation, C.-W.L. and M.F.F.; formal analysis, M.F.F.; investigation, C.-W.L.; resources, C.-W.L., and S.L.; data curation, C.-W.L., and M.F.F.; writing—original draft preparation, M.F.F.; writing—review and editing, C.-W.L., Y.-C.P., S.L., and M.F.F.; visualization, C.-W.L., Y.-C.P., and M.F.F.; supervision, C.-W.L., and S.L.; project administration, C.-W.L.; funding acquisition, Y.-C.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by a grant from the National Research Foundation of Korea provided by the government of Korea (No. 2019R1A2C1085686) and the Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (No. 2019R1A6A1A03033167). This research also was supported by the Basic Research Project of the Korea Institute of Geoscience and Mineral Resources (KIGAM).

Institutional Review Board Statement

Not Applicable.

Informed Consent Statement

Not Applicable.

Data Availability Statement

The dataset related to groundwater-related factor analysis can be found on: https://www.bigdata-environment.kr/user/main.do.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study, in the collection, analysis, or interpretation of the data, in the writing of the manuscript, or in the decision to publish the results.

References

  1. IPCC. Climate Change 2013: The Physical Science Basis; Contribution of Working Group I to the Fifth Assessment Report of the Intergovernmental Panel on Climate Change; Cambridge University Press (CUP): Cambridge, UK, 2013. [Google Scholar]
  2. WWDR. The United Nations World Water Development Report, 2017: Wastewater: The Untapped Resource; UNESCO: Paris, France, 2017; ISBN 978-92-3-100201-4. [Google Scholar]
  3. Zandi, J.; Ghazvinei, P.T.; Hashim, R.; Yusof, K.B.W.; Ariffin, J.; Motamedi, S. Mapping of regional potential groundwater springs using Logistic Regression statistical method. Water Resour. 2016, 43, 48–57. [Google Scholar] [CrossRef]
  4. Mohamaden, M.I.I.; Ehab, D. Application of electrical resistivity for groundwater exploration in Wadi Rahaba, Shalateen, Egypt. NRIAG J. Astron. Geophys. 2017, 6, 201–209. [Google Scholar] [CrossRef]
  5. Nampak, H.; Pradhan, B.; Manap, M.A. Application of GIS based data driven evidential belief function model to predict groundwater potential zonation. J. Hydrol. 2014, 513, 283–300. [Google Scholar] [CrossRef]
  6. Biswas, S.; Mukhopadhyay, B.P.; Bera, A. Delineating groundwater potential zones of agriculture dominated landscapes using GIS based AHP techniques: A case study from Uttar Dinajpur district, West Bengal. Environ. Earth Sci. 2020, 79, 302. [Google Scholar] [CrossRef]
  7. Kim, J.C.; Jung, H.S.; Lee, S. Groundwater productivity potential mapping using frequency ratio and evidential belief function and artificial neural network models: Focus on topographic factors. J. Hydroinformatics 2018, 20, 1436–1451. [Google Scholar] [CrossRef]
  8. Ministry of Land, Transport and Maritime Affairs. National Groundwater Monitoring Network in Korea Annual Report 2016; MLTM: Seoul, Korea, 2016.
  9. Manap, M.A.; Nampak, H.; Pradhan, B.; Lee, S.; Sulaiman, W.N.A.; Ramli, M.F. Application of probabilistic-based frequency ratio model in groundwater potential mapping using remote sensing data and GIS. Arab. J. Geosci. 2014, 7, 711–724. [Google Scholar] [CrossRef]
  10. Muchingami, I.; Chuma, C.; Gumbo, M.; Hlatywayo, D.; Mashingaidze, R. Review: Approaches to groundwater exploration and resource evaluation in the crystalline basement aquifers of Zimbabwe. Hydrogeol. J. 2019, 27, 915–928. [Google Scholar] [CrossRef]
  11. Yin, H.; Shi, Y.; Niu, H.; Xie, D.; Wei, J.; Lefticariu, L.; Xu, S. A GIS-based model of potential groundwater yield zonation for a sandstone aquifer in the Juye Coalfield, Shangdong, China. J. Hydrol. 2018, 557, 434–447. [Google Scholar] [CrossRef]
  12. Oh, H.J.; Kim, Y.S.; Choi, J.K.; Park, E.; Lee, S. GIS mapping of regional probabilistic groundwater potential in the area of Pohang City, Korea. J. Hydrol. 2011, 399, 158–172. [Google Scholar] [CrossRef]
  13. Lee, S.; Hong, S.M.; Jung, H.S. GIS-based groundwater potential mapping using artificial neural network and support vector machine models: The case of Boryeong city in Korea. Geocarto Int. 2018, 33, 847–861. [Google Scholar] [CrossRef]
  14. Díaz-Alcaide, S.; Martínez-Santos, P. Review: Advances in groundwater potential mapping. Hydrogeol. J. 2019, 27, 2307–2324. [Google Scholar] [CrossRef]
  15. Altafi Dadgar, M.; Zeaieanfirouzabadi, P.; Dashti, M.; Porhemmat, R. Extracting of prospective groundwater potential zones using remote sensing data, GIS, and a probabilistic approach in Bojnourd basin, NE of Iran. Arab. J. Geosci. 2017, 10, 1–11. [Google Scholar] [CrossRef]
  16. Duan, H.; Deng, Z.; Deng, F.; Wang, D. Assessment of groundwater potential based on multicriteria decision making model and decision tree algorithms. Math. Probl. Eng. 2016, 2016. [Google Scholar] [CrossRef] [Green Version]
  17. Adeyeye, O.A.; Ikpokonte, E.A.; Arabi, S.A. GIS-based groundwater potential mapping within Dengi area, North Central Nigeria. Egypt. J. Remote Sens. Sp. Sci. 2019, 22, 175–181. [Google Scholar] [CrossRef]
  18. Achmad, A.R.; Lee, S.; Park, S.; Eom, J.; Lee, C.W. Estimating the potential risk of the Mt. Baekdu Volcano using a synthetic interferogram and the LAHARZ inundation zone. Geosci. J. 2020, 24, 755–768. [Google Scholar] [CrossRef]
  19. Hakim, W.L.; Lee, C.-W. A Review on Remote Sensing and GIS Applications to Monitor Natural Disasters in Indonesia. Korean J. Remote Sens. 2020, 36, 1303–1322. [Google Scholar]
  20. Elmahdy, S.I.; Mohamed, M.M. Probabilistic frequency ratio model for groundwater potential mapping in Al Jaww plain, UAE. Arab. J. Geosci. 2015, 8, 2405–2416. [Google Scholar] [CrossRef]
  21. Jothibasu, A.; Anbazhagan, S. Spatial mapping of groundwater potential in Ponnaiyar River basin using probabilistic-based frequency ratio model. Model. Earth Syst. Environ. 2017, 3, 1–12. [Google Scholar] [CrossRef]
  22. Falah, F.; Zeinivand, H. GIS-Based Groundwater Potential Mapping in Khorramabad in Lorestan, Iran, using Frequency Ratio (FR) and Weights of Evidence (WoE) Models. Water Resour. 2019, 46, 679–692. [Google Scholar] [CrossRef]
  23. Chen, W.; Li, H.; Hou, E.; Wang, S.; Wang, G.; Panahi, M.; Li, T.; Peng, T.; Guo, C.; Niu, C.; et al. GIS-based groundwater potential analysis using novel ensemble weights-of-evidence with logistic regression and functional tree models. Sci. Total Environ. 2018, 634, 853–867. [Google Scholar] [CrossRef] [Green Version]
  24. Tahmassebipoor, N.; Rahmati, O.; Noormohamadi, F.; Lee, S. Spatial analysis of groundwater potential using weights-of-evidence and evidential belief function models and remote sensing. Arab. J. Geosci. 2016, 9, 1–18. [Google Scholar] [CrossRef]
  25. Khoshtinat, S.; Aminnejad, B.; Hassanzadeh, Y.; Ahmadi, H. Groundwater potential assessment of the Sero plain using bivariate models of the frequency ratio, Shannon entropy and evidential belief function. J. Earth Syst. Sci. 2019, 128, 1–16. [Google Scholar] [CrossRef] [Green Version]
  26. Sokeng, V.-C.J.; Kouamé, F.K.; Ngounou Ngatcha, B.; Dibi N’da, H.; Akpa You, L.; Rirabe, D. Delineating Groundwater Potential Zones in Western Cameroon Highlands Using GIS Based Artificial Neural Networks Model and Remote Sensing Data. Int. J. Innov. Appl. Stud. 2016, 15, 747–749. [Google Scholar]
  27. Rahmati, O.; Pourghasemi, H.R.; Melesse, A.M. Application of GIS-based data driven random forest and maximum entropy models for groundwater potential mapping: A case study at Mehran Region, Iran. Catena 2016, 137, 360–372. [Google Scholar] [CrossRef]
  28. Naghibi, S.A.; Ahmadi, K.; Daneshi, A. Application of Support Vector Machine, Random Forest, and Genetic Algorithm Optimized Random Forest Models in Groundwater Potential Mapping. Water Resour. Manag. 2017, 31, 2761–2775. [Google Scholar] [CrossRef]
  29. Park, S.; Hamm, S.-Y.; Jeon, H.-T.; Kim, J. Evaluation of Logistic Regression and Multivariate Adaptive Regression Spline Models for Groundwater Potential Mapping Using R and GIS. Sustainability 2017, 9, 1157. [Google Scholar] [CrossRef] [Green Version]
  30. Rizeei, H.M.; Pradhan, B.; Saharkhiz, M.A.; Lee, S. Groundwater aquifer potential modeling using an ensemble multi-adoptive boosting logistic regression technique. J. Hydrol. 2019, 579, 124172. [Google Scholar] [CrossRef]
  31. Bui, D.T.; Shirzadi, A.; Chapi, K.; Shahabi, H.; Pradhan, B.; Pham, B.T.; Singh, V.P.; Chen, W.; Khosravi, K.; Ahmad, B.B.; et al. A hybrid computational intelligence approach to groundwater spring potential mapping. Water 2019, 11, 2013. [Google Scholar] [CrossRef] [Green Version]
  32. Chen, W.; Tsangaratos, P.; Ilia, I.; Duan, Z.; Chen, X. Groundwater spring potential mapping using population-based evolutionary algorithms and data mining methods. Sci. Total Environ. 2019, 684, 31–49. [Google Scholar] [CrossRef]
  33. Toth, E.; Brath, A.; Montanari, A. Comparison of short-term rainfall prediction models for real-time flood forecasting. J. Hydrol. 2000, 239, 132–147. [Google Scholar] [CrossRef]
  34. Al-Fugara, A.; Ahmadlou, M.; Shatnawi, R.; AlAyyash, S.; Al-Adamat, R.; Al-Shabeeb, A.A.R.; Soni, S. Novel hybrid models combining meta-heuristic algorithms with support vector regression (SVR) for groundwater potential mapping. Geocarto Int. 2020. [Google Scholar] [CrossRef]
  35. Panahi, M.; Sadhasivam, N.; Pourghasemi, H.R.; Rezaie, F.; Lee, S. Spatial prediction of groundwater potential mapping based on convolutional neural network (CNN) and support vector regression (SVR). J. Hydrol. 2020, 588, 125033. [Google Scholar] [CrossRef]
  36. Al-Fugara, A.; Ahmadlou, M.; Al-Shabeeb, A.R.; AlAyyash, S.; Al-Amoush, H.; Al-Adamat, R. Spatial mapping of groundwater springs potentiality using grid search-based and genetic algorithm-based support vector regression. Geocarto Int. 2020, 1–20. [Google Scholar] [CrossRef]
  37. Tikhamarine, Y.; Souag-Gamane, D.; Najah Ahmed, A.; Kisi, O.; El-Shafie, A. Improving artificial intelligence models accuracy for monthly streamflow forecasting using grey Wolf optimization (GWO) algorithm. J. Hydrol. 2020, 582, 124435. [Google Scholar] [CrossRef]
  38. Jaafari, A.; Zenner, E.K.; Panahi, M.; Shahabi, H. Hybrid artificial intelligence models based on a neuro-fuzzy system and metaheuristic optimization algorithms for spatial prediction of wildfire probability. Agric. For. Meteorol. 2019, 266–267, 198–207. [Google Scholar] [CrossRef]
  39. Chen, W.; Panahi, M.; Tsangaratos, P.; Shahabi, H.; Ilia, I.; Panahi, S.; Li, S.; Jaafari, A.; Ahmad, B. Bin Applying population-based evolutionary algorithms and a neuro-fuzzy system for modeling landslide susceptibility. Catena 2019, 172, 212–231. [Google Scholar] [CrossRef]
  40. Panahi, M.; Gayen, A.; Pourghasemi, H.R.; Rezaie, F.; Lee, S. Spatial prediction of landslide susceptibility using hybrid support vector regression (SVR) and the adaptive neuro-fuzzy inference system (ANFIS) with various metaheuristic algorithms. Sci. Total Environ. 2020, 741, 139937. [Google Scholar] [CrossRef]
  41. Faris, H.; Aljarah, I.; Al-Betar, M.A.; Mirjalili, S. Grey wolf optimizer: A review of recent variants and applications. Neural Comput. Appl. 2018, 30, 413–435. [Google Scholar] [CrossRef]
  42. Statistics Korea. Complete Enumeration Results of the 2010 Population and Housing Census; Statistics Korea: Daejon, Korea, 2011. [Google Scholar]
  43. Ali, A.; Kim, K.Y. Seismic site conditions in Gangneung, Korea, based on Rayleigh-wave dispersion curves and topographic data. Geosci. J. 2016, 20, 781–791. [Google Scholar] [CrossRef]
  44. Kim, M.G.; Lee, Y. Il The stratigraphy and correlation of the upper Paleozoic Pyeongan Supergroup of southern Korean Peninsula—A review. J. Geol. Soc. Korea 2017, 53, 321–338. [Google Scholar] [CrossRef]
  45. Ministry for Food, Agriculture, Forestry and Fisheries. Rural Groundwater Survey Report (Gangneung-si); MFAFF: Seoul, Korea, 2010.
  46. Lee, S.; Hyun, Y.; Lee, S.; Lee, M.-J. Groundwater Potential Mapping Using Remote Sensing and GIS-Based Machine Learning Techniques. Remote Sens. 2020, 12, 1200. [Google Scholar] [CrossRef] [Green Version]
  47. Razack, M.; Huntley, D. Assessing Transmissivity from Specific Capacity in a Large and Heterogeneous Alluvial Aquifer. Ground Water 1991, 29, 856–861. [Google Scholar] [CrossRef]
  48. Arabameri, A.; Roy, J.; Saha, S.; Blaschke, T.; Ghorbanzadeh, O.; Tien Bui, D. Application of Probabilistic and Machine Learning Models for Groundwater Potentiality Mapping in Damghan Sedimentary Plain, Iran. Remote Sens. 2019, 11, 3015. [Google Scholar] [CrossRef] [Green Version]
  49. Arabameri, A.; Rezaei, K.; Cerdà, A.; Conoscenti, C.; Kalantari, Z. A comparison of statistical methods and multi-criteria decision making to map flood hazard susceptibility in Northern Iran. Sci. Total Environ. 2019, 660, 443–458. [Google Scholar] [CrossRef]
  50. Hong, H.; Pradhan, B.; Xu, C.; Tien Bui, D. Spatial prediction of landslide hazard at the Yihuang area (China) using two-class kernel logistic regression, alternating decision tree and support vector machines. Catena 2015, 133, 266–281. [Google Scholar] [CrossRef]
  51. Lerner, D.N.; Harris, B. The relationship between land use and groundwater resources and quality. Land Use policy 2009, 26, S265–S273. [Google Scholar] [CrossRef]
  52. Condon, L.E.; Maxwell, R.M. Evaluating the relationship between topography and groundwater using outputs from a continental-scale integrated hydrology model. Water Resour. Res. 2015, 51, 6602–6621. [Google Scholar] [CrossRef] [Green Version]
  53. Duan, H.; Deng, Z.; Deng, F. Classification of groundwater potential in Chaoyang area based on QUEST algorithm. In Proceedings of the International Geoscience and Remote Sensing Symposium (IGARSS), Beijing, China, 10–15 July 2016; Institute of Electrical and Electronics Engineers Inc.: Pistacaway, NJ, USA, 2016; pp. 890–893. [Google Scholar]
  54. Serele, C.; Pérez-Hoyos, A.; Kayitakire, F. Mapping of groundwater potential zones in the drought-prone areas of south Madagascar using geospatial techniques. Geosci. Front. 2020, 11, 1403–1413. [Google Scholar] [CrossRef]
  55. Snowdon, A.P.; Sykes, J.F.; Normani, S.D. Topography scale effects on groundwater-surface water exchange fluxes in a Canadian Shield setting. J. Hydrol. 2020, 585, 124772. [Google Scholar] [CrossRef]
  56. Pourghasemi, H.R.; Beheshtirad, M. Assessment of a data-driven evidential belief function model and GIS for groundwater potential mapping in the Koohrang Watershed, Iran. Geocarto Int. 2015, 30, 662–685. [Google Scholar] [CrossRef]
  57. Oikonomidis, D.; Dimogianni, S.; Kazakis, N.; Voudouris, K. A GIS/Remote Sensing-based methodology for groundwater potentiality assessment in Tirnavos area, Greece. J. Hydrol. 2015, 525, 197–208. [Google Scholar] [CrossRef]
  58. Preeja, K.R.; Joseph, S.; Thomas, J.; Vijith, H. Identification of Groundwater Potential Zones of a Tropical River Basin (Kerala, India) Using Remote Sensing and GIS Techniques. J. Indian Soc. Remote Sens. 2011, 39, 83–94. [Google Scholar] [CrossRef]
  59. Sørensen, R.; Zinko, U.; Seibert, J. On the calculation of the topographic wetness index: Evaluation of different methods based on field observations. Hydrol. Earth Syst. Sci. 2006, 10, 101–112. [Google Scholar] [CrossRef] [Green Version]
  60. Razandi, Y.; Pourghasemi, H.R.; Neisani, N.S.; Rahmati, O. Application of analytical hierarchy process, frequency ratio, and certainty factor models for groundwater potential mapping using GIS. Earth Sci. Inform. 2015, 8, 867–883. [Google Scholar] [CrossRef]
  61. Kumar, P.K.D.; Gopinath, G.; Seralathan, P. Application of remote sensing and GIS for the demarcation of groundwater potential zones of a river basin in Kerala, southwest coast of India. Int. J. Remote Sens. 2007, 28, 5583–5601. [Google Scholar] [CrossRef]
  62. Das, S.; Pardeshi, S.D.; Kulkarni, P.P.; Doke, A. Extraction of lineaments from different azimuth angles using geospatial techniques: A case study of Pravara basin, Maharashtra, India. Arab. J. Geosci. 2018, 11, 1–13. [Google Scholar] [CrossRef]
  63. Singh, K.V.; Setia, R.; Sahoo, S.; Prasad, A.; Pateriya, B. Evaluation of NDWI and MNDWI for assessment of waterlogging by integrating digital elevation model and groundwater level. Geocarto Int. 2015, 30, 650–661. [Google Scholar] [CrossRef]
  64. Yeh, H.F.; Cheng, Y.S.; Lin, H.I.; Lee, C.H. Mapping groundwater recharge potential zone using a GIS approach in Hualian River, Taiwan. Sustain. Environ. Res. 2016, 26, 33–43. [Google Scholar] [CrossRef] [Green Version]
  65. Miraki, S.; Zanganeh, S.H.; Chapi, K.; Singh, V.P.; Shirzadi, A.; Shahabi, H.; Pham, B.T. Mapping Groundwater Potential Using a Novel Hybrid Intelligence Approach. Water Resour. Manag. 2019, 33, 281–302. [Google Scholar] [CrossRef]
  66. Lee, S.; Hyun, Y.; Lee, M.-J. Groundwater Potential Mapping Using Data Mining Models of Big Data Analysis in Goyang-si, South Korea. Sustainability 2019, 11, 1678. [Google Scholar] [CrossRef] [Green Version]
  67. Nhu, V.H.; Mohammadi, A.; Shahabi, H.; Ahmad, B.B.; Al-Ansari, N.; Shirzadi, A.; Geertsema, M.; Kress, V.R.; Karimzadeh, S.; Kamran, K.V.; et al. Landslide detection and susceptibility modeling on cameron highlands (Malaysia): A comparison between random forest, logistic regression and logistic model tree algorithms. Forests 2020, 11, 830. [Google Scholar] [CrossRef]
  68. Lee, S.; Lee, C.-W. Application of Decision-Tree Model to Groundwater Productivity-Potential Mapping. Sustainability 2015, 7, 13416–13432. [Google Scholar] [CrossRef] [Green Version]
  69. Thapa, R.; Gupta, S.; Guin, S.; Kaur, H. Assessment of groundwater potential zones using multi-influencing factor (MIF) and GIS: A case study from Birbhum district, West Bengal. Appl. Water Sci. 2017, 7, 4117–4131. [Google Scholar] [CrossRef]
  70. Mandal, U.; Sahoo, S.; Munusamy, S.B.; Dhar, A.; Panda, S.N.; Kar, A.; Mishra, P.K. Delineation of Groundwater Potential Zones of Coastal Groundwater Basin Using Multi-Criteria Decision Making Technique. Water Resour. Manag. 2016, 30, 4293–4310. [Google Scholar] [CrossRef]
  71. Sander, P. Lineaments in groundwater exploration: A review of applications and limitations. Hydrogeol. J. 2007, 15, 71–74. [Google Scholar] [CrossRef]
  72. Benjmel, K.; Amraoui, F.; Boutaleb, S.; Ouchchen, M.; Tahiri, A.; Touab, A. Mapping of Groundwater Potential Zones in Crystalline Terrain Using Remote Sensing, GIS Techniques, and Multicriteria Data Analysis (Case of the Ighrem Region, Western Anti-Atlas, Morocco). Water 2020, 12, 471. [Google Scholar] [CrossRef] [Green Version]
  73. Hakim, W.; Achmad, A.; Lee, C.-W. Land Subsidence Susceptibility Mapping in Jakarta Using Functional and Meta-Ensemble Machine Learning Algorithm Based on Time-Series InSAR Data. Remote Sens. 2020, 12, 3627. [Google Scholar] [CrossRef]
  74. Pradhan, B.; Abokharima, M.H.; Jebur, M.N.; Tehrany, M.S. Land subsidence susceptibility mapping at Kinta Valley (Malaysia) using the evidential belief function model in GIS. Nat. Hazards 2014, 73, 1019–1042. [Google Scholar] [CrossRef]
  75. Moung-Jin, L.; Won-Kyong, S.; Joong-Sun, W.; Inhye, P.; Saro, L. Spatial and temporal change in landslide hazard by future climate change scenarios using probabilistic-based frequency ratio model. Geocarto Int. 2014, 29, 639–662. [Google Scholar] [CrossRef]
  76. Fawcett, T. An introduction to ROC analysis. Pattern Recognit. Lett. 2006, 27, 861–874. [Google Scholar] [CrossRef]
  77. Wu, W.; Zucca, C.; Muhaimeed, A.S.; Al-Shafie, W.M.; Fadhil Al-Quraishi, A.M.; Nangia, V.; Zhu, M.; Liu, G. Soil salinity prediction and mapping by machine learning regression in Central Mesopotamia, Iraq. Land Degrad. Dev. 2018, 29, 4005–4014. [Google Scholar] [CrossRef]
  78. Azeez, O.; Pradhan, B.; Shafri, H. Vehicular CO Emission Prediction Using Support Vector Regression Model and GIS. Sustainability 2018, 10, 3434. [Google Scholar] [CrossRef] [Green Version]
  79. Kavousi-Fard, A.; Samet, H.; Marzbani, F. A new hybrid Modified Firefly Algorithm and Support Vector Regression model for accurate Short Term Load Forecasting. Expert Syst. Appl. 2014, 41, 6047–6056. [Google Scholar] [CrossRef]
  80. Chen, W.; Hong, H.; Panahi, M.; Shahabi, H.; Wang, Y.; Shirzadi, A.; Pirasteh, S.; Alesheikh, A.A.; Khosravi, K.; Panahi, S.; et al. Spatial prediction of landslide susceptibility using GIS-based data mining techniques of ANFIS with whale optimization algorithm (WOA) and grey wolf optimizer (GWO). Appl. Sci. 2019, 9, 3755. [Google Scholar] [CrossRef] [Green Version]
  81. Mirjalili, S.M.; Mirjalili, S.M.; Lewis, A. Grey Wolf Optimizer. Adv. Eng. Softw. 2014, 69, 46–61. [Google Scholar] [CrossRef] [Green Version]
  82. Pradhan, M.; Roy, P.K.; Pal, T. Oppositional based grey wolf optimization algorithm for economic dispatch problem of power system. Ain Shams Eng. J. 2018, 9, 2015–2025. [Google Scholar] [CrossRef]
  83. Soleimani, H.; Kannan, G. A hybrid particle swarm optimization and genetic algorithm for closed-loop supply chain network design in large-scale networks. Appl. Math. Model. 2015, 39, 3990–4012. [Google Scholar] [CrossRef]
  84. Tien Bui, D.; Bui, Q.T.; Nguyen, Q.P.; Pradhan, B.; Nampak, H.; Trinh, P.T. A hybrid artificial intelligence approach using GIS-based neural-fuzzy inference system and particle swarm optimization for forest fire susceptibility modeling at a tropical area. Agric. For. Meteorol. 2017, 233, 32–44. [Google Scholar] [CrossRef]
  85. Arabameri, A.; Nalivan, O.A.; Pal, S.C.; Chakrabortty, R.; Saha, A.; Lee, S.; Pradhan, B.; Bui, D.T. Novel machine learning approaches for modelling the gully erosion susceptibility. Remote Sens. 2020, 12, 2833. [Google Scholar] [CrossRef]
  86. Aditian, A.; Kubota, T.; Shinohara, Y. Comparison of GIS-based landslide susceptibility models using frequency ratio, logistic regression, and artificial neural network in a tertiary region of Ambon, Indonesia. Geomorphology 2018, 318, 101–111. [Google Scholar] [CrossRef]
  87. Notti, D.; Mateos, R.M.; Monserrat, O.; Devanthéry, N.; Peinado, T.; Roldán, F.J.; Fernández-Chacón, F.; Galve, J.P.; Lamas, F.; Azañón, J.M. Lithological control of land subsidence induced by groundwater withdrawal in new urban AREAS (Granada Basin, SE Spain). Multiband DInSAR monitoring. Hydrol. Process. 2016, 30, 2317–2331. [Google Scholar] [CrossRef]
  88. Costall, A.R.; Harris, B.D.; Teo, B.; Schaa, R.; Wagner, F.M.; Pigois, J.P. Groundwater Throughflow and Seawater Intrusion in High Quality Coastal Aquifers. Sci. Rep. 2020, 10, 1–33. [Google Scholar] [CrossRef] [PubMed]
  89. Jasechko, S.; Perrone, D.; Seybold, H.; Fan, Y.; Kirchner, J.W. Groundwater level observations in 250,000 coastal US wells reveal scope of potential seawater intrusion. Nat. Commun. 2020, 11, 1–9. [Google Scholar] [CrossRef] [PubMed]
  90. Hounsinou, S.P. Assessment of potential seawater intrusion in a coastal aquifer system at Abomey—Calavi, Benin. Heliyon 2020, 6, e03173. [Google Scholar] [CrossRef] [PubMed]
  91. Floris, M.; Fontana, A.; Tessari, G.; Mulè, M. Subsidence Zonation through Satellite Interferometry in Coastal Plain Environments of NE Italy: A Possible Tool for Geological and Geomorphological Mapping in Urban Areas. Remote Sens. 2019, 11, 165. [Google Scholar] [CrossRef] [Green Version]
  92. Maskooni, E.K.; Naghibi, S.A.; Hashemi, H.; Berndtsson, R. Application of advanced machine learning algorithms to assess groundwater potential using remote sensing-derived data. Remote Sens. 2020, 12, 2742. [Google Scholar] [CrossRef]
Figure 1. Optical image of the study area in Gangneung-si acquired from the Sentinel-2 satellite.
Figure 1. Optical image of the study area in Gangneung-si acquired from the Sentinel-2 satellite.
Remotesensing 13 01196 g001
Figure 2. Selected groundwater-related factors: (a) elevation, (b) slope, (c) slope height, (d) TWI, (e) LS, (f) precipitation, (g) water density, (h) NDWI, (i) land use, (j) soil type, (k) lithology, (l) lineament density, and (m) distance to fault.
Figure 2. Selected groundwater-related factors: (a) elevation, (b) slope, (c) slope height, (d) TWI, (e) LS, (f) precipitation, (g) water density, (h) NDWI, (i) land use, (j) soil type, (k) lithology, (l) lineament density, and (m) distance to fault.
Remotesensing 13 01196 g002aRemotesensing 13 01196 g002bRemotesensing 13 01196 g002c
Figure 3. Workflow of groundwater potential mapping.
Figure 3. Workflow of groundwater potential mapping.
Remotesensing 13 01196 g003
Figure 4. Groundwater potential maps generated using three algorithms: (a) SVR algorithm, (b) SVR_GWO algorithm, and (c) SVR_PSO algorithm.
Figure 4. Groundwater potential maps generated using three algorithms: (a) SVR algorithm, (b) SVR_GWO algorithm, and (c) SVR_PSO algorithm.
Remotesensing 13 01196 g004
Figure 5. Receiver operating characteristic (ROC) curves for the SVR, SVR_GWO, and SVR_PSO algorithms.
Figure 5. Receiver operating characteristic (ROC) curves for the SVR, SVR_GWO, and SVR_PSO algorithms.
Remotesensing 13 01196 g005
Table 1. Frequency ratios of groundwater-related factors.
Table 1. Frequency ratios of groundwater-related factors.
FactorClassTotal %Event %Frequency Ratio
Elevation (m)0–1009.8852.725.34
100–22210.4721.742.08
222–35611.118.700.78
356–4959.925.980.60
495–62310.003.800.38
623–7359.901.090.11
735–8469.732.170.22
846–9689.833.260.33
968–11199.710.000.00
>11199.540.540.06
Slope (degrees)0–3.9510.6651.094.79
3.95–9.6410.0230.433.04
9.64–14.8410.1511.411.12
14.84–19.299.944.890.49
19.29–23.509.891.090.11
23.50–27.469.950.540.05
27.46–31.1710.020.540.05
31.17–34.889.790.000.00
34.88–39.349.850.000.00
>39.349.740.000.00
Slope height (m)0–6.379.9924.462.45
6.37–10.6110.0015.761.58
10.61–12.7410.0025.002.50
12.74–14.8610.0030.433.04
14.86–23.3610.003.260.33
23.36–33.9810.000.540.05
33.98–48.8510.000.000.00
48.85–70.0910.000.540.05
70.09–108.3210.000.000.00
>108.3210.000.000.00
TWI1.65–4.7410.000.000.00
4.74–5.1410.000.000.00
5.14–5.5410.000.000.00
5.54–6.0410.001.090.11
6.04–6.7410.002.170.22
6.74–7.6410.005.980.60
7.64–9.2410.006.520.65
9.24–12.5310.0012.501.25
12.53–13.9310.0028.262.83
13.93–27.2110.0043.484.35
LS-factor0–3.5910.6651.094.79
3.59–7.189.9425.542.57
7.18–10.789.9310.331.04
10.78–14.379.935.980.60
14.37–17.979.933.260.33
17.97–21.569.931.090.11
21.56–26.369.921.630.16
26.36–32.259.920.540.05
32.35–38.739.920.000.00
>38.739.920.540.05
Precipitation (mm/year)1198–12454.300.000.00
1245–12756.962.720.39
1275–12946.237.611.22
1294–13099.376.520.70
1309–132212.124.890.40
1322–133514.845.980.40
1335–1350 13.316.520.49
1350–136712.3014.671.19
1367–138313.4730.432.26
1383–14107.1020.652.91
Water density (km/km2)014.108.150.58
0–1.589.545.980.63
1.58–2.779.544.890.51
2.77–3.969.545.430.57
3.96–5.159.546.520.68
5.15–6.349.5511.411.20
6.34–7.529.558.700.91
7.52–9.119.558.700.91
9.11–11.499.5513.591.42
>11.49 9.5426.632.79
NDWI−0.57–−0.0410.0037.503.75
−0.04–0.0210.0029.892.99
0.02–0.0810.0013.041.30
0.08–0.1510.008.150.82
0.15–0.2310.004.890.49
0.23–0.3210.003.260.33
0.32–0.4110.001.090.11
0.41–0.510.001.090.11
0.5–0.6110.010.540.05
>0.6110.010.540.05
Land use Urban 3.6121.585.98
Agricultural 11.9557.554.82
Forest78.1911.870.15
Grassland3.284.681.42
Marsh0.290.722.51
Water body1.522.521.65
Bare ground 1.161.080.93
Soil typeCommon paddy0.606.5210.81
Immature paddy0.354.8913.91
Sandy paddy3.8526.096.78
Wet paddy2.4811.414.60
Common field5.2423.374.46
Immature field1.803.802.11
Sandy field1.252.722.17
Red/yellow forest soil1.042.172.10
Volcanic ash soil2.223.261.47
Rock/soil 78.9713.590.17
Other2.202.170.99
LithologyAlluvium5.8228.064.82
Biotite granite0.100.000.00
Dyke0.010.000.00
Gneiss4.350.000.00
Granite54.8343.880.80
Limestone17.893.600.20
Sedimentary rock16.3424.461.50
Water0.660.000.00
Lineament density (km/km2)0.10–0.1799.62242.112.43
0.17–0.19100.57173.681.73
0.19–0.22100.64100.000.99
0.22–0.24100.6689.470.89
0.24–0.26100.6668.420.68
0.26–0.27100.6542.110.42
0.27–0.29100.6636.840.37
0.29–0.31100.6360.820.60
0.31–0.34100.6781.290.81
>0.34 100.6773.680.73
Distance to fault (m)010.056.520.65
0–105210.028.150.81
1052–210410.007.070.71
2104–315610.005.430.54
3156–420810.009.240.92
4208–52609.9912.501.25
5260–683810.006.520.65
6838–89429.9911.411.14
8942–12,62410.0014.671.47
>12,6249.9618.481.86
Table 2. Sensitivity results for each algorithm (SVR, SVR_GWO, and SVR_PSO).
Table 2. Sensitivity results for each algorithm (SVR, SVR_GWO, and SVR_PSO).
FactorsMapping Accuracy Values (%)
SVRVariationSVR_GWOVariationSVR_PSOVariation
All factors0.803 0.878 0.814
Elevation0.8–0.30.866–1.20.803–1.1
Slope0.783–20.872–0.60.805–0.9
Slope height0.788–0.50.872–0.60.807–0.7
TWI0.788–0.50.87800.81–0.4
LS0.782–2.10.877–0.20.808–0.6
Precipitation0.8070.40.8820.30.820.6
Water density0.788–0.50.8790.10.808–0.6
NDWI0.792–1.10.866–1.30.797–1.7
Land use0.752–5.10.87–0.90.811–0.3
Soil type0.777–2.60.865–1.30.804–1
Lithology0.78–2.30.871–0.70.8–1.4
Lineament Density0.782–2.10.874–0.40.807–0.7
Distance to fault0.8060.30.87800.8150.1
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Fadhillah, M.F.; Lee, S.; Lee, C.-W.; Park, Y.-C. Application of Support Vector Regression and Metaheuristic Optimization Algorithms for Groundwater Potential Mapping in Gangneung-si, South Korea. Remote Sens. 2021, 13, 1196. https://doi.org/10.3390/rs13061196

AMA Style

Fadhillah MF, Lee S, Lee C-W, Park Y-C. Application of Support Vector Regression and Metaheuristic Optimization Algorithms for Groundwater Potential Mapping in Gangneung-si, South Korea. Remote Sensing. 2021; 13(6):1196. https://doi.org/10.3390/rs13061196

Chicago/Turabian Style

Fadhillah, Muhammad Fulki, Saro Lee, Chang-Wook Lee, and Yu-Chul Park. 2021. "Application of Support Vector Regression and Metaheuristic Optimization Algorithms for Groundwater Potential Mapping in Gangneung-si, South Korea" Remote Sensing 13, no. 6: 1196. https://doi.org/10.3390/rs13061196

APA Style

Fadhillah, M. F., Lee, S., Lee, C. -W., & Park, Y. -C. (2021). Application of Support Vector Regression and Metaheuristic Optimization Algorithms for Groundwater Potential Mapping in Gangneung-si, South Korea. Remote Sensing, 13(6), 1196. https://doi.org/10.3390/rs13061196

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop