Figure 2 presents the general flowchart in this study to map the GWPZ. The GWPZ was determined by calculating the GWPI for the entire groundwater basin [
25]. The GWPI represents the complex interplay of socioeconomic factors, hydrometeorology, topography, and land resources [
26]. A GWPZ map employs a weighted index overlay concept, where weight values are assigned to each thematic layer [
27]. As shown in the flowchart of
Figure 2, the study determined the GA based on the collected site-specific data, including the aquifer thickness of the first layer, groundwater depths, and the porosity of the shallow aquifer. The results of the GA were for the validation of the GWPZ. We then conducted the Band Collection statistics for normalized thematic maps. The index–overlay method requires weightings and ratings for each specific thematic map. We utilized the FE-AHP to assess the thematic layers and define their weightings. The ratings and weightings of the selected parameters were used to determine the GWPI for each cell in the map. Subsequently, the GWPZ map was generated based on the GWPI values.
2.3.2. Analytic Hierarchy Process (AHP)
The AHP, proposed by Saaty [
6], has found extensive application in multi-criteria evaluation for decision-making scenarios involving conflicting and qualitative criteria. The AHP involves a structured approach of pairwise comparisons, utilizing a standardized nine-level scale to determine the relative importance of criteria. The AHP involves a structured set of pairwise comparisons, utilizing a standardized nine-level scale to evaluate the relative importance of criteria. Each criterion is assigned ratings or weights based on these comparisons, using values ranging from 1 to 9, reflecting the degree of importance (from extremely less important to extremely more influential). The impact of factors or criteria on the decision-making process is quantified by combining this scale with the expertise and knowledge of specialists or users [
31]. The AHP method allows for a comprehensive consideration of both subjective and objective evaluation measures while also providing a means to test the stability of evaluation methods and proposed options through specialists or decision-makers, thus reducing errors in decision making [
31]. This study employed the results of the classical AHP for comparison purposes. Specifically, the maps of the GWPZs obtained from the conventional AHP and the FE-AHP were quantitatively evaluated and assessed.
Most studies utilizing multi-criteria evaluation, such as the AHP, rely on expert judgment to rank the criteria. However, this study introduced a novel approach using a correlation matrix between selected thematic maps derived from seven RS data and an independent variable not included in the model. The study considered GA as the independent variable strongly correlated with the GWPI [
3]. In AHP methods, the initial step involves tabulating the data based on Pearson’s correlation matrix built between the normalized thematic maps and groundwater availability. The correlation matrix among the thematic layers was generated using Band Collection Statistics in ArcGIS 10.8. This method allows for ranking each criterion of the GWPZ, providing a more objective approach than relying solely on expert judgment or the existing literature. Here, the z-score normalization or zero mean normalization method was employed to normalize the data. This normalization method involves subtracting the mean (
) of each feature and dividing it by the standard deviation (
) [
27].
where
is the value of the criteria for every grid of
,
is the expectation of the variable,
is the standard deviation of the variable, and the notation
represents the z-score of the corresponding outlier. Reference data are presented in the form of seven data on the GWPI, which were ranked as reference partners, along with seven thematic maps. The seven thematic maps were then determined by considering the analysis of the needs and availability of RS data. However, the data types could be changed based on different situations and site-specific conditions. In this study, the selected parameters were based on the hydrology processes, human interventions, geological profile and surface profile [
11]. For hydrological processes, they are characterized by specific parameters such as P (precipitation in mm), MNDWI (modified normalized difference water index), and DD (drainage density in km/km
2). Human interventions are identified by parameters, namely, the EVI (enhanced vegetation index) and NDBI (normalized difference building index). Geological profile and surface profile are explained by TRI (terrain ruggedness index) and SL (slope in degrees), respectively.
Note that the AHP involves breaking down the issue into a hierarchy of elements, specifically the criteria and the sub-criteria. In this study, the criteria consisted of the seven parameters, while the sub-criteria were the different ranges within each parameter.
Precipitation (P). Precipitation is the primary source for groundwater recharge in aquifer systems [
11]. Hence, increasing precipitation represents the increasing groundwater potential over a specific area. Precipitation is closely related to the amount of surface water that infiltrates and percolates into the aquifer as the input for the groundwater recharge [
32]. As a result, a variation in the spatial intensity of precipitation is referred to as the variation in groundwater recharge rate across CRGB. The yearly precipitation data from 2006 to 2015 were derived from the Climate Hazards Group InfraRed Precipitation With Station Data (CHIRPS) dataset.
Drainage density (DD). The drainage density is the ratio of the stream segments in a specific area. Drainage is one of the crucial parameters in hydrogeological processes, which controls the interactions near land surfaces. In an aquifer system, the drainage density could significantly influence groundwater recharge. Lower groundwater infiltration happens in the media with higher drainage density. A higher drainage density associated with the lower soil permeability on land surfaces could lead to lower water infiltration and higher surface runoff. In the study, we employed the available line density algorithm in ArcGIS to calculate the drainage density in CRGB. Drainage density with the unit of km/km2 represents the closeness among the stream channels.
Enhanced vegetation index (EVI). Areas with dense vegetation are hydrologically more stable due to their typically better soil infiltration properties, attributed to higher organic matter content. Different vegetation types respond differently to groundwater presence in aquifers [
33]. NDVI is a conventional vegetation index, but recent studies suggest that the EVI, with a more robust profile in areas with atmospheric disturbance, can be derived from Landsat 8–9 data [
34]. Equation (2) outlines the general formula for determining the EVI, covering the canopy background with the L value. Coefficients for atmospheric corrections and the blue band (B) are represented by C values, helping reduce noise induced by background, atmospheric conditions, and saturation in cloud-covered areas.
In Landsat 8–9, the EVI for the study area can be calculated by 2.5 ((Band 5–Band 4)/(Band 5 + 6 Band 4–7.5 Band 2 + 1)). The EVI has a range between −1 and 1. When an area has an EVI closer to 1, then the vegetation over that area is very dense and has a higher value for groundwater potential, and vice versa.
Modified normalized difference water index (MNDWI). The MNDWI was applied to detect the water bodies on the land surface [
35]. The objective of the MNDWI is to reduce the effect of features in the built-up areas that are often detected together with open water like other indices. The algorithm of the MNDWI can be seen in Equation (3):
Moreover, in the case of the modified normalized difference water index (MNDWI), the pixel values extracted from the Green (3rd band) and short-wave infrared SWIR1 (6th band) play a pivotal role. The synergistic utilization of these specific bands from the Landsat 8–9 satellite imagery facilitates a comprehensive analysis of water bodies, aiding in accurately delineating and characterizing aquatic features within the study area.
Terrain ruggedness index (TRI). Topography is essential in controlling the spatial variability of hydrological processes such as surface and groundwater flow and soil moisture. Topographic indices have been applied to represent the spatial pattern of soil moisture [
36]. The topographic ruggedness index to quantify the elevation difference between adjacent cells using DEM obtained from SRTM [
36]. Terrain roughness, such as micro-relief, terrain rugosity, ruggedness, surface roughness, and micro topography, can be defined as the variation in elevation. A higher TRI value of a pixel translates to a more considerable difference in altitude compared to the adjacent areas around that pixel.
Slope (SL). Slope factors intensely affect the lateral and vertical flow of groundwater [
33]. Previous investigations have recognized that the land surface slope considerably controls the infiltration rate of surface water, which is mainly related to the groundwater recharge of an aquifer system (e.g., Ref. [
37]). In principle, the percolation rate of surface water has a negative linear interaction with the land surface slope because of the retention time of the surface runoff. The surface runoff is relatively lower in a gentle slope area than in a sloping area. Therefore, a gentle slope will lead to a higher rate of percolation.
Normalized difference building index (NDBI). The conventional image classification technique is usually used to classify satellite images based on supervised and unsupervised classification methods. However, these methods are ineffective, including steps with complex procedures. Specifically, the operations require composite bands and judgment parameters for the final results. The NDBI technique is more effective than conventional classification methods. The reflectance for built-up areas and bare lands is relatively higher for SWIR than for NIR. For a green surface, the reflection of NIR is higher than that of the SWIR spectrum. In contrast, water bodies cannot be detected by the infrared spectrum. The NDBI gives the following formula (Equation (4)):
For Landsat 8 data, the NDBI can be calculated using the formula (Band 6–Band 5)/ (Band 6 + Band 5). Also, the NDBI has a value range between −1 and +1. A negative value of NDBI represents water bodies, whereas a positive value represents built-up areas. Identifying water bodies is essential to indicate the possible recharge zones for an aquifer system. Note that the NDBI value for vegetation is generally lower than the NDBI value for water (see
Table A13 and
Table A14 for details) [
38].
The AHP requires constructing judgment matrices (
) of size (
n × n) through pairwise comparisons among the n criteria. The diagonal elements are all set to one in these matrices since they represent the same criterion. Subsequently, the relative weights for each matrix are determined by identifying the right eigenvector (
w) corresponding to the largest eigenvalue (
) (Equation (5)).
As the value of
approaches the number of elements (
) in the pairwise comparison matrix, the judgments in the matrix become increasingly consistent. Hence, the difference,
, can be used as an indicator of inconsistency. To assess the coherence of the judgments, Saaty [
30] introduced a consistency index (
) to measure the agreement among the
matrices, where
The formula for calculating the consistency index can be expressed as (Equation (6)):
Random pairwise comparisons on matrices of varying sizes determine the consistency index. The random index (
) can be calculated by averaging the consistency indexes for each matrix size. The consistency ratio (
) is subsequently defined as the ratio between the consistency index and the random index (
) (Equation (7)).
A
value exceeding 0.1 indicates a significant inconsistency in the judgments made during the creation of the pairwise comparison matrix. Therefore, it is necessary to maintain a
at
0.1 to ensure the stability of the array. The
measures how consistent the pairwise comparisons are in the AHP analysis. In addition, the
is used to evaluate the reliability of the judgments made during the comparison process. Note that the significance of the difference between these two consistency ratios depends on the context and the specific threshold used in the model. In general, the lower the consistency ratio, the more reliable the judgments made in the analysis [
39].
2.3.3. Fuzzy Extended AHP (FE-AHP)
Zadeh [
11] introduced the fuzzy set theory in 1965. The study showed that using the membership functions by real numbers [0, 1] is acceptable. The process is called generalization for a classic set theory. In principle, the primary characteristic of fuzziness is individuals grouping into some classes. At that point, it allows unclear boundaries or bias for the threshold of each class [
40]. Then, the ambiguous comparison of the judgment can be characterized by fuzzy numbers. The TFN is a unique class of fuzzy numbers in which three real numbers define the membership: (
l, m, u). A TFN is symbolized by (
l, m, u), where
l, m, and
u refer to the smallest, most promising, and largest possible values, respectively (Equation (8)). In some cases, it could occur when the data are difficult to specify precisely because of measurement or instrument error. However, an accurate height measurement is rarely obtained in practice, with it usually slightly more or slightly less than the real value. Thus, the measurement numbers can be written more accurately as the TFN (
Table 3).
Figure 3a shows the conceptual structure of the TFN of the conventional AHP method [
41].
To create a pairwise comparison of alternatives for each criterion, similar to the concept of the conventional AHP method, a matrix of triangular fuzzy comparison can be defined as follows (Equation (9)):
where
.
The total weights and preferences of alternatives could be developed from other methods. In general, two alternative methods will be modeled in continuation.
Figure 3 summarizes the conceptual structure of the conventional AHP and FE-AHP. The conventional AHP uses a TFN to represent linguistic variables and capture imprecise judgments. It consists of three parameters, including
l (lower bound),
m (modal value), and
u (upper bound) (see
Figure 3a). TFNs are commonly used in the AHP to express linguistic terms, such as “low”, “medium”, and “high”, or to represent uncertainty in pairwise comparisons.
The FE-AHP method was proposed by Chang (1996) [
10,
11]. The extended analysis of Chang (1996) can be concisely described in several steps. First, compute the normalized value for row sums by fuzzy arithmetic operations (Equation (10)):
where the notation ⊗ represents the extended multiplication of two fuzzy numbers. Second, compute the degree of possibility using the following (Equation (11)):
which can be equally written by (Equation (12)):
and d is the ordinate of the highest intersection point between
,
(see
Figure 3b).
The FE-AHP method is an improved version of the AHP that introduces additional features to handle more complex decision problems involving uncertainties. The FE-AHP method incorporates fuzzy numbers to represent imprecise information, but instead of using TFNs, it uses fuzzy number intervals (
Figure 3b). The representation of fuzzy numbers in the FE-AHP is more flexible and allows for a more extensive range of uncertainty [
43].
In the FE-AHP, a fuzzy number is represented as
, where
are lower, modal, and upper values, respectively. Similarly, another fuzzy number
can be defined (
Figure 3b). The FE-AHP allows an intersection point ‘‘d’’ between two fuzzy numbers
and
These fuzzy number intervals allow decision-makers to express their preferences and judgments more detailed and nuancedly [
43]. The FE-AHP also introduces the concept of fuzzy pairwise comparison matrices, which capture the imprecise judgments between criteria or alternatives. These matrices use fuzzy numbers or fuzzy number intervals to accommodate uncertainty and imprecision during the decision-making process.
Third, calculate the degree of possibility of
to be larger than all the other (n − 1) convex fuzzy numbers
(Equation (13)):
Fourth, define the vector of priority
for the fuzzy comparison matrix
as (Equation (14)):
In summary, both TFNs for the AHP and FE-AHP methods involve fuzzy numbers. The TFN relies on a triangular representation (l, m, u) to express imprecise judgments, whereas FE-AHP employs fuzzy number intervals , to provide a more versatile representation for capturing uncertainties and complex decision problems. In the FE-AHP method, the degree of possibility is suggested for the ordering and the weights. In this step, a pairwise comparison is made for every fuzzy weight by other fuzzy weights. The conforming degree of possibility of being higher than other fuzzy weights is defined. The minimum of the possibility is used as the overall score for each criterion.
Table 3.
The triangular fuzzy number of the FE-AHP method [
44].
Table 3.
The triangular fuzzy number of the FE-AHP method [
44].
AHP Scale | Linguistic Variable | FE-AHP Scale |
---|
TFN Number | Reciprocal |
---|
1 | Equally important | (1, 1, 1) | (1, 1, 1) |
2 | Intermediate of 1 to 3 | (1/2, 1, 3/2) | (2/3, 1, 2) |
3 | Slightly important | (1, 3/2, 2) | (1/2, 2/3, 1) |
4 | Intermediate of 3 to 5 | (3/2, 2, 5/2) | (2/5, 1/2, 2/3) |
5 | Important | (2, 5/2, 3) | (1/3, 2/5, 1/2) |
6 | Intermediate of 5 to 7 | (5/2, 3, 7/2) | (2/7, 1/3, 2/5) |
7 | Strongly important | (3, 7/2, 4) | (1/4, 2/7, 1/3) |
8 | Intermediate of 7 to 9 | (7/2, 4, 9/2) | (2/9, 1/4), 2/7) |
9 | Extremely important | (4, 9/2, 9/2) | (2/9, 2/9, 1/4) |
2.3.5. Validation of the GWPZ
First, the GWPZ map was validated using Pearson’s correlation matrix between the GWPI and the direct calculation of the GA for the first aquifer layer in the CRGB. Note that the GA was not included in the analysis of the GWPZ. The GA served as an independent parameter that robustly confirmed the accuracy of the GWPZ. Initially, the GA was solely utilized to determine the ranking and was not included in the GWPZ map. For raster datasets, the correlation matrix provides cell values from one raster layer to another. This correlation between layers enables the measurement of the degree of dependency between them. Correlation values range from +1 to −1. A positive correlation indicates a direct relationship between the layers, whereas a negative correlation signifies an inverse relationship where the variables change in opposite directions. The study also conducted validation based on ground checking. Google Earth provides satellite imagery with varying spatial and temporal resolution [
46]. The spatial resolution of Google Earth imagery varies depending on location and data availability. In densely populated areas or popular tourist destinations, the spatial resolution tends to be higher, enabling more detailed views of buildings, streets, and landmarks [
46]. Conversely, remote or less frequented areas may have a lower spatial resolution, resulting in less detailed information [
46].
Second, delineating groundwater recharge potential zones involves a weighted overlay of different themes in the geospatial environment. This method serves as an indirect measurement to identify potential zones but requires validation through direct observations for accurate planning. The validation process is a crucial element in scientific research. A random sampling technique is employed in the geospatial environment to validate the groundwater recharge potential zones with actual recharge. Using Google Earth, multiple locations were selected randomly to represent each class of the GWPZ, allowing for a comprehensive assessment of the GWPZ’s accuracy. As a model consistency validation, locations were selected based on our knowledge of their real or physical profile. For instance, the downstream area in Mailiao Haipu New Land, western Yunlin County, features diverse land uses, including coastal uplift rejuvenation, siltation sites, saline ponds, aqua fisheries, and sandbanks affected by monsoonal activity. Hypothesizing, Mailiao Haipu New Land is expected to have a poor geopotential zone due to salty water and seawater intrusion. Conversely, the upstream area includes agriculture and water resources, indicating a good groundwater potential zone.