Establishing a Data Fusion Water Resources Risk Map Based on Aggregating Drinking Water Quality and Human Health Risk Indices

: The Drinking Water Quality Index (DWQI) and the Human Health Risk Index (HHRI) are two of the most promising tools for assessing the health impact of water quality on humans. Each of these indices has its own ability to determine a speciﬁc level of safety for drinking, and their results may vary. This study aims to develop an aggregated index to identify vulnerable areas in relation to safe drinking water and, subsequently, risk areas for human health, particularly non-cancerous diseases, in the Maku–Bazargan–Poldasht area in NW Iran through the use of a data fusion technique. Nitrate (NO 3 − ) and ﬂuoride (F − ) are the predominant contaminants that threaten the local population’s health. The DWQI revealed that the majority of the study sites had poor to improper quality for drinking water class. Health risk assessments showed an excessive potential for non-carcinogenic health risks because of high NO 3 − and F − exposure through drinking water. Children are at a higher risk for non-carcinogenic changes than adults, according to the total hazard index (THI; NO 3 − and F − ), suggesting that locals have faced a lifetime risk of non-cancer changes as a consequence of their exposure to these pollutants. Using data fusion techniques can assist in developing a comprehensive water resources risk map for decision-making. Contributions: Conceptualization, A.A.N.; A.A.N. R.B.; curation, Z.S.; writing—original preparation, Z.S.; writing—review and editing, M.R.N.; visualization, M.R.N.; supervision, A.A.N.; project administration, M.R.N.;


Introduction
The problem of water shortage and health issues associated with drinking water has become widespread worldwide. In the context of water quality, fluoride (F − ) and nitrate (NO 3 − ) concentrations are of particular importance because of the significant health impact they have on humans. In most developing countries, including Ghana, parts of eastern and southern Africa, Turkey, and Iran, high F − and NO 3 − concentrations have been reported in groundwater [1][2][3][4]. It is well known that excessive F − exposure can damage teeth, bones, and, in some cases, the kidneys. As a result of the inadvertent consumption of F − by children, adverse effects are known to occur when inadequate F − amounts are consumed [5]. Moreover, the lack of adequate intake of F − can lead to an increased risk of dental caries in children, especially in cases where the F − concentration is lower than A key aspect of this study is the contribution of information about the issue of F − and NO3 − contamination in water resources of the study area, as well as valuable evidence that may have a significant impact on the way local authorities manage their risk to reduce the adverse effects of toxic elements on citizens' health. It should be noted that the aspects of the hazard aggregation problem have been discussed at fluctuating points by different authors (e.g., [15,18,19]), but in general, these functions are still in their beginning, especially those that address the last three dimensions. Table 1. lists the selected cases of techniques used in DWQI and HHRI applications. A key aspect of this study is the contribution of information about the issue of F − and NO 3 − contamination in water resources of the study area, as well as valuable evidence that may have a significant impact on the way local authorities manage their risk to reduce the adverse effects of toxic elements on citizens' health. It should be noted that the aspects of the hazard aggregation problem have been discussed at fluctuating points by different authors (e.g., [15,18,19]), but in general, these functions are still in their beginning, especially those that address the last three dimensions. Table 1 lists the selected cases of techniques used in DWQI and HHRI applications.

Description of the Maku-Bazargan-Poldasht
The Maku-Bazargan-Poldasht is in West Azerbaijan, Iran, at the Ararat Mountain range's foothills, in the province's north ( Figure 2). The Maku-Bazargan-Poldasht is located between longitudes 44 • 21 and 45 • 10 and between latitudes 39 • 13 and 39 • 34 . In the west, it is bordered by Turkey, whereas in the east, it is bordered by the Aras River. The Maku-Bazargan-Poldasht covers nearly 1600 km 2 , of which up to one-fourth is covered with basaltic lavas. This area has three main cities: Maku, Poldasht, and Bazargan. With an average temperature range of −16.2 to 35.1 • C and annual mean precipitation of 300 mm, the least and highest precipitation occurred in September and May, respectively. During a typical year, there is approximately 1500 mm of evaporation, three times more than the amount of precipitation expected. The Sari Su and Zangmar rivers are the two main rivers flowing through the study area. and non-basaltic origins. Phyllite-schist and gneiss, which are the main water-bearing rocks in the region, have small amounts of primary porosity. The secondary porosity of these formations, which is found in the form of fissures or fractures, enables groundwater to be actively transported through the rocky formation, thus acting as a groundwater reservoir. The majority of these zones can be found in basaltic aquifers and some of them can be found in non-basaltic aquifers. As a result of drinking water from basalt springs and wells, residents in the region suffer from dental fluorosis [12].  The Maku-Bazargan-Poldasht area is mainly supplied by water resources, which are used for agriculture, drinking, and industry. In addition, 12 large-scale springs and several withdrawal wells discharge groundwater [12]. According to geoelectrical surveying conducted within the Bazargan Plain area, the basalt-alluvium aquifer's thickness is estimated to be about 150 m [12]. Most of the high F − water resources are found in rock formations formed by basaltic magma (Figure 3). The Maku-Bazargan-Poldasht is predominantly underlain by non-basaltic and basaltic aquifers. Prior reports have indicated a high F − concentration in the Maku-Bazargan-Poldasht complex aquifers. The presence of F − in some areas (called the mixing zone) is caused by the mixing of groundwater from basaltic and non-basaltic origins. Phyllite-schist and gneiss, which are the main water-bearing rocks in the region, have small amounts of primary porosity. The secondary porosity of these formations, which is found in the form of fissures or fractures, enables groundwater to be actively transported through the rocky formation, thus acting as a groundwater reservoir. The majority of these zones can be found in basaltic aquifers and some of them can be found in non-basaltic aquifers. As a result of drinking water from basalt springs and wells, residents in the region suffer from dental fluorosis [12].

Water Sampling and Analysis
Sixty samples were gathered from springs, rivers, and wells in January 2021. These resources provide a large volume of water for consumption and irrigation. Electrical conductivity (EC) and pH were measured directly in the field during sample collection. Potassium (K + ) and sodium (Na + ) were measured with a flame photometer. A UV singlebeam spectrophotometer (UV-1200, Labman Scientific Instruments Pvt. Ltd, Chennai, India) was used for Sulphate (SO 4 2− ), NO 3 − , nitrite (NO 2 − ), ammonium (NH 4 + ), and bromine (Br − ). Bicarbonate (HCO 3 − ), carbonate (CO 3 2− ), chloride (Cl − ), magnesium (Mg 2+ ), and calcium (Ca 2+ ) were analyzed using the titration approaches [29]. The F − concentration was calculated by utilizing an ion-selective electrode. Chemical analysis was validated using an ion balance. The sum of cations and anions must be equal according to the principle of neutrality. A cation-anion balance error [30], was calculated as follows: where A and C are the concentrations of HCO 3 − + Cl − + SO 4 2− and Ca 2+ + Mg 2+ + Na + + K + , respectively, in meq/L. Additionally, charge balance is the ratio of the ionic balance error. The accuracy of ionic measurements was measured through the Charge Balance Error percentage (CBE%). A CBE% within the range of ±5% is accepted as a good analysis measure [31].

Physicochemical Characteristics of Water Resources
A statistical investigation of the physicochemical parameters of water resources measured in the field and the laboratory are presented in Table 2. There was a major difference between the median and maximum values of Na + , Ca 2+ , Cl − , SO 4 2− , NO 3 − , NO 2 − , and CO 3 2− , and the maximum values were more than five times the median values, implying the presence of some external contaminants in the groundwater [32]. The EC value varied between 525 and 5530 µS/cm, with an average value of 1503 µS/cm. It was found that 65% of the samples were freshwater, 20% were brackish, and 15% were saline, according to the EC classification for water samples (i.e., fresh: 1500 S/cm; brackish: 1500-3000 S/cm; saline: >3000 S/cm). The pH values of the water samples in the Maku-Bazargan-Poldasht area ranged from 7.37 to 8.3, indicating a slightly acidic to a slightly alkaline environment.
According to the US EPA, all samples fell within acceptable limits regarding the pH parameter. Na + concentrations ranged between 16 and 1001 mg/L, with an average value of 221 mg/L. According to EPA standards [33], the maximum allowable concentration of Na + for drinking water was 200 mg/L. Table 2 shows that 24 sampling sites exceeded the standard threshold for drinking purposes. In total, 10% of the samples contained Ca 2+ concentrations that ranged between 32 and 518 mg/L, with a mean concentration of 102 mg/L, which was larger than the acceptable limit (i.e., 100 mg/L). Mg 2+ and K + concentrations varied between 11-245 mg/L and 3-70 mg/L, with 65 and 12 mg/L mean values, respectively. In total, 95% of the samples violated the standard threshold of 30 mg/L. In the Maku-Bazargan-Poldasht area, the Cl − concentration in the water resources varied from 4 to 769 mg/L with a mean of 132 mg/L. According to the results, about 15% of them exceeded the 250 mg/L drinking water guideline [33]. Additionally, HCO 3 − and CO 3 2− concentrations showed a wide range of 107-536.5 mg/L and 0-80.6 mg/L, respectively. On the other hand, there was no recommended value for either one. The SO 4 2− content ranged from 5.5 to 9079 mg/L with an average of 1263 mg/L, and the greater part of the samples (80%) were within the acceptable drinking limit of 250 mg/L. In this area, the presence of high levels of SO 4 2− may be attributed to little rain and strong evaporation as well as an aquifer medium abundant in sulfate. The concentration of SO 4 2− in the water was also affected by the contact between the water and the rock as well as evaporationinduced enrichment. NO 2 − concentrations in the samples ranged from 0 to 4.79 mg/L, with about 95% having NO 2 − concentrations more than the standard limit of 1 mg/L [33]. In summary, the average concentration of major cations was in the order of Na + > Ca 2+ > Mg 2+ K + . A correlation analysis was conducted to determine whether there was a consistent relationship between the hydrochemical parameters. It was determined through SPSS that the data were normally distributed to determine which correlation analysis approach (i.e., parametric or nonparametric) should be used in order to determine the most appropriate correlation analysis approach. As a result of the non-normal distribution of the hydrochemical data, Kendall's correlation test, a method of nonparametric correlation analysis, was applied to the hydrochemical data.

Multivariate Statistic
Pre-processing of the data (i.e., normalization, log transformation) was performed to standardize the measured water quality parameters and remove the impact of their diverse units on the multivariate statistics. Then, the Pearson correlation analysis of the water quality parameters was calculated to decipher the relationship between the parameters. Significance (p value) and strength (r) were essential factors when determining the significance of relationships. The higher the r value, the stronger the relationship, and in this study, r > 0.7 was considered to be a strong relationship, while 0.5 < r < 0.7 and r < 0.5 were deemed to be average and weak relationships, respectively. Factor analysis (FA) is usually utilized to determine the hidden dimension, which may not be described by direct analysis. In total, 14 water quality parameters, including pH, EC, Ca 2+ , Mg 2+ , Na + , K + , HCO 3 − , SO 4 2− , Cl − , NO 3 − , F − , NO 2 − , Br − , and NH 4 + , were considered when carrying out the FA. The Kaiser's criterion and varimax rotation technique [34], were used to improve factor loadings, achieve a simple structure, and find factors with eigenvalues greater than 1. Consequently, factor loadings greater than 0.75 were well thought-out as high, whereas factor loadings between 0.50 and 0.75 were considered medium [35]. As mentioned above, NO 3 − contamination was severe in the Maku-Bazargan-Poldasht area. The oxidative conditions of the water resources in the Maku-Bazargan-Poldasht area facilitates the conversion of NO 2 − and NH 4 + contaminants to NO 3 − as a result of the nitrification process [36]. According to the linear correlation between TDS and NO 3 − + Cl − /HCO 3 − [37], a positive correlation coefficient of 0.7 was determined ( Figure 4), indicating that the water resource under study was contaminated by anthropogenic activities.

Multivariate Statistic
Pre-processing of the data (i.e., normalization, log transformation) was performed to standardize the measured water quality parameters and remove the impact of their diverse units on the multivariate statistics. Then, the Pearson correlation analysis of the water quality parameters was calculated to decipher the relationship between the parameters. Significance (p value) and strength (r) were essential factors when determining the significance of relationships. The higher the r value, the stronger the relationship, and in this study, r > 0.7 was considered to be a strong relationship, while 0.5 < r < 0.7 and r < 0.5 were deemed to be average and weak relationships, respectively. Factor analysis (FA) is usually utilized to determine the hidden dimension, which may not be described by direct analysis. In total, 14 water quality parameters, including pH, EC, Ca 2+ , Mg 2+ , Na + , K + , HCO3 − , SO4 2− , Cl − , NO3 − , F − , NO2 − , Br − , and NH4 + , were considered when carrying out the FA. The Kaiser's criterion and varimax rotation technique [34],were used to improve factor loadings, achieve a simple structure, and find factors with eigenvalues greater than 1. Consequently, factor loadings greater than 0.75 were well thought-out as high, whereas factor loadings between 0.50 and 0.75 were considered medium [35]. As mentioned above, NO3 − contamination was severe in the Maku-Bazargan-Poldasht area. The oxidative conditions of the water resources in the Maku-Bazargan-Poldasht area facilitates the conversion of NO2 − and NH4 + contaminants to NO3 − as a result of the nitrification process [36]. According to the linear correlation between TDS and NO3 − + Cl − /HCO3 − [37], a positive correlation coefficient of 0.7 was determined ( Figure 4), indicating that the water resource under study was contaminated by anthropogenic activities. The NO3 − in water resources can result from anthropogenic and geogenic inputs. It is common for water resources to contain nitrogen concentrations below 10 mg/L, and those above this limit are considered anthropogenic. Figure 5 shows that most samples in the Maku-Bazargan-Poldasht area had NO3 − concentrations exceeding the standards limit of 10 mg/L [33], suggesting that the anthropogenic NO3 − contamination affected water quality in the study area. Fluorosis is a prevalent disease in tropical climates, but this is not entirely the case. Water with high F − concentrations in wide geographical belts are related to: (i) sediments with marine sources in the mountainous regions; (ii) igneous rocks; and (iii) gneissic and granitic rocks. A classic example of the first reason covers Iran and Iraq through Turkey and Syria to the Mediterranean region, from Algeria to Morocco [38]. The The NO 3 − in water resources can result from anthropogenic and geogenic inputs. It is common for water resources to contain nitrogen concentrations below 10 mg/L, and those above this limit are considered anthropogenic. Figure 5 shows that most samples in the Maku-Bazargan-Poldasht area had NO 3 − concentrations exceeding the standards limit of 10 mg/L [33], suggesting that the anthropogenic NO 3 − contamination affected water quality in the study area. Fluorosis is a prevalent disease in tropical climates, but this is not entirely the case. Water with high F − concentrations in wide geographical belts are related to: (i) sediments with marine sources in the mountainous regions; (ii) igneous rocks; and (iii) gneissic and granitic rocks. A classic example of the first reason covers Iran and Iraq through Turkey and Syria to the Mediterranean region, from Algeria to Morocco [38]. The F − contamination was as severe as the NO 3 − contamination in the study area. Approximately 50% of the sampling sites revealed F − and NO 3 − concentrations higher than the recommendations given by [33] ( Figure 5). Studies have shown that approximately 90% of F − in drinking water is absorbed in the digestive system, while only 30-60% of F − is absorbed in food [33]. Therefore, there is a risk of skeletal fluorosis and dental fluorosis with excessive F − concentrations, e.g., between 1.5 and 5.0 mg/L. High levels of F − in drinking water can cause more diseases, such as hypertension, neurologic disorders, Alzheimer's disease, etc., posing a serious threat to human health [39]. According to studies conducted by the Poldasht Health Center, available data and information confirm the prevalence of bone fluorosis [40].
Water 2022, 14, x FOR PEER REVIEW 9 of 28 F − contamination was as severe as the NO3 − contamination in the study area. Approximately 50% of the sampling sites revealed F − and NO3 − concentrations higher than the recommendations given by [33] ( Figure 5). Studies have shown that approximately 90% of F − in drinking water is absorbed in the digestive system, while only 30-60% of F − is absorbed in food [33]. Therefore, there is a risk of skeletal fluorosis and dental fluorosis with excessive F − concentrations, e.g., between 1.5 and 5.0 mg/L. High levels of F − in drinking water can cause more diseases, such as hypertension, neurologic disorders, Alzheimer's disease, etc., posing a serious threat to human health [39]. According to studies conducted by the Poldasht Health Center, available data and information confirm the prevalence of bone fluorosis [40].
(a) (b) F − concentrations are often relative to the level of water-rock contact because F − mainly originates from geology [41,42]. The study region is primarily occupied by basalts, which contain a large amount of F − -bearing minerals [43]. The F − concentration likely increased because of this in the study area. Compared to NO3 − contamination, F − contamination was highly severe in the study area. The NO3 − concentration in the Maku-Bazargan-Poldasht water resources ranged from 0.23 to 167 mg/L with a mean of 32.2 mg/L. The threshold of public health standards on NO3 − in drinking water set by the US EPA is 10 mg/L. Overall, 75% of the samples had NO3 − concentrations that exceeded the US EPA standard of 10 mg/L ( Table 2). In natural water resources, the higher concentrations of NO3 − can have anthropogenic origins such as unsuitable surplus disposal, severe agriculture practices, and animal surplus [18,19,43]. Figure 5 shows a bar chart that shows the NO3 − and F − values of the study region relative to the US EPA. A wide variation in F − concentrations are observed (Table 2), varying between 0.39 and 9.89 mg/L, with a mean of 2.94 mg/L. On the other hand, in most samples (54%), the F − concentration exceeds its maximum allowable threshold (1.5 mg/L) for drinking water [33].  5  7  9  11  13  15  17  19  21  23  25  27  29  31  33  35  37  39  41  43  45  47  49  51  53  55  57  F − concentrations are often relative to the level of water-rock contact because F − mainly originates from geology [41,42]. The study region is primarily occupied by basalts, which contain a large amount of F − -bearing minerals [43]. The F − concentration likely increased because of this in the study area. Compared to NO 3 − contamination, F − contamination was highly severe in the study area. The NO 3 − concentration in the Maku-Bazargan-Poldasht water resources ranged from 0.23 to 167 mg/L with a mean of 32.2 mg/L. The threshold of public health standards on NO 3 − in drinking water set by the US EPA is 10 mg/L. Overall, 75% of the samples had NO 3 − concentrations that exceeded the US EPA standard of 10 mg/L ( Table 2). In natural water resources, the higher concentrations of NO 3 − can have anthropogenic origins such as unsuitable surplus disposal, severe agriculture practices, and animal surplus [18,19,43]. Figure 5 shows a bar chart that shows the NO 3 − and F − values of the study region relative to the US EPA. A wide variation in F − concentrations are observed (Table 2), varying between 0.39 and 9.89 mg/L, with a mean of 2.94 mg/L. On the other hand, in most samples (54%), the F − concentration exceeds its maximum allowable threshold (1.5 mg/L) for drinking water [33].

Drinking Water Quality Index (DWQI)
The Drinking Water Quality Index (DWQI) exposes the general quality of drinking water. This index can be determined by standardizing each hydrogeochemical parameter [44]. The DWQI switches the samples' water quality parameters into a sole code and the analysis of water quality information is compared with data from the World Health Organization to check their appropriateness for drinking in Appendix A (Table A1). The DWQI calculation is based on three steps. First, each of the 14 parameters (EC, pH, major and minor ions, and nutrients) receives a weight (W i ) depending on its relative importance on the general water quality for drinking in Appendix A (Table A1). The steps for calculating the DWQI are estimated using Equations (2)-(5): 1.
Consider the weights, W i , for each element (i) of drinking water constituents; these weights can be changed from 1 (minimum value) to 5 (maximum value) and are assigned based on expert opinion. The corresponding weights utilized in this study are presented in Appendix A (Table A1).

2.
Determine the relative weight, W i , considering the number of elements (n): 3.
Calculate the quality rating scale (q i ) of each parameter [45]: where c i is the ith chemical concentration in the considered water sample (mg/L); according to WHO standards, the sub-index of the ith parameter (SI i ) can be determined as follows (mg/L):

4.
By calculating the SI i for each parameter, the DWQI is determined using the following equation [46]:

Human Health Risk Index (HHRI)
The health impact of water contaminated with toxic chemicals is checked based on the model developed by the US Environmental Protection Agency [33]. In this regard, risk assessment map of water resources might include important data to better address both qualitative and quantitative issues [47,48]. An HHRI describes the nature and likelihood of adverse health effects resulting from chemicals found in contaminated environmental media, which may be harmful to humans [49]. In general, there is a great deal of risk associated with oral exposure to the dermal and inhalation pathways of exposure. Accordingly, a non-carcinogenic pollutants health risk evaluation (e.g., NO 3 − and F − ) is carried out [50,51]. The US EPA provides a "Regional Screening Levels (RSLs) for Chemical Contaminants" online calculator [13]. HQ values greater than 1 suggest an increased risk of developing non-carcinogenic consequences throughout life. The exposure to F − and NO 3 − in these groups is estimated using Equations (6) to (10) [33]: where CDI is the chronic daily intake through the oral pathway [mg/(kg × day)]; C represents the contaminant concentration (i.e., F − and NO 3 − ) in the water resources (mg/L); IR is the ingestion rate (L/day, IR = 2.5 L/day for adults, 0.78 L/day for Child); EF and ED are Exposure Frequencies (365 days/year) and Exposure Duration (standard exposure in the literature is suggested to be 30 years for adults and 12 years for children), respectively; BW and AT are the average body weight (Kg, BW = 57.5 Kg and 18.7 Kg for adults and children, respectively) and the average exposure time (days, AT = 23,360 days and 4380 days for adults and children, respectively), respectively; and finally, HQ i and RfD are the hazard quotient of ith pollutant and reference dose for non-carcinogenic contaminants, respectively. The RfD values for F − and NO 3 − are 0.04 and 1.6 mg/(Kg × day), respectively [33]. HI is a hazard index that indicates the total non-carcinogenic risk. Non-carcinogenic risk values above 1 indicate health risks, while those below 1 indicate no health risks from drinking water containing toxic elements [33]. A detailed list of non-carcinogenic health risks can be found in Appendix A (Tables A2 and A3) Water resources containing high levels of NO 3 − and F − may pose high health risks to humans if consumed for long periods as drinking and bathing water sources [13,52]. Thus, these two contaminants were considered in assessing non-carcinogenic risk for children and adults (i.e., Females and Males). More than 90% of the study region's population consumes untreated water resources for drinking. It was found that 55.81% and 65% of sampling points exceeded the prescribed levels of NO 3 − and F − , respectively. Therefore, the consumption of such water in the region posed health risks to people of all ages.
According to Table 3, most samples fell within the F − concentration range of 1-3 mg/L (38.33%), followed by 3-4 mg/L (6.6%). The number of samples greater than 4 also had a higher percentage (33%) in the Maku-Bazargan-Poldasht region, which may cause dental fluorosis and joint stiffness and brittleness in the region. NO 3 − concentrations of the samples showed that 21.6% of them were below the permissible limit, 25% were within the safe limit (NO 3 − < 10 mg/L), 58% were at health risk (NO 3 − : 10-50 mg/L), and 8.33% were at a high health risk (NO 3 − : 50-100 mg/L). Therefore, there was a very high health risk of NO 3 − (>100 mg/L) in 8.33% of samples, which causes methemoglobinemia in children (6 months old) and abortion in pregnant women [53].

Information-Fusion
In accordance with Esteban et al. [54], it is essential to formulate a strategy in advance of engaging in any undertaking of information fusion to assist in solving the problem efficiently and robustly. Data fusion architecture is a platform that connects databases with the help of data fusion techniques to create an integrated system. It is a mathematical model that functions as the basis for merging data from several sources into one. This methodology is based on goals and combines low-and high-level information. This term refers to a variety of methods and approaches used to combine information to enhance quality, reduce uncertainty, or uncover novel knowledge or characters from the collected data. Theoretically, information fusion combines data from a number of diverse data sources [11]. Typically, it can be characterized at the signal level, the pixel level, the feature level, or the Top level [55], each with its own definitions and associated procedures. There is also another way of categorizing fusion in terms of top-level, medium-level, and high-level fusions [56]. Several techniques support information fusion, including statistical matching, grey relational analysis, moving average filters, and Bayesian inference [2,57]. Table 2 gives a comprehensive statistical summary of the various physicochemical parameters (EC, pH, Ca 2+ , Mg 2+ , Na + , K + , NH 4 + , Cl − , Br − , SO 4 2− , NO 2 − , NO 3 − , and F − ) as well as their comparison with the drinking water quality limits set by US EPA for 60 water samples. The main factors contributing to the significant F − concentration in water resources are low velocity, rock chemistry, long water-rock interactions [58], and high HCO 3 − and Na + concentrations. There was a positive correlation between F − concentrations and the values of HCO 3 − , Na + , and K + concentrations, according to the correlation analysis (Table 4). Groundwater with dominant HCO 3 − , Na + , and K + concentrations originated from igneous rocks [59], so the correlation indicates that excessive F − ion concentrations may have resulted from fluorine-bearing minerals associated with the source volcanic rocks as well as the application of fertilizers and pesticides on the field [60,61]. Generally, most ions were positively correlated with Cl − , and particularly Na + , Mg 2+ , and SO 4 2− showed a strong correlation with Cl − , suggesting that they came from the same origin of saline water [62], meaning they are furthermore representative of a high occurrence of chemical weathering and the subsequent leaching of secondary salts. Chemical weathering, anthropogenic impacts, and salt leaching were the main factors contributing to the Cl − contamination in the study area. The correlation of F − and the other ions showed that it is poorly correlated with Ca 2+ and Mg 2+ and positively correlated with Na + , K + , and HCO 3 − . It can therefore be concluded that high F − concentrations exist in water with low Ca 2+ and Mg 2+ levels, as well as in water with high Na + levels. Low Ca 2+ resulted from the intense cation exchange reaction between Na + and Ca 2+ . The presence of a high HCO 3 − and an alkaline pH in the samples resulted in the precipitation of Mg 2+ as Dolomite and Ca 2+ as Calcite. According to Sarma and Rao [63], this process leads to a higher concentration of Na + in water resources. It is evident that HCO 3 − is highly correlated with F − , indicating that volcanic rock weathering is the major cause of F − formation [60]. Low levels of Ca 2+ and Mg 2+ in the groundwater within the area may be contributing to high concentrations of F − in the water resources.

Factor Analysis (FA)
Factor analysis (FA) is a method that has been successfully used by different authors for the assessment of water quality and chemistry [64], since it helps in the distribution analysis as well as in tracing the source(s) of the chemical components in water [65]. The factor analysis for the physicochemical parameters in the water resources of the Maku-Bazargan-Poldasht region is given in Table 5. A total of four components were extracted based on the results of the FA analysis, which accounted for 81.83% of the variance in the data. A rotating factor matrix for the parameters studied can be found in Table 5. The interpretability of the factor loads without rotation is difficult, so in order to make the factors more interpretable, the factors were rotated. The results showed that the FA1 described 35.86%, the FA2 described 18.07%, the FA3 described 10.45%, and the FA4 described 7.51% of the total variance. With a variance of 35.86%, FA1 was positively and considerably related to EC, Na + , K + , Ca 2+ , Mg 2+ , Cl − , and SO 4 2− concentrations. These associations indicated: (i) the interaction between water and rocks in the study area; and (ii) the general trend of dissolution in waters within the study area. This interaction was unlimited to one site, but rather the flow through the aquifer encouraged the tendency for further interactions and the dissolution process to occur in the future. With a total variance of 18.07%, FA2 can be associated with the concentration of F − and HCO 3 − , and F − anomalies resulted predominantly from geogenic processes. The F − concentration in the samples with a high HCO 3 − concentration was higher than those with a low HCO 3 − concentration. The FA3, with a total variance of 10.4%, correlated well with pH and Br-concentrations, and the existence of NO 3 − with negative loading indicated anoxic conditions in the study area [66], and denitrification and NO 3 − reduction are related geochemically [67]. A major source of anthropogenic NO 3 − and nitrite is artificial fertilizers, and various industrial processes also produce NO 3 − in their waste streams. In this study, the spatial distributions of nitrate, nitrite, and ammonium were investigated. High-NO 3 − , low-NO 2 − , and high-NH 4 + water resources were observed. The proportions of high-NO 3 − and high-NH 4 + water resources in urbanized areas were nearly or more than twice those in non-urbanized areas (Figure 6). High NO 3 − levels in the Maku-Bazargan-Poldasht aquifers probably originated mainly from industrialization accompanied by wastewater leakage. Urbanization accompanied by the leakage of domestic sewage, is likely to be another main driving force for high NO 3 − levels in the water resources. The high loading of NO 3 − ions indicated that there was anthropogenic input to the system via the leaching of fertilizers from farming regions, which is linked to the interaction of surface water with the geological formations in the area. The element Br-can originate from old rivers and seas, as well as from animal waste, which can have a profound impact on water supply quality and create contaminations that are mainly caused by the impact of human activities related to farming, with slight influences from domestic sewage. The total variance of 7.51% can be attributed to the FA4, which is related to the concentration of NO 2 − .

DWQI
The DWQI was employed to assess the status of the water resource quality for drinking water objects in the Maku-Bazargan-Poldasht area. Unlike previous studies on water quality, which used a common classification for drinking purposes, this study determined the ranges between excellent and unsuitable water quality based on a rational classification. As a result, the DWQI was classified as belonging to the excellent water quality class if it was smaller than the minimum data utilized to calculate the Drinking Water Quality Index; the good water quality class if it was among US EPA standards and the average of the data used; the poor quality class if it ranged from the safe limit to the average data; and the unsuitable class if the results of calculating the drinking water index were between

DWQI
The DWQI was employed to assess the status of the water resource quality for drinking water objects in the Maku-Bazargan-Poldasht area. Unlike previous studies on water quality, which used a common classification for drinking purposes, this study determined the ranges between excellent and unsuitable water quality based on a rational classification. As a result, the DWQI was classified as belonging to the excellent water quality class if it was smaller than the minimum data utilized to calculate the Drinking Water Quality Index; the good water quality class if it was among US EPA standards and the average of the data used; the poor quality class if it ranged from the safe limit to the average data; and the unsuitable class if the results of calculating the drinking water index were between the average to maximum data. The calculated DWQI ranged from 51.19 to 2200, with a mean of 502.71. In total, 28 samples (46%) were classified as poor and 22 samples (36%) as unsuitable in terms of their quality, while the remaining 10 samples were classified as good for drinking ( Table 6). The water samples were analyzed for F − and NO 3 − concentrations and then the rational (i.e., proposed classification) and conventional DWQI values were calculated as follows: (i) classify the concentrations of NO 3 − and F − into three categories: safe (<10 mg/L), health risk (10-50 mg/L), and high health risk (>50 mg/L), and safe (<1 mg/L), dental fluorosis (1-4 mg/L), and defects in knees-crippling fluorosis (>4 mg/L), respectively; (ii) classify the DWQI (conventional-rational) into "Good", "Poor", and "Unsuitable" bands; (iii) assign a '3' to a given index performance at the samples if the difference in the categories of F − -NO 3 − concentration and the DWQI value is 0 but assign scores of two or one when the differences are one or two, respectively; and (iv) add the scores for the DWQI and calculate their Correlation Index (CI). Consider the following example for obtaining the CI for the prediction rational (proposed classification) DWQI. The results showed that there were 38 and 30 samples for the same class, 19 and 24 samples with a difference of one in the categories of the F − -DWQI and nitrate-DWQI values, respectively, and 3 and 6 with a difference of two in the categories of the F − -DWQI and nitrate-DWQI values, respectively. A higher CI means a higher correlation. The coincidence of the water samples (the F − -NO 3 − concentration) and the predicted DWQI categories are presented in Table 6. Ultimately, this indicated that a higher percentage of the water samples from the study region were unsuitable in terms of quality. The spatial distribution of the DWQI (Figure 7) showed that the east and southeast of the area had a high DWQI compared to the north and west of the Maku-Bazargan-Poldasht area. The water quality along the Zangmar and Sari Su rivers has deteriorated in recent years, and the worst quality for drinking occurred at the confluence of the two rivers with the Aras River. were calculated as follows: (i) classify the concentrations of NO3 − and F − into three categories: safe (<10 mg/L), health risk (10-50 mg/L), and high health risk (>50 mg/L), and safe (<1 mg/L), dental fluorosis (1-4 mg/L), and defects in knees-crippling fluorosis (>4 mg/L), respectively; (ii) classify the DWQI (conventional-rational) into "Good", "Poor", and "Unsuitable" bands; (iii) assign a '3′ to a given index performance at the samples if the difference in the categories of F − -NO3 − concentration and the DWQI value is 0 but assign scores of two or one when the differences are one or two, respectively; and (iv) add the scores for the DWQI and calculate their Correlation Index (CI). Consider the following example for obtaining the CI for the prediction rational (proposed classification) DWQI. The results showed that there were 38 and 30 samples for the same class, 19 and 24 samples with a difference of one in the categories of the F − -DWQI and nitrate-DWQI values, respectively, and 3 and 6 with a difference of two in the categories of the F − -DWQI and nitrate-DWQI values, respectively. A higher CI means a higher correlation. The coincidence of the water samples (the F − -NO3 − concentration) and the predicted DWQI categories are presented in Table 6. Ultimately, this indicated that a higher percentage of the water samples from the study region were unsuitable in terms of quality. The spatial distribution of the DWQI (Figure 7) showed that the east and southeast of the area had a high DWQI compared to the north and west of the Maku-Bazargan-Poldasht area. The water quality along the Zangmar and Sari Su rivers has deteriorated in recent years, and the worst quality for drinking occurred at the confluence of the two rivers with the Aras River.

Non-Carcinogenic Health Risk Assessment
A non-carcinogenic hazard is mainly associated with the consumption of portable water and contact with the skin. Three factors influence the CDI values: the concentration of the contaminants, the rate at which the water is ingested, and the individual's body weight. The CDI values in children are comparatively higher than those in adults. The child's HQ oral intake values range from 0.27 to 6.82, and the adult's HQ oral intake ranges from 0.31 to 7.95 (with an average of 2.94). In the case of children, the dermal intake ranges from 0.002 to 1.7, and in the case of adults, the oral intake ranges from 0.001 to 0.96 (with an average of 0.18). The spatial distribution of human health risk for both children and adults (Figure 8) along the study area indicates that high HI values (i.e., high HHR) prevail in the southeast and patches in the west.

Non-Carcinogenic Health Risk Assessment
A non-carcinogenic hazard is mainly associated with the consumption of portable water and contact with the skin. Three factors influence the CDI values: the concentration of the contaminants, the rate at which the water is ingested, and the individual's body weight. The CDI values in children are comparatively higher than those in adults. The child's HQ oral intake values range from 0.27 to 6.82, and the adult's HQ oral intake ranges from 0.31 to 7.95 (with an average of 2.94). In the case of children, the dermal intake ranges from 0.002 to 1.7, and in the case of adults, the oral intake ranges from 0.001 to 0.96 (with an average of 0.18). The spatial distribution of human health risk for both children and adults (Figure 8) along the study area indicates that high HI values (i.e., high HHR) prevail in the southeast and patches in the west. Health risks were assessed using the model developed by the US EPA to assess the health risks related to this study area. A summary of the calculated results of the noncarcinogenic health risks posed by NO3 − and F − contaminations through the pathways of Health risks were assessed using the model developed by the US EPA to assess the health risks related to this study area. A summary of the calculated results of the noncarcinogenic health risks posed by NO 3 − and F − contaminations through the pathways of drinking water contamination for adults and children is explicitly presented in Table 6 and Figure 9. It summarizes both the oral intake and dermal intake of each of the different groups of inhabitants in the studied region, as well as the total hazard index (THI) corresponding to each group of inhabitants. In children and adults, the HQ values ranged from 0.002 to 1.7 and 0.0013 to 3.2, respectively, depending on the dermal pathway. The mean dermal contact values for children and adults were 0.32 and 0.18, respectively. The hazard index values for children and adults (HQ Oral + HQ Dermal = HI) ranged from 0.000014 to 7.2 for children and from 0.0000062 to 3.2 for adults. The body weight of a child is lower than that of an adult. The estimated hazard quotient for children is higher than for adults. Noncarcinogenic risks to children in this region were higher than those for adults. The mean values for all the groups of people (i.e., children and adults) were, however, within the allowable limits (HI < 1) [53]. Since most samples present a high level of non-carcinogenic risks, they are not suitable for direct consumption. According to the results, the majority of the samples were not fit for human consumption, as they posed unacceptable health risks to both adults and children alike. Children are at an increased risk when compared to adults. Through the ingestion pathway, infants are the most vulnerable group of people. It is evident from the hazard index that the majority of the samples (i.e., 72% and 60%) may pose a risk to adults and children, respectively. There is a need to take immediate remedial steps in this region to prevent the residents from being exposed to NO 3 − and F − through ingestion. Moreover, the results of the total risk via ingestion and dermal contact showed that ingestion was the predominant pathway. Different strategies can be used to reduce the risk of dental fluorosis, including (a) the use of alternative water sources, (b) improving nutrition, and (c) the defluoridation of water. The defluoridation methods can be divided into adsorption ( [68][69][70][71][72]), participation/coagulation [73], electrocoagulation [74][75][76][77][78], nanofiltration [79,80]), and nanofiltration [81]. In addition, F − -resistant bacteria play a crucial role in the bioremediation and biotransformation of anions in order to convert them into less available and less harmful forms. R REVIEW 20 of 28 drinking water contamination for adults and children is explicitly presented in Table 6 and Figure 9. It summarizes both the oral intake and dermal intake of each of the different groups of inhabitants in the studied region, as well as the total hazard index (THI) corresponding to each group of inhabitants. In children and adults, the HQ values ranged from 0.002 to 1.7 and 0.0013 to 3.2, respectively, depending on the dermal pathway. The mean dermal contact values for children and adults were 0.32 and 0.18, respectively. The hazard index values for children and adults (HQOral + HQDermal = HI) ranged from 0.000014 to 7.2 for children and from 0.0000062 to 3.2 for adults. The body weight of a child is lower than that of an adult. The estimated hazard quotient for children is higher than for adults. Noncarcinogenic risks to children in this region were higher than those for adults. The mean values for all the groups of people (i.e., children and adults) were, however, within the allowable limits (HI < 1) [53]. Since most samples present a high level of non-carcinogenic risks, they are not suitable for direct consumption. According to the results, the majority of the samples were not fit for human consumption, as they posed unacceptable health risks to both adults and children alike. Children are at an increased risk when compared to adults. Through the ingestion pathway, infants are the most vulnerable group of people. It is evident from the hazard index that the majority of the samples (i.e., 72% and 60%) may pose a risk to adults and children, respectively. There is a need to take immediate remedial steps in this region to prevent the residents from being exposed to NO3 − and F − through ingestion. Moreover, the results of the total risk via ingestion and dermal contact showed that ingestion was the predominant pathway. Different strategies can be used to reduce the risk of dental fluorosis, including (a) the use of alternative water sources, (b) improving nutrition, and (c) the defluoridation of water. The defluoridation methods can be divided into adsorption ( [68][69][70][71][72]), participation/coagulation [73], electrocoagulation [74][75][76][77][78], nanofiltration [79,80]), and nanofiltration [81]. In addition, F − -resistant bacteria play a crucial role in the bioremediation and biotransformation of anions in order to convert them into less available and less harmful forms.

Data Fusion
The main data integration objectives in this research are: (i) refini ing data quality; (ii) inventing additional inferences and rising adva improving understanding and decisions. To incorporate datasets from specialized data fusion techniques can be incorporated into the HHR mended earlier by the National Academy of Sciences [82]. In this st sion was performed in order to combine index values by DWQI and produce a comprehensive risk map, a scheme depicted in and outlin

Data Fusion
The main data integration objectives in this research are: (i) refining data and improving data quality; (ii) inventing additional inferences and rising advantage from data; (iii) improving understanding and decisions. To incorporate datasets from numerous sources, specialized data fusion techniques can be incorporated into the HHRI framework recommended earlier by the National Academy of Sciences [82]. In this study, information fusion was performed in order to combine index values by DWQI and HHRI indicators to produce a comprehensive risk map, a scheme depicted in and outlined below [83], The data-fused HI (i.e., aggregating HI for both children and adults) values ranged between 0.01 and 0.99 in the Maku-Bazargan-Poldasht area. The spatial distribution of the fused Health Index (Figure 9a) shows that the water resources of the southeast regions had a greater health hazard, followed by the west of the area.
Total data-fused HI values varied from 0.21 to 0.96. Based on the aggregated HI (i.e., combining the DWQI and data-fused HI at Strategy 1), the southeast of the area bore the highest risk to the people consuming water. On the other hand, it was observed that aggregating the DWQI to HI may decrease the health risk in the central parts of the Maku-Bazargan-Poldasht area, even though there is a greater risk in Strategy 1 than in Strategy 2. The aggregated index was compiled from the information from Strategy 1 by implementing an unsupervised learning plan, which is shown to capture information on the adverse water quality and health risks associated with water of poor quality.

Performance Metrics
The Area Under Curve (AUC) and Receiver Operating Characteristic (ROC) curve can be utilized to measure the accuracy of a diagnostic system [84]. They were recently used to evaluate a groundwater vulnerability map accuracy by [18]. The events related to diagnosis can be clustered into four groups, including True Positive (TP), False Positive (FP), True Negative (TN), and False Negative (FN). The ROC curve plots of FP versus TP show that desirable performance has a deviation towards the upper left corner of this curve. The AUC quantifies this as the ratio of the area under the ROC curve to the whole area that varies between 0.5 to 1. The AUC values 0.5 and 1 mean poor and perfect performance, respectively. The area under the curve is used as one of the error estimation methods; whenever the AUC is close to one, the model has high accuracy. Table 7 presents the AUC values of both Strategy 1 and Strategy 2. The AUC value is improved from Strategy 1 (0.92) to Strategy 2 (0.98). Figure 10 shows the ROC curves for both strategies obtained by drawing TPR (sensitivity) versus FPR (one-specificity). As shown in Figure 10, Strategy 2 has the highest level under the curve and has the highest accuracy. These results provide evidence of the feasibility of aggregated indices.

Conclusions
This study evaluated water quality and human health risks, considering the hydrogeological and hydrochemical properties of Maku-Bazargan-Poldasht, Iran. The water quality analysis showed that F − and NO3 − concentrations were higher than the permissible level for drinking. A multivariate analysis combining factor analysis and correlations revealed that both geogenic and anthropogenic agents significantly impacted the quality of the water resources in the study area. Using the US EPA water quality standards for elements in drinking water, this study modified the water quality index classes for the first Figure 10. The ROC curves for Strategies.

Conclusions
This study evaluated water quality and human health risks, considering the hydrogeological and hydrochemical properties of Maku-Bazargan-Poldasht, Iran. The water quality analysis showed that F − and NO 3 − concentrations were higher than the permissible level for drinking. A multivariate analysis combining factor analysis and correlations revealed that both geogenic and anthropogenic agents significantly impacted the quality of the water resources in the study area. Using the US EPA water quality standards for elements in drinking water, this study modified the water quality index classes for the first time. The DWQI results indicated that most of the study area fell within poor or inopportune drinking water conditions. Based on the calculation of the CI and the comparison of the assessment of drinking water quality as well as the accurate determination of suitable and unsuitable areas with the rational (proposed classification) and conventional classification, the results indicated that rational classifications for drinking water quality indicators and the definition of drinking water quality categories were more accurate than conventional classifications. Health risk results demonstrated a considerable non-carcinogenic health risk due to high NO 3 − and F − exposure through drinking water. Children were found more defenseless than adults in the age categories. A fusion model based on the DWQI and HHRI was developed for fast safety control of residues related to water quality and health. The northwest, southeast and central portions of Maku-Bazargan-Poldasht were considered to be the most unsafe regions in the study area. A high level of NO 3 − and NH 4 + pollution occurred in the study area, and since there is no effective control and treatment in such a rapidly urbanized region, this process is bound to get worse in the future. For newly and old-urbanized areas, especially in developing countries, there is a need for the long-term monitoring of NO 3 − and NH 4 + in the area's water resources. These results suggest that the governing bodies require immediate intervention in these areas. Furthermore, the obtained results showed that alternate preparations should be made for drinking water sources, and people must be aware of the water quality they consume in the affected areas. Data Availability Statement: The study did not report any data.

Acknowledgments:
The authors would like to thank the Center for International Scientific Studies and Collaboration (CISSC), Ministry of Science, Research and Technology, for their support with this research.

Conflicts of Interest:
The authors declare no conflict of interest.