Spatiotemporal Analysis of Surface Water Quality in Dong Thap Province, Vietnam Using Water Quality Index and Statistical Approaches

: The study was conducted to spatiotemporally analyze the quality, location and critical water variables inﬂuencing water quality using water monitoring data from the Department of Environment and Natural Resources, Dong Thap province in 2019. The water quality parameters including turbidity, pH, temperature, dissolved oxygen (DO), total suspended solids (TSS), biological oxygen demand (BOD), chemical oxygen demand (COD), nitrite (N-NO 2 − ), nitrate (N-NO 3 − ), ammonium (N-NH 4 + ), total nitrogen (TN), orthophosphate (P-PO 4 3 − ), chloride (Cl − ), oil and grease, sulfate (SO 42 − ), coliforms, and Escherichia coli ( E. coli ) were collected at 58 locations with the frequency of four times per year (February, May, August, and November). These parameters were compared with national technical regulation on surface water quality—QCVN 08-MT: 2015/BTNMT. Water quality index (WQI) was calculated and spatially presented by geographical information system (GIS) tool. Pearson correlation analysis, cluster analysis (CA), and principal component analysis (PCA) were used to evaluate the correlation among water quality parameters, group and reduce the sampling sites, and identify key parameters and potential water pollution sources. The results showed that TSS, BOD, COD, N-NH 4+ , P-PO 43 − , coliforms, and E. coli were the signiﬁcant concerns impairing the water quality. Water quality was assessed from poor to medium levels by WQI analysis. CA suggested that the current monitoring locations could be reduced from 58 sites to 43 sites which can be saved the total monitoring budget up to 25.85%. PCA showed that temperature, pH, TSS, DO, BOD, COD, N-NH 4+ , N-NO 2 − , TN, P-PO 43 − , coliforms, and E. coli were the key water parameters inﬂuencing water quality in Dong Thap province’s canals and rivers; thus, these parameters should be monitored annually. The water pollution sources were possibly hydrological conditions, water runoff, riverbank erosion, domestic and urban activities, and industrial and agricultural discharges. Signiﬁcantly, the municipal and agricultural wastes could be decisive factors to the change of surface water quality in the study area. Further studies need to focus on identifying sources of water pollution for implementing appropriate water management strategies. and November; however, there was no difference between February, May, and November ( p > 0.05). Similarly, COD levels were signiﬁcantly different between February, August, and November ( p < 0.05). The difference between BOD and COD can be assessed as negligible; it means that the organic matter in the water body is mainly biodegradable organic matter. BOD and COD exceeded the allowable limits of QCVN 08-MT: 2015/BTNMT, column A2, with the limit values of 6 mg L − 1 and 15 mg L − 1 , respectively; which showed that the quality of water was organically polluted. coliform density in the water bodies in Dong Thap was signiﬁcantly lower than that in An Giang and Can Tho [21,22,26]. The main reason why the density of coliform is more in An Giang and Can Tho is the presence of artiﬁcial waste such as point sources (domestic, industrial, aquaculture) and non-point sources (soil leaching, grazing), as well as other environmental factors such as temperature, pH, salinity, turbidity, nutrients, and hydrological regime [34,35]. In Dong Thap, the source of pollution mainly comes from domestic, soil washout and grazing, while An Giang and Can Tho are mainly derived from domestic and industry. Considering some environmental factors, the values of pH and DO in the An Giang and Can Tho watersheds are more favorable for the development of coliform than Dong Thap.


Introduction
Rivers play an essential role in creating habitats for many organisms and providing water for human activities. Meanwhile, the discharge of wastewater caused by industrial, urban, and other activities makes constant pollution sources, while surface water quality is seasonally changed. The flow discharge on the main Mekong River in Vietnam is divided into two distinct seasons: flood and dry seasons. The flood season is characterized by the enormous flow of 38,000-40,000 m 3 /s, causing flooding of about 1.2-1.9 million ha with depths from 0.5 to 4.5 m. In contrast, the dry season flow is 2000-2400 m 3 /s, resulting in difficulty for water supply during agricultural production in Winter-Spring and Summer-Autumn [1]. The Vietnamese Mekong Delta is at risk of facing a lack of surface water Aquaculture has also been considered the second strength after rice cultivation, ranked first in the country in terms of export volume of pangasius. The structure of land use has about 2602 km 2 of agricultural land, 111 km 2 of forest land, 257 km 2 of special-use land, and 146 km 2 of residential land. The climate has tropical, hot and, humid, greatly influenced by seasonal monsoons, each year there are 2 main seasons: rainy and dry seasons. The annual average temperature of the province ranged from 26 to 27 • C, the average temperature variation was 3-4 • C. The average annual rainfall was up to 1500 mm, and the average relative humidity for many years was 82-83%. Therefore, water quality can be affected by artificial sources, mainly agriculture, aquaculture and population. In addition, the sources of impacts from the natural environment recorded in Dong Thap at the beginning of the rainy season are alluvial water and acid sulfate water (water washing away acid sulfate materials on the soil surface), and at the end of the rainy season, they are alluvial water and water flowing from the upstream (for example, from Cambodia, Laos).

Water Sampling and Analysis
Seventeen water monitoring indicators at 58 sampling sites were collected by the Department of Natural Resources and Environment of Dong Thap province, Vietnam. Dong Thap's People Committee authorizes this department to monitor the environments including water, soil, sediment, and air quality in Dong Thap province. The characteristics of the waste sources, as well as the purposes of using water (domestic, agriculture, industry, aquaculture), form the basic monitoring objectives of the water quality monitoring program in Dong Thap province. The observed water quality parameters comprised temperature ( • C), pH, turbidity (NTU), dissolved oxygen (DO) (mg L −1 ), total suspended solids (TSS) (mg L −1 ), BOD (mg L −1 ), COD (mg L −1 ), N-NO 2 − (mg L −1 ), N-NO 3 − (mg L −1 ), N-NH 4 + (mg L −1 ), TN (mg L −1 ), P-PO 4 3− (mg L −1 ), Cl − (mg L −1 ), SO 4 2− (mg L −1 ), oil and grease (mg L −1 ), coliforms (MPN/100 mL) and E. coli (MPN/100 mL). The Mekong Delta region is located in the central tropical monsoon region of Asia; Climate was divided into the rainy season (May-October) and the dry season (November-April next year). The sample collection frequency was four times per year (February, May, August, and November) in 2019. Specifically, the sampling months were divided into dry season (February and November) and rainy season (May and August). The monitoring locations were mostly located along Tien River, Hau River, and infield canals in Dong Thap province which were shown in Figure 1. The description of the sampling sites are provided in the supplementary file (Table S1). Sampling, storage, and analysis methods were conducted according to the guidelines [16]. Turbidity, pH, temperature, and DO were in situ determined by hand-held devices.

Data Analysis
The water quality parameters were compared with QCVN 08-MT: 2015/BTNMT-National technical regulation on surface water quality [9]. The water quality index (WQI) was calculated with the guidance of the Vietnam Environment Administration (2019) [17] and presented as a geographic map through the software QGIS version 3.14 (the Open Source Geospatial Foundation-OSGeo, Chicago, IL, USA). Then, the distribution of the colors was proposed based on the results of the prior WQI. Descriptive statistical, boxplots, one-way ANOVA (the post-hoc test using Ducan), and Pearson correlation analysis was performed using SPSS software (version 20.0, IBM Corp., Armonk, NY, USA).

Data Analysis
The water quality parameters were compared with QCVN 08-MT: 2015/BTNMT-National technical regulation on surface water quality [9]. The water quality index (WQI) was calculated with the guidance of the Vietnam Environment Administration (2019) [17] and presented as a geographic map through the software QGIS version 3.14 (the Open Source Geospatial Foundation-OSGeo, Chicago, IL, USA). Then, the distribution of the colors was proposed based on the results of the prior WQI. Descriptive statistical, boxplots, one-way ANOVA (the post-hoc test using Ducan), and Pearson correlation analysis was performed using SPSS software (version 20.0, IBM Corp., Armonk, NY, USA).
The parameters used to calculate WQI in the guidance of the Vietnam Environment Administration in 2019 are divided into 05 groups of parameters, including the pH parameter group, the pesticide parameter group (09 parameters), the heavy metal parameter group (07 parameters), the organic and nutritional parameter group (08 parameters), and the microbiological parameter group (02 parameters). These parameters needed to satisfy two conditions: (1) at least 03/05 parameter groups must be included in the calculation, (2) the group of organic and nutritional parameters must have at least 03 parameters. Therefore, the data set in the study ensured the conditions for calculating the WQI value. However, based on the guidance of the Vietnam Environment Administration, the parameters turbidity, TSS, Cl − , SO4 2− , TN, TP, and oil and grease were not calculated; therefore, the calculated data set included only 10/17 analyzed parameters. WQI values were calculated by the formula (1): The parameters used to calculate WQI in the guidance of the Vietnam Environment Administration in 2019 are divided into 05 groups of parameters, including the pH parameter group, the pesticide parameter group (09 parameters), the heavy metal parameter group (07 parameters), the organic and nutritional parameter group (08 parameters), and the microbiological parameter group (02 parameters). These parameters needed to satisfy two conditions: (1) at least 03/05 parameter groups must be included in the calculation, (2) the group of organic and nutritional parameters must have at least 03 parameters. Therefore, the data set in the study ensured the conditions for calculating the WQI value. However, based on the guidance of the Vietnam Environment Administration, the parameters turbidity, TSS, Cl − , SO 4 2− , TN, TP, and oil and grease were not calculated; therefore, the calculated data set included only 10 where WQI a is the calculated WQI value for parameters DO, BOD, COD, N-NH 4 + , N-NO 2 − , N-NO 3 − , P-PO 4 3− ; WQI b is the calculated WQI value for coliforms and E. coli, and WQI pH is the calculated value for pH. The results of WQI value can provide general information on suitable water uses at the monitoring sites.
Pearson correlation analysis is a preliminary descriptive technique to estimate the degree of association among multiple variables involved in the study. The following formula is used to calculate the Pearson correlation (2): In which: r = Pearson r correlation coefficient between parameter X and parameter Y. n = number of observations. X i = value of X (for ith observation).
These values vary from −1 to 1, and the sign of each correlation coefficient indicates the inverse correlation between the parameters. The greater correlation occurs if the coefficient approaches −1 or 1. The correlation is moderate when its coefficient has absolute value >|0.3|−|0.5|; correlations higher than 0.5 considered strong; in contrast, its correlation is low when the correlation coefficient has absolute value < |0.3| [18,19].
Principal component analysis (PCA) was used to determine the main water parameters in the variation of the data set. This method enables us to reduce baseline parameters that do not make a significant contribution to data variability while creating a new set of parameters called key component or factor (PC). The eigenvalue coefficient of each factor is used to decide the main components. The larger this coefficient is, the greater the contribution to interpreting the variation of the original dataset. The method used in PCA is Varimax, and each initial data variable is classified as a factor, and each factor represents a subset of the initial variables. Correlations between the main component and the primary data variables are indicated by the weighted correlation coefficients [11].
In addition, cluster analysis (CA) was performed to group the locations based on the similarity of water properties. The analysis does not give any assumptions about the similarity of the positions; the clusters are formed statistically at D link /D max × 100 < 60, in which D link : linkage distance for an individual case and D max : maximum linkage distance. The number of clusters is determined by the fact of this study. Ward method and Euclidean range were used as measures of similarity [10]. CA and PCA were performed using copyrighted software Primer 5.2 for Windows (PRIMER-E Ltd., Plymouth, UK).

Summary of Surface Water Quality in Dong Thap Province in 2019
The mean water temperature in 2019 ranged from 29.56 ± 1.05 • C to 31.08 ± 1.09 • C ( Figure 2). ANOVA analysis showed a statistically significant difference in temperature between the observed months (p < 0.05). The temperature recorded in November was higher than that in February, May, and August. According to previous studies, there was no significant difference in water temperature in Bung Binh Thien, canals in An Giang, and main rivers and tributaries of Can Tho province compared to the study area [20][21][22]. It can be caused by water regulates the temperature in water, mostly in large deep canals or rivers.
The pH values had a statistically significant difference between wet season (May, August) and dry season (February, November) (p < 0.05). This is consistent with the seasonal distribution of pH in the Mekong Delta regions. Intermonth pH values ranged from 7.15 ± 0.20 to 7.36 ± 0.27 (Figure 2), which was also reported in similar water bodies and were within the allowable range of QCVN 08-MT: 2015/BTNMT (6.5-8.5) [20][21][22].
Turbidity was seasonally varied through February, May, August, and November, with average values of 26.63 ± 9.47 NTU, 63.98 ± 20.78 NTU, 59.86 ± 10.49 NTU, and 44.42 ± 13.13 NTU, respectively. The results showed a statistically significant difference (p < 0.05) between May versus February and November. In contrast, there was no difference between May and August (p > 0.05) (Figure 2). High turbidity during the rainy season can be caused by water runoff due to frequent and heavy rainfall. During the rainy season, the upstream sedimentation coupled with the precipitation eroded on both sides of the river can increase turbidity at this time [23]. In addition, organic impurities, insoluble inorganics, and micro-planktons have also resulted in high turbidity. The previous studies have also reported that the water turbidity varied considerably between the surveys [20,24,25].
Turbidity was seasonally varied through February, May, August, and November, with average values of 26.63 ± 9.47 NTU, 63.98 ± 20.78 NTU, 59.86 ± 10.49 NTU, and 44.42 ± 13.13 NTU, respectively. The results showed a statistically significant difference (p < 0.05) between May versus February and November. In contrast, there was no difference between May and August (p > 0.05) (Figure 2). High turbidity during the rainy season can be caused by water runoff due to frequent and heavy rainfall. During the rainy season, the upstream sedimentation coupled with the precipitation eroded on both sides of the river can increase turbidity at this time [23]. In addition, organic impurities, insoluble inorganics, and micro-planktons have also resulted in high turbidity. The previous studies have also reported that the water turbidity varied considerably between the surveys [20,24,25]. Moreover, the concentration of suspended clay particles also affects the TSS in the water. TSS formed by plankton is beneficial, and that of suspended clay particles are detrimental. In the study, TSS also had a considerable seasonal variation, ranging from 21.71 ± 15.11 to 49.57 ± 33.58 mg L −1, and the difference was statistically significant (p < 0.05). According to the value specified in QCVN 08-MT: 2015/BTNMT, column A2 (30 mg L −1 ), which was used for the purpose of domestic water supply but applying the appropriate treatment technology or irrigation and drainage and water transportation, TSS exceeded the specified limit (except in May). However, TSS in the present study tended to be lower than those reported in the previous studies [21,22] in the canals and rivers in An Giang and Can Tho provinces. TSS had the difference between the monitoring months because the amount of water flowing and flooding from upstream carrying various amounts of sediments led to high TSS concentrations. The high amount of TSS can increase treatment costs and make the aquatic environment less suitable for living. Moreover, the concentration of suspended clay particles also affects the TSS in the water. TSS formed by plankton is beneficial, and that of suspended clay particles are detrimental. In the study, TSS also had a considerable seasonal variation, ranging from 21.71 ± 15.11 to 49.57 ± 33.58 mg L −1, and the difference was statistically significant (p < 0.05). According to the value specified in QCVN 08-MT: 2015/BTNMT, column A2 (30 mg L −1 ), which was used for the purpose of domestic water supply but applying the appropriate treatment technology or irrigation and drainage and water transportation, TSS exceeded the specified limit (except in May). However, TSS in the present study tended to be lower than those reported in the previous studies [21,22] in the canals and rivers in An Giang and Can Tho provinces. TSS had the difference between the monitoring months because the amount of water flowing and flooding from upstream carrying various amounts of sediments led to high TSS concentrations. The high amount of TSS can increase treatment costs and make the aquatic environment less suitable for living.
The mean DO concentrations in February, May, August, and November were 5.07 ± 0.63 mg L −1 , 5.13 ± 0.12 mg L −1 , 5.16 ± 0.15 mg L −1 , and 5.18 ± 0.33 mg L −1 , respectively. The difference was not statistically significant between the observed months (p > 0.05) ( Figure 3). DO concentration tended to increase in the observation months. This could be due to the diffusion directly from the air by disturbance or produced by phytoplankton through photosynthesis. The DO was assessed to meet the limit of QCVN 08-M: 2015/BT-NMT column A2 (5 mg L −1 ). However, the DO concentrations in this study were found to be higher than those in the water bodies in An Giang (4.0-5.2 mg L −1 ) [21] and Can Tho (3.5-5.8 mg L −1 ) [26]. The low DO in An Giang and Can Tho could be due to the presence of biodegradable matters, fertilizers from agricultural land [21,27]. DO may not pose a direct hazard to human health, but it may affect other chemicals in the water [27]. Typically, BOD and COD in the months of the year 2019 ranged from 14.05 ± 1.41-15.52 ± 1.67 mg L −1 and 21.26 ± 1.74-23.03 ± 1.77 mg L −1 (Figure 3). Furthermore, ANOVA analysis showed that BOD was significantly different (p < 0.05) between August compared to February, May, and November; however, there was no difference between February, May, and November (p > 0.05). Similarly, COD levels were significantly different between February, August, and November (p < 0.05). The difference between BOD and COD can be assessed as negligible; it means that the organic matter in the water body is mainly biodegradable organic matter. BOD and COD exceeded the allowable limits of QCVN 08-MT: 2015/BTNMT, column A2, with the limit values of 6 mg L −1 and 15 mg L −1 , respectively; which showed that the quality of water was organically polluted. cause water eutrophication is very high [30]. This point shows that the concentration of TN through the monitoring phases can potentially cause eutrophication.
In addition, the P-PO4 3− in February, May, August, and November were 0.24 ± 0.18 mg L −1 , 0.21 ± 0.12 mg L −1 , 0.18 ± 0.11 mg L −1 , and 0.30 ± 0.30 mg L −1 , respectively, which was a statistically significant difference (p < 0.05) between November versus May and August. There was no difference between November and February (p > 0.05) ( Figure 3). The content of P-PO4 3− in February and November was higher than that of QCVN 08-MT:2015/BTNMT, around 1.2-1.5 times. Normally, phosphorus dissolved in natural surface water is found in concentrations ranging from 0.005 to 0.02 mg L −1 and greater than 0.02 mg L −1 , which is considered nutritious [31]. Similar to TN, P-PO4 3− could result in potential eutrophication in surface water in Dong Thap province.   [32] on the surface water quality of the Tien River flowing through Tan Chau, An Giang's lowest Clvalue was found in August 2017 (2.1 mg L −1 ), while the highest value was measured in December 2017 (19.4 mg L −1 ). Concentrations of SO4 2− in the study's water bodies in February and November were lower than those in May and August, possibly due to the use of sulfate by some microorganisms as dissolved oxygen sources. Additionally, when sulfate concentrations ranged from 5.3 ± 8.1 to 27.8 ± 5.3 mg L −1 in river water [13], the water bodies were influenced by several human activities. In this study, Cl − and SO4 2− concentrations were significantly detected in N-NH 4 + over the observation months tended to increase through the survey periods and fluctuated between 0.36 ± 0.061 and 0.40 ± 0.074 mg L −1 , and the difference was statistically significant between February and November (p < 0.05) (Figure 3). N-NH 4 + concentration exceeded the prescribed limit of QCVN 08-MT: 2015/TNMT, which indicated that surface water quality in the water body was contaminated with nutrients. Moreover, in August and November, the concentration of N-NO 2 − was within the allowable limit of QCVN 08-MT: 2015/BTNMT, column A2 (0.05 mg L −1 ). In contrast, the concentration of N-NO 2 − in February (0.46 mg L −1 ) and May (0.46 mg L −1 ) were determined to be higher than the permissible limit of QCVN 08-MT: 2015/BTNMT, column A2 (0.05 mg L −1 ), with the levels of 9.2 times and 9.1 times, respectively. In addition, the study also noted a statistically significant difference between February and May compared with August and November (p < 0.05); N-NO 2 − concentrations in the months of the rainy season were higher than those in the months of the dry season. The increase of N-NO 2 − can be explained by the nitrogen of wastewater and insufficient DO in converting N-NO 2 − into N-NO 3 − by nitrifying microorganisms. Another explanation for this might be the consequences of fertilizers. N-NO 2 − was a product of nitrification and denitrification, and N-NO 2 − can be toxic to aquatic organisms at a concentration of 0.1 mg L −1 [28]; however, N-NO 2 − concentrations in February and May were recorded to be 4.59 times higher than that level. Water containing N-NO 2 − is of great concern because it can cause methemoglobinemia or blue-skin disease due to limited oxygen transport in the bloodstream. In contrast to N-NO 2 − , N-NO 3 − concentrations tended to be the highest in November of 3.00 ± 0.83 mg L −1 and lowest in May at 1.14 ± 0.39 mg L −1 . The results of the statistical analysis showed a significant difference (p < 0.05) between May and November ( Figure 3). This difference has also been reported in several water bodies in the past, where N-NO 3 − concentration was high in October, November, and December and low in April, May, and June. It is explained by decreased biological activities (bacterial denitrification and algae assimilation) in the last months of the year. However, most of the monitoring months in the study area were within the allowable limits of QCVN 08-MT: 2015/BTNMT column A2 (5 mg L −1 ). Meanwhile, TN fluctuated to relatively high degree from 3.75 ± 0.54 to 4.30 ± 0.51 mg L −1 , and the difference was statistically significant (p < 0.05) between May, August, November compared to February (Figure 3). To minimize the ability to cause water eutrophication, TN should not exceed 1.5 mg L −1 [29] . When TN is higher than 1.7 mg L −1 , the ability to cause water eutrophication is very high [30]. This point shows that the concentration of TN through the monitoring phases can potentially cause eutrophication.
In addition, the P-PO 4 3− in February, May, August, and November were 0.24 ± 0.18 mg L −1 , 0.21 ± 0.12 mg L −1 , 0.18 ± 0.11 mg L −1 , and 0.30 ± 0.30 mg L −1 , respectively, which was a statistically significant difference (p < 0.05) between November versus May and August. There was no difference between November and February (p > 0.05) (Figure 3). The content of P-PO 4 3− in February and November was higher than that of QCVN 08-MT:2015/BTNMT, around 1.2-1.5 times. Normally, phosphorus dissolved in natural surface water is found in concentrations ranging from 0.005 to 0.02 mg L −1 and greater than 0.02 mg L −1 , which is considered nutritious [31]. Similar to TN, P-PO 4 3− could result in potential eutrophication in surface water in Dong Thap province.
Cl − and SO 4 2− concentrations had similar fluctuations over the survey periods, ranging from 7.26 ± 3.19 to 19.48 ± 7.80 mg L −1 and 18.04 ± 11.43 to 28.65 ± 3.77 mg L −1 , respectively. The results showed a statistically significant difference (p < 0.05) between November versus May and February versus August; however, there was no difference between February and August. Similarly, SO 4 2− concentration was a statistically significant difference between May versus August or May versus February and November (p < 0.05).  4 2− in the study's water bodies in February and November were lower than those in May and August, possibly due to the use of sulfate by some microorganisms as dissolved oxygen sources. Additionally, when sulfate concentrations ranged from 5.3 ± 8.1 to 27.8 ± 5.3 mg L −1 in river water [13], the water bodies were influenced by several human activities. In this study, Cl − and SO 4 2− concentrations were significantly detected in surface water, which could have originated from human activities; therefore, it needs to be appropriately treated for meeting domestic use and other similar purposes.

Concentrations of SO
The mean density of coliforms in the monitoring months ranged from 4599.31 ± 3019.32 to 8327.41 ± 7685.89 MPN 100 mL −1 (Figure 4). This density was statistically significantly different between February and August and November (p < 0.05). An increase in coliform density with increasing temperature was also previously reported [33], which can be explained for the maximum coliform density in August (8327.41 ± 7685.89 MPN 100 mL −1 ). According to the limit value of coliform in QCVN 08-MT: 2015/BTNMT, column A2 (5000 MPN 100 mL −1 ), coliform density in the study area exceeded the permitted limit in May, August, and November by approximately 1.3-1.4 times. However, coliform density in the water bodies in Dong Thap was significantly lower than that in An Giang and Can Tho [21,22,26]. The main reason why the density of coliform is more contaminated in An Giang and Can Tho is the presence of artificial waste such as point sources (domestic, industrial, aquaculture) and non-point sources (soil leaching, grazing), as well as other environmental factors such as temperature, pH, salinity, turbidity, nutrients, and hydrological regime [34,35]. In Dong Thap, the source of pollution mainly comes from domestic, soil washout and grazing, while An Giang and Can Tho are mainly derived from domestic and industry. Considering some environmental factors, the values of pH and DO in the An Giang and Can Tho watersheds are more favorable for the development of coliform than Dong Thap. A2 (5000 MPN 100 mL −1 ), coliform density in the study area exceeded the permitted lim in May, August, and November by approximately 1.3-1.4 times. However, coliform de sity in the water bodies in Dong Thap was significantly lower than that in An Giang an Can Tho [21,22,26]. The main reason why the density of coliform is more contaminated An Giang and Can Tho is the presence of artificial waste such as point sources (domesti industrial, aquaculture) and non-point sources (soil leaching, grazing), as well as oth environmental factors such as temperature, pH, salinity, turbidity, nutrients, and hydr logical regime [34,35]. In Dong Thap, the source of pollution mainly comes from domesti soil washout and grazing, while An Giang and Can Tho are mainly derived from domest and industry. Considering some environmental factors, the values of pH and DO in th An Giang and Can Tho watersheds are more favorable for the development of colifor than Dong Thap. The average density of E. coli in the study area was very high and seasonally fluct ated. Specifically, E. coli density was significantly different (p < 0.05) in the two months the rainy season (May and August) and the two months of the dry season (February an November). The density of E. coli in February, May, August, and November was 548.10 430.41, 1728.97 ± 3320.80 MPN 100 mL −1 , 520.26 ± 438.64 MPN 100 mL −1 , and 1615.17 1124.19 MPN 100 mL −1 , respectively (Figure 4). This shows that E. coli in the rainy seaso was higher than that in the dry season. Compared with QCVN 08-MT: 2015/BTNMT, E. co at all monitoring months exceeded the allowable limit of column A2 by 10-34 times. Th indicator can be considered as the most exceeding parameter. Therefore, the water quali in water bodies in Dong Thap province has high risk for human uses. Appropriate measur are urgently needed to treat and improve the existing water resources.
Meanwhile, oil and grease concentration over the observed months were relative low, and there was no statistically significant difference (p > 0.05), ranging from 0.0024 0.00072 to 0.0027 ± 0.00076 mg L −1 (Figure 4). The above results show that the concentratio The average density of E. coli in the study area was very high and seasonally fluctuated. Specifically, E. coli density was significantly different (p < 0.05) in the two months of the rainy season (May and August) and the two months of the dry season (February and November). The density of E. coli in February, May, August, and November was 548.10 ± 430.41, 1728.97 ± 3320.80 MPN 100 mL −1 , 520.26 ± 438.64 MPN 100 mL −1 , and 1615.17 ± 1124.19 MPN 100 mL −1 , respectively (Figure 4). This shows that E. coli in the rainy season was higher than that in the dry season. Compared with QCVN 08-MT: 2015/BTNMT, E. coli at all monitoring months exceeded the allowable limit of column A2 by 10-34 times. This indicator can be considered as the most exceeding parameter. Therefore, the water quality in water bodies in Dong Thap province has high risk for human uses. Appropriate measures are urgently needed to treat and improve the existing water resources.
Meanwhile, oil and grease concentration over the observed months were relatively low, and there was no statistically significant difference (p > 0.05), ranging from 0.0024 ± 0.00072 to 0.0027 ± 0.00076 mg L −1 (Figure 4). The above results show that the concentration of oil and grease did not fluctuate greatly among seasons and were within the limit of QCVN 08-MT: 2015/BTNMT, column A2. The concentration of oil and grease in the surface water was mainly from domestic waste and leaching of materials; Nevertheless, this content was negligible. On the other hand, the algae absorption can be attributed to the low concentration of oil and grease in the water due to its susceptible to biological oxidation.
In short, the surface water quality in Dong Thap province in 2019 was polluted by suspended solids, organic matters, nutrients, and microbes. This indicated that the potential risk of eutrophication is very high, which is a leading cause of impairment of many freshwater ecosystems and human health. Therefore, it is necessary to develop appropriate programs to tackle these current problems.

Correlation among Water Quality Variables in Water Bodies in Dong Thap Province in 2019
The correlation between 17 observed indicators at 58 sampling locations along Tien River, Hau River, and infield canals in Dong Thap province in 2019 is presented in Table 1. The results show that temperature was positively correlated with BOD, COD, TSS, and N-NO 3 − and inversely correlated with DO. It was shown that the higher the temperature is, the more likely that the water is saturated [36,37]. The study also recorded that the pH parameter had a low negative correlation with Cl − (r = 0.15), turbidity (r = 0.26), and SO 4 2− (r = 0.27). In practice, turbidity is related to runoff water and soil erosion; however, the pH is also related to the leaching of compounds containing Cl − and SO 4 2− . An inverse correlation between pH and turbidity has also been noted in a previous study [12]. Meanwhile, turbidity was found to positively correlate with TSS, Cl − , SO 4 2− , and TN. This can be seen that the water in the study area contained several dissolved ions, especially fertilizers containing sulfur and chlorine [38].
TSS showed a positive correlation with several parameters such as N-NO 3 − , P-PO 4 3− , coliforms, E. coli and a negative correlation with N-NO 2 − , oil and grease, and Cl − . Suspended solids in water tended to adsorb P-PO 4 3− and N-NO 3 − [39]. Similarly, the correlation of TSS with coliform and E. coli was explained by soil leaching in the husbandry areas, resulting in increased TSS, coliforms, and E. coli. Therefore, the reduction in E. coli density and nutrients in water can be accomplished by sedimentation with clay particles. In addition, stormwater runoff with non-volatile hydrocarbons, animal and vegetable oils, grease, and other related materials can increase the grease contents in the water body [40]. This amount of grease can stick to the soil particles during leaching and floating on the water surface, limiting the number of suspended solids present in the water.
Moreover, a high DO may increase the nitrification rate [12,41]. It helps to explain the positive correlation between DO and N-NO 3 − in this study. BOD correlated positively with COD at a high level (r = 0.84). There was no statistically significant difference between these two parameters, meaning that most organic matters were quickly biodegradable. N-NO 3 − was positively correlated with P-PO 4 3− and inversely correlated with N-NO 2 − and Cl − . There was a correlation between N-NO 3 − with Cl − and P-PO 4 3− at an average correlation level and N-NO 2 − at a weak correlation level. It was expected that there was an inverse correlation between N-NO 3 − and N-NO 2 − because the N-NO 3 − concentration depends on the nitrification process. Furthermore, there is a moderate positive correlation between Cl − and SO 4 2 , related to the water-soluble salts in the study water body. This correlation has also been determined in a previous study [42].
Furthermore, coliform correlated with E. coli at a strong positive correlation. Water quality has been significantly influenced by the residential areas [43] because E. coli is derived from the human digestive system. For N-NH 4 + , no correlation with other parameters was noted. Overall, the results indicated that most of the water quality parameters were correlated. However, the correlation between water quality parameters is only a medium-weak correlation. Therefore, the parameters at the study water bodies may have been greatly influenced by external environmental factors.

Spatial Variation of Water Quality Index in the Water Bodies in Dong Thap Province in 2019
The mean values of the ten physical and chemical parameters were used to calculate the water quality index (WQI) at 58 locations, which is shown in Figure 5. The results showed that the WQI values at these monitoring sites were from medium (yellow color) to poor (red color). While nine locations were identified with very poor water quality, the poor and medium water quality accounted for 24 monitoring locations at each level. Water quality was unevenly spatially distributed in the study area. Poor water quality was mostly found in the regions associated with concentrated socio-economic activities. Specifically, the southern regions of Dong Thap had lower water quality than those in the northern; the South of Dong Thap has two main rivers Tien and Hau, where they could receive several discharging sources from industrial, domestic, aquacultural, and agricultural activities. In contrast, the water quality in the northern part of Dong Thap may be affected by the flow and discharge characteristics from upstream of Cambodia by the Mekong river system's transboundary character. However, the water quality in Dong Thap was considered to be less polluted than that in the water bodies in An Giang province [15,44]. It was reported that water quality in the southeast region of An Giang had better water quality, which is consistent with the calculation results of WQI in the northwest part of Dong Thap, where the water quality better than that in the other places in the study area. However, water quality was similar to that in Can Tho's water bodies in 2018 [22]. It can be seen that the application of GIS incorporating WQI in surface water quality assessment can be the basis for further considering the surface water monitoring network in Dong Thap province in the future.

Spatial Variation of Water Quality Index in the Water Bodies in Dong Thap Province in 2019
The mean values of the ten physical and chemical parameters were used to calculate the water quality index (WQI) at 58 locations, which is shown in Figure 5. The results showed that the WQI values at these monitoring sites were from medium (yellow color) to poor (red color). While nine locations were identified with very poor water quality, the poor and medium water quality accounted for 24 monitoring locations at each level. Water quality was unevenly spatially distributed in the study area. Poor water quality was mostly found in the regions associated with concentrated socio-economic activities. Specifically, the southern regions of Dong Thap had lower water quality than those in the northern; the South of Dong Thap has two main rivers Tien and Hau, where they could receive several discharging sources from industrial, domestic, aquacultural, and agricultural activities. In contrast, the water quality in the northern part of Dong Thap may be affected by the flow and discharge characteristics from upstream of Cambodia by the Mekong river system's transboundary character. However, the water quality in Dong Thap was considered to be less polluted than that in the water bodies in An Giang province [15,44]. It was reported that water quality in the southeast region of An Giang had better water quality, which is consistent with the calculation results of WQI in the northwest part of Dong Thap, where the water quality better than that in the other places in the study area. However, water quality was similar to that in Can Tho's water bodies in 2018 [22]. It can be seen that the application of GIS incorporating WQI in surface water quality assessment can be the basis for further considering the surface water monitoring network in Dong Thap province in the future.

Key Water Variables Influencing Water Quality in the Water Bodies in Dong Thap Province in 2019
The principal component analysis results revealed that 11 PCs contributed significantly and explained 90.7% of the total variation in surface water quality in Dong Thap province in 2019 (Table 2). For the extraction of each component in the PCA analysis, the eigenvalue coefficient was used as a criterion to determine the load or importance level of each component [45]. PC1 and PC2 contributed, respectively, 17.5% and 13.9% of surface water quality variation while PC3, PC4, PC5, PC6, PC7, PC8, PC9, PC10, and PC11 contributed 10.4%, 9.5%, 7.7%, 7%, 6.9%, 5.1%, 4.9%, 4.6%, and 3.4%, respectively. Eigenvalues coefficients greater than 1 are considered significant and vice versa [14,46]. In this study, the eigenvalues from PC1 to PC7 were greater than 1, so these PCs were used to evaluate potential polluting sources and key water quality variables in the present study. It can be seen that the change of water quality in Dong Thap province in 2019 was very complicated and affected by various pollution sources. PC1 was the most important factor (17.5%) in the contribution of the water quality parameters such as TSS, SO 4 2− , coliforms, and E. coli at low correlation level and N-NH 4 + at the high level. The present conditions suggested that the cause could be an increase in manure-containing waste, overuse of fertilizers, or disturbance to the flow. TSS could be from surface water runoff, riverbank erosion, and phytoplankton occurrence due to the high risk of eutrophication area. PC2 also significantly explained the variation (13.9%) of water quality, in which temperature, DO, pH, BOD, COD, and N-NO 3 − were the parameters causing the most considerable fluctuation. This component can be from hydrological conditions, domestic, urban, and agricultural sources. Hydrological factors mainly affect the self-cleaning process of rivers/canals, including flow velocity, fluctuation in water level, water temperature, flow rate, and catchment area. Typically, large bodies of water and deep water can promote disturbance and self-cleaning, which can directly affect the temperature and aquatic ecosystems, indirectly affecting the process of oxygen exchange in the water. The inverse correlation of temperature and DO, and DO with BOD and COD could mean that as temperature increases, DO decreases, and BOD and COD increase [14,47]. The fluctuations caused by turbidity, P-PO 4 3− , and Cl − in PC3 accounted for 10.4%. It showed that PC3 was affected by salinity, domestic activities, and overflow and erosion [14]. PC4 accounted for 9.5% of the variation contributed by N-NO 2 − , N-NO 3 − , coliforms, and E. coli. N-NO 2 − and N-NO 3 values indicated the releasing sources relating to nitrogen-containing materials and fertilizers while coliform and E. coli originated from animal and fecal materials. PC5 and PC6 explained the water quality variation by 7.7% and 7%, respectively, with the weak contributions of temperature, N-NH 4 + , and TN. PC7 showed the contribution of oil and grease at a moderate correlation and TN at a weak correlation. It can be implied that the water quality in the study area was influenced by several different sources such as hydrological conditions, stormwater runoff, and riverbank erosion, domestic activities, urban areas, industrial, and agricultural zones. Among these, urban and agricultural wastes may be the decisive factors in the change of surface water quality in the study area. The water quality indicators should be accounted in the water monitoring program, including temperature, pH, TSS, DO, BOD, COD, N-NH 4 + , N-NO 2 − , TN, P-PO 4 3− , coliforms, and E. coli.

Clustering Water Quality in the Water Bodies in Dong Thap Province in 2019
In this study, at a distance Euclid = 5 (red line), 58 monitoring positions were divided into four clusters ( Figure 6). Cluster 1 included only NM43, Cluster 2 included positions NM46, NM28, and NM45; Cluster 3 included locations NM37, NM35, NM77, NM78, NM39, NM57, NM64, NM65, NM69, and NM81; and Cluster 4 comprised the remaining positions. In addition, 12 clusters were divided at Euclid = 3 (blue line) for a more detailed observation of water quality changes in Dong Thap province. The monitoring clusters were divided into 12 clusters including Cluster 1 (NM43), Cluster 2 (NM46), Cluster 3 (NM28, NM45), Cluster 4 (NM37, NM35, NM77, NM78, NM39, NM57, NM64, NM65), Cluster 5 (NM69, NM81), Cluster 6 (NM26, NM29), Cluster 7 (NM70, NM72, NM61, NM62, NM59, NM42, NM71, NM53, NM60, NM03, NM11), Cluster 8 (NM58), Cluster 9 (NM16, NM66, NM68), Cluster 10 (NM44, NM63), Cluster 11 (NM13, NM67), Cluster 12 (remaining locations). Water quality characteristics in the clusters were assessed by the mean values of the same cluster locations and presented in Table 3.  (Table  3). The above results showed that the water quality in Dong Thap province's water bodies was polluted with suspended solids, nutrients, organic matters, and microorganisms. The primary sources of the water problems could be from hydrological conditions, stormwater runoff, and riverbank erosion, domestic activities, urban areas, and industrial and agricultural zones. The reason is that wastewater and wastes from these sources are characterized by organic matter constituents, which are manifested by large concentrations of COD and BOD and other nutrients such as nitrogen, phosphorus, and microorganisms. Moreover, these sources were also relevant to the local economic development. CA results suggested that the numbers of the monitoring locations on the same rivers/canals in the same cluster can be reduced, so the monitoring points along Tien River, Hau River, and infield canals can be reduced from 58 to 43 positions as indicated in Figure 7. This could save 25.85% of the monitoring costs. The sites that could be omitted were NM37 or NM38 under Cluster 4 (on Hau river); NM61 or NM72 (Cai Nho river), NM11 or NM53 (Nguyen Van Tiep Canal), NM59 or NM60 or NM03 (Tien river) in Cluster 7; NM13 or NM67 belonging to Cluster 11 (Nguyen Van Tiep Canal); NM49 or NM54 or NM73 or NM50 or NM52 or NM55 or NM05 (Hau river), NM82 or NM83 (So Ha river), NM74 or NM06 (Sa Dec river), NM02 or NM56 (Cao Lanh river) belonging to Cluster 12. In general, BOD, COD, N-NH 4 + , P-PO 4 3− , and E. coli in all clusters were higher than the limits of QCVN 08-MT: 2015/BTNMT, column A2. Cluster 1 is located upstream of the Tien River when it flows into Dong Thap province with BOD, COD, N-NH 4 + , P-PO 4 3− , and E. coli exceeded the standards these parameters had the values lower than those in the remaining groups. Water quality in Cluster 2 was reported to be higher than that in Cluster 1. TSS in cluster 1 far exceeded the limit of QCVN 08-MT: 2015/BTNMT, column A2. Cluster 3 showed a nutrients pollution problem that can be assessed by N-NH 4 + , N-NO 2 − , N-NO 3 − , P-PO 4 3− , TN, and TP. In Cluster 3, N-NH 4 + and P-PO 4 3− were higher than the permitted value; this could stem from the fact that these locations are in the densely populated area and intersect the tributaries, so they may be affected by integrated pollution sources. Cluster 4 and Cluster 5 had very high concentrations of coliform and E. coli, which were 2.3-4 times higher than the limits of QCVN 08-MT: 2015/BTNMT column A2 for coliform and 26.52-97.42 times for E. coli (Table 3). This showed that these two clusters' pollution characteristic was microbiological pollution, influenced by fecal materials from human and animals. Clusters 4 and 5 were considered the two clusters with the highest pollution level. The water quality of Cluster 6 and Cluster 7 was organic, nutrient, and microbiological pollution indicated by the exceeding limits of the water parameters of BOD, COD, TSS, N-NH 4 + , N-NO 2 − , P-PO 4 3− , and E. coli. Cluster 8, Cluster 9, Cluster 10, and Cluster 11 were polluted because of the water quality parameters of BOD, COD, TSS, N-NO 2 − , N-NH 4 + , P-PO 4 3− , coliform, and E. coli all exceeded the limit values of QCVN 08-MT: 2015/BTNMT, column A2. Cluster 12 was also polluted by BOD, COD, N-NO 2 − , N-NH 4 + , coliform, and E. coli. However, the water quality in Cluster 12 had a lower level of microbiological pollution and higher organic matters than those in Cluster 8-Cluster 11 ( Table 3).
The above results showed that the water quality in Dong Thap province's water bodies was polluted with suspended solids, nutrients, organic matters, and microorganisms. The primary sources of the water problems could be from hydrological conditions, stormwater runoff, and riverbank erosion, domestic activities, urban areas, and industrial and agricultural zones. The reason is that wastewater and wastes from these sources are characterized by organic matter constituents, which are manifested by large concentrations of COD and BOD and other nutrients such as nitrogen, phosphorus, and microorganisms. Moreover, these sources were also relevant to the local economic development. CA results suggested that the numbers of the monitoring locations on the same rivers/canals in the same cluster can be reduced, so the monitoring points along Tien River, Hau River, and infield canals can be reduced from 58 to 43 positions as indicated in Figure 7. This could save 25.85% of the monitoring costs. The sites that could be omitted were NM37 or NM38 under Cluster 4 (on Hau river); NM61 or NM72 (Cai Nho river), NM11 or NM53 (Nguyen Van Tiep Canal), NM59 or NM60 or NM03 (Tien river) in Cluster 7; NM13 or NM67 belonging to Cluster 11 (Nguyen Van Tiep Canal); NM49 or NM54 or NM73 or NM50 or NM52 or NM55 or NM05 (Hau river), NM82 or NM83 (So Ha river), NM74 or NM06 (Sa Dec river), NM02 or NM56 (Cao Lanh river) belonging to Cluster 12.

Conclusions
The quality of surface water in Dong Thap in 2019 has been polluted, as manifested by TSS, BOD, COD, N-NH4 + , N-NO2 − , P-PO4 3− , coliform, and E. coli exceeding the limits of QCVN 08-MT: 2015/BTNMT, column A2. ANOVA analysis showed that water quality has seasonally changed significantly through surveys (except DO and oil and grease). The WQI index showed that the overall water quality in the south of Dong Thap has lower water quality than in the north of Dong Thap, and the water quality ranged from poor to medium. PCA and Pearson analysis showed 12 water monitoring indicators including