1. Introduction
Water quality remote sensing refers to the use of remote sensing technology to monitor water quality parameters in water bodies [
1]. It is based on the absorption and scattering of solar radiation by the water, leading to distinct spectral characteristics. Since the early 1970s, remote sensing technology has made great strides in research on water quality monitoring of inland waters, from simple qualitative identification of waters to the ability to quantitatively obtain the concentration of water quality parameters and the ability to conduct remote sensing monitoring [
2]. Compared to traditional water quality monitoring, which requires a large number of measurement points, satellite remote sensing monitoring is a low-cost and time-efficient method. It can display the spatial and temporal changes in water quality on a larger regional scale, provide a historical record of water quality in a certain area over a long period, predict future trends and help experimentalists exclude unsuitable sampling points to save investigation time [
3]. The data are also presented in an intuitive manner. These advantages have led to the rapid progress and widespread use of satellite remote sensing in water quality monitoring.
Water color indicators are fundamental components of remote sensing data, making them the basis for remote sensing inversion [
4]. Inversion models transform these water color indicators into the water quality parameters under study. The concentrations of various water quality indicators can be inferred by analyzing data from the water body. On this basis, the mathematical model can be developed to relate water monitoring data with corresponding satellite remote sensing data, and then be applied to satellite imagery in the study area, ultimately allowing for the extraction of the overall spatial distribution characteristics of water quality within the research scope.
Currently, remote sensing monitoring of water quality is mainly achieved through empirical, semi-empirical and machine learning methods. Kloiber et al. [
5] successfully monitored water quality in the Twin Cities area of Minnesota using empirical methods. Machine learning methods are also being increasingly applied in the field of water quality remote sensing monitoring, as they can analyze water quality data to discover new patterns and correlations between water quality parameters. The software produces models to predict water quality that accurately reflect the current situation from multiple perspectives. In recent years, support vector machines (SVMs), neural networks and random forests have been commonly used for water quality inversion through machine learning. Many studies have reported positive results in water quality remote sensing monitoring using these models. Cheng et al. [
6] demonstrated the effectiveness of a convolutional neural network in accurately extracting and classifying fish movement trajectories, which could distinguish between normal and abnormal water quality conditions. This approach uses biological indicators for water quality assessment. Xiao et al. [
7] improved the GA-BP neural network to fit a complex water quality model, reducing absolute error and improving prediction accuracy. Guan [
8] applied the support vector machines theory and methodology to water quality monitoring, resulting in a feasible monitoring model.
There are three main categories of substances that affect the spectral characteristics of water bodies, in addition to the pure water body itself: yellow matter, phytoplankton (chlorophyll-a) and total suspended solids (TSSs). Such water quality indicators with significant spectral characteristics that can be directly inverted by remote sensing are called water color indicators. Non-water color indicators are used to indirectly extrapolate or invert important indicators such as total nitrogen (TN) and total phosphorus (TP), which do not have significant spectral characteristics, in the evaluation and protection of the water environment. TSSs is one of the key parameters in water color indicators. Li et al. [
9] used Landsat TM as a data source to analyze the pollution conditions in the coastal waters of Xiamen. Xingsps et al. [
10] built an inversion model for total suspended matter concentration in the reservoir area using HJ-CCD satellite data, obtaining highly accurate results.
Chlorophyll-a concentration (Chl-a) is an important indicator for evaluating eutrophication in water environments [
11]. Hoogenboom et al. [
12] used a matrix inversion model to monitor Chl-a concentration in water bodies. Li et al. [
13] used the position and value of reflection peaks in the blue-violet band and at 670 nm to indicate Chl-a concentration. Zhu et al. [
14] discovered similarities between the results of GF-1 and HJ-1A CCD satellites. They compared the band information from GF-1 and HJ-1A satellites using different sources’ band response functions and a large amount of measured Taihu Lake water body data. Although TN and TP contents do not directly affect the spectral characteristics of the water body, they are significantly correlated with Chl-a concentration [
15]. Wang et al. [
16] established the relationship between remote sensing data and turbidity, suspended solids, TP and chemical oxygen demand using Landsat TM satellite information. Xu et al. [
17] developed a quantitative inverse model for nitrogen and phosphorus concentrations using MODIS image data. The obtained results were consistent with the actual nitrogen concentrations in the study area.
Existing studies have indicated that, due to the complexity of water quality influencing factors in natural water bodies, it is not rigorous to use a single water quality indicator to evaluate the water quality status. Therefore, it is necessary to use the trophic state index to judge the trophic state of lakes. Lillesand et al. [
18] calculated the Carlson trophic state index using Landsat satellite MSS data and used it to evaluate the trophic state of lakes. For non-water color indicators that do not directly affect the spectral characteristics of water bodies, such as TN, TP, dissolved oxygen (DO), five-day biochemical oxygen demand (BOD5) and chemical oxygen demand (COD), it is generally necessary to indirectly infer the water color parameters [
19]. Remote sensing inversion of non-water color indicators is more difficult compared to water color indicators, and research in this area started later. However, there are still many related studies.
Rodríguez-Molina [
20] introduced a software system called HydroTrack which could simulate pollutant transport and demonstrated a simulation of pollutant discharges with an example in the Maracaibo estuary in Venezuela. Richards et al. [
21] studied different tracers to identify the sources of pollution by tracing contaminants in watercourses and the result suggested that a combination of multiple chemical and physical tracing approaches should be employed. Compared to the methods used in these studies, the inversion model constructed using remote sensing imagery and field-measured data can more clearly and intuitively identify the locations of pollutant discharges, with a shorter processing time.
This paper is based on fundamental theories, including remote sensing inversion and water quality monitoring. This study focuses on East Lake in Wuhan and utilizes satellite remote sensing inversion for water quality simulation. The research yields spatial distributions and temporal variations in key water quality parameters, such as Chl-a, COD, TN and TP. An inversion of pollutant concentration variations within the water quality field can identify potential pollution sources in East Lake.
4. Discussion
The spatial distribution of water quality parameters obtained through remote sensing inversion, as shown in
Figure 6, closely corresponds to prior research findings in East Lake. In general, nitrogen and phosphorus concentrations in East Lake surpass standard limits. The southern part of East Lake exhibits relatively poorer water quality compared to the northern part. Concentration trends for Chl-a, TN, TP and NH3-N are generally consistent, with the central regions of large sub-lakes in East Lake, such as Guozheng Lake (GZL), Tangling Lake (TLH) and Tuan Lake (TL), showcasing relatively better water quality. Conversely, peripheral smaller sub-lakes like Yujia Lake (YJL) and Miao Lake (ML) demonstrate relatively poorer water quality due to limited water exchange [
26]. The distribution trend of COD concentration deviates from other indicators, with lower COD concentrations observed in YJL and ML. Areas near East Lake sewage outlets, regions with frequent human activities and enclosed small sub-lakes with poor water mobility exhibit poorer water quality conditions. These findings align with the regional eutrophication status reported on the official website of the Wuhan Municipal Ecology and Environment Bureau (
Table S1). And they are also consistent with previous research findings that higher concentrations of Chl-a are commonly observed in sub-lakes such as Yujia Lake, Tuan Lake, Miao Lake and Shuiguo Hu [
27]. Notably, Chl-a levels are higher near the edges of these lakes, close to the land, compared to the central regions of the lakes [
28]. Li et al. [
29] utilized Landsat-8 images and measured data to construct a MIKE 21 model and similarly investigated chlorophyll-a concentrations in East Lake. Their findings indicated that chlorophyll-a concentrations were generally poorer near the periphery of the southwest bank of East Lake, with water quality gradually improving toward the central area of the lake. This study’s results are consistent with these results, further confirming the spatial patterns of water quality distribution in East Lake.
Zhang et al. [
30] used Landsat TM/ETM+ and OLI imagery to predict water quality parameters in Danjiangkou Reservoir, and the coefficients of determination R
2 for total nitrogen and chemical oxygen demand were 0.62 and 0.61, respectively, during the dry season. For this study, using Sentinel-2 data and a multiple linear regression model achieved higher R
2 values for the same two water quality parameters (R
2 > 0.75). This suggests that the remote sensing approach used in this study is effective in capturing water quality variations, particularly when compared to similar methodologies in other studies. However, as both studies indicate, the performance of regression models may vary depending on the specific water quality parameters and the characteristics of the study area, highlighting the need for further research across different temporal and spatial scales.
The proximity of stormwater and wastewater outfalls to each of the four identified potential pollution sources, as shown in
Figure 8, demonstrates the effectiveness of this method in pinpointing pollution source locations. This finding demonstrates the potential utility of our model in identifying areas with elevated pollution risks, supporting its broader application in similar contexts.
After analyzing the water quality data for Chl-a, TN, TP, NH₃-N and COD, it was determined that there were no significant outliers that required removal. It is important to note that the handling of outliers can significantly impact the conclusions drawn from a small dataset.
From the remotely sensed concentration of water quality parameters in this study, it is evident that East Lake faces water pollution issues in the year 2021, often falling short of the planned Class III water quality standards. Despite increased efforts by the Wuhan municipal government to address pollution in East Lake, the water quality has frequently reached Class V conditions in recent years, as presented in
Figure S1. During the investigation of East Lake outfalls, a significant amount of industrial and domestic wastewater discharge into the lake was observed. Although the treated discharged wastewater meets standards, the cumulative volume still exceeds the water environmental capacity of East Lake.
However, there are still many limitations to this study that require further exploration and validation in future research.
First, due to the lack of long-term data, the measured data in this study primarily reflect the water quality conditions of East Lake during the winter of 2021, which is not comprehensive enough to represent the conditions in other seasons or years. Additionally, the time scale for simulation and prediction is relatively short. With more extensive water quality data, future research can conduct a more thorough analysis of the spatial and temporal variations in East Lake’s water quality over an extended period. This would allow for a comparison of water quality across different seasons and enable long-term predictions on an annual scale, helping to identify the factors influencing changes in East Lake’s water quality.
Secondly, the satellite-based remote sensing approach used in this study is limited to assessing the uppermost part of the water column, and thus may not fully represent the vertical stratification of the water body. We did not consider the effects of hydrodynamic processes within East Lake, such as hydrodynamic mixing, eddies and water circulation patterns, which can influence the distribution of pollutants. These processes may cause pollutants to disperse and accumulate in areas that do not directly correspond to their source, potentially leading to inaccuracies in pinpointing discharge locations. In addition, factors such as water, grass and sediment can affect spectral data and, consequently, the identification of pollution sources. To address these limitations in future studies, incorporating hydrodynamic modeling alongside remote sensing data would be beneficial. By simulating water flow and mixing patterns within the lake, we could more accurately trace the movement of pollutants from their source to their observed concentrations. Future studies should consider removing pixels from areas near the shore or aquatic zones to enhance the accuracy of the results. Additionally, including data from different depths and integrating in situ measurements could improve our understanding of subsurface discharges and their impact on water quality.
5. Conclusions
This study focuses on East Lake in Wuhan and involves constructing a remote sensing inversion model based on satellite imagery and concurrently collected water quality data. The model is used to infer the spatial distribution of water quality indicators. The accuracy and simulation performance of the model are evaluated by comparing its results with actual water quality data from sampled points. Assessing the distribution of water quality indicators makes it possible to identify regions with elevated concentrations along the boundaries, enabling the determination of potential pollution sources and the calculation of pollutant discharge. Verification is conducted by comparing these findings with the actual discharge points in East Lake. Remote sensing technology is utilized for water quality monitoring in East Lake, and an efficient water quality prediction model is developed.
The multiple linear regression model provided a good relationship between the observed and simulated water quality parameters, with four parameters correlated at R2 > 0.6 and the remaining one correlated at R2 > 0.45. The result shows that the location of the main pollution sources for the inferred water quality parameters was close to the publicly disclosed discharge point. This method can be applied to lakes where the discharge point locations are unknown. A longer period should be investigated to test the multiple linear regression model, determine its performance and assess the impacts of East Lake restoration on water quality parameters.
Considering the analyzed concentration distribution results of remotely sensed parameters in various sub-lakes of East Lake, issues with the current water quality status are identified, such as non-compliance with water quality standards and the frequent occurrence of “algae bloom”. In response to these problems, several remediation recommendations are proposed.
5.1. Improvement in East Lake Water System Structure
Although the “Grand East Lake Water Network Connection Project” was initiated by the Wuhan municipal government in 2009, effectively managing water quality parameters in the large sub-lakes of East Lake, some of the smaller sub-lakes in its vicinity still experience poor water circulation. These smaller sub-lakes are significantly influenced by pollution discharges from surrounding enterprises and human activities. Beyond the existing culverts and sluice gates, enhancing connectivity between the sub-lakes could be considered to improve the self-purification capacity of the water body in East Lake.
5.2. Management of Pollution Sources in East Lake
To tackle the increased pollution in the southern and nearshore areas of East Lake, it is essential to enhance the redevelopment of adjacent sewage treatment plants. This includes expanding the sewage treatment capacity to accelerate the purification of water. Furthermore, a centralized approach for treating pollutants from urban sewage and aquaculture is imperative.
5.3. Enhancement of the Construction Supervision System
The water protection authority responsible for East Lake should actively disseminate information by consistently consolidating and publishing monthly water quality monitoring data on its official website. This initiative aims to uphold transparency, enabling easy public access to crucial water quality information. By involving the community in overseeing the water conditions of East Lake, a collaborative partnership between the government and the public can be established.
Utilizing remote sensing inversion data for East Lake, we inferred potential discharge points. Four locations, characterized by the highest pollutant concentrations with a subsequent decrease in surrounding water bodies, were designated as potential pollution sources. A comparative analysis with publicly available information on East Lake’s discharge points confirmed a reasonable alignment, showcasing the utility of the remote sensing inversion model in pinpointing pollutant source locations. This significantly enhances the model’s potential application to other water bodies.
In conclusion, this model can subsequently be applied to broader watershed studies, including the Yangtze River, for improved assessment of water quality conditions. This approach holds scientific significance and practical value in enhancing water environmental management and remediation efforts.