Next Article in Journal
Ionospheric Inhomogeneities and Their Influences on the Earth’s Remote Sensing from Space
Next Article in Special Issue
Response of Industrial Warm Drainage to Tide Revealed by Airborne and Sea Surface Observations
Previous Article in Journal
Comparative Analysis between Sea Surface Salinity Derived from SMOS Satellite Retrievals and in Situ Measurements
Previous Article in Special Issue
Retrieval of Water Quality Parameters Based on Near-Surface Remote Sensing and Machine Learning Algorithm
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Satellite and Machine Learning Monitoring of Optically Inactive Water Quality Variability in a Tropical River

1
Environmental and Plant Protection Institute, Chinese Academy of Tropical Agricultural Sciences, Haikou 571101, China
2
Hainan Danzhou Tropical Agro-Ecosystem National Observation and Research Station, Danzhou 571737, China
3
Key Laboratory of Low-Carbon Green Agriculture in Tropical Region of China, Ministry of Agriculture and Rural Affairs, Haikou 571101, China
4
Hainan Key Laboratory of Tropical Eco-Circular Agriculture, Haikou 571101, China
5
National Agricultural Experimental Station for Agricultural Environment, National Long-Term Experimental Station for Agriculture Green Development, Danzhou 571737, China
6
Hainan Provincial Ecological and Environmental Monitoring Centre, Haikou 571126, China
7
Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, China
8
Progoo Research Institute, Tianjin Progoo Information Technology Co., Ltd., Tianjin 300380, China
9
College of Resources and Environmental Sciences, China Agricultural University, Beijing 100193, China
10
Hainan Climate Center, Haikou 570203, China
*
Author to whom correspondence should be addressed.
Remote Sens. 2022, 14(21), 5466; https://doi.org/10.3390/rs14215466
Submission received: 21 September 2022 / Revised: 20 October 2022 / Accepted: 25 October 2022 / Published: 30 October 2022
(This article belongs to the Special Issue Hyperspectral Remote Sensing Technology in Water Quality Evaluation)

Abstract

:
Large-scale monitoring of water quality parameters (WQPs) is one of the most critical issues for protecting and managing water resources. However, monitoring optically inactive WQPs, such as total nitrogen (TN), ammoniacal nitrogen (AN), and total phosphorus (TP) in inland waters, is still challenging. This study constructed retrieval models to explore the spatiotemporal evolution of TN, AN, and TP by Landsat 8 images, water quality sampling, and five machine learning algorithms (support vector regression, SVR; random forest regression, RFR; artificial neural networks, ANN; regression tree, RT; and gradient boosting machine, GBM) in the Nandu River downstream (NRD), a tropical river in China. The results indicated that these models can effectively monitor TN, AN, and TP concentrations at in situ sites. In particular, TN by RFR as well as AN and TP by ANN had better accuracy, in which the R2 value ranged between 0.44 and 0.67, and the RMSE was 0.03–0.33 mg/L in the testing dataset. The spatial distribution of TN, AN, and TP was seasonal in NRD from 2013–2022. TN and AN should be paid more attention to in normal wet seasons of urban and agricultural zones, respectively. TP, however, should be focus on in the normal season of agricultural zones. Temporally, AN decreased significantly in the normal and wet seasons while the others showed little change. These results could provide a large-scale spatial overview of the water quality, find the sensitive areas and periods of water pollution, and assist in identifying and controlling the non-point source pollution in the NRD. This study demonstrated that multispectral remote sensing and machine learning algorithms have great potential for monitoring optically inactive WQPs in tropical large-scale inland rivers.

Graphical Abstract

1. Introduction

Freshwater is a vital resource for both the environment and humanity. However, in recent decades, water resources have been threatened by anthropogenic contamination with the rapid development of the economy and insufficient awareness of environmental protection [1,2,3]. Due to the indiscriminate discharges of industrial, agricultural, and domestic sewage to water bodies, nutrient concentrations such as nitrogen (N) and phosphorus (P) increase dramatically, leading to non-point source pollution [4,5]. For example, eutrophication, associated algal overgrowth, and declining water quality negatively impact eco-environment services (e.g., drinking water, food supply, and biodiversity) and pose a threat to human health [6,7,8].
The Nandu River, a tropical river and the largest river on Hainan Island, is one of the primary sources of drinking water, livestock and poultry aquaculture, and agricultural irrigation for Hainan Province, China. It is a typical tropical river basin consisting mainly of agriculture and rural areas, without industry. In recent years, however, the Nandu River has also been facing the problem of eutrophication. Due to the hot and humid climate, the Nandu River basin has had a high multiple cropping index (158.6%) [9] with more nutrient average input per crop season than in temperate regions [10]. Nutrient losses along with the frequent strong rainfall in this area result in the concentrations of total nitrogen (TN), ammoniacal nitrogen (AN), and total phosphorus (TP) exceeding the acceptable range and having great spatial heterogeneity [11,12,13], as did other tropical rivers around the world [14,15,16]. Therefore, providing effective strategies to monitor nutrient concentrations of inland water and determining their spatiotemporal dynamics are critical for freshwater resource protection and management [17,18].
In traditional water quality monitoring methods, extensive field monitoring networks are required, including on-site measurements, samplings, and laboratory analysis. Water quality monitoring networks, such as buoys and platforms deployed on some rivers and lakes, can collect data continuously and increase the “temporality” of in situ samplings, but are still spatially limited and costly. Although relatively accurate, most traditional methods are time-consuming, expensive, and limited to point-based data, failing to give a large-scale spatial overview of the water quality [19]. Therefore, researchers and managers aim for a comprehensive, low-cost water quality monitoring method [20].
In recent years, as remote sensing techniques have rapidly developed, satellite remote sensing has become a powerful tool that is widely used for monitoring large-scale variations in water quality [21,22]. Compared to the traditional methods, satellite remote sensing is cost-effective and less time-consuming and has been used as an effective detection tool for the optically active water quality parameters (WQPs), including chlorophyll-a, temperature, turbidity, and suspended solids with high spatiotemporal resolution [23,24,25,26]. However, retrieving the N and P concentrations for inland water remains challenging because these WQPs are non-optically active [27]. Fortunately, some scholars have made quantitative retrievals of N and P in bodies of water with empirical or machine learning algorithms [21,28].
Empirical models and machine learning algorithms have been widely used in retrieving WQPs from remote sensing images. The main principle of empirical methods is to use several spectral band combinations as part of polynomial regression to predict the change in water quality [29,30]. They are developed by long-term and large-scale experiments in the fixed water body with high accuracy [31]. Therefore, an empirical model obtained at one water body is not usually applicable for other water bodies, such as tropical rivers having different climates and ambient conditions. With the advances in algorithm development, computing power, sensor performance, and data availability, machine learning algorithms have been widely used for estimating WQPs [27]. Many scholars have used machine learning algorithms and remote sensing data to solve complex regressions such as nonlinear regression, which has relatively obvious advantages in monitoring optically inactive WQPs [32,33,34,35]. Popular machine learning algorithms including linear regression, regression tree, random forest, boosted regression tree, support vector regression, and artificial neural networks have been used in water quality monitoring along with various remote sensing data such as Landsat 5/7/8, MODIS, Sentinel 2/3, and airborne systems. Most of these studies focused on TN and TP in reservoirs and lakes with lower flow velocity in non-tropical regions [29,34,36], few AN, and did not involve in tropical rivers. In order to improve the robustness, these machine learning algorithms must be carefully made generalizable to tropical regions because they have higher temperatures and more frequent precipitation than the other regions. Therefore, a customized model by machine learning algorithms is needed for the monitoring of tropical rivers’ N and P concentrations.
The goals of this study are to: (1) develop retrieval models of WQPs to monitor the concentrations of TN, AN, and TP in the Nandu River based on Landsat 8 images, machine learning algorithms, and water quality samplings; (2) evaluate the accuracy of these models and determine the best model for a long-term analysis from 2013 to 2022; and (3) explore the spatiotemporal distribution and patterns of N and P concentrations and identify the sensitive area and period in the tropical river.

2. Materials and Methods

2.1. Study Area

The Nandu River downstream (NRD, 19°41′49″–20°4′56″N, 109°58′38″–110°26′7″E) is in the tropical area of northern Hainan Island, China (Figure 1). The waterway, which runs about 110.7 km from the Shankou national cross-section to the river mouth, flows into the Qiongzhou strait at the Haikou City Xinbu delta and occupies a total drainage area of ~2290 km2. It is a primary river system that flows through agricultural, rural, and urban areas, mostly hilly tableland and coastal deltas. This area has a tropical monsoon marine climate with an annual mean temperature of ~24.6 °C, long summers, and no winters. The annual precipitation is ~1880.6 mm (1992–2021), most of which occurs in tropical cyclones or typhoons from July to October, and the spatiotemporal distribution is highly uneven. The annual average runoff is ~6.92 billion m3. The dry season (January–March), wet season (June–October), and normal season (April, May, November, December) recur throughout the year. Therefore, the area has a better agricultural foundation and a high level of intensive cultivation and breeding, with an industrial advantage in the areas of planting, freshwater aquatic, livestock, and poultry. To reveal the spatial variations of the nutrients in the different environments, this study divided the NRD into three subregions (Figure 1). More generally, the NRD was divided into an agriculture and rural mixed zone (A), an agriculture zone (B), and an urban zone (C) based on location, shape, geomorphology, and ecological function.

2.2. Data

2.2.1. Water Quality Sampling

Surface water quality data include the national cross-sections and the in situ site samplings. Five national cross-sections were obtained from the Environmental Knowledges Service System (http://envi.ckcest.cn/environment/data_Integration/data_Integration.jsp accessed on 18 April 2022) at the NRD from January 2021 to March 2022. The data are hourly (4 h) including the concentrations of TN, AN, and TP. The value at 12:00 was used due to being near the Landsat 8 scene center time in the NRD. Based on the five national cross-sections and field surveys, nine in situ water quality sites were established with representativeness, easy sampling, stability, and reliability as supplementary monitoring in the NRD. In situ water samples were collected once a month from the water surface (~20 cm) in the time frame of March 2021 to September 2021, stored in acid-washed Niskin bottles under ice conditions, and delivered to the laboratory on time. Then, WQPs were analyzed, including the concentrations of TN, AN, and TP. In order to test the TN concentration, the solutions were digested with potassium persulfate, and then a UV spectrophotometer was used to characterize the sample at a wavelength of 210 nm. The spectrophotometry running at the wavelength of 620 nm using Nessler’s reagent was used to test the AN concentration. The TP concentration was tested by spectrophotometry at 700 nm using ammonium molybdate. The environmental quality standards for surface water (GB 3838-2002) in China (Table S1) were used to evaluation water quality. There are five levels, including classes I, II, III, IV, and V according to the standard limit values of WQPs from low to high.

2.2.2. Landsat 8 Satellite Images

This study used Level-2 surface reflectance products in Landsat 8 operational land imager collection 2, which can be freely downloaded from the website of the United States Geological Survey (USGS) (http://earthexplorer.usgs.gov accessed on 19 May 2022). These products are radiometrically and geometrically corrected. Landsat 8 was successfully launched by NASA on 11 February 2013 and had a 16-day revisit cycle. This study used bands 1–7 of Landsat 8 with a spatial resolution of 30 m and a spectrum range from visible to short-infrared. In this study, 25 Landsat 8 images were collected with little clouds over the NRD from April 2013 to April 2022 at row 46 and path 124 (Table 1) to explore the spatiotemporal evolution of TN, AN, and TP in the NRD. There were totals of 3, 11, and 11 images at the dry, normal, and wet seasons, respectively.

2.2.3. Dataset for Modeling

There were 67 water samples collected, including national cross-sections (N = 39) and in situ sites (N = 28) with quasi-synchronous Landsat 8 images ranging from January 2021 to March 2022. The concentrations of TN, AN, and TP for all pixel–sample matchups were in the ranges of 0.30–2.97 mg/L, 0.03–0.65 mg/L, and 0.01–0.30 mg/L, respectively, covering the typical range for the NRD. The dataset, used to construct the retrieval models for TN, AN, and TP, was combined with the surface water quality data of 67 samples (output data) and the spectral values of seven bands of Landsat 8 images at the corresponding water samples (input data). The dataset was randomly divided into two groups: the training dataset and testing dataset. The training dataset contained 47 (AN) and 42 (TN and TP) samples, which accounted for 70% of the data and were used to train the parameters of retrieval models. The rest of dataset as the testing dataset contained 20 (AN) and 18 (TN and TP) samples that were used to test the performance of retrieval models.

2.3. Methods

2.3.1. Support Vector Regression (SVR)

SVR is a classical machine learning approach due to its robust ability to capture nonlinear trends. The basic idea of SVR [37] is that the original data points can be mapped from the input space into a feature with higher or even infinite spatial dimensions, where an optimal separating hyperplane with the minimum distance to all data points is established. It has been widely used in water environment prediction [38,39]. In this study, SVR was performed by the radial basis function of package “e1071” [40] in the R language.

2.3.2. Random Forest Regression (RFR)

RFR is a modern nonparametric technique for nonlinear multiple regression [41] and has been one of the best machine learning algorithms in recent years. As an ensemble algorithm, RFR constructs a large set of decision trees (typically hundreds or several thousand) through integrated learning, using one decision tree as the basic unit and each decision tree as a predictor. The composition of the forest is relatively simple, consisting mainly of independent and disconnected decision trees. The result is predicted from the average results of all decision trees with high accuracy and generalization performance. In addition, RFR is efficient on large datasets, can evaluate the significance of each input feature, and can effectively avoid overfitting [42]. The “randomForest” [43] package of R language was used to implement this regression technique.

2.3.3. Artificial Neural Networks (ANNs)

ANNs are similar to the mediation of the human brain, with a nerve cell model as a numerical analysis unit. The back-propagation neural network algorithm [44] used in this study is the most commonly used ANN in simulating the nonlinear relationship between variables. Thus, ANNs play a vital role in artificial intelligence. There are two processes in the primary learning of this algorithm, i.e., a forward computation process and an error back-propagation process. ANNs consist of three main layers: the input layer, hidden layer, and output layer. Neurons in different layers are connected by the corresponding weights, which can be updated in the error back-propagation process by back-propagating the output error in some form layer-by-layer through the hidden layer to the input layer, assigning the error to individual neural units of each neuron in each layer. The details of the ANN can be found in previous studies [21,36,45]. The “neuralnet” package [46] in R language was used to implement the ANN using the “neuralnet” function.

2.3.4. Regression Tree (RT)

RT analysis has been widely used for remote sensing-based environmental monitoring [47]. Generally speaking, the RT model can be considered as a binary splitting process in which the feature space is recursively stratified into subdivisions. Each node has two possible values, i.e., “Yes” or “No”. The basis of stratification is to minimize the deviation from the mean of the response variable. This rule-based regression model consists of four parts, i.e., root nodes, internal nodes, branch nodes, and terminal nodes. The prediction accuracy of the model is highly related to the total number of nodes. The redundant growth of the model leads to a higher possibility of overfitting, which can reduce the prediction accuracy of the test set. Therefore, the necessary pruning of the tree is required. RT analysis can also be applied in nonlinear regression modeling to interpret the relationships between the independent and dependent variables in the form of rulesets. The “rpart” package [48] in the R language was used in this study.

2.3.5. Gradient Boosting Machine (GBM)

The GBM is another popular machine learning algorithm that has the advantages of high accuracy, a fast training process, short prediction time, and a small memory footprint in various applications [49]. Similar to RFR, the GBM [50] also consists of an ensemble of decision trees, but the sequence of trees is created, and each tree in the sequence focuses on the previous tree’s prediction residuals. The innovation of the GBM is its use of a nonparametric approach to estimate the basis function and using gradient descent to approximate the solution in function space. It does not depend on the pre-processing of variables, it can handle missing data, and it can solve overfitting problems. The “gbm” [51] package in the R language was used in this study.

2.4. Accuracy Assessment

The accuracy of these WQP retrieval models was evaluated by the coefficient of determination (R2), the root-mean-square error (RMSE), and the mean absolute percentage error (MAPE). The training and testing of these models, the statistical analysis of model parameters, the calculation of correlation coefficients, and the error analysis were mainly implemented by the R 4.2.0 language with the “raster”, “dplyr”, “e1071”, “randomForest”, “neuralnet”, “rpart”, and “gbm” packages. ArcGIS 10.2.2 and R with the “ggplot2” package were used to map the nutrient concentrations in the NRD spatially and temporally.

3. Results

3.1. Model Training and Testing

The WQP retrieval models for TN, AN, and TP were constructed by SVR, RFR, ANN, RT, and GBM algorithms through training and testing. ANN, RFR, and SVR showed better prediction results than RT and GBM for TN, AN, and TP (Figure 2, Table 2). Since the regression results were close to classification and narrowed the prediction range, the RT and GBM algorithms were unsuitable for the prediction of these WQPs. In addition, the R2 of RT and GBM were below 0.55, and the average was 0.29 in the training and testing datasets (Figure 2(a4,b4,c4,a5,b5,c5), and Table 2).
The ANN had the best prediction for TN in the training dataset, with the R2 above 0.95 and the RMSE around 0.09 mg/L (Figure 2(a3)). However, according to the values of the R2 and RMSE, the ANN performed worse than RFR (R2 = 0.49 and RMSE = 0.33 mg/L) in the testing dataset (Figure 2(a2)). RFR also had a better prediction in the training dataset (R2 = 0.75 and RMSE = 0.34 mg/L). Thus, RFR is the best model for TN. For the prediction of AN, the algorithms are ranked as ANN, RFR, and SVR from the best to the worst in the training and testing datasets. Furthermore, the ANN had the highest R2 values, which were close to 1 and 0.44 for the training and testing datasets, respectively, and the RMSE was less than 0.20 mg/L (Figure 2(b3)). RFR had the highest R2 (0.86) and the lowest RMSE (0.02 mg/L) in the training dataset for TP, but its R2 and RMSE values were the lowest in the testing dataset (Figure 2(c2)). The ANN was the best in the testing dataset (R2 = 0.67 and RMSE = 0.03 mg/L) and similar to RFR in the training dataset (Figure 2(c3)). Therefore, ANN is the best method for TP.
In general, these models constructed by Landsat 8 and machine learning algorithms can consistently monitor TN (by RFR), AN, and TP (both by ANN) at the national cross-section and in situ site samplings. The spread of predicted and measured WQPs was close to the 1:1 line. In addition, these WQPs had significant linear correlations for the testing dataset (Table 2), where the R2 ranged between 0.44 and 0.67, and the RMSE was 0.03–0.33 mg/L. The results indicated that these selected models could quantitatively monitoring TN, AN, and TP in the study area.

3.2. Spatiotemporal Variability of N and P Concentrations

TN, AN, and TP concentrations in the NRD were monitored for the whole region (All) and three subregions (A, B, and C) in the dry, normal, and wet seasons from 2013 to 2022 using the selected models. Temporal change analysis was not conducted in the dry season due to the small amount of Landsat 8 images (N = 3).
The TN concentration, ranging from 0.69 to 1.72 mg/L (Figure 3 and Figure S1) in class III to V water quality, was low in the dry season (mean 1.09 ± 0.11 mg/L) with the lowest concentration in region A, and high in the wet season (mean 1.27 ± 0.20 mg/L) with the highest concentration in region C. In all seasons, the maximum value of TN always occurred in region C, followed by region B. In addition, the spatial distribution of the TN concentration varied enormously in the normal and wet seasons but was more consistent in the dry season. Temporally, the TN had no significant change in the wet and normal seasons (Figure 4, Table S2 and Figure S1). The highest value (1.44 ± 0.17 mg/L) of TN was monitored in region C in 2016, and the lowest value (1.06 ± 0.11 mg/L) was monitored in region A in 2019 in the NRD’s normal season.
The AN concentration, ranging between 0.03 and 1.21 mg/L (Figure 3 and Figure S2) in class I to III water quality, was low in the dry season (mean 0.29 ± 0.21 mg/L) and high in the normal season (mean 0.48 ± 0.21 mg/L); the lowest and the highest concentrations both occurred in region B. The maximum concentration of AN in the dry season was monitored in region C, followed by region A; and in the normal and wet seasons, it was monitored in region B, followed by region C. Moreover, the spatial distribution of AN varied strongly in all seasons, especially in the dry season. Temporally, the AN concentration decreased significantly in the normal and wet seasons, especially in the normal season with a rate of 0.02 mg/a (Figure 5, Table S3 and Figure S2). The highest (0.71 ± 0.21 mg/L) and lowest (0.08 ± 0.14 mg/L) concentrations of AN were in region B’s wet seasons of 2015 and 2020, respectively.
The TP concentration, ranging from 0.01 to 0.28 mg/L (Figure 3 and Figure S3) in class I to IV water quality, was low in the wet season (mean 0.07 ± 0.02 mg/L) with the lowest concentration in region A, and high in the normal season (mean 0.09 ± 0.03 mg/L) with the highest concentration in region B. In the dry and wet seasons, the maximum concentration of TP occurred in region C, followed by region B; and in the normal season, the maximum concentration of TP occurred in region B, followed by region A. In addition, the spatial distribution of TP varied strongly in the dry and normal seasons but was more consistent in the wet season. Temporally, the decrease rate of TP passed the significance test only in the normal season in region A but was close to 0 (Figure 6, Table S4 and Figure S3). The highest value (0.10 ± 0.01 mg/L) of TP was observed in the wet season of 2013 in region C, and the lowest value (0.06 ± 0.01 mg/L) was observed in the normal season of 2015 in region C.

4. Discussion

4.1. Models Comparison and Future Optimization

The results show that the concentrations of TN, AN, and TP, especially TP, can be accurately monitored faster, easier, and more low-cost by machine learning algorithms and Landsat 8 data in the tropical river. In these models, the underlying hydrologic and environmental processes don’t need to be explicitly described in a mathematical form [52]. Therefore, these models can be used without many variables. In other words, these constructed models only need a sufficient amount of surface water quality monitoring data and remote sensing data from the same period.
Using field measurements, sampling, and laboratory analysis to monitor the concentration of WQPs is expensive and time-consuming. Currently, the remote sensing retrieval models have not completely replaced experimental analysis. However, the pixels of Landsat 8 images are equivalent to setting a sampling point in a 30 × 30 m2 water area. Thus, Landsat 8 has a much higher spatial sampling density than conventional sampling. Higher resolution can better indicate large-scale changes in nutrient concentrations [38]. It can further help to optimize the location of water samples in watersheds and maximize the effectiveness of the data collection strategy. In the long term, satellite observations highly reduce uncertainty due to the sampling intervals and the uneven spatial distribution of sampling. It is of great significance to help fully interpret the changes and rules of river nutrient concentrations and the contribution of various non-point source pollution.
As a complex biogeochemical system, the aquatic ecosystem contains numerous chemical, physical, and biological components that undergo a large range of integrated transformation processes. It is challenging to find the associations between WQPs and bands of remote sensing in bodies of water. The existing regression equations had high accuracy when fitted in fixed waters. However, when they were extended and applied to other waters, the results were relatively coarse. This study also compared the monitoring performance of the selected models with previous regression equations [29,30,53] for TN and TP concentrations in the NRD. Since there are few regression equations existing for AN, it is not compared in the study. Figure 7, Tables S5 and S6 indicate that the previous regression equations applicable to specific regions are difficult to apply to monitoring TN and TP in the NRD. Most empirical methods establish statistical relationships between nutrient concentrations and remotely sense reflectance on the basis of linear, exponential, or logarithmic regressions [36,54,55] without strict theoretical foundations. Furthermore, the regression fits for single and multiband combinations were susceptible to changes in the specified bands. Thus, they were poorly generalized and did not apply to other conditions or locations [56].
While excellent model performance was demonstrated in this study, some additional aspects need to be further investigated in future studies. More advanced deep learning and artificial intelligence algorithms with large training and testing datasets from various rivers should be used to avoid overfitting and improve the monitoring accuracy and robustness of the retrieval model. Considering the uncertainties of the model parameters, structure, and input data, monitoring results’ uncertainty should be quantified to facilitate data interpretation [39].

4.2. Water Quality Evaluation

This study area is located in the Haikou, Dingan, and Chengmai regions of Hainan Province, China. From the Hainan statistical yearbook [9], this area has a resident population of ~3.67 million (75.66% in urban), a cultivated land area of ~10.29 × 104 ha, a freshwater aquatic area of ~7742 ha, chemical fertilizer consumption of ~26.46 × 104 t, numbers of livestock and poultry raised in the whole year of ~162.17 and ~5645.26 million heads, and little industry in 2020, respectively. The area is a typical tropical agricultural town non-point source pollution area, and the pollution source is mainly the discharges of agriculture and domestic sewage [57].
The results indicated the seasonality of TN, AN, and TP distribution in the NRD. TN and AN concentrations were high in the wet and normal seasons and low in the dry season. This result was similar to previous studies on the Yiluo River [58], the Ru River [18], and the Yangtze River [30]. The wet and normal seasons almost overlap with the rainy season in this area. Increased rainfall could result in N being exported into the water body through runoff. Rainfall often imports a lot of N from agricultural and domestic sources [59,60]. Another reason may be N deposition. Chen et al. [61] found that there was more atmospheric N deposition in summer than in other seasons. In the urban region, wastewater is probably the largest nutrient source [62,63], and this area has structurally unsound sewers and relatively simple sewage treatment processes. Sewers often leak at low flows and may surcharge and overflow during the inflow and infiltration in wet weather, which is related to the age of infrastructure, the pipe materials, and the dimensions of pipes [64,65]. Thus, the NRD showed a gradually decreasing distribution of N concentration from the estuary to the upstream with the inflow of other tributaries.
However, TP concentration was high in the normal season and low in the wet season. This result was similar to previous studies on the Pearl River [66] and the Fuyang River [67]. In contrast to N, there are more forms of P existing in water bodies, especially dissolved P, which can be adsorbed on mineral surfaces [68]. Thus, P is easily present in sediments by binding to aluminum oxides, interiors of iron, and calcium oxide [69]. Replenishing many clean water bodies in the NRD has diluted the TP of surface water in the wet season. The replenishment decreases during the normal and dry seasons, aquatic plant residue decomposes, and sediments release nutrients into surface water, increasing the TP. In addition, Berthold et al. [70] reported that for coastal waters, P deposition is a considerable nutrient flux. These may be the reasons for the order of TP in these regions, i.e., C > B > A, in the wet and dry seasons.
Temporally, AN had a significant decreasing trend in the normal and wet seasons. The reason for this is the significant investments in wastewater discharge standards and pollution control strategies that were made to address the water pollution crisis and promote eco-environment restoration [5].

4.3. Limitations

Ideally, the more Landsat 8 images are used, the more accurate the retrieval model will be, and the N and P concentrations’ spatial distribution and temporal trends will be more clearly identified in the NRD. However, this study only obtained a limited number of Landsat 8 images due to the coarse temporal (16 days) and spatial (30 m) resolutions and the heavy cloud cover in tropical areas. These disadvantages limit the application of satellite remote sensing for inland water research in tropics [35].
Furthermore, satellite observations are almost instantaneous, and therefore the high temporal variability of nutrient concentrations is an important consideration. The results of 24 h continuous TN, AN, and TP monitoring that grouped different subregions and hydrological seasons in the NRD are shown in Figure 8. Nutrient concentrations varied slightly throughout the day except on a few individual dates, and the scene center time of Landsat 8 does not overlap the time in which the extremum value occurs. Therefore, the satellite monitoring results are representative of the daily mean values. Further studies should focus on achieving high-temporal-resolution (e.g., hourly) monitoring of nutrient concentrations in a river by using geostationary satellite images [71,72] or assimilating remote sensing and water quality models [73,74,75].
In addition, hyperspectral remote sensing is a feasible solution to improve the accuracy of the WQP retrieval model. It can obtain abundant information about inland water bodies, making it possible to accurately identify nutrient concentrations due to high spatial and spectral resolution [76]. It has been applied to inland water quality monitoring, such as TN, TP, AN, cyanobacterial blooms, chlorophyll-a, total suspended matter, and colored dissolved organic matter [33,35,77,78,79,80]. However, portable, proximal, and unmanned aerial vehicle-borne hyperspectral data can only observe limited areas. The entire area of the NRD is difficult to explore because it has about 110.7 km of waterways. Furthermore, trade-offs among spectral, spatial, and radiometric resolutions limit the spatial resolution of the spaceborne hyperspectral images to medium or coarse resolutions [81,82]. Moreover, few long-term series continuous data of hyperspectral remote sensing are now available for temporal variation studies. Therefore, multispectral remote sensing is a better choice for studying the spatiotemporal variation of WQPs in large-scale inland areas.

5. Conclusions

Machine learning algorithms for monitoring optically inactive WQPs were constructed in this study by Landsat 8 images and ground monitoring data. The TN concentration can be effectively predicted by RFR, and AN and TP by ANN. These retrieval models had much higher spatial resolutions than conventional field sampling and were more accurate than previous regression equations for TN and TP in the NRD, which is a large-scale tropical inland river. They can be used to interpret the changes and rules of TN, AN, and TP in the NRD. The spatiotemporal evolution of TN, AN, and TP in the NRD was explored in the dry, normal, and wet seasons at the pixel level from 2013 to 2022. The concentrations of N and P showed significant seasonal changes. In the normal and wet seasons, AN had a significant decreasing trend, while TN and TP had almost no significant change. Considering the supervision and management of non-point source pollution in the NRD, TN and AN should be paid more attention to in the normal and wet seasons of urban and agricultural zones, respectively; TP, however, should be focused on in the normal season of agriculture zones. The results can give a large-scale spatial overview of the water quality, identify the sensitive areas and periods with water pollution, provide an important basis for the supervision and management of non-point source pollution in the whole region, and be applied to other similar tropical rivers. This study demonstrated that multispectral remote sensing, currently, is a better choice for studying the spatiotemporal variation of optically inactive WQPs in large-scale and long-term tropical inland waters. Further studies can use more advanced deep learning and artificial intelligence algorithms with hyperspectral remote sensing to improve the robustness and accuracy of these retrieval models in different water bodies. In the future, the retrieval model could be extended for larger tropical regions and more WQPs, assist in identifying the pollution source, and provide insight into nutrient dynamics and transfers from inland to ocean waters.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/rs14215466/s1, Table S1: classification standards of water quality levels in GB 3838–2002; Table S2: statistical information of TN for all regions and seasons; Table S3: statistical information of AN for all regions and seasons; Table S4: statistical information of TP for all regions and seasons; Table S5: statistical information of RFR for TN compare with previous regression equations in the testing dataset; Table S6: statistical information of ANN for TP compare with previous regression equations in the testing dataset; Figure S1: the spatial distribution of TN in NRD; Figure S2: the spatial distribution of AN in NRD; Figure S3: the spatial distribution of TP in NRD.

Author Contributions

Conceptualization, N.L. and M.C.; methodology, N.L., D.Z. and M.C.; software, N.L. and Z.N.; validation, N.L. and M.C.; formal analysis, N.L.; investigation, N.L., H.L., X.C., W.L., W.Z. and Y.C.; resources, N.L., C.H. and R.B.; data curation, N.L., H.L., W.L., W.Z. and Y.C.; writing—original draft preparation, N.L., Z.N. and M.C.; writing—review and editing, N.L., Z.N., D.Z., M.C., D.W. and R.B.; visualization, N.L. and Z.N.; supervision, M.C., Q.L. and L.Z.; project administration, N.L. and M.C.; funding acquisition, N.L., M.C. and Q.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Major Science and Technology Plan of Hainan Province, China (ZDKJ2021008), the Key Research and Development Project of Hannan Province, China (ZDYF2022XDNY210), and the Central Public-interest Scientific Institution Basal Research Fund, China (1630042022022).

Data Availability Statement

The data used in this study are available from the corresponding website.

Acknowledgments

The authors would like to acknowledge the data provided by the Environmental Knowledges Service System, the United States Geological Survey (USGS), and the anonymous reviewers for their constructive comments and suggestions to improve this manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Liu, J.; Yang, W. Water Sustainability for China and Beyond. Science 2012, 337, 649–650. [Google Scholar] [CrossRef] [PubMed]
  2. Liu, J.; Diamond, J. China’s environment in a globalizing world. Nature 2005, 435, 1179–1186. [Google Scholar] [CrossRef] [PubMed]
  3. Wang, M.; Janssen, A.B.G.; Bazin, J.; Strokal, M.; Ma, L.; Kroeze, C. Accounting for interactions between Sustainable Development Goals is essential for water pollution control in China. Nat. Commun. 2022, 13, 730. [Google Scholar] [CrossRef] [PubMed]
  4. Turner, S.W.D.; Rice, J.S.; Nelson, K.D.; Vernon, C.R.; McManamay, R.; Dickson, K.; Marston, L. Comparison of potential drinking water source contamination across one hundred U.S. cities. Nat. Commun. 2021, 12, 7254. [Google Scholar] [CrossRef] [PubMed]
  5. Ma, T.; Zhao, N.; Ni, Y.; Yi, J.; Wilson, J.P.; He, L.; Du, Y.; Pei, T.; Zhou, C.; Song, C.; et al. China’s improving inland surface water quality since 2003. Sci. Adv. 2020, 6, eaau3798. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  6. Lu, Y.; Song, S.; Wang, R.; Liu, Z.; Meng, J.; Sweetman, A.J.; Jenkins, A.; Ferrier, R.C.; Li, H.; Luo, W.; et al. Impacts of soil and water pollution on food safety and health risks in China. Environ. Int. 2015, 77, 5–15. [Google Scholar] [CrossRef] [Green Version]
  7. Chen, B.; Wang, M.; Duan, M.; Ma, X.; Hong, J.; Xie, F.; Zhang, R.; Li, X. In search of key: Protecting human health and the ecosystem from water pollution in China. J. Clean. Prod. 2019, 228, 101–111. [Google Scholar] [CrossRef]
  8. Schwarzenbach, R.P.; Egli, T.; Hofstetter, T.B.; von Gunten, U.; Wehrli, B. Global Water Pollution and Human Health. Annu. Rev. Environ. Resour. 2010, 35, 109–136. [Google Scholar] [CrossRef]
  9. Hainan Provincial Bureau of Statistics; Survey Office of National Bureau of Statistics in Hainan. Hainan Statistical Yearbook 2021; China Statistics Press: Beijing, China, 2021; Volume 35.
  10. Li, T.; Hong, X.; Liu, S.; Wu, X.; Fu, S.; Liang, Y.; Li, J.; Li, R.; Zhang, C.; Song, X.; et al. Cropland degradation and nutrient overload on Hainan Island: A review and synthesis. Environ. Pollut. 2022, 313, 120100. [Google Scholar] [CrossRef]
  11. Jordan, Y.C.; Ghulam, A.; Hartling, S. Traits of surface water pollution under climate and land use changes: A remote sensing and hydrological modeling approach. Earth-Sci. Rev. 2014, 128, 181–195. [Google Scholar] [CrossRef]
  12. Ebenstein, A. The Consequences of Industrialization: Evidence from Water Pollution and Digestive Cancers in China. Rev. Econ. Stat. 2012, 94, 186–201. [Google Scholar] [CrossRef] [Green Version]
  13. Huddell, A.M.; Galford, G.L.; Tully, K.L.; Crowley, C.; Palm, C.A.; Neill, C.; Hickman, J.E.; Menge, D.N.L. Meta-analysis on the potential for increasing nitrogen losses from intensifying tropical agriculture. Glob. Chang. Biol. 2020, 26, 1668–1680. [Google Scholar] [CrossRef]
  14. Nguyen, T.T.N.; Nemery, J.; Gratiot, N.; Strady, E.; Tran, V.Q.; Nguyen, A.T.; Aime, J.; Peyne, A. Nutrient dynamics and eutrophication assessment in the tropical river system of Saigon—Dongnai (southern Vietnam). Sci. Total Environ. 2019, 653, 370–383. [Google Scholar] [CrossRef] [PubMed]
  15. McDowell, W.H.; McDowell, W.G.; Potter, J.D.; Ramírez, A. Nutrient export and elemental stoichiometry in an urban tropical river. Ecol. Appl. 2019, 29, e01839. [Google Scholar] [CrossRef] [PubMed]
  16. Liu, X.L.; Han, G.; Zeng, J.; Liu, M.; Li, X.Q.; Boeckx, P. Identifying the sources of nitrate contamination using a combined dual isotope, chemical and Bayesian model approach in a tropical agricultural river: Case study in the Mun River, Thailand. Sci. Total Environ. 2021, 760, 143938. [Google Scholar] [CrossRef] [PubMed]
  17. Liang, X.; Zhao, H.; He, Y.; Zhu, L.; Zou, Y.; Ye, C. Spatiotemporal characteristics of agricultural nitrogen and phosphorus emissions to water and its source identification: A case in Bamen BayChina. J. Contam. Hydrol. 2022, 245, 103936. [Google Scholar] [CrossRef] [PubMed]
  18. Yang, X.; Liu, Q.; Fu, G.; He, Y.; Luo, X.; Zheng, Z. Spatiotemporal patterns and source attribution of nitrogen load in a river basin with complex pollution sources. Water Res. 2016, 94, 187–199. [Google Scholar] [CrossRef] [Green Version]
  19. Beretta-Blanco, A.; Carrasco-Letelier, L. Relevant factors in the eutrophication of the Uruguay River and the Rio Negro. Sci. Total Environ. 2021, 761, 143299. [Google Scholar] [CrossRef]
  20. Zhang, D.; Zhang, L.; Sun, X.; Gao, Y.; Lan, Z.; Wang, Y.; Zhai, H.; Li, J.; Wang, W.; Chen, M.; et al. A New Method for Calculating Water Quality Parameters by Integrating Space–Ground Hyperspectral Data and Spectral-In Situ Assay Data. Remote Sens. 2022, 14, 3652. [Google Scholar] [CrossRef]
  21. Zhang, Y.; Wu, L.; Deng, L.; Ouyang, B. Retrieval of water quality parameters from hyperspectral images using a hybrid feedback deep factorization machine model. Water Res. 2021, 204, 117618. [Google Scholar] [CrossRef]
  22. Mansaray, A.S.; Dzialowski, A.R.; Martin, M.E.; Wagner, K.L.; Gholizadeh, H.; Stoodley, S.H. Comparing PlanetScope to Landsat-8 and Sentinel-2 for Sensing Water Quality in Reservoirs in Agricultural Watersheds. Remote Sens. 2021, 13, 1847. [Google Scholar] [CrossRef]
  23. Rotta, L.; Alcântara, E.; Park, E.; Bernardo, N.; Watanabe, F. A single semi-analytical algorithm to retrieve chlorophyll-a concentration in oligo-to-hypereutrophic waters of a tropical reservoir cascade. Ecol. Indic. 2021, 120. [Google Scholar] [CrossRef]
  24. Minnett, P.J.; Alvera-Azcárate, A.; Chin, T.M.; Corlett, G.K.; Gentemann, C.L.; Karagali, I.; Li, X.; Marsouin, A.; Marullo, S.; Maturi, E.; et al. Half a century of satellite remote sensing of sea-surface temperature. Remote Sens. Environ. 2019, 233, 111366. [Google Scholar] [CrossRef]
  25. Kuhn, C.; de Matos Valerio, A.; Ward, N.; Loken, L.; Sawakuchi, H.O.; Kampel, M.; Richey, J.; Stadler, P.; Crawford, J.; Striegl, R.; et al. Performance of Landsat-8 and Sentinel-2 surface reflectance products for river remote sensing retrievals of chlorophyll-a and turbidity. Remote Sens. Environ. 2019, 224, 104–118. [Google Scholar] [CrossRef] [Green Version]
  26. Balasubramanian, S.V.; Pahlevan, N.; Smith, B.; Binding, C.; Schalles, J.; Loisel, H.; Gurlin, D.; Greb, S.; Alikas, K.; Randla, M.; et al. Robust algorithm for estimating total suspended solids (TSS) in inland and nearshore coastal waters. Remote Sens. Environ. 2020, 246, 111768. [Google Scholar] [CrossRef]
  27. Sagan, V.; Peterson, K.T.; Maimaitijiang, M.; Sidike, P.; Sloan, J.; Greeling, B.A.; Maalouf, S.; Adams, C. Monitoring inland water quality using remote sensing: Potential and limitations of spectral indices, bio-optical simulations, machine learning, and cloud computing. Earth-Sci. Rev. 2020, 205, 103187. [Google Scholar] [CrossRef]
  28. He, Y.; Gong, Z.; Zheng, Y.; Zhang, Y. Inland Reservoir Water Quality Inversion and Eutrophication Evaluation Using BP Neural Network and Remote Sensing Imagery: A Case Study of Dashahe Reservoir. Water 2021, 13, 2844. [Google Scholar] [CrossRef]
  29. Shang, W.; Jin, S.; He, Y.; Zhang, Y.; Li, J. Spatial–Temporal Variations of Total Nitrogen and Phosphorus in Poyang, Dongting and Taihu Lakes from Landsat-8 Data. Water 2021, 13, 1704. [Google Scholar] [CrossRef]
  30. He, Y.; Jin, S.; Shang, W. Water Quality Variability and Related Factors along the Yangtze River Using Landsat-8. Remote Sens. 2021, 13, 2241. [Google Scholar] [CrossRef]
  31. Politi, E.; Cutler, M.E.J.; Rowan, J.S. Evaluating the spatial transferability and temporal repeatability of remote-sensing-based lake water quality retrieval algorithms at the European scale: A meta-analysis approach. Int. J. Remote Sens. 2015, 36, 2995–3023. [Google Scholar] [CrossRef]
  32. Pahlevan, N.; Smith, B.; Schalles, J.; Binding, C.; Cao, Z.; Ma, R.; Alikas, K.; Kangro, K.; Gurlin, D.; Hà, N.; et al. Seamless retrievals of chlorophyll-a from Sentinel-2 (MSI) and Sentinel-3 (OLCI) in inland and coastal waters: A machine-learning approach. Remote Sens. Environ. 2020, 240, 111604. [Google Scholar] [CrossRef]
  33. Qun’ou, J.; Lidan, X.; Siyang, S.; Meilin, W.; Huijie, X. Retrieval model for total nitrogen concentration based on UAV hyper spectral remote sensing data and machine learning algorithms—A case study in the Miyun Reservoir, China. Ecol. Indic. 2021, 124, 107356. [Google Scholar] [CrossRef]
  34. Qiao, Z.; Sun, S.; Jiang, Q.o.; Xiao, L.; Wang, Y.; Yan, H. Retrieval of Total Phosphorus Concentration in the Surface Water of Miyun Reservoir Based on Remote Sensing Data and Machine Learning Algorithms. Remote Sens. 2021, 13, 4662. [Google Scholar] [CrossRef]
  35. Sun, X.; Zhang, Y.; Shi, K.; Zhang, Y.; Li, N.; Wang, W.; Huang, X.; Qin, B. Monitoring water quality using proximal remote sensing technology. Sci. Total Environ. 2022, 803, 149805. [Google Scholar] [CrossRef]
  36. Vakili, T.; Amanollahi, J. Determination of optically inactive water quality variables using Landsat 8 data: A case study in Geshlagh reservoir affected by agricultural land use. J. Clean. Prod. 2020, 247, 119134. [Google Scholar] [CrossRef]
  37. Smola, A.J.; Schölkopf, B. A tutorial on support vector regression. Stat. Comput. 2004, 14, 199–222. [Google Scholar] [CrossRef] [Green Version]
  38. Yang, Y.; Shang, X.; Chen, Z.; Mei, K.; Wang, Z.; Dahlgren, R.A.; Zhang, M.; Ji, X. A support vector regression model to predict nitrate-nitrogen isotopic composition using hydro-chemical variables. J. Environ. Manag. 2021, 290, 112674. [Google Scholar] [CrossRef]
  39. Noori, R.; Yeh, H.-D.; Abbasi, M.; Kachoosangi, F.T.; Moazami, S. Uncertainty analysis of support vector machine for online prediction of five-day biochemical oxygen demand. J. Hydrol. 2015, 527, 833–843. [Google Scholar] [CrossRef]
  40. Meyer, D.; Dimitriadou, E.; Hornik, K.; Weingessel, A.; Leisch, F.; Chang, C.-C.; Lin, C.-C. Misc Functions of the Department of Statistics, Probability Theory Group (Formerly: E1071), TU Wien [R Package ‘e1071’ Version 1.7-11]. Available online: https://cran.r-project.org/web/packages/e1071/index.html (accessed on 14 July 2022).
  41. Svetnik, V.; Liaw, A.; Tong, C.; Sheridan, J.C.C.R.P.; Feuston, B.P. Random Forest:  A Classification and Regression Tool for Compound Classification and QSAR Modeling. J. Chem. Inf. Comput. Sci. 2003, 43, 1947–1958. [Google Scholar] [CrossRef]
  42. De’ath, G.; Fabricius, K.E. Classification and Regression Trees: A Powerful Yet Simple Technique for Ecological Data Analysis. Ecology 2000, 81, 3178–3192. [Google Scholar] [CrossRef]
  43. Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef] [Green Version]
  44. Heermann, P.D.; Khazenie, N. Classification of multispectral remote sensing data using a back-propagation neural network. IEEE Trans. Geosci. Remote Sens. 1992, 30, 81–88. [Google Scholar] [CrossRef]
  45. Balokas, G.; Czichon, S.; Rolfes, R. Neural network assisted multiscale analysis for the elastic properties prediction of 3D braided composites under uncertainty. Compos. Struct. 2018, 183, 550–562. [Google Scholar] [CrossRef]
  46. Fritsch, S.; Guenther, F.; Wright, M.N.; Suling, M.; Mueller, S.M. Neuralnet: Training of Neural Networks. R J. 2019, 2, 30. [Google Scholar]
  47. Johnson, B.; Jozdani, S. Identifying Generalizable Image Segmentation Parameters for Urban Land Cover Mapping through Meta-Analysis and Regression Tree Modeling. Remote Sens. 2018, 10, 73. [Google Scholar] [CrossRef] [Green Version]
  48. Therneau, T.; Atkinson, B.; Ripley, B. Recursive Partitioning and Regression Trees [R Package ‘rpart’ Version 4.1.16]. Available online: https://cran.r-project.org/web/packages/rpart/index.html (accessed on 14 July 2022).
  49. Arumugam, P.; Chemura, A.; Schauberger, B.; Gornott, C. Remote Sensing Based Yield Estimation of Rice (Oryza sativa L.) Using Gradient Boosted Regression in India. Remote Sens. 2021, 13, 2379. [Google Scholar] [CrossRef]
  50. Friedman, J.H. Greedy function approximation: A gradient boosting machine. Ann. Stat. 2001, 1189–1232. [Google Scholar] [CrossRef]
  51. Greenwell, B.; Boehmke, B.; Cunningham, J.; GBM Developers. Generalized Boosted Regression Models [R Package ‘gbm’ Version 2.1.8]. Available online: https://cran.r-project.org/web/packages/gbm/index.html (accessed on 15 July 2022).
  52. Ahmed, A.N.; Othman, F.B.; Afan, H.A.; Ibrahim, R.K.; Fai, C.M.; Hossain, M.S.; Ehteram, M.; Elshafie, A. Machine learning methods for better water quality prediction. J. Hydrol. 2019, 578, 124084. [Google Scholar] [CrossRef]
  53. Lim, J.; Choi, M. Assessment of water quality based on Landsat 8 operational land imager associated with human activities in Korea. Environ. Monit. Assess. 2015, 187, 384. [Google Scholar] [CrossRef]
  54. Li, Y.; Zhang, Y.; Shi, K.; Zhu, G.; Zhou, Y.; Zhang, Y.; Guo, Y. Monitoring spatiotemporal variations in nutrients in a large drinking water reservoir and their relationships with hydrological and meteorological conditions based on Landsat 8 imagery. Sci. Total Environ. 2017, 599–600, 1705–1717. [Google Scholar] [CrossRef]
  55. Gao, Y.; Gao, J.; Yin, H.; Liu, C.; Xia, T.; Wang, J.; Huang, Q. Remote sensing estimation of the total phosphorus concentration in a large lake using band combinations and regional multivariate statistical modeling techniques. J. Environ. Manag. 2015, 151, 33–43. [Google Scholar] [CrossRef] [PubMed]
  56. Palmer, S.C.J.; Kutser, T.; Hunter, P.D. Remote sensing of inland waters: Challenges, progress and future directions. Remote Sens. Environ. 2015, 157, 1–8. [Google Scholar] [CrossRef] [Green Version]
  57. Zou, L.; Liu, Y.; Wang, Y.; Hu, X. Assessment and analysis of agricultural non-point source pollution loads in China: 1978–2017. J. Environ. Manag. 2020, 263, 110400. [Google Scholar] [CrossRef] [PubMed]
  58. Wang, X.; Xu, Y.J.; Zhang, L. Watershed scale spatiotemporal nitrogen transport and source tracing using dual isotopes among surface water, sediments and groundwater in the Yiluo River Watershed, Middle of China. Sci. Total Environ. 2022, 833, 155180. [Google Scholar] [CrossRef] [PubMed]
  59. Chen, N.; Wu, J.; Hong, H. Effect of storm events on riverine nitrogen dynamics in a subtropical watershed, southeastern China. Sci. Total Environ. 2012, 431, 357–365. [Google Scholar] [CrossRef]
  60. Zhang, Y.; Shi, K.; Zhou, Y.; Liu, X.; Qin, B. Monitoring the river plume induced by heavy rainfall events in large, shallow, Lake Taihu using MODIS 250m imagery. Remote Sens. Environ. 2016, 173, 109–121. [Google Scholar] [CrossRef]
  61. Chen, X.; Strokal, M.; Kroeze, C.; Ma, L.; Shen, Z.; Wu, J.; Chen, X.; Shi, X. Seasonality in river export of nitrogen: A modelling approach for the Yangtze River. Sci. Total Environ. 2019, 671, 1282–1292. [Google Scholar] [CrossRef]
  62. Kaushal, S.S.; Groffman, P.M.; Band, L.E.; Elliott, E.M.; Shields, C.A.; Kendall, C. Tracking nonpoint source nitrogen pollution in human-impacted watersheds. Environ. Sci. Technol 2011, 45, 8225–8232. [Google Scholar] [CrossRef]
  63. Delesantro, J.M.; Duncan, J.M.; Riveros-Iregui, D.; Blaszczak, J.R.; Bernhardt, E.S.; Urban, D.L.; Band, L.E. The Nonpoint Sources and Transport of Baseflow Nitrogen Loading across a Developed Rural-Urban Gradient. Water Resour. Res. 2022, 58, e2021WR031533. [Google Scholar] [CrossRef]
  64. Rieckermann, J.; Bares, V.; Kracht, O.; Braun, D.; Gujer, W. Estimating sewer leakage from continuous tracer experiments. Water Res. 2007, 41, 1960–1972. [Google Scholar] [CrossRef]
  65. Rueedi, J.; Cronin, A.A.; Morris, B.L. Estimation of sewer leakage to urban groundwater using depth-specific hydrochemistry. Water Environ. J. 2009, 23, 134–144. [Google Scholar] [CrossRef]
  66. Lu, S.; Deng, R.; Liang, Y.; Xiong, L.; Ai, X.; Qin, Y. Remote Sensing Retrieval of Total Phosphorus in the Pearl River Channels Based on the GF-1 Remote Sensing Data. Remote Sens. 2020, 12, 1420. [Google Scholar] [CrossRef]
  67. Zhang, W.; Jin, X.; Liu, D.; Lang, C.; Shan, B. Temporal and spatial variation of nitrogen and phosphorus and eutrophication assessment for a typical arid river—Fuyang River in northern China. J. Environ. Sci. 2017, 55, 41–48. [Google Scholar] [CrossRef] [PubMed]
  68. Maavara, T.; Chen, Q.; Van Meter, K.; Brown, L.E.; Zhang, J.; Ni, J.; Zarfl, C. River dam impacts on biogeochemical cycling. Nat. Rev. Earth Environ. 2020, 1, 103–116. [Google Scholar] [CrossRef] [Green Version]
  69. Yin, Y.; Zhang, W.; Tang, J.; Chen, X.; Zhang, Y.; Cao, X.; Li, Q. Impact of river dams on phosphorus migration: A case of the Pubugou Reservoir on the Dadu River in China. Sci. Total Environ. 2022, 809, 151092. [Google Scholar] [CrossRef] [PubMed]
  70. Berthold, M.; Wulff, R.; Reiff, V.; Karsten, U.; Nausch, G.; Schumann, R. Magnitude and influence of atmospheric phosphorus deposition on the southern Baltic Sea coast over 23 years: Implications for coastal waters. Environ. Sci. Eur. 2019, 31, 27. [Google Scholar] [CrossRef] [Green Version]
  71. Huang, C.; Shi, K.; Yang, H.; Li, Y.; Zhu, A.x.; Sun, D.; Xu, L.; Zou, J.; Chen, X. Satellite observation of hourly dynamic characteristics of algae with Geostationary Ocean Color Imager (GOCI) data in Lake Taihu. Remote Sens. Environ. 2015, 159, 278–287. [Google Scholar] [CrossRef]
  72. Jiang, H.; Lu, N.; Qin, J.; Tang, W.; Yao, L. A deep learning algorithm to estimate hourly global solar radiation from geostationary satellite data. Renew. Sustain. Energy Rev. 2019, 114. [Google Scholar] [CrossRef]
  73. Cho, K.H.; Pachepsky, Y.; Ligaray, M.; Kwon, Y.; Kim, K.H. Data assimilation in surface water quality modeling: A review. Water Res 2020, 186, 116307. [Google Scholar] [CrossRef]
  74. Wu, S.; Yang, P.; Ren, J.; Chen, Z.; Li, H. Regional winter wheat yield estimation based on the WOFOST model and a novel VW-4DEnSRF assimilation algorithm. Remote Sens. Environ. 2021, 255, 112276. [Google Scholar] [CrossRef]
  75. Baracchini, T.; Chu, P.Y.; Šukys, J.; Lieberherr, G.; Wunderle, S.; Wüest, A.; Bouffard, D. Data assimilation of in situ and satellite remote sensing data to 3D hydrodynamic lake models: A case study using Delft3D-FLOW v4.03 and OpenDA v2.4. Geosci. Model Dev. 2020, 13, 1267–1284. [Google Scholar] [CrossRef]
  76. Bioucas-Dias, J.M.; Plaza, A.; Camps-Valls, G.; Scheunders, P.; Nasrabadi, N.; Chanussot, J. Hyperspectral Remote Sensing Data Analysis and Future Challenges. IEEE Geosci. Remote Sens. Mag. 2013, 1, 6–36. [Google Scholar] [CrossRef] [Green Version]
  77. Cao, Q.; Yu, G.; Sun, S.; Dou, Y.; Li, H.; Qiao, Z. Monitoring Water Quality of the Haihe River Based on Ground-Based Hyperspectral Remote Sensing. Water 2021, 14, 22. [Google Scholar] [CrossRef]
  78. Kwon, Y.S.; Pyo, J.; Kwon, Y.-H.; Duan, H.; Cho, K.H.; Park, Y. Drone-based hyperspectral remote sensing of cyanobacteria using vertical cumulative pigment concentration in a deep reservoir. Remote Sens. Environ. 2020, 236, 111517. [Google Scholar] [CrossRef]
  79. Pahlevan, N.; Smith, B.; Binding, C.; Gurlin, D.; Li, L.; Bresciani, M.; Giardino, C. Hyperspectral retrievals of phytoplankton absorption and chlorophyll-a in inland and nearshore coastal waters. Remote Sens. Environ. 2021, 253, 112200. [Google Scholar] [CrossRef]
  80. Park, Y.; Pyo, J.; Kwon, Y.S.; Cha, Y.; Lee, H.; Kang, T.; Cho, K.H. Evaluating physico-chemical influences on cyanobacterial blooms using hyperspectral images in inland water, Korea. Water Res. 2017, 126, 319–328. [Google Scholar] [CrossRef] [PubMed]
  81. Gege, P.; Dekker, A.G. Spectral and Radiometric Measurement Requirements for Inland, Coastal and Reef Waters. Remote Sens. 2020, 12, 2247. [Google Scholar] [CrossRef]
  82. Zhong, Y.; Wang, X.; Xu, Y.; Wang, S.; Jia, T.; Hu, X.; Zhao, J.; Wei, L.; Zhang, L. Mini-UAV-Borne Hyperspectral Remote Sensing: From Observation and Processing to Applications. IEEE Geosci. Remote Sens. Mag. 2018, 6, 46–62. [Google Scholar] [CrossRef]
Figure 1. Location of the Nandu River downstream and distribution of sampling sites. The map of China is produced under the supervised by the Ministry of Natural Resources of China with the map content approval number: GS(2016)1554.
Figure 1. Location of the Nandu River downstream and distribution of sampling sites. The map of China is produced under the supervised by the Ministry of Natural Resources of China with the map content approval number: GS(2016)1554.
Remotesensing 14 05466 g001
Figure 2. Relationships between in situ measured and Landsat 8 predicted TN (a), AN (b), and TP (c) for pixel–sample matchups. (15) indicate different machine learning algorithms: SVR, RFR, ANN, RT, and GBM, respectively. The orange text and circles represent the training results, and the blue text and triangles represent the testing results. The blue line is 1:1, and the red line is 15% variation.
Figure 2. Relationships between in situ measured and Landsat 8 predicted TN (a), AN (b), and TP (c) for pixel–sample matchups. (15) indicate different machine learning algorithms: SVR, RFR, ANN, RT, and GBM, respectively. The orange text and circles represent the training results, and the blue text and triangles represent the testing results. The blue line is 1:1, and the red line is 15% variation.
Remotesensing 14 05466 g002aRemotesensing 14 05466 g002b
Figure 3. Spatial distributions of mean Landsat 8-monitored TN, AN, and TP concentrations in the whole region of the NRD (All) and three subregions (A, B, and C) in the dry, normal, and wet seasons from 2013 to 2022. Different lowercase letters indicate significant differences at the level of 0.05. I, II, III, IV, and V are the quality standards for surface water from low to high.
Figure 3. Spatial distributions of mean Landsat 8-monitored TN, AN, and TP concentrations in the whole region of the NRD (All) and three subregions (A, B, and C) in the dry, normal, and wet seasons from 2013 to 2022. Different lowercase letters indicate significant differences at the level of 0.05. I, II, III, IV, and V are the quality standards for surface water from low to high.
Remotesensing 14 05466 g003
Figure 4. Time series of mean Landsat 8-monitored TN of the NRD in the whole region (All) and three subregions (A, B, and C) in the normal and wet seasons from 2013 to 2022. III, IV, and V are the quality standards for surface water from low to high.
Figure 4. Time series of mean Landsat 8-monitored TN of the NRD in the whole region (All) and three subregions (A, B, and C) in the normal and wet seasons from 2013 to 2022. III, IV, and V are the quality standards for surface water from low to high.
Remotesensing 14 05466 g004
Figure 5. Time series of mean Landsat 8-monitored AN of the NRD for the whole region (All) and three subregions (A, B, and C) in dry, normal, and wet seasons from 2013 to 2022. I, II, and III are the quality standards for surface water from low to high.
Figure 5. Time series of mean Landsat 8-monitored AN of the NRD for the whole region (All) and three subregions (A, B, and C) in dry, normal, and wet seasons from 2013 to 2022. I, II, and III are the quality standards for surface water from low to high.
Remotesensing 14 05466 g005
Figure 6. Time series of mean Landsat 8-monitored TP of the NRD for the whole region (All) and three subregions (A, B, and C) in dry, normal, and wet seasons from 2013 to 2022. II and III are the quality standards for surface water from low to high.
Figure 6. Time series of mean Landsat 8-monitored TP of the NRD for the whole region (All) and three subregions (A, B, and C) in dry, normal, and wet seasons from 2013 to 2022. II and III are the quality standards for surface water from low to high.
Remotesensing 14 05466 g006
Figure 7. Comparison of prediction performance between machine learning algorithms and previous regression equations for TN (a) and TP (b) in the NRD.
Figure 7. Comparison of prediction performance between machine learning algorithms and previous regression equations for TN (a) and TP (b) in the NRD.
Remotesensing 14 05466 g007
Figure 8. Diurnal cycle of TN (a), AN (b), and TP (c) measured at national cross-sections. Vertical dashed lines at 11:05 represent the Landsat 8 scene center time in the NRD.
Figure 8. Diurnal cycle of TN (a), AN (b), and TP (c) measured at national cross-sections. Vertical dashed lines at 11:05 represent the Landsat 8 scene center time in the NRD.
Remotesensing 14 05466 g008
Table 1. Temporal distribution of Landsat 8 images used in this study.
Table 1. Temporal distribution of Landsat 8 images used in this study.
SeasonDryNormalWetNormalTotal
YearJanuaryFebruaryMarchAprilMayJuneJulyAugustSeptemberOctoberNovemberDecember
2013\\\1100001014
20140000000000000
20150000120000104
20160010000000012
20170000000100001
20180000001000001
20190000121010005
20201000110100004
20211000100000013
20220001\\\\\\\\1
Total20125522111325
Table 2. Comparison of different machine learning algorithms for the training and testing datasets.
Table 2. Comparison of different machine learning algorithms for the training and testing datasets.
WQPDatasetModelSlopeInterceptR2pRMSE (mg/L)MAPE (%)
TNTraining
N = 42
SVR0.240.700.50<0.010.4424.39
RFR0.420.620.75<0.010.3427.50
ANN0.970.030.97<0.010.097.70
RT0.210.820.21<0.010.4838.69
GBM0.060.990.31<0.010.5140.20
Testing
N = 18
SVR0.180.730.200.060.4135.05
RFR0.300.700.49<0.010.3333.53
ANN1.040.010.45<0.010.4833.11
RT0.250.710.170.080.4038.32
GBM0.070.970.250.040.4043.59
ANTraining
N = 47
SVR0.210.170.31<0.010.14202.56
RFR0.440.130.66<0.010.10153.12
ANN0.990.000.99<0.010.016.09
RT0.370.140.37<0.010.13169.79
GBM0.220.170.55<0.010.13195.37
Testing
N = 20
SVR0.060.190.070.250.16305.05
RFR0.160.180.240.030.15284.84
ANN0.96−0.050.44<0.010.19318.07
RT0.140.180.170.080.15273.77
GBM0.050.200.130.110.16312.73
TPTraining
N = 42
SVR0.580.040.66<0.010.0359.51
RFR0.600.030.86<0.010.0230.01
ANN0.690.020.69<0.010.0346.23
RT0.230.050.23<0.010.0449.24
GBM0.060.060.46<0.010.0459.89
Testing
N = 18
SVR0.260.050.59<0.010.0452.53
RFR0.140.050.210.060.0448.23
ANN0.600.030.67<0.010.0346.44
RT0.210.050.240.040.0454.84
GBM0.060.060.42<0.010.0564.46
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Li, N.; Ning, Z.; Chen, M.; Wu, D.; Hao, C.; Zhang, D.; Bai, R.; Liu, H.; Chen, X.; Li, W.; et al. Satellite and Machine Learning Monitoring of Optically Inactive Water Quality Variability in a Tropical River. Remote Sens. 2022, 14, 5466. https://doi.org/10.3390/rs14215466

AMA Style

Li N, Ning Z, Chen M, Wu D, Hao C, Zhang D, Bai R, Liu H, Chen X, Li W, et al. Satellite and Machine Learning Monitoring of Optically Inactive Water Quality Variability in a Tropical River. Remote Sensing. 2022; 14(21):5466. https://doi.org/10.3390/rs14215466

Chicago/Turabian Style

Li, Ning, Ziyu Ning, Miao Chen, Dongming Wu, Chengzhi Hao, Donghui Zhang, Rui Bai, Huiran Liu, Xin Chen, Wei Li, and et al. 2022. "Satellite and Machine Learning Monitoring of Optically Inactive Water Quality Variability in a Tropical River" Remote Sensing 14, no. 21: 5466. https://doi.org/10.3390/rs14215466

APA Style

Li, N., Ning, Z., Chen, M., Wu, D., Hao, C., Zhang, D., Bai, R., Liu, H., Chen, X., Li, W., Zhang, W., Chen, Y., Li, Q., & Zhang, L. (2022). Satellite and Machine Learning Monitoring of Optically Inactive Water Quality Variability in a Tropical River. Remote Sensing, 14(21), 5466. https://doi.org/10.3390/rs14215466

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop