Next Article in Journal
Spatio-Temporal Deep Learning-Based Forecasting of Surface Solar Irradiance: Leveraging Satellite Data and Feature Selection
Next Article in Special Issue
Signal Photon Extraction and Classification for ICESat-2 Photon-Counting Lidar in Coastal Areas
Previous Article in Journal
Single-Temporal Sentinel-2 for Analyzing Burned Area Detection Methods: A Study of 14 Cases in Republic of Korea Considering Land Cover
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Spatio-Temporal Variation of Cyanobacteria Blooms in Taihu Lake Using Multiple Remote Sensing Indices and Machine Learning

1
College of Geography and Remote Sensing, Hohai University, Nanjing 210098, China
2
Jiangsu Province Engineering Research Center of Water Resources and Environment Assessment Using Remote Sensing, Hohai University, Nanjing 211100, China
3
School of Geography, Geology and the Environment, University of Leicester, Leicester LE1 7RH, UK
4
School of Earth Sciences and Engineering, Hohai University, 8 Buddha City West Road, Nanjing 211100, China
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Remote Sens. 2024, 16(5), 889; https://doi.org/10.3390/rs16050889
Submission received: 15 January 2024 / Revised: 29 February 2024 / Accepted: 29 February 2024 / Published: 2 March 2024

Abstract

:
In view of the ecological threat posed by cyanobacteria blooms in Taihu Lake (China), this paper presents a study on the area of cyanobacteria extent based on MODIS data using the quantum particle swarm optimization–random forest (QPSO-RF) machine learning algorithm. This paper selects multiple remote sensing input indices that can represent the characteristics of the primary underlying type in Taihu Lake. The proposed method performs best, with an F1 score of 0.91–0.98. Based on this method, the spatio-temporal variation of cyanobacteria blooms in the Taihu Lake complex was analyzed. During 2010–2022, the average area of cyanobacteria blooms in Taihu Lake increased slightly. Severe-scale cyanobacteria blooms occurred in 2015–2019. Cyanobacteria blooms were normally concentrated from May to November. However, the most prolonged extended duration occurred in 2017, lasting for eight months. Spatially, cyanobacteria blooms were mainly identified in the northwestern part of Taihu Lake, with an average occurrence frequency of about 10.0%. The cyanobacteria blooms often began to grow in the northwestern part of the lake and then spread to the Center of the Lake, and also dissipated earliest in the northwestern part of the lake. Our study is also beneficial for monitoring the growth of cyanobacteria blooms in other similar large lakes in long time series.

1. Introduction

Cyanobacteria blooms can cause the deterioration of water quality, endanger human health, and destroy the balance of lake ecosystems [1]. As the water source for one of the most developed regions in China, the water quality of Taihu Lake has been constantly threatened by frequent cyanobacteria blooms since the 1980s [2]. In 2007, the cyanobacteria bloom in Taihu Lake triggered a severe water supply crisis. After more than ten years of sustained high-investment treatment, there is still no significant reduction trend in eutrophication problems, and cyanobacteria blooms still occur with high frequency. Especially in 2017, the cyanobacteria bloom situation was seriously aggravated, which severely challenged the drinking water safety and security of the surrounding cities’ residents [3]. Some studies have found that meteorological conditions have a considerable influence on the occurrence of cyanobacterial blooms in Taihu Lake at multiple time scales, causing cyanobacteria to occur at different time periods every year [4,5]. Before 2007, the largest cyanobacterial bloom in Taihu Lake generally occurred between Jun. and Oct. when temperatures were high [6]. However, after 2007, the largest cyanobacterial bloom in Taihu Lake may occur in any month between May and November, as long as water temperatures and habitats are suitable [3]. Overall, the severity of cyanobacterial blooms in winter is lower than that in the other three seasons. Therefore, it is necessary to quickly and accurately grasp the occurrence patterns of cyanobacteria blooms, which is significant for monitoring, controlling, and preventing them.
Due to the rapid migration and replication speed of cyanobacteria blooms, in situ measurements are usually not suitable for observing the spatio-temporal variations of cyanobacteria blooms [7]. Remote sensing has often been used to monitor the cyanobacteria bloom at a regional scale owing to its high efficiency and broad coverage. In recent years, a series of remote sensing spectral indices were developed from image data to estimate chlorophyll-a (Chla) and recognize cyanobacteria [8,9,10,11,12,13,14,15,16,17]. Based on these remote sensing indices, threshold and machine learning methods were used to extract cyanobacteria blooms. The application of the threshold methods is limited by the selection of thresholds determined by subjective/statistical experience. A series of studies have shown that machine-learning techniques can extract cyanobacteria well [18,19,20,21], including decision trees, random forest (RF), support vector machine (SVM), neural networks, and gradient boosting decision trees (GBDTs). As a convenient model, RF benefits greatly from quick speed, high accuracy, and strong stability [22]. It is widely used to classify land cover, even the extraction of cyanobacteria [23].
The performance of RF methods is greatly affected by their model parameters, and improper parameter selection can lead to underfitting or increased computation time problems. This problem occurs not only in the classification of cyanobacteria, but also in other applications such as land cover classification and crop classification [24,25]. The commonly used parameter-tuning methods include the grid search method and random search method; the former has high accuracy but takes a long time with an extensive range of data, while the latter runs fast but is only suitable for low-dimensional spaces or rough searches [26]. Intelligent optimization algorithms can make the random forest method less likely to fall into a local optimum during the search process and find the global optimal solution with great probability. Particle swarm optimization (PSO) is a data mining optimization method that adjusts meaningful features from the input dataset [27]. However, PSO algorithms have limitations, such as low convergence rates, and tend to fall into local optimization in high-dimensional spaces. At the same time, quantum-particle swarm optimization (QPSO) can deal with these problems. QPSO has the advantages of a few input factors, strong search ability, and easy implementation [28]. It has good performance in solving various optimization problems. In recent decades, QPSO has made great achievements in image research [29,30,31]. Some studies use quantum PSO–random forest (QPSO-RF) to predict borehole spontaneous combustion and lung diseases [32,33]. Specifically, the RF algorithm is suitable for classification, but improper parameter selection can lead to problems, while the QPSO algorithm can optimize the RF algorithm to find the global optimal solution when training the model, and also avoid premature convergence. The combination of the two can achieve more accurate classification, which may perform well in cyanobacterial bloom monitoring.
Remote sensing spectral indices are generally affected by external environments such as water transparency, cyanobacteria bloom concentration, and atmospheric effects. At the same time, the accuracy of RF is greatly affected by its model parameters, and parameter election is crucial [34]. Selecting factors that are inappropriate for the regional environment may affect the accuracy of the extraction, resulting in errors in cyanobacteria bloom identification. To better reveal the spatial and temporal variation of cyanobacteria in Taihu Lake with high accuracy, this paper develops a QPSO-RF method, combines five indices, the Normalized Differential Water Index (NDWI) [35], Water Adjusted Vegetation Index (WAVI) [36], Turbid Water Index (TWI) [7], Floating Algae Index (FAI) [17], and Surface Algal Bloom Index (SABI) [37] as input data, called the Water–Vegetation–Algae (WVA-QPSO-RF) method. This paper considered the turbid water body, aquatic vegetation, and water-floating biomass growth of Taihu Lake when selecting remote sensing input factors, which can present characteristics of the primary underlying type in Taihu Lake. Considering various meteorological, ecological, and environmental conditions in Taihu Lake at the inter-monthly scale, this paper established individual random forest models every month to reduce uncertainty and obtain a model with higher accuracy. Moreover, this method is compared with other methods accordingly. Based on this method, the spatial and temporal variations of cyanobacteria in Taihu Lake are recognized in 2010–2022. Our study is believed to be beneficial for the study of the occurrence and development mechanisms of cyanobacteria blooms in Taihu Lake and similar large lakes worldwide.

2. Methodology

2.1. Study Area

Taihu Lake is located between 30°55′40″~31°32′58″N and 119°52′32″~120°36′10″E, with a total area of 2427.8 km2, a water area of 2338.1 km2, and a shoreline of 393.2 km. It is the third-largest freshwater lake in China [38]. Taihu Lake has a subtropical monsoon climate, with an average annual temperature of 17 °C, an annual precipitation of 1100–1150 mm, an average annual runoff of 7.5 × 1010 m3, and a storage capacity of 4.4 × 1010 m3. Referring to the sub-district boundary of the Taihu Basin Authority of Ministry of Water Resources, we drew lines to divide Taihu Lake into several sections to help the description of the spatial distribution of the blooms (Figure 1); Zhushan Lake (I), Meiliang Lake (II), Gonghu Lake (III), East Taihu Lake (shaded fill), Center of the Lake, Southern Lake, and Western Lake. In this paper, the study area was focused on the entire lake except for East Taihu Lake, which is a hydrophyte distribution area. Hydrophytes can significantly change reflectance characteristics and interfere with cyanobacteria identification [39,40,41].

2.2. Data

2.2.1. MODIS Data

The Moderate-Resolution Imaging Spectroradiometer (MODIS), as a low- and medium-resolution satellite, has 36 bands (0.405~14.385 μm) with spatial resolutions of 250 m, 500 m, and 1 km, and a temporal resolution of 1 day [42]. The MODIS band information used in this paper is shown in Table 1.
MODIS data were derived from the NASA website (https://ladsweb.modaps.eosdis.nasa.gov, accessed on 30 March 2023). MODIS Reproject Tools enables pre-processing of MODIS data, including image format conversion, stitching, and cropping.
From 2010 to 2022, 693 images were manually selected from a total of 4748 images of MYD09 data, considering the need for cloud-free images of the lake area. The selected data are shown in Supplementary Material Table S1.

2.2.2. Sentinel-2 Data

The Sentinel-2 satellite provides high-spatial- and -temporal-resolution multispectral imagery of the world, with 13 bands with resolutions of 10 m, 20 m, and 60 m [43]. Considering the relatively high spatial resolution, the Sentinel-2 image was selected as the validation reference for extracting the cyanobacteria-affected water surface from the MODIS data [44]. The nearest-neighbor resampling algorithm was used to unify the spatial resolution of band 3, band 4, and band 8 of Sentinel-2 images to synthesize pseudo-color images. Based on visual inspection, we verified whether the MODIS image pixels were clear water bodies or cyanobacteria blooms. In order to make the sample representative and make it available for each month, this paper established individual random forest models every month using the available data. The pseudo-color images of the Sentinel-2 data were used for visual interpretation to verify whether the image pixels were clear water bodies or cyanobacteria blooms. Table 2 presents the Sentinel-2 data and MODIS data we used for building the model.

3. Methods

3.1. WVA-QPSO-RF Method

QPSO-RF is an integrated algorithm that selects the best features for classification [33]. The QPSO algorithm is a bionic intelligence computing method based on group intelligence by finding the global optimal solution through collaboration and information sharing among individuals in a group [28]. First, a subset of candidate features is selected and the RF classifier is iteratively trained to optimize the objective function. Then, the iterative search and training continue to obtain a better objective function value than found from the previously selected feature subset. We selected the appropriate predicted factors for QPSO-RF based on the characteristics of the study area. Models based on RF methods can show better performance when the input factors can represent different types of underlying surface covers, as can be proved in many other applications of RF, such as the downscaling of land surface temperature and the classification of underlying surface covers [45,46,47].
Our method combines five indices: the NDWI can better identify water bodies [35], the WAVI captures aquatic vegetation features [36], the TWI avoids interference from high-turbidity waters [7], the FAI allows accurate and comprehensive extraction of cyanobacteria [17], and the SABI detects water-floating biomass [37]. These input factors were chosen to fully consider various water conditions in Taihu Lake, thus making the obtained model more applicable to Taihu Lake. To mitigate any concerns about the validity of this approach, the samples used were evenly split (5400 cyanobacteria and 5400 clear water body points in twelve images), and were considered to be of a sufficient number, spatially distributed across all parts of the study area, and temporally distributed across several input scenes over thirteen years.
To train the model, twelve MODIS and Sentinel-2 images were selected separately, representing each month, random points were generated on the low-resolution image (MODIS), and high-resolution pseudo-color images of Sentinel-2 from close moments of the same day were used for visual interpretation judgments to determine the category of image elements at random points. A total of 900 sample points (450 cyanobacteria and 450 clear water body points) of each image were selected evenly by hand from the random points. The corresponding multifactors at the 600 training sample points (300 cyanobacteria and 300 clear water body points) of each image were brought into the model and trained for MODIS. Then, the trained model was used for cyanobacteria extraction in the entire image. Finally, the remaining 300 sample points (150 cyanobacteria and 150 clear water body points) of each image were used as validation data to verify the accuracy of the extraction results. We constructed twelve models that simulated/represented each month.
The random forest model was built with the following equation.
Model = RF_train(NDWI, WAVI, TWI, FAI, SABI, Value)
Result = RF_class(NDWI, WAVI, TWI, FAI, SABI, Model)
N D W I = ρ G R E E N ρ N I R ρ G R E E N + ρ N I R
W A V I = ( 1 + L ) ρ N I R ρ B L U E ρ N I R + ρ B L U E + L ( L = 0.5 )
T W I = ρ R E D ρ S W I R
F A I = ρ N I R ρ R E D ρ S W I R ρ R E D × λ N I R λ R E D λ S W I R λ R E D
S A B I = ρ N I R ρ R E D ρ B L U E + ρ G R E E N
To evaluate our method, the NDVI threshold method, NDWI threshold method, FAI threshold method [9,17,35], RF (NDVI & NDWI) method, Anomalous Behavior Detection-RF (ABD-RF) method [48], WVA-RF method, and WVA-QPSO-RF-A method were used to extract cyanobacteria to compare with our model. The WVA-RF method used the same remote sensing indices as our method, but only used an RF model. The WVA-QPSO-RF-A method used the same remote sensing indices and QPSO-RF method as our method, but the model covered all months. The ABD-RF method used five remote sensing indices, NDWI, NDVI, SABI, FAI, and SAred−NIR, as input data to build the model.

3.2. Trend Analysis

This paper used the Theil–Sen method [49] and the Mann–Kendall method [50,51] to explore the inter-annual trends of cyanobacteria blooms. First, the Theil–Sen method was used to calculate trend values and the Mann–Kendall method was used to determine the significance of the trend. The significant increase/decrease, no variation, and slight increase/decrease trends were then obtained. The Theil–Sen method is less influenced by outliers in the study of long-term series variation. However, it cannot achieve serial trend significance judgment, while the advantage of the Mann–Kendall method is that the samples do not need to follow a specific distribution rule and are not influenced by a small number of outliers. The introduction of this method can complete the test for significance of the trend of the series. Combining the two methods provides a robust evasive power against the interference of data errors, providing a solid statistical theoretical basis for the significance level test and making the test results more scientific and reliable [52].

3.3. Precision Evaluation Index

In order to evaluate the accuracy of the WVA-QPSO-RF method to extract cyanobacteria, this paper compared it with the threshold methods of NDVI/NDWI/FAI, RF (NDVI & NDWI), ABD-RF, WVA-RF, and WVA-QPSO-RF-A, and the extraction results were evaluated by introducing the confusion matrix to calculate 3 indices: precision, recall, and F1 score. Precision was judged by the prediction results, which is the proportion of correct predictions in samples where the prediction was positive. Recall was judged on the basis of actual samples, which is the proportion of actual positive samples that were predicted correctly as a proportion of the total actual positive samples. F1 score was the harmonic mean of precision and recall.
The formulas for precision, recall, and F1 score are as follows.
p r e c i s i o n = T P T P + F P
r e c a l l = T P T P + F N
F 1   s c o r e = 2 × p r e c i s i o n × r e c a l l p r e c i s i o n + r e c a l l

3.4. Occurrence Frequency of Cyanobacteria Blooms

The occurrence frequency was used to explore the spatial variation patterns of cyanobacteria in Taihu Lake. The occurrence frequency of cyanobacteria blooms was calculated by the following equation.
F = d / D × 100 %
F represents the occurrence frequency of cyanobacteria blooms in Taihu Lake, d represents the number of days of cyanobacteria blooms in each region of Taihu Lake, and D represents the total number of days counted. The probability of cyanobacteria bloom occurring in each pixel within a year is the inter-annual cyanobacteria bloom occurrence frequency. The probability of cyanobacteria bloom in an image element in a month is the intra-year cyanobacteria bloom occurrence frequency.

4. Results

4.1. Comparison of Cyanobacteria Extraction Methods

In this paper, visual recognition of cyanobacteria based on Sentinel-2 data was used as the reference. The accuracy of the WVA-QPSO-RF method was evaluated by comparing it with the result of the NDVI threshold method, NDWI threshold method, FAI threshold method, RF (NDVI & NDWI) method, ABD-RF method, WVA-RF method, and WVA-QPSO-RF-A method on 3 June 2019 (Figure 2).
It can be found that the eight methods differed significantly in the classification of Zhushan Lake (I), Southern Lake, Western Lake, and the Center of the Lake (in color frames). The NDVI, NDWI, and FAI threshold methods all failed to identify some slight cyanobacterial blooms (in purple, yellow, and grey frames). The RF (NDVI & NDWI) method identified some water as cyanobacteria (in grey, red, yellow, and purple color frames). The ABD method identified some water as cyanobacteria (in grey and red frames), and also identified some slight cyanobacteria as water (in purple frames). The WVA-RF method identified some cyanobacteria as water (in grey and yellow frames). The WVA-QPSO-RF-A method identified some water as cyanobacteria (in five color frames). Compared with the other seven methods, the WVA-QPSO-RF method can better identify clear water and cyanobacteria in Taihu Lake, and it can be seen that not only were the severe cyanobacteria extracted accurately, but the slight clear cyanobacteria in the Center of the Lake were also identified.
To further demonstrate the performance of the method in this paper, evaluations were conducted in each month. Table 3 shows the accuracy of the eight methods. Overall, the accuracy of the three RF methods was higher than the three threshold methods. The WVA-QPSO-RF method had the highest F1 score of 0.91–0.98, indicating that the accuracy of the WVA-QPSO-RF method was high. Therefore, the extraction of cyanobacteria using the WVA-QPSO-RF method is better than other methods.

4.2. Temporal Variation Pattern

4.2.1. Changes in Coverage Areas of Cyanobacteria Blooms

Based on the WVA-QPSO-RF method, the cyanobacteria in Taihu Lake were extracted during 2010–2022. The temporal evolution of cyanobacteria coverage in the lake study area calculated from all cloud-free MODIS images with the WVA-QPSO-RF model is shown in Figure 3. The trend line shows that the coverage of cyanobacteria blooms increased slightly during 2010–2022. The coverage of cyanobacteria bloom decreased from 2010 to 2011, generally increased during 2011–2017, slightly decreased, and was more stable from 2020 to 2022, returning to the levels of a decade ago. In 2017, the coverage of cyanobacteria blooms was broader than those existing during the other years. The coverage of cyanobacteria blooms was relatively small in 2011. The broadest coverage of cyanobacteria blooms occurred in May 2017, at about 62.3%.

4.2.2. Changes in the Spatial Areas of Cyanobacteria

As shown in Figure 4, from December to April, the average monthly coverage of cyanobacteria in Taihu Lake was low, below 5.0%. Then, the average coverage of cyanobacteria increased, especially from April to May, and rose significantly faster, reaching a peak of 9.7% in Jun. From May to November, the average and maximum monthly coverage of cyanobacteria in Taihu Lake remained high. It then gradually decreased in December.
As shown in Figure 5, during 2010–2014, the average annual coverage of cyanobacteria in Taihu Lake was below 5.0%, then it increased, and by 2017, it peaked at 11.1%. Later, the average annual coverage of cyanobacteria declined and was below 5.0% by 2020–2022. The maximum coverage of cyanobacteria occurred in 2017, at 62.3%.

4.2.3. Changes in the Occurrence Frequency of Cyanobacteria Blooms

The scales of cyanobacteria blooms in Taihu Lake were graded with reference to Liu [53]. There are four different scales of cyanobacteria blooms in Taihu Lake: small-scale cyanobacteria blooms, medium-scale cyanobacteria blooms, large-scale cyanobacteria blooms, and severe large-scale cyanobacteria blooms. The available data from 2010 to 2022 were counted to determine the days each cyanobacteria bloom scale occurred. The number of days per year that various scales of cyanobacteria occurred in Taihu Lake was counted. The occurrence frequency of each Taihu Lake cyanobacteria bloom scale in each year was calculated to obtain Figure 6.
A total of 693 days of cyanobacteria area results were selected, with various scales of large-scale cyanobacteria blooms accounting for 63.4%, 29.2%, 6.5%, and 1.0%. From 2010 to 2015, small-scale cyanobacteria blooms mainly occurred and no severe-scale cyanobacteria blooms occurred. Severe-scale cyanobacteria blooms began in 2016 and also occurred in 2017 and 2019. As seen from Figure 7, 2017 was the year with the most significant cyanobacteria cover and the highest proportion of large-scale and severely large-scale blooms, consistent with the actual situation [3,14]. Although there was no severe-scale cyanobacteria bloom in 2018, the proportion of medium-scale cyanobacteria blooms was high. Then, the scale of cyanobacteria blooms decreased in 2020–2022.
The duration of large-scale cyanobacteria blooms in Taihu Lake from 2010 to 2022 is shown in Figure 7. It can be seen that most of the large-scale cyanobacteria blooms in Taihu Lake from 2010 to 2022 started in April and May, and ended in November and December. The shortest blooms occurred in 2011 and 2021, which lasted one month. The most prolonged extended duration occurred in 2017, which lasted eight months.

4.3. Spatial Variation Patterns

1.
Spatial distribution of average cyanobacteria blooms occurrence frequency from 2010 to 2022
As shown in Figure 8, cyanobacteria blooms are relatively common in the Western Lake and the coastal areas of Taihu Lake from 2010 to 2022. The average occurrence frequency of cyanobacteria blooms in the northwestern part of Taihu Lake was high, at about 10.0%, exceeding 20.0% in some places. Moreover, the occurrence frequency of cyanobacteria blooms in Zhushan Lake (I), Meiliang Lake (II), and the coastal areas of Gonghu Lake (III) was high, with an average occurrence frequency of more than 5.0%. The occurrence of cyanobacteria blooms in the Center of the Lake and Southern Lake was relatively slight, with an occurrence frequency of almost zero.
In order to further reveal the spatial variation of cyanobacteria in Taihu Lake during 2010–2022, a combination of the Theil–Sen method and the Mann–Kendall method were used to investigate the spatial variation of cyanobacteria in Taihu Lake. The results were classified into five types of changes with reference to previous studies [52]. According to the previous study in this paper, the trend in two phases was analyzed. As shown in Figure 9a, during 2010–2017, there were more areas showing a mild increase in cyanobacteria blooms in Taihu Lake, concentrated in Zhushan Lake (I), Meiliang Lake (II), and Western Lake. There were few areas of mild decrease in cyanobacteria blooms in the coastal area of Gonghu Lake (III). As shown in Figure 9b, during 2017–2022, there were more areas showing a mild decrease in cyanobacteria blooms in Taihu Lake, especially in the Western Lake and Zhushan Lake (I), which shows that Taihu Lake’s eutrophication has improved in these six years.
2.
Changes in the spatial distribution of cyanobacteria bloom occurrence frequency during 2010–2022
As shown in Figure 10, cyanobacteria blooms are frequent in the coastal areas of Taihu Lake every year. The occurrence frequency of cyanobacteria blooms in Zhushan Lake (I) and Western Lake was high, sometimes exceeding 40.0%. In 2015–2019, Taihu Lake had more severe cyanobacteria bloom coverage. In 2015 and 2016, cyanobacteria blooms occurred in most areas of Taihu Lake, and then in 2017, the most widespread year of cyanobacteria blooms in Taihu Lake occurred, covering even whole areas. In 2020–2022, cyanobacteria blooms in the Center of the Lake and the three northern arms of the lake reduced, especially in the coastal areas of Gonghu Lake (III), which had been severe most years before.
3.
Changes in the spatial distribution of cyanobacteria bloom occurrence frequency during the year
As shown in Figure 11, the occurrence frequency of cyanobacteria was low throughout the lake from December to April. In May, cyanobacteria blooms began growing in the Western Lake, and three northern arms of the lake then spread to the Center of the Lake. From May to November, the occurrence frequency of cyanobacteria blooms was high, and sometimes cyanobacteria blooms covered the whole area. In October, the cyanobacteria bloom in the northwestern part of the lake began to dissipate. During the year, the Western Lake showed a high occurrence frequency of cyanobacteria blooms, while the Southern Lake had a relatively low occurrence frequency, and the area with the highest occurrence frequency was in Zhushan Lake (I).

5. Discussion

5.1. Analysis of Cyanobacteria Bloom during 2010–2020

During 2010–2020, the frequency and spread of cyanobacterial blooms have increased. In terms of time, the duration of large-scale blooms has increased, and in some years, they have even occurred during the winter, such as in 2013 and 2017 [6]. In terms of location, the blooms can be found in most areas of Taihu Lake. Since 2015, a large number of lakes in the world have experienced the phenomenon of large-scale and high-intensity growth of cyanobacteria [54]. A possible reason is that variations in meteorological conditions are rapid, which further cause changes in water temperature and thus affect the occurrence of cyanobacterial blooms [4,5]. Specifically, the intensity of blooms in Taihu Lake is positively correlated with the average daily water temperature in winter and early spring, as well as the effective cumulative temperature in the same period [55]. In 2017, there was an abnormally high water temperature during winter and early spring which contributed to the high intensity of the blooms, and it was the year when blooms covered almost the entire lake.
During 2010–2020, it can be found that cyanobacteria blooms were more severe in the northwestern part of Taihu Lake. The northwestern part of Taihu Lake encompasses the Yixing, Wujin, and Wuxi city areas, which are economically developed, densely distributed with industries, and highly intensified with agriculture. A great deal of industrial wastewater, domestic wastewater, and agricultural pollution is discharged into Taihu Lake from these areas [56]. These areas have higher concentrations of nutrients, such as nitrogen and phosphorus in Taihu Lake, serious eutrophication problems, and vigorous algal growth [57,58].

5.2. Applicability of WVA-QPSO-RF Method in Other Lakes

This study aimed to extract cyanobacteria blooms, and the five input factors of NDWI, TWI, FAI, WAVI, and SABI were selected to train our model. NDWI is suitable for clear water, TWI is for turbid water, FAI is used to extract cyanobacteria, WAVI is used to detect aquatic vegetation, and SABI detects water-floating biomass. Waters in Taihu Lake are consistently turbid all year [59], and when large amounts of suspended sediment appear, they may be identified as cyanobacteria, so the TWI index was added to avoid interference in such high-turbidity waters. In Taihu Lake, there is more aquatic vegetation in the Southern Lake, with lower turbidity and few cyanobacteria, and aquatic vegetation can easily be misidentified as cyanobacteria [60]. Using the NDWI, FAI, and SABI indexes can effectively distinguish lake water and cyanobacteria, but they are not effective in distinguishing cyanobacteria and aquatic vegetations [8], so the WAVI index was added to monitor aquatic vegetation. The model in this paper was proposed for the shallow eutrophic lake Taihu Lake, and is also suitable for lakes with similar environments, such as Chaohu Lake [61] However, whether it applies to other types of cyanobacteria-growing lakes remains to be discussed.
For lakes with different underlying surface covers, certain input factors need to be replaced according to the reality of the lakes. For example, the TWI index should not be used in deep lakes or lakes less affected by turbidity [7], such as Lake Erie (in the United States and Canada), Erhai Lake (in southwestern China), and Fuxian Lake (in southwestern China) [53,62,63]. For lakes with macrophytes and cyanobacteria, training samples were needed to be added to distinguish them. When the existing indices in our method could not characterize the macrophytes in the lake, other corresponding remote sensing indices and training samples were needed. For example, Cuiping Lake (in northern China) contains cyanobacteria and much submerged vegetation [64]. Since the existing indices in our model cannot identify submerged vegetation, the submerged vegetation index (SAVI) can be added to our model as an input factor [65]. Then, samples of submerged vegetation can be added to reconstruct the model.
Furthermore, some lakes have high water inflow and significant water changes, and the large discharge takes away more cyanobacteria and is not conducive to the growth of cyanobacteria. Usually, large cyanobacteria blooms are rare in these lakes, such as Schlachtensee (in northeastern Germany), Onondaga Lake (in the northeastern United States), Poyang Lake (in eastern China), and Dongting Lake (in central China) [65,66,67]. Since the FAI index is used to detect surface cyanobacteria scum, it may not be applicable if the cyanobacteria concentration is so low that it cannot form surface scum and mixes in the water column [18]. However, the concentration of cyanobacteria that cannot form surface scum is difficult to determine, and the applicability of our model to such lakes needs further study.

6. Conclusions

Based on MODIS data, a method combining QPSO-RF and multi-factors (NDWI, WAVI, TWI, FAI, and SABI) was developed to extract cyanobacteria in Taihu Lake, called the WVA-QPSO-RF method. It showed better cyanobacteria recognition than the threshold methods and other RF methods. Our method revealed the spatio-temporal variation of cyanobacteria blooms in Taihu Lake during 2010–2022.
Among the methods for extracting cyanobacteria from Taihu Lake suitable for MODIS data, the WVA-QPSO-RF method, NDVI threshold method, NDWI threshold method, FAI threshold method, RF (NDVI & NDWI) method, ABD-RF method, WVA-RF method, and WVA- QPSO -RF-A method were used to compare and analyze the results of cyanobacteria extraction from Taihu Lake. The F1 score reached 0.91–0.98, so it is the most accurate method among the eight for cyanobacteria extraction from MODIS data for Taihu Lake.
During 2010–2022, the average area of cyanobacteria blooms increased slightly. Severe-scale Cyanobacteria blooms occurred in 2015–2019. Annually, in 2017, the coverage of cyanobacteria blooms was broader than those existing during the other years, and the broadest coverage occurred in May 2017, at about 62.3%. The coverage was relatively small in 2011. Monthly, the average and maximum monthly coverage of cyanobacteria in Taihu Lake remained high from May to November. Spatially, cyanobacteria blooms occurred more frequently in the northwestern part of Taihu Lake, with an average occurrence frequency of about 10.0%. The occurrence frequency in the Center of the Lake and Southern Lake was low, with an occurrence frequency of almost zero. The cyanobacteria blooms often began to grow in the northwestern part of the lake and then spread to the Center of the Lake, and dissipation also occurred first in the northwestern part of the lake.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/rs16050889/s1, Table S1: Distribution of MODIS data.

Author Contributions

X.P. wrote this paper and designed the experiments. J.Y. performed the experiments. Z.Y., W.X. and H.S. performed the experiments and performed careful data analysis. Y.W. performed careful data analysis. K.T. and Y.Y. offered invaluable suggestions for the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Nature Science Foundation of China (41701487, 42071346, and 42371397), the State Scholarship Funds of China, and the Royal Society IEC\NSFC\223292—International Exchanges 2022 Cost Share (NSFC) grant of the UK.

Data Availability Statement

The datasets used or analyzed during the current study are available from the corresponding author on reasonable request. The data are not publicly available due to privacy restrictions.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Zhang, W.; Liu, J.; Xiao, Y.; Zhang, Y.; Yu, Y.; Zheng, Z.; Liu, Y.; Li, Q. The Impact of Cyanobacteria Blooms on the Aquatic Environment and Human Health. Toxins 2022, 14, 658. [Google Scholar] [CrossRef]
  2. Yang, S.Q.; Liu, P.W. Strategy of Water Pollution Prevention in Taihu Lake and Its Effects Analysis. J. Gt. Lakes Res. 2010, 36, 150–158. [Google Scholar] [CrossRef]
  3. Jia, T.; Zhang, X.; Dong, R. Long-Term Spatial and Temporal Monitoring of Cyanobacteria Blooms Using MODIS on Google Earth Engine: A Case Study in Taihu Lake. Remote Sens. 2019, 11, 2269. [Google Scholar] [CrossRef]
  4. Qi, L.; Hu, C.; Visser, P.M.; Ma, R. Diurnal Changes of Cyanobacteria Blooms in Taihu Lake as Derived from GOCI Observations. Limnol. Oceanogr. 2018, 63, 1711–1726. [Google Scholar] [CrossRef]
  5. Wang, S.; Zhang, X.; Wang, C.; Chen, N. Temporal Continuous Monitoring of Cyanobacterial Blooms in Lake Taihu at an Hourly Scale Using Machine Learning. Sci. Total Environ. 2023, 857, 159480. [Google Scholar] [CrossRef] [PubMed]
  6. Wu, L.; Zhu, Y.; Zhu, X. Analysis on Influencing Factors of Continuous Cyanobacteria Outbreak in Taihu Lake from 2007 to 2020. Water Resour. Dev. Manag. 2023, 9, 43–49+84. [Google Scholar] [CrossRef]
  7. Liang, Q.; Zhang, Y.; Ma, R.; Loiselle, S.; Li, J.; Hu, M. A MODIS-Based Novel Method to Distinguish Surface Cyanobacterial Scums and Aquatic Macrophytes in Lake Taihu. Remote Sens. 2017, 9, 133. [Google Scholar] [CrossRef]
  8. Feng, L.; Hu, C.; Han, X.; Chen, X.; Qi, L. Long-Term Distribution Patterns of Chlorophyll-a Concentration in China’s Largest Freshwater Lake: MERIS Full-Resolution Observations with a Practical Approach. Remote Sens. 2014, 7, 275–299. [Google Scholar] [CrossRef]
  9. Malahlela, O.E.; Oliphant, T.; Tsoeleng, L.T.; Mhangara, P. Mapping Chlorophyll-a Concentrations in a Cyanobacteria-and Algae-Impacted Vaal Dam Using Landsat 8 OLI Data. S. Afr. J. Sci. 2018, 114, 1–9. [Google Scholar] [CrossRef]
  10. Mishra, S.; Mishra, D.R. Normalized Difference Chlorophyll Index: A Novel Model for Remote Estimation of Chlorophyll-a Concentration in Turbid Productive Waters. Remote Sens. Environ. 2012, 117, 394–406. [Google Scholar] [CrossRef]
  11. Mishra, S.; Stumpf, R.P.; Schaeffer, B.A.; Werdell, P.J.; Loftin, K.A.; Meredith, A. Measurement of Cyanobacterial Bloom Magnitude Using Satellite Remote Sensing. Sci. Rep. 2019, 9, 18310. [Google Scholar] [CrossRef] [PubMed]
  12. Ogashawara, I.; Li, L.; Moreno-Madriñán, M.J. Slope Algorithm to Map Algal Blooms in Inland Waters for Landsat 8/Operational Land Imager Images. J. Appl. Remote Sens. 2017, 11, 012005. [Google Scholar] [CrossRef]
  13. Oyama, Y.; Fukushima, T.; Matsushita, B.; Matsuzaki, H.; Kamiya, K.; Kobinata, H. Monitoring Levels of Cyanobacterial Blooms Using the Visual Cyanobacteria Index (VCI) and Floating Algae Index (FAI). Int. J. Appl. Earth Obs. Geoinform. 2015, 38, 335–348. [Google Scholar] [CrossRef]
  14. Qin, X.; Xia, W.; Hu, X.; Shao, Z. Dynamic Variations of Cyanobacterial Blooms and Their Response to Urban Development and Climate Change in Lake Chaohu Based on Landsat Observations. Environ. Sci. Pollut. Res. 2022, 29, 33152–33166. [Google Scholar] [CrossRef] [PubMed]
  15. Shi, K.; Zhang, Y.; Zhou, Y.; Liu, X.; Zhu, G.; Qin, B.; Gao, G. Long-Term MODIS Observations of Cyanobacterial Dynamics in Lake Taihu: Responses to Nutrient Enrichment and Meteorological Factors. Sci. Rep. 2017, 7, 40326. [Google Scholar] [CrossRef] [PubMed]
  16. Tomlinson, M.C.; Stumpf, R.P.; Wynne, T.T.; Dupuy, D.; Burks, R.; Hendrickson, J.; Fulton III, R.S. Relating Chlorophyll from Cyanobacteria-Dominated Inland Waters to a MERIS Bloom Index. Remote Sens. Lett. 2016, 7, 141–149. [Google Scholar] [CrossRef]
  17. Hu, C. A Novel Ocean Color Index to Detect Floating Algae in the Global Oceans. Remote Sens. Environ. 2009, 113, 2118–2129. [Google Scholar] [CrossRef]
  18. Ananias, P.H.M.; Negri, R.G.; Dias, M.A.; Silva, E.A.; Casaca, W. A Fully Unsupervised Machine Learning Framework for Algal Bloom Forecasting in Inland Waters Using MODIS Time Series and Climatic Products. Remote Sens. 2022, 14, 4283. [Google Scholar] [CrossRef]
  19. Cao, H.; Han, L.; Li, L. A Deep Learning Method for Cyanobacterial Harmful Algae Blooms Prediction in Taihu Lake, China. Harmful Algae 2022, 113, 102189. [Google Scholar] [CrossRef]
  20. Song, Z.; Xu, W.; Dong, H.; Wang, X.; Cao, Y.; Huang, P.; Hou, D.; Wu, Z.; Wang, Z. Research on Cyanobacterial-Bloom Detection Based on Multispectral Imaging and Deep-Learning Method. Sensors 2022, 22, 4571. [Google Scholar] [CrossRef]
  21. Yan, K.; Li, J.; Zhao, H.; Wang, C.; Hong, D.; Du, Y.; Mu, Y.; Tian, B.; Xie, Y.; Yin, Z.; et al. Deep Learning-Based Automatic Extraction of Cyanobacterial Blooms from Sentinel-2 MSI Satellite Data. Remote Sens. 2022, 14, 4763. [Google Scholar] [CrossRef]
  22. Bouslihim, Y.; Kharrou, M.H.; Miftah, A.; Attou, T.; Bouchaou, L.; Chehbouni, A. Comparing Pan-Sharpened Landsat-9 and Sentinel-2 for Land-Use Classification Using Machine Learning Classifiers. J. Geovis. Spat. Anal. 2022, 6, 35. [Google Scholar] [CrossRef]
  23. Pan, X.; Yang, Z.; Yang, Y.; Sun, Y.; Liu, S.; Xie, W.; Li, T. Comparison and Applicability Analysis of Methods for Extracting Cyanobacteria from Lake Taihu Based on GF-6 Data. J. Lake Sci. 2022, 34, 1866–1876. [Google Scholar]
  24. Isaac, E.; Easwarakumar, K.; Isaac, J. Urban Landcover Classification from Multispectral Image Data Using Optimized AdaBoosted Random Forests. Remote Sens. Lett. 2017, 8, 350–359. [Google Scholar] [CrossRef]
  25. Sonobe, R.; Yamaya, Y.; Tani, H.; Wang, X.; Kobayashi, N.; Mochizuki, K. Assessing the Suitability of Data from Sentinel-1A and 2A for Crop Classification. GISci. Remote Sens. 2017, 54, 918–938. [Google Scholar] [CrossRef]
  26. Mantovani, R.G.; Rossi, A.L.D.; Vanschoren, J.; Bischl, B.; De Carvalho, A.C.P.L.F. Effectiveness of Random Search in SVM Hyper-Parameter Tuning. In Proceedings of the 2015 IEEE International Joint Conference on Neural Networks (IJCNN), Killarney, Ireland, 12–17 July 2015; pp. 1–8. [Google Scholar]
  27. Sheng, X.; Xi, M.; Sun, J.; Xu, W. Quantum-Behaved Particle Swarm Optimization with Novel Adaptive Strategies. J. Algorithms Comput. Technol. 2015, 9, 143–161. [Google Scholar] [CrossRef]
  28. Islam, A.R.M.T.; Saha, A.; Ghose, B.; Pal, S.C.; Chowdhuri, I.; Mallick, J. Landslide Susceptibility Modeling in a Complex Mountainous Region of Sikkim Himalaya Using New Hybrid Data Mining Approach. Geocarto Int. 2022, 37, 9021–9046. [Google Scholar] [CrossRef]
  29. Zhang, C.; Xie, Y.; Liu, D.; Wang, L. Fast Threshold Image Segmentation Based on 2D Fuzzy Fisher and Random Local Optimized QPSO. IEEE Trans. Image Process. 2016, 26, 1355–1362. [Google Scholar] [CrossRef]
  30. Mergin, A.A.; Premi, M.G. Convolutional Neural Networks (CNN) with Quantum-Behaved Particle Swarm Optimization (QPSO)-Based Medical Image Fusion. Int. J. Image Graph. 2022, 2340005. [Google Scholar] [CrossRef]
  31. Xu, X.; Shan, D.; Wang, G.; Jiang, X. Multimodal Medical Image Fusion Using PCNN Optimized by the QPSO Algorithm. Appl. Soft Comput. 2016, 46, 588–595. [Google Scholar] [CrossRef]
  32. Qi, Y.; Xue, K.; Wang, W.; Cui, X.; Liang, R. Prediction Model of Borehole Spontaneous Combustion Based on Machine Learning and Its Application. Fire 2023, 6, 357. [Google Scholar] [CrossRef]
  33. Kim, G.H.J.; Shi, Y.; Yu, W.; Wong, W.K. A Study Design for Statistical Learning Technique to Predict Radiological Progression with an Application of Idiopathic Pulmonary Fibrosis Using Chest CT Images. Contemp. Clin. Trials 2021, 104, 106333. [Google Scholar] [CrossRef]
  34. Zhou, X.; Wen, H.; Zhang, Y.; Xu, J.; Zhang, W. Landslide Susceptibility Mapping Using Hybrid Random Forest with GeoDetector and RFE for Factor Optimization. Geosci. Front. 2021, 12, 101211. [Google Scholar] [CrossRef]
  35. McFeeters, S.K. The Use of the Normalized Difference Water Index (NDWI) in the Delineation of Open Water Features. Int. J. Remote Sens. 1996, 17, 1425–1432. [Google Scholar] [CrossRef]
  36. Villa, P.; Mousivand, A.; Bresciani, M. Aquatic Vegetation Indices Assessment through Radiative Transfer Modeling and Linear Mixture Simulation. Int. J. Appl. Earth Obs. Geoinform. 2014, 30, 113–127. [Google Scholar] [CrossRef]
  37. Alawadi, F. Detection of Surface Algal Blooms Using the Newly Developed Algorithm Surface Algal Bloom Index (SABI). In Proceedings of the Remote Sensing of the Ocean, Sea Ice, and Large Water Regions 2010, Toulouse, France, 21–23 September 2010; SPIE: Bellingham, WA, USA, 2010; Volume 7825, pp. 45–58. [Google Scholar]
  38. Ma, J.; Li, X.; Ma, S.; Zhang, X.; Li, G.; Yu, Y. Temporal Trends of “Old” and “New” Persistent Halogenated Organic Pollutants in Fish from the Third Largest Freshwater Lake in China during 2011–2018 and the Associated Health Risks. Environ. Pollut. 2020, 267, 115497. [Google Scholar] [CrossRef] [PubMed]
  39. Luo, J.; Li, X.; Ma, R.; Li, F.; Duan, H.; Hu, W.; Qin, B.; Huang, W. Applying Remote Sensing Techniques to Monitoring Seasonal and Interannual Changes of Aquatic Vegetation in Taihu Lake, China. Ecol. Indic. 2016, 60, 503–513. [Google Scholar] [CrossRef]
  40. Xu, J.; Lyu, H.; Xu, X.; Li, Y.; Li, Z.; Lei, S.; Bi, S.; Mu, M.; Du, C.; Zeng, S. Dual Stable Isotope Tracing the Source and Composition of POM during Algae Blooms in a Large and Shallow Eutrophic Lake: All Contributions from Algae? Ecol. Indic. 2019, 102, 599–607. [Google Scholar] [CrossRef]
  41. Yang, X.; Wei, X.; Xu, X.; Zhang, Y.; Li, J.; Wan, J. Characteristics of Dissolved Organic Nitrogen in the Sediments of Six Water Sources in Taihu Lake, China. Int. J. Environ. Res. Public Health 2019, 16, 929. [Google Scholar] [CrossRef] [PubMed]
  42. Swain, R.; Sahoo, B. Mapping of Heavy Metal Pollution in River Water at Daily Time-Scale Using Spatio-Temporal Fusion of MODIS-Aqua and Landsat Satellite Imageries. J. Environ. Manag. 2017, 192, 1–14. [Google Scholar] [CrossRef] [PubMed]
  43. Khaleal, F.M.; El-Bialy, M.Z.; Saleh, G.M.; Lasheen, E.S.R.; Kamar, M.S.; Omar, M.M.; El-Dawy, M.N.; Abdelaal, A. Assessing Environmental and Radiological Impacts and Lithological Mapping of Beryl-Bearing Rocks in Egypt Using High-Resolution Sentinel-2 Remote Sensing Images. Sci. Rep. 2023, 13, 11497. [Google Scholar] [CrossRef]
  44. Mondal, P.; McDermid, S.S.; Qadir, A. A Reporting Framework for Sustainable Development Goal 15: Multi-Scale Monitoring of Forest Degradation Using MODIS, Landsat and Sentinel Data. Remote Sens. Environ. 2020, 237, 111592. [Google Scholar] [CrossRef]
  45. Hutengs, C.; Vohland, M. Downscaling Land Surface Temperatures at Regional Scales with Random Forest Regression. Remote Sens. Environ. 2016, 178, 127–141. [Google Scholar] [CrossRef]
  46. Wu, H.; Li, W. Downscaling Land Surface Temperatures Using a Random Forest Regression Model with Multitype Predictor Variables. IEEE Access 2019, 7, 21904–21916. [Google Scholar] [CrossRef]
  47. Zhao, W.; Duan, S.-B.; Li, A.; Yin, G. A Practical Method for Reducing Terrain Effect on Land Surface Temperature Using Random Forest Regression. Remote Sens. Environ. 2019, 221, 635–649. [Google Scholar] [CrossRef]
  48. Ananias, P.H.M.; Negri, R.G. Anomalous Behaviour Detection Using One-Class Support Vector Machine and Remote Sensing Images: A Case Study of Algal Bloom Occurrence in Inland Waters. Int. J. Digit. Earth 2021, 14, 921–942. [Google Scholar] [CrossRef]
  49. Sen, P.K. Estimates of the Regression Coefficient Based on Kendall’s Tau. J. Am. Stat. Assoc. 1968, 63, 1379–1389. [Google Scholar] [CrossRef]
  50. Kendall, M.G. Rank Correlation Methods; Griffin: London, UK, 1948. [Google Scholar]
  51. Mann, H.B. Nonparametric Tests against Trend. Econom. J. Econom. Soc. 1945, 13, 245–259. [Google Scholar] [CrossRef]
  52. Jiang, W.; Yuan, L.; Wang, W.; Cao, R.; Zhang, Y.; Shen, W. Spatio-Temporal Analysis of Vegetation Variation in the Yellow River Basin. Ecol. Indic. 2015, 51, 117–126. [Google Scholar] [CrossRef]
  53. Liu, L.; Xiang, A.; Feng, Y.; Wei, D.Q.; Yang, H.; Xia, X.S. Cyanobacteria Diversity in Eutrophic Lake of Yunnan, China. Adv. Mater. Res. 2012, 343, 914–919. [Google Scholar] [CrossRef]
  54. Wu, H.; Jia, G.; Xu, B.; Shao, Y.; Zhao, X. Analysis of Variation and Driving Factors of Total Phosphorus in Lake Taihu, 1980–2020. J. Lake Sci. 2021, 33, 974–991. [Google Scholar]
  55. Zhu, G.; Shi, K.; Li, W.; Li, N. Seasonal Forecast Method of Cyanobacterial Bloom Intensity in Eutrophic Lake Taihu, China. J. Lake Sci. 2020, 32, 1421. [Google Scholar] [CrossRef]
  56. Han, X.; Zhu, G.; Xu, H.; Wilhelm, S.W.; Qin, B.; Li, Z. Source Analysis of Urea-N in Lake Taihu during Summer. Environ. Sci. 2014, 35, 2547–2556. [Google Scholar]
  57. Li, S.; Liu, C.; Sun, P.; Ni, T. Response of Cyanobacterial Bloom Risk to Nitrogen and Phosphorus Concentrations in Large Shallow Lakes Determined through Geographical Detector: A Case Study of Taihu Lake, China. Sci. Total Environ. 2022, 816, 151617. [Google Scholar] [CrossRef]
  58. Wang, Y.; Guo, Y.; Zhao, Y.; Wang, L.; Chen, Y.; Yang, L. Spatiotemporal Heterogeneities and Driving Factors of Water Quality and Trophic State of a Typical Urban Shallow Lake (Taihu, China). Environ. Sci. Pollut. Res. 2022, 29, 53831–53843. [Google Scholar] [CrossRef]
  59. Shi, W.; Zhang, Y.; Wang, M. Deriving Total Suspended Matter Concentration from the Near-Infrared-Based Inherent Optical Properties over Turbid Waters: A Case Study in Lake Taihu. Remote Sens. 2018, 10, 333. [Google Scholar] [CrossRef]
  60. Liu, X.; Zhang, Y.; Shi, K.; Zhou, Y.; Tang, X.; Zhu, G.; Qin, B. Mapping Aquatic Vegetation in a Large, Shallow Eutrophic Lake: A Frequency-Based Approach Using Multiple Years of MODIS Data. Remote Sens. 2015, 7, 10295–10320. [Google Scholar] [CrossRef]
  61. Qi, Y.; Hu, S.; Huo, S.; Xi, B.; Zhang, J.; Wang, X. Spatial Distribution and Historical Deposition Behaviors of Perfluoroalkyl Substances (PFASs) in Sediments of Lake Chaohu, a Shallow Eutrophic Lake in Eastern China. Ecol. Indic. 2015, 57, 1–10. [Google Scholar] [CrossRef]
  62. Huang, J.; Xu, Q.; Wang, X.; Ji, H.; Quigley, E.J.; Sharbatmaleki, M.; Li, S.; Xi, B.; Sun, B.; Li, C. Effects of Hydrological and Climatic Variables on Cyanobacterial Blooms in Four Large Shallow Lakes Fed by the Yangtze River. Environ. Sci. Ecotechnol. 2021, 5, 100069. [Google Scholar] [CrossRef] [PubMed]
  63. Lin, S.S.; Shen, S.L.; Zhou, A.; Lyu, H.M. Assessment and Management of Lake Eutrophication: A Case Study in Lake Erhai, China. Sci. Total Environ. 2021, 751, 141618. [Google Scholar] [CrossRef] [PubMed]
  64. Wang, Z.; Xin, C.; Sun, Z.; Luo, J.; Ma, R. Automatic Extraction Method of Aquatic Vegetation Types in Small Shallow Lakes Based on Sentinel-2 Data: A Case Study of Cuiping Lake. Remote Sens. Inform. 2019, 34, 132–141. [Google Scholar]
  65. Fastner, J.; Abella, S.; Litt, A.; Morabito, G.; Vörös, L.; Pálffy, K.; Straile, D.; Kümmerlin, R.; Matthews, D.; Phillips, M.G.; et al. Combating Cyanobacterial Proliferation by Avoiding or Treating Inflows with High P Load—Experiences from Eight Case Studies. Aquat. Ecol. 2016, 50, 367–383. [Google Scholar] [CrossRef]
  66. Liu, X.; Li, Y.L.; Liu, B.G.; Qian, K.M.; Chen, Y.W.; Gao, J.F. Cyanobacteria in the Complex River-Connected Poyang Lake: Horizontal Distribution and Transport. Hydrobiologia 2016, 768, 95–110. [Google Scholar] [CrossRef]
  67. Wang, W.; Yang, P.; Xia, J.; Zhang, S.; Hu, S. Changes in the Water Environment and Its Major Driving Factors in Poyang Lake from 2016 to 2019, China. Environ. Sci. Pollut. Res. 2023, 30, 3182–3196. [Google Scholar] [CrossRef]
Figure 1. Location of study area.
Figure 1. Location of study area.
Remotesensing 16 00889 g001
Figure 2. Sentinel-2 pseudo-color image (a), and comparison between the NDVI method (b), NDWI method (c), FAI method (d), RF (NDVI & NDWI) method (e), ABD-RF method (f), WVA-RF method (g), WVA-QPSO-RF-A method, (h) and WVA-QPSO-RF method (i) of extracting cyanobacteria on 3 June 2019. (I)—Zhushan Lake, (II)—Meiliang Lake, (III)—Gonghu Lake. Different colored wireframes represent areas with inconsistent classification results.
Figure 2. Sentinel-2 pseudo-color image (a), and comparison between the NDVI method (b), NDWI method (c), FAI method (d), RF (NDVI & NDWI) method (e), ABD-RF method (f), WVA-RF method (g), WVA-QPSO-RF-A method, (h) and WVA-QPSO-RF method (i) of extracting cyanobacteria on 3 June 2019. (I)—Zhushan Lake, (II)—Meiliang Lake, (III)—Gonghu Lake. Different colored wireframes represent areas with inconsistent classification results.
Remotesensing 16 00889 g002
Figure 3. Temporal variation of the cyanobacteria areas of Taihu Lake during 2010–2022 using WVA-QPSO-RF model, red line is the trend line.
Figure 3. Temporal variation of the cyanobacteria areas of Taihu Lake during 2010–2022 using WVA-QPSO-RF model, red line is the trend line.
Remotesensing 16 00889 g003
Figure 4. Monthly coverage of cyanobacteria blooms in Taihu Lake (x-axis caption—month in year).
Figure 4. Monthly coverage of cyanobacteria blooms in Taihu Lake (x-axis caption—month in year).
Remotesensing 16 00889 g004
Figure 5. Annual coverage of cyanobacteria blooms in Taihu Lake.
Figure 5. Annual coverage of cyanobacteria blooms in Taihu Lake.
Remotesensing 16 00889 g005
Figure 6. Proportion of various scales of cyanobacteria blooms in Taihu Lake from 2010 to 2022.
Figure 6. Proportion of various scales of cyanobacteria blooms in Taihu Lake from 2010 to 2022.
Remotesensing 16 00889 g006
Figure 7. The duration of the large-scale cyanobacteria blooms in Taihu Lake.
Figure 7. The duration of the large-scale cyanobacteria blooms in Taihu Lake.
Remotesensing 16 00889 g007
Figure 8. Spatial map of average cyanobacteria bloom occurrence frequency in Taihu Lake from 2010 to 2022.
Figure 8. Spatial map of average cyanobacteria bloom occurrence frequency in Taihu Lake from 2010 to 2022.
Remotesensing 16 00889 g008
Figure 9. The trend change distribution of cyanobacteria blooms in Taihu Lake during 2010–2017 (a) and 2017–2022 (b).
Figure 9. The trend change distribution of cyanobacteria blooms in Taihu Lake during 2010–2017 (a) and 2017–2022 (b).
Remotesensing 16 00889 g009
Figure 10. Spatial map of cyanobacteria bloom occurrence frequency in Taihu Lake during 2010–2022.
Figure 10. Spatial map of cyanobacteria bloom occurrence frequency in Taihu Lake during 2010–2022.
Remotesensing 16 00889 g010
Figure 11. Spatial distributions of the occurrence frequency of cyanobacteria blooms in Taihu Lake during the year.
Figure 11. Spatial distributions of the occurrence frequency of cyanobacteria blooms in Taihu Lake during the year.
Remotesensing 16 00889 g011
Table 1. Subset of bands of MODIS data used in this study.
Table 1. Subset of bands of MODIS data used in this study.
BandNameWavelength Range/nmSpatial Resolution/m
1Red620–670500
2Near-Infrared841–876500
3Blue459–479500
4Green545–565500
5Short-Wave Infrared1230–1250500
Table 2. Usage date of Sentinel-2 data and MODIS data.
Table 2. Usage date of Sentinel-2 data and MODIS data.
MonthDate
118 January 2023
228 February 2017
329 March 2022
49 April 2018
524 May 2019
63 June 2019
718 July 2018
812 August 2019
930 September 2021
1010 October 2022
1120 November 2019
1214 December 2022
Table 3. Accuracy evaluation of cyanobacteria extraction results based on MODIS data using eight methods.
Table 3. Accuracy evaluation of cyanobacteria extraction results based on MODIS data using eight methods.
MonthEvaluation IndexNDVI ThresholdNDWI ThresholdFAI ThresholdRF (NDVI & NDWI)ABD-RFWVA-RFWVA-QPSO-RF-AWVA-QPSO-RF
Januaryprecision0.72 0.76 0.77 0.87 0.93 0.90 0.920.95
recall0.85 0.89 0.84 0.89 0.94 0.93 0.930.94
F1 score0.78 0.82 0.80 0.88 0.93 0.91 0.920.95
Februaryprecision0.72 0.70 0.75 0.88 0.92 0.89 0.910.94
recall0.83 0.90 0.86 0.90 0.91 0.92 0.900.93
F1 score0.77 0.79 0.80 0.89 0.92 0.90 0.900.94
Marchprecision0.84 0.65 0.71 0.87 0.94 0.84 0.940.97
recall0.74 0.87 0.65 0.87 0.88 0.96 0.730.94
F1 score0.79 0.74 0.68 0.87 0.91 0.90 0.820.95
Aprilprecision0.63 0.56 0.63 0.88 0.95 0.87 0.940.97
recall0.82 0.88 0.83 0.88 0.91 0.95 0.940.96
F1 score0.71 0.68 0.72 0.88 0.93 0.91 0.940.96
Mayprecision0.53 0.53 0.49 0.61 0.84 0.74 0.820.90
recall0.84 0.56 0.83 0.56 0.79 0.80 0.940.97
F1 score0.65 0.55 0.62 0.58 0.82 0.77 0.880.93
Juneprecision0.93 0.87 0.83 0.91 0.99 0.99 0.930.96
recall0.76 0.76 0.86 0.76 0.86 0.89 0.750.99
F1 score0.84 0.81 0.84 0.83 0.92 0.94 0.830.98
Julyprecision0.68 0.73 0.65 0.77 0.80 0.88 0.930.88
recall0.91 0.82 0.91 0.82 0.89 0.91 0.880.95
F1 score0.78 0.77 0.76 0.79 0.84 0.90 0.900.91
Augustprecision0.86 0.68 0.86 0.82 0.82 0.88 0.760.97
recall0.73 0.87 0.70 0.87 0.77 0.94 0.880.94
F1 score0.79 0.76 0.77 0.85 0.79 0.91 0.820.96
Septemberprecision0.79 0.78 0.60 0.92 0.94 0.95 0.900.96
recall0.94 0.96 0.92 0.96 0.90 0.93 0.960.96
F1 score0.86 0.86 0.73 0.94 0.92 0.94 0.930.96
Octoberprecision0.32 0.26 0.28 0.86 0.90 0.80 0.780.92
recall0.55 0.96 0.67 0.96 0.94 0.74 0.930.96
F1 score0.41 0.41 0.39 0.91 0.92 0.77 0.850.94
Novemberprecision0.53 0.58 0.49 0.72 0.91 0.80 0.830.95
recall0.57 0.60 0.31 0.89 0.95 0.97 0.950.99
F1 score0.55 0.59 0.38 0.80 0.93 0.88 0.880.97
Decemberprecision0.66 0.53 0.60 0.93 0.96 0.98 0.960.98
recall0.97 0.89 0.94 0.89 0.87 0.88 0.930.95
F1 score0.78 0.66 0.73 0.91 0.92 0.93 0.950.97
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Pan, X.; Yuan, J.; Yang, Z.; Tansey, K.; Xie, W.; Song, H.; Wu, Y.; Yang, Y. Spatio-Temporal Variation of Cyanobacteria Blooms in Taihu Lake Using Multiple Remote Sensing Indices and Machine Learning. Remote Sens. 2024, 16, 889. https://doi.org/10.3390/rs16050889

AMA Style

Pan X, Yuan J, Yang Z, Tansey K, Xie W, Song H, Wu Y, Yang Y. Spatio-Temporal Variation of Cyanobacteria Blooms in Taihu Lake Using Multiple Remote Sensing Indices and Machine Learning. Remote Sensing. 2024; 16(5):889. https://doi.org/10.3390/rs16050889

Chicago/Turabian Style

Pan, Xin, Jie Yuan, Zi Yang, Kevin Tansey, Wenying Xie, Hao Song, Yuhang Wu, and Yingbao Yang. 2024. "Spatio-Temporal Variation of Cyanobacteria Blooms in Taihu Lake Using Multiple Remote Sensing Indices and Machine Learning" Remote Sensing 16, no. 5: 889. https://doi.org/10.3390/rs16050889

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop