3.2. Algorithm Development
The empirical algorithm was developed using a forward step-wise regression of the values from the two field campaigns (
n = 50). This procedure was conducted since significance F was low and one
p-value was high, then the forward stepwise regression can be used to develop the best model. This procedure was followed by a multiple linear regression of the most correlated bands. The forward step-wise regression (
Table 3) revealed that the spectral Bands 1, 4, 5 and 6 were the most important MODIS channels for the correlation with chl-
a concentrations. Band 1 is located in the red channel related to the chl-
a absorption features. Band 4 is located in the green channel related to the reflectance peak of chl-
a (
Figure 2). Band 5 is usually used for atmospheric correction, because of its proximity to the water absorption feature and also for the cirrus feature. Band 6 is generally used in inland water algorithms, because the water-surface-leaving radiance in this spectral band is insignificant, so this band is less affected by water types and depth. Bands 3 and 2 were excluded, because their inclusion in the analysis did not improve the R
2 value. The reflectance from each of these bands for each sample point pixel were used in a multiple linear regression that retrieved a relationship among MODIS bands and chl-
a concentration, as shown in
Equation (2).
For the semi-empirical model, we evaluated the
Rrs and bandwidth of MODIS (
Figure 2). Because the only spectral bands from MODIS sensor in the visible spectral range are Bands 1, 3 and 4, it was possible to plot them on the
Rrs. Because chl-
a absorbs the blue light peak at approximately 440 nm, this range has been used for Case 1 water algorithms to estimate chl-
a from space. A typical semi-empirical model used in Case 1 waters is the OC4v4 algorithm [
32], which uses the band at 555 nm as the denominator and one of the three bands located near the green region (443, 490, and 510 nm) as the numerator. This algorithm, which was developed for Case 1 waters, is based on the absorption of phytoplankton in the blue region and the reflectance of phytoplankton in the green region. For the Itumbiara Reservoir, the reflectance in the green region was higher than in the near-infrared (NIR). Thus, a typical chl-
a red-NIR algorithm would not work for these waters, which is consistent with Nascimento [
20], who found that in the Itumbiara Reservoir, the total absorption was dominated by the absorption of detritus (60%) and chl-
a (40%). The band ratio between Bands 4 (545–565 nm) in the green region and 3 (459–479 nm) in the blue region was shown to be the most accurate, because of their proximity to the reflectance and absorption peaks of phytoplankton, which is observed in
Figure 2.
The empirical (O14a) and semi-empirical (O14b) algorithms were defined as in
Equations (2) and
(3), respectively:
where
B1,
B2,
B3,
B4,
B5 and
B6 are the reflectance values from MODIS Bands 1, 2, 3, 4, 5 and 6, respectively, and
Chla is the chl-
a concentration.
3.3. Calibration and Validation
Calibration for both models was conducted as described in Section 2.5. Thus, we applied
Equations (2) and
(3) to the MOD09GA product. The model values for each sample point were then used in a linear regression with the measurements of chl-
a concentrations. The first campaign calibration showed that the best result was produced with O14a (R
2 = 0.206). For the second campaign dataset, the best R
2 was also found with O14a (R
2 = 0.065). A mixed dataset combining the other two algorithms was also calibrated and showed the highest R
2 values for both models: 0.223 for the O14a and 0.074 for the O14b. Calibration results are shown in
Table 4, which not only includes R
2 values, but also the adjusted R
2 (Adj. R
2) values, intercept and slope of each linear calibration. The Adj. R
2 presented similar results, confirming the consistency in their performance. Calibration also showed the intercept and slope for each model, and the mixed dataset presented almost perfect intercept and slope values, with an intercept near zero and the slope at almost one. For each separate campaign, the values for the intercept and slope did not have the best performance.
These results were similar to the ones found by Wu
et al. [
15], who judged that the R
2 of multiple linear regressions showed a better performance when compared to a simple linear regression of a single bands or band ratios. The authors attributed the improvement of the multiple linear regression to the fact that it used more bands, which reduced the model’s degree of freedom.
For accuracy assessment, models were validated by applying the calibrated equations from two datasets to the other dataset according to
Equation (4).
Error estimators were calculated according to the equations in
Table 2 and are shown in
Table 5. Shaded areas enhanced the best error estimator for each campaign. For both cases, the use of a mixed calibration equation improved the accuracy of the algorithm. The use of a calibration with a smaller range of chl-
a concentration showed better accuracy results than the one with a large range. Using the May calibration, the RMSE was 42.052 and 36.018% for O14a and O14b, respectively, while the use of the September calibration produced an RMSE of 47.973% and 72.998% for the same algorithms. The use of O14b instead of O14a was appropriated for the second campaign using the May calibration and showed that when the empirical algorithm for the study area was used without changes to the water’s biogeochemical composition, then the empirical algorithm is useful. It was also observed that the use of algorithms with a higher R
2 in the calibration produced the lowest RMSE values; however, the exception was the May calibration for the O14b model in the second campaign, which produced a lower RMSE (36.018%) and a lower R
2 (0.157) compared to the 014a algorithm (R
2 of 0.206 and RMSE of 42.052) for the same calibration.
The use of three different datasets showing different conditions of flow regimes and chl-a concentration allowed us to analyze the calibration that is more accurate for different environmental conditions. For a rising flow regime, in which the mixing process of the water column decreases the chl-a content, the use of a mixed calibration produced better accuracy. For the low flow regime in which the stratification process is dominant, increasing the chl-a concentration and the use of mixed calibration was also more accurate.
Compared to other studies, our error estimators were lower than other studies that also used spectral bands that were inappropriate for water studies. Le
et al. [
33] tested various MODIS band combinations, and their best performance produced an RMSE (%) of 36.5% using bands with better spectral resolution for aquatic studies (Bands 11, 12, 14L and 14H). Because of the difficulties of bio-optical modeling of inland waters, the use of the MODIS 500-m product to estimate chl-
a concentrations produced reasonable errors (
Table 5). This assumption was based on the SeaWiFS program, which is attempting to estimate chl-
a concentrations in open ocean waters that are within a 35% accuracy [
34]. In the case of estuaries, Le
et al. [
33] noted that an RMSE of 39.6% for the red-green ratio algorithms was acceptable. Thus, the use of O14a and O14b with mixed calibrations is acceptable for tropical reservoirs with low chl-
a (varying from zero to 50 μg/L).
We also compared our studies to the chl-
a algorithms implemented in the SeaWiFS Data Analysis System (SeaDAS); however, only two algorithms (OC2 and OC3) were able to retrieve the chl-
a data from the Itumbiara Reservoir. The algorithms were implemented using MODIS 1-km products resampled for 500 m that were atmospherically corrected by the Management Unit of the North Sea Mathematical Models (MUMM) algorithm using its default settings [
35]. This algorithm was chosen because of its application for turbid waters.
Table 6 shows the results of the OC2 and OC3 algorithms for the Itumbiara Reservoir in the September campaign that produced the best results for our model.
Comparing the results for the same campaign, a better performance was observed for both algorithms (O14a and O14b) when compared to the OC2 and OC3. The RMSE (%) for OC2 and OC3 was approximately 37 and 44% respectively, whereas for the same campaign, O14a and O14b produced an RMSE (%) of approximately 30 and 32%.
3.4. Time Series
A time series of chl-
a concentrations of the Itumbiara Reservoir was derived from the O14a algorithm and applied to MOD09GA. The time-series was filtered using HANTS to exclude the pixels with interference by clouds and shadows [
36] and non-imagery days. Alcantara [
37] analyzed the MODIS time series from 2003 to 2008 over the Itumbiara Reservoir and showed that from the 4,380 images, 2,976 images were cloud-free, which means that approximately 68% of the time, the images were useful for the time-series analysis. Non-imagery images occurred every 16 days, and in 2009, there were 23 non-imagery days for MODIS. HANTS was also used to interpolate the remaining data to a daily frequency.
Figure 3 shows the filtered estimated chl-
a concentration for two points of the reservoir of the Itumbiara Reservoir for 2009. The first point is located near the reservoir’s entrance, which is close to the Paranaíba and Corumbá Rivers, and the second point is located near the hydroelectric dam (
Figure 1).
Both points showed low chl-
a concentrations during the dry season (from middle April to the end of August). However, during the dry season, a peak of the estimated chl-
a concentration was observed on 11 May for both points. To evaluate this, we reviewed the meteorological analysis of frontal events in Brazil and observed that among Days 4, 5 and 6 of May, intense winds from the east transported humidity from the ocean to the continent and caused precipitation in several regions of Brazil. The wind disturbance on the surface of the aquatic system can provoke a mixing of the water column [
38]. In the Southern Hemisphere, wind-induced mixing is common during the passage of cold fronts [
39,
40]. After these events, mixing and several stratification processes occur on the water column [
41]. These stability processes provoke disturbances of the thermal stratification, chemical stratification and ecological succession. Ogashawara
et al. [
42] showed that these cold front events could cause mixing in the water column, and a sudden variability of weather types could provoke such a mixing. Tundisi
et al. [
40] proposed a relationship between the stability of the water column and phytoplankton response. Thus, during mixing processes, phytoplankton is not predominant, whereas during the stratification of the water column, it is possible to have pre-bloom conditions.
Therefore, the low frequency of chl-
a peaks during the winter period was also related to external forces, because there were a high number of cold front entrances. Alcântara [
30] showed that during the spring and summer when the heat balance was positive (heat gain), the water column stratifies. However, when the heat balance was negative, the water column exhibits mixing. The main effect of this differential heat and cooling of the water in the reservoir is on its quality. Thus, the response of chl-
a in a time series was affected by external variables, such as precipitation, wind intensity, wind direction and temperature.
Figure 4 shows the meteorological variables during the first two weeks of May used to analyze the estimated chl-
a peak on 11 May. Precipitation, maximum and minimum temperature, atmospheric pressure and wind speed were collected from a meteorological station located near the reservoir. There was a precipitation event on 5 May that occurred after a peak of wind speed on 4 May and with an increase of atmospheric pressure. This instability in the atmosphere occurred until 6 May, when the wind speed decreased along with the atmospheric pressure and the air temperature began to rise. During the high event of chl-
a (showed on
Figure 3), the atmospheric dynamics were stable, which could be related to a stratification process of the water column.
Figure 4 shows that the rain event caused the mixing near the surface layer and reduced the chl-
a concentration. However, after the atmospheric and water column instability, the stable and stratified period enhanced the chl-
a concentration. This relationship was described by Tundisi
et al. [
40], who indicated that a precipitation event might drain the nutrients from an agricultural to aquatic environment. This action increased the pool of nutrients available for phytoplankton growth.
These processes of mixing and stratification of the water column associated with phytoplankton proposed by Tundisi
et al. [
40] were used to analyze the coherence of the estimated chl-
a time series. Thus, a heat balance analysis proposed by Alcântara [
30] to identify the mixing and stratification periods was used to evaluate the estimated chl-
a time-series. This analysis ascribed a positive heat balance (heat gain) to the stratification of the water column and a negative heat balance to a mixing in the water column.
Figure 5 shows the heat balance analysis from March 2009, to February 2010, that was calculated from the data at the Integrated System for Environmental Monitoring (SIMA) installed at the Itumbiara Reservoir [
43,
44] near the dam. In the figure, the estimated chl-
a time series for the same pixels of the SIMA were plotted.
Figure 5 illustrates the importance of mixing and stratification processes on the phytoplankton succession changes of a tropical reservoir and the stability of the water column. During the austral winter, the cold front frequency and wind speed are the main parameters for the mixing processes in the Itumbiara Reservoir [
41]. Because of the entrance of cold fronts, the relationship established by Tundisi
et al. [
40] could be observed, which showed that during cold front dissipation, pre-algal bloom conditions occur; therefore, the entrance of cold fronts could be related to a high concentration of chl-
a. This relationship is observed in
Figure 5, as well; during austral winter, when the frequency of cold fronts is higher, the dissipation of these cold fronts from the water column causes peaks of chl-
a concentrations. Therefore, when the water column is cold and mixed, the peaks of chl-
a could be related to meteorological events promoting the instability of the water column. This same relationship was observed in tropical aquatic systems in Africa, where the algal concentrations in an integrated column before and after strong winds were compared [
45]. This study found that the concentration increased two- to three-fold after strong winds; however, during one particular event, the concentration increased more than five-fold. During stratified periods, it is possible to observe more chl-
a peaks; however, during the maximum stratification at the end of November, chl-
a peaks do not occur, which might be related to the lack of nutrients in the water column, because of the decantation resulting from the strong stratification process.
In situ measurements of chl-a concentration from the Itumbiara Reservoir measured by the Federal University of Juiz de Fora showed that in a field campaign in November 2004, chl-a concentrations varied from 7.17 to 57.91 μg/L from different sampling points. In March of 2005, chl-a concentrations ranged from 1.33 to 143.55 μg/L. Another field campaign in August 2005, produced chl-a concentrations from 11.52 to 40.12 μg/L. Considering the same thermal structure of 2009, the smaller range was found in August, during the mixing period, while the larger range was found in March during the stratification period.
The spatial variability of the estimated chl-
a was analyzed by cropping the reservoir mask from the O14a product and performing a density slice among the chl-
a concentrations in the image. The period analyzed occurred during the mixing period from 2–10 August (
Figure 6), and the influence of cloud cover [
36] on the image of August 10 can be noticed, because most of the reservoir was unclassified because of the negative values. The pixels near the borders were unclassified, which showed that the algorithm worked well in not incorporating the influence from the soil. High concentrations of chl-
a were found in a linear stripe in various parts of the reservoir, because of a noise detector on the Band 5 (1,230∼1,250 nm) image of the Terra MODIS geolocated data, which has severe and typical strips throughout the whole image. Because it can detect cirrus clouds and retrieve water vapor amounts [
46], this spectral band is important, and it is worthwhile to consider it in the algorithm. Thus, the values of this stripe should not be considered in the analysis.
On 2 August, high chl-
a concentration were observed if compared to others images, which is explained by a variation of the weather conditions on 30 July, when wind speed and atmospheric pressure increased and air temperature decreased. These meteorological characteristics showed an instability with a mixing process in the water column, followed by a stratification in the bottom of the reservoir (
Figure 5). This sequence of events [
40] was responsible for the high chl-
a concentrations observed on 2 August. Therefore, these findings suggest that algal blooms have a rhythm that is cross-correlated to meteorological parameters. Time-series analysis (
Figure 3) showed that during spring (September) until mid-autumn (May to June), the monitoring of chl-
a should be the concern of the reservoir management administration. However, because of the high frequency of winter cold front entrances promoting the mixing of the water column, the concentration of chl-
a decreases along with its risks to human health.