Next Article in Journal
Coin.AI: A Proof-of-Useful-Work Scheme for Blockchain-Based Distributed Deep Learning
Next Article in Special Issue
Decision-Making of Irrigation Scheme for Soybeans in the Huaibei Plain Based on Grey Entropy Weight and Grey Relation–Projection Pursuit
Previous Article in Journal
Structure Extension of Tree-Augmented Naive Bayes
Previous Article in Special Issue
A Method for Improving Controlling Factors Based on Information Fusion for Debris Flow Susceptibility Mapping: A Case Study in Jilin Province, China
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Application of the Entropy Spectral Method for Streamflow and Flood-Affected Area Forecasting in the Brahmaputra River Basin

1
Key Laboratory of Ecosystem Network Observation and Modeling, Chinese Academy of Sciences, Beijing 100101, China
2
Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China
3
University of Chinese Academy of Sciences, Beijing 100049, China
*
Authors to whom correspondence should be addressed.
Entropy 2019, 21(8), 722; https://doi.org/10.3390/e21080722
Submission received: 4 June 2019 / Revised: 9 July 2019 / Accepted: 23 July 2019 / Published: 25 July 2019
(This article belongs to the Special Issue Entropy Applications in Environmental and Water Engineering II)

Abstract

:
Reliable streamflow and flood-affected area forecasting is vital for flood control and risk assessment in the Brahmaputra River basin. Based on the satellite remote sensing from four observation sites and ground observation at the Bahadurabad station, the Burg entropy spectral analysis (BESA), the configurational entropy spectral analysis (CESA), maximum likelihood (MLE), ordinary least squares (OLS), and the Yule–Walker (YW) method were developed for the spectral analysis and flood-season streamflow forecasting in the basin. The results indicated that the BESA model had a great advantage in the streamflow forecasting compared with the CESA and other traditional methods. Taking 20% as the allowable error, the forecast passing rate of the BESA model trained by the remote sensing data can reach 93% in flood seasons during 2003–2017, which was significantly higher than that trained by observed streamflow series at the Bahadurabad station. Furthermore, the segmented flood-affected area function with the input of the streamflow forecasted by the BESA model was able to forecast the annual trend of the flood-affected area of rice and tea but needed further improvement in extreme rainfall years. This paper provides a better flood-season streamflow forecasting method for the Brahmaputra River basin, which has the potential to be coupled with hydrological process models to enhance the forecasting accuracy.

1. Introduction

The Brahmaputra River basin in Northeastern India annually suffers from large-scale flooding and a large number of agricultural losses owing to the heavy monsoon rainfall and the low-lying terrain [1]. Timely and reliable forecasting of the flood-affected agricultural area is vital for flood control and risk assessment in the Brahmaputra River Basin. Furthermore, it can also provide guidelines for local farmers and policy makers in the utilization of agricultural water resources and adaptive crop planting. The flood-affected agricultural area can be investigated by employing the spatial distribution of the flood-affected area (FAA), the land use, and the topography of the river’s basin [2,3,4]. The flood-affected area can be indirectly estimated from the forecasting of streamflow series. For long-term streamflow prediction, the most frequently used methods to forecast streamflow are time series analyses such as autoregressive (AR) or autoregressive moving average (ARMA) models, which are based on the observed streamflow derived from hydrological stations or satellite remote sensing sites in the flood season [5,6,7]. However, such models suppose that the streamflow time series is stochastic and linear, which limits their application in terms of the strong seasonality that exists in the monthly streamflow series of the Brahmaputra River [8,9]. Entropy spectral analysis has the advantage of extracting periodic and seasonal change information from time series data [10]. Hence, applying entropy spectral theory to time series analysis may provide a reasonable method for streamflow and flood forecasting.
Burg proposed the concept of information entropy in the frequency domain and calculated it by using the power spectral density function of the time series, which has the property of a probability density so that the maximum Burg entropy can obtain the best estimation of the spectral density function of the time series [11]. The method termed Burg entropy spectral analysis (BESA) has been widely used in flood forecasting [12], streamflow forecasting [13], and groundwater level forecasting [14]. Compared with the traditional spectral analysis methods, BESA has the advantage of less assumptions, a higher resolution, and short sequences analysis [15]. Nonetheless, the spectral estimate obtained by the Burg algorithm sometimes displays line splitting and peak shifting. Given that the timing of the monthly streamflow of the Brahmaputra River has presented multi-peak characteristics [16], which is important for forecasting the agricultural flood-affected area, the maximum configurational entropy spectral method (CESA) is likely to be a substitute for the forecasting of multi-peak spectra series.
CESA was initially proposed by Frieden [17] for image recognition. In the field of time series analysis, CESA performs better than BESA in the determination of the spectral density function in the ARMA model and the MA model but has no practical advantage in the AR model [9]. Cui et al. [18] inversely deduced a CESA-based streamflow forecast model by the spectral density estimation formula constructed by Wu [19] and tested CESA using the monthly streamflow data from 19 river sites in the United States. Testing shows that CESA captures streamflow seasonality, satisfactorily forecasts both high and low flows, and shows a higher resolution than BESA.
The above time series method requires observation to train the model and to calculate the parameters, while observations from a few river sites generally do not meet the needs to estimate the flood-affected area of the whole river basin. Actually, on account of the complex geographical, economic, and political factors in the Brahmaputra River basin, it is impractical to obtain continuous streamflow series with full details from hydrological stations from upstream to downstream [20]. In this regard, microwave remote sensing can be used as an alternative source of surface water information for hydrological applications in the Brahmaputra River basin because of its continual, rapid, and dynamic monitoring of the large-scale environment. Spaceborne passive microwave remote sensing such as AMSR-E (Advanced Microwave Scanning Radiometer for Earth observing system) can provide the approximated river streamflow signals using the techniques proposed by Brakenridge et al. [21], and the applicability of assimilating AMSR-E to improve flood prediction in the sparsely gauged Cubango River basin has been previously demonstrated [7]. Hirpa et al. [22] found that the cross-validation regression model derived from the satellite remote sensing series produces the lower streamflow forecast error for high flows compared with low flows, and it has considerably better skill than ARMA model trained by in situ discharge for the Brahmaputra River. Because of the effect of river geometry, topography, and precipitation spatial scale [22], remote sensing streamflow signals may not share the same patterns and neither do their spectra with in situ observations.
Many studies have reported the application of entropy spectral analysis in monthly streamflow and flood forecasting based on hydrological stations, but these methods have been rarely used in spaceborne passive microwave streamflow signals. Therefore, the main objectives of this paper are (1) to compare the forecast performances of BESA and CESA with traditional methods including maximum likelihood (MLE), ordinary least squares (OLS), and the Yule–Walker (YW) method for monthly streamflow forecasting based on the ground observed data and microwave remote sensing data and (2) to simulate the annual agricultural flood-affected area by forecasting the monthly streamflow in the Brahmaputra River Basin in India.

2. Methods

The research workflow scheme is shown in Figure 1 and methodology of key processes are reviewed in this section. First, BESA, CESA, MLE, OLS, and YW were applied in parameters estimation of AR models to forecast streamflow in this study. Second, we extracted FAA of rice and tea by spatial analysis of the normalized differences water index (NDWI) and agricultural land use data. Lastly, a FAA function was established based on the sub-basin and whole-basin scale analysis and streamflow series forecasted by an optimal model was used to predict the FAA in the Brahmaputra River Basin.

2.1. Streamflow Forecasting Based on Maximum Burg Entropy Spectrum Analysis

As introduced by Burg [11] and Singh [23], maximum Burg entropy spectrum analysis regarded the frequency f of time series as a random variable and defined Burg entropy as
H B ( f ) = W W ln [ P ( f ) ] d f
where W = ½ × Δt is the Nyquist fold-over frequency and Δt is the sampling period. P(f) is a power spectral density function and it is also a probability density function.
Maximum entropy is obtained with constraints formed from the power spectral density function. For a stationary random sequence, the autocorrelation function ρ(n) and the power spectral density function P(f) are Fourier transforms of each other. The constraints of discrete time series can be expressed as
ρ ( n ) = W W P ( f ) e i 2 π f n Δ t d f , N n N
where i is an imaginary number, and n is the time-lag of the autocorrelation function and also represents the n-th harmonic wave of the discrete Fourier transform.
When n = 0, Equation (2) reduces to
1 = W W P ( f ) d f
In order to solve the power spectral density function corresponding to the maximum Burg entropy, the Lagrange multiplier method is used to construct the Lagrange function. The Lagrange function based on the Burg entropy is expressed as
L B ( f ) = W W ln [ P ( f ) ] d f n = N N λ n [ W W P ( f ) e i 2 π f n Δ t d f ρ ( n ) ]
where λn is the Lagrange multiplier.
The partial differential of the Lagrange function LB(f) for P(f) is calculated and made equal to 0 so that the power spectral density function PB(f) corresponding to the maximum Burg entropy is obtained:
P B ( f ) = 1 n = N N λ n exp ( i 2 π f n Δ t )
The form of power spectral density is equivalent to that of an autoregressive model when PB(f) is transformed as
P B ( f ) = σ m 2 [ 1 + n = 1 m a m ( n ) exp ( i 2 π f n Δ t ) ] 2
In the formula, m is the order of the model and σ m 2 is the variance of the m-order model. If the mean value of the input time series is 0, σ m 2 is equal to the output power.
Burg proposed a recursive method to solve the autoregressive model by using the reflection coefficient. The estimation formula of the i-th parameter ak(i) of the k-order autoregressive model is
a k ( i ) = { a k 1 ( i ) + k k a k 1 ( k i ) ; i = 1 , 2 , , k 1 k k ; i = k
According to the objective function of minimum power of forward prediction error and backward prediction error, the reflection coefficient kk of the model can be derived as follows:
k k = 2 i = k N 1 e k 1 f ( i ) e k 1 b ( i 1 ) i = k N 1 | e k 1 f ( i ) | 2 + i = k N 1 | e k 1 b ( i 1 ) | 2
e k f ( i ) = e k 1 f ( i ) + k k e k 1 b ( i 1 )
e k b ( i ) = e k 1 b ( i 1 ) + k k e k 1 f ( i )
The best order m is determined by Akaike information criterion (AIC) criteria, and the power spectral density function based on BESA can be obtained by substituting {am(1), am(2), …, am(m)} into Equation (6). The m-order autoregressive model becomes
ρ ( n ) = k = 1 m a m ( k ) ρ ( n k )
Therefore, the autoregressive model can be applied to monthly streamflow forecasting as below:
x ( t ) = k = 1 m a m ( k ) x ( t k ) , t > N

2.2. Streamflow Forecasting Based on Configurational Entropy Spectrum Analysis

Configurational entropy and Shannon entropy are similar in definition [24]. As with Burg entropy, configurational entropy takes the frequency of streamflow time series as the independent variable:
H C ( f ) = W W P ( f ) ln [ P ( f ) ] d f
where W = ½ × Δt is the Nyquist fold-over frequency and Δt is the sampling period.
The Lagrange multiplier method is also used to construct the Lagrange function based on the configurational entropy:
L C ( f ) = W W P ( f ) ln [ P ( f ) ] d f n = N N λ n [ W W P ( f ) e i 2 π f n Δ t d f ρ ( n ) ]
where λn is the Lagrange multiplier.
To obtain the spectral density maximizing the entropy, the partial differential of the Lagrange function LC(f) for P(f) is calculated and made equal to 0 so that the power spectral density function PC(f) corresponding to the configurational entropy is obtained:
P C ( f ) = exp ( 1 n = N N λ n e i 2 π f n Δ t )
The form of the spectral density function of maximum configurational entropy is an exponential function, which is different from the form of the spectral density function of the maximum Burg entropy. Wu [19] applied cepstrum analysis to the solution of the Lagrange multiplier and extension of the autocorrelation function and proposed the explicit solution of the spectral density function based on the maximum configurational entropy. The inverse Fourier transform of the logarithmic solution of Equation (14) can be obtained as
W W { 1 + ln [ P ( f ) ] } exp ( i 2 π k Δ t ) d f = W W [ n = N N λ n exp ( i 2 π f n Δ t ) ] exp ( i 2 π f k Δ t ) d f
where k is the k-th harmonic of the inverse discrete Fourier transform. Defining
e ( k ) = W W ln [ P ( f ) ] exp ( i 2 π f k Δ t ) d f
The Equation (18) can be simplified as
δ k + e ( k ) = n = N N λ n δ k n
where δk is a unit pulse function.
For k with different values, a set of equations for solving Lagrange multipliers can be obtained:
λ 0 = e ( 0 ) 1 λ 1 = e ( 1 ) λ N = e ( N )
In order to solve the spectral density function conveniently, Nadeu [25] proposed a simple method for cepstrum e(n) estimation. Its principle is to minimize the difference between the estimated spectral density and prior information. When −N ≤ n ≤ N, the cepstrum formula is expressed as
e ( n ) = { 2 [ ρ ( n ) k = 1 n 1 k n e ( k ) ρ ( n k ) ] ; n > 0 0 ; n 0
The autocorrelation coefficient is deduced from cepstrum:
ρ ( n ) = e ( n ) 2 + k = 1 n 1 k n e ( k ) ρ ( n k )
If the order of an autoregressive model is determined to be m, the autocorrelation coefficient ρ(m) is
ρ ( m ) = k = 1 m a m ( k ) ρ ( m k ) , m N
a m ( k ) = k m e ( k ) , k < m
The autoregressive coefficient is extended to forecast streamflow:
ρ ( t ) = k = 1 m a m ( k ) ρ ( t k ) , t > T
where T is the total length of streamflow series. Therefore, the forecast model of streamflow x(t) can be expressed as
x ( t ) = k = 1 m a k x ( t k ) , t > T

2.3. Streamflow Forecasting Based on MLE, OLS, and YW

Principles of parameter estimation methods including MLE, OLS, and YW are briefly reviewed in this sub-section. MLE is a method to estimate the parameter θ by maximizing the likelihood function L(θ;x), so that the observed data is most probable under the assumed statistical model. To maximize L(θ;x) with respect to θ:(1) first calculate the derivative of L(θ;x) with respect to θ, (2) set the derivative equal to zero, and (3) solve the resulting equation for θ. Contrary to MLE, OLS chooses the parameter θ of a set of explanatory variables by minimizing the sum of the squares of the differences between the observed streamflow in the given dataset and those predicted by the linear function.
The Yule–Walker estimation is based on the Yule–Walker equation of the AR model [26,27,28]. The specific steps are as follows:
ρ ( m ) = k = 1 p a ( k ) ρ ( m k ) + σ e 2 δ m , 0
where m = 1, 2, …, p, yielding p = 1 equations and a(k) are parameters of explanatory variables. Here σ e is the standard deviation of the input noise process, and δ m , 0 is the Kronecker delta function.
Because the last part of an individual equation is non-zero only if m = 0, the set of equations can be solved by representing the equations for m > 0 in matrix form, thus getting the equation
[ ρ ( 1 ) ρ ( 2 ) ρ ( 3 ) ρ ( p ) ] = [ ρ ( 0 ) ρ ( 1 ) ρ ( 2 ) ρ ( 1 ) ρ ( 0 ) ρ ( 1 ) ρ ( 2 ) ρ ( 1 ) ρ ( 0 ) ρ ( p 1 ) ρ ( p 2 ) ρ ( p 3 ) ] [ a ( 1 ) a ( 2 ) a ( 3 ) a ( p ) ]
which can be solved for all {a(m); m = 1, 2, …, p}. The remaining equation for m = 0 is
ρ ( 0 ) = k = 1 p a ( k ) ρ ( k ) + σ e 2
in which, once {a(m); m = 1, 2, …, p} are known, can be solved for σ e 2 .

2.4. Determination of the Order of the Autoregressive Model

This paper used the Akaike information criterion (AIC) to determine the order m [29]:
AIC = N ln σ ^ 2 + m
where N is the total length of the training series and σ ^ 2 is the variance of the residual of the model simulation.

2.5. Flood-Affected Area Estimation

This paper estimated the flood-affected area by subtracting the spatial distribution of flood inundated pixels in the dry season from that in the flood season, as shown in Equation (27).
Sfaa = Sflood − Sdry
Flood-inundated pixels were identified by the normalized differences water index (NDWI) [30] extracted from remote sensing data:
N D W I = G N I R G + N I R
where G and NIR are the green light and near-infrared reflectance of ground objects, respectively.
The flood-affected area of rice and tea plantations in the basin was obtained by spatial overlay analysis of the flood-affected area and harvest area of rice and tea in the flood season, and it was estimated by single variable regression between the flood-affected area and the flood-season streamflow based on the microwave remote sensing data.
Evaluation indicators used for the forecast accuracy of streamflow and the flood-affected area in this paper were the Pearson correlation coefficient (R), root mean square error (RMSE), and Nash–Sutcliffe efficiency coefficient (NSE), which can be expressed as
R = t = 1 n ( x ( t ) x ¯ ) ( f ( t ) f ¯ ) [ t = 1 n ( x ( t ) x ¯ ) 2 t = 1 n ( f ( t ) f ¯ ) 2 ] 0.5
R M S E = t = 1 n [ x ( t ) f ( t ) ] 2 n 1
N S E = 1 t = 1 n | x ( t ) f ( t ) | 2 t = 1 n | x ( t ) x ¯ | 2
where x represents the average value of observed streamflow x(t), f ¯ represents the average value of the forecasted streamflow f(t), and n is the forecasting period.

3. Application

3.1. Data Sources and Preprocessing

The Brahmaputra River, which originates from the Yarlung–Zangbo River in China, has a total length of 2880 km, and 34% of the basin area is located in India. The climate in the basin can be divided into a dry season (November–April) and a monsoon season (May–October), and precipitation in the monsoon season accounts for more than 85% of the total annual precipitation. The flood season of the Brahmaputra River is between July and October. Rice and tea are the main food crops and cash crops in the Brahmaputra Basin. According to the MAPSPAM2010 dataset, the physical area of rice and tea plantations are 2.33 and 0.38 million hectares respectively, and most plantation areas are located in the Brahmaputra Valley in the Assam state (Figure 2).
In this paper, the spatial distribution of the rice and tea harvest area in Northeast India was obtained from the MAPSPAM2010 database and annual flood-inundated areas during 2001–2017 in the basin were extracted from the Moderate Resolution Imaging Spectroradiometer (MODIS) data. Ground observed streamflow data from the Bahadurabad station (located 25.15° N/89.70° E) in Bangladesh, which controls the whole area of the Brahmaputra River basin in India, and satellite streamflow measurement data from four sites along the river (located 25.51° N/89.78° E, 26.32° N/92.03° E, 27.13° N/94.55° E, and 27.76° N/95.54° E, respectively) were selected to compare the performance of different models. Satellite streamflow data was derived from the River and Reservoir Watch Version 3.5 dataset, which is based on the Advanced Microwave Scanning Radiometer - Earth Observing System (AMSR-E), the Tropical Rainfall Measuring Mission (TRMM), Advanced Microwave Scanning Radiometer-2 (AMSR-2), and Global Precipitation Measurement (GPM) sensors data [31].
Streamflow data from the Bahadurabad station and remote sensing were processed on the monthly scale and raster datasets of the harvest area were interpolated with a spatial resolution of 250 m. Given that the input data of autocorrelation models should be a standardized stationary random series, the streamflow series were standardized and transformed using the Box–Cox method. Locations of the Bahadurabad station and satellite river discharge measurement sites are shown in Figure 2. Input data types and sources are shown in Table 1.

3.2. Historical Streamflow and Agricultural Flood-Affected Area in the Brahmaputra River Basin

In order to determine the streamflow indicator suitable for forecasting the agricultural flood-affected area in the Brahmaputra River basin, observed historical annual maximum monthly streamflow, maximum daily streamflow, maximum seven-day streamflow, total flood streamflow, annual streamflow, and mean streamflow in the flood season (July–October) from 2001 to 2017 were analyzed. The result showed that the mean streamflow in the flood season has the strongest correlation with the agricultural flood-affected area.
Inter-annual patterns of mean streamflow in the flood season at different sites and flood-affected areas (FAA) in the Brahmaputra basin are shown in Figure 3. Streamflow in the flood season at five stations during 2001–2017 increased progressively from upstream sites to downstream sites, which ranged from 4411 m3/s at Satellite-4 to 36,797 m3/s at Bahadurabad, and the average flood-season streamflow of the four satellite sites was 14,782 m3/s. According to MODIS data, the maximum annual flood-affected area of rice (FAArice) and tea (FAAtea) during 2001~2017 in Northeast India occurred in 2004, which was 115,685 ha and 10,074 ha respectively.
We divided the Brahmaputra basin in Northeast India into four sub-basins based on the locations of streamflow observation sites to calculate the correlation between FAA and mainstream flow (Figure 4). Owing to the impact of both local and upstream runoff to mainstream flood, the observation site satellite-1 controls the streamflow of sub-basins I + II + III + IV, satellite-2 controls sub-basins II + III + IV, satellite-3 controls sub-basins III + IV, and satellite-4 controls the sub-basin IV. Pearson correlation coefficients between the flood-season streamflow and FAA at the sub-basin scale are shown in Table 2 and at the whole-basin scale are shown in Table 3. At the sub-basin scale, the streamflow from satellite-1 and satellite-3 both has a significant correlation with FAArice (0.667 and 0.691, p < 0.01) and FAAtea (0.671 and 0.690, p < 0.01) in corresponding sub-regions. At the whole-basin scale, the average streamflow of the four satellite sites has a higher correlation coefficient (0.798 with FAArice and 0.747 with FAAtea, p < 0.01) than the individual satellite observation site. Compared with the satellite multi-site observation along the river, the streamflow at Bahadurabad shows less correlation with the annual flood-affected area (0.494 with FAArice and 0.486 with FAAtea, p < 0.05). Beyond that, determining the coefficient of multivariate regression between FAA and four satellite-derived streamflow series based on sub-basin analysis is lower than that between FAA and mean streamflow series based on the whole-basin analysis. Thus, we chose mean streamflow series of four satellite sites rather than individual sites to test entropy spectral models and forecast FAA in the Brahmaputra basin.

3.3. Monthly Streamflow Forecasting

In order to examine whether the entropy spectral method will improve the forecasting precision of flood-season streamflow in the Brahmaputra River basin, we used BESA, CESA, MLE, OLS, and YW methods to forecast the monthly streamflow for selecting the optimal model and compared the model training effects of remote sensing data and hydrological observation series. The average streamflow of the four satellite sites along the Brahmaputra River and monthly streamflow at Bahadurabad hydrological station were chosen as the training data of AR models and the best order was selected under Akaike information criterion for all forecasting experiments in the paper.
Before calibrating the parameters of the AR model, the optimal training period was determined. In this paper, we selected the observed streamflow data from the length of 36 months to 120 months as a training period to evaluate the influence of the training period on the forecasted results. According to the calculation results, when the training period is less than 60 months, the NSE of the forecasting models using BESA/CESA/traditional methods is less than 0.72, and the NSE of models tends to be stable with the increase of training period (Figure 5). Therefore, the streamflow of 12 months per year are forecasted by the previous 60-month period in AR models with feedback in this paper. The forecast results of mean monthly streamflow at the Brahmaputra River based on microwave remote sensing series data during 2003–2017 is shown in Figure 6 and Table 4. Streamflow at Bahadurabad forecasted using hydrological station series data during 2003–2017 is shown in Figure 7 and Table 5.
At the monthly scale from January to December, we found that the rank of forecast accuracy with the evaluation criteria of R, RMSE, and NSE was in the order BESA > traditional methods (MLE, OLS, and YW) > CESA for satellite sites and the Bahadurabad station by comparing the monthly streamflow from the five models. Furthermore, the streamflow forecasting accuracy for Bahadurabad station by the five models was higher than the upstream four satellite observation sites with the evaluation criteria of R and NSE. The reason for this is that streamflow signals derived from remote sensing were disturbed by external factors such as irrigation in the dry season, which made remote sensing signals show weaker autocorrelation than the streamflow measured at the hydrological station.
At the flood-season scale, on the other hand, it can be discovered that the models trained with Bahadurabad station data overestimated the flood-season streamflow in August and September of 2003 and 2005, resulting in more outliers and a lower accuracy than the models trained by remote sensing data (Figure 8). Taking 20% of the measured streamflow as the allowable error, the average passing rate of the flood-season streamflow forecasting was 65% at Bahadurabad station and was 76% at the satellite sites. Among all the methods, the BESA method had the highest flood-season streamflow forecast passing rate, which reached 73% and 93% when using the Bahadurabad station data and remote sensing data training, respectively. The forecast passing rate of CESA, MLE, OLS, and YW methods was 60%–67% at the Bahadurabad station and 60%–80% at the satellite sites.
In summary, the monthly streamflow forecast accuracy of the BESA method is significantly higher than the traditional methods, regardless of whether it is on a monthly scale or flood-season scale. However, there is no significant difference between the CESA and traditional methods in the forecast accuracy of streamflow at the Brahmaputra River. At the monthly scale from January to December, the models trained by the Bahadurabad station perform better than those trained by microwave remote sensing series. Nevertheless, at the flood-season scale, the models trained by the Bahadurabad station perform worse than those trained by remote sensing series. Given that remote sensing series of streamflow in flood season has a stronger correlation with the flood-affected area, we selected forecasting results from the BESA model based on the remote sensing observation sites during the flood season to forecast the agricultural flood-affected area in the Brahmaputra River basin.

3.4. Flood-Affected Area Functions and Forecasting

In order to convert the forecasted flood-season streamflow into the agricultural FAA, regression functions between monthly mean streamflow in flood seasons and the flood-affected area of rice and tea were established based on remote sensing data from 2001 to 2017. The calibration period is 2001–2007 and the validation period is 2008–2017.
We first tested three forms of the flood-affected area functions according to a previous study: linear, exponential, and logistic [1]. The results showed that the R2 value of the fitted linear function for the flood-affected area of rice and tea is 0.718 and 0.633 respectively; the fitted exponential curve is very close to the fitted logistic function curve, hence their R2 values for the FAA of rice and tea are 0.773 and 0.647, respectively. Second, considering that there is an abrupt change point in the variation curve of the agricultural flood-affected area with increased streamflow, the segmented function form was also selected to fit the agricultural flood-affected area in the Brahmaputra River basin. The fitted segmented equations for the flood-affected area of rice and tea are shown in Figure 9, where X is the mean streamflow in the flood season and FAArice and FAAtea refer to the flood-affected area of rice and tea, respectively. From Figure 5, we can see that when the average streamflow in the flood season was lower than 11,218 m3/s, FAArice did not change with the streamflow. When the flood-season streamflow was higher than 11,218 m3/s, FAArice increased rapidly with the increase of streamflow. The FAAtea curve was similar to the FAArice curve, but it had a higher abrupt change point with the flood-season streamflow. The R2 values of the fitted segmented functions are 0.819 and 0.824, respectively, significantly higher than the linear, exponential, and logistic functions. Therefore, we used the segmented functions to forecast the flood-affected area of rice and tea in the Brahmaputra River basin during 2008–2017.
The validation results showed a significant linear correlation between the forecasted and observed FAA (R = 0.60, p < 0.01) during 2008–2017, which indicated that the FAA function can basically estimate the annual trend of the FAA of rice and tea based on the flood-season streamflow. The absolute relative error was 20.12% of forecast FAArice and 26.45% of forecast FAAtea. As a whole, the regression function tends to overestimate the FAA in a dry year and underestimate the FAA in a flood year, especially for tea, which needs further improvement for forecasting the FAA in extreme rainfall years.

4. Discussion

Previous studies have manifested that BESA and CESA can both accurately forecast monthly streamflow, and CESA showed a better performance than BESA when fitting streamflow at Iowa River [10], Greenbrier River, Upper Colorado River, and Green River in the United States [18] and rivers in Northwest China [9]. Nevertheless, we found that BESA performed better than CESA in the Brahmaputra River basin. There are three reasons for the results. First, the streamflow series at the Brahmaputra River during 2001–2017 exhibited a strong autocorrelation (R was 0.71 at the 12th lag at the Bahadurabad station) and a previous study discovered that BESA had better forecast results for the streamflow data, with a strong autocorrelation coefficient of more than 0.60 at the lag of 12 months [8]. Second, the order of the CESA model (m = 20) determined by AIC was larger than BESA and other traditional models (13 ≤ m ≤ 16), which will introduce more uncertainties of streamflow signal and induce over-fit of the CESA model. Third, the temporal distribution of the streamflow at the Brahmaputra River showed bi-modal characteristics at a monthly scale from 2010 to 2017. The model results support the view that the BESA model can more accurately forecast the bi-modal values of the flood season for the lead time [8]. In addition, CESA can be derived with spectral power (CESAS) or frequency (CESAF) as a random variable and it was found that CESAS performed very similarly with BESA in Minnesota River, Upper Mississippi River, Des Moines River, and Illinois River [13]. Performance differences between CESAS and CESAF in Brahmaputra River is not discussed in this paper and need a further study in future.
In this study, the relative coarse spatial resolution (250 m) but temporally dense time series (8-day composition) of inundation information derived from the MODIS data allowed for the revelation of inter-annual FAA dynamics and overall inundation patterns for large areas. Nevertheless, discrepancies between the MODIS derived and the actual inundation areas exist in the boundary regions of the water bodies due to the mixed pixel effects. Kuenzer et al. found that the accuracy of inundation patterns derived from the MODIS data ranged between 79% and 99% in the Ganges–Brahmaputra Delta [35], which may induce errors of flood area estimation. Moreover, the length of the accessible MODIS data is less than 20 years, which maybe not enough for regression analysis. Additionally, the fitted flood-affected area function in the paper can hardly represent the whole basin in the context of climate change and land cover change. Strong increasing trends were predicted for the total water yield and streamflow at the Brahmaputra River basin, indicating the possibility of the sudden change of the streamflow temporal pattern and exacerbation of flooding potential in the future [32,36], while regression models trained with historical observed data are not able to simulate these variations. Therefore, the models developed in the paper are suitable for medium- or short-term forecasts and should be coupled with hydrological process models to prolong the forecast period.

5. Conclusions

From the perspective of the river basin planning and management, a reliable streamflow forecast is critical for estimating the agricultural loss due to floods. Using remote sensing data from four observation sites and hydrological station data, we developed the BESA, CESA, and other traditional models for the spectral analysis and flood-season streamflow forecasting of the Brahmaputra River basin. The results indicated that forecast models trained by microwave remote sensing series performed better in the flood season than those trained by hydrological station data at Bahadurabad. The streamflow forecast accuracy of the BESA model was significantly higher than the traditional methods, while the CESA model did not show any advantage in terms of the forecast accuracy of streamflow at the Brahmaputra River, regardless of whether a monthly scale or flood-season scale was being considered. Taking 20% of the measured flood-season streamflow as the allowable error, the forecast passing rate of the BESA model trained by the remote sensing data reached 93% during 2003–2017, which met the basic needs of flood control and risk assessment in the Brahmaputra River basin.
Moreover, the segmented flood-affected area (FAA) function and the streamflow forecast output from the BESA model were utilized to forecast the FAA of rice and tea in the basin during 2008–2017. The prediction results showed that the FAA function was able to forecast the annual trend of the flood-affected area of rice and tea and the absolute relative error of the forecast flood-affected area of rice and tea was 20.12% and 26.45%, respectively. In future research, the streamflow forecast model trained by the microwave remote sensing series with the BESA method can be coupled with hydrological process models to prolong the forecast period and enhance the forecast accuracy of the agricultural loss due to floods.

Author Contributions

X.W. designed the study, developed the models, and wrote the manuscript. S.W. collected and preprocessed the data and reviewed and edited the manuscript. H.C. wrote the MATLAB code for CESA and BESA models and analyzed the results. All the authors have read and approved the final manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (Project No. 31,861,143,015, Project No. 51,809,251) and the Key Program of the Chinese Academy of Sciences (Project No. ZDRW-ZS-2016-6, Project No. KGFZD-135-17-009).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Yang, Y.E.; Ray, P.A.; Brown, C.M.; Khalil, A.F.; Winston, H.Y. Estimation of flood damage functions for river basin planning: A case study in Bangladesh. Nat. Hazards 2015, 75, 2773–2791. [Google Scholar] [CrossRef]
  2. Posthumus, H.; Hewett, C.; Morris, J.; Quinn, P. Agricultural land use and flood risk management: Engaging with stakeholders in North Yorkshire. Agric. Water Manag. 2008, 95, 787–798. [Google Scholar] [CrossRef]
  3. Tapia-Silva, F.-O.; Itzerott, S.; Foerster, S.; Kuhlmann, B.; Kreibich, H. Estimation of flood losses to agricultural crops using remote sensing. Phys. Chem. Earth 2011, 36, 253–265. [Google Scholar] [CrossRef]
  4. Son, N.T.; Chen, C.F.; Chen, C.R.; Chang, L.Y. Satellite-based investigation of flood-affected rice cultivation areas in Chao Phraya River Delta, Thailand. ISPRS J. Photogramm. 2013, 86, 77–88. [Google Scholar] [CrossRef]
  5. Khan, S.I.; Hong, Y.; Wang, J.; Yilmaz, K.K.; Gourley, J.J.; Adler, R.F.; Brakenridge, G.R.; Policelli, F.; Habib, S.; Irwin, D. Satellite remote sensing and hydrologic modeling for flood inundation mapping in Lake Victoria basin: Implications for hydrologic prediction in ungauged basins. IEEE Trans. Geosci. Remote Sens. 2011, 49, 85–95. [Google Scholar] [CrossRef]
  6. Bajracharya, S.; Shrestha, M.; Shrestha, A. Assessment of high-resolution satellite rainfall estimation products in a streamflow model for flood prediction in the Bagmati basin, Nepal. J. Flood Risk Manag. 2017, 10, 5–16. [Google Scholar] [CrossRef]
  7. Zhang, Y.; Hong, Y.; Wang, X.; Gourley, J.J.; Gao, J.; Vergara, H.J.; Yong, B. Assimilation of passive microwave streamflow signals for improving flood forecasting: A first study in Cubango River Basin, Africa. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2013, 6, 2375–2390. [Google Scholar] [CrossRef]
  8. Zhou, Z.; Ju, J.; Su, X.; Singh, V.; Zhang, G. Comparison of Two Entropy Spectral Analysis Methods for Streamflow Forecasting in Northwest China. Entropy 2017, 19, 597. [Google Scholar] [CrossRef]
  9. Zhang, G.; Zhou, Z.; Su, X.; Ayantobo, O.O. Application of Entropy Spectral Method for Streamflow Forecasting in Northwest China. Entropy 2019, 21, 132. [Google Scholar] [CrossRef]
  10. Singh, V.P.; Cui, H. Entropy theory for streamflow forecasting. Environ. Process. 2015, 2, 449–460. [Google Scholar] [CrossRef]
  11. Burg, J.P. Maximum entropy spectral analysis. In Proceedings of the 37th Annual International Meeting, Oklahoma, OK, USA, 31 October 1967; pp. 34–41. [Google Scholar]
  12. Krstanovic, P.F.; Singh, V.P. A real-time flood forecasting model based on maximum-entropy spectral analysis: I. Development. Water Resour. Manag. 1993, 7, 109–129. [Google Scholar] [CrossRef]
  13. Cui, H.; Singh, V.P. Maximum entropy spectral analysis for streamflow forecasting. Physics A 2016, 442, 91–99. [Google Scholar] [CrossRef]
  14. Huo, C.; Chen, N.; Liu, H. Application of time series free regression model in dynamic simulation and prediction of groundwater in irrigation area. Geotech. Investig. Surv. 1990, 1, 36–38. [Google Scholar]
  15. Zhou, Z.; Li, X. Monthly streamflow forecasting based on entropy spectral theory. J. Northwest A F Univ. (Nat. Sci. Ed.) 2019, 47, 146–154. Available online: http://kns.cnki.net/kcms/detail/61.1390.S.20181106.1657.038.html (accessed on 19 May 2019). (In Chinese).
  16. Webster, P.J.; Jian, J.; Hopson, T.M.; Hoyos, C.D.; Agudelo, P.A.; Chang, H.-R.; Curry, J.A.; Grossman, R.L.; Palmer, T.N.; Subbiah, A. Extended-range probabilistic forecasts of Ganges and Brahmaputra floods in Bangladesh. Bull. Am. Meteorol. Soc. 2010, 91, 1493–1514. [Google Scholar] [CrossRef]
  17. Frieden, B.R. Restoring with maximum likelihood and maximum entropy. J. Opt. Soc. Am. 1972, 62, 511–518. [Google Scholar] [CrossRef]
  18. Cui, H.; Singh, V.P. Configurational entropy theory for streamflow forecasting. J. Hydrol. 2015, 521, 1–17. [Google Scholar] [CrossRef]
  19. Wu, N.-L. An explicit solution and data extension in the maximum entropy method. IEEE Trans. Acoust. Speech Signal Process. 1983, 31, 486–491. [Google Scholar]
  20. Samaranayake, N.; Limaye, S.; Wuthnow, J. Water Resource Competition in the Brahmaputra River Basin: China, India, and Bangladesh; CAN: Arlington, VA, USA, 2016; Available online: https://www.cna.org/centers/cna/strategic-studies/brahmaputra (accessed on 1 July 2018).
  21. Brakenridge, G.R.; Nghiem, S.V.; Anderson, E.; Mic, R. Orbital microwave measurement of river discharge and ice status. Water Resour. Res. 2007, 43, 576. [Google Scholar] [CrossRef]
  22. Hirpa, F.A.; Hopson, T.M.; De Groeve, T.; Brakenridge, G.R.; Gebremichael, M.; Restrepo, P.J. Upstream satellite remote sensing for river discharge forecasting: Application to major rivers in South Asia. Remote Sens. Environ. 2013, 131, 140–151. [Google Scholar] [CrossRef]
  23. Singh, V.P. The use of entropy in hydrology and water resources. Hydrol. Process. 1997, 11, 587–626. [Google Scholar] [CrossRef]
  24. Shannon, C.E. A Mathematical Theory of Communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef] [Green Version]
  25. Nadeu, C. Finite length cepstrum modelling—A simple spectrum estimation technique. Signal Process. 1992, 26, 49–59. [Google Scholar] [CrossRef]
  26. Mills, T.C. Time Series Techniques for Economists; Cambridge University Press: Cambridge, UK, 1990. [Google Scholar]
  27. Percival, D.B.; Walden, A.T. Spectral Analysis for Physical Applications; Cambridge University Press: Cambridge, UK, 1993. [Google Scholar]
  28. Pandit, S.M.; Wu, S.-M. Time Series and System Analysis with Applications; John Wiley & Sons: New York, NY, USA, 1983. [Google Scholar]
  29. Akaikei, H. Information theory and an extension of maximum likelihood principle. In Proceedings of the 2nd International Symposium on Information Theory, Tsahkadsor, Armenia, 2–8 September 1971; pp. 267–281. [Google Scholar]
  30. McFeeters, S.K. The use of the normalized difference water index (NDWI) in the delineation of open water features. Int. J. Remote Sens. 1996, 17, 1425–1432. [Google Scholar] [CrossRef]
  31. Brakenridge, G.R.; Kettner, A.; Syvitski, J.; Overeem, I. River and Reservoir Watch Version 3.5, Experimental Satellite-Based River Discharge and Reservoir Area Measurements: Technical Summary. 2016. Available online: http://floodobservatory.colorado.edu/technical.html (accessed on 5 January 2019).
  32. Pervez, M.S.; Henebry, G.M. Assessing the impacts of climate and land use and land cover change on the freshwater availability in the Brahmaputra River basin. J. Hydrol. 2015, 3, 285–311. [Google Scholar] [CrossRef] [Green Version]
  33. Palash, W.; Jiang, Y.; Akanda, A.S.; Small, D.L.; Nozari, A.; Islam, S. A Streamflow and Water Level Forecasting Model for the Ganges, Brahmaputra, and Meghna Rivers with Requisite Simplicity. J. Hydrometeorol. 2018, 19, 201–225. [Google Scholar] [CrossRef]
  34. International Food Policy Research Institute. Global Spatially-Disaggregated Crop Production Statistics Data for 2010 Version 1.0. Harvard Dataverse, V1. 2019. Available online: https://doi.org/10.7910/DVN/PRFF8V (accessed on 4 January 2019).
  35. Kuenzer, C.; Klein, I.; Ullmann, T.; Georgiou, E.F.; Baumhauer, R.; Dech, S. Remote Sensing of River Delta Inundation: Exploiting the Potential of Coarse Spatial Resolution, Temporally-Dense MODIS Time Series. Remote Sens. 2015, 7, 8516–8542. [Google Scholar] [CrossRef] [Green Version]
  36. Immerzeel, W.W.; Van Beek, L.P.; Bierkens, M.F. Climate change will affect the Asian water towers. Science 2010, 328, 1382–1385. [Google Scholar] [CrossRef] [PubMed]
Figure 1. The research workflow scheme of streamflow and flood-affected area forecasting.
Figure 1. The research workflow scheme of streamflow and flood-affected area forecasting.
Entropy 21 00722 g001
Figure 2. Locations of the hydrological station and satellite river discharge measurement sites in the Brahmaputra River basin. (a) Rice planting area; (b) tea planting area.
Figure 2. Locations of the hydrological station and satellite river discharge measurement sites in the Brahmaputra River basin. (a) Rice planting area; (b) tea planting area.
Entropy 21 00722 g002
Figure 3. Mean streamflow in the flood season at different sites and flood-affected areas in the Brahmaputra basin during 2001–2017.
Figure 3. Mean streamflow in the flood season at different sites and flood-affected areas in the Brahmaputra basin during 2001–2017.
Entropy 21 00722 g003
Figure 4. Cross-sections of spatial distribution pattern of the flood-affected area of rice (FAArice) and flood-affected area of tea (FAAtea) in 2004 and 2012.
Figure 4. Cross-sections of spatial distribution pattern of the flood-affected area of rice (FAArice) and flood-affected area of tea (FAAtea) in 2004 and 2012.
Entropy 21 00722 g004
Figure 5. Nash–Sutcliffe efficiency coefficient (NSE) values of models trained by different training periods.
Figure 5. Nash–Sutcliffe efficiency coefficient (NSE) values of models trained by different training periods.
Entropy 21 00722 g005
Figure 6. Observed and forecasted streamflow based on the remote sensing series data.
Figure 6. Observed and forecasted streamflow based on the remote sensing series data.
Entropy 21 00722 g006
Figure 7. Observed and forecasted streamflow based on the Bahadurabad station data.
Figure 7. Observed and forecasted streamflow based on the Bahadurabad station data.
Entropy 21 00722 g007
Figure 8. Relative errors of streamflow forecasting in the flood season (blue dots are outliers of forecast results).
Figure 8. Relative errors of streamflow forecasting in the flood season (blue dots are outliers of forecast results).
Entropy 21 00722 g008
Figure 9. Flood-affected area function curves with observed flood-season streamflow: (a) Observed flood-affected area of rice and the segmented function of FAArice; (b) observed flood-affected area of tea and the segmented function of FAAtea.
Figure 9. Flood-affected area function curves with observed flood-season streamflow: (a) Observed flood-affected area of rice and the segmented function of FAArice; (b) observed flood-affected area of tea and the segmented function of FAAtea.
Entropy 21 00722 g009
Table 1. Basic information of datasets used in the paper.
Table 1. Basic information of datasets used in the paper.
Data TypeTemporal/Spatial ScaleData Source
Ground observed streamflow dataTime seriesYear 2001–2017, monthlyBWDB [16,32,33]
Satellite streamflow measurement dataTime seriesYear 2001–2017, monthly (from daily)River and Reservoir Watch Version 3.5 [31]
Harvest area of rice and teaRaster datasetYear 2010, 250 m (interpolated from 10 km)MAPSPAM2010 [34]
Flood-inundated AreaRaster datasetYear 2001–2017, 250 mMODIS
Table 2. Pearson correlation coefficient table of flood-season streamflow and flood-affected area at the sub-basin scale.
Table 2. Pearson correlation coefficient table of flood-season streamflow and flood-affected area at the sub-basin scale.
Flood-Season Streamflow and Flood-Affected AreaSatellite-1 and Sub-Basin I + II + III + IVSatellite-2 and Sub-Basin II + III + IVSatellite-3 and Sub-Basin III + IVSatellite-4 and Sub-Basin IV
FAAriceCorrelation Coefficient0.6670.4040.6910.385
Significance0.0030.0970.0010.115
FAAteaCorrelation Coefficient0.6710.2670.6900.518
Significance0.0020.2850.0010.028
Table 3. Pearson correlation coefficient table of flood-season streamflow and flood-affected area at the whole-basin scale.
Table 3. Pearson correlation coefficient table of flood-season streamflow and flood-affected area at the whole-basin scale.
Flood-Affected AreaSatellite-1Satellite-2Satellite-3Satellite-4Satellite-MeanBahadurabad Station
FAAriceCorrelation Coefficient0.7490.4430.6220.3060.7980.510
Significance0.0010.0750.0080.2330.0000.036
FAAteaCorrelation Coefficient0.7400.3080.7140.1410.7470.487
Significance0.0010.2290.0010.5880.0010.047
Table 4. The performances of the five models for the remote sensing series.
Table 4. The performances of the five models for the remote sensing series.
BESACESAMLEOLSYW
R0.8580.7510.8070.7830.776
RMSE (m3/s)24663393279929902985
NSE0.7230.4760.6430.5930.594
Table 5. The performances of the five models at the Bahadurabad station.
Table 5. The performances of the five models at the Bahadurabad station.
BESACESAMLEOLSYW
R0.9120.8400.8660.8400.855
RMSE (m3/s)67129902799888128755
NSE0.8230.6150.7490.6950.699

Share and Cite

MDPI and ACS Style

Wang, X.; Wang, S.; Cui, H. Application of the Entropy Spectral Method for Streamflow and Flood-Affected Area Forecasting in the Brahmaputra River Basin. Entropy 2019, 21, 722. https://doi.org/10.3390/e21080722

AMA Style

Wang X, Wang S, Cui H. Application of the Entropy Spectral Method for Streamflow and Flood-Affected Area Forecasting in the Brahmaputra River Basin. Entropy. 2019; 21(8):722. https://doi.org/10.3390/e21080722

Chicago/Turabian Style

Wang, Xiaobo, Shaoqiang Wang, and Huijuan Cui. 2019. "Application of the Entropy Spectral Method for Streamflow and Flood-Affected Area Forecasting in the Brahmaputra River Basin" Entropy 21, no. 8: 722. https://doi.org/10.3390/e21080722

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop