Development of a Maximum Entropy-Archimedean Copula-Based Bayesian Network Method for Streamflow Frequency Analysis — A Case Study of the Kaidu River Basin , China

Xiangming Kong 1, Xueting Zeng 2, Cong Chen 3, Yurui Fan 4,*, Guohe Huang 5,*, Yongping Li 5 and Chunxiao Wang 1 1 School of Fundamental Science, Beijing Polytechnic, Beijing 100176, China; xiangming0913@126.com (X.K.); chunxiao20083366@163.com (C.W.) 2 School of Labor Economics, Capital University of Economics and Business, Beijing 100070, China; zxt1231@sina.com 3 Donlinks School of Economics and Management, University of Science and Technology Beijing, Beijing 100083, China; chencong@ustb.edu.cn 4 Department of Civil and Environmental Engineering, Brunel University, London, Uxbridge UB8 3PH, UK 5 Center for Energy, Environment and Ecology Research, Beijing Normal University, Beijing 100875, China; yongping.li33@gmail.com * Correspondence: yurui.fan@gmail.com (Y.F.); huangg@iseis.org (G.H.)


Introduction
Many regions especially in developing countries are suffering from severe water stresses resulting from heterogenous precipitation, extreme hydrological events, water shortages, as well as numerous demands from socio-economic and natural systems [1][2][3][4][5].However, extensive uncertainties in hydrological processes are among the major challenges to deriving reliable water system management Water 2019, 11, 42 3 of 16 then applied to a case study of the Kaidu River Basin in the Xinjiang Uyghur Autonomous Region, an arid region in northwest China.In the MECBN method, the maximum entropy-Archimedean copula approach is incorporated into a Bayesian network framework according to the dependences between adjacent monthly streamflows in the Kaidu River Basin.MECBN is used for: (i) assessing the uncertainties which exist in the model representation; (ii) addressing the correlation structure between streamflows in two adjacent months, and (iii) presenting the key statistical characters (i.e., mean, standard deviation, skewness and kurtosis) for the predictive streamflow.

Maximum Entropy Method
The determination of marginal distribution for monthly streamflow is critical.The accuracy of marginal distribution significantly affects the follow-up study for interactions of adjacent monthly streamflows.As a method to generate distribution of random variable, the maximum entropy method has been widely used in hydrology.It can present the key statistical characters of monthly streamflow properly without any hypothesis [31].The marginal probability density function (PDF) for random variable X can be obtained by maximizing the Shannon entropy with mean, standard deviation, skewness and kurtosis as constraints.The marginal cumulative density function (CDF) can be obtained by computing the integral for PDF.Then, the generated marginal PDF and CDF can be expressed as follows: where a and b are the lower and upper bounds of random variable X, respectively.λ i (i = 1, 2, . . ., m) are the Lagrange multipliers which could be determined by the conjugate gradient (CG) method [31]; h i (x) is a known function of X, which can be specified as h 1 = x, h 2 = x 2 , h 3 = x 3 and h 4 = x 4 for the constraints of mean, standard deviation, skewness and kurtosis Hao et al. [32].

Archimedean Copula
After obtaining marginal distribution of monthly streamflow, joint distributions should be further constructed in order to further investigate interactions between adjacent monthly streamflows.The Archimedean copula is widely used in hydrologic frequency analysis.It can generate joint distribution through combining marginal distributions into a copula function.The superior property of the Archimedean copula is that it can be applied when the correlation amongst hydrologic variables is positive or negative [33][34][35].In this study, one-parameter Archimedean copulas including the Gumbel-Hougaard, Clayton and Frank copulas are used to generate the joint distributions of adjacent monthly streamflows.The expressions for two random variables, X and Y, can be defined as follows [34]: where C(u 1 , u 2 ) is the one-parameter Archimedean copula in general, ϕ(u) is the generating function of C(u 1 , u 2 ); ϕ −1 (u) is the inverse function of ϕ(u); u 1 = F X (x) and u 2 = F Y (y) are the CDFs of X and Y, respectively.C GH θ (u 1 , u 2 ), C CT θ (u 1 , u 2 ) and C FK θ (u 1 , u 2 ) are the Gumbel-Hougaard, Clayton and Frank copulas, respectively.θ , θ and θ are the unknown parameters to be estimated for the Gumbel-Hougaard, Clayton and Frank copulas, respectively.

Maximum Entropy-Archimedean Copula Method
A maximum entropy-Archimedean copula (MEAC) method can be proposed through integrating techniques of maximum entropy and Archimedean copula into a general framework.In MEAC, the marginal distribution of monthly streamflow is constructed by the maximum entropy method and the joint distribution of two adjacent monthly streamflows is constructed by an Archimedean copula.The joint distribution of the MEAC method can be formulated as follows: where C θ (u 1 , u 2 ) is the Archimedean copula with parameter θ; u 1 = F X (x; λ) and u 2 = F Y (y; λ ) are the marginal distributions of random variables X and Y, respectively.h i (x) and h i (y) are known functions of X and Y; a 1 and b 1 are the lower and upper bounds of the random variable X, respectively; a 2 and b 2 are the lower and upper bounds of random variable Y, respectively; λ i (i = 1, 2, . . ., m) and λ i (i = 1, 2, . . ., m ) are the Lagrange multipliers.The parameters of MEAC could be estimated by the inference functions for margins (IFM) method [31].The parameter of the Archimedean copula could be estimated by Kendall's tau.The CG method could be used for the parameter estimation of marginal distribution.

Maximum Entropy-Archimedean Copula-Based Bayesian Network
Based on the MEAC method, the joint distribution of adjacent monthly streamflows could be generated.However, the frequency analysis of monthly streamflow also needs to construct the conditional dependency of adjacent monthly streamflows.Therefore, a Bayesian network is introduced in this study in order to obtain the conditional dependencies of monthly streamflow.As a probabilistic model, a Bayesian network can describe the conditional dependencies of a set of random variables via directed acyclic graphs [29,30].The joint PDF of different random variables with a level of dependency (x t 1 , x t 2 , . . . ,x t n ) within a Bayesian network can be defined as follows [30]: Based on Equation (10), the expression of conditional PDF of two adjacent monthly streamflows, X and Y, can be described as follows: where c θ (u 1 , u 2 ) is the PDF of the Archimedean copula with parameter θ, which describes the joint PDF of two adjacent monthly streamflows; u 1 = F X (x; λ) and u 2 = F Y (y; λ ) are the marginal distributions of monthly streamflows whose marginal PDFs are f X (x) and f Y (y), respectively.The PDF of copula c θ (u 1 , u 2 ) can be defined as follows: The general framework of MECBN for modeling hydrological processes is shown in Figure 1.The MECBN method would be used for monthly streamflow frequency analysis.The application involves the construction of marginal, joint and conditional distributions, and the measurement of dependence.In detail, (a) the marginal distributions of monthly streamflows would be constructed by the maximum entropy method; (b) the joint distributions of two adjacent monthly streamflows would be estimated using Archimedean copula based on the generated marginal distributions; (c) the performance of marginal and joint distributions would be tested by the goodness-of-fit statistics; (d) the conditional distributions of two adjacent monthly streamflows would be generated by the Bayesian network combined with generated marginal and joint distributions; (d) the dependence between two adjacent monthly streamflows would be measured by methods including Spearman's rho, Kendall's tau and the tail dependence coefficient.

Historical records of monthly streamflow
Generate marginal distributions of monthly streamflow using the maximum entropy method 1. Parameters are determined by CG method.2. Statistics including mean, standard deviation, skewness and kurtosis are specified as constraints.3. Goodness-of-fit statistic tests including RMSE and K-S are applied to evaluate the marginal distributions.

Generate joint distributions of two adjacent monthly streamflow pairs using the Archimedean copula
Generate conditional distributions of monthly streamflow using the Bayesian network Maximum entropy-Archimedean copula (MEAC) method ( )

Study Area
The Kaidu River Basin (Figure 2), located in the middle reach of the Tarim River Basin, plays a key role in water supply to municipality, industry, agricultural, stockbreeding and forestry sectors in Xinjiang province of China.It lies between the latitudes of 42°14' N and 43°21' N and longitudes

Study Area
The Kaidu River Basin (Figure 2), located in the middle reach of the Tarim River Basin, plays a key role in water supply to municipality, industry, agricultural, stockbreeding and forestry sectors in Xinjiang province of China.It lies between the latitudes of 42 • 14 N and 43 • 21 N and longitudes of 82 • 58 E and 86 • 05 E [36].The Kaidu River Basin experiences a typical inner-continental climate, with an average annual temperature of 4.16 • C. The spatial and temporal distribution of precipitation in the Kaidu River Basin is heterogeneous with an average rainfall of 273 mm/year Huang et al. [37].It exhibits plentiful snowmelt water in spring, concentrated ice-melt water and rainfall recharge during the summer, significant temperature differences during the autumn, and frequent early frost in winter.More than 80% of the rainfall occurs between May and September [38].The study site is Dashankou hydrological station, which is located at the Kaidu River Basin outlet.The Kaidu River Basin above Dashankou hydrological station has a catchment area of 18,827 km 2 with the mean elevation of 3100 m.The data of streamflow records are obtained from the Dashankou hydrological station for the period of 1957-2009.The unit for streamflow is in cubic meter per second (CMS).The monthly variation of streamflow is shown in Figure 3, with a maximum value of 413, a minimum value of 33.2, a mean of 110.54, a standard deviation of 68.77, a skewness of 1.37, and a kurtosis of 5.07.
The Kaidu River Basin has important functions for agricultural irrigation and ecological sustainability.It also plays a critical role in protecting the Bosten Lake.However, the regional water-resources systems have been destroyed because of excessive exploitation and environmental degradation, leading to amounts of ecological and environmental problems.Moreover, the streamflow is replenished by snowmelt in spring, sometimes causing floods.Therefore, it is necessary to propose an effective method to analyze the inherent uncertain characteristics on streamflow of the Kaidu River Basin.In this paper, potential streamflow interactions are explored and formalized into a copula function combined with a Bayesian network framework.An application using such frequency analysis method could be developed as part of a strategy for water resources management and precaution of hydrological extremes.maximum value of 413, a minimum value of 33.2, a mean of 110.54, a standard deviation of 68.77, a skewness of 1.37, and a kurtosis of 5.07.The Kaidu River Basin has important functions for agricultural irrigation and ecological sustainability.It also plays a critical role in protecting the Bosten Lake.However, the regional water-resources systems have been destroyed because of excessive exploitation and environmental degradation, leading to amounts of ecological and environmental problems.Moreover, the streamflow is replenished by snowmelt in spring, sometimes causing floods.Therefore, it is necessary to propose an effective method to analyze the inherent uncertain characteristics on streamflow of the Kaidu River Basin.In this paper, potential streamflow interactions are explored and formalized into a copula function combined with a Bayesian network framework.An application using such frequency analysis method could be developed as part of a strategy for water resources management and precaution of hydrological extremes.

Dependence Measures
In streamflow analysis, one may be interested in dependence structure between adjacent streamflows, especially for the extreme behavior of the streamflows for risk analysis.Therefore, the rank-based coefficients of correlation including Spearman's rho and Kendall's tau, which are well-known non-parametric measures of dependence, are applied to examine the dependence structure of adjacent monthly streamflows.The tail dependence coefficient is used to analyze the extreme behavior of adjacent monthly streamflows [39].The simplification of Spearman's rho can be expressed as: where ρs ∈ [−1, 1], (X i , Y i ), i = 1, 2, . . ., n are the sample of (X, Y); R i stands for the rank of X i among (X 1 , X 2 , . . . ,X n ), and Q i stands for the rank of Y i among (Y 1 , Y 2 , . . . ,Y n ).For Kendall's tau, it can be expressed as: where c and d are the number of concordant of discordant pairs, respectively; (X i , Y i ) and X j , Y j are said to be concordant when The lower and upper tail dependence coefficients are introduced as follows: where λ lo and λ up are the lower and upper dependence coefficients, respectively; F(x) and G(x) are the marginal distributions of random variable X and Y, respectively.

Goodness-of-Fit (GOF)
The goodness-of-fit (GOF) statistic tests for marginal and joint distributions can be performed separately.In this study, the GOF tests for marginal distributions include the root mean square error (RMSE) and the Kolmogorov-Smirnov (K-S) test.The GOF tests for joint distributions consist of the Rosenblatt transformation with Cramér von Mises statistic, the Akaike information criterion (AIC), and RMSE.The expression of RMSE is Willmott and Matsuura [40]: where x est k is the estimated value from the fitted distribution; x obs k is the corresponding observed value; N is the sample size.
As a non-parametric probability distribution free test, the K-S test quantifies the largest vertical difference between the specified and empirical distributions.The statistic of the K-S test is defined as follows: where data points x are in an increasing order; F * (x) and F n (x) are the specified distribution and empirical distribution, respectively; If T exceeds the 1 − α quantile, then we reject the null hypothesis H 0 (The sample data follow the hypothesized distribution); α is the level of significance.
The GOF tests based on the Rosenblatt transformation with Cramér von Mises statistic, Akaike information criterion (AIC) and RMSE are employed in this study.The empirical distribution for the Rosenblatt transformation and the corresponding Cramér von Mises statistic can be expressed as follows [31]: where E i stand for the pseudo-observations from the independence copula C ⊥ ; υ is the vector of marginal distributions; n is the sample size; E il ∨ E jl = max E il , E jl , (l = 1, 2).The corresponding p-value of the Cramér von Mises test statistic exceeds the 1 − α quantile, then we reject the null hypothesis H 0 (the hypothesized copula function is the fitted one); α is the level of significance.Then, AIC can be used for identifying the most appropriate probability distribution, which can be expressed below: The RMSE value for joint distribution can also be determined using Equation (17).The best fitted copula function is the one that has the minimum values of AIC and RMSE.

Marginal Distributions
Firstly, marginal distributions of monthly streamflows are constructed.The Lagrange multipliers for the marginal distributions are estimated by the CG method.The results of estimation are shown in Table 1.Based on the Lagrange multipliers, the PDFs and CDFs of monthly streamflows in the Kaidu River Basin could be determined based on Equations ( 6) and (7).In order to evaluate the marginal distributions of monthly streamflow in Kaidu River Basin generated by the MECBN method, GOF tests including RMSE and K-S test are applied in this study.The results are shown in Table 2.As a measure which is regularly employed in model evaluation studies, RMSE indicates the deviation between generated monthly streamflow and observed monthly streamflow.Most RMSE values in Table 2 are small.The maximum value of RMSE is 10.28.This means that the maximum deviation between generated monthly streamflow and observed monthly streamflow occurs in July, while the minimum RMSE can be observed in March with a value of 1.20.The K-S test can quantify the largest vertical difference between the hypothesized and empirical distributions [31,41,42].Therefore, it can be used to verify if the sample data of monthly streamflow follow the marginal distribution generated by the MECBN method.As presented in Table 2, all the p-Values calculated from K-S test are higher than the significant level α = 0.05, where T is the K-S test statistic.The results indicate that observed monthly streamflows in the Kaidu River Basin could be appropriately represented by the marginal distributions generated by MECBN.
Figure 4 shows the comparison of theoretial and empirical marginal PDF and CDF for monthly streamflow in Kaidu River Basin.Numbers (1) to (12) stand for monthly streamflow from January to December.It could be concluded that the marginal PDF and CDF generated by MECBN are able to capture the shape of empirical histograms and empirical CDF.Therefore, MECBN is effective for generating marginal distributions of monthly streamflows of the Kaidu River Basin.In addition, Figure 4 indicates that most streamflow records in April, May, June, July, August and September are greater than 100 CMS.The streamflow values from April to September, corresponding to the 0.5 value of CDF, are greater than 100 CMS.That is mainly because the distribution of precipitation is uneven in Kaidu River Basin.More than 80% of the total precipitation in a year falls from May to September, and the rest falls from October to April.However, there is a low rainfall but a high streamflow in April.This is due to the fact that snow melting is the main sources in April in the Kaidu River Basin.

11
value of CDF, are greater than 100 CMS.That is mainly because the distribution of precipitation is uneven in Kaidu River Basin.More than 80% of the total precipitation in a year falls from May to September, and the rest falls from October to April.However, there is a low rainfall but a high streamflow in April.This is due to the fact that snow melting is the main sources in April in the Kaidu River Basin.

Joint Distributions
In this study, Archimedean copulas including Gumbel-Hougaard, Clayton and Frank copulas would be used to generate the joint probability distributions for adjacent monthly streamflows.In order to evaluate the joint distributions generated by MECBN, the GOF tests of the Rosenblatt transformation with the Cramér von Mises statistic, AIC and RMSE would be employed.As shown in Table 3.All the p-Values of adjacent monthly streamflow pairs are higher than significant level α = 0.05 for the Gumbel-Hougaard and Frank copulas.When employing Clayton copula to generate joint distributions, the p-Values of months 4-5, 6-7 and 9-10 are lower than the significant level.Results in Table 3 mean that both Gumbel-Hougaard and Frank copulas could be used to generate joint distributions of two adjacent monthly streamflows.In order to choose the most appropriate copula for monthly streamflow pairs in the Kaidu River Basin, the AIC and RMSE values are applied to test goodness of fitness for sample data to the joint distribution obtained by the Gumbel-Hougaard and Frank copulas.Results are given in Table 4.The Gumbel-Hougaard copula would be chosen as the most appropriate copula with the minimum values of AIC and RMSE for monthly streamflow pairs.Based on the analysis above, marginal distributions of monthly streamflow were generated and the Gumbel-Hougaard copula was chosen to describe the dependence structures for adjacent monthly streamflows in the Kaidu River Basin.Therefore, joint PDF, joint CDF and the corresponding counters are obtained by the MECBN method.Dependence measures (i.e., Table 5) indicate that the adjacent monthly streamflow pairs in the Kaidu River Basin are highly correlated with significant upper tail dependence.The upper tail dependence between two adjacent monthly streamflows indicates that streamflow of one month in the Kaidu River Basin depends on the streamflow of its adjacent month, and also denotes the probability for one monthly streamflow exceeding a high threshold under the situation that its adjacent monthly streamflow exceeds that threshold.It is important to determine this in order to provide vital information for the prevention and control of hydrological extremes in the Kaidu River Basin.

Conditional Distributions
Once the marginal and joint distributions are fitted to streamflow records in the Kaidu River Basin, the conditional distributions of adjacent monthly streamflows can be derived according to Equation (11).The streamflow status is classified based on the quantile.In this study, 95, 97.5 and 99 percentile values reflect qualitative information on probability distributions of the scaled monthly streamflow of the Kaidu River Basin.They are used as streamflow classification schemes at high level to explore the conditional distribution.As the streamflow has been scaled into [0,1], 95% means that the streamflow is 0.95.The relationship of conditional distribution of two adjacent months has been transformed to the exploration of the distribution of the streamflow in each month given the streamflow status (0.95, 0.975, 0.99) in the last month demonstrated in Figure 5.
As shown in Figure 5, if the given streamflow percentile value changes in the previous, the probability distribution of the streamflow in the current month varies, which also exhibits the conclusion that there exists upper tail dependence between every two adjacent monthly streamflows in the Kaidu River Basin.Moreover, the probability distributions under a given streamflow percentile value, and the fluctuation ranges under different streamflow percentile values vary considerably between different streamflow pairs.It indicates that the upper tail dependences between each streamflow pairs are different, which are coincide to the upper tail coefficients presented in Table 5.It is also interesting to find that the dependence coefficients including Spearman's rho, Kendall's tau and upper tail dependence coefficients are in inverse proportion to monthly streamflow values in the Kaidu River Basin.In detail, the streamflows in April, May, June, August and September are higher than those in other months.But the dependence coefficients between March and April, April and May, May and June, June and August, and August and September are lower than other monthly streamflow pairs.Among all the streamflow pairs, the dependence between January and February is most significant with the values of Spearman's rho, Kendall's tau and upper tail dependence coefficient being 0.931, 0.784, and 0.838, respectively.It leads to the results that the streamflow in February depends significantly on the streamflow in January and the PDF of streamflow in February varies greatly along with the changes of streamflow status in January.The minimum values of Spearman's rho, Kendall's tau and upper tail dependence coefficient are 0.292, 0.180 and 0.235, respectively.They reveal the interaction of Water 2019, 11, 42 13 of 16 monthly streamflows between April and May.Obviously, f 5|4 indicates the small change of PDF for streamflow in May when the scaled streamflow of April changes from 0.95 to 0.99.These results show that: (a) in general, high magnitudes of two adjacent streamflow corresponds to a low dependence coefficient among them, and vice versa; (b) the two minimum adjacent streamflows exhibit the highest dependence, but the two maximum streamflows do not show the minimum dependent coefficent, which indicates the interaction between streamflow and dependence coefficient is non-linear.That is mainly because (a) there are a variety of factors which affect the streamflow in the Kaidu River Basin, such as rainfall, snow melting, evapotranspiration rate, requirement of water use, and so on; (b) the conditions of streamflow are more complex under flooding season in the Kaidu River Basin, which leads to the situation that months with high streamflow values under flooding season have weak correlations with their adjacent months.In addition, the dependence coefficient between April and May is minimum mainly due to the types of streamflow contributors being different in April and May.It has been concluded before that snow melting is the main source in April in the Kaidu River Basin.However, both rainfall and snow melting make major contributions to the runoff in May.Otherwise, the conditions that the evapotranspiration rate increases with increasing temperature and the requirement of water increases due to the growth of the plants and the impact of human use may also affect the interaction of streamflow pairs between April and May.As shown in Figure 5, if the given streamflow percentile value changes in the previous, the probability distribution of the streamflow in the current month varies, which also exhibits the conclusion that there exists upper tail dependence between every two adjacent monthly streamflows in the Kaidu River Basin.Moreover, the probability distributions under a given streamflow percentile value, and the fluctuation ranges under different streamflow percentile values vary considerably between different streamflow pairs.It indicates that the upper tail

Conclusions
In this study, a maximum entropy-Archimedean copula-based Bayesian network (MECBN) method has been developed in order to analyze the interaction of monthly streamflows.MECBN is effective for representing the uncertainties existing in the model representation, preserving the distributional characteristics of streamflow records and addressing the correlation structure between streamflow pairs.This study is the first attempt at introducing a maximum entropy-copula into the Bayesian network modeling framework.Compared to the conventional Bayesian network approach, MECBN is more effective for addressing the correlation structure between streamflow pairs with arbitrary random characteristics.Moreover, the marginal distribution of monthly streamflow is approximated by the maximum entropy method.Such a method could preserve key statistics, and does not require specific assumptions (e.g., normal, gamma etc.) on the historical streamflow data.The limitations of MECBN is that the selection of copula functions needs more computation to ensure the validity of preserving the non-linear dependence structure.Moreover, the determination of parameters in the MECBN method is computationally cumbersome.
The proposed method has been applied to frequency analysis of monthly streamflow in Kaidu River Basin.The results indicate that (a) MECBN can preserve the key statistical characteristics (i.e., mean, standard deviation, skewness and kurtosis) for each monthly streamflow in the Kaidu River Basin; (b) the Gumbel-Hougaard copula among Archimedean copulas is the most appropriate for reflecting the interactions of monthly streamflow pairs, with the minimum values of AIC and RMSE; (c) the joint distributions, the conditional distributions and the dependence measures including Spearman's Rho, Kendall's Tau and the tail dependence coefficient reveal the upper tail dependence between monthly streamflow pairs in the Kaidu River Basin; (d) the goodness-of-fit statistical tests show a good performance of MECBN in describing the historical data of the Kaidu River Basin, China.
The frequency analysis of monthly streamflow in the Kaidu River Basin indicates that the interactions between two adjacent monthly streamflow pairs are non-linear.Compared with the other months, there is a low rainfall but a high streamflow in April due to the fact that snow melting is the main source for the streamflow in April in the Kaidu River Basin.The upper tail dependence between monthly streamflow pairs in the Kaidu River Basin indicates that streamflow of one month depends on the streamflow of its adjacent month.It also denotes a high probability for one monthly streamflow exceeding a high threshold under the situation that its adjacent monthly streamflow exceeds the threshold.The upper tail dependence varies from different streamflow status (95%, 97.5% and 99% quantiles of streamflow) and is described by the upper tail dependence coefficients.The dependence coefficients of Spearman's rho, Kendall's tau and the upper tail dependence coefficient are in inverse proportion to monthly streamflow values in the Kaidu River Basin.This is because a variety of factors (i.e., rainfall, snow melting, evapotranspiration rate and requirement of water use) would affect the streamflow in the Kaidu River Basin in the flooding season, leading to less impact from the streamflow in the antecedent month.These findings have great significance for the Kaidu River Basin, which is located in the arid region of China.They can be used for providing vital information for the prevention and control of hydrological extremes and further water resources planning in the Kaidu River Basin.

Figure 4 .
Figure 4. Comparison of theoretical and empirical marginal probability density function (PDF) and cumulative density function (CDF).The values are calculated monthly from 1957 to 2009.The abscissa represents streamflow values.The corresponding size varies from month to month.PDF describes the output of variables, and indicates the frequency at a given streamflow value.CDF describes the integrate of PDF, and indicates the probability of a given streamflow value.Lines in PDFs and CDFs show the performance of generated values.Bars and dots depict the empirical values in PDFs and CDFs, respectively.

Figure 4 .
Figure 4. Comparison of theoretical and empirical marginal probability density function (PDF) and cumulative density function (CDF).The values are calculated monthly from 1957 to 2009.The abscissa represents streamflow values.The corresponding size varies from month to month.PDF describes the output of variables, and indicates the frequency at a given streamflow value.CDF describes the integrate of PDF, and indicates the probability of a given streamflow value.Lines in PDFs and CDFs show the performance of generated values.Bars and dots depict the empirical values in PDFs and CDFs, respectively.

Figure 5 .
Figure 5. Distribution of streamflow in each month given the streamflow status (0.95, 0.975, 0.99) in the previous month.The values are calculated monthly from 1957 to 2009.The abscissa represents streamflow values, which are scaled into [0,1].fK+1|K indicates the conditional PDF of two adjacent monthly streamflows, K and K+1.0.95, 0.975 and 0.99 mean the steamflow status of K's streamflow.

Figure 5 .
Figure 5. Distribution of streamflow in each month given the streamflow status (0.95, 0.975, 0.99) in the previous month.The values are calculated monthly from 1957 to 2009.The abscissa represents streamflow values, which are scaled into [0,1].f K+1|K indicates the conditional PDF of two adjacent monthly streamflows, K and K+1.0.95, 0.975 and 0.99 mean the steamflow status of K's streamflow.

Table 1 .
Estimation of Lagrange multipliers for maximum entropy (ME)-based marginal distribution of monthly streamflow.

Table 2 .
The root means square error (RMSE) value and Kolmogorov-Smirnov (K-S) test for marginal distribution of monthly streamflow.

Table 3 .
Comparison of Cramér von Mises statistic for joint distribution of different streamflow pairs.

Table 4 .
Comparison of Akaike information criterion (AIC) and RMSE values for joint distribution of different streamflow pairs.

Table 5 .
Results of dependence measures.