A Forecast-Skill-Based Dynamic Pre-Storm Level Control for Reservoir Flood-Control Operation

: The design and operation of reservoirs based on conventional ﬂood-limited water levels (FLWL) implicitly adopts the assumption of hydrological stationarity. As such, historical-record-based FLWL may not be the best choice for ﬂood-control operations due to the inherent non-stationarity of rainfall inputs. With maturing ﬂood forecasts, this study focuses on establishing linkage between FLWL and skill of forecast, thus developing a “dynamic pre-storm level” approach for reservoir ﬂood-control operations. The approach utilizes forecast ﬂood magnitude, forecast skill and exceedance probability of forecast error to determine the pre-storm reservoir storage for each ﬂood event. The exceedance probability of forecast error for each incoming ﬂood is used as the reservoir ﬂood control standard instead of the probability of a static return interval ﬂood. This approach is demonstrated in a hypothetical situation in the Three Gorges Reservoir in China. The results show that under zero-forecast-skill conditions, the proposed dynamic pre-storm level matches well with the Three Gorges Reservoir-designed FLWL; and, as the forecast accuracy/skill increase, the proposed approach can make better use of the increased forecast accuracy, thereby maximizing ﬂoodwater utilization and reservoir storage. In this way, coupling the new approach with FLWL allows for more efﬁcient and economic day-to-day reservoir operations without adding any ﬂood risk. This study validates the usefulness of dynamic water level control during ﬂood season, considering the improvement of ﬂood forecast accuracy.


Introduction
Most large reservoirs are built for multiple purposes, including flood control, economic benefits and environmental protection. During the flood season, when flood control is the primary objective, reservoir levels are kept low for the entire season to ensure that there is sufficient storage to cater to extreme flood events. This usually static low reservoir level, known as the flood storage level or flood-limited water level (FLWL), is predetermined by reservoir routing of the design flood hydrograph using some given operating strategies and constraints [1,2]. The choice of traditional design flood, as well as FLWL, relies on the historical hydrological conditions (e.g., flood frequency distribution, flood peak) during the flood season. Given the growing non-stationarity of hydrology [3,4] and the rapid progress made in the hydrological forecast, it is imperative to seek new ways for reservoir flood-control operation.
Knowledge of future inflows in terms of streamflow forecasts is undoubtedly valuable for reservoir operation. Streamflow forecasts are greatly improved with the advances period flood volume is thus denoted as "period flood forecast". The use of period flood forecasts would result in quick decisions for reservoir operation and the avoidance of frequent opening and closing of sluice gates.
Focusing on large reservoirs that are unaffected by intraday (e.g., hourly) peak flow fluctuations, this study explores the possibility of flood-control operation based on the forecasts of maximum period flood volume.

Typical Reservoir Flood Control Operation Strategies
At present, reservoir representative water levels include design flood level, normal pool level, flood-limited water level, and dead water level, respectively, from top to bottom. The storage between the normal pool level and the dead water level is the active storage used for water supply, while that between design flood level and flood-limited water level (FLWL) is required for flood control. The engineers designed dams/reservoirs to protect against floods under a particular magnitude. The corresponding frequencies of flood occurrence, or risk level, are known as flood control standards determined by the Ministry of Water Resources of China. Both the probability of exceedance P des and return interval T des (in years), where P des = T −1 des , reflect the cumulative probability of a design magnitude flood being equaled or exceeded (i.e., the so-called flood control standard). The conventional procedure for flood-control operation relies mainly on FLWL. That is, starting from this level, the reservoir could hold any incoming floods below the design magnitude.
The general reservoir flood-control operation trajectories consist of two types of reservoir operation strategies, conventional strategy and DC-FLWL strategy, as shown in Figure 1. The flood-control operation can be split into three periods depending on the occurrence of the flood event: pre-release period, storm-control period, non-storm period. For the conventional flood-control operation strategy (solid line in Figure 1), reservoir water levels are generally operated at FLWL during the pre-release period and non-storm period. When a flood event occurs, the reservoir is permitted to retain floodwater; hence the water level will rise above FLWL (i.e., from level A to some level B based on the reservoir operational practice). Once the flood has subsided, the reservoir water level should return to FLWL. However, this conventional operation does not consider the influences of flood forecasts and often results in uneconomic reservoir operation due to the limitation of low water level. Therefore, Li et al. [19] and follow-up researchers, e.g., [30,31], adopted a range of higher water levels than FLWL during the non-storm period (known as dynamic control of FLWL, DC-FLWL), that allows reservoir water level to fluctuate within the dynamic control bound. When the forecast flood is relatively small, the reservoir is operated with forecast information; when a large flood is forecasted, the water level should be reduced to FLWL at the start of the storm-control period and allowed to rise following the conventional strategy (i.e., from level A to level B). After the storm, the reservoir water level will return to the upper boundary of the dynamic control bound (dashed line in Figure 1).
The method of DC-FLWL has successfully utilized excess reservoir storage capacity during the pre-release period and non-storm period thus increases the hydropower production and floodwater utilization rate without decreasing flood control standard. However, always be ready to lower reservoir level to FLWL can severely constrain the set of DC-FLWL in flood season. Since the FLWL is obtained based on the design flood, it may be pretty low if the design flood is set to a very low probability of occurrence; this is especially the case for some key projects. Nevertheless, most real-world flood events are significantly smaller than the design magnitude; fixing the pre-storm regulating water level at FLWL may be too conservative. Furthermore, the design return interval may not be well-defined because of uncertainty factors such as the sampling amount of flood records and the choice of flood series distribution [32]. These factors can severely affect the precision of return interval prediction. The method of DC-FLWL has successfully utilized excess reservoir storage capacity during the pre-release period and non-storm period thus increases the hydropower production and floodwater utilization rate without decreasing flood control standard. However, always be ready to lower reservoir level to FLWL can severely constrain the set of DC-FLWL in flood season. Since the FLWL is obtained based on the design flood, it may be pretty low if the design flood is set to a very low probability of occurrence; this is especially the case for some key projects. Nevertheless, most real-world flood events are significantly smaller than the design magnitude; fixing the pre-storm regulating water level at FLWL may be too conservative. Furthermore, the design return interval may not be well-defined because of uncertainty factors such as the sampling amount of flood records and the choice of flood series distribution [32]. These factors can severely affect the precision of return interval prediction.
From the above analysis, the DC-FLWL strategy does not give full consideration to the forecasts regarding the magnitude of incoming floods. Instead of employing the probability of a static return interval flood (i.e., design flood) as a reservoir flood control standard, this study proposes a dynamic design-flood probability, indicated by the exceedance probability of forecast error for each incoming flood. A new dynamic pre-storm floodcontrol operation strategy based on this criterion is then proposed (double dashed line in Figure 1) in this study.
The proposed new strategy consists of three key elements: 1. The reservoir water level at the beginning of flood forecasts should be allowed to operate below a limit operation water level; 2. The water level at the start of the storm-control period should be set to a lower water level as compared to the beginning of flood forecasts. This water level (hereinafter referred to as dynamic pre-storm level, D-PSL, i.e., level A' in Figure 1) is related to the magnitude and accuracy of forecasts for the incoming flood. Then reservoir regulates the flood from this level to some level B' based on the difference between flood volume and reservoir release; 3. After the storm-control period, the reservoir should be allowed to return to the limited operation water level. From the above analysis, the DC-FLWL strategy does not give full consideration to the forecasts regarding the magnitude of incoming floods. Instead of employing the probability of a static return interval flood (i.e., design flood) as a reservoir flood control standard, this study proposes a dynamic design-flood probability, indicated by the exceedance probability of forecast error for each incoming flood. A new dynamic pre-storm floodcontrol operation strategy based on this criterion is then proposed (double dashed line in Figure 1) in this study.
The proposed new strategy consists of three key elements:

1.
The reservoir water level at the beginning of flood forecasts should be allowed to operate below a limit operation water level; 2.
The water level at the start of the storm-control period should be set to a lower water level as compared to the beginning of flood forecasts. This water level (hereinafter referred to as dynamic pre-storm level, D-PSL, i.e., level A' in Figure 1) is related to the magnitude and accuracy of forecasts for the incoming flood. Then reservoir regulates the flood from this level to some level B' based on the difference between flood volume and reservoir release; 3.
After the storm-control period, the reservoir should be allowed to return to the limited operation water level.
These elements of the proposed strategy enable the reservoir operators to make full use of the excess reservoir capacity and information provided by the flood forecasts, not just during floods but throughout the flood season. In this study, our analyses focus on the derivation of the dynamic pre-storm level. Flood forecasts are generally associated with some uncertainty because of the stochastic nature of storms and the range of forecast models/products used. We denote by f t s and V t s the forecasting and actual maximum flood volume, respectively, within t s time period. Forecast uncertainty can be indicated by the differences between the two and represented by forecast error ε t s , with: Forecast skill represents the magnitude of forecast uncertainty. One of the commonly adopted methods to evaluate forecast skill is the coefficient of prediction C p proposed by Lettenmaier [28], with: where MSE(·) and var(·) are the mean squared error and statistical variance, respectively. Using multiple linear regression techniques, C p is numerically equal to the square of the correlation coefficient between observed and forecasted period floods [33]. In practice, long-term historical forecasts and observations can be used to estimate C p [34,35]. Consider the fact that hydrologic conditions in more regions have been changed due to the disturbance of intensive human activities and climate change and are expected to continue to change [3,36,37], the non-stationary patterns in streamflow (and precipitation) have brought challenges for flood risk predictions, and thus forecast skill determination. One possible solution is to derive the streamflow statistics on the basis of climate change projections of hydrologic conditions rather than just historical case following, e.g., [9] or [38], which helps mitigate some effects of changes in hydrological variability.
Denoting the variance of forecast error by σ ε 2 , the coefficient of prediction C p can be the connection for a fraction of streamflow variability and forecasts [8,39], with: For a given forecast skill C p , the forecast uncertainty level, σ ε 2 , can be therefore derived from Equation (3). C p ranges from zero to one, with 1.0 for a perfect forecast skill (σ ε 2 becomes zero), and 0.0 for a forecast containing no information other than the mean of historical statistics or projections (σ ε 2 equals to unconditional variability of V t s , σ ε 2 = var(V t s )). Thus, an effective flood forecast error variance cannot be greater than flood variance, i.e., σ ε 2 ≤ var(V t s ). Otherwise, having no forecast would be a better option [39]. Clearly, as forecast skill C p increases, the losses of forecast accuracy improves, and the economic and societal benefits associated with a real-time operating policy should approach the minimum.
For any hydrologic forecast model, the forecast error ε t s may be expressed using a specified distribution obtained via regression or parameter estimation with past forecast error. Knowledge from stochastic hydrology and forecasting studies suggests flood forecast error follows a bounded biased distribution [40]. In practice, ε t s is usually assumed to follow a priori Gaussian, Student's t, or gamma distribution. For the sake of simplicity, the non-Gaussian distributed forecast errors may be transformed and mapped onto Gaussian space by applying the normal quantile transform (NQT) method [41,42], with known transformed variance, σ ε 2 , that is, ε t s ∼ N 0, σ t s 2 , following Ding et al. [23,24] and Zhao et al. [25].

Derivation of Dynamic Pre-Storm Level
Section 2.1 has mentioned that reservoir flood control standard relies mainly on the static design flood with a predefined return interval (T des = 1/P des , event exceeds once in T years). In practice, since the design flood is often far greater than what actually occurs, the design-flood-based water level often tends to be unnecessarily low. Along with maturing flood forecasts, the uncertainty in forecasts for the incoming flood has been considerably decreased. Therefore, we may define a dynamic design-flood probability P des (in terms of exceedance probability of forecast error for each incoming flood) as a new flood control standard, thereby spreading the tiny flood control risk to each flood. The P des should numerically be equal to the primary standard, P des , and be represented by the integral Water 2021, 13, 556 6 of 20 function of forecast error for each incoming flood being beyond a limit δ P_t s under a given C p (as shown in Figure 2), with: where F −1 (·) is the inverse of the cumulative probability function of forecast error. For the normal distribution, a higher/better forecast skill (i.e., larger C p value) would bring a decrease in forecast uncertainty level, σ ε 2 , and thus limit forecast error, δ P_t s .
static design flood with a predefined return interval ( = 1/ , event exceeds once in T years). In practice, since the design flood is often far greater than what actually occurs, the design-flood-based water level often tends to be unnecessarily low. Along with maturing flood forecasts, the uncertainty in forecasts for the incoming flood has been considerably decreased. Therefore, we may define a dynamic design-flood probability (in terms of exceedance probability of forecast error for each incoming flood) as a new flood control standard, thereby spreading the tiny flood control risk to each flood. The should numerically be equal to the primary standard, , and be represented by the integral function of forecast error for each incoming flood being beyond a limit _ under a given (as shown in Figure 2), with: where (•) is the inverse of the cumulative probability function of forecast error. For the normal distribution, a higher/better forecast skill (i.e., larger value) would bring a decrease in forecast uncertainty level, , and thus limit forecast error, _ .  According to the definition of dynamic design-flood probability, the dynamic design period flood, f des_t s , that could cause damage to the reservoir should be the sum of forecast value and limit forecast error, as shown in Figure 2. Note that the dynamic design period flood changes in magnitude with each incoming flood rather than a constant value, with: Both upstream flood and downstream channel capacity influence reservoir floodcontrol operation. Like the common operational practice, we assume the reservoir stores all incoming floodwater while releases according to downstream conveyance capacity (r t s , converted into cumulative volume in t s day). The dynamic pre-storm level D-PSL t s (i.e., the associated pre-storm reservoir storage S 0_t s ) can be expressed as: Water 2021, 13, 556 7 of 20 To ensure flood control safety, reservoir pre-storm regulating water level should not be higher than this D-PSL ts , which is the sum of reservoir capacity K and maximum t s day release volume r t s minus the dynamic design period flood f des_t s .
From the above derivation, the critical pre-storm water level (or storage) for a certain time period can be obtained. Herein, four sets of forecast periods are considered, including maximum 1, 2, 3 and 5 day flood volumes, which bracket the range of useful forecasts. Therefore, the chosen D-PSL for each incoming flood should be the minimum of the multiple time period D-PSL t s , with: The specific calculation steps are as follows: Step 1. Estimate the flood variance, var(V t s ), for multiple periods from historical/projected annual maximum flood series (AM) [43]; Step 2. Evaluate the forecast skill (C p of any applicable hydrological forecast model based on long term retroactive forecasts and observations following hydrological forecasts related research, e.g., [35]; Step 3. Produce the maximum t s day flood volume forecast of an incoming flood using the forecast model, where the forecast is in the form of mean f t s and variance σ ε 2 C p ; Step 4. Calculate the dynamic design period flood, f des_t s , according to the reservoir dynamic design-flood probability, P des , indicated by the exceedance probability of forecast error (e.g., 1% shown in Figure 2); Step 5. Carry out the reservoir flood-control operation according to Equation (7) based on dynamic design period flood, f des_t s , and then obtain the dynamic pre-storm level, D-PSL t s ; Step 6. Repeat step 3 to step 5 with the time period t s varying from 1 day to 2 days, 3 days, and 5 days, and obtain the corresponding D-PSL t s ; Step 7. Define the minimum value in {D-PSL 1 d, D-PSL 2 d, D-PSL 3 d, D-PSL 5 d } as the chosen D-PSL.
When C p = 1.0, the flood volume forecasts with full accuracy, i.e., f t s = V t s . Hence, the limit forecast error δ P_t s obtained from Equation (5) is zero; hence Equation (7) becomes Before proceeding with further analysis, let us consider a special example to illustrate the determination of D-PSL and the regulation process. Let C p = 1.0, K = 20 MCM (million cubic meter), r 1 d = 3 MCM/day. Assume three different types of flood shapes, but with the same period flood volumes to be V 1 d = 7 MCM/day, V 2 d = 7 + 4 = 11 MCM/day, V 3 d = 7 + 4 + 3 = 14 MCM/day, and V 5 d = 7 + 4 + 3 + 1.5 + 1 = 16.5 MCM/day (gray bars in Figure 3). Then we have D-PSL 1 d = 16 MCM, D-PSL 2 d = D-PSL 3 d =15 MCM, D-PSL 5 d = 18.5 MCM according to Equation (9). Following Equation (8), we have the chosen D-PSL = 15 MCM, which should be the initial storage before the flood. Regulating flood with the maximum safety release, the solid lines in Figure 3 show the reservoir storage processes within the storm-control period under three flood shape types. The result shows although the determination of D-PSL is simply based on the flood volume, it is still capable of coping with the daily scale uneven distributed flood hydrograph. Figure 3). Then we have D-PSL1 d = 16 MCM, D-PSL2 d = D-PSL3 d =15 MCM, D-PSL5 d = 18.5 MCM according to Equation (9). Following Equation (8), we have the chosen D-PSL = 15 MCM, which should be the initial storage before the flood. Regulating flood with the maximum safety release, the solid lines in Figure 3 show the reservoir storage processes within the storm-control period under three flood shape types. The result shows although the determination of D-PSL is simply based on the flood volume, it is still capable of coping with the daily scale uneven distributed flood hydrograph.

The Relationship between Forecast Error and Flood Risk
Considering the fact that most reservoir operators are more conservatives than risktakers, it is necessary to analyze the corresponding flood risks. Using this chosen D-PSL as pre-storm regulating water level, the relation between forecast and actual flood volume (regardless of the time period ) can be divided into three cases (see Figure 2): 1. Overestimated flood volume-the probability of such case − ≥ 0 = 0.5.
The reservoir will not be filled to its capacity; thus, the outflow can be reduced according to the flood volume difference between the dynamic design period flood and actual flood volume. Compared to the static FLWL method, the D-PSL-based operation will definitely not increase flood risk, and this conclusion is irrelevant to the magnitude of a flood; 2. Underestimated, but within limit forecast error-the probability of such case − ≤ = 0.5 − . This seems alert condition actually will not face flood risk. This is because the D-PSL is derived from _ rather than the forecast value itself. This means the proposed policy did provide an adequate margin of safety. Indeed, the result is the same as case 1; 3. Significantly underestimated flood volume-the probability of such case − > < ′ . The actual flood is significantly larger than the forecasts. The reservoir storage reserved is insufficient to capture all excess flood. To avoid catastrophic flood damage induced by a dam failure, the reservoir must release water equals to the inflow, thus increase downstream flood risk. However, this kind of probability for each flood event is less than ′ , which is usually as small as 1% to 0.01%.

The Relationship between Forecast Error and Flood Risk
Considering the fact that most reservoir operators are more conservatives than risktakers, it is necessary to analyze the corresponding flood risks. Using this chosen D-PSL as pre-storm regulating water level, the relation between forecast f t s and actual flood volume V t s (regardless of the time period t s ) can be divided into three cases (see Figure 2):

1.
Overestimated flood volume-the probability of such case Prob( f t s − V t s ≥ 0) = 0.5. The reservoir will not be filled to its capacity; thus, the outflow can be reduced according to the flood volume difference between the dynamic design period flood and actual flood volume. Compared to the static FLWL method, the D-PSL-based operation will definitely not increase flood risk, and this conclusion is irrelevant to the magnitude of a flood; 2.
Underestimated, but within limit forecast error-the probability of such case Prob (V t s − f t s ) ≤ δ P ts C p = 0.5 − P des . This seems alert condition actually will not face flood risk. This is because the D-PSL is derived from f des_t s rather than the forecast value itself. This means the proposed policy did provide an adequate margin of safety. Indeed, the result is the same as case 1; 3.
Significantly underestimated flood volume-the probability of such case Prob (V t s − f t s ) > δ P ts C p < P des . The actual flood is significantly larger than the forecasts. The reservoir storage reserved is insufficient to capture all excess flood. To avoid catastrophic flood damage induced by a dam failure, the reservoir must release water equals to the inflow, thus increase downstream flood risk. However, this kind of probability for each flood event is less than P des , which is usually as small as 1% to 0.01%.

Case Study
The proposed methodology for deriving D-PSL is implemented in a case study. Note that this method is only applicable to reservoirs whose sizes are large enough to retain floodpeaks. It is based on the implicit condition that the wave travel time in those reservoirs should be long so that any sudden increase in peak inflow does not adversely affect the water level at the outlet of the reservoir.

The Three Gorges Reservoir
The Three Gorges Reservoir (TGR, 30 • 49 N; 111 • 00 E), one of the largest water resource projects in the world, is selected as a case study in a hypothetical situation. The TGR dam, located in Yichang City of Hubei Province, China, controls the streamflow of the upper Yangtze River. It drains an area of 1.0 million km 2 . The mean annual streamflow at the dam site is 4510 × 10 8 m 3 . The TGR dam is mainly operated for flood control, hydropower generation and navigation improvement. In particular, the flood control target is to protect the downstream areas from frequent and disastrous flood threats in the flood season from June to September.
The designed FLWL of TGR is fixed at 145 m MSL (171.5 × 10 8 m 3 ), which is also the dead water level. The total reservoir storage at the normal pool level (also design flood level) 175 m is 393 × 10 8 m 3 . The current flood-control operation in the flood season is when the flood comes, the release of TGR is hierarchically controlled not to exceed the downstream safety discharge of 56,700 m 3 /s (i.e., r t s /t s ). The flood control storage (storage between FLWL and design flood level) is 221.5 × 10 8 m 3 , which takes up more than half the total storage.
The current flood forecast product for TGR is the Yangtze River real-time flood forecasting system (Yangtze FFS) [44,45]. With the development of weather monitoring and forecasting techniques, reliable flood forecast horizon (i.e., effective forecast lead time) of Yangtze FFS has reaches 3-5 days. Currently, the forecast skill of Yangtze FFS at a three day lead time is roughly estimated at 0.8. Despite such high forecast skill, the current flood-control operation still follows the FLWL, making little use of flood forecasts.

Evaluation of Dynamic Pre-Storm Level
The methodology of deriving the D-PSL from a sequence of simulated forecasts (i.e., expected floods) under a given C p may be summarized in the following steps:

1.
Estimate the period flood variance by reviewing the historical flow using the annual maximum series method; 2.
Simulate flood volume for multiple time periods using the autoregressive model and Gaussian random number generator, and use them as period flood forecasts f t s ; 3. Verify the simulation model by comparing the results between historical observations and simulated forecasts; 4.
Compute the limit forecast error for multiple time periods under a given forecast skill δ P_t s C p using Equations (1) to (5); 5.
Calculate the D-PSL ts based on forecast floods for multiple time periods and obtain the chosen D-PSL using Equations (6) to (9).
Since the detailed forecast information of Yangtze FFS is hard to find, we could not obtain the historical flood forecasts. Moreover, to investigate the relationship between forecast skill and the proposed dynamic pre-storm level, the analysis is based on the simulated forecasts. Using the observed daily inflow records from the year 1882 to 2007 at Yichang hydrologic station, four sets of statistics for maximum daily flow (m 3 /s) and maximum 1, 2, 3 and 5 day flood volumes (10 8 m 3 ) were calculated. Table 1 presents the statistic parameters of design flood volumes for TGR. Hydrological stochastic simulation is widely used in reservoir operation to reduce the impact of hydrological uncertainty [46,47]. Although the TGR flood series of 126 years of observations are long enough for statistical analysis, the observations cannot provide floods of higher magnitude, like 1000 year recurrence events. In addition, because of the intercorrelation between flood volumes of various time periods, stochastic simulation can obtain more combinations that are possible for each incoming flood. Hence there was a need to simulate events of any return interval.
Since the maximum period floods happen in different years are independent, the linear autoregressive order 1 (AR1) model, based on historical records, can be used to simulate floods and take them as forecast floods (i.e., f t s ) [48,49]. The general form of AR1 is: where X t is a vector of the variables of interest, namely Pearson's type III (P3) distributed period floods at year t; ϕ 1 is lag 1 autoregressive coefficient; Φ t is the independent standardized P3 distributed stochastic sequence, expressed as: where ξ is a Gaussian variate with mean zero and standard deviation one; C sΦ , the skewness coefficient of Φ t , is related to that of X t by: The simulated flood now preserves the mean u, variance σ 2 , lag 1 correlation ρ 1 which is assumed as zero, and skewness C sX of each historical flood.
After simulation with 40,000 repetitions (10,000 times each period flow, i.e., maximum 1 day, 2 day, 3 day, and 5 day flood volumes), we have 10,000 sets of simulated floods by randomly grouping the four types of period flows. The boxplots of simulated (forecast) floods and historical period floods are plotted in Figure 4. Compared to the historic daily inflow records from 1882 to 2007, the mean, coefficients of variation and skewness of simulated period flood are calculated and proved to be close to the records. The largest difference in mean, C V and C S are within 2%, 10%, and 6%.  The flood frequency classification of the 10,000 simulated forecast floods is presented in Table 2. Among these simulations, 9 of them predict maximum 1 day flood volumes exceeding the magnitude of 1000 year design flood derived from historical series. Similarly, there are 5 times for maximum 2 day flood volume, 6 times for maximum 3 day The flood frequency classification of the 10,000 simulated forecast floods is presented in Table 2. Among these simulations, 9 of them predict maximum 1 day flood volumes exceeding the magnitude of 1000 year design flood derived from historical series. Similarly, there are 5 times for maximum 2 day flood volume, 6 times for maximum 3 day flood volume and 3 times for maximum 5 day flood volume. As for the low magnitude flood, more than half of the total simulated forecast floods belong to the 2 year flood recurrence or even more frequent. All these comparisons represent the effectiveness of the above flood simulation model, with good reflecting of history and coverage of different magnitude floods.

Forecast-Skill-Based Dynamic Pre-Storm Level
Based on the forecast floods simulated by the AR1 model, this section presents the results of D-PSL ts and chosen D-PSL for each flood event under different levels of forecast skills. The return interval of TGR dam design flood is one in 1000 years (i.e., P des = 0.1%) with a conveyance capacity of 56,700 m 3 /s (i.e., r t s /t s ) for the downstream flood control gauging station Shashi (a once-in-100-year design flood of downstream Jingjiang River reach). The design reservoir storage at the normal level 175 m is 393 × 10 8 m 3 (i.e., K).
Recall from Equation (4) that the reservoir dynamic design-flood probability equals the forecast error exceedance probability, i.e., Prob(ε t s ≥ δ P_t s ) = P des = 0.1%. Under a given forecast skill, the variance of forecast error for each incoming flood is given by Equation (3), i.e., σ ε 2 = 1 − C p var(V t s ). According to Equations (5) to (9), the D-PSL ts for different period flood forecasts, with the forecast skill varying from zero to perfect, can be obtained ( Figure 5).
Focusing only on the result of one certain period (e.g., Figure 5a), the D-PSL 1 d of 10,000 forecast floods ranges from 167 m to 175 m. The water level cannot be higher than 175 m since it is the highest water level permitted. Some D-PSL 1 d obtained from the 10,000 forecast floods (outlier cross sign in Figure 5a) are found to be lower than the D-PSL 1 d obtained from forecasted design flood (blue line in Figure 5a). This is because that some forecast floods are simulated with magnitudes larger than the design flood (see Figure 4). The dispersion of D-PSL ts s reduces with the increase of forecast skill (Xaxis). For the perfect forecast condition (i.e., C p = 1.0), there is no need to vacant any storage for more than 75% of flood events. When comparing the results of multiple periods (e.g., Figure 5a-d), the range of D-PSL ts s becomes wider. Both phenomena are due to the variance of forecast error. For one certain flood event, with the increase of forecast period from maximum 1 day to maximum 5 day, or the decrease of forecast skill C P from one to zero, the uncertainty of forecast (reflected by the variance of forecast error) will increase significantly ( Table 1 or Equation (3)). This means a larger limit forecast error (δ P_t s ) is found from the platykurtic distribution consistently (Equation (5)), and thus, results in a larger dynamic design period flood f des_t s and lower D-PSL ts . Therefore, the chosen D-PSL for one certain flood event, according to Equation (7), obtained from different forecast skill models or different forecast periods can be different.
given forecast skill, the variance of forecast error for each incoming flood is given by Equation (3), i.e., = (1 − )var( ). According to Equations (5) to (9), the D-PSLts for different period flood forecasts, with the forecast skill varying from zero to perfect, can be obtained ( Figure 5).  Table 1, with the change of forecast skill.
Focusing only on the result of one certain period (e.g., Figure 5a), the D-PSL1 d of 10,000 forecast floods ranges from 167 m to 175 m. The water level cannot be higher than 175 m since it is the highest water level permitted. Some D-PSL1 d obtained from the 10,000 forecast floods (outlier cross sign in Figure 5a) are found to be lower than the D-PSL1 d obtained from forecasted design flood (blue line in Figure 5a). This is because that some forecast floods are simulated with magnitudes larger than the design flood (see Figure 4). The dispersion of D-PSLtss reduces with the increase of forecast skill (X-axis). For the perfect forecast condition (i.e., = 1.0), there is no need to vacant any storage for more than 75% of flood events. When comparing the results of multiple periods (e.g., Figure 5a-d), the range of D-PSLtss becomes wider. Both phenomena are due to the variance of forecast error. For one certain flood event, with the increase of forecast period from maximum 1 day to maximum 5 day, or the decrease of forecast skill from one to zero, the uncertainty of forecast (reflected by the variance of forecast error) will increase significantly ( Table 1 or Equation (3)). This means a larger limit forecast error ( _ ) is found from the platykurtic distribution consistently (Equation (5)), and thus, results in a larger dynamic design period flood _ and lower D-PSLts. Therefore, the chosen D-PSL for one certain flood event, according to Equation (7), obtained from different forecast skill models or different forecast periods can be different.
From Figure 5, it seems that the D-PSL5 d derived from the forecasted maximum 5 d flood volume is always the lowest compared to other shorter period results. However, the chosen D-PSL for any particular flood event may not always be D-PSL5 d. For each flood  (Figure 6b), the difference is even clearer. When the forecast skill is low, values from D-PSL 1 d are rarely chosen, and most of the values are from D-PSL 5 d . Take C p = 0.3 for example, the D-PSL 1 d contribution is only 3%, and this percentage is increased to 13% when C p = 0.7. For a near-perfect forecast skill condition (C p = 0.9), four different period D-PSL ts share almost the same proportion, around 15%. In addition, 38% of the values are obtained from more than one period result (denoted as overlap in Figure 6b). forecast floods (Figure 6b), the difference is even clearer. When the forecast skill is low, values from D-PSL1 d are rarely chosen, and most of the values are from D-PSL5 d. Take = 0.3 for example, the D-PSL1 d contribution is only 3%, and this percentage is increased to 13% when = 0.7. For a near-perfect forecast skill condition ( = 0.9), four different period D-PSLts share almost the same proportion, around 15%. In addition, 38% of the values are obtained from more than one period result (denoted as overlap in Figure  6b).  [19]; this is equivalent to a forecast skill of 3.5 (see blue line in Figure 6a).

Impact of Downstream Conveyance Capacity
The previous section assumes that reservoir release varies according to the downstream conveyance capacity. However, it is well-known that reservoir release has a direct impact on downstream safety. When reservoir release exceeds the downstream capacity, An interesting and important observation from Figure 6a is that when a design flood is expected, the chosen D-PSL under zero-forecast-skill conditions is 144.89 m, which is fairly close to the TGR designed FLWL 145 m. This suggests that the result obtained using the proposed methodology conforms to the primary TGR design flood control standard. However, as the forecast skill improves, this chosen D-PSL increases considerably. This water level can be as high as 162.21 m under perfect forecast conditions, indicating the necessity of dynamic water level control during the flood season, depending on the improvement of flood forecast accuracy. As compared to the previous research on TGR, Liu et al. [50] obtained a seasonal FLWL in the pre-flood season of 152.6 m, which is close to a medium forecast skill C p ≈ 0.6; however, in the main flood season of lower than 145 m because the deviation of seasonal FLWL, in fact, does not consider real-time forecast information. Incorporating a 2 day inflow forecast and considering uncertainty, an upper boundary of DC-FLWL was 2.4 m higher than 145 m [19]; this is equivalent to a forecast skill of 3.5 (see blue line in Figure 6a).

Impact of Downstream Conveyance Capacity
The previous section assumes that reservoir release varies according to the downstream conveyance capacity. However, it is well-known that reservoir release has a direct impact on downstream safety. When reservoir release exceeds the downstream capacity, levee failure may occur. An increase or decrease in conveyance capacity usually originates from the enhancement or decaying of the downstream levee. Therefore, it is necessary to investigate the impact of downstream conveyance capacity on the choice of D-PSL. In this section, this effect is presented and analyzed.
To reduce the disturbance of forecast uncertainty, a moderate forecast skill (C p = 0.5) is employed. Twelve representative forecast flood groups were selected from 10,000 simulated floods at constant increment between flood volumes of 30.63 × 10 8 m 3 (F12) and 84.72 × 10 8 m 3 (F1) from the maximum 1 day flood volume forecast ( Table 3). The maximum flood volume forecasted for the 2 day, 3 day and 5 day periods was subsequently determined.
The D-PSLs results computed under the changing downstream conveyance capacity are shown in Figure 7. Here, the capacity is adjusted in the range (30,000, 120,000) m 3 /s, varying from half of primary as the minimum and twice as the maximum. Clearly, D-PSL tends to increase with the enlargement of downstream conveyance capacity. The convex relationship is determined by the nonlinear reservoir elevation-storage relationship. Although the 12 flood events are selected at a regular interval, the resultant D-PSLs do not increase by the same proportion as indicated by the gap between each of the lines shown in Figure 7. This is mainly due to the varying range among the maximum 5 day flood volume forecasts. For example, the water level for F1 almost coincides with that for F2 due to the small difference in the 5 day forecasts between these two (340.02 for F1 and 335.99 for F2); and, conversely, a large gap is found between F1-F6 and F7-F12 due to the large difference in 5 day forecasts among these two groups (minimum in F1-F6 is 256.68 and maximum in F7-F12 is 198.59). Furthermore, if focusing on the results between F4 and F6, we can find the D-PSL for F6 is lower than that for F4 when the downstream conveyance capacity is less than 70,000 m 3 /s, but when release exceeds this critical value, the opposite trend occurs. Generally, from Equation (7), it can be concluded that the capacity will only force linear changes of D-PSL. However, the intersection between the lines of F4 and F6 in Figure 7 suggests the downstream conveyance capacity can also "work" in other ways.    3.2.2 has revealed that forecast skill can be one major factor that influences the choice of D-PSL from four different periods result. A similar plot under the impact of downstream conveyance capacity for the 12 selected floods is shown in Figure 8. When the capacity is less than 40,000 m 3 /s, all chosen D-PSLs for the 12 floods are equal to D-PSL 5 d , which are the minimums compared to the results from the other three periods. However, with the increase of conveyance capacity, the proportion of D-PSL 5 d reduces.
If the reservoir release is large enough (≥120,000 m 3 /s), there is no need to empty any reservoir storage for flood control. Combining results from Figure 7, we could conclude that downstream conveyance capacity not only nearly linearly affects the calculation of D-PSL but also influences the choice of D-PSL among different periods result. Figure 7. The relationship between chosen D-PSL and downstream conveyance capacity ( / ) for 12 selected forecast floods under moderate forecast skill, the vertical dashed line represents the current TGR conveyance capacity of 56,700 m 3 /s. Section 3.2.2 has revealed that forecast skill can be one major factor that influences the choice of D-PSL from four different periods result. A similar plot under the impact of downstream conveyance capacity for the 12 selected floods is shown in Figure 8. When the capacity is less than 40,000 m 3 /s, all chosen D-PSLs for the 12 floods are equal to D-PSL5 d, which are the minimums compared to the results from the other three periods. However, with the increase of conveyance capacity, the proportion of D-PSL5 d reduces. If the reservoir release is large enough (≥120,000 m 3 /s), there is no need to empty any reservoir storage for flood control. Combining results from Figure 7, we could conclude that downstream conveyance capacity not only nearly linearly affects the calculation of D-PSL but also influences the choice of D-PSL among different periods result.

Impact of Reservoir Size
Focusing on reservoirs whose active storages are adequate to regulate flood peaks, this section discussed the impact of reservoir size on the calculation of forecast-skill-based D-PSL. It is compared using the same 12 forecast flood groups in Section 3.3. This effect can be specified using the capacity-flood-volume ratio, which is the reservoir capacity, K, over design maximum 1 day flood volume, V 1 d . Figure 9 provides the relation between chosen D-PSL and capacity-flood-volume ratio, with the latter varying from 2 to 20. Increasing ratios of reservoir capacity over flood volume represent the ability of the system to store floodwater. Since V 1 d is fixed at 83.9 × 10 8 m 3 according to Table 1, an increase in the capacity-flood-volume ratio, K/V 1 d , renders larger reservoir storage capacity K, namely higher highest flood level (i.e., equivalent to the 175 m in the current TGR design, shown as black dashed line in Figure 9). The reservoir elevation-storage curve is fitted to a three-order polynomial function. This allows for extrapolation of a reservoir level for a given reservoir storage capacity. Typically, the maximum capacity-flood-volume ratio (K/V 1 d = 20) stands for a storage capacity of 1678.0 × 10 8 m 3 , which contains a significant extrapolation from the current reservoir magnitude as well as the highest flood level, 248.7 m MSL. The dead water level and minimum level remain at TGR current values, i.e., 145 m and 50 m.
to the 175 m in the current TGR design, shown as black dashed line in Figure 9). The reservoir elevation-storage curve is fitted to a three-order polynomial function. This allows for extrapolation of a reservoir level for a given reservoir storage capacity. Typically, the maximum capacity-flood-volume ratio ( / = 20) stands for a storage capacity of 1678.0 × 10 8 m 3 , which contains a significant extrapolation from the current reservoir magnitude as well as the highest flood level, 248.7 m MSL. The dead water level and minimum level remain at TGR current values, i.e., 145 m and 50 m. The results in Figure 9 illustrate that the D-PSLs tend to increase with the capacityflood-volume ratio, while differences between 12 floods tend to be converged. Recalling Equation (7), these positive correlations and convergent effects are simply due to the change of reservoir storage capacity and nonlinear elevation-storage relationship. Focusing on the case approach of the design flood (F1, black scatter line in Figure 9), the chosen D-PSL is found so high as to be close to the highest flood level (black dashed line) when the capacity-flood-volume ratio is high (e.g., 20). When the capacity-flood-volume ratio decreases to 2, the D-PSL is found as low as a minimum level of 50 m. This means that reservoir should be emptying before the 1000 year design magnitude flood. However, this The results in Figure 9 illustrate that the D-PSLs tend to increase with the capacityflood-volume ratio, while differences between 12 floods tend to be converged. Recalling Equation (7), these positive correlations and convergent effects are simply due to the change of reservoir storage capacity and nonlinear elevation-storage relationship. Focusing on the case approach of the design flood (F1, black scatter line in Figure 9), the chosen D-PSL is found so high as to be close to the highest flood level (black dashed line) when the capacity-flood-volume ratio is high (e.g., 20). When the capacity-flood-volume ratio decreases to 2, the D-PSL is found as low as a minimum level of 50 m. This means that reservoir should be emptying before the 1000 year design magnitude flood. However, this is unrealistic because of the requirement of reservoir refill and economic benefits. Therefore, this suggests that there is a lower limit of a capacity-flood-volume ratio where the D-PSL volume is no longer applicable. The reservoir size is positively correlated with the ability to cope with floods. The larger the reservoir capacity, the greater the level of flood regulation. If small, the occurrence of large floods would result in reservoir failure, and then the joint flood control of cascade reservoirs becomes especially important.

Relationship between D-PSL and Limit Operation Water Level
In practice, D-PSL is affected by the forecast skill and magnitude of the incoming flood, and it, in turn, affects the volume of reservoir pre-release during the pre-release period. Considering the limitation of reservoir pre-release capacity, there should be an upper boundary of reservoir operating water level at the beginning of flood forecast and in the non-storm period, as well (denoted as limit operation water level, shown in Figure 1). The D-PSL is lowest when the forecast skill is zero and highest when the forecast skill is perfect. The D-PSL is high when the magnitude of the forecasted incoming flood is low and vice versa. Therefore, the limit operation water level at the beginning of flood forecasts must be kept sufficiently low to ensure that the water level can be lowered to any required D-PSL at the end of the pre-release period. However, regardless of the magnitude of the forecasted incoming flood, it is of primary importance that there is sufficient reservoir capacity to cope with the design flood. Hence, the limit operation water level is bounded by reservoir pre-release capacity, forecast skill, forecast lead-time for pre-release operation, and the magnitude of the design flood.

Application of the Dynamic Pre-Storm Flood Control Operation Strategy
Regardless of the forecast lead-time, one simplest and most conservative way is to let the limit operation water level equals to the D-PSL derived from the design flood (i.e., the blue line in Figure 6a). Although it can be set to a higher level considering the forecast lead time for pre-release operation theoretically. In this case, the forecast skill triggers the various option of limit operation water level as well as D-PSL. With a stable forecast skill of the reservoir forecast product, we can keep the reservoir operating at the limit operation water level when no flood is forecasted; otherwise, decrease, if low, to the obtained D-PSL when a flood is about to occur.
In comparison to the typical flood-control operation strategies shown in Figure 1, the limit operation water level coincides with FLWL only when both forecast skill and forecast lead-time are zero (i.e., conventional FLWL strategy). Given the existence of forecast leadtime, the limit operation water level becomes the upper boundary of dynamic control bound if fix D-PSL at FLWL (i.e., DC-FLWL strategy). Yet, in the proposed strategy, the D-PSL is relevant with forecast skill, suggesting an even higher limit operation water level is possible as long as the forecast skill is reliable. This strategy not only provides a degree of freedom of reservoir water level in the flood regulation period by using D-PSL but also allows reservoirs to operate at higher levels in non-storm periods by adopting limited operation water levels. This limited operation water level is obviously beneficial for day-to day hydropower, irrigation and other economic operation while not adding any flooding cost other than the design level.
The calculation of D-PSL is a volume-based method, in which the maximum one, two-, three-and five day flood volumes are considered (Equation (8)). However, the shape of a flood hydrograph is a random event, which usually occurs in various combinations of flood peak, volume, and duration [51,52]. A river may have various patterns that are above critical streamflow for one, two, three, or five or more days. The factors that affect the flood hydrograph include, but are not limited to, drainage shape and area, slope and density of stream, soil moisture, type of precipitation, amount of upstream regulation [53,54]. Flood forecast longer than five days would no doubt be operationally useful. We adopted maximum flood volumes within five days because the progressively worse regression fits for longer duration [55] and the relatively small flood volume for rest days.
In case of a significant overestimation in forecast skill, a rolling evaluation of forecast skill is recommended. The skill of forecast a product is first determined from its historical performance and then kept reevaluating in the real-time flood forecasts. The proposed flood-control operation strategy is only adopted if the forecast skill stays consistent throughout the operation. Once the forecast product performs abnormally, this strategy should be terminated immediately, and reservoir water level must be lowered to the designed FLWL on safety grounds. Here, the utilization of the forecast skill stability as triggers for starting or ending the proposed dynamic flood-control operation strategy, but the information used for real-time forecast skill evaluation as well as triggering mechanisms remains a subject for further research.

Conclusions
Contemporary reservoir operations often must balance the risk of downstream flooding, environmental impacts and economic benefits. During flood seasons, the effectiveness of flood-control operation often relies on the accurate determination of the annual static flood-limited water level (FLWL) or improved dynamic control of FLWL. The current state-of-the-art method dictates that the designed FLWL should not be exceeded unless a relatively large flood is forecasted. This approach ignores the flood forecast skill and flood exceedance probability of such event. In this study, a new concept of dynamic FLWL (dynamic pre-storm level, D-PSL) is proposed and critically evaluated using the Three Gorges Reservoir as an example. The D-PSL is based on the forecast flood magnitude, the skill of forecast, and reservoir dynamic design-flood probability represented by the exceedance probability of forecast error.
The results show that D-PSL performs the same as traditional FLWL when the forecast skill is unknown or poor and increases considerably with increasing forecast skill. This demonstrates that D-PSL can be used in conjunction with FLWL without increasing the risk of flooding. By suitably relaxing the pre-storm release, D-PSL maximizes the excess reservoir storage, ensuring that the flood-control operation is both effective and economical. Moreover, for a particular flood event, D-PSL can be determined by flood forecasts of multiple time periods. That is, when forecast skill is low, most of the chosen D-PSL are from D-PSL 5 d ; with the increase of forecast skill, the contribution of different period D-PSL ts becomes a lot closer. In addition, an important parameter to consider when using this method is the reservoir size. A lower limit of capacity-flood-volume ratio is the governing criteria in smaller reservoirs. It must be stressed that the application of the dynamic pre-storm flood-control operation strategy leads to a closer linkage with the current floodcontrol operation strategies. When both forecast skill and forecast lead-time are zero, the conventional FLWL strategy is adopted but reverted to DC-FLWL strategy when fixing D-PSL at FLWL. Thus, the proposed strategy not just provides a degree of freedom of reservoir water level in the storm-control period but also allows reservoir operating at higher levels in non-storm period by adopting limited operation water level. It is expected that such a strategy is beneficial for day-to-day economic reservoir operations.