Next Article in Journal
Developing a Multicriteria Decision Analysis Framework to Evaluate Reclaimed Wastewater Use for Agricultural Irrigation: The Case Study of Maryland
Next Article in Special Issue
A One-Way Coupled Hydrodynamic Advection-Diffusion Model to Simulate Congested Large Wood Transport
Previous Article in Journal
High-Resolution Mapping of Tile Drainage in Agricultural Fields Using Unmanned Aerial System (UAS)-Based Radiometric Thermal and Optical Sensors
Previous Article in Special Issue
Propagating Dam Breach Parametric Uncertainty in a River Reach Using the HEC-RAS Software
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

OpenForecast v2: Development and Benchmarking of the First National-Scale Operational Runoff Forecasting System in Russia

State Hydrological Institute, 199004 Saint Petersburg, Russia
Hydrology 2021, 8(1), 3; https://doi.org/10.3390/hydrology8010003
Submission received: 29 November 2020 / Revised: 30 December 2020 / Accepted: 4 January 2021 / Published: 6 January 2021

Abstract

:
Operational national-scale hydrological forecasting systems are widely used in many countries for flood early warning systems and water management. However, this kind of system has never been implemented in Russia. OpenForecast v2—the first national-scale operational runoff forecasting system in Russia—has been developed and deployed to fill this gap. OpenForecast v2 delivers 7 day-ahead streamflow forecasts for 843 gauges across Russia. The verification study has been carried out using 244 gauges for which operational streamflow data were openly available and quality-controlled for the entire verification period (14 March–6 July 2020). The results showed that the developed system provides reliable and skillful runoff forecasts for up to one week. The benchmark testing against climatology and persistence forecasts showed that the system provides skillful predictions for most analyzed basins. OpenForecast v2 is in operational use and is openly available on the Internet.

1. Introduction

Flood-related hazards are among the most devastating natural disasters [1]. The risks they pose are high for many economic sectors and tend to increase [1,2,3]. Furthermore, climate change amplifies the anticipated risks by making the hydrological cycle more intense and making the prediction of its processes more uncertain [4,5]. Thus, the mitigation of risks by developing timely and reliable streamflow forecasts remains a key focus of the research community [6,7].
Many state-of-the-art runoff forecasting systems operate at regional, national and global scales and provide forecasts at hourly, daily or (sub-) seasonal resolution [8,9,10]; for example, the European Flood Awareness System (EFAS [11]), the Global Flood Awareness System (GloFAS [12]), the FANFAR project (https://fanfar.eu/forecast/, last access: 19 November 2020), the forecasting system at the Australian Bureau of Meteorology [13,14], the Terrestrial Systems Modeling Platform (TSMP [15]), to name a few. These systems routinely derive skillful runoff forecasts that are used for—but are not limited to—early warning systems for extreme events and water management. The latest example of devastating floods in Mozambique in 2019 demonstrated the crucial role of timely and distributed forecasts to minimize human and economic losses [16].
Although global forecasting systems such as GloFAS may provide runoff forecasts for any river basin in the world, the efficacy of that forecast is limited. The reason for this is that the global systems are rarely calibrated against streamflow observations [17]. Moreover, the limited accessibility of regional hydrometric data could also hinder calibration [18]. Thus, the use of available regional information should be a key focus when developing new or advancing existing runoff forecasting services. That is particularly relevant for forecasting systems that operate at a local or national scale as they may explicitly consider the use of specific regional data by design [19,20,21,22].
Over recent decades, many countries or their territorial units (e.g., states or provinces) have developed their own runoff forecasting systems [8,9,10]. Corresponding services are available, e.g., in the US [23], UK [22,24], Australia [13,14], and New Zealand [25]. Still, many barriers hinder the development of runoff forecasting systems in many less developed countries worldwide, of which the most significant barriers are economic [26].
There is an increasing trend in the frequency of flood-related hazards in Russia [27]. However, no runoff forecasting systems operate at the national nor at the territorial level. This is surprising because the devastating flash-floods triggered by extreme rainfall (e.g., in Krymsk in 2012 [28]) and exceptional spring floods cause economic and human losses that are a few orders of magnitude higher than the investment required for establishing a national-scale forecasting system; it is all the more surprising as the building blocks comprising any modern runoff forecasting system—observational data, hydrological models, and numerical weather prediction (NWP) models—have been present in the public domain for years.
To fill this gap, in 2018, Ayzel et al. [21] developed the first version of OpenForecast—the first open-source operational runoff forecasting system in Russia (OpenForecast v1; https://hydrogo.github.io/openforecast/, last access: 20 November 2020). OpenForecast v1 has been in operation since 20 July 2018 and delivers a 3 day-ahead runoff forecast for two test-bed river basins: the Moskva River at Barsuki and Seraya River at Novinki. As an interim and forerunner system, OpenForecast v1 provided the guideline for the further development and scaling of the service [21]. In the presented paper, the development of the second version of OpenForecast (OpenForecast v2; https://openforecast.github.io/, last access: 20 November 2020) is discussed; it is the first national-scale operational forecasting system in Russia that delivers 7 day-ahead streamflow forecasts for 843 gauges across Russia. In particular, this study aims to present the development workflow for the established operational service and provide a benchmark for its efficiency in terms of two widely used conventional approaches: climatology and persistence [29].
Finally, the potential of openly available data and software that serve as a basis for forecasting system development is exploited. Similar to its predecessor, each component of OpenForecast v2 is freely and readily available, ensuring the replicability of the system as well as the reproducibility of the results.

2. Data

2.1. Streamflow and Water Level Observations

Archive streamflow (in m 3 /s) and water level observations (in cm above the “gauge null”) for hundreds of gauges across Russia are available at the website of the Automated Information System for State Monitoring of Water Bodies (AIS; https://gmvo.skniivh.ru, last access: 20 November 2020). The corresponding datasets are available for the period from 2008 to 2017 (10 years) and distributed in a machine-readable format (.csv).
Operational data are available only for water level observations at the Unified State System of Information website regarding the Situation in the World Ocean (ESIMO; http://esimo.ru/dataview/viewresource?resourceId=RU_RIHMI-WDC_1325_1, last access: 20 November 2020). The corresponding data are in open access but are available only for the last seven days. Thus, the author developed a script that routinely downloads all the available data and converts them to a machine-readable format (.csv). It should also be mentioned that operational data in ESIMO does not pass a quality control and may be inconsistent with data from the AIS system. The possible reasons for that inconsistency are the measurement instrument’s change or the change of the “gauge null”.

2.2. Meteorological Data

ERA5 global reanalysis [30] serves as a source of meteorological forcing variables: air temperature (T, ° C) and precipitation (P, mm). ERA5 has a spatial resolution of 0.25 ° × 0.25 ° and an hourly temporal resolution. ERA5 data are available for the period from 1979 to within five days of real-time and are taken from two sources: (1) quality-controlled monthly updates, which are published within three months of real-time (namely ERA5), and (2) daily updates of the dataset, which are available within five days of real-time (namely ERA5T). All the data are available for download at the Climate Data Store (https://cds.climate.copernicus.eu, last access: 20 November 2020). Recently, ERA5 showed high reliability to be used as a reference dataset for hydrological modeling over North America [31].
ICON—the global NWP model of the German Weather Service (DWD) [32]—serves as a source of deterministic meteorological forecasts for air temperature and precipitation. ICON has a spatial resolution of about 13 km and a temporal resolution of 1 h and provides forecast data up to one week [32]. The data are freely available for download at the DWD’s Open Data Portal (https://opendata.dwd.de/weather/nwp/icon/, last access: 20 November 2020).
In this study, ERA5 and ICON precipitation and air temperature data have been aggregated to the daily time step and then averaged at the basin scale for each available basin. The spatial averaging is based on the relative weights of the intersection between basin boundaries and corresponding grid cells. Potential evaporation ( P E , mm) is calculated using the temperature-based equation proposed by Oudin et al. [33].

2.3. Gauge Attributes and Basin Boundaries

Gauge attributes, such as the identification number, name, location, and drainage area, are available in both AIS and ESIMO databases. However, the information for a large portion of gauges is inconsistent between these datasets. Thus, manual expert control has been done to assign a unique set of attributes to each presented gauge. Particular attention has been devoted to the correction of geographical position; i.e., gauge latitude and longitude. For example, some gauges did not match the river network, while some have a substantial difference between actual and reported (in AIS or ESIMO) drainage areas. All these inconsistencies have been manually corrected to derive a quality-controlled set of studied gauges.
For each gauge, the corresponding basin boundaries have been obtained using the standard GIS-instruments [34] and the MERIT Hydro digital elevation model [35]. There are 1004 gauges with basin areas from 10 to 100,000 km 2 in the compiled dataset’s preliminary version.

3. Methods

3.1. OpenForecast Computational Workflow

The schematic illustration of the second version of the OpenForecast computational framework is shown in Figure 1. There is no conceptual difference between the current and preceding versions (for the latter, see Figure 4 in [21]). However, the specific differences are as follows:
  • Although the first version uses ERA-Interim reanalysis, the second one uses ERA5, a gradual development over ERA-Interim.
  • Although there is a single hydrological model—GR4J (in French, modèle du Génie Rural à 4 paramètres Journalier) [36]—that simulates runoff in the first version, the second version is complemented by the HBV (in Swedish, Hydrologiska Byråns Vattenbalansavdelning) hydrological model [37,38].
  • Although the single loss function of the Nash–Sutcliffe efficiency coefficient (NSE [39]) has been used for model calibration in the first version, the second version is complemented using the Kling–Gupta efficiency coefficient (KGE [40]).
  • While the first version of the system derives a forecast for three days ahead, the second version extends this to seven days.
  • The number of gauges increased from two in the first version to 843 in the second version.
The computational workflow is introduced in more detail in the following subsections, describing the underlying computational procedures in a step-by-step fashion. Figure 1 shows that the computational procedures comprising the OpenForecast’s workflow can be classified into two types: (1) offline, which are done once and require only historical data, and (2) online, which run on an everyday basis and require new operational data each time.

3.1.1. Model Calibration

The core offline procedure is a hydrological model calibration. In the presented study, I use two conceptual lumped hydrological models: HBV [37,38] and GR4J [36]. Although the HBV model has an internal module for representing snow-related processes, the GR4J model has been coupled with the Cema–Neige snow accumulation routine [41,42]. Both models represent the water balance at the basin scale using storage reservoirs (three in HBV and two in GR4H) and require only daily precipitation, air temperature, and potential evaporation as inputs (see Section 2.2). HBV and GR4J models have 14 and six free parameters, respectively (Table 1 and Table 2).
For each basin, the optimal values of model parameters have been found by their calibration (numerical optimization) against the entire period of observed runoff time series [43] (see Section 2.1). To this end, a global optimization algorithm of differential evolution is used [44]. This algorithm finds a set of optimal model parameters by minimizing the loss function, which is either 1 N S E or 1 K G E , where NSE and KGE are the Nash–Sutcliffe [39] and Kling–Gupta efficiency coefficients [40], respectively. The calibration ranges of model parameters (Table 1 and Table 2) have been adopted from several studies [42,45,46,47,48,49]. The calibration procedure for each basin ends up with four optimal sets of model parameters, resulting from the use of two models (HBV, GR4J) and two loss functions (NSE, KGE). The model codes and corresponding optimization routines are openly available as a part of the Lumped Hydrological Model Playground (https://github.com/hydrogo/LHMP, last access: 23 November 2020).
Table 2. Description and calibration ranges for HBV model parameters (based on Beck et al. [45]).
Table 2. Description and calibration ranges for HBV model parameters (based on Beck et al. [45]).
ParametersDescriptionCalibration Range
TTThreshold temperature when precipitation is simulated as snowfall ( ° C) 2.5 –2.5
SFCFSnowfall gauge undercatch correction factor1–1.5
CWHWater holding capacity of snow0–0.2
CFMAXMelt rate of the snowpack (mm/(day* ° C))0.5–5
CFRRefreezing coefficient0–0.1
FCMaximum water storage in the unsaturated-zone store (mm)50–700
LPSoil moisture value above which actual evaporation reaches potential evaporation0.3–1
BETAShape coefficient of recharge function1–6
UZLThreshold parameter for extra outflow from upper zone (mm)0–100
PERCMaximum percolation to lower zone (mm/day)0–6
K0Additional recession coefficient of upper groundwater store (1/day)0.05–0.99
K1Recession coefficient of upper groundwater store (1/day)0.01–0.8
K2Recession coefficient of lower groundwater store (1/day)0.001–0.15
MAXBASLength of equilateral triangular weighting function (day)1–3

3.1.2. Generation of Runoff Forecast

The obtained sets of optimal model parameters form a core for further computational procedures of runoff forecast generation. At this stage, a hydrological model was run with a set of optimal parameters, providing coupled archive (ERA5) and operational (ICON) meteorological time series as input (Figure 2). As the ERA5T data has a latency of 5 days (see Section 2.2), the seamless integration of (archive) ERA5 and (operational) ICON data at forecast time t cannot be ensured: there will always be a gap of at least five days. In this way, the use of ICON hindcasts is proposed, which are forecast from previous days to fill the corresponding gap. However, to back up the presented workflow from unpredictable delivery delays of ERA5T data, the period filled with ICON hindcasts is extended to 7 days (t−7 days in Figure 2). Thus, for each river basin (or gauge), the output of the runoff forecast generation stage is an ensemble of four runoff forecasts, two of which are simulated with the HBV model driven with two sets of model parameters (calibrated against NSE and KGE, respectively), and the other two are simulated with the GR4J model and the corresponding sets of its optimal parameters.

3.1.3. Forecast Communication

The simulated four-realization ensemble of runoff forecasts must then be properly communicated to a broad audience. To this end, for each river basin, the bokeh Python library (https://docs.bokeh.org, last access: 23 November 2020) was used to generate a webpage that consists of an interactive plot showing 7 day-ahead runoff forecast, as well as the hindcast for the proceeding seven days (Figure 3). Individual realizations of the forecast are not presented; instead, the ensemble mean (bold line in Figure 3) and ensemble spread (filled area in Figure 3) are displayed. There are numerous approaches to runoff forecast visualization [50]. However, this setup proved its reliability in the first version of OpenForecast [21]. The main webpage of OpenForecast v2 demonstrates all the gauges for which runoff forecasts are available on an interactive map (https://openforecast.github.io/, last access: 29 November 2020).

3.1.4. Computational Details

OpenForecast v2 has been in operational use since 14 March 2020. The OpenForecast computational workflow (Figure 1 and Figure 2) runs on a webserver with a single CPU (two computational threads) and 4 Gb of memory. The forecast cycle begins daily at 07:00 UTC to produce runoff forecasts for the set of used gauges for the next 7 days (the day of the initial run is included). All the computations take around 20 min. Similar to the first version, the second version of the OpenForecast computational framework entirely relies on open-source software packages. The Python programming language (https://www.python.org/, last access: 23 November 2020) is used in this work with several open-source libraries: cdsapi (https://github.com/ecmwf/cdsapi, last access: 23 November 2020), xarray (http://xarray.pydata.org, last access: 23 November 2020), cfgrib (https://github.com/ecmwf/cfgrib, last access: 23 November 2020), and cdo (https://code.mpimet.mpg.de/projects/cdo/, last access: 23 November 2020) to preprocess ERA5 and ICON data, numpy (https://numpy.org/, last access: 23 November 2020), pandas (https://pandas.pydata.org/, last access: 23 November 2020), and geopandas (https://geopandas.org/, last access: 23 November 2020) for general-purpose data analysis and calculation, scipy (https://www.scipy.org/, last access: 23 November 2020) for implementation of global optimization algorithm of differential evolution, and matplotlib (https://matplotlib.org/, last access: 23 November 2020), bokeh (https://docs.bokeh.org, last access: 23 November 2020), and folium (https://python-visualization.github.io/folium/, last access: 23 November 2020) for plotting.

3.2. Benchmarks and Verification Setup

The forecasts from operational systems are typically evaluated in terms of the degree of their similarity with observations [29,51]. To this end, here, three efficiency metrics that are widely used in hydrological studies are employed: the Nash–Sutcliffe Efficiency coefficient (NSE; Equation (1); [39]), the Kling–Gupta Efficiency coefficient (KGE; Equation (2); [40]), and the systematic error (BIAS, Equation (3)).
N S E = 1 Ω ( Q s i m Q o b s ) 2 Ω ( Q o b s Q o b s ¯ ) 2
K G E = 1 ( r 1 ) 2 + ( σ s i m σ o b s 1 ) 2 + ( Q s i m ¯ Q o b s ¯ 1 ) 2
B I A S = Ω ( Q s i m Q o b s ) Ω ( Q o b s ) × 100 , %
where Ω is the period of evaluation, Q s i m and Q o b s are the simulated and observed runoff, Q s i m ¯ and Q o b s ¯ are the mean simulated and observed runoff, r is the correlation component represented by Pearson’s correlation coefficient, σ s i m and σ o b s are the standard deviations in simulations and observations, and cov Ω ( Q o b s , Q s i m ) is the covariance of simulated and observed runoff. NSE and KGE are positively oriented and not limited at the bottom: a value of 1 represents a perfect correspondence between simulations and observations. According to Knoben et al. [52], NSE > 0 and KGE > −0.41 can be considered to be showing efficacy against the mean flow benchmark. Bias is unbounded with the perfect value of 0.
Another important component of the forecast evaluation is whether the forecasts add value or show skill compared to some benchmark [29,53]. Thus, forecast skill (Equation (4)) can be assessed by the direct comparison of the forecast and the benchmark accuracy for a given efficiency metric.
S k i l l = A f o r e c a s t A b e n c h m a r k A p e r f e c t A b e n c h m a r k
where A f o r e c a s t is the forecast accuracy in terms of some efficiency metric, A b e n c h m a r k is the benchmark accuracy in terms of the same efficiency metric, and A p e r f e c t is the perfect accuracy for this metric; i.e., 1 for NSE and KGE, and 0 for bias. Skill score values are assigned qualitative descriptions, such as having positive (skill > 0) and negative (skill < 0) skill [14,54].
Following Pappenberger et al. [29], two benchmarks are considered here: (runoff) persistence and climatology. The rationale behind the use of the corresponding benchmarks and computational details are as follows.
  • Runoff climatology(hereafter climatology) is a naive benchmark that requires only information about historical runoff observations. From the general public’s perspective, this benchmark can be formulated as “The situation will be the same as in the year YYYY”. Although the climatology benchmark can be dynamically calculated for each date of the forecast, here, the use of an a posteriori estimate is proposed; i.e., the single-year realization from the available 10-year climatological sample (2008–2017) that has the highest correlation coefficient with observations from the verification period. In this way, the climatology benchmark here will be “the best guess” one can make based on the available climatological sample; i.e., without any forecasting system at all.
  • The runoff persistence (hereafter persistence) benchmark belongs to the change-signal category of benchmarks. It assumes that for any lead time, the runoff will be the same as the last observation (at forecast time). Despite its simplicity, persistence may be useful for short-range forecasting where the forecast signal is dominated by the auto-regression of flow [29]. Following Pappenberger et al. [29], “the last observation” is not considered here as a measured discharge, rather as the last runoff prediction simulated by the hydrological model. That choice ensures consistency and offers a homogeneous verification data set that is usually not readily available for operational observations. Thus, persistence shows the gain provided by the use of a deterministic meteorological forecast.
The verification period is considered from 14 March 2020 (the launch of OpenForecast v2) to 6 July 2020. For this period, water level observations were collected using the ESIMO system (Section 2.1). Then, for each gauge, the water level was transformed to discharge based on the stage–discharge relationship, which was calculated based on available historical observations from the AIS system (Figure 4). To find a rating curve, which is a functional relationship between water level (H) and discharge (Q), three polynomial approximations—linear ( Q = a H + b ), quadratic ( Q = a H 2 + b H + c ), and cubic ( Q = a H 3 + b H 2 + c H + d )—have been fitted to observations (Figure 4). The approximation with the lowest mean absolute error (MAE) has been selected for each gauge. To minimize the effect of the temporal transformation of the rating curve [55], only the last 365 pairs of water level and discharge observations were considered to rate the curve calculation.

4. Results and Discussion

4.1. Hydrological Model Calibration

The calibration procedure (Section 3) was carried out for each basin from the set of 1004 available river basins (Section 2.3). In detail, two hydrological models, HBV and GR4J, were calibrated using two loss functions, KGE and NSE, for automatic fitting. That resulted in four configurations of “model–optimal parameter” pairs: HBV N S E , HBV K G E , GR4J N S E , and GR4J K G E . For example, HBV N S E represents the HBV model with optimal parameters derived from the calibration using NSE as a loss function. Following recommendations in Knoben et al. [52], only river basins for which each model configuration passed both thresholds for “skillful” predictions—i.e., NSE > 0 and KGE > −0.41—were retained. This choice led to the selection of 843 river basins (gauges) that form the core of OpenForecast v2. The database that contains all the optimal model parameters for these basins is readily available in an open repository (https://doi.org/10.5281/zenodo.4328996, last access: 17 December 2020) and could be used for further analysis beyond the scope of the presented study.
Figure 5 demonstrates the calibration results for the core set of 843 basins in terms of NSE and KGE metrics. The median efficiencies in terms of NSE and KGE metrics are 0.78 and 0.84; 0.76 and 0.87; 0.77 and 0.81; 0.74 and 0.86 for HBV N S E , HBV K G E , GR4J N S E , and GR4J K G E , respectively. According to Moriasi et al. [56] and Knoben et al. [52], runoff simulations can be considered to be satisfactory if NSE > 0.5 or KGE > 0.3, respectively. Based on that, at least 725 (86%; GR4J K G E model configuration) and 841 (99.8%; HBV N S E model configuration) basins passed the corresponding thresholds for NSE and KGE, respectively. Thus, the obtained calibration results are promising: even with meteorological reanalysis data as forcing elements, both hydrological models showed a reliable efficiency for runoff simulation at a daily temporal resolution. That confirms recent findings [18,31,46,47,48,57,58], which also showed the high potential of modern climate reanalysis data for use in hydrological modeling applications. Furthermore, the calibration results confirmed that the used model configurations met a common prerequisite that ensures a strong ensemble—efficient yet different model configurations [59].

4.2. Selection of Reference Gauges

For the verification period (14 March 2020 to 6 July 2020; 115 days), water level observations were collected for every gauge present in the ESIMO system (Section 3.2). Then, a detailed analysis of collected data was performed to select the set of reference gauges, where observed water levels were reliable and consistent with those from the AIS database (Section 2.1). The corresponding analysis included automated tests (e.g., detection of outliers, sudden changes in flow dynamics, correspondence with historical observations), and the manual visual inspection of data. This detailed yet subjective analysis led to 244 gauges being selected from the ESIMO database, which were suitable for OpenForecast v2 verification (Figure 6). The following verification results are shown only for the specified set of gauges (river basins). The selection made excluded many gauges from the further analysis, but I argue that the remaining 244 gauges, which represent 29% out of 843 gauges in operation in OpenForecast v2, form a factual basis for a comprehensive analysis and benchmark of the OpenForecast streamflow forecasting system.

4.3. Benchmark and Verification Results

The set of evaluation metrics (Section 3.2) was calculated for each model configuration and for each gauge in the reference set (Figure 6, Section 4.2). Although individual forecasts produced by different model configurations—i.e., HBV N S E , HBV K G E , GR4J N S E , and GR4J K G E —are available for analysis, in the following subsection, only the analysis of the ensemble means of individual forecasts is presented. The ensemble mean (hereafter ENS) is calculated as an arithmetic mean over individual forecast members and represents the average agreement over them. Furthermore, it is the ensemble mean that is communicated as “runoff prediction” (both forecast and hindcast) in the OpenForecast system (Figure 3, Section 3.1). Thus, this choice is also justified by user experience, as ENS represents the actual forecast, rather individual members forming its uncertainty interval.
Figure 7 demonstrates the ensemble mean forecast (ENS) performance in terms of NSE and KGE metrics as a function of lead time for the set of reference basins. The visually distinct results show that (1) ENS shows a reliable median efficiency over the lead time of 7 days but (2) a gradual loss of efficiency over lead time is present, as expected; (3) the loss of efficiency is more pronounced for the NSE metric, both for a median value and width of the Interquartile Range (IQR). This loss of efficiency is usually referred to as the deterioration of the efficiency of meteorological forecast [8,10,21,29]. However, other factors, such as the cumulative effect of biased initial conditions, may also play a considerable role in the corresponding efficiency loss [29,60]. Still, both median NSE and KGE demonstrate values that are higher than behavioral values: 0.5 for NSE and 0.3 for KGE (after Knoben et al. [52]).
To distinguish the difference in ENS forecast efficiency between river basins of different sizes, all the reference basins were classified into three groups based on basin area: small (<1000 km 2 ), medium (1000–10,000 km 2 ), and large (>10,000 km 2 ). Figure 8 demonstrates the efficiency (in terms of NSE and KGE metrics) of ENS forecast as a function of lead time for reference basins of different sizes. The loss of the forecast efficiency over lead time is more pronounced for small rather than for medium or large basins. Thus, the loss of efficiency from the first to a seventh day-ahead forecasts (in terms of median NSE) for small, medium, and large basins are 0.68, 0.06, and 0.04, respectively. For KGE, these differences are less pronounced, yet still present. Thus, forecasts issued for small basins are less reliable than those for medium and large basins. This is expected as smaller basins tend to react to the changes in weather or/and initial conditions more rapidly [29,61]. Thus, advancing runoff predictions at small spatial scales must have a higher priority for the hydrological modeling community [62].
Although the results of OpenForecast v2 efficiency (Figure 7 and Figure 8) are promising, it is important to estimate the gain provided by OpenForecast v2 compared to two standard benchmarks (Section 3.2): climatology and persistence. To this end, efficiency metrics were compared for both ensemble mean forecast (ENS) and the set of benchmark models; then, respective skill scores were calculated for each basin (Equation (4)). Figure 9 shows whether OpenForecast forecasts were skillful in comparison to climatology (skill > 0) for the set of reference gauges or not (skill < 0). For the sake of brevity, only the skill for lead times of one, three, and seven days is plotted. Results show that for the lead time of one day, the provided forecasts have a positive skill for at least 86% (i.e., for 204 out of 244 basins in terms of NSE) of basins from the reference set. There is a slight decrease in the number of skillful basins with lead time. For the lead time of 7 days, forecasts are skillful for at least 78% of analyzed basins (i.e., for 191 out of 244 basins in terms of KGE). The results do not clearly represent the expected pattern of less skillful predictions in highly seasonal basins [29], where climatology may provide a solid benchmark estimate [63]: there are many adjacent basins across the entire spatial coverage of data where the calculated skills have opposite signs. Moreover, there are also many basins in which the calculated skill may have different signs for calculated metrics (e.g., positive skill in terms of NSE and negative in terms of bias). Thus, the estimate of the number of basins where climatology is skillful compared to OpenForecast is optimistic. Furthermore, it should be mentioned that the implementation of the climatology benchmark in the presented study tends to provide a more optimistic assessment of its efficiency because of a posteriori calculation (Section 3.2).
The climatology benchmark remains efficient for a considerable portion of basins. However, the overall positive skill provided by OpenForecast is clear. Moreover, the skill of the climatology benchmark tends to decrease due to climate change. According to recent studies, the number of extreme events will increase in a warmer climate [64,65,66]; thus, runoff climatology will give less efficient estimates in the future. The rationale behind the use of the climatology benchmark was from the comparison of two scenarios—(1) when a runoff forecasting system was present, and (2) when there was no runoff forecasting system—and one could make predictions based only on an analogy, the naive version of which is “The situation will be the same as in the year YYYY”. The obtained results confirmed that the forecasting system (OpenForecast) ensures the production of more reliable runoff forecasts in comparison to naive approaches. Thus, the urgent need for the development and further distribution of such systems for operational runoff forecasting is confirmed.
Similar to Figure 9, Figure 10 demonstrates the skill of the ensemble mean (ENS) forecast over the runoff persistence benchmark. As expected, runoff forecasts are less skillful compared to persistence than climatology. However, even for the lead time of one day, at least 62%, 57%, and 53% of basins from the reference set demonstrate positive skill in terms of NSE, KGE, and bias, respectively. The percentage of skillful basins progressively increases with lead time. For the lead time of seven days, it reaches 80%, 89%, and 66% in terms of NSE, KGE, and bias, respectively. Moreover, the skill also increases in absolute values, becoming more pronounced with increasing lead time.
The obtained results indicate the crucial value of NWP weather forecast use in the OpenForecast system. Although persistence is a skillful benchmark for very short lead times, it becomes less reliable for longer lead times [7,29]. Thus, the forecast skill compared to persistence benefits more from the use of NWP weather forecasts with increasing lead time. In this way, the further development of runoff forecasting systems should focus on the reliability of weather forecasts due to their high effect on the resulting efficiency.
In summary, the verification results showed the high reliability, skill, and additional value provided by the OpenForecast v2 system. The current computational workflow (Section 3.1, Figure 1) proved its efficiency and demonstrated remarkable progress in comparison to the first version of the OpenForecast system [21]. For example, the verification results of the OpenForecast v1 system showed its limited ability to simulate flood volumes. To compensate for such errors, a data assimilation routine has been developed to update forecasts based on operational streamflow observations; that led to a significant increase in forecast efficiency, from negative values to 0.9 in terms of NSE. Instead, the second version of the OpenForecast system does not include any data assimilation routine yet provides reliable predictions (Figure 7 and Figure 8). I argue that two components ensure the demonstrated high out-of-the-shelf efficiency of OpenForecast v2:
  • The transition from ERA-Interim to ERA5 meteorological reanalysis (including the use of ERA5T product; Section 2.2).
  • The transition from deterministic to ensemble runoff forecast, which is produced by different hydrological model configurations (Section 3.1).
The verification of OpenForecast v2 proved the reliability of the framework on which it is based. However, there are several issues worth further discussion. In detail, I acknowledge that OpenForecast efficiency can be limited by the following factors:
  • The use of meteorological reanalysis data instead of observation-based products.
  • The use of non-homogeneous meteorological data—i.e., ERA5 reanalysis—but ICON NWP forecast.
  • The lack of observational streamflow data assimilation.
  • The lack of an error correction routine.
  • The use of lumped conceptual hydrological models, while far more advanced models exist.
The aforementioned factors could have a major impact on runoff forecasting system improvement, but there are several barriers that limit their implementation potential. The first barrier is the (still) limited availability of both meteorological and streamflow data [18,67]. Although recent studies revealed pronounced biases in meteorological reanalysis products [68], as well as corresponding effects on the results of hydrological modeling [46,69], observation-based products are rarely readily available for use in operational forecasting systems. A similar situation is found with the availability of operational streamflow observations. Forecasting systems significantly benefit from the implementation of observational data assimilation routines [21,70]. However, the data often exhibit large errors and long discontinuities due to absent or insufficient quality control. The issue of the use of non-homogeneous meteorological data also remains unresolvable. Despite many research centers providing both reanalysis and forecast data, there is no openly available and spatiotemporal consistent dataset. The second barrier is more computational: there are many models that can be suitable in an operational flood forecasting framework at a continental scale [71], but the added complexity of their implementation related to the required data and computational resources may not be compensated by the obtained efficiency gains [72]. To some extent, this is also relevant to error correction techniques [73,74]. Thus, the development of runoff forecasting systems should continue to investigate the careful balance between the used data and models in terms of their availability, reliability, and computational complexity.

4.4. Website Traffic and Demand for Forecasts

Although timely and reliable runoff forecast production is the central focus of any forecasting service, the issued forecasts should be properly disseminated and communicated with the general public. OpenForecast v2 delivers runoff forecasts through the website (https://openforecast.github.io/, last access: 28 November 2020). There is an interactive map on the main page of this website that shows the location of all 834 gauges for which a runoff forecast is issued. Clicking on a gauge’s icon on the map redirects the user to the individual webpage for the respective gauge with an interactive plot of the 7-day-ahead streamflow forecast (Figure 3).
Figure 11 demonstrates the time series of the daily number of unique visits to the OpenForecast main page. Results show that the first surge of interest in OpenForecast (16–25 March 2020) was related to its release, while the second surge was related to the flood period in Russia (April–June 2020). The dynamics of the website’s traffic (Figure 11) underline the main difference between weather and runoff forecasts. Although the general public needs (and uses) weather forecasts on an everyday basis, runoff forecasts are needed only during flood periods. Thus, the demand for weather forecasts becomes a habit, while services for runoff forecasts remain marginal. This low and unsteady demand for runoff forecasts also limits their commercial potential, which is confirmed by the absence of private runoff forecast services. The low interest in runoff forecasts can be, to some extent, attributed to the weak communication skills of the hydrologists who develop the service. There are only a few papers (e.g., [50,75]) that put their central focus on communication rather than computational issues. Thus, hydrologists know how to compute forecasts but do not know how to communicate them efficiently.

5. Conclusions and Outlook

In the presented study, the development and efficiency of OpenForecast v2—the first national-scale operational runoff forecasting system in Russia—are explored. OpenForecast v2 is fully based on open data and software and operationally delivers 7 day-ahead streamflow forecasts for 843 gauges across Russia. The verification of OpenForecast has been carried out using 244 gauges for which operational streamflow data were accessible and reliable during the verification period (14 March–6 July 2020). The skill of runoff forecasts, which measures the improvement in OpenForecast v2 over benchmark forecast, has been calculated and compared to two widely used benchmarks: climatology and persistence.
Results show that OpenForecast v2 provides reliable and skillful runoff forecasts up to one week. Thus, the loss of skill with lead time is more pronounced for small (<1000 km 2 ) river basins, rather than for medium (1000–10,000 km 2 ) and large (>10,000 km 2 ) basins. The results of the benchmark compared with climatology and persistence showed that OpenForecast v2 provides skillful predictions for most of the analyzed basins. Thus, it offers prominent additional value and has the potential to serve as a core component for early warning and water management systems.
The demonstrated development and benchmarking of OpenForecast v2 also helped to reveal several problems and questions worth exploring in future research. First, the results of the hydrological model calibration procedure showed that 161 out of 1004 basins (16%) did not pass the “skillful” thresholds in terms of NSE and KGE, and 118 out of the remaining 843 basins (14%) fell below the “satisfactory” threshold in terms of NSE. This shows that additional effort is needed both to understand the possible causes of model failure and to find respective ways to address the identified causes.
Second, an overall problem of data availability is shown. This problem mainly refers to Russian institutions, which are responsible for environmental monitoring. Thus, there is neither a model nor an observation-based, spatially and temporally consistent, national-scale meteorological dataset that can be used for hydrological applications. The situation with hydrological data is similar. Although the AIS database contains historical streamflow observations for more than 1000 gauges for the period of 10 years (2008–2017), it has not yet been updated with data from the preceding historical period. The ESIMO database, which maintains the circulation of the operational streamflow data, is designed in a way that limits its use in hydrological applications. Thus, the publication of streamflow data in Russia is considered rather exceptional than routine. A paradigm change is needed to improve the situation.
Third, there are obvious problems with runoff forecast communication. Many people in Russia may benefit from the timely and reliable forecasts produced by the OpenForecast system, but they do not know that it exists. The collaboration with media providers, such as TV channels, newspapers, web portals, and popular social networks, may help to improve the situation. However, additional (and substantial) resources are needed to arouse interest in cooperation.
In the first paper about the OpenForecast system [21], the authors mentioned that it has the potential to be implemented at a national scale. In the presented paper, this potential is exploited. Furthermore, the potential of the OpenForecast system is further elaborated upon, and it is argued that it could be further extended in the form of a framework: a set of open-source computational procedures that deploy operational runoff forecasting system for any region, country, or continent.

Funding

The reported study was funded by the Russian Foundation for Basic Research (RFBR) according to the research projects Nos. 19-05-00087 and 19-35-60005.

Acknowledgments

The author thanks Liubov Kurochkina for providing river basins’ boundaries.

Conflicts of Interest

The author declares no conflict of interest.

References

  1. CRED. Natural Disasters. 2019. Available online: https://emdat.be/sites/default/files/adsr_2019.pdf (accessed on 19 November 2020).
  2. Pappenberger, F.; Cloke, H.L.; Parker, D.J.; Wetterhall, F.; Richardson, D.S.; Thielen, J. The monetary benefit of early flood warnings in Europe. Environ. Sci. Policy 2015, 51, 278–291. [Google Scholar] [CrossRef]
  3. Ward, P.J.; Blauhut, V.; Bloemendaal, N.; Daniell, J.E.; de Ruiter, M.C.; Duncan, M.J.; Emberson, R.; Jenkins, S.F.; Kirschbaum, D.; Kunz, M.; et al. Review article: Natural hazard risk assessments at the global scale. Nat. Hazards Earth Syst. Sci. 2020, 20, 1069–1096. [Google Scholar] [CrossRef] [Green Version]
  4. Blöschl, G.; Hall, J.; Viglione, A.; Perdigão, R.A.; Parajka, J.; Merz, B.; Lun, D.; Arheimer, B.; Aronica, G.T.; Bilibashi, A.; et al. Changing climate both increases and decreases European river floods. Nature 2019, 573, 108–111. [Google Scholar] [CrossRef] [PubMed]
  5. Blöschl, G.; Kiss, A.; Viglione, A.; Barriendos, M.; Böhm, O.; Brázdil, R.; Coeur, D.; Demarée, G.; Llasat, M.C.; Macdonald, N.; et al. Current European flood-rich period exceptional compared with past 500 years. Nature 2020, 583, 560–566. [Google Scholar] [CrossRef]
  6. Emerton, R.; Zsoter, E.; Arnal, L.; Cloke, H.L.; Muraro, D.; Prudhomme, C.; Stephens, E.M.; Salamon, P.; Pappenberger, F. Developing a global operational seasonal hydro-meteorological forecasting system: GloFAS-Seasonal v1.0. Geosci. Model Dev. 2018, 11, 3327–3346. [Google Scholar] [CrossRef] [Green Version]
  7. Harrigan, S.; Zoster, E.; Cloke, H.; Salamon, P.; Prudhomme, C. Daily ensemble river discharge reforecasts and real-time forecasts from the operational Global Flood Awareness System. Hydrol. Earth Syst. Sci. Discuss. 2020, 2020, 1–22. [Google Scholar] [CrossRef]
  8. Cloke, H.; Pappenberger, F. Ensemble flood forecasting: A review. J. Hydrol. 2009, 375, 613–626. [Google Scholar] [CrossRef]
  9. Emerton, R.E.; Stephens, E.M.; Pappenberger, F.; Pagano, T.C.; Weerts, A.H.; Wood, A.W.; Salamon, P.; Brown, J.D.; Hjerdt, N.; Donnelly, C.; et al. Continental and global scale flood forecasting systems. Wiley Interdiscip. Rev. Water 2016, 3, 391–418. [Google Scholar] [CrossRef] [Green Version]
  10. Wu, W.; Emerton, R.; Duan, Q.; Wood, A.W.; Wetterhall, F.; Robertson, D.E. Ensemble flood forecasting: Current status and future opportunities. Wiley Interdiscip. Rev. Water 2020, 7, e1432. [Google Scholar] [CrossRef]
  11. Bartholmes, J.C.; Thielen, J.; Ramos, M.H.; Gentilini, S. The european flood alert system EFAS—Part 2: Statistical skill assessment of probabilistic and deterministic operational forecasts. Hydrol. Earth Syst. Sci. 2009, 13, 141–153. [Google Scholar] [CrossRef] [Green Version]
  12. Alfieri, L.; Burek, P.; Dutra, E.; Krzeminski, B.; Muraro, D.; Thielen, J.; Pappenberger, F. GloFAS: Global ensemble streamflow forecasting and flood early warning. Hydrol. Earth Syst. Sci. 2013, 17, 1161–1175. [Google Scholar] [CrossRef] [Green Version]
  13. Robertson, D.E.; Shrestha, D.L.; Wang, Q.J. Post-processing rainfall forecasts from numerical weather prediction models for short-term streamflow forecasting. Hydrol. Earth Syst. Sci. 2013, 17, 3587–3603. [Google Scholar] [CrossRef] [Green Version]
  14. Bennett, J.C.; Wang, Q.J.; Robertson, D.E.; Schepen, A.; Li, M.; Michael, K. Assessment of an ensemble seasonal streamflow forecasting system for Australia. Hydrol. Earth Syst. Sci. 2017, 21, 6007–6030. [Google Scholar] [CrossRef] [Green Version]
  15. Kollet, S.; Gasper, F.; Brdar, S.; Goergen, K.; Hendricks-Franssen, H.J.; Keune, J.; Kurtz, W.; Küll, V.; Pappenberger, F.; Poll, S.; et al. Introduction of an experimental terrestrial forecasting/monitoring system at regional to continental scales based on the terrestrial systems modeling platform (v1. 1.0). Water 2018, 10, 1697. [Google Scholar] [CrossRef] [Green Version]
  16. Emerton, R.; Cloke, H.; Ficchi, A.; Hawker, L.; de Wit, S.; Speight, L.; Prudhomme, C.; Rundell, P.; West, R.; Neal, J.; et al. Emergency flood bulletins for Cyclones Idai and Kenneth: A critical evaluation of the use of global flood forecasts for international humanitarian preparedness and response. Int. J. Disaster Risk Reduct. 2020, 50, 101811. [Google Scholar] [CrossRef]
  17. Hirpa, F.A.; Salamon, P.; Beck, H.E.; Lorini, V.; Alfieri, L.; Zsoter, E.; Dadson, S.J. Calibration of the Global Flood Awareness System (GloFAS) using daily streamflow data. J. Hydrol. 2018, 566, 595–606. [Google Scholar] [CrossRef]
  18. Ayzel, G.; Kurochkina, L.; Zhuravlev, S. The influence of regional hydrometric data incorporation on the accuracy of gridded reconstruction of monthly runoff. Hydrol. Sci. J. 2020, 1–12. [Google Scholar] [CrossRef]
  19. Antonetti, M.; Zappa, M. How can expert knowledge increase the realism of conceptual hydrological models? A case study based on the concept of dominant runoff process in the Swiss Pre-Alps. Hydrol. Earth Syst. Sci. 2018, 22, 4425–4447. [Google Scholar] [CrossRef] [Green Version]
  20. Antonetti, M.; Horat, C.; Sideris, I.V.; Zappa, M. Ensemble flood forecasting considering dominant runoff processes – Part 1: Set-up and application to nested basins (Emme, Switzerland). Nat. Hazards Earth Syst. Sci. 2019, 19, 19–40. [Google Scholar] [CrossRef] [Green Version]
  21. Ayzel, G.; Varentsova, N.; Erina, O.; Sokolov, D.; Kurochkina, L.; Moreydo, V. OpenForecast: The First Open-Source Operational Runoff Forecasting System in Russia. Water 2019, 11, 1546. [Google Scholar] [CrossRef] [Green Version]
  22. Robson, A.; Moore, R.; Wells, S.; Rudd, A.; Cole, S.; Mattingley, P. Understanding the Performance of Flood Forecasting Models; Technical Report SC130006; Centre for Ecology and Hydrology, Environment Agency: Wallingford, OX, USA, 2017.
  23. Cohen, S.; Praskievicz, S.; Maidment, D.R. Featured Collection Introduction: National Water Model. JAWRA J. Am. Water Resour. Assoc. 2018, 54, 767–769. [Google Scholar] [CrossRef] [Green Version]
  24. Prudhomme, C.; Hannaford, J.; Harrigan, S.; Boorman, D.; Knight, J.; Bell, V.; Jackson, C.; Svensson, C.; Parry, S.; Bachiller-Jareno, N.; et al. Hydrological Outlook UK: An operational streamflow and groundwater level forecasting system at monthly to seasonal time scales. Hydrol. Sci. J. 2017, 62, 2753–2768. [Google Scholar] [CrossRef] [Green Version]
  25. McMillan, H.K.; Booker, D.J.; Cattoën, C. Validation of a national hydrological model. J. Hydrol. 2016, 541. [Google Scholar] [CrossRef]
  26. Viglione, A.; Borga, M.; Balabanis, P.; Blöschl, G. Barriers to the exchange of hydrometeorological data in Europe: Results from a survey and implications for data policy. J. Hydrol. 2010, 394, 63–77. [Google Scholar] [CrossRef]
  27. Frolova, N.; Kireeva, M.; Magrickiy, D.; Bologov, M.; Kopylov, V.; Hall, J.; Semenov, V.; Kosolapov, A.; Dorozhkin, E.; Korobkina, E.; et al. Hydrological hazards in Russia: Origin, classification, changes and risk assessment. Nat. Hazards 2017, 88, 103–131. [Google Scholar] [CrossRef]
  28. Meredith, E.P.; Semenov, V.A.; Maraun, D.; Park, W.; Chernokulsky, A.V. Crucial role of Black Sea warming in amplifying the 2012 Krymsk precipitation extreme. Nat. Geosci. 2015, 8, 615–619. [Google Scholar] [CrossRef]
  29. Pappenberger, F.; Ramos, M.H.; Cloke, H.L.; Wetterhall, F.; Alfieri, L.; Bogner, K.; Mueller, A.; Salamon, P. How do I know if my forecasts are better? Using benchmarks in hydrological ensemble prediction. J. Hydrol. 2015, 522, 697–713. [Google Scholar] [CrossRef] [Green Version]
  30. Hersbach, H.; Bell, B.; Berrisford, P.; Hirahara, S.; Horányi, A.; Muñoz-Sabater, J.; Nicolas, J.; Peubey, C.; Radu, R.; Schepers, D.; et al. The ERA5 global reanalysis. Q. J. R. Meteorol. Soc. 2020, 146, 1999–2049. [Google Scholar] [CrossRef]
  31. Tarek, M.; Brissette, F.P.; Arsenault, R. Evaluation of the ERA5 reanalysis as a potential reference dataset for hydrological modelling over North America. Hydrol. Earth Syst. Sci. 2020, 24, 2527–2544. [Google Scholar] [CrossRef]
  32. Reinert, D.; Prill, F.; Frank, H.; Denhard, M.; Baldauf, M.; Schraff, C.; Gebhardt, C.; Marsigli, C.; Zängl, G. DWD Database Reference for the Global and Regional ICON and ICON-EPS Forecasting System; Technical Report Version 2.1.1; Deutscher Wetterdienst (DWD): Offenbach, Germany, 2020. [Google Scholar] [CrossRef]
  33. Oudin, L.; Hervieu, F.; Michel, C.; Perrin, C.; Andréassian, V.; Anctil, F.; Loumagne, C. Which potential evapotranspiration input for a lumped rainfall–runoff model? Part 2—Towards a simple and efficient potential evapotranspiration model for rainfall–runoff modelling. J. Hydrol. 2005, 303, 290–306. [Google Scholar] [CrossRef]
  34. Lehner, B. Derivation of Watershed Boundaries for GRDC Gauging Stations Based on the HydroSHEDS Drainage Network; Technical Report 41; Global Runoff Data Centre (GRDC): Koblenz, Germany, 2012; Available online: https://www.bafg.de/GRDC/EN/02_srvcs/24_rprtsrs/report_41.html (accessed on 20 November 2020).
  35. Yamazaki, D.; Ikeshima, D.; Sosa, J.; Bates, P.D.; Allen, G.H.; Pavelsky, T.M. MERIT Hydro: A High-Resolution Global Hydrography Map Based on Latest Topography Dataset. Water Resour. Res. 2019, 55, 5053–5073. [Google Scholar] [CrossRef] [Green Version]
  36. Perrin, C.; Michel, C.; Andréassian, V. Improvement of a parsimonious model for streamflow simulation. J. Hydrol. 2003, 279, 275–289. [Google Scholar] [CrossRef]
  37. Bergström, S.; Forsman, A. Development of a conceptual deterministic rainfall-runoff model. Hydrol. Res. 1973, 4, 147–170. [Google Scholar] [CrossRef]
  38. Lindström, G. A simple automatic calibration routine for the HBV model. Hydrol. Res. 1997, 28, 153–168. [Google Scholar] [CrossRef] [Green Version]
  39. Nash, J.E.; Sutcliffe, J.V. River flow forecasting through conceptual models part I—A discussion of principles. J. Hydrol. 1970, 10, 282–290. [Google Scholar] [CrossRef]
  40. Gupta, H.V.; Kling, H.; Yilmaz, K.K.; Martinez, G.F. Decomposition of the mean squared error and NSE performance criteria: Implications for improving hydrological modelling. J. Hydrol. 2009, 377, 80–91. [Google Scholar] [CrossRef] [Green Version]
  41. Valéry, A.; Andréassian, V.; Perrin, C. ‘As simple as possible but not simpler’: What is useful in a temperature-based snow-accounting routine? Part 1—Comparison of six snow accounting routines on 380 catchments. J. Hydrol. 2014, 517, 1166–1175. [Google Scholar] [CrossRef]
  42. Valéry, A.; Andréassian, V.; Perrin, C. ‘As simple as possible but not simpler’: What is useful in a temperature-based snow-accounting routine? Part 2—Sensitivity analysis of the Cemaneige snow accounting routine on 380 catchments. J. Hydrol. 2014, 517, 1176–1187. [Google Scholar] [CrossRef]
  43. Arsenault, R.; Brissette, F.; Martel, J.L. The hazards of split-sample validation in hydrological model calibration. J. Hydrol. 2018, 566, 346–362. [Google Scholar] [CrossRef]
  44. Storn, R.; Price, K. Differential evolution–a simple and efficient heuristic for global optimization over continuous spaces. J. Glob. Optim. 1997, 11, 341–359. [Google Scholar] [CrossRef]
  45. Beck, H.E.; van Dijk, A.I.J.M.; de Roo, A.; Miralles, D.G.; McVicar, T.R.; Schellekens, J.; Bruijnzeel, L.A. Global-scale regionalization of hydrologic model parameters. Water Resour. Res. 2016. [Google Scholar] [CrossRef] [Green Version]
  46. Ayzel, G. Runoff predictions in ungauged Arctic basins using conceptual models forced by reanalysis data. Water Resour. 2018, 45, 1–7. [Google Scholar] [CrossRef]
  47. Ayzel, G.; Izhitskiy, A. Coupling physically based and data-driven models for assessing freshwater inflow into the Small Aral Sea. Proc. Int. Assoc. Hydrol. Sci. 2018, 379, 151–158. [Google Scholar] [CrossRef] [Green Version]
  48. Ayzel, G.; Izhitskiy, A. Climate change impact assessment on freshwater inflow into the Small Aral Sea. Water 2019, 11, 2377. [Google Scholar] [CrossRef] [Green Version]
  49. Pushpalatha, R.; Perrin, C.; Le Moine, N.; Mathevet, T.; Andréassian, V. A downward structural sensitivity analysis of hydrological models to improve low-flow simulation. J. Hydrol. 2011, 411, 66–76. [Google Scholar] [CrossRef]
  50. Pappenberger, F.; Stephens, E.; Thielen, J.; Salamon, P.; Demeritt, D.; van Andel, S.J.; Wetterhall, F.; Alfieri, L. Visualizing probabilistic flood forecast information: Expert preferences and perceptions of best practice in uncertainty communication. Hydrol. Process. 2013, 27, 132–146. [Google Scholar] [CrossRef]
  51. Demargne, J.; Mullusky, M.; Werner, K.; Adams, T.; Lindsey, S.; Schwein, N.; Marosi, W.; Welles, E. Application of forecast verification science to operational river forecasting in the US National Weather Service. Bull. Am. Meteorol. Soc. 2009, 90, 779–784. [Google Scholar] [CrossRef] [Green Version]
  52. Knoben, W.J.M.; Freer, J.E.; Woods, R.A. Technical note: Inherent benchmark or not? Comparing Nash–Sutcliffe and Kling–Gupta efficiency scores. Hydrol. Earth Syst. Sci. 2019, 23, 4323–4331. [Google Scholar] [CrossRef] [Green Version]
  53. Luo, Y.Q.; Randerson, J.T.; Abramowitz, G.; Bacour, C.; Blyth, E.; Carvalhais, N.; Ciais, P.; Dalmonech, D.; Fisher, J.B.; Fisher, R.; et al. A framework for benchmarking land models. Biogeosciences 2012, 9, 3857–3874. [Google Scholar] [CrossRef] [Green Version]
  54. Harrigan, S.; Prudhomme, C.; Parry, S.; Smith, K.; Tanguy, M. Benchmarking ensemble streamflow prediction skill in the UK. Hydrol. Earth Syst. Sci. 2018, 22, 2023–2039. [Google Scholar] [CrossRef] [Green Version]
  55. Mansanarez, V.; Renard, B.; Le Coz, J.; Lang, M.; Darienzo, M. Shift happens! Adjusting stage-discharge rating curves to morphological changes at known times. Water Resour. Res. 2019. [Google Scholar] [CrossRef]
  56. Moriasi, D.; Arnold, J.; Van Liew, M.; Binger, R.; Harmel, R.; Veith, T. Model evaluation guidelines for systematic quantification of accuracy in watershed simulations. Trans. ASABE 2007, 50, 885–900. [Google Scholar] [CrossRef]
  57. Essou, G.R.C.; Sabarly, F.; Lucas-Picher, P.; Brissette, F.; Poulin, A. Can Precipitation and Temperature from Meteorological Reanalyses Be Used for Hydrological Modeling? J. Hydrometeorol. 2016, 17, 1929–1950. [Google Scholar] [CrossRef]
  58. Raimonet, M.; Oudin, L.; Thieu, V.; Silvestre, M.; Vautard, R.; Rabouille, C.; Le Moigne, P. Evaluation of Gridded Meteorological Datasets for Hydrological Modeling. J. Hydrometeorol. 2017, 18, 3027–3041. [Google Scholar] [CrossRef]
  59. Kuncheva, L.I.; Whitaker, C.J. Measures of diversity in classifier ensembles and their relationship with the ensemble accuracy. Mach. Learn. 2003, 51, 181–207. [Google Scholar] [CrossRef]
  60. Pinnington, E.; Quaife, T.; Black, E. Impact of remotely sensed soil moisture and precipitation on soil moisture prediction in a data assimilation system with the JULES land surface model. Hydrol. Earth Syst. Sci. 2018, 22, 2575–2588. [Google Scholar] [CrossRef] [Green Version]
  61. Poncelet, C.; Merz, R.; Merz, B.; Parajka, J.; Oudin, L.; Andréassian, V.; Perrin, C. Process-based interpretation of conceptual hydrological model performance using a multinational catchment set. Water Resour. Res. 2017, 53, 7247–7268. [Google Scholar] [CrossRef] [Green Version]
  62. Blöschl, G.; Bierkens, M.F.; Chambel, A.; Cudennec, C.; Destouni, G.; Fiori, A.; Kirchner, J.W.; McDonnell, J.J.; Savenije, H.H.; Sivapalan, M.; et al. Twenty-three Unsolved Problems in Hydrology (UPH)—A community perspective. Hydrol. Sci. J. 2019. [Google Scholar] [CrossRef] [Green Version]
  63. Schaefli, B.; Gupta, H.V. Do Nash values have value? Hydrol. Process. 2007, 21, 2075–2080. [Google Scholar] [CrossRef] [Green Version]
  64. Thober, S.; Kumar, R.; Wanders, N.; Marx, A.; Pan, M.; Rakovec, O.; Samaniego, L.; Sheffield, J.; Wood, E.F.; Zink, M. Multi-model ensemble projections of European river floods and high flows at 1.5, 2, and 3 degrees global warming. Environ. Res. Lett. 2018, 13, 014003. [Google Scholar] [CrossRef]
  65. Bertola, M.; Viglione, A.; Lun, D.; Hall, J.; Blöschl, G. Flood trends in Europe: Are changes in small and big floods different? Hydrol. Earth Syst. Sci. 2020, 24, 1805–1822. [Google Scholar] [CrossRef] [Green Version]
  66. Arnell, N.W.; Gosling, S.N. The impacts of climate change on river flood risk at the global scale. Clim. Chang. 2016, 134, 387–401. [Google Scholar] [CrossRef] [Green Version]
  67. Addor, N.; Do, H.X.; Alvarez-Garreton, C.; Coxon, G.; Fowler, K.; Mendoza, P.A. Large-sample hydrology: Recent progress, guidelines for new datasets and grand challenges. Hydrol. Sci. J. 2019. [Google Scholar] [CrossRef] [Green Version]
  68. Beck, H.E.; Vergopolan, N.; Pan, M.; Levizzani, V.; van Dijk, A.I.J.M.; Weedon, G.P.; Brocca, L.; Pappenberger, F.; Huffman, G.J.; Wood, E.F. Global-scale evaluation of 22 precipitation datasets using gauge observations and hydrological modeling. Hydrol. Earth Syst. Sci. 2017, 21, 6201–6217. [Google Scholar] [CrossRef] [Green Version]
  69. Beck, H.E.; van Dijk, A.I.J.M.; de Roo, A.; Dutra, E.; Fink, G.; Orth, R.; Schellekens, J. Global evaluation of runoff from ten state-of-the-art hydrological models. Hydrol. Earth Syst. Sci. Discuss. 2016, 1–33. [Google Scholar] [CrossRef]
  70. Kneis, D.; Abon, C.; Bronstert, A.; Heistermann, M. Verification of short-term runoff forecasts for a small Philippine basin (Marikina). Hydrol. Sci. J. 2017, 62, 205–216. [Google Scholar] [CrossRef]
  71. Kauffeldt, A.; Wetterhall, F.; Pappenberger, F.; Salamon, P.; Thielen, J. Technical review of large-scale hydrological models for implementation in operational flood forecasting schemes on continental level. Environ. Model. Softw. 2016, 75, 68–76. [Google Scholar] [CrossRef]
  72. Hrachowitz, M.; Clark, M.P. HESS Opinions: The complementary merits of competing modelling philosophies in hydrology. Hydrol. Earth Syst. Sci. 2017, 21, 3953–3973. [Google Scholar] [CrossRef] [Green Version]
  73. Li, M.; Wang, Q.J.; Robertson, D.E.; Bennett, J.C. Improved error modelling for streamflow forecasting at hourly time steps by splitting hydrographs into rising and falling limbs. J. Hydrol. 2017, 555, 586–599. [Google Scholar] [CrossRef]
  74. Li, M.; Robertson, D.E.; Wang, Q.J.; Bennett, J.C.; Perraud, J.M. Reliable hourly streamflow forecasting with emphasis on ephemeral rivers. J. Hydrol. 2020, 125739. [Google Scholar] [CrossRef]
  75. Arnal, L.; Anspoks, L.; Manson, S.; Neumann, J.; Norton, T.; Stephens, E.; Wolfenden, L.; Cloke, H.L. “Are we talking just a bit of water out of bank? Or is it Armageddon?” Front line perspectives on transitioning to probabilistic fluvial flood forecasts in England. Geosci. Commun. 2020, 3, 203–232. [Google Scholar] [CrossRef]
Figure 1. Illustration of the OpenForecast computational workflow.
Figure 1. Illustration of the OpenForecast computational workflow.
Hydrology 08 00003 g001
Figure 2. Illustration of data flows during the process of runoff forecast generation.
Figure 2. Illustration of data flows during the process of runoff forecast generation.
Hydrology 08 00003 g002
Figure 3. Example of the issued forecast for the Protva River at Spas-Zagor’e. Transcription from Russian as follows. x-axis: date; y-axis: discharge, m 3 /s. The legend shows the following: bold light blue line: hindcast mean; light blue area: hindcast spread; bold blue line: forecast mean; blue area: forecast spread.
Figure 3. Example of the issued forecast for the Protva River at Spas-Zagor’e. Transcription from Russian as follows. x-axis: date; y-axis: discharge, m 3 /s. The legend shows the following: bold light blue line: hindcast mean; light blue area: hindcast spread; bold blue line: forecast mean; blue area: forecast spread.
Hydrology 08 00003 g003
Figure 4. Example of rating curve calculation (Tom River at Novokuznetsk).
Figure 4. Example of rating curve calculation (Tom River at Novokuznetsk).
Hydrology 08 00003 g004
Figure 5. Cumulative density functions (CDF) for the NSE (left plot) and KGE (right plot) for all model configurations.
Figure 5. Cumulative density functions (CDF) for the NSE (left plot) and KGE (right plot) for all model configurations.
Hydrology 08 00003 g005
Figure 6. The spatial location of OpenForecast v2 gauges (n = 843) and those from the ESIMO database that were selected for the verification procedure (n = 244).
Figure 6. The spatial location of OpenForecast v2 gauges (n = 843) and those from the ESIMO database that were selected for the verification procedure (n = 244).
Hydrology 08 00003 g006
Figure 7. Verification of the ensemble mean forecast (ENS) in terms of NSE (left plot) and KGE (right plot) for the entire verification period and the entire set of reference gauges. The boxplot box represents the interquartile range (IQR, the difference between the 25th and 75th quantiles); the whiskers represent ±1.5×IQR from the 25th and 75th quantiles, respectively; the yellow line denotes the median value.
Figure 7. Verification of the ensemble mean forecast (ENS) in terms of NSE (left plot) and KGE (right plot) for the entire verification period and the entire set of reference gauges. The boxplot box represents the interquartile range (IQR, the difference between the 25th and 75th quantiles); the whiskers represent ±1.5×IQR from the 25th and 75th quantiles, respectively; the yellow line denotes the median value.
Hydrology 08 00003 g007
Figure 8. Verification of the ensemble mean forecast (ENS) in terms of NSE (top row) and KGE (bottom row) for the three groups of river basins: small (left column), medium (middle column), and large (right column). The boxplot box represents the Interquartile Range (IQR, the difference between the 25th and 75th quantiles); the whiskers represent ±1.5×IQR from the 25th and 75th quantiles, respectively; the yellow line denotes the median value.
Figure 8. Verification of the ensemble mean forecast (ENS) in terms of NSE (top row) and KGE (bottom row) for the three groups of river basins: small (left column), medium (middle column), and large (right column). The boxplot box represents the Interquartile Range (IQR, the difference between the 25th and 75th quantiles); the whiskers represent ±1.5×IQR from the 25th and 75th quantiles, respectively; the yellow line denotes the median value.
Hydrology 08 00003 g008
Figure 9. The skill of OpenForecast v2 with NSE (top row), KGE (middle row), and bias (bottom row) metrics for forecasts against the climatology benchmark for lead times of one (left column), three (middle column), and seven (right column) days.
Figure 9. The skill of OpenForecast v2 with NSE (top row), KGE (middle row), and bias (bottom row) metrics for forecasts against the climatology benchmark for lead times of one (left column), three (middle column), and seven (right column) days.
Hydrology 08 00003 g009
Figure 10. The skill of OpenForecast v2 with NSE (top row), KGE (middle row), and bias (bottom row) metrics for forecasts compared to the persistence benchmark for lead times of one (left column), three (middle column), and seven (right column) days.
Figure 10. The skill of OpenForecast v2 with NSE (top row), KGE (middle row), and bias (bottom row) metrics for forecasts compared to the persistence benchmark for lead times of one (left column), three (middle column), and seven (right column) days.
Hydrology 08 00003 g010
Figure 11. Daily attendance of the OpenForecast main webpage (https://openforecast.github.io/).
Figure 11. Daily attendance of the OpenForecast main webpage (https://openforecast.github.io/).
Hydrology 08 00003 g011
Table 1. Description and calibration ranges for GR4J model parameters (based on Ayzel et al. [21]).
Table 1. Description and calibration ranges for GR4J model parameters (based on Ayzel et al. [21]).
ParametersDescriptionCalibration Range
X1Production store capacity (mm)0–3000
X2Intercatchment exchange coefficient (mm/day)−10–10
X3Routing store capacity (mm)0–1000
X4Time constant of unit hydrograph (day)0–20
X5Dimensionless weighting coefficient of the snowpack thermal state0–1
X6Day-degree rate of melting (mm/(day* ° C))0–10
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Ayzel, G. OpenForecast v2: Development and Benchmarking of the First National-Scale Operational Runoff Forecasting System in Russia. Hydrology 2021, 8, 3. https://doi.org/10.3390/hydrology8010003

AMA Style

Ayzel G. OpenForecast v2: Development and Benchmarking of the First National-Scale Operational Runoff Forecasting System in Russia. Hydrology. 2021; 8(1):3. https://doi.org/10.3390/hydrology8010003

Chicago/Turabian Style

Ayzel, Georgy. 2021. "OpenForecast v2: Development and Benchmarking of the First National-Scale Operational Runoff Forecasting System in Russia" Hydrology 8, no. 1: 3. https://doi.org/10.3390/hydrology8010003

APA Style

Ayzel, G. (2021). OpenForecast v2: Development and Benchmarking of the First National-Scale Operational Runoff Forecasting System in Russia. Hydrology, 8(1), 3. https://doi.org/10.3390/hydrology8010003

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop