Next Article in Journal
Assessment of Inter-Sectoral Virtual Water Reallocation and Linkages in the Northern Tianshan Mountains, China
Previous Article in Journal
Dynamic Evaluation and Internal Driving Factors of Water Resources Green Efficiency in China
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Uncertainty in the Number of Calibration Repetitions of a Hydrologic Model in Varying Climatic Conditions

1
Institute of Hydrology, Slovak Academy of Sciences, Dúbravská cesta 9, 84104 Bratislava, Slovakia
2
Institute of Hydraulic Engineering and Water Resources Management, Technische Universität Wien, Karlsplatz 13/223, 1040 Vienna, Austria
*
Author to whom correspondence should be addressed.
Water 2020, 12(9), 2362; https://doi.org/10.3390/w12092362
Submission received: 9 July 2020 / Revised: 15 August 2020 / Accepted: 19 August 2020 / Published: 23 August 2020
(This article belongs to the Section Hydrology)

Abstract

:
The objective of this study is to examine the impact of the number of calibration repetitions on hydrologic model performance and parameter uncertainty in varying climatic conditions. The study is performed in a pristine alpine catchment in the Western Tatra Mountains (the Jalovecký Creek catchment, Slovakia) using daily data from the period 1989–2018. The entire data set has been divided into five 6-years long periods; the division was based on the wavelet analysis of precipitation, air temperature and runoff data. A lumped conceptual hydrologic model TUW (“Technische Universität Wien”) was calibrated by an automatic optimisation using the differential evolution algorithm approach. To test the effect of the number of calibrations in the optimisation procedure, we have conducted 10, 50, 100, 300, 500 repetitions of calibrations in each period and validated them against selected runoff and snow-related model efficiency criteria. The results showed that while the medians of different groups of calibration repetitions were similar, the ranges (max–min) of model efficiency criteria and parameter values differed. An increasing number of calibration repetitions tend to increase the ranges of model efficiency criteria during model validation, particularly for the runoff volume error and snow error, which were not directly used in model calibration. Comparison of model efficiencies in climate conditions that varied among the five periods documented changes in model performance in different periods but the difference between 10 and 500 calibration repetitions did not change much between the selected time periods. The results suggest that ten repetitions of model calibrations provided the same median of model efficiency criteria as a greater number of calibration repetitions and model parameter variability and uncertainty were smaller.

1. Introduction

Conceptual rainfall-runoff models are used for a wide range of purposes including reservoir operations, flood and drought prediction, risk analysis, climate change impact studies, etc. [1,2]. These models usually contain parameters that need to be estimated through calibration. However, many parameters sets with different values of the same parameters can provide similar results in terms of model efficiency, which is termed as “the equifinality principle” [3,4]. The problem of equifinality and its reduction in hydrological modelling has been discussed in many studies (e.g., [3,4,5,6,7]). Previous investigations examined the potential of the Monte Carlo approach, multiple-objective calibration or the influence of the length of data series or climatic variability during calibration/validation periods on the improvement of parameter representativeness and hence reduction of uncertainty in parameters of the models. Several authors (e.g., [8,9,10,11]) used the Monte Carlo (MC) calibration approach to examine the links between model consistency and parameter uncertainty. For example, Finger et al. [9] and [10] used the MC approach (10,000 runs) to test the performance of a conceptual hydrologic model in terms of three observational data sets (discharge, snow cover, glacier mass balances) and found that an ensemble of 100 parameters sets adequately represented parameter variability, hydrological regime and seasonal dynamics of discharge. Another approach for selection of representative model parameters was recently evaluated by Sikorska-Senoner et al. [12] who tested three methods (ranking, quantiling, clustering) and found that the reduced parameter ensembles (i.e., three representative parameter sets) obtained by these methods were reliable for the simulation of extreme floods.
The multi-criteria approach, i.e., consideration of additional hydrological processes in model performance evaluation, has been successful in the reduction of parameter uncertainty, particularly for parameters connected to the additional process. For example, Finger et al. [9] and Parajka and Blöschl [13] showed that the combination of discharge data with snow cover related observations was useful in constraining snow model parameters. Similar results were obtained with satellite data of snow cover [11,14,15] or soil moisture [16].
Evaluations of the effect of length of calibration period on model parameter uncertainty indicated that longer calibration periods reduced parameter uncertainty [17]. Generally, longer calibration periods are advised to capture the variability of climatic and flow conditions [18]. Anctil et al. [19] and Brath et al. [20] recommended using two to ten years as optimum for model calibration. Merz et al. [21] stated that since the calibration period of five years captured most of the temporal hydrological variability, it should be the minimum for achieving reasonable predictive model performance.
A number of studies analysed the effects of climatic variability on the variability of model parameters (e.g., [22,23,24,25,26,27]). Several authors observed a decreasing trend in model performance when model parameters were transferred to periods with a different climate, e.g., from wetter to drier periods and vice versa [22,23,28,29,30,31,32,33,34,35]. For example, in Australia, Vaze et al. [28] and Coron et al. [23] found that the transfer of model parameters to a drier climate was problematic. They concluded that the model parameters transferability was more influenced by a change in precipitation than by changes in evaporation or air temperature. In Austria, Merz et al. [22] found that parameters representing snow cover evolution and soil moisture variability showed significant correlations with air temperature. They documented that a model calibrated in a colder/drier decade had a tendency to overestimate runoff in a warmer/wetter decade. Similar results were obtained by [34] who showed that the use of parameters calibrated in a colder decade for a warmer/wetter decade tends to overestimate catchment runoff, particularly in flatland and hilly catchments.
The uncertainties associated with hydrological model calibration (i.e., parameter estimation, choice of the length of calibration and validation periods) remain a challenge for the modellers. Representativeness of model parameters and reproducibility and repeatability of results are fundamental assumptions in any calibration experiment and uncertainty assessment. The study of Ceola et al. [36] showed that even if the same experimental protocol is used for calibration of hydrological models, there can be some variability in model performance and parameter uncertainty. The results of Ceola et al. [36] indicate that repetition of the calibration procedure can help to detect insensitive model parameters and reduce equifinality, but it is still not clear how to estimate the optimal number of repetitions. This is particularly interesting at present when the variability of climate can have a significant impact on the temporal stability of model parameters [22].
The main aim of this study is hence to examine how does a different number of calibration repetitions impact hydrologic model uncertainty in varying climatic conditions. We investigate the following research questions: (a) Does the number of calibrations influence model performance in varying climatic conditions? (b) Is the optimal number of calibrations related to the varying climate conditions? (c) Does the increasing number of calibrations decrease parameter uncertainty? These questions are investigated in a small pristine alpine catchment in the Western Tatra Mountains where runoff generation is affected only by natural processes. Model performance is evaluated by criteria related to catchment runoff snow cover.

2. Study Catchment and Data

2.1. Study Catchment

The Jalovecký Creek catchment in the Western Tatra Mountains (northern Slovakia) is selected as a pilot catchment for this research (Figure 1). It is a small experimental catchment where the Institute of Hydrology SAS has carried out hydrological research since 1986. It is located at altitudes between 800 and 2178 m a.s.l. (meters above sea level) (mean 1500 m a.s.l.) and has a total area of 22.2 km2. The average catchment slope is 30°, and most slopes have a south-eastern orientation. Soils are represented by Cambisol, Podsol, Ranker and Lithosol. All soils have high stoniness (typically 40–50% and more, [37,38]). Forests (mainly spruce) cover 44% of the catchment area. Dwarf pine covers 31% and Alpine meadows and bare rocks cover the rest 25% of the catchment area.

2.2. Data

The study region is in a transitional zone between oceanic western and continental climate and belongs to the tundra (ET) climate class of Köppen classification. Climate observations, i.e., daily precipitation, daily air temperature, daily discharge and snow course measurements are available from the period between 1 November 1988 and 31 October 2018. Point observations of precipitation and air temperature are measured at 7 and 3 sites, respectively. Snow water equivalent is measured at site Červenec, which is located at catchment mean elevation (Figure 1).

2.3. Selection of Periods With Varying Climate

In this study, the entire dataset is divided into calibration/validation periods identified by the wavelet transform method. Based on the analysis of Sleziak et al. [39], we selected five 6-years long periods (i.e., P1 = 1989–1994, P2 = 1995–2000, P3 = 2001–2006, P4 = 2007–2012 and P5 = 2013–2018). The climatic and hydrologic characteristics of the selected periods are presented in Figure 2 and Table 1. The characteristics show that warmer periods (1989–1994 and 2013–2018) are generally drier. The largest difference in air temperature is observed between January and March. In the wettest period P2 (1995–2000), winters are characterized by larger snowpacks and mean an annual maximum of snow water equivalent is about 136 mm larger than in snow poor warmer periods. The largest differences in the seasonal distribution of precipitation and runoff are observed between April and September where the warmer periods tend to have less precipitation and runoff.

3. Methods

3.1. Hydrologic Model

The hydrological model used in this study is the TUW model [40]. It is a conceptual Hydrologiska Byråns Vattenbalansavdelning (HBV) type model that uses daily precipitation totals, mean daily air temperatures, and potential evapotranspiration as the inputs. Mean daily flows observed at the outlet of the catchment and point measurements of snow water equivalent (Figure 1) are used for the evaluation of model simulations.
The structure of the model involves three modules that simulate changes in snow, soil water and groundwater storage. In total, the model has 15 parameters (Table 2). The snow module represents snow accumulation and melting in a catchment. Snowmelt is simulated by the degree-day approach, using a degree-day factor DDF (mm/°C/d) and a melt air temperature parameter Tm (°C). The catch deficit of precipitation gauges during snowfall is corrected by a snow correction factor SCF (–). A threshold temperature interval Tr − Ts (°C) is used to discriminate between rainfall, snowfall and mix of rain and snow [41]. The soil module simulates the processes taking place in the soil profile. It contains three parameters: the field capacity FC (mm), the limit for potential evapotranspiration Lprat (-), and the parameter relating runoff generation to the soil moisture state, termed the nonlinearity parameter BETA (–). If BETA is large, direct runoff is small, and vice versa. Finally, the runoff module consists of two reservoirs that represent hillslope routing. Rainfall enters the upper reservoir and leaves it through three paths: (1) outflow from the reservoir based on a very fast storage coefficient k0 (days) if a threshold of the storage state LSuz (mm) is exceeded in the upper reservoir; (2) outflow from the upper reservoir with a fast storage coefficient k1 (days), and; (3) percolation to the lower reservoir with a constant percolation rate Cperc (mm/day). Water leaves the lower zone based on a slow storage coefficient k2 (days). Channel routing is simulated by a triangular weighting function, where Bmax (days) is the maximum base at low flows, and Croute (day2/mm) is a free scaling parameter. A detailed description of the model structure with particular model equations is given in Parajka et al. [41].
The model is automatically calibrated using a differential evolution algorithm Deoptim [42]. The objective function (OF) used in calibration is selected on the basis of prior analyses performed in different calibration studies [34,39]. It consists of a combination of the Nash-Sutcliffe coefficient (NSE, [43]) and the logarithmic Nash-Sutcliffe coefficient (logNSE, [22]):
O F = 1 N S E 2 + 1 log N S E 2
where NSE and logNSE criteria are mathematically expressed as follows:
N S E = 1 i = 1 n ( Q s i m , i Q o b s , i ) 2 i = 1 n ( Q o b s , i Q ¯ o b s ) 2
log N S E = 1 i = 1 n ( log ( Q s i m , i ) log ( Q o b s , i ) ) 2 i = 1 n ( log ( Q o b s , i ) log ( Q ¯ o b s ) ) 2
Q s i m , i and Q o b s , i indicate the simulated and observed mean daily flows on day i, and Q ¯ o b s is the average of the flows observed. The NSE and logNSE coefficients range between −∞ (poor fit) and 1 (perfect fit of the observed and simulated values).
Model calibrations are additionally evaluated by the runoff volume error (VE) and the root mean square error (RMSE) between measured and simulated snow water equivalents. VE is the measure of bias between the simulated and observed runoff [22]. The VE value equal to 0 indicates no bias, VE smaller than 0 represents an underestimation of the total runoff volume, and VE greater than 0 denotes overestimation of the total runoff volume. The equation is defined as:
V E = i = 1 n Q s i m , i i = 1 n Q o b s , i i = 1 n Q o b s , i
The root mean square error (RMSE) between simulated and observed daily SWE values is calculated as:
R M S E = i = 1 n ( S W E s i m , i S W E o b s , i ) 2 n
S W E s i m , i and S W E o b s , i are simulated and observed SWE values on day i, respectively, n is the number of observations. Smaller RMSE values indicate better agreement between the observed and simulated SWE.
Model performance is evaluated by the differential split-sample test [44], where the model calibrated in one of the six years long periods is validated in the remaining (four) periods.

3.2. Analysis of Uncertainty Resulting from a Different Number of Calibration Repetitions

The model is calibrated automatically 500 times in each time period. The effect of different numbers of calibration repetitions on model performance and parameter uncertainty is examined for five groups of calibration repetitions, i.e., first 10, 50, 100, 300, and finally, all 500 repetitions.
Our experiment looks at the repetitions of the calibrations of a hydrologic model, where the calibration strategy is identical (no difference in the number of calibration runs). The only difference is that the optimisation approach is based on a random generation of the initial population of model parameters, which can have some impact on the final result (i.e., calibrated parameter set). We have thus run the entire calibration strategy 500 times and investigated the impact on different model efficiencies (objective function, its parts and efficiencies not used in objective function).

4. Results

4.1. Uncertainty of Hydrologic Model Performance in Varying Climatic Conditions

Figure 3 shows the variability in the medians and ranges of the objective function (OF), the logarithmic Nash-Sutcliffe efficiency (logNSE), the Nash-Sutcliffe efficiency (NSE), volume error (VE) and the root mean square error (RMSE) for a different number of calibration repetitions (i.e., 10, 50, 100, 300 and 500 repetitions) in five specific calibration periods. The results show that the median difference between the groups of calibration repetitions was very small even for snow RMSE, which was not used in model calibration. The variability in the range of model efficiency was, however, larger. The differences among the groups were larger for VE and RMSE. It indicates that an increasing number of calibration repetitions increased also the variability of model simulations. Hence, the spread of the efficiency criteria increased as well, particularly of those criteria that were not included in model calibration (runoff VE and snow RMSE). Interestingly, the absolute value of the VE and RMSE ranges differed, but the difference between 10 and 500 calibration repetitions did not vary among the climate periods. While larger VE range was observed in drier periods, the largest snow RMSE range for most of the calibration groups was found in the recent period (2013–2018) that had many snow poor winters with shallower snowpacks and on average smaller SWE maximum (Table 1). Generally, the lowest model performance was obtained in the period 1989–1994, which was the warmest and comparatively dry. The largest efficiency in terms of the objective function, NSE and snow RMSE was obtained in the most recent period 2013–2018, which was second warmest and driest with second smallest average SWEmax. The volume error of model simulations in that period, however, indicated more than 10% underestimation of runoff volumes.
Figure 4 and Figure 5 show the variability in the median (Figure 4) and ranges (Figure 5) of model efficiencies in four specific validation periods, i.e., parameters obtained in each calibration period were validated in the remaining four validation periods. Similarly, as for the calibration model efficiency, the medians in most cases did not change with the number of calibration repetitions. The exception was obtained for the validation efficiency in the coldest period when the model was calibrated in the warmest period (the first column of panels in Figure 4). Medians of runoff objective function and logNSE efficiency for 10 calibration repetitions were greater than the median for groups with a higher number of calibration repetitions. The same pattern was reflected also in the range of the model efficiencies (Figure 5). The variability in model efficiency was the lowest for 10 calibration repetitions and it was increasing with the number of calibration repetitions. The variability was larger for the efficiency criteria that were not used in model calibration (VE and snow RMSE). Larger ranges in the efficiency criteria during model validation were found when the model was validated in colder time periods (i.e., 1995–2000 and 2001–2006). For snow simulations, the calibration in the snow poorer period (2013–2018) lead to greater variability of RMSE if the model was validated in periods with larger snowpacks (1995–2000).

4.2. Uncertainty in Hydrologic Model Parameters in Varying Climatic Conditions

The differences in the median of model parameters and their variability are plotted in Figure 6 and Figure 7. The panels show parameters of the main model modules, i.e., the snow, soil moisture, runoff generation and runoff routine modules. The results indicate that the median of model parameters varied among the time periods but generally did not change much among the calibration groups. Increasing the number of calibration repetitions did not result in changes in the median values for most model parameters. The exceptions include routing module parameters that were generally less sensitive to selected model efficiencies; the snow degree-day parameter calibrated in the coldest time period and the limit for potential evapotranspiration Lprat and very fast storage parameter k0 calibrated in the warmest time period (P1). Medians of these parameters for the calibration group of 10 repetitions were noticeably different than the median obtained from greater numbers of calibration repetitions.
Significantly larger differences among calibration groups were found for the ranges of model parameters (Figure 7) that were increasing with the number of calibration repetitions. Such a pattern was consistent over all calibration periods. There were also large differences between ranges in model parameters calibrated in different time periods. While the difference in ranges of snow module parameters was the smallest for the snow correction factor (SCF), the less sensitive threshold temperatures (Tr, Ts, Tm) have the largest variability among the calibration groups. The largest differences among the calibration groups for degree-day factor (DDF) were found in the warmest period P1 (1989–1994). Soil moisture and runoff generation parameters had the largest differences in the wettest period P2 (1995–2000), where the ranges obtained for 10 calibration repetitions were significantly smaller than those obtained for a greater number of calibration repetitions.

4.3. Simulation of Runoff and Snow Water Equivalent in Varying Climatic Conditions

In order to demonstrate the implications of increasing variability in model parameters caused by the increasing number of calibration repetitions, Figure 8 compares ranges of model simulations obtained by 10 and 500 simulations with observed runoff values. Runoff is simulated in a colder/wetter validation period (2001–2006) with model parameters obtained in a warmer/drier calibration period (1989–1994). Better simulations are indicated by smaller variability between min. and max (i.e., the lines are closer to each other).
For a detailed illustration of the results, one hydrological year (1, November 2000 to 1, October 2001) is plotted (Figure 8). Figure 8 shows that the model reproduced runoff variability. Slightly more realistic simulations of runoff volume, mainly during summer months, were obtained using a smaller number of simulations (i.e., with 10 repetitions of model simulations). Greater differences between simulated and measured runoff were related to the main snowmelt period (April, May).
Figure 9 presents measured and simulated snow water equivalent (SWE) after 10 and 500 simulations. While measured data are represented by point (snow course) measurements of SWE at site Červenec (catchment mean elevation 1500 m a.s.l.), the simulations represent catchment SWE. The simulations were based on model parameters obtained in the snow poorer calibration period (1989–1994). We plotted one hydrological year (1, November 2000 to 1, October 2001) in the snow richer validation period (2001–2006). The range of simulated SWE based on 10 simulations was slightly smaller than that based on 500 simulations. In some cases (e.g., 15 February 2001 to 9 March 2001) the simulated catchment mean SWE was smaller than SWE measured at catchment mean altitude. Such a result could be realistic if the SWE significantly increases with the altitude.

5. Discussion

This study evaluates the impact of a different number of calibration repetitions on model performance and parameter uncertainty in varying climate periods observed in a small pristine mountain catchment. Previous studies showed that an increasing number of calibration runs can be the way to the more robust calibration of conceptual hydrologic models. For example, Finger et al. [9] used the Monte Carlo (MC) approach and found that 100 ensemble parameter sets (out of 10,000 MC runs) are sufficient to obtain an adequate model performance and parameter variability. A similar conclusion is presented in Finger et al. [10] and Konz and Seibert [8] for alpine catchments in Austria and Switzerland. Another example is the results of one experiment of Ceola et al. [36] who compared model performance and uncertainty for 10 and 100 calibration runs. They found some differences in the calibrated model parameters which have been interpreted as a result of parameter insensitivity, equifinality and the problem of model structure to capture the complex runoff generation processes in some catchments. Our results indicate that 10 calibration repetitions resulted in the same model performance, but a smaller range of model parameters and model efficiency and hence model uncertainty compared to a larger number of calibration repetitions. The variability in model performance tends to increase with the increasing number of calibration repetitions, particularly for validation periods and efficiency measures which were not used in model calibration, i.e., in our case runoff volume error and snow modelling error.
Our study extends previous assessments in terms of investigating whether varying climate conditions have an additional impact on the performance of a hydrological model and uncertainty of results evaluated on the basis of a different number of calibration repetitions. Previous studies clearly reported that the performance of conceptual hydrologic models tends to degrade when climatic conditions of model simulations are different from those used for model calibration. For example, Merz et al. [22] showed that the model calibrated in a colder decade had a tendency to overestimate the runoff in a warmer decade and vice versa. Similarly, Coron et al. [23] showed that model calibration in wetter conditions leads to runoff overestimation in drier periods. These results were attributed to the impact of increasing air temperature on the main runoff generation mechanisms, particularly to increasing nonlinearity in runoff response in warmer and drier climate conditions. We found that although the absolute value of model efficiency and parameter ranges varies between the different climate periods, the difference in model performance and parameter variability between 10 and 500 calibration repetitions is generally consistent among all tested periods. The exceptions are larger differences in soil and runoff generation model parameters which are calibrated in the wettest time period. Here the increasing number of calibration repetitions increased the variability in model parameter ranges. For snow model parameters, we found similar results as presented in Merz et al. [22] and Sleziak et al. [34]. Model parameters calibrated in warmer periods resulted in greater differences among the different calibration groups, which could be attributed to the impact of increasing air temperatures on the variability of snowmelt.

6. Conclusions

We explored some uncertainties associated with the calibration of a lumped hydrological model in a small mountain research catchment. The results showed that a relatively small number of 10 calibration repetitions in model parameters optimisation can provide robust model simulations with smaller parameter uncertainty than obtained by a larger number of calibration repetitions. Model performance based on ten calibration repetitions was particularly better when model parameters were optimised in colder/wetter climate conditions.
To use these findings in practice (e.g., when dealing with various water management tasks), it should be noted that the selection of the optimal set of parameters also depends on the type of the task to be addressed (e.g., some sets of parameters may better describe periods of low flow while other can better represent high flows). The results are also dependent on the model used.
More analysis needs to be done in the future to verify these results. In our next effort, we plan to investigate modelling uncertainties in the context of climate change by using distributed hydrological models.

Author Contributions

Conceptualization, P.S., J.P. and L.H.; methodology, P.S., L.H. and J.P.; software, P.S.; validation, P.S.; formal analysis, P.S., J.P., L.H. and M.D.; investigation, P.S., L.H., J.P. and M.D.; writing—original draft preparation, P.S., J.P. and L.H.; writing—review and editing, P.S., J.P. and L.H.; visualization, P.S. All authors have read and agreed to the published version of the manuscript.

Funding

The study was supported by grants from the Slovak Academy of Sciences VEGA (project No. 2/0065/19) and the Slovak Research and Development Agency (APVV) (project No. 15-0497). Data collection was supported by project ITMS 26210120009 Infrastructure completion of hydrological research stations, of the Research & Development Operational Programme funded by the ERDF.

Acknowledgments

We would like to acknowledge the support from the Stefan Schwarz grant of the Slovak Academy of Sciences given to the first author.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Nester, T.; Komma, J.; Blöschl, G. Real time forecasting in the Upper Danube basin. J. Hydrol. Hydromech. 2016, 64, 404–414. [Google Scholar] [CrossRef] [Green Version]
  2. Farkas, C.; Kværnø, S.H.; Engebretsen, A.; Barneveld, R.; Deelstra, J. Applying profile and catchment-based mathematical models for evaluating the run-off from a Nordic catchment. J. Hydrol. Hydromech. 2016, 64, 218–225. [Google Scholar] [CrossRef] [Green Version]
  3. Beven, K.; Binley, A. The future of distributed models: Model calibration and uncertainty prediction. Hydrol. Process. 1992, 6, 279–298. [Google Scholar] [CrossRef]
  4. Beven, K.J.; Freer, J. Equifinality, data assimilation, and uncertainty estimation in mechanistic modeling of complex environmental systems using the GLUE methodology. J. Hydrol. 2001, 105, 157–172. [Google Scholar] [CrossRef]
  5. Beven, K. The limits of splitting: Hydrology. Sci. Total Environ. 1996, 183, 89–97. [Google Scholar] [CrossRef]
  6. Beven, K. A manifesto for the equifinality thesis. J. Hydrol. 2006, 320, 18–36. [Google Scholar] [CrossRef] [Green Version]
  7. Shen, Z.Y.; Chen, L.; Chen, T. Analysis of parameter uncertainty in hydrological and sediment modeling using GLUE method: A case study of SWAT model applied to Three Gorges Reservoir Region, China. Hydrol. Earth Syst. Sci. 2012, 16, 121–132. [Google Scholar] [CrossRef] [Green Version]
  8. Konz, M.; Seibert, J. On the value of glacier mass balances for hydrological model calibration. J. Hydrol. 2010, 385, 238–246. [Google Scholar] [CrossRef] [Green Version]
  9. Finger, D.; Pellicciotti, F.; Konz, M.; Rimkus, S.; Burlando, P. The value of glacier mass balance, satellite snow cover images, and hourly discharge for improving the performance of a physically based distributed hydrological model. Water Resour. Res. 2011, 47, W07519. [Google Scholar] [CrossRef]
  10. Finger, D.; Vis, M.; Huss, M.; Seibert, J. The value of multiple data set calibration versus model complexity for improving the performance of hydrological models in mountain catchments. Water Resour. Res. 2015, 51. [Google Scholar] [CrossRef]
  11. de Niet, J.; Finger, D.C.; Bring, A.; Egilson, D.; Gustafsson, D.; Kalantari, Z. Benefits of combining satellite-derived snow cover data and discharge data to calibrate a glaciated catchment in sub-arctic Iceland. Water 2020, 12, 975. [Google Scholar] [CrossRef] [Green Version]
  12. Sikorska-Senoner, A.E.; Schaefli, B.; Seibert, J. Downsizing parameter ensembles for simulations of extreme floods. Nat. Hazards Earth Syst. Sci. Discuss. 2020. [Google Scholar] [CrossRef] [Green Version]
  13. Parajka, J.; Blöschl, G. The value of MODIS snow cover data in validating and calibrating conceptual hydrologic models. J. Hydrol. 2008, 358, 240–258. [Google Scholar] [CrossRef]
  14. Finger, D.; Heinrich, G.; Gobiet, A.; Bauder, A. Projections of future water resources and their uncertainty in a glacierized catchment in the Swiss Alps and the subsequent effects on hydropower production during the 21st century. Water Resour. Res. 2012, 48, W02521. [Google Scholar] [CrossRef] [Green Version]
  15. Duethmann, D.; Peters, J.; Blume, T.; Vorogushyn, S.; Gunter, A. The value of satellite-derived snow cover images for calibrating a hydrological model in snow–dominated catchments in Central Asia. Water Resour. Res. 2014, 50, 2002–2021. [Google Scholar] [CrossRef] [Green Version]
  16. Parajka, J.; Naeimi, V.; Blöschl, G.; Wagner, W.; Merz, R.; Scipal, K. Assimilating scatterometer soil moisture data into conceptual hydrologic models at the regional scale. Hydrol. Earth Syst. Sci. 2006, 10, 353–368. [Google Scholar] [CrossRef] [Green Version]
  17. Razavi, S.; Tolson, B.A. An efficient framework for hydrologic model calibration on long data periods. Water Resour. Res. 2013, 49, 8418–8431. [Google Scholar] [CrossRef]
  18. Sorooshian, S.; Gupta, V.K.; Fulton, J.L. Evaluation of maximum likelihood parameter estimation techniques for conceptual rainfall–runoff models: Influence of calibration data variability and length on model credibility. Water Resour. Res. 1983, 19, 251–259. [Google Scholar] [CrossRef]
  19. Anctil, F.; Perrin, C.; Andréassian, V. Impact of the length of observed records on the performance of ANN and of conceptual parsimonious rainfall–runoff forecastting models. Environ. Model. Softw. 2004, 19, 357–368. [Google Scholar] [CrossRef]
  20. Brath, A.; Montanari, A.; Toth, E. Analysis of the effects of different scenarios of historical data availability on the calibration of a spatially-distributed hydrological model. J. Hydrol. 2004, 291, 232–253. [Google Scholar] [CrossRef]
  21. Merz, R.; Blöschl, G.; Parajka, J. Scale effects in conceptual hydrological modelling. Water Resour. Res. 2009, 45, W09405. [Google Scholar] [CrossRef]
  22. Merz, R.; Parajka, J.; Bloschl, G. Time stability of catchment model parameters: Implications for climate impact analyses. Water Resour. Res. 2011, 47. [Google Scholar] [CrossRef] [Green Version]
  23. Coron, L.; Andréassian, V.; Perrin, C.; Lerat, J.; Vaze, J.; Bourqui, M.; Hendrickx, F. Crash testing hydrological models in contrasted climate conditions: An experiment on 216 Australian catchments. Water Resour. Res. 2012, 48. [Google Scholar] [CrossRef] [Green Version]
  24. Brigode, P.; Oudin, L.; Perrin, C. Hydrological model parameter instability: A source of additional uncertainty in estimating the hydrological impacts of climate change? J. Hydrol. 2013, 476, 410–425. [Google Scholar] [CrossRef] [Green Version]
  25. Broderick, C.; Matthews, T.; Wilby, R.L.; Bastola, S.; Murphy, C. Transferability of hydrological models and ensemble averaging methods between contrasting climatic periods. Water Resour. Res. 2016, 52, 8343–8373. [Google Scholar] [CrossRef] [Green Version]
  26. Deng, C.; Liu, P.; Wang, D.; Wang, W. Temporal variation and scaling of parameters for a monthly hydrologic model. J. Hydrol. 2018, 558, 290–300. [Google Scholar] [CrossRef]
  27. Guo, D.; Johnson, F.; Marshall, L. Assessing the potential robustness of conceptual rainfall-runoff models under a changing climate. Water Resour. Res. 2018. [Google Scholar] [CrossRef]
  28. Vaze, J.; Post, D.A.; Chiew, F.H.S.; Perraud, J.M.; Viney, N.R.; Teng, J. Climate non-stationarity—Validity of calibrated rainfall–runoff models for use in climate change studies. J. Hydrol. 2010, 394, 447–457. [Google Scholar] [CrossRef]
  29. Coron, L.; Andréassian, V.; Perrin, C.; Bourqui, M.; Hendrickx, F. On the lack of robustness of hydrologic models regarding water balance simulation: A diagnostic approach applied to three models of increasing complexity on 20 mountainous catchments. Hydrol. Earth Syst. Sci. 2014, 18, 727–746. [Google Scholar] [CrossRef] [Green Version]
  30. Osuch, M.; Romanowicz, R.J.; Booij, M.J. The influence of parametric uncertainty on the relationships between HBV model parameters and climatic characteristics. Hydrol. Sci. J. 2015, 60, 1299–1316. [Google Scholar] [CrossRef] [Green Version]
  31. Saft, M.; Peel, M.C.; Western, A.W.; Perraud, J.M.; Zhang, L. Bias in streamflow projections due to climate-induced shifts in catchment response. Geophys. Res. Lett. 2016, 43, 1574–1581. [Google Scholar] [CrossRef]
  32. Fowler, H.J.A.; Peel, M.C.; Western, A.W.; Zhang, L.; Peterson, T.J. Simulating runoff under changing climatic conditions: Revising an apparent deficiency of conceptual rainfall-runoff models. Water Resour. Res. 2016, 52, 1820–1846. [Google Scholar] [CrossRef] [Green Version]
  33. Dakhlaoui, H.; Ruelland, D.; Tramblay, Y.; Bargaoui, Z. Evaluating the robustness of conceptual rainfall-runoff models under climate variability in northern Tunisia. J. Hydrol. 2017, 550, 201–217. [Google Scholar] [CrossRef]
  34. Sleziak, P.; Szolgay, J.; Hlavčová, K.; Duethmann, D.; Parajka, J.; Danko, M. Factors controlling alterations in the performance of a runoff model in changing climate conditions. J. Hydrol. Hydromech. 2018, 66, 381–392. [Google Scholar] [CrossRef] [Green Version]
  35. Stephens, C.M.; Marshall, L.A.; Johnson, F.M. Investigating strategies to improve hydrologic model performance in a changing climate. J. Hydrol. 2019, 579. [Google Scholar] [CrossRef]
  36. Ceola, S.; Arheimer, B.; Baratti, E.; Blöschl, G.; Capell, R.; Castellarin, A.; Freer, J.; Han, D.; Hrachowitz, M.; Hundecha, Y.; et al. Virtual laboratories: New opportunities for collaborative water science. Hydrol. Earth Syst. Sci. 2015, 19, 2101–2117. [Google Scholar] [CrossRef] [Green Version]
  37. Kostka, Z.; Holko, L. Soil Moisture and Runoff Generation in Small Mountain Basin. Institute of Hydrology SAS; Slovak Committee for Hydrology: Bratislava, Slovakia, 1997; 84p. [Google Scholar]
  38. Hlaváčiková, H.; Holko, L.; Danko, M.; Novák, V. Estimation of macropore flow characteristics in stony soils of a small mountain catchment. J. Hydrol. 2019, 574, 1176–1187. [Google Scholar] [CrossRef]
  39. Sleziak, P.; Danko, M.; Holko, L. Testing an alternative approach to calibration of a hydrological model under varying climatic conditions. Acta Hydrol. Slovaca 2019, 20, 131–138. [Google Scholar] [CrossRef]
  40. Viglione, A.; Parajka, J. TUWmodel: Lumped/Semi-Distributed Hydrological Model for Education Purposes. R Package Version 1.1-1. 2020. Available online: https://cran.r-project.org/web/packages/TUWmodel/index.html (accessed on 20 August 2020).
  41. Parajka, J.; Merz, R.; Blöschl, G. Uncertainty and multiple calibration in regional water balance modelling case study in 320 Austrian catchments. Hydrol. Process. 2007, 21, 435–446. [Google Scholar] [CrossRef]
  42. Ardia, D.; Mullen, K.M.; Peterson, B.G.; Ulrich, J. DEoptim: Diferential evolution in R. J. Stat. Softw. 2015, 40, 1–26. [Google Scholar] [CrossRef] [Green Version]
  43. Nash, J.E.; Sutcliffe, J.V. River flow forecasting through conceptual models part I—A discussion of principles. J. Hydrol. 1970, 10, 282–290. [Google Scholar] [CrossRef]
  44. Klemeš, V. Operational testing of hydrological simulation models. Hydrol. Sci. J. 1986, 31, 13–24. [Google Scholar] [CrossRef]
Figure 1. Map of Europe (bottom)—the red point indicates the location of the study catchment within Slovakia. The topography of the Jalovecký Creek catchment (top)—the black triangle indicates stream gauge; the black circles are climate stations providing the air temperature (numbers 1, 2 and 5) and precipitation data (numbers 1–7); the blue star represents site with snow water equivalent measurement.
Figure 1. Map of Europe (bottom)—the red point indicates the location of the study catchment within Slovakia. The topography of the Jalovecký Creek catchment (top)—the black triangle indicates stream gauge; the black circles are climate stations providing the air temperature (numbers 1, 2 and 5) and precipitation data (numbers 1–7); the blue star represents site with snow water equivalent measurement.
Water 12 02362 g001
Figure 2. Variability of mean monthly precipitation, air temperature and runoff in five specific periods.
Figure 2. Variability of mean monthly precipitation, air temperature and runoff in five specific periods.
Water 12 02362 g002
Figure 3. Variability of the objective function (OF), the logarithmic Nash-Sutcliffe efficiency (logNSE), the Nash-Sutcliffe efficiency (NSE), the root mean square error (RMSE) and the volume error (VE) after 10, 50, 100, 300, 500 calibration repetitions in five specific periods. The lines represent the medians and ranges (max–min) of different metrics obtained by different calibration repetitions. The right panel shows the variability of climatic characteristics in the five periods.
Figure 3. Variability of the objective function (OF), the logarithmic Nash-Sutcliffe efficiency (logNSE), the Nash-Sutcliffe efficiency (NSE), the root mean square error (RMSE) and the volume error (VE) after 10, 50, 100, 300, 500 calibration repetitions in five specific periods. The lines represent the medians and ranges (max–min) of different metrics obtained by different calibration repetitions. The right panel shows the variability of climatic characteristics in the five periods.
Water 12 02362 g003
Figure 4. Variability of medians of the objective function (OF), the logarithmic Nash-Sutcliffe efficiency (logNSE), the Nash-Sutcliffe efficiency (NSE) criteria, the root mean square error (RMSE) and the volume error (VE) after 10, 50, 100, 300, 500 repetitions of model simulations in four validation periods. The lines represent the medians of different metrics obtained by the different number of repetitions.
Figure 4. Variability of medians of the objective function (OF), the logarithmic Nash-Sutcliffe efficiency (logNSE), the Nash-Sutcliffe efficiency (NSE) criteria, the root mean square error (RMSE) and the volume error (VE) after 10, 50, 100, 300, 500 repetitions of model simulations in four validation periods. The lines represent the medians of different metrics obtained by the different number of repetitions.
Water 12 02362 g004
Figure 5. Variability of the ranges of the objective function (OF), the logarithmic Nash-Sutcliffe efficiency (logNSE), the Nash-Sutcliffe efficiency (NSE) criteria, the root mean square error (RMSE) and the volume error (VE) after 10, 50, 100, 300, 500 repetitions of model simulations in four validation periods. The lines represent the ranges (Max–Min) of different metrics obtained by the different number of repetitions.
Figure 5. Variability of the ranges of the objective function (OF), the logarithmic Nash-Sutcliffe efficiency (logNSE), the Nash-Sutcliffe efficiency (NSE) criteria, the root mean square error (RMSE) and the volume error (VE) after 10, 50, 100, 300, 500 repetitions of model simulations in four validation periods. The lines represent the ranges (Max–Min) of different metrics obtained by the different number of repetitions.
Water 12 02362 g005
Figure 6. Medians of model parameters after 10, 50, 100, 300, 500 calibration repetitions in five specific periods.
Figure 6. Medians of model parameters after 10, 50, 100, 300, 500 calibration repetitions in five specific periods.
Water 12 02362 g006
Figure 7. Ranges of model parameters after 10, 50, 100, 300, 500 calibration repetitions in five specific periods.
Figure 7. Ranges of model parameters after 10, 50, 100, 300, 500 calibration repetitions in five specific periods.
Water 12 02362 g007
Figure 8. Comparison of measured and simulated runoff after 10 (a) and 500 simulations (b). Runoff is simulated in a colder/wetter validation period (2001–2006) with model parameters obtained in a warmer/drier calibration period (1989–1994). The yellow colour indicates the range between the minimum (blue line) and maximum (red line) simulated runoff obtained by 10 and 500 repetitions.
Figure 8. Comparison of measured and simulated runoff after 10 (a) and 500 simulations (b). Runoff is simulated in a colder/wetter validation period (2001–2006) with model parameters obtained in a warmer/drier calibration period (1989–1994). The yellow colour indicates the range between the minimum (blue line) and maximum (red line) simulated runoff obtained by 10 and 500 repetitions.
Water 12 02362 g008
Figure 9. Measured snow water equivalent (SWE) at catchment mean elevation (points) and simulated catchment SWE after 10 (top panel) (a) and 500 repetitions (b). SWE is simulated in a snow richer validation period (2001–2006) with model parameters obtained in the snow poorer calibration period (1989–1994). The blue and red colours indicate min. and max. SWE values obtained by 10 and 500 simulations, respectively.
Figure 9. Measured snow water equivalent (SWE) at catchment mean elevation (points) and simulated catchment SWE after 10 (top panel) (a) and 500 repetitions (b). SWE is simulated in a snow richer validation period (2001–2006) with model parameters obtained in the snow poorer calibration period (1989–1994). The blue and red colours indicate min. and max. SWE values obtained by 10 and 500 simulations, respectively.
Water 12 02362 g009
Table 1. Mean annual and seasonal characteristics of precipitation (P), air temperature (T), runoff (Q) and snow water equivalent (SWE) in the selected periods. Seasonal characteristics of P and T represent the period from April to October, SWEmax is the mean of SWE maximum in each year.
Table 1. Mean annual and seasonal characteristics of precipitation (P), air temperature (T), runoff (Q) and snow water equivalent (SWE) in the selected periods. Seasonal characteristics of P and T represent the period from April to October, SWEmax is the mean of SWE maximum in each year.
Annual CharacteristicsSeasonal Characteristics
PeriodP (mm/year)T (°C)Q (mm/year)P (mm/)T (°C)SWEmax (mm)
1989–199414913.69739978.1284
1995–200016692.8110910957.8420
2001–200615632.1102310397.3408
2007–201215702.9102710267.9416
2013–201814193.4109310248.2344
Table 2. 15 TUW model parameters according to the routines and their calibration ranges. The calibration ranges were taken from the literature [22].
Table 2. 15 TUW model parameters according to the routines and their calibration ranges. The calibration ranges were taken from the literature [22].
ParameterRoutineUnitRange
Snow correction factor (SCF)Snow 0.9–1.5
Degree-day factor (DDF)Snowmm/°C day0–5
Rain threshold temperature (Tr)Snow°C1–3
Snow threshold temperature (Ts)Snow°C−3–1
Melt temperature (Tm)Snow°C−2–2
Limit for potential evapotranspiration (Lprat)Soilday0–l
Maximum soil moisture storage (FC)Soilmm0–600
Nonlinearity parameter (BETA)Soil 0–20
Very fast storage coefficient (k0)Runoffdays0–2
Fast storage coefficient (k1)Runoffdays2–30
Slow storage coefficient (k2)Runoffdays30–250
Upper storage coefficient (Lsuz)Runoffmm1–100
Percolation rate (Cperc)Routingmm/day0–8
Maximum base parameter (Bmax)Routingdays0–30
Free scaling parameter (Croute)Routingday2/mm0–50

Share and Cite

MDPI and ACS Style

Sleziak, P.; Holko, L.; Danko, M.; Parajka, J. Uncertainty in the Number of Calibration Repetitions of a Hydrologic Model in Varying Climatic Conditions. Water 2020, 12, 2362. https://doi.org/10.3390/w12092362

AMA Style

Sleziak P, Holko L, Danko M, Parajka J. Uncertainty in the Number of Calibration Repetitions of a Hydrologic Model in Varying Climatic Conditions. Water. 2020; 12(9):2362. https://doi.org/10.3390/w12092362

Chicago/Turabian Style

Sleziak, Patrik, Ladislav Holko, Michal Danko, and Juraj Parajka. 2020. "Uncertainty in the Number of Calibration Repetitions of a Hydrologic Model in Varying Climatic Conditions" Water 12, no. 9: 2362. https://doi.org/10.3390/w12092362

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop