1. Introduction
Understanding issues related to the functioning of the Earth’s climate system, including the hydrological cycle, is an important challenge of the modern world. Increasingly frequent natural disasters, such as floods and droughts, on the one hand, pose a direct threat to human life, and, on the other hand, contribute to economic losses. Human activity significantly impacts the observed changes in the hydrological cycle [
1]. Extreme phenomena in the form of floods particularly affect mountainous catchments, characterized by a short hydrological response time, making the risk of floods exceptionally high in their area [
2].
Rainfall–runoff hydrological modeling is one of the basic cognitive methods for flood risk analysis. It is a fundamental element of operational hydrology in the forecasting and warning of extreme events. For this reason, hydrological models play an increasingly important role in decision-making processes at local, national, and global levels. One of the most important data sources necessary for reliable outflow simulations using hydrological rainfall–runoff modeling is knowledge of the temporal–spatial structure of the precipitation field over the analyzed area. Available sources of precipitation data (rain gauges, meteorological radars, satellites, and numerical weather predictions) are characterized by varying temporal–spatial resolution due to the technologies used to measure precipitation, which makes it difficult to determine a priori which precipitation field data source provides the most reliable information. In addition, precipitation is one of the main factors in the formation of floods, but its measurement in a mountainous area poses significant challenges due to the varied relief of the terrain [
3]. In hydrological rainfall–runoff modeling, the predictive aspect of the model is crucial. All of the most commonly used precipitation data sources, except numerical weather prediction, provide data with a lag.
The beginning of numerical precipitation forecasting dates back to the turn of the 20th century when Abbe [
4] and Bjerkens [
5] discovered that the laws of physics could be used to create a numerical precipitation model that could be used for predictive purposes using the current state of the atmosphere. In recent decades, significant progress has been made in understanding and describing atmospheric processes. Initially simple models are now reaching a very high level of complexity. This progress is so significant that it is sometimes referred to as a “quiet revolution” [
6], as it takes into account the most important discoveries in atmospheric physics of the past decades.
Nowadays, numerical weather prediction models are systems of nonlinear differential equations, with varying time steps (from minutes to months) and spatial distribution (from regional to global models), taking into account dynamic, thermodynamic, radiative, and chemical processes occurring in the atmosphere [
6]. Forecasts can also be generated probabilistically, allowing for an estimation of the confidence level of the generated forecasts.
The predictive nature of numerical precipitation forecasts means that they can be used to forecast river runoff using hydrological models [
7,
8,
9] or hurricanes [
6]. This is particularly useful for deciding on the projects needed in the event of an emergency (such as flooding) [
10]. Many of the numerical precipitation models that have been developed are used for operational purposes, e.g., the European Center Medium-Range Weather Forecasts (ECMWF) [
11] or the numerical precipitation forecast model of the Japan Meteorological Agency [
12].
This paper aims to use a Global Environmental Multiscale (GEM) numerical precipitation prediction for a hydrological analysis in a small mountainous catchment, as these data have not yet been used for this purpose. To date, GEM data have been mainly used in hydrological modeling that focused on large catchments. Gaborit et al. [
13] have applied the GEM data coupled in the GEM-Hydro runoff modeling platform to the flat watershed of Lake Ontario. There are research studies by Rokata et al. [
14] and Abaza et al. [
15] showing the application of the GEM data for the big watershed to, among other things, perform continuous modeling. GEM data have also been investigated for hydrological purposes but were not strictly related to streamflow simulation. For example, Asong et al. [
16] used these data for the investigation of climate change impact. In this study, the potential of GEM data was investigated for event-based modeling. This was carried out in two aspects: modeling the flow in the catchment and determining moisture conditions through AMC classification.
2. Materials and Methods
2.1. Characteristics of Study Area
The upper part of the Skawa catchment was chosen as a study area. The main reason for flooding in this area is excessive precipitation and precipitation accumulation around neighboring regions, leading to significant surface runoff. The selected catchment is relatively small (~240 km2), and its water network is characterized by a dense dominant short stream with significant slopes, leading to quick response time (around 2.5 h). There are four rain gauges in the catchment area, of which, one is directly located in the investigated study area. The rain gauges are not well distributed over the entire study area, making the areal estimation of precipitation more challenging. Radar data application over this area is difficult due to radar shadow phenomena. Therefore, hydrological and hydraulic modeling over that kind of area is extremely challenging while essential at the same time due to the risk of flooding.
Figure 1 presents the major characteristics of the study area.
Historically, flash floods in this area were already noticed in the XV century. The most recent flood events occurred in 2010, 2014, and 2019, leading to substantial material losses in built-up parts of the catchment and significant changes to the topography in forested areas. The logs transported in the streams have a considerable impact on the accumulation of sediment transport, increasing the risk of flooding. Climate change impact will most likely result in even more frequent and intense precipitation events. Considering that, we can expect even more severe floods in the years to come.
The case study of the upper Skawa catchment, due to its characteristics (relatively small area, mountainous character, quick response time, and a limited number of rain gauges), is representative of similar study areas in Slovakia and the Czech Republic, as well as other parts of Europe.
2.2. Data Acquisition and Processing
2.2.1. Global Environmental Multiscale Model
At Warsaw University of Technology in Poland, in cooperation with Ecoforecast Foundation, an integrated system of numerical models was created based on the operational model of the Canadian Meteorological Center, Global Environmental Multiscale Model (GEM), along with its atmospheric chemistry extension-GEM-AQ [
17,
18].
The model operates in two configuration sets:
Global—characterized by variable resolution numerical grid. This set covers the entire globe and focuses on Europe with 0.135° (15 km) grid spacing;
Mesoscale—located over Poland with 0.0625° (5 km) spacing.
Figure 2 presents the visualization of the GEM computational grid applied in two different configuration sets.
For the purpose of this study, a nested mesoscale model was used. Its characteristics are presented in
Table 1.
Forecasts are generated in an integrated system of numerical dynamics and atmospheric chemistry models. The forecast calculation begins daily at 18 according to UTC time and is performed up to 72 h in advance. One of the operationally generated products is hourly precipitation totals (at ground level)—this product was used in this study.
The provided precipitation data were recorded in the form of point precipitation height information for the coordinates of the center of the computational grid. These data were processed into a numerical grid (
Figure 3).
For the purposes of the hydrological model, individual rainfall hyethograms were created for each sub-catchment representing the weighted average rainfall from the GEM model pixels covering each sub-catchment.
2.2.2. Rain Gauge, Discharge, and Other Data
For the purposes of this study, precipitation data from the meteorological network managed by the Institute of Meteorology and Water Management–National Research Institute in Poland were used. This network consists of 491 telemetry stations, and measurements are made with a time step of 10 min. All measurements are automatically subjected to quality control, taking into account the spatio-temporal consistency and a range check using climatological values [
29]. There are 4 stations in and around the catchment area. Their characteristics are presented in
Table 2.
The acquired data were aggregated to a time step of 1 h before being used in further analyses. The rain gauge data were used to determine AMC groups for sub-catchments, which were then compared with the AMC classification calculated using GEM data.
The discharge data were acquired at the gauging station located in Osielec (
Figure 1). The obtained data were available at hourly time-steps. Similarly, like for the precipitation data, the discharge data came from a network managed by the Institute of Meteorology and Water Management–National Research Institute in Poland.
Precipitation data for rain gauge and GEM, as well as discharge data, were obtained between 2014 and 2016. During this interval, there were six flood events. Three of them were used for the calibration of the model, while the other three were used for its calibration.
Other data used in the study were the Digital Elevation Model (DEM) of 100 m resolution that was provided by the Head Office of Geodesy and Cartography in Poland and land-use data from the CORINE Land Cover Project 2018. Statistical analysis and data processing were performed using R software version 4.2.1 developed at Bell Laboratories (downloaded from the website of Imperial College London, United Kingdom).
2.3. Rainfall-Runoff Modeling and Assessment
Rainfall–runoff hydrological modeling was performed using Hydrologic Engineering Center-Hydrologic Modelling System (HEC-HMS) version 4.9. It is a freeware software developed by the US Army Corps of Engineers that enables continuous and event-based hydrological modeling. Depending on the selected parameters, the model can be used in a lumped or semi-distributed mode and is widely used for flood simulations in various scientific applications [
30,
31,
32]. The HEC-HMS model was linked with HEC-DSSVue software version 2.0.1 developed by the US Army Corps of Engineers to provide precipitation and discharge data. Both softwares. HEC-HMS and HEC-DSSVue, were downloaded from the website of US Army Corps of Engineers.
For the purpose of this work the model was adopted in a semi-distributed scheme using the following parameters for the catchment model:
Rainfall losses—Soil Conservation Service (SCS) Curve Number (CN);
Transformation of effective precipitation—Snyder unit hydrograph;
Baseflow—Recession baseflow;
Routing—Musingum–Cunge.
As for the meteorological model, the specified hyetographs were applied for every sub-catchment for each data source of precipitation (rain gauge and GEM). During the calibration phase, the following parameters were calibrated:
The peak-weighted RMSE metric was applied as an objective function during the calibration of the model. The simulated discharge was compared to the observed flow registered at the gauging station in Osielec. Simulations were performed at hourly time-steps.
To provide multi-aspect analysis during the calibration and validation phase, the following evaluation metrics were chosen (
Table 3).
2.4. Calculation of Antecedent Moisture Conditions
Moisture conditions for each sub-catchment were defined by assigning a so-called Antecedent Moisture Condition (AMC) group. A distinction was made between three AMC groups depending on the amount of antecedent precipitation:
AMC I—the lowest probability of surface runoff;
AMC II—a moderate probability of surface runoff occurrence;
AMC III—the highest probability of surface runoff.
The methodology for assigning an AMC group for a given sub-catchment is to calculate the sum of precipitation from the 5 days preceding the flood event and then attribute an AMC group in accordance with
Table 4. A distinction was made between the growing season and the non-growing season, which were characterized by different precipitation thresholds.
Moisture conditions must be determined for each flood period before running the simulation. The AMC factor does not have tolerance ranges, but rigid limits that cause a change in group assignment. Therefore, it can be assumed that it does not reflect the actual dynamics of changes in effective precipitation. In order to increase the precision of the estimation of moisture conditions, it would be necessary to take into account, among other things, temperature or insolation, which can significantly affect wetness. For this reason, in order to simplify the subsequent calibration process of the CN coefficient and to minimize the spatial variability of moisture content, a common solution is to calculate the antecedent precipitation totals for sub-catchments by assigning to them the precipitation sums from the nearest meteorological station.
AMC conditions were determined for sub-catchments using two data sources of precipitation–rain-gauge data and GEM. It was decided to carry that out due to the fact that precipitation data from meteorological stations are mostly used for this purpose and will serve as a valuable reference to the AMC classification obtained from GEM data. Using the precipitation fields created by the Inverse Distance Weighting (IDW) interpolation, the area average precipitation total for each sub-catchment was calculated, and the corresponding AMC group number was assigned. It was decided to carry that out in order to account for the greater spatial variability of precipitation. The IDW method was chosen for the calculations due to the fact that it is the primary method of interpolating precipitation built into the HEC-HMS hydrological model and because of the very good flow simulation results obtained using this interpolation method [
36,
37]. When using data from the GEM model, a weighted average proportional to the area of the GEM pixel covering a given sub-basin was calculated. For both data sources, 5-day precipitation totals were calculated based on 1 h accumulations. All investigated flood events during the calibration and validation phases corresponded to the growing season.
4. Summary and Conclusions
The purpose of this paper was to present the potential of using numerical weather prediction data from a Global Environmental Multiscale model for rainfall–runoff hydrological modeling in a small mountain catchment. The analysis explored the feasibility of using GEM data to simulate discharge and determine antecedent moisture conditions in the catchment.
It is worth noting that, according to the literature review conducted by the article’s author, this is the first published attempt to use GEM data directly in applied hydrological applications, such as event-based modeling, for a small mountainous catchment. Previous works using GEM data have focused only on strictly meteorological aspects (e.g., [
39,
40,
41]) or continuous hydrological modeling on large catchments.
Based on the analysis, the following conclusions can be drawn:
GEM data can be used for rainfall–runoff modeling with satisfactory results. Since these data are available in advance, they can be used to conduct a preliminary forecast of discharge.
Better results of the projected outflow were obtained for heavy rainfall events (Events 1, 3 and 4). For relatively small rainfall events, the GEM data appear to underestimate the amount of precipitation, resulting in a consequent underestimation of the simulated flow rate.
Overall, the best simulation results were obtained for Event 1, which has the highest value of maximum flow. This may indicate a significant potential for the use of GEM data to forecast large floods.
GEM data can be useful for determining moisture conditions in the catchment area. Assigned AMC groups for sub-catchments were very close to those determined from the rain gauge data. This is a very important advantage of these data, especially if they would allow us to determine in advance that we may be dealing with AMC Group III, where the consequences of which could be very large floods.
Moisture conditions are often a neglected aspect during hydrological modeling, but, as shown during the calibration process, they play a vital role. The correct identification of AMC groups allows us to determine the range of values of the parameters of the rainfall losses model closer to reality. In particular, it is a matter of limiting unrealistic calibration results, which can give parameter values for a higher AMC group.
A significant disadvantage of the SCS-CN model in determining its parameters is the rigid limits in the classification adopted (
Table 4). Another aspect is the uncertainty associated with the correct determination of conditions in the catchment from the point of view of vegetation and land use, which affects the estimated parameters. Therefore, the determined values of the model parameters should be approached not deterministically but with a significant amount of tolerance at the calibration phase.
Despite the fact that GEM data do not perform the best in forecasting small-scale flood events, they can still be successfully used to determine the AMC in such cases.
It should be noted that the presented discharge simulation results were unsatisfactory in several cases. Given that only parameters related to precipitation losses were calibrated in the hydrological model, it seems that the main reason for such results is the inaccuracy of the GEM data. In particular, this is an underestimation of rainfall for small events, which leads to better-projected discharge for heavy rainfall events and worse for small rainfall events. Due to dynamic processes in the atmosphere, forecasting precipitation is complex, and, as shown in
Table 6, there are often underestimations for small events. For significant rainfall events, forecast values can often be higher than those that occur in reality. On the other hand, if the hydrological model is properly calibrated, this does not necessarily lead to an overestimation of the predicted discharge.
In order to accurately analyze the sources of errors that lead to the obtained results, it is necessary to consider the errors directly related to the GEM data separately, as well as the errors resulting from the specifics of the hydrological model. For example, it would be interesting to validate the GEM data using the rain gauge data or to adjust the bias using high-resolution satellite data. As for the hydrological model, the current version used for validation could be recalibrated using other model parameters that were not calibrated. In such an approach, however, it should be kept in mind that there could be a situation of model over-calibration, which could lead to unrealistic results.
In future work, it would be interesting to compare the determination of AMC conditions for the catchment area using other precipitation data, such as satellites and meteorological radars. It would also be necessary to verify what the effect of the interpolation method for rain gauge data on AMC classification is. Another interesting aspect from a hydrological point of view is a separate analysis of errors from GEM’s outputs and the HEC-HMS hydrological model.