1. Introduction
Flood frequency analysis plays a critical role in hydrological design, infrastructure safety, and risk assessment, particularly in the context of increasing hydroclimatic variability [
1].
Romania has historically faced significant challenges related to flood events, which have had severe socio-economic and environmental impacts [
2,
3]. The Danube River, forming the southern border of the country, is prone to extreme flood events due to a combination of upstream contributions from Central Europe and snowmelt-induced high flows during spring. Notable floods occurred in 2006 and 2010, affecting large portions of southern Romania, including urban areas such as Giurgiu and Drobeta-Turnu Severin [
4]. The Siret River, one of the largest tributaries of the Danube, is characterized by rapid runoff from the Carpathian region and has generated destructive flash floods, particularly in northeastern Romania (e.g., in 2005 and 2008) [
5]. The Ialomita River, although smaller in size, also presents a high flood risk due to its torrential regime and the presence of densely populated and agricultural areas along its lower course. These events underscore the need for robust flood frequency modeling to support infrastructure planning and disaster risk reduction strategies.
In Romania, traditional flood frequency analysis methods—typically based on the Pearson III distribution and the method of moments—employ a limited range of statistical tools which may not adequately represent the behavior of extreme hydrological events [
6]. This limitation highlights the need for a more detailed assessment of the applicability and performance of the GEV distribution.
The GEV distribution is widely used to analyze and model extremes in datasets from various fields [
7,
8,
9]. Its flexibility and adaptability make it a valuable resource in hydrology, climate science, and other fields dealing with extreme value analysis.
In hydrology, the GEV distribution is used in frequency analysis to forecast the probability and magnitude of extreme hydrological events, providing vital information for hydrological infrastructure design, water resource management, and flood risk assessment [
10,
11]. Frequency analysis represents a direct method for determining this information, relying exclusively on the use of statistical distributions, particularly the statistical indicators of the analyzed data [
6].
Characterized by three parameters (shape, scale, and location), the GEV distribution is flexible and capable of modeling different hydrological data sets with varied statistical characteristics, being a heavy-tailed distribution with no upper bound.
Alongside the Pearson III, Log-Pearson, Generalized Pareto, Weibull, and Generalized Logistic distributions, the GEV distribution is one of the most widely used three-parameter statistical distributions in frequency analyses of extreme events (maximum and minimum) in hydrological and meteorological processes. It is used to evaluate the probability of occurrence of extraordinary events of varying magnitudes (at site and regional levels), such as maximum discharges [
7,
8,
9,
10,
11], maximum precipitation [
12,
13,
14,
15], the design of Intensity–Duration–Frequency (IDF) curves [
16,
17,
18,
19], and the analysis of extreme temperatures [
20,
21,
22].
Regarding parameter estimation methods, the GEV distribution has been used in frequency analyses primarily with the method of ordinary moments (MOM) [
23,
24,
25,
26,
27], the method of linear moments (L-moments) [
24,
26,
28,
29], the method of maximum likelihood (ML) [
26,
30,
31], the method of least squares (MLS) [
26,
32], the probability weighted moments method [
26,
33,
34], and the Bayesian method [
35,
36,
37]. Of all these estimation methods, the most common are the MOM and L-moments methods, commonly used both for in situ analyses, but especially in analyses regarding the regionalization of extreme events. These methods facilitate the derivation of statistical indicators, such as mean, coefficient of variation, skewness (in the case of MOM), or L-skewness and L-kutozis (in the case of L-moments), which are essential for the regionalization of extreme hydrological events.
In the case of the L-moments method, it is currently one of the most popular and widely used methods in regional frequency analyses of extreme events [
26,
38,
39,
40,
41]. The widespread applicability of this method is due to its advantages and superiority over other methods, the most important of which include the following: (1)
parameter estimation based on statistical indicators that can be determined both in situ
and regionally; (2)
the stability and robustness of this method to the influence of variability in the lengths of available observed data, as well as the presence of outliers [
26,
38,
39]; (3)
the existence of rigorous numerical and graphical criteria for selecting the best distribution [
26,
40,
41]; (4)
the availability of an accessible approach for constructing the confidence interval [
24,
26,
38,
39,
42].
Of course, there are other methods, such as the MOM or Bayesian approaches, but these present certain difficulties in application. The Bayesian method is less used due to its higher mathematical and computational complexity, requiring specialized programming/software. It also introduces additional uncertainties in the choice of the prior distribution of parameter variability, requiring additional data such as precipitation, temperature, etc.
Regarding the MOM, its applicability in the regionalization of extreme events is increasingly limited (especially in Eastern Europe) due to the disadvantages related to the corrections required for higher-order statistical indicators (skewness and kurtosis), which lead to significant uncertainties and errors [
6,
24,
43].
In general, the results of extreme event frequency analysis are characterized by two important categories of uncertainty, namely, the uncertainty in selecting the best distribution and the uncertainty in choosing the most suitable parameter estimation method.
The interpretation of results involves two components: the objective component and the subjective component. These two modes of interpreting results are heavily influenced by the parameter estimation method used for the analyzed distributions, the existence or absence of rigorous criteria for selecting the best model, and the availability of data lengths. For example, in cases with short data lengths and applications requiring the forecasting of values beyond the range of recorded probabilities (such as flood frequency analysis or maximum precipitation analysis), most estimation methods (like maximum likelihood, least squares, Bayesian methods, and mixed moments methods) render statistical tests and performance criteria less relevant. In such situations, the dominant selection criteria are subjective, such as graphical model selection based on the researcher’s experience and available historical information, which can lead to unreliable parameter estimates and misleading conclusions.
Regarding the uncertainties, the analysis presents the uncertainties in selecting the best model and the uncertainties regarding the behavior of the GEV distribution as a “parent” distribution. In many cases the “parent” distributions have already been established. The rigor in choosing these “parent” models is indeed an important issue but will not be explored in depth in the present manuscript. For example, in the U.S., the parent distribution is the Log-Pearson distribution, both for maximum flows and for maximum precipitation [
8,
9,
38,
39]. In Canada, the parent distribution for maximum flows is the two-parameter log-normal distribution [
9], while for maximum precipitation the reference distribution is the Gumbel distribution [
9]. In Finland and Spain, the recommended distribution is the Gumbel distribution [
8]. The generalized Pareto distribution is recommended in Belgium [
8]. In the U.K., the parent distribution is the three-parameter Log-Logistic distribution (generalized Logistic) [
8]. In Italy, the recommended distribution is the Two-Component Extreme Values (TCEVs) distribution [
8,
44]. Regarding the GEV distribution, it is the parent distribution in Austria, and is recommended in Germany, Italy, Spain, and Slovakia [
8].
In Romania, the Pearson Type III distribution is commonly adopted as the parent distribution for modeling hydrological extremes [
24,
45,
46,
47]. However, this choice is often not supported by rigorous and comprehensive statistical validation. In many cases, it is applied using the method of ordinary moments, with skewness values imposed based on assumptions regarding the genesis of maximum flows. Subsequently, quantiles for various annual exceedance probabilities are estimated through standard tables and linear interpolation. Such practices may introduce notable uncertainties in the estimation process.
Another important aspect is the examination of the GEV distribution’s behavior with respect to the variability in the lengths of available data. This includes highlighting theoretical biases for representative samples typically encountered in such analyses. The biases are identified for statistical indicators, estimated parameters, and predicted quantile values, under the assumption that the analyzed samples come from a GEV population. These theoretical biases of the distribution are highlighted for the following: (1) the entire range of definition of statistical indicators; (2) the full range of annual exceedance probabilities, particularly in the domain of rare and very rare events (low and very low annual exceedance probabilities, events considered “impossible” in terms of occurrence).
Such theoretical analyses regarding the influence of data lengths were made by Hosking [
40] for the GEV distribution, who highlighted the theoretical biases of the distribution over a relatively narrow range of the theoretical values matrix (coefficient of L-variation–L-skewness), using the L-moments method; Singh [
48], who identified biases in the two-parameter Log-Logistic distribution using MOM and L-moments for various data lengths; Jokiel Rokita et al. [
49], who identified biases in the three-parameter Weibull distribution for the maximum likelihood method and other methods; Anghel [
24], who highlighted the theoretical biases of the Gumbel distribution. In [
10], Mertins and Stedinger analyzed the biases of the Gumbel distribution for restricted ranges of the shape parameter (between −0.25 and 0.30) and not for specific theoretical values of the higher statistical indicators specific to the L-moments method. In other materials, only the biases reported exclusively for the analyzed data (at-site wind analysis) were highlighted [
50].
Thus, the manuscript aims to improve and expand Hosking’s analysis [
40] by highlighting these theoretical biases across the entire range of definition for the two statistical indicators related to the L-moments method. Additionally, it seeks to introduce a new element by identifying theoretical biases for the method of ordinary moments over a broader range of statistical indicators (coefficient of variation and skewness), so that these reflect the most common cases encountered in the field of flood frequency analysis.
Since the L-moment method is recognized as the most robust and stable method for parameter estimation, it requires the use of a distribution with at least four parameters to correctly calibrate all four linear moments specific to this method. Thus, the Wakeby distribution was chosen as the reference distribution as it is one of the most stable and robust distributions with a large number of parameters [
18,
24,
26,
40,
41].
As novel elements of the manuscript, we highlight the most important ones:
- −
The particularization and highlighting of the behavior of the probability density function (PDF), cumulative distribution function (CDF), and quantile function (x(p)), with a statistical and hydrological approach from Romania (as well as other regions in southeastern Europe, and some regions in China), regarding the choice of data skewness in the genesis function of maximum flows [
51,
52], as well as the representation of these curves based on theoretically relevant values of these statistical indicators.
- −
The presentation of new improved approximate relationships for estimating the parameters of the GEV distribution using MOM, L-moments, and LH-moments.
- −
The presentation of variation curves for the shape parameter as a function of skewness and L-skewness, providing important information for selecting initial values for accurately determining the parameters that require solving a system of three nonlinear equations.
- −
The presentation of exact and approximate relationships for the frequency factors in the inverse function component, an approach that eliminates the need to solve nonlinear equations, within acceptable relative error limits.
- −
The presentation of the influence of available data lengths on the behavior of the GEV distribution, highlighting theoretical biases across the entire matrix defining higher-order statistical indicators. This matrix theoretically covers any statistical and hydrological situation in FFA, thus providing essential information for considering the uncertainties associated with forecasted values. Representative samples from a statistical and hydrological perspective are selected (n = 20, 35, 55, 90).
- −
A detailed analysis regarding the determination of the empirical probability of historical data in the case of the PWMs method and LSM. A significant number of empirical probabilities are comparatively analyzed, highlighting the behavior of the GEV quantile tail in relation to these choices, which generally represent a “subjective choice” into statistical analysis.
This is the first time such a holistic analysis regarding the methods for estimating the parameters of the GEV distribution and the applicability of this distribution is being carried out in Romania, especially considering the specific characteristics of maximum discharge frequency analysis in Romania (i.e., determining skewness based on the genesis of maximum discharges).
Since the most effective way to illustrate the behavior of a statistical distribution is through cases where the estimation of values for extremely rare events is necessary, the behavior of the GEV distribution is demonstrated using four case studies with varied morphometric, hydrological, and statistical characteristics, such as the number of years of analysis, the catchment area, the average altitude, the variability of peak flows, etc. These studies conduct an at-site flood frequency analysis, predicting values for extreme events considered “impossible” [
53] in terms of occurrence probability (T = 10,000 years), but which are crucial for designing Class 1 critical structures like dams [
54].
3. Case Studies
The case studies involve conducting in situ frequency analyses of annual maximum discharges, with the aim of determining extreme maximum values corresponding to annual exceedance probabilities of interest in technical hydrology.
Figure 2 shows the four analyzed locations. Two of them are associated with the Danube River, one location on the Siret River, and one on the Ialomita River, all situated in Romania.
The Danube River is the second longest river in Europe (after the Volga), with a total length of approximately 2860 km. It originates in the Black Forest Mountains (Germany) and flows into the Black Sea through the Danube Delta, crossing the territories of Romania and Ukraine. In Romania, the Danube travels approximately 1075 km and forms the border with Serbia, Bulgaria, Moldova, and Ukraine. Between Baziaș and Drobeta-Turnu Severin, the river passes through a spectacular gorge with rocky cliffs and deep waters. Here, the Iron Gates I and II dams are located. Between Drobeta-Turnu Severin and Călărași, the river flows through a lower area with wide floodplains and meanders. The delta section starts at Brăila and includes the actual delta area, where the river splits into three main branches: Chilia, Sulina, and Sfântu Gheorghe. The waters of the Danube are influenced by the melting of snow from the Alps and Carpathians, as well as by precipitation. The most important rivers that flow into the Danube are the Cerna, Jiu, Olt, Argeș, Ialomita, and Siret.
The Siret River originates in the Obcina Feredeului Mountains (Eastern Carpathians, Ukraine), with a length of approximately 706 km, of which 559 km are in Romanian territory. It has a catchment area of approximately 47,610 km2, making it the largest catchment area among the Danube’s tributaries on Romanian territory, flowing into the Danube near Galați. The Siret has a nival–pluvial hydrological regime, meaning it is fed both by snowmelt and precipitation, with a variable discharge that increases in spring (March-April) due to snowmelt and secondary increases in autumn due to heavy rainfall. It is a river with a high flood risk, having numerous flash floods, especially in years with abundant rainfall. Along its course, dams and reservoirs have been constructed to mitigate floods and produce hydroelectric power. The Siret River basin area is influenced by a temperate continental climate, with significant variations between the mountain regions and the plains. The temperatures are 4–6 °C in the mountainous area and 9–11 °C in the plains. Precipitation is more abundant in spring and autumn, and summer drought episodes can occur, particularly in the southern part of the basin. The average annual precipitation ranges from 800–1200 mm/year in the mountainous area to 450–600 mm/year in the hilly and plain areas.
The Ialomita River originates in the Bucegi Mountains (Southern Carpathians) at an altitude of approximately 2300 m, below the Omu Peak. With a drainage basin of 10,350 km2 and a length of about 417 km, it is one of the most important rivers in southern Romania. The regime is nival–pluvial, primarily fed by the melting of snow from the Bucegi Mountains and precipitation from the subcarpathian and plain areas. Maximum discharges occur in spring (March–April) due to snowmelt and rainfall, while minimum discharges occur in summer, when severe droughts can be recorded. The river poses a high flood risk, especially in the plain areas. The average annual temperature ranges from 0–5 °C in the mountain area to 10–11 °C in the plain area.
Figure 3 shows the chronological series of data analyzed at each hydrometric station.
In
Figure 4, box plot graphs of the analyzed data are presented, showing the distribution and variability of the data.
To verify stationarity, the
t-test was used [
54], and the analyzed data were found to be stationary. No outliers were identified in the analyzed series after applying the Grubbs [
38,
39,
60] test (
Table 1).
Table 1.
The results of the stationarity and outliers check.
Table 1.
The results of the stationarity and outliers check.
Series | t-Test | Grubbs |
---|
Results | Critical Value (10%) | Rezults (Upper) | Max (Q) |
---|
Danube/Drobeta | −0.242 | 1.992 | 18,544 | 16,300 |
Danube/Giurgiu | 1.157 | 2.028 | 16,146 | 15,800 |
Siret/Lungoci | 0.277 | 2.026 | 7040 | 4650 |
Ialomita/Tandarei | −0.087 | 2.04 | 943 | 468 |
Considering that the method of ordinary moments and the method of linear moments are two methods that exclusively use characteristic moments and higher-order statistical indicators, their corresponding values for the analyzed sites are highlighted in
Table 2 and
Table 3.
Table 2.
The statistical indicators of the observed data: MOM.
Table 2.
The statistical indicators of the observed data: MOM.
River | Station | Record Length | | | | | |
---|
[m3/s] | [m3/s] | [−] | [−] | [−] |
---|
Danube | Drobeta | 38 | 10,480.4 | 1816.6 | 0.173 | 0.881 | 6.167 |
Danube | Giurgiu | 78 | 10,972.7 | 2016.5 | 0.184 | 0.379 | 4.853 |
Siret | Lungoci | 39 | 1442.5 | 915.1 | 0.634 | 1.413 | 7.872 |
Ialomita | Tandarei | 33 | 224.1 | 118.1 | 0.527 | 0.327 | 4.074 |
Table 3.
The statistical indicators of the observed data: L-moments method.
Table 3.
The statistical indicators of the observed data: L-moments method.
River/Station | | | | | | | |
---|
[m3/s] | [m3/s] | [m3/s] | [m3/s] | [−] | [−] | [−] |
---|
Danube/Drobeta | 10,480.4 | 999.9 | 172 | 159.1 | 0.095 | 0.172 | 0.159 |
Danube/Giurgiu | 10,972.7 | 1142.3 | 100.1 | 138.5 | 0.104 | 0.088 | 0.121 |
Siret/Lungoci | 1442.5 | 489.5 | 111.5 | 90.6 | 0.339 | 0.228 | 0.185 |
Ialomita/Tandarei | 224.1 | 68.6 | 6.1 | 1.7 | 0.306 | 0.089 | 0.025 |
5. Discussions
In all four case studies, the GEV distribution was compared to the Pearson III distribution (considered the “parent” distribution in Romania, using MOM) and the Wakeby distribution (with five parameters, used as a robust reference model due to its ability to calibrate all four L-moments).
The main objectives of the presented analysis are as follows:
- (1)
To verify the applicability of the GEV distribution in Romania by determining the maximum discharges associated with rare and very rare events required in technical hydrology, assuming apriori that the data come from a GEV distribution.
- (2)
To compare the maximum values predicted by the GEV distribution with those predicted by the Pearson III distribution, which is considered the “parent” distribution in Romania.
- (3)
To compare the values predicted by GEV and PE3 with those predicted by the Wakeby distribution, which has five parameters estimated using the L-moments method, chosen as the reference distribution and method.
- (4)
To highlight the behavior of the GEV distribution, particularly the uncertainties characterizing the predicted values due to the variability in the lengths of available data series.
The results are compared with the distribution considered the “parent” distribution in Romania, namely the Pearson III distribution, with its three parameters (location, scale, and shape) estimated using the method of ordinary moments, which is the parameter estimation method used in Romania. The results are also compared with the Wakeby distribution, which has five parameters estimated using the method of linear moments [
41,
58,
61]. The justification for choosing this distribution as the reference distribution is explained in the previous sections, with the most important reasons being the high number of parameters which manage to properly calibrate all four linear moments specific to the L-moments method. As a result, the uncertainties regarding the values estimated using this method and distribution are characterized by reduced uncertainties, with the predicted values having a high degree of confidence.
Analyzing the obtained results, the GEV distribution showed consistent differences in the predicted maximum discharges compared to the Pearson III and Wakeby distributions.
In the domain of very rare events (probabilities of exceedance p < 1%), the GEV distribution’s tail behavior was noticeably influenced by the estimation method. The L-moments method resulted in a smoother, more reliable extrapolation for extreme quantiles, reducing bias compared to MOM.
For the Danube River, Giurgiu station (for T = 10,000 years), the Wakeby distribution predicted a maximum discharge of 18,372 m3/s. The GEV distribution predicted a maximum discharge ranging from 18,667 m3/s (MLE) to 22,217 m3/s (LSM). Compared to Pearson III (20,096 m3/s), GEV (L-moments) produced a lower value of 19,902 m3/s. By comparing the results to the Wakeby distribution (selected based on the considerations mentioned earlier), the GEV distribution generated higher percentage values, specifically +3% (MOM), +8.3% (L-moments), +7% (PWMs), +20.9% (LSM), +7.9% (LH-moments), +1.6% (MLE), and +11.9% (WLSM). Likewise, the Pearson III distribution produced higher percentage values, ranging from +9.4% (MOM) to +14.2% (L-moments).
In the case of the Danube River, Drobeta station, the Wakeby distribution predicted a maximum discharge of 22,023 m3/s. The GEV distribution predicted a maximum discharge ranging from 17,532 m3/s (MOM) to 23,573 m3/s (LH-moments). Compared to Pearson III (21,586 m3/s), GEV (L-moments) produced a higher value of 23,095 m3/s. By comparing the results to the Wakeby distribution, GEV generated percentage values of −20.4% (MOM), +4.9% (L-moments), −0.8% (PWMs), +5.2% (LSM), +7.0% (LH-moments), −8.8% (MLE), and +2.7% (WLSM). The Pearson III distribution produced lower percentage values, ranging from −2.0% (L-moments) to −15.4% (MOM).
Regarding the Siret River, the Wakeby distribution predicted a maximum discharge of 7916 m3/s. The GEV distribution predicted a maximum discharge ranging from 8069 m3/s (MOM) to 10,946 m3/s (MLE). Compared to Pearson III (7706 m3/s), GEV (L-moments) produced a higher value of 10,193 m3/s. By comparing the results to the Wakeby distribution, GEV generated percentage values of +1.9% (MOM), +28.8% (L-moments), +17.5% (PWMs), +8.3% (LSM), +33.9% (LH-moments), +38.2% (MLE), and +22.0% (WLSM). The Pearson III distribution produced lower percentage values, ranging from −2.7% (L-moments) to −5.8% (MOM).
For the Ialomita River, the Wakeby distribution predicted a maximum discharge of 495 m3/s. The GEV distribution predicted a maximum discharge ranging from 705 m3/s (MLE) to 979 m3/s (MOM). Compared to Pearson III (829 m3/s), GEV (L-moments) produced a lower value of 765 m3/s. By comparing the results to the Wakeby distribution, GEV generated higher percentage values, specifically +97.7% (MOM), +54.5% (L-moments), +47.5% (PWMs), +56.4% (LSM), +21% (LH-moments), +42.4% (MLE), and +60.2% (WLSM). The Pearson III distribution produced higher percentage values, ranging from +67.5% (L-moments) to +90.3% (MOM).
This variability in the predicted values is mainly due to the inability of the three-parameter distributions to calibrate the L-kurtosis [
34]. The statistical indicators L-skewness and L-kurtosis play a crucial role in influencing the predicted maximum discharges. L-kurtosis reflects the variability and extremity of maximum discharge values—higher L-kurtosis values indicate a greater tendency toward extreme events, leading to higher predicted quantiles [
34]. In the Danube River (Drobeta station), with an L-kurtosis of 0.159, the predicted values with GEV (L-moments) were relatively stable, aligning closely with Pearson III but slightly lower than Wakeby. At the Siret River (Lungoci station), where L-kurtosis was 0.185—signaling more extreme variability—the GEV distribution produced significantly higher quantiles than Pearson III, although they were still lower than Wakeby. The Ialomita River (Tandarei station) exhibited a relatively low L-kurtosis (0.025) which corresponded to less variable maximum discharges, resulting in lower quantile predictions across all distributions.
The criteria for choosing the best distribution are in compliance with the conditions imposed by the L-moments method (also chosen in these cases as a reference) but also in the use of some performance indicators that are based on highlighting the relative errors between recorded and forecasted values.
For the analyzed case studies, the RME (relative mean error) and RAE (relative absolute error) performance indicators are used [
60,
62]. In terms of performance indicators RME and RAE, their results are presented in
Table 8,
Table 9,
Table 10 and
Table 11.
However, it should be noted that these performance indicators generally highlight only the calibration within the range of observed values. Outside of this range (for selecting the best distribution for very rare events, T = 10,000 years) these indicators lose their relevance. For this reason, the L-moments method and distributions with at least four parameters are preferred, as it is an extremely stable and robust method and using distributions with a high number of parameters ensures that all calibration conditions associated with the method are met.
For distributions with two or three parameters, the selection of the best model is made by comparing the values of the statistical indicators L-skewness and L-kurtosis of the analyzed data with those characteristics of the statistical distributions, the latter being defined by specific interdependence relationships between these two statistical indicators [
24].
For the Danube River (Giurgiu station), the best result for the GEV distribution was obtained using the MOM method (RME = 0.0016) and the PWMs method (RAE = 0.0119). Analyzing exclusively the scores for L-moments, the GEV distribution provided the best result (RME = 0.0017, RAE = 0.0121).
In the case of the Danube River (Drobeta station), the RME scores were very close, around 0.0039. Regarding RAE, the best result with GEV was achieved using the L-mom and LH-mom methods (RAE = 0.0171). For L-moments, the GEV distribution provided the best result (RME = 0.0039, RAE = 0.0171).
For the Siret River (Lungoci station), the best result for the GEV distribution was obtained using the MLE method (RME = 0.0157) and the L-moments method (RAE = 0.0571). L-moments with GEV distribution gave the best result for RAE (0.0571), while the Pearson III distribution had the best RME score (0.0121).
For the Ialomita River (Tandarei station), the best result for the GEV distribution was obtained using the MOM method (RME = 0.0231 and RAE = 0.1039). Analyzing exclusively the scores for L-moments, the Wakeby distribution provided the best result for RME (0.0105), while the Pearson III distribution had the best RAE score (0.1073).
It is essential that when applying the L-moments method model selection be guided by criteria specifically tailored to this approach. In particular, emphasis should be placed on the accurate calibration of higher-order statistical indicators, namely L-skewness and L-kurtosis, which play a crucial role in characterizing the shape and tail behavior of the distribution. This process should involve both the use of theoretical interdependence relationships between L-moments and a preliminary visual (graphical) assessment, which helps identify candidate distributions that align with the empirical L-moment ratios. Such a dual approach ensures that the selected model not only fits the central tendency of the data but also reliably captures the behavior of extreme values, which is of primary interest in hydrological frequency analysis.
The Influence of Available Data Lengths on Forecasted Values.
In hydrological/hydrotechnical practice there is an apparent but inevitable contradiction between the stringent requirements imposed by national technical standards and international regulations, on the one hand, and the limited reality of available hydrological data, on the other.
Despite the fact that in the vast majority of cases long observation series are not available, especially those with over 100 years of records, standards still require the frequency analysis and estimation of extreme quantiles of annual maximum flows, even for very large return periods, such as T = 1,000, 5,000, or even 10,000 years. This requirement has a solid justification as hydrotechnical works of strategic importance—such as dams, dikes, water intakes, or flood defense works—must be designed to ensure a high degree of safety throughout the lifespan of the construction, considering even events with very low probability but potentially devastating impact.
However, the hydrometric reality in many countries—including Romania—is that the networks of hydrometric stations are relatively sparse or have been reduced in recent decades; the operating period of many stations is under 30–40 years and unmonitored sites or sites with discontinuous data series are common in mountainous areas or in small basins.
In these conditions, performing flood frequency analyses becomes an extremely difficult task and highly vulnerable to uncertainties. However, these analyses cannot be abandoned, and the solution lies in methodological rigor.
It is important to emphasize that estimating extreme flows does not mean exact prediction, but rather a probabilistic assessment of an extreme risk. This assessment, even if not perfect, is preferable to the complete disregard of the hazard. In this context, it is important to offer the confidence intervals associated with the estimates to perform a sensitivity analysis of the results to the length of the series and the choice of model, and an evaluation of residual risk and possible adaptation or protection measures.
The limited availability of sufficiently large data series, ensuring that the obtained values closely approximate those of the population, necessitates highlighting the theoretical biases associated with small- and medium-length data samples. Generally, these are the most common cases in frequency analysis. As previously mentioned, in general this analysis represents a secondary stage following the selection of the most suitable distribution and parameter estimation method. Thus, this section starts from the general assumption that the sample data would originate from a GEV distribution with parameters estimated using the method of moments (MOM) and L-moments.
Biases are presented both in the estimation of parameters and in the estimation of event magnitudes with large return periods. This also illustrates how parameter biases propagate into the biases of the inverse function.
In the sampling process Hazen’s empirical probability was used as the values obtained with this method were the closest to the theoretical curve (n = 1000 values).
Probabilities corresponding to recorded values, representing samples rather than populations—as small or medium data lengths are generally available—should not be directly associated with the relative frequency (i/n) but rather with an empirical probability where the minimum value exceeds the range of relative frequencies of the observed data. Thus, Hazen’s empirical probability generated the smallest errors compared to theoretical values with other empirical probabilities. Similar results were observed for the Gumbel distribution [
24]. The subjective choice of the empirical probability is most clearly highlighted in the case of the PWMs and LSM parameter estimation methods, which generate significant biases, especially in the case of the rarest events (T = 10,000 years).
Figure 6 graphically presents these differences (based on the case study of the Danube River, St. Giurgiu) following the analysis of six empirical probabilities, Hazen, Weibull, Tukey, Cunnane, Beard, and Chegodayev, whose expressions can be found in [
26,
42,
51]. Significant differences can be observed, considering that for this station the length of the records is relatively relevant (n = 78 values). In the case of shorter data lengths, these deviations increase significantly, with generated values being characterized by substantial uncertainties. For this reason these two methods are less commonly used. They can have relevant applicability when the probability range of the forecasted values is closer to that of the recorded data.
To cover as extensive a range as possible regarding statistical indicators and available data lengths, sample sizes of 25, 50, and 80 values are chosen to highlight the theoretical biases in estimating statistical indicators, parameter estimation, and quantile estimation.
To reflect real data applications, scenarios were selected with coefficients of variation ranging between 0.1 and 2 and skewness values between 0.1 and 6 for the MOM. For the L-moments analysis, coefficients of L-variation between 0.01 and 1 and L-skewness values between 0.01 and 1 were used. In the case of L-moments, a detailed analysis is presented with a smaller increment of statistical indicators, specifically samples of 20, 35, 55, and 90 values.
In general, in frequency analysis of extreme events in hydrology—especially for maximum flow rates, precipitation, volumes, and levels—the range of interest corresponds to low exceedance probabilities
p < 1% [
24,
38,
39]. This is also the range where the behavior of distributions varies, and real data extrapolation processes are strongly influenced by the particularities of statistical distributions and parameter estimation methods. Therefore, the results are presented for values characteristic of this range (left-hand, upper part of the graph).
Tables S4 and S5 of the Supplementary Files show the results obtained with the GEV distribution. It can be seen that the influence of data variability is much more pronounced due to the heavy-tail nature of this distribution.
For the rarest event of interest in hydrology (T = 10,000 years), the bi-axes of the GEV distribution, across the entire domain of the parameters obtained using L-moments, are presented in
Table S6 of the Supplementary Files. These values provide crucial insights into the behavior of the GEV distribution when applied to extreme events, particularly those with a very low probability of occurrence. The bi-axes represent the theoretical coefficient of L-variation and L-skewness, which both play a key role in accurately estimating the extreme values of the hydrological data. By using the L-moments method to estimate these statistical indicators, the calibration process ensures that the model remains robust and stable, allowing for more reliable predictions of rare hydrological events.
Table S6 in the Supplementary Files allows for a detailed comparison of these values, providing a deeper understanding of the GEV distribution’s performance in capturing extreme events, which is essential for the design and safety of critical infrastructure like dams and flood control systems.
It can be easily observed that the biases for extreme quantiles (predicted with GEV) varied significantly with data length, with the most notable discrepancies occurring for shorter records, especially for MOM.
These systematic theoretical biases are essential for understanding the uncertainty associated with extreme estimates using the GEV distribution with parameters estimated by MOM and L-moments.
The biases in the tables indicate the systematic deviation (i.e., the average difference between the estimated value and the actual value of the parameter). Negative values indicate a systematic underestimation of the quantiles (i.e., extreme values are smaller than the actual ones). Positive values indicate an overestimation. As n increases the biases tend to approach 0, indicating that the estimates become more precise as more data is available. If the biases are large, the estimates of maximum discharge for a very large T (e.g., 10,000 years) may be significantly wrong—which can have serious implications in the design of hydraulic structures (dams and levees). For small n values (e.g., 20) the deviations are very large, and extremes can be underestimated or overestimated by tens of percent. For large n values (e.g., 90) the estimates become more reliable, and the biases decrease.
These biases (specific percentage values for each distribution) cannot be “avoided,” regardless of how well the method selection was made. Thus, in addition to the uncertainty associated with the process of selecting the best model (distribution and estimation method), it is essential to present these systematic theoretical biases (generally adapted to the statistical indicators of the analyzed series, the distribution model, and the length of the available data series).
In the particular case of the four analyzed case studies and considering the values of the statistical indicator characteristics of the analyzed data series and the lengths of the available data, the biases associated with the values predicted using the GEV distribution (with the L-moments method), solely as a function of this variability in the availability of historical data, are presented in
Table 12. The biases (%, referenced to the theoretical curve values corresponding to a set of 1000 data) are shown both for the estimation of statistical indicators, the three parameters, and for the quantile with an annual exceedance probability of 0.01%.
It can be observed that for similar lengths of the available data series (n = 33, n = 38, and n = 39), the relative differences are still significant, mainly due to the torrentiality and variability of the data, expressed through higher values of the statistical indicators , , and . Nevertheless, the biases in forecasting such a rare event (which in many cases is considered “impossible” in terms of occurrence probability) are extremely small, ranging between 0.53% and 1.94%, once again highlighting the advantages of the L-moments method in flood frequency analysis.
This analysis will help researchers assess how “reliable” the extreme quantiles calculated with GEV are and adjust maximum discharge frequency analyses based on the sample size and chosen method.