Next Article in Journal
Assessing Impacts of Anthropogenic Modification on Surface Soil Moisture Dynamics: A Case Study over Southwest China
Previous Article in Journal
Improving Soil Water Simulation in Semi-Arid Agriculture: A Comparative Evaluation of Water Retention Curves and Inverse Modeling Using HYDRUS-1D
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Flood Frequency Analysis Using the Bivariate Logistic Model with Non-Stationary Gumbel and GEV Marginals

by
Laura Berbesi-Prieto
and
Carlos Escalante-Sandoval
*
Faculty of Engineering, National Autonomous University of Mexico, Coyoacan, Mexico City 04510, Mexico
*
Author to whom correspondence should be addressed.
Hydrology 2025, 12(11), 274; https://doi.org/10.3390/hydrology12110274
Submission received: 3 September 2025 / Revised: 14 October 2025 / Accepted: 17 October 2025 / Published: 22 October 2025

Abstract

Flood frequency analysis is essential for designing resilient hydraulic infrastructure, but traditional stationary models fail to capture the influence of climate variability and land-use change. This study applies a bivariate logistic model with non-stationary marginals to eight gauging stations in Sinaloa, Mexico, each with over 30 years of maximum discharge records. We compared stationary and non-stationary Gumbel and Generalized Extreme Value (GEV) distributions, along with their bivariate combinations. Results show that the non-stationary bivariate GEV–Gumbel distribution provided the best overall performance according to AIC. Importantly, GEV and Gumbel marginals captured site-specific differences: GEV was most suitable for sites with highly variable extremes, while Gumbel offered a robust fit for more regular records. At station 10086, where a significant increasing trend was detected by the Mann–Kendall and Spearman tests, the stationary GEV estimated a 50-year return flow of 772.66 m3/s, while the non-stationary model projected 861.00 m3/s for 2075. Under stationary assumptions, this discharge would be underestimated, occurring every ~30 years by 2075. These findings demonstrate that ignoring non-stationarity leads to systematic underestimation of design floods, while non-stationary bivariate models provide more reliable, policy-relevant estimates for climate adaptation and infrastructure safety.

1. Introduction

Floods constitute one of the most critical natural hazards, producing extensive economic impacts, infrastructure deterioration, and considerable risks to human life. Reliable flood frequency analysis (FFA) is therefore essential, as it provides the basis for estimating the magnitude of extreme events given a return period (T) and for supporting the safe design and management of hydraulic structures and flood-risk mitigation strategies. In this context, Genest and Favre [1] provided a brief review of univariate and multivariate approaches to FFA.
Building on this perspective, research has increasingly turned to multivariate methods, which allow for a more complete representation of flood phenomena. Multivariate approaches to FFA have been applied globally. These studies often consider flood peaks, volumes, and durations [1,2,3]. While many analyses incorporate two or more hydrological variables to capture joint flood behavior, some studies have focused on a single variable across multiple stations within a multivariate framework [4,5]. These multivariate frameworks improve the realism of flood representation and support more reliable design criteria and risk management strategies in complex hydrological contexts.
A central limitation of conventional FFA, whether univariate or multivariate, is the assumption of stationarity. As defined by Milly et al. [6], stationarity implies that the probability distribution of a variable remains constant over time. However, growing evidence indicates that this assumption is often invalid due to climatic and non-climatic drivers, undermining the applicability of traditional statistical methods [7,8,9]. In response, a large body of research has focused on non-stationary FFA, with applications spanning diverse regions and methodological approaches [10,11].
Most non-stationary flood frequency studies rely on extreme value distributions such as GEV and Gumbel, combined with trend-detection techniques like the Mann–Kendall test. Time is typically used as the main covariate, consistently revealing linear trends, while large-scale climate indices such as ENSO, PDO, NAO, and global mean temperature have also been incorporated, highlighting their significant role in flood frequency dynamics [12].
While these non-stationary approaches provide a more flexible and realistic framework, their adoption also presents important challenges. First, they involve greater mathematical and computational complexity, as the number of parameters increases and convergence of iterative estimation can be difficult. Second, their performance depends critically on the appropriate selection of covariates (e.g., large-scale climatic indices or land-use metrics). When relevant covariates are unavailable or unstable, there is a risk of model misspecification and reduced predictive skill. Third, despite their flexibility, non-stationary models cannot fully eliminate the uncertainty in projecting future extremes, since climatic and land-use conditions evolve in ways that may not be captured by historical relationships [7]. Finally, the assumptions underlying the selected statistical distributions (such as GEV or Gumbel) may not always hold across diverse hydrological settings, requiring careful validation before generalization [13]. In addition, when copula-based approaches are used to model dependence, their validity depends strongly on the copula family selected, which may introduce further subjectivity and potential misspecification [1,2]. Recognizing these limitations is essential to ensure that non-stationary bivariate models are applied responsibly and interpreted with due caution in flood frequency analysis.
In the multivariate domain, copula-based methods have become the prevailing approach for modeling flood dependence structures. Copulas are flexible because they separate marginal behavior from the joint dependence function, and they have been applied extensively in hydrology [2,3,4]. However, they typically require the prior specification of copula families and can be computationally demanding when extended to non-stationary contexts. By contrast, logistic models for extremes [14,15] provide a direct parametric representation of dependence, allowing both marginal distributions and the dependence structure to evolve with covariates. This framework offers a simpler alternative to copulas while retaining the ability to capture changing joint dynamics.
Hydrological regionalization studies traditionally test homogeneity using L-moment-based approaches [16] or related statistical tests [17]. In this study, we complement these classical approaches by applying Andrew’s curves [18], a visualization-based technique that clusters stations according to morphometric attributes and their relationship with return levels, thereby providing an alternative justification of regional homogeneity prior to multivariate modeling [19,20].
Extending FFA to bivariate non-stationary marginals therefore represents a promising avenue to better capture the joint dynamics of flood-related variables under changing conditions. In this study, we address this gap by applying a bivariate logistic model with non-stationary Gumbel and Generalized Extreme Value (GEV) marginals to flood frequency analysis.

2. Materials and Methods

According to Durocher et al. [20], at-site frequency analysis based on annual maxima has a long-standing tradition in hydrology and remains one of the most widely adopted methods for quantile estimation [21]. Theoretical principles support the use of the GEV distribution, which represents the limiting distribution of block maxima [22]. In addition to the GEV distribution, this study also applies the Gumbel distribution in its stationary form, as part of the modeling framework [15].
Gumbel and GEV distributions:
F x = e e x v ,
F x = e 1 x v β 1 β .

2.1. Bivariate Logistic Model

Yue and Wang [14] emphasize that the bivariate logistic model (LM) is more adaptable than the mixed model, as it accommodates a broader range of correlation coefficients and dependency indices. This flexibility makes it a preferable choice for addressing frequency analysis problems in hydrology. The LM is defined as follows [15]:
F x , y , m = e x p l n F x m + l n F y m 1 / m ,
where F x and F ( y ) represent the marginal distribution functions, and m   ( m > 1 ) is the bivariate association parameter. When m = 1 , the bivariate distribution simplifies to the case of independence: F x , y , m = F x F ( y ) .
Using notation 1 for the Gumbel distribution and notation 2 for the GEV distribution, the possible combinations of bivariate extreme value (BEV) distributions are denoted as BEV11, BEV12, BEV21, and BEV22 [4,5]. The probability distribution and density function for each BEV combination are shown in Appendix A along with the return level functions.

2.2. Estimation of Bivariate Parameters

The likelihood function for n random variables is defined as the joint density of those variables and serves as a function of the parameters. Given a random sample x 1 , y 1 from a bivariate density, the corresponding likelihood function is formulated accordingly.
L x , y , θ i = i = 1 n f x i , y i , θ i .
Due to unequal record lengths in the analyzed samples (Figure 1 and Figure 2), a flexible formulation is required to accommodate all possible data arrangements. This formulation is based on the generalization proposed by Anderson [23].
The likelihood function corresponding to the maximum sample arrangement is derived accordingly.
L x , y , θ i = i = 1 n 1 f p i , θ i ( 1 ) I 1 · i = 1 n 2 f x i , y i , θ i 2 I 2 · i = 1 n 3 f q i , θ i 3 I 3 ,
where the following abbreviations are used:
  • n 1 , n 3 = univariate lengths of record before and after the common period n 2
  • n 2 = common bivariate period
  • p = variable with univariate record before the common period n 2
  • q = variable with univariate record after the common period n 2
  • x , y = variable with bivariate record during the common period n 2
  • I i = indicator number such that I i = 1 if n i > 0 or I i = 0 if n i = 0
  • θ i = parameter vector
Since the maximum of a function and its logarithm occur at the same point, and the logarithmic form of the maximum likelihood Equation (5) is easier to manipulate, the logarithmic likelihood function is employed.
  l n L x , y , θ i = I 1 i = 1 n l n f p i , θ i ( 1 ) + I 2 i = 1 n l n f x i , y i , θ i ( 2 ) + I 3 i = 1 n l n f q i , θ i ( 3 ) .
Maximum likelihood estimators for the parameters of bivariate extreme value distributions are obtained by maximizing Equation (6). Due to the complexity of the functions associated with bivariate probability densities, analytical solutions via differential calculus are not feasible. Instead, a direct search optimization process is required. Given the characteristics of the function to be optimized, Rosenbrock [24] recommended a restricted multivariate nonlinear optimization algorithm.

2.3. Non-Stationary GEV Models and Covariates

Coles [22] gives an example of an annual maximum series X t to which the GEV distribution can be applied, even though its record has changed linearly over the observation time, but in other respects, the distribution does not change. Using the notation G E V ν , α , β to denote the GEV distribution with parameters ν , α and β , it follows that a suitable model for X t , the annual maximum value in a year t, can be
X t ~ G E V ν t , α , β ,   where   ν t = ν 0 + ν 1 t ,
for parameters ν0 and ν1. In this way, variations over time in the observed process are modeled as a linear trend in the location parameter of the appropriate extreme value model, which in this case is the GEV distribution. The parameter ν1 corresponds to the annual rate of change in the annual maximum.
According to Coles [22], a different situation that may arise is that the extreme behavior of one series is related to that of another variable, called a covariate. For example, the values of a series of maximums may appear larger in years when the Southern Oscillation Index (SOI) is higher. This suggests the following model for X t :
X t ~ G E V ν t , α , β ,   where   ν t = ν 0 + ν 1 S O I t ,
and SOI(t) denotes the Southern Oscillation Index in year t.
There is a structural unit in all these examples. In each case, the extreme value parameters can be written in the form
θ t = h X T B ,
where θ denotes ν , α , or β , h is a specific function, B is a vector of parameters, and X is a model vector.
One advantage of maximum likelihood over other parameter estimation techniques is its adaptability to changes in model structure. Take, for example, a non-stationary GEV model to describe the distribution X t for t = 1 , , n :
X t ~ G V E ν t , α t , β t ,
where each parameter has an expression in terms of a vector of parameters and covariates of the type (9). Denoting the complete parameter vector as B, the likelihood is simply:
L B = t = 1 n g x t ; ν t , α ( t ) , β ( t ) ,
where g x t ; ν t , α ( t ) , β ( t ) denotes the GEV density function with parameters ν t , α ( t ) , β ( t ) evaluated at x t .

2.4. Covariate Choice and Functional Form

The covariates used to model non-stationarity in the location parameter were time, representing long-term gradual trends, and the large-scale climate indices PDO and SOI, representing interannual variability. The selection of these covariates was informed by preliminary correlation analyses and trend detection tests (Spearman and Mann–Kendall), which consistently indicated significant temporal trends and climatic teleconnections at the regional scale [15,22]. We adopted a linear form in the location parameter as a parsimonious first-order representation of these relationships, consistent with previous non-stationary extreme value studies [7,8,13]. While nonlinear terms or alternative covariates (e.g., precipitation anomalies, land-use metrics) may capture additional variability, our model comparison results showed limited improvements relative to linear formulations, suggesting that linear terms provide a robust and interpretable baseline [9].

2.5. Choice of Non-Stationary Model

Coles [22] presents an alternative method for quantifying uncertainty in maximum likelihood estimators using the deviation function, defined as
D = 2 l 1 M 1 l 0 M 0 ,
Here, l 0 M 0 and l 1 M 1 represent the maximized log-likelihoods under models M0 and M1, respectively. A high value of D indicates that model M1 explains significantly more variation in the data than M0, while a low value suggests that increasing model complexity does not enhance explanatory power.
Model M0 is rejected at a significance level α if D > c α , where c α is the (1−α) quantile of the X k 2 distribution, and k is the difference in dimensionality between models M1 and M0. This provides a formal criterion for preferring model M1 based on the magnitude of D.
To reduce statistical uncertainty and design safer hydraulic structures, this study proposes analyzing annual maximum discharges using a bivariate logistic model with non-stationary marginals. Figure 3 presents a schematic representation of the methodology followed, highlighting the main steps from data collection to flood frequency modeling.

2.6. Regional Homogeneity

The hydrological homogeneity of the study region was assessed using Andrew’s curves [18], which is a multivariate visualization technique that maps each observation into a continuous function of the form
f t = X 1 2 + X 2 s i n t + X 3 c o s t + X 4 s i n 2 t + X 5 c o s 2 t + ,
where X 1 , X 2 , X n are standardized attributes of each station, and the function is plotted over the range −π to π.
Attributes correspond to morphometric and climatological properties and their empirical relation to return levels. Candidate attributes were screened via multiple regression following Nathan and McMahon [19] and standard hydrologic practice [13,16,17], retaining drainage area (A), mean annual precipitation (MAP), and drainage density (DD) as most relevant for regionalization.

3. Results

3.1. Data

Eight stations with at least 30 years of records, located in a homogeneous region of Sinaloa in northern Mexico, were selected to apply the bivariate logistic model with non-stationary marginals for flood frequency analysis. Data were obtained from the official database of the National Commission of Water (CONAGUA) [25]. Table 1 lists record lengths and basic statistics. Outliers were screened (box plot, Grubbs test, three-sigma), independence verified (Anderson–Mantilla–Amigó), homogeneity tested (Helmert, Student’s t, Cramer, Pettit, Standard Normal, Buishand, Von Neumann), and trends assessed (Spearman, Mann–Kendall). Results are shown in Table 2.
Station 10086 exhibited an increasing trend, confirmed by graphical analysis (Figure 4) and statistical tests.

3.2. Delineation of Homogeneous Region

Attributes such as drainage area (A), mean annual precipitation (MAP), and drainage density (DD) were compiled and standardized (Table 3 and Table 4). Stations from a homogeneous region tend to produce Andrews curves that cluster, indicating similar hydrological behavior and flood-generating mechanisms. The clustering observed in Figure 5 confirmed that the eight stations form a hydrologically homogeneous region, justifying joint bivariate non-stationary analysis.
This validated homogeneity provides a solid basis for applying the bivariate non-stationary framework in the subsequent frequency analysis.

3.3. Frequency Analysis

After verifying independence and randomness, the Gumbel (1) and GEV (2) distributions were fitted, followed by four bivariate marginal combinations (6). Models with (D > 3.84) were retained, and the one with highest log-likelihood and lowest AIC was selected at each site.
The Akaike Information Criterion (AIC) is defined as
A I C = 2 l + 2 θ
where l is the maximum log-likelihood and θ represents the number of estimated parameters within the model.
Table 5 presents detailed results for Station 10086 (stationary and non-stationary, univariate and bivariate). Table 6 summarizes return levels across return periods under stationary and non-stationary assumptions. To complement tabular results, Figure 6 visualizes the differences in return levels between stationary GEV and non-stationary BEV21, making the divergence across time explicit. Non-stationary estimates reveal increasing return levels over time, underscoring the importance of incorporating trends into flood risk analysis.
Finally, Table 7 provides a regional overview of the best-fitting distributions for all stations in the homogeneous region, listing the selected covariates and model characteristics.
Although the analysis presented here focuses on point estimates of return levels, confidence intervals were not computed. We acknowledge that this represents a limitation, as measures of uncertainty are essential for decision-making in hydraulic design and flood risk management. The inclusion of confidence bounds would provide stakeholders with not only the expected magnitude of extreme events but also the range of plausible variability. In this study, the emphasis was on illustrating the methodological contribution of applying non-stationary bivariate models, and therefore the computation of uncertainty bands—through bootstrap resampling or Bayesian credible intervals—was beyond the present scope. We note, however, that these approaches constitute a natural extension of the framework and will be explored in future work to provide a more comprehensive quantification of flood frequency uncertainty [7].

4. Discussion

Traditional stationary univariate models assume that the statistical properties of extreme events remain constant over time. While simple to apply, they neglect temporal changes driven by climate variability, land-use, or anthropogenic impacts. As a result, return levels derived from stationary univariate approaches often misrepresent current and future risks [6].
Stationary bivariate models improve on this by accounting for dependence between two variables (for example, peak flow in two gauged stations), leading to more accurate joint return period estimates than univariate analyses [2]. However, they still impose the limitation of time-invariant marginals, which can bias estimates if the underlying processes evolve.
Traditional assumptions of stationarity are increasingly invalidated by anthropogenic influences such as urbanization, deforestation, and climate change, which significantly alter discharge records. Non-stationary univariate models allow distribution parameters to vary with time or covariates like PDO and SOI, capturing observed or expected trends [7,9,12]. Yet, they continue to overlook the dependence between variables, which is critical for compound flood risk.
The most comprehensive approach is the non-stationary bivariate model, which jointly models dependence while allowing marginals to evolve with time or climatic covariates [2,8,22,26,27]. This reduces misspecification bias in joint-tail estimation and improves return-level accuracy, while providing a platform for scenario analysis under future climate/land-use pathways. In practice, non-stationary bivariate models deliver policy-relevant results that reflect evolving hazards and their interactions-advantages confirmed here by the superior log-likelihood and AIC performance.

4.1. Performance of GEV and Gumbel Marginals

The best results were obtained when using GEV and Gumbel marginals to model annual maxima. The GEV distribution, with its shape parameter, offers flexibility for heavy, light, or bounded tails, while the Gumbel distribution, as a special case of the GEV, provides a parsimonious yet robust fit [21,22]. At both gauges, the fitted marginals captured the statistical properties of annual maxima.
The GEV was more suitable at the site with higher variability and occasional extreme peaks, while the Gumbel distribution fit better at the more regular site. This differentiation reflects catchment differences and translates to practice: return period estimates become more reliable when tailored to local discharge behavior.

4.2. Case Study: Station 10086

The analysis of station 10086 revealed a statistically significant increasing trend, detected by both the Spearman rank correlation test and the Mann–Kendall test [28,29]. This trend is reflected in the positive value of the location parameter ( v 11 ) of the non-stationary model. Figure 5 illustrates the importance of explicitly incorporating non-stationarity when modeling extremes.
For example, under stationary assumptions, the GEV distribution estimates a 50-year return flow of 772.66 m3/s. In contrast, the non-stationary BEV21 model projects a 50-year return flow of 861.00 m3/s for the year 2075 scenario. This difference implies that a discharge of 772.66 m3/s, considered a 50-year event under stationary assumptions, would be expected approximately every 30 years by 2075 when non-stationarity is accounted for. Such a misrepresentation has direct implications for hydraulic design, as ignoring trends may lead to underestimation of design floods and compromise the safety and resilience of hydraulic infrastructure [30].

4.3. Practical and Policy Implications

The adoption of non-stationary bivariate models has important implications for both engineering design and water policy. For engineers, it means that hydraulic structures—such as dams, levees, and urban drainage systems—can be designed with greater confidence and safety margins, reflecting realistic compound risks under changing conditions. For policymakers, non-stationary bivariate approaches support adaptive management strategies, allowing the testing of “what-if” scenarios under different climate or socio-economic pathways. This provides a sound basis for long-term investment decisions, disaster risk reduction strategies, and compliance with national and international climate adaptation frameworks [31].
Ultimately, the progression from stationary to non-stationary, and from univariate to bivariate analysis, represents a critical methodological advance in flood frequency analysis. It ensures that design standards and water policies evolve in step with changing climatic and hydrological realities, providing a more resilient and cost-effective basis for flood risk management.
While these findings underscore the value of non-stationary bivariate models, it is also essential to recognize their inherent limitations, which qualify the interpretation of results and indicate directions for future research.

4.4. Limitations of the Proposed Models

Despite the advantages of non-stationary bivariate models, several limitations must be acknowledged. First, the assumption of linear tendencies in the marginal parameters may not be universally valid. Sudden shifts in climate conditions or nonlinear changes in land-use may lead to dynamics that cannot be fully captured by linear formulations, potentially biasing return level estimates. Second, the bivariate logistic model employed here assumes a symmetric dependence structure, which may not adequately represent more complex or asymmetric relationships between flood variables. This limitation could affect joint-tail estimation in catchments where dependence varies across different ranges of extremes. Finally, as with all statistical models, results are constrained by data availability and the inherent uncertainties of projecting future hydrological extremes. Recognizing these limitations is essential for balanced interpretation and highlights the need for future research exploring nonlinear covariate effects and alternative dependence structures (e.g., asymmetric copulas [1]) to further enhance the robustness of flood frequency analysis.

5. Conclusions

This study confirms that non-stationary bivariate models provide the most reliable estimates of return levels, since they capture both the dependence between variables and the temporal evolution of extremes. Compared with stationary or univariate approaches, this framework reduces bias, improves accuracy, and supports scenario-based risk assessment under changing climatic and land-use conditions.
The use of GEV and Gumbel marginals yielded the best overall performance for the maximum discharges at the two gauging sites. The GEV distribution was particularly suited to the site with greater variability and occasional extremes, while the Gumbel distribution offered a parsimonious yet robust fit at the more stable site.
Importantly, the case of station 10086 illustrates how trend detection alters return level estimates. A discharge of 772.66 m3/s, considered a 50-year event under stationary assumptions, is projected by the non-stationary model to occur every ~30 years by 2075. Ignoring such non-stationarity could lead to systematic underestimation of design floods and compromise the safety of hydraulic infrastructure.
The scientific contribution of this work lies in demonstrating the added value of combining a bivariate logistic model with non-stationary marginals for flood frequency analysis. By explicitly incorporating both dependence structures and time- or climate-driven variability, this framework advances beyond traditional copula-based or stationary approaches and provides a transparent, data-driven methodology applicable to regional flood risk management.
Nevertheless, uncertainties remain. The assumption of linear covariate effects and symmetric dependence may not fully represent nonlinear climatic shifts or asymmetric joint behaviors, and future research should address these issues by exploring alternative functional forms, additional covariates, and flexible dependence structures such as asymmetric copulas. Moreover, the lack of confidence intervals around return level estimates is a limitation that will be addressed in future extensions through bootstrap or Bayesian approaches, to provide decision-makers with explicit measures of uncertainty.
In sum, this study underscores the importance of adopting flexible non-stationary approaches for both engineering design and water policy, while also recognizing the methodological challenges that remain. By tailoring flood frequency analysis to evolving hydrological regimes, decision-makers can avoid under- or over-design, improve safety margins, and align planning with climate adaptation frameworks.

Author Contributions

L.B.-P. and C.E.-S. both contributed to all aspects of this paper. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Data will be made available at a reasonable request.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
BEVBivariate Extreme Value Distribution
FFAFlood Frequency Analysis
GEVGeneralized Extreme Value
PDOPacific Decadal Oscillation
SOISouthern Oscillation Index
TReturn period

Appendix A. Bivariate Logistic Model with Non-Stationary Gumbel and GEV Marginals

Let v i denote the linearly varying location parameter as v i = v i 0 + v i 1 c , where c indicates the covariate.

Appendix A.1. Probability Distribution and Density Functions for the BEV11 Distribution

F x i , y i ; v 1 , 1 , v 2 , 2 , m = e x p e m x v 1 1 + e m y v 2 2 1 / m
  f x i , y i ; v 1 , 1 , v 2 , 2 , m = e m x v 1 1 · e m y v 2 2 1 2 · e m x v 1 1 + e m y v 2 2 1 m 2 · e x p e m x v 1 1 + e m y v 2 2 1 m ·   m 1 + e m x v 1 1 + e m y v 2 2 1 m

Appendix A.2. Probability Distribution and Density Functions for the BEV12 Distribution

F x i , y i ; v 1 , 1 , v 2 , 2 , β 2 , m = e x p e m x v 1 1 + 1 y v 2 α 2 β 2 m β 2 1 / m
  f x i , y i ; v 1 , 1 , v 2 , 2 , β 2 , m = e m x v 1 1 · 1 y v 2 α 2 β 2 m β 2 1 1 2 · e m x v 1 1 + 1 y v 2 α 2 β 2 m β 2 1 m 2 · e x p e m x v 1 1 + 1 y v 2 α 2 β 2 m β 2 1 m · m 1 + e m x v 1 1 + 1 y v 2 α 2 β 2 m β 2 1 m

Appendix A.3. Probability Distribution and Density Functions for the BEV22 Distribution

F x i , y i ; v 1 , 1 , β 1 , v 2 , 2 , β 2 , m = e x p 1 x v 1 α 1 β 1 m β 1 + 1 y v 2 α 2 β 2 m β 2 1 / m
  f x i , y i ; v 1 , 1 , β 1 , v 2 , 2 , β 2 , m = 1 x v 1 α 1 β 1 m β 1 1 · 1 y v 2 α 2 β 2 m β 2 1 1 2 1 x v 1 α 1 β 1 m β 1 + 1 y v 2 α 2 β 2 m β 2 1 m 2 e x p 1 x v 1 α 1 β 1 m β 1 + 1 y v 2 α 2 β 2 m β 2 1 m · m 1 + 1 x v 1 α 1 β 1 m β 1 + 1 y v 2 α 2 β 2 m β 2 1 m
The Log-Likelihood function to be maximized:
l x , y , θ i = I 1 n 1 l n p + i = 1 n 1 A 1 A 2   + I 2 n 2 l n 1 + l n 2 + i = 1 n 2 C 2 + D 2 + ln E 3 + ln m 1 + E 2 E 2 + I 3 n 3 l n q + i = 1 n 3 B 1 B 2
where i   > 0 ,   m > 1 ,   n 2 > 10   and   i = 1 ,   2

Appendix A.4. Log-Likelihood Function for the BEV11 DistributionAppendix

A 1 = p i v p 0 v p 1 c P , A 2 = e x p ( A 1 ) B 1 = q i v q 0 v q 1 c q ,   B 2 = e x p ( B 1 ) C 1 = x i v 10 v 11 c 1 ,   C 2 = m C 1 ,     C 3 = e x p ( C 2 ) D 1 = y i v 20 v 21 c 2 ,   D 2 = m D 1 ,   D 3 = e x p ( D 2 ) E 1 = C 3 + D 3 ,                   E 2 = E 1 1 / m ,         E 3 = E 1 1 m 2

Appendix A.5. Log-Likelihood Function for the BEV12 Distribution

A 1 = p i v p 0 v p 1 c P , A 2 = e x p ( A 1 ) B 1 = 1 q i v q 0 v q 1 c q β q 1 β q ,                 B 2 = l n 1 q i v q 0 v q 1 c q β q 1 β q 1 C 1 = x i v 10 v 11 c 1 ,   C 2 = m C 1 ,     C 3 = e x p ( C 2 ) D 2 = l n 1 y i v 20 v 21 c 2 β 2 m β 2 1 ,             D 3 = 1 y i v 20 v 21 c 2 β 2 m β 2 E 1 = C 3 + D 3 ,                   E 2 = E 1 1 / m ,         E 3 = E 1 1 m 2

Appendix A.6. Log-Likelihood Function for the BEV22 Distribution

A 1 = 1 p i v p 0 v p 1 c p β p 1 β p ,               A 2 = l n 1 p i v p 0 v p 1 c p β p 1 β p 1 B 1 = 1 q i v q 0 v q 1 c q β q 1 β q ,               B 2 = l n 1 q i v q 0 v q 1 c q β q 1 β q 1 C 2 = l n 1 x i v 10 v 11 c 1 β 1 m β 1 1 ,             C 3 = 1 x i v 10 v 11 c 1 β 1 m β 1 D 2 = l n 1 y i v 20 v 21 c 2 β 2 m β 2 1 ,             D 3 = 1 y i v 20 v 21 c 2 β 2 m β 2 E 1 = C 3 + D 3 ,                   E 2 = E 1 1 / m ,             E 3 = E 1 1 m 2

Appendix A.7. Return Levels for the Gumbel and GEV Distributions

DistributionFunction
Gumbel X ^ T = v ^ ^ l n l n F ( x )
GEV X ^ T = v ^ + ^ β ^ 1 l n F ( x ) β ^
Non-Stationary Gumbel X ^ T = v ^ 0 + v ^ 1 c ^ l n l n F ( x )
Non-Stationary GEV X ^ T = v ^ 0 + v ^ 1 c + ^ β ^ 1 l n F ( x ) β ^
Where F x = 1 1 T stands for the non-exceedance probability and T represents the return period in years.

References

  1. Genest, C.; Favre, A.-C. Everything You Always Wanted to Know about Copula Modeling but Were Afraid to Ask. J. Hydrol. Eng. 2007, 12, 347–368. [Google Scholar] [CrossRef]
  2. Salvadori, G.; Michele, C.D.; Kottegoda, N.T.; Rosso, R. Extremes in Nature: An Approach Using Copulas; Water Science and Technology Library; Springer: Dordrecht, The Netherlands, 2007; Volume 56, ISBN 978-1-4020-4414-4. [Google Scholar]
  3. Zhang, L.; Singh, V.P. Copulas and Their Applications in Water Resources Engineering, 1st ed.; Cambridge University Press: Cambridge, UK, 2019; ISBN 978-1-108-56510-3. [Google Scholar]
  4. Coles, S.G.; Tawn, J.A. Modelling Extreme Multivariate Events. J. R. Stat. Soc. Ser. B Stat. Methodol. 1991, 53, 377–392. [Google Scholar] [CrossRef]
  5. Naveau, P.; Guillou, A.; Cooley, D.; Diebolt, J. Modelling Pairwise Dependence of Maxima in Space. Biometrika 2009, 96, 1–17. [Google Scholar] [CrossRef]
  6. Milly, P.C.D.; Betancourt, J.; Falkenmark, M.; Hirsch, R.M.; Kundzewicz, Z.W.; Lettenmaier, D.P.; Stouffer, R.J. Stationarity Is Dead: Whither Water Management? Science 2008, 319, 573–574. [Google Scholar] [CrossRef]
  7. Katz, R.W. Statistical Methods for Nonstationary Extremes. In Extremes in a Changing Climate; AghaKouchak, A., Easterling, D., Hsu, K., Schubert, S., Sorooshian, S., Eds.; Water Science and Technology Library; Springer: Dordrecht, The Netherlands, 2013; Volume 65, pp. 15–37. ISBN 978-94-007-4478-3. [Google Scholar]
  8. Serinaldi, F.; Kilsby, C.G. Stationarity Is Undead: Uncertainty Dominates the Distribution of Extremes. Adv. Water Resour. 2015, 77, 17–36. [Google Scholar] [CrossRef]
  9. Villarini, G.; Smith, J.A.; Serinaldi, F.; Bales, J.; Bates, P.D.; Krajewski, W.F. Flood Frequency Analysis for Nonstationary Annual Peak Records in an Urban Drainage Basin. Adv. Water Resour. 2009, 32, 1255–1266. [Google Scholar] [CrossRef]
  10. López, J.; Francés, F. Non-Stationary Flood Frequency Analysis in Continental Spanish Rivers, Using Climate and Reservoir Indices as External Covariates. Hydrol. Earth Syst. Sci. 2013, 17, 3189–3203. [Google Scholar] [CrossRef]
  11. Lima, C.H.R.; Lall, U.; Troy, T.; Devineni, N. A Hierarchical Bayesian GEV Model for Improving Local and Regional Flood Quantile Estimates. J. Hydrol. 2016, 541, 816–823. [Google Scholar] [CrossRef]
  12. Salas, J.D.; Obeysekera, J. Revisiting the Concepts of Return Period and Risk for Nonstationary Hydrologic Extreme Events. J. Hydrol. Eng. 2014, 19, 554–568. [Google Scholar] [CrossRef]
  13. Stedinger, J.R.; Vogel, R.M.; Foufoula-Georgia, E. Frequency Analysis of Extreme Events. In Handbook of Hydrology; Maidment, D.R., Ed.; McGraw Hill: New York, NY, USA, 1993; pp. 18.1–18.66. [Google Scholar]
  14. Yue, S.; Wang, C.Y. A Comparison of Two Bivariate Extreme Value Distributions. Stoch. Environ. Res. Risk Assess. 2004, 18, 61–66. [Google Scholar] [CrossRef]
  15. Gumbel, E.J. Bivariate Logistic Distributions. J. Am. Stat. Assoc. 1961, 56, 335–349. [Google Scholar] [CrossRef]
  16. Hosking, J.R.M.; Wallis, J.R. Regional Frequency Analysis: An Approach Based on L-Moments, 1st ed.; Cambridge University Press: Cambridge, UK, 1997; ISBN 978-0-521-43045-6. [Google Scholar]
  17. Viglione, A.; Laio, F.; Claps, P. A Comparison of Homogeneity Tests for Regional Frequency Analysis. Water Resour. Res. 2007, 43, 2006WR005095. [Google Scholar] [CrossRef]
  18. Andrews, D.F. Plots of High-Dimensional Data. Biometrics 1972, 28, 125. [Google Scholar] [CrossRef]
  19. Nathan, R.J.; McMahon, T.A. Identification of Homogeneous Regions for the Purposes of Regionalisation. J. Hydrol. 1990, 121, 217–238. [Google Scholar] [CrossRef]
  20. Durocher, M.; Burn, D.H.; Mostofi Zadeh, S. A Nationwide Regional Flood Frequency Analysis at Ungauged Sites Using ROI/GLS with Copulas and Super Regions. J. Hydrol. 2018, 567, 191–202. [Google Scholar] [CrossRef]
  21. Bezak, N.; Brilly, M.; Šraj, M. Comparison between the Peaks-over-Threshold Method and the Annual Maximum Method for Flood Frequency Analysis. Hydrol. Sci. J. 2014, 59, 959–977. [Google Scholar] [CrossRef]
  22. Coles, S. An Introduction to Statistical Modeling of Extreme Values; Springer Series in Statistics; Springer: London, UK, 2001; ISBN 978-1-84996-874-4. [Google Scholar]
  23. Anderson, T.W. Maximum Likelihood Estimates for a Multivariate Normal Distribution When Some Observations Are Missing. J. Am. Stat. Assoc. 1957, 52, 200–203. [Google Scholar] [CrossRef]
  24. Rosenbrock, H.H. An Automatic Method for Finding the Greatest or Least Value of a Function. Comput. J. 1960, 3, 175–184. [Google Scholar] [CrossRef]
  25. CONAGUA Banco Nacional de Datos de Aguas Superficiales (BANDAS). Available online: https://app.conagua.gob.mx/bandas/ (accessed on 1 April 2024).
  26. Salas, J.D.; Obeysekera, J.; Vogel, R.M. Techniques for Assessing Water Infrastructure for Nonstationary Extreme Events: A Review. Hydrol. Sci. J. 2018, 63, 325–352. [Google Scholar] [CrossRef]
  27. Zhang, L.; Singh, V.P. Bivariate Flood Frequency Analysis Using the Copula Method. J. Hydrol. Eng. 2006, 11, 150–164. [Google Scholar] [CrossRef]
  28. Mann, H.B. Nonparametric Tests Against Trend. Econometrica 1945, 13, 245. [Google Scholar] [CrossRef]
  29. Kendall, M.G. Rank Correlation Methods, 4th ed.; Griffin: Salisbury South, Australia, 1975. [Google Scholar]
  30. Rootzén, H.; Katz, R.W. Design Life Level: Quantifying Risk in a Changing Climate: Design Life Level. Water Resour. Res. 2013, 49, 5964–5972. [Google Scholar] [CrossRef]
  31. Intergovernmental Panel On Climate Change (IPCC). Climate Change 2021—The Physical Science Basis: Working Group I Contribution to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change, 1st ed.; Cambridge University Press: Cambridge, UK, 2021; ISBN 978-1-009-15789-6. [Google Scholar]
Figure 1. Minimum bivariate sample arrangement.
Figure 1. Minimum bivariate sample arrangement.
Hydrology 12 00274 g001
Figure 2. Maximum bivariate sample arrangement.
Figure 2. Maximum bivariate sample arrangement.
Hydrology 12 00274 g002
Figure 3. The Methodology of the Flood Frequency Analysis applied in this study.
Figure 3. The Methodology of the Flood Frequency Analysis applied in this study.
Hydrology 12 00274 g003
Figure 4. Maximum instantaneous discharges (m3/s) recorded at Station 10086.
Figure 4. Maximum instantaneous discharges (m3/s) recorded at Station 10086.
Hydrology 12 00274 g004
Figure 5. Andrews curves constructed from morphometric and climatological attributes (A, MAP, DD. Clustering supports hydrological homogeneity of the study region.
Figure 5. Andrews curves constructed from morphometric and climatological attributes (A, MAP, DD. Clustering supports hydrological homogeneity of the study region.
Hydrology 12 00274 g005
Figure 6. Return level curves (m3/s) for Station 10086 under stationary GEV and non-stationary BEV21 models.
Figure 6. Return level curves (m3/s) for Station 10086 under stationary GEV and non-stationary BEV21 models.
Hydrology 12 00274 g006
Table 1. Some characteristics of the stations analyzed.
Table 1. Some characteristics of the stations analyzed.
StationPeriodLength of Record (Years)MeanStandard
Deviation
KurtosisSkewnessVariation
Coefficient
100271938–199558285.79260.8313.682.740.91
100651953–1999471218.731081.7613.592.840.89
100661955–200551311.89278.6716.363.180.89
100791959–1999411016.911642.4119.863.741.62
100831960–199233458.64425.135.851.580.93
100861960–199233236.33156.804.481.190.66
101111958–2009521315.931558.2214.233.031.18
101371958–2008511058.15992.917.592.020.94
Table 2. Independence, Homogeneity, and Trend Test Results.
Table 2. Independence, Homogeneity, and Trend Test Results.
StationIndependent?Homogeneous?Trend?Increasing?
10027YesYesNoYes
10065YesYesNoNo
10066YesNoNoNo
10079YesYesYesYes
10083YesYesNoYes
10086YesNoYesYes
10111YesYesNoNo
10137YesYesNoNo
Table 3. Raw values of drainage area (A), mean annual precipitation (MAP), and drainage density (DD) for study stations.
Table 3. Raw values of drainage area (A), mean annual precipitation (MAP), and drainage density (DD) for study stations.
StationA (km2)MAP (mm)DD (km/km2)
10027388.259912.90.2135
100656102.181871.50.2474
100661373.576777.450.2471
100791010.8911031.190.2746
10083827.432754.80.2750
10086226.588831.20.2356
101115277.521993.50.2471
101373300.188982.90.2396
Table 4. Standardized morphometric and climatic attributes for study stations.
Table 4. Standardized morphometric and climatic attributes for study stations.
StationArea (km2)MAP (mm)DD (km/km2)
100270.1698.85510.579
100652.6568.45412.259
100660.5987.54212.244
100790.44010.00313.607
100830.3607.32213.626
100860.0998.06311.674
101112.2979.63712.244
101371.4369.53511.872
Table 5. Parameter estimates for Station 10086 under stationary and non-stationary univariate and bivariate models.
Table 5. Parameter estimates for Station 10086 under stationary and non-stationary univariate and bivariate models.
DistributionCovariateM v 01 v 11 1 β 1 v 02 v 22 2 l AICD
Gumbel--168.54 108.89 −203.88411.75
GEV--155.56 96.66−0.235 −204.12414.23
BEV 21-10137-1.11154.55 92.42−0.094681.09 574.59−200.14406.28
BEV 21-10137Time1.10105.822.7786.74−0.123745.27−2.45573.55−198.55405.1111.13
Table 6. Estimated return levels (m3/s) for Station 10086 under stationary and non-stationary models across different return periods.
Table 6. Estimated return levels (m3/s) for Station 10086 under stationary and non-stationary models across different return periods.
DistributionScenariotT (years)
25102050100
Gumbel--208.45331.86413.58491.96593.42669.44
GEV--192.55329.32442.08570.59772.66955.82
BEV 21-10137--189.01303.43386.20471.27590.31686.62
BEV 21-10137 non-stationary in time202566320.99431.32513.29599.33722.62824.71
205091390.17500.51582.47668.52791.81893.90
2075116459.36569.69651.66737.71861.00963.08
2125166597.74708.07790.03876.08999.371101.46
Table 7. Best-fitting distributions for stations in the homogeneous region.
Table 7. Best-fitting distributions for stations in the homogeneous region.
StationBest-Fitting
Distribution
CovariateM v 01 v 11 1 β 1 v 02 v 22 2 l AIC
10027BEV 21-10065SOI1.18165.48−4.68112.91−0.219825.22−75.56584.65−369.18746.36
10065BEV 21-10027Time1.15804.27−1.31436.67−0.191190.17−0.06148.03−361.49730.98
10066BEV 21-10079SOI1.07192.234.25103.89−0.236527.8026.00637.16−320.53649.06
10079BEV 21-10066Time1.10337.401.80270.49−0.478257.48−1.57142.75−301.79611.58
10083BEV 21-10065Time1.34186.500.59155.50−0.455909.26−2.88584.78−223.72455.45
10086BEV 21-10137Time1.10105.822.7786.74−0.123745.27−2.45573.55−198.55405.11
10111BEV 21-10065Time2.05800.38−6.16338.99−0.485943.79−2.84626.07−395.48798.95
10137BEV 21-10066SOI1.13512.21−44.16344.23−0.524215.82−4.84146.25−389.67787.35
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Berbesi-Prieto, L.; Escalante-Sandoval, C. Flood Frequency Analysis Using the Bivariate Logistic Model with Non-Stationary Gumbel and GEV Marginals. Hydrology 2025, 12, 274. https://doi.org/10.3390/hydrology12110274

AMA Style

Berbesi-Prieto L, Escalante-Sandoval C. Flood Frequency Analysis Using the Bivariate Logistic Model with Non-Stationary Gumbel and GEV Marginals. Hydrology. 2025; 12(11):274. https://doi.org/10.3390/hydrology12110274

Chicago/Turabian Style

Berbesi-Prieto, Laura, and Carlos Escalante-Sandoval. 2025. "Flood Frequency Analysis Using the Bivariate Logistic Model with Non-Stationary Gumbel and GEV Marginals" Hydrology 12, no. 11: 274. https://doi.org/10.3390/hydrology12110274

APA Style

Berbesi-Prieto, L., & Escalante-Sandoval, C. (2025). Flood Frequency Analysis Using the Bivariate Logistic Model with Non-Stationary Gumbel and GEV Marginals. Hydrology, 12(11), 274. https://doi.org/10.3390/hydrology12110274

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop