Nonstationary Flood Frequency Analysis for Urban Watersheds Using Open-Source Bayesian Software: Contrasting Case Studies from Texas

Smith, C. Haden; Skahill, Brian; Margo, David A.

doi:10.3390/w18050636

Open AccessArticle

Nonstationary Flood Frequency Analysis for Urban Watersheds Using Open-Source Bayesian Software: Contrasting Case Studies from Texas

by

C. Haden Smith

^1,*

,

Brian Skahill

²

and

David A. Margo

³

¹

U.S. Army Corps of Engineers, Risk Management Center, Lakewood, CO 80228, USA

²

Fariborz Maseeh Department of Mathematics & Statistics, Portland State University, Portland, OR 97201, USA

³

HDR, Inc., Pittsburgh, PA 15219, USA

^*

Author to whom correspondence should be addressed.

Water 2026, 18(5), 636; https://doi.org/10.3390/w18050636

Submission received: 28 January 2026 / Revised: 2 March 2026 / Accepted: 5 March 2026 / Published: 7 March 2026

(This article belongs to the Special Issue Urban Flood Frequency Analysis and Risk Assessment, 2nd Edition)

Download

Browse Figures

Review Reports Versions Notes

Abstract

Urban flood frequency analysis faces unique challenges as land development alters watershed hydrology, producing nonstationary flood records. This study demonstrates nonstationary flood frequency analysis (NSFFA) using RMC-BestFit, an open-source Bayesian software, through two Texas case studies. Brays Bayou at Houston (96 years of record) exemplifies an urbanized watershed with increasing flood trends; a step-logistic model captures both the abrupt increase in mean flood magnitude around 1968 and the progressive decrease in log-space variance as urbanization homogenized runoff response. O.C. Fisher Reservoir (169 years of record) exhibits decreasing trends attributed to brush encroachment and groundwater extraction; despite a sinusoidal model achieving best information criteria, a step function was selected based on physical reasoning, demonstrating that statistical fit alone should not dictate model selection. Results reveal contrasting frequency curve patterns: at O.C. Fisher, stationary and nonstationary curves differ uniformly (53% reduction in 100-year flood), while at Brays Bayou, curves differ substantially for frequent events (48% increase in 2-year flood) but converge in the extreme tail due to opposing trends in location and scale parameters. These findings underscore that NSFFA relevance depends on decision context. Bayesian methods offer key advantages including flexible integration of diverse data sources, comprehensive uncertainty quantification, and principled model comparison. Open-source software democratizes access to these methods, promoting transparency and reproducibility.

Keywords:

urban floods; Bayesian analysis; nonstationarity; land use change; uncertainty; risk assessment; open-source software

1. Introduction

Urban areas face disproportionate flood risks globally, with dense populations and critical infrastructure concentrated in floodplains. The Centre for Research on the Epidemiology of Disasters (CRED) reported that floods were the most common disasters worldwide in 2023, affecting several million people and causing over ten thousand deaths [1]. According to CRED, floods and severe storms (including tropical cyclones and convective events) accounted for more than 70% of natural disasters on average between 2003 and 2023. Urban areas experience particularly severe impacts due to concentrated development and hydrologic modifications. Furthermore, flood risk management infrastructure in the U.S. and globally is aging, making many existing structures vulnerable to the compounding impacts of climate change. Urban flood frequency analysis presents unique challenges as ongoing development fundamentally alters watershed runoff characteristics, producing flood records that violate the stationarity assumption underlying traditional frequency methods [2,3].

Both natural and anthropogenic influences can alter flood behavior over time. A warming climate increases the amount of precipitable water in the atmosphere, which could increase the likelihood of extreme floods. Conversely, land use changes such as agricultural intensification and groundwater extraction can reduce runoff generation in some watersheds. Therefore, incorporating climate change and land use evolution into flood hazard and risk analysis is crucial for dam and levee safety professionals, urban planners, and infrastructure managers.

Urbanization increases flood peaks through multiple mechanisms: expansion of impervious surfaces reduces infiltration capacity, channelization and drainage improvements increase conveyance efficiency, and loss of floodplain storage reduces attenuation [4]. These changes can be rapid, with urban watersheds experiencing order-of-magnitude increases in impervious cover over several decades. Consequently, flood frequency distributions estimated from historical data may poorly represent current or future conditions if temporal trends are not explicitly modeled [5,6].

1.1. The Stationarity Problem

Traditional flood frequency analysis assumes that flood magnitudes are independent and identically distributed samples from a stationary probability distribution [7]. This stationarity paradigm has been challenged as observational evidence increasingly demonstrates temporal nonstationarity in hydrologic extremes [2,8]. Bayazit [3] provided a comprehensive review of nonstationarity in hydrological records and methods for trend detection. For urban watersheds undergoing active development, the stationarity assumption is particularly problematic, as the watershed generating flood events today differs fundamentally from the watershed that produced historical flood observations.

Nonstationary flood frequency analysis (NSFFA) provides a framework for explicitly modeling temporally evolving flood risk. Rather than assuming constant distribution parameters, NSFFA allows location, scale, and shape parameters to vary with time or covariates [9,10,11]. Strupczewski et al. [9] introduced the foundational maximum likelihood framework for nonstationary at-site modeling. Recent advances have demonstrated various NSFFA approaches across multiple frameworks. Debele et al. [12,13] applied generalized additive models for location, scale, and shape (GAMLSS) parameters to Polish flood data. Cheng and AghaKouchak [14] developed nonstationary intensity–duration–frequency curves for precipitation extremes under climate change. Chen et al. [15] provided a comprehensive comparison of linear, nonlinear, parametric, and nonparametric regression models for NSFFA. Grego and Yates [16] recently introduced robust local likelihood estimation methods. Wasko et al. [17] provided a systematic review of climate change science relevant to Australian design flood estimation, including NSFFA approaches.

1.2. Bayesian Approaches to Flood Frequency Analysis

Bayesian methods have become increasingly prominent in flood frequency analysis due to their ability to incorporate prior information, quantify uncertainty comprehensively, and provide coherent probabilistic inference [18]. Foundational work by Kuczera [19,20] established Bayesian approaches for regional skew estimation and at-site flood frequency analysis using Monte Carlo methods. Reis and Stedinger [21] extended these methods to incorporate historical flood information through Markov Chain Monte Carlo (MCMC) sampling. Viglione et al. [22] developed a comprehensive Bayesian framework for flood frequency hydrology integrating multiple information sources. O’Connell et al. [23] demonstrated Bayesian analysis with paleohydrologic bound data, enabling incorporation of geologic evidence.

Regional Bayesian approaches have been developed to pool information across sites while accounting for spatial heterogeneity [24,25,26]. Kwon et al. [27] introduced climate-informed hierarchical Bayesian models for Montana flood frequency.

Bayesian approaches to NSFFA have been demonstrated by Renard et al. [10,28], Ouarda and El-Adlouni [29,30], Luke et al. [31], Xu et al. [32], and Bracken et al. [33]. Ouarda and El-Adlouni [30] provided a foundational discussion of Bayesian NSFFA with efficient estimation using generalized maximum likelihood methods. Bracken et al. [33] extended Bayesian hierarchical methods to multivariate nonstationary frequency analysis. International applications include Bossa et al. [34] in West Africa and Xiong et al. [35] analyzing 547 years of Yangtze River flood records under nonstationarity.

Despite these methodological advances, practical application of Bayesian NSFFA remains limited in engineering practice due to software availability, computational complexity, and challenges in selecting appropriate trend structures from numerous possible models.

1.3. Model Selection for Nonstationary Analysis

Model selection presents a critical challenge in both stationary and nonstationary flood frequency analysis. Di Baldassarre et al. [36] and Laio et al. [37] developed comprehensive frameworks for comparing candidate probability distributions using information criteria including AIC and BIC. Haddad and Rahman [38] demonstrated distribution and parameter estimation selection in Australian applications.

For nonstationary analysis, model selection must additionally address trend structure complexity. Parsimonious approaches are essential to avoid overfitting, particularly given typically limited flood record lengths. Serago and Vogel [39] introduced parsimonious NSFFA methods emphasizing the importance of balancing model complexity with predictive performance. Singh et al. [40] developed a diagnostic framework for comparing stationary and nonstationary analyses under changing climate.

1.4. RMC-BestFit Software

The U.S. Army Corps of Engineers (USACE) Risk Management Center developed RMC-BestFit as a Bayesian estimation and fitting software designed to enhance flood hazard assessments [41,42]. The software implements a comprehensive Bayesian framework that can incorporate multiple sources of hydrologic information including systematic streamflow data, historical flood records, paleoflood indicators, regional rainfall-runoff analyses, and expert elicitation [22,43,44].

RMC-BestFit employs MCMC methods to sample from posterior distributions of distribution parameters, providing full characterization of parameter uncertainty. The Differential Evolution Markov Chain with snooker update (DE-MCzs) algorithm [45] enables efficient exploration of complex posterior distributions. Recent enhancements include NSFFA capabilities allowing distribution parameters to vary with time through flexible trend functions: linear, polynomial, exponential, logistic, power, sinusoidal, and step functions. Model selection tools based on Akaike Information Criterion (AIC), Bayesian Information Criterion (BIC), and Deviance Information Criterion (DIC) help identify parsimonious trend structures balancing goodness-of-fit with complexity [37,38].

The software is freely available as open-source code on GitHub at https://github.com/USACE-RMC/RMC-BestFit (accessed on 7 November 2025), promoting transparency, reproducibility, and community development. Version 2.0 Beta-3 was used for all analyses in this study. All figures except for the study area map were created using RMC-BestFit.

1.5. Objectives

This study demonstrates practical application of Bayesian NSFFA using RMC-BestFit through two contrasting Texas case studies selected for their clear physical drivers and strong nonstationary signals. The first case study examines Brays Bayou at Houston, one of the most intensively urbanized watersheds in the United States, where 96 years of flood data (1929–2024) including perception-threshold historical information reveal increasing trends in flood magnitude accompanied by decreasing variance as urbanization homogenized the watershed’s runoff response. The second case study analyzes O.C. Fisher Reservoir, where 169 years of historical and systematic data (1853–2021) show decreasing trends attributed to brush encroachment and groundwater extraction.

The objectives are to (1) outline the Bayesian NSFFA methodology and suite of trend options implemented in RMC-BestFit; (2) demonstrate model selection procedures using multiple information criteria while emphasizing the critical role of physical reasoning; (3) evaluate how changes in both location and scale parameters interact to produce different patterns of difference between stationary and nonstationary frequency curves; (4) illustrate how the relevance of NSFFA depends on decision context (frequent events versus extreme events); and (5) provide practical guidance on trend model selection for different watershed types and nonstationarity mechanisms.

2. Materials and Methods

2.1. Likelihood Function

A key component of Bayesian flood frequency analysis is the likelihood function, which gives the probability of the observed data conditional on the distribution parameters [20]. For stationary analysis with exact (systematic) data only, the likelihood is

L (x | θ) = \prod_{i = 1}^{N} f (x_{i} | θ)

(1)

where

x = (x_{1}, \dots, x_{N})

is the sample of observed annual maximum flows,

θ

represents the constant distribution parameters, and

f (\cdot)

is the probability density function.

2.2. Nonstationary Likelihood Function

When distribution parameters vary with time, each observation

x_{t}

at time t follows a distribution with time-dependent parameters

θ_{t}

[9,30]. The nonstationary likelihood becomes:

L (x | θ) = \prod_{t = 1}^{T} f (x_{t} | θ_{t})

(2)

The likelihood can be extended to accommodate various data types commonly encountered in flood frequency analysis [20,21,23,46]:

L (x | θ) = \prod_{t = 1}^{T} \{\begin{matrix} f (x_{t} | θ_{t}) & exact (systematic) data \\ F (x_{t}^{U} | θ_{t}) & left censored (below threshold) \\ 1 - F (x_{t}^{L} | θ_{t}) & right censored (above threshold) \\ F (x_{t}^{U} | θ_{t}) - F (x_{t}^{L} | θ_{t}) & interval censored \end{matrix}

(3)

where

F (\cdot)

is the cumulative distribution function and superscripts U and L denote upper and lower bounds for censored observations. Left censoring indicates the observation is known to be below a threshold but the exact magnitude is unknown; right censoring indicates the observation exceeded a threshold. This flexibility allows incorporation of historical flood information following the foundational work of Stedinger and Cohn [47], as demonstrated by Xiong et al. [35] for multi-century flood records.

2.3. Bayesian Framework and Prior Information

In the Bayesian approach to flood frequency analysis, distribution parameters

θ = {μ, σ, ξ}

(representing location (

μ

), scale (

σ

), and shape (

ξ

)) are treated as random variables rather than fixed constants [18,20]. This conceptualization enables explicit quantification of parameter uncertainty arising from limited data.

Bayesian inference combines observed data with prior information through Bayes’ theorem [20]:

p (θ | x) = \frac{L (x | θ) \cdot π (θ)}{\int L (x | θ) \cdot π (θ) d θ}

(4)

where

p (θ | x)

is the posterior probability distribution of the parameters given the observed data,

L (x | θ)

is the likelihood function,

π (θ)

is the prior distribution encoding information available before analyzing the current data, and the denominator serves as a normalizing constant.

The prior distribution is not derived from the observed flood data, but instead comes from other sources that can be either subjective (e.g., expert opinion) or objective (e.g., regional analysis, previous studies). This prior information is particularly valuable for constraining parameters that are poorly informed by limited data.

2.3.1. Prior Information on Parameters

Prior information for distribution parameters can include regional hydrologic knowledge [19,24], paleoflood evidence [23], or information from analogous watersheds. RMC-BestFit automatically develops default priors to ensure ease of use while maintaining statistical rigor. Flat (uniform) priors are applied to all distribution parameters (location, scale, and shape), centered near the likelihood with broad spread to remain relatively uninformative. Additionally, a Jeffreys’ prior is applied to the scale parameter by default, which provides appropriate behavior for positive-valued parameters and is equivalent to using a log-link function. Users can override these defaults with informative priors when regional or prior knowledge is available.

While flat priors yield posterior inference that approximates maximum likelihood results for well-identified parameters, the Bayesian framework retains key advantages: proper uncertainty quantification through the full posterior distribution rather than asymptotic approximations, natural handling of complex likelihoods with censored data types, and posterior predictive distributions that integrate parameter uncertainty.

The joint prior probability of model parameters is given by

π (θ) = π (θ_{1} \cap θ_{2} \cap \dots \cap θ_{p})

(5)

where p is the number of parameters. In RMC-BestFit, parameters are treated as independent, so the joint prior simplifies to

π (θ) = \prod_{i = 1}^{p} f (θ_{i})

(6)

where

f (\cdot)

is the probability density function for each parameter’s prior distribution.

2.3.2. Prior Information on Quantiles

Viglione et al. [22] presented a framework for incorporating causal information from rainfall-runoff modeling or regional analysis through quantile priors. Causal information expansion analyzes the generating mechanisms of floods in the catchment and is typically derived using rainfall-runoff models or expert elicitation.

A quantile prior specifies the distribution of a flood magnitude

x_{p}

corresponding to a specific annual exceedance probability p:

h (x_{p}) = f (x_{p} | θ_{h})

(7)

where

h (\cdot)

is a probability density function for the quantile prior and

θ_{h}

represents the hyperparameters of the chosen prior distribution (e.g., mean and standard deviation). RMC-BestFit supports several distribution options for quantile priors including Normal, Log-Normal, and Truncated Normal, allowing users to select the form most appropriate for their prior information. The flood quantile

x_{p}

relates to distribution parameters through the inverse cumulative distribution function:

x_{p} = F^{- 1} (p | θ_{T})

(8)

where

θ_{T}

are the distribution parameters at the final time step. The inverse CDF is then used to evaluate the quantile prior in parameter space:

π_{q} (θ) = h (F^{- 1} (p | θ_{T}))

(9)

During MCMC sampling, the quantile prior acts as a penalty function, rewarding parameter sets that produce flood quantiles consistent with the prior information and discounting those that do not. The complete posterior distribution combines all information sources:

p (θ | x) \propto L (x | θ) \cdot π (θ) \cdot π_{q} (θ)

(10)

This framework allows systematic integration of multiple data types: at-site observations through the likelihood, regional parameter knowledge through parameter priors, and rainfall-runoff or climate projection results through quantile priors.

RMC-BestFit also implements quantile priors following the method of Coles and Tawn [44], which provides a more complete mathematical treatment allowing multiple quantile priors to be specified simultaneously. This approach requires that the number of quantile priors equals the number of distribution parameters; for a three-parameter distribution such as Log-Pearson Type III, exactly three quantile priors must be specified (e.g., at the 0.10, 0.01, and 0.001 exceedance probabilities). The O.C. Fisher case study demonstrates this approach using quantile priors derived from rainfall-runoff modeling.

An important consideration for nonstationary analysis is that prior distributions on parameters and quantiles are applied at the final time step T (representing current or future conditions). This ensures that prior knowledge about current flood behavior appropriately constrains the temporal evolution of the distribution through the trend functions [22]. The trend parameters themselves (slopes, change points, etc.) typically receive uninformative priors.

Global Climate Model (GCM) projections can also be incorporated into Bayesian NSFFA through quantile priors. For sites where climate projections indicate continued changes in runoff, the mean flows of quantile priors (

μ_{p}

in Equation (7)) can be adjusted accordingly to reflect projected future conditions. This approach provides a principled mechanism for integrating climate science with flood frequency analysis, supporting forward-looking infrastructure planning decisions.

2.4. Detecting Nonstationarity

Detecting changes in the time series data is a critical first step in NSFFA. A nonstationary time series will often exhibit a trend or jump that can be increasing or decreasing and may be linear or nonlinear. The cause of the nonstationarity can be gradual changes in hydrological and climatological factors, or anthropogenic changes such as alterations in land use and land cover.

RMC-BestFit provides several widely used hypothesis tests for evaluating flood series properties [48,49]. Some tests assess whether the independence and identical distribution (i.i.d.) assumptions are satisfied, while others specifically detect nonstationarity. These include:

Jarque-Bera test for detecting departures from normality [50];
Ljung-Box Q test for autocorrelation indicating periodicity [51];
Wald-Wolfowitz runs test for independence and stationarity [52,53,54];
Mann-Whitney U test for differences in distribution between subperiods [55];
Mann-Kendall trend test for monotonic trends [56,57];
Linear regression trend test for slope significance [48];
Student’s t-test (equal and unequal variance) for differences in means between subperiods [58];
F-test for differences in variance between subperiods [58];
Mixture model test for unimodality versus multimodal distributions [59].

The autocorrelation function (ACF) provides additional insight into the nature of nonstationarity and informs trend model selection. Persistent positive autocorrelation with slow exponential decay indicates a strong monotonic trend, while oscillatory ACF patterns with peaks at specific lags may suggest periodicity, though such patterns can also arise as artifacts of abrupt regime shifts combined with sampling variability.

Hypothesis test results and ACF plots for each case study are presented in Section 3. Multiple tests rejecting the null hypothesis of stationarity, combined with visual inspection of ACF and flood chronology plots, provide strong motivation for nonstationary modeling.

2.5. Return Period Interpretation and Risk Communication Under Nonstationarity

The interpretation of return periods and risk under nonstationary conditions requires careful reconsideration of traditional concepts. Read and Vogel [60] provided comprehensive analysis of reliability, return periods, and risk under nonstationarity, demonstrating that traditional return period interpretations become problematic when exceedance probabilities vary with time. Their subsequent work [61] introduced hazard function analysis and reliability-based methods as alternative frameworks for flood planning under nonstationary conditions.

This study employs a complementary approach based on the “Time Index” concept [14], which provides annual exceedance probability (AEP) inference conditioned on a specific time step rather than computing time-integrated reliability measures. Since distribution parameters vary with time in NSFFA, the exceedance probability and corresponding return period of a flow value also varies with time. In a stationary frequency analysis, the return period provides a straightforward and consistent measure of the likelihood of an event occurring. For example, a flood with a return period of 100 years has a 1% chance of being exceeded in any given year.

However, in nonstationary frequency analysis where statistical properties change over time, the concept of a return period becomes more complex. The return period can vary over time, reflecting the changing probabilities of event magnitudes [60]. Consequently, the return period in a nonstationary analysis must be carefully interpreted, often focusing on specific time steps or conditions to provide meaningful insights.

The Time Index approach determines which time step to use for evaluating the nonstationary frequency distribution [14]. For dam and levee safety applications, risk is typically evaluated using current conditions rather than forecasted or hindcasted conditions. Thus, parameters from the most recent time step are used for flood hazard prediction in risk analysis, providing AEP estimates conditional on present-day watershed conditions. However, users can choose any time step within the study period and forecast into the future to support scenario-based planning decisions [62]. This time-conditioned AEP inference differs fundamentally from reliability-based methods [60,61], which integrate time-varying exceedance probabilities over a structure’s design life to provide cumulative failure probability, analogous to the stationary expression

1 - {(1 - p)}^{T}

but more complex under nonstationarity. The cumulative probability produced by the reliability-based approach is not an AEP and is less directly interpretable for engineering design standards that are conventionally expressed in AEP terms. Furthermore, the reliability framework has been developed primarily for increasing hazard trends; for decreasing trends such as those at O.C. Fisher, reliability improves over time and the cumulative metric provides less actionable design guidance. The Time Index approach avoids these complications by providing a conditional AEP at a specific time step, preserving the familiar AEP interpretation used throughout engineering practice while naturally accommodating both increasing and decreasing trends.

Traditional return period standards (e.g., the “100-year flood” used as the regulatory basis for the U.S. National Flood Insurance Program) implicitly assume stationarity and require re-evaluation when exceedance probabilities vary with time [60]. Application of the Time Index approach is demonstrated in Section 3.

2.6. Trend Model Formulations

RMC-BestFit implements multiple trend model options for location, scale, and shape parameters. In practice, insufficient data typically precludes reliable estimation of time-varying shape parameters, so trend modeling focuses primarily on location and occasionally scale parameters [30,62]. Serago and Vogel [39] emphasized the importance of parsimony in trend model selection.

Table 1 summarizes trend model options with typical use cases relevant to hydrology and flood frequency analysis.

2.7. Probability Distribution Selection

The Log-Pearson Type III (LP3) distribution was selected for both case studies presented in this paper, consistent with federal guidelines established in Bulletin 17C [7]. The LP3 distribution has been the standard for flood frequency analysis in the United States since 1967 and remains the recommended distribution for federal agencies. This distribution is particularly well-suited for modeling flood data because it: (1) accommodates the positive skewness typically observed in flood records, (2) provides flexibility through three parameters (location, scale, and shape), and (3) has been extensively validated across diverse hydrologic regimes.

In RMC-BestFit’s implementation, the LP3 distribution is fit to the logarithms of annual maximum flows. For nonstationary analysis, the location (

μ

) and scale (

σ

) parameters can vary with time according to specified trend functions, while the shape parameter (skewness) is typically held constant due to data limitations for reliable estimation of time-varying skewness [30,62]. While RMC-BestFit supports multiple probability distributions (Generalized Extreme Value, Generalized Logistic, Normal, etc.), the LP3 was selected here to maintain consistency with standard practice and facilitate comparison with conventional stationary analyses.

2.8. Model Selection Using Multiple Criteria

Selecting appropriate trend structures presents a critical challenge in NSFFA [37,39]. RMC-BestFit implements three information criteria:

Akaike Information Criterion (AIC):

A I C = 2 k - 2 ln (\hat{L})

(11)

Bayesian Information Criterion (BIC):

B I C = k ln (n) - 2 ln (\hat{L})

(12)

Deviance Information Criterion (DIC):

D I C = \bar{D} + p_{D}

(13)

where k is the number of parameters, n is the sample size,

\hat{L}

is the maximum likelihood,

\bar{D}

is the posterior mean deviance, and

p_{D}

is the effective number of parameters. Lower values indicate better models [36,38].

2.9. Posterior Inference

RMC-BestFit employs MCMC methods using the DE-MCzs algorithm [45] to sample from the posterior distribution. Xu et al. [32] demonstrated adaptive Metropolis-Hastings optimization for Bayesian NSFFA.

The posterior predictive distribution integrates over parameter uncertainty to characterize the distribution of future floods [22]:

p (x_{T + k} | x) = \int f (x_{T + k} | θ_{T + k}) \cdot p (θ | x) d θ

(14)

This distribution accounts for both parameter uncertainty (epistemic) and natural variability (aleatory), providing comprehensive uncertainty quantification essential for risk-informed decision making [20,63]. This Bayesian formulation has conceptual parallels to the expected probability distribution, which has a long history in U.S. flood frequency analysis for incorporating parameter uncertainty into quantile estimates [64,65].

The 90% credible intervals reported in this study are derived from the posterior predictive distribution, capturing both parameter uncertainty and natural variability. These intervals provide a probabilistic range within which future flood quantiles are expected to fall, given the observed data and model assumptions.

2.10. Practical Implementation for Engineering Applications

While Bayesian MCMC methods are mathematically sophisticated, RMC-BestFit is designed to shield practitioners from implementation complexity, allowing engineers to focus on hydrologic interpretation rather than computational details. Key features enabling practical application include:

Automatic Prior Generation: The software automatically constructs default priors based on the observed data range and distribution support, enabling immediate use without requiring prior specification. Flat priors are applied to location, scale, and shape parameters, with an additional Jeffreys’ prior on the scale parameter. Users can optionally specify informative priors when regional or causal knowledge is available.
Adaptive Sampling: The DE-MCzs algorithm automatically adapts proposal distributions during burn-in, removing the need for manual tuning of step sizes and acceptance rates that traditionally require MCMC expertise.
Convergence Diagnostics: Built-in diagnostics (Gelman-Rubin statistics, effective sample size, trace plots) are computed automatically, alerting users to potential convergence issues without requiring interpretation of raw MCMC output.
Default Settings: Recommended chain lengths, burn-in periods, and thinning intervals are provided as defaults, based on extensive testing across diverse flood datasets.
Graphical Interface: The user interface guides workflow from data import through model specification, estimation, and results visualization, requiring no programming or scripting knowledge.

These features enable engineers familiar with traditional flood frequency analysis to apply Bayesian NSFFA methods without specialized statistical training, democratizing access to advanced uncertainty quantification approaches.

3. Results

The following case studies integrate watershed description, data characteristics, and frequency analysis results for each site to maintain narrative coherence, following the presentation format common in applied flood hydrology literature.

3.1. Case Study Watersheds

This study examines two watersheds in Texas selected to represent contrasting nonstationary behaviors relevant to urban flood management. Case Study 1 (Brays Bayou, Houston) represents a heavily urbanized watershed exhibiting increasing flood trends resulting from rapid land use conversion, following the analytical framework demonstrated by Villarini et al. [5]. Case Study 2 (O.C. Fisher Reservoir, west Texas) represents a watershed with decreasing flood trends attributed to brush encroachment, groundwater extraction, and climate variability. Figure 1 shows the locations of both case study watersheds within Texas.

3.2. Case Study 1: Brays Bayou at Houston, Texas

The first case study examines Brays Bayou, a major urban waterway in southwest Houston that exemplifies the flood challenges facing rapidly urbanizing watersheds. Annual maximum streamflow data were obtained from USGS gage 08075000 (Brays Bayou at Houston, TX, USA), located near the Main Street Bridge in Harris County. The drainage area upstream of the gage is approximately 246 km² (94.9 mi²).

3.2.1. Flood Data

The systematic record spans water years 1936–2024, providing 89 years of continuous annual maximum instantaneous streamflow data. In addition to the systematic record, a significant flood event occurred in 1929 and was included as exact (uncensored) historical data. For the intervening period 1930–1935, a perception threshold of 994 cubic meters per second (cms) was applied. A perception threshold defines the flow above which an event would have been recorded in any given year [7]; individual flood magnitudes during this period are unknown but inferred to be below the threshold. This treatment follows established methods for incorporating historical information [21,47].

The resulting dataset comprises 90 exact observations (1929 plus 1936–2024) and 6 perception-threshold years (1930–1935), spanning a total of 96 years. The combination of systematic and historical data substantially improves parameter estimation, particularly for the upper tail of the frequency distribution [47]. Figure 2 presents the flood chronology, illustrating the clear increasing trend associated with urbanization.

3.2.2. Watershed Characteristics

Brays Bayou is a principal tributary of Buffalo Bayou, flowing approximately 50 km (31 mi) from the western edge of Harris County, south of Barker Reservoir near the Fort Bend County border, eastward to its confluence with Buffalo Bayou at Harrisburg. The stream corridor traverses some of Houston’s most densely developed neighborhoods and districts, including Meyerland, Braeswood Place, the Texas Medical Center, and Riverside Terrace.

The Brays Bayou watershed represents one of the most intensively urbanized drainage basins in the United States. The broader watershed encompasses approximately 334 km² (129 mi²) and supports a population exceeding 700,000 residents. The drainage network includes over 200 km (124 mi) of open-channel waterways, the majority of which are artificial drainage channels constructed to convey stormwater through the urban landscape. This combination of high population density, extensive impervious surface coverage, and limited flood control infrastructure renders Brays Bayou exceptionally susceptible to flash flooding.

3.2.3. Urbanization History

Urbanization within the Brays Bayou basin, consistent with broader Houston metropolitan development patterns, accelerated dramatically during the mid-20th century. Post-World War II population growth, successive petroleum industry expansions, and the emergence of the Houston Ship Channel as a major industrial corridor drove rapid conversion of agricultural and natural lands to urban uses. The most intensive development occurred between the 1970s and 2010s, during which impervious surface coverage increased substantially as residential subdivisions, commercial centers, and transportation infrastructure replaced permeable landscapes.

This transformation fundamentally altered watershed hydrology. Natural infiltration capacity diminished as concrete, asphalt, and rooftops replaced vegetated surfaces, while channelization projects increased conveyance efficiency but reduced floodplain storage. The combined effect has been a progressive increase in flood peaks and volumes for storms of equivalent magnitude, providing strong physical justification for nonstationary flood frequency analysis [5,6]. The temporal coincidence between the identified 1968 change point and documented urbanization acceleration provides qualitative causal support, though formal covariate-based attribution is beyond the current scope (Section 4.4).

Houston’s flood history includes several catastrophic events that underscore the region’s vulnerability, including Tropical Storm Allison (2001) and Hurricane Harvey (2017). The watershed’s urban development timeline coincides with observed increases in flood frequency and magnitude, motivating explicit modeling of temporal trends in flood risk.

3.2.4. Hypothesis Testing

Table 2 presents hypothesis test results for the Brays Bayou annual maximum flow series. All trend tests reject the null hypothesis of stationarity at the

p < 0.001

level. The significant F-test (

p = 8.76 \times 10^{- 7}

) indicates changing variance between early and late periods, while the mixture model test confirms multimodality, consistent with distinct pre- and post-urbanization flow regimes.

The autocorrelation function (Figure 3) exhibits persistent positive autocorrelation with slow exponential decay, characteristic of a strong monotonic trend. This pattern, combined with the highly significant trend tests, confirms that nonstationarity must be explicitly modeled.

3.2.5. Model Estimation and Selection for Brays Bayou

Given the strong evidence for nonstationarity in both mean and variance (Table 2), seven candidate models were evaluated using the Log-Pearson Type III distribution. The significant F-test (

p = 8.76 \times 10^{- 7}

) and multimodal mixture test results motivated testing trend models on both location (

μ

) and scale (

σ

) parameters. Candidate models included: (1) stationary (constant parameters); (2) linear trend on

μ

; (3) logistic trend on

μ

; (4) step function on

μ

; (5) linear on

μ

with logistic on

σ

; (6) logistic on

μ

with logistic on

σ

; and (7) step function on

μ

with logistic on

σ

.

Table 3 presents model comparison results. The logistic–logistic model achieved the lowest AIC and BIC values, while the step–logistic model (step function on

μ

, logistic on

σ

) achieved the lowest DIC. Given that DIC is specifically designed for Bayesian model comparison and the step function provides an identifiable change point with clear physical interpretation, the step–logistic model was selected.

The step–logistic model is also physically compelling. The estimated mean change point of 1968 corresponds closely to the onset of Houston’s most intensive urbanization period in the early 1970s, when post-war suburban expansion accelerated dramatically. Notably, the logistic trend on the scale parameter indicates decreasing variance in log-space over time (negative slope

β = - 0.014

). While this may seem counterintuitive given the visually apparent spread of flows in recent years, examination of the log-transformed data reveals that early-period floods spanned over one order of magnitude (approximately 20–300 cms), whereas the post-urbanization period shows roughly half an order of magnitude spread (approximately 200–800 cms).

This decreasing log-space variance has a compelling physical interpretation: urbanization homogenizes the watershed’s runoff response, reducing the natural sources of variability that dominated the pre-development era. This mechanism is discussed further in Section 4.2.

Figure 4 illustrates the fitted trends for four representative models, showing the time-varying 2-year flood (median annual maximum): (a) stationary baseline, (b) linear–logistic, (c) logistic–logistic, and (d) the selected step–logistic model. The step–logistic model captures both the abrupt increase in mean flood magnitude around 1968 and the progressive reduction in flood variability as urbanization homogenized the watershed response.

3.2.6. Frequency Analysis Results for Brays Bayou

Table 4 presents flood quantile estimates comparing stationary and nonstationary analyses using current conditions (2024). A notable feature of the Brays Bayou results is the convergence of stationary and nonstationary frequency curves in the extreme right tail, with the largest differences occurring for frequent events. This convergence pattern arises from the opposing trends in location (increasing) and scale (decreasing) parameters, with implications for practice discussed in Section 4.2.

The quantitative differences are substantial for frequent events but diminish toward the tail. The 2-year flood is 48% larger under the nonstationary model (511 versus 346 cms), and the 5-year flood is 9% larger (661 versus 607 cms). However, differences diminish rapidly: the 10-year floods are nearly identical (749 versus 744 cms, less than 1% difference), and the 100-year floods are essentially equivalent (996 versus 993 cms). This convergence pattern contrasts sharply with O.C. Fisher, where uniform differences occur across all return periods.

Figure 5 presents the flood frequency curves comparing stationary and nonstationary analyses. The convergence of curves in the extreme tail is clearly visible, with substantial differences for frequent events.

3.3. Case Study 2: O.C. Fisher Reservoir, West Texas

O.C. Fisher Dam is located on the North Concho River in west Texas with a catchment area of approximately 3885 km². Period-of-record systematic inflows were obtained from 1916 to 2021, providing 106 years of annual maximum 2-day average inflow data. The largest flood during the systematic period occurred on 17 September 1936, and was estimated to be approximately 1150 cms.

The observed decline in flood magnitudes at O.C. Fisher coincides with significant changes in the North Concho watershed during the mid-20th century. Progressive encroachment of mesquite and juniper, which accelerated from the late 1800s through the 1950s due to overgrazing and fire suppression, fundamentally altered watershed hydrology [66]. These deep-rooted woody plants intercept precipitation through transpiration, depleting soil moisture and reducing aquifer recharge. Concurrently, agricultural irrigation pumping from the Edwards–Trinity aquifer lowered regional water tables, with documented declines exceeding 100 feet in portions of the watershed between the 1930s and 1960s. These combined effects depleted shallow aquifers that historically fed springs, converting perennial streams to intermittent channels. The Upper Colorado River Authority documented that streamflow at Carlsbad decreased to less than 22% of pre-1960 levels despite slightly increased rainfall [66].

Based on historical records and reports, the largest flood on the North Concho River occurred in 1853 and was at least as large as the 1936 event. Multiple other large events occurred between 1854 and 1915, but none exceeded the 1936 flood magnitude. In RMC-BestFit, the historical period from 1853 to 1915 was treated using a perception threshold of 1150 cms following methods established by Stedinger and Cohn [47]. One flood (1853) is known to have exceeded this level; exact magnitudes for all other years are unknown and treated as binomial-censored observations in the likelihood function. This treatment has been extended to nonstationary contexts by Machado et al. [46] and Xiong et al. [35]. The resulting dataset spans 169 years (1853–2021), combining systematic observations with perception-threshold historical information. Figure 6 presents the flood chronology, illustrating the clear decreasing trend.

Preliminary hypothesis tests applied to the combined systematic and historical dataset indicate statistically significant nonstationarity (Table 5), consistent with frameworks established by Gül et al. [8]. These statistical tests, combined with visual inspection of the flood chronology showing clear declining pattern, provide strong motivation for nonstationary modeling [3].

The autocorrelation function (Figure 7) exhibits oscillatory behavior with peaks around lags 9–12 years. This lag range is consistent with known climate teleconnection periods that influence Texas hydrology, including ENSO (2–7 years) and PDO (10–20 years) [67,68]. As discussed below, however, the fitted sinusoidal trend model does not successfully capture this potential signal.

3.3.1. Quantile Priors from Rainfall-Runoff Modeling

As discussed in Section 2.3.1, regional precipitation-frequency and rainfall-runoff modeling results can be incorporated in the Bayesian analysis through quantile priors [22,44]. For the O.C. Fisher analysis, the Rainfall-Runoff Frequency Tool (RMC-RRFT), a cloud-based system for stochastically sampling HEC-HMS rainfall-runoff models, was used to estimate quantile priors for the 0.10, 0.01, and 0.001 AEPs [69,70].

Global Climate Model (GCM) projections were also incorporated through the quantile priors. Downscaled hydrology projections from a multi-model ensemble [71] were processed for the watershed, indicating that runoff would continue to decrease by approximately 5% on average over the next 30 years. The mean flows of the quantile priors were reduced accordingly, demonstrating how climate science can be integrated with flood frequency analysis to support forward-looking infrastructure decisions. Quantile priors were specified as log-normal distributions with means and standard deviations estimated from the RMC-RRFT stochastic sampling results.

3.3.2. Model Estimation and Selection for O.C. Fisher

Given the oscillatory ACF pattern (Figure 7) and the non-significant F-test for variance change (

p = 0.42

), four candidate trend models were evaluated for the location parameter only: (1) stationary (constant); (2) linear trend; (3) sinusoidal; and (4) step function.

Table 6 presents model comparison results following the information-theoretic framework of Laio et al. [37]. Notably, the sinusoidal model achieved the lowest AIC, BIC, and DIC values across all candidates. However, the step function model was selected as the preferred model despite inferior information criteria, illustrating a critical principle in NSFFA: statistical fit alone should not dictate model selection when physical justification is lacking.

The sinusoidal model achieved the best statistical fit but presents interpretive challenges. While the ACF exhibits peaks at lags consistent with known teleconnection periods, the fitted sinusoidal model converged to a period of approximately 50–60 years, which aligns with neither the ACF peaks (9–12 years) nor established teleconnection periodicities (2–7 years for ENSO, 10–20 years for PDO). This mismatch suggests the model is not successfully isolating any real periodic signal. Furthermore, if both a secular decline and teleconnection-driven oscillation are present, a standalone sinusoidal function cannot capture both simultaneously. In contrast, the step function model has clear causal support: decades of progressive brush encroachment combined with groundwater extraction for irrigation collectively reduced runoff generation capacity by the 1960s [66]. The encroachment of deep-rooted mesquite and juniper increased transpiration, depleting shallow aquifers that historically fed perennial springs, while agricultural pumping from the Edwards–Trinity aquifer lowered water tables substantially. Given that the sinusoidal model’s fitted period lacks physical justification and the step function aligns with documented watershed changes, the step function was selected. The small DIC difference (

Δ

DIC = 2.28) arises because both models capture the observed decline in flood magnitudes over the period of record, but they imply fundamentally different future trajectories.

Future investigations using covariate-based approaches with teleconnection indices or compound trend formulations could better evaluate potential climate oscillation influences. This case demonstrates that practitioners should always require physical explanations for trend structures, using information criteria to guide rather than dictate model selection [62].

Figure 8 illustrates the fitted trends for all four candidate models. Visual comparison reveals that while the sinusoidal model fits the oscillatory pattern in the data, it implies physically implausible future behavior. The step function model captures the essential regime shift while maintaining interpretability.

3.3.3. Frequency Analysis Results for O.C. Fisher

After estimating and selecting the step function model, flood hazard predictions and risk analyses can be performed for dam safety applications. The parameters from the most recent time step (2021) were used for prediction, representing current watershed conditions. For O.C. Fisher Dam, the Probable Maximum Flood (PMF) provides an important reference point for evaluating spillway capacity and dam safety margins. The PMF was estimated at approximately 7930 cms following standard U.S. practice: Probable Maximum Precipitation (PMP) was derived from Hydrometeorological Reports 51 and 52 (HMR 51/52) and routed through a calibrated HEC-HMS rainfall-runoff model, the same model used to develop the quantile priors described in Section 3.3.1.

The stationary posterior predictive frequency curve suggests the PMF (estimated at approximately 7930 cms) has an annual exceedance probability of approximately 2 × 10⁻⁴ (5000-year return period). In contrast, the nonstationary posterior predictive curve indicates the PMF has an AEP slightly less than 1 × 10⁻⁴ (10,000-year return period), reflecting the reduced flood hazard under current conditions. More significantly for operations planning, the 100-year flood (0.01 AEP) is substantially reduced with the nonstationary model.

Table 7 compares stationary vs. nonstationary flood quantile estimates using current conditions (year 2021). The 100-year flood estimate decreased from 947 cms (stationary) to 444 cms (nonstationary current), representing a 53% reduction. The 90% credible intervals, derived from the posterior predictive distribution, properly reflect substantial parameter uncertainty and natural variability [22,60]. This substantial reduction in design flood estimates presents an opportunity to potentially reallocate flood control storage for water supply, thereby enhancing climate resiliency in this semi-arid region.

Figure 9 presents flood frequency curves comparing stationary and nonstationary analyses. Unlike Brays Bayou, where curves converge in the tail, O.C. Fisher exhibits uniform differences across all return periods because only the location parameter changes.

3.4. Summary of Frequency Curve Contrasts

The two case studies exhibit fundamentally different patterns in how stationary and nonstationary frequency curves differ. At O.C. Fisher, the nonstationary curve is shifted uniformly downward across all return periods (53% reduction in 100-year flood). At Brays Bayou, curves differ substantially for frequent events (48% increase in 2-year flood) but converge in the extreme tail (less than 1% difference in 100-year flood) due to opposing trends in location (increasing) and scale (decreasing) parameters. These contrasting patterns have important implications for practice, discussed in Section 4.

4. Discussion

4.1. Model Selection and Trend Model Use Cases

The two case studies demonstrate the importance of combining statistical model selection with physical understanding [39,40]. At Brays Bayou, the step-logistic model was selected based on lowest DIC and the identifiable 1968 change point corresponding to documented urbanization. At O.C. Fisher, the step function model was selected despite the sinusoidal model achieving superior information criteria across all metrics because the step function has clear causal support from documented brush encroachment and groundwater extraction, while the sinusoidal model’s fitted period does not align with any known physical mechanism.

Model selection under nonstationarity requires balancing parsimony with goodness-of-fit. A recommended practice is to start with a stationary baseline and test incrementally more complex models, checking whether each alternative significantly improves over the previous one. AIC, BIC, and DIC each penalize additional parameters, though BIC imposes a more severe penalty that scales with sample size, and DIC is specifically designed for Bayesian model comparison.

Critically, a model with the smallest information criteria may not always be the most appropriate choice if there is no hydrological or climatological explanation for the trend pattern. The O.C. Fisher case study exemplifies this principle: the sinusoidal model achieved the lowest AIC, BIC, and DIC, yet was rejected because its fitted period of approximately 50–60 years does not align with any known physical mechanism. The step function model, with clear causal support from documented brush encroachment and groundwater extraction, was selected despite its DIC being only 2.28 units higher than the sinusoidal model, a minimal statistical cost for a model with stronger physical justification.

General guidance by watershed type includes (1) early-stage urbanization: start with linear, then test logistic [5]; (2) rapid urban expansion: test power or quadratic; (3) post-disturbance recovery: test exponential for approach to new equilibrium; (4) known regime shift: test step function; (5) long-term records with climate influence: test sinusoidal cautiously [27,72].

4.2. Contrasting Patterns in Frequency Curve Differences

The two case studies exhibit fundamentally different patterns in how stationary and nonstationary frequency curves differ, with important implications for practice.

At O.C. Fisher Reservoir, where only the location parameter (

μ

) changes over time, the nonstationary frequency curve is shifted uniformly downward from the stationary curve across all return periods. The 100-year flood decreases by 53%, and similar proportional reductions occur throughout the frequency range. This pattern arises because a decrease in

μ

with constant

σ

simply translates the entire distribution to lower values.

At Brays Bayou, where both location and scale parameters change (increasing

μ

, decreasing

σ

), the frequency curves exhibit a more complex relationship. The 2-year flood increases by 48%, but this difference diminishes with decreasing AEP: the 10-year floods differ by less than 1%, and the 100-year floods are essentially identical. This convergence occurs because the increasing mean shifts the distribution upward while the decreasing variance produces a narrower distribution. For frequent events near the median, the upward shift dominates; for extreme events, the narrower spread partially offsets the higher mean.

This finding has practical significance: the most appropriate choice between stationary and nonstationary analysis depends on the decision context. For Brays Bayou, stationary analysis may adequately estimate extreme floods relevant to dam safety, but would substantially underestimate the frequent flooding that drives stormwater infrastructure design, flood insurance rates, and routine flood damages experienced by residents. Practitioners should consider which portion of the frequency curve is most relevant to their specific application.

The physical mechanism underlying the decreasing log-space variance merits further explanation. In a pre-urbanization watershed, flood magnitudes are highly variable because antecedent soil moisture, infiltration capacity, and vegetation interception modulate rainfall-runoff partitioning differently for each event. As impervious surfaces replace permeable landscapes, these modulating factors diminish and the watershed’s response approaches a near-deterministic conversion of rainfall to runoff. This homogenization reduces relative variability (variance in log-space) even as mean flood magnitude increases.

The 90% credible intervals also differ between case studies. At Brays Bayou, nonstationary intervals are narrower for frequent events (where the model explains the regime shift) but wider in the extreme tail due to epistemic uncertainty from additional trend parameters. At O.C. Fisher, nonstationary intervals are wider than stationary across all return periods, reflecting the added epistemic uncertainty from trend parameters despite the improved mean estimate. This time-dependent uncertainty structure underscores the value of the full Bayesian posterior predictive framework.

A practical question is: when does NSFFA enhance decision-making compared to a robust stationary Bayesian analysis? The Brays Bayou case is instructive. For the 100-year flood, stationary and nonstationary estimates are nearly identical (993 versus 996 cms) with overlapping credible intervals, and a stationary analysis would suffice for dam safety. NSFFA adds the most value when: (1) the decision involves the frequency range where estimates diverge, such as stormwater design at Brays Bayou where the 2-year flood differs by 48%; (2) trends produce uniform divergence across all return periods, as at O.C. Fisher; or (3) forward-looking projections are needed for long-lived infrastructure.

4.3. Implications for Dam Safety and Storage Reallocation

The NSFFA results have direct implications for dam safety risk analysis and reservoir operations. For dam safety applications, the nonstationary flow-frequency curve must be transformed to reservoir water levels using reservoir routing software such as RMC-RFA [73], with results then imported to quantitative risk analysis software for comprehensive dam safety assessment [74].

In the coming decades, reallocation to balance competing objectives such as flood storage and water supply will undoubtedly be necessary to address climate change impacts. The direction of reallocation depends critically on whether flood hazard is increasing or decreasing at a given site:

For sites where flood hazard is increasing due to climate change or urbanization, more conservative dam safety modifications that provide higher levels of protection will be preferable. Infrastructure investments should anticipate continued growth in flood magnitudes and design for conditions expected at the end of the project life.

For sites where flood hazard is decreasing, as demonstrated at O.C. Fisher, less flood storage capacity will be needed to maintain the same level of flood protection. This reduction in required capacity presents an opportunity to reallocate flood control storage for water supply, thereby enhancing climate resiliency, particularly valuable in the semi-arid western United States where water supplies are increasingly stressed.

Increasing water supply storage will tend to increase the likelihood of spillway discharge at lower water levels, thereby increasing potential flood damages during spillway operations. However, if the flood hazard is decreasing over time, the net increase in these damages could be negligible, allowing for positive net benefits and increased water supply without compromising safety.

4.4. Limitations and Future Research

This study demonstrates practical application of nonstationary flood frequency analysis for urban watersheds, but several important limitations should be acknowledged. First, both case studies employ at-site analysis only, estimating distribution parameters independently at each location. Regional analysis approaches that pool information across multiple sites through hierarchical Bayesian models could substantially improve parameter estimation, particularly for sites with limited record lengths [26]. Regional hierarchical models that share nonstationary trend parameters across climatically or hydrologically similar sites while preserving site-specific characteristics represent an important direction for future research.

Additionally, both case studies feature strong anthropogenic drivers with clear physical mechanisms. Framework performance in data-scarce basins, mixed-driver systems with ambiguous attribution, and regions with weaker trend signals remains untested and represents important future research scope.

Second, this analysis relies on a single probability distribution (Log-Pearson Type III) selected for consistency with federal guidelines. Distribution choice can interact with trend specification under nonstationarity [72], and future analyses should evaluate alternative distributions using DIC-based comparison in RMC-BestFit. For short-record sites, informative priors, as demonstrated at O.C. Fisher using quantile priors from rainfall-runoff modeling, and regional hierarchical approaches become especially valuable for stable trend estimation. Sensitivity of results to different prior information sources, including expert elicitation, also warrants future investigation.

Third, trend models use time as the independent variable rather than physical covariates. While physical reasoning supports the selected trend structures qualitatively, this study does not quantitatively demonstrate causal linkage between physical drivers and flood trends. Covariate-based approaches using impervious area fraction, groundwater levels, or climate indices would enable formal attribution and scenario-based projections [5,17]. In the interim, the quantile prior mechanism demonstrated at O.C. Fisher (Section 3.3.1) provides a complementary pathway for integrating climate projections.

Fourth, this study demonstrates single best model selection based on information criteria and physical plausibility. However, when multiple trend models achieve similar statistical fit (as observed at O.C. Fisher where the sinusoidal and step function models differed by only

Δ

DIC = 2.28), single-model selection discards potentially useful information. Bayesian Model Averaging (BMA) provides a principled alternative, where posterior model probabilities derived from DIC differences weight predictive distributions across competing models [37]. This yields ensemble forecasts that account for both parameter and model structural uncertainty, propagating epistemic uncertainty about the “true” trend structure into final quantile estimates.

The combination of multiple probability distributions with multiple trend models can produce a large candidate model space; BMA offers a rigorous framework for managing this complexity. While available in RMC-BestFit, BMA application is beyond the scope of this introductory demonstration but represents an important direction for comprehensive methodological guidance in subsequent publications.

Fifth, RMC-BestFit does not currently support compound trend models that superimpose multiple signals (e.g., a step function combined with a periodic oscillation). For watersheds potentially influenced by both anthropogenic regime shifts and climate teleconnections, such compound formulations or covariate-based approaches using teleconnection indices may be necessary to isolate and properly attribute different sources of nonstationarity.

Finally, both case studies analyze annual maximum series which may not capture sub-annual flood seasonality changes. Partial duration series approaches that model all peaks above a threshold could provide additional insights into changing flood behavior, particularly for urban watersheds where multiple events per year can cause significant damages [12,13].

This study treats flood discharges as known values, but stage–discharge relationships introduce measurement uncertainty, particularly for extreme flows exceeding the calibrated rating curve range [75]. RMC-BestFit addresses this through an uncertain data likelihood that propagates observation-level distributional uncertainty through both stationary and nonstationary analyses, and through Bayesian estimation of segmented rating curves. Demonstration of these capabilities is planned for a forthcoming publication.

Despite these limitations, the case studies successfully demonstrate that open-source Bayesian software can make sophisticated nonstationary methods accessible to the broader engineering community, promoting transparency and reproducibility in flood risk assessment. These advanced approaches (regional analysis, covariate-based models, BMA, alternative distributions) will be addressed in subsequent publications focused on comprehensive methodological guidance.

5. Conclusions

This study demonstrated practical application of nonstationary flood frequency analysis using RMC-BestFit through contrasting Texas case studies. Key findings include the following:

Nonstationarity substantially impacts flood risk estimates, but the pattern depends on which parameters change and how they interact. At O.C. Fisher, floods decreased uniformly by 40–53% (location shift only). At Brays Bayou, urbanization increased the mean while decreasing log-space variance as impervious surfaces homogenized runoff response, producing a 48% increase in the 2-year flood but less than 1% difference at the 100-year level.
Model selection should integrate information criteria with physical reasoning [37,39]. At O.C. Fisher, the sinusoidal model achieved the best information criteria but was rejected for lack of physical mechanism. Different trend models suit different situations: step functions for regime shifts, logistic trends for saturating processes [5,6].
Historical and perception-threshold data substantially improve analyses [21,35,47]. Extended records at both sites enabled confident detection of long-term trends.
The relevance of NSFFA depends on decision context. NSFFA adds the most value when decisions involve frequency ranges where estimates diverge, when trends produce uniform divergence, or when forward-looking projections are needed for long-lived infrastructure.
Open-source Bayesian software promotes accessibility and reproducibility [41], and the demonstrated workflow provides a practical template for practitioners applying NSFFA.

As climate change continues to impact hydrological patterns, it is crucial to adopt flexible and forward-looking approaches in dam and levee safety and water resource management. The methodologies and insights from this study provide a foundation for adaptive strategies, ensuring better preparedness and optimized resource allocation in the face of evolving flood risks.

Author Contributions

Conceptualization, C.H.S.; methodology, C.H.S. and B.S.; software, C.H.S.; validation, C.H.S.; formal analysis, C.H.S.; investigation, C.H.S.; writing–original draft preparation, C.H.S.; writing–review and editing, C.H.S., B.S. and D.A.M.; supervision, D.A.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the U.S. Army Corps of Engineers Risk Management Center.

Data Availability Statement

Brays Bayou streamflow data are publicly available from the U.S. Geological Survey National Water Information System (https://waterdata.usgs.gov/nwis, accessed on 7 November 2025). Restrictions apply to the availability of the O.C. Fisher Reservoir 2-day average inflow data. Data were obtained from the U.S. Army Corps of Engineers Risk Management Center and are available from the corresponding author with the permission of the U.S. Army Corps of Engineers Risk Management Center. RMC-BestFit software and documentation are openly available at GitHub (https://github.com/USACE-RMC/RMC-BestFit, accessed on 7 November 2025). The raw data and RMC-BestFit model files supporting the conclusions of this article will be made available by the authors on request.

Acknowledgments

The RMC-BestFit software would not exist without the support of Risk Management Center leadership, in particular the former RMC Director Nathan J. Snorteland (retired) and RMC Lead Engineers David A. Margo (retired) and John F. England. RMC-BestFit was developed in collaboration with Brian Skahill (retired) at the USACE Engineer Research and Development Center Coastal and Hydraulics Laboratory. The authors are grateful to all who have contributed to the development of RMC-BestFit and the content of this paper.

Conflicts of Interest

Author David Margo was employed by the company HDR Engineering, Inc. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

AEP	Annual Exceedance Probability
AIC	Akaike Information Criterion
BIC	Bayesian Information Criterion
CRED	Centre for Research on the Epidemiology of Disasters
DIC	Deviance Information Criterion
MCMC	Markov Chain Monte Carlo
NSFFA	Nonstationary Flood Frequency Analysis
USACE	U.S. Army Corps of Engineers
USGS	U.S. Geological Survey

References

Centre for Research on the Epidemiology of Disasters (CRED). 2023 Disasters in Numbers; Technical report; CRED: Brussels, Belgium, 2024. [Google Scholar]
Milly, P.C.D.; Betancourt, J.; Falkenmark, M.; Hirsch, R.M.; Kundzewicz, Z.W.; Lettenmaier, D.P.; Stouffer, R.J. Stationarity is dead: Whither water management? Science 2008, 319, 573–574. [Google Scholar] [CrossRef]
Bayazit, M. Nonstationarity of hydrological records and recent trends in trend analysis: A state-of-the-art review. Environ. Processes 2015, 2, 527–542. [Google Scholar] [CrossRef]
Leopold, L.B. Hydrology for Urban Land Planning—A Guidebook on the Hydrologic Effects of Urban Land Use; Circular 554; U.S. Geological Survey: Reston, VA, USA, 1968.
Villarini, G.; Smith, J.A.; Serinaldi, F.; Bales, J.; Bates, P.D.; Krajewski, W.F. Flood frequency analysis for nonstationary annual peak records in an urban drainage basin. Adv. Water Resour. 2009, 32, 1255–1266. [Google Scholar] [CrossRef]
Gilroy, K.L.; McCuen, R.H. A nonstationary flood frequency analysis method to adjust for future climate change and urbanization. J. Hydrol. 2012, 414–415, 40–48. [Google Scholar] [CrossRef]
England, J.F.; Cohn, T.A.; Faber, B.A.; Stedinger, J.R.; Thomas, W.O.; Veilleux, A.G.; Kiang, J.E.; Mason, R.R. Guidelines for Determining Flood Flow Frequency, Bulletin 17C; Techniques and Methods 4-B5; U.S. Geological Survey: Reston, VA, USA, 2018. [CrossRef]
Gül, G.O.; Aşıkoğlu, Ö.L.; Gül, A.; Yaşoğlu, F.G.; Benzeden, E. Nonstationarity in flood time series. J. Hydrol. Eng. 2014, 19, 1349–1360. [Google Scholar] [CrossRef]
Strupczewski, W.G.; Singh, V.P.; Feluch, W. Non-stationary approach to at-site flood frequency modelling I. Maximum likelihood estimation. J. Hydrol. 2001, 248, 123–142. [Google Scholar] [CrossRef]
Renard, B.; Lang, M.; Bois, P. Statistical analysis of extreme events in a non-stationary context via a Bayesian framework: Case study with peak-over-threshold data. Stoch. Environ. Res. Risk Assess. 2006, 21, 97–112. [Google Scholar] [CrossRef]
Skahill, B.E.; AghaKouchak, A.; Cheng, L.; Byrd, A.; Kanney, J. Bayesian Inference of Nonstationary Precipitation Intensity-Duration-Frequency Curves for Infrastructure Design; Technical Note ERDC/CHL CHETN-X-2; U.S. Army Corps of Engineers, Engineer Research and Development Center: Vicksburg, MS, USA, 2016. [Google Scholar]
Debele, S.E.; Strupczewski, W.G.; Bogdanowicz, E. A comparison of three approaches to non-stationary flood frequency analysis. Acta Geophys. 2017, 65, 863–883. [Google Scholar] [CrossRef]
Debele, S.E.; Bogdanowicz, E.; Strupczewski, W.G. Around and about an application of the GAMLSS package to non-stationary flood frequency analysis. Acta Geophys. 2017, 65, 885–892. [Google Scholar] [CrossRef]
Cheng, L.; AghaKouchak, A. Nonstationary precipitation intensity-duration-frequency curves for infrastructure design in a changing climate. Sci. Rep. 2014, 4, 7093. [Google Scholar] [CrossRef]
Chen, M.; Papadikis, K.; Jun, C.; Macdonald, N. Linear, nonlinear, parametric and nonparametric regression models for nonstationary flood frequency analysis. J. Hydrol. 2023, 616, 128772. [Google Scholar] [CrossRef]
Grego, J.M.; Yates, P.A. Robust local likelihood estimation for non-stationary flood frequency analysis. J. Agric. Biol. Environ. Stat. 2024, 29, 571–591. [Google Scholar] [CrossRef]
Wasko, C.; Westra, S.; Nathan, R.; Pepler, A.; Raupach, T.H.; Dowdy, A.; Grose, M.R.; Trenham, C.; Evans, J.P.; Johnson, F.; et al. A systematic review of climate change science relevant to Australian design flood estimation. Hydrol. Earth Syst. Sci. 2024, 28, 1251–1285. [Google Scholar] [CrossRef]
Gaumé, É. Flood frequency analysis: The Bayesian choice. WIREs Water 2018, 5, e1290. [Google Scholar] [CrossRef]
Kuczera, G. A Bayesian surrogate for regional skew in flood frequency analysis. Water Resour. Res. 1983, 19, 821–829. [Google Scholar] [CrossRef]
Kuczera, G. Comprehensive at-site flood frequency analysis using Monte Carlo Bayesian inference. Water Resour. Res. 1999, 35, 1551–1557. [Google Scholar] [CrossRef]
Reis, D.S.; Stedinger, J.R. Bayesian MCMC flood frequency analysis with historical information. J. Hydrol. 2005, 313, 97–116. [Google Scholar] [CrossRef]
Viglione, A.; Merz, R.; Salinas, J.L.; Blöschl, G. Flood frequency hydrology: 3. A Bayesian analysis. Water Resour. Res. 2013, 49, 675–692. [Google Scholar] [CrossRef]
O’Connell, D.R.H.; Ostenaa, D.A.; Levish, D.R.; Klinger, R.E. Bayesian flood frequency analysis with paleohydrologic bound data. Water Resour. Res. 2002, 38, 1058. [Google Scholar] [CrossRef]
Seidou, O.; Ouarda, T.B.M.J.; Barbet, M.; Bruneau, P.; Bobée, B. A parametric Bayesian combination of local and regional information in flood frequency analysis. Water Resour. Res. 2006, 42, W11408. [Google Scholar] [CrossRef]
Ribatet, M.; Sauquet, E.; Grésillon, J.M.; Ouarda, T.B.M.J. A regional Bayesian POT model for flood frequency analysis. Stoch. Environ. Res. Risk Assess. 2008, 22, 327–339. [Google Scholar] [CrossRef]
Guo, S.; Xiong, L.; Chen, J.; Guo, S.; Xia, J.; Zeng, L.; Xu, C.Y. Nonstationary regional flood frequency analysis based on the Bayesian method. Water Resour. Manag. 2022, 36, 6809–6831. [Google Scholar] [CrossRef]
Kwon, H.H.; Brown, C.; Lall, U. Climate informed flood frequency analysis and prediction in Montana using hierarchical Bayesian modeling. Geophys. Res. Lett. 2008, 35, L05404. [Google Scholar] [CrossRef]
Renard, B.; Sun, X.; Lang, M. Bayesian methods for non-stationary extreme value analysis. In Extremes in a Changing Climate; Springer: Berlin/Heidelberg, Germany, 2012; pp. 39–95. [Google Scholar] [CrossRef]
Ouarda, T.B.M.J.; El-Adlouni, S.E. Bayesian inference of non-stationary flood frequency models. In Proceedings of the Proceedings of the World Environmental and Water Resources Congress 2008; American Society of Civil Engineers: Reston, VA, USA, 2008. [Google Scholar] [CrossRef]
Ouarda, T.B.M.J.; El-Adlouni, S. Bayesian nonstationary frequency analysis of hydrological variables. J. Am. Water Resour. Assoc. 2011, 47, 496–505. [Google Scholar] [CrossRef]
Luke, A.; Vrugt, J.A.; AghaKouchak, A.; Matthew, R.A.; Sanders, B.F. Predicting nonstationary flood frequencies: Evidence supports an updated stationarity thesis in the United States. Water Resour. Res. 2017, 53, 5469–5494. [Google Scholar] [CrossRef]
Xu, W.; Jiang, C.; Yan, L.; Li, L.; Liu, S. An adaptive Metropolis-Hastings optimization algorithm of Bayesian estimation in non-stationary flood frequency analysis. Water Resour. Manag. 2018, 32, 1343–1366. [Google Scholar] [CrossRef]
Bracken, C.; Holman, K.D.; Rajagopalan, B.; Moradkhani, H. A Bayesian hierarchical approach to multivariate nonstationary hydrologic frequency analysis. Water Resour. Res. 2017, 53, 4481–4505. [Google Scholar] [CrossRef]
Bossa, A.Y.; Akpaca, J.d.D.; Hounkpè, J.; Yira, Y.; Badou, D.F. Non-stationary flood discharge frequency analysis in West Africa. GeoHazards 2023, 4, 316–327. [Google Scholar] [CrossRef]
Xiong, B.; Xiong, L.; Guo, S.; Xu, C.Y.; Xia, J.; Zhong, Y.; Yang, H. Nonstationary frequency analysis of censored data: A case study of the floods in the Yangtze River from 1470 to 2017. Water Resour. Res. 2020, 56, e2020WR027112. [Google Scholar] [CrossRef]
Di Baldassarre, G.; Laio, F.; Montanari, A. Design flood estimation using model selection criteria. Phys. Chem. Earth 2009, 34, 606–611. [Google Scholar] [CrossRef]
Laio, F.; Di Baldassarre, G.; Montanari, A. Model selection techniques for the frequency analysis of hydrological extremes. Water Resour. Res. 2009, 45, W07416. [Google Scholar] [CrossRef]
Haddad, K.; Rahman, A. Selection of the best fit flood frequency distribution and parameter estimation procedure: A case study for Tasmania in Australia. Stoch. Environ. Res. Risk Assess. 2011, 25, 415–428. [Google Scholar] [CrossRef]
Serago, J.M.; Vogel, R.M. Parsimonious nonstationary flood frequency analysis. Adv. Water Resour. 2018, 112, 1–16. [Google Scholar] [CrossRef]
Singh, J.; Hari, V.; Singh, T.; Karmakar, S.; Ghosh, S. A framework for investigating the diagnostic trend in stationary and nonstationary flood frequency analyses under changing climate. J. Clim. Change 2015, 1, 47–56. [Google Scholar] [CrossRef]
Smith, C.H. RMC-TR-2020-02: Verification of the Bayesian Estimation and Fitting Software (RMC-BestFit); Technical report; U.S. Army Corps of Engineers, Risk Management Center: Lakewood, CO, USA, 2020. [Google Scholar]
Smith, C.H.; Doughty, M. RMC-TR-2020-03: RMC-BestFit Quick Start Guide; Technical report; U.S. Army Corps of Engineers, Risk Management Center: Lakewood, CO, USA, 2020. [Google Scholar]
Smith, C.H.; Skahill, B.E. Estimating design floods with a specified annual exceedance probability. In Proceedings of the NZSOLD ANCOLD Combined Conference 2019, Sydney, Australia, 9–12 October 2019. [Google Scholar]
Coles, S.G.; Tawn, J.A. A Bayesian analysis of extreme rainfall data. J. R. Stat. Soc. Ser. C (Appl. Stat.) 1996, 45, 463–478. [Google Scholar] [CrossRef]
ter Braak, C.J.F.; Vrugt, J.A. Differential evolution Markov chain with snooker updater and fewer chains. Stat. Comput. 2008, 18, 435–446. [Google Scholar] [CrossRef]
Machado, M.J.; Botero, B.A.; López, J.; Francés, F.; Díez-Herrero, A.; Benito, G. Flood frequency analysis of historical flood data under stationary and non-stationary modelling. Hydrol. Earth Syst. Sci. 2015, 19, 2561–2576. [Google Scholar] [CrossRef]
Stedinger, J.R.; Cohn, T.A. Flood frequency analysis with historical and paleoflood information. Water Resour. Res. 1986, 22, 785–793. [Google Scholar] [CrossRef]
Maity, R. Statistical Methods in Hydrology and Hydroclimatology; Springer: Singapore, 2018. [Google Scholar] [CrossRef]
Naghettini, M. Fundamentals of Statistical Hydrology; Springer: Cham, Switzerland, 2017. [Google Scholar] [CrossRef]
Jarque, C.M.; Bera, A.K. A Test for Normality of Observations and Regression Residuals. Int. Stat. Rev. 1987, 55, 163–172. [Google Scholar] [CrossRef]
Ljung, G.M.; Box, G.E.P. On a Measure of Lack of Fit in Time Series Models. Biometrika 1978, 65, 297–303. [Google Scholar] [CrossRef]
Wald, A.; Wolfowitz, J. An Exact Test for Randomness in the Non-Parametric Case Based on Serial Correlation. Ann. Math. Stat. 1943, 14, 378–388. [Google Scholar] [CrossRef]
Meylan, P.; Favre, A.C.; Musy, A. Predictive Hydrology: A Frequency Analysis Approach; CRC Press: Boca Raton, FL, USA, 2012. [Google Scholar] [CrossRef]
Rao, A.R.; Hamed, K.H. Flood Frequency Analysis; CRC Press: Boca Raton, FL, USA, 2000. [Google Scholar] [CrossRef]
Mann, H.B.; Whitney, D.R. On a Test of Whether One of Two Random Variables is Stochastically Larger than the Other. Ann. Math. Stat. 1947, 18, 50–60. [Google Scholar] [CrossRef]
Mann, H.B. Nonparametric Tests Against Trend. Econometrica 1945, 13, 245–259. [Google Scholar] [CrossRef]
Kendall, M.G. Rank Correlation Methods, 4th ed.; Charles Griffin: London, UK, 1975. [Google Scholar]
Snedecor, G.W.; Cochran, W.G. Statistical Methods, 8th ed.; Iowa State University Press: Ames, IA, USA, 1989. [Google Scholar]
McLachlan, G.J.; Peel, D. Finite Mixture Models; John Wiley & Sons: New York, NY, USA, 2000. [Google Scholar] [CrossRef]
Read, L.K.; Vogel, R.M. Reliability, return periods, and risk under nonstationarity. Water Resour. Res. 2015, 51, 6381–6398. [Google Scholar] [CrossRef]
Read, L.K.; Vogel, R.M. Hazard function analysis for flood planning under nonstationarity. Water Resour. Res. 2016, 52, 4116–4131. [Google Scholar] [CrossRef]
Smith, C.H. Nonstationary flood frequency analysis with RMC-BestFit. In Proceedings of the ANCOLD Conference 2024, Sydney, Australia, 11–14 November 2024. [Google Scholar]
Apostolakis, G. The Concept of Probability in Safety Assessments of Technological Systems. Science 1990, 250, 1359–1364. [Google Scholar] [CrossRef]
Beard, L.R. Probability Estimates Based on Small Normal-Distribution Samples. J. Geophys. Res. 1960, 65, 2143–2148. [Google Scholar] [CrossRef]
Stedinger, J.R.; Vogel, R.M.; Foufoula-Georgiou, E. Frequency Analysis of Extreme Events. In Handbook of Hydrology; Maidment, D.R., Ed.; McGraw-Hill: New York, NY, USA, 1993; Chapter 18; pp. 18.1–18.66. [Google Scholar]
Upper Colorado River Authority. North Concho River Watershed Brush Control Planning, Assessment and Feasibility Study; Technical report; Upper Colorado River Authority: San Angelo, TX, USA, 1999. [Google Scholar]
Mishra, A.K.; Singh, V.P.; Özger, M. Seasonal streamflow extremes in Texas River basins: Uncertainty, trends, and teleconnections. J. Geophys. Res. Atmos. 2011, 116, D08108. [Google Scholar] [CrossRef]
Tootle, G.A.; Piechota, T.C.; Singh, A. Coupled oceanic-atmospheric variability and U.S. streamflow. Water Resour. Res. 2005, 41, W12408. [Google Scholar] [CrossRef]
Avance, A.; Mahoney, M.; Smith, C.H. Incorporating regional rainfall-frequency into flood frequency using RMC-RRFT and RMC-BestFit. In Proceedings of the ASDSO Conference 2021, Lexington, KY, USA, 12–15 September 2021. [Google Scholar]
Quebbeman, J.; Carney, S.; Denno, M.; Smith, C.H. Integrating USACE RMC-RRFT within a risk-informed framework for SQRAs. In Proceedings of the USSD Conference 2023, Charleston, SC, USA, 17–21 April 2023. [Google Scholar]
Maurer, E.P.; Brekke, L.; Pruitt, T.; Duffy, P.B. Fine-resolution climate projections enhance regional climate change impact studies. Eos Trans. Am. Geophys. Union 2007, 88, 504. [Google Scholar] [CrossRef]
Vasiliades, L.; Galiatsatou, P.; Loukas, A. Nonstationary frequency analysis of annual maximum rainfall using climate covariates. Water Resour. Manag. 2015, 29, 339–358. [Google Scholar] [CrossRef]
Smith, C.H. A robust and efficient stochastic simulation framework for estimating reservoir stage-frequency curves with uncertainty bounds. In Proceedings of the ANCOLD Conference 2018, Sydney, Australia, 11–12 October 2018. [Google Scholar]
Smith, C.H.; Fields, W.L. A New Comprehensive Risk Analysis Software, RMC-TotalRisk. In Proceedings of the ANCOLD Conference 2022, Sydney, Australia, 26–28 October 2022. [Google Scholar]
Kiang, J.E.; Gazoorian, C.; McMillan, H.; Hall, A.; Cohn, T.; Mason, R.R.; Eng, K.; Badruzzaman, M. A comparison of methods for streamflow uncertainty estimation. Water Resour. Res. 2018, 54, 7149–7176. [Google Scholar] [CrossRef]

Figure 1. Study area map showing the locations of the two case study watersheds in Texas. Brays Bayou at Houston (USGS gage 08075000) drains approximately 246 km² in southwest Houston, Harris County. O.C. Fisher Reservoir on the North Concho River drains approximately 3885 km² in west Texas near San Angelo. This figure was created using matplotlib (Version 3.9).

Figure 2. Chronology of annual maximum instantaneous streamflow for Brays Bayou at Houston (1929–2024). The 1929 flood (filled circle) is included as exact historical data. The period 1930–1935 (shaded region) represents a perception threshold of 994 cms; individual flood magnitudes during this period are unknown but are inferred to be below the threshold. A clear increasing trend is evident, associated with progressive urbanization of the watershed.

Figure 3. Autocorrelation function for Brays Bayou at Houston. Persistent positive autocorrelation with slow decay is characteristic of a strong monotonic trend. Dashed lines indicate 95% confidence bounds.

Figure 4. Comparison of trend model fits for Brays Bayou showing the time-varying 2-year flood distribution: (a) stationary, (b) linear–logistic (

μ

–

σ

) [

Δ

DIC = 16.74], (c) logistic–logistic (

μ

–

σ

) [

Δ

DIC = 4.95], and (d) step–logistic (

μ

–

σ

) [selected,

Δ

DIC = 0]. Dashed lines indicate posterior mean; shaded regions show 90% credible intervals. The selected step–logistic model (d) captures both the regime shift in mean flood magnitude around 1968 and the decreasing variance as urbanization homogenized the watershed’s runoff response.

Figure 4. Comparison of trend model fits for Brays Bayou showing the time-varying 2-year flood distribution: (a) stationary, (b) linear–logistic (

μ

–

σ

) [

Δ

DIC = 16.74], (c) logistic–logistic (

μ

–

σ

) [

Δ

DIC = 4.95], and (d) step–logistic (

μ

–

σ

) [selected,

Δ

DIC = 0]. Dashed lines indicate posterior mean; shaded regions show 90% credible intervals. The selected step–logistic model (d) captures both the regime shift in mean flood magnitude around 1968 and the decreasing variance as urbanization homogenized the watershed’s runoff response.

Figure 5. Flood frequency curves for Brays Bayou at Houston comparing stationary (blue) and nonstationary (red) analyses. Dashed lines represent posterior predictive estimates; shaded regions show 90% credible intervals. The nonstationary curve reflects current watershed conditions (2024). Curves differ substantially for frequent events but converge in the extreme tail due to opposing trends in location and scale parameters. Plotting positions for observed data use the Hirsch-Stedinger formula and implicitly assume stationarity, highlighting the inherent challenge of visualizing nonstationary frequency curves.

Figure 6. Chronology of annual maximum 2-day average inflow for O.C. Fisher Reservoir (1853–2021). The historical period 1853–1915 (shaded region) represents a perception threshold of 1150 cms; individual flood magnitudes are unknown but inferred to be below the 1936 flood level, with one flood (1853) known to exceed it. A clear decreasing trend is evident, attributed to brush encroachment and groundwater extraction.

Figure 7. Autocorrelation function for O.C. Fisher Reservoir. Oscillatory behavior with peaks around lags 9–12 years is consistent with known teleconnection periods (ENSO, PDO); however, the fitted sinusoidal model converged to a period of approximately 50–60 years that does not match these signals, informing selection of the step function model. Dashed lines indicate 95% confidence bounds.

Figure 8. Comparison of trend model fits for O.C. Fisher Reservoir showing the time-varying 2-year flood distribution: (a) stationary, (b) linear trend, (c) step function, and (d) sinusoidal. Dashed lines indicate posterior mean; shaded regions show 90% credible intervals. Historical perception-threshold period (1853–1915) shown in pink shading. The sinusoidal model (d) achieves the best statistical fit but implies physically implausible cyclic behavior; the step function model (c) was selected based on physical reasoning.

Figure 9. Flood frequency curves for O.C. Fisher Reservoir comparing stationary (blue) and nonstationary (red) analyses. Dashed lines represent posterior predictive estimates; shaded regions show 90% credible intervals. The nonstationary curve reflects current watershed conditions (2021). The Probable Maximum Flood (PMF) is shown as a horizontal dashed line for dam safety reference. Uniform difference across all return periods reflects the decrease in location parameter only.

Table 1. Trend model formulations and applications for urban watersheds.

Model	Formulation	Use-Case and Hydrologic Example
Constant	$μ_{t} = α$	No detectable trend. Rural watershed with no significant land use changes.
Linear	$μ_{t} = α + β t$	Steady trends in flood magnitude. Increasing flood peaks due to gradual urban expansion [5].
Exponential	$μ_{t} = α e^{- β t}$	Rapidly decelerating trends. Slowing trend as watershed stabilizes after land use changes.
Logistic	$μ_{t} = \frac{α}{1 + e^{- β t}}$	Saturating growth behavior. Urban sprawl drives flood risk upward but plateaus as zoning reaches capacity [6].
Power	$μ_{t} = α t^{β}$	Accelerating trends. Impervious surface growth outpaces population in rapidly urbanizing areas.
Quadratic	$μ_{t} = α + β t + γ t^{2}$	Turning points or parabolic behavior. Floods increase then plateau following green infrastructure.
Cubic	$μ_{t} = α + β t + γ t^{2} + δ t^{3}$	Complex evolution with multiple inflection points. A basin affected by sequential changes: deforestation → partial reforestation → late-stage sprawl.
Sinusoidal	$μ_{t} = α + β sin (2 π γ t + δ)$	Oscillatory behavior. Multidecadal climate cycles influencing rainfall variability [27].
Step Function	$μ_{t} = \{\begin{matrix} μ_{1}, & t \leq t_{c} \\ μ_{2}, & t > t_{c} \end{matrix}$	Sudden shifts in flood regime. Significant land cover change in known year.

Table 2. Hypothesis test results for Brays Bayou at Houston.

Test	Null Hypothesis	p-Value	Sig.
Jarque-Bera	Normality	0.0015	**
Ljung-Box Q	No autocorrelation	<1 × 10⁻¹⁵	***
Wald-Wolfowitz	Independence/stationarity	1.84 × 10⁻¹⁰	***
Mann-Whitney U	Homogeneity/stationarity	4.55 × 10⁻¹¹	***
Mann-Kendall	No monotonic trend	1.33 × 10⁻¹³	***
Linear Trend (slope)	$β_{1} = 0$	<1 × 10⁻¹⁵	***
Equal Variance t-test	$μ_{1} = μ_{2}$	4.23 × 10⁻¹³	***
Unequal Variance t-test	$μ_{1} = μ_{2}$	5.91 × 10⁻¹²	***
F-test	$σ_{1}^{2} = σ_{2}^{2}$	8.76 × 10⁻⁷	***
Mixture Model	Unimodality	0.0004	***

Notes: Standard significance codes are listed for reference: ***

p < 0.001

; **

p < 0.01

; *

p < 0.05

; ·

p < 0.1

. Subperiod tests use 1929–1980 vs. 1981–2024.

Table 3. Model comparison for Brays Bayou case study.

Model	k	AIC	BIC	DIC	$Δ$ DIC
Stationary	3	1233.73	1241.23	1228.36	73.06
Linear ( $μ$ )	4	1182.14	1192.14	1180.76	25.45
Logistic ( $μ$ )	4	1179.94	1189.94	1179.23	23.93
Step ( $μ$ )	5	1177.04	1189.54	1162.28	6.98
Linear–Logistic ( $μ$ – $σ$ )	5	1168.70	1181.20	1172.04	16.74
Logistic–Logistic ( $μ$ – $σ$ )	5	1161.46	1173.96	1160.26	4.95
Step–Logistic ( $μ$ – $σ$ )	6	1165.24	1180.24	1155.30	0.00

Notes: k = number of parameters. Bold indicates selected model (lowest DIC with identifiable change point).

Table 4. Flood quantile estimates for Brays Bayou at Houston.

	Stationary		Nonstationary (2024)
AEP (Return)	Posterior Predictive	90% CI	Posterior Predictive	90% CI
0.50 (2-yr)	346	[297, 400]	511	[462, 560]
0.20 (5-yr)	607	[542, 673]	661	[603, 727]
0.10 (10-yr)	744	[674, 815]	749	[675, 836]
0.02 (50-yr)	940	[865, 1046]	924	[798, 1076]
0.01 (100-yr)	993	[912, 1126]	996	[838, 1172]

Notes: AEP = Annual Exceedance Probability; CI = Credible Interval from posterior distribution; Units: cms.

Table 5. Hypothesis test results for O.C. Fisher Reservoir.

Test	Null Hypothesis	p-Value	Sig.
Jarque-Bera	Normality	0.6571
Ljung-Box Q	No autocorrelation	4.08 × 10⁻⁸	***
Wald-Wolfowitz	Independence/stationarity	0.0544	·
Mann-Whitney U	Homogeneity/stationarity	4.23 × 10⁻⁶	***
Mann-Kendall	No monotonic trend	1.01 × 10⁻⁶	***
Linear Trend (slope)	$β_{1} = 0$	2.23 × 10⁻⁷	***
Equal Variance t-test	$μ_{1} = μ_{2}$	5.25 × 10⁻⁷	***
Unequal Variance t-test	$μ_{1} = μ_{2}$	5.41 × 10⁻⁷	***
F-test	$σ_{1}^{2} = σ_{2}^{2}$	0.4213
Mixture Model	Unimodality	0.5650

Notes: Standard significance codes are listed for reference: ***

p < 0.001

; **

p < 0.01

; *

p < 0.05

; ·

p < 0.1

. Subperiod tests use 1916–1960 vs. 1961–2021.

Table 6. Model comparison for O.C. Fisher case study.

Model	k	AIC	BIC	DIC	$Δ$ DIC
Stationary	3	1061.42	1069.41	1051.80	24.96
Linear	4	1044.61	1055.26	1034.06	7.22
Step Function	5	1054.21	1067.52	1029.12	2.28
Sinusoidal	6	1044.27	1060.25	1026.84	0.00

Notes: k = number of parameters. Bold indicates selected model based on physical reasoning despite not having lowest information criteria.

Table 7. Flood quantile estimates for O.C. Fisher Reservoir.

	Stationary		Nonstationary (2021)
AEP (Return)	Posterior Predictive	90% CI	Posterior Predictive	90% CI
0.50 (2-yr)	20	[15, 26]	12	[8, 16]
0.20 (5-yr)	80	[60, 106]	43	[30, 60]
0.10 (10-yr)	165	[123, 220]	85	[59, 120]
0.02 (50-yr)	598	[414, 844]	285	[166, 458]
0.01 (100-yr)	947	[616, 1405]	444	[232, 764]

Notes: AEP = Annual Exceedance Probability; CI = Credible Interval from posterior distribution; Units: cms.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Smith, C.H.; Skahill, B.; Margo, D.A. Nonstationary Flood Frequency Analysis for Urban Watersheds Using Open-Source Bayesian Software: Contrasting Case Studies from Texas. Water 2026, 18, 636. https://doi.org/10.3390/w18050636

AMA Style

Smith CH, Skahill B, Margo DA. Nonstationary Flood Frequency Analysis for Urban Watersheds Using Open-Source Bayesian Software: Contrasting Case Studies from Texas. Water. 2026; 18(5):636. https://doi.org/10.3390/w18050636

Chicago/Turabian Style

Smith, C. Haden, Brian Skahill, and David A. Margo. 2026. "Nonstationary Flood Frequency Analysis for Urban Watersheds Using Open-Source Bayesian Software: Contrasting Case Studies from Texas" Water 18, no. 5: 636. https://doi.org/10.3390/w18050636

APA Style

Smith, C. H., Skahill, B., & Margo, D. A. (2026). Nonstationary Flood Frequency Analysis for Urban Watersheds Using Open-Source Bayesian Software: Contrasting Case Studies from Texas. Water, 18(5), 636. https://doi.org/10.3390/w18050636

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Nonstationary Flood Frequency Analysis for Urban Watersheds Using Open-Source Bayesian Software: Contrasting Case Studies from Texas

Abstract

1. Introduction

1.1. The Stationarity Problem

1.2. Bayesian Approaches to Flood Frequency Analysis

1.3. Model Selection for Nonstationary Analysis

1.4. RMC-BestFit Software

1.5. Objectives

2. Materials and Methods

2.1. Likelihood Function

2.2. Nonstationary Likelihood Function

2.3. Bayesian Framework and Prior Information

2.3.1. Prior Information on Parameters

2.3.2. Prior Information on Quantiles

2.4. Detecting Nonstationarity

2.5. Return Period Interpretation and Risk Communication Under Nonstationarity

2.6. Trend Model Formulations

2.7. Probability Distribution Selection

2.8. Model Selection Using Multiple Criteria

2.9. Posterior Inference

2.10. Practical Implementation for Engineering Applications

3. Results

3.1. Case Study Watersheds

3.2. Case Study 1: Brays Bayou at Houston, Texas

3.2.1. Flood Data

3.2.2. Watershed Characteristics

3.2.3. Urbanization History

3.2.4. Hypothesis Testing

3.2.5. Model Estimation and Selection for Brays Bayou

3.2.6. Frequency Analysis Results for Brays Bayou

3.3. Case Study 2: O.C. Fisher Reservoir, West Texas

3.3.1. Quantile Priors from Rainfall-Runoff Modeling

3.3.2. Model Estimation and Selection for O.C. Fisher

3.3.3. Frequency Analysis Results for O.C. Fisher

3.4. Summary of Frequency Curve Contrasts

4. Discussion

4.1. Model Selection and Trend Model Use Cases

4.2. Contrasting Patterns in Frequency Curve Differences

4.3. Implications for Dam Safety and Storage Reallocation

4.4. Limitations and Future Research

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI