4.1. Model Selection and Trend Model Use Cases
The two case studies demonstrate the importance of combining statistical model selection with physical understanding [
39,
40]. At Brays Bayou, the step-logistic model was selected based on lowest DIC and the identifiable 1968 change point corresponding to documented urbanization. At O.C. Fisher, the step function model was selected despite the sinusoidal model achieving superior information criteria across all metrics because the step function has clear causal support from documented brush encroachment and groundwater extraction, while the sinusoidal model’s fitted period does not align with any known physical mechanism.
Model selection under nonstationarity requires balancing parsimony with goodness-of-fit. A recommended practice is to start with a stationary baseline and test incrementally more complex models, checking whether each alternative significantly improves over the previous one. AIC, BIC, and DIC each penalize additional parameters, though BIC imposes a more severe penalty that scales with sample size, and DIC is specifically designed for Bayesian model comparison.
Critically, a model with the smallest information criteria may not always be the most appropriate choice if there is no hydrological or climatological explanation for the trend pattern. The O.C. Fisher case study exemplifies this principle: the sinusoidal model achieved the lowest AIC, BIC, and DIC, yet was rejected because its fitted period of approximately 50–60 years does not align with any known physical mechanism. The step function model, with clear causal support from documented brush encroachment and groundwater extraction, was selected despite its DIC being only 2.28 units higher than the sinusoidal model, a minimal statistical cost for a model with stronger physical justification.
General guidance by watershed type includes (1) early-stage urbanization: start with linear, then test logistic [
5]; (2) rapid urban expansion: test power or quadratic; (3) post-disturbance recovery: test exponential for approach to new equilibrium; (4) known regime shift: test step function; (5) long-term records with climate influence: test sinusoidal cautiously [
27,
72].
4.2. Contrasting Patterns in Frequency Curve Differences
The two case studies exhibit fundamentally different patterns in how stationary and nonstationary frequency curves differ, with important implications for practice.
At O.C. Fisher Reservoir, where only the location parameter () changes over time, the nonstationary frequency curve is shifted uniformly downward from the stationary curve across all return periods. The 100-year flood decreases by 53%, and similar proportional reductions occur throughout the frequency range. This pattern arises because a decrease in with constant simply translates the entire distribution to lower values.
At Brays Bayou, where both location and scale parameters change (increasing , decreasing ), the frequency curves exhibit a more complex relationship. The 2-year flood increases by 48%, but this difference diminishes with decreasing AEP: the 10-year floods differ by less than 1%, and the 100-year floods are essentially identical. This convergence occurs because the increasing mean shifts the distribution upward while the decreasing variance produces a narrower distribution. For frequent events near the median, the upward shift dominates; for extreme events, the narrower spread partially offsets the higher mean.
This finding has practical significance: the most appropriate choice between stationary and nonstationary analysis depends on the decision context. For Brays Bayou, stationary analysis may adequately estimate extreme floods relevant to dam safety, but would substantially underestimate the frequent flooding that drives stormwater infrastructure design, flood insurance rates, and routine flood damages experienced by residents. Practitioners should consider which portion of the frequency curve is most relevant to their specific application.
The physical mechanism underlying the decreasing log-space variance merits further explanation. In a pre-urbanization watershed, flood magnitudes are highly variable because antecedent soil moisture, infiltration capacity, and vegetation interception modulate rainfall-runoff partitioning differently for each event. As impervious surfaces replace permeable landscapes, these modulating factors diminish and the watershed’s response approaches a near-deterministic conversion of rainfall to runoff. This homogenization reduces relative variability (variance in log-space) even as mean flood magnitude increases.
The 90% credible intervals also differ between case studies. At Brays Bayou, nonstationary intervals are narrower for frequent events (where the model explains the regime shift) but wider in the extreme tail due to epistemic uncertainty from additional trend parameters. At O.C. Fisher, nonstationary intervals are wider than stationary across all return periods, reflecting the added epistemic uncertainty from trend parameters despite the improved mean estimate. This time-dependent uncertainty structure underscores the value of the full Bayesian posterior predictive framework.
A practical question is: when does NSFFA enhance decision-making compared to a robust stationary Bayesian analysis? The Brays Bayou case is instructive. For the 100-year flood, stationary and nonstationary estimates are nearly identical (993 versus 996 cms) with overlapping credible intervals, and a stationary analysis would suffice for dam safety. NSFFA adds the most value when: (1) the decision involves the frequency range where estimates diverge, such as stormwater design at Brays Bayou where the 2-year flood differs by 48%; (2) trends produce uniform divergence across all return periods, as at O.C. Fisher; or (3) forward-looking projections are needed for long-lived infrastructure.
4.3. Implications for Dam Safety and Storage Reallocation
The NSFFA results have direct implications for dam safety risk analysis and reservoir operations. For dam safety applications, the nonstationary flow-frequency curve must be transformed to reservoir water levels using reservoir routing software such as RMC-RFA [
73], with results then imported to quantitative risk analysis software for comprehensive dam safety assessment [
74].
In the coming decades, reallocation to balance competing objectives such as flood storage and water supply will undoubtedly be necessary to address climate change impacts. The direction of reallocation depends critically on whether flood hazard is increasing or decreasing at a given site:
For sites where flood hazard is increasing due to climate change or urbanization, more conservative dam safety modifications that provide higher levels of protection will be preferable. Infrastructure investments should anticipate continued growth in flood magnitudes and design for conditions expected at the end of the project life.
For sites where flood hazard is decreasing, as demonstrated at O.C. Fisher, less flood storage capacity will be needed to maintain the same level of flood protection. This reduction in required capacity presents an opportunity to reallocate flood control storage for water supply, thereby enhancing climate resiliency, particularly valuable in the semi-arid western United States where water supplies are increasingly stressed.
Increasing water supply storage will tend to increase the likelihood of spillway discharge at lower water levels, thereby increasing potential flood damages during spillway operations. However, if the flood hazard is decreasing over time, the net increase in these damages could be negligible, allowing for positive net benefits and increased water supply without compromising safety.
4.4. Limitations and Future Research
This study demonstrates practical application of nonstationary flood frequency analysis for urban watersheds, but several important limitations should be acknowledged. First, both case studies employ at-site analysis only, estimating distribution parameters independently at each location. Regional analysis approaches that pool information across multiple sites through hierarchical Bayesian models could substantially improve parameter estimation, particularly for sites with limited record lengths [
26]. Regional hierarchical models that share nonstationary trend parameters across climatically or hydrologically similar sites while preserving site-specific characteristics represent an important direction for future research.
Additionally, both case studies feature strong anthropogenic drivers with clear physical mechanisms. Framework performance in data-scarce basins, mixed-driver systems with ambiguous attribution, and regions with weaker trend signals remains untested and represents important future research scope.
Second, this analysis relies on a single probability distribution (Log-Pearson Type III) selected for consistency with federal guidelines. Distribution choice can interact with trend specification under nonstationarity [
72], and future analyses should evaluate alternative distributions using DIC-based comparison in RMC-BestFit. For short-record sites, informative priors, as demonstrated at O.C. Fisher using quantile priors from rainfall-runoff modeling, and regional hierarchical approaches become especially valuable for stable trend estimation. Sensitivity of results to different prior information sources, including expert elicitation, also warrants future investigation.
Third, trend models use time as the independent variable rather than physical covariates. While physical reasoning supports the selected trend structures qualitatively, this study does not quantitatively demonstrate causal linkage between physical drivers and flood trends. Covariate-based approaches using impervious area fraction, groundwater levels, or climate indices would enable formal attribution and scenario-based projections [
5,
17]. In the interim, the quantile prior mechanism demonstrated at O.C. Fisher (
Section 3.3.1) provides a complementary pathway for integrating climate projections.
Fourth, this study demonstrates single best model selection based on information criteria and physical plausibility. However, when multiple trend models achieve similar statistical fit (as observed at O.C. Fisher where the sinusoidal and step function models differed by only
DIC = 2.28), single-model selection discards potentially useful information. Bayesian Model Averaging (BMA) provides a principled alternative, where posterior model probabilities derived from DIC differences weight predictive distributions across competing models [
37]. This yields ensemble forecasts that account for both parameter and model structural uncertainty, propagating epistemic uncertainty about the “true” trend structure into final quantile estimates.
The combination of multiple probability distributions with multiple trend models can produce a large candidate model space; BMA offers a rigorous framework for managing this complexity. While available in RMC-BestFit, BMA application is beyond the scope of this introductory demonstration but represents an important direction for comprehensive methodological guidance in subsequent publications.
Fifth, RMC-BestFit does not currently support compound trend models that superimpose multiple signals (e.g., a step function combined with a periodic oscillation). For watersheds potentially influenced by both anthropogenic regime shifts and climate teleconnections, such compound formulations or covariate-based approaches using teleconnection indices may be necessary to isolate and properly attribute different sources of nonstationarity.
Finally, both case studies analyze annual maximum series which may not capture sub-annual flood seasonality changes. Partial duration series approaches that model all peaks above a threshold could provide additional insights into changing flood behavior, particularly for urban watersheds where multiple events per year can cause significant damages [
12,
13].
This study treats flood discharges as known values, but stage–discharge relationships introduce measurement uncertainty, particularly for extreme flows exceeding the calibrated rating curve range [
75]. RMC-BestFit addresses this through an uncertain data likelihood that propagates observation-level distributional uncertainty through both stationary and nonstationary analyses, and through Bayesian estimation of segmented rating curves. Demonstration of these capabilities is planned for a forthcoming publication.
Despite these limitations, the case studies successfully demonstrate that open-source Bayesian software can make sophisticated nonstationary methods accessible to the broader engineering community, promoting transparency and reproducibility in flood risk assessment. These advanced approaches (regional analysis, covariate-based models, BMA, alternative distributions) will be addressed in subsequent publications focused on comprehensive methodological guidance.