Beyond Deterministic Forecasts: A Scoping Review of Probabilistic Uncertainty Quantification in Short-to-Seasonal Hydrological Prediction

De León Pérez, David; Salazar-Galán, Sergio; Francés, Félix

doi:10.3390/w17202932

Open AccessReview

Beyond Deterministic Forecasts: A Scoping Review of Probabilistic Uncertainty Quantification in Short-to-Seasonal Hydrological Prediction

by

David De León Pérez

^1,*

,

Sergio Salazar-Galán

²

and

Félix Francés

¹

Research Group of Hydrological and Environmental Modelling (GIHMA), Research Institute of Water and Environmental Engineering (IIAMA), Universitat Politècnica de València, 46022 Valencia, Spain

²

Agroecosystems History Laboratory, Universidad Pablo de Olavide, 41013 Sevilla, Spain

^*

Author to whom correspondence should be addressed.

Water 2025, 17(20), 2932; https://doi.org/10.3390/w17202932

Submission received: 17 September 2025 / Revised: 29 September 2025 / Accepted: 30 September 2025 / Published: 11 October 2025

(This article belongs to the Section Hydrology)

Download

Browse Figures

Versions Notes

Abstract

This Scoping Review methodically synthesizes methodological trends in predictive uncertainty (PU) quantification for short-to-seasonal hydrological modeling-based forecasting. The analysis encompasses 572 studies from 2017 to 2024, with the objective of addressing the central question: What are the emerging trends, best practices, and gaps in this field? In accordance with the six-stage protocol that is aligned with PRISMA-ScR standards, 92 studies were selected for in-depth evaluation. The results of the study indicate the presence of three predominant patterns: (1) exponential growth in the applications of machine learning and artificial intelligence; (2) geographic concentration in Chinese, North American, and European watersheds; and (3) persistent operational barriers, particularly in data-scarce tropical regions with limited flood and streamflow forecasting validation. Hybrid statistical-AI modeling frameworks have been shown to enhance forecast accuracy and PU quantification; however, these frameworks are encumbered by constraints in computational demands and interpretability, with inadequate validation for extreme events highlighting critical gaps. The review emphasizes standardized metrics, broader validation, and adaptive postprocessing to enhance applicability, advocating robust frameworks integrating meteorological input to hydrological output postprocessing for minimizing uncertainty chains and supporting water management. This study provides an updated field mapping, identifies knowledge gaps, and prioritizes research for the operational integration of advanced PU quantification.

Keywords:

predictive uncertainty quantification; hydrological forecasting; streamflow prediction; machine learning hydrology; probabilistic forecasting; AI flood forecasting; postprocessing techniques; scoping review water resources; climate variability forecasting; uncertainty chain reduction

Graphical Abstract

1. Introduction

Hydrological forecasting is fundamental to sustainable water resource management because it enables stakeholders to anticipate hydroclimatic variability and make informed decisions. Short-to-seasonal hydrological prediction encompasses forecast horizons from daily (1-day) to seasonal (up to 8 months) time scales, bridging operational flood forecasting and climate-informed water management planning. This temporal range specifically targets the critical gap between numerical weather prediction limits (~15 days) and long-term climate projections (>1 year), where hydrological memory and initial conditions provide predictability beyond meteorological skill. Predictive uncertainty (PU) quantification has emerged as a cornerstone for enhancing forecast reliability within this critical forecasting window.

Nevertheless, three primary bottlenecks persistently constrain probabilistic uncertainty quantification: (1) Methodological challenges—incomplete propagation of meteorological uncertainty through hydrological systems, lack of standardized validation protocols, and inadequate representation of extreme events; (2) Geographic inequities—concentrated validation in temperate, data-rich regions (72% of studies in Chinese, North American, and European watersheds) while 43% of global hydrological disasters occur in underrepresented tropical and arid regions; (3) Operational implementation gaps—persistent disconnect between academic innovation and real-world adoption due to computational demands, interpretability constraints, and limited operational framework standardization (35% adoption rate for Bayesian frameworks). These limitations systematically undermine early warning systems in data-scarce regions where reliable forecasting is most critically needed [1,2].

Recent advancements in Bayesian frameworks, machine learning, and hybrid methodologies have expanded PU quantification capabilities. However, extant syntheses remain fragmented: specialized reviews have addressed individual components including ML postprocessing [3,4], Bayesian frameworks [5], and ensemble streamflow forecasts [6], yet a comprehensive evaluation of emerging trends, cross-methodological comparisons, and operational integration pathways remains absent. This fragmentation impedes identification of scalable, adaptable operational frameworks suitable for diverse hydroclimatic regions. However, there are significant challenges that need to be addressed. These include the need for improved identification of the primary sources of uncertainty and development of scalable, adaptable operational frameworks for diverse regions [7].

The De León Pérez et al. [8] protocol applied in this Scoping Review (ScR) was designed to ensure reproducibility, given its substantial alignment with PRISMA-ScR standards [9,10]. However, it exhibits enhanced flexibility and is particularly well-suited for hydrological sciences. In this context, this Scoping Review addresses the central question: What are the emerging trends, best practices, and existing gaps in predictive uncertainty quantification for short-to-seasonal hydrological forecasting? The inquiry objectives are threefold: first, evaluate contemporary methodologies; second, identify patterns in statistical and machine learning-based approaches; third, detect limitations in existing frameworks. A complementary question guides operational relevance: How can these methodologies bridge the gap between theoretical advancements and operational implementation in diverse hydroclimatic regions?

This ScR analyzed 572 studies from 2017 to 2024, with 92 selected for comprehensive evaluation. The contribution includes updated field mapping, systematic gaps identification, and prioritized research directions for operational integration of advanced PU quantification. The manuscript organization follows: methodology (Section 2), results including foundational pre-2017 methodologies (Section 3), discussion (Section 4), and conclusions (Section 5). Following, a comprehensive description of the content of each Supplementary Material is presented. The present ScR is noteworthy for its transparent presentation of information, a quality that facilitates reader comprehension and paves the way for future research (or actualization) in the field. Consequently, the supplemental documentation provided here encompasses all extant support documentation to reproduce or enhance this ScR in the future.

2. Methodology

This study utilized the structured framework proposed by De León Pérez et al. [8] which was meticulously designed for systematic syntheses in the domain of hydrological sciences (see Figure 1 for a flowchart of this framework). The protocol aligns with 88% of the PRISMA-ScR guidelines and prioritizes transparency, reproducibility, and minimization of selection bias through sequential filtering. The methodology under consideration integrates semiautomated database queries and rigorous document screening. It was predicated on a series of clearly delineated and predefined steps. This approach is intended to circumvent, or at least minimize, the omission of relevant papers and reduce selection bias.

2.1. Literature Search Strategy

A semi-automated search was conducted in Scopus [11] and Web of Science [12], which were selected for the comprehensive coverage of peer-reviewed literature [13,14,15,16]. The search prioritized Scopus and Web of Science due to their curated, peer-reviewed coverage in hydrology and water resources, standardized metadata, and reproducible export tools. A comprehensive search of complementary sources (e.g., Crossref, DOAJ, PubMed, Google Scholar) was conducted for reference chasing and verification purposes. However, these sources were not integrated into the primary corpus to mitigate potential issues such as non-peer-reviewed leakage, large-scale duplication, and indexing heterogeneity. This approach is consistent with the PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines, which emphasize transparent reporting of sources, criteria, and reproducibility (refer to the PRISMA-ScR flowchart in Figure 2). The Boolean search strategy combines the following three conceptual layers:

Uncertainty components: uncertainty analysis, forecast uncertainty, error analysis
Hydrological focus: hydrological forecasting, streamflow prediction, ensemble forecasts
Methodological scope: probabilistic forecasts, machine learning, Bayesian frameworks

The full search equations optimized for each database’s syntax are detailed in Supplementary Material S1. These keywords were applied to refine the semi-automated search of each database, thereby ensuring the inclusion of relevant documents. The search period, which was concluded on 31 December 2024, was designed to provide the most up-to-date information, given that the objective of this ScR was to identify emerging trends in PU (results archived in Supplementary Material S3), because the aim was to identify current trends in the central topic of this ScR, without excluding pre-2017 seminal methodologies (see Section 3.1). Language filters prioritized English/Spanish publications that cover about 91% of the published papers in both databases used [17,18], while domain filters excluded non-hydrological fields (e.g., medicine, economics, social sciences).

2.2. Inclusion/Exclusion Criteria

The implementation of the inclusion and exclusion criteria necessitates a systematic analysis and invariably entails a certain degree of subjective interpretation. Consequently, the research team engaged in a deliberative process to formulate and meticulously implement a set of general criteria for the inclusion and exclusion of studies.

2.2.1. Inclusion Criteria

Forecasting from days to seasonal;
Research focused on predictive uncertainty in hydrological or meteorological forecasting;
Research that identifies Uncertainty sources;
Quantitative methods;
Models with multiple realizations from different inputs, such as ensemble members;
Application of Statistical, Probabilistic, Stochastic, or ML/AI methodologies that include analyzing or evaluating, or reducing the predictive uncertainty;
Postprocessing methodologies;
Hydrological variables (e.g., streamflow, precipitation, temperature…);
Research with performance probabilistic metrics;
Error models;
Research with clear data sources or access to validating.

2.2.2. Exclusion Criteria

Long-term climate projections (>1 year horizon);
Real-time forecast (sub-daily);
Parametric Uncertainty;
Research that does not identify uncertainty sources;
Qualitative or descriptive methods;
Deterministic simulation models;
Non-hydrological variables or domains (water quality, sediment without nexus with forecast, hydropower engineering…);
Research without (or not standardized) quantitative validation (performance metrics);
Research without data sources or access to validating.

The fundamental search equations are listed in Table 1. The complete and final two search equations applied to each scientific database are presented in Supplementary Material S1 and were configured differently in accordance with the specifications and tools of each database. Readers interested in updating, continuing, or expanding this ScR are encouraged to use these equations on a basis and adapt the terms to their needs. On the other hand, an assessment of the keywords was conducted using VOSViewer 1.6.20 [19].

2.3. Documents Referenced by Colleagues or Other Researchers

In addition to the direct searches conducted in the databases, the methodology employed for this ScR involved evaluating the records referenced by De León Pérez et al. [8], obtained from colleagues’ suggestions or manual searches. These records are incorporated into Supplementary Material S3 in a spreadsheet named “REFERENCED” (Table S3.3: Summary Form 1 with documents that have been cited by colleagues or in other research papers). Furthermore, seminal methodologies developed before the search period were included, taking into account that they have significantly influenced the field (see Section 3.1 Referent Methodologies Prior to 2017).

2.4. Document Selection

To evaluate the application of the inclusion and exclusion criteria (and refine them if necessary), four research team meetings were convened. The preliminary inclusion and exclusion criteria were presented during the preliminary meeting. Subsequently, a random sample of the retrieved documents was subjected to a review by one member of the research team and an external colleague. This sample comprises 56 documents, constituting approximately 10% of the total. The objective of this review was to evaluate Cohen’s kappa statistic [20,21]. A cut-off value of 0.6 was established according to Table 2. This value was selected based on the (aforementioned) degree of subjective interpretation that is inherent to every person, which is inevitable. Table 2 presents the range of the categorical strength of agreement according to the Kappa Statistic.

At this stage, the reviewers were members of the research team and an external peer reviewer. The result of the Cohen’s Kappa Statistic was 0.68 (see Supplementary Material S2 in sheet name: 1st_Evaluation, Table S2.1: Kappa Statistic for first evaluation (Title and Abstract)), which indicates that the strength of agreement (Table 2) is in the “Substantial” range for Cohen’s Kappa Statistic and exceeds the established cut-off.

In the subsequent meeting, the results of the cross-selection were presented, and the criteria were refined. Using these modified criteria, a random sample of the selected documents was subjected to a subsequent review by one member of the research team and an external colleague. The sample comprised 31 documents, constituting approximately 11% of the initial set of documents selected for review. The objective of this second review was to evaluate Cohen’s Kappa Statistic for the complete reading (and final selection) of the documents. The result of the Cohen’s Kappa Statistic was 0.68 (see Supplementary Material S2 in a spreadsheet named: 2nd_Evaluation, Table S2.2: Kappa Statistic for second evaluation (Methodology, Results and Conclusions)), which indicates that the strength of agreement (Table 2) is in the “Substantial” range for Cohen’s Kappa Statistic and exceeds the established cut-off.

The results demonstrate that the criteria inclusion/exclusion were clearly defined (with a strength agreement different from 1). Following the establishment of a set of clearly delineated criteria and their subsequent validation, a comprehensive review of the documents retrieved by one of the researchers was performed. It should be noted that this review was conducted by a single reviewer due to constraints on resource availability. A shorter final evaluation of 10 (11%) selected documents was conducted by the research director. The objective of this third review was to evaluate Cohen’s Kappa Statistic for the final selection of documents. The result of the Cohen’s Kappa Statistic was 0.77 (see Supplementary Material S2 in a spreadsheet named: 3rd_Evaluation, Table S2.3: Kappa Statistic for third evaluation (final selection)), which indicates that the strength of agreement (Table 2) is in the “Substantial” range for Cohen’s Kappa Statistic and exceeds the established cut-off.

3. Results

The application of equation search produced the filtered documents incorporated into Form 1: Supplementary Material S3 in sheet name “SCOPUS” (Table S3.1: Summary Form 1 with Scopus Database search) and sheet name “WoS” (Table S3.2: Summary Form 1 with Web of Science Database search); in the same file, the reader may find the identification by document if was duplicated, eighter was selected in the first read (Title and Abstract), either if was selected in the second read (Methodology, Results, and Conclusions). The documents selected in the present ScR were retrieved using the aforementioned methodology and deposited in Form 2 (See Supplementary Material S4). However, the initial results analysis is devoted to the presentation of referent methodologies prior to 2017, which are foundational to comprehending or assessing contemporary trends in PU.

These foundational methodologies have been incorporated to mitigate some temporal bias of the search period and contextualize developments from 2017 to 2024, since the integration of these foundational methodologies with AI/ML, as well as other statistical approaches, is a driving force behind contemporary hybrid frameworks.

3.1. Referent Methodologies Prior to 2017

This ScR focuses on trends in PU evaluation from 2017 to 2024. However, it is essential to acknowledge the foundational methods developed by important researchers prior to this period, which have significantly influenced the field. The first section provides a concise overview of some established methods for assessing PU. It should be noted that these methods are not exclusive because there are numerous alternative approaches.

3.1.1. Bayesian Forecasting System

In the late 1990s, the Bayesian Forecasting System (BFS) was presented by Roman Krzysztofowicz [22], which is based on the theory of Bayesian Forecast Processors (BFP). It combines an “a priori” distribution representing the uncertainty inherent in a process with a likelihood function related to the uncertainty in the process forecasts. Consequently, a posterior distribution conditional on the estimates is obtained [23]. The mathematical and theoretical foundations of BPF are rooted in Bayesian approaches to inference for stationary time series [24].

BFS represents a probabilistic theoretical framework for separating the uncertainty inherent in a deterministic hydrological model into two distinct components. One is generated by the input forecast, namely the Precipitation Uncertainty Processor—PUP [25], which evaluates the output uncertainty under the assumption of no hydrologic uncertainty. The other is the result of the hydrologic models. The Hydrologic Uncertainty Processor—HUP [26] assesses the output uncertainty under the assumption of no uncertainty in precipitation. This assessment is incorporated into an Uncertainty Integrator (INT), which generates the final probabilistic forecasts [27]. An alternative focus is the Precipitation-Dependent Hydrologic Uncertainty Processor (PD-HUP), which postulates that precipitation is the primary source of hydrological uncertainty. Consequently, uncertainty is not incorporated into the hydrological model [28]. For a comprehensive examination of this methodology, refer to [5].

3.1.2. Bayesian Model Averaging

Bayesian Model Averaging—BMA [29,30,31,32,33] is a robust technique that, like HUP, identifies the primary source of uncertainty as residing within the model itself, rather than in the forcings. BMA is distinguished by its superior reliability and accuracy as a statistical method for multiple model ensembles. This results in more competent and reliable predictive ability, which exceeds those obtained through the individual independent use of the models. Consequently, confidence in the effectiveness of the models is reinforced.

Hoeting Jennifer A. et al. [33] indicate that the researchers select a statistical model and presume that this model generates the data, thereby overlooking the potential for the uncertainty inherent in the model. In contrast, BMA addresses the uncertainty in model selection by employing a set of models and then averaging their inferences or predictions by assigning weights to each model using Bayes’ theorem. In this way, posterior probabilities were calculated, and the predictions were averaged, assigning a higher weight to those models that were more congruent with the observed data [34]. This allowed the uncertainty of the models to be averaged in proportion to the certainty of their prediction probability. However, despite the good results, the computational cost of BMA is high because its evaluation requires the use or operation of several models; in other words, the computational time depends on the number of models to be used because the logic of the method implies that the more models used, the better the result, but also the more time and computing power [35,36].

3.1.3. Model Conditional Processor

In 2008, an Italian researcher from the University of Bologna, Ezio Todini, proposed a novel methodology, designated as the Model Conditional Processor (MCP), which contributes to the assessment of PU by providing a systematic approach to combining observations with model forecasts in a multivariate normal space [37]. MCP permits the estimation of a comprehensive PU through the marginalization of parameter uncertainty, as opposed to relying on a single set of parameter values. This approach recognizes the variability in model parameters and their influence on predictions, thereby facilitating a more comprehensive understanding of the uncertainty. The methodology employs the Normal Quantile Transform [38,39,40] to transform both observations and model forecasts into a multivariate normal space. This transformation allows the derivation of joint and conditional probability distributions, thereby facilitating a more accurate assessment of the PU.

Overall, MCP enhances the understanding and management of PU in flood forecasting, which is crucial for effective decision-making in flood risk management, and represents a robust alternative to existing approaches, including the HUP and BMA models; however, it is subject to key limitations. First, its assumption of homoscedastic error variance frequently fails under conditions of heteroscedasticity, which is a common occurrence in hydrological processes. Second, its meta-Gaussian framework may inadequately capture the nonlinear dependencies among hydrological variables, limiting its applicability to complex systems. Finally, MCP’s reliability diminishes in extrapolative scenarios, such as extreme events or climate change contexts, highlighting its reduced robustness outside calibration conditions [41,42,43].

3.1.4. Generalized Likelihood Uncertainty Estimation

Generalized Likelihood Uncertainty Estimation (GLUE) is a statistical method to quantify the PU popularized by Beven in his pioneering paper “The Future of Distributed Models: Model Calibration and Uncertainty Prediction” [44]. According to Beven [45], this methodology represents a technique for estimating uncertainty in distributed hydrological models by evaluating multiple parameter configurations. In contrast to traditional methodologies that aim to identify an optimal set of parameters, this approach is based on attributing likelihood measures to various groupings of model parameters and employing these measures as weighting factors in the model projections.

The GLUE method has been the subject of criticism in the hydrological literature [46,47] because of its inability to provide an accurate estimate of PU. One of its primary shortcomings is its incompatibility with the Bayesian paradigm, which constrains the “learning” capacity of the method and diminishes the precision of parameter estimation. The use of less formal likelihoods in GLUE allows flexibility in parameter selection; however, this results in the generation of flat and imprecise posterior probability distributions. Furthermore, the method overestimates the PU owing to its inability to fit the observational data well with the model. As Mantovan and Todini [47] observed, while GLUE endeavors to address the phenomenon of “equifinality” (where multiple parameter configurations can describe the same phenomenon), the method fails to calculate the correct predictive probability. This is because the GLUE likelihoods, which are not formal probabilities, do not comply with Bayes’ theorem. Consequently, inconsistencies in the predictive results were observed. Furthermore, this inconsistency affects the comparability of the results, which can ultimately lead to risky management decisions. In the papers “Hydrological forecasting uncertainty assessment: Incoherence of the GLUE methodology” [47] and “Comment on: ‘On undermining the science?’ by Keith Beven” [46], the authors conducted a comprehensive analysis and discussion of the GLUE methodology.

In addressing these critiques, Beven et al. [48] adopt a reflective and constructive stance, proposing that under certain strong assumptions, GLUE can attain coherence, contingent upon the utilization of consistent prior information. While acknowledging the challenges posed by multiple sources of uncertainty in practical applications, the authors underscored the flexibility of GLUE, which enables a more nuanced assessment of the uncertainty in hydrological systems. Nevertheless, they also caution against the utilization of elementary formal likelihood functions as they may generate misleading outcomes. Instead, they proposed GLUE adaptability as a more robust alternative to traditional Bayesian approaches. Finally, they emphasized the necessity of a profound comprehension of the informational content of data to enhance uncertainty estimation in hydrological modeling. Subsequent advancements have demonstrated that this methodology can be updated with new data using a Bayesian procedure, thereby refining the uncertainty distribution of the model [49,50].

3.2. Selected Bibliography from Search Strategies

The flowchart (Figure 2) methodically delineates the trajectory of the document search and the results obtained at each stage up to the database obtained for the ScR. Bibliometric networks were produced in VOSViewer using keyword co-occurrence, revealing a notable correlation between the document’s keywords and the research question (see Figure 3 and Figure 4).

The final parameters were determined to be Resolution = 1.07, Attraction = 2.0, and Repulsion = 0. The Scopus map yielded six clusters, while the WoS map yielded five clusters. Co-occurrence maps (Figure 3 and Figure 4) provide substantiation of the thematic alignment with the research question, accentuating clusters with cores in “Prediction,” “Uncertainty,” “Uncertainty Analysis,” “Hydrological Modeling,” and “Forecasting” with elevated connectivity.

A total of 572 documents (see Figure 5 at panel «a») were retrieved from the semi-automatic search of the databases and included in Form 1 (Supplementary Material S2). Most of these documents were published in 2019, followed by a decline until 2021, followed by a recovery in 2022, reaching a level close to that of 2019. However, there was a sharp drop in 2023, and even more so in 2024. The oscillating pattern of production during the research period rises, falls, recovers, and then falls, suggesting a vulnerability of scientific productivity to external shocks. The significant decline in publications in 2020 (Figure 5 at panel «a») corresponds temporally with the onset of the COVID-19 pandemic, which imposed considerable restrictions, particularly on field projects and international collaborations in non-medical domains [51]. The subsequent recovery (2021–2022) can be attributed to the reactivation of funding for climate conservation and adaptation to virtual modalities. The further post-2022 decline points to factors that may be the subject of future bibliometric investigations into thematic saturation or the migration of researchers to more emerging areas, such as applied artificial intelligence and data science.

As illustrated in Figure 5 at panel «b», the selection process is delineated as such: A total of 51% of the records (282 out of 572) successfully advanced to the initial phase, which involved a title and summary review. However, a more stringent filtration process was evident, as only 16% (92 of 572) satisfied the methodological criteria in the comprehensive evaluation.

About the documentary classification (Figure 5 at panel «c»), it is evident that the most significant documentary sources found were articles, which constituted 95.6% of the 92 final documents retrieved. Of these, 90.2% were research articles, and 5.4% were reviews. Conference papers (2.2%) and book chapters (2.2%) constituted a minority of sources. Similarly, the publishers with the highest number of publications on this research topic are Elsevier, Springer Link, and MDPI, which together account for 60% of the articles retrieved Figure 5 at panel «d». As indicated by (Figure 5 at panel «e»), the Journal of Hydrology is responsible for 15% of the publications retrieved, followed by Water Resources Management (13%), Water (11%), and HESS (10%), which are specialized journals with a higher 5-year Impact Factor (2024) than 3.0 in the field of water resources and with significant prestige in the academic community.

A geographical analysis of the 92 documents retrieved reveals that 97 countries are mentioned as the locations of the cases studied (Figure 5 at panel «f»). China was the most frequently mentioned country, with 36 cases (37%), followed by several blocks of countries, including those in North America (Canada and the USA) with 16 cases (16.5%), and Europe with 12 cases (12.4%). These three regions account for more than 65% of the cases studied, which is consistent with the geographical distribution of the top 100 universities in global rankings.

The regional aggregation of observations, with a focus on China, North America, and Europe, aligns with global reviews identifying areas of elevated flood risk and research and data gaps in Africa and specific regions of Asia. This pattern reflects not only exposure and risk, but also data and infrastructure gaps that determine where research is conducted, validated, and transferred to operation [52]. Recent evidence from global flood risk mapping, adjusted for social vulnerability, has identified regions of high risk in countries with high population density and deprivation. This finding indicates that the accessibility and availability of hydrological and socioeconomic data play a critical role in the prioritization and generalization of methods [53].

3.3. Prevalent Methodologies Found

The extant literature from 2017 to 2024 demonstrates a clear evolution in predictive uncertainty (PU) quantification for hydrological forecasting, with two main methodological streams: statistical approaches and machine learning/artificial intelligence (ML/AI) methods (see Figure 6). Statistical methods, including Bayesian frameworks, ensemble techniques, and quantile-based approaches, offer robust probabilistic representations of the forecast uncertainty [31,33,44,54,55,56]. Concurrently, machine learning (ML) and artificial intelligence (AI) methods have rapidly gained traction because of their capacity to capture nonlinearities and leverage extensive datasets [3,4,57].

A prevalent contemporary technique involves the utilization of ensembles to obtain multiple forecast realizations, thereby considering various initial conditions of atmospheric variables. This approach is appealing because the unstable and chaotic dynamics of the global climate system preclude the determination of the initial state of meteorological forecasts with sufficient precision in advance. This finding indicates that the incorporation of multiple model outputs and observational data through advanced postprocessing methodologies has led to the provision of more comprehensive uncertainty bounds [6,58,59].

4. Discussion

The quantification of predictive uncertainty (PU) has emerged as a cornerstone for improving forecast reliability. This ScR analysis of 92 selected studies reveals a paradigmatic shift in uncertainty quantification methodologies over the 2017–2024 period (according to the current AI Revolution), demonstrating not only technological advancement but also a fundamental reconceptualization of how hydrological uncertainty would be approached. This transition coincides with the geographic concentration (about 74%, see Figure 5) observed in the Chinese, Australian, Indian, North American, and European watersheds, suggesting that methodological innovation is closely linked to computational infrastructure availability and institutional research priorities.

The temporal restriction to post-2017 literature captured this transformative period when AI technologies are rising, while preserving connections to foundational pre-2017 methodologies in Section 3.1 analysis. This approach revealed that while classical frameworks maintain theoretical relevance, their operational implementation is overshadowed by hybrid approaches that combine statistical rigor with ML flexibility. However, significant gaps persist in the operational adoption and methodological standardization of PU quantification. From 2017 to 2024, global operational systems have documented a series of forecast inaccuracies that have been exceeded during weather events. These inaccuracies can be primarily attributed to the presence of unquantified precipitation uncertainties [55,58].

In the subsequent sections, the methodologies found in this ScR for postprocessing and time series forecasting uncertainty enhancement of hydrological variables are discussed. In view of the extensive research on ensembles developed by Troin et al. [6] a specific section is not designated for discussion on this topic in this ScR.

4.1. Statistical Methods

Statistical methods provide transparent, interpretable frameworks for uncertainty quantification, with Bayesian and ensemble approaches offering rigorous probabilistic outputs [5,33]. Bayesian frameworks (e.g., HUP, BMA, MCP) demonstrate robust probabilistic rigor; the efficacy of these methods in integrating prior knowledge and generating probabilistic forecasts has been demonstrated [5,60,61]. However, their operational implementation is frequently constrained by computational demands and the necessity for well-defined priors, particularly in regions with limited data or high variability in data [36,54,62].

Darbandsari and Coulibaly [63] proposed an enhanced streamflow forecasting by integrating initial flow conditions: the HUP-BMA method reduced the forecast interval width by 28.42%, and CRPS by 17.86% compared to HUP; however, reliance on Normal Quantile Transformation (NQT) introduced biases in extreme flows, for this reason recently Cui et al. [64] proposed CHUP-BMA, replacing NQT with copulas (e.g., Student t), eliminating distributional assumptions and reducing interval width by 28.42% and CRPS by 17.86% versus HUP-BMA, achieving superior calibration (CR > 90%) with horizons up to 7 days. This approach resolves transformation-induced distortions and improves computational efficiency, thereby advancing probabilistic hydrology in operational settings and demonstrating iterative BMA refinement with CHUP-BMA, thereby addressing the key limitations of its predecessor [63,64].

MCP established another seminal Bayesian framework for PU quantification by estimating conditional probability densities, assuming joint normality in transformed spaces [37]. Barbetta et al. [65] introduced the multi-temporal/multi-model MCP (MCP-MT), which integrates Truncated Normal Distributions (TNDs) to differentiate hydrological phases (e.g., rising limbs and peak flows). This approach reduced the forecast interval widths by 28.42% and CRPS by 17.86% compared to single-temporal approaches. Additionally, Anele et al. [66], MCP has demonstrated adaptability beyond the domain of hydrology. Specifically, the application of MCP in urban water demand forecasting has demonstrated notable efficacy. Through the integration of autoregressive (ARMA), neural network (FFBP-NN), and hybrid models, MCP has been successful in reducing the validation RMSE from 1.677 to 1.329. Additionally, it enhances the NSE to 0.953, achieving 95% coverage within the 90% uncertainty bands.

Romero-Cuellar et al. [67] breakthrough innovation through Gaussian mixture clustering integration (GMCP) that systematically addresses heteroscedastic errors with quantifiable improvements (36.64% sharpness increase, 10.29% containing ratio improvement, 16.66% NSE enhancement in dry catchments). Barbetta et al. [55] built upon Todini’s foundational MCP, extending its application from single-model flood forecasting to multimodel reservoir inflow prediction. This advancement involved the concurrent utilization of up to five deterministic models, a significant enhancement of the original approach with quantifiable improvements: The 1-day forecast demonstrated a 72% reduction in error, and an NSE that increased from 0.86 to 0.98. For the 3-day forecasts, there was a 50% reduction in the error, resulting in an NSE that improved from 0.64 to 0.93.

The collective works [55,65,66,67] constitute a coherent methodological evolution of the MCP, demonstrating the progression from basic urban applications to complex hydrological systems and advanced theoretical extensions. This highlights MCP flexibility but revealed sensitivity to input data quality and the need for dynamic updating in systems with high seasonal variability; however, critical limitations persist, derived from temporally restricted datasets that compromise statistical robustness, the absence of integration with operational meteorological uncertainties, and dependence on normality assumptions that limit extreme event representation [68], and a lack of exhaustive validation against alternative uncertainty quantification methods.

4.2. AI-Driven Approaches

In the present manuscript, a distinction is established between Artificial Intelligence (AI) as the broader field encompassing techniques that enable machines to mimic human intelligence, and Machine Learning (ML) as a specific subset of AI focused on algorithms that learn patterns from data to make predictions. While these terms are frequently used interchangeably in hydrological literature, ML more precisely describes the data-driven methodologies analyzed in this review, including neural networks, support vector machines, and ensemble methods. The term AI is employed when referring to the broader technological revolution and its comprehensive implications for hydrology.

ML/AI approaches, including deep learning architectures (e.g., Long Short-Term Memory-LSTM, Gated Recurrent Unit-GRU, and bidirectional long short-term memory (BLSTM)) and tree-based algorithms (e.g., Random Forest and XGBoost), in addition to hybrid models, have been shown to offer notable advances in capturing complex nonlinear relationships and integrating diverse predictors.

AI-driven methods handle high-dimensional data, capture complex patterns, and improve forecasting skills, particularly when integrated with statistical postprocessing [4,57,69]. Nonetheless, these methodologies are frequently the subject of critique due to their “black box” nature and the risk of overfitting, particularly in the context of rare or extreme events [70,71].

They are also encumbered by significant technical challenges, including the need for model interpretability, trade-offs in hyperparameter optimization, and requirements for substantial computational resources. The incorporation of uncertainty quantification methods is undoubtedly beneficial; however, it introduces a degree of complexity that can impede operational implementation. In the future, research should focus on the development of computationally efficient algorithms that can be implemented in real time. In addition, there is a need to establish standardized protocols for cross-basin validation. These protocols should combine physics-based constraints with data-driven flexibility to create hybrid frameworks that can effectively address the challenges posed by these constraints. Furthermore, the development of explainable AI techniques and automated hyperparameter optimization would considerably augment the practical utility of these sophisticated methodologies in operational water resource management.

The analysis of AI-Driven Hydrological Forecasting Methods demonstrates evidence of remarkable advancements in probabilistic hydrological forecasting, encompassing Bayesian deep learning frameworks [69], hybrid ensemble approaches [57,72], multi-scale variable integration [72], and explainable machine learning with uncertainty quantification [71]. Nonetheless, it should be noted that these findings are subject to certain limitations.

The BLSTM framework proposed by [69] has achieved significant advancements in the multi-step domain. The framework successfully integrated variational inference, a technique that has been demonstrated to enhance the precision of predictions in intricate systems. This finding aligns with the framework’s ability to quantify epistemic and random uncertainties [4]. This framework achieved PICP values exceeding 0.950 for one-day forecasts. However, this performance comes at a computational cost that may be prohibitive for real-time operational systems in resource-constrained environments.

The XGB-GPR-BOA model proposed by Bai et al. [57] exemplifies the tension between methodological sophistication and practical applicability. While achieving superior deterministic accuracy (RMSE: 1.847 m³/s, R²: 0.965) in the Yangtze River Basin, its geographical specificity and limitation to one-step-ahead forecasts highlight a fundamental challenge: advanced AI methods often sacrifice generalizability for performance optimization. RF-GPR-MV [72] further illustrates this pattern, with 15–25% improvements in longer horizons (1- to 12-month forecasts) offset by computational complexity, which hinders real-time operational implementation. This recurring theme across the reviewed studies suggests that recent acceleration of AI/ML adoption in hydrology prioritizes accuracy over operational viability.

In contrast, Fan et al. [71] presented substantial methodological advancements in reservoir inflow forecasting with PI3NN. The PI3NN method addresses the critical gap between performance and explainability, achieving 90% coverage probability while maintaining interpretability through explicit uncertainty decomposition. However, only 23% of the AI-driven studies in the reviewed corpus provided comparable interpretability frameworks, indicating that the field has largely overlooked operational transparency requirements.

4.3. AI-Driven Plus Statistical Frameworks

The integration of ML/AI with Bayesian frameworks represents the most promising methodological evolution identified in this review; however, implementation challenges persist across the analyzed studies. Emerging hybrid approaches, such as ML/AI plus Bayesian frameworks, enhance performance compared to deterministic models and demonstrate robust performance in hydrological forecasting. Reference [73] employed BMA-ensemble approach achieved 80% NSE/R² improvements but introduces systematic biases through Box–Cox transformations that compromise extreme-flow representation—a critical limitation given the underrepresentation of extreme events across the reviewed corpus (<12% addressing 100-year floods).

In contrast, Cui et al. [74] proposed copula-HUP paired with DA-LSTM-RED, which achieved substantial performance gains (10–17% MAE reduction and 17.86% CRPS improvement compared to traditional HUP-BMA methods) but at elevated computational expenses for copula calibration.

While Li et al. [73] emphasized the capacity of Bayesian frameworks to integrate the strengths of multiple models, their deterministic forecasts are characterized by the absence of explicit uncertainty quantification and sensitivity to input transformations. Conversely, Cui et al. [74] advanced operational applicability by implementing attention mechanisms that prioritize critical variables (e.g., precipitation) and temporal dependencies. Nonetheless, this approach is associated with elevated computational expenses for copula calibration. This exemplifies the fundamental tension identified throughout the ScR. The reviewed studies consistently demonstrate that Bayesian-AI hybrids require specialized expertise for implementation, creating interoperability barriers, such as standardized APIs for basin agencies or operational users. This finding aligns with the geographic concentration observed in this ScR (Figure 5 at panel «f»), where advanced methodologies cluster in well-resourced research institutions rather than in operational water management agencies.

The collective evidence retrieved suggests that while hybrid frameworks successfully harmonize precision and uncertainty resolution, their computational and technical requirements may perpetuate the research-practice gap that limits real-world uncertainty quantification deployment.

4.4. Final Remarks

The ScR provides a comprehensive overview of recent advances in the assessment of predictive uncertainty and innovations in short-term and seasonal hydrological forecasting. The analysis in this ScR, based on 92 selected papers (see Supplementary Material S4), revealed trends, innovative approaches, and best practices that can guide future development in this field. The identified patterns demonstrate both remarkable progress and persistent limitations, which limit operational adoption.

The findings and knowledge gaps identified and derived from the state-of-the-art examination to answer the research question are presented below, according to the records from the consulted databases and documents collected from 2017 to December 2024. Significant progress has been made in developing robust methodologies, including Bayesian, AI-driven, and ensemble techniques. These methods have enhanced the quantification and reduction in the PU, particularly for short- to seasonal forecasting horizons. However, the operational application of these methods remains limited, particularly in regions characterized by complex hydroclimatic conditions or limited resources.

Bayesian approaches have evolved from theoretical frameworks to hybrid implementations that leverage AI-driven capabilities, while maintaining probabilistic rigor. However, the review revealed that computational demands and reliance on well-defined priors can curtail their applicability in data-scarce regions, creating an equity gap between resource-rich and resource-constrained operational environments. In addition, the integration of Bayesian frameworks with machine learning has emerged as a promising trend, offering enhanced predictive accuracy and uncertainty quantification. However, this synergy often suffers from overfitting and interpretability issues, particularly in high-dimensional datasets.

Based on the research developed by Troin et al. [6], it is evident that ensembles represent a highly effective strategy for capturing a broader range of potential outcomes and providing a more comprehensive characterization of PU. By leveraging diverse outputs from ensemble members, ensemble methods can enhance the robustness and reliability of forecasts by incorporating several initial conditions into hydrological applications.

The integration of remote sensing data and global climate predictors has yielded substantial advances in reducing forecasting uncertainties. This approach leverages the extensive spatial coverage and accessibility afforded by satellite-derived data. Nevertheless, the temporal resolution constraints and biases in satellite-derived datasets can impede the efficacy of this approach by amplifying uncertainties rather than mitigating them.

Although postprocessing techniques have demonstrated effectiveness across diverse hydroclimatic contexts, they remain underutilized in operational settings because of the lack of adaptive functions, factors, and parameters that enable regional customization without statistical rigidity. This finding underscores the persistent disconnection between academic innovation and operational implementation.

While AI-driven approaches offer flexibility and the ability to capture nonlinear patterns, they are not devoid of limitations. The dependence on large datasets and computational resources makes their implementation challenging in regions with a limited technical infrastructure. Additionally, the “black box” nature of these models can give rise to concerns regarding their interpretability and reliability, particularly in scenarios where decisions of significant importance are being made.

Finally, the regional concentration of studies may restrict the applicability of their findings to other areas with contrasting hydrological and climatic conditions; however, this limitation extends beyond simple geographic bias. The ScR revealed a systematic underrepresentation of data-scarce regions, where uncertainty quantification is most critical for disaster risk reduction and enhances water management and governance.

4.5. Limitations

The limitations identified in this ScR reflect both the methodological constraints and broader systemic challenges in hydrological uncertainty research. The heterogeneity of the methodological approaches employed in the reviewed studies may present challenges in directly comparing results.

Furthermore, it is important to acknowledge that a comprehensive ScR may be constrained by the lack of comprehensive coverage of all relevant studies owing to limitations in accessing databases and publications. Notwithstanding the implementation of rigorous methodologies and the establishment of inclusion criteria, there is invariably a risk of publication bias. Consequently, studies such as those found in the gray literature that present significant results may not have been published and, therefore, have not been incorporated into this review. Nevertheless, the value of this ScR should not be underestimated, as it provides an overview of the current state of research in this field following a rigorous protocol [8]. Indeed, it is highly relevant because of its timeliness at a time of rapid emergence of AI-driven methods and their combination with statistical methods, with the latter having a longer tradition in the field.

In terms of unifying the criteria for selecting articles, it should be noted that although an initial research question and a protocol of “clear” rules for selecting and including documents were established, there is an inevitable degree of subjectivity involved in the selection. At least four meetings were held to define the final inclusion and exclusion criteria. These were aimed at refining the precise focus of the research and thus the research questions.

4.6. On Future Research Directions

Despite advances in reducing the uncertainty in hydrological forecasting, several knowledge gaps persist and require further research. Some areas that require more attention are as follows:

Choosing the primary source of uncertainty remains a challenge. Therefore, it is necessary to develop clear guidelines for selecting an ideal approach, depending on the situation.
Postprocessing techniques have great potential for refining forecasts; however, their large-scale operational implementation remains limited. Further studies are required for the medium- and long-term horizons.
Most advances have concentrated on forecasting streamflow and precipitation. However, there is a lack of research on reducing uncertainty in the forecasts of other key hydrological variables, such as water quality, soil moisture, and water tables.
The reviewed studies revealed the development of predictors based on remotely sensed data. Integrating sources, such as radar, satellites, and global climate indices, is an underexploited opportunity to reduce uncertainty.

Future research endeavors must concentrate on several pivotal aspects to further understand and administer the uncertainty in seasonal hydrological forecasts. Primarily, it is imperative to formulate and appraise modeling methodologies that can more effectively encapsulate the variability and intricacy of hydrologic systems. This encompasses enhancing the depiction of physical processes, incorporating reliable observational data, and investigating novel modeling techniques such as artificial intelligence and machine learning approaches.

Future research opportunities include the development of hybrid frameworks integrating artificial intelligence and machine learning for capturing complex nonlinear relationships. Among the emerging methodological approaches, distributional regression techniques represent a particularly promising research direction. Recent studies have begun exploring methods that generate full probability distributions of target variables rather than single deterministic values, thereby providing richer representations of uncertainty that capture not only central tendencies but also variability and potential extremes [75,76]. Extension toward nonstationary multi-model approaches with adaptive clustering, integration with ensemble meteorological forecasts, implementation of robust spatio-temporal cross-validation, explicit treatment of meteorological forecast uncertainties, and evaluation of quantifiable socioeconomic benefits, which are fundamental elements for establishing practical operational utility in integrated water resource management under climate change conditions and increasing hydroclimatological variability.

Furthermore, it is necessary to cultivate enhanced interdisciplinary collaboration, thereby addressing uncertainty from multiple perspectives. Experts in hydrology, climatology, mathematical modeling, and water resource management must collaborate to develop integrated and holistic approaches. The establishment of research networks and implementation of collaborative projects can facilitate the exchange of knowledge and application of best practices on a global scale.

It is imperative that future research prioritize the practical applications of these advancements within specific contexts. Conducting case studies across various regions and watershed types can yield invaluable insights into the efficacy of disparate uncertainty reduction methodologies and their relevance to disparate scenarios. This encompasses an assessment of the economic and social ramifications of hydrological forecasts, in addition to the effective communication of uncertainty to stakeholders and decision-makers.

In addressing the central research question, which explores emerging trends, best practices, and gaps in predictive uncertainty quantification, this ScR unveils a landscape of significant advancements in hybrid frameworks. However, the study also reveals persistent limitations in operational adoption and geographic validation. Moreover, the secondary inquiry underscores the significance of hybrid methodologies that seamlessly integrate the rigor of statistical standards with AI adaptability. These methodologies are designed to promote equitable access to data in regions facing scarcity, while concurrently enabling a seamless transition from academic innovation to practical hydrological applications in the face of escalating climatic variability. The findings incorporated in Table 3 and Figure 7 of this study are consistent with the principles of ScR, particularly the “mapping” and “charting” aspects of PRISMA-ScR (with an adapted protocol for the hydrological sciences). These principles advocate that a ScR should culminate in clear findings and identified gaps, thereby establishing a foundation for future research that builds upon the methodological patterns and gaps identified [77]. Below, Table 3 condenses the key gaps, priority research directions, and operational implications, while Figure 7 illustrates their dynamic interconnections through a Sankey diagram, highlighting how methodological gaps can inform future strategies for more robust quantification

5. Conclusions

This comprehensive Scoping Review of 92 studies from 2017 to 2024 elucidates a field undergoing methodological transformation, where traditional statistical frameworks are enhanced by machine learning innovations to address predictive uncertainty in short-to-seasonal hydrological forecasting. In addressing the fundamental question, which delves into emergent trends, optimal practices, and extant lacunae, three pivotal findings emerge: (1) The evolution toward Bayesian-ML hybrid methodologies has been shown to achieve substantial accuracy improvements. However, this evolution has also revealed a systematic disconnect between academic innovation and operational adoption. This disconnect is exacerbated by computational demands that create equity barriers. (2) Geographic and validation biases have been shown to limit generalizability. This limitation is particularly pronounced in data-scarce tropical and semi-arid regions. This limitation underscores the need for inclusive strategies in global disaster risk reduction. (3) Integration challenges have been shown to demand frameworks balanced between methodological rigor and practical viability. These frameworks must prioritize extreme event representation and operational transparency.

The secondary inquiry, which centers on the potential of these methodologies to span the divide between theoretical advancements and operational implementation in diverse hydroclimatic regions, underscores the significance of a robust framework that integrates hydrological postprocessing as a pivotal instrument to augment forecast accuracy and reliability. This framework must span from meteorological input postprocessing to hydrological postprocessing, minimizing the uncertainty chain for optimal water resource management, planning, and handling amid growing hydroclimatic variability.

In summary, while uncertainty quantification has achieved technical sophistication, future progress must emphasize geographic equity, standardized validations, and integrated postprocessing. This will allow for the translation of advancements into equitable and operational tools for global water sustainability.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/w17202932/s1. 1. Supplementary Material S1: Microsoft Excel file containing a spreadsheet (Table S1.1: Search equations applied to each scientific database) that includes the search equations applied. 2. Supplementary Material S2: Microsoft Excel file containing three spreadsheets: a. The first element designated “1st_Evaluation” (Table S2.1: Kappa Statistic for first evaluation (Title and Abstract)), constitutes the Kappa Statistic calculated for the first read. b. The second one, designated “2nd_Evaluation” (Table S2.2: Kappa Statistic for second evaluation (Methodology, Results and Conclusions)), constitutes the Kappa Statistic calculated for the second read. c. The third one, designated “3rd_Evaluation” (Table S2.3: Kappa Statistic for third evaluation (final selection)), constitutes the Kappa Statistic calculated for the final selection; 2. Supplementary Material S3: Microsoft Excel file containing four spreadsheets: a. The first one, designated “SCOPUS” (Table S3.1: Summary Form 1 with Scopus Database search), functions as the synopsis table of Form 1, derived from the Scopus Database search results. It encompasses a document extracted and retrieved from the implementation of semi-automatic filters within the Scopus platform. b. The second one, designated “WoS” (Table S3.2: Summary Form 1 with Web of Science Database search), is a synopsis table of Form 1 derived from the Web of Science Database search results. It encompasses documents that were extracted and retrieved from the implementation of semi-automatic filters on the Web of Science platform. c. The third one, designated “REFERENCED” (Table S3.3: Summary Form 1 with the records referenced by De León Pérez et al. (2024) [8]), constitutes the synopsis table of Form 1 extracted from the records referenced by De León Pérez et al. (2024) [8]. d. The final spreadsheet is designated “SELECTED_SUMMARY” (Table S3.4: Summary form 1 with a comprehensive summary of all retrieved documents), this table constitutes a comprehensive summary of all retrieved documents; 4. Supplementary Materials S4: A Microsoft Excel spreadsheet containing a table (Table S4.1: Summary Form 2) that comprehensively analyzes and discriminates the methods, scales, performance metrics, and most of the important data (about the research topic, PU) of each document was selected to be included in the ScR; 5. Supplementary Materials S5: A Microsoft Excel spreadsheet containing a table (Table S5.1: Summary of Communication approaches for decision-makers) that analyzes several documents found with uncertainty communication approaches for decision-makers; 6. Supplementary Material S6: A Microsoft Excel spreadsheet containing a table (Table S6.1 Summary of the theoretical articles and books found) that analyzes several documents found with PU theoretical approaches; 7. Supplementary Material S7: A PDF document containing an extended narrative description of all documents and methods found in the present ScR [83,84,85,86,87,88,89,90,91,92,93,94,95,96,97,98,99,100,101,102,103,104,105,106,107,108,109,110,111,112,113,114,115,116,117,118,119,120,121,122,123,124,125,126,127,128,129,130,131,132,133,134,135,136,137,138,139,140,141,142,143,144,145,146,147,148,149,150,151,152,153,154,155,156,157,158,159,160,161,162,163,164,165,166,167,168,169,170,171,172,173,174,175,176,177,178,179,180].

Author Contributions

Conceptualization, D.D.L.P., S.S.-G. and F.F.; Methodology, D.D.L.P.; Validation, S.S.-G. and F.F.; Formal Analysis, D.D.L.P. and S.S.-G.; Investigation, D.D.L.P.; Data Curation, D.D.L.P.; Writing—Original Draft, D.D.L.P.; Writing—Review and Editing, D.D.L.P., S.S.-G. and F.F.; Visualization, D.D.L.P.; Supervision, S.S.-G. and F.F.; Project Administration, F.F.; Funding Acquisition, D.D.L.P., S.S.-G. and F.F. All authors have read and agreed to the published version of the manuscript.

Funding

This study was funded by the Colombian Ministry of Science, Technology, and Innovation (MINCIENCIAS) through the Call for Doctorates Abroad 885-2 (D.D.L.P.); by the Valencian Regional Government through the WATER4CAST 2.0 (CIPROM/2023/5) research project (D.D.L.P. and F.F.); Spanish Ministry of Science and Innovation through TETISPREDICT (PID2022-141631OB-I00) research project (D.D.L.P. and F.F.); and S.S.G. by research talent recruitment programme “EMERGIA”, Call 2021, Consejería de Universidad, Investigación e Innovación, Junta de Andalucía, Spain (EMC21_00413). The APC was funded by the Universitat Politècnica de València for open access through the aid program of the Vice-Rectorate for Research.

Data Availability Statement

All the filtered databases and search equations are available in the Supplementary Material. The papers’ access is according to each journal’s policies, but with forms 1 and 2 (Supplementary Materials S2 and S3), the readers can obtain a very good approximation of all the information contained in each selected paper.

Acknowledgments

The authors wish to acknowledge Universitat Politècnica de València for access to the scientific databases used in this ScR. The authors express their gratitude to the 2 anonymous reviewers, the editor, and his/her assistant for their prompt and thorough review of this research article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

DeChant, C.M.; Moradkhani, H. Toward a reliable prediction of seasonal forecast uncertainty: Addressing model and initial condition uncertainty with ensemble data assimilation and Sequential Bayesian Combination. J. Hydrol. 2014, 519, 2967–2977. [Google Scholar] [CrossRef]
Koutsoyiannis, D.; Efstratiadis, A.; Georgakakos, K.P. Uncertainty assessment of future hydroclimatic predictions: A comparison of probabilistic and scenario-based approaches. J. Hydrometeorol. 2007, 8, 261–281. [Google Scholar] [CrossRef]
Papacharalampous, G.; Tyralis, H. A review of machine learning concepts and methods for addressing challenges in probabilistic hydrological post-processing and forecasting. Front. Water 2022, 4, 961954. [Google Scholar] [CrossRef]
Ghobadi, F.; Kang, D. Application of Machine Learning in Water Resources Management: A Systematic Literature Review. Water 2023, 15, 620. [Google Scholar] [CrossRef]
Han, S.; Coulibaly, P. Bayesian flood forecasting methods: A review. J. Hydrol. 2017, 551, 340–351. [Google Scholar] [CrossRef]
Troin, M.; Arsenault, R.; Wood, A.W.; Brissette, F.; Martel, J.L. Generating Ensemble Streamflow Forecasts: A Review of Methods and Approaches Over the Past 40 Years. Water Resour. Res. 2021, 57, e2020WR028392. [Google Scholar] [CrossRef]
McMillan, H.K.; Westerberg, I.K.; Krueger, T. Hydrological data uncertainty and its implications. Wiley Interdiscip. Rev. Water 2018, 5, e1319. [Google Scholar] [CrossRef]
De León Pérez, D.; Acosta Vega, R.; Salazar Galán, S.; Aranda, J.Á.; Francés García, F. Toward Systematic Literature Reviews in Hydrological Sciences. Water 2024, 16, 436. [Google Scholar] [CrossRef]
Page, M.J.; McKenzie, J.E.; Bossuyt, P.M.; Boutron, I.; Hoffmann, T.C.; Mulrow, C.D.; Shamseer, L.; Tetzlaff, J.M.; Akl, E.A.; Brennan, S.E.; et al. The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. Br. Med. J. 2021, 372, n71. [Google Scholar] [CrossRef] [PubMed]
Page, M.J.; Moher, D.; Bossuyt, P.M.; Boutron, I.; Hoffmann, T.C.; Mulrow, C.D.; Shamseer, L.; Tetzlaff, J.M.; Akl, E.A.; Brennan, S.E.; et al. PRISMA 2020 explanation and elaboration: Updated guidance and exemplars for reporting systematic reviews. Br. Med. J. 2021, 372, n160. [Google Scholar] [CrossRef]
Elsevier B.V. Scopus® Scopus Preview. Available online: https://www.scopus.com (accessed on 7 January 2025).
Clarivate Analytics. Web Of Science®. Available online: https://www.webofscience.com/wos (accessed on 7 January 2025).
Singh, V.K.; Singh, P.; Karmakar, M.; Leta, J.; Mayr, P. The journal coverage of Web of Science, Scopus and Dimensions: A comparative analysis. Scientometrics 2021, 126, 5113–5142. [Google Scholar] [CrossRef]
Martín-Martín, A.; Thelwall, M.; Orduna-Malea, E.; Delgado López-Cózar, E. Google Scholar, Microsoft Academic, Scopus, Dimensions, Web of Science, and OpenCitations’ COCI: A multidisciplinary comparison of coverage via citations. Scientometrics 2021, 126, 871–906. [Google Scholar] [CrossRef] [PubMed]
Visser, M.; van Eck, N.J.; Waltman, L. Large-scale comparison of bibliographic data sources: Scopus, Web of Science, Dimensions, Crossref, and Microsoft Academic. Quant. Sci. Stud. 2021, 2, 20–41. [Google Scholar] [CrossRef]
Martín-Martín, A.; Orduna-Malea, E.; Thelwall, M.; Delgado López-Cózar, E. Google Scholar, Web of Science, and Scopus: A systematic comparison of citations in 252 subject categories. J. Informetr. 2018, 12, 1160–1177. [Google Scholar] [CrossRef]
Zhu, J.; Liu, W. A tale of two databases: The use of Web of Science and Scopus in academic papers. Scientometrics 2020, 123, 321–335. [Google Scholar] [CrossRef]
Mongeon, P.; Paul-Hus, A. The journal coverage of Web of Science and Scopus: A comparative analysis. Scientometrics 2016, 106, 213–228. [Google Scholar] [CrossRef]
Centre for Science and Technology Studies VOSviewer 1.6.20. Available online: https://www.vosviewer.com/ (accessed on 31 January 2025).
Landis, J.R.; Koch, G.G. The measurement of observer agreement for categorical data. Biometrics 1977, 33, 159–174. [Google Scholar] [CrossRef]
Soo, A. Measuring Observer Agreement on Categorical Data. Ph.D. Thesis, University of Calgary, Calgary, AB, Canada, 2015. [Google Scholar]
Krzysztofowicz, R. Bayesian theory of probabilistic forecasting via deterministic hydrologic model. Water Resour. Res. 1999, 35, 2739–2750. [Google Scholar] [CrossRef]
Krzysztofowicz, R. Bayesian Models of Forecasted Time Series. J. Am. Water Resour. Assoc. 1985, 21, 805–814. [Google Scholar] [CrossRef]
Winkler, R.L. A Bayesian Approach to Nonstationary Processes; Technical Report; Stanford University: Stanford, CA, USA, 1975; OLK_NSF_94, 1–32. [Google Scholar]
Kelly, K.S.; Krzysztofowicz, R. Precipitation uncertainty processor for probabilistic river stage forecasting. Water Resour. Res. 2000, 36, 2643–2653. [Google Scholar] [CrossRef]
Krzysztofowicz, R.; Kelly, K.S. Hydrologic uncertainty processor for probabilistic river stage forecasting. Water Resour. Res. 2000, 36, 3265–3277. [Google Scholar] [CrossRef]
Krzysztofowicz, R. Integrator of uncertainties for probabilistic river stage forecasting: Precipitation-dependent model. J. Hydrol. 2001, 249, 69–85. [Google Scholar] [CrossRef]
Krzysztofowicz, R.; Herr, H.D. Hydrologic uncertainty processor for probabilistic river stage forecasting: Precipitation-dependent model. J. Hydrol. 2001, 241, 46–48. [Google Scholar] [CrossRef]
Raftery, A.E.; Gneiting, T.; Balabdaoui, F.; Polakowski, M. Using Bayesian Model Averaging to Calibrate Forecast Ensembles. Am. Meteorol. Soc. 2005, 133, 1155–1174. [Google Scholar] [CrossRef]
Kass, R.E.; Raftery, A.E. Bayes Factors. J. Am. Stat. Assoc. 1995, 90, 773–795. [Google Scholar] [CrossRef]
Raftery, A.E.; Madigan, D.; Hoeting, J.A. Bayesian Model Averaging for Linear Regression Models. J. Am. Stat. Assoc. 1997, 92, 179–191. [Google Scholar] [CrossRef]
Raftery, A.E. Bayesian Model Selection in Structural Equation Models. Sociol. Methodol. 1995, 25, 111–163. [Google Scholar] [CrossRef]
Hoeting Jennifer, A.; Madigan, D.; Raftery Adrian, E.; Volinsky Chris, T. Bayesian Model Averaging: A Tutorial. Stat. Sci. 1999, 14, 382–417. [Google Scholar] [CrossRef]
Vrugt, J.A.; Robinson, B.A. Treatment of uncertainty using ensemble methods: Comparison of sequential data assimilation and Bayesian model averaging. Water Resour. Res. 2007, 43, W01411. [Google Scholar] [CrossRef]
Fragoso, T.M.; Neto, F.L. Bayesian model averaging: A systematic review and conceptual classification. arXiv 2015. [Google Scholar] [CrossRef]
Fragoso, T.M.; Bertoli, W.; Louzada, F. Bayesian Model Averaging: A Systematic Review and Conceptual Classification. Int. Stat. Rev. 2018, 86, 1–28. [Google Scholar] [CrossRef]
Todini, E. A model conditional processor to assess predictive uncertainty in flood forecasting. Int. J. River Basin Manag. 2008, 6, 123–137. [Google Scholar] [CrossRef]
Van der Waerden, B.L. Order tests for the two-sample problem and their power. Indag. Math. Proc. 1952, 55, 453–458. [Google Scholar] [CrossRef]
Van der Waerden, B.L. Order Tests for the Two-Sample Problem (second communication). Indag. Math. Proc. 1953, 56, 303–310. [Google Scholar] [CrossRef]
Van der Waerden, B.L. Order Tests for the Two-Sample Problem (third communication). Indag. Math. Proc. 1953, 56, 311–316. [Google Scholar] [CrossRef]
Romero-Cuellar, J.; Abbruzzo, A.; Adelfio, G.; Francés, F. Hydrological post-processing based on approximate Bayesian computation (ABC). Stoch. Environ. Res. Risk Assess. 2019, 33, 1361–1373. [Google Scholar] [CrossRef]
Berthet, L.; Bourgin, F.; Perrin, C.; Viatgé, J.; Marty, R.; Piotte, O. A crash-testing framework for predictive uncertainty assessment when forecasting high flows in an extrapolation context. Hydrol. Earth Syst. Sci. 2020, 24, 2017–2041. [Google Scholar] [CrossRef]
Coccia, G.; Todini, E. Recent developments in predictive uncertainty assessment based on the model conditional processor approach. Hydrol. Earth Syst. Sci. 2011, 15, 3253–3274. [Google Scholar] [CrossRef]
Beven, K.; Binley, A. The future of distributed models: Model calibration and uncertainty prediction. Hydrol. Process. 1992, 6, 279–298. [Google Scholar] [CrossRef]
Beven, K. Prophecy, reality and uncertainty in distributed hydrological modelling. Adv. Water Resour. 1993, 16, 41–51. [Google Scholar] [CrossRef]
Todini, E.; Mantovan, P. Comment on: “On undermining the science?” by Keith Beven. Hydrol. Process. 2007, 21, 1633–1638. [Google Scholar] [CrossRef]
Mantovan, P.; Todini, E. Hydrological forecasting uncertainty assessment: Incoherence of the GLUE methodology. J. Hydrol. 2006, 330, 368–381. [Google Scholar] [CrossRef]
Beven, K.J.; Smith, P.J.; Freer, J.E. So just why would a modeller choose to be incoherent? J. Hydrol. 2008, 354, 15–32. [Google Scholar] [CrossRef]
Beven, K. Facets of uncertainty: Epistemic uncertainty, non-stationarity, likelihood, hypothesis testing, and communication. Hydrol. Sci. J. 2016, 61, 1652–1665. [Google Scholar] [CrossRef]
Beven, K.; Binley, A. GLUE: 20 years on. Hydrol. Process. 2014, 28, 5897–5918. [Google Scholar] [CrossRef]
Wen, J.; Wan, C.; Ye, Q.; Yan, J.; Li, W. Disaster Risk Reduction, Climate Change Adaptation and Their Linkages with Sustainable Development over the Past 30 Years: A Review. Int. J. Disaster Risk Sci. 2023, 14, 1–13. [Google Scholar] [CrossRef]
Beevers, L.; Popescu, I.; Pregnolato, M.; Liu, Y.; Wright, N. Identifying hotspots of hydro-hazards under global change: A worldwide review. Front. Water 2022, 4, 879536. [Google Scholar] [CrossRef]
Fox, S.; Agyemang, F.; Hawker, L.; Neal, J. Integrating social vulnerability into high-resolution global flood risk mapping. Nat. Commun. 2024, 15, 3155. [Google Scholar] [CrossRef]
Han, S.; Coulibaly, P. Probabilistic flood forecasting using hydrologic uncertainty processor with ensemble weather forecasts. J. Hydrometeorol. 2019, 20, 1379–1398. [Google Scholar] [CrossRef]
Barbetta, S.; Sahoo, B.; Bonaccorsi, B.; Nanda, T.; Chatterjee, C.; Moramarco, T.; Todini, E. Addressing effective real-time forecasting inflows to dams through predictive uncertainty estimate. J. Hydrol. 2023, 620, 129512. [Google Scholar] [CrossRef]
Acharya, S.C.; Babel, M.S.; Madsen, H.; Sisomphon, P.; Shrestha, S. Comparison of different quantile regression methods to estimate predictive hydrological uncertainty in the Upper Chao Phraya River Basin, Thailand. J. Flood Risk Manag. 2020, 13, e12585. [Google Scholar] [CrossRef]
Bai, H.; Li, G.; Liu, C.; Li, B.; Zhang, Z.; Qin, H. Hydrological probabilistic forecasting based on deep learning and Bayesian optimization algorithm. Hydrol. Res. 2021, 52, 927–943. [Google Scholar] [CrossRef]
Valdés-Pineda, R.; Valdés, J.B.; Wi, S.; Serrat-Capdevila, A.; Roy, T. Improving operational short-to medium-range (Sr2mr) streamflow forecasts in the upper zambezi basin and its sub-basins using variational ensemble forecasting. Hydrology 2021, 8, 188. [Google Scholar] [CrossRef]
Xu, J.; Anctil, F.; Boucher, M.A. Exploring hydrologic post-processing of ensemble streamflow forecasts based on affine kernel dressing and non-dominated sorting genetic algorithm II. Hydrol. Earth Syst. Sci. 2022, 26, 1001–1017. [Google Scholar] [CrossRef]
Zhang, X.; Song, S.; Guo, T. Nonlinear Segmental Runoff Ensemble Prediction Model Using BMA. Water Resour. Manag. 2024, 38, 3429–3446. [Google Scholar] [CrossRef]
Zhong, Y.; Guo, S.; Ba, H.; Xiong, F.; Chang, F.J.; Lin, K. Evaluation of the BMA probabilistic inflow forecasts using TIGGE numeric precipitation predictions based on artificial neural network. Hydrol. Res. 2018, 49, 1417–1433. [Google Scholar] [CrossRef]
Zhou, J.; Feng, K.; Liu, Y.; Zhou, C.; He, F.; Liu, G.; He, Z. A Hydrologic Uncertainty Processor Using Linear Derivation in the Normal Quantile Transform Space. Water Resour. Manag. 2020, 34, 3649–3665. [Google Scholar] [CrossRef]
Darbandsari, P.; Coulibaly, P. HUP-BMA: An Integration of Hydrologic Uncertainty Processor and Bayesian Model Averaging for Streamflow Forecasting. Water Resour. Res. 2021, 57, e2020WR029433. [Google Scholar] [CrossRef]
Cui, Z.; Guo, S.; Chen, H.; Liu, D.; Zhou, Y.; Xu, C.Y. Quantifying and reducing flood forecast uncertainty by the CHUP-BMA method. Hydrol. Earth Syst. Sci. 2024, 28, 2809–2829. [Google Scholar] [CrossRef]
Barbetta, S.; Coccia, G.; Moramarco, T.; Brocca, L.; Todini, E. The multi temporal/multi-model approach to predictive uncertainty assessment in real-time flood forecasting. J. Hydrol. 2017, 551, 555–576. [Google Scholar] [CrossRef]
Anele, A.O.; Todini, E.; Hamam, Y.; Abu-Mahfouz, A.M. Predictive uncertainty estimation in water demand forecasting using the model conditional processor. Water 2018, 10, 475. [Google Scholar] [CrossRef]
Romero-Cuellar, J.; Gastulo-Tapia, C.J.; Hernández-López, M.R.; Sierra, C.P.; Francés, F. Towards an Extension of the Model Conditional Processor: Predictive Uncertainty Quantification of Monthly Streamflow via Gaussian Mixture Models and Clusters. Water 2022, 14, 1261. [Google Scholar] [CrossRef]
Beneyto, C.; Vignes, G.; Aranda, J.Á.; Francés, F. Sample Uncertainty Analysis of Daily Flood Quantiles Using a Weather Generator. Water 2023, 15, 3489. [Google Scholar] [CrossRef]
Ghobadi, F.; Kang, D. Multi-Step Ahead Probabilistic Forecasting of Daily Streamflow Using Bayesian Deep Learning: A Multiple Case Study. Water 2022, 14, 3672. [Google Scholar] [CrossRef]
Kasiviswanathan, K.S.; Sudheer, K.P. Methods used for quantifying the prediction uncertainty of artificial neural network based hydrologic models. Stoch. Environ. Res. Risk Assess. 2017, 31, 1659–1670. [Google Scholar] [CrossRef]
Fan, M.; Liu, S.; Lu, D.; Gangrade, S.; Kao, S.C. Explainable machine learning model for multi-step forecasting of reservoir inflow with uncertainty quantification. Environ. Model. Softw. 2023, 170, 105849. [Google Scholar] [CrossRef]
Sun, N.; Zhang, S.; Peng, T.; Zhang, N.; Zhou, J.; Zhang, H. Multi-Variables-Driven Model Based on Random Forest and Gaussian Process Regression for Monthly Streamflow Forecasting. Water 2022, 14, 1828. [Google Scholar] [CrossRef]
Li, G.; Liu, Z.; Zhang, J.; Han, H.; Shu, Z. Bayesian model averaging by combining deep learning models to improve lake water level prediction. Sci. Total Environ. 2024, 906, 167718. [Google Scholar] [CrossRef]
Cui, Z.; Guo, S.; Zhou, Y.; Wang, J. Exploration of dual-attention mechanism-based deep learning for multi-step-ahead flood probabilistic forecasting. J. Hydrol. 2023, 622, 129688. [Google Scholar] [CrossRef]
Huang, Z.; Schepen, A.; Bennett, J.C.; Robertson, D.E.; Zhao, T.; Im, E.; Wang, Q.J. A Distributional Regression Network With Data Transformation for Calibrating Rainfall Forecasts. J. Geophys. Res. Mach. Learn. Comput. 2025, 2, e2025JH000635. [Google Scholar] [CrossRef]
Uttarwar, S.B.; Lerch, S.; Avesani, D.; Majone, B. Performance assessment of neural network models for seasonal weather forecast postprocessing in the Alpine region. Adv. Water Resour. 2025, 204, 105061. [Google Scholar] [CrossRef]
Tricco, A.C.; Lillie, E.; Zarin, W.; O’Brien, K.K.; Colquhoun, H.; Levac, D.; Moher, D.; Peters, M.D.J.; Horsley, T.; Weeks, L.; et al. PRISMA Extension for Scoping Reviews (PRISMA-ScR): Checklist and Explanation. Ann. Intern. Med. 2018, 169, 467–473. [Google Scholar] [CrossRef]
Li, W.; Duan, Q.; Miao, C.; Ye, A.; Gong, W.; Di, Z. A review on statistical postprocessing methods for hydrometeorological ensemble forecasting. Wiley Interdiscip. Rev. Water 2017, 4, e1246. [Google Scholar] [CrossRef]
Liu, X.; Zhang, L.; She, D.; Chen, J.; Xia, J.; Chen, X.; Zhao, T. Postprocessing of hydrometeorological ensemble forecasts based on multisource precipitation in Ganjiang River basin, China. J. Hydrol. 2022, 605, 127323. [Google Scholar] [CrossRef]
Biondi, D.; Todini, E. Comparing Hydrological Postprocessors Including Ensemble Predictions Into Full Predictive Probability Distribution of Streamflow. Water Resour. Res. 2018, 54, 9860–9882. [Google Scholar] [CrossRef]
Panchanathan, A.; Ahrari, A.; Ghag, K.S.; Mustafa, S.; Haghighi, A.T.; Kløve, B.; Oussalah, M. An overview of approaches for reducing uncertainties in hydrological forecasting: Progress and challenges. Earth Sci. Rev. 2024, 258, 104956. [Google Scholar] [CrossRef]
Sharma, S.; Raj Ghimire, G.; Siddique, R. Machine learning for postprocessing ensemble streamflow forecasts. J. Hydroinformatics 2023, 25, 126–139. [Google Scholar] [CrossRef]
Esha, R.I.; Imteaz, M.A. Seasonal streamflow prediction using large scale climate drivers for NSW region. In Proceedings of the 22nd International Congress on Modelling and Simulation, Hobart, Tasmania, Australia, 3–8 December 2017; pp. 1593–1599. [Google Scholar]
Liu, L.; Xie, J.; Gu, H.; Xu, Y.P. Estimating the added value of GRACE total water storage and uncertainty quantification in seasonal streamflow forecasting. Hydrol. Sci. J. 2022, 67, 304–318. [Google Scholar] [CrossRef]
The National Aeronautics and Space Administration GRACE—NASA Science. Available online: https://science.nasa.gov/mission/grace (accessed on 28 September 2025).
Mo, R.; Xu, B.; Zhong, P.A.; Zhu, F.; Huang, X.; Liu, W.; Xu, S.; Wang, G.; Zhang, J. Dynamic long-term streamflow probabilistic forecasting model for a multisite system considering real-time forecast updating through spatio-temporal dependent error correction. J. Hydrol. 2021, 601, 126666. [Google Scholar] [CrossRef]
Patel, A.; Yadav, S.M. Improving the reservoir inflow prediction using TIGGE ensemble data and hydrological model for Dharoi Dam, India. Water Supply 2023, 23, 4489–4509. [Google Scholar] [CrossRef]
Yang, X.; Zhou, J.; Fang, W.; Wang, Y. An ensemble flow forecast method based on autoregressive model and hydrological uncertainty processer. Water 2020, 12, 3138. [Google Scholar] [CrossRef]
Bennett, J.C.; Wang, Q.J.; Robertson, D.E.; Schepen, A.; Li, M.; Michael, K. Assessment of an ensemble seasonal streamflow forecasting system for Australia. Hydrol. Earth Syst. Sci. 2017, 21, 6007–6030. [Google Scholar] [CrossRef]
Hapuarachchi, H.A.P.; Bari, M.A.; Kabir, A.; Hasan, M.M.; Woldemeskel, F.M.; Gamage, N.; Sunter, P.D.; Zhang, X.S.; Robertson, D.E.; Bennett, J.C.; et al. Development of a national 7-day ensemble streamflow forecasting service for Australia. Hydrol. Earth Syst. Sci. 2022, 26, 4801–4821. [Google Scholar] [CrossRef]
Bennett, J.C.; Robertson, D.E.; Wang, Q.J.; Li, M.; Perraud, J.M. Propagating reliable estimates of hydrological forecast uncertainty to many lead times. J. Hydrol. 2021, 603, 126798. [Google Scholar] [CrossRef]
Ba, H.; Guo, S.; Zhong, Y.; He, S.; Wu, X. Quantification of the forecast uncertainty using conditional probability and updating models. Hydrol. Res. 2019, 50, 1751–1771. [Google Scholar] [CrossRef]
Koutsoyiannis, D.; Montanari, A. Bluecat: A Local Uncertainty Estimator for Deterministic Simulations and Predictions. Water Resour. Res. 2022, 58, e2021WR031215. [Google Scholar] [CrossRef]
Yadav, R.; Yadav, S.M. Review on Statistical Post-processing of Ensemble Forecasts. In Innovation in Smart and Sustainable Infraestructure, ISSI 2022; Patel, D., Kim, B., Han, D., Eds.; Springer: Berlin/Heidelberg, Germany, 2024; pp. 469–476. [Google Scholar] [CrossRef]
Valdez, E.S.; Anctil, F.; Ramos, M.H. Choosing between post-processing precipitation forecasts or chaining several uncertainty quantification tools in hydrological forecasting systems. Hydrol. Earth Syst. Sci. 2022, 26, 197–220. [Google Scholar] [CrossRef]
Matthews, G.; Barnard, C.; Cloke, H.; Dance, S.L.; Jurlina, T.; Mazzetti, C.; Prudhomme, C. Evaluating the impact of post-processing medium-range ensemble streamflow forecasts from the European Flood Awareness System. Hydrol. Earth Syst. Sci. 2022, 26, 2939–2968. [Google Scholar] [CrossRef]
Rao, C.R. R.A. Fisher: The Founder of Modern Statistics. Stat. Sci. 1992, 7, 34–48. [Google Scholar] [CrossRef]
Tyralis, H.; Koutsoyiannis, D. On the prediction of persistent processes using the output of deterministic models. Hydrol. Sci. J. 2017, 62, 2083–2102. [Google Scholar] [CrossRef]
Reynolds, D. Gaussian Mixture Models. In Encyclopedia of Biometrics; Springer: Berlin/Heidelberg, Germany, 2009. [Google Scholar] [CrossRef]
Hochreiter, S.; Schmidhuber, J. Long Short-Term Memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef] [PubMed]
Liang, Z.; Li, Y.; Hu, Y.; Li, B.; Wang, J. A data-driven SVR model for long-term runoff prediction and uncertainty analysis based on the Bayesian framework. Theor. Appl. Clim. 2018, 133, 137–149. [Google Scholar] [CrossRef]
Li, W.; Zhou, J.; Sun, H.; Feng, K.; Zhang, H.; Tayyab, M. Impact of Distribution Type in Bayes Probability Flood Forecasting. Water Resour. Manag. 2017, 31, 961–977. [Google Scholar] [CrossRef]
Tavare, S.; Balding, D.J.; Griffiths, R.C.; Donneuyst, P. Inferring Coalescence Times From DNA Sequence Data. Genetics 1997, 145, 505–518. [Google Scholar] [CrossRef]
Xiang, Y.; Peng, T.; Gao, Q.; Shen, T.; Qi, H. Evaluation of TIGGE Precipitation Forecast and Its Applicability in Streamflow Predictions over a Mountain River Basin, China. Water 2022, 14, 2432. [Google Scholar] [CrossRef]
Xiang, Y.; Liu, Y.; Zou, X.; Peng, T.; Yin, Z.; Ren, Y. Post-Processing Ensemble Precipitation Forecasts and Their Applications in Summer Streamflow Prediction over a Mountain River Basin. Atmosphere 2023, 14, 1645. [Google Scholar] [CrossRef]
Li, X.Q.; Chen, J.; Xu, C.Y.; Li, L.; Chen, H. Performance of Post-Processed Methods in Hydrological Predictions Evaluated by Deterministic and Probabilistic Criteria. Water Resour. Manag. 2019, 33, 3289–3302. [Google Scholar] [CrossRef]
Cai, C.; Wang, J.; Li, Z.; Shen, X.; Wen, J.; Wang, H.; Wu, C. A New Hybrid Framework for Error Correction and Uncertainty Analysis of Precipitation Forecasts with Combined Postprocessors. Water 2022, 14, 3072. [Google Scholar] [CrossRef]
Cai, C.; Wang, J.; Li, Z. Assessment and modelling of uncertainty in precipitation forecasts from TIGGE using fuzzy probability and Bayesian theory. J. Hydrol. 2019, 577, 123995. [Google Scholar] [CrossRef]
Jha, S.K.; Shrestha, D.L.; Stadnyk, T.A.; Coulibaly, P. Evaluation of ensemble precipitation forecasts generated through post-processing in a Canadian catchment. Hydrol. Earth Syst. Sci. 2018, 22, 1957–1969. [Google Scholar] [CrossRef]
Huang, H.; Liang, Z.; Li, B.; Wang, D.; Hu, Y.; Li, Y. Combination of Multiple Data-Driven Models for Long-Term Monthly Runoff Predictions Based on Bayesian Model Averaging. Water Resour. Manag. 2019, 33, 3321–3338. [Google Scholar] [CrossRef]
Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
Studley, D. Algebra of Neural Nets. Math. Mag. 1949, 22, 125–128. [Google Scholar] [CrossRef]
Hearst, M.A.; Dumais, S.T.; Osuna, E.; Platt, J.; Scholkopf, B. Support vector machines. IEEE Intell. Syst. Their Appl. 1998, 13, 18–28. [Google Scholar] [CrossRef]
Liu, Z.; Cheng, L.; Lin, K.; Cai, H. A hybrid bayesian vine model for water level prediction. Environ. Model. Softw. 2021, 142, 105075. [Google Scholar] [CrossRef]
Cho, K.; van Merrienboer, B.; Gulcehre, C.; Bahdanau, D.; Bougares, F.; Schwenk, H.; Bengio, Y. Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation. arXiv 2014, arXiv:1406.1078. [Google Scholar] [CrossRef]
Bai, S.; Kolter, J.Z.; Koltun, V. An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling. arXiv 2018, arXiv:1803.01271. [Google Scholar] [CrossRef]
Darbandsari, P.; Coulibaly, P. Introducing entropy-based Bayesian model averaging for streamflow forecast. J. Hydrol. 2020, 591, 125577. [Google Scholar] [CrossRef]
Darbandsari, P.; Coulibaly, P. Assessing Entropy-Based Bayesian Model Averaging Method for Probabilistic Precipitation Forecasting. J. Hydrometeorol. 2022, 23, 421–440. [Google Scholar] [CrossRef]
Bellier, J.; Bontron, G.; Zin, I. Selecting components in a probabilistic hydrological forecasting chain: The benefits of an integrated evaluation. LHB Hydrosci. J. 2021, 107, 1938352. [Google Scholar] [CrossRef]
Darbandsari, P.; Coulibaly, P. Inter-comparison of different bayesian model averaging modifications in streamflow simulation. Water 2019, 11, 1707. [Google Scholar] [CrossRef]
Li, B.; He, Y.; Ren, L. Multisource hydrologic modeling uncertainty analysis using the IBUNE framework in a humid catchment. Stoch. Environ. Res. Risk Assess. 2018, 32, 37–50. [Google Scholar] [CrossRef]
Shu, Z.; Zhang, J.; Wang, L.; Jin, J.; Cui, N.; Wang, G.; Sun, Z.; Liu, Y.; Bao, Z.; Liu, C. Evaluation of the Impact of Multi-Source Uncertainties on Meteorological and Hydrological Ensemble Forecasting. Engineering 2023, 24, 212–228. [Google Scholar] [CrossRef]
Xu, J.; Anctil, F.; Boucher, M.A. Hydrological post-processing of streamflow forecasts issued from multimodel ensemble prediction systems. J. Hydrol. 2019, 578, 124002. [Google Scholar] [CrossRef]
Zhang, J.; Chen, J.; Li, X.; Chen, H.; Xie, P.; Li, W. Combining Postprocessed Ensemble Weather Forecasts and Multiple Hydrological Models for Ensemble Streamflow Predictions. J. Hydrol. Eng. 2020, 25, 04019060. [Google Scholar] [CrossRef]
Shen, Q.; Mo, L.; Liu, G.; Wang, Y.; Zhang, Y. Interpretable probabilistic modeling method for runoff prediction: A case study in Yangtze River basin, China. J. Hydrol. Reg. Stud. 2024, 52, 101684. [Google Scholar] [CrossRef]
Jahangir, M.S.; Quilty, J. Generative deep learning for probabilistic streamflow forecasting: Conditional variational auto-encoder. J. Hydrol. 2024, 629, 130498. [Google Scholar] [CrossRef]
Hu, W.; Ghazvinian, M.; Chapman, W.E.; Sengupta, A.; Ralph, F.M.; Luca, A.; Monache, D. Deep Learning Forecast Uncertainty for Precipitation over the Western United States. Mon. Weather. Rev. 2023, 151, 1367–1385. [Google Scholar] [CrossRef]
Bogner, K.; Chang, A.Y.Y.; Bernhard, L.; Zappa, M.; Monhart, S.; Spirig, C. Tercile Forecasts for Extending the Horizon of Skillful Hydrological Predictions. J. Hydrometeorol. 2022, 23, 521–539. [Google Scholar] [CrossRef]
Siqueira, V.A.; Weerts, A.; Klein, B.; Fan, F.M.; de Paiva, R.C.D.; Collischonn, W. Postprocessing continental-scale, medium-range ensemble streamflow forecasts in South America using Ensemble Model Output Statistics and Ensemble Copula Coupling. J. Hydrol. 2021, 600, 126520. [Google Scholar] [CrossRef]
McInerney, D.; Thyer, M.; Kavetski, D.; Laugesen, R.; Woldemeskel, F.; Tuteja, N.; Kuczera, G. Improving sub-seasonal streamflow forecasts across flow regimes. In Proceedings of the 24th International Congress on Modelling and Simulation (Invited Paper), Sydney, Australia, 5–10 December 2021; pp. 616–622. [Google Scholar]
He, S.; Guo, S.; Liu, Z.; Yin, J.; Chen, K.; Wu, X. Uncertainty analysis of hydrological multi-model ensembles based on CBP-BMA method. Hydrol. Res. 2018, 49, 1636–1651. [Google Scholar] [CrossRef]
Wang, S.; Gong, J.; Gao, H.; Liu, W.; Feng, Z. Gaussian Process Regression and Cooperation Search Algorithm for Forecasting Nonstationary Runoff Time Series. Water 2023, 15, 2111. [Google Scholar] [CrossRef]
Kopsiaftis, G.; Protopapadakis, E.; Voulodimos, A.; Doulamis, N.; Mantoglou, A. Gaussian Process Regression Tuned by Bayesian Optimization for Seawater Intrusion Prediction. Comput. Intell. Neurosci. 2019, 2019, 2859429. [Google Scholar] [CrossRef]
Ghasemi, P.; Karbasi, M.; Zamani Nouri, A.; Sarai Tabrizi, M.; Azamathulla, H.M. Application of Gaussian process regression to forecast multi-step ahead SPEI drought index. Alex. Eng. J. 2021, 60, 5375–5392. [Google Scholar] [CrossRef]
Nelsen, R.B. An Introduction to Copulas, 2nd ed.; Bickel, P., Diggle, P., Fienberg, S., Gather, U., Olkin, I., Zeger, S., Eds.; Springer: New York, NY, USA, 2006; ISBN 978-0387-28659-4. [Google Scholar]
Boldea, O.; Magnus, J.R. Maximum likelihood estimation of the multivariate normal mixture model. J. Am. Stat. Assoc. 2009, 104, 1539–1549. [Google Scholar] [CrossRef]
McLachlan, G.J.; Lee, S.X.; Rathnayake, S.I. Annual Review of Statistics and Its Application Finite Mixture Models. Annu. Rev. Stat. Appl. 2024, 6, 355–378. [Google Scholar] [CrossRef]
Evensen, G. Data Assimilation; Springer: Berlin/Heidelberg, Germany, 2007; ISBN 978-3-540-38300-0. [Google Scholar]
Huang, C.; Newman, A.J.; Clark, M.P.; Wood, A.W.; Zheng, X. Evaluation of snow data assimilation using the ensemble Kalman filter for seasonal streamflow prediction in the western United States. Hydrol. Earth Syst. Sci. 2017, 21, 635–650. [Google Scholar] [CrossRef]
van Ravenzwaaij, D.; Cassey, P.; Brown, S.D. A simple introduction to Markov Chain Monte–Carlo sampling. Psychon. Bull. Rev. 2018, 25, 143–154. [Google Scholar] [CrossRef]
Marjoram, P.; Molitor, J.; Plagnol, V.; Tavaré, S. Markov chain Monte Carlo without likelihoods. Proc. Natl. Acad. Sci. USA 2003, 100, 15324–15328. [Google Scholar] [CrossRef]
Metropolis, N.; Ulam, S. The Monte Carlo Method. J. Am. Stat. Assoc. 1949, 44, 335–341. [Google Scholar] [CrossRef]
Ulam, S.M. Los Alamos National Laboratory Stanislaw Ulam 1909–1984; Los Alamos National Laboratory, Ed.; Los Alamos science; Los Alamos National Laboratory: Los Alamos, NM, USA, 1987. [Google Scholar]
Billingsley, P. Statistical Methods in Markov Chains. Ann. Math. Stat. 1961, 32, 12–40. [Google Scholar] [CrossRef]
Vrugt, J.A.; ter Braak, C.J.F.; Diks, C.G.H.; Schoups, G. Hydrologic data assimilation using particle Markov chain Monte Carlo simulation: Theory, concepts and applications. Adv. Water Resour. 2013, 51, 457–478. [Google Scholar] [CrossRef]
Woldemeskel, F.; McInerney, D.; Lerat, J.; Thyer, M.; Kavetski, D.; Shin, D.; Tuteja, N.; Kuczera, G. Evaluating post-processing approaches for monthly and seasonal streamflow forecasts. Hydrol. Earth Syst. Sci. 2018, 22, 6257–6278. [Google Scholar] [CrossRef]
Perrin, C.; Michel, C.; Andréassian, V. Improvement of a parsimonious model for streamflow simulation. J. Hydrol. 2003, 279, 275–289. [Google Scholar] [CrossRef]
Hernández-López, M.R.; Francés, F. Bayesian joint inference of hydrological and generalized error models with the enforcement of Total Laws. Hydrol. Earth Syst. Sci. Discuss. 2017, 1–40, preprint. [Google Scholar] [CrossRef]
Zhang, Z.; Zhang, Q.; Singh, V.P.; Shi, P. River flow modelling: Comparison of performance and evaluation of uncertainty using data-driven models and conceptual hydrological model. Stoch. Environ. Res. Risk Assess. 2018, 32, 2667–2682. [Google Scholar] [CrossRef]
Onyutha, C. Randomized block quasi-Monte Carlo sampling for generalized likelihood uncertainty estimation. Hydrol. Res. 2024, 55, 319–335. [Google Scholar] [CrossRef]
Waheed, S.Q.; Alobaidy, M.N.; Grigg, N.S. Forcing Data Organization for the Lesser Zab River Basin in Iraq to Build a Coherent Hydrological Model. J. Hydrol. Eng. 2022, 27, 05022019. [Google Scholar] [CrossRef]
Li, B.; Liang, Z.; He, Y.; Hu, L.; Zhao, W.; Acharya, K. Comparison of parameter uncertainty analysis techniques for a TOPMODEL application. Stoch. Environ. Res. Risk Assess. 2017, 31, 1045–1059. [Google Scholar] [CrossRef]
Tongal, H.; Booij, M.J. Quantification of parametric uncertainty of ANN models with GLUE method for different streamflow dynamics. Stoch. Environ. Res. Risk Assess. 2017, 31, 993–1010. [Google Scholar] [CrossRef]
Hamman, J.J.; Nijssen, B.; Bohn, T.J.; Gergel, D.R.; Mao, Y. The variable infiltration capacity model version 5 (VIC-5): Infrastructure improvements for new applications and reproducibility. Geosci. Model Dev. 2018, 11, 3481–3496. [Google Scholar] [CrossRef]
Liang, X.; Lettenmaier, D.P.; Wood, E.F.; Burges, S.J. A simple hydrologically based model of land surface water and energy fluxes for general circulation models. J. Geophys. Res. 1994, 99, 415–429. [Google Scholar] [CrossRef]
Gupta, H.V.; Kling, H.; Yilmaz, K.K.; Martinez, G.F. Decomposition of the mean squared error and NSE performance criteria: Implications for improving hydrological modelling. J. Hydrol. 2009, 377, 80–91. [Google Scholar] [CrossRef]
Huang, Z.; Zhao, T.; Xu, W.; Cai, H.; Wang, J.; Zhang, Y.; Liu, Z.; Tian, Y.; Yan, D.; Chen, X. A seven-parameter Bernoulli-Gamma-Gaussian model to calibrate subseasonal to seasonal precipitation forecasts. J. Hydrol. 2022, 610, 127896. [Google Scholar] [CrossRef]
Hamitouche, M.; Molina, J.L. A Review of AI Methods for the Prediction of High-Flow Extremal Hydrology. Water Resour. Manag. 2022, 36, 3859–3876. [Google Scholar] [CrossRef]
Roushangar, K.; Ghasempour, R.; Alizadeh, F. Uncertainty Assessment of the Integrated Hybrid Data Processing Techniques for Short to Long Term Drought Forecasting in Different Climate Regions. Water Resour. Manag. 2022, 36, 273–296. [Google Scholar] [CrossRef]
Tennant, C.; Larsen, L.; Bellugi, D.; Moges, E.; Zhang, L.; Ma, H. The Utility of Information Flow in Formulating Discharge Forecast Models: A Case Study From an Arid Snow-Dominated Catchment. Water Resour. Res. 2020, 56, e2019WR024908. [Google Scholar] [CrossRef]
Ren, W.W.; Yang, T.; Huang, C.S.; Xu, C.Y.; Shao, Q.X. Improving monthly streamflow prediction in alpine regions: Integrating HBV model with Bayesian neural network. Stoch. Environ. Res. Risk Assess. 2018, 32, 3381–3396. [Google Scholar] [CrossRef]
Bergström, S. Utveckling och tillämpning av en digital avrinningsmodell. Hydrol. Byran 1972, 22, 1–28. [Google Scholar]
Quilty, J.; Jahangir, M.S.; You, J.; Hughes, H.; Hah, D.; Tzoganakis, I. Bayesian extreme learning machines for hydrological prediction uncertainty. J. Hydrol. 2023, 626, 130138. [Google Scholar] [CrossRef]
Schreiber, T. Measuring Information Transfer. Phys. Rev. Lett. 2000, 85, 461–464. [Google Scholar] [CrossRef]
Abbaspour, K.C.; Yang, J.; Maximov, I.; Siber, R.; Bogner, K.; Mieleitner, J.; Zobrist, J.; Srinivasan, R. Modelling hydrology and water quality in the pre-alpine/alpine Thur watershed using SWAT. J. Hydrol. 2007, 333, 413–430. [Google Scholar] [CrossRef]
De León Pérez, D.; Domínguez, E. Determinación de áreas hidroclimáticamente homogéneas. Una propuesta técnica. Ing. del Agua 2021, 25, 97. [Google Scholar] [CrossRef]
Zhu, S.; Luo, X.; Xu, Z.; Ye, L. Seasonal streamflow forecasts using mixture-kernel GPR and advanced methods of input variable selection. Hydrol. Res. 2019, 50, 200–214. [Google Scholar] [CrossRef]
Ngoc Tran, V.; Ivanov, V.Y.; Tien Nguyen, G.; Ngoc Anh, T.; Huy Nguyen, P.; Kim, D.H.; Kim, J. A deep learning modeling framework with uncertainty quantification for inflow-outflow predictions for cascade reservoirs. J. Hydrol. 2024, 629, 130608. [Google Scholar] [CrossRef]
Tanhapour, M.; Soltani, J.; Malekmohammadi, B.; Hlavcova, K.; Kohnova, S.; Petrakova, Z.; Lotfi, S. Forecasting the Ensemble Hydrograph of the Reservoir Inflow based on Post-Processed TIGGE Precipitation Forecasts in a Coupled Atmospheric-Hydrological System. Water 2023, 15, 887. [Google Scholar] [CrossRef]
Liu, Z.; Li, Q.; Zhou, J.; Jiao, W.; Wang, X. Runoff Prediction Using a Novel Hybrid ANFIS Model Based on Variable Screening. Water Resour. Manag. 2021, 35, 2921–2940. [Google Scholar] [CrossRef]
Wu, X.; Lu, G.; Wu, Z. Remote Sensing Technology in the Construction of Digital Twin Basins: Applications and Prospects. Water 2023, 15, 2040. [Google Scholar] [CrossRef]
Peng, T.; Zhang, C.; Zhou, J.; Xia, X.; Xue, X. Multi-Objective Optimization for Flood Interval Prediction Based on Orthogonal Chaotic NSGA-II and Kernel Extreme Learning Machine. Water Resour. Manag. 2019, 33, 4731–4748. [Google Scholar] [CrossRef]
Nourali, M. Improved Treatment of Model Prediction Uncertainty: Estimating Rainfall using Discrete Wavelet Transform and Principal Component Analysis. Water Resour. Manag. 2023, 37, 4211–4231. [Google Scholar] [CrossRef]
Kasiviswanathan, K.S.; He, J.; Tay, J.H.; Sudheer, K.P. Enhancement of Model Reliability by Integrating Prediction Interval Optimization into Hydrogeological Modeling. Water Resour. Manag. 2019, 33, 229–243. [Google Scholar] [CrossRef]
Chu, H.; Wei, J.; Jiang, Y. Middle- and Long-Term Streamflow Forecasting and Uncertainty Analysis Using Lasso-DBN-Bootstrap Model. Water Resour. Manag. 2021, 35, 2617–2632. [Google Scholar] [CrossRef]
Kilinc, H.C.; Haznedar, B.; Katipoğlu, O.M.; Ozkan, F. A comparative study of daily streamflow forecasting using firefly, artificial bee colony, and genetic algorithm-based artificial neural network. Acta Geophys. 2024, 72, 4575–4595. [Google Scholar] [CrossRef]
Katipoğlu, O.M.; Ertugay, N.; Elshaboury, N.; Aktürk, G.; Kartal, V.; Pande, C.B. A novel metaheuristic optimization and soft computing techniques for improved hydrological drought forecasting. Phys. Chem. Earth Parts A/B/C 2024, 135, 103646. [Google Scholar] [CrossRef]
Gupta, S.; Gupta, S.K. Development of AI-based hybrid soft computing models for prediction of critical river water quality indicators. Environ. Sci. Pollut. Res. 2024, 31, 27829–27845. [Google Scholar] [CrossRef]
Riahi-Madvar, H.; Dehghani, M.; Memarzadeh, R.; Gharabaghi, B. Short to Long-Term Forecasting of River Flows by Heuristic Optimization Algorithms Hybridized with ANFIS. Water Resour. Manag. 2021, 35, 1149–1166. [Google Scholar] [CrossRef]
He, Y.; Yan, Y.; Wang, X.; Wang, C. Uncertainty Forecasting for Streamflow based on Support Vector Regression Method with Fuzzy Information Granulation. Energy Procedia 2019, 158, 6189–6194. [Google Scholar] [CrossRef]

Figure 1. Protocol framework applied to ScR. Source: De León Pérez et al. [8].

Figure 2. PRISMA-ScR Flowchart of the documents search framework.

Figure 3. Graph of reported keyword occurrences within the Scopus Database search results. The cycle size schematizes the number of occurrences, while the colors differentiate the clusters. The lines represent connections between keywords. Source: VOSviewer analysis of Scopus keywords.

Figure 4. Graph of reported keyword occurrences within the WoS Database search results. The cycle size schematizes the number of occurrences, while the colors differentiate the clusters. The lines represent connections between keywords. Source: VOSviewer analysis of WoS keywords.

Figure 5. General information pertaining to the documents that constitute the selected bibliography (Quantity). (a) Documents obtained from database filters, (b) Selected documents by stage, (c) Selected documents classified by type, (d) Documents by editorial, (e) Documents by publishing journal (66% higher), and (f) Studies developed by country.

Figure 6. Number of documents classified by year and group by the methods utilized (statistical or Machine Learning/Artificial Intelligence). The totality of the documents included several alternative methods that were present in minimal quantity (e.g., decomposition with Wavelet Transform).

Figure 7. Summary of findings/gaps and research directions with operational implications.

Table 1. Generic search equations applied to each scientific database.

Database	Generic Search Equation
Scopus	(TITLE-ABS-KEY (uncertainty AND hydro* AND forecast) AND PUBYEAR >2016) + language filters + domain filters + Conceptual layers (search refine terms)
WoS	TS = (uncertainty AND hydro* AND forecast) AND PY = (2017–2024) + language filters + domain filters + Conceptual layers (search refine terms)

Table 2. Strength of agreement. Source: Adapted from Landis et al. [20].

Kappa Statistic1	Strength of Agreement
<0.00	Poor
0.00–0.20	Slight
0.21–0.40	Fair
0.41–0.60	Moderate
0.61–0.80	Substantial
0.81–1.00	Almost Perfect

Table 3. Summary of findings/gaps and research directions with operational implications.

Finding/Gap	Research Dir.	Operational Imp.	References *
Incomplete propagation of forcing uncertainty (precipitation) to flow. Sub/Over-dispersed and biased raw ensembles in events.	* Explicitly coupling meteorological-hydrological ensembles with probabilistic postprocessing	Better calibrated prediction bands and better control of false alarms	[6,55,58,59]
	* Estimate predictive densities conditional on horizon and report CRPS, calibration, and coverage by time frame		[6,55,58,59]
Univariate post-processing by time frame ignores temporal correlations between sub-horizons and multivariate incoherences	* Extending hydrological postprocessing to multivariate/horizon-dependent approaches	Consistent exceedance probabilities over the entire forecast window and simultaneous improvement at operational thresholds	[6,37,55,65,78,79]
	* Compare multi and univariate approaches, maintaining time dependence in the predictive distribution		[6,37,55,65,78,79]
Heterogeneity of probabilistic metrics and protocols; limited comparability between studies.	* Establish a minimum battery of probabilistic metrics (CRPS, CRPSS, PICP, BS, BSS, R-Factor…) and reproducible spatio-temporal validation by basin and horizon.	Transparent comparison of methods and clear criteria for operational adoption.	[78,80]
Poor validation in tails/ends and composite events; skill degradation at high percentiles.	* Tailor-made evaluation designs (q95-q99), multiple thresholds, and Threat Score	More reliable flood/drought alerts, reduction in false alarms at peaks.	[6,52,53,55,65,79,81]
	* Use of truncated families and/or copulas for asymmetries and extreme dependencies		[6,52,53,55,65,79,81]
Research-operation disconnection and gap for computational cost and interpretability in AI + statistics hybrids.	* Parsimonious and explainable hybrids (e.g., multi-MCP with ML on residuals)	Robust implementation in real time with limited resources without losing calibration.	[80,82]
	* Explicit cost reporting and real-time deployment guidelines.		[80,82]

Note(s): * Some examples extracted from the Form 2 (Supplementary Material S4) to guide the reader.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

De León Pérez, D.; Salazar-Galán, S.; Francés, F. Beyond Deterministic Forecasts: A Scoping Review of Probabilistic Uncertainty Quantification in Short-to-Seasonal Hydrological Prediction. Water 2025, 17, 2932. https://doi.org/10.3390/w17202932

AMA Style

De León Pérez D, Salazar-Galán S, Francés F. Beyond Deterministic Forecasts: A Scoping Review of Probabilistic Uncertainty Quantification in Short-to-Seasonal Hydrological Prediction. Water. 2025; 17(20):2932. https://doi.org/10.3390/w17202932

Chicago/Turabian Style

De León Pérez, David, Sergio Salazar-Galán, and Félix Francés. 2025. "Beyond Deterministic Forecasts: A Scoping Review of Probabilistic Uncertainty Quantification in Short-to-Seasonal Hydrological Prediction" Water 17, no. 20: 2932. https://doi.org/10.3390/w17202932

APA Style

De León Pérez, D., Salazar-Galán, S., & Francés, F. (2025). Beyond Deterministic Forecasts: A Scoping Review of Probabilistic Uncertainty Quantification in Short-to-Seasonal Hydrological Prediction. Water, 17(20), 2932. https://doi.org/10.3390/w17202932

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Beyond Deterministic Forecasts: A Scoping Review of Probabilistic Uncertainty Quantification in Short-to-Seasonal Hydrological Prediction

Abstract

1. Introduction

2. Methodology

2.1. Literature Search Strategy

2.2. Inclusion/Exclusion Criteria

2.2.1. Inclusion Criteria

2.2.2. Exclusion Criteria

2.3. Documents Referenced by Colleagues or Other Researchers

2.4. Document Selection

3. Results

3.1. Referent Methodologies Prior to 2017

3.1.1. Bayesian Forecasting System

3.1.2. Bayesian Model Averaging

3.1.3. Model Conditional Processor

3.1.4. Generalized Likelihood Uncertainty Estimation

3.2. Selected Bibliography from Search Strategies

3.3. Prevalent Methodologies Found

4. Discussion

4.1. Statistical Methods

4.2. AI-Driven Approaches

4.3. AI-Driven Plus Statistical Frameworks

4.4. Final Remarks

4.5. Limitations

4.6. On Future Research Directions

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI