A Review of XAI Methods Applications in Forecasting Runoff and Water Level Hydrological Tasks
Abstract
1. Introduction
- planning of hydroelectric power station operating modes;
- efficient water consumption and minimization of idle discharges;
- regulation of water levels in the upper and lower pools of the reservoir.
- The systematic classification of XAI methods for hydrological forecasting by interpretability type (ante-hoc/post-hoc), level (global/local), and intended function (interpretation, control, and trust);
- Identifying methodological limitations in current XAI applications dealing with multidimensional time series and spatial data;
- Formulating directions for advancing XAI in hydrology, including metrics for interpretability assessment and semantic, engineering-oriented visualization interfaces.
2. Article Selection Criteria and Methodology
2.1. Search Sources
2.2. Keywords and Search Syntax
2.3. Inclusion and Exclusion Criteria
- contain a description of AI models applied to hydrological forecasting problems;
- apply or analyze explainability methods (XAI)—SHAP, LIME, Grad-CAM, ICE, etc.;
- contain both qualitative and quantitative verification of forecasts, e.g., root mean squared error (RMSE), mean absolute error (MAE), and interpretability metrics (if available);
- are published in peer-reviewed journals (including preprints).
- those without containing descriptions of AI models;
- those without any explicit connection with the tasks of forecasting hydrological quantities;
- those that do not use XAI or do not attempt to interpret the model.
2.4. Selection Results
- 29 use post-hoc XAI methods (SHAP, LIME, Grad-CAM, and ICE);
- 31 use ante-hoc approaches, mainly attention mechanisms;
- 23 combine both approaches.
- The most frequently used AI models include the following:
- ◦
- LSTM and GRU for working with time series;
- ◦
- CNN and Transformers (including the attention);
- ◦
- XGBoost, Random Forest, and other ensembles;
- ◦
- hybrid architectures combining wavelet transforms, filtering, and neural networks.
2.5. Limitation
3. Key Characteristics of Inflow Forecasting Task
3.1. Types of Forecasts and Forecasting Horizons
- Short-term ones refer to time periods from a few hours to a few days. They are essential for warning and managing emergency situations, such as floods and high water, allowing for rapid protective measures to be taken. These forecasts are also useful for run-of-the-river hydroelectricity regarding the compensation of energy fluctuations in national energy systems.
- Medium-term ones cover a time interval from a few weeks to a month. They are useful for optimizing the operation of irrigation and electricity production.
- Long-term ones extend over a period of months or even years. These forecasts are essential for strategic water resources planning, assessing the storage capacity of reservoirs, and for major investment decisions in hydrotechnical infrastructure.
3.2. Complexity and Multidimensionality of Input Data
- Hydrometeorological stations and posts (heterogeneous in density and frequency of observations);
- Numerical Weather Prediction (NWP) models;
- Satellite remote sensing data;
- Manual measurements and historical series (archives of hydrometeorological services and energy companies).
3.3. Multiscale Nature of the Task
- in time (from minute and hourly intervals to seasonal cycles);
- in space (from observation points to basins with an area of thousands of km2);
- in data structure (from scalar time series to graphs and spatiotemporal matrices).
3.4. Data-Related Issues
- Sparseness of the observation network (low spatial coverage of posts and stations);
- Mixed data format (combination of numerical models, retrospectives, manual measurements, and images);
- Seasonal instability of values (snow reserves, floods, evaporation, and soil freezing);
- Lack of direct observation of key parameters (soil moisture, snow cover structure).
- Indirect indicators (previous runoff as an indicator of humidity);
- Wavelet decompositions of time series;
- Methods of aggregation and reconstruction of missing values.
3.5. Implications for XAI
- Identifying critical factors in small data environments;
- Explanation of transient regimes (e.g., sudden changes in precipitation);
- Construction of confidence intervals and scenario trees for risk assessment.
- Installation of additional meteorological and hydrological posts in the territories under consideration;
4. Methods of Explainability of Artificial Intelligence
4.1. Classification of Interpretability Approaches
- (i)
- Post-hoc XAI: external or model-agnostic methods applied to an already trained predictor to explain its behavior (e.g., SHAP, LIME, ICE, anchors, counterfactuals, and Grad-CAM);
- (ii)
- Ante-hoc interpretability aids: intrinsic signals or transparent models whose parameters/structure are interpretable by design (e.g., attention mechanisms, monotonic constraints, linear/logistic models, shallow trees, and GAM-style additivity). In this paper, we use the term “XAI” strictly for post-hoc methods; attention and other intrinsic signals are reported as ante-hoc aids that may be corroborated by XAI.
4.2. Ante-Hoc Approaches
Attention Mechanisms
4.3. Post-Hoc Methods
- quantitative assessment of the joint influence of features on the model results (H-statistic);
- decomposition of a complex predictive model function into simpler components (functional decomposition);
- assessment of the influence of a feature on the results of a model when changing the order of the features used (permutation feature importance);
- using simpler and more interpretable models to predict the results of the original model (global surrogate models).
Local Interpretation Methods
- Local Interpretable Model-Agnostic Explanation;
- SHapley Additive exPlanations;
- Scoped rules (anchors);
- Individual conditional expectation curves;
- Counterfactual explanations.
4.4. Advantages and Disadvantages of XAI Methods
5. Explainability and Trust in Inflow Forecasting Problems Using AI Models
- Retraining models under data limitations;
- Leak of “future” data;
- Use of incorrect (unfair) dependencies;
- Uncontrolled data generalization.
- Determining the contribution of each input parameter to the formation of the forecast;
- Accompanying each predicted value with a confidence interval.
- relationship between the precipitation volume and slope of territories;
- relationship between the type of land use, housing density, and precipitation volumes.
6. Directions of Further Development
6.1. Developing Unified Interpretability Metrics
- Stability—stability of explanations with small input changes;
- Fidelity—the degree of correspondence between the explanation and the actual solution of the model;
- Consistency—similarity of explanations for models with the same behavior;
- Sparsity—compactness of explanation without loss of meaning.
6.2. Post-Hoc and Ante-Hoc Sharing
- Use of attention mechanisms as a primary interpretation of the feature importances for the model, followed by SHAP analysis to determine the individual contribution of each significant feature;
- Construction of surrogate models from attention-weighted input subsets;
- Interpretation of attention maps using ICE graphs or rules (anchors).
6.3. Adapting XAI to High-Dimensional Data
- identify synergistic features (joint influence);
- process graph structures and spatiotemporal data;
- interpret the outputs of models such as GCN, spatial–temporal LSTM, and transformers.
6.4. Spatial and Seasonal Stability of XAI
- XAI spatial validation methods;
- assessment of explanation transferability;
- analysis of the stability of attention/SHAP values by seasons and regions.
6.5. Physical Meaningfulness of Explanations
- development of physics-aware XAI that imposes constraints on acceptable explanations;
- integration of expert ontologies (for example, “if precipitation + snow reserve > threshold—forecast for level increase”);
- generating explanations in the form of rules or causal diagrams that are understandable to engineers.
6.6. Practical Implications
6.7. Accounting for Uncertainty and Confidence Intervals
- transmit forecasts with confidence intervals;
- assess risk based on interpretable factors;
- generate scenario trajectories that take into account the range of weather conditions and model uncertainty.
6.8. Semantic XAI Interfaces
- visualization interfaces: heat maps, scenario trees, dynamics of contributions over time;
- semantic explanations: generating texts in an engineering style, using domain terms;
- integration with SCADA/ASUE: explanation of anomalies and support for decisions on the operational horizon.
6.9. Operational Applicability
7. Conclusions
- Post-hoc methods—especially SHAP for tabular time series and Grad-CAM for spatial features—currently dominate hydrological XAI; ante-hoc signals such as attention can aid interpretation but should be corroborated by post-hoc analysis.
- When scrutinized seasonally and by regime, explanations frequently align with hydrologic reasoning (e.g., precipitation/temperature drivers, snowmelt thresholds, lagged-flow memory), and they help diagnose model failure modes and data issues.
- XAI is most informative when embedded in the modeling workflow (feature design, sanity checks, and stress tests) rather than used as an after-the-fact visualization.
- Lack of standardized protocols and shared benchmarks prevents fair cross-study comparisons of both accuracy and interpretability.
- Limited spatiotemporal validation (basin transfer, seasonal stratification, and climate-zone reporting) makes generalization uncertain.
- Physics-aware and causally consistent explanations remain rare; risks of spurious associations persist under data scarcity.
- Uncertainty-aware XAI (explaining predictive intervals and risk) is underreported; extremes and flood regimes are insufficiently probed.
- Multimodal fusion (in situ plus remote sensing) and human-in-the-loop evaluation for decision support are still immature.
- Benchmarks and protocols: curate public hydrological datasets with fixed train/validation/test splits and reporting checklists; evaluate XAI configurations on identical folds.
- Evaluation of explanations: report stability, fidelity, consistency, and sparsity alongside RMSE/MAE/NSE/KGE; analyze explanation drift across seasons and basins.
- Physics-aware XAI: encode monotonicity and energy-/mass-balance constraints; pair post-hoc methods with hybrid (theory-guided) models to enforce hydrologic plausibility.
- Spatiotemporal generalization: require basin-transfer tests, seasonal stratification, and climate-class breakdowns for explanations and accuracy.
- Uncertainty and risk: develop probabilistic XAI that attributes drivers of predictive intervals and tail risk relevant to operations (e.g., spill and exceedance probabilities).
- Decision-centric tooling: counterfactual and “what-if” analyses for reservoir operation policies; online monitoring of explanation drift in DSS/SCADA.
- Governance and deployment: document background datasets, XAI hyperparameters, and audit trails to meet traceability needs in water management and hydropower settings.
Author Contributions
Funding
Conflicts of Interest
References
- Sherman, L. Stream Flow from Rainfall by the Unit Graph Method. Eng. News Rec. 1932, 108, 501–505. [Google Scholar]
- Beven, K.J.; Kirkby, M.J. A physically based, variable contributing area model of basin hydrology/Un modèle à base physique de zone d’appel variable de l’hydrologie du bassin versant. Hydrol. Sci. Bull. 1979, 24, 43–69. [Google Scholar] [CrossRef]
- Goderniaux, P.; Brouyère, S.; Fowler, H.J.; Blenkinsop, S.; Therrien, R.; Orban, P.; Dassargues, A. Large scale surface-subsurface hydrological model to assess climate change impacts on groundwater reserves. J. Hydrol. 2009, 373, 122–138. [Google Scholar] [CrossRef]
- Gusev, E.M.; Nasonova, O.N. The simulation of heat and water exchange at the land–atmosphere interface for the boreal grassland by the land-surface model SWAP. Hydrol. Process. 2002, 16, 1893–1919. [Google Scholar] [CrossRef]
- Motovilov, Y.G.; Gottschalk, L.; Engeland, K.; Belokurov, A. ECOMAG—Regional Model of Hydrological Cycle. Application to the NOPEX Region; Institute Report Series no.105; Department of Geophysics University of Oslo: Oslo, Norway, 1999; 88p, ISBN 82-91885-04-4. ISSN 1501-6854. [Google Scholar]
- Refsggard, J.C.; Storm, B. MIKE SHE (Chapter 23). In Computer Models of Watershed Hydrology; Singh, V.P., Ed.; Water Resources Publ.: Lettleton, CO, USA, 1995. [Google Scholar]
- Rigon, R.; Bertoldi, G.; Over, T.M. GEOtop: A distributed hydrological model with coupled water and energy budgets. J. Hydrometeorol. 2006, 7, 371–388. [Google Scholar] [CrossRef]
- Wood, E.F.; Sivapalan, M.; Beven, K.J. Similarity and scale in catchment storm response. Rev. Geophys 1990, 28, 1–18. [Google Scholar] [CrossRef]
- Reggiani, P.; Schellekens, J. Modelling of hydrological responses: The representative elementary watershed as an alternative blueprint for watershed modeling. Hydr. Process 2003, 17, 3785–3789. [Google Scholar] [CrossRef]
- Barzola-Monteses, J.; Gómez-Romero, J.; Espinoza-Andaluz, M.; Fajardo, W. Time series forecasting techniques applied to hydroelectric generation systems. IJEPES 2025, 110424. [Google Scholar] [CrossRef]
- Awol, F.S.; Coulibaly, P.; Tsanis, I.; Unduche, F. Identification of hydrological models for enhanced ensemble reservoir inflow forecasting in a large complex prairie watershed. Water 2019, 11, 2201. [Google Scholar] [CrossRef]
- Li, Y.; Kek, X.Y.; Shafiee, E.; Lin, Z.; Wen, B. A review of recent hybridized machine learning methodologies for time series forecasting on water-related variables. J. Hydrol. 2025, 656, 132909. [Google Scholar] [CrossRef]
- Ibrahim, K.S.M.H.; Huang, Y.F.; Ahmed, A.N.; Koo, C.H.; El-Shafie, A. A review of the hybrid artificial intelligence and optimization modelling of hydrological streamflow forecasting. Alex. Eng. J. 2022, 61, 279–303. [Google Scholar] [CrossRef]
- Wang, W.C.; Chau, K.W.; Cheng, C.T.; Qiu, L. A comparison of performance of several artificial intelligence methods for forecasting monthly discharge time series. J. Hydrol. 2009, 304, 294–306. [Google Scholar] [CrossRef]
- Li, P.-H.; Kwon, H.-H.; Sun, L.; Lall, U.; Kao, J.-J. A modified support vector machine based prediction model on streamflow at the Shihmen Reservoir, Taiwan. Int. J. Climatol 2009, 30, 1256–1268. [Google Scholar] [CrossRef]
- Guo, J.; Zhou, J.; Qin, H.; Zou, Q.; Li, Q. Monthly streamflow forecasting based on improved support vector machine model. Expert Syst. Appl. 2011, 38, 13073–13081. [Google Scholar] [CrossRef]
- Castangia, M.; Grajales, L.M.M.; Aliberti, A.; Rossi, C.; Macii, A.; Macii, E.; Patti, E. Transformer neural networks for interpretable flood forecasting. Environ. Model. Softw. 2023, 160, 105581. [Google Scholar] [CrossRef]
- Kao, I.-F.; Zhou, Y.; Chang, L.-C.; Chang, F.-J. Exploring a long short-term memory based encoder-decoder framework for multi-step-ahead flood forecasting. J. Hydrol. 2020, 583, 124631. [Google Scholar] [CrossRef]
- Ni, L.; Wang, D.; Singh, V.P.; Wu, J.; Wang, Y.; Tao, Y.; Zhang, J. Streamflow and rainfall forecasting by two long short-term memory-based models. J. Hydrol. 2020, 583, 124296. [Google Scholar] [CrossRef]
- Xu, Y.; Zhao, J.; Wan, B.; Cai, J.; Wan, J. Flood forecasting method and application based on informer model. Water 2024, 16, 765. [Google Scholar] [CrossRef]
- Feng, Z.; Niu, W.; Wan, X.; Xu, B.; Zhu, F.; Chen, J. Hydrological time series forecasting via signal decomposition and twin support vector machine using cooperation search algorithm for parameter identification. J. Hydrol. 2022, 612, 128213. [Google Scholar] [CrossRef]
- Noor, F.; Haq, S.; Rakib, M.; Ahmed, T.; Jamal, Z.; Siam, Z.S.; Hasan, R.T.; Adnan, M.S.G.; Dewan, A.; Rahman, R.M. Water level forecasting using spatiotemporal attention-based long short-term memory network. Water 2022, 14, 612. [Google Scholar] [CrossRef]
- Malekpour Heydari, S.; Aris, T.N.M.; Yaakob, R.; Hamdan, H. Data-driven forecasting and modeling of runoff flow to reduce flood risk using a novel hybrid wavelet-neural network based on feature extraction. Sustainability 2021, 13, 11537. [Google Scholar] [CrossRef]
- Wang, X.; Wang, Y.; Yuan, P.; Wang, L.; Cheng, D. An adaptive daily runoff forecast model using VMD-LSTM-PSO hybrid approach. Hydrol. Sci. J. 2021, 66, 1488–1502. [Google Scholar] [CrossRef]
- Saab, S.M.; Othman, F.; Tan, C.G.; Allawi, M.F.; Sherif, M.; El-Shafie, A. Utilizing deep learning machine for inflow forecasting in two different environment regions: A case study of a tropical and semi-arid region. Appl. Water Sci. 2022, 12, 272. [Google Scholar] [CrossRef]
- Herbert, Z.C.; Asghar, Z.; Oroza, C.A. Long-term Reservoir Inflow Forecasts: Enhanced Water Supply and Inflow Volume Accuracy Using Deep Learning. J. Hydrol. 2021, 601, 126676. [Google Scholar] [CrossRef]
- Latif, S.D.; Ahmed, A.N. Streamflow Prediction Utilizing Deep Learning and Machine Learning Algorithms for Sustainable Water Supply Management. Water Resour. Manag. 2023, 37, 3227–3241. [Google Scholar] [CrossRef]
- Chen, S.; Dong, S.; Cao, Z.; Guo, J. A Compound Approach for Monthly Runoff Forecasting Based on Multiscale Analysis and Deep Network with Sequential Structure. Water 2020, 12, 2274. [Google Scholar] [CrossRef]
- Le, X.H.; Nguyen, D.H.; Jung, S.; Yeon, M.; Lee, G. Comparison of Deep Learning Techniques for River Streamflow Forecasting. IEEE Access 2021, 9, 71805–71820. [Google Scholar] [CrossRef]
- Page, M.J.; McKenzie, J.E.; Bossuyt, P.M.; Boutron, I.; Hoffmann, T.C.; Mulrow, C.D.; Shamseer, L.; Tetzlaff, J.M.; Akl, E.A.; Brennan, S.E.; et al. The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. BMJ 2021, 372, n71. [Google Scholar] [CrossRef]
- Hu, H.; Yang, K.; Yang, Z. Adaptive Reservoir Inflow Forecasting Using Variational Mode Decomposition and Long Short-Term Memory. IEEE Access 2021, 9, 119032–119048. [Google Scholar] [CrossRef]
- Lei, K.; Chang, J.; Long, R.; Wang, Y.; Zhang, H. Cascade Hydropower Station Risk Operation under the Condition of Inflow Uncertainty. Energy 2021, 244, 122666. [Google Scholar] [CrossRef]
- Zhou, F.; Wang, Z.; Chen, D.; Zhang, K. Reservoir Inflow Forecasting in Hydropower Industry: A Generative Flow-Based Approach. IEEE Trans. Ind. Inform. 2022, 19, 1196–1206. [Google Scholar] [CrossRef]
- Wang, S.; Peng, H.; Hu, Q.; Jiang, M. Analysis of Runoff Generation Driving Factors Based on Hydrological Model and Interpretable Machine Learning Method. J. Hydrol. Reg. Stud. 2022, 42, 101139. [Google Scholar] [CrossRef]
- Yang, R.; Wu, J.; Gan, G.; Guo, R.; Zhang, H. Combining Physical Hydrological Model with Explainable Machine Learning Methods to Enhance Water Balance Assessment in Glacial River Basins. Water 2024, 16, 3699. [Google Scholar] [CrossRef]
- Mushtaq, H.; Akhtar, T.; Hashmi, M.Z.U.R.; Masood, A.; Saeed, F. Hydrologic Interpretation of Machine Learning Models for 10-Daily Streamflow Simulation in Climate Sensitive Upper Indus Catchments. Theor. Appl. Climatol. 2024, 155, 5525–5542. [Google Scholar] [CrossRef]
- Xiang, X.; Guo, S.; Cui, Z.; Wang, L.; Xu, C.-Y. Improving Flood Forecast Accuracy Based on Explainable Convolutional Neural Network by Grad-CAM Method. J. Hydrol. 2024, 642, 131867. [Google Scholar] [CrossRef]
- Xu, Y.; Lin, K.; Hu, C.; Wang, S.; Wu, Q.; Zhang, J.; Xiao, M.; Luo, Y. Interpretable Machine Learning on Large Samples for Supporting Runoff Estimation in Ungauged Basins. J. Hydrol. 2024, 639, 131598. [Google Scholar] [CrossRef]
- Bai, Y.; Chen, Z.; Xie, J.; Li, C. Daily reservoir inflow forecasting using multiscale deep feature learning with hybrid models. J. Hydrol. 2016, 532, 193–206. [Google Scholar] [CrossRef]
- Wang, T.; Liu, J.; Cheng, Y.; Duan, J.; Zhao, Y.; Zhao, J.; Wang, P.; Zhai, J. Adaptive Rolling Runoff Forecasting Model: Combining Multi-Source Correlated Sequences and Extreme Value Encoding. J. Hydrol. Reg. Stud. 2025, 58, 102241. [Google Scholar] [CrossRef]
- Meydani, A.; Dehghanipour, A.; Schoups, G.; Tajrishy, M. Daily Reservoir Inflow Forecasting Using Weather Forecast Downscaling and Rainfall-Runoff Modeling: Application to Urmia Lake Basin, Iran. J. Hydrol. Reg. Stud. 2022, 44, 101228. [Google Scholar] [CrossRef]
- Chang, J.; Yan, B.; Sun, M.; Gu, D.; Zhou, X. Integrated Forecasting of Monthly Runoff Considering the Combined Effects of Teleconnection Factors. J. Hydrol. Reg. Stud. 2025, 58, 102206. [Google Scholar] [CrossRef]
- Velásquez, J.D.; Dyner, I.; Franco, C.J. Modeling the Effect of Macroclimatic Events on River Inflows in the Colombian Electricity Market. IEEE Lat. Am. Trans. 2016, 14, 4287–4292. [Google Scholar] [CrossRef]
- Maddu, R.; Pradhan, I.; Ahmadisharaf, E.; Singh, S.K.; Shaik, R. Short-Range Reservoir Inflow Forecasting Using Hydrological and Large-Scale Atmospheric Circulation Information. J. Hydrol. 2022, 612, 128153. [Google Scholar] [CrossRef]
- Manshausen, P.; Cohen, Y.; Pathak, J.; Pritchard, M.; Garg, P.; Mardani, M.; Kashinath, K.; Byrne, S.; Brenowitz, N. Generative Data Assimilation of Sparse Weather Station Observations at Kilometer Scales. J. Adv. Model. Earth Syst. (JAMES). 2024. preprint. Available online: https://www.researchgate.net/publication/381704607_Generative_Data_Assimilation_of_Sparse_Weather_Station_Observations_at_Kilometer_Scales (accessed on 7 July 2025).
- Soto, Á.M.; Cervantes, A.; Soler, M. Physics-Informed Neural Networks for High-Resolution Weather Reconstruction from Sparse Weather Stations. Open Res. Eur. 2024, 4, 99. [Google Scholar] [CrossRef]
- Ekeu-Wei, I.T.; Blackburn, G.A.; Pedruco, P. Infilling Missing Data in Hydrology: Solutions Using Satellite Radar Altimetry and Multiple Imputation for Data-Sparse Regions. Water 2018, 10, 1483. [Google Scholar] [CrossRef]
- Corbari, C.; Ravazzani, G.; Perotto, A.; Lanzingher, G.; Lombardi, G.; Quadrio, M.; Mancini, M.; Salerno, R. Weekly Monitoring and Forecasting of Hydropower Production Coupling Meteo-Hydrological Modeling with Ground and Satellite Data in the Italian Alps. Hydrology 2022, 9, 29. [Google Scholar] [CrossRef]
- Fok, H.S.; Chen, Y.; Zhou, L. Daily Runoff and Its Potential Error Sources Reconstructed Using Individual Satellite Hydrological Variables at the Basin Upstream. Front. Earth Sci. 2022, 10, 821592. [Google Scholar] [CrossRef]
- Thapa, S.; Zhao, Z.; Li, B. Snowmelt-Driven Streamflow Prediction Using Machine Learning Techniques (LSTM, NARX, GPR, and SVR). Water 2020, 12, 1734. [Google Scholar] [CrossRef]
- Anderson, S.; Radic, V. Interpreting Deep Machine Learning for Streamflow Modeling Across Glacial, Nival, and Pluvial Regimes in Southwestern Canada. Front. Water 2022, 4, 934709. [Google Scholar] [CrossRef]
- Kumar, A.; Ramsankaran, R.; Brocca, L. A simple machine learning approach to model real-time streamflow using satellite inputs: Demonstration in a data scarce catchment. J. Hydrol. 2021, 595, 126046. [Google Scholar] [CrossRef]
- Machlev, R.; Heistrene, L.; Perl, M.; Levy, K.Y.; Belikov, J.; Mannor, S.; Levron, Y. Explainable Artificial Intelligence (XAI) Techniques for Energy and Power Systems: Review, Challenges and Opportunities. Energy AI 2022, 9, 100169. [Google Scholar] [CrossRef]
- Han, D.; Liu, P.; Xie, K.; Li, H.; Xia, Q.; Cheng, Q.; Wang, Y.; Yang, Z.; Zhang, Y.; Xia, J. An Attention-Based LSTM Model for Long-Term Runoff Forecasting and Factor Recognition. Environ. Res. Lett. 2022, 18, 024004. [Google Scholar] [CrossRef]
- Toubeau, J.-F.; Bottieau, J.; Wang, Y.; Vallee, F. Interpretable Probabilistic Forecasting of Imbalances in Renewable-Dominated Electricity Systems. IEEE Trans. Sustain. Energy 2021, 13, 1267–1277. [Google Scholar] [CrossRef]
- Sheng, Z.; Wen, S.; Feng, Z.-K.; Shi, K.; Huang, T. A Novel Residual Gated Recurrent Unit Framework for Runoff Forecasting. IEEE Internet Things J. 2023, 10, 12736–12748. [Google Scholar] [CrossRef]
- Başağaoğlu, H.; Chakraborty, D.; Lago, C.D.; Gutierrez, L.; Şahinli, M.A.; Giacomoni, M.; Furl, C.; Mirchi, A.; Moriasi, D.; Şengör, S.S. A Review on Interpretable and Explainable Artificial Intelligence in Hydroclimatic Applications. Water 2022, 14, 1230. [Google Scholar] [CrossRef]
- Linardatos, P.; Kotsiantis, S.; Papastefanopolous, V. Explainable AI: A Review of Machine Learning Interpretability Methods. Entropy 2022, 23, 18. [Google Scholar] [CrossRef] [PubMed]
- Molnar, C. Interpretable Machine Learning. A Guide for Making Black Box Models Explainable. Available online: https://christophm.github.io/interpretable-ml-book/ (accessed on 18 June 2025).
- Narkhede, J. Comparative Evaluation of Post-Hoc Explainability Methods in AI: LIME, SHAP, and Grand-CAM. In Proceedings of the International Conference on Sustainable Expert Systems (ICSES-2024), Kaski, Nepal, 15–17 October 2024. [Google Scholar] [CrossRef]
- Lundberg, S.M.; Lee, S.-I. A Unified Approach to Interpreting Model Predictions. In Proceedings of the 31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA, 4–9 December 2017. [Google Scholar] [CrossRef]
- Samek, W.; Wiegand, T.; Müller, K.-R. Explainable Artificial Intelligence: Understanding, Visualizing and Interpreting Deep Learning Models. arXiv 2017. [Google Scholar] [CrossRef]
- Bedi, P.; Thukral, A.; Dhiman, S. Explainable AI in Disease Diagnosis. In Computational Intelligence Methods and Applications; Springer: Singapore, 2024; pp. 87–111. [Google Scholar] [CrossRef]
- Adadi, A.; Berrada, M. Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI). IEEE Access 2018, 6, 52138–52160. [Google Scholar] [CrossRef]
- Kotagiri, A. Demystifying Machine Learning by Unraveling Interpretability. In Advances in Systems Analysis, Software Engineering, and High Performance Computing Book Series; IGI Global Scientific Publishing: Hershey, PA, USA, 2024; pp. 145–156. [Google Scholar]
- Gabbay, F.; Bar-Lev, S.; Montano, O.; Hadad, N. A LIME-Based Explainable Machine Learning Model for Predicting the Severity Level of COVID-19 Diagnosed Patients. Appl. Sci. 2021, 11, 10417. [Google Scholar] [CrossRef]
- Anshuka, A.; Chandra, R.; Buzacott, A.J.V. Spatio temporal hydrological extreme forecasting framework using LSTM deep learning model. Stoch. Environ. Res. Risk Assess. 2022, 36, 3467–3485. [Google Scholar] [CrossRef]
- Mendes, J.; Maia, R. Evaluation of Ensemble Inflow Forecasts for Reservoir Management in Flood Situations. Hydrology 2023, 10, 28. [Google Scholar] [CrossRef]
- Kratzert, F.; Klotz, D.; Shalev, G. Towards learning universal, regional, and local hydrological behaviors via machine learning applied to large-sample datasets. Hydrol. Earth Syst. Sci. 2019, 23, 5089–5110. [Google Scholar] [CrossRef]
- Chen, Y.; Zhang, Z.; Zhong, W.; Pan, Y.; Zheng, Y. A Multi-Step Water Level Prediction Model Based On CNN-LSTM-ATTENTION Combined With Wavelet Transform. In Proceedings of the 4th International Conference on Neural Networks, Information and Communication Engineering (NNICE), Guangzhou, China, 19–21 January 2024; pp. 992–996. [Google Scholar] [CrossRef]
- Huang, F.; Chen, P.; Yi, J.; Yang, J. A multi-Task Water Level Prediction Method Based on Attention Mechanism and LSTM. In Proceedings of the 6th International Conference on Artificial Intelligence and Big Data (ICAIBD), Chengdu, China, 26–29 May 2023; pp. 639–643. [Google Scholar] [CrossRef]
- Lu, J.; Xie, Z.; Chen, J.; Li, M.; Xu, C.; Cao, H. GC-SALM: Multi-Task Runoff Prediction Using Spatial-Temporal Attention Graph Convolution Networks. In Proceedings of the 2023 IEEE International Conference on Systems, Man and Cybernetics (SMC), Honolulu, HI, USA, 1–4 October 2023; pp. 3633–3638. [Google Scholar] [CrossRef]
- Feng, J.; Sha, H.; Ding, Y.; Yan, L.; Yu, Z. Graph Convolution Based Spatial-Temporal Attention LSTM Model for Flood Forecasting. In Proceedings of the 2022 International Joint Conference on Neural Networks (IJCNN), Padua, Italy, 18–23 July 2022; pp. 1–8. [Google Scholar] [CrossRef]
- Sheng, Z.; Cao, Y.; Yang, Y.; Feng, Z.-K.; Shi, K.; Huang, T.; Wen, S. Residual Temporal Convolutional Network with Dual Attention Mechanism for Multilead-Time Interpretable Runoff Forecasting. IEEE Trans. Neural Netw. Learn. Syst. 2024, 36, 8757–8771. [Google Scholar] [CrossRef]
- Yang, S.; Lian, H.; Soltanian, M.R.; Xu, B.; Liu, W.; Thanh, H.V.; Li, Y.; Yin, H.; Dai, Z. Hybrid Approach for Early Warning of Mine Water: Energy Density-Based Identification of Water-Conducting Channels Combined with Water Inflow Prediction by SA-LSTM. IEEE Trans. Geosci. Remote Sens. 2024, 62, 5911312. [Google Scholar] [CrossRef]
- Li, Z.; Gao, T.; Guo, C.; Li, H.-A. A Gated Recurrent Unit Network Model for Predicting Open Channel Flow in Coal Mines Based on Attention Mechanisms. IEEE Access 2020, 8, 119819–119828. [Google Scholar] [CrossRef]
- Chen, S.; Dong, S. A Sequential Structure for Water Inflow Forecasting in Coal Mines Integrating Feature Selection and Multi-Objective Optimization. IEEE Access 2020, 8, 183619–183632. [Google Scholar] [CrossRef]
- Han, C.; Guo, Z.; Sun, X.; Zhang, Y. Dynamic Forecasting and Operation Mechanism of Reservoir Considering Multi-Time Scales. Water 2023, 15, 2472. [Google Scholar] [CrossRef]
- Weekaew, J.; Ditthakit, P.; Kittiphattanabawon, N.; Pham, Q.B. Quartile Regression and Ensemble Models for Extreme Events of Multi-Time Step-Ahead Monthly Reservoir Inflow Forecasting. Water 2024, 16, 3388. [Google Scholar] [CrossRef]
- Ding, Y.; Yin, S.; Dai, Z.; Lian, H.; Bu, C. Multi-Factor Prediction of Water Inflow from the Working Face Based on an Improved SSA-RG-MHA Model. Water 2024, 16, 3390. [Google Scholar] [CrossRef]
- Stefenon, S.F.; Seman, L.O.; Aquino, L.S.; Coelho, L.D.S. Wavelet-Seq2Seq-LSTM with Attention for Time Series Forecasting of Level of Dams in Hydroelectric Power Plants. Energy 2023, 274, 127350. [Google Scholar] [CrossRef]
- Kajári, B.; Tobak, Z.; Túri, N.; Bozán, C.; Van Leeuwen, B. Prediction of Inland Excess Water Inundations Using Machine Learning Algorithms. Water 2024, 16, 1267. [Google Scholar] [CrossRef]
- Núñez, J.; Cortés, C.B.; Yáñez, M.A. Explainable Artificial Intelligence in Hydrology: Interpreting Black-Box Snowmelt-Driven Streamflow Predictions in an Arid Andean Basin of North-Central Chile. Water 2023, 15, 3369. [Google Scholar] [CrossRef]
- Amnuaylojaroen, T.; Ptak, M.; Sojka, M. Assessment of the Impact of Meteorological Variables on Lake Water Temperature Using the SHapley Additive ExPlanations Method. Water 2024, 16, 3296. [Google Scholar] [CrossRef]
Ref. | Authors | Title | Explainable Approach |
---|---|---|---|
[31] | Hu H., Yang K., Yang Z. | Adaptive Reservoir Inflow Forecasting Using Variational Mode Decomposition and Long Short-Term Memory | Amplitude–frequency decomposition based on attention coefficients |
[32] | Lei K. Chang, J., Long R., Wang Y., Zhang H. | Cascade Hydropower Station Risk Operation under the Condition of Inflow Uncertainty | Confidence intervals of the forecast based on the method of constructing scenario trees |
[33] | Zhou F., Wang Z., Chen D., Zhang K. | Reservoir Inflow Forecasting in Hydropower Industry: A Generative Flow-Based Approach | Confidence intervals with probability distribution of attention coefficients |
[34] | Wang S., Peng H., Hu Q., Jiang M. | Analysis of Runoff Generation Driving Factors Based on Hydrological Model and Interpretable Machine Learning Method | SHAP values for the XGBoost model |
[35] | Yang R., Wu J., Gan G., Guo R., Zhang H. | Combining Physical Hydrological Model with Explainable Machine Learning Methods to Enhance Water Balance Assessment in Glacial River Basins | SHAP values for the hybrid physically supported model |
[36] | Mushtaq H., Akhtar T., Hashmi M.Z.U.R., Masood A., Saeed F. | Hydrologic Interpretation of Machine Learning Models for 10-Daily Streamflow Simulation in Climate Sensitive Upper Indus Catchments | SHAP values for detailing the contribution of features to results |
[37] | Xiang X., Guo S., Cui Z., Wang L., Xu C.-Y. | Improving Flood Forecast Accuracy Based on Explainable Convolutional Neural Network by Grad-CAM Method | Grad-CAM for visualizing the importance of CNN inputs |
[38] | Xu Y., Lin K., Hu C., Wang S., Wu Q., Zhang J., Xiao M., Luo Y. | Interpretable Machine Learning on Large Samples for Supporting Runoff Estimation in Ungauged Basins | Heatmaps based on SHAP values for the XGBoost model |
Reference | Target Variable | Input Parameters |
---|---|---|
[39] | Inflow into the reservoir (1–7 days) | Historical inflow values; decomposed time series components |
[40] | Inflow to the reservoir (online) | Historical inflow values, precipitation, and coded inflow extremes |
[41] | Inflow into the reservoir (1 day) | Precipitation and temperature forecast data from The European Centre for Medium-Range Weather Forecasts (ECMWF) and National Centers for Environmental Prediction (NCEP) models, snow storage, and soil moisture parameters |
[42] | Inflow into the reservoir (1 month) | Climate and meteorological data from ENSO and AO, solar flux data, and historical inflow values |
[43] | Inflow into the reservoir (1 month) | Historical inflow values, climate and meteorological data from ENSO, Southern Oscillation Index (SOI), and DMI |
[44] | Inflow into the reservoir (1 day) | Climate and meteorological data from ENSO, AO, and NAO; precipitation; lagged variables |
XAI Method | Type | Explanation Tool |
---|---|---|
SHAP | Post-hoc | Individual quantitative contribution of each feature to the result |
LIME | Post-hoc | Approximation of the forecast by a simple interpretable model |
ICE | Post-hoc | Feature–result curves |
Anchors | Post-hoc | The rules and conditions that determine the outcome |
Counterfactual explanations | Post-hoc | Recommendations for changing the values of attributes |
Attention mechanisms * | Ante-hoc | Feature importances based on weight coefficients |
Hydrological Task (Typical Horizon) | Typical Model Families Seen in the Corpus | Interpretability Tool Used in Practice | Typical Inputs Noted | Practitioner Takeaway (Recurring Across Studies) |
---|---|---|---|---|
Reservoir inflow (1–10 days; 2–4 weeks) | LSTM/GRU; hybrids (e.g., VMD-LSTM); ensembles; Transformers in some cases | SHAP, ICE; probabilistic XAI (scenario trees, intervals); attention as ante-hoc aid | Lagged inflow/streamflow; precipitation; temperature; snow proxies; sometimes climate indices | Precipitation and lagged flow dominate short-horizon drivers; snow/temperature thresholds matter in nival/glacial regimes; confidence intervals useful for risk-aware operation. |
Streamflow/runoff | XGBoost/Random Forest; LSTM/GRU; Transformers (emerging) | SHAP (local/global summaries); occasional Grad-CAM for spatial CNN inputs | Precipitation, evaporation; land use/slope; seasonal factors | Climate/season stratification changes driver ranks; SHAP often confirms hydrologic plausibility (e.g., slope–precipitation synergies, temperature thresholds). |
Water level | CNN; CNN-LSTM-attention; Seq2Seq with attention | Grad-CAM (spatial focus); attention (ante-hoc) | Precipitation; upstream inflow; evaporation; spatial tiles | Spatial hot-spots align with storm periods and contributing areas; combine attention with post-hoc checks for robustness. |
Method | Works Best with | Outputs | Common Pitfalls |
---|---|---|---|
SHAP | Tree ensembles; tabular time series; also used with DL via Kernel/Deep variants | Local and aggregated feature attributions; seasonal/segment analysis | Background set sensitivity; compute cost |
LIME | Local diagnosis around single predictions | Sparse local surrogate with feature weights | Kernel width sensitivity; locality mismatch |
ICE | Monotone/threshold behavior checks | Feature–response curves by instance/aggregate | One-feature-at-a-time; crowding of curves |
Grad-CAM | CNNs on spatial inputs | Saliency maps over input space | Layer/variant dependence |
Reference | XAI Method | Explanation |
---|---|---|
[70] | LSTM attention mechanism | Assignment of different weighting coefficients expressing the influence of wavelet components on the forecast result at different time intervals |
[71] | LSTM attention mechanism + binary masks for selecting input parameters depending on the season | Selection of input data to make a forecast based on the use of an attention mechanism. Additionally, the creation of binary masks for input parameters based on the importance of the contribution of features to the forecast |
[72] | Spatial LSTM attention mechanism + temporal LSTM attention mechanism | An assessment based on a graph structure represents the topological relationships between the considered watershed points and is used to assess the contribution of a data source to the forecast result A temporal LSTM attention mechanism based on the outputs of the hidden layer of the model used to assess the influence of periodicity and seasonality |
[73] | Spatial graph LSTM attention mechanism + temporal LSTM attention mechanism + DropEdge mechanism | The DropEdge mechanism used to limit overfitting and improve the generalization ability of the model. It removes part of the edges of the model graph during its training Assessment of the influence of periodicity and seasonality by means of a temporal LSTM attention mechanism based on the outputs of the hidden layer of the model |
[74] | Dual attention mechanisms (temporal and features) + heatmap visualization | A temporal convolutional network with dual attention mechanisms used to estimate the contribution of each feature and time step Heat maps used to visualize the contribution. |
[75] | Temporal LSTM attention mechanism, SHAP | Sensitivity analysis of selected “energy” features |
[76] | Temporal–spatial LSTM attention mechanism | The analysis of the contribution of data from each sensor to the predicted value expressed by a matrix of weighting coefficients |
[77] | Feature selection protocol (Relief F) + multi-objective optimization | A formed rating of the importance of geological and hydrological indicators |
[78] | Variational mode decomposition (VMD), mode extraction | To justify the forecast, the user is provided with data on the influence of parameters in the form of an amplitude-frequency decomposition |
[79] | Confidence intervals of the forecast | The method of constructing scenario trees used to determine the confidence intervals of the forecast |
[80] | Temporal LSTM attention mechanism with probability distribution construction, confidence intervals | Estimated probability distribution of the hidden layer output for different time intervals and the given forecast with confidence intervals for risk assessment. |
[81] | Rolling-feedback approach | Step-by-step adaptation of the forecast performed, taking into account previous and current inflow values |
[82] | Quartile regression, introduction of forecasting rules | The data sample divided into “normal” and “extreme” conditions, on the basis of which clear rules for explaining the forecast are formed |
[83] | Multi-head attention mechanism | Local features of the data taken into account when extracting time features of the input parameters |
[84] | Seq2Seq attention mechanism, wavelet transform | The Attention layer receives the decoder states and all the encoder states as input, then the alignment layer determines the relationship between the input and output parameters |
Limitation | Hydrological Challenge | Implementation |
---|---|---|
Stability of explanations under data perturbations (SHAP/LIME/attention). | Sparse/noisy gauges and changing input coverage | Bootstrap explanations; stratify SHAP/attention summaries by season and basin; report rank-correlation stability with small perturbations |
Fidelity of local surrogates (LIME) and post-hoc additivity assumptions (SHAP) | Nonlinear, regime-dependent responses (snowmelt vs. rain) | Tune LIME locality kernel by regime; verify SHAP additivity with ICE spot-checks on key features and seasons; flag mismatches as potential model misuse |
Consistency across models and basins | Transfer to new catchments/climates | Basin-transfer tests for explanations; compare attribution patterns for models with matched accuracy; report a consistency score across basins |
Computational cost of explanations | Real-time or near-real-time operation | Use TreeSHAP for ensembles; pre-compute attributions on seasonal prototypes; cache top-k features; employ lightweight ICE for on-the-fly diagnostics; schedule full explanations offline |
Lack of physical meaningfulness | Trustworthy operation decisions | Apply physics-aware constraints/ontologies (e.g., monotonic precipitation-to-inflow within regime) and reject explanations violating known hydrologic relations |
Explanations ignore forecast uncertainty | Risk-aware reservoir operation | Pair explanations with predictive intervals/scenario trees; attribute drivers of interval width or exceedance risk (probabilistic XAI) |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Bramm, A.M.; Matrenin, P.V.; Khalyasmaa, A.I. A Review of XAI Methods Applications in Forecasting Runoff and Water Level Hydrological Tasks. Mathematics 2025, 13, 2830. https://doi.org/10.3390/math13172830
Bramm AM, Matrenin PV, Khalyasmaa AI. A Review of XAI Methods Applications in Forecasting Runoff and Water Level Hydrological Tasks. Mathematics. 2025; 13(17):2830. https://doi.org/10.3390/math13172830
Chicago/Turabian StyleBramm, Andrei M., Pavel V. Matrenin, and Alexandra I. Khalyasmaa. 2025. "A Review of XAI Methods Applications in Forecasting Runoff and Water Level Hydrological Tasks" Mathematics 13, no. 17: 2830. https://doi.org/10.3390/math13172830
APA StyleBramm, A. M., Matrenin, P. V., & Khalyasmaa, A. I. (2025). A Review of XAI Methods Applications in Forecasting Runoff and Water Level Hydrological Tasks. Mathematics, 13(17), 2830. https://doi.org/10.3390/math13172830