Hybrid Streamflow Forecasting with ERA5 and Machine Learning Across Daily and Monthly Time Scales
Abstract
1. Introduction
2. Study Area and Data
2.1. Study Area and Hydrological Targets
2.2. Streamflow Observations and Temporal Partitioning
2.3. Meteorological Forcing: Transition from BR-DWGD to ERA5
3. Methods
3.1. General Methodological Structure
3.2. Monthly ERA5-Based HHFS
3.3. Daily ERA5-Based Short-Term Forecasting Extension
4. Results
4.1. Operational Motivation and Dataset Transition (BR-DWGD → ERA5)
4.2. Deterministic Performance with ERA5 Forcing
4.3. Probabilistic Performance of GA2 Under ERA5 Forcing
4.4. Comparative Summary of the Monthly Transition
4.5. Daily Short-Term Forecast Performance (Direct Multi-Output, D+1 to D+10)
4.6. Interpretation for Operational Deployment
5. Discussion
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- de Oliveira, D.M.; Carvalho, V.S.B.; da Silva, B.C.; Reboita, M.S.; de Campos, B. Hydrological and Precipitation Extremes and Trends over the Paraíba do Sul River Basin, Brazil. Climate 2023, 11, 138. [Google Scholar] [CrossRef]
- França, G.B.; Almeida, V.A.; Senna, M.C.A.; Souza, E.P.; Silva, M.T.; Aranha, T.R.B.T.; Silva, M.S.; de Araujo, A.A.M.; de Almeida, M.V.; de Campos Velho, H.F.; et al. Integrating Regressive and Probabilistic Streamflow Forecasting via a Hybrid Hydrological Forecasting System: Application to the Paraíba do Sul River Basin. Water 2026, 18, 210. [Google Scholar] [CrossRef]
- Xavier, A.C.; Scanlon, B.R.; King, C.W.; Alves, A.I. New Improved Brazilian Daily Weather Gridded Data (1961–2020). Int. J. Climatol. 2022, 42, 8390–8404. [Google Scholar] [CrossRef]
- Hersbach, H.; Bell, B.; Berrisford, P.; Hirahara, S.; Horányi, A.; Muñoz-Sabater, J.; Nicolas, J.; Peubey, C.; Radu, R.; Schepers, D.; et al. The ERA5 Global Reanalysis. Q. J. R. Meteorol. Soc. 2020, 146, 1999–2049. [Google Scholar] [CrossRef]
- Frota, M.N.; França, G.B.; Souza, E.P.; Araujo, A.A.M.; Godoy, J.M.; Viana, L.Q. The Water Cycle of the Paraíba do Sul River Basin: A Strategic Tool for Integrated Management of the Hydroelectric System; PUC-Rio: Rio de Janeiro, Brazil, 2023. [Google Scholar]
- Nash, J.E.; Sutcliffe, J.V. River Flow Forecasting Through Conceptual Models Part I—A Discussion of Principles. J. Hydrol. 1970, 10, 282–290. [Google Scholar] [CrossRef]
- Gupta, H.V.; Kling, H.; Yilmaz, K.K.; Martinez, G.F. Decomposition of the Mean Squared Error and NSE Performance Criteria: Implications for Improving Hydrological Modelling. J. Hydrol. 2009, 377, 80–91. [Google Scholar] [CrossRef]
- Chen, T.; Guestrin, C. XGBoost: A Scalable Tree Boosting System. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar] [CrossRef]
- Reboita, M.S.; Gan, M.A.; da Rocha, R.P.; Ambrizzi, T. Regimes de Precipitação na América do Sul: Uma Revisão Bibliográfica. Rev. Bras. Meteorol. 2010, 25, 185–204. [Google Scholar] [CrossRef]
- Carvalho, L.M.V.; Jones, C.; Liebmann, B. The South Atlantic Convergence Zone: Intensity, Form, Persistence, and Relationships with Intraseasonal to Interannual Activity and Extreme Rainfall. J. Clim. 2004, 17, 88–108. [Google Scholar] [CrossRef]
- Arnold, J.G.; Srinivasan, R.; Muttiah, R.S.; Williams, J.R. Large Area Hydrologic Modeling and Assessment Part I: Model Development. J. Am. Water Resour. Assoc. 1998, 34, 73–89. [Google Scholar] [CrossRef]
- Bergström, S. The HBV Model—Its Structure and Applications; SMHI Reports RH No. 4; SMHI: Norrköping, Sweden, 1992. [Google Scholar]
- Liang, X.; Lettenmaier, D.P.; Wood, E.F.; Burges, S.J. A Simple Hydrologically Based Model of Land Surface Water and Energy Fluxes for General Circulation Models. J. Geophys. Res. Atmos. 1994, 99, 14415–14428. [Google Scholar] [CrossRef]
- Collischonn, W.; Tucci, C.E.M.; Clarke, R.T.; Chou, S.C.; Guilhon, L.G.; Cataldi, M.; Allasia, D. Medium-Range Reservoir Inflow Predictions Based on Quantitative Precipitation Forecasts. J. Hydrol. 2007, 344, 112–122. [Google Scholar] [CrossRef]
- Paiva, R.C.D.; Collischonn, W.; Tucci, C.E.M. Large Scale Hydrologic and Hydrodynamic Modeling Using Limited Data and a GIS-Based Approach. J. Hydrol. 2011, 406, 170–181. [Google Scholar] [CrossRef]
- Mosavi, A.; Ozturk, P.; Chau, K.W. Flood Prediction Using Machine Learning Models: Literature Review. Water 2018, 10, 1536. [Google Scholar] [CrossRef]
- Kratzert, F.; Klotz, D.; Shalev, G.; Klambauer, G.; Hochreiter, S.; Nearing, G.S. Towards Learning Universal, Regional, and Local Hydrological Behaviours via Machine Learning Applied to Large-Sample Datasets. Hydrol. Earth Syst. Sci. 2019, 23, 5089–5110. [Google Scholar] [CrossRef]
- Frame, J.M.; Kratzert, F.; Klotz, D.; Gauch, M.; Shalev, G.; Gilon, O.; Qualls, L.M.; Gupta, H.V.; Nearing, G.S. Deep Learning Rainfall–Runoff Predictions of Extreme Events. Hydrol. Earth Syst. Sci. 2022, 26, 3377–3392. [Google Scholar] [CrossRef]
- Feng, D.; Fang, K.; Shen, C. Enhancing Streamflow Forecast and Extracting Insights Using Long-Short Term Memory Networks with Data Integration at Continental Scales. Water Resour. Res. 2020, 56, e2019WR026793. [Google Scholar] [CrossRef]
- Ng, G.-H.C.; Huang, Y.F.; Koo, C.H.; Chong, K.L.; El-Shafie, A.; Ahmed, A.N. A Review of Hybrid Deep Learning Applications for Streamflow Forecasting. J. Hydrol. 2023, 625, 130141. [Google Scholar] [CrossRef]
- Papacharalampous, G.; Tyralis, H.; Langousis, A.; Jayawardena, A.W.; Sivakumar, B.; Mamassis, N.; Montanari, A.; Koutsoyiannis, D. Probabilistic Hydrological Post-Processing at Scale: Why and How to Apply Machine-Learning Quantile Regression Algorithms. Water 2019, 11, 2126. [Google Scholar] [CrossRef]
- Zhong, Y.; Guo, S.; Xiong, F.; Liu, D.; Ba, H.; Wu, X. Probabilistic Forecasting Based on Ensemble Forecasts and EMOS Method for TGR Inflow. Front. Earth Sci. 2020, 14, 188–200. [Google Scholar] [CrossRef]
- Delgado, J.M.; Voss, S.; Bürger, G.; Vormoor, K.; Murawski, A.; Pereira, J.M.R.; Martins, E.; Vasconcelos, F.; Francke, T. Seasonal Drought Prediction for Semiarid Northeastern Brazil: Verification of Six Hydrometeorological Forecast Products. Hydrol. Earth Syst. Sci. 2018, 22, 5041–5056. [Google Scholar] [CrossRef]
- Angelopoulos, A.N.; Bates, S. A Conformal Prediction: A Gentle Introduction. Found. Trends Mach. Learn. 2023, 16, 494–591. [Google Scholar] [CrossRef]
- Taieb, S.B.; Atiya, A.F. A Bias and Variance Analysis for Multistep-Ahead Time Series Forecasting. IEEE Trans. Neural Netw. Learn. Syst. 2016, 27, 62–76. [Google Scholar] [CrossRef]
- Ampas, H.; Refanidis, I.; Ampas, V. Hybrid Hydrological Forecasting Through a Physical Model and a Weather-Informed Transformer Model: A Case Study in Greek Watershed. Appl. Sci. 2025, 15, 6679. [Google Scholar] [CrossRef]
- Yeo, I.; Johnson, R.A. A New Family of Power Transformations to Improve Normality or Symmetry. Biometrika 2000, 87, 954–959. Available online: http://www.jstor.org/stable/2673623 (accessed on 24 May 2026). [CrossRef]









| Metric | BR-DWGD (Baseline) | ERA5 (Migration) |
|---|---|---|
| GA1 NSE | 0.77 | 0.798 |
| GA1 KGE | 0.85 | 0.878 |
| GA1 R2 | 0.77 | 0.798 |
| GA1 Pearson | 0.88 | 0.896 |
| GA1 MAE (m3/s) | 14.36 | 13.629 |
| GA1 RMSE (m3/s) | 20.24 | 18.778 |
| GA1 Bias (m3/s) | 0.84 | 1.58 |
| GA2 coverage, COVcomp | 0.881 | 0.838 |
| GA2 HIT rate, p | 0.976 | 0.979 |
| GA2 Relative width, r | 2.425 | 2.388 |
| Lead Time (Days) | NSE | KGE | Pearson r | MAE (m3/s) | RMSE (m3/s) | Bias (m3/s) | q (m3/s) | Coverage |
|---|---|---|---|---|---|---|---|---|
| 1 | 0.881 | 0.832 | 0.944 | 9.04 | 18.42 | −1.12 | 14.8 | 0.842 |
| 2 | 0.749 | 0.737 | 0.872 | 13.92 | 26.74 | −1.46 | 26.24 | 0.863 |
| 3 | 0.634 | 0.645 | 0.802 | 17.5 | 32.29 | −1.49 | 33.86 | 0.871 |
| 4 | 0.542 | 0.582 | 0.74 | 20.28 | 36.09 | −1.34 | 40.51 | 0.872 |
| 5 | 0.481 | 0.564 | 0.696 | 22.45 | 38 | −1.22 | 42.75 | 0.872 |
| 6 | 0.457 | 0.531 | 0.677 | 23.42 | 39 | −0.73 | 45.02 | 0.871 |
| 7 | 0.44 | 0.491 | 0.665 | 23.83 | 39.99 | −0.98 | 47.83 | 0.873 |
| 8 | 0.415 | 0.467 | 0.646 | 24.4 | 41.16 | −1.4 | 48.3 | 0.865 |
| 9 | 0.401 | 0.454 | 0.635 | 24.89 | 41.88 | −1.38 | 49.38 | 0.861 |
| 10 | 0.394 | 0.448 | 0.63 | 25.12 | 42.23 | −1.76 | 49.87 | 0.866 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
França, G.B.; Almeida, V.A.d.; Senna, M.C.A.; Souza, E.P.d.; Silva, M.T.; Aranha, T.R.B.T.; Silva, M.S.d.; Araujo, A.A.M.d.; Melo, G.T.S.d.; Almeida, M.V.d.; et al. Hybrid Streamflow Forecasting with ERA5 and Machine Learning Across Daily and Monthly Time Scales. Water 2026, 18, 1337. https://doi.org/10.3390/w18111337
França GB, Almeida VAd, Senna MCA, Souza EPd, Silva MT, Aranha TRBT, Silva MSd, Araujo AAMd, Melo GTSd, Almeida MVd, et al. Hybrid Streamflow Forecasting with ERA5 and Machine Learning Across Daily and Monthly Time Scales. Water. 2026; 18(11):1337. https://doi.org/10.3390/w18111337
Chicago/Turabian StyleFrança, Gutemberg Borges, Vinícius Albuquerque de Almeida, Mônica Carneiro Alves Senna, Enio Pereira de Souza, Madson Tavares Silva, Thaís Regina Benevides Trigueiro Aranha, Maurício Soares da Silva, Afonso Augusto Magalhães de Araujo, Gabriel Titara Silva de Melo, Manoel Valdonel de Almeida, and et al. 2026. "Hybrid Streamflow Forecasting with ERA5 and Machine Learning Across Daily and Monthly Time Scales" Water 18, no. 11: 1337. https://doi.org/10.3390/w18111337
APA StyleFrança, G. B., Almeida, V. A. d., Senna, M. C. A., Souza, E. P. d., Silva, M. T., Aranha, T. R. B. T., Silva, M. S. d., Araujo, A. A. M. d., Melo, G. T. S. d., Almeida, M. V. d., Velho, H. F. C., Frota, M. N., Freitas, G. G., Anochi, J. A., Moreno Aldana, E. A., & Viana, L. Q. (2026). Hybrid Streamflow Forecasting with ERA5 and Machine Learning Across Daily and Monthly Time Scales. Water, 18(11), 1337. https://doi.org/10.3390/w18111337

