Integrating Hydrological and Machine Learning Models for Enhanced Streamflow Forecasting via Bayesian Model Averaging in a Hydro-Dominant Power System
Abstract
:1. Introduction
- ▪
- It explores the use of dynamic weights that are updated weekly in response to new streamflow measurements (considering a rolling window of 120 days of data). Consequently, the streamflow predictions are more aligned with reality, given that static weights cannot account for the varying predictive performance of rainfall–runoff models as streamflow levels change;
- ▪
- It incorporates Occam’s Razor principle from Raftery et al. [28] and Hoeting et al. [29] to select good ensemble members from a prior multi-model ensemble. Our approach differs from [28,29] because it does not constrain the model-averaging weights to have the same value. As a consequence of Occam’s Razor principle, ensemble members with lower performance are discarded to avoid degradation of the BMA predictions, given that the joint contribution of models with higher forecasting uncertainty strongly affects the BMA results;
- ▪
- This was tested in a real large-scale hydropower system, aiming to produce a reliable forecasting engine that can be used to support operational scheduling of hydropower;
- ▪
- The individual ensemble members and the BMA predictions were assessed taking into account the spatial variability of the Brazilian large-scale hydropower system and the varying days ahead of the forecast horizon. A hypothesis test was also applied to compare the predictive performance of BMA to the ensemble members.
2. Materials and Methods
2.1. Precipitation Forecasts
2.2. Natural Streamflow
2.3. Rainfall–Runoff Models
2.3.1. MGB
2.3.2. SMAP
2.3.3. Artificial Neural Networks
2.4. Bayesian Model Averaging
2.5. Backtesting Simulation and Evaluation Metrics
3. Case Study, Results, and Discussion
3.1. An Overview of the Brazilian Hydropower System
3.2. Time Series Forecasting Considerations
3.3. Overall Performance of BMA and Ensemble Members
3.3.1. The Illustrative Case of Água Vermelha Hydropower Plant
3.3.2. Case Study Results
3.4. Spatial Predictive Performance of BMA
3.5. Weights Applied to Ensemble Members
3.6. Remarks and Discussion
4. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Appendix A
References
- Tundisi, J.G. Water Resources in the Future: Problems and Solutions. Estud. Avançados 2008, 22, 7–16. [Google Scholar] [CrossRef]
- Cheng, M.; Fang, F.; Kinouchi, T.; Navon, I.M.; Pain, C.C. Long Lead-Time Daily and Monthly Streamflow Forecasting Using Machine Learning Methods. J. Hydrol. 2020, 590, 125376. [Google Scholar] [CrossRef]
- Schmitt Quedi, E.; Mainardi Fan, F. Sub Seasonal Streamflow Forecast Assessment at Large-Scale Basins. J. Hydrol. 2020, 584, 124635. [Google Scholar] [CrossRef]
- Sohrabi, S.; Brissette, F.P.; Arsenault, R. Coupling Large-Scale Climate Indices with a Stochastic Weather Generator to Improve Long-Term Streamflow Forecasts in a Canadian Watershed. J. Hydrol. 2020, 594, 125925. [Google Scholar] [CrossRef]
- Silva, F.d.N.R.d.; Alves, J.L.D.; Cataldi, M. Climate Downscaling over South America for 1971–2000: Application in SMAP Rainfall-Runoff Model for Grande River Basin. Clim. Dyn. 2019, 52, 681–696. [Google Scholar] [CrossRef]
- Cloke, H.L.; Pappenberger, F. Ensemble Flood Forecasting: A Review. J. Hydrol. 2009, 375, 613–626. [Google Scholar] [CrossRef]
- Moges, E.; Demissie, Y.; Larsen, L.; Yassin, F. Review: Sources of Hydrological Model Uncertainties and Advances in Their Analysis. Water 2021, 13, 28. [Google Scholar] [CrossRef]
- Draper, D. Assessment and Propagation of Model Uncertainty. J. R. Stat. Soc. 1995, 57, 45–97. [Google Scholar] [CrossRef]
- Bourdin, D.R.; Stull, R.B. Bias-Corrected Short-Range Member-to-Member Ensemble Forecasts of Reservoir Inflow. J. Hydrol. 2013, 502, 77–88. [Google Scholar] [CrossRef]
- Zahmatkesh, Z.; Karamouz, M.; Nazif, S. Uncertainty Based Modeling of Rainfall-Runoff: Combined Differential Evolution Adaptive Metropolis (DREAM) and K-Means Clustering. Adv. Water Resour. 2015, 83, 405–420. [Google Scholar] [CrossRef]
- Ajami, N.K.; Duan, Q.; Sorooshian, S. An Integrated Hydrologic Bayesian Multimodel Combination Framework: Confronting Input, Parameter, and Model Structural Uncertainty in Hydrologic Prediction. Water Resour. Res. 2007, 43, 1–19. [Google Scholar] [CrossRef]
- Stergiadi, M.; Di Marco, N.; Avesani, D.; Righetti, M.; Borga, M. Impact of Geology on Seasonal Hydrological Predictability in Alpine Regions by a Sensitivity Analysis Framework. Water 2020, 12, 2255. [Google Scholar] [CrossRef]
- Galletti, A.; Avesani, D.; Bellin, A.; Majone, B. Detailed Simulation of Storage Hydropower Systems in Large Alpine Watersheds. J. Hydrol. 2021, 603, 127125. [Google Scholar] [CrossRef]
- Smajgl, A.; Ward, J.; Pluschke, L. The Water-Food-Energy Nexus—Realising a New Paradigm. J. Hydrol. 2016, 533, 533–540. [Google Scholar] [CrossRef]
- Bahramian, K.; Nathan, R.; Western, A.W.; Ryu, D. Towards an Ensemble-Based Short-Term Flood Forecasting Using an Event-Based Flood Model-Incorporating Catchment-Average Estimates of Soil Moisture. J. Hydrol. 2021, 593, 125828. [Google Scholar] [CrossRef]
- Brito, B.O.; Salgado, R.M.; Beijo, L.A. Intelligent Modeling for Streamflow Forecasting. IEEE Lat. Am. Trans. 2016, 14, 3669–3677. [Google Scholar] [CrossRef]
- Siqueira, V.A.; Fan, F.M.; de Paiva, R.C.D.; Ramos, M.H.; Collischonn, W. Potential Skill of Continental-Scale, Medium-Range Ensemble Streamflow Forecasts for Flood Prediction in South America. J. Hydrol. 2020, 590, 125430. [Google Scholar] [CrossRef]
- Ribeiro, V.H.A.; Reynoso-Meza, G.; Siqueira, H.V. Multi-Objective Ensembles of Echo State Networks and Extreme Learning Machines for Streamflow Series Forecasting. Eng. Appl. Artif. Intell. 2020, 95, 103910. [Google Scholar] [CrossRef]
- Sabzipour, B.; Arsenault, R.; Brissette, F. Evaluation of the Potential of Using Subsets of Historical Climatological Data for Ensemble Streamflow Prediction (ESP) Forecasting. J. Hydrol. 2020, 595, 125656. [Google Scholar] [CrossRef]
- Jeong, D., Il; Kim, Y.O. Rainfall-Runoff Models Using Artificial Neural Networks for Ensemble Streamflow Prediction. Hydrol. Process. 2005, 19, 3819–3835. [Google Scholar] [CrossRef]
- Kasiviswanathan, K.S.; Cibin, R.; Sudheer, K.P.; Chaubey, I. Constructing Prediction Interval for Artificial Neural Network Rainfall-Runoff Models Based on Ensemble Simulations. J. Hydrol. 2013, 499, 275–288. [Google Scholar] [CrossRef]
- Saraiva, S.V.; Carvalho, F.d.O.; Santos, C.A.G.; Barreto, L.C.; de Macedo Machado Freire, P.K. Daily Streamflow Forecasting in Sobradinho Reservoir Using Machine Learning Models Coupled with Wavelet Transform and Bootstrapping. Appl. Soft Comput. 2021, 102, 107081. [Google Scholar] [CrossRef]
- Shamseldin, A.Y.; O’Connor, K.M.; Liang, G.C. Methods for Combining the Outputs of Different Rainfall-Runoff Models. J. Hydrol. 1997, 197, 203–229. [Google Scholar] [CrossRef]
- Kim, Y.-O.; Jeong, D.; Ko, I.H. Combining Rainfall-Runoff Model Outputs for Improving Ensemble Streamflow Prediction. J. Hydrol. Eng. 2006, 11, 578–588. [Google Scholar] [CrossRef]
- Devineni, N.; Sankarasubramanian, A.; Ghosh, S. Multimodel Ensembles of Streamflow Forecasts: Role of Predictor State in Developing Optimal Combinations. Water Resour. Res. 2008, 44, W09404. [Google Scholar] [CrossRef]
- Muhammad, A.; Stadnyk, T.A.; Unduche, F.; Coulibaly, P. Multi-Model Approaches for Improving Seasonal Ensemble Streamflow Prediction Scheme with Various Statistical Post-Processing Techniques in the Canadian Prairie Region. Water 2018, 10, 1604. [Google Scholar] [CrossRef]
- Fragoso, T.M.; Bertoli, W.; Louzada, F. Bayesian Model Averaging: A Systematic Review and Conceptual Classification. Int. Stat. Rev. 2018, 86, 1–28. [Google Scholar] [CrossRef]
- Raftery, A.E.; Madigan, D.; Hoeting, J.A. Bayesian Model Averaging for Linear Regression Models. J. Am. Stat. Assoc. 1997, 92, 179–191. [Google Scholar] [CrossRef]
- Hoeting, J.A.; Madigan, D.; Raftery, A.E.; Volinsky, C.T. Bayesian Model Averaging: A Tutorial. Stat. Sci. 1999, 14, 382–401. [Google Scholar] [CrossRef]
- He, S.; Guo, S.; Liu, Z.; Yin, J.; Chen, K.; Wu, X. Uncertainty Analysis of Hydrological Multi-Model Ensembles Based on CBP-BMA Method. Hydrol. Res. 2018, 49, 1636–1651. [Google Scholar] [CrossRef]
- Wagena, M.B.; Goering, D.; Collick, A.S.; Bock, E.; Fuka, D.R.; Buda, A.; Easton, Z.M. Comparison of Short-Term Streamflow Forecasting Using Stochastic Time Series, Neural Networks, Process-Based, and Bayesian Models. Environ. Model. Softw. 2020, 126, 104669. [Google Scholar] [CrossRef]
- Duan, Q.; Ajami, N.K.; Gao, X.; Sorooshian, S. Multi-Model Ensemble Hydrologic Prediction Using Bayesian Model Averaging. Adv. Water Resour. 2007, 30, 1371–1386. [Google Scholar] [CrossRef]
- See, L.; Openshaw, S. A Hybrid Multi-Model Approach to River Level Forecasting. Hydrol. Sci. J. 2000, 45, 523–536. [Google Scholar] [CrossRef]
- Souza Filho, F.d.A.d.; Rocha, R.V.; Estácio, Á.B.; Rolim, L.Z.R.; Pontes Filho, J.D.d.A.; Porto, V.C.; Guimarães, S.O. Enhancing Streamflow Forecasting for the Brazilian Electricity Sector: A Strategy Based on a Hyper-Multimodel. Braz. J. Water Resour. 2023, 28, e45. [Google Scholar] [CrossRef]
- Box, G.E.P.; Draper, N.R. Empirical Model-Building and Response Surfaces; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 1987; Volume 37, ISBN 9780471810339. [Google Scholar]
- Kimura, R. Numerical Weather Prediction. J. Wind Eng. Ind. Aerodyn. 2002, 90, 1403–1414. [Google Scholar] [CrossRef]
- Kanamitsu, M.; Alpert, J.C.; Campana, K.A.; Caplan, P.M.; Deaven, D.G.; Iredell, M.; Katz, B.; Pan, H.-L.; Sela, J.; White, G.H. Recent Changes Implemented into the Global Forecast System at NMC. Weather. Forecast. 1991, 6, 425–435. [Google Scholar] [CrossRef]
- EMC. Global Forecast System. Available online: https://www.emc.ncep.noaa.gov/emc/pages/numerical_forecast_systems/gfs.php (accessed on 22 November 2023).
- Lien, G.Y.; Kalnay, E.; Miyoshi, T.; Huffman, G.J. Statistical Properties of Global Precipitation in the NCEP GFS Model and TMPA Observations for Data Assimilation. Mon. Weather Rev. 2016, 144, 663–679. [Google Scholar] [CrossRef]
- NCEI. Global Forecast System. Available online: https://www.ncei.noaa.gov/products/weather-climate-models/global-forecast#:~:text=The%20Global%20Forecast%20System%20(GFS,moisture%2C%20and%20atmospheric%20ozone%20concentration (accessed on 14 August 2023).
- NCEP. GFS Dataset. Available online: https://ftp.ncep.noaa.gov/data/nccf/com/gfs/prod/ (accessed on 21 May 2023).
- NCEI. Global Ensemble Forecast System. Available online: https://www.ncei.noaa.gov/products/weather-climate-models/global-ensemble-forecast (accessed on 21 September 2023).
- Zhou, X.; Zhu, Y.; Hou, D.; Luo, Y.; Peng, J.; Wobus, R. Performance of the New NCEP Global Ensemble Forecast System in a Parallel Experiment. Weather Forecast 2017, 32, 1989–2004. [Google Scholar] [CrossRef]
- NCEP. GEFS Dataset. Available online: https://ftp.ncep.noaa.gov/data/nccf/com/gens/prod/ (accessed on 21 May 2023).
- Chou, S.C. Regional ETA Model. Climanálise 1996, 1, 203–207. [Google Scholar]
- Mesinger, F. A Blocking Technique for Representation of Mountains in Atmospheric Models. Riv. Meteorol. Aeronaut. 1984, 44, 195–202. [Google Scholar]
- Moreto, V.B.; Rolim, G.d.S.; Esteves, J.T.; Vanuytrecht, E.; Chou, S.C. Sugarcane Decision-Making Support Using Eta Model Precipitation Forecasts. Meteorol. Atmos. Phys. 2021, 133, 181–191. [Google Scholar] [CrossRef]
- ONS. ETA Model Dataset. Available online: https://sintegre.ons.org.br/sites/9/38/paginas/produtos-dinamicos/meteorologia.aspx (accessed on 21 April 2023).
- ECMWF. ENS—Ensemble Forecasts. Available online: https://confluence.ecmwf.int/display/FUG/ENS+-+Ensemble+Forecasts (accessed on 19 April 2023).
- ECMWF. The ECMWF Integrated Forecasting System. Available online: https://confluence.ecmwf.int/display/FUG/2+The+ECMWF+Integrated+Forecasting+System+-+IFS (accessed on 2 April 2023).
- ONS. ENS Dataset. Available online: https://sintegre.ons.org.br/sites/9/38/paginas/produtos-dinamicos/meteorologia.aspx (accessed on 21 April 2023).
- ONS. Update of the Historical Streamflow Time Series—Period from 1931 to 2019; REL 142/2020; ONS: Rio de Janeiro, RJ, Brazil, 2020.
- ONS. Historical Time Series of Daily Natural Streamflows Dataset. Available online: https://sintegre.ons.org.br/sites/9/13/84 (accessed on 2 March 2023).
- ONS. Methodology for Reconstitution and Treatment of Natural Streamflows; NT 144/2018; ONS: Rio de Janeiro, RJ, Brazil, 2018.
- ONS. Hydraulic-Hydrological Reports. Available online: https://sintegre.ons.org.br/sites/9/13/56/paginas/servicos/produtos.aspx (accessed on 2 March 2023).
- Da Silva, B.C.; Tucci, C.E.M.; Collischonn, W. Streamflow Forecast with Hydroclimatic Models. Braz. J. Water Resour. 2006, 11, 15–29. [Google Scholar] [CrossRef]
- Eslamian, S. Handbook of Engineering Hydrology: Modeling, Climate Change, and Variability; Eslamian, S., Ed.; CRS Press: Boca Ratom, FL, USA, 2014; ISBN 9781466552463. [Google Scholar]
- Sitterson, J.; Knightes, C.; Parmar, R.; Wolfe, K.; Muche, M.; Avant, B. An Overview of Rainfall-Runoff Model Types; EPA/600/R-14/152; Environmental Protection Agency: Athens, GA, USA, 2017. [Google Scholar]
- Niu, W.; Feng, Z. Evaluating the Performances of Several Artificial Intelligence Methods in Forecasting Daily Streamflow Time Series for Sustainable Water Resources Management. Sustain. Cities Soc. 2021, 64, 102562. [Google Scholar] [CrossRef]
- De Faria, V.A.D.; de Queiroz, A.R.; Lima, L.M.; Lima, J.W.M.; da Silva, B.C. An Assessment of Multi-Layer Perceptron Networks for Streamflow Forecasting in Large-Scale Interconnected Hydrosystems. Int. J. Environ. Sci. Technol. 2021, 19, 5819–5838. [Google Scholar] [CrossRef]
- Rasmussen, P.F.; Salas, J.D.; Fagherazzi, L.; Rassam, J.-C.; Bobée, B. Estimation and Validation of Contemporaneous PARMA Models for Streamflow Simulation. Water Resour. Res. 1996, 32, 3151–3160. [Google Scholar] [CrossRef]
- Tongal, H.; Booij, M.J. Simulation and Forecasting of Streamflows Using Machine Learning Models Coupled with Base Flow Separation. J. Hydrol. 2018, 564, 266–282. [Google Scholar] [CrossRef]
- Yu, X.; Wang, Y.; Wu, L.; Chen, G.; Wang, L.; Qin, H. Comparison of Support Vector Regression and Extreme Gradient Boosting for Decomposition-Based Data-Driven 10-Day Streamflow Forecasting. J. Hydrol. 2020, 582, 124293. [Google Scholar] [CrossRef]
- Liu, Z.; Zhou, P.; Chen, G.; Guo, L. Evaluating a Coupled Discrete Wavelet Transform and Support Vector Regression for Daily and Monthly Streamflow Forecasting. J. Hydrol. 2014, 519, 2822–2831. [Google Scholar] [CrossRef]
- Lopes, J.E.G.; Braga, B.P.F.; Conejo, J.G.L. SMAP: A Simplified Hydrologic Model. In Applied Modeling in Catchment Hydrology; Singh, V.P., Ed.; Water Resources Publications: Littleton, CO, USA, 1982; pp. 167–176. ISBN 1887201963. [Google Scholar]
- Collischonn, W.; Allasia, D.; da Silva, B.C.; Tucci, C.E.M. The MGB-IPH Model for Large-Scale Rainfall-Runoff Modelling. Hydrol. Sci. J. 2007, 52, 878–895. [Google Scholar] [CrossRef]
- Collischonn, W.; Tucci, C.E.M. Hydrological Simulation of Large Basins. Braz. J. Water Resour. 2001, 6, 95–118. [Google Scholar] [CrossRef]
- Beven, K.J. 122: Rainfall-Runoff Modeling: Introduction. In Encyclopedia of Hydrological Sciences; Anderson, M.G., Ed.; John Wiley & Sons: Hoboken, NJ, USA, 2005; pp. 1–12. [Google Scholar]
- De Queiroz, A.R.; Marangon Lima, L.M.; Marangon Lima, J.W.; Da Silva, B.C.; Scianni, L.A. Climate Change Impacts in the Energy Supply of the Brazilian Hydro-Dominant Power System. Renew. Energy 2016, 99, 379–389. [Google Scholar] [CrossRef]
- De Queiroz, A.R.; Faria, V.A.D.; Lima, L.M.M.; Lima, J.W.M. Hydropower Revenues under the Threat of Climate Change in Brazil. Renew. Energy 2019, 133, 873–882. [Google Scholar] [CrossRef]
- Kuki, C.A.C.; Torres, F.L.R.; de Faria, V.A.D.; de Queiroz, A.R.; Lima, L.M.M.; Lima, J.W.M. Short-Term Streamflow Forecast Strategies: A Case Study in the Grande and Paranaíba River Basins. In Proceedings of the Anais do XXIII Congresso Brasileiro de Automática, Virtual Event, 23–26 November 2020; Sociedade Brasileira de Automática: Campinas, SP, Brazil, 2020; pp. 1–8. [Google Scholar]
- Kuki, C.A.C. Methodology for Forecasting Energy Prices Considering Streamflow Uncertainties. Master’s Thesis, Institute of Electrical Systems of Energy, Federal University of Itajubá, Itajubá, MG, Brazil, 2020. [Google Scholar]
- Da Silva, B.C.; Collischonn, W.; Tucci, C.E.M.; Clarke, R.T.; Corbo, M.D. Short-Term Hydroclimatic Streamflow Forecasting in the São Francisco River Basin. Braz. J. Water Resour. 2007, 12, 21–41. [Google Scholar] [CrossRef]
- Collischonn, W.; Gama, C.H.A.; Siqueira, V.A.; Paiva, R.C.D.; Fleischmann, A.S. MGB Reference Manual 2020; IPH: Porto Alegre, RS, Brazil, 2020. [Google Scholar]
- Kuki, C.A.C.; Torres, F.L.R.; de Faria, V.A.D.; de Queiroz, A.R.; Lima, L.M.M.; Lima, J.W.M. Streamflow Forecasting Strategy for Short-Term Electricity Price Formation. In Proceedings of the Anais do LII Simpósio Brasileiro de Pesquisa Operacional; Galoá: João Pessoa, PB, Brazil, 2020; pp. 1–12. [Google Scholar]
- Maciel, G.M.; Cabral, V.A.; Marcato, A.L.M.; Júnior, I.C.S.; Honório, L.D.M. Daily Water Flow Forecasting via Coupling between SMAP and Deep Learning. IEEE Access 2020, 8, 204660–204675. [Google Scholar] [CrossRef]
- De Paiva, L.F.G.; Montenegro, S.M.; Cataldi, M. Prediction of Monthly Flows for Três Marias Reservoir (São Francisco River Basin) Using the CFS Climate Forecast Model. Braz. J. Water Resour. 2020, 25, 1–18. [Google Scholar] [CrossRef]
- Nunes, F.M.S.; de Farias, C.A.S.; Martins, W.A.; Almeida, R.N.; Leite, J.C.A. Hydrological Modelling Using SMAP for Estimating Monthly Streamflows in Piancó River Basin. Rev. Verde De Agroecol. E Desenvolv. Sustentável 2014, 9, 289–295. [Google Scholar]
- Cavalcante, M.R.G.; da Cunha Luz Barcellos, P.; Cataldi, M. Flash Flood in the Mountainous Region of Rio de Janeiro State (Brazil) in 2011: Part I—Calibration Watershed through Hydrological SMAP Model. Nat. Hazards 2020, 102, 1117–1134. [Google Scholar] [CrossRef]
- ONS. Application of the SMAP/ONS Model to Forecast Streamflows within the Scope of the SIN; NT 97/2018-RV4; ONS: Rio de Janeiro, RJ, Brazil, 2020.
- ONS. Application of the SMAP/ONS Model to Forecast Streamflows within the Scope of the SIN; ONS 97/2018-RV7; ONS: Rio de Janeiro, RJ, Brazil, 2018.
- ONS. SMAP Data. Available online: https://sintegre.ons.org.br/sites/9/13/82/paginas/servicos/produtos.aspx (accessed on 21 April 2023).
- McCulloch, W.S.; Pitts, W.H. A Logical Calculus of the Ideas Immanent in Nervous Activity. Bull. Math. Biophys. 1943, 5, 115–133. [Google Scholar] [CrossRef]
- Rosenblatt, F. The Perceptron: A Probabilistic Model for Information Storage and Organization in the Brain. Psychol. Rev. 1958, 65, 386–408. [Google Scholar] [CrossRef]
- Rumelhart, D.E.; Hinton, G.E.; Williams, R.J. Learning Representations by Back-Propagating Errors. Nature 1986, 323, 533–536. [Google Scholar] [CrossRef]
- Krogh, A. What Are Artificial Neural Networks? Nat. Biotechnol. 2008, 26, 195–197. [Google Scholar] [CrossRef]
- Hinne, M.; Gronau, Q.F.; van den Bergh, D.; Wagenmakers, E.-J. A Conceptual Introduction to Bayesian Model Averaging. Adv. Methods Pract. Psychol. Sci. 2020, 3, 200–215. [Google Scholar] [CrossRef]
- Wasserman, L. Bayesian Model Selection and Model Averaging. J. Math. Psychol. 2000, 44, 92–107. [Google Scholar] [CrossRef]
- Moon, T.K. The Expectation-Maximization Algorithm. IEEE Signal Process. Mag. 1996, 13, 47–60. [Google Scholar] [CrossRef]
- Myung, I.J. Tutorial on Maximum Likelihood Estimation. J. Math. Psychol. 2003, 47, 90–100. [Google Scholar] [CrossRef]
- Box, G.E.P.; Cox, D.R. An Analysis of Transformations. J. Am. Stat. Assoc. 1964, 26, 211–252. [Google Scholar] [CrossRef]
- Chen, M.; Shi, W.; Xie, P.; Silva, V.B.S.; Kousky, V.E.; Higgins, R.W.; Janowiak, J.E. Assessing Objective Techniques for Gauge-Based Analyses of Global Daily Precipitation. J. Geophys. Res. 2008, 113, 1–13. [Google Scholar] [CrossRef]
- Nash, J.E.; Sutcliffe, J.V. River Flow Forecasting through Conceptual Models Part I—A Discussion of Principles. J. Hydrol. 1970, 10, 282–290. [Google Scholar] [CrossRef]
- Moriasi, D.N.; Arnold, J.G.; Van Liew, M.W.; Bingner, R.L.; Harmel, R.D.; Veith, T.L. Model Evaluation Guidelines for Systematic Quantification of Accuracy in Watershed Simulations. Trans. ASABE 2007, 50, 885–900. [Google Scholar] [CrossRef]
- Moriasi, D.N.; Zeckoski, R.W.; Arnold, J.G.; Baffaut, C.B.; Malone, R.W.; Daggupati, P.; Guzman, J.A.; Saraswat, D.; Yuan, Y.; Wilson, B.W.; et al. Hydrologic and Water Quality Models: Key Calibration and Validation Topics. Trans. ASABE 2015, 58, 1609–1618. [Google Scholar] [CrossRef]
- ONS. SMAP Application: Methodology Manual; ONS: Rio de Janeiro, RJ, Brazil, 2017.
- Diebold, F.X.; Mariano, R.S. Comparing Predictive Accuracy. J. Bus. Econ. Stat. 1995, 13, 253–263. [Google Scholar] [CrossRef]
- ONS. National Interconnected System. Available online: http://www.ons.org.br/paginas/sobre-o-sin/o-que-e-o-sin (accessed on 27 April 2023).
- ONS. The System in Numbers. Available online: http://www.ons.org.br/paginas/sobre-o-sin/o-sistema-em-numeros (accessed on 27 September 2023).
- ONS. Schematic Diagram of SIN Hydropower Plants. Available online: http://www.ons.org.br/paginas/sobre-o-sin/mapas (accessed on 27 June 2023).
- Raftery, A.E.; Gneiting, T.; Balabdaoui, F.; Polakowski, M. Using Bayesian Model Averaging to Calibrate Forecast Ensembles. Mon. Weather Rev. 2005, 133, 1155–1174. [Google Scholar] [CrossRef]
- Slingo, J.; Palmer, T. Uncertainty in Weather and Climate Prediction. Philos. Trans. R. Soc. A Math. Phys. Eng. Sci. 2011, 369, 4751–4767. [Google Scholar] [CrossRef] [PubMed]
- ONS. Methodology of Ensemble Precipitation Prediction and Bias Correction Using Short-Term Historical Data; ONS: Rio de Janeiro, RJ, Brzil, 2019; NT 53/2019.
- Souza, C.M.; Shimbo, J.Z.; Rosa, M.R.; Parente, L.L.; Alencar, A.A.; Rudorff, B.F.T.; Hasenack, H.; Matsumoto, M.; Ferreira, L.G.; Souza-Filho, P.W.M.; et al. Reconstructing Three Decades of Land Use and Land Cover Changes in Brazilian Biomes with Landsat Archive and Earth Engine. Remote Sens. 2020, 12, 2735. [Google Scholar] [CrossRef]
- Martins, P.R.; Sano, E.E.; Martins, E.S.; Vieira, L.C.G.; Salemi, F.; Vasconcelos, V.; Couto Júnior, A.F. Terrain Units, Land Use and Land Cover, and Gross Primary Productivity of the Largest Fluvial Basin in the Brazilian Amazonia/Cerrado Ecotone: The Araguaia River Basin. Appl. Geogr. 2021, 127, 102379. [Google Scholar] [CrossRef]
- Da Cunha, E.R.; Santos, C.A.G.; da Silva, R.M.; Panachuki, E.; de Oliveira, P.T.S.; Oliveira, N.d.S.; Falcão, K.d.S. Assessment of Current and Future Land Use/Cover Changes in Soil Erosion in the Rio Da Prata Basin (Brazil). Sci. Total Environ. 2022, 818, 151811. [Google Scholar] [CrossRef] [PubMed]
- Collischonn, W.; Haas, R.; Andreolli, I.; Tucci, C.E.M. Forecasting River Uruguay Flow Using Rainfall Forecasts from a Regional Weather-Prediction Model. J. Hydrol. 2005, 305, 87–98. [Google Scholar] [CrossRef]
- Kaufmann de Almeida, I.; Kaufmann Almeida, A.; Garcia Gabas, S.; Alves Sobrinho, T. Performance of Methods for Estimating the Time of Concentration in a Watershed of a Tropical Region. Hydrol. Sci. J. 2017, 62, 2406–2414. [Google Scholar] [CrossRef]
- Hagedorn, R.; Doblas-Reyes, F.J.; Palmer, T.N. The Rationale behind the Success of Multi-Model Ensembles in Seasonal Forecasting—I. Basic Concept. Tellus Ser. A Dyn. Meteorol. Oceanogr. 2005, 57, 219–233. [Google Scholar] [CrossRef]
- Doblas-Reyes, F.J.; Hagedorn, R.; Palmer, T.N. The Rationale behind the Success of Multi-Model Ensembles in Seasonal Forecasting—II. Calibration and Combination. Tellus Ser. A Dyn. Meteorol. Oceanogr. 2005, 57, 234–252. [Google Scholar] [CrossRef]
- Darema, F. Dynamic Data Driven Applications Systems: A New Paradigm for Application Simulations and Measurements. In Computational Science—ICCS 2004: Lecture Notes in Computer Science; Bubak, M., van Albada, G.D., Sloot, P.M.A., Dongarra, J., Eds.; Springer: Berlin/Heidelberg, Germany, 2004; Volume 3038, pp. 662–669. [Google Scholar] [CrossRef]
Model | Details |
---|---|
GFS | The Global Forecast System (GFS) is a global numerical weather prediction system that has been operational at the National Centers for Environmental Prediction (NCEP) since 1991 [37]. The system couples individual numerical models of atmosphere, ocean, land, and sea ice to portray the weather conditions using atmospheric and land–soil variables [38], thereby establishing itself as one of the state-of-the-art numerical systems that are capable of running at various spectral resolutions [39]. At NCEP, this integrated system of models runs four times daily—at 00, 06, 12, and 18 UTC—producing deterministic scenarios with a forecast horizon of up to 16 days [40]. In the present study, we use the total precipitation data from the GFS’s 18 UTC runs [41], provided at a horizontal spatial resolution of 0.25° latitude by 0.25° longitude. |
GEFS | The Global Ensemble Forecast System (GEFS) is a global numerical weather forecasting system developed by NCEP that generates predictions up to 16 days in advance. The 11.0 version of the system was actively used for forecasting from December 2016 until September 2020. During its operational period, GEFS generated 21 different precipitation scenarios (i.e., ensemble prediction). The first one is the base scenario or the control member, and the remaining ones underline uncertainties from the input data, such as limited coverage, instruments, or observing system biases [42], by using a breeding method to introduce perturbations in the initial conditions [43]. In the present study, we use the total precipitation data from the GEFS’s 18 UTC runs [44] in a horizontal spatial resolution of 1° latitude by 1° longitude. |
ETA | The η (ETA, from Greek alphabet) is a regional numerical weather forecasting model developed by the University of Belgrade in collaboration with the Institute of Hydrometeorology of Yugoslavia [45]. The ETA model became operational at CPTEC (Center for Weather Forecasting and Climate Studies) in the 1990s and has been used since then to forecast the South American weather. The model’s name refers to a vertical coordinate η that represents the vertical structure of the atmosphere, taking the sea level as a reference [46]. Initially, the ETA model was developed only to make regional weather predictions, but it was later successfully applied in climate prediction and projection studies [47]. The operational version of the ETA model used to forecast the weather conditions of South America has a horizontal spatial resolution of 0.4° of latitude × 0.4° of longitude, as well as a forecast time horizon of 11 days ahead. Unfortunately, the daily forecasts made by the ETA model were downloaded using a link that is no longer available. The precipitation data from the ETA model are now available through the SINtegre portal [48]. |
ENS | The ensemble forecast (ENS) consists of an ensemble of possible future weather states, generated by the ECMWF Integrated Forecast System [49]. This ensemble includes one control/unperturbed member and fifty disturbance members, all of which are produced twice daily, at 00 and 18 UTC, by the IFS at the European Climate Center. The ECMWF IFS is a numerical system composed of several coupled components that model the atmosphere, ocean, wind-generated ocean waves, sea ice, land surface, and lakes to predict the evolution of the climate system [50]. In this study, we use the ENS total precipitation dataset, which provides forecasts extending 15 days ahead and has a horizontal resolution of 0.20° latitude by 0.20° longitude. Although the original product was an ensemble of predictions, we did not have access to the individual members, but to the average, through the SINtegre portal [51]. |
Metric/Test | Details |
---|---|
MAPE | The mean absolute percentage error (MAPE) (7), as outlined in [92], is a metric to measure the predictive accuracy of a time series. According to criteria established in [92], a MAPE between 0 (0%) and 0.1 (10%) indicates high accuracy, values between 0.1 (10%) and 0.2 (20%) indicate good accuracy, values between 0.2 (20%) and 0.5 (50%) indicate reasonable accuracy, and values above 0.5 (50%) indicate poor accuracy. |
NSE | The Nash–Sutcliffe efficiency coefficient (NSE) [93], (8), is a metric to evaluate the efficiency of rainfall–runoff models. It measures the ratio of the residual variance (i.e., the residual sum of squares) to the variance of the measured value dataset (i.e., the total sum of squares) subtracted from 1. NSE ranges from 1 to . The interpretation of NSE is also based on predefined intervals [94,95]. Values above 0.75 indicate that the model has very good efficiency, values between 0.65 and 0.75 indicate good efficiency, values between 0.5 and 0.65 indicate satisfactory efficiency, and values less than 0.5 indicate unsatisfactory efficiency. Additionally, NSE values below zero also suggest that the mean of measured values is a better forecast than the predictions performed by the model under evaluation. |
MD | Multi-criteria distance (MD) [96], (9), has been the metric officially used by ONS to assess streamflow forecasts since 2010. MD is defined as the Euclidean distance between two points, and , calculated as . Here, the first point represents the ideal values of MAPE and NSE (i.e., and , respectively), while the second point represents the actual values of the model being assessed ( and ). MD ranges from 0 to . The higher its value is, the worse the performance of the assessed model is. Therefore, the optimal MD value is zero. |
Diebold–Mariano test | The Diebold–Mariano hypothesis test [97] is widely used to compare the predictive accuracy of a pair of time series, A and B, using the difference between the measured data and the predictions ( and ). A loss function of absolute or quadratic error over time is computed for each time series, and , and the gap between them is known as differential loss function . In the original version of the Diebold–Mariano test (10), if the expectation of is equal to zero, (i.e., null hypothesis ), it is said that A has the same level of accuracy as B; otherwise, their accuracies differ, (i.e., alternative hypothesis ). On the other hand, in the alternative versions of the Diebold–Mariano test (11) and (12), and are changed in order to determine whether series A is more accurate than series B, or if series B is more accurate than series A, respectively. The results are evaluated through the analysis of the p-value, calculated using the statistic (13). Values of p-value lower than the significance level indicate that there is sufficient evidence to reject and accept , while values of p-value greater than indicate that there is insuficiente evidence for the rejection of and acceptance of . Original hypothesis: |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Torres, F.L.R.; Lima, L.M.M.; Reboita, M.S.; de Queiroz, A.R.; Lima, J.W.M. Integrating Hydrological and Machine Learning Models for Enhanced Streamflow Forecasting via Bayesian Model Averaging in a Hydro-Dominant Power System. Water 2024, 16, 586. https://doi.org/10.3390/w16040586
Torres FLR, Lima LMM, Reboita MS, de Queiroz AR, Lima JWM. Integrating Hydrological and Machine Learning Models for Enhanced Streamflow Forecasting via Bayesian Model Averaging in a Hydro-Dominant Power System. Water. 2024; 16(4):586. https://doi.org/10.3390/w16040586
Chicago/Turabian StyleTorres, Francisca Lanai Ribeiro, Luana Medeiros Marangon Lima, Michelle Simões Reboita, Anderson Rodrigo de Queiroz, and José Wanderley Marangon Lima. 2024. "Integrating Hydrological and Machine Learning Models for Enhanced Streamflow Forecasting via Bayesian Model Averaging in a Hydro-Dominant Power System" Water 16, no. 4: 586. https://doi.org/10.3390/w16040586
APA StyleTorres, F. L. R., Lima, L. M. M., Reboita, M. S., de Queiroz, A. R., & Lima, J. W. M. (2024). Integrating Hydrological and Machine Learning Models for Enhanced Streamflow Forecasting via Bayesian Model Averaging in a Hydro-Dominant Power System. Water, 16(4), 586. https://doi.org/10.3390/w16040586