Hybrid Decomposition Strategies and Model Combinatorial Optimization for Runoff Prediction
Abstract
1. Introduction
2. Method
2.1. Two-Layer Combinatorial Optimization Framework
- (1)
- Data normalization is performed on the raw runoff sequence to eliminate the difference in magnitude between different data features.
- (2)
- The original runoff signal is decomposed using SSA and CEEMDAN, respectively.
- (3)
- The signals are predicted without decomposition, SSA decomposition, and CEEMDAN decomposition using five prediction models, namely, LR, BP, RNN, LSTM, and GRU [5], respectively.
- (4)
- In the optimal combination layer of signal decomposition, the combination coefficients are assigned to the signals without decomposition, SSA decomposition, and CEEMDAN decomposition in each prediction model, and the formulas containing the MAPE, RRMSE, and NSE metrics are used as the fitness function. Intelligent algorithms are applied to optimize the combination coefficients to obtain the optimal combination strategy for each signal decomposition method.
- (5)
- The combination coefficients are allocated at the prediction model layer to obtain the optimal combination of the five prediction models, and MAPE, RRMSE, and NSE are utilized to evaluate the prediction effect of the model. The formula of each metric is as follows:
2.2. Runoff Data Decomposition Methods
2.2.1. SSA
- (1)
- Construct the trajectory matrix: according to the given time series x(n)(n = 1,....., N) and embedding dimension L, construct the trajectory matrix X, where the matrix element .
- (2)
- Singular-value decomposition of the trajectory matrix: X = U∑VT, where U and V are orthogonal matrices, ∑ is a diagonal matrix, and the elements on the diagonal are singular values.
- (3)
- Grouping reconstruction: according to the magnitude of the singular values or other characteristics, the left singular vector and the right singular vector corresponding to the singular values are grouped, and then different components are obtained through reconstruction.
2.2.2. CEEMDAN
- (1)
- A certain amount of white noise vi(t) is introduced into the original signal x(t) to obtain the preprocessing sequence xi(t). Then, the empirical modal decomposition of xi(t) is performed, the mean value obtained from the I experiment is taken as the component of the CEEMDAN decomposition, and then the first-order residuals r1(t) are found.
- (2)
- The noise is added to the decomposed stage j residual signal and an empirical modal decomposition is performed to find the stage j modal component.
- (3)
- Cycle steps (1) and (2); when the extreme value point is less than 2 or the number of components n is set by a human, the decomposition process ends. The IMF components and corresponding residuals that meet the requirements are obtained, at which time the original signal x(t) can be expressed as
2.3. Runoff Prediction Models
2.4. Optimization Solution Algorithm for Two-Layer Combinatorial Optimization Framework
- (1)
- Calculation of snowmelt rate
- (2)
- Dual population division
- (3)
- Exploration phase (subpopulation Pa)
- (4)
- Development phase (subpopulation Pb)
3. Case Study
3.1. Study Area and Data
3.2. Experimental Design
- (1)
- Single-model single-decomposition: 3 types of data processing (raw data, SSA decomposition, CEEMDAN decomposition) × 5 types of models (LR, BP, RNN, LSTM, GRU);
- (2)
- Single-layer optimization: optimization of the data decomposition layer only (fixed model);
- (3)
- Bi-layer optimization (the method adopted in this paper): simultaneous optimization of data layer coefficients (Wsingle, WSSA, WCEEMDAN) and model layer coefficients (WLR, WBP, WRNN, WLSTM, WGRU) using the SAO algorithm.
4. Results
4.1. Prediction Results of Single Model in Different Decomposition Modes
- (1)
- Deep models are better than shallow models. The average adaptation of LSTM and GRU is much lower than that of LR and BP, indicating that they have advantages in extracting complex temporal features.
- (2)
- Decomposition mode has a significant impact. SSA improves the prediction accuracy as a whole, especially on LSTM and GRU; CEEMDAN introduces too much noise in some models (e.g., RNN, LSTM), and the performance decreases dramatically.
- (3)
- There is a coupling effect between model and decomposition. For example, GRU performs better under CEEMDAN than without decomposition (fitness: 0.1264 vs. 0.1334), suggesting that some models are well adapted to high-frequency components.
4.2. Data Decomposition Layer Optimization Results
- (1)
- SSA components generally gained higher weights in each model, reflecting their greater adaptability to Han River runoff characteristics.
- (2)
- CEEMDAN component weights are negative in some models (e.g., LSTM), indicating that there is misleading noise in its decomposition results that can be automatically suppressed by SAO with negative weights.
- (3)
- Intelligent optimization outperforms least squares optimization in all models, with an average reduction in adaptation of 12.2%.
4.3. Model Layer Optimization Results
- (1)
- The fitness value reaches the global minimum, 0.0488, which is 32.84% and 30.21% lower than the optimal single model (0.0726) and single-layer optimization (0.0510), respectively.
- (2)
- The RRMSE is 0.0242, indicating that the framework has extremely high predictive stability and fitting ability.
- (3)
- The model combination has a clear preference: the deep recurrent model (RNN/LSTM) dominates the combination, while the shallow model (BP/LR) is significantly suppressed.
4.4. Comprehensive Comparison Results of Prediction Performance
- (1)
- Peak capture accuracy: At the peak of runoff, the optimal model (Best, red curve) matches the observed values (deviation = −5.94%). The single model (green) and SSA (blue) underestimate the peak by 19.98% and 13%, respectively.
- (2)
- Low-flow stability: In the low-flow segments from 0 to 250 days and after 400 days, all models have low absolute errors, but Best most closely matches the observed value curve.
5. Discussion
5.1. Effectiveness and Innovativeness of Dual-Level Optimization Mechanisms
- (1)
- (2)
- Optimization efficiency improvement
- (3)
- Adaptation function fusion of multiple indicators
5.2. Analysis of Decomposition Model-Matching Mechanism
- (1)
- (2)
- Noise problem of CEEMDAN
5.3. Comparison with Existing Methods and Room for Improvement
- (1)
- The introduction of meteorological forecast data to construct a synergistic prediction system with multi-source inputs.
- (2)
- An exploration of the applicability of other decomposition methods (e.g., VMD, EWT) in arid or monsoon-type watersheds.
- (3)
- The coupling of WRF with other meteorological models to construct a cascading prediction framework of “precipitation forecasting → soil moisture updating → runoff response”.
6. Conclusions
- (1)
- The adaptive decomposition mechanism demonstrates significant improvements in data processing efficacy. SSA decomposition consistently receives high combination weights across most models (e.g., WSSA = 0.6115 in LSTM), effectively capturing the seasonal runoff characteristics of the Hanjiang River. Conversely, CEEMDAN is automatically assigned negative weights in certain models (e.g., WCEEMDAN = −0.0063), which proves effective in suppressing noise interference while still extracting seasonal features. These results confirm the superiority of the proposed decomposition combination strategy over single-method approaches.
- (2)
- The two-layer optimization framework substantially enhances runoff prediction performance. Under SAO-based co-optimization, the fitness function value improves from 0.0726 (optimal single model: LSTM + SSA) to 0.0488. Error metrics (MAPE, RRMSE) show reductions of 32.84% and 30.21%, respectively, outperforming both the best single model and single-layer optimization approaches. The SAO algorithm strengthens deep recurrent models (WRNN = 1.7639, WLSTM = 1.3082) while mitigating weaknesses in shallow models (e.g., BP, LR), demonstrating the framework’s adaptive capability to identify and regulate model effectiveness. Notably, this data-driven approach requires only historical runoff sequences, making it particularly suitable for rapid deployment scenarios with limited parameter information.
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Yin, D.; Li, X.; Wang, F.; Liu, Y.; Croke, B.F.; Jakeman, A.J. Water-energy-ecosystem nexus modeling using multi-objective, non-linear programming in a regulated river: Exploring tradeoffs among environmental flows, cascaded small hydropower, and inter-basin water diversion projects. J. Environ. Manag. 2022, 308, 114582. [Google Scholar] [CrossRef]
- Nazeer, A.; Maskey, S.; Skaugen, T.; McClain, M.E. Simulating the hydrological regime of the snow fed and glaciarised Gilgit Basin in the Upper Indus using global precipitation products and a data parsimonious precipitation-runoff model. Sci. Total Environ. 2022, 802, 149872. [Google Scholar] [CrossRef]
- Mengistu, A.G.; van Rensburg, L.D.; Woyessa, Y.E. Techniques for calibration and validation of SWAT model in data scarce arid and semi-arid catchments in South Africa. J. Hydrol. Reg. Stud. 2019, 25, 100621. [Google Scholar] [CrossRef]
- Guduru, J.U.; Mohammed, A.S. Hydrological modeling using HEC-HMS model, case of Tikur Wuha River Basin, Rift Valley River Basin, Ethiopia. Environ. Chall. 2024, 17, 101017. [Google Scholar] [CrossRef]
- Chen, T.; Zou, L.; Xia, J.; Liu, H.; Wang, F. Decomposing the impacts of climate change and human activities on runoff changes in the Yangtze River Basin: Insights from regional differences and spatial correlations of multiple factors. J. Hydrol. 2022, 615, 128649. [Google Scholar] [CrossRef]
- Okkan, U.; Ersoy, Z.B.; Kumanlioglu, A.A.; Fistikoglu, O. Embedding machine learning techniques into a conceptual model to improve monthly runoff simulation: A nested hybrid rainfall-runoff modeling. J. Hydrol. 2021, 598, 126433. [Google Scholar] [CrossRef]
- Attia, M.; Driss, Z.; Abd-Elhamid, H.F.; Ennetta, R. Enhanced water quality prediction: Application of deep neural networks and adaptive neuro-fuzzy inference systems to assess calcium concentration. Desalination Water Treat. 2025, 322, 101193. [Google Scholar] [CrossRef]
- Jajarmizadeh, M.; Lafdani, E.K.; Harun, S.; Ahmadi, A. Application of SVM and SWAT models for monthly streamflow prediction, a case study in South of Iran. KSCE J. Civ. Eng. 2015, 19, 345–357. [Google Scholar] [CrossRef]
- Van, S.P.; Le, H.M.; Thanh, D.V.; Dang, T.D.; Loc, H.H.; Anh, D.T. Deep learning convolutional neural network in rainfall–runoff modelling. J. Hydroinform. 2020, 22, 541–561. [Google Scholar] [CrossRef]
- Zhao, X.; Chen, X.; Xu, Y.; Xi, D.; Zhang, Y.; Zheng, X. An EMD-Based Chaotic Least Squares Support Vector Machine Hybrid Model for Annual Runoff Forecasting. Water 2017, 9, 153. [Google Scholar] [CrossRef]
- Mehr, A.D.; Reihanifar, M.; Alee, M.M.; Ghaffari, M.A.V.; Safari, M.J.S.; Mohammadi, B. VMD-GP: A New Evolutionary Explicit Model for Meteorological Drought Prediction at Ungauged Catchments. Water 2023, 15, 2686. [Google Scholar] [CrossRef]
- Ahmadi, F.; Tohidi, M.; Sadrianzade, M. Streamflow prediction using a hybrid methodology based on variational mode decomposition (VMD) and machine learning approaches. Appl. Water Sci. 2023, 13, 135. [Google Scholar] [CrossRef]
- Zhou, D.; Liu, Y.; Wang, X.; Wang, F.; Jia, Y. Combined ultra-short-term photovoltaic power prediction based on CEEMDAN decomposition and RIME optimized AM-TCN-BiLSTM. Energy 2025, 318, 134847. [Google Scholar] [CrossRef]
- Nguyen, A.D.; Vu, V.H.; Hoang, D.V.; Nguyen, T.D.; Nguyen, K.; Le Nguyen, P.; Ji, Y. Attentional ensemble model for accurate discharge and water level prediction with training data enhancement. Eng. Appl. Artif. Intell. 2023, 126, 107073. [Google Scholar] [CrossRef]
- Zhang, X.; Zheng, Z.; Wang, K. Prediction of runoff in the upper Yangtze River based on CEEMDAN-NAR model. Water Supply 2021, 21, 3307–3318. [Google Scholar] [CrossRef]
- Deng, L.; Liu, S. Snow ablation optimizer: A novel metaheuristic technique for numerical optimization and engineering design. Expert Syst. Appl. 2023, 225, 120069. [Google Scholar] [CrossRef]
- Torres, M.E.; Colominas, M.A.; Schlotthauer, G.; Flandrin, P. A complete ensemble empirical mode decomposition with adaptive noise. In Proceedings of the 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Prague, Czech Republic, 22–27 May 2011. [Google Scholar]
- Zhang, Y.; Chiew, F.H.; Li, M.; Post, D. Predicting Runoff Signatures Using Regression and Hydrological Modeling Approaches. Water Resour. Res. 2018, 54, 7859–7878. [Google Scholar] [CrossRef]
- Perry, M.B.; Spoerre, J.K.; Velasco, T. Control chart pattern recognition using back propagation artificial neural networks. Int. J. Prod. Res. 2001, 39, 3399–3418. [Google Scholar] [CrossRef]
- Sheng, Z.; Wen, S.; Feng, Z.-K.; Gong, J.; Shi, K.; Guo, Z.; Yang, Y.; Huang, T. A Survey on Data-Driven Runoff Forecasting Models Based on Neural Networks. IEEE Trans. Emerg. Top. Comput. Intell. 2023, 7, 1083–1097. [Google Scholar] [CrossRef]
- Waqas, M.; Humphries, U.W. A critical review of RNN and LSTM variants in hydrological time series predictions. MethodsX 2024, 13, 102946. [Google Scholar] [CrossRef] [PubMed]
- Shobuzako, K.; Yoshida, S.; Kawada, Y.; Nakashima, R.; Fujioka, S.; Asai, M. A generalized smoothed particle hydrodynamics method based on the moving least squares method and its discretization error estimation. Results Appl. Math. 2025, 26, 100594. [Google Scholar] [CrossRef]
Sign | Model | MAPE | RRMSE | NSE | Fitness |
---|---|---|---|---|---|
single | LR | 0.2384 | 0.2356 | 0.7644 | 0.2365 |
BP | 0.1728 | 0.1966 | 0.8034 | 0.1887 | |
RNN | 0.1211 | 0.1068 | 0.8932 | 0.1116 | |
LSTM | 0.0956 | 0.0668 | 0.9332 | 0.0764 | |
GRU | 0.1373 | 0.1314 | 0.8686 | 0.1334 | |
SSA | LR | 0.1581 | 0.1864 | 0.8136 | 0.1769 |
BP | 0.1407 | 0.1501 | 0.8499 | 0.1470 | |
RNN | 0.1632 | 0.1848 | 0.8152 | 0.1776 | |
LSTM | 0.0915 | 0.0632 | 0.9368 | 0.0726 | |
GRU | 0.1515 | 0.1461 | 0.8539 | 0.1479 | |
CEEMDAN | LR | 0.2684 | 0.1727 | 0.8273 | 0.2046 |
BP | 0.3745 | 0.2248 | 0.7752 | 0.2747 | |
RNN | 0.9499 | 0.4322 | 0.5678 | 0.6048 | |
LSTM | 0.9179 | 0.3754 | 0.6246 | 0.5562 | |
GRU | 0.1924 | 0.0934 | 0.9066 | 0.1264 |
Primary Optimization Method | Model | Data Decomposition Layer Combination Coefficients | Evaluation Metrics | |||||
---|---|---|---|---|---|---|---|---|
Wsingle | WSSA | WCEEMDAN | MAPE | RRMSE | NSE | Fitness | ||
Least squares method | LR | 1.0451 | −0.2152 | 0.4492 | 0.3998 | 0.1002 | 0.8998 | 0.2000 |
BP | −0.8088 | 1.4917 | 0.2661 | 0.1830 | 0.1291 | 0.8709 | 0.1471 | |
RNN | 1.0705 | 0.0174 | −0.0138 | 0.1540 | 0.0990 | 0.9010 | 0.1173 | |
LSTM | 0.8698 | 0.2716 | −0.0820 | 0.1215 | 0.0567 | 0.9433 | 0.0783 | |
GRU | −0.4594 | 0.7268 | 0.5847 | 0.1552 | 0.0682 | 0.9318 | 0.0972 | |
SAO | LR | 0.3819 | 0.4631 | 0.2038 | 0.1777 | 0.1378 | 0.8622 | 0.1511 |
BP | −0.4579 | 1.3616 | 0.0875 | 0.1473 | 0.1363 | 0.8637 | 0.1400 | |
RNN | 1.3278 | −0.3283 | 0.0028 | 0.1184 | 0.1006 | 0.8994 | 0.1065 | |
LSTM | 0.4061 | 0.6115 | −0.0063 | 0.0896 | 0.0601 | 0.9399 | 0.0699 | |
GRU | −0.1179 | 0.5137 | 0.5347 | 0.1112 | 0.0717 | 0.9283 | 0.0849 |
Optimization Method | Model Layer Combination Coefficients | Evaluation Metrics | |||||||
---|---|---|---|---|---|---|---|---|---|
WLR | WBP | WRNN | WLSTM | WGRU | MAPE | RRMSE | NSE | Fitness | |
Least squares method | −0.3835 | −2.0036 | 1.9868 | 1.4872 | −0.0808 | 0.1058 | 0.0236 | 0.9764 | 0.0510 |
SAO | −0.3150 | −1.8176 | 1.7639 | 1.3082 | 0.0613 | 0.0980 | 0.0242 | 0.9758 | 0.0488 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Hu, W.; Yuan, X. Hybrid Decomposition Strategies and Model Combinatorial Optimization for Runoff Prediction. Water 2025, 17, 2560. https://doi.org/10.3390/w17172560
Hu W, Yuan X. Hybrid Decomposition Strategies and Model Combinatorial Optimization for Runoff Prediction. Water. 2025; 17(17):2560. https://doi.org/10.3390/w17172560
Chicago/Turabian StyleHu, Wenbin, and Xiaohui Yuan. 2025. "Hybrid Decomposition Strategies and Model Combinatorial Optimization for Runoff Prediction" Water 17, no. 17: 2560. https://doi.org/10.3390/w17172560
APA StyleHu, W., & Yuan, X. (2025). Hybrid Decomposition Strategies and Model Combinatorial Optimization for Runoff Prediction. Water, 17(17), 2560. https://doi.org/10.3390/w17172560