An Adaptive Hybrid Short-Term Load Forecasting Framework Based on Improved Rime Optimization Variational Mode Decomposition and Cross-Dimensional Attention
Abstract
1. Introduction
- (1)
- An adaptive signal decomposition method (IRIME-VMD) is proposed. This study employs an Improved Rime Optimization Algorithm (IRIME) integrated with a Gaussian Mutation strategy to perform global adaptive optimization of the key parameter combination—the number of decomposed modes (K) and the penalty factor (α)—in Variational Mode Decomposition (VMD). By introducing Gaussian Mutation, this method enhances population diversity, effectively avoiding the problem of traditional optimization algorithms easily falling into local optima during the solution process. Consequently, it achieves an optimal decomposition of the original load sequence, transforming it into a set of more stationary and more predictable Intrinsic Mode Functions (IMFs).
- (2)
- A multi-dimensional feature matrix was constructed, and a prediction model based on Cross-Dimensional Attention (CDA-LSTNet) was designed. Subsequent to the signal decomposition, this study utilizes the Maximal Information Coefficient (MIC) method to screen for climatic features that are highly correlated with the load. These features are then combined with the decomposed IMF components to construct a multi-dimensional information matrix. This matrix serves as the input for a novel CDA-LSTNet prediction model. The key innovation of this model lies in the introduction of the Cross-Dimensional Attention (CDA) mechanism. This enables the network to dynamically assign importance weights to different input feature dimensions (i.e., each IMF component and climatic variable) at every time step, thereby more intelligently capturing the key driving factors that influence load variations.
- (3)
- An end-to-end synergistic forecasting framework was integrated and validated. This study organically combines adaptive signal decomposition (IRIME-VMD) with dynamic feature attention prediction (CDA-LSTNet) to form a complete and automated hybrid framework, spanning from raw data processing to final result prediction. Through a “decompose first, then fuse and predict” strategy, this framework synergistically leverages the strengths of each module. This not only enhances the overall prediction accuracy of the model but also effectively mitigates the prediction lag problem commonly observed in single deep learning models.
2. The IRIME-VMD-CDA-LSTNet Hybrid Model
2.1. Fundamental Principles
2.1.1. Principles of the RIME Algorithm
2.1.2. Gaussian Mutation
2.1.3. Variational Mode Decomposition
2.2. IRIME-VMD Modal Decomposition
- (1)
- Initialization: The optimization process is initiated by defining the population size N, the maximum number of iterations T, and the boundary constraints [Lb, Ub] for the RIME algorithm. Simultaneously, the initial decomposition parameters for VMD are established.
- (2)
- Data input: The power load sequence intended for decomposition is imported into the system as the target signal.
- (3)
- Position update: The particle adherence coefficient E is computed based on the ratio of the current iteration t to the total iterations T. Subsequently, the particle positions are updated according to Equation (2) when the stochastic condition (r2 < E) is met.
- (4)
- Fitness evaluation: The average sample entropy is used as the fitness evaluation metric to measure the complexity and information redundancy of the VMD decomposition results. For a given parameter combination [K, ], the original load sequence is decomposed into K modal components via VMD. The Hilbert transform envelope signal of each component is calculated, and the sample entropy is computed based on a sliding window. The final fitness value is the mean of the sample entropies of all modal components. Its mathematical expression is:
- (5)
- Boundary constraint and assessment: A boundary check is enforced to constrain updated particle positions within the feasible range [Lb, Ub], ensuring the physical validity of parameters. The fitness of the new position is then evaluated; if it outperforms the original, the global optimal solution is updated accordingly.
- (6)
- Termination and output: The process evaluates whether the termination criteria (maximum iterations T or convergence thresholds) are satisfied. Upon meeting these conditions, the optimal parameter combination is output and applied to the final VMD decomposition. Conversely, if criteria are unmet, the algorithm proceeds to the adaptive mutation adjustment phase.
2.3. CDA-LSTNet
3. Experimental Design and Validation
3.1. Dataset
3.2. Evaluation Metrics
4. Case Study Analysis
4.1. Baseline Model Comparison
4.2. Comparative Study
5. Conclusions
- (1)
- An improved Rime-Optimization Variational Mode Decomposition method (IRIME-VMD) is proposed. Unlike traditional optimizers (e.g., GA, GWO, PSO) that rely on experience or are prone to local optima, the Gaussian-Mutation strategy introduced in this paper significantly enhances the global search capability of the RIME algorithm. The comparative study (Table 2) strongly demonstrates this: compared to GA-VMD, GWO-VMD, and PSO-VMD, IRIME-VMD reduced the RMSE on Substation A by 14.1%, 13.1%, and 18.9%, respectively. This confirms that the method effectively avoids local optima and finds superior VMD parameters, thereby achieving a “cleaner” and more stable signal decomposition, which is the fundamental prerequisite for subsequent accurate prediction.
- (2)
- An LSTNet model optimized by a cross-dimensional attention mechanism (CDA-LSTNet) is constructed. Most attention mechanisms in existing research are limited to the temporal dimension, neglecting the fact that the contributions of different input features (such as various IMF components, temperature, humidity, etc.) are dynamically variable at different time steps. The Cross-Dimensional Attention (CDA) mechanism designed in this paper successfully addresses this issue. It enables the model to adaptively and dynamically assign weights to each feature dimension. This allows the model to intelligently amplify the influence of key driving factors (e.g., a specific high-frequency IMF component or an abrupt temperature change) while suppressing the interference from noise components, thereby significantly enhancing the model’s dynamic perception capabilities and robustness.
- (3)
- The proposed hybrid framework (IRIME-VMD-CDA-LSTNet), through the synergistic interaction of the two aforementioned innovations, effectively resolves the “prediction lag” problem commonly found in traditional models (including single deep learning models and baseline decomposition models). As demonstrated in the baseline comparison, baseline models consistently exhibit significant lag and amplitude underestimation at load peaks. In contrast, the model proposed in this paper, by leveraging its high-quality decomposition and intelligent feature fusion, achieves high-fidelity tracking of the true load with virtually no phase delay. This capability is of critical practical application value for the real-time dispatch and secure operation of power systems, far outweighing the importance of simple aggregated error metrics.
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Baur, L.; Ditschuneit, K.; Schambach, M.; Kaymakci, C.; Wollmann, T.; Sauer, A. Explainability and Interpretability in Electric Load Forecasting Using Machine Learning Techniques–A Review. Energy AI 2024, 16, 100358. [Google Scholar] [CrossRef]
- Wang, C.; Zhao, H.; Liu, Y.; Fan, G. Minute-level ultra-short-term power load forecasting based on time series data features. Appl. Energy 2024, 372, 123801. [Google Scholar] [CrossRef]
- Liu, Y.; Liao, J.; Guo, C.; Tan, Z.; Wang, Y.; Wei, N.; Zhou, N.; Song, Y. Fault Severity Classification Based Coordination Control Strategy of Fault Current Limiter and Modular Multilevel Converter for Adaptive Fault Current Limiting. J. Mod. Power Syst. Clean Energy 2025, 13, 1432–1443. [Google Scholar] [CrossRef]
- Liu, Y.; Liao, J.; Guo, C.; Tan, Z.; Wang, Q.; Wang, Y.; Zhou, N. Self-Adaptive Action and Parameter Optimization of DC Series-Parallel Power Flow Controller for Fault Current Limiting in Bipolar DC Distribution Systems. J. Mod. Power Syst. Clean Energy 2025, 13, 732–746. [Google Scholar] [CrossRef]
- Eren, Y.; Küçükdemiral, İ. A comprehensive review on deep learning approaches for short-term load forecasting. Renew. Sustain. Energy Rev. 2024, 189, 114031. [Google Scholar] [CrossRef]
- Ullah, K.; Ahsan, M.; Hasanat, S.M.; Haris, M.; Yousaf, H.; Raza, S.F.; Tandon, R.; Abid, S.; Ullah, Z. Short-Term Load Forecasting: A Comprehensive Review and Simulation Study With CNN-LSTM Hybrids Approach. IEEE Access 2024, 12, 111858–111881. [Google Scholar] [CrossRef]
- Wan, A.; Chang, Q.; AL-Bukhaiti, K.; He, J. Short-term power load forecasting for combined heat and power using CNN-LSTM enhanced by attention mechanism. Energy 2023, 282, 128274. [Google Scholar] [CrossRef]
- Shafiuzzaman, M.; Safayet Islam, M.; Rubaith Bashar, T.M.; Munem, M.; Nahiduzzaman, M.; Ahsan, M.; Haider, J. Enhanced very short-term load forecasting with multi-lag feature engineering and prophet-XGBoost-CatBoost architecture. Energy 2025, 335, 137981. [Google Scholar] [CrossRef]
- Nabavi, S.A.; Mohammadi, S.; Motlagh, N.H.; Tarkoma, S.; Geyer, P. Deep learning modeling in electricity load forecasting: Improved accuracy by combining DWT and LSTM. Energy Rep. 2024, 12, 2873–2900. [Google Scholar] [CrossRef]
- Yang, D.; Guo, J.; Li, Y.; Sun, S.; Wang, S. Short-term load forecasting with an improved dynamic decomposition-reconstruction-ensemble approach. Energy 2023, 263, 125609. [Google Scholar] [CrossRef]
- Xiao, Y.; Wu, S.; He, C.; Hu, Y.; Yi, M. An effective hybrid wind power forecasting model based on “decomposition-reconstruction-ensemble” strategy and wind resource matching. Sustain. Energy Grids Netw. 2024, 38, 101293. [Google Scholar] [CrossRef]
- Zhou, M.; Hu, T.; Bian, K.; Lai, W.; Hu, F.; Hamrani, O.; Zhu, Z. Short-Term Electric Load Forecasting Based on Variational Mode Decomposition and Grey Wolf Optimization. Energies 2021, 14, 4890. [Google Scholar] [CrossRef]
- Hu, T.; Zhou, M.; Bian, K.; Lai, W.; Zhu, Z. Short-Term Load Probabilistic Forecasting Based on Improved Complete Ensemble Empirical Mode Decomposition with Adaptive Noise Reconstruction and Salp Swarm Algorithm. Energies 2022, 15, 147. [Google Scholar] [CrossRef]
- Yin, C.; Wei, N.; Wu, J.; Ruan, C.; Luo, X.; Zeng, F. An Empirical Mode Decomposition-Based Hybrid Model for Sub-Hourly Load Forecasting. Energies 2024, 17, 307. [Google Scholar] [CrossRef]
- Liu, W.; Bai, Y.; Yue, X.; Wang, R.; Song, Q. A wind speed forcasting model based on rime optimization based VMD and multi-headed self-attention-LSTM. Energy 2024, 294, 130726. [Google Scholar] [CrossRef]
- Mohamed, S.A.; Shaheen, A.M.; Alqahtani, M.H.; Al Faiya, B.M. Enhancement of rime algorithm using quadratic interpolation learning for parameters identification of photovoltaic models. Sci. Rep. 2025, 15, 21166. [Google Scholar] [CrossRef]
- Zhou, D.; Liu, Y.; Wang, X.; Wang, F.; Jia, Y. Combined ultra-short-term photovoltaic power prediction based on CEEMDAN decomposition and RIME optimized AM-TCN-BiLSTM. Energy 2025, 318, 134847. [Google Scholar] [CrossRef]
- Rubasinghe, O.; Zhang, X.; Chau, T.K.; Chow, Y.H.; Fernando, T.; Iu, H.H.-C. A Novel Sequence to Sequence Data Modelling Based CNN-LSTM Algorithm for Three Years Ahead Monthly Peak Load Forecasting. IEEE Trans. Power Syst. 2024, 39, 1932–1947. [Google Scholar] [CrossRef]
- Chu, X.; Jin, H.; Li, Y.; Feng, J.; Mu, W. CDA-LSTM: An evolutionary convolution-based dual-attention LSTM for univariate time series prediction. Neural Comput. Appl. 2021, 33, 16113–16137. [Google Scholar] [CrossRef]
- Ran, J.; Gong, Y.; Hu, Y.; Cai, J. EV load forecasting using a refined CNN-LSTM-AM. Electr. Power Syst. Res. 2025, 238, 111091. [Google Scholar] [CrossRef]
- Zhan, X.; Kou, L.; Xue, M.; Zhang, J.; Zhou, L. Reliable Long-Term Energy Load Trend Prediction Model for Smart Grid Using Hierarchical Decomposition Self-Attention Network. IEEE Trans. Reliab. 2023, 72, 609–621. [Google Scholar] [CrossRef]
- Guo, X.; Gao, Y.; Li, Y.; Zheng, D.; Shan, D. Short-term household load forecasting based on Long- and Short-term Time-series network. Energy Rep. 2021, 7, 58–64. [Google Scholar] [CrossRef]
- Lin, J.; Ma, J.; Zhu, J.; Cui, Y. Short-term load forecasting based on LSTM networks considering attention mechanism. Int. J. Electr. Power Energy Syst. 2022, 137, 107818. [Google Scholar] [CrossRef]
- Li, K.; Huang, W.; Hu, G.; Li, J. Ultra-short term power load forecasting based on CEEMDAN-SE and LSTM neural network. Energy Build. 2023, 279, 112666. [Google Scholar] [CrossRef]
- Feng, Z.; Zhang, X.; Quan, W.; Liu, X.; An, J.; Wang, C.; Ji, X.; Kang, L. A hybrid deep learning model based on Rime optimization and multi-head attention for cooling load prediction in public buildings. Energy 2025, 339, 139100. [Google Scholar] [CrossRef]
- Gu, G.; Lou, J.; Wan, H. A multi-strategy improved rime optimization algorithm for three-dimensional USV path planning and global optimization. Sci. Rep. 2024, 14, 12603. [Google Scholar] [CrossRef] [PubMed]
- Wang, Y.; Zhang, W.; Ma, Y.; Yu, Y.; Chen, H. An improved RIME optimization algorithm based maximum power point tracking method for photovoltaic system under partially shading condition. Sci. Rep. 2025, 15, 19507. [Google Scholar] [CrossRef]
- Shi, J.; Chen, Y.; Heidari, A.A.; Cai, Z.; Chen, H.; Chen, Y.; Liang, G. Environment random interaction of rime optimization with Nelder-Mead simplex for parameter estimation of photovoltaic models. Sci. Rep. 2024, 14, 15701. [Google Scholar] [CrossRef]
- He, X.; Kawaguchi, T.; Hashimoto, S. Intelligent Identification Method of Low Voltage AC Series Arc Fault Based on Using Residual Model and Rime Optimization Algorithm. Energies 2024, 17, 4675. [Google Scholar] [CrossRef]
- Zhao, Z.; Dai, Y.; Li, K.; Zhang, Z.; Fang, Y.; Chen, B.; Zhao, Q. Estimation of Lithium Battery State of Health Using Hybrid Deep Learning with Multi-Step Feature Engineering and Optimization Algorithm Integration. Energies 2025, 18, 5849. [Google Scholar] [CrossRef]
- Wang, W.; Zhang, M.; Zhang, Z.; Du, D.; Tang, Z. Enhancing Photovoltaic Power Forecasting via Dual Signal Decomposition and an Optimized Hybrid Deep Learning Framework. Energies 2025, 18, 6159. [Google Scholar] [CrossRef]
- Li, K.; Yuan, L.; Qian, F.; Song, L.; Wu, X.; Wang, L.; Dai, J.; Shen, L. Short-Term Load Forecasting for Electricity Spot Markets Across Different Seasons Based on a Hybrid VMD-LSTM-Random Forest Model. Energies 2025, 18, 6097. [Google Scholar] [CrossRef]











| Substation | Metric | LSTM | GRU | SVR | Random Forest | ARIMA | Proposed Method |
|---|---|---|---|---|---|---|---|
| A | RMSE | 299.8 | 306.92 | 261.14 | 519.6 | 335.75 | 284.42 |
| MAE | 169.56 | 159.07 | 207.4 | 110.69 | 92.59 | 177.618 | |
| MAPE (%) | 27.42 | 25.59 | 50.2 | 12.87 | 39.27 | 30.13 | |
| B | RMSE | 315.05 | 304.18 | 277.1 | 320.62 | 267.1 | 285.39 |
| MAE | 171.48 | 171.8 | 172.52 | 189.38 | 166.9 | 170.44 | |
| MAPE (%) | 18.74 | 18.14 | 17.742 | 19.97 | 36.04 | 31.69 | |
| C | RMSE | 882.48 | 975.9 | 655.4 | 607.6 | 586.1 | 883.75 |
| MAE | 483.44 | 521.1 | 462.65 | 287.31 | 305.22 | 605.9 | |
| MAPE (%) | 23.46 | 26.8 | 26.38 | 10.91 | 39.37 | 35.62 | |
| D | RMSE | 1110.81 | 1128.7 | 771.5 | 540.1 | 485.1 | 794.8 |
| MAE | 739.48 | 683.7 | 680.77 | 401.6 | 364.5 | 506.1 | |
| MAPE (%) | 18.81 | 17.63 | 21.85 | 10.56 | 25.95 | 14.6 |
| Substation | Metric | GA-VMD | GWO-VMD | PSO-VMD | IRIME-VMD |
|---|---|---|---|---|---|
| A | RMSE | 330.956 | 327.172 | 350.555 | 284.42 |
| MAE | 214.250 | 212.783 | 219.336 | 177.618 | |
| MAPE (%) | 36.847 | 36.770 | 37.016 | 30.13 | |
| B | RMSE | 317.199 | 327.172 | 350.34 | 285.39 |
| MAE | 176.201 | 212.783 | 193.34 | 170.44 | |
| MAPE (%) | 31.599 | 36.770 | 33.964 | 31.69 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Zhang, A.; Liu, D.; Liao, J. An Adaptive Hybrid Short-Term Load Forecasting Framework Based on Improved Rime Optimization Variational Mode Decomposition and Cross-Dimensional Attention. Energies 2026, 19, 497. https://doi.org/10.3390/en19020497
Zhang A, Liu D, Liao J. An Adaptive Hybrid Short-Term Load Forecasting Framework Based on Improved Rime Optimization Variational Mode Decomposition and Cross-Dimensional Attention. Energies. 2026; 19(2):497. https://doi.org/10.3390/en19020497
Chicago/Turabian StyleZhang, Aodi, Daobing Liu, and Jianquan Liao. 2026. "An Adaptive Hybrid Short-Term Load Forecasting Framework Based on Improved Rime Optimization Variational Mode Decomposition and Cross-Dimensional Attention" Energies 19, no. 2: 497. https://doi.org/10.3390/en19020497
APA StyleZhang, A., Liu, D., & Liao, J. (2026). An Adaptive Hybrid Short-Term Load Forecasting Framework Based on Improved Rime Optimization Variational Mode Decomposition and Cross-Dimensional Attention. Energies, 19(2), 497. https://doi.org/10.3390/en19020497
