Short-Term Power Load Forecasting Based on CEEMDAN-WT-VMD Joint Denoising and BiTCN-BiGRU-Attention
Abstract
:1. Introduction
- Hierarchical denoising strategy: A CEEMDAN-WT-VMD framework is designed to suppress multi-source noise. CEEMDAN first decomposes raw load data into stable IMFs while mitigating mode mixing. Wavelet thresholding (WT) then filters high-frequency noise components, and VMD performs secondary decomposition to enhance feature stability. This joint approach resolves the limitations of single-stage decomposition and residual noise in existing methods.
- Bidirectional spatiotemporal modeling: A BiTCN-BiGRU-Attention network is developed to exploit bidirectional temporal dependencies. The BiTCN captures multi-scale historical patterns through dilated causal convolutions, while the BiGRU extracts forward–backward dynamic features. An attention mechanism dynamically weights critical temporal states, overcoming the unidirectional information bottleneck in traditional RNN-based models.
- Validation on real-world data: Experiments on Australian power load data demonstrate superior performance, with the proposed model achieving a 0.65% MAPE, significantly outperforming benchmarks like VMD-BiTCN-BiGRU-Attention (1.21% MAPE) and non-denoised hybrid models (1.64% MAPE). This validates the effectiveness of the joint denoising and bidirectional architecture in improving both accuracy and robustness.
2. Denoising and Decomposition of Power Load Time Series
2.1. Principle of CEEMDAN
2.2. CEEMDAN Combined with Wavelet Threshold Denoising
- Signal Decomposition: The noisy load data are decomposed into multiple scales using CEEMDAN, yielding N IMF components, each containing distinct frequency characteristics of the signal.
- Noise Processing: High-frequency IMF components are further denoised via wavelet thresholding to obtain refined components . A hybrid soft-hard threshold method is adopted to mitigate the limitations of single-threshold approaches. The soft threshold function is defined as Equation (4):
- 3.
- An appropriate wavelet basis function is selected, and the decomposition level is determined based on signal characteristics. Wavelet decomposition generates coefficients , including noise coefficients and target signal coefficients .
- 4.
- Signal Reconstruction: The denoised high-frequency components for cumulative reconstruction, producing the final denoised power load data. The reconstruction formula is expressed as Equation (5):
- 5.
- Denoising Effect Quantification: The signal-to-noise ratio () is used to evaluate denoising performance, calculated via Equation (6):
2.3. Theory of VMD Algorithm
3. Short-Term Power Load Forecasting Model
3.1. BiTCN Neural Network Architecture
3.2. BiGRU Neural Network
3.3. Attention Mechanism
- Attention Scoring Function:
- 2.
- Softmax Function:
- 3.
- Output Calculation:
4. Hybrid CWVMD-BiTCN-BiGRU-Attention Forecasting
4.1. Input Feature Settings for Load Forecasting Model
4.2. CWVMD-BiTCN-BiGRU-Attention Hybrid Forecasting Model
- Decomposition: The denoised load time series is decomposed into multiple IMF components using the CWVMD method.
- Model Training: For each IMF component, a BiTCN-BiGRU-Attention neural network prediction model is established. Model parameters are initialized and optimized via the Adam optimizer, adjusting weight parameters to enhance performance.
- Iterative Forecasting: The trained models generate predictions iteratively, aggregating results to produce the final forecast.
- BiTCN Layer: IMF components are fed into the BiTCN layer, which consists of forward and reverse TCN modules. These modules process sequential data bidirectionally through dilated causal convolution, batch normalization, Leaky ReLU activation, and dropout operations to extract temporal features.
- BiGRU Layer: The output vectors from BiTCN serve as inputs to the BiGRU layer, which includes forward and reverse GRU modules to capture long-term dependencies in the time series.
- Attention Layer: The outputs from BiGRU are weighted by the attention mechanism to emphasize critical temporal patterns.
- Output Layer: The weighted features are passed to the output layer to generate the final prediction.
4.3. Evaluation Metrics
- : Total number of test samples.
- : Error between the -th predicted value and the corresponding true value.
- : Actual (true) value of the -th sample.
5. Case Analysis
5.1. Data Source and Preprocessing
5.2. Data Denoising
5.3. VMD Decomposition
5.4. Model Parameters and Prediction Results of BiTCN-BiGRU-Attention
5.5. Ablation Experiments and Prediction Model Comparison
6. Conclusions
- The original load data are decomposed into multiple IMF (Intrinsic Mode Function) components using CEEMDAN. Wavelet threshold denoising is applied to remove noise from high-frequency components, followed by secondary feature extraction via VMD on the denoised signal. This process generates load time series with enhanced stationarity and periodicity, enabling the BiTCN model to better capture long-term dependencies and improve fitting capability.
- The VMD-derived IMF components are fed into the BiTCN network using time windows, leveraging BiTCN’s strength in extracting latent features and long-range dependencies. This step provides more accurate feature representations for subsequent modeling.
- The feature vectors extracted by BiTCN are used as inputs for the BiGRU model, which further models complex temporal relationships and nonlinear characteristics of load data. Concurrently, the Attention mechanism dynamically focuses on critical features, enhancing prediction accuracy and model robustness.
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Zhang, Z. Energy system optimization based on fuzzy decision support system and unstructured data. Energy Inform. 2024, 7, 82. [Google Scholar] [CrossRef]
- Zaboli, A.; Kasimalla, S.R.; Park, K.; Hong, Y.; Hong, J. A Comprehensive Review of Behind-the-Meter Distributed Energy Resources Load Forecasting: Models. Energies 2023, 17, 2534. [Google Scholar] [CrossRef]
- Ullah, K.; Ahsan, M.; Hasanat, S.M.; Haris, M.; Yousaf, H.; Raza, S.F.; Tandon, R.; Abid, S.; Ullah, Z. Short-Term Load Forecasting: A Comprehensive Review and Simulation Study with CNN-LSTM Hybrids Approach. IEEE Access 2024, 12, 111858–111881. [Google Scholar] [CrossRef]
- Buratto, W.G.; Muniz, R.N.; Nied, A.; González, G.V. Seq2Seq-LSTM With Attention for Electricity Load Forecasting in Brazil. IEEE Access 2024, 12, 30020–30029. [Google Scholar] [CrossRef]
- Das, S.; Fouda, M.M.; Abdo, M.G. Short-Term Load Forecasting Using GRU-LGBM Fusion. In Proceedings of the 2024 International Conference on Smart Applications, Communications and Networking (SmartNets), Washington DC, USA, 28–30 May 2024; pp. 1–6. [Google Scholar]
- Wang, X.; Wang, H.; Bhandari, B.; Cheng, L. AI-Empowered Methods for Smart Energy Consumption: A Review of Load Forecasting, Anomaly Detection and Demand Response. Int. J. Precis. Eng. Manuf. Technol. 2024, 11, 963–993. [Google Scholar] [CrossRef]
- Amini, H.; Mehrizi-Sani, A. Similar Days Fuzzy Clustering Load Forecasting. In Proceedings of the IECON 2024—50th Annual Conference of the IEEE Industrial Electronics Society, Chicago, IL, USA, 3–6 November 2024; pp. 1–5. [Google Scholar] [CrossRef]
- Wang, Z.-X.; Ku, Y.-Y.; Liu, J. The Power Load Forecasting Model of Combined SaDE-ELM and FA-CAWOA-SVM Based on CSSA. IEEE Access 2024, 12, 41870–41882. [Google Scholar] [CrossRef]
- Chen, S.; Hao, C.; Dehghanian, P. Short-Term Load Forecasting Model Based on ICCEEMDAN-MRMR and CNN-SVM. In Proceedings of the 2024 4th International Conference on Energy Engineering and Power Systems (EEPS), Hangzhou, China, 9–11 August 2024; pp. 385–389. [Google Scholar] [CrossRef]
- Magalhães, B.; Bento, P.; Pombo, J.; Calado, M.D.R.; Mariano, S. Short-Term Load Forecasting Based on Optimized Random Forest and Optimal Feature Selection. Energies 2023, 17, 1926. [Google Scholar] [CrossRef]
- Wang, Q.; Peng, C.; Zhou, Y. Short-term power load forecasting based on improved back propagation neural network. In Proceedings of the SPIE 13395, International Conference on Optics, Electronics, and Communication Engineering (OECE 2024), 133952R, Foshan, China, 12 November 2024; pp. 698–705. [Google Scholar]
- Jiang, B.; Yang, H.; Wang, Y.; Liu, Y.; Geng, H.; Zeng, H.; Ding, J. Dynamic Temporal Dependency Model for Multiple Steps Ahead Short-Term Load Forecasting of Power System. IEEE Trans. Ind. Appl. 2024, 60, 5244–5254. [Google Scholar] [CrossRef]
- Wu, L.; Kong, C.; Hao, X.; Chen, W. A Short-Term Load Forecasting Method Based on GRU-CNN Hybrid Neural Network Model. Math. Probl. Eng. 2020, 1, 1428104. [Google Scholar] [CrossRef]
- Ahranjani, Y.K.; Beiraghi, M.; Ghanizadeh, R. Short time load forecasting for Urmia city using the novel CNN-LTSM deep learning structure. Electr. Eng. 2025, 107, 1253–1264. [Google Scholar] [CrossRef]
- Murali, S.; Saini, P.; Abhinav, K.; Shankar, R.; Parida, S.K. Improved LSTM-Based Load Forecasting Embedded 3DOF (FOPI)-FOPD Controller for Proactive Frequency Regulation in Power System. IEEE Trans. Ind. Appl. 2024, 60, 8213–8227. [Google Scholar] [CrossRef]
- Guo, D.; Zhang, Z.; Yang, B.; Zhang, J.; Yang, H.; Lin, Y. Integrating spoken instructions into flight trajectory prediction to optimize automation in air traffic control. Nat. Commun. 2024, 15, 15. [Google Scholar] [CrossRef] [PubMed]
- Chen, H.; Sun, Y.; Li, X.; Zheng, B.; Chen, T. Dual-Scale Complementary Spatial-Spectral Joint Model for Hyperspectral Image Classification. IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens. 2025, 18, 6772–6789. [Google Scholar] [CrossRef]
- Mateev, V.; Marinova, I. Modified chromosome pooling genetic algorithm for resource allocation optimization. AIP Conf. Proc. 2023, 2939, 100010. [Google Scholar] [CrossRef]
- Marin-Quintero, J.; Orozco-Henao, C.; Bretas, A.S.; Velez, J.C.; Herrada, A.; Barranco-Carlos, A.; Percybrooks, W.S. Adaptive Fault Detection Based on Neural Networks and Multiple Sampling Points for Distribution Networks and Microgrids. J. Mod. Power Syst. Clean Energy 2022, 10, 1648–1657. [Google Scholar] [CrossRef]
- Tan, D.; Tang, Z.; Zhou, F.; Xie, Y. A Novel Hybrid Model Based on EMD-Improved TCN-Improved TST for Short-Term Railway Traction Load Forecasting. IEEE Trans. Transp. Electrif. 2024, 11, 6418–6427. [Google Scholar] [CrossRef]
- Dong, Y.; Meng, Q.; Fu, J.; Ai, X.; Chen, Z.; Yin, Y.; Liu, K.; Zhang, X. Ultra-Short-Term Load Forecasting Based on EMD Decomposition and CNN-BiGRU Heterogeneous Computing Model. In Proceedings of the 2024 IEEE 4th International Conference on Power, Electronics and Computer Applications (ICPECA), Shenyang, China, 26–28 January 2024; pp. 1266–1272. [Google Scholar] [CrossRef]
- Dong, Y.; Yang, C.; Meng, Q.; Ai, X.; Yin, Y.; Liu, K.; Fu, J.; Chen, Z. Ultra-short-term load forecasting based on the combination of EEMD and Autoformer multi-model. In Proceedings of the 2024 IEEE 4th International Conference on Power, Electronics and Computer Applications (ICPECA), Shenyang, China, 26–28 January; pp. 1273–1279. [CrossRef]
- Yang, M.; Liu, Y.; Mu, Y.; Li, Z.; Zhang, H.; Chen, W.; Rong, F. Short-Term Power Load Forecasting Model Based on EEMD-SE-ERCNN. In Proceedings of the 2024 7th International Conference on Pattern Recognition and Artificial Intelligence (PRAI), Hangzhou, China, 15–17 August 2024; pp. 893–898. [Google Scholar] [CrossRef]
- Ye, H.; Zhu, Q.; Zhang, X. Short-Term Load Forecasting for Residential Buildings Based on Multivariate Variational Mode Decomposition and Temporal Fusion Transformer. Energies 2023, 17, 3061. [Google Scholar] [CrossRef]
- Duan, Q.; He, X.; Chao, Z.; Tang, X.; Li, Z. Short-term power load forecasting based on sparrow search algorithm-variational mode decomposition and attention-long short-term memory. Int. J. Low-Carbon Technol. 2024, 19, 1089–1097. [Google Scholar] [CrossRef]
- Xiang, X.; Yuan, T.; Cao, G.; Zheng, Y. Short-Term Electric Load Forecasting Based on Signal Decomposition and Improved TCN Algorithm. Energies 2023, 17, 1815. [Google Scholar] [CrossRef]
- Heng, L.; Hao, C.; Nan, L.C. Load forecasting method based on CEEMDAN and TCN-LSTM. PLoS ONE 2024, 19, e0300496. [Google Scholar] [CrossRef]
- Liu, J.; Luan, J.; Tai, Y.; Gao, C.; Li, S.; Jiao, L. Application of Signal Averaging CEEMDAN and Adaptive Wavelet Threshold Denoising in Signal Processing. In Proceedings of the 2024 5th International Conference on Machine Learning and Computer Application (ICMLCA), Hangzhou, China, 18–20 October 2024; pp. 535–538. [Google Scholar] [CrossRef]
- Liu, W.; Hua, F.; Cui, Y.; Xu, Y.; Liu, H. An Optimized Power Load Forecasting Algorithm Based on VMD-SMA-LSTM. Energy Sci. Eng. 2025, 9, 2857. [Google Scholar] [CrossRef]
- Tian, C.; Liu, Q.; Tian, C.; Ma, X.; Feng, Y.; Han, D. Interval Prediction of Air Conditioning Load Based on Quantile Regression BiTCN-BiGRU-Attention Model. In Proceedings of the 2024 China Automation Congress (CAC), Qingdao, China, 1–3 November 2024; pp. 1–6. [Google Scholar] [CrossRef]
- Ullah, K.; Shakir, D.; Abid, U.; Alahmari, S.; Aslam, S.; Ullah, Z. Hybrid BiGRU-CNN Model for Load Forecasting in Smart Grids with High Renewable Energy Integration. IET Gener. Transm. Distrib. 2024, 19, e70060. [Google Scholar] [CrossRef]
- Zhang, X.; Ye, J.; Gao, L.; Ma, S.; Xie, Q.; Huang, H. Short-term wind power prediction based on ICEEMDAN decomposition and BiTCN–BiGRU-multi-head self-attention model. Electr. Eng. 2025, 107, 2645–2662. [Google Scholar] [CrossRef]
Feature | Description | Value Range |
---|---|---|
LOAD | Previous day load | Load sequence values y |
HOUR | Hour of the day | 00:00–23:00 (0.5 h steps) |
DBT | Dry bulb temperature | Real-time temperature values |
DPT | Dew point temperature | Real-time temperature values |
WBT | Wet bulb temperature | Real-time temperature values |
HMY | Humidity | Real-time humidity values |
Component | IMF_1 | IMF_2 | IMF_3 | IMF_4 | IMF_5 | IMF_6 | … |
---|---|---|---|---|---|---|---|
Percentage (%) | 1.03 | 0.56 | 6.19 | 1.28 | 0.02 | 1.25 | … |
Category | High | High | High | Low | Low | Low | Low |
Component | SNR (Pre-Denoising, dB) 1 | SNR (Post-Denoising, dB) |
---|---|---|
IMF-1 | 0.10 | 0.31 |
IMF-2 | 2.34 | 16.42 |
IMF-3 | 5.67 | 48.98 |
Reconstructed Signal | 41.06 | 227.10 |
Mode | K = 3 | K = 4 | K = 5 | K = 6 |
---|---|---|---|---|
IMF-1 | 7.43 × 10−6 | 7.01 × 10−6 | 6.98 × 10−6 | 6.95 × 10−6 |
IMF-2 | 0.021 | 0.021 | 0.020 | 0.019 |
IMF-3 | 0.046 | 0.043 | 0.041 | 0.040 |
IMF-4 | … | 0.055 | 0.044 | 0.042 |
IMF-5 | … | … | 0.064 | 0.045 |
IMF-6 | … | … | … | 0.065 |
Dropout | Train RMSE (MW) | Val RMSE (MW) | Test RMSE (MW) |
---|---|---|---|
0.1 | 45.2 | 62.8 | 63.5 |
0.2 | 48.6 | 58.3 | 57.0 |
0.3 | 52.1 | 59.7 | 58.9 |
0.5 | 60.4 | 65.2 | 64.1 |
Model | RMSE (MW) | MAPE (%) | MAE (MW) |
---|---|---|---|
CWVMD-BiGRU | 130.84 | 1.15 | 115.32 |
CWVMD-BiTCN | 115.63 | 1.02 | 102.67 |
CWVMD-BiGRU-Attention | 85.42 | 0.83 | 79.45 |
CWVMD-BiTCN-Attention | 93.54 | 0.90 | 83.85 |
CWVMD-BiTCN-BiGRU | 79.84 | 0.78 | 76.71 |
CWVMD-BiTCN-BiGRU-Attention | 57.01 | 0.65 | 54.62 |
Model | RMSE (MW) | MAPE (%) | MAE (MW) |
---|---|---|---|
BiTCN-BiGRU-Attention | 165.39 | 1.64 | 144.79 |
VMD-BiTCN-BiGRU-Attention | 109.15 | 1.21 | 98.21 |
WT-VMD-BiTCN-BiGRU-Attention | 85.63 | 1.05 | 84.37 |
CWVMD-BiTCN-BiGRU-Attention | 57.01 | 0.65 | 54.62 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Guo, X.; Gao, Y.; Song, W.; Zen, Y.; Shi, X. Short-Term Power Load Forecasting Based on CEEMDAN-WT-VMD Joint Denoising and BiTCN-BiGRU-Attention. Electronics 2025, 14, 1871. https://doi.org/10.3390/electronics14091871
Guo X, Gao Y, Song W, Zen Y, Shi X. Short-Term Power Load Forecasting Based on CEEMDAN-WT-VMD Joint Denoising and BiTCN-BiGRU-Attention. Electronics. 2025; 14(9):1871. https://doi.org/10.3390/electronics14091871
Chicago/Turabian StyleGuo, Xincheng, Yan Gao, Wanqing Song, Yi Zen, and Xianhui Shi. 2025. "Short-Term Power Load Forecasting Based on CEEMDAN-WT-VMD Joint Denoising and BiTCN-BiGRU-Attention" Electronics 14, no. 9: 1871. https://doi.org/10.3390/electronics14091871
APA StyleGuo, X., Gao, Y., Song, W., Zen, Y., & Shi, X. (2025). Short-Term Power Load Forecasting Based on CEEMDAN-WT-VMD Joint Denoising and BiTCN-BiGRU-Attention. Electronics, 14(9), 1871. https://doi.org/10.3390/electronics14091871