Multi-Step Ahead Short-Term Electricity Load Forecasting Using VMD-TCN and Error Correction Strategy
Abstract
:1. Introduction
- According to the statistic-based STLF methods, it is assumed that the current observation is a linear combination of its historical values. Typically, those statistic-based methods include the Holts–Winters (HW) method and the autoregressive moving average (ARMA) methods and its variants. The HW method relies on exponential smoothing to encode historical values that are then used to forecast load for the present and future. For example, in 2011, Taylor adopted five exponentially weighted methods to forecast intraday load, with these methods used in combination to produce the best possible result [5]. In 2018, Mi et al., proposed a hybrid model, and applied the HW method to smooth the original load series [6]. The ARMA is aimed to describe the autocorrelation in time series data. For example, in literature [7,8,9], ARMA, ARIMA, and seasonal ARIMA were used, respectively, to forecast hourly load. However, the dynamic nature of load series hindered statistic-based models from representing its dynamics and capturing the inner non-linearity [10].
- The ML-based STLF methods are operated by learning and mining the historical load features and establishing a prediction model for load forecasting. The commonly adopted approaches include support vector regression (SVR) and artificial neural networks (ANNs). For example, Chen et al., put forward an SVR model to predict the demand response baseline for office buildings [11]. ANNs perform well in feature extraction and nonlinear mapping [12], which is effective in addressing the weaknesses of the time series model. The primary neural networks applied for short-term forecasting include the multi-layer perceptron (MLP) [13] and the feedforward deep neural network (DNN) [14]. With the advancement of machine learning, the recurrent neural network (RNN) has been demonstrated as one of the most effective models for processing time series data. For example, in literature [15,16], the electric load forecasting models based on LSTM were proposed, with the results showing that LSTM can produce a better performance in prediction and achieve a higher accuracy compared to primary ANNs. As a variant of LSTM, the gated recurrent unit (GRU) features a simpler network structure. In [17,18], GRU was introduced for short-term load forecasting. However, it remains necessary to enhance the performance in feature extraction due to the lack of convolution in the model. To address these problems, convolutional neural networks (CNN) have been adopted for their excellent capability of feature extraction. In [19,20], a CNN-LSTM model and a CNN-GRU model were proposed, with CNN layers used for feature extraction from the input data and RNN layers used for sequence learning. As suggested by the research results, CNN hybrid methods outperform the RNN-based model in terms of accuracy. More recently, a specialised CNN architecture known as temporal convolutional networks (TCN) has received popularity due to their applicability to process time series data [21], with the results showing that TCN outperforms LSTM in the accuracy of forecasting. Distinct from one-dimensional CNN [22], TCN relies on stacked CNN and dilated casual convolution, which enables the extraction of time series dependencies between the long intervals of historical data [23].
- According to the hybrid (statistics-ML-based) approach, statistical theory is applied to decompose time series data, ML methods are used to analyse and predict the decomposition, and finally the prediction results of each decomposition part are aggregated [24]. For example, Hasmat Malik et al., proposed a hybrid model where empirical mode decomposition (EMD) and neural networks were combined for multi-step ahead load forecasting [25]. Jian Li et al., integrated empirical mode decomposition (EEMD) with LSTM [26]. Zhang et al., put forward a load prediction method based on complete ensemble empirical mode decomposition (CEEMD) and LSTM [27], with the results suggesting that hybrid approaches are conducive to improving the accuracy of forecasting. However, there are two technical problems with the EMD-like decomposition algorithms. On the one hand, the number of their decompositions is inconsistent when a complex time series is decomposed, and it is difficult to determine the number of predictive models for the subsequent compilation of the program, which will lead to interruptions. On the other hand, EMD-like algorithms are usually disadvantaged by such problems as boundary effects, mode overlap, and the sensitivity to noise, which may affect the final outcome of forecasting. As a novel multiresolution technique intended for signal processing, variational mode decomposition (VMD) [28] is entirely non-recursive and has a solid theoretical foundation. According to the experimental results shown in [29], VMD outperforms CEEMD in terms of decomposition. By decomposing the original series into linear and nonlinear parts, VMD reduces the difficulty of forecasting and ensures the high level of prediction accuracy [30]. However, the application of VMD remains problematic in reality, especially when it comes to determining the hyperparameters [31]. To solve this problem, there have been different evaluation methods proposed that take into account the number of decomposition modes. For example, Li Y et al., used the ratio of residual energy to determine the optimal decomposed number [32]. However, this method produces too many decomposed subsequences, which may lead to it being extremely time consuming. In [33], permutation entropy (PE) was applied to minimize the complexity of subsequences. However, PE is not a comprehensive indicator as it ignores the amplitude of subsequences. To a large extent, validating the VMD model for decomposing data hinges on the purpose of the experiment. Therefore, it is essential to adopt an appropriate evaluation method for the VMD model.
- In the process of data pre-processing, a novel approach was proposed to determine the most suitable VMD decomposition number and penalty factor based on weighted permutation entropy (WPE). As a more comprehensive indicator, WPE was used to evaluate the time complexity and amplitude of the decomposed subsequences.
- A VMD-TCN model framework was constructed. VMD, as a competitive method of signal decomposition, was applied to reveal the irregular characteristics of the original series. The TCN network, as a method of deep learning, was adopted to explore the hidden high-level interdependence of the electrical load data, which leads to the highly effective prediction.
- An error correction strategy was applied. For the error series between the primary forecasting results and the actual load, an extra TCN model was established again for the purpose of learning and forecasting. The final forecasting results were obtained by summing the error correction values and primary prediction results.
2. Methodology
2.1. Variational Mode Decomposition
- For each subsequence, Hilbert transform is applied to calculate its analytical signal. Then, obtain its unilateral frequency spectrum
- Estimate the center frequency of each mode by hybrid exponential tuned to shift it to the baseband
- Calculate the bandwidth of each modal signal based on Gaussian smoothing, and the constrained variational problem could be described as
- In order to make the above constrained variational problem unconstrained, quadratic penalty factor and Lagrangian multiplication operator are introduced, where maintains the reconstruction precision of the signal and maintains the strictness of the constraint. The extended Lagrangian expression is shown in (4):
- Alternating direction method of multipliers (ADMM) is adopted to solve the original minimization problem. In the expanded Lagrange expression, , , and are alternately updated to find the “saddle point”, where could be transformed into the frequency domain using the Fourier isometric transformation
2.2. Weighted Permutation Entropy
2.3. Temporal Convolutional Network
3. The Proposed Forecasting Model
- Determine the optimal decomposition number K and penalty factor . Based on permutation entropy, the hyperparameters are determined by minimizing the time complexity of the decomposed subsequences. Considering the variation in amplitudes of the subsequences, the high amplitudes also have a relatively more significant impact on the final prediction results. Therefore, a comprehensive indicator is introduced to evaluate the effect of different hyperparameter values, and the hyperparameter with the minimum is treated as the optimal.
- VMD decomposition. The original load sequence is denoted as , where n represents the number of observations. There are K intrinsic mode functions obtained based on VMD, and they are denoted as . Then, each subsequence is normalized to (0, 1), respectively.
- Split train set and test set. A moving window scheme is used to create the input–output pairs that will be fed into neural network. Figure 6 illustrates the process of applying the moving window over the complete time series. The input data of each sample can be represented as , where L refers to the sequence length. The label of each sample can be represented as , where S represents the forecasting steps. When different values of S are chosen, the S step future loads can be predicted.
- Each subsequence is trained and predicted using a TCN model. For neural networks, weights updating is a non-convex optimization problem, as the 40 values of weights are related to the convergence efficiency and generalization ability of the network [41]. For the convolutional layers in TCN, weights are randomly initialized through a Gaussian distribution with a mean of 0 and a variance of 0.01. The primary forecasting results are obtained by aggregating the prediction of each TCN model. can be expressed as follows.
- Apply the error correction strategy. According to the central limit theorem, a certain pattern of errors between the actual historical data and the primary results can be found out. The distribution of error series conforms to the Gaussian distribution. Thus, an error correction strategy is applied. The trained forecasting model is applied to predict the train set, and forecasting values for the train set are obtained, which denotes as . The error series for the train set can be expressed as
- By comparing the final prediction result with the actual load on the test set, performance indicators are calculated to evaluate the model.
4. Experiment and Discussion
4.1. Experimental Dataset
4.2. Performance Indicators
- RMSE is defined as follows:
- MAPE is defined as follows:
- R2 is defined as follows.
4.3. Training Model
4.4. Experimental Result
4.5. Comparison
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Kuster, C.; Rezgui, Y.; Mourshed, M. Electrical load forecasting models: A critical systematic review. Sustain. Cities Soc. 2017, 35, 257–270. [Google Scholar] [CrossRef]
- Malik, H.; Fatema, N.; Iqbal, A. Intelligent Data-Analytics for Condition Monitoring: Smart Grid Applications; Elsevier: Amsterdam, The Netherlands, 2021. [Google Scholar]
- Soliman, S.A.; Al-Kandari, A.M. Dynamic Electric Load Forecasting: Modeling and Model Construction; Elsevier: Amsterdam, The Netherlands, 2010. [Google Scholar]
- Petropoulos, F.; Spiliotis, E. The wisdom of the data: Getting the most out of univariate time series forecasting. Forecasting 2021, 3, 29. [Google Scholar] [CrossRef]
- Taylor, J.W. Short-term load forecasting with exponentially weighted methods. IEEE Trans. Power Syst. 2011, 27, 458–464. [Google Scholar] [CrossRef]
- Mi, J.; Fan, L.; Duan, X.; Qiu, Y. Short-term power load forecasting method based on improved exponential smoothing grey model. Math. Probl. Eng. 2018, 2018, 3894723. [Google Scholar] [CrossRef] [Green Version]
- Pappas, S.S.; Ekonomou, L.; Karampelas, P.; Karamousantas, D.C.; Katsikas, S.K.; Chatzarakis, G.E.; Skafidas, P.D. Electricity demand load forecasting of the Hellenic power system using an ARMA model. Electr. Power Syst. Res. 2010, 80, 256–264. [Google Scholar] [CrossRef]
- Wu, F.; Cattani, C.; Song, W.; Zio, E. Fractional ARIMA with an improved cuckoo search optimization for the efficient Short-term power load forecasting. Alex. Eng. J. 2020, 59, 3111–3118. [Google Scholar] [CrossRef]
- Tarsitano, A.; Amerise, I.L. Short-term load forecasting using a two-stage sarimax model. Energy 2017, 133, 108–114. [Google Scholar] [CrossRef]
- Lara-Benítez, P.; Carranza-García, M.; Riquelme, J.C. An experimental review on deep learning architectures for time series forecasting. Int. J. Neural Syst. 2021, 27, 458–464. [Google Scholar] [CrossRef]
- Chen, Y.; Xu, P.; Chu, Y.; Li, W.; Wu, Y.; Ni, L.; Bao, Y.; Wang, K. Short-term electrical load forecasting using the Support Vector Regression (SVR) model to calculate the demand response baseline for office buildings. Appl. Energy 2017, 195, 659–670. [Google Scholar] [CrossRef]
- Wang, R.; Li, C.; Fu, W.; Tang, G. Deep learning method based on gated recurrent unit and variational mode decomposition for short-term wind power interval prediction. IEEE Trans. Neural Netw. Learn. Syst. 2019, 31, 3814–3827. [Google Scholar] [CrossRef]
- Cerjan, M.; Matijaš, M.; Delimar, M. Dynamic hybrid model for short-term electricity price forecasting. Energies 2014, 7, 3304–3318. [Google Scholar] [CrossRef]
- Marcjasz, G.; Lago, J.; Weron, R. Neural networks in day-ahead electricity price forecasting: Single vs. multiple outputs. arXiv 2020, arXiv:2008.08006. [Google Scholar] [CrossRef]
- Muzaffar, S.; Afshari, A. Short-term load forecasts using LSTM networks. Energy Procedia 2019, 158, 2922–2927. [Google Scholar] [CrossRef]
- Kong, W.; Dong, Z.Y.; Jia, Y.; Hill, D.J.; Xu, Y.; Zhang, Y. Short-term residential load forecasting based on LSTM recurrent neural network. IEEE Trans. Smart Grid 2017, 10, 841–851. [Google Scholar] [CrossRef]
- Xiuyun, G.; Ying, W.; Yang, G.; Chengzhi, S.; Wen, X.; Yimiao, Y. Short-term load forecasting model of gru network based on deep learning framework. In Proceedings of the 2018 IEEE Conference on Energy Internet and Energy System Integration, Beijing, China, 20–22 October 2018. [Google Scholar] [CrossRef]
- Zheng, J.; Chen, X.; Yu, K.; Gan, L.; Wang, Y.; Wang, K. Short-term power load forecasting of residential community based on GRU neural network. In Proceedings of the 2018 International Conference on Power System Technology, Guangzhou, China, 6–9 November 2018. [Google Scholar] [CrossRef]
- Alhussein, M.; Aurangzeb, K.; Haider, S.I. Hybrid CNN-LSTM model for short-term individual household load forecasting. IEEE Access 2020, 8, 180544–180557. [Google Scholar] [CrossRef]
- Sajjad, M.; Khan, Z.A.; Ullah, A.; Hussain, T.; Ullah, W.; Lee, M.Y.; Baik, S.W. A novel CNN-GRU-based hybrid approach for short-term residential load forecasting. IEEE Access 2020, 8, 143759–143768. [Google Scholar] [CrossRef]
- Bai, S.; Kolter, J.Z.; Koltun, V. An empirical evaluation of generic convolutional and recurrent networks for sequence modeling. arXiv 2018, arXiv:1803.01271. [Google Scholar] [CrossRef]
- Zhang, J.; Liu, C.; Ge, L. Short-Term Load Forecasting Model of Electric Vehicle Charging Load Based on MCCNN-TCN. Energies 2022, 15, 2633. [Google Scholar] [CrossRef]
- Lara-Benítez, P.; Carranza-García, M.; Luna-Romera, J.M.; Riquelme, J.C. Temporal convolutional networks applied to energy-related time series forecasting. Appl. Sci. 2020, 10, 2322. [Google Scholar] [CrossRef] [Green Version]
- Yan, J.; Mu, L.; Wang, L.; Ranjan, R.; Zomaya, A.Y. Temporal convolutional networks for the advance prediction of ENSO. Sci. Rep. 2020, 10, 8055. [Google Scholar] [CrossRef]
- Malik, H.; Alotaibi, M.A.; Almutairi, A. A new hybrid model combining EMD and neural network for multi-step ahead load forecasting. J. Intell. Fuzzy Syst. 2022, 42, 1099–1114. [Google Scholar] [CrossRef]
- Li, J.; Deng, D.; Zhao, J.; Cai, D.; Hu, W.; Zhang, M.; Huang, Q. A novel hybrid short-term load forecasting method of smart grid using MLR and LSTM neural network. IEEE Trans. Ind. Inform. 2020, 17, 2443–2452. [Google Scholar] [CrossRef]
- Zhang, W.; Huang, W. Multivariate load prediction method for integrated energy system based on CEEMD-LSTM. In Proceedings of the International Symposium on Geographic Information, Energy and Environmental Sustainable Development, Tianjin, China, 26–27 December 2021. [Google Scholar] [CrossRef]
- Dragomiretskiy, K.; Zosso, D. Variational mode decomposition. IEEE Trans. Signal. Process. 2013, 62, 531–544. [Google Scholar] [CrossRef]
- Yan, X.; Jia, M. Application of CSA-VMD and optimal scale morphological slice bispectrum in enhancing outer race fault detection of rolling element bearings. Mech. Syst. Signal. Process. 2019, 122, 56–86. [Google Scholar] [CrossRef]
- Zhang, Y.; Zhao, Y.; Kong, C.; Chen, B. A new prediction method based on VMD-PRBF-ARMA-E model considering wind speed characteristic. Energy Convers. Manag. 2020, 203, 112254. [Google Scholar] [CrossRef]
- Zhang, Z.; Hong, W.C. Application of variational mode decomposition and chaotic grey wolf optimizer with support vector regression for forecasting electric loads. Knowl. Based Syst. 2021, 228, 107297. [Google Scholar] [CrossRef]
- Liu, Y.; Yang, C.; Huang, K.; Gui, W. Non-ferrous metals price forecasting based on variational mode decomposition and LSTM network. Knowl. Based Syst. 2020, 188, 105006. [Google Scholar] [CrossRef]
- Zhang, X.; Tuo, W.; Song, C. Application of MEEMD-ARIMA combining model for annual runoff prediction in the Lower Yellow River. J. Water Clim. Chang. 2020, 11, 865–876. [Google Scholar] [CrossRef] [Green Version]
- Hong, T.; Pinson, P.; Fan, S.; Zareipour, H.; Troccoli, A.; Hyndman, R.J. Probabilistic energy forecasting: Global energy forecasting competition 2014 and beyond. Int. J. Forecast. 2016, 32, 896–913. [Google Scholar] [CrossRef] [Green Version]
- Liu, H.; Mi, X.; Li, Y. Smart multi-step deep learning model for wind speed forecasting based on variational mode decomposition, singular spectrum analysis, LSTM network and ELM. Energy Convers. Manag. 2018, 159, 54–64. [Google Scholar] [CrossRef]
- Keller, K.; Mangold, T.; Stolz, I.; Werner, J. Permutation entropy: New ideas and challenges. Entropy 2017, 19, 134. [Google Scholar] [CrossRef] [Green Version]
- Fadlallah, B.; Chen, B.; Keil, A.; Príncipe, J. Weighted-permutation entropy: A complexity measure for time series incorporating amplitude information. Phys. Rev. E 2013, 87, 022911. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Hewage, P.; Behera, A.; Trovati, M.; Pereira, E.; Ghahremani, M.; Palmieri, F.; Liu, Y. Temporal convolutional neural (TCN) network for an effective weather forecasting using time-series data from the local weather station. Soft Comput. 2020, 24, 16453–16482. [Google Scholar] [CrossRef] [Green Version]
- Gan, Z.; Li, C.; Zhou, J.; Tang, G. Temporal convolutional networks interval prediction model for wind speed forecasting. Electr. Power Syst. Res. 2021, 191, 106865. [Google Scholar] [CrossRef]
- Chen, Y.; Kang, Y.; Chen, Y.; Wang, Z. Probabilistic forecasting with temporal convolutional neural network. Neurocomputing 2020, 399, 491–501. [Google Scholar] [CrossRef] [Green Version]
- Qiu, X.P. Neural Networks and Deep Learning; China Machine Press: Beijing, China, 2020. [Google Scholar]
= 100 | = 500 | = 1000 | = 2000 | = 3000 | = 4000 | = 5000 | |
---|---|---|---|---|---|---|---|
= 2 | 0.9675 | 0.9680 | 0.9686 | 0.9631 | 0.9522 | 0.9355 | 0.9978 |
= 3 | 0.9960 | 0.9970 | 0.9969 | 0.9962 | 0.9939 | 0.9894 | 0.9851 |
= 4 | 0.7349 | 0.7354 | 0.7258 | 0.6921 | 0.9026 | 0.6808 | 0.8088 |
= 5 | 0.6375 | 0.6007 | 0.5921 | 0.6327 | 0.5773 | 0.6236 | 0.5776 |
= 6 | 0.5708 | 0.5738 | 0.5772 | 0.5390 | 0.5104 | 0.5642 | 0.5634 |
= 7 | 0.5696 | 0.5699 | 0.5699 | 0.5659 | 0.5671 | 0.5222 | 0.5360 |
= 8 | 0.5410 | 0.5687 | 0.5667 | 0.5665 | 0.5670 | 0.5644 | 0.5411 |
Model | Forecast Steps | ||||||||
---|---|---|---|---|---|---|---|---|---|
6 Steps | 12 Steps | 24 Steps | |||||||
MAPE/ % | RMSE/ kW | R2 | MAPE/ % | RMSE/ kW | R2 | MAPE/ % | RMSE/ kW | R2 | |
TCN | 1.545 | 66.617 | 0.9826 | 2.553 | 105.911 | 0.9561 | 3.452 | 138.850 | 0.9246 |
VMD + TCN | 0.711 | 29.645 | 0.9965 | 0.861 | 31.873 | 0.9963 | 1.117 | 43.439 | 0.9924 |
VMD + TCN + EC | 0.274 | 12.452 | 0.9994 | 0.326 | 16.296 | 0.9986 | 0.405 | 17.81 | 0.9977 |
Model | 6 Steps | 12 Steps | 24 Steps | ||||||
---|---|---|---|---|---|---|---|---|---|
MAPE/ % | RMSE/ kW | R2 | MAPE/ % | RMSE/ kW | R2 | MAPE/ % | RMSE/ kW | R2 | |
LSTM | 1.875 | 82.325 | 0.9737 | 3.316 | 135.77 | 0.9279 | 3.828 | 170.556 | 0.9146 |
GRU | 1.881 | 84.123 | 0.9723 | 3.244 | 129.238 | 0.9347 | 3.924 | 156.930 | 0.9037 |
TCN | 1.545 | 66.617 | 0.9826 | 2.553 | 105.911 | 0.9561 | 3.452 | 138.850 | 0.9246 |
VMD-LSTM | 1.351 | 63.965 | 0.9839 | 1.856 | 79.211 | 0.9755 | 2.801 | 111.853 | 0.9511 |
VMD-GRU | 1.373 | 64.035 | 9.9836 | 1.735 | 75.846 | 0.9775 | 2.949 | 120.529 | 0.9432 |
VMD-TCN | 0.711 | 29.645 | 0.9965 | 0.861 | 31.873 | 0.9963 | 1.117 | 43.439 | 0.9924 |
Model | 6 Steps | 12 Steps | 24 Steps |
---|---|---|---|
LSTM | 448.74s | 424.59s | 436.17s |
GRU | 374.29s | 361.78s | 368.92s |
TCN | 63.17s | 67.28s | 65.56s |
VMD-LSTM | 2629.76s | 2686.79s | 2647.61s |
VMD-GRU | 2113.85s | 2241.80s | 2286.35s |
VMD-TCN | 275.36s | 280.62s | 280.09s |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zhou, F.; Zhou, H.; Li, Z.; Zhao, K. Multi-Step Ahead Short-Term Electricity Load Forecasting Using VMD-TCN and Error Correction Strategy. Energies 2022, 15, 5375. https://doi.org/10.3390/en15155375
Zhou F, Zhou H, Li Z, Zhao K. Multi-Step Ahead Short-Term Electricity Load Forecasting Using VMD-TCN and Error Correction Strategy. Energies. 2022; 15(15):5375. https://doi.org/10.3390/en15155375
Chicago/Turabian StyleZhou, Fangze, Hui Zhou, Zhaoyan Li, and Kai Zhao. 2022. "Multi-Step Ahead Short-Term Electricity Load Forecasting Using VMD-TCN and Error Correction Strategy" Energies 15, no. 15: 5375. https://doi.org/10.3390/en15155375
APA StyleZhou, F., Zhou, H., Li, Z., & Zhao, K. (2022). Multi-Step Ahead Short-Term Electricity Load Forecasting Using VMD-TCN and Error Correction Strategy. Energies, 15(15), 5375. https://doi.org/10.3390/en15155375