1. Introduction
With ongoing global warming and rising fossil fuel prices, governments have vigorously implemented global carbon neutrality policies to reduce the reliance on fossil fuels and encourage the deployment of renewable energy [
1]. As a mainstream energy source, wind energy is the key to energy transformation because it is recyclable, economical and abundant [
2]. Despite some challenges in 2023, such as global supply chain disruptions and intensifying global energy crises, wind energy maintained high-quality and fast-paced development. For the first time, 1 TW of energy was generated, thereby providing a solid foundation for sustainable social development. However, the randomness of wind speed would result in fluctuations in the voltage and output power of the wind power generation system, resulting in negative effects on the stability of the power grid [
3,
4]. Therefore, it is crucial to research high-precision wind speed prediction models, which can improve power system stability, optimize the wind power generation plan, and reduce the dispatching cost of the power system [
5].
Generally, wind speed forecasting models can be divided into three categories: statistical models, physical models and hybrid models [
6]. Among them, the statistical models are dedicated to calculating the relationship between the input and output variables by mathematical statistical methods, such as the autoregressive moving average (ARMA) [
7], the autoregressive integrated moving average (ARIMA) [
8], and the fractional auto regressive integrated moving average (f-ARIMA) [
9]. Although the statistical models have a high prediction performance for linear data, they are limited in processing nonlinear series [
10]. Contrastively, numerical weather prediction (NWP), as a representative of physical models, is adopted to simulate wind speed trends using meteorological data and geographical parameters, especially for 48–72 h. With the development of NWP, there are now many models for forecasting wind speed, which mainly include high-resolution limited area models (HIRLAMs) [
11], fifth-generation mesoscale models (MM5s) [
12], and weather research and forecasting (WRF) models [
13,
14]. Nevertheless, the model structure, the inputs and the physical scheme of NWP have uncertainty, resulting in a certain error between the forecasting value and the actual data [
15].
To correct the errors of NWP, hybrid models coupling data preprocessing methods, artificial intelligence (AI) models, and parameter optimization have been widely researched in recent years. Among them, data preprocessing consists of correlation analysis and a decomposition technique. Considering that the excessive input of meteorological variables collected by NWP will lead to information redundancy, correlation analysis is adopted to select the correlation factors. For instance, Chen et al. [
16] adopted the Pearson coefficient to evaluate the correlation between meteorological factors and wind speed for NWP correction, thereby selecting the appropriate variables as inputs. Wu et al. [
17] applied PCA to capture input data characteristics from NWP, where the experimental results demonstrate that PCA can reduce the computation complexity and improve the prediction accuracy. Moreover, due to the nonlinear nature of wind speed, decomposition methods are adopted to transform wind speed series into a set of subsequences. For example, Wang et al. [
18] applied several subseries obtain by CEEMDAN and the predicted data from NWP as inputs for a prediction model. Among them, CEEMDAN introduces adaptive white noise to achieve a satisfactory decomposition performance compared with those of EMD, EEMD and CEEMD. However, few studies have applied decomposition techniques to correct NWP, especially in the field of multivariate wind speed forecasting. In our study, CEEMDAN is implemented to transform multivariate series into multiple subsequences.
Furthermore, AI models have been widely adopted to correct NWP due of its strong nonlinear adaptability and learning ability [
19], such as artificial neural networks (ANNs) [
20], support vector regression (SVR) [
21], and long short-term memory (LSTM) [
22]. Among them, ANN achieves nonlinear adaptability and has a short running time. Moosavi et al. [
23] applied an ANN and random forest (RF) to study uncertainty quantification in NWP, which proved that ANNs outperform RF, and the running time of ANNs is shorter than that of RF. Nevertheless, the ANN prediction performance is unstable since the internal structure randomly generates inherent parameters. Contrastively, SVR has a good generalization ability in dealing with small samples. Cai et al. [
24] employed SVR to fuse the forecasting results obtained by NWP, where the experimental results affirm that SVR can effectively correct the error of NWP. However, the computational complexity of SVR surges with an increase in sample size [
25], which is unsuitable for large sample prediction. In contrast, LSTM utilizes memory modules to effectively capture the important parts of time series information in a large sample, thereby overcoming the limited short-term memory ability aroused by recurrent neural networks [
26]. For instance, Xu et al. [
27] employed LSTM for NWP error correction, in which the experimental results depict that LSTM can reduce the wind speed prediction error of NWP significantly. Han et al. [
15] applied bidirectional LSTM to extract the temporal correlation features from NWP, thereby improving accurate results and a better prediction effect. Although LSTM achieve high prediction accuracy, the prediction training time is longer than that of other AI models due to its complex internal structure and many weight parameters [
28]. As an enhanced version of LSTM, a shared weight gated memory network (SWGMN) is proposed for NWP correction in the field of wind speed forecasting, in which the proposed shared gate replaces the traditional forgetting gate, the input gate, and the output gate and shares the weights with different values, but of the same type, in the LSTM as a uniform weight, which leads to a simpler structure in the network and greatly reduces the forecasting time.
Since the model prediction accuracy is greatly affected by its own hyperparameters, it is essential for researchers to select appropriate hyperparameters for models using theoretical methods, which mainly include manual methods and optimization algorithms. Although the operating principle of manual methods are simple, they have strong subjectivity and limited experience, which easily lead to one calibrating a sub-optimal solution; thus, it should not be used in the field of actual wind speed prediction. Conversely, optimization algorithms employ the principle of gradient descent to calibrate the optimal solution route of the inherent parameters and the approaches to the optimal solution iteratively, thus avoiding the subjective deviation of human judgment. In recent years, a large number of optimization algorithms inspired by various biological behaviors have been proposed as research hotspots to improve prediction model accuracy, such as particle swarm optimization (PSO), beetle antennae search (BAS), and northern goshawk optimization (NGO). Among them, NGO has the best performance among all the classical optimization algorithms on 68 different objective functions, proving that NGO is highly capable of solving real-world problems [
29]. Although optimization algorithms have a strong search ability and a fast convergence rate, they cannot always determine the local optimal solution. Therefore, many researchers have studied some improved optimization algorithms to find a global optimal solution. Fu et al. [
30] proposed a combined optimization algorithm based on DE and a slime mold algorithm, which demonstrate that the proposed algorithm can enhance the global optimization ability. Referring to the previous studies, we propose improved northern goshawk optimization (INGO) based on a levy flight disturbance strategy and a nonlinear contraction strategy, in which the nonlinear contraction strategy is employed to speed up the convergence of this algorithm, and the levy flight disturbance strategy is used to enhance the ability of the algorithm to determine the local optimal solution.
In conclusion, a novel hybrid short-term wind speed forecasting framework is proposed based on the MIC, CEEMDAN, and the SWGMN with INGO for NWP correction. Firstly, the MIC is employed to acquire the correlation between the predicted variables and the error to select the correlation factors, in which the predicted variables with different domains, including the predicted wind speed and other meteorological variables, are obtained by NWP, and the error is calculated using the predicted and actual wind speeds. Then, the selected correlation factors and the error are decomposed into multiple subsequences by CEEMDAN. Subsequently, the multiple subsequences are input into the proposed SWGMN to forecast each subsequent error, in which the shared gate is proposed to replace the input gate, the forgetting gate and the output gate in the SWGMN. Furthermore, the proposed INGO coupling NGO, the levy flight disturbance strategy and the nonlinear contraction strategy is employed to optimize the parameters of the SWGMN. Ultimately, the wind speed forecasting values are obtained by accumulating the forecasted error of all the subsequences and the predicted wind speed from NWP. The framework optimizes the utilization of wind energy resources by improving the accuracy of wind speed prediction, thereby contributing to the sustainable development of society. The principal contributions are described as follows:
- (1)
The MIC is deployed to select the meteorological factors with different time domains. By eliminating the irrelevant variables and retaining the main components, the influence of the irrelevant factors on the SWGMN can be avoided to improve the prediction accuracy.
- (2)
The meteorological factors, the historical data and the error are decomposed into multiple subsequences by CEEMDAN to reduce data non-stationarity and boost the prediction performance.
- (3)
An improved network, the SWGMN, as a variant of LSTM, is proposed by replacing the forgetting gate, the input gate and the output gate with the shared gate, which achieves a good prediction accuracy and can avoid the long training process caused by LSTM; thus, it is more suitable for NWP correction in the field of short-term wind speed forecasting.
- (4)
The proposed INGO is developed to optimize the parameters of the SWGMN by combining the levy flight disturbance strategy and the nonlinear contraction strategy, which can determine the local optimal solution and accelerate the convergence speed, contributing to improving the generalization, prediction performance and stability of the SWGMN.
The rest of the paper is ordered as follows: Data preprocessing, the shared weight gated memory network, and improved northern goshawk optimization are described in
Section 2.
Section 3 shows the architecture of the proposed framework. The experimental results and analysis are presented in
Section 4.
Section 5 summarizes the conclusions.
3. Architecture of the Proposed Framework Coupling MIC, CEEMDAN, Shared Weight Gated Memory Network with Improved Northern Goshawk Optimization for NWP Correction
In this section, a compound short-term wind speed forecasting framework for NWP correction coupling the MIC, CEEMDAN and the SWGMN with INGO is proposed for NWP correction, as illustrated in
Figure 4. The implementation steps are depicted as follows:
Step 1: The predicted and actual variables are acquired by NWP and Open Weather, respectively, in which the predicted variables with different domains include the predicted wind speed and other meteorological variables.
Step 2: The MIC is employed to obtain the correlation between the predicted variables and the error, which is calculated using the actual and predicted wind speeds, after which the correlation factors strongly related to the error are selected from all the variables.
Step 3: The error and the correlation factors are decomposed into a series of subsequences by CEEMDAN.
Step 4: The proposed SWGMN is applied to forecast the error of each subsequence, respectively, in which the shared gate of the SWGMN is proposed to replace the input gate, the forgetting gate and the output gate of LSTM.
Step 5: The proposed INGO is employed to optimize the parameters of the SWGMN, which is composed of NGO, the levy flight disturbance strategy and the nonlinear contraction strategy.
Step 6: The wind speed forecasting results are attained by accumulating the forecasting error of all the subsequences and the predicted wind speed to achieve NWP correction.