Wind Speed Forecasting Based on Phase Space Reconstruction and a Novel Optimization Algorithm

He, Zhaoshuang; Chen, Yanhua; Zang, Yale

doi:10.3390/su16166945

Open AccessArticle

Wind Speed Forecasting Based on Phase Space Reconstruction and a Novel Optimization Algorithm

by

Zhaoshuang He

¹,

Yanhua Chen

^2,* and

Yale Zang

²

¹

School of Communication and Information Engineering, Xi’an University of Posts & Telecommunication, Xi’an 710121, China

²

School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450000, China

^*

Author to whom correspondence should be addressed.

Sustainability 2024, 16(16), 6945; https://doi.org/10.3390/su16166945

Submission received: 8 July 2024 / Revised: 6 August 2024 / Accepted: 7 August 2024 / Published: 13 August 2024

Download

Browse Figures

Versions Notes

Abstract

:

The wind power generation capacity is increasing rapidly every year. There needs to be a corresponding development in the management of wind power. Accurate wind speed forecasting is essential for a wind power management system. However, it is not easy to forecast wind speed precisely since wind speed time series data are usually nonlinear and fluctuant. This paper proposes a novel combined wind speed forecasting model that based on PSR (phase space reconstruction), NNCT (no negative constraint theory) and a novel GPSOGA (a hybrid optimization algorithm that combines global elite opposition-based learning strategy, particle swarm optimization and the genetic algorithm) optimization algorithm. SSA (singular spectrum analysis) is firstly applied to decompose the original wind speed time series into IMFs (intrinsic mode functions). Then, PSR is employed to reconstruct the intrinsic mode functions into input and output vectors of the forecasting model. A combined forecasting model is proposed that contains a CBP (cascade back propagation network), RNN (recurrent neural network), GRU (gated recurrent unit), and CNNRNN (convolutional neural network combined with recurrent neural network). The NNCT strategy is used to combine the output of the four predictors, and a new optimization algorithm is proposed to find the optimal combination parameters. In order to validate the performance of the proposed algorithm, we compare the forecasting results of the proposed algorithm with different models on four datasets. The experimental results demonstrate that the forecasting performance of the proposed algorithm is better than other comparison models in terms of different indicators. The DM (Diebold–Mariano) test, Akaike’s information criterion and the Nash–Sutcliffe efficiency coefficient confirm that the proposed algorithm outperforms the comparison models.

Keywords:

wind speed forecasting; phase space reconstruction; no negative constraint theory; recurrent neural network; gated recurrent unit

1. Introduction

Wind energy is one of the most widely used sources of renewable energy [1]. According to the WWEA Half-Year Report 2023 [2], the wind power industry is slowly regaining momentum. In 2023, the new installed capacity of wind power reached 41.2 gigawatts, an increase of 38% compared to 2022. It is expected that the total new capacity for the entire year of 2023 will be at least 110 gigawatts, and the global wind power installed capacity will soon surpass the threshold of 1 million megawatts. With the decreasing installation costs, developing countries are expected to leapfrog directly to wind energy [3]. Climate change is emerging as the world’s most significant environmental challenge due to its adverse impacts on the earth’s ecosystem and human welfare, including wide-ranging economic, ecological and social effects. Successive international commitments relating to energy and climate change [4] illustrate that renewable energies are significant to the worldwide energy. Limiting global warming below 2 °C requires rapid decarbonization towards net-zero greenhouse gas emissions by 2050 [5]. Renewable energies are critical in achieving rapid decarbonization by replacing fossil energy.

Damousis and Dokopoulos [6] have pointed out that if the accuracy of short-term wind speed prediction is less than 10%, the generation capacity can increase by 30–100 MW and a profit of USD 100,000 can be obtained. The accurate forecasting of wind speed in wind farms is conducive to a timely adjustment of dispatching plans and the better planning of power grid dispatching departments. However, the instability and nonlinearity of wind speed limit the development of wind power and bring many obstacles to the wind power grid. Accurate wind speed prediction technology can reduce the impact of wind speed characteristics, which not only helps power grid operators and decision makers to timely plan and dispatch the power system but also reduces the failure risk of a wind power system and improves power quality [7]. Therefore, accurate wind speed prediction technology can effectively improve the stability of a wind power generation system.

To improve the accuracy of wind speed forecasting, decomposition algorithms and PSR are widely used in wind speed forecasting. Decomposition algorithms are usually used as the feature extraction method, including WD (wavelet decomposition) [8], EMD (empirical mode decomposition) [9], SSA [10] and others. Among the above strategies, it is tough for EMD to achieve satisfactory results in analyzing non-stationary and nonlinear series as its decomposition efficiency is susceptible to mode mixing problems [11]. SSA has been widely used in many fields, including climate, the environment, geography, social science, and finance. It consists of two complementary stages: decomposition and reconstruction [12]. The SSA technique can identify the original series as several independent components, including the trend, periodic components, oscillations, noise, and clean series. Thus, we use SSA to decompose the raw wind speed data into IMFs to decrease the non-stationarity. PSR can reconstruct the dynamics of a complex system by mapping its observed time series data into a multidimensional space. This technique is particularly useful for understanding the underlying structure of a time series [13]. PSR is employed to reconstruct the IMFs into input and output vectors of a forecasting model after the raw wind speed series is decomposed.

In order to predict the wind speed accurately, a wide range of models have been proposed during the last decade. These models can be classified into four categories [14,15]: (1) physical models, (2) standard statistical models, (3) machine learning-based models, and (4) hybrid AI-based approaches. The practical application of current physical models is limited by challenges in coding the detailed physical model and the large computational resources needed to run them [16].

Compared with physical models, a statistical model has the characteristics of a simple structure. In addition, a statistical model can sufficiently excavate the hidden information of historical data. ARMA (auto-regressive moving average) [17] and ARIMA (auto-regressive integrated moving average) [18] are the most widely used statistical models. Jiang et al. [19] proposed a method based on EMD and VAR (vector auto-regression) for wind speed forecasting. Singh et al. [20] proposed a new repeated wavelet transform-based ARIMA (RWT-ARIMA) model that can improve the accuracy of very short-term wind speed forecasting. The performance of these statistical models depends on the nonlinearity and non-stationarity of historical wind speed data. Most statistical models can hardly capture the nonlinear feature in the historical data. The intermittent and stochastic characteristics of a wind speed series need more complex functions to capture nonlinear relations. Therefore, the performance of statistical models is not stable.

Artificial intelligence technology is developing rapidly. Many intelligent forecasting methods have been applied for wind power prediction and wind speed prediction. These models are capable of high performance in terms of forecasting nonlinear and non-stationary wind speed time series data. Artificial intelligence models mainly include ANNs (artificial neural networks) [21], a SVM (support vector machine) [22], LSTM (long short-term memory network) [23], a GRU [24], a GNN [25], and so on. Zhang et al. [23] proposed a shared weight LSTM to decrease the number of variables that need to be optimized and the training time of LSTM without significantly reducing prediction accuracy. Wei et al. designed a wind speed forecasting system consisting of a GRU and SNN (spiking neural network) with error correction and fluctuating featured composition strategies to fill the gaps of hybrid structures based on the SNN [24].

However, a single model cannot always meet the prediction of time series and the change in wind speed because it is difficult for a single model to extract the features in a time series. In order to overcome the shortages of a single model, some researchers proposed the NNCT strategy of integrating multiple prediction models. For example, Xiao et al. [26] proposed a genetic algorithm based on NNCT (GA–CM–NNCT) to forecast the hourly average wind speed at three wind turbines in Chengde, China. Zhang et al. [27] proposed a combined model that combines CEEMDAN (complete ensemble empirical mode decomposition with adaptive noise) and a flower pollination algorithm with chaotic local search. In their model, five neural networks and NNCT are employed for short-term wind speed forecasting. Wang et al. [28] adopt the NNCT method to integrate a BP (back propagation) neural network, SVM, ELM (extreme learning machine) and ARIMA to build a hybrid forecasting system for wind speed point forecasting and fuzzy interval forecasting. Niu et al. [29] proposed a hybrid model, which consists of CEEMDAN, a NNCT-based multi-objective grasshopper optimization algorithm, and several single models. Their experiments shows that the NNCT strategy can improve the accuracy of the combined system.

In NNCT-based models, an algorithm is needed to optimize the weight coefficients of each single model. In recent years, the following algorithms have been commonly used to optimize parameters: PSO (particle swarm optimization) [30], a GA (genetic algorithm) [31], SA (simulated annealing) [32], the GWO (grey wolf optimizer) [33], the WOA (whale optimization algorithm) [34], and so on. Some researchers have combined various algorithms and made some improvement. Wang et al. [35] proposed an algorithm combining PSO and the GSA (gravitational search algorithm) to optimize the prediction model. He et al. [36] designed the PSOGA, a hybrid particle swarm optimization genetic algorithm to optimize the hyperparameters of the model. The PSOGA takes the advantages of both PSO and a GA with fast convergent speed and high accuracy. Wang et al. [37] designed a combined wind speed forecasting system that employs multiverse optimization algorithm to optimize the weights of each forecasting model. Zhou et al. [38] established a combined model that uses ELM, RNN, LSTM, MLP (multi-layer perceptron) and SVM to forecast short-term wind speed and designed a modified multi-objective optimization algorithm to optimize the weight of the combined model.

Through the above literature, we find that many decomposition algorithms are used for the preprocessing of wind speed data. Although the wind speed data are preprocessed and the non-stationary and nonlinear problems are solved, a single prediction model cannot fully capture the characteristics of the data, and the use of multiple models can better capture the characteristics in the data. Existing optimization algorithms (such as PSO, GA, GWO) still have space for improvement in accuracy, convergent speed and stability, and an improved optimization algorithm needs to be proposed.

In this paper, we propose a new hybrid SSA-GPSOGA-NNCT algorithm based on SSA, PSR and the NNCT strategy. There are four forecasting models in NNCT-based models including CBP, an RNN, a GRU and a CNNRNN. To optimize the hyperparameters in NNCT-based models, we propose an optimization algorithm named the GPSOGA. The GPSOGA optimization method is composed of PSO, a GA and the GEOLS (global elite opposition-based learning strategy). And the overall process of the proposed algorithm is as follows:

The original wind speed was decomposed into multiple components by SSA. Each component was reconstructed by PSR, which was divided into the input and output vectors of the proposed algorithm. Each component was predicted by CBP, the RNN, the GRU and the CNNRNN, respectively. The prediction results of the four methods were combined by the NNCT strategy. The weight coefficients of each prediction method are optimized by the GPSOGA algorithm, and finally, all components are accumulated to obtain the predicted results. The main innovation achievements and contributions of this paper are as follows:

1. SSA and PSR are used to construct the input and output of the proposed forecasting algorithms. SSA and PSR can effectively understand the underlying structure and extract the hidden features of a nonlinear time series.

2. A new combination strategy is proposed, which adopts the NNCT multi-model fusion strategy to combine CBP, RNN, GRU and CNNRNN prediction models. The proposed hybrid algorithm achieves the mean absolute error (MAE) for one-step-ahead predictions on four datasets as follows: 0.0156, 0.0453, 0.0182, and 0.1025. For three-step-ahead predictions, the MAE values are 0.0435, 0.1028, 0.444, and 0.3071. Five-step-ahead predictions yield MAE values of 0.0767, 0.1819, 0.0816, and 0.486. The combined algorithm can overcome the limitations of a single model in the prediction process and improve the accuracy of the prediction results.

3. A new GPSOGA optimization algorithm is proposed, which makes the particles in PSO carry gene sequences. The proposed optimization algorithm takes the advantages of PSO and GA with a fast convergent speed and high accuracy.

4. DM test, Akaike’s information criterion and the Nash–Sutcliffe efficiency coefficient are employed to compare the performance of the proposed algorithm with different models.

The rest of this paper is organized as follows. In Section 2, we provide a short overview of the approaches involved in this paper. The proposed GPSOGA optimization algorithm is introduced in Section 3. In Section 4, a detailed description of the proposed forecasting algorithm is presented. Section 5 describes the experiment results and analysis. The discussions are provided in Section 6. Finally, we conclude this paper in Section 7.

2. Methodology

This section introduces the methods used in this paper, including SSA, PSR, CBP, an RNN, a GRU and a CNNRNN.

2.1. Singular Spectrum Analysis

SSA is a nonparametric spectrum estimation algorithm [39]. It is widely used in time series analysis that decomposes a time series into several meaningful components based on the singular value decomposition of a time series. As a mature signal processing method, it has the advantages of being unconstrained by the sine wave assumption and not requiring prior information. It can extract more useful information from the original sequence, thereby improving the signal-to-noise ratio of the original sequence. SSA has been widely used to identify and extract low-frequency and high-frequency components from a time series. Considering the series

\tilde{Y} = [{\tilde{y}}_{1}, {\tilde{y}}_{2}, \dots, {\tilde{y}}_{N}]

available at

N

time points, the window length

w

is set in the range of

2 \leq w \leq N / 2

. If

n = N - w + 1

, then the trajectory matrix

X

can be defined as follows:

X = (\begin{matrix} {\tilde{y}}_{1} & {\tilde{y}}_{2} & \dots & {\tilde{y}}_{w} \\ {\tilde{y}}_{2} & {\tilde{y}}_{3} & \dots & {\tilde{y}}_{w + 1} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ {\tilde{y}}_{n} & {\tilde{y}}_{n + 1} & \dots & {\tilde{y}}_{N} \end{matrix})

(1)

SVD (singular values decomposition) is employed to decompose trajectory matrix

X

.

λ_{1}, λ_{2}, \dots, λ_{w}

and

U_{1}, U_{2}, \dots, U_{w}

are the eigenvalues and eigenvectors of matrix

X

. The right-singular vector

V_{i} = X^{T} U_{i} \sqrt{λ_{i}} (i = 1, 2, \dots, w)

, and the

i

th eigentriple

(\sqrt{λ_{i}}, U_{i}, V_{i})

is obtained by the SVD of matrix

X

. Each sub-matrix can be derived by

X_{i} = U_{i} \sqrt{λ_{i}} V_{i}^{T}

.

The matrix is transferred into a time series by diagonal averaging.

x_{j k} (1 \leq j \leq n, 1 \leq k \leq w)

is the element of diagonal matrix

X_{i}

. The construction of subseries

Z_{i} (i = 1, 2, \dots, w)

via diagonal averaging can be defined by Equation (2) and the subseries are arranged in descending order according to their eigenvalues.

Z_{i} = {\begin{matrix} \frac{1}{n} \sum_{i = 1}^{n + 1} x_{i, n - i + 1}, 1 \leq n \leq w^{*} \\ \frac{1}{w^{*}} \sum_{i = 1}^{m^{*}} x_{i, n - i + 1}, w^{*} \leq n \leq n^{*} \\ \frac{1}{N - w + 1} \sum_{i = n - n^{*} + 1}^{N - n^{*} + 1} x_{i, n - i + 1}, n^{*} \leq n \leq N \end{matrix}

(2)

2.2. Phase Space Reconstruction

To accurately forecast wind speed, it is necessary to fully grasp the inherent characteristics of wind speed time series. Due to the fact that wind speed is affected by many factors, it has non-stationary, nonlinear and non-deterministic characteristics. By using PSR to map the wind speed time series into a high-dimensional space, the relationship between the nonlinear characteristics and the interaction between the systems can be obtained, which helps to forecast the future trend in wind speed. The essence of PSR is to reconstruct one-dimensional time series into a

d

-dimensional vector with delay time

τ

, which can reconstruct the unidimensional series into dynamic chaotic space by setting delay time

τ

and embedded dimension

d

appropriately [40].

In this paper, PSR is employed to construct the corresponding phase space matrixes, which are imported into the forecasting models subsequently. The wind speed series are reconstructed by PSR into input matrix

X_{i n t p u t}

and output matrix

X_{o u t p u t}

, which is illustrated as follows:

X_{i n p u t} = {[X_{1}, X_{2}, \dots, X_{L}]}^{T} = [\begin{matrix} x_{1} & x_{1 + τ} & \dots & x_{1 + (d - 1) τ} \\ x_{2} & x_{2 + τ} & \dots & x_{2 + (d - 1) τ} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ x_{L} & x_{L + τ} & \dots & x_{L + (d - 1) τ} \end{matrix}]

(3)

X_{o u t p u t} = {[x_{1 + (d - 1) τ + t}, x_{2 + (d - 1) τ + t}, \dots, x_{N}]}^{T}

(4)

where

d

represents the reconstructed dimension, τ represents the delay time,

t

represents the number of directly predicted steps,

L

represents the length of the input data, and

N

indicates the total length of the time series data.

2.3. Cascade Backpropagation Network

The cascade backpropagation (CBP) network is based on the BP neural network. Cascade refers to different layers of a neural network, not just the adjacent layer connections. For example, the input layer has a direct connection with the output layer, and it also has a connection with the hidden layer. This also means that each layer not only receives the information provided by the previous layer but also obtains the weight connection provided by other layers in front. CBP also includes the characteristics of layer-by-layer training and progressive expansion, enabling neural networks to maintain predictive performance while dealing with more complex patterns and relationships.

Suppose the input vector is

X (t) = {[x_{t 1}, x_{t 2}, \dots, x_{t h}]}^{T}

, and the output vector is

Y (t) = {[y_{t 1}, y_{t 2}, \dots, y_{t d}]}^{T}

. The following equation defines CBP with

h

inputs,

m

hidden neurons and

d

outputs:

y_{t k} = \sum_{k = 1}^{h} p (M_{k i} x_{t i}) + g (\sum_{j = 1}^{m} W_{k j} f (\sum_{i = 1}^{d} V_{j i} x_{t i}))

(5)

where

p

,

f

and

g

are active functions that connect the input layer to the output layer, the input layer to the hidden layer, and the hidden layer to the output layer, respectively.

M_{k i}

is the weight for the connection between input neuron

i

and output neuron

k

.

W_{k j}

is the weight between hidden neuron

j

and output neuron

k

.

V_{j i}

is the weight between output neuron

i

to hidden neuron

j

.

x_{t i}

represents the input data to neuron

i \in [1, h]

, and

y_{t k}

represents output

k \in [1, d]

at time

t

.

2.4. Recurrent Neural Network

An RNN is a very effective technique to process time series due to the internal memory that can remember the important features of the input sequential data [41]. Its characteristic of a hidden state can capture the time dependence in sequence data. By reusing the same network unit at each time-step of the sequence, an RNN enables information to propagate along the time dimension so that the sequence data can be modeled. In addition, the essential feature of an RNN is that it has an internal memory memorize previous data. In an RNN, the output from the previous time stamp along with the input from the current time stamp are fed to RNN cells so that the current state of the model is influenced by its previous states. The following equation explains the function of a single RNN cell:

h_{t} = \tanh (W [h_{t - 1}, x_{t}] + b)

(6)

where hyperbolic tan function (

\tan h

) is used to scale the actual values so that the values fall into the range of −1 to +1,

W

is the weight matrix,

b

is the bias matrix,

h_{t}

and

h_{t - 1}

are hidden states at the current time-step and previous time-step, respectively.

2.5. Gated Recurrent Unit

A GRU is proposed by Cho et al. [42]. It is an effective variant structure of LSTM. LSTM has three gates, while a GRU only has two gates. A GRU stores and filters information through update gates and reset gates. Therefore, some information will not be deleted over time. On the contrary, it will retain part of the information with a certain probability and sends it to the next unit. Thus, the problem that an RNN cannot process too long a time series is solved [43].

A GRU is composed of a reset gate and update gate. The reset gate controls how much inconsequential information of the previous moment is filtered, and the update gate determines whether the most efficient message can enter the next GRU cell. The whole calculation processes of a GRU are expressed as below:

z_{t} = σ (W_{z} * [h_{t - 1}, x_{t}])

(7)

r_{t} = σ (W_{r} * [h_{t - 1}, x_{t}])

(8)

{\tilde{h}}_{t} = \tanh (W_{h} * [r_{t} * h_{t - 1}, x_{t}])

(9)

h_{t} = (1 - z_{t}) * h_{t - 1} + z_{t} * {\tilde{h}}_{t}

(10)

where

r_{t}

is the output of the reset gate,

z_{t}

is the output of the update gate,

h_{t - 1}

and

h_{t}

represent the previous hidden state and current hidden state, respectively,

t a n h

is the activation function,

{\tilde{h}}_{t}

is the hidden state,

x_{t}

is the current input, and

W_{r}

,

W_{z}

and

W_{h}

are weight vectors.

2.6. Convolutional Neural Network Combined with Recurrent Neural Network

A convolutional neural network (CNN) is composed of three layers: a convolutional layer, a pooling layer and a fully connected layer. Through convolution and pooling operations, CNN can automatically perform feature extraction and dimensionality reduction, effectively helping the model to capture local patterns and features in a time series. The convolutional layer is applied to excavate representative information of input datasets. A CNN is usually used to extract features from the original data. The convolution method commonly used by the CNN model in processing a time series is one-dimensional convolution. The formula of the convolution layer is as follows:

R_{[c, t]} = f (R_{t} K + b) = f (\sum_{i = 1}^{h} R_{t} K_{t - j} + b_{j})

(11)

where

R_{[c, t]}

is the output of the convolution layer,

R_{t}

is the data at time t, and K is a one-dimensional convolution kernel.

The pooling layer is used to reduce the complexity of mathematical computation by decreasing the dimension of a target variable. The fully connected layer is employed to forecast the object variable. Here, we use max pooling, and the formula for the pooling layer is as follows:

R_{P} = m a x (R_{[c, t]}^{[m, n]})

(12)

where

R_{P}

is the output result of the pooling layer, and

[m, n

] is the pool window size. The pool window size is set in the pooling layer.

Finally, the output of the CNN is obtained through the fully connected layer. The formula is as follows:

Z = W R_{P} + a

(13)

where W is the weight matrix and, Z is the output of the fully connected layer, and it serves as the input to the RNN.

In the framework of the CNNRNN, the CNN is utilized to extract the characteristics of a wind speed series. After extracting the features of the data through the CNN, the RNN is applied to obtain the final forecasting results. In this paper, wind speed series disposed by two convolutional layers and a pooling layer is flattened to a vector and further transmitted to the RNN layer. Finally, the output results interpreted by the RNN, and the fully connected layer is employed for wind speed prediction. Figure 1 gives the specific procedure of the CNNRNN.

3. The Proposed GPSOGA Optimization Algorithm

3.1. Particle Swarm Optimization

The PSO algorithm is a population-based stochastic optimization technique that is inspired by the collective behavior of natural organisms. It initializes a group of random particles (random solutions). Then, it finds the optimal solution through iteration. At each iteration, the particle updates itself by tracking two extreme values (

p b e s t

and

g b e s t

). After finding these two optimal values, the particle updates its velocity and position by using the formula below.

v_{i} (t + 1) = v_{i} (t) + c_{1} * r a n d * (p b e s t_{i} - x_{i}) + c_{2} * r a n d * (g b e s t_{i} - x_{i})

(14)

x_{i} (t + 1) = x_{i} (t) + v_{i} (t + 1)

(15)

where

c_{1}

and

c_{2}

are two positive constant parameters for controlling the step size,

v_{i}

is the velocity of particle

i

;

x_{i}

represents the position of particle

i

,

g b e s t_{i}

represents the global optimal position in searching process so far,

p b e s t_{i}

is the individual optimal position of

x_{i}

,

r a n d

is an independently uniformly distributed random variables with range [0, 1],

t

represents the result of the previous iteration and

t + 1

represents the current iteration.

3.2. Genetic Algorithm

The genetic algorithm is a kind of heuristic algorithm. It is a search optimal solution algorithm based on the notion of natural evolution; it can generally obtain optimal results faster than standard optimization algorithms [44]. The core steps of the genetic algorithm mainly include three steps: operator selection, crossover, and mutation. The optimal individual that conforms to the objective function is selected through loop iteration [45]. The operation process of the algorithm is generally as follows: (1) randomly initialize the population, and set the relevant parameters of the model (including population size, mutation rate, crossover rate, and the maximum number of iterations); (2) calculate the fitness of the population, and then select, cross, and mutate all individuals of the population through the genetic operator; (3) iteratively update the fitness, and judge the optimization objectives, constraints, and iterations; and (4) finally, generate an output when the maximum number of iterations is reached or the fitness has no change.

3.3. Global Elite Opposition-Based Learning Strategy

EOL (elite opposition-based learning) is a strategy in the field of intelligence computation. The main ideology is that for a feasible solution, one must calculate and evaluate the opposite solution at the same time, and choose the better one as the individual of the next generation [46]. The GEOLS is introduced in this paper, and it can promote the searching performance.

X_{i} = (x_{i, 1}, x_{i, 2}, \dots, x_{i, D})

is a point in the current population, and

D

is the problem dimensional space. Then, the opposition point

{\overset{˘}{X}}_{i} = ({\overset{˘}{x}}_{i, 1}, {\overset{˘}{x}}_{i, 2}, \dots, {\overset{˘}{x}}_{i, D})

is defined as the following equation:

{\overset{˘}{x}}_{i, j} = S \times (d a_{j} + d b_{j}) - x_{i, j}

(16)

where

x \in [a_{i}, b_{i}]

,

S \in U [0, 1]

, and

S

is a generalized factor.

d a_{j}

and

d b_{j}

are the dynamic boundaries, which can be defined as

d a_{j} = \min (x_{i, j}), d b_{i} = \max (x_{i, j})

(17)

However, the corresponding opposite can exceed the search boundary

[a_{i}, b_{i}]

. To solve this, the transformed individual is assigned a random value within

[a_{i}, b_{i}]

as follows:

{\overset{˘}{x}}_{i, j} = r a n d (d a_{j}, d b_{j}), i f {\overset{˘}{x}}_{i, j} 〈 a_{j} ∥ {\overset{˘}{x}}_{i, j} 〉 b_{j}

(18)

3.4. The Proposed Optimization Algorithm

By combining the GEOLS algorithm with PSO and the GA, we propose a novel GPSOGA optimization algorithm to expand the global search capability for the proposed SSA-GPSOGA-NNCT algorithm. The details of the GPSOGA algorithm are as follows (Algorithm 1).

Algorithm 1: The pseudo code of the proposed GPSOGA algorithm.

Objective function:

m i n = {f i t n e s s = S S E = \sum_{i = 1}^{M} {(y_{i} - {\hat{y}}_{i})}^{2}}

/*

y_{i}

and

{\hat{y}}_{i}

denote actual value and forecasting value respectively. */
Input: Training set and validation set
Output: Optimal weight coefficients of corresponding forecasting models
Parameters:

M a x_i t e r

—maximum iterations

t

—current iteration

d i m

—dimensions of particles

s i z e

—number of particles

r a n d_{1}

,

r a n d_{2}

—a random value in [0, 1]

v_{i}

—particle velocity

x_{i}

—particle position

x_m a x

—the maximum value of particle position

x_m i n

—the minimum value of particle position

\max_v e l

—maximum of particle velocity

x_{b e s t}

—The best position of the searching particle in the population

C r o s s o v e r_R a t e

—the crossover probability

M u t a t i o n_R a t e

—the mutation probability

P

—agent position generated by GEOLS
Initialize the position (

x_{i}

) of each particle according to

s i z e

,

d i m

,

x_m a x

and

x_m i n

Initialize fitness and speed (

v_{i}

) of each particle according to

x_{i}

t = 0

WHILE

t

:

t < M a x_i t e r

DO
The position of each particle is encoded in binary
FOR EACH

i

:

1 \leq i \leq s i z e

DO
IF

r a n d_{1} < C r o s s o v e r_R a t e

DO
A particle is randomly selected from the population as the mother, and a value is randomly selected in the DNA length, and the DNA sequence after the value of the mother is assigned to

x_{i}

.
END IF
IF

r a n d_{2} < M u t a t i o n_R a t e

DO
A random location of DNA is chosen to reverse it
END IF
END FOR
The position of each particle is decoded in decimal
Calculate elite agent position

P

by Equations (13)–(15).
IF

f i t n e s s (P) < f i t n e s s (x_{i})

DO

x_{i} = P

END IF
FOR EACH

i

:

1 \leq i \leq s i z e

DO
Each particle updates its

x_{i}

and

v_{i}

by Equations (11) and (12)
IF

v_{i} > \max_v e l

DO

v_{i} = \max_v e l

ELIF

v_{i} < - \max_v e l

DO

v_{i} = - \max_v e l

END IF
IF

f i t n e s s (x_{i}) < f i t n e s s (x_{b e s t})

DO

x_{b e s t} = x_{i}

END IF
END FOR

t = t + 1

END WHILE
RETURN

x_{b e s t}

4. The Proposed SSA-GPSOGA-NNCT Algorithm

In this section, we introduce the overall process of the proposed SSA-GPSOGA-NNCT algorithm. The specific procedures are displayed in Figure 2. As shown in Figure 2, SSA-GPSOGA-NNCT mainly includes the following five steps:

Step 1: the original wind speed series are decomposed into several IMFs components by the SSA decomposition method.

Step 2: we apply PSR on IMFs, and the IMFs are reconstructed into input and output vectors.

Step 3: Four models (CBP, the RNN, the GRU and the CNNRNN) are applied as forecasting models, respectively, to predict the wind speed data that were reconstructed by PSR. The intermediate forecasting results of the four models are combined by the NNCT strategy.

Step 4: the weight coefficients of the corresponding single models in the NNCT strategy are optimized by the proposed GPSOGA optimization algorithm.

Step 5: the final forecasting results are obtained by integrating the weight coefficients and the intermediate forecasting results of the four models.

5. Experiment Results and Analysis

5.1. Dataset Information

The wind speed data utilized in this paper were obtained from the National Wind Technology Center (NWTC) of the National Renewable Energy Laboratory (NREL). These data were collected at a frequency of every two seconds, and an average value was recorded every minute. The wind speeds were measured and recorded at six different heights: 2 m, 5 m, 10 m, 20 m, 50 m, and 80 m. For our experiment, we selected the wind speed data from heights of 5 m, 20 m, 50 m, and 80 m. Specifically, we designated the wind speed data at heights of 80 m, 50 m, 20 m, and 5 m as dataset 1, dataset 2, dataset 3, and dataset 4, respectively. The time periods for these datasets are as follows: dataset 1—from 17 May 2020 12:00 to 18 May 2020 4:39, dataset 2—from 25 July 2020 22:40 to 26 July 2020 15:19, dataset 3—from 10 September 2020 8:00 to 11 September 2020 0:39, and dataset 4—from 26 June 2020 18:40 to 27 June 2020 11:19.

Selecting data from days across four different months as the dataset provides a higher resolution of shorter time periods, enabling a more precise understanding of short-term wind speed variations. Daily data also better capture the variability in wind speed characteristics between day and night, which is advantageous compared to seasonal or monthly data.

To provide a visual representation, we have included relevant curves and histograms for the four datasets in Figure 3. Furthermore, Table 1 displays the statistical information for each of the four wind speed datasets.

5.2. Evaluation Criteria

We employed various performance indicators to evaluate the accuracy of the proposed algorithm. These indicators include the MAE, the mean square error (MSE), the mean absolute percentage error (MAPE), and the coefficient of determination (

R^{2}

). The formulas for these indicators are as follows:

M A E = \frac{1}{n} \sum_{i = 1}^{n} | {\hat{y}}_{i} - y_{i} |

(19)

M A P E = \frac{100 %}{n} \sum_{i = 1}^{n} | \frac{{\hat{y}}_{i} - y_{i}}{y_{i}} |

(20)

M S E = \frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}

(21)

R^{2} = (1 - \frac{\sum_{i} {({\hat{y}}_{i} - y_{i})}^{2}}{\sum_{i} {({\bar{y}}_{i} - y_{i})}^{2}})

(22)

In these formulas,

n

represents the number of samples,

y_{i}

is the original wind speed value,

{\hat{y}}_{i}

is the predicted wind speed value, and

{\bar{y}}_{i}

represents the average of the observed values. In these evaluation criteria, smaller values indicate better performance for the MAE, MSE, and MAPE. Conversely, a higher value of

R^{2}

signifies a better model.

5.3. Comparison Models and Their Parameters

To comprehensively evaluate the forecasting performance of the proposed SSA-GPSOGA-NNCT algorithm, we conducted four experiments, each with specific objectives and comparisons. The details of these experiments are summarized in Table 2.

Experiment I: We compare the proposed algorithm with individual single models. They are CBP, an RNN, a GRU and the CNNRNN, respectively.

Experiment II: We apply the SSA decomposition technique to the individual single models used in Experiment I. We compare the performance of SSA-CBP, the SSA-RNN, the SSA-GRU, and the SSA-CNNRNN with the proposed SSA-GPSOGA-NNCT algorithm.

Experiment III: The objective of this experiment is to verify the effectiveness of the proposed GPSOGA optimization algorithm. We compare SSA-SA-NNCT, SSA-ACO-NNCT, SSA-GA-NNCT, and SSA-PSO-NNCT with the proposed SSA-GPSOGA-NNCT algorithm.

Experiment IV: This experiment aimed to validate the suitability of the SSA decomposition technology for the proposed algorithm. We compare the performance of EMD-GPSOGA-NNCT and CEEMDAN-GPSOGA-NNCT with the proposed SSA-GPSOGA-NNCT algorithm.

The parameters used in the four experiments are provided in Table 3. For CBP, the RNN, the GRU, and the CNNRNN, the activation function in the hidden layers is Rule, and the Adam optimizer is utilized. EMD does not require any specific parameter settings; hence, it is not included in the table. In terms of data preprocessing, the min–max normalization method is applied to enhance the convergence speed of the models.

5.4. Experiment I

In this subsection, we conducted experiments using individual prediction models without decomposition technology. Figure 4 illustrates the forecasting results of the proposed algorithm and the four comparison models for one-step, three-step, and five-step forecasting on dataset 1. The figure visually demonstrates the forecasting results of the proposed algorithm compared to the individual single models, and it is apparent that the proposed algorithm achieves the best fitting results.

The results of the evaluation criteria of CBP, the RNN, the GRU, and the CNNRNN are presented in Table 4. Upon analyzing the evaluation criteria, it is evident that the proposed algorithm achieves higher accuracy compared to the individual single models. The proposed SSA-GPSOGA-NNCT algorithm consistently outperforms all traditional single models in terms of all evaluation metrics for one-step-ahead forecasting on all datasets. The MAPE values of the individual single models are relatively higher than the proposed algorithm. For instance, for one-step-ahead forecasting on dataset 1, the MAPE of the proposed algorithm achieves 0.5067%, while CBP, RNN, GRU, and CNNRNN achieve 5.1632%, 5.3300%, 5.2373%, and 5.0308%, respectively.

The comparison results clearly reveal that the performance of the individual single models falls significantly short in comparison to the proposed algorithm, regardless of whether it is one-step, three-step or five-step forecasting. Moreover, the superiority of the proposed algorithm becomes more apparent as the number of prediction steps increases. This highlights the effectiveness and superiority of the proposed algorithm in accurately predicting wind speed compared to the individual single models.

5.5. Experiment II

To verify the effectiveness of the NNCT strategy, which combines multiple single models with SSA decomposition technology, we conducted experiments using four single models along with SSA (SSA-CBP, SSA-RNN, SSA-GRU, SSA-CCNRNN). Figure 5 displays the forecasting errors of all the models for three-step forecasting and the results of performance indicators for one-step, three-step and five-step forecasting on dataset 1. In Figure 5, the SSA in the models’ names is omitted to ensure a tight layout. From the figures, comparing to the four comparison models, the proposed algorithm consistently exhibits the fewest predicting errors. And it is evident that as the number of prediction steps increases, the fitting effect of all models deteriorates.

Table 5 provides the results of the evaluation indices of the models employed in this experiment. Among all the comparative data, it is apparent that the proposed algorithm outperforms the others across multiple indicators. For example, for one-step forecasting on dataset 4, the proposed algorithm achieves an MAPE index of 1.8978%, while the SSA-CBP, SSA-RNN, SSA-GRU, and SSA-CNNRNN achieve MAPE indices of 2.3410%, 2.7375%, 2.3346%, and 3.7835%, respectively. The best performing one among them reaches only 2.3346%. Similarly, for five-step forecasting on dataset 3, the proposed algorithm achieves an MAPE index of 7.1916%, while the lowest among the four models is the SSA-GRU model, with an MAPE index of 8.6586%. The difference in accuracy is approximately 1.5% when compared to the proposed algorithm.

The comparison results show that the proposed algorithm based on the NNCT strategy, along with SSA decomposition technology, can obtain the best forecasting result. This experiment demonstrates the NNCT strategy can improve the forecasting accuracy in wind speed forecasting.

5.6. Experiment III

To verify the effect of the proposed GPSOGA algorithm, we compare the proposed SSA-GPSOGA-NNCT algorithm with the models using different optimization algorithms (SSA-SA-NNCT, SSA-ACO-NNCT, SSA-GA-NNCT, and SSA-PSO-NNCT). Among the models, SSA decomposition technology is used to preprocess the data, and the NNCT strategy is utilized to combine the prediction results of single models.

Figure 6 illustrates the forecasting results of all the models for one-step forecasting and the performance indicators of the proposed algorithm and the comparison models for one-step, three-step and five-step forecasting on dataset 2. From the comparison of these optimization algorithms, it is evident that the model based on the GPSOGA algorithm outperforms the other models.

Table 6 presents the evaluation metrics for the models in this experiment. A comprehensive analysis can clearly demonstrate that the proposed algorithm outperforms the comparison models across multiple indicators. For example, for one-step forecasting on dataset 2, the MAPE value of the proposed algorithm is 1.2235%, while the MAPE values of the SA, ACO, GA, and PSO models are 4.7473%, 2.9277%, 2.1252%, and 1.3954%, respectively. It is noteworthy that the MAPE values of the PSO algorithm are close to those of the GPSOGA algorithm, while the results of the other three algorithms are inferior to the GPSOGA algorithm.

These results clearly indicate that the proposed GPSOGA algorithm achieves better prediction accuracy compared to the other algorithms considered, highlighting its effectiveness in wind speed forecasting.

5.7. Experiment IV

To evaluate the effectiveness of the SSA decomposition method, we conducted a comparative analysis with two classical decomposition techniques, namely EMD and CEEMDAN. These decomposition methods were applied to the combined model, which integrated the NNCT fusion strategy and the GPSOGA algorithm. Figure 7 shows the forecasting results of the proposed algorithm and the comparison models for one-step, three-step, and five-step forecasting on dataset 1, while Table 7 presents the corresponding performance indicators.

The results clearly demonstrate that the combination method based on SSA decomposition technology consistently outperforms the other decomposition techniques from Figure 7 and Table 7. Particularly, in cases where the combined models based on EMD or CEEMDAN do not perform satisfactorily, the combined model based on SSA exhibits superior predictive performance. For example, for three-step forecasting on dataset 3, the MAPE values of the EMD-based and CEEMDAN-based models are 11.7078% and 12.9240%, respectively. In contrast, the MAPE value of the SSA-based model achieves an impressive 3.9608%.

These results demonstrate the efficiency of SSA decomposition technology, especially when the performance of the EMD and CEEMDAN-based models falls short. The results illustrate that SSA decomposition is suitable for the proposed algorithm.

6. Discussion

In this section, we conduct further validation to demonstrate the advantages of the proposed algorithm. We employ the DM test, Akaike’s information criterion, and the Nash–Sutcliffe efficiency coefficient for this purpose. These statistical measures are utilized to rigorously assess the performance of our proposed model compared to other comparison models.

6.1. Diebold–Mariano Test

The DM test is a statistical hypothesis test widely used to assess the relative accuracy of two comparison forecasting models [47]. The test aims to determine whether one forecasting model is significantly different from the other in terms of forecasting accuracy. Its robustness and ease of implementation have made it a popular choice in the comparison of forecasting methods.

Considering significant level

α

, the null hypothesis

H_{0}

suggests that there is no significant difference between the proposed algorithm and the comparative model. On the other hand, the alternative hypothesis

H_{1}

represents a different conjecture, asserting that there exists a notable distinction between the proposed algorithm and the comparative model. The hypothesis formula is represented as follows:

H_{0} : E [L (ε_{i}^{1})] = E [L (ε_{i}^{2})]

(23)

H_{1} : E [L (ε_{i}^{1})] \neq E [L (ε_{i}^{2})]

(24)

where

L

denotes the loss function, and

ε_{i}^{p}

(

p = 1, 2

) are the forecasting errors of two comparison models. We use the squared-error loss as the loss function in this paper. Furthermore, the DM test statistics can be defined as

D M = \frac{\sum_{i = 1}^{n} (L (ε_{i}^{1}) - L (ε_{i}^{2})) / n}{\sqrt{s^{2} / n}}

(25)

where

s^{2}

is an estimation for the variance in

d_{i} = L (ε_{i}^{1}) - L (ε_{i}^{2})

.

In this test, under significance level

α

, we compare the computed value of DM with

- Z_{α / 2}

and

Z_{α / 2}

. The hypothesis

H_{0}

is accepted if the value falls inside

[- Z_{α / 2}, Z_{α / 2}]

. Otherwise,

H_{0}

is rejected.

The DM test results of the four experiments are presented in Table 8. Upon reviewing the table, it becomes evident that the DM values between the proposed algorithm and the comparison models in experiment I, II, and IV are all greater than the threshold of

Z_{\frac{0.05}{2}}

= 1.96. For example, in experiment II, the DM values for three-step forecasting on dataset 1 are 12.117, 11.511, 8.776, and 9.033, respectively. Therefore, the null hypothesis can be accepted at the 5% significance level.

In experiment III, the DM values of SSA-SA-NNCT and SSA-GA-NNCT are all greater than 1.96. The DM values of SSA-ACO-NNCT on dataset 4 are lower than 1.96. Comparing the proposed algorithm with SSA-SA-NNCT and SSA-GA-NNCT on dataset 1, dataset 2, and dataset 3, the null hypothesis can be accepted at the 5% significance level. There are five DM values of SSA-PSO-NNCT lower than 1.96, but the DM values of SSA-PSO-NNCT for five-step forecasting are greater than 1.96. For SSA-PSO-NNCT, the null hypothesis can be accepted at the 5% significance level for five-step forecasting on all the datasets. All the DM values of the proposed algorithm and the comparison models in experiment III are greater than the threshold of

Z_{\frac{0.1}{2}}

= 1.65.

According to the results of the DM test, the proposed algorithm exhibits a significant difference in performance compared with all the comparison models in experiment I, II, and IV, with a probability more than 95%. For experiment III, there is a significant difference between the proposed algorithm and the comparison models, with a probability more than 95% in most cases and the rest with a probability more than 90%.

6.2. Akaike’s Information Criterion

AIC (Akaike’s information criterion) is a statistical measure used for model comparison in the field of time series forecasting [48]. AIC is derived from the likelihood function of the model, considering the number of model parameters and the quality of the fit to the data. It is particularly valuable when comparing multiple comparison models based on the same dataset. AIC is defined as

AIC = 2 k - 2 \ln (L)

(26)

where

k

is the number of parameters in the model, and

L

is the maximum likelihood of the model. AIC is used to compare the forecasting performance of different models. The model with a lower AIC value is considered to have better performance, and the absolute value of the AlC cannot determine whether a model is good or bad in terms of its forecasting performance.

The detailed results of AIC are listed in Table 9. From Table 9, the following conclusions can be drawn.

The proposed algorithm consistently has the lowest AIC values across all datasets and steps, suggesting it provides the best trade-off between goodness of fit and simplicity.

The models in experiment I show relatively higher AIC values compared to other models in experiment II and experiment III. This means that the decomposition algorithms and optimization algorithms are necessary in wind speed forecasting. Moreover, 31 out of 44 AIC values are lower when we compare the models in experiment II with the models in experiment III. This observation implies that the decomposition algorithms are more effective at improving forecasting accuracy. The AIC values of models in experiment IV are relatively higher than the models in experiment II and experiment III. This observation suggests that EMD and CEEMDAN are not very suitable for the datasets used in this paper.

According to the results of AIC, we can conclude that the proposed algorithm consistently outperforms the other models across all datasets and steps, indicating its robustness and suitability for the given data.

6.3. Nash–Sutcliffe Efficiency Coefficient

The NSE (Nash–Sutcliffe efficiency coefficient) is a widely used statistical metric for evaluating the performance of forecasting models [49]. The NSE coefficient is based on the comparison of the model’s simulated values with the mean of the observed data, considering both the variability and bias of the model. In this paper, we apply NSE to evaluate the quality of the proposed algorithm. The formula is described as follows:

NSE = 1 - \frac{\sum_{t = 1}^{T} {(y_{o}^{t} - y_{m}^{t})}^{2}}{\sum_{t = 1}^{T} {(y_{o}^{t} - \bar{y_{o}})}^{2}}

(27)

where

y_{o}

refers to the actual value,

y_{m}

refers to the predicted value,

y^{t}

denotes a certain value at time

t

, and

\bar{y_{o}}

is the average of the observed value. Higher NSE values indicate better performance. The coefficient ranges from negative infinity to 1, where 1 represents a perfect fit, 0 indicates that the model performs as well as the mean values as a predictor, and values less than zero imply that the model’s performance is worse than using the mean value as a predictor.

Table 10 shows the detailed results of NSE. From Table 10, the following conclusion can be drawn.

If we compare the average values of NSE, the top five models in terms of performance are the proposed algorithm, SSA-ACO-NNCT, SSA-PSO-NNCT, SSA-CNNRNN, and SSA-RNN on dataset 1. The proposed algorithm performs the best followed by SSA-PSO-NNCT, SSA-ACO-NNCT, the SSA-RNN, and the SSA-GRU on dataset 2. On dataset 3, The proposed algorithm, SSA-PSO-NNCT, SSA-ACO-NNCT, SSA-GA-NNCT, and SSA-SA-NNCT are the top five models according to their average NSE values. On dataset 4, the proposed algorithm, SSA-PSO-NNCT, SSA-SA-NNCT, SSA-GA-NNCT, and SSA-ACO-NNCT are the best models in terms of performance.

As indicated by the NSE analysis, the proposed algorithm, SSA-ACO-NNCT, and SSA-PSO-NNCT are the most accurate models, and the proposed algorithm consistently outperforms the other models across all datasets and steps.

7. Conclusions and Future Research

The management of wind power relies on precise wind speed forecasting. This paper proposes an innovative wind speed forecasting model to make accurate wind speed predictions. The proposed algorithm uses SSA to decompose the original wind speed time series and PSR to construct samples. Four predictors are employed to forecast the wind speed and the NNCT strategy is utilized to combine their results as the final forecasting results. We develop an optimization algorithm to find the optimal weights of NNCT. We compare the proposed algorithm with different models on four datasets. And the following conclusions can be drawn.

The prediction results of a single model have great limitation and a big gap compared with the combined models. The MAPE of the combined models always achieves better performance than single models. The decomposition technology is necessary to improve the forecasting accuracy. Among the three decomposition methods, SSA outperforms EMD and CEEMDAN. Compared with EMD and CEEMDAN, the MAPE of SSA is improved by 31.32% and 42.69% for three-step forecasting on dataset 1. The proposed GPSOGA can increase the forecasting performance. Compared with SA, ACO, GA, and PSO, the MAPE of GPSOGA is improved by 11.32%, 17.31%, 15.48%, and 7.14% for five-step forecasting on dataset 3.

The DM test shows that the null hypothesis can be accepted at the 5% significance level in most cases. There are significant differences between the proposed algorithm and the 14 models involved in the four experiments. The AIC of the proposed algorithm has the smallest value. This demonstrates that the proposed algorithm outperforms the other models and provides a good balance between fitting accuracy and complexity. The NSE is also employed to compare the proposed algorithm with different models on the four datasets. The results also show that the proposed algorithm outperforms the other models.

The GPSOGA algorithm is used in this paper for optimizing the weights of multiple models. In addition, optimization algorithms are now applied in various fields such as online learning, scheduling, multi-objective optimization, and transportation among others [50]. The GPSOGA algorithm employs multiple search strategies, dynamically adjusts algorithm parameters based on current search states, ensures global search capability, and effectively optimizes local regions. In future research, we will explore more advanced optimization algorithms for other applications, specifically, (1) considering using the proposed optimization algorithm to optimize model hyperparameters, thereby achieving faster optimal hyperparameter acquisition, and (2) considering integrating other self-adaptive algorithms [51,52] and hyper-heuristics algorithms [53] to enhance the proposed optimization algorithm.

This study utilized multiple datasets for extensive experiments, proving that the proposed algorithm offers excellent prediction performance and generalization. However, our proposed model ignores other weather factors related to wind speed, such as temperature, humidity, and pressure. We may build a time series forecasting model that considers these features to obtain more accurate results in the future. In future, we will continue to optimize the hyperparameters, conduct in-depth error analysis, and consider applying the model to other prediction fields.

Author Contributions

Conceptualization, Z.H.; methodology, Y.C.; software, Z.H.; validation, Y.Z., Z.H.; formal analysis, Y.Z.; investigation, Z.H.; resources, Z.H.; data curation, Y.Z.; writing—original draft preparation, Y.Z.; writing—review and editing, Z.H; visualization, Y.Z.; supervision, Y.C.; project administration, Y.C.; funding acquisition, Y.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Natural Science Foundation of Henan Province of China, grant number 232300421385, 222300420296.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

No new data was created or analyzed in this study.

Conflicts of Interest

The authors declare no conflict of interest.

References

Perry, S. Wind energy for sustainable development: Driving factors and future outlook. J. Clean. Prod. 2021, 289, 125779. [Google Scholar]
WWEA. WWEA Half-Year Report 2023: Additional Momentum for Windpower in 2023; WWEA: Bonn, Germany, 2023. [Google Scholar]
IEA. World Energy Outlook 2022; IEA: Paris, France, 2022. [Google Scholar]
Arndt, C.; Arent, D.; Hartley, F.; Merven, B.; Mondal, A.H. Faster than you think: Renewable energy and developing countries. Annu. Rev. Resour. Econ. 2019, 11, 149–168. [Google Scholar] [CrossRef]
Rockström, J.; Gaffney, O.; Rogelj, J.; Meinshausen, M.; Nakicenovic, N.; Schellnhuber, H.J. A roadmap for rapid decarbonization. Science 2017, 355, 1269–1271. [Google Scholar] [CrossRef] [PubMed]
Damousis, I.G.; Dokopoulos, P. A fuzzy expert system for the forecasting of wind speed and power generation in wind farms. In Proceedings of the PICA 2001. Innovative Computing for Power-Electric Energy Meets the Market. 22nd IEEE Power Engineering Society. International Conference on Power Industry Computer Applications (Cat. No. 01CH37195), Sydney, NSW, Australia, 20–24 May 2001; IEEE: Piscataway, NJ, USA, 2001. [Google Scholar]
Attig-Bahar, F.; Ritschel, U.; Akari, P.; Abdeljelil, I.; Amairi, M. Wind energy deployment in Tunisia: Status, drivers, barriers and research gaps—A comprehensive review. Energy Rep. 2021, 7, 7374–7389. [Google Scholar] [CrossRef]
Yang, Q.; Huang, G.; Li, T.; Xu, Y.; Pan, J. A novel short-term wind speed prediction method based on hybrid statistical-artificial intelligence model with empirical wavelet transform and hyperparameter optimization. J. Wind Eng. Ind. Aerodyn. 2023, 240, 105499. [Google Scholar] [CrossRef]
Karan, S.; Panigrahi, B.K.; Shikhola, T.; Sharma, R. An imputation and decomposition algorithms based integrated approach with bidirectional LSTM neural network for wind speed prediction. Energy 2023, 278, 127799. [Google Scholar]
Yang, Z.; Dong, S. A novel decomposition-based approach for non-stationary hub-height wind speed modelling. Energy 2023, 283, 129081. [Google Scholar] [CrossRef]
Jiang, Y.; Liu, S.; Zhao, N.; Xin, J.; Wu, B. Short-term wind speed prediction using time varying filter-based empirical mode decomposition and group method of data handling-based hybrid model. Energy Convers. Manag. 2020, 220, 113076. [Google Scholar] [CrossRef]
Hu, J.; Wang, J.; Xiao, L. A hybrid approach based on the Gaussian process with t-observation model for short-term wind speed forecasts. Renew. Energy 2017, 114, 670–685. [Google Scholar] [CrossRef]
Fu, W.; Wang, K.; Tan, J.; Zhang, K. A composite framework coupling multiple feature selection, compound prediction models and novel hybrid swarm optimizer-based synchronization optimization strategy for multi-step ahead short-term wind speed forecasting. Energy Convers. Manag. 2020, 205, 112461. [Google Scholar] [CrossRef]
Neshat, M.; Nezhad, M.M.; Abbasnejad, E.; Mirjalili, S.; Tjernberg, L.B.; Garcia, D.A.; Alexander, B.; Wagner, M. A deep learning-based evolutionary model for short-term wind speed forecasting: A case study of the Lillgrund offshore wind farm. Energy Convers. Manag. 2021, 236, 114002. [Google Scholar] [CrossRef]
Xiong, D.; Fu, W.; Wang, K.; Fang, P.; Chen, T.; Zou, F. A blended approach incorporating TVFEMD, PSR, NNCT-based multi-model fusion and hierarchy-based merged optimization algorithm for multi-step wind speed prediction. Energy Convers. Manag. 2021, 230, 113680. [Google Scholar] [CrossRef]
Wang, K.; Qi, X.; Liu, H.; Song, J. Deep belief network based k-means cluster approach for short-term wind power forecasting. Energy 2018, 165, 840–852. [Google Scholar] [CrossRef]
Zhang, Y.; Zhao, Y.; Kong, C.; Chen, B. A new prediction method based on VMD-PRBF-ARMA-E model considering wind speed characteristic. Energy Convers. Manag. 2020, 203, 112254. [Google Scholar] [CrossRef]
Liu, D.; Ding, L.; Bai, Y.-L. Application of hybrid model based on empirical mode decomposition, novel recurrent neural networks and the ARIMA to wind speed prediction. Energy Convers. Manag. 2021, 233, 113917. [Google Scholar] [CrossRef]
Jiang, Z.; Che, J.; Wang, L. Ultra-short-term wind speed forecasting based on EMD-VAR model and spatial correlation. Energy Convers. Manag. 2021, 250, 114919. [Google Scholar] [CrossRef]
Singh, S.; Mohapatra, A. Repeated wavelet transform based ARIMA model for very short-term wind speed forecasting. Renew. Energy 2019, 136, 758–768. [Google Scholar]
Pazikadin, A.R.; Rifai, D.; Ali, K.; Malik, M.Z.; Abdalla, A.N.; Faraj, M.A. Solar irradiance measurement instrumentation and power solar generation forecasting based on Artificial Neural Networks (ANN): A review of five years research trend. Sci. Total. Environ. 2020, 715, 136848. [Google Scholar] [CrossRef] [PubMed]
Salcedo-Sanz, S.; Ortiz-García, E.G.; Pérez-Bellido, Á.M.; Portilla-Figueras, A.; Prieto, L. Short term wind speed prediction based on evolutionary support vector regression algorithms. Expert Syst. Appl. 2011, 38, 4052–4057. [Google Scholar] [CrossRef]
Zhang, Z.; Ye, L.; Qin, H.; Liu, Y.; Wang, C.; Yu, X.; Yin, X.; Li, J. Wind speed prediction method using shared weight long short-term memory network and Gaussian process regression. Appl. Energy 2019, 247, 270–284. [Google Scholar] [CrossRef]
Wei, D.; Wang, J.; Niu, X.; Li, Z. Wind speed forecasting system based on gated recurrent units and convolutional spiking neural networks. Appl. Energy 2021, 292, 116842. [Google Scholar] [CrossRef]
Gao, Z.; Li, Z.; Xu, L.; Yu, J. Dynamic adaptive spatio-temporal graph neural network for multi-node offshore wind speed forecasting. Appl. Soft Comput. 2023, 141, 110294. [Google Scholar] [CrossRef]
Xiao, L.; Wang, J.; Dong, Y.; Wu, J. Combined forecasting models for wind energy forecasting: A case study in China. Renew. Sustain. Energy Rev. 2015, 44, 271–288. [Google Scholar] [CrossRef]
Zhang, W.; Qu, Z.; Zhang, K.; Mao, W.; Ma, Y.; Fan, X. A combined model based on CEEMDAN and modified flower pollination algorithm for wind speed forecasting. Energy Convers. Manag. 2017, 136, 439–451. [Google Scholar] [CrossRef]
Wang, J.; Zhang, H.; Li, Q.; Ji, A. Design and research of hybrid forecasting system for wind speed point forecasting and fuzzy interval forecasting. Expert Syst. Appl. 2022, 209, 118384. [Google Scholar] [CrossRef]
Niu, X.; Wang, J. A combined model based on data preprocessing strategy and multi-objective optimization algorithm for short-term wind speed forecasting. Appl. Energy 2019, 241, 519–539. [Google Scholar] [CrossRef]
Jian, H.; Lin, Q.; Wu, J.; Fan, X.; Wang, X. Design of the color classification system for sunglass lenses using PCA-PSO-ELM. Measurement 2022, 189, 110498. [Google Scholar] [CrossRef]
Leon, A.S.; Bian, L.; Tang, Y. Comparison of the genetic algorithm and pattern search methods for forecasting optimal flow releases in a multi-storage system for flood control. Environ. Model. Softw. 2021, 145, 105198. [Google Scholar] [CrossRef]
Lv, H.; Chen, X.; Zeng, X. Optimization of micromixer with Cantor fractal baffle based on simulated annealing algorithm. Chaos Solitons Fractals 2021, 148, 111048. [Google Scholar] [CrossRef]
Ghalambaz, M.; Yengejeh, R.J.; Davami, A.H. Building energy optimization using grey wolf optimizer (GWO). Case Stud. Therm. Eng. 2021, 27, 101250. [Google Scholar] [CrossRef]
Chakraborty, S.; Saha, A.K.; Chakraborty, R.; Saha, M. An enhanced whale optimization algorithm for large scale optimization problems. Knowl.-Based Syst. 2021, 233, 107543. [Google Scholar] [CrossRef]
Wang, Y.; Wang, J.; Wei, X. A hybrid wind speed forecasting model based on phase space reconstruction theory and Markov model: A case study of wind farms in northwest China. Energy 2015, 91, 556–572. [Google Scholar] [CrossRef]
He, P.; Fang, Q.; Jin, H.; Ji, Y.; Gong, Z.; Dong, J. Coordinated design of PSS and STATCOM-POD based on the GA-PSO algorithm to improve the stability of wind-PV-thermal-bundled power system. Int. J. Electr. Power Energy Syst. 2022, 141, 108208. [Google Scholar] [CrossRef]
Wang, Y.; Wang, J.; Li, Z.; Yang, H.; Li, H. Design of a combined system based on two-stage data preprocessing and multi-objective optimization for wind speed prediction. Energy 2021, 231, 121125. [Google Scholar] [CrossRef]
Zhou, Q.; Wang, C.; Zhang, G. A combined forecasting system based on modified multi-objective optimization and sub-model selection strategy for short-term wind speed. Appl. Soft Comput. 2020, 94, 106463. [Google Scholar] [CrossRef]
Arteche, J.; García-Enríquez, J. Singular spectrum analysis for signal extraction in stochastic volatility models. Econ. Stat. 2017, 1, 85–98. [Google Scholar] [CrossRef]
Fu, W.; Wang, K.; Li, C.; Tan, J. Multi-step short-term wind speed forecasting approach based on multi-scale dominant ingredient chaotic analysis, improved hybrid GWO-SCA optimization and ELM. Energy Convers. Manag. 2019, 187, 356–377. [Google Scholar] [CrossRef]
Lin, L.; Li, M.; Ma, L.; Baziar, A.; Ali, Z.M. Hybrid RNN-LSTM deep learning model applied to a fuzzy based wind turbine data uncertainty quantization method. Ad Hoc Netw. 2021, 123, 102658. [Google Scholar] [CrossRef]
Li, C.; Tang, G.; Xue, X.; Saeed, A.; Hu, X. Short-term wind speed interval prediction based on ensemble GRU model. IEEE Trans. Sustain. Energy 2019, 11, 1370–1380. [Google Scholar] [CrossRef]
Nachaoui, M.; Afraites, L.; Laghrib, A. A regularization by denoising super-resolution method based on genetic algorithms. Signal Process. Image Commun. 2021, 99, 116505. [Google Scholar] [CrossRef]
Feng, Y.; Lan, C.; Briseghella, B.; Fenu, L.; Zordan, T. Cable optimization of a cable-stayed bridge based on genetic algorithms and the influence matrix method. Eng. Optim. 2022, 54, 20–39. [Google Scholar] [CrossRef]
Katoch, S.; Chauhan, S.S.; Kumar, V. A review on genetic algorithm: Past, present, and future. Multimed. Tools Appl. 2021, 80, 8091–8126. [Google Scholar] [CrossRef] [PubMed]
Zhou, Y.; Wang, R.; Luo, Q. Elite opposition-based flower pollination algorithm. Neurocomputing 2016, 188, 294–310. [Google Scholar] [CrossRef]
Yang, H.; Zhu, Z.; Li, C.; Li, R. A novel combined forecasting system for air pollutants concentration based on fuzzy theory and optimization of aggregation weight. Appl. Soft Comput. 2020, 87, 105972. [Google Scholar] [CrossRef]
Tsvetkova, O.; Ouarda, T.B. Use of the Halphen distribution family for mean wind speed estimation with application to Eastern Canada. Energy Convers. Manag. 2023, 276, 116502. [Google Scholar] [CrossRef]
Lu, Y.; Li, T.; Hu, H.; Zeng, X. Short-term prediction of reference crop evapotranspiration based on machine learning with different decomposition methods in arid areas of China. Agric. Water Manag. 2023, 279, 108175. [Google Scholar] [CrossRef]
Mojgan, S.; Khayamim, R.; Ozguven, E.E.; Dulebenets, M.A. Sustainable decisions in a ridesharing system with a tri-objective optimization approach. Transp. Res. Part D 2023, 125, 103958. [Google Scholar]
Chen, M.; Tan, Y. SF-FWA: A Self-Adaptive Fast Fireworks Algorithm for effective large-scale optimization. Swarm Evol. Comput. 2023, 80, 101314. [Google Scholar] [CrossRef]
Dulebenets, M.A. An Adaptive Polyploid Memetic Algorithm for scheduling trucks at a cross-docking terminal. Inf. Sci. 2021, 565, 390–421. [Google Scholar] [CrossRef]
Singh, E.; Pillay, N. A study of ant-based pheromone spaces for generation constructive hyper-heuristics. Swarm Evol. Comput. 2022, 72, 101095. [Google Scholar] [CrossRef]

Figure 1. The flowchart of CNNRNN.

Figure 2. The flow chart of the proposed SSA-GPSOGA-NNCT algorithm.

Figure 3. The statistical information of the four datasets used in this paper.

Figure 4. The forecasting results of the proposed algorithm and four single forecasting models on dataset 1.

Figure 5. The forecasting errors of all the models for three-step forecasting and the performance indicators for one-step, three-step and five-step forecasting on dataset 1.

Figure 6. The forecasting results of all the models for one-step forecasting and the performance indicators for one-step, three-step, and five-step forecasting on dataset 2.

Figure 7. The results of different forecasting models with different decomposition methods on dataset 1.

Table 1. The statistical information of the four wind speed datasets.

Dataset	Min.	Median	Max.	Mean	Std.
Dataset 1	0.3530	2.6625	5.5810	2.7703	1.1196
Dataset 2	0.3540	4.0460	9.7100	3.9389	1.7143
Dataset 3	0.3640	1.7760	5.1360	1.8342	0.8890
Dataset 4	0.3620	4.0540	11.8100	4.4091	2.2651

Table 2. The comparison models in the four experiments.

Experiments	Comparison Models
Experiment I	CBP
	RNN
	GRU
	CNNRNN
Experiment II	SSA-CBP
	SSA-RNN
	SSA-GRU
	SSA-CNNRNN
Experiment III	SSA-SA-NNCT
	SSA-ACO-NNCT
	SSA-GA-NNCT
	SSA-PSO-NNCT
Experiment IV	EMD-GPSOGA-NNCT
Experiment IV	CEEMDAN-GPSOGA-NNCT

Table 3. The specific parameter values of the models used in this paper.

Model	Parameters	Values
CBP, RNN, GRU	Number of neurons in hidden layers	100
	Size of batch	32
	Epochs of training	200
CNNRNN	Number of kernels in the CNN layer	10
	Number of parallel filters in the CNN layer	100
	Number of neurons in the RNN layer	100
	Size of batch	32
	Epochs of training	200
CEEMDAN	Noise standard deviation	0.05
	Number of realizations	50
	Maximum sifting iterations	300
PSR	Reconstruction dimension d	10
PSR	Time delay τ	1
SA, ACO, GA, PSO, GPSOGA	Maximum iterations	100
SA, ACO, GA, PSO, GPSOGA	Number of searching individuals	60

Table 4. Results of four evaluation metrics of the proposed model and four single forecasting models.

		Step 1				Step 3				Step 5
		MAE	MSE	MAPE	$R^{2}$	MAE	MSE	MAPE	R²	MAE	MSE	MAPE	$R^{2}$
Dataset 1	CBP	0.1589	0.0378	5.1632	0.9323	0.2851	0.1292	8.6159	0.8687	0.3372	0.1818	10.1662	0.8746
	RNN	0.1692	0.0439	5.3300	0.9213	0.3245	0.1643	10.0029	0.9059	0.3860	0.2335	11.6739	0.8822
	GRU	0.1715	0.0475	5.2373	0.9149	0.3016	0.1364	9.3970	0.8558	0.4250	0.2827	12.7851	0.8940
	CNNRNN	0.1644	0.0424	5.0308	0.9240	0.3315	0.1645	10.1096	0.9055	0.4153	0.2654	12.4034	0.8250
	Proposed	0.0156	0.0004	0.5067	0.9991	0.0435	0.0031	1.4294	0.9943	0.0767	0.0093	2.4493	0.9833
Dataset 2	CBP	0.4909	0.4779	12.0881	0.9251	0.5755	0.6228	15.4712	0.9023	0.6482	0.7939	16.8649	0.8754
	RNN	0.3465	0.2119	9.6922	0.9667	0.5300	0.4997	14.1149	0.9216	0.5960	0.6569	15.8675	0.8969
	GRU	0.3812	0.2208	11.3245	0.9653	0.4902	0.4158	14.8632	0.9347	0.8043	1.2173	19.5752	0.8090
	CNNRNN	0.3753	0.2184	10.7489	0.9657	0.5214	0.4452	16.5091	0.9301	0.6590	0.7657	17.776	0.8799
	Proposed	0.0453	0.0038	1.2235	0.9994	0.1028	0.0191	2.9543	0.9969	0.1819	0.0551	5.6233	0.9913
Dataset 3	CBP	0.1710	0.5057	14.9385	0.8639	0.3470	0.1876	30.9241	0.8433	0.4278	0.2868	38.6148	0.9005
	RNN	0.1655	0.0500	15.5006	0.8682	0.4089	0.2524	32.7434	0.8725	0.3569	0.2188	36.4308	0.8501
	GRU	0.1683	0.0516	15.1504	0.8579	0.3047	0.1529	31.3265	0.9131	0.3348	0.1912	34.3368	0.8674
	CNNRNN	0.1898	0.0622	18.2551	0.8872	0.3011	0.1596	28.2073	0.8576	0.3361	0.1887	31.6548	0.8503
	Proposed	0.0182	0.0006	1.5909	0.9960	0.0444	0.0034	3.9608	0.9769	0.0816	0.0113	7.1916	0.9247
Dataset 4	CBP	0.9881	1.5249	18.4462	0.9408	1.2373	2.5485	23.8929	0.8998	1.3806	3.1115	27.0739	0.8672
	RNN	0.9936	1.609	19.2608	0.921	1.2226	2.5561	23.4273	0.898	1.3625	3.1363	25.7829	0.8614
	GRU	0.981	1.504	18.2036	0.8457	1.2335	2.641	24.1534	0.878	1.3605	3.0157	25.7006	0.8898
	CNNRNN	1.0329	1.6885	18.7101	0.9023	1.2194	2.4893	23.2502	0.8137	1.3369	2.9483	25.1321	0.9056
	Proposed	0.1025	0.0204	1.8978	0.9951	0.3071	0.1669	5.6554	0.9606	0.486	0.3986	8.8072	0.9061

Table 5. Results of four evaluation metrics of the proposed model and four single forecasting models along with SSA.

		Step 1				Step 3				Step 5
		MAE	MSE	MAPE	$R^{2}$	MAE	MSE	MAPE	R²	MAE	MSE	MAPE	$R^{2}$
Dataset 1	SSA-CBP	0.0642	0.0063	2.0743	0.9885	0.1502	0.0317	5.2032	0.9431	0.1556	0.0339	5.1781	0.9392
	SSA-RNN	0.0169	0.0005	0.5440	0.9990	0.0996	0.0127	3.0550	0.9772	0.1059	0.0174	3.4444	0.9688
	SSA-GRU	0.0222	0.0008	0.7247	0.9985	0.0823	0.0098	2.7254	0.9822	0.1514	0.0325	4.5651	0.9417
	SSA-CCNRNN	0.0255	0.0010	0.8192	0.9981	0.0925	0.0124	3.0497	0.9778	0.1094	0.0179	3.4357	0.9679
	Proposed	0.0156	0.0004	0.5067	0.9991	0.0435	0.0031	1.4294	0.9943	0.0767	0.0093	2.4493	0.9833
Dataset 2	SSA-CBP	0.3461	0.2700	6.9472	0.9576	0.6016	1.0297	9.9864	0.8385	0.4961	0.5266	9.8678	0.9174
	SSA-RNN	0.0654	0.0076	1.8038	0.9987	0.1882	0.051	5.4475	0.9920	0.2131	0.0746	6.4870	0.9882
	SSA-GRU	0.0736	0.0116	1.9255	0.9981	0.1762	0.0485	5.3744	0.9923	0.2562	0.1386	6.1579	0.9782
	SSA-CCNRNN	0.1975	0.1067	4.1492	0.9832	0.1915	0.0963	4.0609	0.9848	0.2686	0.1413	7.6861	0.9778
	Proposed	0.0453	0.0038	1.2235	0.9994	0.1028	0.0191	2.9543	0.9969	0.1819	0.0551	5.6233	0.9913
Dataset 3	SSA-CBP	0.0313	0.0015	2.9107	0.9899	0.0899	0.0123	8.9501	0.9181	0.1830	0.0438	14.2863	0.8796
	SSA-RNN	0.0178	0.0008	1.6445	0.9946	0.0689	0.0073	6.6673	0.9514	0.1189	0.0217	9.5569	0.8558
	SSA-GRU	0.0338	0.0017	3.2008	0.9881	0.0554	0.0053	4.9410	0.9644	0.1015	0.0181	8.6586	0.8797
	SSA-CCNRNN	0.0328	0.0018	3.3472	0.9877	0.0787	0.0095	6.6068	0.9370	0.1033	0.0172	8.7949	0.8857
	Proposed	0.0182	0.0006	1.5909	0.9960	0.0444	0.0034	3.9608	0.9769	0.0816	0.0113	7.1916	0.9247
Dataset 4	SSA-CBP	0.1273	0.0299	2.3410	0.9929	0.4083	0.2986	7.8789	0.9296	0.6538	0.6985	12.4793	0.8354
	SSA-RNN	0.1525	0.0403	2.7375	0.9904	0.3893	0.3003	7.2788	0.9292	0.5687	0.5375	10.6628	0.8734
	SSA-GRU	0.1314	0.0324	2.3346	0.9923	0.3551	0.2318	6.6904	0.9453	0.5641	0.5485	10.2898	0.8708
	SSA-CCNRNN	0.1982	0.6830	3.7835	0.9839	0.3435	0.2021	6.2534	0.9524	0.6505	0.6932	11.2917	0.8367
	Proposed	0.1025	0.0204	1.8978	0.9951	0.3071	0.1669	5.6554	0.9606	0.4860	0.3986	8.8072	0.9061

Table 6. Results of four evaluation metrics of the proposed model and four combined models based on different optimization algorithm.

		Step 1				Step 3				Step 5
		MAE	MSE	MAPE	$R^{2}$	MAE	MSE	MAPE	R²	MAE	MSE	MAPE	$R^{2}$
Dataset 1	SSA-SA-NNCT	0.1306	0.0239	4.2301	0.9571	0.0551	0.0049	1.7708	0.9911	0.1240	0.0262	3.9866	0.9529
	SSA-ACO-NNCT	0.0174	0.0006	0.5682	0.9989	0.1480	0.0348	4.3457	0.9376	0.0803	0.0101	2.6024	0.9818
	SSA-GA-NNCT	0.0308	0.0016	2.7731	0.9889	0.0621	0.0056	2.0409	0.9899	0.1329	0.0261	4.2545	0.9532
	SSA-PSO-NNCT	0.0158	0.0005	0.5135	0.9991	0.0440	0.0033	1.4468	0.9941	0.0801	0.0098	2.5816	0.9824
	Proposed	0.0156	0.0004	0.5067	0.9991	0.0435	0.0031	1.4294	0.9943	0.0767	0.0093	2.4493	0.9833
Dataset 2	SSA-SA-NNCT	0.1951	0.0709	4.7473	0.9888	0.3792	0.2458	9.3127	0.9614	0.3039	0.1646	7.9633	0.9741
	SSA-ACO-NNCT	0.1147	0.0214	2.9277	0.9966	0.1601	0.0518	3.8252	0.9918	0.2014	0.0713	5.7130	0.9888
	SSA-GA-NNCT	0.0972	0.0227	2.1252	0.9964	0.3573	0.2134	7.8370	0.9665	0.3476	0.2597	7.6108	0.9592
	SSA-PSO-NNCT	0.0518	0.0051	1.3954	0.9991	0.1060	0.0214	2.9977	0.9966	0.1863	0.0560	5.7998	0.9912
	Proposed	0.0453	0.0038	1.2235	0.9994	0.1028	0.0191	2.9543	0.9969	0.1819	0.0551	5.6233	0.9913
Dataset 3	SSA-SA-NNCT	0.0406	0.0028	3.5775	0.9808	0.0509	0.0047	4.4664	0.9687	0.0964	0.0149	8.3017	0.9009
	SSA-ACO-NNCT	0.0199	0.0007	1.7773	0.9952	0.0513	0.0045	4.7902	0.9699	0.0861	0.0123	7.4389	0.9179
	SSA-GA-NNCT	0.1491	0.0422	2.7625	0.9900	0.0539	0.0051	4.6864	0.9657	0.0924	0.0133	7.7894	0.9116
	SSA-PSO-NNCT	0.0197	0.0007	1.6873	0.9953	0.0489	0.0041	4.2658	0.9727	0.0842	0.0116	7.1622	0.9228
	Proposed	0.0182	0.0006	1.5909	0.9960	0.0444	0.0034	3.9608	0.9769	0.0816	0.0113	7.1916	0.9247
Dataset 4	SSA-SA-NNCT	0.1496	0.0395	2.8636	0.9906	0.3494	0.2039	6.8123	0.9519	0.5185	0.4460	9.4389	0.8949
	SSA-ACO-NNCT	0.1062	0.0214	1.9928	0.9949	0.3187	0.1748	6.1091	0.9588	0.4936	0.4069	8.9811	0.9041
	SSA-GA-NNCT	0.1491	0.0422	2.7625	0.9900	0.3403	0.2011	6.1707	0.9526	0.5308	0.4846	9.6930	0.8858
	SSA-PSO-NNCT	0.1066	0.0215	1.9808	0.9949	0.3120	0.1669	5.7963	0.9606	0.5069	0.4333	9.0767	0.8979
	Proposed	0.1025	0.0204	1.8978	0.9951	0.3071	0.1669	5.6554	0.9606	0.4860	0.3986	8.8072	0.9061

Table 7. Results of four evaluation metrics of the proposed model and four combined models employing different decomposition algorithms.

		Step 1				Step 3				Step 5
		MAE	MSE	MAPE	$R^{2}$	MAE	MSE	MAPE	R²	MAE	MSE	MAPE	$R^{2}$
Dataset 1	EMD-GPSOGA-NNCT	0.0678	0.0073	2.2451	0.9869	0.0955	0.0141	3.0810	0.9746	0.1080	0.0186	3.5666	0.9665
	CEEMDAN-GPSOGA-NNCT	0.5403	0.4525	10.2478	0.8934	0.1002	0.0157	3.2823	0.9718	0.1287	0.0260	4.2743	0.9533
	Proposed	0.0156	0.0004	0.5067	0.9991	0.0435	0.0031	1.4294	0.9943	0.0767	0.0093	2.4493	0.9833
Dataset 2	EMD-GPSOGA-NNCT	0.3035	0.1399	10.0201	0.9780	0.3914	0.2299	13.2703	0.9639	0.4955	0.3701	16.4526	0.9410
	CEEMDAN-GPSOGA-NNCT	0.2212	0.0835	6.2285	0.9868	0.3587	0.1992	11.9113	0.9687	0.4392	0.2878	15.2696	0.9548
	Proposed	0.0453	0.0038	1.2235	0.9994	0.1028	0.0191	2.9543	0.9969	0.1819	0.0551	5.6233	0.9913
Dataset 3	EMD-GPSOGA-NNCT	0.0933	0.0193	8.7657	0.8715	0.1307	0.0365	11.7078	0.8578	0.1512	0.0478	13.8153	0.8831
	CEEMDAN-GPSOGA-NNCT	0.0834	0.0140	7.5605	0.9069	0.1455	0.0401	12.9240	0.8339	0.1524	0.0402	13.5900	0.8334
	Proposed	0.0182	0.0006	1.5909	0.996	0.0444	0.0034	3.9608	0.9769	0.0816	0.0113	7.1916	0.9247
Dataset 4	EMD-GPSOGA-NNCT	0.5534	0.4928	10.3445	0.8839	0.7939	1.0773	14.9112	0.8462	0.8545	1.2587	16.2717	0.8435
	CEEMDAN-GPSOGA-NNCT	0.5543	0.4951	10.2425	0.8833	0.7760	1.0356	14.1822	0.8561	0.9265	1.4638	16.7090	0.8552
	Proposed	0.1025	0.0204	1.8978	0.9951	0.3071	0.1669	5.6554	0.9606	0.4860	0.3986	8.8072	0.9061

Table 8. DM test results of different models for four datasets.

		Dataset 1			Dataset 2			Dataset 3			Dataset 4
		Step 1	Step 3	Step 5	Step 1	Step 3	Step 5	Step 1	Step 3	Step 5	Step 1	Step 3	Step 5
Experiment I	CBP	9.275	10.205	10.004	6.853	6.521	6.493	7.92	10.189	9.130	9.056	9.266	8.843
	RNN	7.764	9.308	10.171	7.469	6.722	6.357	6.326	10.788	8.699	9.502	8.871	8.179
	GRU	8.455	9.822	10.738	9.937	6.775	7.671	6.474	8.622	8.633	8.888	8.489	8.601
	CNNRNN	9.495	10.595	10.722	9.553	7.794	6.696	7.35	7.855	8.664	9.053	9.088	8.702
Experiment II	SSA-CBP	9.295	12.117	9.866	7.187	6.721	6.858	5.025	6.966	11.857	5.041	5.078	5.399
	SSA-RNN	2.803	11.511	6.452	5.019	8.051	3.691	2.024	6.495	7.875	5.985	4.760	3.296
	SSA-GRU	4.829	8.776	9.577	4.849	7.709	5.123	4.610	4.882	5.634	3.602	3.728	4.467
	SSA-CNNRNN	6.186	9.033	8.046	5.68	5.256	5.411	4.959	8.408	6.591	8.086	3.809	6.779
Experiment III	SSA-SA-NNCT	13.121	8.378	6.505	7.027	7.111	4.789	7.022	3.886	4.081	5.669	2.083	2.747
	SSA-ACO-NNCT	3.694	6.014	2.313	8.169	4.787	3.145	2.410	4.019	2.412	1.702	1.901	1.803
	SSA-GA-NNCT	11.231	6.174	6.895	4.643	7.965	5.188	5.696	3.942	2.700	4.455	2.632	2.906
	SSA-PSO-NNCT	1.653	1.689	1.975	4.108	1.880	2.097	1.971	3.443	2.821	1.723	1.834	3.241
Experiment IV	EMD-GPSOGA	9.393	8.238	5.061	11.38	9.618	8.977	4.531	5.273	4.828	9.399	6.858	6.047
	CEEMDAN-GPSOGA	9.421	8.172	6.696	8.316	9.893	9.168	6.119	6.913	5.542	9.748	6.314	6.631

Table 9. AIC results of different models for four datasets.

		Dataset1			Dataset2			Dataset3			Dataset4
		Step 1	Step 3	Step 5	Step 1	Step 3	Step 5	Step 1	Step 3	Step 5	Step 1	Step 3	Step 5
Experiment I	CBP	−442.5	−199.1	−131.5	59.8	112.3	160.3	−384.3	−125.3	−41.3	285.9	391.2	430.8
	RNN	−412.6	−151.5	−82.0	−101.2	68.6	122.8	−386.9	−66.6	−94.8	297.6	391.8	432.3
	GRU	−397.1	−188.3	−44.1	−93.0	32.3	244.9	−380.8	−165.8	−121.5	279.4	398.3	424.6
	CNNRNN	−351.4	−151.3	−56.6	−95.2	45.8	153.2	−343.6	−157.3	−124.2	284.3	386.6	420.1
Experiment II	SSA-CBP	−794.2	−476.9	−463.6	−53.2	211.8	79.0	−1078.6	−663.9	−413.3	−488.8	−33.3	135.0
	SSA-RNN	−1294.7	−658.5	−595.8	−757.7	−383.2	−307.9	−1202.8	−767.4	−551.9	−429.4	−32.2	83.1
	SSA-GRU	−1204.5	−707.8	−472.0	−675.0	−393.0	−185.2	−1047.4	−829.3	−587.8	−472.9	−83.4	87.1
	SSA-CNNRNN	−1151.0	−663.2	−590.5	−236.9	−257.3	−181.4	−1039.5	−716.0	−598.0	−325.4	−110.6	133.5
Experiment III	SSA-SA-NNCT	−532.7	−458.6	−514.5	−317.9	−71.8	−151.1	−951.4	−854.9	−626.1	−433.4	−108.8	46.2
	SSA-ACO-NNCT	−1265.7	−845.8	−702.5	−554.6	−379.9	−316.6	−1229.2	−862.1	−663.4	−555.2	−139.3	28.0
	SSA-GA-NNCT	−508.2	−819.9	−515.5	−543.3	−99.8	−60.9	−1034.9	−836.5	−648.9	−420.7	−111.5	62.6
	SSA-PSO-NNCT	−1309.3	−924.9	−709.8	−835.6	−554.7	−364.4	−1230.6	−882.0	−675.8	−554.0	−148.5	40.4
Experiment IV	EMD-GPSOGA	−767.6	−636.5	−582.0	−183.3	−85.0	9.2	−574.8	−449.2	−396.0	65.9	220.8	251.6
	CEEMDAN-GPSOGA	−768.0	−615.9	−516.3	−285.5	−113.4	−40.6	−638.7	−430.6	−430.2	66.8	212.9	281.5
	Proposed	−1318.9	−933.0	−719.7	−896.9	−576.9	−367.8	−1262.3	−915.2	−680.6	−564.0	−148.4	23.9

Table 10. NSE results of different models for four datasets.

		Dataset 1			Dataset 2			Dataset 3			Dataset 1
		Step 1	Step 3	Step 5	Step 1	Step 3	Step 5	Step 1	Step 3	Step 5	Step 1	Step 3	Step 5
Experiment I	CBP	0.932	0.769	0.675	0.925	0.902	0.875	0.664	0.243	0.901	0.647	0.400	0.267
	RNN	0.921	0.706	0.582	0.967	0.922	0.897	0.668	0.673	0.450	0.626	0.398	0.261
	GRU	0.915	0.756	0.494	0.965	0.935	0.809	0.658	0.131	0.267	0.659	0.378	0.290
	CNNRNN	0.893	0.706	0.525	0.966	0.930	0.880	0.587	0.577	0.250	0.650	0.414	0.306
Experiment II	SSA-CBP	0.989	0.943	0.939	0.958	0.839	0.917	0.990	0.918	0.710	0.993	0.930	0.835
	SSA-RNN	0.993	0.977	0.969	0.995	0.992	0.988	0.995	0.951	0.856	0.990	0.929	0.873
	SSA-GRU	0.996	0.982	0.942	0.998	0.992	0.978	0.988	0.964	0.880	0.992	0.945	0.871
	SSA-CNNRNN	0.998	0.978	0.968	0.983	0.985	0.978	0.988	0.937	0.886	0.984	0.952	0.837
Experiment III	SSA-SA-NNCT	0.957	0.938	0.953	0.989	0.961	0.974	0.981	0.969	0.901	0.991	0.952	0.895
	SSA-ACO-NNCT	0.998	0.991	0.982	0.997	0.992	0.989	0.995	0.970	0.918	0.993	0.959	0.864
	SSA-GA-NNCT	0.951	0.990	0.953	0.996	0.967	0.959	0.987	0.966	0.912	0.990	0.953	0.886
	SSA-PSO-NNCT	0.996	0.991	0.982	0.997	0.994	0.989	0.995	0.973	0.923	0.994	0.951	0.899
Experiment IV	EMD-GPSOGA	0.987	0.975	0.967	0.978	0.964	0.942	0.872	0.758	0.683	0.884	0.746	0.704
	CEEMDAN-GPSOGA	0.987	0.972	0.953	0.987	0.969	0.955	0.907	0.734	0.733	0.883	0.756	0.655
	Proposed	0.999	0.994	0.983	0.999	0.997	0.991	0.996	0.977	0.925	0.995	0.961	0.906

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

He, Z.; Chen, Y.; Zang, Y. Wind Speed Forecasting Based on Phase Space Reconstruction and a Novel Optimization Algorithm. Sustainability 2024, 16, 6945. https://doi.org/10.3390/su16166945

AMA Style

He Z, Chen Y, Zang Y. Wind Speed Forecasting Based on Phase Space Reconstruction and a Novel Optimization Algorithm. Sustainability. 2024; 16(16):6945. https://doi.org/10.3390/su16166945

Chicago/Turabian Style

He, Zhaoshuang, Yanhua Chen, and Yale Zang. 2024. "Wind Speed Forecasting Based on Phase Space Reconstruction and a Novel Optimization Algorithm" Sustainability 16, no. 16: 6945. https://doi.org/10.3390/su16166945

APA Style

He, Z., Chen, Y., & Zang, Y. (2024). Wind Speed Forecasting Based on Phase Space Reconstruction and a Novel Optimization Algorithm. Sustainability, 16(16), 6945. https://doi.org/10.3390/su16166945

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Wind Speed Forecasting Based on Phase Space Reconstruction and a Novel Optimization Algorithm

Abstract

1. Introduction

2. Methodology

2.1. Singular Spectrum Analysis

2.2. Phase Space Reconstruction

2.3. Cascade Backpropagation Network

2.4. Recurrent Neural Network

2.5. Gated Recurrent Unit

2.6. Convolutional Neural Network Combined with Recurrent Neural Network

3. The Proposed GPSOGA Optimization Algorithm

3.1. Particle Swarm Optimization

3.2. Genetic Algorithm

3.3. Global Elite Opposition-Based Learning Strategy

3.4. The Proposed Optimization Algorithm

4. The Proposed SSA-GPSOGA-NNCT Algorithm

5. Experiment Results and Analysis

5.1. Dataset Information

5.2. Evaluation Criteria

5.3. Comparison Models and Their Parameters

5.4. Experiment I

5.5. Experiment II

5.6. Experiment III

5.7. Experiment IV

6. Discussion

6.1. Diebold–Mariano Test

6.2. Akaike’s Information Criterion

6.3. Nash–Sutcliffe Efficiency Coefficient

7. Conclusions and Future Research

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI