Next Article in Journal
Analysis of the Working Characteristics of the Ejector in the Water Heating System
Previous Article in Journal
A Study of the Vibration Characteristics of Flexible Mechanical Arms for Pipe String Transportation in Oilfields
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Wind Power Forecasting Model Using LSTM Optimized by the Modified Bald Eagle Search Algorithm

1
College of Water Conservancy and Hydro-Power Engineering, HoHai University, Nanjing 210098, China
2
College of Hydraulic and Civil Engineering, Xinjiang Agricultural University, Urumqi 830052, China
3
College of Energy and Electrical Engineering, HoHai University, Nanjing 210098, China
4
Institute of Technology, College of Mechanical Engineering Nanchang, Nanchang 330099, China
5
Clean Energy Branch of Huaneng International Power Jiangsu Energy Development Co., Nanjing 210098, China
*
Author to whom correspondence should be addressed.
Postal address: No. 1 Xikang Road, Nanjing 210098, China.
Energies 2022, 15(6), 2031; https://doi.org/10.3390/en15062031
Submission received: 20 January 2022 / Revised: 3 March 2022 / Accepted: 8 March 2022 / Published: 10 March 2022

Abstract

:
High-precision forecasting of short-term wind power (WP) is integral for wind farms, the safe dispatch of power systems, and the stable operation of the power grid. Currently, the data related to the operation and maintenance of wind farms mainly comes from the Supervisory Control and Data Acquisition (SCADA) systems, with certain information about the operating characteristics of wind turbines being readable in the SCADA data. In short-term WP forecasting, Long Short-Term Memory (LSTM) is a commonly used in-depth learning method. In the present study, an optimized LSTM based on the modified bald eagle search (MBES) algorithm was established to construct an MBES-LSTM model, a short-term WP forecasting model to make predictions, so as to address the problem that the selection of LSTM hyperparameters may affect the forecasting results. After preprocessing the WP data acquired by SCADA, the MBES-LSTM model was used to forecast the WP. The experimental results reveal that, compared with the PSO-RBF, PSO-SVM, LSTM, PSO-LSTM, and BES-LSTM forecasting models, the MBES-LSTM model could effectively improve the accuracy of WP forecasting for wind farms.

1. Introduction

The development of renewable energy can effectively reduce the deficiency of global energy and reduce environmental pollution [1]. Notably, wind energy has become one of the fastest-growing renewable energy sources [2], serving as an environmentally friendly and clean energy source that can meet the requirements of human sustainable development [3]. According to statistics related to WT (wind turbine) released by the World Wind Energy Association (WWEA) in early 2021, the total installed capacity of global WT in 2020 reached 744 GW [4].
Despite the extensive promotion of WP, bringing about significant economic benefits, wind farms are also facing several challenges. Since the wind farms’ output power can be volatile and uncertain, after the wind farm is connected to the grid, there will be certain disturbances to the safe operation of the power system [5]. To overcome such problems, accurate forecasting of the wind farm output power is necessary. Through forecasting, WP fluctuations can be determined in advance. As such, corresponding countermeasures can also be prepared in advance, and a reasonable power generation plan can be arranged, which not only ensures the safety of the grid, but also improves its reliability [6].
In recent years, researchers have conducted extensive research on WP forecasting using numerous mainstream models, including physical models [7], statistical models [8,9], and artificial intelligence (AI) models [10,11], among which AI models are the most widely studied.
AI models usually establish a high-dimensional nonlinear function to fit WP by minimizing training errors [12]. The more widely used methods are mainly machine learning (ML) methods and artificial neural network (ANN) methods. ML methods mainly include Support Vector Regression (SVR), Least Square Support Vector Machine (LSSVM), and Extreme Learning Machine (ELM) networks [13]. Kuilin Chen et al. [14] used an unscented Kalman filter (UKF) for integration with SVR to establish an SVR-UKF forecasting model, which improved the forecasting accuracy of the SVR model. The ANN acts as a parallel processor with the ability of efficiently storing and figuring out experimental knowledge; it is suitable for solving complex nonlinear problems [15]. D. Huang et al. [16] used GA and BP neural networks to forecast the WP of a wind farm. This GA-BP model was beneficial in improving the correctness of WP forecasting. P. Guo et al. [17] used GA to optimize the hyper-parameters of RBF and to predict the WP of a wind farm. As the theories and technology of neural networks have gradually developed and matured, more researchers have applied deep neural networks (DNN), such as convolutional neural networks (CNN) [18], deep belief networks (DBN) [19], and RNN [20] to predict the WP of wind farms.
LSTM as a form of RNN, has been demonstrated to be suitable for analyzing long series data, and there have been an increasing number of studies on the forecasting of WP based on LSTM networks. In order to construct an LSTM network model that meets the requirements, it is necessary to adjust the relevant hyperparameters, and researchers often set the parameters according to their actual experience and priori knowledge. For different problems, it may be necessary to repeatedly manually tune the relevant parameters.
To address such issues, the parameters of LSTM are optimized using the MEBS algorithm; thus, constructing the MBES-LSTM forecasting model. The experimental results reveal that the optimized forecasting model can better predict the trend of WP and provide technical support for the refinement of wind farm management. The main contributions are highlighted as follows:
(1)
The original BES algorithm was improved, and the improved BES algorithm was tested.
(2)
The wind farm data collected by the SCADA system was cleaned and filtered to form a sample set and processed by means of an empirical mode decomposition (EMD) method.
(3)
The parameters of the LSTM such as iteration number T i t e r , learning rate L R , number of the first layer L 1 , number of the second layer L 2 , were optimized by the MBES algorithm, and the MBES-LSTM forecasting model was subsequently constructed.
(4)
The processed WP data was used as the test sample with the PSO-RBF, PSO-SVM, LSTM, PSO-LSTM, BES-LSTM, and MBES-LSTM models to predict and compare performance.
The balance of this paper is organized as follows: Section 2 describes the LSTM, BES, and MBES algorithm. Evaluation index, MBES-LSTM forecasting model, SCADA data preprocess, and relevant parameter settings are introduced in Section 3. Section 4 provides some graphical results, along with analysis. Finally, Section 5 concludes this paper.

2. Description of LSTM, BES, and MBES

2.1. LSTM

LSTM is a special type of RNN, of which the hidden layer is composed of one or more memory cells, and each memory cell includes a forget gate, an input gate, and an output gate [21]; its structure is shown in Figure 1.
(1) Forget gate: The forget gate f t is responsible for controlling whether the long-term state c continues to be preserved, and is jointly determined by the input x t of the current moment t and the output h t 1 of the previous moment t 1 . The relevant formula is as follows:
f t = σ ( W f × [ h t 1 ,   x t ] + B f )
where W f represents the weight matrix of the forget gate; B f represents the bias term; and σ ( ) represents the sigmoid function.
(2) Input gate: The function of the input gate is to establish a new unit state c ˜ , perform related processing therein, and control how much information is added. The calculation formula is as follows:
i t = σ ( W i × [ h t 1 ,   x t ] + B i )
c ˜ t = t a n h ( W c × [ h t 1 ,   x t ] + B c )
where i t is the input gate result; W i represents the weight matrix of the input gate; B i represents the bias of the input gate; c ˜ represents the current input cell state; W c represents the weight matrix; and B c represents the bias of the cell state. The matrix [ h t 1 ,   x t ] is composed of two vectors, the output h t 1 at the previous moment t 1 and the input x t at the current moment t . σ ( ) is the sigmoid activation function and tanh ( ) is a double tangent function.
(3) Output gate: In the output gate, the output h t 1 of the previous moment t 1 and the input x t of the current moment t are used to output f t through a sigmoid function σ ( ) , namely:
O t = σ ( W o × [ h t 1 ,   x t ] + B o )
c t = f t × c t 1 + i t c ˜ t
h t = o t × t a n h ( c t )
where W o represents the weight matrix of the output gate; and B o represents the bias term of the output gate.
In the LSTM structure, due to the unique three-gate structure and the existence of the hidden state with storage function, LSTM can better reveal long-term historical data, thereby solving the issue of long-term dependence. First, the hidden state C t of the current moment t uses the forget gate to control which information in the hidden state C t 1 of the last moment t 1 needs to be discarded, and which information can continue to be retained. Second, the structure discards certain information in the hidden state C t and the forget gate, and learns new information through the input gate. Third, after a series of calculations, the cell state C t is updated. Here, LSTM uses the output gate, cell state C t , and a tanh layer to determine the final output value h t .

2.2. BES Algorithm

The BES is a new meta-heuristic optimization algorithm established in 2020 by H. A. Alsattar, who was inspired by the hunting behavior of bald eagles [22]. Bald eagles are found throughout North America and have sharp eyesight and significantly high observation ability during flight [23]. Taking salmon predation as an example, the bald eagle will first select a search space based on the distribution density of individuals and populations to salmon and fly towards a specific area; next, the bald eagle will search the water surface in the selected search space until a suitable prey is found; finally, the bald eagle will gradually change the flying altitude, dive down quickly, and successfully catch salmon and other prey from the water. The BES algorithm simulates the prey-preying behavior of this condor and divides it into three stages: select stage, search stage, and swooping stage, as shown in Figure 2.
(1) Selecting phase: In the selecting phase, the search area is randomly selected by the bald eagle looking for the area with the most prey. The updated description of the bald eagle’s position at this phase is:
L n e w i = L b e s t + α r ( L m e a n L i )
where, L n e w i represents the latest location of the i-th bald eagle, and L b e s t is the current optimal position; L i represents the location of the i-th bald eagle; L m e a n is the evenly distributed bald eagle’s position after the previous search; α [ 1.5 ,   2 ] is the parameter which controls the positions change; and r ( 0 ,   1 ) is an arbitrary number
(2) Searching phase: In the searching phase, the bald eagle flies in a conical spiral in the selected search space, looking for prey. In the conical spiral space, the bald eagle moves in a different course to accelerate its searching speed and find the best dive capture location. The updated description of the spiral flight position of the bald eagle is:
θ ( i ) = β × π × r a n d o m ( 0 , 1 )
r ( i ) = θ ( i ) + γ × r a n d o m ( 0 , 1 )
x × r ( i ) = r ( i ) × s i n [ θ ( i ) ]
y × r ( i ) = r ( i ) × c o s [ θ ( i ) ]
x ( i ) = x × r ( i ) m a x ( | x × r | )
y ( i ) = y × r ( i ) m a x ( | y × r | )
where θ ( i ) is the polar angle of the spiral equation; r ( i ) is the polar diameter of the spiral equation; β ( 0 ,   5 ) and γ ( 0.5 ,   2 ) are the parameters that control the spiral trajectory; x ( i ) and y ( i ) represent the bald eagle’s location in polar coordinates, with both values being (−1, 1). The bald eagle’s position is updated as follows:
L n e w i = L i + x ( i ) × ( L i L m e a n ) + y ( i ) × ( L i L i + 1 )
where L i + 1 is the next updated location of the i-th bald eagle.
(3) Swooping phase: After the bald eagle locks onto a target, it quickly dives from the best position and flies towards the target. The movement state of this phase is still used, and the polar coordinate equation can be denoted as follows:
L n e w i = r a n d ( 0 , 1 ) × L b e s t + δ x + δ y
δ x = x 1 ( i ) × ( L i λ 1 × L m e a n ) ,   λ 1 [ 1 , 2 ]
δ y = y 1 ( i ) × ( L i λ 2 × L b e s t ) ,   λ 2 [ 1 , 2 ]
where, λ 1 and λ 2 represents best position and center position, respectively.

2.3. MBES Algorithm

The MBES algorithm is an improved version of the original BES algorithm. In the selection phase, the parameter α in Formula (7) was optimized to no longer be a fixed value between [1.5, 2], which can be expressed by the following formula:
a t = e x p ( T t T ) 1
L n e w i = L b e s t + a t × r ( L m e a n L i )
where a t is the bald eagle’s position change control parameter; t represents the current iteration number; T is the maximum iteration number.
The MBES algorithm was tested and evaluated using different benchmark functions; the parameters of all compared algorithms, such as particle swarm optimization (PSO), grey wolf optimizer (GWO), whale optimization algorithm (WOA), BES, and MBES are shown in Table 1. Table 2 presents the name and parameter settings of the benchmark functions.
The relevant statistical experiment results of selecting each algorithm to run 20 times independently in the benchmark function are shown in Table 3 (unimodal benchmark functions), Table 4 (multimodal benchmark functions), and Table 5 (composite benchmark functions).
Figure 3 presents the qualitative metrics for the F1–F8 functions, including 2D views of the functions and convergence curves. For the convergence curves in Figure 3, the red line represents the MBES and the blue line represents the BES. As can be seen, the MBES’s speed of convergence is faster than the others.

3. WP Forecast

3.1. MBES-LSTM Forecasting Model

To reduce forecasting errors, SCADA data is decomposed using the EMD method after preprocessing. The flow chart of the MBES-LSTM WP forecasting model based on the EMD is shown in Figure 4, and the specific steps were as follows:
Step 1: The EMD method was used to decompose the pre-processed WP time series data and to decompose the data into the intrinsic mode function (IMF).
Step 2: The data of each IMF was divided into a training set and a test set.
Step 3: The decomposition data from each IMF was normalized.
Step 4: The MBES-LSTM model was used for training and forecasting, respectively, and in this step:
(1)
The LSTM { T i t e r , L R , L 1 , L 2 } parameter and the MBES algorithm parameters, including the bald eagle population N , the maximum iterations M , the upper limit of argument U b , the lower limit of argument L b , the dimension D , and the sample data were initialized.
(2)
The data of the fitness value was calculated. The mean square error obtained by training the LSTM network was used as the fitness value, and the value was updated in real-time as the bald eagle continued to operate; within the iteration range, Formulas (14), (15), and (19) were used to calculate the position of the bald eagle. If the current new position was better, the old position was updated.
(3)
According to the optimal parameter combination, the LSTM network was trained, and testing samples were used to forecast and save the result of each IMF.
Step 5: The results of each IMF from Step 4 were anti-normalized.
Step 6: By linearly superimposing the forecasting results of each IMF, the final forecasting result was obtained.
Step 7: The relevant evaluation metrics were calculated.

3.2. Data Preprocessing and EMD Decomposition

In the present study, the actual operational SCADA data of a wind farm in Yunnan, China, from 1 August 2018 to 31 August 2018, was selected for the analysis of the calculation examples. The time resolution of the data was ten minutes. After data cleaning, the first 2150 groups were taken as experimental samples. The relevant features of the samples are shown in Table 6.
To further reduce the nonlinearity and strong volatility characteristics of the WP signal and enhance the forecasting precision, the EMD algorithm was first used to decompose the preprocessed SCADA data and then perform related calculations. The EMD decomposition results of the original WP data are shown in Figure 5. A total of 10 IMF components and 1 remaining component were decomposed, and they are plotted in blue. Figure 5 shows that the time characteristic scale of the eigenmode function component increased from IMF1 to IMF10, and the frequency changed from high to low.

3.3. Parameters Setting

In the present study, PSO-RBF and PSO-SVM models were established, and algorithms such as PSO, BES, and MBES were used to optimize the hyper-parameters of the LSTM. The relevant parameter values are shown in Table 7.

3.4. Evaluation Indicators

To better assess the performance of the MBES-LSTM forecasting model, RMSE, MAE, MAPE, COV, CC, TIC, EC, and r2 were used in the present study. The specific expressions are as follows:
(1) The root mean square error (RMSE) indicates the deviation between the predicted and actual values.
R M S E = 1 N t = 1 N ( u a c t u a l t u p r e d i c t t ) 2
(2) The mean absolute error (MAE) reflects the actual situation of the errors, and this value also becomes larger when the error is large.
M A E = 1 N t = 1 N | u a c t u a l t u p r e d i c t t |
(3) The mean absolute percentile error (MAPE) is used to the measure forecast accuracy. Smaller MAPE values indicate that the model is more accurate in predicting
M A P E = 1 N t = 1 N | u a c t u a l t u p r e d i c t t u a c t u a l t |
(4) The coefficient of variance (COV) [24] reflects the degree of discretization of the data, with a larger value indicating a higher degree of data scattering.
C O V = R M S E u a c t m e a n × 100 %
(5) The correlation coefficient (CC) represents the relationship between the actual and predicted values. This value is close to 1 when the actual data is strongly correlated with the predicted data.
C C = N t = 1 N ( u a c t u a l t × u p r e d i c t t ) ( t = 1 N u a c t u a l t ) ( t = 1 N u p r e d i c t t ) [ N ( t = 1 N ( u a c t u a l t ) 2 ) ( t = 1 N u a c t u a l t ) 2 ] [ N ( t = 1 N ( u p r e d i c t t ) 2 ) ( t = 1 N u p r e d i c t t ) 2 ]
(6) The Theil’s inequality coefficient (TIC) [25], with a value range between [0, 1]. The smaller the TIC value, the better the prediction accuracy of the model.
T I C = 1 N t = 1 N ( u a c t u a l t u p r e d i c t t ) 2 1 N t = 1 N ( u a c t u a l t ) 2 + 1 N t = 1 N ( u p r e d i c t t ) 2
(7) The efficiency coefficient (EC) [26] is generally used to verify the goodness of fit of the model’s prediction results, and if the EC is close to 1, the prediction quality of the model is good.
E C = 1 1 N t = 1 N ( u a c t u a l t u p r e d i c t t ) 2 1 N t = 1 N ( u a c t u a l t u a c t m e a n ) 2
(8) The coefficient of determination r2 (r2) estimates the combined dispersion against the single dispersion of the observed and predicted series.
r 2 = ( t = 1 N [ ( u a c t u a l t u a c t m e a n ) × ( u p r e d i c t t u p r e m e a n ) ] t = 1 N ( u a c t u a l t u a c t m e a n ) 2 × t = 1 N ( u p r e d i c t t u p r e m e a n ) 2 ) 2
where u a c t u a l t represents the WP observation value at time t; u p r e d i c t t is the WP forecast value at time t; u a c t m e a n is the average WP observation; u p r e m e a n is the average WP forecast; and N is the number of samples in the u t sequence.

4. Experimental Results and Discussion

In WP forecasting, the setting of related parameters directly affects the forecasting precision of the LSTM network. In the present study, the PSO, BES, and MBES algorithms were used to optimize the hyper-parameters such as T i t e r , L R , L 1 , and L 2 . The LSTM-related parameter values and calculated errors in each IMF decomposition are shown in Table 8.
Table 9 indicates the Loss and RMSE of the LSTM, PSO-LSTM, BES-LSTM, and MBES-LSTM models in the training set and the testing set. An observation can be made that the MBES-LSTM model trends towards small Loss and RMSE value in the training set and the testing set.
To further reflect the advantages of the MBES-LSTM forecasting model, some models were contrasted with other models such as the Multi-Regress, LSTM, PSO-RBF, PSO-SVM, PSO-LSTM, and BES-LSTM models. Figure 6 shows a comparison diagram of the direct forecasting results of the models. It can be noted from Figure 6 that the prediction line of the multi-regress model was far away from the actual line, indicating that the prediction results were poor, while the prediction lines of other models were near the actual lines.
Figure 7 shows the forecasting results of different models based on the EMD algorithm.
In Figure 6 and Figure 7, the solid green line represent the actual values, while the remaining six colored lines represent the forecasted values. The closer their location to the green line, the higher the forecasting accuracy of the models. An observation can be made that the forecasting value line of the MBES-LSTM model was closer to the green line, which indicates that the forecasting precision of MBES-LSTM was the highest.
In order to more intuitively reflect the size of the forecasting error, a box-plot was drawn to graphically present the forecasting error, as shown in Figure 8. According to the box-plot, the MBES-LSTM model based on EMD showed less errors.
Formulas (20)–(27) were used to calculate the relevant evaluation indicators of each forecasting model. The calculation results are shown in Table 10, according to which proposed EMD-MBES-LSTM model had the optimal forecasting performance, exhibiting the highest performance indexes.
Figure 9 and Figure 10 describe the relevant evaluation index histograms. In Figure 9, the height of the blue column represents the RMSE of the Multi-Regress, LSTM, PSO-RBF, PSO-SVM, PSO-LSTM, BES-LSTM, and MBES-LSTM forecasting models; the height of the magenta column represents the MAE of every model; the height of the green column represents the TIC. In Figure 10, the height of the red column represents the EC, and the height of the green column represents the r2 of each model. From Figure 9 and Figure 10, an observation can be made that the MBES-LSTM model described in the present study showed the minimum evaluation error and highest evaluation coefficient, regardless of which evaluation standard was used.
A Taylor diagram of the forecasting models is shown in Figure 11. In Figure 11, point D is closest to point B, indicating that point D had the best performance metrics. Specifically, the MBES-LSTM model based on EMD had the optimal predictive performance.
The MBES-LSTM model based on EMD has many advantages. Compared with the other models, there was a significant improvement in the forecasting accuracy in terms of WP forecasting. The MBES-LSTM model can be summarized as follows:
(1) The EMD algorithm was a significant factor in data preprocessing, and it significantly improved the forecasting accuracy. As can be seen from Figure 6 and Figure 7 and Table 10, after EMD decomposition on LSTM, RMSE decreased by about 0.2751, MAE decreased by about 0.1607, and r2 increased by about 0.0649.
(2) The proposed algorithm was evaluated through several different benchmark functions and was applied in the parameter estimation of LSTM. As can be seen from Table 9 and Table 10, compared with the standard bald eagle algorithm, the modified bald eagle algorithm had a positive effect on LSTM hyper-parameter optimization. After optimizing the related parameters of LSTM with the modified bald eagle algorithm, RMSE decreased by about 0.0922, MAE decreased by about 0.0764, and r2 increased by about 0.0188.

5. Conclusions

Accurate forecasting of WP is integral for the safe dispatch of power systems and the operation management of wind farms. In the field of WP forecasting, LSTM is a commonly used in-depth learning algorithm. With the aim of solving the problem that the improper selection of LSTM related parameters { T i t e r , L R , L 1 , L 2 } may adversely affect the forecasting results of the LSTM, an MBES-LSTM WP short-term forecasting model was established in the present study.
(1) In the selection phase, the improvement of parameters is based on creating varied values for the learning parameter in each iteration, which helps to enhance the exploration of the MBES algorithm.
(2) The MBES algorithm was adopted to optimize the relevant parameters { T i t e r , L R , L 1 , L 2 } of the LSTM to form the MBES-LSTM model. In the WP forecasting test on the wind farm, the forecasting accuracy rate was better than that of the PSO-RBF, PSO-SVM, LSTM, PSO-LSTM, and BES-LSTM models.
Only historical WP data are used in the MBES-LSTM forecasting model. The WP is constantly affected by external factors, such as the wind direction, landforms, humidity, air temperature, and atmospheric pressure. Such factors will lead to rapidness, high nonlinearity, and uncertainty of WP changes, and thus, should be considered reasonable when establishing a multi-input forecasting model. In this way, the accuracy of WP forecasting can be further improved, which is also indicated for future study.

Author Contributions

Data curation, H.G., N.Z., Y.G.; methodology, C.X.; resources, L.G.; Investigation and writing, W.T. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Ministry of Science and Technology of the Peoples Republic of China (Grant No. 2019YFE0104800), the Special Training Plan for Minority Science and Technology Talents, the Natural Science Foundation of the Xinjiang Uyghur Autonomous Region (Grant No. 2020D03004), the National Natural Science Foundation of China (Grant No. U1865101), the Fundamental Research Funds for the Central Universities, and the Postgraduate Research and Practice Innovation Program of the Jiangsu Province (Grant No. B210201018).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data that support this manuscript are available from Chang Xu upon reasonable request.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Gu, B.; Zhang, T.; Meng, H.; Zhang, J. Short-term forecasting and uncertainty analysis of wind power based on long short-term memory, cloud model and non-parametric kernel density estimation. Renew. Energy 2021, 164, 687–708. [Google Scholar] [CrossRef]
  2. Xue, F.; Duan, H.; Xu, C.; Han, X.; Shangguan, Y.; Li, T.; Fen, Z. Research on the Power Capture and Wake Characteristics of a Wind Turbine Based on a Modified Actuator Line Model. Energies 2022, 15, 282. [Google Scholar] [CrossRef]
  3. Gu, C.; Li, H. Review on Deep Learning Research and Applications in Wind and Wave Energy. Energies 2022, 15, 1510. [Google Scholar] [CrossRef]
  4. WWEA. Worldwide Wind Capacity Reaches 744 Gigawatts–An Unprecedented 93 Gigawatts Added in 2020. Available online: https://wwindea.org/worldwide-wind-capacity-reaches-744-gigawatts/ (accessed on 24 March 2021).
  5. Donadio, L.; Fang, J.; Porté-Agel, F. Numerical Weather Prediction and Artificial Neural Network Coupling for Wind Energy Forecast. Energies 2021, 14, 338. [Google Scholar] [CrossRef]
  6. Pichault, M.; Vincent, C.; Skidmore, G.; Monty, J. Short-Term Wind Power Forecasting at the Wind Farm Scale Using Long-Range Doppler LiDAR. Energies 2021, 14, 2663. [Google Scholar] [CrossRef]
  7. Landberg, L. A mathematical look at a physical power prediction model. Wind Energy 1998, 1, 23–28. [Google Scholar] [CrossRef]
  8. Lydia, M.; Kumar, S.S.; Selvakumar, A.I.; Kumar, G.E.P. Linear and non-linear autoregressive models for short-term wind speed forecasting. Energy Convers. Manag. 2016, 112, 115–124. [Google Scholar] [CrossRef]
  9. Han, Q.; Meng, F.; Hu, T.; Chu, F. Non-parametric hybrid models for wind speed forecasting. Energy Convers. Manag. 2017, 148, 554–568. [Google Scholar] [CrossRef]
  10. Lodge, A.; Yu, X.H. Short Term Wind Speed Prediction Using Artificial Neural Networks. In Proceedings of the 2014 4th IEEE International Conference on Information Science and Technology, Shenzhen, China, 26–28 April 2014; pp. 539–542. [Google Scholar] [CrossRef]
  11. Kalogirou, S.; Neocleous, C.; Pashiardis, S.; Schizas, C.N. Wind speed prediction using artificial neural networks. In Proceedings of the European Symposium on Intelligent Techniques, Crete, Greece, 3–4 June 1999. [Google Scholar]
  12. Lorenzo, J.; Mendez, J.; Castrillon, M.; Hernandez, D. Short-Term Wind Power Forecast Based on Cluster Analysis and Artificial Neural Networks. In Proceedings of the 11th International Work-Conference on Artificial Neural Networks (Advances in Computational Intelligence, IWANN 2011, Pt I), Torremolinos-Málaga, Spain, 8–10 June 2011; Cabestany, J., Rojas, I., Joya, G., Eds.; Springer: Berlin/Heidelberg, Germany; Volume 6691, pp. 191–198. [Google Scholar]
  13. An, G.; Jiang, Z.; Cao, X.; Liang, Y.; Zhao, Y.; Li, Z.; Dong, W.; Sun, H. Short-Term Wind Power Prediction Based on Particle Swarm Optimization-Extreme Learning Machine Model Combined with Adaboost Algorithm. IEEE Access 2021, 9, 94040–94052. [Google Scholar] [CrossRef]
  14. Hur, S. Short-term wind speed prediction using Extended Kalman filter and machine learning. Energy Rep. 2021, 7, 1046–1054. [Google Scholar] [CrossRef]
  15. Elsheikh, A.H.; Sharshir, S.W.; Abd Elaziz, M.; Kabeel, A.E.; Guilan, W.; Haiou, Z. Modeling of solar energy systems using artificial neural network: A comprehensive review. Sol. Energy 2019, 180, 622–639. [Google Scholar] [CrossRef]
  16. Huang, D.; Gong, R.; Gong, S. Prediction of Wind Power by Chaos and BP Artificial Neural Networks Approach Based on Genetic Algorithm. J. Electr. Eng. Technol. 2015, 10, 41–46. [Google Scholar] [CrossRef] [Green Version]
  17. Guo, P.; Qi, Z.; Huang, W. Short-term wind power prediction based on genetic algorithm to optimize RBF neural network. In Proceedings of the 2016 Chinese Control and Decision Conference (CCDC), Yinchuan, China, 28–30 May 2016; pp. 1220–1223. [Google Scholar]
  18. Peng, X.; Li, Y.; Dong, L.; Cheng, K.; Wang, H.; Xu, Q.; Wang, B.; Liu, C.; Che, J.; Yang, F.; et al. Short-Term Wind Power Prediction Based on Wavelet Feature Arrangement and Convolutional Neural Networks Deep Learning. IEEE Trans. Ind. Appl. 2021, 57, 6375–6384. [Google Scholar] [CrossRef]
  19. Lin, Z.; Liu, X.; Collu, M. Wind power prediction based on high-frequency SCADA data along with isolation forest and deep learning neural networks. Int. J. Electr. Power Energy Syst. 2020, 118, 105835. [Google Scholar] [CrossRef]
  20. Chen, X.; Zhang, X.; Dong, M.; Huang, L.; Guo, Y.; He, S. Deep Learning-Based Prediction of Wind Power for Multi-turbines in a Wind Farm. Front. Energy Res. 2021, 9, 403. [Google Scholar] [CrossRef]
  21. Hochreiter, S.; Schmidhuber, J. Long Short-Term Memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef]
  22. Alsattar, H.A.; Zaidan, A.A.; Zaidan, B.B. Novel meta-heuristic bald eagle search optimisation algorithm. Artif. Intell. Rev. 2020, 53, 2237–2264. [Google Scholar] [CrossRef]
  23. Hansen, A.J. Fighting Behavior in Bald Eagles: A Test of Game Theory. Ecology 1986, 67, 787–797. [Google Scholar] [CrossRef]
  24. Elsheikh, A.H.; Panchal, H.; Ahmadein, M.; Mosleh, A.O.; Sadasivuni, K.K.; Alsaleh, N.A. Productivity forecasting of solar distiller integrated with evacuated tubes and external condenser using artificial intelligence model and moth-flame optimizer. Case Stud. Therm. Eng. 2021, 28, 101671. [Google Scholar] [CrossRef]
  25. Tian, Z.; Chen, H. A novel decomposition-ensemble prediction model for ultra-short-term wind speed. Energy Convers. Manag. 2021, 248, 114775. [Google Scholar] [CrossRef]
  26. Elsheikh, A.H.; Katekar, V.P.; Muskens, O.L.; Deshmukh, S.S.; Elaziz, M.A.; Dabour, S.M. Utilization of LSTM neural network for water production forecasting of a stepped solar still with a corrugated absorber plate. Process Saf. Environ. Prot. 2021, 148, 273–282. [Google Scholar] [CrossRef]
Figure 1. Structure of LSTM.
Figure 1. Structure of LSTM.
Energies 15 02031 g001
Figure 2. Diagram of the BES for the three main phases of hunting (S1—selecting phase; S2—searching phase; S3—swooping phase). (a) Selecting phase; (b) Search and swooping phase.
Figure 2. Diagram of the BES for the three main phases of hunting (S1—selecting phase; S2—searching phase; S3—swooping phase). (a) Selecting phase; (b) Search and swooping phase.
Energies 15 02031 g002
Figure 3. Qualitative metrics for the benchmark functions: 2D views and convergence curves of the functions. (a) F1; (b) F2; (c) F3; (d) F4; (e) F5; (f) F6; (g) F7; (h) F8.
Figure 3. Qualitative metrics for the benchmark functions: 2D views and convergence curves of the functions. (a) F1; (b) F2; (c) F3; (d) F4; (e) F5; (f) F6; (g) F7; (h) F8.
Energies 15 02031 g003aEnergies 15 02031 g003b
Figure 4. Forecasting process of the proposed model.
Figure 4. Forecasting process of the proposed model.
Energies 15 02031 g004
Figure 5. Diagram of original WP data and its EMD decomposition.
Figure 5. Diagram of original WP data and its EMD decomposition.
Energies 15 02031 g005
Figure 6. The forecasting results of different models.
Figure 6. The forecasting results of different models.
Energies 15 02031 g006
Figure 7. The forecasting results of different models based on EMD.
Figure 7. The forecasting results of different models based on EMD.
Energies 15 02031 g007
Figure 8. The box-plot of forecasting errors of the different models (1, Multi-Regress; 2, LSTM; 3, EMD-LSTM; 4, PSO-RBF; 5, EMD-PSO-RBF; 6, PSO-SVM; 7, EMD-PSO-SVM; 8, PSO-LSTM; 9, EMD-PSO-LSTM; 10, BES-LSTM; 11, EMD-BES-LSTM; 12, MBES-LSTM; 13, EMD-MBES-LSTM).
Figure 8. The box-plot of forecasting errors of the different models (1, Multi-Regress; 2, LSTM; 3, EMD-LSTM; 4, PSO-RBF; 5, EMD-PSO-RBF; 6, PSO-SVM; 7, EMD-PSO-SVM; 8, PSO-LSTM; 9, EMD-PSO-LSTM; 10, BES-LSTM; 11, EMD-BES-LSTM; 12, MBES-LSTM; 13, EMD-MBES-LSTM).
Energies 15 02031 g008
Figure 9. Evaluation error of different forecasting models.
Figure 9. Evaluation error of different forecasting models.
Energies 15 02031 g009
Figure 10. Evaluation coefficient results of different forecasting models.
Figure 10. Evaluation coefficient results of different forecasting models.
Energies 15 02031 g010
Figure 11. The Taylor diagram of the forecasting results (A: Multi-Regress; B: Actual; C: EMD-LSTM; D: EMD-MBES-LSTM; E: EMD-PSO-RBFF; F: PSO-RB; G: EMD-PSO-SVM; H: PSO-SVM; I: EMD-PSO-LSTM; J: PSO-LSTM; K: EMD-BES-LSTM; L: BES-LSTM; M: LSTM; N: MBES-LSTM).
Figure 11. The Taylor diagram of the forecasting results (A: Multi-Regress; B: Actual; C: EMD-LSTM; D: EMD-MBES-LSTM; E: EMD-PSO-RBFF; F: PSO-RB; G: EMD-PSO-SVM; H: PSO-SVM; I: EMD-PSO-LSTM; J: PSO-LSTM; K: EMD-BES-LSTM; L: BES-LSTM; M: LSTM; N: MBES-LSTM).
Energies 15 02031 g011
Table 1. Parameter value of PSO, GWO, WOA, BES, and MBES.
Table 1. Parameter value of PSO, GWO, WOA, BES, and MBES.
AlgorithmParameter Setting
Common settingMaximum iteration N = 200
population size: P = 10
Runs: r = 20
Probability Pr = 0.5
PSOV1 = 2
V2 = 2
W = 0.9
GWOm1 = 0.3
WOAP1 = 3
P2 = 5
BESα = 2
λ1 = 2, λ2 = 2
a = 10
R = 1.5
MBESλ1 = 2, λ2 = 2
a = 10
R = 1.5
Table 2. Parameter settings of test functions.
Table 2. Parameter settings of test functions.
Test FunctionFunction NameDRangefopt
(1) Unimodal test functions
F 1 ( x ) = i = 1 D ( x i + 0.5 ) 2 Step30[−100, 100]0
F 2 ( x ) = i = 1 D i x i 4 + r a n d o m [ 0 , 1 ) Quartic30[−1.28, 1.28]0
(2) Multimodal test functions
F 3 ( x ) = 20 + e 20 exp ( 20 ( 1 / D ) i = 1 D x i 2 ) exp ( ( 1 / D ) i = 1 D cos ( 2 π x i ) ) Ackley30[−32, 32]0
F 4 ( x ) = ( 1 / 4000 ) i = 1 D ( x i 100 ) 2 ( i = 1 D cos ( x i 100 i ) ) + 1 Griewank30[−600, 600]0
F 5 ( x ) = ( π / D ) { 10 sin 2 ( π y i ) + i = 1 D ( y i 1 ) 2 [ 1 + 10 sin 2 ( π y i + 1 ) ] + ( y D 1 ) } + i = 1 D u ( x i , 10 , 100 , 4 ) Penalized30[−50, 50]0
(3) Composite benchmark functions
F 6 ( x ) = ( ( 1 / 500 ) + j = 1 25 ( 1 / ( j + i = 1 2 ( x i a i j ) 6 ) ) ) 1 Foxholes2[−65.53, 65.53]0.998004
F 7 ( x ) = i = 1 4 ( c i exp ( j 1 6 a i j ( x i p i j ) ) 2 ) Martman 66[0, 1]−3.322
F 8 ( x ) = i = 1 10 [ ( X a i ) ( X a i ) T + c i ] 1 Langerman 104[0, 10]−10.5364
Table 3. Results for unimodal benchmark functions.
Table 3. Results for unimodal benchmark functions.
FunctionStatisticsPSOGWOWOABESMBES
F1Best3.462 × 10−11.122 × 10−13.667 × 10−42.341 × 10−160
Worst5.183 × 1036.135 × 1046.006 × 1047.288 7.279
Average6.021 × 1012.294 × 1022.117 × 1021.981 × 10−18.983 × 10−2
Std.3.101 × 1022.828 × 1032.854 × 1037.723 × 10−16.035 × 10−1
F2Best4.169 × 10−32.570 × 10−46.913 × 10−43.923 × 10−51.557 × 10−5
Worst6.976 × 1011.011 × 1029.312 × 1015.994 × 10−22.665 × 10−2
Average9.507 × 10−12.593 × 10−12.735 × 10−12.968 × 10−41.301 × 10−4
Std.7.123 3.973 4.054 2.175 × 10−31.011 × 10−3
Table 4. Results for multimodal benchmark functions.
Table 4. Results for multimodal benchmark functions.
FunctionStatisticsPSOGWOWOABESMBES
F3Best1.644 × 10−11.066 × 10−143.730 × 10−158.882 × 10−168.882 × 10−16
Worst1.876 × 1012.047 × 1012.057 × 1017.817 × 10−42.325 × 10−8
Average1.812 2.841 × 10−12.297 × 10−17.817 × 10−72.325 × 10−11
Std.2.241 1.889 1.731 2.472 × 10−57.353 × 10−10
F4Best4.4484 × 10−20000
Worst3.34756.3476 × 1025.4016 × 1021.37141.0657
Average3.7099 × 10−12.40482.02911.7522 × 10−31.0663 × 10−3
Std.3.9847 × 10−12.9049 × 1012.8352 × 1014.4912 × 10−23.3700 × 10−2
F5Best9.934 × 10−21.583 × 10−24.714 × 10−54.285 × 10−191.572 × 10−32
Worst8.002 × 1064.691 × 1084.621 × 1081.510 1.462
Average1.095 × 1041.074 × 1061.188 × 1061.265 × 10−28.612 × 10−3
Std.2.603 × 1051.875 × 1071.994 × 1079.636 × 10−28.345 × 10−2
Table 5. Results for composite benchmark functions.
Table 5. Results for composite benchmark functions.
FunctionStatisticsPSOGWOWOABESMBES
F6Best9.980 × 1012.570 1.048 5.633 4.702 × 10−1
Worst1.510 5.387 × 1013.579 × 1011.351 × 1011.150 × 101
Average9.989 × 10−12.669 1.199 5.690 4.738
Std.1.873 × 10−21.676 1.263 3.787 × 10−13.174 × 10−1
F7Best3.979 × 10−13.979 × 10−13.979 × 10−13.979 × 10−13.979 × 10−1
Worst4.269 × 10−18.117 × 10−18.530 × 10−17.572 × 10−16.451 × 10−1
Average3.980 × 10−13.997 × 10−13.997 × 10−13.988 × 10−13.983 × 10−1
Std.1.626 × 10−31.967 × 10−21.910 × 10−21.355 × 10−28.160 × 10−3
F8Best−1.0536 × 101−1.0536 × 101−1.0536 × 101−1.0536 × 101−1.0536 × 101
Worst−1.5670−1.0484−1.7075−2.7215−1.8866
Average−1.0344 × 101−8.7332−1.0424 × 101−1.0488 × 101−1.0495 × 101
Std.1.07421.93668.0145 × 10−13.8490 × 10−15.2253 × 10−1
Table 6. Attributes of training set and test set samples.
Table 6. Attributes of training set and test set samples.
Training Set Testing Set
number of samples2000 (pcs)150 (pcs)
minimum value0.5660 (MW)13.2090 (MW)
maximum value29.3640 (MW)29.3460 (MW)
average value13.2264 (MW)24.4299 (MW)
Table 7. The parameter values of PSO-RBF, PSO-SVM, PSO-LSTM, BES-LSTM, and MBES-LSTM.
Table 7. The parameter values of PSO-RBF, PSO-SVM, PSO-LSTM, BES-LSTM, and MBES-LSTM.
Model NameParameter NameParameter Value
PSO-RBFiterations N100
population P10
V13
V23
W0.6
b[0.585,0.3050,0.969]
c[0.748,0.278,−3]
[1.448,−0.227,3]
w[1,−1,1]
PSO-SVMiterations N100
population P10
V12
V22
W0.9
C87
g0.02
PSO-LSTMiterations N30
population P8
C14
C24
W0.9
lower board lb[1,1,1,0.001]
upper board ub [100,100,100,0.01]
BES-LSTMiterations N30
population P8
lower board lb[1,1,1,0.001]
upper board ub [100,100,100,0.01]
α2
MBES-LSTMiterations N30
population P8
lower board lb[1,1,1,0.001]
upper board ub [100,100,100,0.01]
α α c = e x p ( T t T ) 1
Table 8. The LSTM hyper-parameters value and errors based on PSO, BES, and MBES.
Table 8. The LSTM hyper-parameters value and errors based on PSO, BES, and MBES.
ModelEMDiterLRL1L2RMSEMAPE
PSO-LSTMIMF1430.00507910.7891 0.5498
IMF2950.005893250.2829 0.2046
IMF3520.007986650.1432 0.1125
IMF4880.008664340.2111 0.1589
IMF5690.004820620.1020 0.0784
IMF6700.003966320.2686 0.1637
IMF7480.006338100.1351 0.0854
IMF8720.006470400.5410 0.2376
IMF9920.009334930.3567 0.1505
IMF10860.005961950.3453 0.2160
Res330.007411670.5596 0.4408
BES-LSTMIMF1240.008771200.7884 0.5453
IMF2500.01100440.2782 0.2001
IMF3490.009797970.1266 0.0964
IMF4230.009854780.1097 0.0879
IMF5500.01100940.1098 0.0810
IMF6410.00824320.1945 0.0923
IMF7280.003252120.1179 0.0710
IMF8430.001196510.4455 0.2112
IMF9380.011001000.3035 0.1352
IMF10500.01561000.3198 0.1292
Res480.005792140.3602 0.1307
MBES-LSTMIMF1170.005863270.7909 0.5522
IMF2900.007096890.2829 0.2046
IMF3870.006289820.1145 0.0838
IMF4850.006751830.0995 0.0751
IMF5800.005243230.1020 0.0784
IMF6670.006343510.1856 0.0641
IMF7260.004056410.1006 0.0685
IMF8870.008829430.3528 0.1775
IMF9710.009334930.2875 0.1220
IMF10860.005961950.2804 0.1193
Res670.009254450.3492 0.0980
Table 9. Forecasting errors of LSTM, PSO-LSTM, BES-LSTM, and MBES-LSTM models.
Table 9. Forecasting errors of LSTM, PSO-LSTM, BES-LSTM, and MBES-LSTM models.
Model NameTraining SetTesting Set
RMSELossRMSELoss
LSTM0.19260.02462.0632 1.3091
PSO-LSTM0.17240.01481.7649 1.2462
BES-LSTM0.16130.01111.7079 1.1709
MBES-LSTM0.14510.01141.6158 1.0945
Table 10. Comparison of evaluation metrics.
Table 10. Comparison of evaluation metrics.
Forecasting ModelsRMSEMAECOVMAPETICNSER2CC
Multi-Regress3.0374 2.7887 0.1243 0.1143 0.0648 0.6486 0.6150 0.8212
LSTM2.0632 1.3091 0.0845 0.0592 0.0419 0.7761 0.7393 0.8810
EMD-LSTM1.7881 1.1484 0.0732 0.0539 0.0360 0.8103 0.8042 0.9001
PSO-RBF1.8815 1.3516 0.0770 0.0629 0.0381 0.7839 0.7832 0.8854
EMD-PSO-RBF1.6514 1.1329 0.0676 0.0532 0.0334 0.8441 0.8330 0.9188
PSO-SVM1.7441 1.1274 0.0714 0.0533 0.0352 0.8250 0.8137 0.9083
EMD-PSO-SVM1.5146 1.1253 0.0620 0.0513 0.0307 0.8654 0.8595 0.9302
PSO-LSTM1.7649 1.2462 0.0722 0.0568 0.0358 0.8117 0.8092 0.9009
EMD-PSO-LSTM1.7649 1.2462 0.0722 0.0568 0.0358 0.8117 0.8092 0.9009
BES-LSTM1.7079 1.1709 0.0699 0.0542 0.0346 0.8257 0.8213 0.9087
EMD-BES-LSTM1.1310 0.7508 0.0463 0.0337 0.0228 0.9234 0.9217 0.9609
MBES-LSTM1.6158 1.0945 0.0661 0.0517 0.0327 0.8506 0.8401 0.9223
EMD-MBES-LSTM1.0009 0.6550 0.0410 0.0310 0.0202 0.9391 0.9386 0.9691
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Tuerxun, W.; Xu, C.; Guo, H.; Guo, L.; Zeng, N.; Gao, Y. A Wind Power Forecasting Model Using LSTM Optimized by the Modified Bald Eagle Search Algorithm. Energies 2022, 15, 2031. https://doi.org/10.3390/en15062031

AMA Style

Tuerxun W, Xu C, Guo H, Guo L, Zeng N, Gao Y. A Wind Power Forecasting Model Using LSTM Optimized by the Modified Bald Eagle Search Algorithm. Energies. 2022; 15(6):2031. https://doi.org/10.3390/en15062031

Chicago/Turabian Style

Tuerxun, Wumaier, Chang Xu, Hongyu Guo, Lei Guo, Namei Zeng, and Yansong Gao. 2022. "A Wind Power Forecasting Model Using LSTM Optimized by the Modified Bald Eagle Search Algorithm" Energies 15, no. 6: 2031. https://doi.org/10.3390/en15062031

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop