1. Introduction
Due to the worldwide consensus to attain carbon neutrality, expanding renewable energy sources has increasingly become the norm. Renewable energy like solar, wind, and hydro power produce much lower carbon emissions than traditional fossil fuel-based energy. Among renewable options, wind power stands out as having especially prominent low-carbon attributes since it generates fewer emissions than alternatives. Additionally, wind resources are widely available in large quantities, yielding significant reserves of wind power that can be tapped for energy production [
1]. Therefore, wind power has been increasingly valued by countries worldwide. In recent years, the new installed capacity of global wind power has shown a rapid upward trend, rising from 50.7 GW in 2018 to 93 GW in 2021. And as of 2021, the cumulative installed global wind power capacity had arrived at 837 GW. Moreover, according to the forecast of the Global Wind Energy Council, over the next five years (2022–2026), there will be an increase of 557 GW in installed wind power capacity globally, with a compound annual growth rate of 6.6%. By 2026, the new installed capacity of wind power globally will reach 128.8 GW [
2]. However, wind energy exhibits intermittency and stochastic fluctuations, where changes in WS (wind speed) directly affect the active power and reactive power of wind farms, resulting in unstable power output. With the rapid development of wind power generation, the issues caused by the unstable output power of wind farms, such as difficulties in power system dispatching and operation, as well as significant wind curtailment, have become increasingly prominent. These directly impact the safe, stable, and economically efficient operation of the power grid [
3,
4]. The impact of WS prediction on wind power generation is significant, as accurate forecasting allows for the better planning and management of wind farms. By predicting WS with precision, operators can optimize the operation of turbines, adjust the power output accordingly, and anticipate fluctuations in energy production. This leads to the improved efficiency, reduced costs, and increased reliability of wind power generation systems. Additionally, accurate WS prediction enables grid operators to integrate wind energy more effectively into the overall energy mix, contributing to a more sustainable and reliable power supply. Therefore, accurate WS prediction can effectively and economically solve problems caused by the instability of wind power and wind power integration problems.
WS forecasting can be classified from different angles. According to the forecasted time horizon, WS forecasts can be categorized into ultra-short-term, short-term, medium-term, and long-term forecasts [
5]. And the ultra-short term is a few seconds to 30 min, the short term is 30 min to 6 h, the medium term is 6 h to a day, and the long term is a day to a few weeks. In addition, WS forecasts can also be categorized into physical models, statistical models, and artificial intelligence models according to different prediction models [
6]. The physical model is based on computational fluid dynamics and is built using detailed information such as meteorological information, the topographic features of the location of the wind farm, and surface roughness. Therefore, the NWP (numerical weather prediction) model is a typical physical model. Feng et al. [
7] proposed a physics-based prediction method using WS as input data for numerical weather forecasting. Moreover, Liu et al. [
8] greatly enhanced the precision of wind power prediction by improving the accuracy of NWP. And Li et al. [
9] proposed a correction strategy for NWP to further enhance the precision of wind power prediction. The biggest advantage of a physical model is that it does not require historical operation data from wind farms, and it is suitable for newly built farms or those with incomplete data. However, physical models rely on rich and accurate meteorological, terrain, and other environmental data, and need to simulate the changes in meteorological factors such as WS and direction under the local effects of wind farms. The modeling complexity of physical models is high, and there are many uncertain links. Due to the limitations of model accuracy and simulation ability, systematic deviations may occur. Because of the high computational cost of numerical weather forecasting, the physical model based on numerical weather forecasting is not suitable for short-term WS forecasting, so it is mostly used for medium- and long-term WS forecasting [
10]. Statistical models are established by studying the mapping relationship between input data and output data in historical data, and their main processes include model identification, parameter estimation, and model validation. Traditional statistical models include the Kalman filter model [
11], AR (autoregressive) model [
12], MA (moving average) model [
13], ARMA (autoregressive moving average) model [
14], and ARIMA (autoregressive integrated moving average) model [
15]. Compared to physical models, statistical models have simple methods and use a single type of data, but their ability to process sudden changes is poor. At the same time, this model type requires a large amount of historical operating data, so it is not suitable for newly built or incomplete wind farms. In addition, the performance of statistical models deteriorates as the prediction time increases, and it is difficult to obtain excellent forecasting results for nonlinear and unstable WS data [
16]. Moreover, the artificial intelligence model is derived from statistical methods and is built through the repeated learning and training of data relationships based on large amounts of historical data. And common artificial intelligence models include the SVM (support vector machine) [
17], BP (backpropagation) neural network [
18], deep learning model [
19,
20,
21], DBN (deep belief network) [
22], reinforcement learning model [
23], and broad learning models [
24]. Nevertheless, a single artificial intelligence model has the disadvantages of being easily trapped in local minima, overfitting, underfitting, etc., and cannot fully capture all information about WS sequences. In order to overcome these shortcomings and make full use of the advantages of a single model and various methods, building a combined model to predict WS has gradually become a popular topic of research [
25].
At present, the most common combined forecasting methods are the combination of different models and the combination of models and algorithms. In terms of different model combinations, Neto et al. [
26] constructed a hybrid model that combines linear statistical and artificial intelligence (AI) forecasters, which can effectively overcome the shortcomings of single models. Moreover, Cheng et al. [
16] combined the four models of BP, a random vector functional link network, ENN (evolutionary neural network), and GRNN (generalized regression neural network) to form a new hybrid model, which can better utilize the characteristics of each model and make up for the inadequacy of a single model. Moreover, Shahzad et al. [
27] proposed a new combined model based on ARAR (autoregressive–autoregressive) and ANN (artificial neural network) models, which was experimentally demonstrated to better predict WS time series datasets. And Liang et al. [
28] combined the RNN (recurrent neural network) and CNN (convolutional neural network) to propose a new method. On the other hand, there are also many relevant studies on the combination of models and algorithms. According to the different functions of algorithms, they can be divided into two categories: optimization algorithms and decomposition methods. Because of their strong versatility, high efficiency, and simple implementation, swarm intelligence optimization algorithms have become a popular topic of research in the field of artificial intelligence and intelligent computing [
29,
30] and have been widely used in the parameter optimization of WS prediction models. For instance, there are some common optimization algorithms that have been combined with models, which include PSO (particle swarm optimization) [
31], the WOA (whale optimization algorithm) [
32], the DA (dragonfly algorithm) [
33], GWO (grey wolf optimization) [
34], etc. WS data have the characteristics of strong nonlinearity and non-stationarity. Signal decomposition methods have been gradually introduced into WS prediction in recent years because of their outstanding role in denoising and extracting signal features in complex signal processing. They are mainly used in combination with various models for data preprocessing. In WS prediction, common decomposition methods include EMD (empirical mode decomposition) [
35], VMD (variational mode decomposition) [
36], EEMD (ensemble empirical mode decomposition) [
37], CEEMD (complementary ensemble empirical mode decomposition) [
38], etc. Ibrahim et al. [
39] proposed a combined prediction model based on AD-PSO, Guided WOA, and LSTM (long short-term memory); the experiment showed that the LSTM optimized by AD-PSO and Guided WOA has strong predictive performance. Altan et al. [
40] proposed a combined prediction model based on a single model, namely, LSTM, a decomposition method, namely, ICEEMDAN (improved complementary ensemble empirical mode decomposition with adaptive noise), and an advanced optimization algorithm, namely, GWO; the combined model could better capture the nonlinear characteristics of WS data and had good prediction accuracy.
While combination models integrate multiple individual models and techniques to leverage their respective strengths and compensate for limitations, existing combination models still have some issues. First, determining optimal model integration and weighting is challenging when combining multiple model types. Simply combining various models often does not significantly improve combination model performance. Currently, there is a lack of comparative analysis on the roles of various types of techniques in composite model research. The theoretical construction of composite models needs further improvement. Second, when combining models with optimization algorithms, most existing combination models directly introduce the algorithm without improvements, limiting the algorithm’s functionality. Third, when combining models with decomposition methods, decomposition parameter settings significantly impact results. Existing models often use experiential parameter settings rather than effectively optimizing parameter combinations. Finally, combination models’ complex structures and numerous parameters generally cause long prediction times. These issues directly affect model prediction performance.
Based on the above analysis, the primary factors that impact the performance of a WS prediction model are as follows: First, WS data preprocessing is inadequate. WS data sources contain much noise and interference, making complete and precise data collection challenging. Existing preprocessing cannot effectively reduce noise and extract time series information. Second, model parameter optimization is insufficient. The many parameters in combination models often make conventional intelligent optimization algorithms inaccurate and slow in application, negatively affecting model prediction. Optimization algorithm convergence, search efficiency, and parameter settings need further development and improvement. Finally, combination model construction is unreasonable. WS characteristics like strong randomness, intermittency, and volatility require models with robust adaptive abilities. Existing models lack integrated advanced data mining and parameter optimization technologies for performance optimization. Therefore, it is evident that existing models have certain limitations and fail to meet the increasing demand for WS prediction performance, especially in terms of selecting and optimizing individual techniques within combination models, leaving significant room for improvement. To address these issues, this paper proposes a hybrid ultra-short-term WS forecasting model based on a modified pelican optimization algorithm. Firstly, targeting the common problems of swarm intelligence optimization algorithms, such as being easily trapped in local optima and insufficient optimization precision, an MPOA (modified pelican optimization algorithm) is introduced. This algorithm enhances the diversity of the initial population by incorporating chaotic mapping, improves the algorithm’s local exploration capability by introducing Lévy flight strategy, and dynamically adjusts critical parameters to enhance adaptability, thereby achieving overall performance enhancement in algorithm optimization. Secondly, to address the challenges of parameter abundance and the difficulty in combination optimization in VMD, a combined VMD key parameter optimization method using the MPOA and sample entropy is proposed, enabling the deep exploration of WS’s nonlinear variation characteristics. Finally, to tackle the problem of numerous hyperparameters in LSTM and the difficulty in achieving parameter optimization manually, the MPOA is applied to optimize the hyperparameters of LSTM, resulting in a significant improvement in LSTM’s predictive performance. Additionally, this paper conducts comparative simulation experiments between the proposed combination model and other models using WS data from a wind farm in eastern China, validating the predictive performance of the model.
The primary contributions of this paper include the following:
A modified pelican optimization algorithm is proposed. First, the tent map is applied for population initialization and the Lévy flight factor is introduced. Second, algorithm applications to different dimensional problems are classified and optimized. Finally, improved algorithm optimization performance is verified through test functions. The proposed modified pelican optimization algorithm not only effectively enhances the algorithm’s global and local search capabilities but also provides a new avenue for optimizing swarm intelligence algorithms;
The original WS data are modally decomposed by VMD. Through the use of sample entropy as an evaluation metric to determine K (the modal decomposition number) for VMD, and of RMSE (root mean square error) as the objective function for the MPOA, the alpha (moderate bandwidth constraint) and tau (noise tolerance) of the other two parameters of VMD are optimized. Applying VMD with the optimized parameter combination for WS data decomposition significantly reduces the nonlinearity and instability characteristics of the original WS data, effectively exploring the features of WS data. By introducing the composite evaluation metrics of sample entropy and RMSE in the parameter optimization of the WS signal decomposition strategy VMD, the data preprocessing technique effectively removes noise while retaining the main features of the original signal sequence, significantly improving the effectiveness of WS prediction;
The MPOA is applied to optimize LSTM model hyperparameters including the iteration number, first LSTM layer neuron number, second LSTM layer neuron number, and learning rate. Through hyperparameter optimization, the MPOA significantly strengthens LSTM predictive capabilities;
A combined ultra-short-term WS forecasting model based on the modified pelican optimization algorithm is proposed, combining the MPOA, VMD, and LSTM. The WS prediction process based on the combined model is constructed. The proposed VMD-MPOA-LSTM model provides a new effective method to improve ultra-short-term WS prediction model performance;
A comprehensive and effective evaluation of the developed combined model’s performance is carried out. The evaluation system uses three experiments and six performance indicators to effectively assess prediction accuracy and stability. Experimental analysis specifically determines optimization algorithm and data decomposition method impacts on the prediction model, providing a theoretical and practical basis for combined model construction.
The rest of this paper is organized as follows: In
Section 2, the methodology is introduced, including optimization algorithms, data preprocessing techniques, and LSTMs and their combined models. And in
Section 3, the experimental procedures of the proposed combined model are illustrated, and the comparative experimental results of different models are shown. Additionally, the precision and validity of the proposed model are discussed and validated in
Section 4. Concluding remarks are then provided in
Section 5.
5. Conclusions
Wind power generation technology has been rapidly advancing, leading to increases in installed wind power capacity and its contribution to overall power supply. Hence, the impact of wind farms on reliable grid operation is becoming increasingly pronounced. However, the strong time variation in, nonlinearity of, and complexity of WS render the prediction effect unsatisfactory. Therefore, this paper proposes a combined ultra-short-term WS forecasting model utilizing a modified pelican optimization algorithm, composed of the WS decomposition methods VMD, MPOA, and LSTM. Firstly, techniques including tent map-based population initialization and Lévy flight strategy are introduced to form an improved POA, whose superior optimization performance is verified by comparison with other popular optimization algorithms. Secondly, given the difficulty in selecting the objective function in current VMD parameter optimization, the optimal number of decompositions in VMD is determined by the minimum sample entropy value. Concurrently, taking RSME as the objective function, the optimal parameter combination of k, alpha, and tau in VMD is obtained through the MPOA, effectively enhancing the accuracy of VMD signal reconstruction after WS signal decomposition. Additionally, the MPOA is applied to LSTM hyperparameter optimization to attain the optimal combination. Finally, single-month WS data from an eastern China wind farm were selected, where WS fluctuations during this month were most frequent and drastic compared to during other months of the year. The proposed combined prediction model is compared, analyzed, and evaluated against other models through the aforementioned three experiments and six evaluation indicators. The experiments focus on the quantitative analysis of utilizing optimization algorithms and decomposition methods to enhance the prediction capabilities of the combined model, providing a theoretical basis and practical guidance for constructing hybrid models moving forward. The experimental results demonstrate that the combined model achieves a MAPE of 6.49% and R2 of 0.9803, with optimal values on other metrics. This signifies that the combined model possesses superior predictive capabilities compared to conventional single models and earlier combined models. Additionally, through further, concurrent optimization of the parameters of both the decomposition method and the single model, the optimization algorithm, decomposition method, and single model can be organically integrated, markedly improving the WS predictive capabilities of the combined model. This study provides an effective approach to further enhance the predictive capabilities of combined models.
In future work, the following issues should be studied further:
- (1)
The combination of an optimization algorithm, decomposition method, and neural network needs to be further studied based on the mechanism, and effective methods to realize the in-depth integration of different technologies need to be explored. At the same time, it is necessary to integrate more and more advanced technologies into the combination forecasting model to further enhance the predictive capabilities of the combination model;
- (2)
Despite experimental verification of its superior predictive performance, a key limitation of the current model is its prolonged computation time. Reducing calculation time and improving forecasting efficiency should be priority research directions moving forward. Parallel computing, algorithm optimization, model simplification, and data preprocessing will likely be effective solutions to reduce computation time and improve the computational efficiency of the combined model;
- (3)
The model predicts WS through historical data and only considers the magnitude of WS. In fact, meteorological parameters like wind orientation, temperature, moisture, and barometric pressure impact the precision of predictions. In the future, it is necessary to consider combining these external factors to establish a multi-input prediction model for further enhancing WS forecasting accuracy;
- (4)
The proposed model will be tested in additional actual wind farm scenarios and compared with more prediction models to further validate its robustness and adaptability;
- (5)
For the MPOA, although the improvement methods and experimental validation results are detailed, there is a lack of theoretical analysis of the improved algorithm. A mathematical proof of the convergence of the improved algorithm and the analysis of the mechanisms that enhance the model’s prediction performance will be important directions for future research.