1. Introduction
Wind energy, as one of the most widely used renewable energy sources in modern power systems, has experienced rapid growth worldwide in recent years. According to the Global Wind Energy Council, worldwide installed wind capacity grew from 318 GW in 2013 to 1021 GW by 2023 [
1]. However, wind power is inherently random, volatile, and intermittent. Large-scale integration of wind power can introduce random power fluctuations into the grid and threatens the stability of system frequency [
2]. Crucially, in deregulated electricity markets, these fluctuations also impose significant economic burdens, as grid operators often enforce strict penalties for deviations between scheduled and actual power generation. Therefore, developing highly accurate and reliable wind power forecasting (WPF) technologies is crucial for enabling the scientific allocation of spinning reserve capacity, facilitating timely peak load regulation, and ensuring the secure and stable operation of the power grid [
3].
To achieve higher accuracy in the WPF domain, many WPF models have been developed. At present, WPF techniques are commonly categorized into five types: physical models, statistical models, artificial intelligence (AI)-based models, deep learning models, and hybrid models [
4].
Table 1 provides a clear overview of these five categories of models, and outlines their key methods, main contributions, limitations, and related references. Physical methods rely on numerical weather prediction (NWP) data and replicate the physical process of converting wind energy into electricity to develop forecasting models [
5]. They require highly accurate meteorological inputs and detailed turbine parameters, while the complexity of the modeling process restricts their application in ultra-short-term WPF. Statistical methods use historical data to uncover the correlations between input features and the output variable [
6]. Traditional statistical models, such as ARMA and ARIMA [
7], rely on linear assumptions, which limit their ability to capture the nonlinear characteristics of wind power data [
8].
With the rapid advancement of AI technologies, AI-based approaches have been increasingly applied to WPF. These methods are generally classified into machine learning and deep learning methods [
9]. Machine learning methods, including neural networks [
10,
11], extreme learning machines (ELMs) [
12], and support vector machines (SVMs) [
13], can model nonlinear relationships. However, their network structures make them prone to local optima, overfitting, and slow convergence [
14]. In contrast, deep learning models have become the mainstream choice for WPF due to their multilayer architectures and strong feature extraction capability [
15,
16]. Among them, convolutional neural networks (CNNs) [
17], temporal convolutional networks (TCNs) [
18], and long short-term memory networks (LSTM) [
19] have demonstrated strong performance in both wind power and wind speed forecasting. In particular, LSTM networks have gained widespread adoption in recent years because they effectively mitigate the gradient instability issues present in traditional recurrent neural networks (RNNs).
Wind power generation is susceptible to various external meteorological factors such as weather conditions and temperature, making it challenging for a single model to accurately capture the original feature information within wind power series. To address this limitation, hybrid models combining distinct techniques have been widely adopted to enhance prediction accuracy [
20]. Generally, these hybrid models can be categorized into two primary types. The first category involves applying signal decomposition techniques to partition the original signal, thereby mitigating the nonlinear characteristics of the raw wind power time series. Common decomposition methods include wavelet decomposition [
21], singular value decomposition [
22], and empirical mode decomposition along with its variants [
23]. These methods typically decompose the original wind power series into a sequence of sub-modes with different frequencies and model each sub-mode individually. However, a critical defect persists in this methodology. The decomposed high-frequency subsequences often retain significant noise and fluctuations, and directly predicting these components frequently leads to error accumulation. To mitigate this issue, existing literature typically adopts one of two approaches. The first is applying secondary decomposition to the high-frequency components [
24], and the second is directly discarding them [
25]. However, discarding components results in the loss of valid fluctuation details, whereas secondary decomposition inevitably imposes a substantial computational burden and significantly increases the training time.
The second category integrates optimization methods with deep learning predictors or decomposition algorithms to search for optimal hyperparameter settings, as the performance of hybrid models relies heavily on these configurations. For example, reference [
26] developed a hybrid PSO-CNN-LSTM model for short-term WPF, while reference [
27] utilized the firefly algorithm to adjust the parameter settings of the LSTM network to enhance the adaptability and stability of the model. Additionally, reference [
28] proposed a model that employs the Sparrow Search Algorithm (SSA) to optimize the VMD decomposition process. Nevertheless, a critical common limitation persists across these studies. These basic algorithms often suffer from insufficient population diversity and are prone to premature convergence when solving such high-dimensional and non-convex optimization problems. Consequently, the potential for maximizing model accuracy is hindered by the inability to find the global optimal parameters.
Table 1.
Classification of WPF models with their respective advantages and disadvantages.
Table 1.
Classification of WPF models with their respective advantages and disadvantages.
| Category | Model | Refs. | Advantages | Disadvantages |
|---|
| physical methods | NWP | [5] | Effective for long-term forecasting, ability to handle real-time observations. | Require high computational resources and large datasets. |
| Statistical methods | ARIMA-ANN | [6] | Computationally efficient and perform reliably with limited data. | Have limited capability in modeling nonlinear features. |
| ARIMA-Kalman | [7] |
| Machine learning models | BP | [10,11] | Capture complex patterns and non-linear features, adaptability to dynamic trends. | Require higher computation and careful parameter tuning, with risk of overfitting. |
| ELM | [12] |
| SVM | [13] |
| Deep learning models | CNN | [17] | Learn temporal structures and long-range dependencies. | Incur high computation cost and long training time, and remain sensitive to parameter tuning and overfitting. |
| TCN | [18] |
| LSTM | [19] |
| Hybrid models | PSO-CNN-LSTM | [21] | Combine complementary algorithms, resulting in improved generalization and forecasting ability. | More complex to design and dependent on larger, high-quality datasets. |
| FA-LSTM | [22] |
| WT-MLPNN | [23] |
| SVD-TCN | [24] |
| EMD-BaNN | [25] |
| VMD-ConvLSTM | [26] |
| Secondary Decomposition | [27] |
| VMD-IMPA-SVM | [28] |
Despite extensive research in this field, existing WPF models still have some research gaps that need to be filled:
- (1)
Most existing hybrid models rely on standard metaheuristic algorithms to determine key hyperparameters. However, these algorithms lack sufficient capability to escape local optima when addressing complex non-convex optimization problems, limiting the further improvement of prediction accuracy.
- (2)
The processing strategies for high-frequency components in decomposition-based models are often inefficient. Existing methods either sacrifice valid information by discarding components or incur excessive computational costs through secondary decomposition, lacking a strategy that balances efficiency and accuracy.
To address the identified research gaps, this study proposes a novel wind power forecasting framework based on an Improved Triangular Topology Aggregation Optimizer (ITTAO) and a high-frequency adaptive weighting strategy. The main contributions of this work are twofold. address the identified research gaps, this study proposes a novel wind power forecasting framework based on an Improved Triangular Topology Aggregation Optimizer (ITTAO) and a high-frequency adaptive weighting strategy. The main contributions of this work are twofold.
- (1)
An advanced ITTAO algorithm is proposed to solve high-dimensional non-convex optimization problems. By integrating Logistic–Tent chaotic mapping, the Golden-Sine strategy, and lens-imaging learning, ITTAO effectively overcomes the premature convergence of standard metaheuristics, ensuring precise global optimization for model hyperparameters.
- (2)
A high-frequency adaptive weighting strategy is designed to balance reconstruction accuracy and computational efficiency. By dynamically adjusting weights based on error stability, this strategy suppresses high-frequency noise and minimizes error accumulation without the computational burden of secondary decomposition or the information loss of discarding.
The remainder of this paper is organized as follows.
Section 2 describes the algorithmic principles employed in this study.
Section 3 outlines the construction process of the proposed forecasting model.
Section 4 introduces the data sources and presents the experimental results.
Section 5 summarizes the conclusions and key findings.
5. Conclusions
This study proposes an innovative ensemble forecasting method designed to achieve more accurate ultra-short-term WPF by integrating a multi-strategy improved TTAO, signal decomposition and reconstruction, and a high-frequency adaptive weighting technique. Compared with traditional hybrid models, this work optimizes both data decomposition and predictive modeling. Specifically, ITTAO overcomes premature convergence to achieve precise global parameter optimization for VMD and BiLSTM. Simultaneously, the high-frequency adaptive weighting strategy dynamically adjusts volatile sequences, simplifying the prediction task while facilitating the deep mining of intrinsic features.
Experimental results across four seasons demonstrate that the proposed ITTAO-VMD-BiLSTM model consistently outperforms five baselines. Notably, in the highly volatile January dataset, the model achieved optimal metrics with an NMAE of 1.33%, an NRMSE of 2.20%, and an of 98.18%. Compared to the classical BP neural network and the advanced CEEMDAN-BiLSTM model, the proposed method reduced the NMAE by 49.24% and 11.92%, decreased the NRMSE by 55.65% and 14.40%, and improved the by 6.46% and 0.77%, respectively. These quantitative findings confirm that the combined optimization and high-frequency adaptive weighting strategies effectively suppress noise and enhance prediction accuracy, establishing the model as a robust tool for wind power forecasting.
It is important to note that forecast accuracy is not merely a technical metric but a critical economic determinant. In modern electricity markets, large deviations between forecasted and actual power generation incur substantial financial penalties. Therefore, minimizing error metrics such as NMAE and NRMSE directly correlates with reduced penalty costs and optimized operational revenue. While this study focuses on the technical improvement of forecasting models, the translation of these accuracy gains into specific financial values remains a promising avenue for future research.