1. Introduction
As global warming intensifies and energy crises escalate, the need to seek clean and sustainable energy solutions has become an international consensus. Wind power generation, as a green energy technology, is becoming key to the global energy transition due to its renewable nature and low carbon emissions [
1].
Figure 1 provides a detailed depiction of the workflow of a modern wind turbine, from the capture of wind energy to the output of electricity. The left side of the image illustrates the main components of a wind turbine. Wind is captured by the turbine blades and converted into mechanical energy through transmission gears, which drive the generator to produce electricity. The right side of the image then describes the further processing and distribution of electricity. Power is first transmitted to a charge controller, then flows to a battery bank for storage or directly to an inverter, which supplies power to the grid or directly to the load. According to data released by the Global Wind Energy Council (GWEC) [
2], global cumulative installed wind power capacity grew from 433 GW in 2015 to 906 GW in 2022, with an annual compound growth rate of 11.12%. Accurate wind power prediction is crucial for realizing precise scheduling and efficient power supply, thus making wind power prediction technology essential. However, the variability and intermittency of wind power present significant challenges to the planning, operation, and control of power systems [
3].
Wind power prediction methods are primarily divided into physical models, statistical models, and machine learning algorithms. Physical models depend on numerical weather predictions (NWP), utilizing meteorological data and surface information to solve equations of fluid dynamics and thermodynamics for predicting wind speed and direction, and subsequently calculating the power output of wind farms [
4]. However, these models are computationally complex and sensitive to initial conditions, which limits their application in short-term high-precision forecasting. Statistical methods predict by establishing a relationship between input and output based on historical data [
5]. Despite being computationally efficient, statistical models have drawbacks such as difficulty in model order selection and sensitivity to climate changes, which limit their effectiveness in long-term applications and environments with severe climate variability.
With the development of artificial intelligence technology, potential applications in energy forecasting have emerged. Traditional machine learning methods such as Artificial Neural Networks (ANN), Support Vector Machines (SVM), Extreme Gradient Boosting (XGBoost), and Multi-Layer Perceptrons (MLPs) have been widely utilized. The literature [
6,
7,
8] proposes hybrid neural network architectures combining modal decomposition, optimization algorithms, and Extreme Learning Machines (ELM) to further improve prediction accuracy. The literature [
9] introduces a novel wind power curve model integrating Isolation Forest anomaly detection, Asymmetric Fuzzy Mean Radial Basis Function Neural Network modeling, and meta-heuristic algorithm optimization. Although these traditional machine learning methods have high training efficiency, they still require extensive feature engineering and have limitations in handling long-term dependency relationships. Deep learning, as a branch of machine learning, has shown significant advancements in the field of sequence data prediction. The literature [
10,
11,
12] proposes models integrating Recurrent Neural Networks (RNN) and Long Short Term Memory (LSTM) based on reinforcement learning, a wind power prediction method based on Gated Recurrent Units (GRU), and a model combining Self-Attention Temporal Convolutional Networks (SATCN) with LSTM for wind power prediction, respectively. These models aim to enhance prediction accuracy and reduce computational costs. Despite their strong fitting capabilities, they face issues like vanishing gradients, exploding gradients, and sensitivity to noise, which can lead to poor prediction outcomes.
In recent years, due to the ability of Graph Neural Networks (GNNs) to effectively model interactions between nodes through weighted edges, their application in wind power forecasting has gradually increased. The literature [
13] proposed a Multi-dimensional Spatio-Temporal Graph Neural Network, which combines Wind Transformer for single-point wind speed prediction and utilizes GNNs to integrate wind speed information from multiple points in the spatial dimension, thereby improving wind speed prediction accuracy. Additionally, another study [
14] combined Graph Convolutional Networks (GCN) with multiresolution convolution neural networks to dynamically extract spatial and temporal features among input variables, thus enhancing prediction accuracy. The literature [
15] combined GNN with an improved Bootstrap technique to model the spatio-temporal characteristics between wind farms and meteorological factors, improving the precision and reliability of ultra-short-term predictions. The literature [
16] integrated GNN with a Gated Dilated Inception Network, specifically considering the blockage effects between wind turbines, which further improved prediction accuracy.
With the introduction of the Vanilla Transformer [
17], the attention mechanism it employs has greatly enhanced the model’s ability to handle long-distance sequence dependencies, heralding a new era in time series prediction. Models that integrate attention mechanisms and Transformer-based variants have become the focus of current research. The literature [
18,
19,
20,
21] demonstrates that integrating attention mechanisms with models such as CNN, LSTM, and GRU has enhanced the accuracy of wind power prediction to varying degrees. The Vanilla Transformer, due to its need to compute the relevance of every position in the input sequence to all others, exhibits high complexity. In contrast, the Informer [
22] utilizes the Kullback–Leibler divergence distribution measure to filter out dominant queries on the order of
, reducing complexity and designing a generative decoder to directly produce results, avoiding cumulative errors associated with one forward-step predictions. However, its reliance on single-step time calculations may result in the loss of crucial information under highly volatile wind power data. The Autoformer [
23] attempts to decompose seasonal-trend features and auto-correlation attention for patch-level connections, but its essentially manual design may fail to capture all semantic information within the patch. The FEDformer [
24], by transforming time series into the frequency domain to compute attention, achieves linear computational complexity, though this method may overlook detailed temporal dependencies, affecting prediction accuracy. The iTransformer [
25] applies attention mechanisms and feed-forward networks in inverted dimensions, effectively capturing multivariate correlations and learning nonlinear representations.
Through in-depth observation of wind power sequence data, we found that power data present diverse time patterns at different sampling scales. For instance, wind power data recorded hourly reveal intra-day variations in wind speed, while data sampled daily highlight seasonal fluctuations. From a macro perspective, quarterly climate trends dominate the patterns of annual average power. These observations suggest that understanding complex temporal variations requires a multi-scale analysis approach, which can capture both short-term changes and long-term climate patterns, thereby offering a more comprehensive perspective for wind power prediction. Regarding references [
13,
14,
15,
16], most of the GNNs primarily focus on modeling the interactions between wind turbines within wind farms from a spatial dimension. These methods typically adopt a data-driven approach for constructing the graph structure; while capturing correlations between time series to some extent, they exhibit limited flexibility as they often only learn a specific type of dependency between variables. Consequently, they struggle to effectively handle complex temporal features across different time scales. This paper, from the perspective of capturing dependencies at various scales, combines a multi-scale temporal graph neural network with an adaptive graph learning module and optimizes the synergistic effects of features across different time scales through a fusion module. Furthermore, we introduce an improved temporal convolutional network, which enhances prediction capability by aggregating information from different scales and leveraging the complementarity of multi-scale observations.
The contributions of this paper are as follows:
- (1)
We developed a multi-scale frequency decomposition (MSF-Decomp) module that effectively extracts seasonal and trend changes from data of different sampling sizes, transforming them into high and low-frequency components for independent modeling.
- (2)
The Multi-Scale Temporal Graph Convolutional Network (MST-GCN) was designed to use low-frequency components as inputs, capturing correlations across multi- scale sequences.
- (3)
A Bidirectional Temporal Gated Convolution Network (Bi-TGCN) was introduced, utilizing high-frequency components to effectively handle dependencies within multi-scale sequences.
- (4)
By using a multi-head cross-attention mechanism to fuse the prediction results of two models and comparing them with several benchmark models, the advantages of our method in terms of robustness and accuracy were validated.
The remainder of this paper is organized as follows:
Section 2 provides a detailed explanation of the proposed model’s construction and key techniques.
Section 3 introduces the data sources and preprocessing steps.
Section 4 covers the experimental design, result analysis, and performance comparison of the model. Finally,
Section 5 summarizes the research findings and limitations and discusses future research directions.
5. Conclusions
Due to the uncertainty and high volatility of wind power, the accuracy of wind power prediction is often unsatisfactory. To address this issue, this paper proposes a new prediction model. First, the MSF-Decomp module is used to decompose the wind power data into high- and low-frequency components, mitigating the impact of wind power uncertainty. Then, feature extraction is performed using the Bi-TGCN and MST-GCN networks, and the results are fused to obtain the final prediction. Thanks to its excellent multi-scale processing capabilities, the proposed model, in comparison experiments, reduced the MSE error by an average of 7.1% compared to state-of-the-art models and by an average of 48.9% compared to traditional models. The results demonstrate that the proposed model can effectively capture and integrate data dynamics across different time scales, showing significant practical value in short-term data analysis and mid-to-long-term decision support and proving its advantages in handling complex forecasting tasks.
Although the wind power prediction model proposed in this paper performs excellently in handling multi-scale data and capturing temporal dependencies, it still has some limitations. Firstly, the model’s ability to predict zero or negative wind power values has not been fully optimized, which is crucial for ensuring reliability in practical applications as these values can represent critical operational scenarios. Secondly, due to the integration of MSF-Decomp, Bi-TGCN, and MST-GCN, the complexity of the model might pose challenges in real-time processing, particularly when deployed in online systems that require quick decision-making. Finally, although the model shows higher accuracy and stability compared with the baseline model, different application scenarios in actual operation still need to be considered.
Future research will focus on optimizing the model’s accuracy in predicting zero values and negative power and exploring methods to enhance the precision of mid-to-long-term forecasts. Additionally, we plan to develop an online wind power prediction system for integration into actual power scheduling systems, thus further advancing the practicality of wind power technology.