Next Article in Journal
CNNRes-DIndRNN: A New Method for Detecting TLS-Encrypted Malicious Traffic
Next Article in Special Issue
An Efficient Internet-Wide Port Scan Algorithm Based on Deep Reinforcement Learning
Previous Article in Journal
ICU-Transformer: Multi-Head Attention Expert System for ICU Resource Allocation Robust to Data Poisoning Attacks
Previous Article in Special Issue
ICRSSD: Identification and Classification for Railway Structured Sensitive Data
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Research and Application of a Model Selection Forecasting System for Wind Speed and Theoretical Power Generation

1
Guangzhou Power Supply Bureau, Guangdong Power Grid Co., Ltd., Guangzhou 510520, China
2
School of Mechatronic Engineering, Guangdong Polytechnic Normal University, Guangzhou 510450, China
*
Author to whom correspondence should be addressed.
Future Internet 2026, 18(1), 7; https://doi.org/10.3390/fi18010007
Submission received: 15 November 2025 / Revised: 17 December 2025 / Accepted: 17 December 2025 / Published: 22 December 2025

Abstract

Accurate short-term wind speed forecasting is essential for mitigating wind power variability and supporting stable grid operation. This study proposes a model selection forecasting system (MSFS) that dynamically integrates six deep learning models to enhance predictive accuracy and robustness. Using multi-turbine data from a wind farm in northwest China, the framework identifies the optimal model at each time step through iterative evaluation and retrains the selected models to further improve performance. The Kruskal–Wallis test shows that all forecasting models, including MSFS, maintain statistical consistency with the real wind speed distribution at the 95% confidence level. Uncertainty analysis demonstrates that MSFS more reliable forecasting interval. By coupling MSFS-derived wind speed forecasts with turbine-specific power curves, the system enables reliable theoretical power estimation, offering critical reference information for dispatch planning, reserve allocation, and distinguishing resource-driven variability from turbine performance deviations. The slightly conservative yet highly stable forecasting behavior of MSFS reduces overestimation risks and enhances decision reliability. Overall, the proposed MSFS framework provides a robust, interpretable, and operationally valuable solution for short-term wind energy forecasting, with strong potential for wind farm operation and power system management.

1. Introduction

By the end of 2024, global wind power capacity had continued to expand, reaching approximately 1136 GW, thereby consolidating wind energy as a cornerstone in the transformation of the global energy structure (Figure 1 illustrates the overall trajectory of global wind power development). Within this process, the top ten wind power countries made substantial contributions, with China standing out by achieving 520.6 GW of cumulative installed capacity and 79.8 GW of annual additions, accounting for 45.8% and nearly 68% of the global totals, respectively. This not only reinforced China’s leading position but also underscored its pivotal role in shaping the global wind power landscape. Importantly, China’s dominance is not confined to sheer scale; it is equally reflected in the widespread deployment of onshore projects, the rapid rise of offshore wind, and the sustained export of technology and manufacturing capabilities. Meanwhile, countries such as the United States, Germany, India, Brazil, the United Kingdom, Spain, and France continue to play critical roles within their respective regions, collectively driving the multipolar evolution of global wind power [1].
Nevertheless, the expansion of installed capacity does not automatically translate into the efficient utilization of wind resources. The decisive challenge lies in ensuring the reliable and stable integration of wind power into electricity grids, which remains an unresolved issue [2]. The inherent variability and intermittency of wind speed introduce significant uncertainty into grid integration, thereby constraining both the effective absorption of wind energy and the flexibility of system scheduling. As a result, wind speed forecasting has emerged as a central focus of research, aiming to enhance forecasting accuracy as a means to mitigate the destabilizing effects of wind fluctuations on grid operation, and ultimately to improve the controllability and security of energy systems. However, the nonlinear and complex nature of wind speed data continues to pose formidable challenges, limiting the capacity of existing methods to fully capture its dynamic patterns and constituting a fundamental bottleneck to methodological innovation and theoretical advancement in this field.
To address these challenges, a wide variety of wind power forecasting approaches have been developed, which can generally be categorized into four main categories: (1) Statistical models, such as autoregressive integrated moving average (ARIMA), Kalman filters, and regression approaches, which effectively capture temporal dependencies but remain inadequate for nonlinear and non-stationary characteristics [3]; (2) Numerical weather forecasting (NWP) models, which simulate atmospheric dynamics and thermodynamics through physical equations, thereby offering physically consistent large-scale meteorological inputs, yet with high computational costs and limited adaptability to local contexts [4]; (3) Spatio-temporal models, including graph neural networks (GNNs), spatio-temporal convolutions, and Transformers, which capture interdependencies across turbines and sites, significantly improving predictive accuracy, though requiring high-quality data and complex modeling [5]; (4) Artificial intelligence (AI)-based models, covering deep learning, ensemble learning, and hybrid optimization approaches, which provide powerful nonlinear modeling and feature extraction capabilities but are often criticized for limited interpretability, lack of physical consistency, and sensitivity to data drift [6].
Although different forecasting approaches each offer unique strengths, no single method can simultaneously satisfy the requirements of accuracy, physical plausibility, interpretability, and real-time applicability. Statistical models are transparent but less accurate; NWP models are physically consistent but computationally expensive; spatio-temporal models improve performance but suffer from limited transferability; and AI models provide strong nonlinear learning but face interpretability and credibility challenges. As a result, research has increasingly shifted toward multi-model fusion. By combining the interpretability of statistical models, the physical consistency of NWP, the dependency-modeling capability of spatio-temporal methods, and the nonlinear representational power of AI, hybrid forecasting frameworks have emerged as a more robust and practical solution for large-scale wind integration and low-carbon energy development [7].
Recent studies further highlight the growing importance of hybrid statistical–AI methodologies. Wang et al. [8] reviewed uncertainty-modeling strategies and interval metrics, outlining key challenges and future directions. Building on this, innovative frameworks have been proposed to address the nonlinear and uncertain nature of wind data. For example, Zhao et al. [9] introduced an interpretable contrastive learning model (ICoTF), while Li et al. [10] developed a deep fuzzy inference system to manage asymmetric and heavy-tailed errors. Chen et al. [11] combined Bayesian wavelet denoising with deep Gaussian processes, significantly improving accuracy across multiple wind farms. He et al. [12] addressed concept drift via an online probabilistic LSTM framework, and Gao et al. [13] embedded physical power-distribution constraints into LSTM training.
Advances have also been made in privacy-preserving learning and real-time adaptation [14]. Wang and Zhou [15] introduced a federated learning scheme for multi-site forecasting, whereas Nayak et al. [16] proposed an online ensemble regression model for continuous data streams. Additional contributions include Bayesian DLMs for hydrogen-related forecasting [17] and statistical downscaling frameworks for complex-terrain wind modeling [18].
Collectively, these works show that no single technique can ensure accuracy, interpretability, robustness, and physical consistency simultaneously. Instead, hybrid and ensemble methodologies—integrating statistical modeling, physics-based constraints, spatio-temporal learning, and AI—have become the dominant direction for reliable renewable-energy forecasting.
Spatio-temporal (ST) models, in particular, have emerged as an important complement to these methods. By jointly capturing temporal dynamics and spatial dependencies, ST models provide a more comprehensive representation of renewable-energy variability. Zhao et al. [19] developed a multi-site Transformer framework with gated fusion and frequency-enhancement modules, while Ma et al. [20] proposed an ST graph network (MFSGN) to model static and dynamic correlations among turbine clusters. Both significantly outperform traditional approaches. High-resolution regional forecasting has also benefited from deep learning, with Resifi et al. [21] introducing a hybrid recursive–downscaling strategy, Zhang and Yin [22] developing a PCA-enhanced U-Net architecture, and Zhang et al. [23] presenting a multimodal ST-DFNet incorporating meteorological and topographical factors for accurate multi-step prediction.
Further, spatio-temporal models have been applied to offshore wind and interpretability-focused studies. Hu et al. [24] presented a dual-branch attention neural network (DBANN) for offshore wind forecasting, which separately captures spatial and temporal dependencies before combining them in a fusion module, yielding large error reductions in real-world datasets. Y. N. Zhao et al. [25] proposed a framework to interpret LASSO regression by quantifying spatio-temporal correlations, revealing how feature collinearity and spatial distribution impact predictive accuracy. Verdone et al. [26] reviewed advances in multi-site forecasting, emphasizing that integrating spatial information not only improves predictive performance but also enhances model robustness, while noting the pressing need for standardized ST benchmarks to facilitate cross-study comparisons.
Collectively, these studies highlight that spatio-temporal models—often integrated with attention mechanisms, graph learning, and multimodal feature fusion—represent a critical pathway toward more accurate, robust, and interpretable wind forecasting. By combining the ability to model temporal dependencies, capture inter-site correlations, and incorporate diverse data sources, ST-based frameworks extend the capabilities of traditional approaches and provide strong support for applications such as grid scheduling, offshore wind integration, and renewable energy planning. The convergence of spatio-temporal learning with statistical, physics-informed, and AI-based methods thus forms a promising research frontier for achieving high-penetration renewable integration.
Numerical Weather Prediction (NWP) models provide physically consistent, large-scale meteorological inputs and are indispensable for medium- and long-term renewable energy forecasting. However, their high computational cost, limited local adaptability, and cumulative forecast errors motivate the development of hybrid approaches. Recent studies increasingly integrate NWP with statistical, physical, and deep learning methods to enhance forecasting accuracy, robustness, and interpretability.
Wan et al. [27] proposed the NWP-Time2Vec-xLSTM-TCCNN model, which incorporates NWP-derived variables and a hybrid xLSTM–CNN architecture. This method effectively captures nonlinear temporal features and achieved notable improvements, reducing MAE by 12.28–20% for photovoltaic forecasting and 10.33–11.53% for wind power forecasting. Ignatev et al. [28] refined NWP-based forecasts through localized statistical calibration and incorporated operational turbine data to simulate site-specific power output, demonstrating higher reliability across diverse conditions. Michalakopoulos et al. [29] further integrated NWP inputs, physical turbine specifications, and power curve equations into a Temporal Fusion Transformer (TFT). Their framework yielded substantial performance gains, including RMSE reductions of up to 60% and R2 values exceeding 99%.
Overall, the hybridization of NWP with statistical inference, physical modeling, and deep learning effectively compensates for the limitations of standalone models. By combining large-scale atmospheric information with local corrections and nonlinear learning, these integrated frameworks deliver more accurate, robust, and physically coherent wind power forecasts, thereby supporting grid planning, renewable integration, and market optimization.
Artificial intelligence (AI) models has become a cornerstone of data-driven wind resource modeling due to its strong nonlinear representation capability. However, challenges related to interpretability, robustness under nonstationary conditions, and cross-dataset adaptability remain. Recent work therefore increasingly combines AI models with decomposition techniques, optimization algorithms, frequency-domain analysis, graph learning, and interpretability frameworks, forming hybrid approaches with improved accuracy and stability.
Xu et al. [30] provided a cross-dataset benchmark for neural networks in wind power forecasting, showing that autoregressive models perform more stably in short-term horizons, while neural networks excel in long-term forecasting. Hybrid decomposition–AI models have also demonstrated strong gains. Dong et al. [31] applied CEEMD–Hilbert decomposition with LSTM, while Ullah et al. [32] combined TVFEMD, DWT, GRU, and Transformer architectures for multiscale feature extraction. Other studies further integrate VMD, graph convolution, LSTM, evolutionary optimization, or PatchTST to better capture complex, nonstationary wind patterns [33,34,35]. Frequency-domain enhancement has also been explored, such as the dual-path frequency Mamba-Transformer of Hong et al. [36], which reduced RMSE by more than 60%.
Interpretability has become another major direction. Methods combining feature selection, TFT, SHAP [37] analysis, and MVMD-based [38,39] decomposition have provided more transparent forecasting while maintaining accuracy. Ensemble approaches such as ISI-Net [40] also support both interpretability and intelligent model selection. Optimization and ensemble learning continue to reinforce AI frameworks. Examples include offshore short-term models based on EEMD-BO-BiGRU [41], feature-enhanced xLSTM systems with strong transferability [42], and hybrid point–interval forecasting frameworks optimized by multi-objective strategies [43].
In addition to traditional forecasting approaches, recent advances in model and algorithm selection have emerged as an important research direction closely related to this study. Model selection is a fundamental component of automated machine learning (AutoML), where the goal is to automatically identify the most suitable model or ensemble for a given task or data regime [44]. Contemporary AutoML frameworks incorporate a variety of selection strategies, including meta-learning, Bayesian optimization, reinforcement learning, evolutionary search, and multi-objective model merging [45]. For example, the HM3 framework introduces hierarchical multi-objective model merging to integrate pretrained models efficiently, while recent surveys highlight the increasing role of evolutionary computation and neural model selection in large-scale systems [46,47]. Although these methods demonstrate strong adaptability, they are typically designed for offline optimization, require substantial computational resources, and do not account for the nonstationarity and fine-grained temporal dynamics characteristic of wind speed forecasting [48]. In contrast, the proposed MSFS framework provides a lightweight, time-step-wise model selection strategy tailored to rapidly evolving wind conditions, thereby extending AutoML concepts into a real-time, domain-specific forecasting context.
In this study, the six selected forecasting models were chosen to represent the major architectural paradigms that have demonstrated strong performance in nonlinear and nonstationary wind speed forecasting. Hybrid convolution–recurrent structures (CNN-LSTM, CNN-GRU) effectively capture local temporal patterns and sequential dependencies, while dilated temporal convolutions combined with recurrence (TCN-LSTM) provide multi-scale receptive fields for modeling long-range dynamics. The LSTM–XGBoost hybrid integrates deep temporal feature extraction with nonlinear ensemble learning, offering robustness under varying data regimes. Attention-based models such as the Transformer have recently emerged as state-of-the-art for capturing global dependencies, and graph-enhanced architectures (GNN-TCN) further incorporate spatial interactions among turbines. Together, these models offer complementary representational strengths that are well-suited for evaluating a dynamic model-selection framework. Overall, these studies show that AI alone cannot fully satisfy requirements for accuracy, robustness, interpretability, and generalizability. Hybrid AI frameworks—integrating decomposition, optimization, graph learning, frequency-domain modeling, and interpretability modules—have emerged as a dominant direction. Meanwhile, effective wind forecasting increasingly relies on multi-model designs that combine statistical, physical, spatio-temporal, and AI methods. By incorporating multivariable drivers such as meteorology, spatial heterogeneity, and turbine characteristics, these integrated frameworks more accurately capture wind-to-power dynamics, thereby improving forecasting reliability, supporting grid integration, and advancing low-carbon energy systems.
Building on these insights, the main contributions of this paper are summarized as follows:
Multi-model ensemble framework for wind speed forecasting. We employ multiple forecasting models in parallel and identify the time-varying optimal model using absolute error as the selection criterion, thereby exploiting complementary strengths across regimes to improve accuracy and robustness.
Dynamic model selection mechanism. We introduce an online learning–based retraining strategy for the identified optimal models and develop a classification model that automatically selects the most suitable predictor at each time point, enabling intelligent and adaptive model choice.
Multivariable wind speed–power conversion model. We construct a conversion curve that augments the traditional wind speed–power relationship with temperature and altitude, yielding a more faithful representation of the nonlinear mapping from wind speed to power output and enhancing engineering applicability.
Uncertainty analysis for the conversion process. We incorporate probabilistic modeling to quantify uncertainty in predicted power, explicitly accounting for environmental perturbations and model error, thereby strengthening decision support for grid scheduling, risk management, and energy optimization.
Operational integration via theoretical power forecasting. By combining wind speed forecasts with the multivariable conversion model, we derive theoretical power forecasting that provide actionable guidance for dispatch, generation planning, and large-scale wind integration, improving wind-farm operational efficiency and grid stability

2. Conceptual Framework of the MSFSC Model

In the MSFSC design, four principal elements are integrated, consisting of forecasting models, a classifier, and an algorithmic module for optimization.

2.1. Framework of Predictive Models and Classifiers

This section details six advanced sub-models utilized in the model selection component of wind speed prediction. The suite of models comprises Transformer [49], CNN-LSTM [50], TCN-LSTM [51], CNN-GRU [52], LSTM-XGBoost [53], and GNN-TCN [54]. These algorithms are recognized for their robustness, computational effectiveness, and strong forecasting capability. Furthermore, the Transformer is applied as an auxiliary tool to improve the selection of candidate models [55].

2.2. Classification-Integrated Framework for Model Selection Forecasting

To address the challenges of accuracy, adaptability, and reliability in wind speed and power forecasting, this study proposes an integrated forecasting framework that combines data-driven deep learning models with physical and probabilistic modeling techniques. The proposed framework is designed to capture the complex temporal dependencies of wind speed, adaptively select the most suitable forecasting model, and quantify forecasting uncertainty to enhance its practical applicability for wind farm operation and grid management. The methodological process consists of seven main steps, as illustrated in Figure 2.
Step 1 Data Collection and Pre-training: In this study, operational data from six wind turbines were collected, each containing 115 days of 10 min wind speed measurements. The first 108 days were used for model pre-training through 100 rolling-window iterations. In each iteration, a fixed window of 1008 consecutive samples was assigned as the training set, followed by the next 144 samples as the test set. The entire window was then shifted forward by 144 samples (equivalent to one day of data), and the process was repeated until 100 windows had been evaluated. This rolling-window strategy preserves temporal causality, avoids data leakage, and enables model assessment under diverse wind regimes. For each window, six advanced deep learning models (CNN-LSTM, Transformer, TCN-LSTM, CNN-GRU, LSTM-XGBoost, and GNN-TCN) were independently trained and tested. The model with the smallest absolute error on the test portion of each window was recorded as the optimal model label. These labels are used solely for training the classifier and do not involve running all six models during real-time forecasting. Accordingly, for wind turbine i at rolling-window iteration T , the constructed input-output dataset is formulated as shown in Equation (1).
D i , T = d 1 , i d 1008 - p + 1 , i d 1009 - p + 1 , i d 1152 - p + 1 , i d p , i d 1007 , i d 1008 , i d 1151 , i d p + 1 , i d 1008 , i d 1009 , i d 1152 , i Training   data Testing   data
where i denotes the index of the wind turbine i = 1 , 2 , , 6 , and T represents the iteration number of the testing process T = 1 , 2 , , 100 .
Step 2 Retraining of the forecasting models using the optimal datasets: To fully exploit the predictive capability of each model, the optimal datasets associated with the optimal model at each time step were consolidated and used for retraining. This process enabled fine-tuning of model parameters, thereby enhancing the robustness, adaptability, and generalization performance of each model.
Step 3 Development of the model selection mechanism: The model selection mechanism was designed to enable adaptive identification of the most suitable forecasting model at each time step. As formulated in Equation (2), the optimal model label was generated by selecting, at every time point, the forecasting model that yielded the minimal absolute error among the six candidates. Importantly, although these labels are derived from forecasting performance, the classifier uses only the raw input features and does not access forecasting outputs, errors, or model parameters, thereby avoiding any form of information leakage. The classifier is trained to learn the mapping from current input conditions to the most suitable forecasting model prior to prediction, enabling dynamic adaptability during online forecasting.
D i , t C j = d i , t - p + 1 C j , , d i , t - 1 C j min k [ 1 , 6 ] d i , t C j d ^ i , t C j k M i , k
The optimal forecasting model at each time step is selected based on the absolute error (AE). This criterion is particularly suitable for point-wise model selection, as it directly measures the deviation between the predicted and observed values at each individual time step without introducing scale-dependent or distributional biases. Compared with alternative metrics such as RMSE or MAPE, AE offers greater robustness under nonstationary wind conditions, where abrupt fluctuations and occasional extreme values are common. The squared penalty in RMSE tends to overemphasize these rare outliers, which can distort the model ranking and lead to unstable selection decisions. Preliminary comparisons conducted in this study showed that AE provides consistent model rankings across different wind regimes while avoiding excessive sensitivity to extreme deviations. Therefore, AE is adopted as the primary criterion for identifying the best-performing model at each forecasting step.
Step 4: Wind speed forecasting: During the online forecasting stage, the newly received wind speed inputs are first passed to the trained classification model, which rapidly identifies the most suitable forecasting model based on the learned selection mechanism. Only the selected model is then executed to generate the final wind speed prediction, meaning that the forecasting phase requires running a single model rather than all six candidates. The classifier itself is lightweight and adds negligible computational overhead, ensuring real-time responsiveness. This adaptive forecasting procedure enhances prediction accuracy while maintaining computational efficiency and avoiding performance degradation associated with relying on a single model.
Step 5 Wind speed–power curve: Based on the historical operational data, a wind speed–power conversion curve was established to characterize the nonlinear relationship between wind speed and turbine output power. Environmental factors such as air temperature, altitude, and humidity were incorporated into the modeling process to improve the generalization of the conversion law.
The output power of a wind turbine originates from the kinetic energy carried by the moving air mass, and its theoretical value can be derived from the principle of fluid dynamics. Given the rotor swept area A, air density ρ, and wind speed v, the total power available in the wind flow can be expressed as:
P wind = 1 2 ρ A v 3 .
However, a wind turbine cannot convert all the available wind energy into mechanical energy. According to Betz’s Law [56], the theoretical maximum efficiency of energy extraction from the wind flow is limited to 59.3%. This conversion efficiency is characterized by the power coefficient “C” _“p”, defined as the ratio of the actual mechanical output power of the turbine to the theoretical wind power:
C p = P mech P wind .
Therefore, under ideal operating conditions, the mechanical output power of a wind turbine can be expressed as:
P mech = 1 2 ρ A C p v 3 .
Temperature and altitude do not enter the model as arbitrary additional features, but through their physical effect on air density ρ . Using standard thermodynamic relationships, ρ is modeled as a function of temperature and altitude, and variations in ρ induce proportional variations in P mech at a given wind speed. Therefore, the “weights” of temperature and altitude in the multivariable wind speed–power model are not free parameters but are determined explicitly by the air-density correction function.
This wind speed–power conversion model provides a physical foundation for validating and interpreting the forecasting results obtained in the previous steps.
Step 6 Theoretical power forecasting: The wind speed values were input into the established wind speed–power conversion curve to forecast the theoretical power outputs for different wind turbines. This process yielded the estimated mechanical power under ideal operating conditions, serving as a reference for wind farm dispatching and power grid operation. The theoretical power forecasting results provide valuable insights for short-term operational planning and contribute to maintaining grid stability and optimizing energy management strategies.
In practical wind turbine operations, the power coefficient C p may gradually degrade over time due to blade fouling, mechanical wear, and other aging effects. To ensure that the wind speed–power conversion model remains aligned with the turbine’s current operating conditions, this study employs a short-term rolling-window strategy. Specifically, only the most recent 108 days of operational data are used to construct the empirical wind speed–power curve. As new measurements become available, older samples are discarded, and both the forecasting models and the estimated C p curve are continuously updated. This rolling-update mechanism allows C p to reflect the turbine’s recent performance characteristics and implicitly captures gradual efficiency changes without relying on long-term static assumptions. Consequently, long-term degradation effects do not accumulate in our framework, and the theoretical power estimation remains robust and representative of current turbine conditions.
Step 7 Uncertainty analysis: To further enhance the reliability and practical applicability of the proposed forecasting framework, probabilistic modeling was incorporated for uncertainty analysis. Based on the predicted wind speed values, a probability distribution model was constructed to capture the stochastic characteristics of wind variations and model forecasting errors. This evaluation of uncertainty examines the combined effects of environmental disturbances and model-generated variability, thereby reinforcing the stability and dependability of the forecasted results. It also provides a foundation for risk-aware operational planning in power grids and the optimization of renewable energy utilization. The metrics applied in this analysis are summarized in Table 1.
The flowchart representing both the wind speed forecasting system and the wind energy conversion system is presented in Figure 2.

3. Experiment and Analysis

In this section, three experiments are designed to examine both the predictive performance of the proposed MSFSC and the theoretical power generation for individual wind turbines. Before detailing the experimental procedures, the relevant dataset and the criteria for evaluation are introduced.

3.1. Data Source

High-resolution wind speed data, sampled at 10 min intervals from six turbines in a wind farm located in northwestern China, constituted the primary dataset for this investigation. Each dataset encompassed 15,552 continuous measurements over a 108-day period. Figure 3 summarizes the statistical characteristics of these datasets. For model construction, the majority of data (approximately 99%) were utilized for training, while the final 6.5% were set aside for forecasting performance evaluation, ensuring a rigorous validation framework.

3.2. Evaluation Criteria

In this study, multiple statistical indicators were employed to comprehensively evaluate the forecasting performance from different perspectives. The error-based metrics (MAE and RMSE) measure the overall deviation between predicted and observed wind speeds, reflecting the absolute accuracy and robustness of the model. The relative error and stability metrics (MAPE and STDAPE) assess the magnitude and variability of percentage errors, providing insights into model consistency under varying wind conditions. The directional accuracy metric (DA) evaluates the model’s capability to correctly capture the changing trends of wind speed, which is essential for short-term operational control. Finally, the correlation and overall performance metrics (R2 and TIC) quantify the explanatory power and overall efficiency of the forecasting model. These evaluation indices collectively form a comprehensive, multidimensional framework encompassing forecast accuracy, stability, trend-following capacity, and correlation strength, thereby providing a rigorous assessment of wind speed prediction performance and classification accuracy (ACC). The precise definitions of these indices are detailed in Table 2.
The six forecasting models differ in both architectural and computational complexity. CNN-LSTM and CNN-GRU are the most efficient due to their lightweight convolution–recurrent structures, while LSTM-XGBoost and TCN-LSTM exhibit medium complexity. The Transformer and GNN-TCN are the most computationally demanding because of quadratic self-attention and graph-convolution operations.
Using the training dataset (1008 × 7) and test dataset (144 × 7), and running on an Intel i9-13900K CPU, 64 GB RAM, and an NVIDIA RTX 4090 GPU with a batch size of 16 and 150 epochs, all models can be trained within tens of seconds per turbine, and inference remains at the millisecond level. The approximate runtime differences are summarized in Appendix A Table A1. Overall, despite the varying complexity, all six models are computationally feasible for real-time or near real-time short-term wind speed forecasting, supporting the practical deployment of the MSFS framework.

3.3. Experiment I: Forecasting System for Model Selection Driven by Classification

In this experiment, wind speed forecasts were generated using six deep learning models across six wind turbines for 100 iterations, yielding a total of 14,400 test results. The 100 iterations were designed to determine the optimal forecasting model at each time point. Specifically, the model yielding the smallest absolute error among the six forecasting values at each time point was selected as the optimal model. The test results of single model (in Table 3) and the results of forecasting system (in Figure 4 and Table 4).
Table 3 summarizes the wind speed forecasting performance of six wind turbines under different forecasting models. Overall, the performance variations among models across different turbines are relatively small, indicating that each deep learning models exhibit strong generalization capability and stable predictive behavior. Among them, the CNN-GRU, CNN-LSTM, and LSTM-XGBoost models achieve superior accuracy for most turbines, reflected by their lower aggregate error metrics and higher correlation coefficients.
Table 4 summarizes the wind speed forecasting performance for six wind turbines using the classification-driven Model Selection Forecasting System (MSFSC). Over 14,400 test samples, the optimal selection of each model exhibits substantial variability, highlighting the non-stationary and time-dependent behavior of wind speed, which results in ongoing adjustments of the optimal forecasting model over time.
The experimental results for six wind turbines under different forecasting strategies are summarized in Table 3 and Table 4 and Figure 4. Overall, all six deep learning models achieved high forecasting accuracy; Model performance differed across turbines and temporal intervals, illustrating the dynamic and non-stationary properties of wind speed data. For all 14,400 test points, the optimal model at each time step was determined using the criterion of minimum absolute error, providing a systematic approach for adaptive model selection.
For Turbine #1, the distribution of optimal points (OP) shows that CNN-LSTM, Transformers, TCN-LSTM, CNN-GRU, LSTM-XGBoost, GNN-TCN and MSFS generated 1925, 2737, 2638, 2139, 2031 and 2930 optimal points. Similar patterns were observed across other turbines, indicating that different models dominate under different wind conditions. This variation indicates that no single model consistently dominates, as model performance varies with the dynamic characteristics of wind speed data. When the Transformers-based MSFS was applied, the forecasting accuracy improved markedly, achieving an MAE of 0.0650, RMSE of 0.1052, and MAPE of 1.71%, outperforming the average single-model results (MAE ≈ 0.11, RMSE ≈ 0.16). The Directional Accuracy (DA) also rose from about 75% for single models to over 93% under MSFS, confirming the system’s enhanced capability in capturing wind speed fluctuations and improving temporal adaptability.
As shown in Table 4 and Figure 4 (Part C), the proposed system accurately reproduced the real distribution of optimal models. For Turbine #1, the Transformers classifier correctly identified Model Selection Accuracy (SA) of all turbines were 94.59%, 91.34%, 94.61%, 92.89%, 92.00%, and 91.53%, yielding an average model selection accuracy of approximately 92.8%.
For other Turbines the proposed MSFS consistently outperformed all single models across four categories of evaluation metrics. In terms of error-based metrics, the MSFS achieved MAE < 0.07 and RMSE < 0.11, indicating a substantial reduction in overall forecasting errors. Regarding relative error and stability metrics, the MAPE remained below 2%, and STDAPE values were notably lower, reflecting stable and reliable performance under varying wind conditions. The improvement in Directional Accuracy (DA) from around 75% for single models to over 92% demonstrates the system’s superior ability to capture wind speed fluctuations and trend reversals. Moreover, the R2 values exceeding 0.99 and TIC below 0.008 confirm the strong consistency and robustness of the forecasts. Collectively, these results verify that the MSFS provides accurate, stable, and adaptive wind speed forecasting across multiple turbines and operating environments.
Remark. The proposed MSFS successfully integrates six deep learning models, achieving an average model selection accuracy above 92% across 14,400 test instances, while significantly improving overall forecasting precision (R2 > 0.99, MAPE < 2%) and directional consistency. These findings validate the robustness, adaptability, and practical value of the Transformers-driven dynamic model selection approach for wind speed forecasting and subsequent wind power forecasting applications.

3.4. Experiment II: Analysis of Classification and Forecasting Outcomes for Various Wind Turbine Categories

The above results demonstrate that the proposed MSFS substantially enhances model selection accuracy and forecasting stability, outperforming all individual forecasting models. In the forecasting phase of this study, a total of 1008 time points were predicted. Building on this foundation, Table 5 and Figure 5 provides a comprehensive evaluation of the forecasting performance of each individual model and the MSFS framework, focusing on overall accuracy, trend consistency, and error distribution to further assess the effectiveness and applicability of the proposed approach.
In Table 5 and Figure 5 shows the forecasting performance of each model based on their optimal forecasting points and the MSFS which display best performing of different models. However, some standalone models appear to outperform MSFS for certain turbines. For example, in Turbine #2, the Transformers model achieves 170 optimal forecasting points, which the MAE and RMSE are 0.0724, 0.1345, both lower than the MAE of MSFS. Similarly, in Turbine #4, LSTM-XGBoost obtains 145 optimal forecasting points and achieves the MAE (0.1095), outperforming the MAE (0.1419) of MSFS for that turbine. These local advantages do not indicate superior overall performance; rather, they arise because these models happened to be optimal for a larger number of time steps in these turbines, causing metrics of optimal forecasting points to be skewed toward their best-performing intervals and artificially reducing their average errors.
Meanwhile, Table 5 also reveals that some models exhibit relatively high MAPE values. For instance, the MAPE of TCN-LSTM reaches 3.55% in Turbine #2 and 3.44% in Turbine #2. As illustrated in Figure 5 (Part B), these inflated MAPE values primarily occur in the 0–3 m/s low-wind speed region, which is below the cut-in wind speed of the turbines. In this range, the actual wind speed is close to zero, causing proportional error metrics such as MAPE to be artificially magnified due to the small denominator. Importantly, because turbines do not generate power in this wind speed region, such errors do not affect the accuracy of the wind-power conversion process, nor do they pose risks for power system operation or grid dispatching.
Despite the localized advantages observed in certain models at their OP intervals, the MSFS consistently demonstrates superior and more stable performance across all turbines. The MAE remains within 0.105–0.129, the RMSE is maintained between 0.177 and 0.221, and all R2 values exceed 0.98, indicating that the system reliably captures the temporal dynamics of wind speed and produces highly consistent forecasts. Furthermore, the model selection accuracy (SA) of MSFS remains within 91–95% across the six turbines, confirming that the Transformers-based classifier effectively identifies the optimal model at most time steps—an essential factor contributing to MSFS’s overall performance advantages.
Remark: although standalone models may show better performance at their optimal points and proportional errors may appear elevated in the low-wind speed region, these factors do not alter the broader conclusion. Considering that the single-model metrics are inherently OP-based, the MSFS—enabled by its high-precision dynamic model selection mechanism—achieves more stable, accurate, and generalizable forecasting performance across the full time sequence, making it a more practical and reliable approach for real-world wind speed forecasting applications.

3.5. Experiment III: Theoretical Assessment of Power Generation for Individual Wind Turbines

Accurately estimating the theoretical power output is essential for evaluating wind turbine performance, optimizing wind farm operational strategies, and supporting power grid dispatching. As theoretical power is highly sensitive to wind speed typically following a cubic relationship, accurate wind speed forecasting becomes a critical prerequisite for reliable theoretical power estimation. Even minor deviations in predicted wind speed can lead to significant amplification in power calculation errors.
Building on the reliable wind speed forecasting results obtained in the previous section, this study converts the forecast wind speeds into corresponding theoretical power outputs using turbine-specific wind speed–power characteristic curves. The theoretical power represents the maximum mechanical power a wind turbine can achieve under ideal operating conditions, serving as an upper reference bound for actual power generation and providing essential insights for operational state assessment and short-term power planning. By examining how wind speed forecasting accuracy influences the stability and reliability of power conversion, this section further evaluates the applicability of the proposed forecasting framework to short-term wind power forecasting and wind farm operational decision-making.
Based on the above analysis, Table 6 and Table 7 provide a summary of the wind turbine parameters and the corresponding power curve configurations used in this study. The standard power curve is constructed based on the turbine’s aerodynamic characteristics under standard air-density conditions, while the power coefficient is represented by a polynomial function of wind speed obtained from manufacturer data.
To account for variations in atmospheric temperature and altitude, an air-density correction function is applied to adjust the standard curve to actual operating conditions. In addition, three versions of the actual power coefficient curve—representing the lower bound, standard condition, and upper bound—are included to reflect the uncertainty and performance variability of six wind turbines.
Accurate estimation of aerodynamic power requires a physically consistent representation of air density, as mechanical power extraction is directly proportional to ρ . Manufacturers typically assume the standard air density ρ 0 = 1.225   kg / m 3 which is defined at sea-level pressure and a reference temperature of 15   C . However, the turbines investigated in this study are situated at an elevation of H = 1152, where both atmospheric pressure and temperature differ significantly from standard conditions. As a result, using ρ 0 introduces a systematic bias that propagates into the theoretical power estimation.
To account for local atmospheric conditions, air density was recalculated for each time step using the temperature–altitude correction function:
ρ ( T , H ) = 353.1 e 342 H T T ,
where T is the ambient temperature in Kelvin, H is turbine elevation (in kilometers under the model’s empirical scaling), and the coefficient 353.1 arises from the combination of the hydrostatic equation, gas constant, and empirical lapse-rate approximation.
This formulation incorporates two fundamental physical mechanisms:
  • Pressure reduction with altitude:
    p ( H ) e k H ,
    describing the exponential decay of atmospheric pressure as elevation increases.
  • Thermal expansion of air:
    ρ 1 T ,
    reflecting the ideal-gas relationship between density and temperature.
Both factors significantly influence air density in mountainous regions. Applying the above correction to the 144-sample dataset produced density values ranging from 0.88 to 0.95 kg/m3, substantially lower than the constant value ρ 0 = 1.225   kg / m 3 . This reduction aligns with theoretical atmospheric models and field measurements at elevations above 1000 m.
Since aerodynamic mechanical power is given by
P theoretical = 1 2 ρ A C p v 3 ,
where A   =   π R 2 is the swept area and C p ( v ) is estimated empirically from operational data, any deviation in air density directly scales the theoretical power estimate. Substituting ρ 0 into the power equation yields.
P ρ 0 v = 1 2 ρ 0 A C p v 3 ,
whereas using the corrected density gives.
P ρ T , H v = 1 2 ρ ( T , H ) A C p v 3 .
Thus, the ratio of the two theoretical power estimates becomes.
P ρ ( T , H ) P ρ 0 = ρ ( T , H ) ρ 0 ,
which for the observed dataset lies between 0.72 and 0.78, implying that the manufacturer-based power curve overestimates aerodynamic power by approximately 22–28%. Such discrepancies are particularly problematic in wind farms located at high altitudes, where theoretical curves derived from ρ 0 no longer form realistic upper performance bounds.
In contrast, the corrected air density ρ ( T , H ) yields theoretical power curves that closely follow the empirical envelopes observed in SCADA data, thereby improving the physical interpretability of the wind–power conversion model. This correction ensures that theoretical power outputs accurately reflect the turbine’s instantaneous operating environment and prevents systematic bias in the subsequent wind power forecasting analysis.
Based on the forecasting wind speeds of each wind turbine and the turbine parameters of power curve Table 6 and Table 7, the theoretical power output of each wind turbine can be further derived. This step enables a direct evaluation of how forecasting accuracy in wind speed propagates into the estimation of wind energy production. To illustrate this process and assess the reliability of the proposed forecasting framework from an energy perspective, Figure 6 presents the theoretical power curves, the corresponding power forecasting results, and the comparative performance of different forecasting models and the MSFS. The figure provides a comprehensive view of the temporal evolution of predicted theoretical power as well as the accuracy improvements achieved through dynamic model selection.
The accuracy of theoretical power forecasting is strongly governed by the precision of wind speed forecasting. As shown in Table 8, the NRMD values across all turbines remain within 3.8–5.1% for single models, while the MSFS further reduces this range to 3.2–4.5%, indicating smaller deviations between predicted and reference theoretical power. Similarly, RMSE values are consistently lower for MSFS (e.g., 67.69 kW for Turbine #1 and 57.85 kW for Turbine #2), compared with 70–80 kW exhibited by most single models. The R2 values further confirm this improvement, with MSFS achieving 0.995–0.998, compared to 0.982–0.994 for single models. These results indicate that the enhanced wind speed accuracy achieved by MSFS directly translates into more reliable theoretical power estimation.
A notable feature is that the Mean Percentage Error (MPE) is consistently negative for all turbines, typically ranging from −2% to −6%. This indicates that the predicted theoretical power is slightly lower than the actual theoretical output. Such a conservative bias is operationally beneficial: under-forecasting avoids the risk of overestimating available wind power, which could otherwise lead to dispatch imbalance or insufficient reserves in grid scheduling. Importantly, this conservative tendency does not compromise forecasting accuracy, as evidenced by the high R2 and low NRMD values. Therefore, the forecasting framework provides both high-precision and operational safety, making it particularly suitable for real-time power dispatching and short-term wind farm operational planning.
Remark: The experimental results demonstrate that the proposed forecasting framework provides accurate and operationally reliable theoretical power forecasting. By enhancing wind speed forecasting accuracy through dynamic model selection, the framework achieves low conversion errors and high consistency with turbine power characteristics. The consistently negative but small MPE values indicate a conservative bias that avoids overestimation, thereby reducing risks in grid scheduling and short-term dispatch decisions. Overall, the forecasting system not only ensures high forecasting fidelity but also offers a safety margin beneficial for wind farm operation and power system management, making it particularly suitable for practical dispatch applications.

4. Discussion

In this subsection, a thorough examination of the proposed forecasting system is provided, focusing on two key areas: the relevance and uncertainty evaluation of the system, and an analysis of wind energy generation within wind farms.

Analysis of the Proposed Model’s Significance and Uncertainty

To evaluate whether the wind speed forecasts generated by the seven predictive models preserve the statistical characteristics of the observed wind speeds, both the Kruskal–Wallis (KW) test [57] and the Kolmogorov–Smirnov (KS) [58] test were employed. These complementary non-parametric approaches enable a rigorous assessment of central tendency and distributional fidelity, which is essential given that wind speed data follow a Gamma distribution and deviate substantially from normality.
Together, the KW and KS tests provide a comprehensive framework for evaluating whether each forecasting model maintains both the central tendency and the full distributional properties of the observed wind speeds, enabling an assessment that extends beyond pointwise accuracy metrics. The KW statistic approximately follows a chi-square distribution with k 1 degrees of freedom, where k is the number of groups, while the KS statistic is based on the supremum of the absolute CDF differences. In both cases, a p-value below the selected significance level leads to the rejection of the null hypothesis that the predictions and observations share the same statistical characteristics. An increased value of the KW statistic signifies more pronounced differences between groups. When the resulting p-value is less than the selected significance level, the null hypothesis, which posits that all samples are drawn from the same distribution, is rejected. The computation of the test statistic H is expressed by the following formula:
H = 12 N ( N + 1 ) p j 2 n j 3 ( N + 1 ) D n   =   sup x F n     G n .
To rigorously assess the uncertainty associated with wind speed forecasting, the t-Location-scale distribution [59] is utilized to model the forecast error, characterized by the location parameter (μ), scale parameter (σ), and degrees of freedom (ν). The corresponding probability density function (PDF) of this distribution is formulated as follows:
f ( x ; ν , µ , σ ) = Γ v + 1 2 σ v π Γ v 2 v + x μ σ 2 v v + 1 2
where Γ(•) is the gamma function, µ is the location parameter, σ is the scale parameter, and ν is the shape parameter.
The Kruskal–Wallis results show in Table 9 that all model vs. actual data comparisons yield p-values of 1.000 with 0.05 significance level. This indicates that, at the 95% confidence level, none of the six single models exhibit a significant distributional difference from the real wind speed data, confirming that their predictive outputs preserve the underlying wind speed distribution. Building upon this consistent statistical foundation, the MSFS further enhances forecasting performance by dynamically integrating the strengths of these models. As a result, MSFS achieves more accurate and distributionally stable wind speed forecasting, outperforming any single model despite their individually non-significant KW deviations.
In Table 9 and Figure 7, at the 95% confidence level, the uncertainty analysis based on FICP, FINAW, and AWD demonstrates the superior interval performance of the MSFS. Compared with the single models, MSFS consistently achieves the highest coverage, with FICP reaching 93–95%, indicating that its forecasting intervals more reliably encompass actual wind speed variations. Meanwhile, MSFS produces the narrowest normalized interval widths (FINAW ≈ 0.008–0.015), reflecting tighter and more informative uncertainty bounds. In addition, MSFS yields the lowest weighted deviation (AWD ≈ 0.17–0.37), showing minimal discrepancy between the interval forecasting and observed values. Collectively, these results confirm that the MSFS framework provides more accurate, compact, and reliable uncertainty analysis than any single forecasting model.
Remark: The KW test and uncertainty analysis confirm that the MSFS forecasting framework preserves the statistical characteristics of real wind speeds while providing more accurate and reliable forecasting value than any single model.

5. Conclusions

This study proposed an MSFS based on the classification model for short-term wind speed forecasting framework that integrates six deep learning models through time-step model selection. Using multi-turbine operational datasets, the framework was further extended to theoretical power forecasting and uncertainty analysis, addressing the challenges of wind variability and short-term operational decision-making.
The experimental results show that MSFS achieves superior wind-speed prediction performance relative to single-model approaches. The Kruskal–Wallis test shows that all models exhibit no significant deviation from the actual data at the 95% confidence level, confirming strong distributional consistency. The uncertainty analysis further indicates that MSFS achieves higher interval coverage (up to 93–95%), narrower interval widths, and lower weighted deviation than any single model, demonstrating superior interval reliability. When combined with turbine-specific power curves, the high-accuracy wind speed forecasts enable stable and reliable estimation of theoretical power output.
Accurate wind speed and theoretical power forecasting plays a crucial role in power system operation, as wind speed dictates aerodynamic turbine performance and theoretical power serves as an upper bound for dispatch planning and reserve scheduling. The MSFS framework, with its slightly conservative yet highly precise forecasts, effectively reduces the risk of overestimation-induced scheduling imbalance while providing operators with dependable uncertainty bounds for risk-aware decision-making. These characteristics make the proposed approach particularly suitable for wind farm operation, short-term grid scheduling, and safety margin planning.
Future research will explore the integration of multi-source environmental data, the extension of MSFS to actual power forecasting, and the application of the forecasting framework within real dispatch optimization systems. Scaling the approach to multiple wind farms and larger regional grids will also be investigated to enhance its practical applicability.

Author Contributions

Conceptualization, M.Z.; methodology, M.Z. and Q.J.; software, Z.W. and F.M.; validation, H.H. and J.P.; formal analysis, J.P.; data curation, F.M.; writing-original draft preparation, M.Z., Q.J. and Z.W.; writing-review and editing, M.Z., Q.J., Z.W., F.M., H.H. and J.P.; visualization, H.H.; supervision, Q.J. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The datasets analyzed during the current study are available from the corresponding author on reasonable request. The data are not publicly available due to confidentiality agreements.

Conflicts of Interest

Authors Ming Zeng, Qianqian Jia, Fang Mao, Haotao Huang, and Jingyuan Pan were employed by Guangzhou Power Supply Bureau, a subsidiary of Guangdong Power Grid. The other authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as potential conflicts of interest.

Appendix A

1. 
Model Description 
The six forecasting models used in this study span a wide spectrum of architectural and computational complexity, offering complementary strengths for dynamic model selection. Their hyperparameters, structural complexity, computational complexity, and runtime measurements are listed in Table A1. A detailed comparison is provided below.
Table A1. Default model settings and computational cost for the six models. 
Table A1. Default model settings and computational cost for the six models. 
ModelHyperparameterDefaultStructural ComplexityComputational ComplexityTraining Time
CNN-LSTMNumFilters32CNN filters = 32, kernel size = 3
LSTM hidden units = 64
Parameter scale: medium-to-low
Two-stage hybrid structure (CNN → LSTM), a typical mixed model
CNN: O(32 × k × L)
LSTM: O(L × 642)
Overall complexity:
O(L × 4k + L × 4096), medium level
10–20 s
Runs efficiently on GPU/CPU
Suitable for medium-scale real-time forecasting
FilterSize3
NumHiddenUnits64
Dropout0.1
InitialLearnRate1.00 × 10−3
MiniBatchSize16
MaxEpochs150
TransformersNumEncoderLayers2Self-attention + FFN
Parameter scale: medium
Small dmodel, considered a lightweight Transformer
Multi-head attention: O(L2 × dmodel)
FFN: O(L × 64 × 32)
Main bottleneck comes from the quadratic attention term L2
8–18 s
Significantly higher cost than RNN/CNN for long sequences
dModel32
NumHeads4
NumHiddenUnits64
Dropout0.1
InitialLearnRate1.00 × 10−3
TCN-LSTMNumFilters3232 filters × dilation = [1, 2, 4, 8] × 2 residual blocks
LSTM hidden units = 64
Parameter scale: medium-to-high
TCN (dilated convolution):
O(L × 32 × 3 × blocks)
LSTM: O(L × 642)
More complex than CNN-LSTM but does not suffer from the L2 cost of Transformers
15–25 s
Lighter than Transformers
Considered a medium-complexity model
FilterSize3
DilationFactor[1 2 4 8]
NumResidualBlocks2
NumHiddenUnits64
Dropout0.1
CNN-GRUNumFilters32CNN filters = 32
GRU hidden units = 64
Parameter scale similar to CNN-LSTM, but GRU is lighter than LSTM
CNN: O(L × 32 × 3)
GRU: O(L × 642) × 3 gates
Lower than LSTM (which has 4 gates)
15–30 s
Typically, faster than LSTM
Medium-to-low complexity
FilterSize3
NumHiddenUnits64
Dropout0.1
InitialLearnRate1.00 × 10−3
LSTM-XGBoostNumHiddenUnits32LSTM hidden units = 32 (smaller than other hybrid models)
XGBoost trees = 200, depth = 3
Overall architecture: hybrid but not heavy
LSTM: O(L × 322) (small scale)
XGBoost: O(trees × depth × features)
Overall complexity: medium-to-high depending on the number of trees
20–40 s
XGBoost training can be slow on CPU
Dropout0.1
max_depth3
learning_rate0.05
n_estimators200
subsample0.8
colsample_bytree0.8
GNN-TCNNumHiddenUnits32GNN layers = 2, hidden size = 32
TCN filters = 32 with dilation = [1, 2, 4]
Requires graph convolution based on adjacency matrix
Parameter scale: relatively high
GNN: O(Nedges × 32)
TCN: O(L × 32 × 3 × layers)
Combined complexity is higher than pure CNN/RNN models
25–45 s
Runs efficiently on GPU/CPU
NumGraphLayers2
NumFilters32
DilationFactor[1 2 4]
Dropout0.1
InitialLearnRate1.00 × 10−3
(1) 
Low-Complexity Models 
The CNN-LSTM and CNN-GRU represent lightweight hybrid architectures that combine convolutional feature extraction with recurrent units. Owing to their shallow convolutional layers and moderate hidden dimensions, these models exhibit low to medium structural complexity, with computational costs scaling as O(F × k × L) for convolution and O(L × H2) for recurrent operations. Their training times (8–20 s) are the shortest among all models, and inference remains at the sub-millisecond level. These properties make them well suited for scenarios requiring high responsiveness and low resource consumption.
(2) 
Medium-Complexity Models 
The TCN-LSTM and LSTM–XGBoost models fall into the medium to medium–high complexity range.
TCN-LSTM extends the temporal receptive field through dilated causal convolutions, enabling efficient multi-scale pattern extraction. However, combining TCN with LSTM increases computational overhead, resulting in training times of 15–30 s.
LSTM–XGBoost incorporates sequential learning with ensemble-based nonlinear regression, yielding moderate structural complexity and training times of 15–25 s. Both models provide enhanced representational capacity while maintaining practical computational demands.
(3) 
High-Complexity Models: The Transformer and GNN–TCN exhibit the highest structural and computational complexity. 
The Transformer introduces a quadratic self-attention mechanism (O(L2 × d_model)), enabling the learning of global dependencies at the cost of increased computation, though its parallelization keeps training efficient (20–40 s).
The GNN–TCN is the most computationally intensive architecture, combining graph message passing (O(E × H)) with multi-scale temporal convolutions, resulting in training times of 25–45 s. These models offer the strongest expressive power but require greater computational resources.
Despite the significant differences in model complexity, all six models achieve millisecond-level inference latency on the experimental platform, ensuring real-time feasibility. Their diverse structural and computational properties justify the proposed MSFS framework, where the classifier adaptively selects the most appropriate model under varying wind conditions to balance predictive accuracy and computational efficiency.
Table A2. The Transformer Classifier Configuration and Computational Characteristics.
Table A2. The Transformer Classifier Configuration and Computational Characteristics.
CategoryDescription
Model TypeTransformer encoder classifier
Hyperparameters
-
Encoder layers: 2
-
Model dimension dmodel: 32
-
Attention heads: 4
-
FFN hidden size: 64
-
Dropout: 0.1
-
Optimizer: Adam, LR = 1 × 10−3
-
Batch size: 32
-
Epochs: 100
Training Input10,080 samples × 7 features
Testing Input4320 samples × 7 features
Structural Complexity
-
Lightweight encoder architecture—Total parameters ≈ 20k—Significantly smaller than forecasting models
Computational ComplexityO (NL2d + NLddff) With (L = 7, d = 32, dff = 64): extremely low in practice
Training Time3–8 s on RTX 4090 for 100 epochs (few tens of seconds on CPU)
Inference TimeMillisecond-level for 4 320 samples (negligible overhead in MSFS)
Practical Notes
-
Very efficient due to short sequence length (L = 7)
-
Introduces negligible computational cost compared to the forecasting models
-
Well suited for real-time model selection
Table A3. The performance of each distribution for uncertainty estimation.
Table A3. The performance of each distribution for uncertainty estimation.
NODistributionParameterCNN-LSTMCNN-GRUGNN-TCNLSTM-XGBoostTCN-LSTMTransformersMSFS
Turbine 1normal‘mu’−0.0009−0.00280.0010−0.0037−0.0024−0.0012−0.0005
‘sigma’0.14030.13040.04520.13290.14150.12440.0826
SSE22.752523.705536.497429.285219.043025.044548.5762
logistic‘mu’0.0008−0.00230.0011−0.0024−0.0002−0.00080.0002
‘sigma’0.06630.06270.02400.06020.06800.05710.0369
SSE10.977312.197633.016512.62688.67809.638313.5334
generalized extreme value‘k’−0.1654−0.1418−0.1932−0.2102−0.2666−0.2168−0.1488
‘sigma’0.16340.14720.04730.16730.16320.14980.1008
‘mu’−0.0591−0.0580−0.0167−0.0561−0.0541−0.0498−0.0350
SSE33.447534.144461.651144.697327.063438.372285.7774
tlocationscale‘mu’0.00350.00020.0011−0.00010.00160.00060.0014
‘sigma’0.06320.06180.03510.05530.07010.05610.0398
‘nu’1.81061.88915.13601.75132.02011.91762.2681
SSE1.53622.907136.46511.04541.16621.06507.6752
Turbine 2normal‘mu’−0.0031−0.0015−0.0086−0.0018−0.01680.0003−0.0036
‘sigma’0.30630.32970.42030.09460.29390.27980.3412
SSE8.77143.38986.511537.47519.297013.39857.4163
logistic‘mu’−0.0061−0.0064−0.0018−0.0021−0.01460.0002−0.0077
‘sigma’0.14540.15510.18030.04100.13770.13060.1601
SSE5.25441.77943.998710.95005.41408.75004.0357
generalized extreme value‘k’−0.1761−0.1463−0.1541−0.1438−0.1429−0.2046−0.1859
‘sigma’0.33160.36710.48610.11420.32830.31380.3697
‘mu’−0.1252−0.1370−0.1828−0.0411−0.1404−0.1088−0.1378
SSE10.79984.50717.909759.806812.169016.09779.2571
tlocationscale‘mu’−0.0025−0.00600.0029−0.0017−0.00730.0015−0.0087
‘sigma’0.12880.13480.09790.04080.12220.09800.1424
‘nu’1.63611.59621.06502.01721.64811.36341.6557
SSE1.27090.16520.10604.48011.34451.33770.7493
Turbine 3normal‘mu’0.00440.0150−0.0059−0.01300.02680.00860.0023
‘sigma’0.39340.76000.71220.79750.66720.67980.2491
SSE2.85720.05470.02540.04420.47140.06239.7058
logistic‘mu’0.01020.0157−0.0041−0.00570.02250.01290.0049
‘sigma’0.18360.41430.39300.43380.35500.36540.1151
SSE1.40260.02510.01000.02300.26350.01345.1727
generalized extreme value‘k’−0.2296−0.2292−0.2837−0.1937−0.2190−0.2342−0.1907
‘sigma’0.45510.77750.73740.83350.69290.72200.2968
‘mu’−0.1474−0.2726−0.2567−0.3324−0.2290−0.2491−0.0988
SSE3.91630.09580.05300.10730.62440.134713.7061
tlocationscale‘mu’0.01090.0157−0.0029−0.00470.01960.01380.0071
‘sigma’0.16740.62650.61730.65780.49090.53910.1082
‘nu’1.71316.04117.95916.21933.93905.28361.7880
SSE0.16030.02450.00930.02300.19970.01121.4037
Turbine 4normal‘mu’−0.0005−0.0024−0.0012−0.0028−0.0037−0.00090.0010
‘sigma’0.08260.14150.12440.13040.13290.14030.0452
SSE48.576219.043025.044523.705529.285222.752536.4974
logistic‘mu’0.0002−0.0002−0.0008−0.0023−0.00240.00080.0011
‘sigma’0.03690.06800.05710.06270.06020.06630.0240
SSE13.53348.67809.638312.197612.626810.977333.0165
generalized extreme value‘k’−0.1488−0.2666−0.2168−0.1418−0.2102−0.1654−0.1932
‘sigma’0.10080.16320.14980.14720.16730.16340.0473
‘mu’−0.0350−0.0541−0.0498−0.0580−0.0561−0.0591−0.0167
SSE85.777427.063438.372234.144444.697333.447561.6511
tlocationscale‘mu’0.00140.00160.00060.0002−0.00010.00350.0011
‘sigma’0.03980.07010.05610.06180.05530.06320.0351
‘nu’2.26812.02011.91761.88911.75131.81065.1360
SSE7.67521.16621.06502.90711.04541.536236.4651
Turbine 5normal‘mu’−0.0086−0.0080−0.0076−0.0079−0.0104−0.0113−0.0013
‘sigma’0.16560.22000.23540.24570.23880.25500.0923
SSE36.823817.703710.367610.87134.836912.682531.4714
logistic‘mu’−0.0040−0.0072−0.0082−0.0082−0.0082−0.0091−0.0012
‘sigma’0.05920.09490.10750.11110.10250.11830.0343
SSE14.78999.54525.16105.66011.66497.92375.1605
generalized extreme value‘k’−0.1141−0.1097−0.1397−0.1112−0.1733−0.1687−0.0567
‘sigma’0.23310.24300.26700.26890.35520.31800.1090
‘mu’−0.0797−0.0997−0.1056−0.1113−0.1105−0.1181−0.0391
SSE51.410222.362114.279914.36069.034417.239347.7776
tlocationscale‘mu’0.00030.0004−0.0047−0.0033−0.0045−0.0068−0.0002
‘sigma’0.04200.07150.09820.10100.08840.10030.0362
‘nu’1.43761.40751.72771.71811.63421.55502.3046
SSE0.17420.64070.91891.13610.14561.90400.2898
Turbine 6normal‘mu’−0.0059−0.0100−0.0099−0.0088−0.01190.0002−0.0011
‘sigma’0.13430.18890.17880.20560.18600.20070.0695
SSE36.48189.405015.221917.596714.527310.484844.0969
logistic‘mu’−0.0014−0.0044−0.0060−0.0063−0.00480.00060.0005
‘sigma’0.05150.08970.08200.09950.08520.09490.0316
SSE14.09284.43757.753410.08497.54795.49569.3376
generalized extreme value‘k’−0.1777−0.2106−0.1901−0.2204−0.2112−0.2020−0.2341
‘sigma’0.19240.22020.20930.22460.23080.23580.0884
‘mu’−0.0647−0.0854−0.0831−0.0875−0.0872−0.0787−0.0283
SSE53.597313.806820.795822.269320.797414.734686.6710
tlocationscale‘mu’0.0000−0.0016−0.0039−0.0031−0.00150.00060.0005
‘sigma’0.03980.08860.06910.09450.07370.08850.0353
‘nu’1.54591.90401.56691.78441.61051.75062.4024
SSE0.30540.60060.43542.22710.81180.76315.4124

References

  1. Global Wind Energy Council. Global Wind Report 2024; Global Wind Energy Council (GWEC): Brussels, Belgium, 2024. [Google Scholar]
  2. Yang, D.Z.; Wang, W.T.; Hong, T. A historical weather forecast dataset from the European centre for medium-range weather forecasts (ecmwf) for energy forecasting. Sol. Energy 2022, 232, 263–274. [Google Scholar] [CrossRef]
  3. Alsamamra, H.R.; Salah, S.; Shoqeir, J.H. Performance analysis of arima model for wind speed forecasting in Jerusalem, Palestine. Energy Explor. Exploit. 2024, 42, 1727–1746. [Google Scholar] [CrossRef]
  4. Dupuy, F.; Durand, P.; Hedde, T. Downscaling of surface wind forecasts using convolutional neural networks. Nonlin. Process. Geophys. 2023, 30, 553–570. [Google Scholar] [CrossRef]
  5. Daenens, S.; Verstraeten, T.; Daems, P.J.; Nowé, A.; Helsen, J. Spatio-temporal graph neural networks for power prediction in offshore wind farms using scada data. Wind Energy Sci. 2025, 10, 1137–1152. [Google Scholar] [CrossRef]
  6. Liao, W.; Fang, J.; Ye, L.; Bak-Jensen, B.; Yang, Z.; Porte-Agel, F. Can we trust explainable artificial intelligence in wind power forecasting? Appl. Energy 2024, 376, 124273. [Google Scholar] [CrossRef]
  7. Liu, Z.; Guo, H.; Zhang, Y.; Zuo, Z. A comprehensive review of wind power prediction based on machine learning: Models, applications, and challenges. Energies 2025, 18, 350. [Google Scholar] [CrossRef]
  8. Wang, Y.; Zhang, F.; Kou, H.; Zou, R.; Hu, Q.; Wang, J.; Srinivasan, D. A review of predictive uncertainty modeling techniques and evaluation metrics in probabilistic wind speed and wind power forecasting. Appl. Energy 2025, 396, 126234. [Google Scholar] [CrossRef]
  9. Zhao, Y.; Liao, H.; Zhao, Y.; Pan, S. Data-augmented trend-fluctuation representations by interpretable contrastive learning for wind power forecasting. Appl. Energy 2025, 380, 125052. [Google Scholar] [CrossRef]
  10. Li, J.; Jia, L.; Zhou, C. Deep fuzzy inference system fused with probability density function control for wind power forecasting with asymmetric error distribution. J. Clean. Prod. 2025, 511, 145590. [Google Scholar] [CrossRef]
  11. Chen, H.; Jiang, X.; Hui, H.; Zhang, K.; Meng, W.; Cheynet, E. Enhancing probabilistic wind speed forecasting by integrating self-adaptive bayesian wavelet denoising with deep gaussian process regression under uncertainties. Renew. Energy 2026, 256, 123966. [Google Scholar] [CrossRef]
  12. He, Y.Y.; Yu, N.N.; Wang, B. Online probability density prediction of wind power considering virtual and real concept drift detection. Appl. Energy 2025, 396, 126318. [Google Scholar] [CrossRef]
  13. Gao, J.; Cheng, Y.; Zhang, D.; Chen, Y. Physics-constrained wind power forecasting aligned with probability distributions for noise-resilient deep learning☆. Appl. Energy 2025, 383, 125295. [Google Scholar] [CrossRef]
  14. Huang, H.; Xue, S.; Zhao, L.; Wang, W.; Wu, H. Privacy-preserving smart energy management by consumer-electronic chips and federated learning. IEEE Trans. Consum. Electron. 2024, 70, 2200–2209. [Google Scholar] [CrossRef]
  15. Wang, X.R.; Zhou, Y.Z. Privacy-preserving probabilistic wind power forecasting: An adaptive federated approach. Appl. Energy 2025, 396, 126177. [Google Scholar] [CrossRef]
  16. Nayak, A.K.; Sharma, K.C.; Bhakar, R.; Tiwari, H. Probabilistic online learning framework for short-term wind power forecasting using ensemble bagging regression model. Energy Convers. Manag. 2025, 323, 119142. [Google Scholar] [CrossRef]
  17. Leal, J.I.; Pitombeira-Neto, A.R.; Bueno, A.V.; Rocha, P.A.C.; de Andrade, C.F. Probabilistic wind speed forecasting via bayesian dlms and its application in green hydrogen production. Appl. Energy 2025, 382, 125286. [Google Scholar] [CrossRef]
  18. Chen, H. A novel wind model downscaling with statistical regression and forecast for the cleaner energy. J. Clean. Prod. 2024, 434, 140217. [Google Scholar] [CrossRef]
  19. Zhao, L.; Liu, C.; Yang, C.; Liu, S.; Zhang, Y.; Li, Y. A location-centric transformer framework for multi-location short-term wind speed forecasting. Energy Convers. Manag. 2025, 328, 119627. [Google Scholar] [CrossRef]
  20. Ma, J.; Du, J.; Chen, Q.; Jiang, X.; Pan, L. Multi-feature extraction spatio-temporal interaction graph network for wind speed forecasting in windfarm. Energy 2025, 333, 137229. [Google Scholar] [CrossRef]
  21. Resifi, S.; Al Aawar, E.; Dasari, H.P.; Jebari, H.; Hoteit, I. A novel deep learning approach for regional high-resolution spatio-temporal wind speed forecasting for energy applications. Energy 2025, 328, 136356. [Google Scholar] [CrossRef]
  22. Zhang, Z.G.; Yin, J.C. Incremental principal component analysis based depthwise separable unet model for complex wind system forecasting. Energy 2025, 334, 137751. [Google Scholar] [CrossRef]
  23. Zhang, J.; Zhang, Y.; Liu, K.; Zhao, C. Multi-step prediction of spatio-temporal wind speed based on the multimodal coupled st-dfnet model. Energy 2025, 334, 137670. [Google Scholar] [CrossRef]
  24. Hu, D.; He, F.; Fan, W.; Feng, W. Dbann: Dual-branch attention neural networks with hierarchical spatiotemporal-perception for multi-node offshore wind power forecasting. Energy 2025, 334, 137521. [Google Scholar] [CrossRef]
  25. Zhao, Y.; Zhao, Y.; Liao, H.; Pan, S.; Zheng, Y. Interpreting lasso regression model by feature space matching analysis for spatio-temporal correlation based wind power forecasting. Appl. Energy 2025, 380, 124954. [Google Scholar] [CrossRef]
  26. Verdone, A.; Panella, M.; De Santis, E.; Rizzi, A. A review of solar and wind energy forecasting: From single-site to multi-site paradigm. Appl. Energy 2025, 392, 126016. [Google Scholar] [CrossRef]
  27. Wan, H.; Wang, J.; Gan, Q.; Xia, Y.; Chang, Y.; Yan, H. Addressing intermittency in medium-term photovoltaic and wind power forecasting using a hybrid xlstm-tccnn model with numerical weather predictions. Renew. Energy 2025, 253, 123618. [Google Scholar] [CrossRef]
  28. Ignatev, E.; Deriugina, G.; Suslov, K.; Balaban, G. Development of a hybrid model for medium-term wind farm power output forecasting. Renew. Energy 2025, 249, 123200. [Google Scholar] [CrossRef]
  29. Michalakopoulos, V.; Zakynthinos, A.; Sarmas, E.; Marinakis, V.; Askounis, D. Hybrid short-term wind power forecasting model using theoretical power curves and temporal fusion transformers. Renew. Energy 2026, 256, 124008. [Google Scholar] [CrossRef]
  30. Xu, X.; Cao, Q.; Deng, R.; Guo, Z.; Chen, Y.; Yan, J. A cross-dataset benchmark for neural network-based wind power forecasting. Renew. Energy 2025, 254, 123463. [Google Scholar] [CrossRef]
  31. Dong, Y.; Zhou, B.; Zhang, H.; Yang, G.; Ma, S. A deep time-frequency augmented wind power forecasting model. Renew. Energy 2026, 256, 123550. [Google Scholar] [CrossRef]
  32. Ullah, S.; Chen, X.; Han, H.; Wu, J.; Dong, J.; Liu, R.; Ding, W.; Liu, M.; Li, Q.; Qi, H.; et al. A novel hybrid ensemble approach for wind speed forecasting with dual-stage decomposition strategy using optimized gru and transformer models. Energy 2025, 329, 136739. [Google Scholar] [CrossRef]
  33. Li, S.; Guo, L.; Zhu, J.; Liu, M.; Chen, J.; Meng, Z. Short-term multi-step wind speed forecasting with multi-feature inputs using variational mode decomposition, a novel artificial intelligence network, and the polar lights optimizer. Renew. Energy 2026, 256, 123965. [Google Scholar] [CrossRef]
  34. Ma, C.; Zhang, C.; Yao, J.; Zhang, X.; Nazir, M.S.; Peng, T. Enhancement of wind speed forecasting using optimized decomposition technique, entropy-based reconstruction, and evolutionary patchtst. Energy Convers. Manag. 2025, 333, 119819. [Google Scholar] [CrossRef]
  35. Wu, X.; Wang, D.; Yang, M.; Liang, C. Ceemdan-se-hdbscan-vmd-tcn-bigru: A two-stage decomposition-based parallel model for multi-altitude ultra-short-term wind speed forecasting. Energy 2025, 330, 136660. [Google Scholar] [CrossRef]
  36. Hong, J.T.; Han, S.; Yan, J.; Liu, Y.Q. Dual-path frequency mamba-transformer model for wind power forecasting. Energy 2025, 332, 137225. [Google Scholar] [CrossRef]
  37. Wu, B.; Lin, J.; Liu, R.; Wang, L. A multi-dimensional interpretable wind speed forecasting model with two-stage feature exploring. Renew. Energy 2026, 256, 124028. [Google Scholar] [CrossRef]
  38. Zeng, H.; Wu, B.; Fang, H.; Lin, J. Interpretable wind speed forecasting through two-stage decomposition with comprehensive relative importance analysis. Appl. Energy 2025, 392, 126015. [Google Scholar] [CrossRef]
  39. Xu, R.; Fang, H.; Zeng, H.; Wu, B. A novel interpretable wind speed forecasting based on the multivariate variational mode decomposition and temporal fusion transformer. Energy 2025, 331, 136497. [Google Scholar] [CrossRef]
  40. Liang, B.J.; Tian, Z.R. Isi net: A novel paradigm integrating interpretability and intelligent selection in ensemble learning for accurate wind power forecasting. Energy Convers. Manag. 2025, 332, 119752. [Google Scholar] [CrossRef]
  41. Wang, Q.; Xu, F.; He, J.; Luo, K.; Fan, J. A new fusion model for enhanced ultra-short-term offshore wind power forecasting. Renew. Energy 2026, 256. [Google Scholar] [CrossRef]
  42. Li, M.; Zhang, K.; Kou, M.; Ma, Y. An offshore wind speed forecasting system based on feature enhancement, deep time series clustering, and extended lstm. Energy 2025, 333, 137335. [Google Scholar] [CrossRef]
  43. Cui, X.; Yu, X.; Niu, H.; Niu, D.; Liu, D. A novel data-driven multi-step wind power point-interval prediction framework integrating sliding window-based two-layer adaptive decomposition and multi-objective optimization for balancing prediction accuracy and stability. Appl. Energy 2025, 397, 126348. [Google Scholar] [CrossRef]
  44. He, X.; Zhao, K.; Chu, X. AutoML: A survey of the state-of-the-art. Knowl.-Based Syst. 2021, 212, 106622. [Google Scholar] [CrossRef]
  45. Zöller, M.A.; Huber, M.F. Benchmark and survey of automated machine learning frameworks. J. Artif. Intell. Res. 2021, 70, 409–472. [Google Scholar] [CrossRef]
  46. Zhou, Y.; Wu, X.; Wu, J.; Feng, L.; Tan, K.C. HM3: Hierarchical multi-objective model merging for pretrained models. arXiv 2024, arXiv:2409.18893. [Google Scholar] [CrossRef]
  47. Telikani, A.; Tahmassebi, A.; Banzhaf, W.; Gandomi, A.H. Evolutionary machine learning: A survey. ACM Comput. Surv. 2021, 54, 1–35. [Google Scholar] [CrossRef]
  48. Cerqueira, V.; Torgo, L.; Pinto, F.; Soares, C. Arbitrated ensembles for time series forecasting in online learning scenarios. Expert Syst. Appl. 2019, 118, 271–282. [Google Scholar]
  49. Jiang, W.; Liu, B.; Liang, Y.; Gao, H.; Lin, P.; Zhang, D.; Hu, G. Applicability analysis of transformer to wind speed forecasting by a novel deep learning framework with multiple atmospheric variables. Appl. Energy 2024, 353, 122155. [Google Scholar] [CrossRef]
  50. Li, W.; Li, Y.; Garg, A.; Gao, L. Enhancing real-time degradation prediction of lithium-ion battery: A digital twin framework with cnn-lstm-attention model. Energy 2024, 286, 129681. [Google Scholar] [CrossRef]
  51. Liu, X.; Zhang, L.; Wang, J.; Zhou, Y.; Gan, W. A unified multi-step wind speed forecasting framework based on numerical weather prediction grids and wind farm monitoring data. Renew. Energy 2023, 211, 948–963. [Google Scholar] [CrossRef]
  52. Su, T.S.; Weng, X.Y.; Vincent, F.Y.; Wu, C.C. Optimal maintenance planning for offshore wind farms under an uncertain environment. Ocean. Eng. 2023, 283, 115033. [Google Scholar] [CrossRef]
  53. Semmelmann, L.; Henni, S.; Weinhardt, C. Load forecasting for energy communities: A novel lstm-xgboost hybrid model based on smart meter data. Energy Inform. 2022, 5, 24. [Google Scholar] [CrossRef]
  54. Hedegaard, L.; Heidari, N.; Iosifidis, A. Continual spatio-temporal graph convolutional networks. Pattern Recognit. 2023, 140, 109528. [Google Scholar] [CrossRef]
  55. Huang, H.; Lin, L.; Zhao, L.; Ding, S.; Huang, H. Time series focused neural network for accurate wireless human gesture recognition. IEEE Trans. Netw. Sci. Eng. 2025, 113, 118–129. [Google Scholar] [CrossRef]
  56. Goldstein, S. On the vortex theory of screw propeller. Proc. R. Soc. A Math. Phys. Eng. Sci. 1929, 123, 440–465. [Google Scholar]
  57. McKight, P.E.; Najab, J. Kruskal-wallis test. In The Corsini Encyclopedia of Psychology; John Wiley & Sons: Haboken, NJ, USA, 2010; Volume 4. [Google Scholar]
  58. Xiaojia, H.; Wang, C.; Zhang, S. Research and application of a Model selection forecasting system for wind speed and theoretical power generation in wind farms based on classification and wind conversion. Energy 2024, 293, 130606. [Google Scholar] [CrossRef]
  59. Tedeschi, P.; Sciancalepore, S.; Pietro, R. Modelling a communication channel under jamming: Experimental model and applications. In Proceedings of the Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking (ISPA/BDCloud/SocialCom/SustainCom), New York, NY, USA, 30 September–3 October 2021. [Google Scholar]
Figure 1. Global wind energy development map based on installed capacity.
Figure 1. Global wind energy development map based on installed capacity.
Futureinternet 18 00007 g001
Figure 2. Flowchart Illustrating the Forecasting System Framework. (First, multi-turbine operational data are collected and pre-processed for model pre-training. The models are then retrained using the datasets corresponding to their optimal performance to enhance adaptability and robustness. A classification-based mechanism is developed to adaptively select the most suitable forecasting model at each time step. The trained system generates short-term wind speed forecasts, which are converted into theoretical power outputs through a physical wind speed–power curve. The resulting power forecasting support wind farm dispatching and grid operation. Finally, uncertainty analysis is conducted to assess the effects of environmental variability and model error, thereby improving the framework’s reliability and practical applicability).
Figure 2. Flowchart Illustrating the Forecasting System Framework. (First, multi-turbine operational data are collected and pre-processed for model pre-training. The models are then retrained using the datasets corresponding to their optimal performance to enhance adaptability and robustness. A classification-based mechanism is developed to adaptively select the most suitable forecasting model at each time step. The trained system generates short-term wind speed forecasts, which are converted into theoretical power outputs through a physical wind speed–power curve. The resulting power forecasting support wind farm dispatching and grid operation. Finally, uncertainty analysis is conducted to assess the effects of environmental variability and model error, thereby improving the framework’s reliability and practical applicability).
Futureinternet 18 00007 g002
Figure 3. The statistical analysis of wind speed and corresponding power output for each turbine is presented, offering a comprehensive evaluation across six turbines. The wind speed data were accurately fitted by a Gamma distribution, with determination coefficients (R2) from 0.970 to 0.976, demonstrating an excellent goodness of fit. The derived shape (1.90–2.23) and scale (2.37–2.97) parameters, together with mean values around 5 m/s and standard deviations of 3.6–4.0 m/s, reveal a pronounced right-skew and heavy-tailed behavior. These results confirm that the Gamma distribution provides an appropriate statistical representation of both the stochastic fluctuations and extreme events inherent in wind speed dynamics.
Figure 3. The statistical analysis of wind speed and corresponding power output for each turbine is presented, offering a comprehensive evaluation across six turbines. The wind speed data were accurately fitted by a Gamma distribution, with determination coefficients (R2) from 0.970 to 0.976, demonstrating an excellent goodness of fit. The derived shape (1.90–2.23) and scale (2.37–2.97) parameters, together with mean values around 5 m/s and standard deviations of 3.6–4.0 m/s, reveal a pronounced right-skew and heavy-tailed behavior. These results confirm that the Gamma distribution provides an appropriate statistical representation of both the stochastic fluctuations and extreme events inherent in wind speed dynamics.
Futureinternet 18 00007 g003
Figure 4. Classification-Based Model Selection and Hybrid Model Forecast Results.
Figure 4. Classification-Based Model Selection and Hybrid Model Forecast Results.
Futureinternet 18 00007 g004
Figure 5. Forecasting Performance and Model Selection Accuracy Across Individual Turbines.
Figure 5. Forecasting Performance and Model Selection Accuracy Across Individual Turbines.
Futureinternet 18 00007 g005
Figure 6. (A) Forecast-Driven Wind Energy Conversion Across Individual Turbines. (B) Forecast-Driven Wind Energy Conversion Across Individual Turbines.
Figure 6. (A) Forecast-Driven Wind Energy Conversion Across Individual Turbines. (B) Forecast-Driven Wind Energy Conversion Across Individual Turbines.
Futureinternet 18 00007 g006aFutureinternet 18 00007 g006b
Figure 7. (A) Wind Speed Prediction Intervals Derived from Each Model for Individual Turbines. (B) Wind Speed Prediction Intervals Derived from Each Model for Individual Turbines.
Figure 7. (A) Wind Speed Prediction Intervals Derived from Each Model for Individual Turbines. (B) Wind Speed Prediction Intervals Derived from Each Model for Individual Turbines.
Futureinternet 18 00007 g007aFutureinternet 18 00007 g007b
Table 1. Evaluation Metrics for the Comparative Analysis of Uncertainty Forecasting.
Table 1. Evaluation Metrics for the Comparative Analysis of Uncertainty Forecasting.
MetricDefinitionMathematical Expression
Upper BoundUpper bounds of the wind speed forecasting value U ( i ) = F ( i ) + K 1 0.5 * α × σ N
Lower BoundLower bounds of the wind speed forecasting value L i = F ( i ) K 1 0.5 * α × σ N
FICPForecast interval coverage probability of testing dataset FICP = 1 N i = 1 N c i × 100 %
FINAWForecast interval normalized average width of testing dataset FINAW = 1 N R i = 1 N ( U ( i ) L ( i ) )
AWDiAccumulated width deviation of testing sample i AWD i = ( L ( i ) A u i ) / ( U ( i ) L ( i ) ) , A u i < L ( i ) 0 ,   A u i [ L ( i ) ,   U ( i ) ] ( A u i U ( i ) ) / ( U ( i ) L ( i ) ) , A u i > U ( i )
AWDAccumulated width deviation of testing dataset FINAW = 1 N R i = 1 N AWD i
Note: At each point i, the forecasted value is denoted by F (i). The parameters K and σ define the quantile and scale of the t-Location Scale distribution function. N represents the total forecast length, while NR quantifies the range between the observed maximum and minimum values. The binary indicator ci takes the value 1 if the actual value A u i falls within the interval L i , U i , and 0 otherwise.
Table 2. Evaluation Metrics for the Assessment of Wind Speed Forecasting.
Table 2. Evaluation Metrics for the Assessment of Wind Speed Forecasting.
IndexDescriptionMathematical Expression
MAEMAE measures the average magnitude of forecasting errors without considering their direction. MAE = 1 N i = 1 N y ^ i y i
RMSEEvaluates the square root of the mean squared error; more sensitive to large deviations and outliers. RMSE = 1 N i = 1 N y ^ i y i
MAPEMAPE expresses forecasting accuracy as a percentage; lower MAPE means higher accuracy. MAPE = 1 N i = 1 N y ^ i y i y i × 100 %
STDAPEThe Standard Deviation of the Absolute Percentage Error is used to measure the stability of a forecasting model. STDAPE = 1 N 1 i = 1 N y ^ i y i y i 1 N i = 1 N y ^ i y i y i 2 × 100 %
DAMeasures the ability of the model to correctly forecast the direction (increase/decrease) of changes. Higher DA means better trend following capability. DA = 1 N 1 i = 1 N a i ,   a i = 1 ,   if   ( y i + 1 y ^ i ) ( y ^ i + 1 y i ) 0 ,   otherwise  
TICThis indicator quantifies the relative accuracy of forecasts, with values ranging from 0 to 1; smaller values correspond to higher predictive accuracy. TIC = i = 1 N ( y i y ^ i ) 2 i = 1 N ( y i ) 2
R2Represents the proportion of variance in observations explained by the model. Values close to 1 indicate strong predictive correlation. R 2 = 1 i = 1 N ( y i   y ^ i ) 2 i = 1 N ( y i y - i ) 2
ACCAccuracy (ACC) is a widely used measure in machine learning for assessing the effectiveness of a classification model ACC = TP + TN TP + TN + FP + FN
Note: For each observation, y i corresponds to the true value, while y u ^ represents the predicted value from a given model. N indicates the number of forecasting samples, and the accuracy of model selection is assessed using the ACC metric.
Table 3. Initial Experiment Outcomes of Each Model on Various Wind Turbines.
Table 3. Initial Experiment Outcomes of Each Model on Various Wind Turbines.
NO Model MAE RMSE STDAPE DA TIC MAPE R2
Turbine #1CNN-LSTM0.13820.193617.30%73.88%0.01644.12%0.9587
Transformers0.15150.205526.45%73.86%0.01774.97%0.9487
TCN-LSTM0.15050.187717.61%67.12%0.01555.48%0.9561
CNN-GRU0.10180.155421.51%70.56%0.01593.88%0.9660
LSTM-XGBoost0.10770.154724.47%76.17%0.01514.20%0.9652
GNN-TCN0.11120.166125.66%76.73%0.01684.55%0.9602
Turbine #2CNN-LSTM0.14290.21009.95%74.84%0.01704.09%0.9555
Transformers0.15540.213813.23%74.37%0.01714.70%0.9498
TCN-LSTM0.14300.200420.89%74.45%0.01695.63%0.9591
CNN-GRU0.17110.238416.19%72.34%0.01975.57%0.9396
LSTM-XGBoost0.11280.165421.29%74.67%0.01454.82%0.9703
GNN-TCN0.13090.184322.36%74.20%0.01735.73%0.9590
Turbine #3CNN-LSTM0.12560.17769.18%75.08%0.01604.30%0.9599
Transformers0.13800.179214.59%72.47%0.01504.72%0.9586
TCN-LSTM0.14690.206411.85%69.22%0.01804.80%0.9600
CNN-GRU0.14710.189721.17%74.50%0.01665.21%0.9488
LSTM-XGBoost0.13150.188813.52%76.71%0.01664.47%0.9590
GNN-TCN0.13360.18629.83%75.81%0.01554.56%0.9571
Turbine #4CNN-LSTM0.12510.181114.52%74.27%0.01564.10%0.9597
Transformers0.12770.169014.79%72.50%0.01494.09%0.9616
TCN-LSTM0.13410.175620.08%66.78%0.01585.14%0.9595
CNN-GRU0.14380.208024.13%73.07%0.01835.44%0.9460
LSTM-XGBoost0.09660.152814.49%70.23%0.01483.94%0.9683
GNN-TCN0.09250.138721.73%74.66%0.01394.53%0.9721
Turbine #5CNN-LSTM0.12590.193718.93%70.85%0.01634.67%0.9629
Transformers0.12680.203221.29%67.52%0.01594.75%0.9592
TCN-LSTM0.13740.203722.50%68.06%0.01635.51%0.9562
CNN-GRU0.13710.174923.44%70.66%0.01446.12%0.9671
LSTM-XGBoost0.13940.196717.50%74.43%0.01604.06%0.9582
GNN-TCN0.13470.210317.55%74.79%0.01694.12%0.9553
Turbine #6CNN-LSTM0.10880.167317.63%73.94%0.01424.31%0.9665
Transformers0.12390.163315.84%74.17%0.01614.13%0.9616
TCN-LSTM0.11460.155623.83%73.49%0.01444.91%0.9689
CNN-GRU0.13000.184426.65%74.57%0.01655.31%0.9577
LSTM-XGBoost0.14220.188718.60%68.14%0.01594.75%0.9595
GNN-TCN0.13500.193214.70%73.80%0.01695.15%0.9571
Table 4. Model Selection Outcomes of Each Model.
Table 4. Model Selection Outcomes of Each Model.
NO Model MAE RMSE STDAPE DA TIC MAPE R2 OP SOP SA
Turbine #1CNN-LSTM0.06900.09488.31%88.66%0.00792.15%0.98611925182194.59%
Transformers0.07680.105313.13%88.64%0.00882.50%0.98522737254793.05%
TCN-LSTM0.07480.09769.23%80.54%0.00822.62%0.98022638249294.48%
CNN-GRU0.05230.077210.75%84.67%0.00762.04%0.98872139201794.29%
LSTM-XGBoost0.05300.077912.26%91.40%0.00762.10%0.98872031183190.16%
GNN-TCN0.05740.083012.75%92.07%0.00812.34%0.98602930266891.04%
MSFS0.06500.10525.32%93.83%0.00881.71%0.993414,40013,37692.89%
Turbine #2CNN-LSTM0.06860.10065.10%89.81%0.00842.12%0.98962430222891.69%
Transformers0.07480.10666.67%89.24%0.00892.44%0.98911807168393.13%
TCN-LSTM0.07240.096510.45%89.34%0.00812.77%0.98031948175490.03%
CNN-GRU0.08410.12048.02%86.80%0.01002.79%0.97982432223992.05%
LSTM-XGBoost0.05700.081110.31%89.61%0.00712.46%0.98552764255692.48%
GNN-TCN0.06690.095010.81%89.03%0.00842.85%0.97903019285094.40%
MSFS0.06070.10307.22%90.25%0.00861.66%0.991914,40013,30992.43%
Turbine #3CNN-LSTM0.06530.09044.71%90.09%0.00782.09%0.98992717257594.78%
Transformers0.06930.09046.97%86.97%0.00782.40%0.98861768161391.26%
TCN-LSTM0.07260.10195.90%83.06%0.00882.44%0.98652064194494.20%
CNN-GRU0.07220.093510.87%89.40%0.00822.73%0.98002004189594.54%
LSTM-XGBoost0.06610.09086.55%92.05%0.00792.17%0.98902655245092.28%
GNN-TCN0.06620.09184.99%90.97%0.00792.18%0.98883192294892.36%
MSFS0.06080.09994.59%93.02%0.00861.64%0.995414,40013,42693.23%
Turbine #4CNN-LSTM0.06530.09017.21%89.12%0.00782.11%0.98951892178894.51%
Transformers0.06380.08797.73%87.01%0.00762.13%0.98832435225092.39%
TCN-LSTM0.06960.090210.11%80.13%0.00782.70%0.98211808165891.71%
CNN-GRU0.07540.104112.09%87.68%0.00902.79%0.98162170202093.10%
LSTM-XGBoost0.04950.07457.44%84.28%0.00741.97%0.99452536233592.07%
GNN-TCN0.04840.069710.70%89.59%0.00692.20%0.98603559333593.69%
MSFS0.05730.09549.08%91.77%0.00831.74%0.995014,40013,38692.96%
Turbine #5CNN-LSTM0.06330.09349.15%85.02%0.00792.33%0.98532687244190.84%
Transformers0.06600.098310.50%81.03%0.00832.41%0.98292002189294.51%
TCN-LSTM0.07000.101511.26%81.68%0.00862.67%0.97921936174690.17%
CNN-GRU0.06650.088111.45%84.80%0.00752.93%0.97622099198994.74%
LSTM-XGBoost0.07000.09748.34%89.32%0.00792.13%0.98822517232492.33%
GNN-TCN0.07050.10078.82%89.75%0.00822.16%0.98713159285990.52%
MSFS0.05770.09656.35%91.95%0.00821.73%0.990614,40013,25192.02%
Turbine #6CNN-LSTM0.05620.07978.56%88.73%0.00732.09%0.98952635238190.37%
Transformers0.05950.08548.01%89.01%0.00782.14%0.98891874169090.19%
TCN-LSTM0.05830.076811.70%88.18%0.00712.53%0.98651767164292.90%
CNN-GRU0.06340.089313.02%89.48%0.00822.57%0.98162229205692.25%
LSTM-XGBoost0.07000.09749.01%81.77%0.00822.45%0.98773326309092.90%
GNN-TCN0.07080.10077.60%88.56%0.00852.47%0.98712569232290.38%
MSFS0.04950.08379.63%92.68%0.00771.60%0.991114,40013,18191.53%
Table 5. Forecasting Accuracy of the Proposed MSFSC for Category II Wind Turbine Applications.
Table 5. Forecasting Accuracy of the Proposed MSFSC for Category II Wind Turbine Applications.
NO Model MAE RMSE STDAPE DA TIC MAPE R2 OP SOP SA
Turbine #1CNN-LSTM0.13820.21161.57%89.94%0.01281.66%0.995316915290.21%
Transformers0.11450.17881.89%85.16%0.01441.99%0.995912811892.95%
TCN-LSTM0.13500.21453.87%88.68%0.02763.55%0.960915914390.29%
CNN-GRU0.11010.17422.87%90.70%0.01622.63%0.992317215490.00%
LSTM-XGBoost0.06360.10791.82%85.96%0.01201.56%0.993717816492.42%
GNN-TCN0.12690.18803.05%94.06%0.01982.87%0.977620219094.42%
MSFS0.11470.18342.96%89.38%0.01762.55%0.9908100892191.37%
Turbine #2CNN-LSTM0.22860.38232.88%89.68%0.02242.53%0.980212611591.34%
Transformers0.07240.13452.22%94.12%0.01301.66%0.995317016194.97%
TCN-LSTM0.12580.19173.43%91.88%0.02443.44%0.981716014993.23%
CNN-GRU0.15120.22723.89%89.43%0.02463.72%0.964412311391.96%
LSTM-XGBoost0.12110.18343.35%85.86%0.02173.20%0.978519117792.95%
GNN-TCN0.14440.21413.22%92.44%0.02443.38%0.977223821690.94%
MSFS0.13940.22443.28%90.67%0.02253.15%0.9850100893192.36%
Turbine #3CNN-LSTM0.17030.31182.51%93.72%0.02212.12%0.986219117290.05%
Transformers0.12870.24485.28%87.74%0.02733.33%0.972115514090.76%
TCN-LSTM0.12690.23413.96%88.59%0.03203.28%0.944214914194.85%
CNN-GRU0.15210.23103.52%91.13%0.02293.42%0.975912411492.56%
LSTM-XGBoost0.11840.18893.13%90.76%0.02292.87%0.978918417394.52%
GNN-TCN0.11250.17092.99%93.17%0.02142.95%0.979120519293.68%
MSFS0.12920.22123.50%91.07%0.02392.97%0.9802100893292.46%
Turbine #4CNN-LSTM0.13020.28162.43%90.45%0.02341.82%0.988719918492.56%
Transformers0.19040.28365.22%90.43%0.03224.78%0.957518817492.64%
TCN-LSTM0.11050.17712.90%90.40%0.01842.55%0.989819817890.40%
CNN-GRU0.15120.25564.43%92.44%0.02503.44%0.983211910991.83%
LSTM-XGBoost0.10950.18683.69%86.21%0.02312.86%0.969514513693.85%
GNN-TCN0.15400.24714.05%86.16%0.03594.01%0.754415914893.71%
MSFS0.14190.24014.06%89.38%0.02613.34%0.9767100892992.16%
Turbine #5CNN-LSTM0.14360.25652.12%89.76%0.01321.57%0.993116615493.11%
Transformers0.10270.20913.12%85.38%0.01712.13%0.992217116294.91%
TCN-LSTM0.11770.19443.57%90.28%0.01912.90%0.990721619490.06%
CNN-GRU0.15350.23402.67%85.85%0.02082.93%0.98541069590.26%
LSTM-XGBoost0.10590.14922.78%93.22%0.01822.86%0.985617716492.74%
GNN-TCN0.10490.15033.06%90.70%0.02413.25%0.820617215590.48%
MSFS0.11970.19322.99%89.48%0.01782.77%0.9920100892491.67%
Turbine #6CNN-LSTM0.14640.26812.06%93.87%0.01701.63%0.988916315394.36%
Transformers0.08080.11982.60%91.95%0.01232.34%0.995017416293.34%
TCN-LSTM0.07600.11532.35%93.21%0.01061.89%0.996016214690.18%
CNN-GRU0.13320.21113.83%85.42%0.02193.31%0.988414413292.21%
LSTM-XGBoost0.07270.11572.97%86.21%0.01462.24%0.991417415890.86%
GNN-TCN0.12180.19222.85%90.58%0.02122.97%0.987219117290.15%
MSFS0.10520.17742.93%90.28%0.01742.51%0.9912100892391.57%
Table 6. Analysis of Power Curve and Technical Parameters of the Wind Turbine.
Table 6. Analysis of Power Curve and Technical Parameters of the Wind Turbine.
Power Curve with Standard Air Density
Standard Power Curve P ( v i ) = 1 2 ρ ( T , H ) A v i 3 C p ( v i )
Power Coefficient C p ( v i ) = 0.3801 v i 3 1.924 v i 2 + 33.27 v i 418.5 v i 4 26.24 v i 3 + 267.1 v i 2 1274 v i + 2949
Actual Power Curve for Wind Turbine
Air Density Correction Function ρ ( T , H ) = 353.1 e 342 H T T
Actual Power Coefficient Curve (Low bound) C p ( v i ) = 0.6313 v i 3 12.69 v i 2 + 131.4 v i 250.8 v i 4 22.99 v i 3 + 187.3 v i 2 542.8 v i + 624.1
Actual Power Coefficient Curve (Standard) C p ( v i ) = 0.5466 v i 3 10.64 v i 2 + 121.43 v i 237.0 v i 4 23.08 v i 3 + 191.2 v i 2 547.1 v i + 629.4
Actual Power Coefficient Curve (Upper bound) C p ( v i ) = 0.5689 v i 3 10.93 v i 2 + 124.8 v i 258.1 v i 4 22.41 v i 3 + 176.9 v i 2 414.1 v i + 326.5
Table 7. Parameter of wind turbine.
Table 7. Parameter of wind turbine.
TypeSL1500/82
ManufacturerSinovel
Rated Wind Speed10.5 m/s
Rotor Diameter82 m
Hub Height70 m
Cut-In Wind Speed3 m/s
Cut-Out Wind Speed22 m/s
Table 8. Assessment of Wind Conversion Efficiency by Each Model and the MSFSC.
Table 8. Assessment of Wind Conversion Efficiency by Each Model and the MSFSC.
NOModelNRMDRMSER2MPENOModelRMDRMSER2MPE
Turbine #1CNN-LSTM4.778%71.55220.9928−4.760%Turbine #4CNN-LSTM4.155%61.23870.9958−4.042%
Transformers4.973%75.57930.9936−5.434%Transformers4.561%68.89290.9949−4.371%
TCN-LSTM4.984%78.03040.9943−5.699%TCN-LSTM4.467%66.36240.9968−4.253%
CNN-GRU4.802%72.21870.9920−5.348%CNN-GRU4.400%64.77760.9959−4.186%
LSTM-XGBoost4.886%75.45740.9937−5.413%LSTM-XGBoost4.377%64.56170.9952−4.176%
GNN-TCN4.813%72.81230.9930−5.379%GNN-TCN4.361%64.38250.9960−4.056%
MSFS4.438%67.69010.9957−3.823%MSFS4.037%58.96290.9973−3.253%
Turbine #2CNN-LSTM4.820%71.20500.9932−4.725%Turbine #5CNN-LSTM3.960%57.14720.9975−4.252%
Transformers4.973%74.14800.9934−5.530%Transformers4.006%57.81980.9971−4.276%
TCN-LSTM4.968%73.38960.9936−5.428%TCN-LSTM4.070%58.43510.9973−4.351%
CNN-GRU4.737%68.79010.9938−4.542%CNN-GRU3.823%54.78540.9978−4.194%
LSTM-XGBoost5.038%74.98990.9940−5.598%LSTM-XGBoost3.980%57.39130.9973−4.266%
GNN-TCN4.829%71.52000.9948−4.926%GNN-TCN3.928%56.75330.9974−4.228%
MSFS4.262%64.69740.9961−3.329%MSFS3.808%54.67070.9979−3.253%
Turbine #3CNN-LSTM4.186%59.54430.9974−4.649%Turbine #6CNN-LSTM7.737%115.43820.9836−3.901%
Transformers4.228%59.85090.9978−4.708%Transformers7.901%117.73010.9931−5.837%
TCN-LSTM4.167%59.38600.9974−4.577%TCN-LSTM7.704%114.58720.9828−3.253%
CNN-GRU4.122%58.64050.9976−4.545%CNN-GRU6.092%113.69730.9819−2.887%
LSTM-XGBoost4.213%59.69980.9976−4.707%LSTM-XGBoost7.754%115.74310.9903−5.735%
GNN-TCN4.117%58.60960.9975−4.452%GNN-TCN7.613%113.75320.9828−2.643%
MSFS4.067%57.85200.9979−3.897%MSFS5.458%83.57710.9828−1.882%
Note: N R M D =   1 n i = 1 n x i x actual ¯ 1 n i = 1 n x actual , i where n is the total number of simulated discharges during rainfall events, with x i and x a c t u a l , i representing the i th simulated and actual discharge values, and x a c t u a l ¯ denoting the mean of the observed reference data. While MPE does not incorporate an absolute value, this feature provides its analytical advantage. Due to the cancelation of positive and negative errors, MPE is typically unsuitable for assessing the overall forecasting performance.
Table 9. The model evaluation by kw test and uncertainty analysis of wind speed forecasting.
Table 9. The model evaluation by kw test and uncertainty analysis of wind speed forecasting.
NOComparisonA-p-Value (KW)p-Value (KS)FICPFINAWAWD
Turbine #1CNN-LSTM vs. Actual Data−2.1281.00000.993574.21%0.028270.5032
Transformers vs. Actual Data−0.6041.00000.988173.61%0.030730.5470
TCN-LSTM vs. Actual Data0.0621.00000.996967.06%0.033850.6026
CNN-GRU vs. Actual Data−0.2001.00000.999569.84%0.031630.5631
LSTM-XGBoost vs. Actual Data−1.3751.00000.999987.10%0.017230.3066
GNN-TCN vs. Actual Data1.2581.00001.000068.25%0.033550.5971
MSFS vs. Actual Data−1.3461.00000.993595.24%0.010050.1788
Turbine #2CNN-LSTM vs. Actual Data0.8821.00000.998761.90%0.067321.3600
Transformers vs. Actual Data−0.7641.00000.912569.94%0.064321.2992
TCN-LSTM vs. Actual Data10.3511.00000.979864.98%0.074501.5048
CNN-GRU vs. Actual Data−4.9231.00000.858062.10%0.106782.1569
LSTM-XGBoost vs. Actual Data−2.0861.00000.968466.07%0.068401.3816
GNN-TCN vs. Actual Data−1.2391.00000.988165.58%0.073841.4916
MSFS vs. Actual Data9.6111.00000.953593.45%0.017250.3484
Turbine #3CNN-LSTM vs. Actual Data−0.0301.00000.998787.60%0.075481.6983
Transformers vs. Actual Data−9.7821.00000.912556.15%0.136043.0609
TCN-LSTM vs. Actual Data−0.7651.00000.979858.04%0.126652.8497
CNN-GRU vs. Actual Data8.9041.00000.858052.98%0.141863.1917
LSTM-XGBoost vs. Actual Data0.9541.00000.826565.77%0.121222.7274
GNN-TCN vs. Actual Data8.7911.00000.988162.80%0.121892.7424
MSFS vs. Actual Data−1.8331.00000.993594.94%0.046451.0450
Turbine #4CNN-LSTM vs. Actual Data−2.1281.00000.993587.10%0.017230.3066
Transformers vs. Actual Data−0.6041.00000.988173.61%0.030730.5470
TCN-LSTM vs. Actual Data0.0621.00000.996974.21%0.028270.5032
CNN-GRU vs. Actual Data−0.2001.00000.999569.84%0.031630.5631
LSTM-XGBoost vs. Actual Data−1.3751.00000.999967.06%0.033850.6026
GNN-TCN vs. Actual Data1.2581.00001.000068.25%0.033550.5971
MSFS vs. Actual Data−1.3461.00000.993595.24%0.010050.1788
Turbine #5CNN-LSTM vs. Actual Data0.9901.00001.000086.11%0.029550.5378
Transformers vs. Actual Data4.5401.00000.999972.62%0.051900.9446
TCN-LSTM vs. Actual Data4.5111.00000.998766.57%0.054190.9863
CNN-GRU vs. Actual Data3.5471.00000.999965.38%0.056111.0212
LSTM-XGBoost vs. Actual Data5.3371.00001.000069.54%0.052200.9500
GNN-TCN vs. Actual Data2.4291.00000.999963.89%0.063121.1487
MSFS vs. Actual Data4.8531.00000.999594.35%0.015100.2748
Turbine #6CNN-LSTM vs. Actual Data0.6001.00001.000087.90%0.023690.4595
Transformers vs. Actual Data5.7881.00000.999569.15%0.041550.8061
TCN-LSTM vs. Actual Data7.2811.00000.998775.20%0.040370.7833
CNN-GRU vs. Actual Data7.0081.00000.988166.37%0.047170.9151
LSTM-XGBoost vs. Actual Data0.8171.00000.993571.83%0.041240.8001
GNN-TCN vs. Actual Data6.5541.00000.988168.65%0.045140.8758
MSFS vs. Actual Data2.9521.00000.999993.35%0.013390.2598
Note: When the p-value is greater than 0.05, the model’s forecasts are considered statistically consistent with the verification set observations. In contrast, a p-value less than 0.05 indicates a statistically significant discrepancy between the observed and forecasted values.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zeng, M.; Jia, Q.; Wen, Z.; Mao, F.; Huang, H.; Pan, J. Research and Application of a Model Selection Forecasting System for Wind Speed and Theoretical Power Generation. Future Internet 2026, 18, 7. https://doi.org/10.3390/fi18010007

AMA Style

Zeng M, Jia Q, Wen Z, Mao F, Huang H, Pan J. Research and Application of a Model Selection Forecasting System for Wind Speed and Theoretical Power Generation. Future Internet. 2026; 18(1):7. https://doi.org/10.3390/fi18010007

Chicago/Turabian Style

Zeng, Ming, Qianqian Jia, Zhenming Wen, Fang Mao, Haotao Huang, and Jingyuan Pan. 2026. "Research and Application of a Model Selection Forecasting System for Wind Speed and Theoretical Power Generation" Future Internet 18, no. 1: 7. https://doi.org/10.3390/fi18010007

APA Style

Zeng, M., Jia, Q., Wen, Z., Mao, F., Huang, H., & Pan, J. (2026). Research and Application of a Model Selection Forecasting System for Wind Speed and Theoretical Power Generation. Future Internet, 18(1), 7. https://doi.org/10.3390/fi18010007

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop