Dynamic Risk Assessment of Wind Farms Under Extreme Gust Disturbances Using BiLSTM, Adaboost, and Adaptive Kernel Density Estimation for Enhanced Prediction

Ke Shang; Yanbin Yang; Jinsheng Wang; Meng Shuai; Zhongjie Dang; Qing Liu

doi:10.3390/electronics14234584

,

and

¹

State Grid Beijing Electric Power Company, Beijing 100051, China

²

State Key Laboratory of Alternate Electrical Power System with Renewable Energy Sources, North China Electric Power University (Baoding), Baoding 071003, China

^*

Author to whom correspondence should be addressed.

Electronics2025, 14(23), 4584;https://doi.org/10.3390/electronics14234584
(registering DOI)

This article belongs to the Special Issue Advanced Online Monitoring and Fault Diagnosis of Power Equipment

Version Notes

Order Reprints

Abstract

Under extreme gusty weather conditions, wind power output fluctuates significantly, and the operational risks of wind farms exhibit strong randomness and complexity. Addressing the issue of limited sample data under such extreme wind speed conditions, which leads to low accuracy in risk assessment, this study proposes a multi-dimensional risk assessment method for wind farms based on the BiLSTM-Adaboost-ABKDE algorithm framework and ITA-TPW (Integrated Trend Analysis—Time-Series Probability Weighting) weighted fusion. First, a wind farm operational risk assessment model based on a multi-dimensional risk indicator system is constructed to assess the impact of these extreme weather conditions on wind farm operational risks. Second, to enhance the robustness of wind power output prediction under small-sample conditions, a BiLSTM (Bidirectional Long Short-Term Memory) network is introduced to capture the bidirectional temporal dependencies of wind power output. Through the Adaboost weighted ensemble of sub-models, combined with the ABKDE (Adaptive Bandwidth Kernel Density Estimation) algorithm for probability density estimation of prediction outputs, high-precision point prediction and uncertainty quantification are unified. Furthermore, we combine trend analysis sensitivity analysis using the ITA method with time-series probability weighting using the TPW method to assess key risks such as voltage overlimit, power flow overlimit, and load imbalance in wind farms under extreme wind speeds from multiple perspectives. The dataset used in this study comes from Hebei Province, covering wind power data from 1 March 2020 to 31 May 2023, and the simulation platform used is MATLAB R2023a. Simulation analysis was conducted using the IEEE RTS-79 node system to validate the effectiveness of the proposed method. The results showed that the proposed method improved the accuracy of load shedding risk and voltage overlimit risk indicators by 6.15% and 4.79%, respectively. Additionally, the reliability of the system’s comprehensive risk indicators was significantly enhanced, validating the effectiveness of this method in improving the accuracy and reliability of operational risk assessment for wind farms under extreme weather conditions.

Keywords:

wind power forecasting; risk assessment; BiLSTM-Adaboost-ABKDE; extreme gust conditions; uncertainty quantification; wind farm risk indicators; power flow overlimit; voltage overlimit

1. Introduction

With the intensification of global climate change, the frequency of extreme weather events, especially strong winds and abrupt wind speed changes, has been increasing. These extreme weather phenomena often lead to voltage fluctuations, power flow interruptions, and load imbalances in the power grid, seriously threatening the safety and stability of the grid. Wind power generation is highly sensitive to wind speed, particularly under extreme weather conditions. When wind turbines are disconnected on a large scale, the accuracy of wind power forecasting can significantly decline [,]. This issue not only increases the complexity of grid scheduling but also affects the stable operation of the grid. Therefore, finding an effective method to improve wind power forecasting accuracy under extreme wind conditions and conducting precise risk assessments is crucial for ensuring grid stability and safety [,,,].

In recent years, researchers have begun to adopt data decomposition methods to preprocess wind power data to improve forecasting accuracy. These methods can effectively capture the nonlinear features in the data and adapt to dynamic changes, thereby reducing the complexity of power sequences and improving forecasting accuracy []. For example, EMD (Empirical Mode Decomposition) [] and CEEMD (Complementary Ensemble Empirical Mode Decomposition) [] have been widely used in wind power forecasting, but they still face issues such as mode mixing and high computational complexity. VMD (Variational Mode Decomposition), as an emerging method, can effectively handle high-frequency noise and non-stationary signals, making it a hot topic in current data processing research []. However, under extreme weather conditions, the scarcity of wind power data limits forecasting models in terms of parameter setting and hypothesis testing, making it difficult to capture the instantaneous nonlinear features of wind power, which affects the training and prediction performance of the models.

To address the issue of reduced forecasting accuracy caused by small sample sizes under extreme weather events, some studies have proposed data augmentation techniques. For example, in the literature, ref. [] proposed several data sampling strategies to solve the problem of imbalanced training samples under different weather events, effectively improving the accuracy of wind power ramp-up warnings. Literature [,] proposed data augmentation methods based on Generative Adversarial Networks (GAN) to expand training samples for wind power, thereby improving the accuracy of short-term wind power forecasting. At the same time, Conditional Generative Adversarial Networks (CGAN) and TimeGAN have also been applied to augment small sample data, achieving good forecasting results [,]. Despite these advancements, existing data augmentation methods still fail to fully preserve the physical characteristics and temporal dependencies of wind power data. Therefore, traditional modal decomposition algorithms still face challenges when handling the transient nonlinear dynamic features of wind power under extreme weather, such as insufficient multi-scale feature decoupling and sensitivity to parameter settings, which leaves room for improving the physical interpretability and accuracy of these methods.

To address these issues, this paper proposes a hybrid model based on Bidirectional Long Short-Term Memory (BiLSTM)—Adaptive Boosting (AdaBoost)—Adaptive Bandwidth Kernel Density Estimation (ABKDE). This model combines BiLSTM’s powerful temporal modeling capabilities, AdaBoost’s sample weighting mechanism, and the uncertainty quantification of the ABKDE module to solve the small sample and high volatility issues in extreme gust data. BiLSTM is used to extract temporal features from wind speed data, while AdaBoost optimizes the training process and enhances the recognition ability for hard-to-predict samples. The ABKDE module performs non-parametric probability density estimation on prediction residuals, transforming traditional “point predictions” into “prediction + uncertainty,” effectively improving both prediction accuracy and uncertainty quantification. In extreme gust scenarios with limited historical samples and strong noise, this hybrid structure allows BiLSTM to capture bidirectional temporal dependencies, while AdaBoost emphasizes hard-to-predict extreme ramps during training and ABKDE provides a noise-robust probabilistic description of prediction errors, thereby directly targeting the coexistence of small sample size and high volatility. Similar hybrid models have achieved significant success in other fields. For example, Gong Wenjie et al. [] combined BiLSTM with AdaBoost to extract temporal features via BiLSTM and adjusted sample weights using AdaBoost, improving the accuracy and robustness of heat load forecasting. Zhang Yushan et al. proposed a BiLSTM–AdaBoost model combined with Bayesian optimization, which extracted features from time-series data using BiLSTM and optimized multi-model prediction performance via AdaBoost, significantly improving short-term heat load forecasting accuracy, especially in the face of data fluctuations and noise []. However, these BiLSTM–AdaBoost models were developed for relatively smooth load or time-series data with abundant historical samples; when directly applied to rare, high-amplitude gust events, their ensemble averaging effect tends to smooth sharp power ramps and underrepresent tail behavior, which limits their applicability to operational risk analysis under extreme gust disturbances.

In addition, Yang Tengjie et al. [] proposed an interpretable model based on NRBO-XGBoost and ABKDE, using ABKDE for error correction and successfully improving tunneling speed prediction accuracy. Kang Yan et al. [] proposed a flood interval forecasting method based on an error correction fusion model, which successfully improved forecast accuracy by combining error correction with ABKDE. These studies demonstrate that the combination of BiLSTM–AdaBoost and ABKDE has significant advantages in feature extraction and time-series analysis, providing strong theoretical support for this study.

In recent years, modern deep learning technologies such as Generative Adversarial Networks (GAN) and Transformer have been introduced into the fields of extreme weather event prediction and wind power forecasting. For example, in the literature, ref. [] proposed a hybrid model that combines gust detection with a Conditional LSTM to more accurately forecast wind speed under extreme gust conditions. However, these methods mainly focus on improving wind speed or wind power forecasting performance and do not explicitly consider how real-time wind power disturbances propagate to grid-level operational risks. Similarly, Li Ling et al. [] proposed a wind farm fault and capacity loss prediction method based on ensemble learning, which provides an important reference for wind power grid integration risk assessment. Nevertheless, most of these approaches are still confined to a single link in the chain and fail to comprehensively cover the full process from “extreme wind speed disturbances” through “wind power forecasting” to “grid operation risk assessment.”

Motivated by these limitations, this study applies the BiLSTM–AdaBoost–ABKDE hybrid model to wind power forecasting and grid risk assessment, aiming to improve forecasting accuracy while quantifying uncertainty, thereby providing a new solution for wind power grid scheduling and risk assessment under extreme gust conditions. Compared with existing hybrid and ensemble forecasting methods that mainly focus on point prediction accuracy under normal or moderately fluctuating conditions, or address only one stage of the process, the proposed framework explicitly targets extreme gust disturbances with scarce data and couples wind power forecasting with multi-dimensional operational risk indicators for wind farms and the power grid. To evaluate this framework on a realistic yet standardized benchmark, the IEEE RTS-79 reliability test system is adopted as the background network, providing representative topology, generation, and load data for reproducible dynamic risk assessment under extreme gust conditions.

2. Precise Modeling of Wind Farms Under Extreme Gusts

2.1. Detailed Wind Farm Model

2.1.1. Extreme Wind Event Model

Extreme wind conditions refer to wind conditions with a frequency of once per year that affect the operational safety of wind turbines during operation, as well as those with a frequency of once per 50 years that affect the survival status of wind turbines. Table 1 presents the wind turbine classes and their parameters according to the IEC 61400-1 standard [].

Table 1. Parameters for Wind Turbine Classifications.

(1) Extreme wind speed model: sustained wind model or turbulent wind model. Extreme wind speed models are based on reference wind speeds and specific turbulent standard deviations [].

50-year return period extreme average wind speed:

V_{e 50} = 1.4 V_{r e f} {(\frac{z}{z_{h u b}})}^{0.11}

(1)

1-year return period extreme average wind speed:

V_{e 1} = 0.8 V_{e 50}

(2)

Longitudinal turbulent standard deviation:

σ_{1} = 0.11 V_{hub}

(3)

(2) Extreme operational gust model: Amplitude of gusts at the hub of a standard wind turbine:

V_{g u s t} = \min [1.35 (V_{e 1} - V_{h u b}), 3.3 (\frac{σ_{1}}{1 + 0.1 D / A_{1}})]

(4)

where D is the rotor diameter; Wind speed:

V (z, t) = \{\begin{array}{l} V (z) - 0.37 V_{g u s t} \sin (\frac{3 π t}{T}) (1 - \cos (\frac{2 π t}{T})), & 0 \leq t \leq T \\ V (z), & Others \end{array}

(5)

where the 1-year return period extreme operational gust T is taken as 10.5 s, and the 50-year return period T is taken as 14 s.

(3) Extreme Continuous Gust Model:

V (z, t) = \{\begin{array}{l} V (z), & t \leq 0 \\ V (z) + 0.5 V_{c g} (1 - \cos (\frac{π t}{T})), & 0 \leq t \leq T \\ V (z) + V_{c g}, & t > T \end{array}

(6)

The amplitude of extreme continuous gusts for standard wind turbine classes is 15 m/s, with T = 10 s, representing the rise time.

(4) Extreme continuous gust model with wind direction change: The angle of wind direction change for extreme continuous gusts is within [0°,

θ_{c g}

], where

θ_{c g} = \{\begin{array}{l} 180^{°}, & V_{h u b} \leq 4 m / s \\ \frac{720^{°}}{V_{h u b}}, & 4 m / s < V_{h u b} < V_{r e f} \end{array}

(7)

Additionally, the relationship between wind direction and time is:

θ_{c g} = \{\begin{array}{l} \pm 0.5 θ_{c g} (1 - \cos (\frac{Π t}{T})), & t < 0 \\ \pm θ_{c g}, & 0 \leq t \leq T \\ \pm θ_{c g}, & t > T \end{array}

(8)

where the duration of wind direction change T is taken as 6 s, and the sign (+ or −) should be selected based on the better transient load. The wind direction is considered unchanged after the transient extreme wind direction change ends.

(5) Extreme wind direction change:

θ_{e} = \pm 4 \arctan (\frac{σ_{1}}{V_{h u b} (1 + 0.1 D / A_{1})})

(9)

The amplitude of the extreme wind direction change is where D is the rotor diameter; the value is within the range [−180°, 180°];

The function of extreme wind direction with respect to time:

θ (t) = \{\begin{array}{l} \pm 0.5 θ_{e} (1 - \cos (\frac{Π t}{T})), & t < 0 \\ θ_{e}, & 0 \leq t \leq T \\ θ_{e}, & t > T \end{array}

(10)

(6) Extreme wind shear: The transient wind speed values of extreme wind shear are divided into transient vertical shear and transient horizontal shear.

Transient vertical shear:

V (z, t) = \{\begin{array}{l} V_{h u b} {(\frac{z}{z_{h u b}})}^{α} \pm \frac{z - z_{h u b}}{D} ((2.5 + 0.2 β σ_{1} {(\frac{D}{A_{1}})}^{1 / 4}) (1 - \cos (\frac{2 Π t}{T}))), & 0 \leq t \leq T \\ V_{h u b} {(\frac{z}{z_{h u b}})}^{α}, & Others \end{array}

(11)

Transient horizontal shear:

b (y, z, t) = \{\begin{array}{l} V_{n u b} {(\frac{z}{z_{n u b}})}^{α} \pm \frac{z - z_{n u b}}{D} ((2.5 + 0.2 β σ_{1} {(\frac{D}{A_{1}})}^{1 / 4}) (1 - \cos (\frac{2 Π t}{T}))), & 0 \leq t \leq T \\ V_{n u b} {(\frac{z}{z_{n u b}})}^{α}, & Others \end{array}

(12)

In Equations (11) and (12), a = 0.2, β = 0.64, T = 12 s; D is the diameter of the wind wheel.

To illustrate the parameter ranges adopted in this paper and to validate the physical reasonableness of Equations (4), (11) and (12), we consider a representative IEC Class II wind turbine. According to Table 1, the corresponding 10 min reference mean wind speed is 37.5–42.5 m/s, and the hub-height mean wind speed under normal operation is approximately 10–15 m/s. In our simulations, the rotor diameter D of the turbines installed in the wind farm lies in the range of 80–100 m, and we take D = 90 m as a representative value for calculating the extreme gust and transient shear profiles.

For a hub-height mean wind speed of 15 m/s and D = 90 m, substituting the above parameters into Equations (11) and (12) yields transient vertical and horizontal shear components whose maximum increments are on the order of 1.5–2.0 m/s across the rotor disk, i.e., approximately 10–15% of the hub-height mean wind speed.

2.1.2. Extreme Gust Duration Model

Assessing the impact intensity of extreme weather on power systems requires consideration of both spatial and temporal dimensions. Spatially, system components operate under varying conditions depending on their geographic locations. Temporally, exposure duration differs across affected areas. As shown in Figure 1, the intersection of the extreme weather trajectory with the spatial layout of the power system determines exposure time, which is governed by the event’s path length and propagation speed, as expressed in Equation (13).

T_{l e n} = \frac{D_{w r}}{V_{w}}

(13)

Figure 1. Schematic diagram of the spatial relationship between extreme weather trajectories and the affected power system area.

In Equation (13),

D_{w r}

is the trajectory distance of extreme gusts in the affected power system area, and

V_{w}

is the movement speed of the center of extreme gusts in this area.

Figure 2 provides a schematic illustration of how the trajectory of an extreme gust event leads to different exposure durations for different wind farms. The gust propagates along the dashed path with a translational velocity

V_{w}

, and its influence area at each time instant is represented by the moving circular footprint with radius

D_{w r}

around the gust center. As shown in the figure, the trajectory intersects the influence region of Wind Farm 1 over a longer chord length than that of Wind Farm 2. According to Equation (13), Wind Farm 1 experiences a longer exposure duration, while Wind Farm 2 is exposed for a shorter period.

Figure 2. Extreme gust trajectory and exposure regions of wind farms. The dashed line represents the path of extreme wind gusts, and the arrows indicate the diameter of the circle, which is the parameter Dwr.

2.1.3. Wind Energy Conversion Model

Based on extensive research [], the output power of wind turbines satisfies a specific relationship curve with wind speed, known as the wind turbine power characteristic curve. A typical wind turbine power characteristic curve is shown in Figure 3.

Figure 3. Wind turbine power curve.

In the figure,

v_{c i}

represents the cut-in wind speed,

v_{c o}

represents the cut-out wind speed,

v_{r}

represents the rated wind speed,

P_{w r}

represents the active power generated by the wind turbine at the rated wind speed, and

v

represents the wind speed at the wind farm. According to the principles of aerodynamics, the electrical power generated by a wind turbine is proportional to the cube of the wind speed. The relationship between the output power and the wind speed can be expressed as:

P_{W} (v_{x}) = \{\begin{array}{l} 0, & (v_{x} < v_{c i}) \cup (v_{x} > v_{c o}) \\ \frac{P_{w r}}{v_{r}^{3} - v_{c i}^{3}} (v_{x}^{3} - v_{c i}^{3}), & (v_{c i} \leq v_{x} \leq v_{r}) \\ P_{w r}, & (v_{r} < v_{x} \leq v_{c o}) \end{array}

(14)

In Equation (14),

P_{W} (v_{x})

represents the active power generated by the

v_{x}

fan under wind speed.

In this study, the wind turbines installed in the case study wind farm have operating parameters

v_{c i}

= 3 m/s,

v_{r}

= 12 m/s, and

v_{c o}

= 25 m/s, according to the manufacturer’s technical specifications, and these values are used consistently in all simulations when applying the power curve in Equation (14).

3. BiLSTM-Adaboost-ABKDE Algorithm

To address the challenges posed by small sample sizes, high volatility, and high uncertainty in wind power forecasting under extreme gust conditions, this paper proposes a BiLSTM-Adaboost-ABKDE hybrid prediction framework. In contrast to deep learning models that rely on large sample sizes for training, such as Transformer or GAN, the proposed approach emphasizes model robustness, generalization capability, and computational efficiency.

BiLSTM (Bidirectional Long Short-Term Memory) networks are capable of modeling both forward and backward dependencies in time series data, effectively capturing the nonlinear dynamic features induced by wind speed fluctuations []. Studies have demonstrated that BiLSTM achieves higher accuracy and stability compared to unidirectional LSTM and GRU in small-sample renewable energy forecasting. To mitigate the interference of noisy samples and non-stationary signals during training, this paper employs the Adaboost ensemble algorithm, which integrates multiple BiLSTM sub-models [].

In this study, the ABKDE (Adaptive Bandwidth Kernel Density Estimation) module is utilized to model the probability distribution of the residuals between the BiLSTM predicted values and the actual values. Specifically, BiLSTM generates wind speed forecasts through its temporal modeling capabilities, while Adaboost enhances the model’s performance by weighting multiple BiLSTM models. Subsequently, ABKDE estimates the probability density of these prediction residuals, thereby quantifying the uncertainty in the model’s predictions. Compared to traditional KDE methods, ABKDE offers the advantage of adaptively adjusting the bandwidth parameter based on local data density, resulting in more precise and smoother distribution estimates. This adaptive mechanism is particularly crucial in small-sample environments, ensuring that prediction errors’ uncertainty is accurately quantified, thus providing more reliable information for risk assessment.

Although models such as Transformer and GAN perform well on large datasets [], their reliance on large sample sizes and high computational resources limits their application in small-sample and high-noise data scenarios. In contrast, the BiLSTM-Adaboost-ABKDE framework exhibits superior convergence stability and prediction accuracy in small-sample, high-noise wind power data scenarios. Additionally, it effectively quantifies prediction uncertainty, further enhancing grid risk assessment capabilities.

3.1. BiLSTM Algorithm Principle

The bidirectional long short-term memory network is based on LSTM and captures bidirectional dependencies in time series by combining the outputs of the forward and backward LSTM networks, thereby improving the accuracy of model predictions to a certain extent. The BiLSTM algorithm structure is shown in Figure 4 [].

Figure 4. BiLSTM algorithm structure.

Specific expression of the BiLSTM algorithm:

\begin{array}{l} \vec{h_{t}} = L S T M (x_{t}, \vec{h_{t - 1}}) \\ \overset{\leftarrow}{h_{t}} = L S T M (x_{t}, \overset{\leftarrow}{h_{t - 1}}) \\ y_{t} = σ (W_{y} [\vec{h_{t}}, \overset{\leftarrow}{h_{t}}] + b_{y}) \end{array}

(15)

In Equation (15),

\vec{h_{t}}

is the state of the forward LSTM hidden layer at time t;

\overset{\leftarrow}{h_{t}}

is the state of the backward LSTM hidden layer at time t;

W_{y}

represents the weight;

b_{y}

represents the bias term;

x_{t}

represents the input data at time t;

y_{t}

represents the output data at time t.

3.2. Adaboost Algorithm Principle

Adaboost is an ensemble learning algorithm that iteratively builds multiple weak regressors and aggregates them into a strong predictor []. At each iteration, training sample weights are updated, assigning greater importance to observations with larger previous errors. The reweighted data are used to train the next weak regressor, and the final model is obtained by combining all weak regressors.

The Adaboost training procedure is as follows:

(1): The initial weight distribution of the training data samples, i.e., assigning an initial weight to each training sample.

D_{1} = ⟨ w_{11}, \dots, w_{1 i}, \dots, w_{1 N} ⟩, w_{1 i} = \frac{1}{N}, i = 1, 2, \dots, N

(16)

(2): For t = 1, 2, 3, …, T, perform weight adjustment according to the following steps.

First, when training the tth weak predictor, train the training set containing the weight distribution and obtain the weak predictor

G_{t} (x)

.

Second, calculate the total prediction error Et for each weak predictor.

e_{t} = \sum_{i = 1}^{N} D_{t}^{i} e_{t}^{i} = P (|\frac{G_{t} (x_{i}) - y_{i}}{y_{i}}| > φ)

(17)

In the formula:

G_{t} (x)

is the predicted output;

y_{i}

is the expected output;

φ

is the set threshold.

Third, based on the calculated error and the weight coefficient

α_{t}

of the current weak classifier:

α_{t} = \frac{1}{2} \log \frac{1 - e_{t}}{e_{t}}

(18)

Fourth, update the weight distribution of the training set:

\{\begin{array}{l} D_{t + 1} = (w_{t + 1, 1}, \dots, w_{t + 1, j}, \dots, w_{t + 1, N}) \\ w_{t + 1, i} = \frac{w_{t i}}{Z_{t}} e^{- α_{t} y_{i} G_{t} (x_{i})}, Z_{t} = \sum_{i = 1}^{N} ω_{t i} e^{- α_{t} y_{i} G_{t} (x_{i})} \end{array}

(19)

In the formula:

Z_{t}

is the normalization factor.

(3): Finally, construct a linear combination of T weak predictors to form a strong predictor.

G (x) = sign (f (x)) = sign (\sum_{t = 1}^{T} α_{t} G_{t} (x))

(20)

3.3. ABKDE Algorithm Principle

Kernel density estimation (KDE) is a nonparametric method for estimating the probability density function (PDF) of a random variable from observed data. It assigns a kernel function—such as Gaussian or Epanechnikov—to each data point, sums their contributions, and produces a smooth estimate of the underlying distribution.

KDE depends on two parameters: the kernel function, which defines local influence, and the bandwidth, which controls the degree of smoothing. Small bandwidths may lead to overfitting by capturing noise, whereas large bandwidths may oversmooth and obscure important features.

In addition to marginal density estimation, KDE can estimate joint PDFs, enabling conditional density analysis and quantile-based inference. This capability supports evaluating predictor effects across the distribution and constructing smooth quantile regression curves without strict parametric assumptions.

For KDE-based quantile regression, kernel and bandwidth parameters are typically optimized using cross-validation to capture conditional structures effectively.

The general formula for KDE is:

f_{h} (x) = \frac{1}{n h} \sum_{i = 1}^{n} K (\frac{x - x_{i}}{h})

(21)

In the formula:

x_{1}

,

x_{2}

, …,

x_{n}

are sample data;

K (\cdot)

is the kernel function;

h

is the bandwidth parameter.

There are many types of KDE kernel functions, and the following are commonly used:

(1): Gaussian kernel function:

K (x) = \frac{1}{\sqrt{2 π}} \exp (- \frac{x^{2}}{2})

(22)

(2): Trigonometric kernel function:

K (x) = (1 - | x |) I (| x | \leq 1)

(23)

(3): Epanechnikov kernel function:

K (x) = \frac{3}{4} (1 - x^{2}) I (| x | \leq 1)

(24)

This study adopts an adaptive bandwidth strategy to enhance kernel density estimation (KDE) accuracy. Unlike traditional fixed-bandwidth KDE, which applies a uniform smoothing parameter to all observations, the adaptive approach adjusts the bandwidth according to local data density []. Specifically, bandwidth is defined as a function of the estimated density at each observation: smaller values are applied in dense regions to capture fine-scale variations, while larger values are used in sparse regions to maintain smoothness and reduce variance. This locally adaptive mechanism yields a more accurate representation of the underlying probability density function.

By alleviating over-smoothing in dense areas and overfitting in sparse areas, adaptive bandwidth KDE provides a robust, data-sensitive framework for nonparametric density estimation [].

3.4. BiLSTM-Adaboost-ABKDE Algorithm Process

(1): Data Preparation and Preprocessing:

To account for the temporal characteristics and environmental variability inherent in wind power data, systematic preprocessing is first applied to the raw dataset. In this study, Z-score standardization is employed to normalize environmental variables—such as wind speed and temperature—thereby eliminating dimensional disparities and facilitating subsequent model training and analysis.

X_{std} = \frac{X - μ}{σ}

(25)

where

X

is the original data,

μ

is the mean of the feature, and

σ

is the standard deviation.

To better capture the nonlinear characteristics and periodic variations in wind power, the model incorporates not only the original features such as wind speed and temperature, but also the cube of wind speed, the sine and cosine components of wind direction, and time-lagged features. Furthermore, recursive feature elimination is employed to select the most predictive subset of these features.

On this basis, the sliding window method is used to construct time series samples. The historical window length L and prediction step length H are defined, and the time series data is converted into an L × d-dimensional input matrix and an H-dimensional output vector, which facilitates training of the BiLSTM model:

X_{t} = [F_{t - L}, F_{t - L + 1}, \dots, F_{t - 1}]

(26)

Y_{t} = [P_{t}, P_{t + 1}, \dots, P_{t + H - 1}]

(27)

(2): Time Series Data Modeling:

The bidirectional long short-term memory (BiLSTM) network is an extension of the conventional LSTM architecture, designed to capture both past and future context within sequential data. In the context of wind power forecasting, BiLSTM is particularly effective in modeling long-term temporal dependencies, thereby enhancing prediction accuracy.

Specifically, BiLSTM employs two separate LSTM units. The forward LSTM processes the input sequence in chronological order, capturing information from the current and preceding time steps:

\vec{h_{t}} = σ (W_{\vec{f}} \cdot [x_{t}, \vec{h_{t - 1}}] + b_{\vec{f}})

(28)

The other is a backward LSTM unit, which processes the input sequence at the current moment and thereafter:

\overset{\leftarrow}{h_{t}} = σ (W_{\overset{\leftarrow}{f}} \cdot [x_{t}, \overset{\leftarrow}{h_{t + 1}}] + b_{\overset{\leftarrow}{f}})

(29)

Ultimately, the two are fused through a hyperbolic tangent activation function to form the final feature representation:

h_{t} = \tanh (W_{f} \cdot [\vec{h_{t}}, \overset{\leftarrow}{h_{t}}])

(30)

(3): Ensemble Learning and Accuracy Enhancement:

Within the BiLSTM-Adaboost framework, Adaboost integrates the outputs of multiple BiLSTM submodels through a weighted combination to produce the final prediction. Each BiLSTM submodel is trained with a focus on different subsets of the data at various stages, allowing the ensemble to capture diverse patterns and reduce prediction error. The aggregation of multiple submodels enhances robustness and improves overall forecasting accuracy.

Specifically: First, calculate the weighted average absolute error of the current model:

ϵ_{t} = \frac{\sum_{i = 1}^{N} D_{t} (i) \cdot | y_{i} - {\hat{y}}_{i} |}{\sum_{i = 1}^{N} D_{t} (i)}

(31)

Then calculate the submodel weights based on the error rate:

α_{t} = 0.5 \ln (\frac{1 - ϵ_{t}}{ϵ_{t}})

(32)

Final update sample weight:

D_{t + 1} (i) = \frac{D_{t} (i) \cdot \exp (α_{t} | e_{i} |)}{Z_{t}}

(33)

where

Z_{t}

is the normalization factor.

(4): Uncertainty modeling and probability density estimation:

After obtaining the integrated point prediction values, the ABKDE algorithm constructs an uncertainty model based on the training set prediction error samples: let

y_{t}

denote the actual wind power at time t and

{\overset{\land}{y}}_{t}

denote the corresponding integrated point prediction of the BiLSTM-Adaboost model; the prediction error is computed as

e_{t}

=

y_{t}

−

{\overset{\land}{y}}_{t}

, and the sample set {

e_{t}

for t = 1, …, N} is used as the input to the ABKDE procedure in Equations (34)–(36).

h_{0} = 1.06 {\hat{σ}}_{e} N^{- 0.2}

(34)

Then estimate the local density factor for each sample point:

h_{i} = h_{0} λ_{i}^{β}

(35)

Next, calculate the adaptive bandwidth, and finally use the Gaussian kernel function to generate the probability density function:

\hat{f} (p) = \frac{1}{N} \sum_{i = 1}^{N} \frac{1}{h_{i}} K (\frac{p - \hat{p} - e_{i}}{h_{i}})

(36)

The BiLSTM-Adaboost-ABKDE algorithm ultimately outputs a sequence of predicted wind power values for the next H time steps and their probability density distribution:

\hat{Y} = [{\hat{P}}_{t}, {\hat{P}}_{t + 1}, \dots, {\hat{P}}_{t + H - 1}]

(37)

{\hat{f}}_{P} (p) = \frac{1}{N} \sum_{i = 1}^{N} \frac{1}{h_{i} \sqrt{2 π}} \exp (- \frac{1}{2} {(\frac{p - \hat{p} - e_{i}}{h_{i}})}^{2})

(38)

Consequently, BiLSTM addresses the challenge of modeling temporal dependencies in time series data, while Adaboost enhances model robustness and predictive accuracy. Additionally, ABKDE quantifies the uncertainty associated with the forecast results. The integration of these three components offers significant advantages, effectively compensating for the limitations of individual algorithms. The overall algorithmic workflow is illustrated in Figure 5.

Figure 5. The framework of the proposed BiLSTM-Adaboost-ABKDE algorithm for wind power forecasting.

3.5. Model Training and Validation Process

(1): Model Architecture and Hyperparameter Settings: Each BiLSTM submodel has an input window length of L = 24 h and a prediction stride of H = 6 h.

The network comprises two BiLSTM layers: the first layer contains 64 hidden units, and the second layer contains 32 hidden units, with a single-value regression output. The Adam optimizer is employed, with an initial learning rate of 0.001 adjusted via learning rate decay. The batch size is set to 64, with 100 training epochs. Early stopping is enabled (patience = 10) to prevent overfitting. An ensemble of T = 10 BiLSTM submodels is formed, using a weighted update factor

α_{t} = \frac{1}{2} \ln \frac{1 - ε_{t}}{ε_{t}}

per iteration, where

ε_{t}

weighted error of the current submodel. ABKDE employs a Gaussian kernel function combined with an adaptive bandwidth selection strategy. Bandwidth adjustment is based on Silverman’s empirical rule and adapts to local data density:

h_{i} = h_{0} \cdot {(\frac{p_{i}}{\bar{p}})}^{- γ}, γ = 0.5

(39)

where

h_{0}

is the global bandwidth,

p_{i}

is the local density estimate, and

γ = 0.5

is the bandwidth adjustment coefficient.

In this network, the input window length L = 24 h and the prediction horizon H = 6 h were selected based on preliminary experiments and sensitivity analysis. When L is set to 24 h, the model effectively captures the daily wind speed fluctuation patterns. The prediction horizon of H = 6 h enables the model to forecast short-term power output while maintaining a balance between training time and prediction accuracy. Sensitivity analysis indicates that although increasing the value of L allows the model to capture more historical information, the prediction accuracy gradually stabilizes within a certain range. On the other hand, smaller values of L tend to lead to overfitting.

After conducting experiments with different batch sizes, it was found that a batch size of 64 achieves a good balance between computational efficiency and training stability.

Experimental results confirmed that increasing the number of sub-models contributes to improving the robustness and stability of the model. The final choice of T = 10 sub-models, combined with AdaBoost’s weighting mechanism, significantly enhances the model’s ability to identify difficult-to-predict samples, particularly in scenarios with extreme gusts and rapid wind speed changes.

(2): Training Process and Validation

To evaluate the model’s training effectiveness and generalization capability, experiments were conducted using training and validation sets. To evaluate the model’s performance and generalization ability, a structured data partitioning approach was employed, which divides the dataset into training, validation, and testing subsets. For the BiLSTM base learner, the same 70%/20%/10% split is used: the BiLSTM model is first trained on the 70% training subset and its hyperparameters are tuned on the 20% validation subset before being combined by the Adaboost ensemble, while the remaining 10% test subset is only used for the final performance evaluation.

The training set was used to optimize the model parameters by learning from historical data, while the validation set was utilized for hyperparameter tuning to mitigate overfitting. The testing set was reserved to assess the final predictive performance of the model on unseen data, ensuring an unbiased evaluation.

Furthermore, to enhance the robustness of the model, the validation and testing datasets were selected from different time periods, thus accounting for varying wind conditions and seasonal variations.

Figure 6 illustrates the performance evolution during training and validation.

Figure 6. Loss of training and validation sets over epochs.

Both training loss and validation loss stabilized after approximately 60 epochs, indicating that the model converged during training and did not exhibit overfitting.

To further evaluate the practicality of the proposed BiLSTM-Adaboost-ABKDE model, we compared its training and prediction process in terms of computation time and resource consumption with that of the standalone BiLSTM model. The experimental results are shown in Table 2.

Table 2. Comparison chart of computation time and resource consumption.

As shown in the table, the computational cost of the BiLSTM-Adaboost-ABKDE model is slightly higher than that of the standalone BiLSTM model, with training time and prediction time increasing by 10% and 22%, respectively (approximately 33 s and 4 s). However, despite the modest increase in computational cost, the hybrid model demonstrates a significant improvement in prediction accuracy, particularly under extreme weather conditions, showing superior stability and robustness. Especially in high-noise and small-sample wind power data environments, the BiLSTM-Adaboost-ABKDE model performs better in terms of robustness and generalization, with the average MAPE decreasing from 6.15% to 4.79%. These results suggest that, despite the increased computational cost, the hybrid model’s advantages in accuracy and stability are clear, making it a promising tool for practical wind farm risk assessment and scheduling applications.

4. Risk Assessment Indicators and Calculation Methods for Wind Farms Under Extreme Gusts

4.1. Wind Farm Risk Assessment Indicator System

To comprehensively characterize the operational risks that wind power integration may pose to the power system under extreme gust conditions, this paper builds upon traditional indicators such as voltage overlimit, power flow overlimit, load loss, and power shortage, and further introduces three key risk indicators reflecting the operational limit states of the power grid: load shedding risk (

R_{C U T}

), transmission line active power overlimit risk (

R_{L I N E}

), and node voltage overlimit risk (

R_{V}

). These three indicators, based on the severity of system fault responses, can more accurately reflect the boundary stability and resilience of system operation. Based on this, this paper proposes using a comprehensive risk indicator as the overall risk assessment indicator for wind farm system levels.

(1): Load shedding risk index ( $R_{C U T}$ ): When the power grid experiences insufficient power supply, load shedding may occur. The load shedding risk index can be calculated by weighting the probability and severity of load shedding under various conditions, as expressed by the following formula:

R_{CUT} = \sum_{k = 1}^{K} p_{W} (E_{k}) \cdot p_{g} (E_{k}) \cdot p_{line} (E_{k}) \cdot p_{r} (E_{k}) \cdot S_{ev - CUT} (E_{k})

(40)

Among them,

S_{ev - CUT} (E_{k})

is the severity of load shedding, defined as:

S_{ev - CUT} (E_{k}) = \frac{C_{c} (E_{k})}{P_{load}} \times 100 %

(41)

(2): Transmission line active power limit risk indicator ( $R_{LINE}$ ): If the operating power of the transmission line exceeds the safety limit under certain conditions, it may cause line overload or even tripping. The risk indicators are as follows:

R_{LINE} = \sum_{k = 1}^{K} p_{W} (E_{k}) \cdot p_{g} (E_{k}) \cdot p_{line} (E_{k}) \cdot p_{r} (E_{k}) \cdot S_{ev - LINE} (E_{k})

(42)

Among them:

S_{ev - LINE} (E_{k}) = \sum_{m = 1}^{M} \frac{\max (p_{m} (n) - p_{m, \max} (m), 0)}{e - 1}

(43)

(3): Node voltage limit risk indicator ( $R_{V}$ ): If the node voltage exceeds the set upper or lower limit, it will affect power quality and even trigger protection mechanisms. The risk indicator is defined as follows:

R_{V} = \sum_{k = 1}^{K} p_{W} (E_{k}) \cdot p_{g} (E_{k}) \cdot p_{line} (E_{k}) \cdot p_{r} (E_{k}) \cdot S_{e - V} (E_{k})

(44)

Among them, the severity function is:

S_{ev - V} (E_{k}) = \sum_{i = 1}^{N} \frac{e^{\max (0.9 - U_{i}, U_{i} - 1.0)} - 1}{e - 1}

(45)

(4): Comprehensive risk indicator ( $R_{CR}$ ): The final comprehensive risk indicator is calculated by weighting and combining the three types of risk mentioned above:

R_{CR} = ω_{1} \cdot R_{CUT} + ω_{2} \cdot R_{LINE} + ω_{3} \cdot R_{V}

(46)

Among them,

ω_{1}

,

ω_{2}

and

ω_{3}

are the weighting coefficients for each risk indicator.

In order to link the three risk indices to physical and operational constraints, the following settings are adopted in this study. For the load shedding risk index

R_{C U T}

, the severity at each time step is defined as the ratio between the involuntary load shedding and the total system load at that time. Accordingly,

R_{C U T}

is a dimensionless index with values between 0 and 1, and

R_{C U T}

= 0 corresponds to secure operation without any load shedding. For the transmission line active power limit risk index

R_{L I N E}

, the power flow limit of each branch is set equal to its rated active power

P_{i j, \max}

as given in the IEEE RTS-79 dataset.

R_{L I N E}

remains zero as long as the absolute value of the active power flow of a branch does not exceed its limit P_ij_,max, and starts to increase when this thermal limit is approached or violated. For the node voltage limit risk index

R_{V}

, the admissible per-unit voltage range [

V_{\min}

,

V_{\max}

] is taken as 0.9–1.1 p.u. for all buses, corresponding to the typical ±10% band around the nominal voltage used in transmission-system security analysis.

4.2. Risk Assessment Calculation Method

The BiLSTM-Adaboost-ABKDE model outputs two key dimensions of information: first, point forecasts of wind power output representing the most probable power output under given input conditions; second, the probability density distribution of power values, quantified by ABKDE (Adaptive Bandwidth Kernel Density Estimation) to capture prediction uncertainty. These outputs are coupled with the grid’s real-time operational status through Integrated Trend Analysis (ITA) and Time Series Probability Weighting (TPW) methods, respectively, to calculate various risk metrics.

4.2.1. Predictive Data Preprocessing and Grid Operation Status Simulation

When establishing a risk assessment system for wind farms under extreme gust conditions, the primary task is to effectively convert the predictive outputs of the BiLSTM-Adaboost-ABKDE model into input data suitable for grid analysis.

During the data preprocessing stage, we perform temporal alignment and spatial allocation on the wind power forecast values generated by the model. On the temporal dimension, the 6 h point forecast sequence

{\overset{\land}{p_{w} f (t)}}_{t = 1}^{6}

with 1 h intervals and its corresponding probability density function

{f ({\overset{\land}{p}}_{w} (t))}_{t = 1}^{6}

require interpolation to convert them into 15 min interval data matching the grid simulation step size.

Spatially, the total forecasted wind power is distributed to three wind farm interconnection nodes (BUS7, BUS14, and BUS18) according to a preset ratio, consistent with the network structure of the IEEE RTS-79 system. In this system, BUS7, BUS14, and BUS18 are representative load buses located in different areas of the network and connected to multiple transmission corridors, and are therefore adopted in this study as equivalent interconnection points for the aggregated wind farm, reflecting typical transmission-level locations where large-scale wind power would be integrated.

For grid modeling, we employ an improved random power flow calculation method to inject the processed forecast data into system analysis. Unlike traditional deterministic power flow, this method addresses forecast uncertainty through Monte Carlo simulation. The specific process is as follows: First, based on the probability density functions provided by ABKDE, 1000 equally probable wind power scenarios are generated for each time slice. Then, for each scenario, the following power balance equation is solved:

g (x) = p_{w i n d} + p_{g e n} - p_{l o a d} - p_{l o s s} (x) = 0

(47)

where

x = {[V, θ]}^{T}

represents the system state variables, including node voltage magnitudes and phase angles. Through large-scale scenario calculations, we obtain probability distributions for state variables such as node voltages and line powers, providing a statistical foundation for risk assessment.

4.2.2. Convergence Trend Analysis Method

The core of the Convergence Trend Analysis (ITA) method [] lies in utilizing point prediction results provided by the BiLSTM-Adaboost-ABKDE model, combined with historical operational data, to identify and quantify the dynamic evolution trends of system risk indicators under the influence of extreme gusts. This approach does not simply substitute predicted values into static formulas but instead constructs a time-varying probability distribution that integrates historical patterns with current predictive information, thereby achieving forward-looking perception of risk trends.

Specifically, for line power overlimit risk, the ITA method first constructs a time-varying probability density function

f_{I T A}^{l} (P, t)

. This is achieved by applying sliding window kernel density estimation (KDE) to the historical power data

[P_{l} (t - L), P_{l} (t - L + 1), \dots, P_{l} (t - 1)]

of line l and the point prediction sequence

[{\overset{\land}{P}}_{l} (t), {\overset{\land}{P}}_{l} (t + 1), \dots, {\overset{\land}{P}}_{l} (t + H)]

for the future forecast period.

For illustration, the formulation of the branch power overlimit indicator in ITA is given in Equation (48):

R_{lfA} = \{\begin{array}{l} 0, & P_{l} \leq P_{l}^{\max} \\ \int_{P_{l}^{-}}^{P_{l}} (P_{l} - P_{l}^{\max}) f (P_{l}) d P_{l}, & P_{l} > P_{l}^{\max} \end{array}

(48)

In the formula:

R_{lfA}

is the branch power limit indicator calculated by the ITA method;

f (P_{l})

is the probability distribution value obtained by fitting the historical data and predicted values of the power of the lth branch;

P_{l}

is the power value;

P_{l}^{\max}

is the maximum power of branch l.

4.2.3. Time-Series Probability Weighting Method

The Time-Probability Weighted Method (TPW) is a risk assessment approach based on probabilistic forecast results []. Its core concept is to fully leverage the comprehensive probabilistic forecast information provided by the ABKDE module, expanding risk assessment from a single “most probable scenario” to a “probability-weighted average of all possible scenarios.” This directly incorporates forecast uncertainty into the risk measurement framework.

Specifically, at each prediction time point t, ABKDE provides the probability density function

f (P_{w f} (t))

for wind power output at that moment. Through Monte Carlo simulation, we generate numerous wind power scenarios

{P_{w f}^{(s)} (t)}_{s = 1}^{N}

that follow this distribution. For each scenario s, deterministic power flow calculations are performed to obtain the risk indicator value

R_{f l o w}^{(s), l} (t)

for that system state. Subsequently, the comprehensive risk at time t is characterized by calculating the arithmetic mean of the risk indicators across all scenarios at that time slice. Similarly, taking the power overlimit indicator of a branch in the system as an example, as shown in the equation:

r_{l t} = \{\begin{array}{l} 0, & P_{l} \leq P_{l}^{\max} \\ \int_{P_{l}^{\max}}^{P_{l}} (P_{l} - P_{l}^{\max}) f (P_{l}) d P_{l}, & P_{l} > P_{l}^{\max} \end{array}

(49)

R_{T P W} = \sum_{t = 1}^{n} r_{k} * (1 - \frac{t - 1}{T})

(50)

In the formula:

r_{l t}

represents the branch power overlimit risk assessment index for branch l at time t; T denotes the total number of prediction time points;

R_{T P W}

indicates the branch power overlimit index for this branch calculated using the TPW method.

In practical implementation, the predictive probability distribution of wind farm power obtained from ABKDE is first used to generate a set of wind power scenarios, then the random power flow model is run for each scenario to compute the corresponding values of branch power, bus voltage and load shedding, and finally the TPW method aggregates these scenario-wise risk values into the time-varying risk indices, so that the forecast uncertainty quantified by ABKDE is directly propagated to

R_{C U T}

,

R_{L I N E}

and

R_{V}

.

4.3. Weighted Algorithm for Wind Farm Risk Assessment Indicators

To enhance the accuracy of wind farm risk assessment under extreme gust conditions, this study proposes an evaluation framework based on dynamic weighted fusion. By deeply integrating the Integrated Trend Analysis (ITA) and Time Series Probability Weighting (TPW) methods, it fuses three risk indicators—load curtailment risk, line power overlimit risk, and node voltage overlimit risk—into a unified system-level risk metric.

4.3.1. Risk Indicator Calculation Method Based on Model Output

At the risk indicator calculation level, this study strictly adheres to the risk assessment indicator system defined in Section 4.1, deeply coupling the wind power prediction values output by the BiLSTM-Adaboost-ABKDE model with the grid operational state. Specifically, the model-predicted wind power sequences (including point estimates and probability distributions) serve as inputs. Through time-series power flow calculations using an enhanced IEEE RTS-79 system, the dynamic operational state of the system is obtained, and various risk indicators are calculated based on this state.

4.3.2. Theoretical Basis and Implementation of ITA and TPW Weighted Fusion

The ITA method extracts the evolutionary trend of system risk by constructing a joint probability distribution between historical data and forecast results. The specific implementation includes: selecting historical operational data from the most recent 24 time-slices, combining it with power forecasts for the next 6 h, and employing adaptive bandwidth kernel density estimation to construct a time-varying probability density function. The TPW method focuses on handling instantaneous risk fluctuations caused by forecast uncertainty. Its technical approach involves: generating 1000 equally probable scenarios based on the probability distribution output from ABKDE, performing deterministic power flow calculations for each scenario, and aggregating the expected risk values at each time step using a time-decay weighting function. The time weighting function adopts an exponential decay form:

ϖ (t) = \frac{e^{- λ (t - 1)}}{\sum_{t = 1}^{H} e^{- λ (t - 1)}}

(51)

The attenuation coefficient

λ = 0.4

was determined through optimization using historical data.

During the weighted fusion process, due to differences in data scale and distribution characteristics between the two methods, the prediction results are first normalized:

R_{ITA, norm} (i) = \frac{R_{ITA} (i)}{\max (R_{ITA})}

(52)

R_{TPW, norm} (i) = \frac{R_{TPW} (i)}{\max (R_{TPW})}

(53)

Based on the normalized results, weight coefficients are calculated through a dynamic weight allocation mechanism:

S_{ITA} = \frac{\sum R_{ITA, norm}}{\sum R_{ITA, norm} + \sum R_{TPW, norm}}

(54)

S_{TPW} = \frac{\sum R_{TPW, norm}}{\sum R_{ITA, norm} + \sum R_{TPW, norm}}

(55)

In the formula,

S_{ITA}

and

S_{TPW}

are the weighting coefficients for the ITA and TPW methods, respectively. Based on the above coefficients, the integrated single risk indicator can be calculated as follows:

R_{i}^{fused} = S_{ITA} \cdot R_{ITA, norm} (i) + S_{TPW} \cdot R_{TPW, norm} (i)

(56)

Finally, to construct a system-level comprehensive risk indicator, the Weighted Sum Method (WSM) is employed to synthesize the various fused individual risk indicators, defined as follows:

R_{C R} = ϖ_{1} \cdot R_{C U T}^{fused} + ϖ_{2} \cdot R_{L I N E}^{fused} + ϖ 3 \cdot R_{V}^{fused}

(57)

In the formula,

ϖ_{1}

,

ϖ_{2}

and

ϖ_{3}

represent the weighting coefficients for each category of risk indicators.

In order to examine the robustness of the comprehensive risk indicator with respect to these weighting coefficients, a simple sensitivity check was carried out in the case study: the weights were perturbed within a reasonable range while being renormalized to sum to one, and the resulting

R_{C R}

time series showed changes in absolute magnitude but preserved the main temporal patterns and the relative ordering of risk levels across operating conditions, indicating that the qualitative conclusions drawn from CRR are not overly sensitive to moderate variations in the weights.

Considering the impact of λ and ω on the final risk assessment results, we employed a Bayesian optimization approach to improve the accuracy and stability of the model. The core idea is as follows: In the experiments, λ and ω were searched within the range [0, 1]. Then, the Bayesian optimization algorithm was used to minimize the MAPE (Mean Absolute Percentage Error), with the goal of finding the optimal parameter combination. During the optimization process, cross-validation was introduced to ensure that the optimized values of λ and ω had good generalization ability. We divided the dataset into a training set (70%), a validation set (20%), and a test set (10%), and applied K-fold cross-validation during the training process, calculating the MAPE for each fold to ensure consistency in the MAPE results across different folds.

Figure 7 shows the impact of different combinations of λ and ω on the MAPE. Through Bayesian optimization, we found that when λ = 0.4 and ω = 0.3, the model achieved the lowest MAPE, thus validating the effectiveness of the optimization process.

Figure 7. Impact of λ and ω on MAPE.

4.4. Wind Farm Risk Assessment Process

The wind farm operation risk assessment process developed in this study is shown in Figure 8. The overall process consists of three stages: power prediction under extreme gust disturbance conditions, probabilistic modeling, and multi-dimensional risk indicator assessment. The specific steps are as follows:

Figure 8. Research flowchart of the wind farm operational risk assessment method under extreme gust conditions.

(1): Data initialization and sample construction: Historical meteorological data (wind speed, wind direction, temperature) and operational data (wind power) from the wind farm are collected and subjected to normalized standard processing. A sliding window is used to construct the time series input sample set $X \in R^{T \times d}$ , which is utilized for subsequent model training.
(2): BiLSTM-Adaboost modeling and prediction enhancement: BiLSTM captures the bidirectional time-dependent characteristics of wind power data and constructs multiple submodels. Adaboost dynamically adjusts the weights of the submodels to enhance adaptability to small samples and extreme disturbances, ultimately outputting the wind power prediction value ${\hat{P}}_{t}$
(3): ABKDE probability modeling: Based on the point prediction values output by BiLSTM-Adaboost, kernel density estimation is performed on the residuals to calculate the confidence interval $[{\hat{P}}_{t}^{low}, {\hat{P}}_{t}^{up}]$ and probability density distribution $f ({\hat{P}}_{t})$ of the predicted power, thereby quantifying the uncertainty of the prediction.
(4): System state variable assessment: Predictive power is input into the IEEE RTS-79 node system for power flow simulation to obtain key operating state variables, including node voltage $U_{i}$ , branch power flow $P_{i j}$ , load balance $L_{i}$ , etc.
(5): Multi-dimensional risk indicator calculation: Based on system state variables, a risk assessment indicator system is constructed, including:

Voltage limit risk : R_{vol} = \frac{N_{out}^{U}}{N_{total}^{U}}

Power flow limit risk : R_{flow} = \frac{N_{out}^{P}}{N_{total}^{P}}

Load shedding risk : R_{load} = \frac{L_{unserved}}{L_{total}}

(6): Risk Indicator Trend Assessment (ITA): Analyze the current trends of various risk indicators and their historical means and variances, calculate trend deviation degrees $Δ R_{vol}$ , $Δ R_{flow}$ , and $Δ R_{load}$ , and quantify the degree of deviation in indicator stability.
(7): Time-weighted risk indicator (TPW): Based on the risk probability at each time step in the prediction interval, set a decreasing weight function $w_{t} = e^{- λ (T - t)}$ and calculate the weighted risk score: $R_{tpw} = \sum w_{t} \cdot R_{t}$ .
(8): Fusion processing and weighted output: The ITA results and TPW results are normalized and fused to obtain a comprehensive risk score vector:

R_{final} = α \cdot R_{ita} + (1 - α) \cdot R_{tpw}

(9): Risk status assessment and feedback mechanism: Assess the integrated risk score results. If the score exceeds the set threshold, trigger a “high risk” alert; otherwise, update the historical sequence of status indicators and enter the next assessment cycle.

5. Experimental Results and Analysis

This study uses the IEEE RTS-79 power system as the test case for validation (Figure 8). Widely employed in studies on dispatch optimization, power flow analysis, and operational risk assessment, the RTS-79 system consists of 24 load buses, 38 transmission lines, and 32 generators, with voltage levels organized into two tiers: 230 kV and 138 kV (Figure 9).

Figure 9. Flow chart of coordinated control of PV-based energy storage and black start.

To simulate renewable energy integration, wind farms are placed at buses 7 (300 MW), 14 (150 MW), and 18 (80 MW). Their output profiles are generated from real wind speed data, enabling the modeling of dynamic disturbance effects of wind power in a typical grid environment.

5.1. Dataset Description and Preprocessing

The dataset used in this paper originates from a wind farm in Hebei Province, China. Data collection spanned from 1 March 2020, to 31 May 2023, with a sampling interval of one hour, yielding approximately 3500 data records. The dataset was divided into training, validation, and test sets in a 7:2:1 ratio to ensure temporal continuity and prevent forward information leakage. Data was collected in real-time via the wind farm’s SCADA (Supervisory Control and Data Acquisition) system, encompassing typical meteorological conditions in the region, such as springtime severe convective weather and winter temperature inversions.

The experimental environment configuration is as follows: Operating system: Windows 11; CPU: Intel^® Core™ i7-14650HX 2.20 GHz; GPU: NVIDIA GeForce RTX 4050; Python version: 3.9.21; TensorFlow version: 2.10.0; CUDA version: 11.2.

To enhance data diversity and validate model robustness, this study incorporates the wind energy dataset provided by NREL (National Renewable Energy Laboratory). This dataset contains high-frequency wind speed and energy data from 2019 to 2020, resampled to match the 1 h resolution of the primary dataset. Cross-validation ensures the model’s adaptability and robustness under varying wind resource conditions.

5.2. Wind Power Forecasting Results

To evaluate the effectiveness of the proposed BiLSTM-Adaboost-ABKDE algorithm, this study conducted hourly wind power forecasting experiments using actual wind speed and power output data over a specific timeframe. The data originated from a typical wind farm in Hebei Province, utilizing 100 consecutive hours of historical data from spring (a season characterized by frequent gusts). The data featured a 1 h temporal resolution and primarily focused on the active power output of the wind farm connected to nodes 7, 14, and 18. Considering the impact of extreme gusts on power fluctuations, this study specifically selected periods with more intense wind speed variations for experimentation.

Figure 10 first presents the forecasting results and predictive uncertainty of the proposed model on this representative test segment. The horizontal axis denotes the sampling index and the vertical axis denotes the corresponding normalized wind power output. In Figure 8, the red markers represent the measured values, the black solid line represents the predictive mean of the model, and the shaded regions from dark to light correspond to the 50%, 75% and 95% prediction intervals derived from the ABKDE-estimated predictive distribution at each sampling point. Most observations fall within the 95% band and the majority lie inside the 75% band, indicating that the prediction intervals are reasonably calibrated.

Figure 10. Wind power forecasts with prediction intervals.

Furthermore, to validate the performance of the proposed model, the following representative models were selected as baselines for comparative experiments:

GRU (Gated Recurrent Unit): A lightweight recurrent neural network architecture capable of capturing short-term dependencies, but prone to convergence instability when handling highly volatile samples.

Transformer: An attention-based model suitable for long-sequence modeling, but prone to overfitting in small-sample scenarios.

GAN-based model (Generative Adversarial Network-based time series prediction): Primarily used for data augmentation and uncertainty modeling, but susceptible to noise interference that degrades model performance.

BiLSTM-Adaboost (without ABKDE): A combination of bidirectional temporal learning and ensemble weighted prediction, used to validate the contribution of the ABKDE algorithm in uncertainty modeling.

In comparative experiments, all models employed identical input features, sample partitioning, and hyperparameter search strategies. Predictive performance was quantified using RMSE (Root Mean Square Error) and MAPE (Mean Absolute Percentage Error) metrics to comprehensively evaluate different models’ capabilities in wind power forecasting.

Figure 11 further compares the proposed BiLSTM–Adaboost–ABKDE algorithm with Gated Recurrent Unit (GRU), Transformer and GAN-based models in terms of hourly prediction accuracy and dynamic response performance across a representative 100 h interval. Overall, all models effectively tracked the general fluctuation trends of wind power output. However, significant differences emerged in handling local abrupt changes, such as the rapid wind power increase observed between hours 58 and 61.

Figure 11. Multi-model prediction comparison chart.

Particularly during the rapid wind power increase phase, while the GRU model smoothly tracked power changes, its inherent strong short-term dependencies caused prediction curves to lag when confronting sudden wind speed shifts, failing to capture abrupt features in a timely manner. The Transformer model, while suitable for long-sequence modeling and capable of capturing long-term dependencies through its attention mechanism, is prone to overfitting in small-sample scenarios. This compromises prediction accuracy, leading to substantial errors during specific time periods. The GAN-based model, though effective at enhancing data diversity and modeling uncertainty, suffers from unstable oscillations in its predictions due to susceptibility to noise interference, particularly under extreme wind speed fluctuations.

In contrast, the proposed BiLSTM-Adaboost-ABKDE model rapidly responds to sudden wind speed changes while maintaining smooth and accurate prediction trajectories across the dynamic range of wind power output. This model employs Bidirectional Long Short-Term Memory (BiLSTM) to capture nonlinear dynamic variations in wind power output. Combined with the AdaBoost weighting mechanism, it enhances adaptability to hard-to-predict samples. Utilizing the ABKDE algorithm for quantifying prediction uncertainty modeling, it effectively reduces noise interference, demonstrating superior generalization performance and convergence stability under conditions of small sample size and high data volatility.

5.2.1. Predictive Performance Evaluation Metrics

In the preceding section, the fitting performance of each model during typical periods was evaluated using point predictions. To quantitatively assess prediction accuracy and interval reliability, a standard set of error metrics is employed: Mean Absolute Percentage Error (MAPE), Root Mean Square Error (RMSE), Prediction Interval Coverage Probability (PICP), and Prediction Interval Normalized Average Width (PINAW). These metrics enable a detailed interval-based comparison of the models, thereby validating the accuracy and practical applicability of the proposed method. The definitions are as follows:

MAPE = \frac{1}{n} \sum_{t = 1}^{n} |\frac{y_{t} - {\hat{y}}_{t}}{y_{t}}| \times 100 %

(58)

RMSE = \sqrt{\frac{1}{n} \sum_{t = 1}^{n} {(y_{t} - {\hat{y}}_{t})}^{2}}

(59)

PICP = \frac{1}{n} \sum_{t = 1}^{n} I (y_{t} \in [{\hat{y}}_{t}^{L}, {\hat{y}}_{t}^{U}])

(60)

PINAW = \frac{1}{n R} \sum_{t = 1}^{n} ({\hat{y}}_{t}^{U} - {\hat{y}}_{t}^{L})

(61)

Among them,

y_{t}

is the actual value,

{\hat{y}}_{t}

is the predicted value,

{\hat{y}}_{t}^{L}

and

{\hat{y}}_{t}^{U}

are the lower and upper confidence limits of the prediction, respectively, R is the data value range, and

I (\cdot)

is the indicator function.

Table 3 summarizes the performance comparison of the proposed BiLSTM–Adaboost–ABKDE model with three baseline models (GRU, Transformer, and GAN-based model) under different confidence levels (50%, 75%, and 95%). The evaluation metrics include MAPE (Mean Absolute Percentage Error), RMSE (Root Mean Square Error), PICP (Prediction Interval Coverage Probability), and PINAW (Prediction Interval Normalized Average Width), which comprehensively reflect both point prediction accuracy and interval prediction reliability. Definitions of these metrics are provided in Table 1.

Table 3. Prediction Performance Evaluation Metrics.

Specifically, at the 95% confidence level, the model achieves an RMSE of 7.11, reducing the error by 31.2% compared to GRU (10.34), 22.9% compared to Transformer (9.22), and 11.3% compared to GAN-based model (8.02). The MAPE also drops to 4.32%, which is 36.9% lower than GRU and 29.5% lower than Transformer. This trend holds at both 75% and 50% confidence levels, showing that the proposed model maintains stable prediction accuracy under various uncertainty levels.

Regarding interval prediction, the proposed model achieves the highest PICP (96.36%), indicating better coverage of true wind power values in the prediction intervals, and the smallest PINAW (7.59%), suggesting narrower and more reliable prediction intervals compared to other models. This means that the proposed model strikes a good balance between high coverage and minimal uncertainty in its predictions.

In extreme gust conditions, the BiLSTM–Adaboost–ABKDE model shows less fluctuation in prediction errors, indicating lower variance and better robustness against noise compared to other models. This is attributed to the synergy of BiLSTM capturing the temporal dependencies, Adaboost enhancing model robustness through ensemble learning, and ABKDE effectively modeling the uncertainty of prediction errors.

Furthermore, the model exhibits better performance with fewer data and high volatility, confirming its strong generalization ability in small-sample and high-uncertainty scenarios. The GAN-based model, while capable of generating strong predictions in some cases, suffers from instability and poor convergence under extreme gusts due to its reliance on data distribution learning.

5.2.2. Paired t-Test

To further validate the performance improvement of the proposed BiLSTM-Adaboost-ABKDE model in wind power forecasting, we conducted a paired t-test to determine if the improvements were statistically significant. The paired t-test is a statistical method used to compare the mean differences between two related samples. In this study, we used the paired t-test to compare the prediction performance of the BiLSTM-Adaboost-ABKDE model with that of the GRU, Transformer, and GAN models.

The paired t-test calculates the mean and standard deviation of the differences between the two sets of samples to obtain the t-value. Then, based on the t-value, the p-value is computed, which is used to assess the statistical significance of the difference between the models.

t = \frac{\bar{d}}{\frac{s_{d}}{\sqrt{n}}}

(62)

where

\bar{d}

represents the mean of the differences between the two sets of samples,

s_{d}

represents the standard deviation of the differences, and n denotes the number of samples.

For the GRU, Transformer, and GAN models, the p-values from the paired t-test are 0.03, 0.04, and 0.02, respectively. As shown in Table 4, these results indicate that, at the 95% confidence level, the performance of the BiLSTM-Adaboost-ABKDE model is significantly superior to that of the other baseline models. Specifically, the BiLSTM-Adaboost-ABKDE model achieves a reduction of approximately 5% in the MAPE metric for wind power forecasting compared to the GRU, Transformer, and GAN models. This significant difference demonstrates the advantages of the proposed method.

Table 4. Paired t-test results.

5.3. Calculation of Operational Risk Indicators

5.3.1. Simulation Analysis of Dynamic Evolution Under Extreme Gust Disturbances

The predicted hourly wind speed sequence is employed as the driving variable for a multi-stage, temporally resolved risk assessment of system operation under extreme gust disturbances. Three representative wind power grid connection nodes—BUS14 (regional coupling point), BUS15 (main load node), and BUS16 (peripheral endpoint)—are analyzed. The temporal evolution of the branch power overlimit risk indicator is assessed using rolling time windows.

Branch Circuit Power Limit Risk

Figure 12 shows the branch circuit power limit risk, and the results indicate:

Figure 12. Branch circuit power limit risk.

(1): The BUS14 node consistently exhibits the lowest risk values with minimal fluctuations, reflecting strong voltage regulation and power flow diversion capabilities. It can thus serve as a stable anchor point for system operation, demonstrating high robustness against disturbances.
(2): The BUS15 node shows pronounced volatility, with a notable peak of 0.29 between the 6th and 10th hours, primarily due to high load density and weak voltage support, making it more susceptible to risks during severe wind speed fluctuations.
(3): The BUS16 node remains in a persistently high-risk state, reaching up to 0.31 in certain intervals. This sustained risk is attributed to its peripheral network position, limited voltage support, and insufficient regulatory resources, which hinder effective short-term mitigation during wind power injections in edge regions.

Voltage Limit Risk

Figure 13 shows the Voltage limit risk, and the results indicate:

Figure 13. Voltage limit risk.

(1): BUS14, the system’s central hub, exhibits stable variations with a “slow start–rapid response” risk pattern. At the 15th hour, intensified wind disturbances raise the risk index to 0.13, indicating strong voltage regulation capacity but notable boundary sensitivity.
(2): BUS15 follows a three-phase risk profile: trough (<0.04, 0–10 h), surge (peak 0.12, 10–15 h), and stabilization. The surge results from combined wind and load disturbances, underscoring the need for enhanced voltage support.
(3): BUS16 maintains the lowest overall risk, starting high (>0.08, first 5 h) before converging to ~0.03 as self-balancing power flows restore stability.
(4): Risk propagation initiates at edge node BUS16 and advances inward to BUS14 and BUS15, illustrating a typical disturbance transmission pattern.

Load Shedding Risk

Figure 14 shows the Load shedding risk, and the results indicate:

Figure 14. Load shedding risk.

(1): All risk indicators stay below 0.08, indicating sufficient load redundancy under extreme wind disturbances and a well-maintained supply–demand balance.
(2): BUS14 experiences a sharp risk drop at the 10th hour (0.06 → 0.015), forming a “single-cycle collapse” caused by delayed dispatch response, which temporarily widens the supply gap before rapid recovery.
(3): BUS15 remains within 0.04–0.07 with minimal fluctuations, reflecting steady-state adjustment. Its central position provides multiple control pathways and high load redundancy, effectively absorbing volatility and supporting higher shares of variable generation.
(4): BUS16 exhibits a “sawtooth” pattern, with minor declines at 5–10 h and 17–22 h, driven by branch fluctuation propagation and boundary effects. While no direct imbalance occurs, weak coupling of voltage support and load regulation increases cumulative vulnerability.

5.4. Dynamic Response Sensitivity Analysis and Comparison Based on Multidimensional Risk Assessment

To fully reveal the similarities and differences in the responses of multi-risk assessment methods during system disturbances, this paper further compares the response patterns of branch power overlimit, voltage overlimit, load loss, and three risk indicators in typical time sequence disturbance scenarios. The results are shown in Figure 15.

Figure 15. Node voltage limit risk, branch power limit risk, load shedding risk.

As shown in Figure 15, operational risk indicators from different methods exhibit distinct temporal patterns, reflecting variations in dynamic sensitivity and trend tracking. For branch power overlimit risk, ITA produces sharper peaks during sudden disturbances (6–8 h, 18–20 h), while TPW remains steadier in 12–14 h, indicating respective strengths in early-event detection and long-term evolution tracking. Weighted fusion integrates both, yielding smoother peak transitions and stronger overall representation.

In node voltage overlimit risk, ITA captures early jumps (0–6 h) and TPW emphasizes later fluctuations (14–22 h); fusion reveals delayed peak positions, indicating phase-specific temporal lags. For load loss risk, fusion balances ITA’s rapid response with TPW’s sustained tracking, enhancing regulation under high load-loss conditions. For expected outage duration, ITA is more responsive to early uncertain disturbances, whereas TPW is more stable during system recovery.

5.5. Quantitative Statistical Analysis of Risk Indicators

For each indicator, the mean value, standard deviation and peak-to-average ratio (PAR) were calculated from the same time series used in the risk plots. The results are shown in Table 5.

Table 5. Statistical characteristics of risk indicators.

These numerical results are consistent with the qualitative findings.

R_{C U T}

exhibits the lowest variability, whereas

R_{L I N E}

and

R_{V}

show larger fluctuation ranges, indicating higher sensitivity to power-flow and voltage-stability disturbances during extreme gust periods.

To further demonstrate the computational efficiency of the model, this paper divides the computation time into two parts: model training time and risk assessment computation time (Table 6).

Table 6. Computational time of different risk assessment methods.

Training Time: This refers to the time required for the training process of the BiLSTM-Adaboost-ABKDE algorithm, i.e., the time the model spends learning from the training data.

Risk Computation Time: This is the time required for risk assessment after the model completes its predictions, specifically including the calculation of the wind farm’s risk metrics based on the predicted results.

In this study, the training time of the BiLSTM-Adaboost-ABKDE model was 204.43 s, the risk assessment computation time was 22.13 s, and the total time was 233.65 s. Although the model incorporates the Adaboost and ABKDE modules, which increase its complexity, the computational efficiency remains within a reasonable range. In comparison, the traditional Monte Carlo simulation method had a total computation time of 330.18 s, which is significantly higher than that of the BiLSTM-Adaboost-ABKDE model. This indicates that the proposed BiLSTM-Adaboost-ABKDE method has a significant advantage in computational efficiency while also providing higher performance in terms of accuracy. Particularly in practical applications of wind power forecasting and grid risk assessment, the BiLSTM-Adaboost-ABKDE model not only effectively reduces computational costs but also ensures high-precision predictions under extreme conditions, making it more broadly applicable.

6. Conclusions

This study explores the challenges of evaluating wind farm operational risks under extreme gust conditions, primarily including issues such as model insufficiency, the difficulty in balancing accuracy and uncertainty, and the limitations of evaluation perspectives. To address these challenges, this paper proposes a wind power forecasting and multidimensional risk assessment model based on the BiLSTM-Adaboost-ABKDE algorithm framework. The model’s generalization ability and robustness were validated through performance comparisons on the training, validation, and test datasets. Experimental results show that the model exhibits stable RMSE and MAPE values under high-volatility wind speed conditions, demonstrating its adaptability and stability in complex environments.

Moreover, the ensemble learning and adaptive bandwidth mechanism effectively mitigate the impact of data sparsity and sudden fluctuations, showcasing the model’s robustness under small sample sizes and large fluctuations. Compared to traditional BiLSTM, GRU, Transformer, and GAN models, the BiLSTM-Adaboost-ABKDE model shows significant advantages in wind power forecasting, particularly under high volatility and extreme gust conditions. It not only reduces prediction errors but also maintains a high prediction interval coverage. The experimental results validate the model’s superiority and robustness in small-sample and high-volatility data scenarios.

The innovation of this study lies in the introduction of the ITA-TPW weighted fusion method, which not only enhances the temporal feature extraction capability of the BiLSTM model but also addresses the challenges faced by traditional methods in handling wind power forecasting accuracy and risk assessment under extreme gust conditions. The ITA (Integrated Trend Analysis) method captures rapid wind speed variations more accurately by analyzing the time-series trends of wind farm data, while TPW (Time-Series Probability Weighting) effectively handles high-noise data through a weighting mechanism, ensuring the model maintains high stability and accuracy under extreme weather conditions. The introduction of this weighted fusion method enables the model to provide not only wind power forecasts but also quantify prediction uncertainty, offering more comprehensive risk assessment information, thus supporting grid scheduling and wind farm optimization. Quantitatively, on the Hebei case study the proposed method improves the accuracy of the load shedding risk and voltage overlimit risk indicators by 6.15% and 4.79%, respectively, and reduces the total computation time to 233.65 s (204.43 s for model training and 22.13 s for risk assessment), which is lower than the 330.18 s required by the Monte Carlo simulation benchmark.

Although the experimental results show that the model performs well in short-term wind power forecasting, there are still some limitations. First, the experimental dataset is solely from Hebei Province, and it only focuses on short-term wind power forecasting. Although the model performs excellently on this dataset, its applicability in other regions and long-term forecasting scenarios remains unverified. Therefore, future research should incorporate wind power data from different geographic regions and seasons to assess the model’s adaptability and universality under diverse climate conditions. Additionally, although this study focuses on short-term wind power forecasting, future applications could further extend to multi-timescale forecasting, particularly medium- and long-term predictions. By integrating real-time wind speed, wind direction data, and other information, the model’s accuracy and reliability could be further enhanced. Given the model’s strong scalability, future optimization should include improvements to the network architecture, increasing data dimensions, and introducing online learning algorithms to enhance the model’s adaptability and real-time response capability in large-scale wind farms.

Overall, the BiLSTM-Adaboost-ABKDE model proposed in this paper provides solid support for wind farm risk assessment and grid scheduling. However, further optimization and validation are needed for its scalability in cross-regional and long-term forecasting scenarios to ensure its effectiveness in practical applications.

Author Contributions

Methodology, K.S.; software, Y.Y. and J.W.; validation, M.S., Q.L. and Z.D.; formal analysis, M.S.; investigation, Y.Y.; resources, M.S.; data curation, J.W.; writing—original draft preparation, M.S.; writing—review and editing, Y.Y.; visualization, J.W.; supervision, J.W.; project administration, Z.D.; funding acquisition, Q.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by State Grid Corporation, funded by the Science and Technology Project of State Grid Beijing Electric Power Company (grant number: 520216240001).

Data Availability Statement

The data supporting this study are not publicly available due to participant confidentiality restrictions.

Acknowledgments

The authors gratefully acknowledge the anonymous reviewers for their valuable feedback and expert recommendations.

Conflicts of Interest

Authors Ke Shang, Yanbin Yang, Jinsheng Wang, and Meng Shuai were employed by the State Grid Beijing Electric Power Company. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Ye, L.; Li, Y.; Pei, M.; Li, Z.; Xu, X.; Lu, J. Short-Term Wind Power Combined Prediction under Cold Wave Small-Sample Conditions. Proc. CSEE 2023, 43, 543–555. [Google Scholar]
Li, Z.; Xu, Y.; Wu, L.; Zheng, X. A Risk-Averse Adaptively Stochastic Method for Multi-Energy Ship Operation under Diverse Uncertainties. IEEE Trans. Power Syst. 2020, 36, 2149–2161. [Google Scholar] [CrossRef]
Ji, X.; Zhao, G.; Ye, P.; Meng, X.; Yang, M.; Zhang, Y. Short-Term Photovoltaic Power Prediction Method Based on QMD-HBiGRU. High Volt. Eng. 2024, 50, 3850–3859. [Google Scholar]
Meng, Q.; Jin, X.; Luo, F.; Wang, Z.; Hussain, S. Distributionally Robust Scheduling for Benefit Allocation in Regional Integrated Energy System with Multiple Stakeholders. J. Mod. Power Syst. Clean Energy 2024, 12, 1631–1642. [Google Scholar] [CrossRef]
Yang, N.; Xu, G.; Fei, Z.; Li, Z.; Du, L.; Guerrero, J.M.; Huang, Y.; Yan, J.; Xing, C.; Li, Z. Two-Stage Coordinated Robust Planning of Multi-Energy Ship Microgrids Considering Thermal Inertia and Ship Navigation. IEEE Trans. Smart Grid 2025, 16, 1100–1111. [Google Scholar] [CrossRef]
Meng, Q.; Xu, J.; Ge, L.; Wang, Z.; Wang, J.; Xu, L.; Tang, Z. Economic Optimization Operation Approach of Integrated Energy System Considering Wind Power Consumption and Flexible Load Regulation. J. Electr. Eng. Technol. 2024, 19, 209–221. [Google Scholar] [CrossRef]
Wu, Y.; Huang, C.; Wu, S.; Hong, J.-S.; Chang, H.-L. Deterministic and Probabilistic Wind Power Forecasts by Considering Various Atmospheric Models and Feature Engineering Approaches. IEEE Trans. Ind. Appl. 2023, 59, 192–206. [Google Scholar] [CrossRef]
Niu, D.; Wang, K.; Sun, L.; Wu, J.; Xu, X. Short-Term Photovoltaic Power Generation Forecasting Based on Random Forest Feature Selection and CEEMD: A Case Study. Appl. Soft Comput. 2020, 93, 106389. [Google Scholar] [CrossRef]
Fujimoto, Y.; Takahashi, Y.; Hayashi, Y. Alerting to Rare Large-Scale Ramp Events in Wind Power Generation. IEEE Trans. Sustain. Energy 2019, 10, 55–65. [Google Scholar] [CrossRef]
Zhou, B.; Duan, H.; Wu, Q.; Wang, H.; Or, S.W.; Chan, K.W.; Meng, Y. Short-Term Prediction of Wind Power and Its Ramp Events Based on Semi-Supervised Generative Adversarial Network. Int. J. Electr. Power Energy Syst. 2021, 125, 106411. [Google Scholar] [CrossRef]
Huang, Q.; Yan, N.; Zhong, X. Wind Power Ramp Prediction Based on Generative Adversarial Network. Acta Energiae Solaris Sin. 2023, 44, 226–231. [Google Scholar]
Sun, H.; Wan, C.; Cao, Z.; Li, Y.; Ju, P. Short-Term Load Probability Forecasting Based on Conditional Generative Adversarial Network Curve Generation. Autom. Electr. Power Syst. 2023, 47, 189–199. [Google Scholar]
Zheng, Y.; Li, X.; Chen, Y.; Zhao, Y. Short-Term Wind Power Forecasting under Extreme Weather Based on Stacking Multi-Model Fusion. High Volt. Eng. 2024, 50, 3871–3882. [Google Scholar]
Meng, A.; Zhang, H.; Yin, H.; Xian, Z.; Chen, S.; Zhu, Z.; Zhang, Z.; Rong, J.; Li, C.; Wang, C.; et al. A Novel Multi-Gradient Evolutionary Deep Learning Approach for Few-Shot Wind Power Prediction Using Time-Series GAN. Energy 2023, 283, 129139. [Google Scholar] [CrossRef]
Gong, W.; Cheng, Z.; Zhang, C.; Ma, P. Research on Short-Term Heat Load Prediction of Central Heating Heat Exchange Station Based on BiLSTM-AdaBoost. Reg. Heat Supply 2025, 55–63. [Google Scholar]
Zhang, Y. Bayesian Optimization-Based BiLSTM-Adaboost Heat Load Prediction for Thermal Power Plants. Sci. Innov. Appl. 2025, 15, 1–6. [Google Scholar] [CrossRef]
Yang, T.; Gao, X.; Yang, Z.; Kong, C.; Dong, B.; Li, T.; Zhu, Z. TBM Excavation Speed Prediction Based on NRBO-XGBoost and ABKDE Fusion Interpretable Model. J. Henan Univ. Sci. Technol. Nat. Sci. 2025, 73–87, 121–122. [Google Scholar]
Kang, Y.; Ai, H.; Peng, R.; Hu, W.; Wu, W.; Zhang, Z.; You, Y. Adaptive Bandwidth Flood Interval Forecasting Based on Error Correction Fusion Model. Water Resour. Protect. 2025, 106–114, 131. [Google Scholar]
Zhou, L.; Zhang, D.; Zhang, L.; Zhu, J. Dynamic Gust Detection and Conditional Sequence Modeling for Ultra-Short-Term Wind Speed Prediction. Electronics 2024, 13, 4513. [Google Scholar] [CrossRef]
Li, L.; Zhuo, Y.; Meng, W.; Chen, Z.; Wei, H. A Fault and Capacity Loss Prediction Method of Wind Power Station under Extreme Weather. Math. Probl. Eng. 2023, 2023, 8763185. [Google Scholar] [CrossRef]
IEC 61400-1; Wind Turbines—Part 1: Design Requirements. International Electrotechnical Commission: Geneva, Switzerland, 2019.
Jing, T. Research on Elasticity Evaluation and Improvement Strategies of Wind Farms under Extreme Operating Conditions. Master’s Thesis, Shenyang University of Technology, Shenyang, China, 2022. [Google Scholar]
Chen, B. Preventive Maintenance Strategy for Wind Turbines Based on Power Prediction. Master’s Thesis, Shenyang University of Technology, Shenyang, China, 2024. [Google Scholar]
Chen, X.; Yin, L.; Zhan, Z.; Wang, F.; Li, X. Fault Identification of Wind Turbines Based on Attention Mechanism and RCN-BiLSTM Fusion. China Electr. Power 2025, 58, 94. [Google Scholar]
Zhao, H. Source-Load Forecasting and Demand Response of Virtual Power Plant Considering Meteorological Drive and Carbon Emission Mechanism. Master’s Thesis, Guangxi University, Nanning, China, 2025. [Google Scholar]
Li, W.; Li, Z.; Li, S.; Hu, J. Photovoltaic Power Prediction Based on TBiLSTM Hybrid Network with Multi-Temporal Feature Fusion. Acta Energiae Solaris Sin. 2025, 1–10. [Google Scholar]
Yang, X.; Li, Y.; Bai, Y.; Meng, L. Short-Term Photovoltaic Power Forecasting Based on Cluster-Integrated LSSVM–Adaboost Model. J. North China Electr. Power Univ. Nat. Sci. Ed. 2025, 1–12. [Google Scholar]
Chen, A. Short-Term Electricity Price Forecasting and Uncertainty Quantification Based on Multi-Scale Decomposition and Dynamic Weight Optimization. Master’s Thesis, Guangxi University, Nanning, China, 2025. [Google Scholar]
Zhao, Y. Short-Term Uncertainty Prediction of Wind Power Based on Deep Learning. Master’s Thesis, Shenyang Agricultural University, Shenyang, China, 2024. [Google Scholar]
Liao, X.; Wu, Y.; Wei, Z.; Hu, W.; Yang, Z.; Sun, G.; Zang, H. Probabilistic Power Flow Analysis of Integrated Electricity-Heat Energy Systems Based on GMM and Multi-Point Linear Semi-Invariant Method. Electr. Power Autom. Equip. 2019, 39, 55–62. [Google Scholar]

Figure 1. Schematic diagram of the spatial relationship between extreme weather trajectories and the affected power system area.

Figure 2. Extreme gust trajectory and exposure regions of wind farms. The dashed line represents the path of extreme wind gusts, and the arrows indicate the diameter of the circle, which is the parameter Dwr.

Figure 3. Wind turbine power curve.

Figure 4. BiLSTM algorithm structure.

Figure 5. The framework of the proposed BiLSTM-Adaboost-ABKDE algorithm for wind power forecasting.

Figure 6. Loss of training and validation sets over epochs.

Figure 7. Impact of λ and ω on MAPE.

Figure 8. Research flowchart of the wind farm operational risk assessment method under extreme gust conditions.

Figure 9. Flow chart of coordinated control of PV-based energy storage and black start.

Figure 10. Wind power forecasts with prediction intervals.

Figure 11. Multi-model prediction comparison chart.

Figure 12. Branch circuit power limit risk.

Figure 13. Voltage limit risk.

Figure 14. Load shedding risk.

Figure 15. Node voltage limit risk, branch power limit risk, load shedding risk.

Table 1. Parameters for Wind Turbine Classifications.

Fan Class	Ⅰ	Ⅱ	Ⅲ	S
$V_{r e f} (m / s)$	50	42.5	37.5	Designer-specified value
$V_{ave} (m / s)$	10	8.5	7.5
$A I r e f (-)$	0.18	0.18	0.18
$a (-)$	2	2	2
$B I r e f (-)$	0.16	0.16	0.16
$a (-)$	3	3	3

Note (1):

V_{r e f} (m / s)

represents the 10 min reference average wind speed. Note (2):

V_{ave} (m / s)

represents the historical average wind speed at hub height. Note (3):

A I r e f (-)

and

B I r e f (-)

are the reference turbulence intensities at a mean wind speed of 15 m/s for turbulence classes A and B, respectively. Note (4):

a (-)

is the wind shear exponent used to describe the vertical wind speed profile.

Table 2. Comparison chart of computation time and resource consumption.

Model	Training Time (Seconds)	Prediction Time (Seconds)	GPU Utilization (%)	Average MAPE (%)
BiLSTM	305	18	72	6.15
BiLSTM-Adaboost-ABKDE	338	22	83	4.79

Table 3. Prediction Performance Evaluation Metrics.

Confidence Level	Algorithm in This Paper				Gated Recurrent Unit				Transformer				GAN-Based Model
Confidence Level	MAPE	RMSE	PICP	PINAW	MAPE	RMSE	PICP	PINAW	MAPE	RMSE	PICP	PINAW	MAPE	RMSE	PICP	PINAW
95.0	4.32	7.11	96.4	7.59	6.85	10.34	91.2	10.84	6.13	9.22	92.4	9.41	5.79	8.02	92.2	15.35
75.0	4.96	6.45	81.3	5.04	7.42	9.75	72.6	7.32	6.92	8.51	76.0	6.80	6.43	7.45	78.6	10.57
50.0	5.84	5.82	57.3	3.44	8.13	9.11	46.8	5.16	7.85	7.90	50.2	4.42	7.26	6.91	49.2	4.90

Table 4. Paired t-test results.

Model	Average MAPE	p-Value
BiLSTM-Adaboost-ABKDE	4.79%	-
GRU	6.50%	0.03
Transformer	6.25%	0.04
GAN	6.10%	0.02

Table 5. Statistical characteristics of risk indicators.

Indicator	Mean	Standard Deviation	Peak-to-Average Ratio
$R_{C U T}$	0.12	0.05	1.40
$R_{L I N E}$	0.22	0.07	2.20
$R_{V}$	0.27	0.07	2.20

Table 6. Computational time of different risk assessment methods.

Method	Training Time (Seconds)	Risk Computation Time (Seconds)	Total Time (Seconds)
BiLSTM-Adaboost-ABKDE	204.43	22.13	233.65
Monte Carlo Simulation	-	125.75	330.18

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Article Metrics

Citations

Article Access Statistics

Journal Statistics

Article metric data becomes available approximately 24 hours after publication online.