Joint Control Strategy of Wind Storage System Based on Temporal Pattern Attention and Bidirectional Gated Recurrent Unit

Li, Bin; Lu, Yaping; Meng, Xuguang; Li, Peijie

doi:10.3390/app15052654

Open AccessArticle

Joint Control Strategy of Wind Storage System Based on Temporal Pattern Attention and Bidirectional Gated Recurrent Unit

¹

Guangxi Key Laboratory of Power System Optimization and Energy Technology, Guangxi University, Nanning 530004, China

²

Power China Zhongnan Engineering Co., Ltd., Changsha 410014, China

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2025, 15(5), 2654; https://doi.org/10.3390/app15052654

Submission received: 10 February 2025 / Revised: 25 February 2025 / Accepted: 26 February 2025 / Published: 1 March 2025

(This article belongs to the Section Electrical, Electronics and Communications Engineering)

Download

Browse Figures

Versions Notes

Abstract

Increasing wind power penetration will profoundly impact a power system’s operating mechanism. It is necessary to study a control strategy so that wind farms can use energy storage to improve their controllability to the level of traditional units. Therefore, this paper proposes a control strategy for wind storage systems based on temporal pattern attention (TPA) and bidirectional gated recurrent units (BiGRUs). The control strategy uses BiGRU to extract the time series information between the energy storage output, the actual output of the wind farm, and the energy storage state, which improves the control stability of a wind storage system. At the same time, TPA is introduced to assign different weights to the hidden layer state of the neural network to highlight the importance of local time series information to the current energy storage output, effectively improving the model performance and reducing the control deviation. Finally, the stability and superiority of the proposed control strategy are verified based on an actual wind farm dataset. The economy of the wind storage system with this control strategy improves significantly.

Keywords:

bidirectional gated recurrent unit; control strategy; data-driven; deep learning; wind storage combined system

1. Introduction

Joint control strategies of wind storage systems play a crucial role in enhancing the competitiveness and regulation of wind power in high-penetration markets [1,2]. Energy storage systems achieve dispatchability comparable to that of conventional units by regulating the active output of wind power to track the dispatch schedule value [3,4,5]. Given energy storage’s cost and capacity constraints, optimizing the wind storage control strategy to improve output accuracy and system economic efficiency is essential.

Much previous research has been conducted on the joint control strategy of wind storage systems. It is mainly divided into two categories: One is the direct control strategy. That is, real-time section control is carried out according to the deviation between the actual output of the current wind farm and the planned value, which mainly includes mode decomposition [6], proportional–integral–derivative (PID) control [7], fuzzy control [8], Fourier transform [9], and so on. The direct control strategy can quickly respond to errors and has strong engineering practicability. However, in the case of large fluctuations in wind power, it is easy to cause problems such as the over-charging and over-discharging of the energy storage system and frequent regulation, which affects the final control effect and economy. The second category is the process optimization control strategy. Aiming at the wind power output error in the control period, this kind of control strategy establishes a mathematical model with the number of energy storage orders and throughput as the objective function and uses the optimizer to solve it. This type of control strategy continuously corrects the energy storage output during the control period through rolling calculations [10,11], effectively reducing the number of energy storage actions and having high control accuracy. However, the solver may not converge in complex environments, which affects the control effect. If proportional–integral (PI) control is used when the solver does not converge, the stability of control can be improved to a certain extent. Therefore, the data after control of the above two control methods can be used as samples, and the excellent control characteristics of both can be learned using deep learning methods, which can enhance the stability and comprehensive performance of control.

Firstly, wind power has high stochasticity and volatility [3]. Deep learning can deal with uncertainty problems well, and it performs well in the research of power system state estimation [12], transient stability [13], fault detection [14], and automatic generation control (AGC) [15]. In the research of wind power generation, deep reinforcement learning extracts uncertain relationships between inputs and outputs from massive, high-dimensional data from wind farms, which significantly reduces the omission of crucial information, and improves the coordination and economy of the wind storage system scheduling as well as the grid connection [16,17]. In addition, deep learning can also cope well with wind power uncertainty [18]. The above research shows that deep learning has applicability and advancement for wind storage combined control strategy research.

Secondly, there exists a certain time series of wind power. Among many neural network algorithms, both the long short-term memory network (LSTM) and gated recurrent unit (GRU) can handle time-series data well and are widely used in wind power prediction [19], wind speed prediction [20], and short-term load prediction [21]. In wind power prediction, the prediction performance of the two is equivalent, but compared with the LSTM network, the structure of the GRU is more concise, and the training speed is faster [21]. In addition, the bidirectional gated recurrent unit (BiGRU) can process the time series data in both directions and fully extract the connection before and after the time series data, which has been shown to improve the model’s accuracy compared to GRU [22,23]. Therefore, this paper uses BiGRU to extract the uncertainty relationship between wind power and energy storage action and realizes the prediction classification of energy storage actions.

The attention mechanism in neural networks can focus on crucial information, reduce attention to other information, and even filter out irrelevant information, thus solving the problem of information overload and improving the efficiency and accuracy of task processing [24,25]. It is mainly used to guide the model focusing on essential features in the power system to improve its performance [26]. If the temporal pattern attention mechanism (TPA), which is more sensitive to temporal features [27], is used in wind power prediction, it can improve the model prediction accuracy [28]. Therefore, this paper uses TPA to extract important temporal information in the BiGRU hidden layer to improve model performance.

Motivated by it, this paper proposes a joint control strategy for the wind storage system based on TPA-BiGRU. The contributions are summarized as follows.

The wind storage system is controlled by adopting the advanced rolling optimization control strategy (AROCS) and PI control strategy, and a dataset with excellent wind storage control characteristics is constructed.
An evaluation standard for the output deviation of wind power active power is proposed. This standard takes the assessment requirements of China Southern Power Grid for wind power integration as an example, which can be consistent with the grid-connected requirements of conventional units.
A wind storage joint control strategy based on the TPA-BiGRU algorithm is proposed, which solves the problem of non-convergence of mathematical modeling methods. It can obtain the storage control quantity in real-time and dynamically, which improves the stability and accuracy of the wind storage system under the premise of ensuring economic benefits.

The rest of this paper is organized as follows: Section 2 describes the structure of the proposed model and the evaluation criteria. Section 3 presents the basic theory and design of the model. Section 4 shows the model’s training process. In Section 5 and Section 6, the effectiveness of the control strategy proposed in this paper is verified by experimental comparison and conclusions are drawn.

2. Structure and Control Criteria

The structure of the joint control strategy of wind storage systems is shown in Figure 1. First, the regulation power is calculated based on the data of the actual output value of wind turbines

P_{A}

, the planned output value

P_{P}

, and the energy storage state of charge (

S O C

). Then, the dead zone is set to judge whether the energy storage is acting or not. The dead zone is defined as a predefined threshold range (e.g., ±5% of the nominal power) where the energy storage system remains inactive. Conversely, the storage system is activated only when the control deviation (e.g., power imbalance) exceeds the thresholds. The exact threshold can be adjusted based on the grid requirements or optimization objectives to balance response frequency and equipment lifespan. When it is judged that the energy storage output is needed, the system will output the regulation power of the energy storage

P_{R}

.

To ensure the safe and efficient integration of large-scale wind power into grid operation, it is stipulated that the active power regulation capacity of grid-connected wind power should meet the requirements of the grid management of new energy field stations. Moreover, to improve the competitiveness of wind power, it is necessary to raise the assessment requirements of the wind power generation program to the same level as conventional units. Taking China Southern Power Grid as an example, it is stipulated that the assessment is carried out every 15 min, and the power deviation rate of conventional grid-connected units should not exceed ±2.5% [29]. The specific requirements are shown in Table 1.

Table 1 shows that the China Southern Power Grid has made high demands on the regulation accuracy and response speed of wind farms’ active output in addition to the need for wind farms to have substantial voltage and frequency adaptability. Accordingly, this paper proposes an evaluation criterion for wind power active power output deviation as follows:

The average value of the control deviation per minute in each assessment period is expressed using

K 1

, which is defined by the following formula:

K 1 = \frac{1}{n} \sum_{i = 1}^{n} \frac{|P_{P i} - P_{A i}|}{C_{c a p}}

(1)

where

C_{c a p}

is the starting capacity of the wind farm,

P_{P i}

and

P_{A i}

are the planned and actual values of the wind farm at sampling point

i

, and

n

is the number of sampling points.

The

K 1

standard is one assessment point per minute and 1440 assessment points throughout the day. The qualified standard is

K 1 \leq 2.5 %

.

3. Designing Control Strategy and Model

3.1. Joint Control Strategy of Wind and Storage Based on TPA-BiGRU

The control process of the wind storage joint control strategy based on advanced rolling optimization control and PI control is divided into two parts. The first part is to determine the type of energy storage action. Both control strategies input the relevant data of wind farms and energy storage and judge the operation of energy storage by calculation. Only when the current control deviation is outside the dead zone does the energy storage need to be charged and discharged. Otherwise, energy storage will not act. The second part calculates the specific value of the charge and discharge of the energy storage. If it has been judged that the energy storage output needs to be adjusted, the adjustment power of the energy storage is determined by using the data of the first part.

In this paper, the proposed wind storage joint control strategy uses three trained TPA-BiGRU models to realize the judgment and calculation functions in the above links to obtain a new wind storage joint control strategy. The function of the first link is to distinguish the action state of energy storage, that is, charging, discharging, and inaction, which can be realized by using a three-classification model. The function of the second link is to calculate the specific value of the charge and discharge power of the energy storage, which can be realized by using two regression calculation models.

Firstly, the energy storage action state is judged by the classification network. A control calculation is performed on each sampling point, and the relevant data at the judgment time are composed of the original data according to the input requirements and standardized. The average and variance of the variables are calculated when training the network. The processed data are input into the classification network to obtain the energy storage control state after the one-hot encoding with a length of 3. After decoding, three control states of energy storage are finally obtained, namely, −1 (charging), 0 (inaction), and 1 (discharging). Then, the regression network calculates the regulated power value under the energy storage’s charge and discharge state. When the energy storage state is in action, the input samples are fed again into the charging or discharging power regression network for calculation, and the network output is anti-standardized to obtain the energy storage charging or discharging power value.

3.2. Joint Control Model of Wind and Storage Based on TPA-BiGRU

Firstly, the proposed model uses the deep BiGRU network to process wind power and energy storage data. Then, the temporal relationship of each feature quantity is extracted and processed for long-term memory. Moreover, it utilizes the TPA mechanism to strengthen the model memory function, highlighting the importance of the local information and ensuring the model’s accuracy and stability. The control model proposed in this paper is shown in Figure 2.

3.2.1. Bidirectional Gated Recurrent Unit

GRU replaces the forgetting gate and input gate in LSTM with an update gate, so the GRU model has fewer parameters and a more straightforward structure, and the performance of the two is comparable. The specific calculations are as follows:

r_{t} = σ (W_{r} \cdot [h_{t - 1}, x_{t}])

(2)

z_{t} = σ (W_{z} \cdot [h_{t - 1}, x_{t}])

(3)

{\tilde{h}}_{t} = \tanh ((W_{h}) \cdot [r_{t} \cdot h_{t - 1}, x_{t}])

(4)

h_{t} = (1 - z_{t}) \cdot h_{t - 1} + z_{t} \cdot {\tilde{h}}_{t}

(5)

where the symbol

σ

represents the sigmoid activation function

σ (z) = \frac{1}{1 + e^{- z}}

;

x_{t}

is the current wind farm data;

h_{t - 1}

is the output of the upper hidden layer;

r_{t}

and

z_{t}

are the reset door and the update door, respectively;

{\tilde{h}}_{t}

is the candidate hidden gate state;

W_{r}

,

W_{z}

, and

W_{h}

are the network parameter matrices; [ ] is vector splicing; and

\cdot

is the multiplication of the matrices by elements.

To better obtain the mapping relationship between the input data and the energy storage regulation, this paper chooses the BiGRU network to capture the relationship between the two in a bidirectional time series. It can be seen from Figure 2 that the output of the BiGRU model contains both historical and future information of the input data, which can avoid the lack of information and improve the prediction accuracy when dealing with rolling optimized data for wind power and energy storage.

3.2.2. Temporal Pattern Attention Mechanism

This paper introduces TPA to strengthen the model’s memory of long-term time series information while reinforcing the key features of local short-term information, highlighting the key factors affecting the energy storage output, and improving the model’s prediction effect. The designed TPA is calculated as follows [26]:

H_{i, j}^{c} = \sum_{l = 1}^{w} H_{i, (t - w - 1 + l)} \times C_{j, T - w + l}

(6)

f (H_{i}^{C}, h_{t}) = {(H_{i}^{C})}^{T} W_{a} h_{t}

(7)

α_{i} = s i g m o i d (f (H_{i}^{C} \cdot h_{t}))

(8)

c_{t} = \sum_{i = 1}^{n} α_{i} H_{i}^{C}

(9)

h_{t}^{A} = W_{h} [c_{t}; h_{t}]

(10)

where

H = [h_{t - w + 1}, h_{t - w + 2, \dots,} h_{t - 1}]

is the hidden state matrix of the original wind storage time series containing multiple moments of information after BiGRU processing, and the length of the time window is

w

;

C

is a convolution kernel of length T, generally taken as the window length;

H_{i, j}^{c}

is the convolution value in row

i

and column

j

;

H^{C}

denotes the temporal pattern matrix;

h_{t}

represents the hidden state information extracted by the neural network from the input wind storage data feature matrix at the current moment;

W_{a}

and

W_{h}

are the weight parameter matrices;

α_{i}

is the attention weight vector, which characterizes the importance of the hidden state information at each moment in the state matrix, and length

w

;

c_{t}

is the feature vector that characterizes the temporal relationship after weighting; and

h_{t}^{A}

is the splicing of the feature vector and the current moment state information, and it is also the final output result of TPA.

4. Training Process of the Model

The model proposed in this paper mainly comprises a deep BiGRU neural network and TPA. As a neural network with supervised learning, BiGRU first needs to obtain the energy storage action data after optimal control and PI control as the dataset label. Since the optimal control has superior performance, it is chosen for most of the examination periods, and the PI control is used when the optimization algorithm does not converge. The control data obtained implies the excellent characteristics of the two typical control strategies.

4.1. Selection of Input and Output Variables for the Network

When using optimal control and PI control to generate datasets, four inputs are used as follows:

P_{A}

,

P_{P}

,

P_{F}

, and energy storage output value in the past period. The optimal control takes the minimum penalty power as the primary goal and the minimum battery throughput as the secondary goal, which the solver solves to obtain the energy storage output value for the period. The first three inputs belong to the characteristic variables reacting to the power state of the wind farm at the sampling moment, and the fourth input belongs to the characteristic variables reacting to the power status of the energy storage in the past period. When selecting the input variables of the neural network, to reflect the actual operating state of the wind storage combined system at the sampling time as much as possible, this paper selects the input variables of the dataset as follows:

P_{A}

,

P_{P}

,

P_{F}

, and the

S O C

in the past period.

The input variables of the three TPA-BiGRU network models are the same, but the selected output variables are different due to the different functions of different networks. The output variables of the classification network are discrete variables of the energy storage action state, which are divided into three categories: −1, 0, and 1 for charging, inactive, and discharging, respectively. The charging and discharging regression networks select the corresponding energy storage charging and discharging power value as the output variable.

4.2. Data Preprocessing

For continuous data, such as

P_{A}

,

P_{P}

,

P_{F}

, and

S O C

, the four types of input variables and the energy storage adjustment power as output variables in the regression network are all processed by the Z-score standardization method. The conversion formula is as follows:

X = \frac{x - μ}{σ^{'}}

(11)

where

X

is the normalized value;

x

is the value to be standardized; and

μ

and

σ^{'}

are the mean and standard deviation of the characteristic variables.

The discrete data are processed by one-hot encoding. The energy storage charging, inaction, and discharge state values are −1, 0, and 1, respectively, corresponding to 100, 010, and 001 after coding.

The TPA-BiGRU network model training requires three-dimensional (3D) supervised learning data, so after the data preprocessing, it is necessary to use a sliding window of sequence length multiplied by the size of the features to frame two-dimensional data in the time series data and superimpose it to obtain 3D data. According to the training effect, the sliding window size taken in this paper is 151 × 4.

The training dataset for the classification network is a 3D dataset labeled with the preprocessed energy storage states. In contrast, the charging and discharging regression network training dataset is a 3D dataset labeled with the corresponding charging and discharging power values.

4.3. Training Process

The structure of the deep TPA-BiGRU network model proposed in this paper is mainly divided into the input, hidden, TPA, and output layers. The data input size of the input layer is 151 × 4, and the hidden layer has four layers, each containing a BiGRU layer, a dropout layer, and an activation function. For the classification network, its output layer outputs a sequence of length 3, and the activation function is Softmax. For the regression network, its output is the energy storage action value, and the length is 1. So its output layer can be a fully connected layer with the number of neurons 1. The specific hyperparameter settings are shown in Table 2.

The hyperparameters listed in Table 2 are selected through empirical validation and domain-specific considerations. For instance, the dropout rate is tuned based on task complexity: a higher rate (0.5) applies to classification to counteract overfitting in multi-class scenarios, while a lower rate (0.3) is used for regression to preserve network capacity. Neurons in BiGRU layers are sized to balance computational efficiency and feature representation needs (256 for classification vs. 128 for regression). The choice of sliding window size balances computational efficiency and feature richness, and the performance metrics of different window sizes are evaluated using sensitivity analysis, which shows that 151 × 4 has the lowest RMSE. The number of BiGRU layers is investigated using an ablation study to quantify the depth-influence relationship, and the 4-layer optimization strikes a balance between accuracy and GPU memory utilization. All the choices are validated via cross-validation on our dataset, with ablation studies confirming their necessity.

The data processing and training processes for the three TPA-BiGRU network models are as follows:

Data preprocessing: Standardize and encode the input and output of the three networks, respectively.
Data sampling: The time series data after preprocessing are sampled by sliding sampling with a size of 151 × 4 window and stored in the form of n × 151 × 4.
Division of training and test sets: The sampled dataset is divided into training and test sets in the ratio of 7:3. The data from the training set is fed into the TPA-BiGRU model, and the predicted values are obtained after neural network black-box computation.
Parameter update: The training set loss is calculated according to the predicted value and the training set label, and the parameters in the recurrent neural network are updated after a single back-propagation computation.
Performance evaluation: The test set loss is obtained by substituting the test set data into the untrained TPA-BiGRU model and comparing it with the training set loss. If overfitting or underfitting occurs, the network structure or hyperparameters need to be adjusted.

After iterative training, three TPA-BiGRU models are constructed, laying the foundation for the simulation of the wind storage joint control strategy based on TPA-BiGRU. The training results are shown in Table 3.

During the simulation process, the data are read in 151 × 4 at the ordered moments and the raw data are normalized using Z-score and one-hot encoding codes. Then, they are inputted into the BiGRU network to extract the bidirectional timing features to memorize the timing relationship between the input variables, and the feature matrix in the last layer of the BiGRU network is inputted into the TPA network to strengthen the model memory function, and at the same time to highlight the importance of the local information to the energy storage output at the current moment. The classification network determines how the energy storage acts at that moment, with output 1 indicating discharge, output −1 indicating charge, and output 0 indicating inaction. When the output of the classification network is 1 (−1), the standardized data will be input into the discharge (charging) regression model to calculate the specific energy storage discharge value (charging value).

5. Case Study

The computational experiments utilized TensorFlow v2.18.0 (Google LLC, Mountain View, CA, USA) under Python 3.8 on hardware comprising an Intel Core i7-10700F CPU (Intel Corporation, Santa Clara, CA, USA) and AMD Radeon R5 430 GPU (Advanced Micro Devices, Inc., Santa Clara, CA, USA). The current implementation uses a minimalist hardware setup and can support the real-time control of a 100-MW wind farm cluster with latency well below the operational threshold. Higher-end hardware will allow for larger batch processing.

The data in this paper comes from the actual historical data of a 100-MW wind farm cluster, which is taken from the operation data of the first half of 2018 for 33 days, with a total of 3168 assessment periods, a sampling period of 2 s, and a total of 43,200 sampling points for the whole day. Among them, a total of 2208 assessment periods in 23 days are used as training sets, and a total of 960 assessment periods in 10 days are used as verification sets. The data used for AROCS and the energy storage parameters are referred to in the literature [11], and the coefficients of PI control are set dynamically according to the control deviation.

5.1. Comparative Control Strategies

To compare the effectiveness of the joint control strategy for the wind storage system based on TPA-BiGRU proposed in this paper, five control strategies are used for simulation and analysis. The experimental data are derived from 31 days of data for the first half of 2019 for the wind farms mentioned above.

TPA-BiGRU: Temporal pattern attention mechanism combined with the bidirectional gated recurrent unit. The new decision model proposed in this paper.
TPA-BiLSTM: The GRU in TPA-BiGRU is replaced with LSTM, and the bidirectional structure and attention mechanism are retained to verify the computational efficiency advantage of GRU in the joint control of the wind storage system by comparison.
BiGRU: The temporal pattern attention (TPA) module in TPA-BiGRU is removed, and only the bidirectional GRU is retained for quantifying the contribution of the attention mechanism to multi-timescale feature extraction.
AROCS: Advanced Rolling Optimal Control Strategy with Model Predictive Control (MPC) framework, the core of which is to dynamically adjust the power allocation of the wind storage system through Rolling Horizon Optimization. AROCS stands for comparative experiments on optimization models.
PI control: Traditional proportional–integral feedback control. The real-time deviation of the wind storage system is used as an input, and through PI control, the regulation command of the energy storage is output to regulate the battery storage output, thus reducing the deviation of the output of the wind storage system. PI control stands for classical feedback control.

5.2. Analysis of Control Effects

The evaluation indexes are RMSE, ABS_MAX, KD, KDB, KS, KSH, K1%, TD, TDA, and TDB. The specific meanings of each index are in the abbreviations.

Table 4 and Table 5 show the effect of wind power and energy storage outputs and the comparison of energy storage output under five control strategies. Table 4 shows that the five control strategies effectively reduce the assessment power of the wind farm and improve the tracking planning ability of the wind storage combined system. However, the wind storage joint control strategy based on TPA-BiGRU performs the best, and it can control the control deviation of the wind storage combined system in a small range. The average value of RMSE is only 0.79%, with high control accuracy. The assessment rate (KSH) is only 10.15%, and the assessed electric quantity (KD) of the wind farm is reduced from the original 3466.96 MWh to 31.76 MWh. Table 5 shows that the wind storage joint control strategy based on TPA-BiGRU has the least average regulated electricity quantity (TDA), which is only 146.48 MWh, and all the storage regulations are better than the other networks. It can be seen from the two tables that the wind storage joint control strategy based on TPA-BiGRU has better control accuracy and stability and is more suitable for the operating conditions of wind farms than the other four control strategies.

Table 6 shows the assessment results of each control strategy. It can be seen from the table that the K1 assessment index of the wind storage joint control strategy based on TPA-BiGRU has the largest number of 100% qualified days, which exceeds the other four control strategies. It shows that the wind storage joint control strategy based on TPA-BiGRU has a better and more stable control effect than the other four control strategies.

To verify the robustness and reliability of the results of the study, this paper uses the following methodology to validate the experimental results:

Statistical significance tests: A paired t-test (α = 0.05) is performed on the prediction error of TPA-BiGRU versus the baseline model (BiGRU) over 100 trials, and the results confirmed a significant difference in the RMSE distributions (p < 0.01), validating the reliability of the proposed model.
K-fold cross-validation: Applying 5-fold cross-validation to assess the stability of the proposed model, the RMSE variance across folds is only 1.8%, ensuring the general applicability of the proposed model in different operational scenarios.
Ablation Studies: Removal of the key model component temporal pattern attention mechanism increases RMSE by 14.8%, confirming its key role in capturing temporal dependencies.

Taking a particular day to analyze the control effect, the RMSE between the actual output of the wind farm and the planned value on that day is 8.61%, and the whole day’s output is 662.17 MWh. Figure 3 shows the wind power output and control deviation curves under the five control strategies. From Figure 3 and Table 7, all five control strategies can effectively track the planned power output. However, the wind storage joint control strategy based on TPA-BiGRU has the best control effect and the highest control accuracy, and the proportion of the assessed electric quantity (KDB) is only 0.33%. It shows that the wind storage joint control strategy based on TPA-BiGRU has a solid ability to adapt to the uncertainty of the wind farm output and can track and regulate energy storage well even when the wind power output changes suddenly.

Table 8 shows the regulation of the energy storage systems with five control strategies. The wind storage joint control strategy based on TPA-BiGRU has the best energy storage regulation performance. In the case that the system can effectively track the planned output, the control strategy proposed in this paper has the least total regulated electricity quantity (TD), the least proportion of the regulated electricity quantity (TDB), and the least number of energy storage operations. It can effectively avoid equipment aging and loss accelerated by frequent actions, and too much regulation power leads to the overuse of energy storage equipment, which affects the equipment’s life.

5.3. Economic Analysis

At present, the cost of energy storage is still high, so it is necessary to evaluate the income level of the wind storage combined system. The economic evaluation of the energy storage system used in this paper is referenced in [11].

Based on the simulation results of 31 days, the annual utilization hours of the wind farm are assumed to be 2300 h. The energy storage battery life and the number of replacements are evaluated using the rain flow counting method [30]. Based on the results of the energy storage system life calculation, it can be seen that during the 20 A life cycle, the PI control strategy needs to replace the equipment twice, while the other four control strategies need to replace the equipment once. The economics of specific energy storage systems are shown in Table 9.

As shown in Table 9, the wind storage joint control strategy based on TPA-BiGRU can extend the lifetime of the storage system by reducing the battery throughput as much as possible while lowering the appraised power during the whole operation cycle. This control strategy achieves a yield of 25.49% over the 20 A life cycle, which is the highest economic benefit among the five control strategies.

In summary, the deep learning control strategy proposed in this paper can obtain the wind storage control results in real-time and quickly and can take into account the instantaneity, stability, and accuracy under various operating conditions while ensuring the economy of the wind storage combined system.

6. Conclusions

This paper presents a joint control strategy for wind storage systems based on TPA-BiGRU. The proposed strategy can improve the economy and stability of the system.

Contribution of this study:
(1)
Based on the bidirectional gated recurrent unit (BiGRU) recurrent neural network, this paper uses its classification and regression calculation functions to construct a deep learning model for wind storage joint control strategy. At the same time, the model introduces TPA to further strengthen its performance. The control strategy can adapt to the complex and variable operating conditions of wind farms and can obtain the wind and storage control results in real-time and quickly.
(2)
The proposed TPA-BiGRU framework operates in an offline training and online deployment paradigm for real-time control. Once the model converges during offline training, its online inference requires only lightweight computations via a pre-optimized approximation model. Specifically, the real-time control latency is reduced to 1.2 ms per decision cycle, which is negligible compared to the 15 s control interval of typical wind storage systems.
(3)
TPA-BiGRU can effectively improve the ability of the wind storage system to track the planned output and reduce the amount of penalty power. One day of TPA-BiGRU’s largest assessment period penalty power accounted for only 1.8%, to meet the requirements of the grid rules of less than 2.5%, and effectively enhanced the market competitiveness of the wind storage system.
(4)
TPA-BiGRU can optimize the battery storage conditioning process and extend the battery storage life. Compared with the PI strategy, TPA-BiGRU extends the battery life by 83.51%, effectively reduces the battery operating cost, and increases the overall yield of the wind storage system by 74.33% over 20 years, thus further enhancing the market competitiveness of the wind storage system.
Inspiration for future study:
(1)
Adaptive hybrid control with dynamic data augmentation:
While the TPA-BiGRU strategy enhances grid compliance metrics (Table 6), residual unqualified intervals persist due to the incomplete coverage of stochastic wind regimes in training datasets. To address wind-ramp edge cases, it is proposed to implement Discrete Fourier Transform (DFT) filtering during transient violations to refine battery dispatch signals, then iteratively update training sets via online learning.
(2)
Context-aware control via transformer-enhanced architecture:
Building on transformer-based breakthroughs in power forecasting, the future is dedicated to deploying lightweight decoders. Integrate FEDformer’s frequency decomposition blocks to handle multi-timescale inertial responses. Develop time-frequency hybrid models: Combine TPA-BiGRU’s temporal efficiency with the transformer’s multi-head attention for global context awareness.
(3)
Resilient operation under extreme disturbances:
Although the proposed model achieves less than 4 ms latency in nominal operation, its generalization capability may deteriorate during unmodeled extreme events (e.g., typhoon-induced turbulence). To ensure contingency robustness, it is recommended to add conditional mode switching. Deploy PI backup controllers at converter stations, activated when deep learning prediction confidence falls below 85%.
(4)
Enhanced scalability:
Current validations on a 100 MW wind farm (20 turbines, 40 MWh energy storage system) reveal computational scaling challenges: Input features grow as O(4N) (time complexity expression) per turbine. To avoid the dimensionality curse and memory bottlenecks, edge-federated architecture can be employed by deploying TPA-BiGRU locally on turbine-level edge devices, coordinated by a lightweight central optimizer.

Author Contributions

Conceptualization, B.L. and Y.L.; methodology, Y.L.; software, Y.L.; validation, B.L., Y.L. and X.M.; formal analysis, Y.L.; investigation, Y.L.; resources, B.L.; data curation, X.M.; writing—original draft preparation, Y.L.; writing—review and editing, X.M.; visualization, Y.L.; supervision, B.L.; project administration, P.L.; funding acquisition, B.L. and P.L. All the authors have read and agreed to the published version of the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China, grant number 52267006.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding authors.

Conflicts of Interest

Author Xuguang Meng was employed by the company Power China Zhongnan Engineering Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

$P_{A}$	Actual output value of wind farm
$P_{P}$	Planned output value of wind farm
$P_{R}$	Energy storage regulating power
$P_{N}$	Installed capacity of wind farm
$P_{F}$	Predicted power of wind farm
K1	average value of the control deviation per minute in each assessment period
SOC	energy storage state of charge
RMSE	Root-mean-square error
ABS_MAX	Absolute value of maximum output deviation, MWh
KD	Appraisal power, MWh
KDB	proportion of the assessed electric quantity, which is a ratio of the assessed electric quantity to the total output electric quantity in all-day
KS	Period to be assessed, min
KSH	assessment rate, which is a ratio of the assessment period to the total assessment periods
K1%	K1 pass rate
TD	Total regulated electricity quantity of energy storage, MWh
TDA	Average regulated electricity quantity of energy storage, MWh
TDB	Proportion of the regulated electricity quantity of energy storage, which is a ratio of the regulated electricity quantity to the total output electricity quantity throughout the day

References

Han, J.S.; Yang, L.; Xu, J.W.; Wang, S.; Liu, D.N. Research on the investment policy of energy storage and other flexible adjustment resources under the scenario of high proportion of new energy. In Proceedings of the 2021 IEEE Sustainable Power and Energy Conference (iSPEC), Nanjing, China, 23–25 December 2021; pp. 2–7. [Google Scholar]
Schrotenboer, A.H.; Veenstra, A.A.; uit het Broek, M.A.; Ursavas, E. A Green Hydrogen Energy System: Optimal control strategies for integrated hydrogen storage and power generation with wind energy. Renew. Sustain. Energy Rev. 2022, 168, 112744. [Google Scholar] [CrossRef]
Wang, J.; Zhou, J.; Wang, L.; Wang, C.; Wu, X.; Dai, L. Study on control strategy of wind farm combined with energy storage system. In Proceedings of the 16th IEEE Conference on Industrial Electronics and Applications (ICIEA 2021), Chengdu, China, 1–4 August 2021; pp. 130–135. [Google Scholar]
Zhang, Z.; Ding, T.; Zhou, Q.; Sun, Y.; Qu, M.; Zeng, Z.; Ju, Y.; Li, L.; Wang, K.; Chi, F. A review of technologies and applications on versatile energy storage systems. Renew. Sustain. Energy Rev. 2021, 148, 111263. [Google Scholar] [CrossRef]
Yi, T.; Ye, H.; Li, Q.; Zhang, C.; Ren, W.; Tao, Z. Energy storage capacity optimization of wind-energy storage hybrid power plant based on dynamic control strategy. J. Energy Storage 2022, 55, 105372. [Google Scholar] [CrossRef]
Wang, Q.; Chen, Y.; Cai, X.; Meng, Z.; Song, D. Real-time control strategy for wind-solar-storage industrial park based on variational modal decomposition. In Proceedings of the 2023 3rd International Conference on Intelligent Power and Systems (ICIPS 2023), Shenzhen, China, 20–22 October 2023; pp. 742–748. [Google Scholar]
Abumeteir, H.A.; Vural, A.M. Design and optimization of fractional order PID controller to enhance energy storage system contribution for damping low-frequency oscillation in power systems integrated with high penetration of renewable sources. Sustainability 2022, 14, 5095. [Google Scholar] [CrossRef]
Rahman, M.J.; Tafticht, T.; Doumbia, M.L.; Mutombo, N.M.A. Dynamic stability of wind power flow and network frequency for a high penetration wind-based energy storage system using fuzzy logic controller. Energies 2021, 14, 4111. [Google Scholar] [CrossRef]
Wu, C.; Gao, S.; Liu, Y.; Han, H.; Jiang, S. Wind power smoothing with energy storage system: A stochastic model predictive control approach. IEEE Access 2021, 9, 37534–37541. [Google Scholar] [CrossRef]
Chen, R.; Gao, C.; Ming, H. Rolling-horizon optimization strategy for wind-storage system in electricity market. IET Renew. Power Gener. 2024, 18, 825–836. [Google Scholar] [CrossRef]
Li, B.; Deng, Y.; Chen, B. Advanced rolling optimization control strategy for wind storage system with enhanced ultra-short-term wind power prediction. Power Grid Tech. 2021, 45, 2280–2287. [Google Scholar]
Yarlagadda, R.; Kosana, V.; Teeparthi, K. Power system state estimation and forecasting using CNN based hybrid deep learning models. In Proceedings of the 2021 IEEE International Conference on Technology, Research, and Innovation for BEtterment of Society (TRIBES), Raipur, India, 17–19 December 2021; pp. 1–6. [Google Scholar]
Zhao, T.; Wang, J.; Lu, X.; Du, Y. Neural Lyapunov control for power system transient stability: A deep learning-based approach. IEEE Trans. Power Syst. 2022, 37, 955–966. [Google Scholar] [CrossRef]
Moradzadeh, A.; Mohammadi-Ivatloo, B.; Pourhossein, K.; Anvari-Moghaddam, A. Data mining applications to fault diagnosis in power electronic systems: A systematic review. IEEE Trans. Power Electron. 2022, 37, 6026–6050. [Google Scholar] [CrossRef]
Zhang, X.; Li, C.; Xu, B.; Pan, Z.; Yu, T. Dropout deep neural network assisted transfer learning for bi-objective pareto AGC dispatch. IEEE Trans. Power Syst. 2022, 38, 1432–1444. [Google Scholar] [CrossRef]
Liu, F.; Liu, Q.; Tao, Q.; Huang, Y.; Li, D.; Sidorov, D. Deep reinforcement learning based energy storage management strategy considering prediction intervals of wind power. Int. J. Electr. Power Energy Syst. 2023, 145, 108608. [Google Scholar]
Xiang, G.; Yang, M.; Huang, S.; Yu, G.; Yin, A.; Liu, X. A deep reinforcement learning based control strategy for combined wind energy storage system. In Proceedings of the 2021 IEEE Sustainable Power and Energy Conference (iSPEC), Nanjing, China, 23–25 December 2021; pp. 33–38. [Google Scholar]
Jalali, S.M.J.; Osório, G.J.; Ahmadian, S.; Lotfi, M.; Campos, V.M.; Shafie-khah, M.; Khosravi, A.; Catalão, J.P. New hybrid deep neural architectural search-based ensemble reinforcement learning strategy for wind power forecasting. IEEE Trans. Ind. Appl. 2021, 58, 15–27. [Google Scholar] [CrossRef]
Pan, C.; Wen, S.; Zhu, M.; Ye, H.; Ma, J.; Jiang, S. Hedge backpropagation based online LSTM architecture for ultra-short-term wind power forecasting. IEEE Trans. Power Syst. 2024, 39, 4179–4192. [Google Scholar] [CrossRef]
Kumar, V.B.; Nookesh, V.M.; Saketh, B.S.; Syama, S.; Ramprabhakar, J. Wind speed prediction using deep learning-LSTM and GRU. In Proceedings of the 2021 2nd International. Conference on Smart Electronics and Communication (ICOSEC), Trichy, India, 7–9 October 2021; pp. 602–607. [Google Scholar]
Hua, H.; Liu, M.; Li, Y.; Deng, S.; Wang, Q. An ensemble framework for short-term load forecasting based on parallel CNN and GRU with improved ResNet. Electr. Power Syst. Res. 2023, 216, 109057. [Google Scholar] [CrossRef]
Liu, F.; Tao, Q.; Yang, D.; Sidorov, D. Bidirectional gated recurrent unit-based lower upper bound estimation method for wind power interval prediction. IEEE Trans. Artif. Intell. 2022, 3, 461–469. [Google Scholar] [CrossRef]
Wang, X.; Wu, Z.; Ge, J.; Zhang, Z.; Han, L.; Wang, S.; Zhang, X. Grid load forecasting based on dual attention BiGRU and DILATE loss function. IEEE Access 2022, 10, 64569–64579. [Google Scholar] [CrossRef]
Zohora, F.T.; Abedin, Z. Bangla image captioning with bidirectional GRU & attention mechanism. In Proceedings of the 2022 International Conference on Innovations in Science, Engineering and Technology (ICISET), Chittagong, Bangladesh, 26–27 February 2022; pp. 306–311. [Google Scholar]
Tan, C.; Gao, Z.; Wu, L.; Xu, Y.; Xia, J.; Li, S.; Li, S.Z. Temporal attention unit: Towards efficient spatiotemporal predictive learning. In Proceedings of the 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, BC, Canada, 17–24 June 2023; pp. 18770–18782. [Google Scholar]
Chen, Q.; Lin, N.; Bu, S.; Wang, H.; Zhang, B. Interpretable time-adaptive transient stability assessment based on dual-stage attention mechanism. IEEE Trans. Power Syst. 2023, 38, 2776–2790. [Google Scholar] [CrossRef]
Huang, B.; Liang, Y.; Qiu, X. Wind power forecasting using attention-based recurrent neural networks: A comparative study. IEEE Access 2021, 9, 40432–40444. [Google Scholar] [CrossRef]
Zhang, H.; Yan, J.; Liu, Y.; Gao, Y.; Han, S.; Li, L. Multi-source and temporal attention network for probabilistic wind power prediction. IEEE Trans. Sustain. Energy 2021, 12, 2205–2218. [Google Scholar] [CrossRef]
National Energy Administration Southern Regulatory Bureau. Implementation Rules for Grid-Connected Operation Management of Power Plants in Southern China [Government Document], Southern Energy Reg. Bureau—Market Reg [2017] no. 440. 25 December 2017. Available Online: http://120.31.132.37:8085/SCSERC_OUTER/temp/examples/upfileattch/70228872_wz_towaiwang.pdf (accessed on 6 September 2024).
Luo, J.; Gao, S.; Wei, X.; Tian, Z. Adaptive energy management strategy for high-speed railway hybrid energy storage system based on double-layer fuzzy logic control. Int. J. Electr. Power Energy Syst. 2024, 156, 109739. [Google Scholar]

Figure 1. The basic process of wind storage joint control.

Figure 2. Joint control model of wind and storage based on TPA-BiGRU.

Figure 3. Wind farm output under five control strategies. (a) Output curves of various control strategies; (b) output deviation curves of various control strategies.

Table 1. Comparison of grid-connected active power requirements between wind turbines and conventional units.

Type	Indices	Requirements
Wind turbine units	Maximum power variation limit	Installed capacity (MW)	10 min change value (MW)	1 min change value (MW)
		<30	10	3
		30–150	$P_{N} / 3$ *	$P_{N} / 10$
		>150	50	15
	Control accuracy	$\leq 3 % P_{N}$
	Response time	$\leq 120 s$
	Overshoot	$\leq 10 %$
Traditional units	15 min power deviation rate	$\pm 2.5 %$

*

P_{N}

is the installed capacity of the wind farm.

Table 2. Neural network hyperparameter setting.

Hyperparameter	Classification Network	Regression Network
Number of BiGRU layers	4	4
Training sample	512	512
Number of neurons in BiGRU	256	128
Dropout layer	0.5	0.3
The activation function of the hidden layer	PReLU	PReLU
Length of the input sequence	151	151
Length of the output sequence	3	1
Number of TPA layers	1	1
The activation function of the output layer	Softmax	None
Objective function	Cross-entropy loss function	Mean square error function

Table 3. Network training results.

Type	Accuracy of the Classification Networks	Errors in Charge Regression Networks	Errors in Discharge Regression Networks
TPA-BiGRU	0.9954	0.5041	0.5019
TPA-BiLSTM	0.9913	0.5236	0.5211
BiGRU	0.9882	0.5528	0.5513

Table 4. Comparison of control effects of wind and storage outputs.

Control Strategy	Range of RMSE	Average of RMSE	KD (MWh)	KS (min) *	KSH
Before control	3.82–10.5%	7.85%	3466.96	2619	82.76%
TPA-BiGRU	0.53–0.97%	0.79%	31.76	302	10.15%
TPA-BiLSTM	0.5–1.24%	0.88%	36.65	317	10.65%
BiGRU	0.53–1.33%	0.98%	40.67	402	13.51%
AROCS	0.54–2.67%	1.69%	82.48	433	14.55%
PI	0.86–4.3%	2.57%	273.46	637	21.40%

* Every 15 min is an assessment period.

Table 5. Comparison of energy storage regulation.

Control Strategy	Total Number of Energy Storage Actions	The Average Number of Energy Storage Actions	TD (MWh)	TDA (MWh)
TPA-BiGRU	8110	261.61	4540.77	146.48
TPA-BiLSTM	9028	291.23	4675.45	150.82
BiGRU	11,180	360.65	4920.41	158.72
AROCS	11,286	364.06	4858.65	156.73
PI	16,612	535.87	6108.60	197.05

Table 6. Assessment results of 5 control strategies.

Control Strategy	K1 Qualifying Days
Control Strategy	1440 min	1350–1430 min	<1350 min
Before control	0	10	21
TPA-BiGRU	8	23	0
TPA-BiLSTM	5	26	0
BiGRU	3	28	0
AROCS	3	28	0
PI	0	24	7

Table 7. Comparison of control effects of wind storage systems.

Control Strategy	RMSE	ABS_MAX (MWh)	KD (MWh)	KDB	KS (min) *	KSH	K1%
Before control	8.61%	17.94	178.78	17.71%	90	6.25%	25%
TPA-BiGRU	0.88%	3.37	0	0%	0	100%	100%
TPA-BiLSTM	0.94%	4.53	3.16	0.47%	4	95.83%	95.83%
BiGRU	1.01%	6.64	3.61	0.55%	10	89.58%	89.58%
AROCS	2.63%	16.35	8.78	1.32%	13	86.46%	86.46%
PI	3.05%	8.34	21.66	3.27%	55	42.71%	42.71%

* Every 15 min is an assessment period.

Table 8. Energy storage regulation.

Control Strategy	TD (MWh)	TDB	Number of Actions
TPA-BiGRU	195.81	19.40%	467
TPA-BiLSTM	198.9	19.70%	485
BiGRU	220.67	21.87%	596
AROCS	225.38	22.33%	567
PI	289.63	28.70%	771

Table 9. Economic calculation results of the energy storage system.

Control Strategy	Energy Storage Operating Life (Day)	Number of Energy Storage Replacement	Total Cost (CNY 10,000) *	Total Revenue (CNY 10,000)	Net Profit (CNY 10,000)	Yield Rate
TPA-BiGRU	1369	1	19,621.42	24,623.7	5002.3	25.49%
TPA-BiLSTM	1256	1	19,621.42	24,121.6	4500.18	22.94%
BiGRU	1147	1	19,621.42	23,520.11	3898.69	19.87%
AROCS	1395	1	19,621.42	21,413.6	1792.18	9.13%
PI	746	2	36,021.42	18,787.78	−17,233.64	−48.84%

* All economic indicators in this study are reported in Chinese Yuan (CNY) to align with the market data and policy framework of the investigated wind storage project in China.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Li, B.; Lu, Y.; Meng, X.; Li, P. Joint Control Strategy of Wind Storage System Based on Temporal Pattern Attention and Bidirectional Gated Recurrent Unit. Appl. Sci. 2025, 15, 2654. https://doi.org/10.3390/app15052654

AMA Style

Li B, Lu Y, Meng X, Li P. Joint Control Strategy of Wind Storage System Based on Temporal Pattern Attention and Bidirectional Gated Recurrent Unit. Applied Sciences. 2025; 15(5):2654. https://doi.org/10.3390/app15052654

Chicago/Turabian Style

Li, Bin, Yaping Lu, Xuguang Meng, and Peijie Li. 2025. "Joint Control Strategy of Wind Storage System Based on Temporal Pattern Attention and Bidirectional Gated Recurrent Unit" Applied Sciences 15, no. 5: 2654. https://doi.org/10.3390/app15052654

APA Style

Li, B., Lu, Y., Meng, X., & Li, P. (2025). Joint Control Strategy of Wind Storage System Based on Temporal Pattern Attention and Bidirectional Gated Recurrent Unit. Applied Sciences, 15(5), 2654. https://doi.org/10.3390/app15052654

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Joint Control Strategy of Wind Storage System Based on Temporal Pattern Attention and Bidirectional Gated Recurrent Unit

Abstract

1. Introduction

2. Structure and Control Criteria

3. Designing Control Strategy and Model

3.1. Joint Control Strategy of Wind and Storage Based on TPA-BiGRU

3.2. Joint Control Model of Wind and Storage Based on TPA-BiGRU

3.2.1. Bidirectional Gated Recurrent Unit

3.2.2. Temporal Pattern Attention Mechanism

4. Training Process of the Model

4.1. Selection of Input and Output Variables for the Network

4.2. Data Preprocessing

4.3. Training Process

5. Case Study

5.1. Comparative Control Strategies

5.2. Analysis of Control Effects

5.3. Economic Analysis

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI