Next Article in Journal
Life Cycle Environmental Impact Assessment of Offshore Wind Power Combined with Hydrogen Energy Storage System
Previous Article in Journal
Coordinated Optimal Dispatch of Source–Grid–Load–Storage Based on Dynamic Electricity Price Mechanism
Previous Article in Special Issue
Process Control via Electrical Impedance Tomography for Energy-Aware Industrial Systems
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Data-Driven Based Dynamic State Estimation Method for Regional Integrated Energy Systems Incorporating Multi-Dimensional Generation-Grid-Load Characteristics

1
State Grid Shanxi Electric Power Company Electric Power Research Institute, Taiyuan 030001, China
2
Shanghai University of Electric Power, Shanghai 201399, China
*
Author to whom correspondence should be addressed.
Energies 2025, 18(23), 6278; https://doi.org/10.3390/en18236278 (registering DOI)
Submission received: 23 October 2025 / Revised: 18 November 2025 / Accepted: 25 November 2025 / Published: 28 November 2025

Abstract

The regional integrated energy system (RIES) has emerged as a critical focus in energy systems research. The comprehensive incorporation of renewable energy and inherent multi-energy flow interconnection within RIES markedly elevates the complexity of “generation-load” balance regulation. Traditional model-driven dynamic state estimation methods, however, are constrained by fundamental limitations—complex modeling, inadequate representation of multi-energy flow interdependencies, and poor computational efficiency. This study proposes a data-driven dynamic state estimation method for RIES, utilizing multi-dimensional “generation-grid-load” characteristic information as its primary input and employing a synergistic framework of Empirical Mode Decomposition-Singular Value Decomposition (EMD-SVD) alongside an enhanced Bidirectional Long Short-Term Memory (BiLSTM) network. EMD-SVD preprocesses raw data to remove noise and extract essential features, while the enhanced BiLSTM serves a dual purpose: it first attains high-precision photovoltaic output prediction and multi-energy load forecasting and subsequently evaluates the node states of the multi-energy flow coupling system. A case study on a practical coupled RIES, comprising a 33-node power system, 7-node gas system, and 6-node thermal system, demonstrates that the proposed method achieves high estimation accuracy and remarkable computational efficiency while effectively addressing the inherent limitations of conventional model-driven approaches.

1. Introduction

In light of the rapid global energy transition and increasing environmental concerns, it is essential to build renewable energy systems that ensure energy security, economic feasibility, and sustainable development. The RIES serves as a crucial physical conduit for attaining carbon peaking and carbon neutrality objectives by dismantling obstacles between conventional single-energy systems. It facilitates multi-energy complementarity, cascaded energy usage, and integrated management of “generation-grid-load-storage,” thereby greatly enhancing energy utilization efficiency, promoting the absorption of clean energy and efficiently reducing carbon emissions. Consequently, RIES has emerged as a central emphasis of energy policy and technical research on both national and global scales [1,2,3,4,5]. To attain precise perception, real-time monitoring, and effective energy management of RIES, while facilitating its coordinated dispatching and informed decision-making, it is essential to derive optimal estimated values of the system’s real-time operational status via dynamic state estimation technology [6,7].
State estimation for integrated energy systems is categorized into static and dynamic state estimation. Static state estimation methods provide a reliable basis for evaluating the operational status of integrated energy systems in steady-state conditions. Researchers in [8] delineated the integrated electricity-heating network by partitioning the heating network into a hydraulic model and a thermal model. Utilizing a semi-joint static model, they employed the Weighted Least Squares (WLS) approach to perform a joint state estimate for the electricity-heating network. In [9], the WLS approach was utilized for state estimation in the electricity-gas integrated energy system under incomplete measurement settings. In [10], the Weighted Least Absolute Value (WLAV) method was employed for state assessment of electricity-gas networks; experiments with various forms of erroneous data demonstrated that this method exhibits considerable robustness. Researchers in [11] suggested a Bilinear Weighted Least Absolute Value (BWLAV) method derived from WLAV, which effectively addresses the issue of initial value configuration for gas networks and enhances computing efficiency. A three-stage state estimate technique for integrated energy systems comprising distributed electricity, heating, and gas, utilizing the Alternating Direction technique of Multipliers (ADMM), was proposed in [12]. This approach circumvents the non-convex optimization dilemma arising from nonlinear measurements and enhances the model’s convergence. In [13], state estimation methods for combined gas-electricity networks were explored; a coupling mechanism characterization framework and practical analysis tools were established to lay a theoretical foundation for state estimation optimization in coordinated operation. In [14], gas transients were incorporated into the state estimation of integrated energy systems; gas flow partial differential equations were simplified via discretization criteria, dynamic modeling was optimized, and its accuracy in real-time state assessment outperforms traditional methods. All the aforementioned studies concentrate on the static state estimation of integrated energy systems. Nonetheless, the extensive incorporation of new energy into power grids, along with the pronounced disparities in dynamic operational characteristics among subsystems within RIES, has markedly amplified its volatility and randomness. In a complex and dynamic context, a singular static state assessment is unlikely to provide correct situational knowledge of RIES. Consequently, it is essential to implement dynamic state estimation as an adjunct.
At present, research on dynamic state estimation for RIES is in the developmental phase. Researchers in [15] utilized the node method to simulate the dynamic processes of heating systems and subsequently devised a dynamic state estimation technique appropriate for integrated electricity-heating networks. In [16], the dynamic properties of gas transmission were elucidated via the discretization of natural gas pipeline equations, and a multi-time-scale dynamic estimation framework for integrated electricity-gas energy systems was established. Despite the consideration of dynamic properties in gas and heating networks by these two dynamic state estimation methods, their fundamental algorithms remain dependent on the conventional WLS technique, resulting in inadequate accuracy in monitoring and forecasting dynamic processes. In [17], researchers adeptly employed the unified multi-energy flow theory by transforming time-domain partial differential equations into linear algebraic equations via frequency-domain transformation, thereby constructing a dynamic state estimation model for natural gas systems. In [18], to tackle the multi-time-scale challenge of electricity-gas-heating systems, the nonlinear dynamic model was converted into an algebraic format utilizing the finite element approach, and Kalman Filtering was employed for dynamic state estimation. Nonetheless, the problem of elevated computational complexity persists and requires resolution. The aforementioned works have significantly advanced model-driven state estimates for integrated energy systems. Nevertheless, the intricate computations of dynamic processes in gas and heating networks compromise the accuracy and real-time efficacy of model-driven methodologies. Moreover, gas and heating networks typically lack measurement instruments, and the terminal systems are intricate. These characteristics complicate the acquisition of real-time measurement data for thermal and gas loads, posing significant obstacles to the state estimation of RIES.
This study focuses on “generation-grid-load”, extracts multi-dimensional features, uses data-driven technology to tackle the identified challenges, and performs dynamic state estimation for RIES. EMD-SVD functions as a filtering module for processing raw meteorological data, while the BiLSTM network serves as the framework for forecasting “generation-load” status. During the offline training phase, it utilizes highly correlated feature data to predict photovoltaic power output and historical data to predict node loads, ultimately relying on BiLSTM to execute the dynamic state estimation of the multi-energy flow coupling system in RIES.

2. Algorithm Architecture of Dynamic State Estimation Integrating Multi-Dimensional “Generation-Grid-Load” Features

Figure 1 illustrates the conceptual diagram of the algorithm presented in this research, with the blue section denoting the natural gas system, the red section indicating the heating system, and the black section representing the electricity system. This conceptual diagram encompasses the data input and state estimation model of the electricity-heat-gas network: On the “generation” side, the system achieves high-precision photovoltaic power forecasting by integrating historical photovoltaic power generation data with critical time-series meteorological feature data; on the “load” side, it amalgamates time-series meteorological feature data and historical node load data for load forecasting; and based on the established “generation-load” coordinated state benchmark, it facilitates dynamic state estimation of the electricity-heat-gas coupled RIES on the “grid” side.

3. Generation-Load Prediction Based on EMD-SVD-BiLSTM

3.1. EMD Decomposition

EMD is a technique that disaggregates data according to its distinctive time scales to derive various intrinsic mode functions (IMFs). Various IMF components signify unique characteristic fluctuation sequences, and the decomposed IMFs can elucidate the fluctuation properties of the original data across multiple time scales. The precise decomposition procedure is outlined as follows:
(1)
Take an original data sequence x 0 ( t ) . Find all maximum value points in the sequence. Make an upper envelope with cubic spline interpolation. Find all minimum value points. Make a lower envelope with cubic spline interpolation. Let m 0 ( t ) be the mean of the upper and lower envelopes. Then, the first decomposed component h 0 ( t ) is h 1 ( t ) = x 0 ( t ) m 0 ( t ) . The remaining component is r 1 ( t ) = x 0 ( t ) h 1 ( t ) .
(2)
When decomposing the n - th ( n 2 ) component, take h n 1 ( t ) as the original data. Make upper and lower envelopes with cubic spline interpolation between the maximum and minimum value points. Calculate the mean m n 1 ( t ) . Then, the decomposed vector is h n ( t ) = x n - 1 ( t ) - m n - 1 ( t ) . The remaining component r n ( t ) = x 0 ( t ) k = 1 n h k ( t ) .
(3)
Continue Step 2 until the maximum number of decomposition iterations is attained, or the resultant component from decomposition is a monotonic function; subsequently, conclude the iteration process.
(4)
The EMD decomposition presented in this paper is utilized to decompose original data for photovoltaic power and load prediction. The upper and lower envelopes are generated using data from the time period corresponding to the input of the BiLSTM neural network during the decomposition process, followed by the actual decomposition.

3.2. SVD Data Dimensionality Reduction

SVD is employed to decrease data dimensionality. It can eliminate noise in data and retrieve the majority of the feature information. The preliminary data decomposed by EMD, characterized by numerous sequence dimensions and inconsistent data scales, when directly utilized as input for BiLSTM, may result in certain information being unresponsive to neuronal adjustments during error backpropagation and weight updates, thereby diminishing the predictive efficacy of the neural network.
Let SVD decompose the input matrix A into singular values and corresponding subspaces. It is expressed as follows:
A = U S V T
Here, U is the left singular matrix. It is an orthogonal basis for the column vector space of input matrix A ; S is the singular value matrix. It has singular values sorted from largest to smallest. Matrix V is the right singular matrix. It is an orthogonal basis for the row vector space of input matrix A .
To decompose A into U S V T and use SVD to filter original data, follow the following steps:
(1)
Calculate the eigenvalues and eigenvectors of A A T and A T A , Then sort these eigenvalues and eigenvectors from largest to smallest by eigenvalues.
(2)
Form left singular matrix U . It is made of the eigenvector matrix of A A T after sorting from largest to smallest. Its singular value matrix is diagonal. The diagonal elements are square roots of the sorted eigenvalues of A A T
(3)
Form right singular matrix V . It is made of the eigenvector matrix of A T A after sorting from largest to smallest.
(4)
When using SVD to reduce the dimension of multi-dimensional data, if the required dimension is k :
A k = U k S k
A k is the data after dimensionality reduction. U k is the first k column of the left singular matrix U . S k is the first k dimensional data of the singular value matrix S .
The application of the EMD-SVD method to decompose environmental elements in photovoltaic power and load prediction mitigates the issue of data singularity in prediction outcomes resulting from too intricate input vector information. This procedure aligns more closely with the methodology of this study, which employs projected “generation-load” states for the dynamic state estimate of RIES.
Figure 2 illustrates the sequence diagram of solar irradiance data for photovoltaic power forecasting subsequent to EMD-SVD processing and normalization. “OI” denotes the sequence of original data post-normalization, while “PI1–PI6” signify sequences obtained following EMD-SVD decomposition and filtering, utilized as inputs for the BiLSTM network.

3.3. EMD-SVD-BiLSTM Model

Following the execution of EMD-SVD decomposition on the original dataset, the resultant data have extracted essential information and eliminated certain singular data points. This research proposes a system that uses BiLSTM as the primary framework for solar power and load prediction, utilizing input data processed by EMD-SVD.
BiLSTM is derived from RNN [19] and LSTM [20], which are engineered to manage data exhibiting temporal correlations in sequences. Recurrent Neural Networks (RNNs) encounter the issue of gradient vanishing when processing long-term sequences, but Long Short-Term Memory (LSTM) networks mitigate this problem by incorporating a state mechanism that retains long-term dependencies. BiLSTM represents an enhanced version of unidirectional LSTM. BiLSTM integrates a forward LSTM layer with a backward LSTM layer, generating output by utilizing both layers concurrently in opposite directions [21]. BiLSTM can optimize the utilization of RIES historical data and comprehensively account for both forward and backward information, hence enhancing predictive performance and accuracy.
Figure 3 shows a typical structure diagram of BiLSTM. X t 1 , X t , X t + 1 stand for the input data at each moment. L S T M F and L S T M B stand for the hidden layer states of the forward LSTM layer and backward LSTM network layer, respectively. O t 1 , O t , O t + 1 stand for the corresponding output data. ω 1 , ω 2 , , ω 6 stand for the weight transmission between different layers. The update states of the forward LSTM, backward hidden layer, and the output state of BiLSTM are shown in Formulas (3)–(5):
L S T M F , i = f 1 ( ω 1 X i + ω 4 L S T M F , i 1 )
L S T M B , i = f 1 ( ω 2 X i + ω 3 L S T M B , i 1 )
O i = f 1 ( ω 5 L S T M F , i + ω 6 L S T M B , i )
In the formulas, f 1 , f 2 , f 3 are activation functions between different layers, and i is the corresponding moment when the output responds.
The method presented in this research utilizes a temporal window format for the input during the training and application of BiLSTM, as illustrated in Figure 4. The normalized data from the time window preceding and succeeding the prediction instant is utilized as the input vector, with the time window advancing incrementally as time elapses, so using the attributes of BiLSTM.

3.4. Photovoltaic Prediction Based on EMD-SVD-BiLSTM

Deep learning is extensively employed for modeling photovoltaic power prediction because of its capability to effectively represent the relationship between meteorological variables and solar power output, as well as its adaptability to fluctuations. The literature [22] suggests that incorporating more pertinent climatic variables will significantly enhance the accuracy of photovoltaic power projection. In research [23], the Pearson correlation analysis is typically employed to identify the input variables for the solar power prediction model and to extract highly correlated feature data. It uses the symbol ρ to reflect the degree of linear correlation between two variables, where ρ ranges from −1 to 1. If ρ > 0, it indicates that the two variables are positively correlated; if ρ < 0, they are negatively correlated. The closer the absolute value of ρ is to 1, the stronger the correlation between the two variables. Its calculation formula is as follows:
ρ = i = 1 n ( x i X ¯ ) ( Y i Y ¯ ) i = 1 n ( X i X ¯ ) 2 i = 1 n ( Y i Y ¯ ) 2
where X and Y represent the values of the two variables, respectively; X ¯ and Y ¯ represent the sample means of the two variables, respectively; and n denotes the number of samples.
Table 1 shows the Pearson correlation analysis of the input vector for photovoltaic power output prediction. For the photovoltaic power output prediction in this paper, the selected input variables are: local solar irradiance G , air temperature T a , and humidity R H ; the output is the real-time power output of the photovoltaic power station, i.e.,
I n p u t P V = { G , T a , R H } O u t p u t P V , i = { P P V , i }

3.5. Load Forecasting Model Based on EMD-SVD-BiLSTM

Load forecast resembles photovoltaic power prediction due to its significant link with meteorological parameters. The multi-component loads of the RIES are intricately linked to human lifestyle and production activities, hence exhibiting significant temporal association. Load prediction, while dependent on meteorological data, demonstrates distinct periodicity and characteristics, influenced by seasonal fluctuations, temperature variations, and the evolving patterns of industrial operations and daily activities of people. Utilizing the BILSTM mechanism, which links data in a temporal sequence, enhances the effects of specific dates and seasonal variations in periodic load prediction, thereby increasing the accuracy of load forecasting. Table 2 displays the Pearson correlation analysis of the input vector for multi-component load forecasting.
Table 2 illustrates large variations in the response intensity of multi-component loads to identical climatic influencing factors.
For electrical load prediction, by combining feature correlation and demand logic, select air temperature T a , humidity R H , air pressure P C , date, and specific hour of the current moment as inputs.
Thermal load is affected by meteorological factors like temperature and air pressure, and time-dimension features. So, select temperature T a , humidity R H , air pressure P C , date, and hour. Temperature leads thermal demand change; air pressure relates to heat transfer and building energy consumption environment; and time dimension anchors the thermal load’s periodic fluctuation. They form a multi-dimensional feature space together.
Gas load is affected by temperature, humidity, wind speed, and time features affect gas usage patterns. With the Pearson coefficient, select temperature T a , humidity R H , wind speed V w , date, and hour. Temperature relates directly to gas-fired heating demand; humidity and wind speed affect equipment efficiency and user habits; and time dimension ensures accurate depiction of gas usage cycles.
In summary, the input–output relationships for electrical, thermal, and gas load predictions are as follows:
I n p u t l o a d , e l e c = { T a , R H , P C , D a t e , H o u r } O u t p u t l o a d , e l e c = { L o a d e l e c }
I n p u t l o a d , h e a t = { T a , R H , P C , D a t e , H o u r } O u t p u t l o a d , h e a t = { L o a d h e a t }
I n p u t l o a d , g a s = { T a , R H , V w , D a t e , H o u r } O u t p u t l o a d , g a s = { L o a d g a s }

4. Dynamic State Estimation Model of RIES Based on “Generation-Load” Status

This article initially employs the EMD-SVD-BiLSTM methodology to forecast the solar power output on the power grid side within RIES to ascertain the “generation” state. Subsequently, multi-component load forecasting is conducted on the pertinent nodes of the electricity grid, heating network, and gas network to ascertain the “load” state. Dynamic state estimation is thereafter conducted on the internal nodes of RIES, based on the assessment of the “generation-load” status.

4.1. Algorithm Flow Chart

Figure 5 illustrates the flowchart of the dynamic state estimation model of RIES predicated on the “generation-load” condition. The “generation-load” balance condition of RIES at future intervals is ascertained by predictive approaches, followed by the execution of a dynamic state estimate of RIES.

4.2. Dynamic State Estimation Model of Coupled Nodes

This study utilizes data from RIES coupling devices to train the model for coupled nodes in RIES, with the objective of enhancing state estimation performance for these nodes. The coupling devices utilized in this context are gas turbines (GT) and Combined Heat and Power (CHP) units.
A gas turbine is a standard component for electricity-gas connection. It utilizes gas generated from natural gas combustion to propel itself and perform labor, hence producing electricity. The energy conversion relationship of a gas turbine is articulated as follows:
P GT = 1 3600 η GT G GT q ng
In the formula, P GT represents the power output of the gas turbine, measured in MW. η GT is the power generation efficiency of the gas turbine (a dimensionless quantity). G GT denotes the gas consumption of the gas turbine, with the unit of Nm3/h. q ng stands for the calorific value of natural gas, and its unit is MJ/Nm3.
CHP is among the most prevalent coupling units in electricity, heat, and gas networks. In CHP systems utilizing gas turbines or reciprocating internal combustion engines as prime movers, the heat production power ratio is regarded as constant. The electro-thermal ratio is articulated as:
γ CHP = Q CHP / P CHP  
In the formula: γ chp represents the electro-thermal ratio of CHP; Q CHP is the heat production power of CHP; and P CHP denotes the electricity generation power of the CHP unit.
This paper employs a BiLSTM neural network to develop distinct state estimation models for CHP units and GT units individually. The model for CHP units utilizes the unit’s operational parameters and temperature data from heat supply ports as input, aiming to predict heat power as the output target. For GT units, the input consists of the unit’s parameters and system pressure measurement data, with the output aimed at electricity generation power.
The intentional choice of these input and output characteristics is based on a study of how energy changes from one form to another. The selected feature parameters exhibit significant physical correlations with the target variables, aligning with the principle of energy conservation and accurately reflecting the unit’s operational characteristics, thus enhancing the model’s generalization capability and state estimate precision.

4.3. Model Robustness Optimization and Data Preprocessing

This study employs a noise-added data augmentation technique during the training phase to enhance the model’s robustness, acknowledging the unavoidable existence of measured erroneous data in real-world RIES. It randomly selects samples from the original dataset, introduces Gaussian white noise with a standard deviation of 10%, and replicates measurement abnormalities in actual systems. Training the model on noise-contaminated datasets markedly improves its fault tolerance to erroneous data during actual operation.
Due to the randomness and minimal prevalence of erroneous data in integrated energy systems, merely using flawed data for training complicates the acquisition of a model with substantial robustness. Initially, it conducts outlier detection via the quartile approach, then employs a modified Akima cubic interpolation algorithm for the intelligent substitution of erroneous data, and reintegrates the rectified results into the anomaly identification process for following measurements. To resolve the dimensional disparity in electricity-heat-gas multi-generation heterogeneous data, it standardizes the input characteristics by max–min normalization to achieve uniform dimensions.

5. Case Study

This study presents an integrated system comprising a 33-node electricity network, a 7-node natural gas network, and a 6-node thermal network, collectively referred to as the electricity-gas-heat RIES network. The topology is illustrated in Figure 6, and the system parameters are cited in Reference [24].
Within the power network, nodes 18 and 22 are linked to photovoltaic devices; nodes 8 and 31 are associated with gas network nodes 5 and 1 through gas turbines; and node 33 is connected to gas network node 3 and thermal network node 1 via a CHP unit. Furthermore, gas network nodes 1 and 4 are linked to gas sources, while the thermal load demand of the thermal network is entirely met by the CHP unit.
The solar and load statistics were obtained from an integrated energy system in a Belgian park [25] and subsequently adjusted proportionally based on the RIES parameters utilized in this investigation. Both of these datasets have a sampling interval of 30 min; therefore, the time interval selected for the simulation process in this paper is also set to 30 min.
For the data analysis indicators of the simulation test system, this paper uses the Root Mean Square Error (RMSE) and the coefficient of determination R 2 to evaluate the accuracy of prediction and estimation.
R M S E ( k ) = 1 N i = 1 N ( y ^ k , i y k , i ) 2
R 2 ( k ) = 1 ( y k , i y ^ k , i ) 2 ( y k , i y ¯ k ) 2
In the formulas, k   is the node number, N   is the length of the entire prediction sequence, y ^ k , i and y k , i are the predicted value and actual value of node k at time i , respectively, and y ¯ k is the mean of the actual values for node k . RMSE measures the absolute error between predicted values and true values—a smaller RMSE means a smaller absolute error. Generally, R 2 ranges from 0 to 1; the closer it is to 1, the better the prediction performance. When R 2 is less than 0, the fitting of predicted values is even worse than just using the mean as the prediction.

5.1. Prediction Performance Analysis Based on EMD-SVD-Bilstm for “Generation-Load”

Figure 7 and Figure 8 illustrate the outcomes of solar output prediction and multi-energy load prediction, respectively. In these pictures, the solar power station that is connected to Grid Node 22 is the prediction target.
For multi-energy load forecasting, Grid Node 16, Thermal Network Node 6, and Gas Network Node 3 in the RIES are selected as typical nodes to predict the loads of electricity, heat, and gas, respectively. All outcomes are presented in per unit (pu) value.
Table 3 displays the accuracy metrics for solar output prediction and multi-energy load forecasting.
The data presented in the figures and Table 3 demonstrate that the “generation-load” prediction system employed in this study efficiently monitors fluctuations in photovoltaic output and load at RIES nodes. It is important to acknowledge that the predictions’ accuracy diminishes when there are substantial temporal fluctuations in historical and meteorological data compared to a steady condition. Nonetheless, the overall predictive accuracy is commendable, and regarding load forecasting, this approach surpasses the method employing pseudo-measurements with significant noise to emulate multi-energy load measurements in the absence of real-time measurement setup.

5.2. Under “Generation-Load” Prediction: RIES Dynamic State Estimation

5.2.1. Result Analysis of Power Grid State Estimation

Figure 9 and Figure 10 present comparative diagrams of the estimated voltage amplitude and phase angle of the power grid at Node 18, derived from calculations based on “generation-load” forecasting. Table 4 displays the RMSE and A values for the estimated power grid voltage vector.
Figure 11 shows the statistical R 2 diagram for voltage magnitude and voltage angle estimation of all nodes. The power grid is divided into four segments: Nodes 1–18, Nodes 19–22, Nodes 23–25, and Nodes 26–33. The R 2 estimation accuracy of the voltage vector for all nodes in the power grid is over 0.9. The segment of Nodes 19–22 is close to the main generator nodes and has photovoltaic devices. So, it is more stable than other nodes. This makes the estimated magnitude of the voltage vector closer to the actual value. In all, the method in this paper works well in the dynamic state estimation of the power grid.

5.2.2. Result Analysis of Thermal and Gas Network State Estimation

This paper aims to validate the efficacy of the proposed method for state estimation in thermal and gas networks by selecting Node 6 of the RIES thermal network and Node 3 of the gas network to conduct dynamic state estimation on the heating temperature of thermal network nodes and the pressure of gas network nodes, respectively, as illustrated in Figure 12 and Figure 13. Table 5 and Table 6 displays the RMSE and R 2 values for the predicted temperature of the thermal network nodes and the pressure of the gas network nodes, as well as the RMSE and R 2 values of the coupled nodes.
The dynamic state estimation results for the heating temperature at Thermal Network Node 6 and the pressure at Gas Network Node 3 indicate that the estimated curves closely align with the actual curves, albeit with slightly lower accuracy compared to the power grid state estimation. This is attributed to the influence of pipeline delays on data collecting in gas and heat networks. Nevertheless, in conjunction with the comparatively low RMSE values presented in the table and the coefficient of determination R 2 approaching 0.87, it can be inferred that the methodology proposed in this paper effectively characterizes the dynamic operating states of thermal and gas networks while demonstrating commendable state estimation performance.

6. Conclusions

This paper presents a data-driven dynamic state estimate approach for RIES that incorporates multi-dimensional attributes of “generation-grid-load.” This method, addressing the practical operational conditions of RIES—marked by multi-energy coupling and extensive integration of renewable energy, resulting in a complex “generation-load” balance—successfully predicts photovoltaic output and multi-energy loads through EMD-SVD and a BiLSTM network, thereby establishing a dependable “generation-load” boundary for state estimation. Conversely, utilizing the anticipated “generation-load” states, the BILSTM network dynamically estimates the state variables of each node within the RIES multi-energy coupling system, circumventing the complexities of modeling and the challenges in accurately representing the multi-energy flow coupling characteristics inherent in traditional model-driven approaches. The case analysis results indicate that utilizing the RIES alongside a 33-node power system, a 7-node natural gas system, and a 6-node thermal system as the research subjects, the proposed method demonstrates high accuracy in estimating critical state variables, including thermal network node temperature and gas network node pressure, thereby validating the method’s efficacy and computational efficiency advantages.
The efficacy of the suggested strategy depends on high-quality historical data and meteorological characteristic data. The experiment revealed that substantial temporal oscillations between these two data types result in a deterioration in the prediction accuracy of photovoltaic output and multi-energy loads. Consequently, due to pipeline transmission delays in gas and heat networks, the state estimate accuracy of their nodes is somewhat inferior to that of the power grid, unable to completely mitigate the measurement discrepancies induced by transmission delays. The experimental data are based on proportional changes related to a specific park-level RIES, and the model’s responsiveness to RIES under varying climatic circumstances and energy architectures necessitates further validation. This paper fails to adequately address abrupt variations in renewable energy generation and multi-energy demand under harsh conditions, and the model’s applicability to renewable energy integrated systems of different scales necessitates additional validation. In the future, two primary aspects of work will be prioritized: first, adopting an adaptive strategy to optimize model parameters online and improve resilience under extreme conditions; and second, expanding the technology’s application in cross-regional interconnected RIES and validating its effectiveness in large-scale contexts.

Author Contributions

Conceptualization, S.L.; Methodology, X.C.; Investigation, L.J.; Writing—review and editing, J.M. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by State Grid Shanxi Electric Power Company Science and Technology Project Research (52053024000V).

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

Author Shengwen Li, Xiao Chang were employed by the company State Grid Shanxi Electric Power Company Electric Power Research Institute. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:
RIESRegional Integrated Energy System
EMD–SVDEmpirical Mode Decomposition-Singular Value Decomposition
BiLSTMBidirectional Long Short-Term Memory
RNNRecurrent Neural Network
LSTMLong Short-Term Memory
WLSWeighted Least Square
WLAVWeighted Least Absolute Value
BWLAVBilinear Weighted Least Absolute Value
ADMMAlternating Direction Method of Multipliers
GTGas Turbines
CHPCombined Heat and Power
RMSERoot Mean Square Error
MAPEMean Absolute Percentage Error

References

  1. Yuan, Y.; Miao, A.K.; Wu, H.; Zhu, J.P.; Wang, Z.M.; Qian, K. Review of the research framework and key issues for low-carbon integrated energy system. High Volt. Eng. 2024, 50, 4019–4036. [Google Scholar] [CrossRef]
  2. Kang, C.Q.; Du, E.S.; Li, Y.W.; Zhang, N.; Chen, Q.X.; Guo, H.Y.; Wang, P. Key scientific problems and research framework for carbon perspective research of new power systems. Power Syst. Technol. 2022, 46, 821–833. [Google Scholar] [CrossRef]
  3. Wang, Y.Z.; Kang, L.G.; Zhang, J.; Zhao, W.; Zhu, Y.L.; Zhao, J. Development history, typical form and future trend of integrated energy system. Acta Energiae Solaris Sin. 2021, 42, 84–95. [Google Scholar] [CrossRef]
  4. Fan, J.B.; Meng, Z.J.; Shan, M.W.; Lü, L.J. A review of key technologies and international standardization of multi energy coupling systems. Power Syst. Clean Energy 2023, 39, 1–9. [Google Scholar]
  5. Zhang, S.X.; Wang, D.Y.; Cheng, H.Z.; Song, Y.; Yuan, K.; Du, W. Key technologies and challenges of low-carbon integrated energy system planning for carbon emission peak and carbon neutrality. Autom. Electr. Power Syst. 2022, 46, 1–10. [Google Scholar]
  6. Chen, Y.B.; Yao, Y.; Lin, Y.Z.; Yang, X.N. Dynamic state estimation for integrated electricity-gas systems based on Kalman filter. CSEE J. Power Energy Syst. 2022, 8, 293–303. [Google Scholar] [CrossRef]
  7. Sheng, T.T.; Yin, G.X.; Guo, Q.L.; Sun, H.B.; Pan, Z.G. A hybrid state estimation approach for integrated heat and electricity networks considering time-scale characteristics. J. Mod. Power Syst. Clean Energy 2020, 8, 636–645. [Google Scholar] [CrossRef]
  8. Dong, J.N.; Sun, H.B.; Guo, Q.L.; Sheng, T.T.; Qiao, Z. State estimation of combined electric-gas networks for energy internet. Power Syst. Technol. 2018, 42, 400–409. [Google Scholar] [CrossRef]
  9. Dong, J.N.; Sun, H.B.; Guo, Q.L.; Pan, Z.G. State estimation for combined electricity and heat networks. Power Syst. Technol. 2016, 40, 1635–1641. [Google Scholar] [CrossRef]
  10. Chen, Y.B.; Zheng, S.L.; Yang, N.; Yang, X.N.; Liu, K.C. Robust state estimation of electric-gas integrated energy system based on weighted least absolute value. Autom. Electr. Power Syst. 2019, 43, 61–70. [Google Scholar] [CrossRef]
  11. Zheng, S.L.; Liu, J.; Chen, Y.B.; Qi, B. Bilinear robust state estimation based on weighted least absolute value for integrated electricity-gas system. Power Syst. Technol. 2019, 43, 3733–3742. [Google Scholar] [CrossRef]
  12. Du, Y.X.; Zhang, W.; Zhang, T.T. ADMM-based distributed state estimation for integrated energy system. CSEE J. Power Energy Syst. 2019, 5, 275–283. [Google Scholar] [CrossRef]
  13. Zhang, Y.B. Study on the Methods for Analyzing Combined Gas and Electricity Networks. Ph.D. Thesis, China Electric Power Research Institute, Beijing, China, 2005. [Google Scholar]
  14. Liu, W.J.; Li, P.Y.; Yang, W.T.; Chung, C.Y. Optimal energy flow for integrated energy systems considering gas transients. IEEE Trans. Power Syst. 2019, 34, 5076–5079. [Google Scholar] [CrossRef]
  15. Ci, W.B.; Gu, H.F.; Zhu, J.S. Multi-timescale state estimation for integrated electricity and heat system. Therm. Power Gener. 2021, 50, 94–100. [Google Scholar] [CrossRef]
  16. Dong, L.; Wang, C.F.; Li, Y.; Wang, X.Y.; Dong, X.C.; Sun, Y.Y.; Pu, T.J. Multi-snapshot state estimation of electric-gas integrated energy system. Power Syst. Technol. 2020, 44, 3458–3465. [Google Scholar] [CrossRef]
  17. Yin, G.X.; Chen, B.B.; Sun, H.B.; Guo, Q.L. Energy circuit theory of integrated energy system analysis (IV): Dynamic state estimation of the natural gas network. Proc. CSEE 2020, 40, 5827–5835. [Google Scholar] [CrossRef]
  18. Liu, X.R.; Li, Y.; Sun, Q.Y.; Pan, Y.L. Interaction and joint state estimation of electric-gas-thermal coupling network. Power Syst. Technol. 2021, 45, 479–488. [Google Scholar] [CrossRef]
  19. Mike, S.; Paliwal, K.K. Bidirectional recurrent neural networks. IEEE Trans. Signal Process. 1997, 45, 2673–2681. [Google Scholar] [CrossRef]
  20. Hochreiter, S.; Schmidhuber, J. Long short-term memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef] [PubMed]
  21. Wu, Y.-K.; Wu, Y.-C.; Hong, J.-S.; Phan, L.H.; Quoc, D.P. Probabilistic forecast of wind power generation with data processing and numerical weather predictions. IEEE Trans. Ind. Appl. 2021, 57, 36–45. [Google Scholar] [CrossRef]
  22. Keddouda, A.; Ihaddadene, R.; Boukhari, A.; Atia, A.; Arıcı, M.; Lebbihiat, N.; Ihaddadene, N. Solar photovoltaic power prediction using artificial neural network and multiple regression considering ambient and operating conditions. Energy Convers. Manag. 2023, 288, 117186. [Google Scholar] [CrossRef]
  23. Chen, H.; Chang, X. Photovoltaic power prediction of LSTM model based on Pearson feature selection. Energy Rep. 2021, 7, 1047–1054. [Google Scholar] [CrossRef]
  24. Liu, X.Z.; Wu, J.Z.; Jenkins, N.S.; Bagdanavicius, A. Combined analysis of electricity and heat networks. Energy Procedia 2014, 61, 155–159. [Google Scholar] [CrossRef]
  25. Wang, D.; Liu, L.; Jia, H.J.; Wang, W.; Zhi, Y.; Meng, Z. Review of key problems related to integrated energy distribution systems. CSEE J. Power Energy Syst. 2018, 4, 130–145. [Google Scholar] [CrossRef]
Figure 1. Algorithm concept diagram.
Figure 1. Algorithm concept diagram.
Energies 18 06278 g001
Figure 2. EMD-SVD decomposition sequence.
Figure 2. EMD-SVD decomposition sequence.
Energies 18 06278 g002
Figure 3. Framework diagram of BiLSTM.
Figure 3. Framework diagram of BiLSTM.
Energies 18 06278 g003
Figure 4. Schematic diagram of the BiLSTM time window.
Figure 4. Schematic diagram of the BiLSTM time window.
Energies 18 06278 g004
Figure 5. Algorithm flow chart.
Figure 5. Algorithm flow chart.
Energies 18 06278 g005
Figure 6. The electric-gas-heat coupling network.
Figure 6. The electric-gas-heat coupling network.
Energies 18 06278 g006
Figure 7. Photovoltaic power output prediction.
Figure 7. Photovoltaic power output prediction.
Energies 18 06278 g007
Figure 8. Multi-load forecasting.
Figure 8. Multi-load forecasting.
Energies 18 06278 g008
Figure 9. Estimated voltage amplitude of node 18.
Figure 9. Estimated voltage amplitude of node 18.
Energies 18 06278 g009
Figure 10. Estimated voltage angle of node 18.
Figure 10. Estimated voltage angle of node 18.
Energies 18 06278 g010
Figure 11. R 2 of estimated voltage vector.
Figure 11. R 2 of estimated voltage vector.
Energies 18 06278 g011
Figure 12. R 2 Heating temperature diagram of Node 6 in thermal network.
Figure 12. R 2 Heating temperature diagram of Node 6 in thermal network.
Energies 18 06278 g012
Figure 13. Pressure diagram of Node 3 in gas network.
Figure 13. Pressure diagram of Node 3 in gas network.
Energies 18 06278 g013
Table 1. Pearson coefficient analysis table of photovoltaic forecast.
Table 1. Pearson coefficient analysis table of photovoltaic forecast.
FeatureCorrelation Coefficient
Temperature0.763
Solar irradiance0.979
Humidity0.338
Wind speed0.187
Table 2. Pearson coefficient analysis table for load forecasting.
Table 2. Pearson coefficient analysis table for load forecasting.
FeatureElectrical LoadThermal LoadGas Load
Hour0.72430.68520.5937
Date0.02630.03150.0289
Month−0.0053−0.0121−0.0086
Temperature0.44020.79650.6538
Humidity−0.2874−0.43280.4876
Wind speed0.16740.0247−0.0289
Air pressure−0.5387−0.58460.0658
Table 3. Prediction accuracy of PV output and multi-energy loads.
Table 3. Prediction accuracy of PV output and multi-energy loads.
Prediction TypeRMSER2MAPE
Photovoltaic output0.77090.95290.0425
Node electrical load0.36420.94060.0538
Node thermal load0.38250.94630.0754
Node gas load0.54170.95870.0508
Table 4. Table of RMSE and R 2 of voltage vector.
Table 4. Table of RMSE and R 2 of voltage vector.
NodeRMSE R 2 MAPE
VmVaVmVaVmVa
Node 100.00270.10620.93150.93130.01850.0191
Node 180.00040.01520.94610.93100.01260.0198
Average0.00220.07350.93220.93240.01740.0174
Table 5. RMSE and R 2 of estimated thermal network node temperatures and gas network node pressures.
Table 5. RMSE and R 2 of estimated thermal network node temperatures and gas network node pressures.
NodesRMSE R 2 MAPE
Thermal Network Node 60.06240.88750.0624
Gas Network Node 30.02470.87630.0647
Average0.04360.88640.0613
Table 6. RMSE and R 2 for coupled node estimation.
Table 6. RMSE and R 2 for coupled node estimation.
NodeRMSE R 2 MAPE
Node 80.01370.09260.0461
Node 310.01760.88610.0611
Node 330.02530.88780.0607
Average0.01850.88830.0594
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Li, S.; Chang, X.; Ji, L.; Mao, J. Data-Driven Based Dynamic State Estimation Method for Regional Integrated Energy Systems Incorporating Multi-Dimensional Generation-Grid-Load Characteristics. Energies 2025, 18, 6278. https://doi.org/10.3390/en18236278

AMA Style

Li S, Chang X, Ji L, Mao J. Data-Driven Based Dynamic State Estimation Method for Regional Integrated Energy Systems Incorporating Multi-Dimensional Generation-Grid-Load Characteristics. Energies. 2025; 18(23):6278. https://doi.org/10.3390/en18236278

Chicago/Turabian Style

Li, Shengwen, Xiao Chang, Liang Ji, and Junchen Mao. 2025. "Data-Driven Based Dynamic State Estimation Method for Regional Integrated Energy Systems Incorporating Multi-Dimensional Generation-Grid-Load Characteristics" Energies 18, no. 23: 6278. https://doi.org/10.3390/en18236278

APA Style

Li, S., Chang, X., Ji, L., & Mao, J. (2025). Data-Driven Based Dynamic State Estimation Method for Regional Integrated Energy Systems Incorporating Multi-Dimensional Generation-Grid-Load Characteristics. Energies, 18(23), 6278. https://doi.org/10.3390/en18236278

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop