Next Article in Journal
Classification of Flow Pathways for Waterflooding Operations in a Hydrocarbon Reservoir in Terms of Displacement Constraints and Incremental Oil Recovery
Next Article in Special Issue
Dynamic Analysis of Virtual Synchronous Generator Control-Based PMSG Considering Low-Voltage Ride-Through Control
Previous Article in Journal
Research on Direct Air Capture: A Review
Previous Article in Special Issue
Lactic Fermentation Spectral Analysis of Target Substrates and Food and Feed Wastes for Energy Applications
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Data-Driven Probabilistic Power Flow for Energy-Storage Planning Considering Interconnected Grids

1
National Key Laboratory of Advanced Power Transmission Technology, China Electric Power Research Institute, Beijing 102209, China
2
Economic & Technological Research Institute, State Grid Shandong Electric Power Company, Jinan 255000, China
*
Author to whom correspondence should be addressed.
Energies 2025, 18(24), 6633; https://doi.org/10.3390/en18246633
Submission received: 9 November 2025 / Revised: 16 December 2025 / Accepted: 16 December 2025 / Published: 18 December 2025
(This article belongs to the Special Issue Advances in Power System and Renewable Energy)

Abstract

As renewable energy penetration increases, the volatility and uncertainty of photovoltaic generation and load demand pose significant challenges to power-system stability. This paper proposes a data-driven probabilistic load-flow method that employs a Gaussian mixture model (GMM) to model uncertainties in photovoltaic generation and load demand. Cumulative quantity analysis is then applied to conduct probabilistic load-flow studies, quantifying the impact of these uncertainties on the power system. Building upon this foundation, a two-layer optimization model is constructed to optimize the siting, capacity, and operational strategies of energy storage systems. Experimental results demonstrate that this method effectively reduces the probability of voltage-limit violations, ensures the reliability of supply–demand balance, and enhances system stability and reliability even under fluctuating PV generation and load-demand conditions.

1. Introduction

With profound transformations in the global energy landscape, the adoption of clean renewable energy—particularly wind and solar photovoltaic (PV) power—has become increasingly widespread [1]. As the world’s largest energy producer and consumer, China is actively promoting renewable energy development to align with its “dual-carbon” goals [2]. The growing integration of renewable energy into power systems has led to significant changes in traditional power architectures [3]. However, unlike conventional thermal power generation, wind and solar outputs exhibit significant intermittency, randomness, volatility, and high uncertainty. Their generation is strongly influenced by meteorological conditions, making accurate forecasting challenging [4]. PV output may plummet sharply during cloudy weather, nighttime, or seasonal variations [5]. This uncertainty poses significant challenges to power system stability and reliability and increases the risk of supply–demand imbalances [6].
Meanwhile, complexity on the demand side is intensifying. With the integration of emerging loads such as electric vehicles, the demand side of power systems also exhibits complex stochastic fluctuations [7]. This dual uncertainty from both generation and demand makes maintaining supply–demand balance during real-time operation highly challenging [8,9,10]. Traditional approaches to power-system planning and operation largely rely on deterministic models and simplified assumptions, often overlooking complex dependence structures between PV output and load demand while failing to effectively quantify inherent uncertainties [11,12]. To more accurately assess these risks, academia has proposed various probabilistic modeling approaches. Classical Monte Carlo simulation (MCS) is widely regarded as a benchmark for probabilistic power flow and risk assessment due to its minimal distributional assumptions and its ability to handle strong nonlinearities [13]. However, achieving stable estimates of tail-related metrics typically requires a large number of samples, resulting in a significant computational burden—especially for large-scale systems and long-horizon planning tasks, where direct coupling with iterative optimization solvers becomes challenging [13]. Point estimate methods (PEMs) reduce the number of power-flow evaluations by matching low-order moments using a small set of representative points, thereby improving computational efficiency [14,15]. Nevertheless, when input uncertainties exhibit heavy tails, multimodality, or strong skewness, low-order moment approximations can lead to amplified errors, and quantile approximations such as the Cornish–Fisher expansion may suffer from non-negligible truncation errors when distributions deviate substantially from normality [16].
To overcome these limitations, data-driven scenario generation and machine learning (ML) techniques have become research hotspots. Kernel density estimation (KDE) can nonparametrically approximate complex distributions for scenario construction; however, its performance heavily depends on bandwidth selection, and it is susceptible to the curse of dimensionality with high-dimensional inputs or complex correlations, leading to unstable generalization [17]. Generative adversarial networks (GANs) can fit data distributions through adversarial learning and have been utilized to generate realistic renewable energy scenarios [18]. Improved variants such as Wasserstein GAN (WGAN) can partially mitigate training instability and improve gradient behavior [19]. Even so, GAN-based methods may still suffer from mode collapse and convergence sensitivity, potentially resulting in insufficient coverage of extreme tail risks or neglect of critical patterns in generated samples [20]. Furthermore, deep reinforcement learning (DRL) and end-to-end learned policies demonstrate strong adaptability in online control and scheduling applications [21,22,23]. However, these approaches often lack physical interpretability, and their performance may degrade under out-of-distribution (OOD) conditions caused by sudden disturbances or distribution shifts, which can translate into difficult-to-quantify operational risks [24]. Consequently, significant fluctuations in PV output or abrupt demand changes may still trigger security issues such as voltage violations [25]. Fundamentally, these problems arise because the system fails to maintain a reliable probabilistic balance between supply and demand under uncertainty, i.e., active and reactive power generation cannot match consumption at a specified probability level, thereby compromising grid security and stability [26,27,28]. A comparison of the methods used in this paper with those in other existing studies in this field is shown in Table 1.
Against this backdrop, energy storage systems have garnered increasing attention as an effective solution to supply–demand imbalances [29]. These systems provide bidirectional power regulation by storing surplus electricity during periods of high PV generation and releasing energy when PV output is insufficient or load demand increases. Such functionality mitigates real-time power shortages caused by uncertainty and helps maintain system equilibrium [30,31]. However, the capital and operational costs of ESS remain high [32,33]. In practical applications, how to rationally plan the capacity, location, and operational strategies of ESSs to ensure supply–demand balance has become a critical issue of guaranteeing the long-term stability and economic operation of power systems [34,35,36,37,38,39,40].
It is important to emphasize that lithium-ion batteries inevitably experience capacity degradation and increased internal resistance during long-term operations. Their degradation rates are typically closely related to operational stresses such as the state-of-charge (SOC) range, charge/discharge rates, depth of cycle, and temperature. If degradation effects are overlooked during energy storage planning, it often leads to underestimating the depreciation/replacement costs over the entire lifecycle, causing capacity allocation and operational strategies to deviate from true optimization on a long-term scale. In recent years, research has begun embedding empirical or semi-empirical degradation models into energy storage planning and scheduling optimization as degradation costs. For instance, the authors in [41] systematically reviewed mainstream approaches for degradation modeling, stress factor characterization, and lifetime cost accounting in BESS operational optimization.Reference [42] further explicitly coupled battery aging models into optimal energy management and capacity design for isolated microgrids to quantify trade-offs between degradation and economics. Research indicates that while high-precision energy storage degradation models offer more accurate lifespan predictions, their nonlinear characteristics significantly increase computational complexity in optimization problems. Given this, while this study does not incorporate complex energy storage degradation models within its current scope, we fully recognize the importance of battery state-of-health (SOH) evolution. Future work will explore integrating more refined battery decay models while ensuring computational efficiency.
In summary, the main contributions of this paper are as follows:
(1)
This paper proposes a probabilistic load flow calculation method designed for interconnected power systems. Its principal novelty lies in: the development of a joint Gaussian Mixture Model (GMM) to concurrently model the correlated uncertainties of photovoltaic output and load demand—a step beyond typical single-source or radial-network GMM applications; and the integration of this joint model with the cumulant method for probabilistic propagation, which effectively circumvents the non-physical negative value problem inherent in traditional series expansion techniques. This integrated approach enhances both the modeling fidelity for complex uncertainties and the numerical robustness of the calculation.
(2)
We established a two-layer energy-storage planning framework that explicitly incorporates PV and load uncertainties. The model optimizes both the location and capacity of energy storage systems to mitigate voltage violation risks induced by intermittent renewable generation and variable demand.
(3)
The effectiveness of the proposed planning scheme is rigorously validated through extreme scenario analysis and robustness testing, with comprehensive simulations conducted on the IEEE 33-node partitioned grid standard system.

2. Uncertainty Modeling for Load and Photovoltaic Generation Based on Gaussian Mixture Model

In power systems, load demand and photovoltaic output exhibit significant uncertainty and constitute critical factors affecting the probabilistic balance of supply and demand. Particularly with a high penetration of renewable energy, the supply–demand balance of power systems faces considerable challenges. Traditional forecasting methods of load demand and photovoltaic output are often based on a single probability distribution model or simplified statistical assumptions, making it difficult to accurately capture multimodal, asymmetric, and other complex characteristics. Therefore, accurately characterizing the probability distributions of photovoltaic generation and load demand has become a critical issue.
To address this issue, this study employs the GMM to conduct uncertainty modeling of photovoltaic output and load demand. The GMM is a model that fits complex probability distributions through a weighted sum of multiple Gaussian distributions. It is capable of effectively handling complex randomness and nonlinear distributions, making it suitable for capturing the multimodal characteristics of photovoltaic generation and load demand. The pseudocode of the GMM method is shown in Algorithm 1.
Algorithm 1: PV And Load Uncertainty Modeling Based On GMM
Input: Historical data of PV output or load demand(dimension d): X = x 1 , x 2 , , x n
   Maximum number of Gaussian components: Kmax
Output: Optimal number of Gaussian components: Kopt
    GMM parameters: Weight vector π , mean vector μ , covariance matrix Σ
    Probability density function of PV output or load demand: p ( x )
1: Procedure Use BIC to select the best number of Gaussian components Kopt
2:   Initialize BIC_best = ∞, Kopt = 1
3:   for k = 1 to Kmax do
4:      π k , μ k , Σ k , log ( p ( X ) ) = EMAlgorithm ( X , k )
5:     Calculate BIC = 2 log p ( X ) + p log N
6:     if BIC < BIC_best then
7:       BIC_best = BIC, Kopt = k
8:     end if
9:   end for
10: end Procedure
11: Procedure Using the BIC_best to retrain GMM
12:    π , μ , Σ = EMAlgorithm ( X , k opt )
13:   return Kopt, π , μ , Σ
14: end Procedure
15: function EMAlgorithm (X, K)
16:   if iteration < max_iteration then
17:     for n = 1 to N do
18:       for k = 1 to K do
19:         Calculating Gaussian PDF p r ( k , x n )
20:       end for
21:       Calculate p r ( k , x n ) = p r ( k , x n ) / k = 1 K p r ( k , x n )
22:     end for
23:     Recalculation log ( p ( X ) )
24:     if  Δ log ( p ( X ) ) < ε  then
25:       break
26:     end if
27:     Update π k , μ k , Σ k
28:   end if
29:   return  π k , μ k , Σ k , log ( p ( X ) )
30: end function

2.1. Mathematical Formulation of GMM

The probability density function of a random variable x under the GMM is expressed as a weighted sum of K Gaussian distributions:
p ( x ) = k = 1 K π k N x | μ k , Σ k
where p(x) denotes the probability density function of load demand or photovoltaic output, x represents the corresponding load demand or photovoltaic output, K is the optimal number of Gaussian components, which can be determined using the Bayesian Information Criterion (BIC) method. The value of K corresponding to the smallest BIC value is considered the optimal number of Gaussian components. The BIC formula is given by BIC = 2 log p ( X ) + p log N , where log p ( X ) is the log-likelihood of the dataset X, and p = k ( d + d ( d + 1 ) / 2 + 1 ) 1 is the total number of free parameters in the GMM, d is the data dimensionality, N is the total number of samples in the dataset X . π k is the weight of the Gaussian component, satisfying k = 1 K π k = 1 . N x | μ k , Σ k denotes a Gaussian distribution with mean μ k and covariance Σ k . The probability density function is given by:
N ( x | μ k , Σ k ) = e 1 2 ( x μ k ) TR Σ k 1 ( x μ k ) ( 2 π k ) d / 2   |   Σ k   | 1 / 2
where μ k is the mean of component k, and Σ k is the corresponding covariance matrix.

2.2. Parameter Estimation of GMM Using Expectation–Maximization (EM) Algorithm

In practical applications, the parameters of GMM are unknown and must be estimated from historical data. A commonly used estimation method is the Expectation–Maximization (EM) algorithm, which iteratively updates the parameters to gradually approximate the maximum likelihood estimate. Through iterative updates of parameters, convergence is achieved when the changes in the model parameters between consecutive iterations become negligible. The goal of the EM algorithm is to maximize the maximum likelihood function:
log p ( X ) = n = 1 N log k = 1 K π k N x n   |   μ k , Σ k
The basic steps of the EM algorithm are as follows:
(1)
Initialization: The K-means clustering algorithm is first applied to the data for preliminary clustering, in order to obtain the initial mean, covariance matrix, and weight for each component k.
(2)
E-step (Expectation): Based on the current mean and covariance, the posterior probability of component k for data point is computed as:
P r k , x n = π k Last N x n   |   μ k Last , Σ k Last k = 1 K π k Last N x n   |   μ k Last , Σ k Last
where P r k , x n denotes the posterior probability that sample xn belongs to component k, μ k Last is the mean of component k from the previous iteration, and Σ k Last is the covariance of component k from the previous iteration, and π k Last is the weight of component k from the previous iteration.
(3)
M-step (Maximization): Using the posterior probabilities obtained in the E-step, the parameters, μ k , π k and Σ k for component k are recalculated, thereby updating the maximum likelihood function.
(4)
Iteration: The E-step and M-step are repeated until the maximum likelihood function converges, that is, until the parameter values show no significant change. Once convergence is achieved, the current μ k , π k and Σ k for each component k are the optimal output.

3. Probabilistic Load Flow Calculation Based on the Cumulant Method

The power flow equations of an electrical power system constitute a set of nonlinear equations characterized by complex mathematical relationships between input and output variables. When large amounts of random renewable energy sources and loads are integrated into the system, traditional deterministic power flow calculations can only provide the system state under specific input conditions, failing to capture the system state and probability distribution affected by uncertainty. Probabilistic Load Flow (PLF) calculation, however, is a method for handling load flow under uncertainty. It converts system outputs such as node voltages and bus-to-bus currents into probability distributions.

3.1. Principle of the Cumulant Method

After processing the uncertainties in photovoltaic output and load demand using GMM, their probability density functions can be obtained. This paper employs the cumulative quantity method for probabilistic power flow calculations and utilizes the C-type Gram–Charlier series expansion to compute the probability density functions of node voltages and branch currents.
Based on the mean, variance, and weights of photovoltaic output and load demand obtained from the GMM, calculate the vth-order cumulative photovoltaic output and v-order cumulative load demand. κ ( 1 ) is the first-order cumulant, representing the mean; κ ( 2 ) is the second-order cumulant, representing the variance; κ ( 3 ) is the third-order cumulant, representing the skewness; κ ( 4 ) is the fourth-order cumulant, representing the kurtosis. These cumulants characterize the distributional features of the random variables and provide the basis for subsequent PLF calculations.

3.2. Cumulant Calculation

In PLF analysis, the power balance equations of each interconnected power system region, including nodal voltage magnitudes and branch power flows, can be expressed as:
P i = U i j = 1 J U j ( G i j cos θ i j + B i j sin θ i j ) Q i = U i j = 1 J U j ( G i j sin θ i j B i j cos θ i j )
P i j = U i U j ( G i j cos θ i j + B i j sin θ i j ) G i j U i 2
where J denotes the total number of partitioned sub-grids, Pi and Qi denote the active and reactive powers of node i in the interconnected system, Ui is the voltage magnitude at node i, and θ i j is the voltage phase angle difference between nodes i and j. Gij and Bij represent the real and imaginary parts of the admittance between nodes i and j, respectively. Pij represents the active power on the line between node i and node j.
To incorporate probabilistic load flow into power flow analysis, Equation (5) is linearized, and disturbance terms are introduced to describe variations in voltages and branch power flows. The nonlinear power flow equations in Equation (6) are rewritten in matrix form and expanded into a Taylor series at the reference operating point using the Newton–Raphson method, while higher-order terms are neglected. Consequently, a linearized power flow model is obtained:
L = L 0 + J 0 Δ U K = K 0 + G 0 Δ U
where L denotes the vector of active and reactive power injections of the interconnected system, K is the vector of active and reactive power transmitted through branches, U is the vector of voltage magnitudes and phase angles of the interconnected nodes, and L0 and K0 are the corresponding deterministic values. ΔU represents the perturbation vector of the voltage state, while J0 and G0 are the Jacobian matrices used in the final calculation of the Newton–Raphson method.
By rearranging and simplifying Equation (7), we can obtain the nth-order cumulative quantities of voltage disturbances at each node and branch-flow disturbances in the partitioned power grid:
κ Δ U ( v ) = S 0 ( v ) κ Δ L ( v ) κ Δ K ( v ) = T 0 ( v ) κ Δ L ( v )
where κ Δ L ( v ) = κ PV ( v ) κ Load ( v ) represents the cumulative power injection, calculated from the cumulative photovoltaic output and load demand, S 0 = J 0 1 , T 0 = G 0 J 0 1 . S 0 ( v ) denotes the vth-order sensitivity matrix of the cumulative node voltage disturbance κ Δ U ( v ) to the cumulative power injection κ Δ L ( v ) , which is S 0 raised to the power of v; T 0 ( v ) is the vth-order sensitivity matrix of the cumulative branch power κ Δ K ( v ) to the cumulative power injection κ Δ L ( v ) , and is the vth power of T 0 .
Since the reference operating point U 0 of the node voltage in the partitioned power grid is constant, the cumulative quantities of the node voltage state variable U and the node voltage disturbance Δ U at each order are related as follows:
κ U ( 1 ) = U 0 + κ Δ U ( 1 ) ,   κ U ( v ) = κ Δ U ( v ) κ U ( v ) = κ Δ U ( v ) ( v 2 )
The node voltage state variables and disturbances share identical central moments, differing only in their means by U 0 . Therefore, solving for the cumulative quantity of Δ U yields the probability distribution characteristics of the node voltage state variable U . Similarly, for branch power flow, the reference operating point K 0 of the branch power flow is also constant. The cumulative quantities of the branch power flow state variable K and the branch power flow disturbance Δ K also satisfy the relationship given by Equation (9). Through the above transformation, the cumulative quantity κ U ( v ) of the node voltage in the partitioned grid and the cumulative quantity κ K ( v ) of the branch power flow can be obtained.

3.3. Cumulant Propagation and Probabilistic Load Flow Calculation

By calculating the cumulants of nodal voltages and branch power flows in the interconnected power system, the output probability distributions under uncertainty can be derived. To obtain the probability density functions of nodal voltages and branch power flows, the C-type Gram–Charlier series expansion is employed. This method avoids the shortcomings of the traditional A-type Gram–Charlier expansion, where the probability density function may become negative in some regions, and ensures that the probability density function remains non-negative and normalized. The mathematical formulation of the C-type Gram–Charlier expansion is expressed as:
h ( y ) = exp m + 1 m δ m H m ( y ) exp m + 1 m δ m H m ( y ) d y
δ m denotes the expansion coefficients, δ = δ 1 , δ 2 , δ m max T .where m is the order of expansion and m m a x is the maximum expansion order. H m ( y ) represents the Hermite polynomial, which is defined as
H m ( y ) = ( 1 ) m e y 2 2 d m d y m e y 2 2
Equation (19) does not directly use the cumulants obtained; instead, the cumulants are transformed into expansion coefficients by solving the linear system δ = λ , where is a symmetric matrix and λ is an m -dimensional column vector. Since λ 1 = 0 , the remaining terms can be obtained using Equation (11):
λ m = ( 1 m ) T m 2 σ m 2 ( m = 2 , 3 , , m max )
Here, σ denotes the standard deviation. Because the constructed probability density function follows the standard normal distribution, σ = 1 . T m can be computed from the m -th order cumulant κ m as follows T 0 = 1 ,   T 1 = T 2 = 0 and for m 3 ,
T m = κ m + q = 3 m 3 ( m 1 ) ! q ! ( m q 1 ) ! κ m q T q ( m 3 )
Through this process, the probability density functions of nodal voltages and branch power flows in the interconnected power system can be derived based on limited cumulant information. These probability distributions can further be applied to the assessment of voltage-limit violation risks in the power grid.

4. Bi-Level Optimization Model for Energy-Storage Planning

Energy storage systems can maintain probabilistic supply–demand balance through flexible charging and discharging. However, the high investment cost of storage makes large-scale deployment challenging. Therefore, both technical feasibility and economic factors must be considered when determining the installation location and capacity of storage systems. To ensure probabilistic supply–demand balance in interconnected power systems, while simultaneously considering economic efficiency and system security, this study develops a bi-level optimization model for storage planning. The model consists of two sub-models: the upper level for site selection and the lower level for capacity allocation and operational strategy.

4.1. Upper-Level Optimization Model

For the upper-level optimization, the comprehensive effectiveness coefficient is used to measure the improvement of grid performance by installing energy storage. The objective function is as follows:
min Φ = ( w 1 f 1 + w 2 f 2 + w 3 f 3 )
where f 1 , f 2 , f 3 are the effectiveness coefficients of the net power fluctuation P 1 , voltage exceedance probability P 2 , and network loss P 3 of the partitioned power grid. The coefficients w1, w2, and w3 are weighting factors. The weight coefficients w1, w2, w3 in the objective function reflect the decision-maker’s preference levels for different optimization objectives. To balance system safety and economic efficiency, we assume equal importance among these performance factors, setting w1 = w2 = w3, related studies [43,44,45] have also adopted this approach. These weight coefficients can be flexibly adjusted based on actual engineering requirements.
The power fluctuation P 1 is the average peak-to-valley difference of the net load power during the production simulation period, which serves as an indicator for measuring the peak-shaving and valley-filling capability of the selected energy storage configuration. The calculation formula is as follows:
P 1 = t = 1 T ( P max ( Δ t ) P min ( Δ t ) ) T
where Δ t is the sampling time interval, P max ( Δ t ) and P min ( Δ t ) are the maximum and minimum values of the net power within the sampling time interval.
The voltage limit exceeding probability P 2 refers to the probability that the node voltage exceeds the preset safe-operating range. It is calculated based on the probability density function of the regional network voltage, and the formula is as follows:
P 2 = j = 1 J V min h j ( v ) d v + V max h j ( v ) d v
where h j ( v ) is the voltage probability density function of subnetwork j , and V max and V min are the upper and lower voltage limits. In this study, to ensure the safe operation of the power grid, the voltage limits are set to V min = 0.95   p . u . and V max = 1.05   p . u .
The network loss P 3 is used as an index to evaluate the effectiveness of an energy storage system in reducing network losses, and it is calculated as follows:
P 3 = j = 1 J P loss
The efficacy function method can transform a multi-objective problem into a single-objective one, effectively solving the multi-objective optimization problem. The efficacy coefficient f d is calculated as:
f d = p d p s , d p h , d p s , d × 40 + 60 p d > p h , d 100 p d p h , d
In the above equation, f d is the efficacy coefficient value of the d variable, p d denotes the actual value of the d evaluation index, p h , d presents the satisfactory value of the d evaluation index, and p s , d indicates the unsatisfactory value of the d evaluation index.

4.2. Lower-Level Optimization Model

Once the optimal installation region of storage systems is determined by the upper-level model, the lower-level optimization further determines the storage capacity and operational strategies. The objective function is formulated as:
min C total = CRF i = 1 N ESS ( C E E i + C P P i ) + W j = 1 J ( V j , t + + V j , t ) + 365 i = 1 N ESS ( k o p + C deg ) 0 t ( P i , t ch + P i , t d i s ) d t
where the first term represents the investment cost of energy storage. CE and CP denote the unit investment costs of energy storage capacity and power, respectively; Ei and Pi are the rated capacity and rated power of energy storage unit i; and CRF is the capital recovery factor, which annualizes the investment cost and is calculated as CRF = r ( 1 + r ) N L ( 1 + r ) N L 1 , r is the discount rate and NL is the lifetime of the storage system. The second term represents the voltage-limit violation penalty, where W is the penalty coefficient, and V+j,t and Vj,t represent the upper and lower voltage-limit violations of node j at time t. The third term represents the annual operational cost, where kop is the operation and maintenance cost coefficient for charging and discharging, C deg is the degeneration cost coefficient. This objective function jointly considers investment, operation, and voltage violation risks to ensure both economic efficiency and reliability of the power system. The lower-level model is further subject to a set of physical and operational constraints for storage systems, defined as follows:
E i = ρ P i 20 a 0 P i , t ch P i 20 b 0 P i , t dis P i ( 20 c ) P i , t ch P i , t dis = 0 ( 20 d ) j = 1 J P j , t in = j = 1 J P j , t load j = 1 J P j , t PV + i = 1 N ESS ( η ch P i , t ch P i , t dis / η dis ) ( 20 e ) SOC min SOC i , t SOC max ( 20 f ) SOC i 0 Δ SOC SOC i , 24 SOC i 0 + Δ SOC ( 20 g ) SOC i , t + 1 = SOC i , t + ( η ch P i , t ch P i , t dis / η dis ) / E i ( 20 h )
where ρ is the ratio between storage capacity and power. Equation (20b) represents the charging power limit, and Equation (20c) represents the discharging power limit. P i , t ch and P i , t dis denote the charging and discharging power of storage unit i at time t. Equation (20d) defines the charging–discharging exclusivity constraint, which ensures that storage cannot charge and discharge simultaneously. Equation (20e) denotes the power balance constraint, where P j , t i n represents the net injected active power of node j at time t, and P j , t l o a d and P j , t P V denote the load demand and photovoltaic generation at node j at time t. η ch = 0.95 and η dis = 0.95 represent charging and discharging efficiencies [46]. Equation (20f) defines the state-of-charge (SOC) limits, where SOCi denotes the SOC of storage unit i at time t, and SOC min = 0.1 and SOC max = 0.9 are the minimum and maximum SOC levels, respectively. Equation (20g) defines the end-of-cycle SOC constraint, at the beginning of each scheduling period, the initial SOC of the energy storage system is set to SOC 0 = 0.5 . Equation (20h) describes the SOC update constraint of storage.

5. Results and Discussion

The flowchart of the data-driven probabilistic power flow method for energy storage planning considering interconnected grids is illustrated in Figure 1.
To verify the effectiveness of the proposed data-driven storage planning method for interconnected power systems based on Probabilistic Load Flow (PLF), experiments were conducted on a typical interconnected power system model. The experimental data consist of historical photovoltaic generation and load demand data, as well as the network topology and parameters of the interconnected power system. The photovoltaic and load data are obtained from actual measurements, covering fluctuation characteristics under different seasonal and weather conditions. Hourly load and photovoltaic output data are used for modeling, thereby reflecting typical uncertainties in power systems.
This experiment conducted simulation tests on the IEEE 33-node standard reference system [43]. For pure photovoltaic systems, PV modules are connected to specific nodes; whereas in PV-storage integrated configurations, energy storage units are co-located with PV modules at the same nodes, forming hybrid generation-storage resources. To validate the applicability of the methodology in interconnected grid scenarios, we logically divided the 33-node standard system into three sub-regions and established interconnection lines, its schematic diagram is shown in Figure 2, the energy storage system shown in the figure is derived from simulation results in subsequent content. The PV output data utilized in this study originated from actual historical operational records of a typical regional grid in Northwest China, covering a time span of 8760 h with a time resolution of Δt = 60 min. The computational experiments were performed on a personal computer equipped with an Intel Core i5 processor and 16 GB of RAM, using Python 3.10 as the programming environment. The optimization model was solved with the Gurobi solver. Photovoltaic sources are connected at nodes 10 in Region I, node 3 and10 in Region II, and node 8 in Region III. Historical profiles of load demand and photovoltaic generation are presented in Figure 3 and Figure 4, respectively.
The Bayesian Information Criterion (BIC) is employed to determine the optimal number of components K in the GMM. For photovoltaic or load data, the BIC values differ across nodes and time points. Figure 5 illustrates the variation curve of photovoltaic output at Node 4 in Region I over 2000 h. As K increases, the BIC value first decreases and then rises. The minimum BIC value occurs at K = 4, thus selecting K = 4 as the optimal number of components.
The Monte Carlo simulation method is commonly used as a benchmark for Probabilistic Load Flow (PLF) calculations. In this paper, the accuracy and computation time of the proposed cumulant-based method are compared with those of the Monte Carlo method. Using Monte Carlo simulation, 5000 scenarios are generated for each time period, and conventional power flow calculations are performed for every scenario to obtain the corresponding voltage magnitudes. The resulting probability density curves are then fitted and compared with those obtained using the cumulant-based probabilistic power flow method proposed in this study.
To analyze the differences between the two methods in fitting the probability density functions of system state variables, simulations are conducted for different buses and time periods. The comparison of the mean and variance of the two methods is shown in Table 2, Figure 6 illustrates the probability density function (PDF) curves calculated by the two methods under different conditions. The results indicate that the relative errors between the proposed method and the Monte Carlo method are within 5%, demonstrating that the proposed method is both effective and accurate.
Furthermore, Table 3 presents the comparison of computational speed between the two methods. As can be seen, the computation time of the proposed method is much shorter than that of the Monte Carlo simulation, indicating a substantial reduction in computational burden for power flow analysis.
The final siting results of the optimization model indicate that storage units are installed at node 5 in Region I, node 1 in Region II, and node 3 in Region III. The optimization model prioritizes nodes with the highest voltage sensitivity for energy storage deployment. These are locations where fluctuations from PV generation and load demand cause the most pronounced voltage deviations. Consequently, strategically placing storage at these critical nodes allows the system to mitigate overvoltage issues most effectively with minimal capacity. The optimal energy capacities and rated power capacities of the storage units are summarized in Table 4. The storage units maintain supply–demand balance through charging and discharging operations. The annual charging/discharging profiles and SOC variations of the storage units in Regions I, II, and III are presented in Figure 7 and Figure 8, Figure 9 and Figure 10, and Figure 11 and Figure 12, respectively.
Figure 7, Figure 8, Figure 9, Figure 10, Figure 11 and Figure 12 display the optimized annual charge–discharge power curves for energy storage systems in Region I, II, and III. Direct comparison with the PV generation curve and load demand curve reveals the operational logic underpinning the energy storage strategy. As shown in Figure 3 and Figure 4, PV generation peaks during midday, while load demand typically reaches its highest point in the evening. This time mismatch results in daytime power surpluses and potential power shortages at night. Accordingly, the energy storage units are scheduled to charge during peak PV generation periods and discharge during evening load peaks. This coordinated operation effectively balances power generation and consumption demands.
Figure 13 compares the voltage-limit violation probabilities across different regions before and after the installation of storage, while Figure 14 illustrates the corresponding average voltages. It can be observed that without storage, the voltage violation probability reaches 32% in Region II and 47% in Region III, which seriously affects power quality and supply reliability. In addition, the average voltages in Region II and Region III are both below 0.96 p.u. After the integration of storage systems, the voltage violation probabilities are reduced to 5% and 6% for Region II and Region III, respectively, demonstrating that the occurrence of voltage violations is largely eliminated. Furthermore, the average voltages of these regions are improved to values above 0.96 p.u., confirming that the storage systems effectively mitigate voltage violations by injecting power at appropriate times and locations, thereby enhancing system stability.
To further validate the effectiveness of storage under extreme conditions, the connection between nodes 9 and 10 in Region II was intentionally disconnected. Figure 15 shows the comparison of voltage violation probabilities with and without storage under the extreme scenario, while Figure 16 presents the corresponding average voltages. Without storage, the voltage violation probabilities in Region II and Region III approach 40%, and the average voltages in both regions fall below 0.96 p.u., with the average voltage in Region III approaching the lower limit. After storage deployment, the average voltages in all interconnected regions increase significantly, returning to higher levels, while the voltage violation probabilities in Regions II and III are reduced to approximately 5%. These results confirm that the proposed storage configuration effectively improves voltage performance even under extreme scenarios.
To further validate the robustness of the planning scheme, simulation analysis was conducted under another extreme operating condition. This scenario features a sudden severe weather event causing a sharp drop in photovoltaic power generation while system load remains at peak levels. As shown in Figure 17, node voltage distributions under such extreme scenarios were analyzed. A Monte Carlo simulation generated 2000 random operating scenarios, covering conditions ranging from normal to severe fluctuations. Among these, scenarios featuring significant PV output reduction coupled with surging load demand were specifically selected for detailed evaluation.
Results indicate, no voltage violations occurred under conventional operating conditions, but certain extreme scenarios caused voltage overshoot. As shown in Table 5, the voltage overshoot probabilities for the two selected extreme cases were 4.7% and 11.65%, respectively. Although these operating conditions increased the risk of overshoot and reduced the average node voltage compared to normal operation, the system performance still outperformed configurations without energy storage. This result confirms the enhanced robustness and disturbance resistance of the proposed planning scheme under extreme events.
Additionally, we conducted a comparative analysis using a deterministic planning approach that disregards uncertainty, with results shown in Figure 18 and Figure 19. The results indicate that the deterministic approach reduces investment costs and operational maintenance costs by 54.2% and 48.3%, respectively. However, this method simultaneously increases the probability of voltage violations by 15.5% in Region 2 and 25% in Region 3. It is evident that while the deterministic approach lowers economic costs, the accompanying increase in voltage violation risks significantly compromises the voltage security of the partitioned grid.
Historical PV generation and load curves are used to construct a GMM-based probabilistic distribution, explicitly capturing the uncertainty characteristics of actual source-load data. These data-driven distributions then serve as inputs for probabilistic power flow calculations within a two-layer planning model, which optimizes energy storage siting and capacity. Consequently, the planning outcomes and annual performance projections are intrinsically linked to the statistical patterns of the real PV and load data employed. The effectiveness of this approach is confirmed by the results: after storage deployment, the probability of voltage violations in Region II and Region III decreased by 27% and 41%, respectively, while the average voltages in Regions I, II, and III increased by 0.01 p.u., 0.06 p.u., and 0.07 p.u., respectively. These improvements directly contribute to enhanced system reliability.
In summary, the proposed storage planning strategy achieves remarkable improvements by significantly reducing the probability of voltage-limit violations and maintaining the probabilistic supply–demand balance of the interconnected power system.

6. Conclusions

This paper presents a data-driven probabilistic power flow method for energy storage planning, designed to enhance the reliability of interconnected grids with high penetration of renewable energy. The framework first employs a Gaussian Mixture Model to accurately capture the complex uncertainties in photovoltaic generation and load. A probabilistic power flow analysis, based on an improved Cumulant Method, is then conducted to quantify voltage stability risks. Informed by this risk assessment, a bi-level optimization model is established to determine the optimal location, capacity, and operational strategy for energy storage, effectively balancing system security and economy. Experimental results validate the method’s effectiveness, demonstrating a significant reduction in the probability of voltage-limit violations. The configured energy storage system exhibits strong robustness, maintaining grid stability even under extreme fault scenarios. This study provides a viable solution for the safe and reliable integration of renewable energy.
While the proposed method demonstrates strong performance in energy storage planning, it is subject to several limitations. First, the computational complexity associated with the Gram–Charlier series expansion increases nonlinearly with the scale of the grid nodes, which may constrain its application in larger systems. Second, the current model does not yet incorporate a physics-based state-of-health representation for the storage units, limiting its ability to capture detailed aging dynamics. Future work could focus on improving computational efficiency through sparse-modeling techniques and integrating refined battery health models to enable more accurate and physically representative planning.

Author Contributions

Conceptualization, T.C.; methodology, T.C.; validation, T.C., X.J. and Y.W.; formal analysis, Z.F. and Y.M.; investigation, Z.F.; resources, D.G.; data curation, Y.W.; writing—original draft preparation, T.C., X.J., Z.F. and Y.W.; writing—review and editing, Y.M., D.G., D.Z. and Y.B.; supervision, X.J.; funding acquisition, Y.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research is funded by the Headquarters Management Science and Technology Project of State Grid Corporation of China (Research and Application of Key Technologies for Dynamic Probability Balance and Security-Economic Equilibrium Decision Making Adapting to High Uncertainty of Power Supply and Demand), grant number 1400-202416292A-1-1-ZN.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

Authors Ying Mu, Dashun Guan, Dongliang Zhang and Ying Bai were employed by the State Grid Shandong Electric Power Company. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  1. Cavus, M. Advancing Power Systems with Renewable Energy and Intelligent Technologies: A Comprehensive Review on Grid Transformation and Integration. Electronics 2025, 14, 1159. [Google Scholar] [CrossRef]
  2. Jiang, B.; Raza, M.Y. Research on China’s renewable energy policies under the dual carbon goals: A political discourse analysis. Energy Strat. Rev. 2023, 48, 101118. [Google Scholar] [CrossRef]
  3. Voropai, N. Electric power system transformations: A review of main prospects and challenges. Energies 2020, 13, 5639. [Google Scholar] [CrossRef]
  4. Zheng, K.; Sun, Z.; Song, Y.; Zhang, C.; Zhang, C.; Chang, F.; Yang, D.; Fu, X. Stochastic scenario generation methods for uncertainty in wind and photovoltaic power outputs: A comprehensive review. Energies 2025, 18, 503. [Google Scholar] [CrossRef]
  5. Nakamoto, Y.; Eguchi, S. How do seasonal and technical factors affect generation efficiency of photovoltaic power plants? Renew. Sustain. Energy Rev. 2024, 199, 114441. [Google Scholar] [CrossRef]
  6. Shen, C.; Zhu, W.; Tang, X.; Du, W.; Wang, Z.; Xu, S.; Yao, K. Risk assessment and resilience enhancement strategies for urban power supply-demand imbalance affected by extreme weather: A case study of Beijing. Int. J. Disaster Risk Reduct. 2024, 106, 104471. [Google Scholar] [CrossRef]
  7. Yang, C.; Zhao, Y.; Li, X.; Zhou, X. Electric vehicles, load response, and renewable energy synergy: A new stochastic model for innovation strategies in green energy systems. Renew. Energy 2025, 238, 121890. [Google Scholar] [CrossRef]
  8. Cao, W.; Zhou, L. Resilient microgrid modeling in Digital Twin considering demand response and landscape design of renewable energy. Sustain. Energy Technol. Assess. 2024, 64, 103628. [Google Scholar] [CrossRef]
  9. Zhao, X.; Zhang, Y. Integrated management of urban resources toward Net-Zero smart cities considering renewable energies uncertainty and modeling in Digital Twin. Sustain. Energy Technol. Assess. 2024, 64, 103656. [Google Scholar] [CrossRef]
  10. Zang, T.; Wang, S.; Wang, Z.; Li, C.; Liu, Y.; Xiao, Y.; Zhou, B. Integrated planning and operation dispatching of source–grid–load–storage in a new power system: A coupled socio–cyber–physical perspective. Energies 2024, 17, 3013. [Google Scholar] [CrossRef]
  11. Hu, H.; Yu, S.S.; Trinh, H. A review of uncertainties in power systems—Modeling, impact, and mitigation. Designs 2024, 8, 10. [Google Scholar] [CrossRef]
  12. Fu, X.; Zhang, C.; Xu, Y.; Zhang, Y.; Sun, H. Statistical machine learning for power flow analysis considering the influence of weather factors on photovoltaic power generation. IEEE Trans. Neural Netw. Learn. Syst. 2024, 36, 5348–5362. [Google Scholar] [CrossRef]
  13. Li, Q.; Xiong, Y.; Sidorov, D.; Murad, M.A.A.; Liu, M. Probabilistic power flow method based on monotonic consistency interpolation and enhanced sample permutation. Electr. Power Syst. Res. 2025, 247, 111821. [Google Scholar] [CrossRef]
  14. Xiao, Q.; Wu, L.; Chen, C. Probabilistic power flow computation using nested point estimate method. IET Gener. Transm. Distrib. 2022, 16, 1064–1082. [Google Scholar] [CrossRef]
  15. Zishan, F.; Akbari, E.; Montoya, O.D. Analysis of probabilistic optimal power flow in the power system with the presence of microgrid correlation coefficients. Cogent Eng. 2024, 11, 2292325. [Google Scholar] [CrossRef]
  16. Luo, Y.; Wang, X.; Yan, S. Risk assessment of photovoltaic distribution network based on adaptive kernel density estimation and cumulant method. Energy Rep. 2022, 8, 1152–1159. [Google Scholar] [CrossRef]
  17. Wang, S.; Wu, S.; Tang, B.; Liu, L.; Cheng, L. Generation method of wind power and photovoltaic output scenarios based on LHS-GRU. Sustain. Energy Grids Netw. 2025, 41, 101602. [Google Scholar] [CrossRef]
  18. Goodfellow, J.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.; Bengio, Y. Generative adversarial nets. Adv. Neural Inf. Process. Syst. 2014, 27, 2672–2680. [Google Scholar]
  19. Ma, X.; Liu, Y.; Yan, J.; Wang, H. A WGAN-GP-based scenarios generation method for wind and solar power complementary study. Energies 2023, 16, 3114. [Google Scholar] [CrossRef]
  20. Salimans, T.; Goodfellow, I.; Zaremba, W.; Cheung, V.; Radford, A.; Chen, X. Improved techniques for training gans. Adv. Neural Inf. Process. Syst. 2016, 29, 2226–2234. [Google Scholar]
  21. Zhang, Z.; Zhang, D.; Qiu, R.C. Deep reinforcement learning for power system applications: An overview. CSEE J. Power Energy Syst. 2019, 6, 213–225. [Google Scholar]
  22. Huang, J.; Zhang, H.; Tian, D.; Zhang, Z.; Yu, C.; Hancke, G.P. Multi-agent deep reinforcement learning with enhanced collaboration for distribution network voltage control. Eng. Appl. Artif. Intell. 2024, 134, 108677. [Google Scholar] [CrossRef]
  23. Li, J.; Zhang, R.; Wang, H.; Liu, Z.; Lai, H.; Zhang, Y. Deep reinforcement learning for voltage control and renewable accommodation using spatial-temporal graph information. IEEE Trans. Sustain. Energy 2023, 15, 249–262. [Google Scholar] [CrossRef]
  24. Yang, J.; Zhou, K.; Li, Y.; Liu, Z. Generalized out-of-distribution detection: A survey. Int. J. Comput. Vis. 2024, 132, 5635–5662. [Google Scholar] [CrossRef]
  25. Tong, B.; Zhang, L.; Li, G.; Zhang, B.; Xie, F.; Tang, W. An Overvoltage-Averse Model for Renewable-Rich AC/DC Distribution Networks Considering the Sensitivity of Voltage Violation Probability. IEEE Trans. Sustain. Energy 2024, 16, 613–626. [Google Scholar] [CrossRef]
  26. Ejuh Che, E.; Roland Abeng, K.; Iweh, C.D.; Tsekouras, G.J.; Fopah-Lele, A. The impact of integrating variable renewable energy sources into grid-connected power systems: Challenges, mitigation strategies, and prospects. Energies 2025, 18, 689. [Google Scholar] [CrossRef]
  27. Lu, M.; Sun, Y.; Ma, Z. Multi-objective design optimization of multiple energy systems in net/nearly zero energy buildings under uncertainty correlations. Appl. Energy 2024, 370, 123620. [Google Scholar] [CrossRef]
  28. Chen, J.; Liu, F.; Wang, Y.; Li, Y. Emergency scheduling of virtual energy storage based on continuous-time model for resilience enhancement under extreme events. Energy 2025, 342, 139634. [Google Scholar] [CrossRef]
  29. Islam, M.M.; Yu, T.; Giannoccaro, G.; Mi, Y.; la Scala, M.; Nasab, M.R.; Wang, J. Improving reliability and stability of the power systems: A comprehensive review on the role of energy storage systems to enhance flexibility. IEEE Access 2024, 12, 152738–152765. [Google Scholar] [CrossRef]
  30. Arun, M.; Samal, S.; Barik, D.; Chandran, S.S.; Tudu, K.; Praveenkumar, S. Integration of energy storage systems and grid modernization for reliable urban power management toward future energy sustainability. J. Energy Storage 2025, 114, 115830. [Google Scholar] [CrossRef]
  31. Shafiei, K.; Seifi, A.; Hagh, M.T. A novel multi-objective optimization approach for resilience enhancement considering integrated energy systems with renewable energy, energy storage, energy sharing, and demand-side management. J. Energy Storage 2025, 115, 115966. [Google Scholar] [CrossRef]
  32. Yi, Y.; Chang, L.; Wu, B.; Zhao, J.; Peng, H.; Li, L.; Wang, A. Life cycle assessment of energy storage technologies for new power systems under dual-carbon target: A review. Energy Technol. 2024, 12, 2301129. [Google Scholar] [CrossRef]
  33. Yang, H.; Chen, J.J.; Li, Y. Two-layer iterative optimization for enhanced electricity-carbon pricing to promote renewable integration and load smoothing with multi-type prosumer. Energy 2025, 336, 138226. [Google Scholar] [CrossRef]
  34. Yao, M.; Da, D.; Lu, X.; Wang, Y. A review of capacity allocation and control strategies for electric vehicle charging stations with integrated photovoltaic and energy storage systems. World Electr. Veh. J. 2024, 15, 101. [Google Scholar] [CrossRef]
  35. Guo, J.; Jing, Y.; Hou, W.; Wang, T.; Ma, S.; He, G. Demands and challenges of energy storage technology for future power system. Energy Internet 2024, 1, 116–122. [Google Scholar] [CrossRef]
  36. Kiasari, M.; Ghaffari, M.; Aly, H.H. A comprehensive review of the current status of smart grid technologies for renewable energies integration and future trends: The role of machine learning and energy storage systems. Energies 2024, 17, 4128. [Google Scholar] [CrossRef]
  37. Khalaf, M.; Ayad, A.; Tushar, M.H.K.; Kassouf, M.; Kundur, D. A survey on cyber-physical security of active distribution networks in smart grids. IEEE Access 2024, 12, 29414–29444. [Google Scholar] [CrossRef]
  38. Liu, M.; Teng, F.; Zhang, Z.; Ge, P.; Sun, M.; Deng, R.; Cheng, P.; Chen, J. Enhancing cyber-resiliency of der-based smart grid: A survey. IEEE Trans. Smart Grid 2024, 15, 4998–5030. [Google Scholar] [CrossRef]
  39. Collath, N.; Tepe, B.; Englberger, S.; Jossen, A.; Hesse, H. Aging aware operation of lithium-ion battery energy storage systems: A review. J. Energy Storage 2022, 55, 105634. [Google Scholar] [CrossRef]
  40. Li, P.; Chen, J.; Yang, H.; Lin, Z. Peer-to-peer power trading and pricing for rental energy storage shared community microgrid: A coordinated Stackelberg and cooperative game. Renew. Energy 2025, 256, 123963. [Google Scholar] [CrossRef]
  41. Seger, P.V.; Rigo-Mariani, R.; Thivel, P.-X.; Riu, D. A storage degradation model of Li-ion batteries to integrate ageing effects in the optimal management and design of an isolated microgrid. Appl. Energy 2023, 333, 120584. [Google Scholar] [CrossRef]
  42. Zhang, W.; Wang, S. Optimal allocation of Bess in distribution network based on improved equilibrium optimizer. Front. Energy Res. 2022, 10, 936592. [Google Scholar] [CrossRef]
  43. Zhao, P.; Liu, X.; Qu, H.; Liu, N.; Zhang, Y.; Xiao, C. Multi-Objective Cooperative Optimization Model for Source–Grid–Storage in Distribution Networks for Enhanced PV Absorption. Processes 2025, 13, 2841. [Google Scholar] [CrossRef]
  44. Maghami, M.R.; Yaghoubi, E.; Mohamed, M.; Jahromi, M.Z.; Fei, T.K. Multi-objective optimization of unbalanced power distribution systems: A comprehensive approach to address uncertainties and enhance performance. Energy Convers. Manag. X 2025, 27, 101087. [Google Scholar] [CrossRef]
  45. Lai, C.S.; Chen, D.; Zhang, J.; Zhang, X.; Xu, X.; Taylor, G.A.; Lai, L.L. Profit maximization for large-scale energy storage systems to enable fast EV charging infrastructure in distribution networks. Energy 2022, 259, 124852. [Google Scholar] [CrossRef]
  46. Li, H.; Li, L. Bilevel Planning of Distribution Networks with Distributed Generation and Energy Storage: A Case Study on the Modified IEEE 33-Bus System. Energy Eng. 2025, 122, 1337–1358. [Google Scholar] [CrossRef]
Figure 1. Flowchart of the data-driven probabilistic power flow method for energy storage planning considering interconnected grids.
Figure 1. Flowchart of the data-driven probabilistic power flow method for energy storage planning considering interconnected grids.
Energies 18 06633 g001
Figure 2. Schematic diagram of the interconnected power system regions.
Figure 2. Schematic diagram of the interconnected power system regions.
Energies 18 06633 g002
Figure 3. Annual photovoltaic generation profiles of each region.
Figure 3. Annual photovoltaic generation profiles of each region.
Energies 18 06633 g003
Figure 4. Annual load demand profiles of each region.
Figure 4. Annual load demand profiles of each region.
Energies 18 06633 g004
Figure 5. BIC Value Change Curve.
Figure 5. BIC Value Change Curve.
Energies 18 06633 g005
Figure 6. Comparison of Voltage Magnitude Probability Density under Different Cases.
Figure 6. Comparison of Voltage Magnitude Probability Density under Different Cases.
Energies 18 06633 g006
Figure 7. Charging and discharging curves of energy storage in Region I.
Figure 7. Charging and discharging curves of energy storage in Region I.
Energies 18 06633 g007
Figure 8. SOC variation curve of energy storage in Region I.
Figure 8. SOC variation curve of energy storage in Region I.
Energies 18 06633 g008
Figure 9. Charging and discharging curves of energy storage in Region II.
Figure 9. Charging and discharging curves of energy storage in Region II.
Energies 18 06633 g009
Figure 10. SOC variation curve of energy storage in Region II.
Figure 10. SOC variation curve of energy storage in Region II.
Energies 18 06633 g010
Figure 11. Charging and discharging curves of energy storage in Region III.
Figure 11. Charging and discharging curves of energy storage in Region III.
Energies 18 06633 g011
Figure 12. SOC variation curve of energy storage in Region III.
Figure 12. SOC variation curve of energy storage in Region III.
Energies 18 06633 g012
Figure 13. Comparison of voltage violation probabilities in each region before and after storage installation.
Figure 13. Comparison of voltage violation probabilities in each region before and after storage installation.
Energies 18 06633 g013
Figure 14. Comparison of average voltages in each region before and after storage installation.
Figure 14. Comparison of average voltages in each region before and after storage installation.
Energies 18 06633 g014
Figure 15. Comparison of voltage deviation probability in different regions before and after installation of energy storage devices under extreme line disconnection conditions.
Figure 15. Comparison of voltage deviation probability in different regions before and after installation of energy storage devices under extreme line disconnection conditions.
Energies 18 06633 g015
Figure 16. Comparison of average voltage before and after installation of energy storage systems in each region under extreme line disconnection conditions.
Figure 16. Comparison of average voltage before and after installation of energy storage systems in each region under extreme line disconnection conditions.
Energies 18 06633 g016
Figure 17. Node Voltage Distribution Diagram Under Extreme Scenarios.
Figure 17. Node Voltage Distribution Diagram Under Extreme Scenarios.
Energies 18 06633 g017
Figure 18. Economic comparison under our method and deterministic programming.
Figure 18. Economic comparison under our method and deterministic programming.
Energies 18 06633 g018
Figure 19. Probability of overvoltage under our method and deterministic programming.
Figure 19. Probability of overvoltage under our method and deterministic programming.
Energies 18 06633 g019
Table 1. Comparison between this work and relevant literature.
Table 1. Comparison between this work and relevant literature.
MethodHigh AccuracyFast CalculationPhysical InterpretabilityStability/Convergence
[13]
[14]×
[15]×
[16]××
[17]×××
[18]×××
[19]×××
[21]××
[23]××
This Work
Table 2. Comparison of Mean and Covariance between This Work and Monte Carlo Method.
Table 2. Comparison of Mean and Covariance between This Work and Monte Carlo Method.
CaseMeanCovariance
This WorkMonte CarloRelative Error (%)This WorkMonte CarloRelative Error
Node 6, 12:000.9783720.9786450.03%0.0060180.0061692.44%
Node 6, 16:000.9779080.9777690.01%0.0062160.0063632.31%
Node 19, 12:000.9981740.9981960.01%0.0003620.0003753.64%
Node 19, 16:000.9981690.9981670.01%0.0003770.0003672.62%
Table 3. Comparison of Computational Speed between This Work and Monte Carlo Method.
Table 3. Comparison of Computational Speed between This Work and Monte Carlo Method.
MethodNumber of Calculations (Times)Computation Times (s)
Monte Carlo50002331.81
This Work3713.43
Table 4. Regional Energy Storage Planning Results for the Interconnected Power Grid.
Table 4. Regional Energy Storage Planning Results for the Interconnected Power Grid.
Storage LocationRated Energy Capacity (kWh)Rated Power Capacity (kW)
Node 5 in Region I11,1885594
Node 2 in Region II67603380
Node 3 in Region III11,7395869
Table 5. Voltage Levels in Each Area Before and After Energy Storage Installation.
Table 5. Voltage Levels in Each Area Before and After Energy Storage Installation.
Storage LocationProbability of Voltage Exceeding Limits (%)Node Average Voltage (kV)
Node 2 in Region IIExtreme4.70.9616
Normal00.9899
Node 3 in Region IIIExtreme11.650.9658
Normal00.9910
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Cheng, T.; Jiang, X.; Fan, Z.; Wu, Y.; Mu, Y.; Guan, D.; Zhang, D.; Bai, Y. Data-Driven Probabilistic Power Flow for Energy-Storage Planning Considering Interconnected Grids. Energies 2025, 18, 6633. https://doi.org/10.3390/en18246633

AMA Style

Cheng T, Jiang X, Fan Z, Wu Y, Mu Y, Guan D, Zhang D, Bai Y. Data-Driven Probabilistic Power Flow for Energy-Storage Planning Considering Interconnected Grids. Energies. 2025; 18(24):6633. https://doi.org/10.3390/en18246633

Chicago/Turabian Style

Cheng, Tingting, Xirui Jiang, Zheng Fan, Yanan Wu, Ying Mu, Dashun Guan, Dongliang Zhang, and Ying Bai. 2025. "Data-Driven Probabilistic Power Flow for Energy-Storage Planning Considering Interconnected Grids" Energies 18, no. 24: 6633. https://doi.org/10.3390/en18246633

APA Style

Cheng, T., Jiang, X., Fan, Z., Wu, Y., Mu, Y., Guan, D., Zhang, D., & Bai, Y. (2025). Data-Driven Probabilistic Power Flow for Energy-Storage Planning Considering Interconnected Grids. Energies, 18(24), 6633. https://doi.org/10.3390/en18246633

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop