1. Introduction
Due to the rapid development of distributed PV, power system operation is facing challenges [
1,
2,
3]. With the deepening advancement of the new power system, renewable energy in China has developed explosively, particularly distributed PV, which has significantly increased the proportion of clean energy consumption [
4]. However, due to climate factors such as cloud cover, the output of distributed PV exhibits significant fluctuations and randomness, which poses severe challenges to the safe and stable operation of the power grid [
5]. On one hand, the large-scale integration of distributed PV has altered the operational landscape of the power grid, transforming traditional distribution grids from purely receiving-end grids to a grid that integrates power generation and consumption, where an excessive reverse load rate can severely impact grid safety. On the other hand, the variability of distributed PV output can affect voltage quality, voltage stability, harmonic content, etc. The unrestricted connection of distributed PV would inevitably lead to a deterioration in grid safety [
6,
7]. Distributed PV bearing capacity refers to the maximum capacity that the grid can accommodate while meeting all operational constraints of the power system and equipment. So, distributed PV bearing capacity assessment is of critical importance.
Regarding the assessment of distributed PV bearing capacity, some scholars have already conducted relevant research. They mainly focus on the key factors caused by distributed PV integration into distribution grids, such as reverse load rate and voltage deviation, extracting relevant constraint indicators for evaluation. References [
8,
9,
10,
11] establish a distributed PV bearing capacity assessment model for distribution grids by considering multi-constraint conditions, with the objective of maximizing the PV capacity that the grid can accommodate; reference [
12] considers the feed-end bus voltage and constructs an evaluation of the access capacity of distributed PV; and references [
13,
14,
15,
16] realize the evaluation and calculation of the PV capacity through various algorithms.
The above method provides an effective approach for the assessment of distributed photovoltaic (PV) capacity. However, it does not take into account the impact of static voltage stability. References [
17,
18,
19] calculate the static voltage stability boundary based on fixed scenarios or operating states and then use the boundary as a constraint to assess the distributed PV capacity. Nevertheless, the system′s static voltage stability is closely related to the system′s operating state and the penetration level of distributed PV—specifically, the penetration level of distributed PV and static voltage stability are tightly coupled. It is necessary to consider the impact of changes in the static voltage stability boundary with the power grid′s operating modes, which has not yet been studied. Meanwhile, the bearing capacity is closely related to the grid operation mode. For example, in grids with heavy loads, the integration of distributed PV can effectively reduce transformer-side power flow, thereby benefiting grid safety. However, when the capacity of distributed PV exceeds a certain proportion, it may face the dual challenges of excessive reverse power flow during periods of low load and heavy PV generation, as well as excessive forward power flow during evening peak load periods when PV generation is absent. Additionally, significant fluctuations in power flow throughout the day pose a severe challenge to the safe and stable operation of the power grid. Therefore, distributed PV bearing capacity assessment needs to be comprehensively judged in conjunction with the actual operating conditions of the grid [
20,
21]. To account for the impact of the source–load coupling characteristic, references [
22,
23] consider the coordinated operation of various regulatory resources within the grid, such as smart inverters and model predictive controllers, to improve distributed PV bearing capacity. References [
24,
25,
26,
27] describe the correlation between photovoltaic (PV) output and load based on the copula function and use sampling methods to extract different typical scenarios. Distributed PV capacity assessment is conducted on this basis. However, when extracting typical scenarios, they ignore the issue that PV output has large fluctuations and may be affected by extreme weather data. For example, in a certain cluster, most data points are concentrated in a specific location, while a few points are relatively far from the center. In such cases, typical scenarios should be more inclined to the location where most data are gathered, and the impact of data points far from the center should be weakened. Reference [
28] assesses the PV bearing capacity considering the uncertainty of source and load by the random optimization method.
Based on the above analysis, current research on the assessment of distributed photovoltaic (PV) capacity is relatively well developed, but there are still two aspects that need improvement: Firstly, regarding the consideration of static voltage stability, at present, the static voltage stability boundary is mostly calculated based on fixed scenarios or operating states and cannot dynamically change with the system’s operating state or the penetration level of distributed PV. This poses a risk that the capacity assessment results may exceed the stability boundary. Secondly, in the generation of source–load-coupled scenarios, some individual data points (not abnormal data but can be understood as atypical cases of normal output, which often occur in PV output scenarios with large fluctuations) are relatively far from the center. This causes the typical scenarios obtained through clustering to fail to reflect the majority of cases, leading to deviations in capacity assessment.
To address the above issues, this paper makes the following contributions:
- (1)
It extracts static voltage stability indicators that take distributed PV integration into account. These indicators can dynamically change according to different PV penetration levels, and optimization is conducted based on them. Compared with the current static voltage stability boundary calculated based on fixed scenarios or operating states, this approach is more accurate.
- (2)
It proposes a weighted-variable K-Means algorithm. When updating the cluster center, weight coefficients are calculated based on Euclidean distance—data points with a larger Euclidean distance are assigned a smaller weight coefficient. This helps mitigate the impact of outliers on clustering results.
- (3)
It establishes a multi-objective assessment model for distributed PV capacity and conducts simulation verification based on measured data. Conclusions are drawn, such as the following: when distributed PV is integrated into nodes where the system’s static voltage stability is weak, the system can accommodate more distributed PV.
3. Source–Load Coupling Scenario Generation Method
Analysis of
Section 2 reveals that the static voltage stability of the system is closely related to node load and photovoltaic output. As these two parameters fluctuate continuously during actual operation, a coupled analysis of load and photovoltaic output is required to better assess power grid bearing capability. Subsequently, typical output scenarios are categorized, with the maximum distributed PV hosting capacity under the most unfavorable scenario serving as the power grid bearing capability.
- (1)
Probability Modeling of Photovoltaic Output
As indicated in references [
29,
30,
31], distributed PV output approximates a Beta distribution, with the probability density function
f(
PPV) defined as
where
Γ denotes the gamma distribution and
and
are shape parameters, which can be obtained through fitting using historical photovoltaic data.
- (2)
Probabilistic Modeling of Load Output
As indicated in references [
32,
33,
34], load data approximates a normal distribution with the following probability density function:
where
PL denotes the active power of the load and
μPL and
σPL denote the mean and standard deviation of the normal distribution, respectively.
- (3)
Coupled Output Modeling for Photovoltaic and Load
Photovoltaic output and load output constitute two random variables. Utilizing copula functions, their distribution functions can be described as a joint distribution function. Sampling based on this joint distribution function fully reflects the correlation between photovoltaic output and active load power. The specific steps are outlined below.
Step 1: Determine the marginal distributions of the two random variables: photovoltaic output and load output.
The marginal distribution functions of photovoltaic output and load power are, respectively,
Step 2: Select an appropriate copula function to combine the marginal distributions of two random variables into a multivariate distribution.
Common copula functions include the Gaussian copula, Frank copula, T-copula, Clayton copula, and Gumbel copula. Different copula functions describe distinct correlation patterns. The T-copula is better suited for handling distributions with heavy tails, while the Gaussian copula is appropriate for simulating correlations in normally distributed data. In practical implementation, the optimal copula function may be determined using the AIC (BIC) criterion, where lower values indicate superior performance. Based on this principle, the BIC values of various types of copula functions are shown in the table.
As can be seen from
Table 1, the BIC value corresponding to the Frank copula function is the lowest, indicating that the Frank copula function is the best in describing the correlation between photovoltaic and load output. Therefore, this paper chooses the Frank copula function. By incorporating the distribution functions of photovoltaic output and load power into the Frank copula function, the joint distribution function
can be expressed as
where
is the correlation parameter, which is the parameter to be estimated.
Step 3: Employ the maximum likelihood method to estimate the correlation parameter , thereby completing the calculation of the joint distribution.
Derive the probability density function of the joint distribution function
, as shown below:
Let
N be the sample size. The likelihood function is then given by
With the objective of maximizing the likelihood function as shown in Formula (17), the estimated value
of the correlation parameter
is obtained as follows:
- (4)
Generation of Typical Scenarios Based on Clustering
Using the joint distribution function fitted via Formula (17), repeated sampling yields a substantial number of new photovoltaic and load output samples . This approach essentially covers all practical operating modes, thus compensating for the shortcomings of insufficient actual sample data. Subsequently, the massive number of scenarios generated above are clustered into several typical scenarios, which are evaluated using the hosting capacity assessment. This paper employs an improved robust K-Means algorithm for clustering. The specific process is as follows:
Step 1: Initialize the cluster centers of
K clusters, where
K is the algorithm′s hyper-parameter. This paper employs the silhouette coefficient method to determine the value of
K. Its principle involves calculating the similarity
a between the samples and its own cluster and the similarity
b between the samples and the nearest other clusters. The silhouette coefficient
s is, then, as follows:
The closer the coefficient is to 1, the better the clustering performance.
The average silhouette coefficient is computed for different K values, and the value of K corresponding to the maximum coefficient is chosen as the final number of clusters.
Step 2: Initial Assignment of Samples
The Euclidean distance is used to quantify the similarity between data samples, and each sample is assigned to its nearest cluster. Let the
K clusters be represented as
, and their corresponding centers are
. Then, the sample
x satisfies
Step 3: Update Cluster Centers
After the initial assignment of all samples, the mean of the samples within each cluster is taken as the new cluster center for re-clustering. The update formula of the cluster center is as follows:
Step 4: Re-cluster using the updated cluster centers as the baseline until the change in cluster center updates falls below the threshold.
The aforementioned clustering process assumes that the data points are relatively uniformly distributed around cluster centers. But practical data analysis indicates that some data points may be away from cluster centers, which can significantly impact clustering results, as shown in the figure below.
In
Figure 2, point A is relatively far away from the nodes of its cluster, and there is a possibility of anomaly. When updating and calculating the cluster center of the cluster, if the weight of point A is treated the same as that of other points, the cluster center may be shifted to point A, resulting in clustering deviation.
However, conventional K-Means algorithms cannot effectively handle this type of data. Therefore, building upon the original algorithm, this paper improves the traditional K-Means by proposing a variable-weight K-Means algorithm. This approach assigns lower weights to samples farther from the cluster center during the updating of cluster centers. The updated formula for cluster center calculation is transformed to
It can be seen that the farther a sample is from the cluster center obtained in the previous iteration, the smaller its weight is. Consequently, the updated cluster center is less affected by possible outliers.
4. Bearing Capacity Assessment Method for Distributed PV in Multi-Level Power Supply Systems
The objective of this paper is to evaluate power grid bearing capability. As demonstrated by the preceding analysis, this capacity is closely tied to the degree of source–load coupling. If a strong positive correlation exists between the source and the load—meaning that during periods of high PV output, the load demand is also high—the electricity produced can be consumed locally, thereby enhancing distributed PV bearing capacity. Conversely, if there is a strong negative correlation—that is, when load demand is high, PV output is low, and when load demand is low, PV output is high—issues such as excessive voltage deviation at the end of the distribution network and a high reverse load rate are inevitably to arise. These adverse conditions hinder the improvement of distributed PV bearing capacity.
In
Section 3,
K typical source–load coupling output scenarios are generated, corresponding to
K distinct source–load output curves. For the bearing capacity assessment, an optimization model is first established with the objectives of maximizing distributed PV hosting capacity, optimizing economic performance, and maximizing system static voltage stability. Then, sequential solutions are performed based on the source–load power curves under different scenarios. The assessment of the most critical scenario and the most vulnerable time instant (which could realistically occur) will be used as the final photovoltaic bearing capacity of the power grid.
4.1. Objective Function
Objective 1: Maximize the total distributed PV capacity connected at each grid node, expressed as
where
denotes the distributed PV capacity connected at node
i at time
t in scenario
k and
N denotes the number of network nodes.
Objective 2: Following the integration of distributed PV into the system, it significantly alters the system power flow and affects the system line losses. The economic objective focuses on minimizing line losses. It is shown as follows:
where
N denotes the number of nodes in the new energy power system;
Ui denotes the voltage at node
i;
Gij and
Bij denote the conductance and admittance of branch
i-
j, respectively; and
is the phase difference between node
i and node
j.
Objective 3: Maximizing System Static Voltage Stability
As the proportion of distributed PV generation increases, the system static voltage stability has become a primary constraint on the distributed PV bearing capacity. While meeting Objectives 1 and 2, it is also necessary to maximize static voltage stability after distributed PV integration:
It should be noted that integrating PV at different locations and capacities will lead to different weak points in the system (to be detailed in the simulation section later). Therefore, when calculating the static voltage stability index shown in Formula (27), it must be recalculated whenever the integration scheme changes. However, this computation is relatively efficient. As indicated by Formula (27), the static voltage stability depends solely on network structure parameters and PV hosting capacity and not on the operational power flow of the system. Hence, once the PV integration scheme changes, algebraic calculations based on the updated PV capacity can be performed directly.
4.2. Constraints
For any scenario m at time t, the objective function must satisfy system power flow constraints, node voltage constraints, branch power flow constraints, and others. For simplicity, subscripts m and t are omitted in the following descriptions:
- (1)
System Power Flow Constraint
Here,
Ui denotes the voltage at node
i;
Gij and
Bij denote the conductance and admittance of branch
i-
j, respectively; and
denotes the phase difference between nodes
i and
j.
- (2)
Node Voltage Constraint
According to GB/T 33593 [
35], the maximum allowable voltage deviation in the grid is
where
and
represent the equivalent resistance and equivalent reactance of the line grid, respectively.
Then, the voltage deviation of the grid after distributed PV integration should be satisfied by
- (3)
Branch Current Constraint
The current in the line must not exceed the upper limit, satisfying
where
Iij,t denotes the branch current magnitude at time
t and
Iij,max denotes the maximum allowable current flowing through the branch.
Additionally, according to the guidelines, the reverse power flow rate is limited to a maximum of 80%, as expressed by the following formula:
The calculation of the reverse power flow rate is given by the following formula:
where
denotes the active power of distributed PV;
denotes the equivalent load active power, i.e., the pure load power excluding distributed PV output; and
denotes the power limit of electrical equipment such as transformers or lines.
4.3. Rigid Indicator Verification
After obtaining the bearing capacity results through optimization evaluation as described in
Section 4.1, a short-circuit current verification is performed under the most severe scenario. If the verification passes, the bearing capacity assessment is completed. Instead, if it fails, the bearing capacity value will be updated to the limit value of the non-compliant indicator.
After integrating distributed PV into the grid, the short-circuit current should comply with the following requirements:
where
denotes the short-circuit current of the system bus and
denotes the short-circuit current limit.
4.4. Model Solving
This paper uses a sequential optimization algorithm to solve the problem because the three objective functions of the proposed distributed photovoltaic multi-power supply capacity assessment method have clear priorities.
Firstly, Objective 1 is to maximize the sum of the distributed PV capacity connected to each grid node. The purpose of this article is to determine the maximum carrying capacity of distributed PV, so this objective has the highest priority.
Secondly, Objective 2 is to maximize system static voltage stability. Generally speaking, system static voltage stability shows a trend of increasing and then decreasing with increasing PV capacity. The capacity calculated by Objective 1 will generally fall within the decreasing trend. Using the distributed PV capacity obtained by Objective 1 as the lower bound, we continue optimizing to meet safety requirements.
Finally, regarding Objective 3, economic efficiency primarily focuses on minimizing line losses. This objective is relatively unimportant. Minimizing line losses is sufficient, provided that Objectives 1 and 2 are met.
6. Simulation Analysis
6.1. Case Study Introduction
- (1)
Grid Topology
The simulation analysis in this paper is based on the modified IEEE 33-node system, and the system structure is shown in
Figure 4. In the figure, arrows and numbers indicate load locations and notes, respectively. According to the guidelines, the permissible voltage deviation at nodes ranges from 0.93 to 1.07 p.u., and the maximum allowable power flow for lines is 5.5 MW.
- (2)
PV and Load Data Simulation
To enhance the practicality of the case study, the example in this paper uses measured data from 16 load nodes and 8 photovoltaic sites in a distribution network in Gansu. The period is 1 h, and the time span is from 1 June to 31 August 2023 and 2024 (summer). A total of 92 data samples are collected at each time point (30 June, 31 July, and 31 September). The Beta distribution parameters of photovoltaic output and the normal distribution parameters of load output are estimated for each time point, as shown in
Figure 5 and
Figure 6. Based on this, scenario design and subsequent case analysis are carried out.
6.2. Analysis of the Impact of Static Voltage Stability on Distributed PV Bearing Capacity
The static voltage stability of the system is calculated under initial load and photovoltaic conditions, as shown in
Figure 7.
As shown in
Figure 7, node 6 and node 28 exhibit the highest
LSVS values of all the nodes, which are the static voltage stability weak nodes in this system.
When the net power load at each node of the distribution network increases proportionally, the LSVS value at node 6 reaches 1 first, followed by node 28. This indicates that voltage instability occurs first at node 6, followed by node 8. Therefore, the constraining effects of these two nodes should be given priority consideration in distributed PV bearing capacity assessment.
- (1)
The impact of connecting photovoltaic capacity
Taking node 6 as an example, the influence of different connecting capacities on static voltage stability is analyzed. The
LSVS values at node 6 are calculated under three scenarios—initial load conditions, connecting a 1 MW load at node 6, and connecting a 5 MW load at node 6—as shown in
Figure 8.
As shown in
Figure 8, after connecting photovoltaic at node 6, the load power margin significantly increases. Moreover, within a certain capacity range, the greater the photovoltaic connection capacity, the more pronounced the improvement effect. Because, in a radial network, the greater the load power of a node, the more severe the voltage drop becomes, resulting in a higher
LSVS value. Therefore, connecting the appropriate photovoltaic capacity in the radial network can increase the voltage amplitude at the end of the line, thereby enhancing the static voltage stability margin. However, as the distributed PV capacity continues to increase, reverse power flow occurs in the grid, causing the voltage at the end of the grid to rise and ultimately reducing the static voltage stability margin of the system. This observation is consistent with the conclusions derived from the aforementioned theoretical analysis.
- (2)
The impact of the connection location
Taking the example of connecting identical distributed PV capacities at nodes 6, 11, and, 33, the static voltage stability indicators for the system were calculated as shown in
Figure 9.
As shown in
Figure 9, when distributed PV is connected at nodes 6 and 11, the static voltage stability of the system is determined by the
LSVS value of node 6. But when the distributed PV is connected at node 33, the system′s static voltage stability is decided by the
LSVS value of node 28. This demonstrates that after distributed PV is connected to the grid, the system′s static voltage stability weak points will vary under different connection schemes. Therefore, during optimizing solutions, care must be taken to avoid consistently using the
LSVS value of the initial weak point as the
LSVS value of the system, which would lead to deviations in the bearing capacity assessment.
Meanwhile, different PV connection location schemes significantly influence the distributed PV bearing capacity. The following three cases are designed for comparative analysis:
Case 1: Distributed PV is permitted to connect only to a single centralized node. The simulation centralized connections to node 6, node 10, or node 11 individually.
Case 2: Distributed PV is permitted to connect to four nodes in a dispersed manner. The simulated node set is {6, 10, 11, 28}.
Case 3: Distributed PV is permitted to connect to eight nodes in a dispersed manner. The simulated node set is {4, 6, 10, 11, 15, 18, 22, 28}.
The simulation results for the three cases are shown in
Figure 10.
As shown in
Figure 10, when distributed PV is integrated at the same node, the maximum bearing capacity occurs at node 6. This shows that integrating distributed PV at the initial weak point allows the power grid to host more distributed PV, because the initial weak point typically experiences heavy load and low terminal voltage. The integration of distributed PV at this location can directly alleviate the terminal voltage issue.
When distributed PV is connected to different nodes in a dispersed manner, the improvement in voltage stability is relatively more effective compared to concentrated integration, because this access method effectively reduces the overall net load of the distribution network. The more dispersed the integration locations are, the more uniformly the net load is reduced at each node, resulting in better voltage improvement effects, a higher PV capacity inflection point as defined in Formula (11), and, ultimately, a greater bearing capacity for distributed PV.
6.3. Analysis of the Impact of Source–Load Correlation on Distributed PV Bearing Capacity
Based on the measured data, the correlation coefficient
ρ between PV output and active load power is calculated, as shown in
Figure 11.
Figure 11 reveals a strong correlation between photovoltaic output and load power.
This paper presents two case studies for simulation analysis, as follows:
Case 1: Based on the PV Beta distribution and load normal distribution functions, parameters are estimated via historical data, and load and PV output data are created by sampling methods.
Case 2: Based on the copula function correlation analysis method proposed in this paper, parameters are estimated from historical data, and sampling methods are applied to generate load and PV output data.
In both cases, the model proposed in
Section 4 was employed to evaluate the distributed PV bearing capacity. The distinction lies in the approach of sequential solutions: Case 2 utilized typical curves generated by considering source–load correlation, while Case 1 employed curves generated without considering such correlation.
The correlation coefficients between photovoltaic and load data under the two cases are shown in the figure below.
As shown in
Figure 12, the load and PV data generated using the copula function correlation analysis method proposed in this paper better align with actual conditions.
To analyze the impact of correlations between photovoltaic output and load random variables on PV bearing capacity, the following two scenarios are provided for comparative analysis.
Scenario 1: The distributed PV bearing capacity is evaluated without considering the correlation between PV and load. That is, typical scenarios of PV and load are generated without accounting for their correlation, and the assessment is performed based on these scenarios.
Scenario 2: The distributed PV hosting capacity is evaluated considering the correlation between PV and load. That is, typical scenarios of PV and load are generated, accounting for their correlation, and the assessment is conducted based on these scenarios.
The distributed PV bearing capacities under the two scenarios are shown in
Figure 13.
As shown in
Figure 13, the calculated PV bearing capacity under Scenario 2 is higher at each node, with a total value of 4.8 MW. Compared to Scenario 1, the calculation results increased by 0.72 MW. This is because when the connected photovoltaic system and the load exhibit a correlation, the load can promptly absorb the photovoltaic output, thereby increasing the grid′s capacity to accommodate distributed photovoltaic.
8. Discussion
In fact, the carrying capacity of distributed photovoltaics is related to many factors. In this article, we not only consider the factors in the “Guidelines for Evaluating the Carrying Capacity of Distributed Power Sources Connected to the Grid” (such as heavy overload of power flow, short-circuit current, voltage exceeding limits, etc.) but also consider static voltage stability, source–load correlation, and line loss economy, making the assessment of distributed photovoltaic carrying capacity more suitable for the current scenario where the proportion of distributed photovoltaic access is increasing.
However, in reality, distributed PV systems employing different grid connection control strategies have varying support capabilities for the system′s static voltage stability. For example, in constant power factor control mode, the inverter operates at a set power factor (e.g., PF = 1.0, meaning only active power is generated) and provides no voltage support to the grid.
In constant voltage control mode, the inverter monitors the grid-connection point voltage and maintains voltage stability by adjusting the reactive power. When the voltage rises, it absorbs reactive power; when the voltage drops, it generates reactive power, which contributes to system static voltage stability.
In active power derating control mode, when the grid-connection point voltage is excessively high, the active power output is proactively reduced, which can alleviate steady-state voltage overshoots but provides relatively insufficient support for static voltage stability.
Due to the complex coupling between distributed PV grid-connection control strategy parameter setting and load capacity assessment, this study assumes that distributed PV systems employ a constant power factor control mode. Subsequent research will focus on the impact of distributed grid-connection control strategies on load capacity assessment.