Considering the characteristics of distributed photovoltaics such as wide distribution area, scattered locations, large quantity, and small individual capacity, directly incorporating them into the operation and control framework of the power system will lead to problems such as a significant increase in solution complexity and high dimensionality of decision variables [
21]. Therefore, this paper aggregates PV resources with adjacent geographical locations and close electrical connections into cluster units through a partitioning approach, and establishes a hierarchical collaborative control architecture to realize cluster-level optimal scheduling and coordinated control of distributed PV [
22]. This paper proposes a method for distributed PV cluster partitioning and flexible peak-shaving based on temporal coupling SOM and a bi-level model, whose overall framework is shown in
Figure 1.
Under the above framework, the comprehensive indicators for distributed PV cluster partitioning used in this paper can be divided into three categories according to their characteristics, namely electrical coupling, power balance capability, and temporal characteristics. Electrical coupling focuses on considering the degree of electrical association between nodes within a cluster and between clusters to better evaluate the integrity and synergy of the clusters; power balance capability calculates the power balance level of each PV cluster and individual PV when participating in peak-shaving, providing a numerical basis for the next step of distributed peak-shaving; and temporal characteristics are used to measure the output variation rules and complementarity of PV clusters in different time periods, and according to the temporal characteristics of the clusters, corresponding peak-shaving tasks can be issued to each cluster.
2.1. Partition Index
The electrical coupling index is used to measure the strength of electrical association between clusters and is comprehensively evaluated using reactive power sensitivity, electrical distance, and node topological connection strength. Among them, reactive power sensitivity represents the degree of influence of load node power changes on voltage changes of other nodes, and its expression is given by Equation (1):
where
S (kV/kW) is the sensitivity matrix, indicating the response of node voltage changes to load node power changes; electrical distance reflects the tightness of electrical connections between nodes, where
is the ratio of voltage fluctuations between the node itself and related nodes when reactive power fluctuates. Detailed parameter information can be found in
Table A1.
By employing the sensitivity matrix, the relationship between a node’s own voltage fluctuation and the resulting voltage fluctuations at surrounding nodes
can be obtained, from which an electrical-distance
expression is derived.
Define the topological connection strength
T between node
i and node
j as the reciprocal of the shortest path length
between the two nodes:
The comprehensive electrical coupling indicator E is calculated as .
Define the net load
at time t as
where
is the system load at time
t and
is the self-consumed output of distributed PV at time
t. Both the system load and the self-consumed output of distributed PV are in KW.
The power balance capability is measured by indicators such as the net load ramp rate and load factor. The net load ramp rate represents the rate of change of net load per unit time, reflecting the intensity of load changes, and its expression is as follows:
where
and
are the net loads at time
t and
t + 1 respectively, and
(h) is the time interval.
The load factor refers to the ratio of the actual output of a photovoltaic cluster to its maximum possible output, reflecting the output utilization level of the photovoltaic cluster. In the calculation, it is necessary to monitor the actual power generation of the photovoltaic cluster in real time and combine it with the maximum generating power under the meteorological conditions of the day, which reflects the output utilization level of the photovoltaic cluster. Its expression is as follows:
where
is the actual output of the photovoltaic cluster and
is the maximum possible output.
The comprehensive power balance capability indicator P is calculated as .
The output curve correlation coefficient is calculated based on the Pearson correlation coefficient, which quantifies the linear correlation between the output sequences of two PV clusters over a continuous time period. For two PV clusters with output sequences
and
, the covariance
and standard deviations
,
are first calculated:
where
and
are the average outputs of the two clusters, respectively.
The output curve correlation coefficient
is shown in Equation (11) of this paper:
Finally, the comprehensive cluster partitioning index is generated by combining the electrical coupling, power balance capability, and temporal characteristic indicators as follows:
2.2. Adaptive SOM Algorithm
Distributed PV cluster partitioning leverages multidimensional features and partitioning algorithms to group PV plants with similar output patterns and close electrical connections, enabling large-scale management and coordinated peak-shaving. The self-organizing map (SOM) neural network [
23]—an unsupervised learning method—excels in data dimensionality reduction, feature extraction, and visualization, making it ideal for this task [
24]; its core idea is mapping high-dimensional input data to a low-dimensional topological structure via competitive learning while preserving data topological relationships. The topological structure of the traditional SOM is shown in
Figure 2.
The SOM algorithm takes the aforementioned comprehensive cluster partitioning indicators of distributed photovoltaics as input and obtains a distance set by calculating the Euclidean distance between each input layer data and the competitive layer neurons, specifically as shown in Equation (13):
where
is the comprehensive cluster partitioning indicator data of the
i-th node and
represents the connection strength between the
i-th neuron in the input layer and the
j-th neuron in the competitive layer, which initially takes a small value. The minimum distance is selected from the obtained distance set, and the competitive layer neuron represented by this distance is the winning neuron. The weight vectors of the winning neuron and the neurons in its neighborhood will be updated according to the rules calculated by Equation (14) [
21]:
where
is the learning rate, which gradually decreases with the training time
t and controls the step size of weight update;
is the neighborhood function, which centers on the winning neuron and gradually reduces the neighborhood range with the training time. Its role is to make the weight vectors of the winning neuron and its surrounding neurons move towards the direction of the input vector, thereby forming a topological representation of the input photovoltaic data distribution in the competitive layer. The neighborhood function
usually adopts the form of a Gaussian function:
where
and
are the positions of the winning neuron c and the neuron j in the competitive layer respectively, and
is the neighborhood width, which also gradually decreases with the training time
t.
However, traditional SOM has notable limitations in handling distributed PVs’ complex data, failing to fully address electrical coupling, power balance capability, and temporal characteristics—key aspects for our clustering task. Thus, we propose the TC-SOM model, tailored to align with our clustering needs through a two-stage division, improved similarity measurement and cluster validation. The two-stage division includes an electrical coupling layer and a power timing layer. The TC-SOM topology is illustrated in
Figure 3.
Among them, the input of the first-stage rough partitioning includes reactive power sensitivity, electrical distance, and node topological connection strength. The goal of this stage is to divide nodes with close electrical connections into the same initial cluster, using a low-dimensional mapping structure. By calculating the similarity of input indicators, nodes with high similarity are mapped to adjacent neuron positions, achieving close electrical coupling within the cluster. The second-stage fine partitioning is carried out on the basis of rough partitioning, with inputs including net load climbing rate, output curve correlation coefficient, and load rate, and a high-dimensional mapping structure is adopted. Through the analysis of these indicators, the power timing complementarity within the cluster is realized, so that the photovoltaic output within the same cluster can complement each other in time, improving the overall peak-shaving capability.
In terms of similarity measurement, dynamic weighted Euclidean distance is used to calculate the similarity between input data and neuron weights. Different weights are assigned according to the importance of different indicators in partitioning, and the weight values are dynamically adjusted based on data characteristics. For the electrical coupling layer, reactive power sensitivity, electrical distance, and node topological connection strength have larger weights; for the power timing layer, net load climbing rate, output curve correlation coefficient, and load rate have larger weights. The expression for dynamic weighted Euclidean distance is
where
is the weight of the
i-th indicator, and
and
are the
i-th indicator values of the input data and neuron weights, respectively.
In order to illustrate the necessity of increasing the hierarchy to meet the different requirements of clustering, the learning rate adaptive formula for the specially added hierarchy and the multi-level domain function formula based on the traditional SOM are presented. In addition, to illustrate the necessity of increasing the hierarchy to meet diverse clustering requirements, this study presents the learning rate adaptive formula for the specially added hierarchy and the multi-level domain function formula improved based on traditional SOM, while it adopts inter-cluster tie-line power fluctuation rate, intra-cluster self-balancing rate, and climbing flexibility deficit as evaluation indicators to verify the effectiveness of cluster division.
Based on the original single SOM layer, a multi-level SOM architecture is constructed. Each layer functions as an independent SOM network, responsible for further abstraction and feature extraction of the data from the previous layer. The specific process is shown in Equation (17).
where
represents the number of neurons in the
l-th layer.
Meanwhile, in response to the problem of slow convergence speed and easy falling into local minima caused by the fixed learning rate in traditional SOM, the proposed improvement scheme in this paper introduces an adaptive learning rate mechanism, as shown in Equation (18). Through dynamically adjusting the learning rate based on the trend of quantization error changes during the training process, the aim is to optimize the training effect.
where
is the initial learning rate,
is the decay period,
is the current quantization error, and
is the maximum quantization error. In the early stage of training,
is relatively large, and this term makes the learning rate larger, thus enabling rapid convergence; as the training progresses,
decreases, and the learning rate gradually becomes smaller, allowing for fine-tuning of weights to avoid getting stuck in local optima.
2.3. Overall Steps
Based on the abovementioned temporal coupling SOM algorithm, combined with three types of key parameters including electrical coupling, power balance capability, and temporal characteristics, the division scheme for each cluster is determined, and the specific process is shown in
Figure 4. Through the two-stage division structure, hierarchical partitioning processing of data is realized, and the accuracy of cluster partitioning is improved. Finally, through the visualized partitioning results, the load cluster partitioning results are output, with the specific steps as follows:
Step 1: Establish a multidimensional evaluation index system based on three types of key parameters, namely, electrical coupling, power balance capability, and temporal characteristics. Eliminate dimensional differences through standardization to form a feature data input set.
Step 2: Perform the first-stage rough. Input reactive power sensitivity, electrical distance, and node topological connection strength into the low-dimensional mapping SOM network to realize the initial partitioning of nodes with close electrical connections.
Step 3: On the basis of the rough partitioning results, perform the second-stage fine partitioning. Input the net load climbing rate, output curve correlation coefficient, and load rate into the high-dimensional mapping SOM network, and calculate the similarity through dynamic weighted Euclidean distance to realize fine partitioning of power timing complementarity within the cluster.
Step 4: Verify the effectiveness of the division results using indicators such as inter-cluster tie-line power fluctuation rate, intra-cluster self-balancing rate, and climbing flexibility deficit. Generate a load cluster partitioning map in combination with visualization analysis tools, and finally output the cluster optimization scheme that meets the peak-shaving demand.