Next Article in Journal
Control Applications with FPGA: Case of Approaching FPGAs for Students in an Intelligent Control Class
Previous Article in Journal
An Overview of Heavy Metals in Cosmetic Products and Their Toxicological Impact
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Multi-Level Evaluation for Flexible Load Regulation Potential in Distribution Network Based on Ensemble Clustering

1
Electric Power Research Institute, State Grid Anhui Electric Power Co., Ltd., Hefei 230601, China
2
State Grid Anhui Electric Power Co., Ltd., Hefei 230022, China
3
Anhui Provincial Key Laboratory of New Energy Utilization and Energy Saving, Hefei University of Technology, Hefei 230009, China
*
Author to whom correspondence should be addressed.
Appl. Sci. 2025, 15(24), 12885; https://doi.org/10.3390/app152412885
Submission received: 10 November 2025 / Revised: 1 December 2025 / Accepted: 4 December 2025 / Published: 5 December 2025

Abstract

With the rapid increase in the renewable energy penetration rate in distribution networks, the volatility and uncertainty on the power supply side have become prominent; thus, it is urgent to fully utilize the regulation potential of the flexible load on the user side to maintain the dynamic balance of power. A multi-level evaluation method for flexible load regulation potential based on ensemble clustering is proposed in the paper. First, a data-driven approach based on ensemble clustering is adopted to quantify the user-level regulation potential of flexible load. Second, the bus-level regulation potential of the flexible load is obtained by aggregation calculation. Finally, a quantitative evaluation of the system-level regulation potential of flexible load in the distribution network is realized by constructing three optimization models with different objectives. Case studies show that the proposed method can effectively evaluate the regulation potential of flexible load in the distribution network from multiple levels, i.e., user level, bus level, and system level.

1. Introduction

In the context of the global energy transition, the continuous increase in the proportion of renewable energy sources has posed challenges of volatility and uncontrollability to the power system [1]. To ensure the safe and stable operation of distribution networks, there is an urgent need to tap into the regulation potential of various controllable resources such as battery energy storage devices, flexible load, and soft open points. Among these resources, the flexible load on the user side demonstrates unique advantages by virtue of its flexible response characteristics [2,3]. Therefore, the scientific assessment of the regulation potential of flexible load serves as a key technical foundation for giving full play to its regulation potential and supporting the stable operation of the system [4,5,6].
Scientific evaluation of the regulation potential of flexible load often requires collecting multi-dimensional data such as equipment parameters, user behavior characteristics, and environmental parameters of various types of flexible load, as well as conducting refined modeling of flexible load. On this basis, the evaluation of regulation potential is realized through methods such as the superposition method [7], state sequence model [8], and conditional value-at-risk (CVaR) model [9]. Gao et al. proposed a method combining a hierarchical federated perception algorithm and a hierarchical architecture, which enables the decomposition of flexible load at the substation level without exposing the privacy of individual users [10]. Wang et al. provided a predictive data foundation for assessing the dispatchable capacity and adjustable range of large-scale electric vehicle clusters by accurately characterizing the spatiotemporal aggregation characteristics of electric vehicle load [11]. Some studies further adopt the method of reduced-order modeling to lower the difficulty of modeling. For instance, Ge et al. describe the aggregated regulation potential of large-scale flexible load groups using a highly simplified low-order mathematical model, thereby reducing the difficulty of modeling flexible load groups [12]. Other studies have introduced advanced risk and optimization theories to address the uncertainties brought about by a high proportion of renewable energy: Lu et al. proposed a stochastic unit commitment model based on Conditional Value-at-Risk (CVaR). By quantifying the operational risks of isolated systems under high-proportion renewable energy penetration through CVaR, this model enables accurate evaluation of the risk-averse regulation potential of various flexible resources in the system [13]. Cao et al. utilized the theory of distributionally robust optimization to provide strict mathematical guarantees for addressing the uncertainties encountered in assessing the regulation potential of flexible load, ensuring that the assessed regulation potential remains reliable and usable under various extreme scenarios [14]. Xu et al. established an uncertainty model for renewable energy output using multi-scenario generation technology, and further proposed a comprehensive reliability assessment method for active distribution networks [15].
Although the aforementioned methods based on refined modeling of flexible load theoretically offer advantages in precision, in practical scenarios where large-scale heterogeneous flexible loads are connected, the difficulties in multi-dimensional data collection and high computational complexity lead to a sharp increase in engineering implementation costs and significantly limited operability. Therefore, some scholars at home and abroad have adopted data-driven approaches to bypass complex modeling and tap into the regulation potential of flexible load from their operational characteristics [16,17]. Baoyi et al. proposed a residential electricity load pattern recognition method based on the combination of K-Means clustering and a deep belief network. This method provides reliable distribution network maintenance for residential electricity load based on classification and recognition [18]. Li et al. assessed the spatiotemporal distribution and regulation potential of aggregated charging load through cluster analysis based on real-world electric vehicle travel data [19]. Wang et al. used a density-based clustering algorithm to classify electricity users into different groups, modeled the dynamic characteristics of each user’s electricity consumption using a Markov model, and further determined the demand response potential of users in each category [20]. Chen et al. proposed a two-stage model that integrates non-intrusive load disaggregation and advanced sequence prediction. In this model, the accurate separation of flexible loads is first achieved through a CNN-BiLSTM network, which effectively extracts their dynamic features; subsequently, the informer deep learning model is used for prediction [21]. Wang et al. first applied the singular value decomposition method to reduce the dimensionality of flexible load data, and then adopted a combined approach of K-Means clustering and convolutional neural network deep learning to improve the prediction accuracy of flexible loads [22]. Sun et al. proposed a two-layer clustering model that integrates K-Means, SOM, and BP neural networks, and conducted a comprehensive analysis of users’ load characteristics and regulation potential based on large-scale power load data [23].
However, existing studies still have some limitations. First, most works only focus on a single dimension, such as the user-level or bus-level, failing to construct a complete evaluation chain of “user–bus–system” and making it difficult to quantify the comprehensive value of flexible load from the global perspective of the system. Second, current data-driven evaluation methods mostly rely on a single clustering algorithm or the combination of a single clustering algorithm with deep learning technologies. The former has poor clustering stability, resulting in insufficient credibility of evaluation results; the latter, which combines with deep learning, usually relies on a large amount of data for model training and has low interpretability, which limits its practical application. It is worth noting that some scholars have applied ensemble clustering algorithms to the fields of load pattern identification and feature extraction [24,25]; for example, Sarmas et al. verified the superiority of ensemble clustering algorithms in load pattern identification by comparing multiple clustering methods [26]. Inspired by this, the paper proposes a multi-level evaluation method for flexible load regulation potential based on ensemble clustering. The main contributions of this paper are as follows:
(1)
A “user–bus–system” evaluation framework is established, thereby enabling a comprehensive assessment of flexible load regulation potential across multiple levels.
(2)
The stability and robustness of load clustering are enhanced by the ensemble clustering algorithm, thus providing a reliable foundation for bus-level assessment.
(3)
The system-level regulation effects of flexible loads are quantified by solving multiple system-level optimization models, which address minimizing the average fluctuation of system load, maximizing renewable energy consumption, and reducing the peak-valley difference in system load, respectively, and by a set of system contribution indexes.
The paper is organized as follows. Section 2 introduces the processes of the ensemble clustering algorithm and the user-level regulation potential evaluation of flexible load; Section 3 introduces the processes of the bus-level and system-level regulation potential evaluation of flexible load; and Section 4 presents the case studies for validating the effectiveness of the proposed evaluation method. Finally, Section 5 provides a summary of the paper.

2. Quantitative Evaluation of User-Level Flexible Load Regulation Potential

2.1. Quantitative Evaluation Framework for Multi-Level Flexible Load Regulation Potential

Figure 1 illustrates the “user–bus–system” three-level evaluation framework for the regulation potential of flexible load on the user side of the distribution network. First, at the user level, an ensemble clustering algorithm is utilized to analyze users’ historical load data, to characterize the baseline load of each user and their bidirectional regulation potential. Then, at the bus level, based on the topology of distribution networks, the load regulation potential of all users connected to the same bus is aggregated to generate the regulation potential of each bus in the distribution network. Finally, at the system level, three optimization models with different objective functions, i.e., minimizing the average fluctuation of system load, maximizing renewable energy consumption, and minimizing the system load peak-valley difference, respectively, are constructed. By solving the three optimization models, the value and efficiency of flexible load at the system level are comprehensively evaluated.

2.2. Ensemble Clustering Algorithm

The ensemble clustering algorithm typically involves two important steps: the first step is to generate a set of base clusters, and the second step is to construct a consensus matrix from the generated set of base clusters and ensemble base clusters to obtain the final clustering result [27]. Figure 2 illustrates the basic workflow of the ensemble clustering algorithm.
The specific steps for clustering the original dataset X N using the ensemble clustering algorithm constructed in this paper are detailed below:
(1)
Perform B times of Bootstrap Resampling on the original dataset X N to generate B training sample sets X = { X N 1 , X N 2 , , X N B } . The original dataset X N = { x 1 , x 2 , , x i , , x N } contains N historical load curves, where x i represents the i-th load curve in the dataset X N .
The primary purpose of resampling is to enhance the overall stability of the algorithm by performing clustering on multiple sample sets, thereby mitigating the effects of randomness in initial center selection and interference from data outliers. Furthermore, it is established in the literature that a bootstrap size of 10 (B = 10) is typically sufficient to virtually encompass all distinct data points from the original dataset [24].
(2)
Perform K-Means clustering on each training sample set to obtain B base cluster results Π = { π 1 , π 2 , , π b , , π B } , where π b = { C 1 b , C 2 b , , C i b } represents the b-th base cluster, and C i b denotes the i-th cluster in the b-th base cluster.
(3)
Based on these base clusters, a consensus matrix A is constructed. The consensus matrix A records the frequency at which any two load curves in the original dataset X N are assigned to the same cluster across all base clusters. It is an N × N matrix, and the calculation formula for its element a i j (located at the i-th row and j-th column) is as follows:
a i j = 1 B b = 1 B δ π b x i , x j
δ π b x i , x j = 1 ,   x i   and   x j   belong to the same cluster in the base cluster  π b 0 ,                                                                                                                                                                     other cases
where x i and x j , respectively, represent the i-th and j-th load curves in the original dataset X N .
(4)
Based on the consensus matrix, a hierarchical clustering algorithm is used to perform the final clustering division on the original dataset X N . Each load curve in the original dataset X N is regarded as an independent cluster, and then the two clusters with the highest similarity are merged iteratively until the number of clusters reaches the optimal number of clusters K , at which point the final clustering result is obtained. The calculation formula for the inter-cluster similarity S C p , C q is as follows:
S C p , C q = 1 n p n q i S C p j S C q a i j
where n p and n q represent the number of samples in the cluster C p and C q , and S C p and S C q denote the index sets of samples in the cluster C p and C q , respectively.
The DBI (Davies–Bouldin Index) is used to determine the optimal number of clusters K for the original dataset X N in this paper. The DBI, shown as (4), measures the compactness of intra-cluster samples and the separation of inter-cluster samples and a smaller value indicates a better clustering effect [24].
I DBI = 1 k i = 1 k max i j d C i + d C j d C i , C j
where I DBI denotes the value of DBI; d C i and d C j represent the intra-cluster average distances of clusters C i and C j , respectively; d C i , C j is the centroid distance between C i and C j ; and k is the number of clusters.
When specifically determining the optimal number of clusters K , it is necessary to iterate through the range of k values, perform K-Means clustering on the original dataset X N sequentially, and calculate the DBI. Finally, the value of k corresponding to the minimum DBI is selected as the optimal number of clusters, denoted as K . It should be noted that the value range for the number of clusters, k, is typically determined based on the data characteristics and research objectives. Considering the characteristics of the power load curve data used in this paper and the goal of cluster analysis, the value range for k is set from 2 to 10 [28].
Through the above steps, the original dataset X N can be divided into K clusters using the ensemble clustering algorithm.

2.3. Quantitative Evaluation of User-Level Regulation Potential

The evaluation process of user-level regulation potential is as follows:
(1)
A full year of historical load data prior to the regulation date is first acquired for each user. This period is chosen as it is considered to virtually encompass the entire spectrum of the user’s electricity consumption characteristics [15].
(2)
By calculating the average power of the user’s load during the N days before regulation, the user’s baseline load curve P b a s e ( t ) is constructed:
P b a s e t = i = 1 N P i t N ,   t = 1,2 , , T
where P i  represents the load curve of the i-th day before the regulation day and T denotes the total number of time intervals. If the dataset adopts a 15 min time resolution, then T = 96 .
(3)
The baseline load curve P b a s e ( t ) is classified into the cluster C M with which it has the maximum membership degree. The calculation formula for the membership degree ξ i between the baseline load curve P b a s e ( t ) and cluster C i is as follows:
ξ i = 1 j = 1 K P b a s e ( t ) P ¯ C i ( t ) 2 P b a s e ( t ) P ¯ C j ( t ) 2 2 ,   i = 1,2 , , K
where P ¯ C i ( t )  and P ¯ C j ( t ) represent the sample centroids of cluster C i and C j , respectively.
(4)
After determining the cluster C M to which the user’s baseline load curve P b a s e ( t ) belongs, load power values are extracted at each time t from the historical load data in the cluster C M . The 0.95th and 0.05th percentiles of these values are then calculated, which are defined as the user’s upper power limit P m a x ( t ) and lower power limit P m i n ( t ) at time t , respectively. The interval [ P m i n t , P m a x t ] represents the typical power fluctuation range of the user at time t.
(5)
By comparing the baseline load P b a s e ( t ) on the regulation day with the fluctuation interval [ P m i n t , P m a x t ] , the user-level regulation potential is quantified as follows:
P u p ( t ) = max ( 0 , P m a x t P b a s e ( t ) )
P d o w n ( t ) = max ( 0 , P b a s e ( t ) P m i n t )
where P u p ( t ) and P d o w n ( t ) represent the user’s upward regulation potential and downward regulation potential, respectively.
Finally, the upward and downward regulation potential curves of the user are formed by the sets { P u p t | t = 1,2 , , T } and { P d o w n ( t ) | t = 1,2 , , T } , respectively, thereby completing the quantification of the user-level regulation potential.

3. Quantitative Evaluation of Flexible Load Regulation Potential at Bus-Level and System-Level

3.1. Quantitative Evaluation of Bus-Level Regulation Potential

The quantification of bus-level regulation potential plays a connecting role in the entire evaluation framework: it is not only the spatial aggregation of user-level evaluation results, but also the data foundation for system-level optimization analysis.
The bus-level regulation potential is obtained by aggregating the potential curves of all users connected to the same bus. Specifically, for a given bus m , its bus-level baseline load P b u s , m b a s e ( t ) is obtained by adding the baseline load values of the users connected to bus m at time t , shown as follows:
P b u s , m b a s e ( t ) = i = 1 N m P m i b a s e t
where P m i b a s e t represents the baseline load of the i-th user connected to bus m at time t .
Similarly, the curves of the total upward regulation potential P b u s , m u p t and total downward regulation potential P b u s , m d o w n t of bus m are generated by adding the values of user-level potential curves at the same time t :
P b u s , m u p t = i = 1 N m P m i u p t
P b u s , m d o w n t = i = 1 N m P m i d o w n t
where P m i u p t  and P m i d o w n t represent the upward regulation potential and downward regulation potential of the i-th user connected to bus m at time t , respectively.
Through the above calculations, the spatial aggregation of regulation potential from the user-level to the bus-level can be completed.

3.2. Quantitative Evaluation of System-Level Regulation Potential

Through the aforementioned method, the baseline load, upward regulation potential, and downward regulation potential of flexible load at each bus in the distribution network can be obtained. On this basis, according to the actual operational requirements of the distribution network, three objective functions are established: minimizing the average fluctuation of the system load, maximizing the renewable energy consumption, and minimizing the peak-valley difference in the system load. Comprehensive consideration is given to distribution network operational constraints, flexible load regulation constraints, and other factors, thereby constructing a system-level evaluation model for flexible load regulation potential. Furtherly, the contribution values of flexible load to the distribution network are assessed.

3.2.1. Optimization Objectives

(1)
With the objective of minimizing the average fluctuation of system load,
min  F a = 1 N t 1 i = 1 N b u s t = 1 N t 1 P b u s , i t + 1 P b u s , i t
where F a is the average fluctuation of system load; N b u s  is the number of buses in the distribution network; N t  is the number of sampling time points of the daily load; and P b u s , i t  represents the load value of bus i  at time t .
(2)
With the objective of maximizing the consumption of renewable energy,
max  F c = i = 1 N b u s t = 1 N t P b u s , i n e w ( t )
where F c is the consumption of renewable energy and   P b u s , i n e w ( t ) represents the renewable energy consumption of bus i  at time t .
(3)
With the objective of minimizing the peak-valley difference in system load,
min   F p = max t { 1,2 , , N t } i = 1 N b u s P b u s , i t min t { 1,2 , , N t } i = 1 N b u s P b u s , i t
where F p is the peak-valley difference in system load.

3.2.2. Constraints

The constraint conditions include power flow balance constraints, bus voltage and branch current constraints, regulation potential constraints, etc. The details are as follows:
(1)
Power flow balance constraints. These constraints ensure the real-time balance of active power and reactive power in the distribution network:
k w j P j k t i u j P i j t R i j I i j 2 t = P b u s , j n e w t P b u s , j b a s e t P b u s , j t
k w j Q j k t i u j Q i j t X i j I i j 2 t = ( P b u s , j b a s e t + P b u s , j t ) · tan φ j t
where u j denotes the set of starting bus of branches with j as the ending bus; w j denotes the set of ending bus of branches with j as the starting bus; P i j t and Q i j t represent the active power and reactive power flowing through branch i j at time t , respectively; P j k t and Q j k t represent the active power and reactive power flowing through branch j k at time t , respectively; φ j t is the power factor angle of the load at bus j at time t ; and P b u s , j t is the load adjustment amount of bus j at time t .
(2)
Bus voltage and branch current constraints. These constraints ensure that the voltage of all buses is within the allowable range and that the branch current does not exceed the thermal limit, to prevent equipment overload and ensure power supply safety:
V j 2 t = V i 2 t 2 R i j P i j t + X i j Q i j t + Z i j 2 I i j 2 t
V i , m i n V i t V i , m a x
I i j 2 t I i j , m a x 2
where V i t denotes the voltage amplitude of bus i at time t ; R i j represents the resistance of branch i j ; X i j stands for the reactance of branch i j ; Z i j indicates the impedance of branch i j ; V i , m a x and V i , m i n represent the upper limit and lower limit of voltage of bus i , respectively; I i j t denotes the current-carrying capacity of branch i j at time t ; and I i j , m a x represents the safe current of branch i j .
(3)
Regulation potential constraints. These constraints ensure that the actual load adjustment amount of each bus does not exceed its upper and lower limits, and that the output of renewable energy does not exceed the predicted value:
P b u s , j d o w n t P b u s , j t P b u s , j u p t
0 P b u s , j n e w t P b u s , j n e w , p r e t
where P b u s , j t is the load adjustment amount of bus j at time t ; P b u s , j n e w t is the active power output of renewable energy of bus j at time t ; and P b u s , j n e w , p r e t is the predicted value of renewable energy generation of bus j at time t .

3.2.3. Convex Relaxation of the Optimization Model

To improve the solution efficiency and numerical stability, the original non-convex optimization problem is transformed into a convex programming form for solution using convex relaxation techniques [9], as follows:
First, variables l i j t = I i j 2 t and v i t = V i 2 t are introduced to eliminate the quadratic terms of current and voltage. The active power and reactive power balance constraints are converted into the following forms:
k w j P j k t i u j P i j t R i j l i j t = P b u s , j n e w t P b u s , j b a s e t P b u s , j t
k w j Q j k t i u j Q i j t X i j l i j t = ( P b u s , j b a s e t + P b u s , j t ) · tan φ j t
Next, the big-M method and the second-order cone relaxation (SOCR) method are adopted to convert the power flow constraints and the voltage–current security constraints into the following forms:
v i t v j t 2 R i j P i j t + X i j Q i j t + Z i j 2 I i j 2 t + M 1 α i j t 0
v i t v j t 2 R i j P i j t + X i j Q i j t + Z i j 2 I i j 2 t + M 1 α i j t 0
2 P i j t , 2 Q i j t , l i j t v i t T 2 l i j t + v i t
V i , min 2 v i t V i , max 2
0 l i j t α i j t I i j , max 2
where M  is a sufficiently large number (needing only to be two orders of magnitude larger than v i t ), and α i j t  denotes the state variable for branch switching—taking 0 when the branch is disconnected and 1 when closed.
The SOCR applied to the DistFlow model is known to be exact (i.e., the relaxation is tight) for radial distribution networks under typical operating conditions [29,30]. The key conditions guaranteeing tightness are as follows:
(1)
The network topology is radial.
(2)
The objective function is monotonically increasing with respect to branch power losses.
(3)
The system does not operate at the boundary of the feasible region (e.g., no line power reaches its limit, and node voltages are far from the upper and lower bounds).
In the context of the proposed optimization model for evaluating the regulation potential of flexible load, the aforementioned conditions for tightness are satisfied for the following reasons:
(1)
The IEEE 33-bus and PG&E 69-bus test systems used in the case studies are radial in structure and operate within normal voltage and current limits, thus fulfilling conditions (1) and (3).
(2)
The primary objective functions defined in Equations (12) and (14), which aim to minimize load fluctuation and peak-valley difference, respectively, inherently discourage excessive power flow and associated losses. Since branch power losses ( R i j l i j t ) are monotonically increasing functions of the squared current magnitude l i j t , minimizing system-level power variations implicitly creates pressure to reduce l i j t . Therefore, these objective functions are consistent with the requirement of being non-decreasing with respect to branch losses, ensuring the relaxation’s exactness. For the objective of maximizing renewable consumption (Equation (13)), the relaxation remains exact in radial networks as the solution typically lies within a non-boundary operating point.
Therefore, the solutions obtained from the relaxed SOCR model (Equations (22)–(28)) are globally optimal for the original non-convex problem, ensuring the accuracy of evaluation results.

3.2.4. Contribution Degree Indexes

To quantitatively analyze the effect of flexibility load on smoothing fluctuation of system load, enhancing consumption of renewable energy and minimizing the load peak-valley difference, three contribution degree indexes are set as follows:
(1)
Contribution degree to the average fluctuation of system load:
D a = F a ( 0 ) F a F a ( 0 ) × 100 %
(2)
Contribution degree to renewable energy consumption:
D c = F c F c ( 0 ) F c ( 0 ) × 100 %
(3)
Contribution degree to the peak-valley difference in system load:
D p = F p ( 0 ) F P F p ( 0 ) × 100 %
where D a , D c , and D p represent the contribution degrees of flexible load to minimizing the average fluctuation of system load, maximizing renewable energy consumption, and minimizing the peak-valley difference in system load, respectively; F a ( 0 ) , F c ( 0 ) , and F p ( 0 ) denote the average fluctuation of system load, renewable energy consumption, and peak-valley difference in system load without considering the regulation potential of flexible load, respectively; and F a , F c , and F P stand for the average fluctuation of system load, renewable energy consumption, and peak-valley difference in system load when the regulation potential of flexible load is considered, respectively.

4. Case Study Analysis

4.1. Case Introduction

A modified IEEE 33-bus distribution network, the structure of which is shown in Figure 3, is used for a case study to verify the effectiveness of the evaluation framework proposed in this paper. This system consists of 33 buses and 34 branches, with a base voltage of 12.66 kV and a total maximum system load of (7.617 + j3.473) MVA. Distributed wind power sources with maximum capacities of 1.7 MVA and 2.0 MVA are connected to bus 5 and bus 13 in the distribution network, respectively.
The historical load data used herein is from the annual electricity load dataset of 2009 users in a certain region, provided by the National Basic Discipline Public Science Data Center in China, with a time resolution of 15 min. In this case study, flexible load users are connected to buses 9, 11, 16, 20, 24, and 29, while the load at the remaining buses are rigid load. The number of flexible load users connected to each bus is presented in Table 1.
This case study aims to verify the following two aspects:
(1)
The stability and robustness of the proposed ensemble clustering algorithm in the quantitative evaluation of user-level regulation potential;
(2)
The effectiveness of the proposed system-level optimization model in quantitatively evaluating the system-level contribution of flexible loads to the distribution network.

4.2. Performance Verification of Ensemble Clustering

(1)
Stability analysis of ensemble clustering
Accurate and stable clustering of user load curves is the cornerstone for precise quantification of user-level regulation potential. In view of this, this section conducts a systematic comparison between the ensemble clustering algorithm proposed in this paper and three single clustering algorithms, i.e., K-Means, hierarchical clustering, and density-based spatial clustering of applications with noise (DBSCAN), to verify the stability of the proposed algorithm in load curve clustering.
The DBI is used as the criterion for evaluating clustering quality. The verification is conducted from two perspectives: the stability under repeated clustering for a single user and the stability across different users. First, the K-Means clustering, hierarchical clustering, DBSCAN, and ensemble clustering algorithms are used to conduct multiple clustering experiments on the load data of the same user in the dataset. The boxplots of DBI are shown in Figure 4a. The box height corresponding to the ensemble clustering algorithm is the smallest, indicating that its clustering results have the lowest volatility and the best stability for the specific user. To further validate its generalizability, we extend the experiment to 100 users from the dataset. The DBI distributions of the clustering results across these users are illustrated in Figure 4b. Similarly, the box corresponding to the ensemble clustering algorithm exhibits the smallest height and the lowest position among all methods. This demonstrates that the proposed algorithm not only maintains minimal volatility for a single user but also achieves the most consistent and superior clustering performance across a diverse user population, thereby confirming its exceptional stability.
(2)
Robustness analysis of ensemble clustering
To verify the robustness of the ensemble clustering algorithm in resisting noise interference, simulated load data with known category labels are used to conduct clustering performance tests. The simulated dataset contains 4 categories of daily load curves of users with distinct patterns, with 200 load curves in each category. By injecting noise curves of different noise content into the data, the impact of noise content on the performance of four algorithms—K-Means, hierarchical clustering, DBSCAN and the ensemble clustering algorithm—is evaluated from two aspects: the clustering accuracy rate and DBI. The experimental results are shown in Figure 5.
As can be seen from Figure 5, with the increase in noise content, the clustering accuracy rates decrease and the values of DBI increase, indicating that noise could greatly degrade the clustering effect. However, the accuracy rate and the value of the DBI by ensemble clustering are still the highest and smallest, respectively, which verifies that the ensemble clustering algorithm is the least affected by noise among the four algorithms and has good robustness when facing noise disturbances.

4.3. Evaluation of User-Level Regulation Potential

The above experiments confirm the stability and robustness of the ensemble clustering algorithm in the clustering of load curves, which provides the foundation for the accurate quantitative evaluation of user-level regulation potential. Figure 6 shows the results of clustering the load curves of a certain user at bus 9 by the ensemble clustering algorithm. It can be observed that the load curves are divided into three clusters with significant differences, and each cluster represents a distinct electricity consumption behavior pattern, which intuitively reflects the diversity of user load characteristics.
Based on the aforementioned clustering results, Figure 7 presents the quantitative results of the regulation potential for this user. The figure clearly shows the upper and lower limits of the user’s load power, the upper and lower limits of regulation potential, and the temporal distribution characteristics of the user’s load power and regulation potential, providing data support for the subsequent bus-level and system-level evaluations.

4.4. Evaluation of System-Level Regulation Potential

Figure 8, Figure 9 and Figure 10 show the regulatory effects of flexible load by three optimization objectives: minimizing the average fluctuation of system load, maximizing the consumption of renewable energy, and minimizing the peak-valley difference in system load. The results indicate that flexible load can play an effective regulatory role in reducing the fluctuation of system load, promoting wind power consumption, and decreasing the peak-valley difference in system load.
Figure 11 presents the contribution degree of flexible loads to each operational objective. As can be seen from the figure, in this case study, the contribution of flexible loads to minimizing the average fluctuation of system load is the most prominent, while they also make significant contributions to maximizing wind power consumption and minimizing the peak-valley difference in system load.
The above results verify that by solving and analyzing the system-level flexible load regulation potential evaluation model proposed in this paper, the contribution degree of flexible loads to the distribution network system in aspects such as minimizing the average fluctuation of system load, maximizing the accommodation of renewable energy, and minimizing the peak-valley difference in system load can be effectively quantified.

4.5. Cross-System Validation Based on the PG&E 69-Bus System

This section aims to further validate the effectiveness of the proposed multi-level evaluation framework by applying it to a larger and more complex US PG&E 69-bus distribution system. Compared to the IEEE 33-bus system, the PG&E 69-bus system, which is a simplification of a real-world grid, features a larger scale and more representative topological structure, enabling more effective simulation of real-world distribution networks.

4.5.1. Case Introduction for the Modified PG&E 69-Bus Distribution System

Figure 12 illustrates the structure of the modified PG&E 69-bus distribution system. The modified PG&E 69-bus distribution system consists of 69 buses and 73 branches, with a base voltage of 12.66 kV and a total maximum system load of (18.063 + j7.874) MVA. Distributed wind power sources with maximum capacities of 1.7 MVA and 2.0 MVA are connected to bus 20 and bus 30 in the distribution network, respectively, and a distributed photovoltaic (PV) system with a maximum capacity of 1.7 MVA is connected to bus 5.
In the PG&E 69-bus system, a larger number of flexible load users are integrated. The specific buses to which they are connected are marked in blue in Figure 12, with an average of 10 flexible load users per bus, while the loads at the remaining buses are rigid. The historical load data for these users still originates from the aforementioned dataset.

4.5.2. Evaluation of System-Level Regulation Potential for the Modified PG&E 69-Bus Distribution System

Figure 13, Figure 14 and Figure 15 show the regulatory effects of flexible loads under the three optimization objectives in the PG&E 69-bus system. It can be observed that flexible loads also play an effective regulatory role in this system by reducing load fluctuation, promoting renewable energy consumption, and minimizing the peak-valley difference in system load. Furthermore, due to the integration of a larger number of flexible loads, the regulatory effects are more significant.
Table 2 compares the system contribution indexes between the two test systems. The results clearly show that the PG&E 69-bus system achieves more pronounced regulatory effects across all three aspects. This significant enhancement is attributable to the substantially larger aggregate capacity of flexible loads integrated into the PG&E 69-bus system, which is approximately 2.4 times that of the IEEE 33-bus system. This successful application in a significantly different and larger system demonstrates the method’s independence from a specific test case and its effectiveness in quantifying the system-level regulation potential of flexible loads in networks of varying scale and structure.

5. Conclusions

To conduct quantitative evaluation on the regulation potential of flexible loads in distribution networks, this paper proposes a multi-level evaluation method for flexible load regulation potential based on ensemble clustering. This method constructs a multi-level evaluation framework ranging from the user-level to the bus-level and then to the system-level. By leveraging the ensemble clustering algorithm, the stability and robustness of load clustering are enhanced, thereby ensuring the quality of quantitative evaluation results. Through case analysis, the main conclusions are drawn as follows:
(1)
Compared to some traditional single clustering algorithms, the ensemble clustering algorithm adopted in this paper can effectively improve the stability and robustness of load clustering, thus providing a guarantee for the stability of the quantitative evaluation results of flexible load regulation potential.
(2)
The “user–bus–system” three-level evaluation framework constructed in this paper enables quantitative evaluation of flexible load regulation potential at multiple levels. Firstly, the power regulation range of each user can be accurately characterized based on the ensemble clustering algorithm. Then, the regulation potential of each bus in the distribution network can be obtained through aggregation calculation. Finally, by solving multiple system-level optimization models, the regulation effects of flexible loads in aspects such as minimizing the average fluctuation of system load, maximizing renewable energy consumption, and minimizing the peak-valley difference in system load can be achieved. Moreover, the system-level regulation effects in various aspects can be accurately quantified through multiple system contribution indexes.
The method proposed in this paper still has certain limitations. For instance, the system-level optimization model does not explicitly incorporate the uncertainties associated with renewable energy generation and load forecasts, which are inherent in real-world power system operations. Addressing these uncertainties is crucial for further enhancing the robustness and practical applicability of the evaluation results. Future research will focus on integrating uncertainty modeling techniques, such as stochastic optimization or robust optimization, into the proposed framework to account for forecast errors in renewable energy output and load demand. Additionally, the method may be extended and combined with refined modeling approaches to support the quantitative evaluation of regulation potential for emerging types of flexible loads, such as green hydrogen facilities, 5G base stations, and data centers.

Author Contributions

Conceptualization, W.L.; methodology, W.L. and M.P.; software, C.Z. (Cheng Zhao); validation, C.Z. (Cheng Zhao) and M.P.; formal analysis, C.Z. (Cheng Zhao) and C.Z. (Chao Zhen); investigation, H.L. and X.Q.; resources, W.L.; data curation, C.Z. (Cheng Zhao); writing—original draft preparation, W.L.; writing—review and editing, W.L., C.Z. (Cheng Zhao), C.Z. (Chao Zhen) and X.Q.; visualization, C.Z. (Chao Zhen); supervision, M.P. and X.Q.; project administration, W.L., H.L. and X.Q.; funding acquisition, W.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by the Science and Technology Project of State Grid Anhui Electric Power Co., Ltd. (No. B3120524003K).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Dataset available on request from the authors.

Conflicts of Interest

Authors Wei Lou and Cheng Zhao were employed by the company Electric Power Research Institute, State Grid Anhui Electric Power Co., Ltd. Authors Min Pan and Chao Zhen were employed by the company State Grid Anhui Electric Power Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  1. Nikoobakht, A.; Aghaei, J.; Shafie-Khah, M.; Catalao, J.P.S. Assessing increased flexibility of energy storage and demand response to accommodate a high penetration of renewable energy sources. IEEE Trans. Sustain. Energy 2018, 10, 659–669. [Google Scholar] [CrossRef]
  2. McPherson, M.; Stoll, B. Demand response for variable renewable energy integration: A proposed approach and its impacts. Energy 2020, 197, 117205. [Google Scholar] [CrossRef]
  3. Huang, S.; Abedinia, O. Investigation in economic analysis of microgrids based on renewable energy uncertainty and demand response in the electricity market. Energy 2021, 225, 120247. [Google Scholar] [CrossRef]
  4. Yang, X.; Xu, C.; He, H.; Yao, W.; Wen, J.; Zhang, Y. Flexibility provisions in active distribution networks with uncertainties. IEEE Trans. Sustain. Energy 2020, 12, 553–567. [Google Scholar] [CrossRef]
  5. Huang, C.; Zhang, H.; Song, Y.; Wang, L.; Ahmad, T.; Luo, X. Demand response for industrial micro-grid considering photovoltaic power uncertainty and battery operational cost. IEEE Trans. Smart Grid 2021, 12, 3043–3055. [Google Scholar] [CrossRef]
  6. Jing, Z.; Gao, L.; Mu, Y.; Liang, D. Flexibility-Constrained Energy Storage System Placement for Flexible Interconnected Distribution Networks. Sustainability 2024, 16, 9129. [Google Scholar] [CrossRef]
  7. Wang, G.; Li, Z.; Wang, F. Enhanced sufficient battery model for aggregate flexibility of thermostatically controlled loads considering coupling constraints. IEEE Trans. Sustain. Energy 2021, 12, 2493–2496. [Google Scholar] [CrossRef]
  8. Zhou, Z.; Guo, X.; Xiong, Y.J.; Xia, C.-M. Kalman-SSM: Modeling long-term time series with kalman filter structured state spaces. IEEE Signal Process. Lett. 2024, 31, 2470–2474. [Google Scholar] [CrossRef]
  9. Zhang, J.C.; Li, M.; Liu, Z.W.; Tan, J.; Tao, Y.G.; Luo, T.L. An evaluation method for multi-type flexible resource regulation capability on the user side of distribution networks. Electr. Power 2023, 56, 96–103. [Google Scholar]
  10. Gao, A.; Zheng, J.; Mei, F.; Liu, Y. Substation-Level flexible load disaggregation based on hierarchical federated perception algorithm. IEEE Trans. Smart Grid 2024, 15, 5787–5798. [Google Scholar] [CrossRef]
  11. Wang, B.; Zhao, D.; Dehghanian, P.; Tian, Y.; Hong, T. Aggregated electric vehicle load modeling in large-scale electric power systems. IEEE Trans. Ind. Appl. 2020, 56, 5796–5810. [Google Scholar] [CrossRef]
  12. Ge, W.; Wang, X. Study of source-load interaction in demand response considering the potential of flexible load dispatch. Chin. J. Electr. Eng. 2024, 10, 141–151. [Google Scholar] [CrossRef]
  13. Lu, S.; Ding, J.; Chen, M.; Gu, W.; Zhu, J.; Xu, Y.; Dong, Z.; Sun, Z. On the solution uniqueness of data-driven modeling of flexible loads. IEEE Trans. Smart Grid 2024, 16, 1993–1996. [Google Scholar] [CrossRef]
  14. Cao, Y.; Wei, W.; Mei, S.; Shafie-Khah, M.; Catalao, J.P.S. Analyzing and quantifying the intrinsic distributional robustness of CVaR reformulation for chance-constrained stochastic programs. IEEE Trans. Power Syst. 2020, 35, 4908–4911. [Google Scholar] [CrossRef]
  15. Xu, W.; Zeng, S.; Du, X.; Zhao, J.; He, Y.; Wu, X. Reliability of active distribution network considering uncertainty of distribution generation and load. Electronics 2023, 12, 1363. [Google Scholar] [CrossRef]
  16. Kong, X.; Ma, Y.; Ai, Q.; Zhang, X.; Li, C.; Xiao, B. Review on electricity consumption characteristic modeling and load forecasting for diverse users in new power system. Autom. Electr. Power Syst. 2023, 47, 2–17. [Google Scholar]
  17. Peng, D.; Pei, W.; Xiao, H.; Yang, Y.; Tang, C. Data-driven consumer demand response behavior modelization and application. Power Syst. Technol. 2021, 45, 2577–2586. [Google Scholar]
  18. Baoyi, W.; Jin, L.; Shaomin, Z. Study of residential power load patterns based on clustering and deep belief network. J. Netw. Comput. Appl. 2017, 2, 7–13. [Google Scholar] [CrossRef]
  19. Li, C.; Dong, Z.; Chen, G.; Zhou, B.; Zhang, J.; Yu, X. Data-driven planning of electric vehicle charging infrastructure: A case study of Sydney, Australia. IEEE Trans. Smart Grid 2021, 12, 3289–3304. [Google Scholar] [CrossRef]
  20. Wang, Y.; Chen, Q.; Kang, C.; Xia, Q. Clustering of electricity consumption behavior dynamics toward big data applications. IEEE Trans. Smart Grid 2016, 7, 2437–2447. [Google Scholar] [CrossRef]
  21. Chen, T.; Wan, W.; Li, X.; Qin, H.; Yan, W. Flexible load multi-step forecasting method based on non-intrusive load decomposition. Electronics 2023, 12, 2842. [Google Scholar] [CrossRef]
  22. Wang, K.; Du, H.; Wang, J.; Jia, R.; Zong, Z. An ensemble deep learning model for provincial load forecasting based on reduced dimensional clustering and decomposition strategies. Mathematics 2023, 11, 2786. [Google Scholar] [CrossRef]
  23. Sun, Y.; Mao, Y.; Li, Z.; Zhang, X.; Li, F. A comprehensive clustering method of user load characteristics and adjustable potential based on power big data. Proc. CSEE 2021, 41, 6259–6271. [Google Scholar]
  24. Zhang, B.; Zhuang, C.; Hu, J.; Chen, S.; Zhang, M.; Wang, K.; Zeng, R. Ensemble clustering algorithm combined with dimension reduction techniques for power load profiles. Proc. CSEE 2015, 35, 3741–3749. [Google Scholar]
  25. Pereira, E.C.; Barbosa, C.H.N.R.; Vasconcelos, J.A. Distribution network reconfiguration using iterative branch exchange and clustering technique. Energies 2023, 16, 2395. [Google Scholar] [CrossRef]
  26. Sarmas, E.; Fragkiadaki, A.; Marinakis, V. Explainable AI-based ensemble clustering for load profiling and demand response. Energies 2024, 17, 5559. [Google Scholar] [CrossRef]
  27. Huang, D.; Wang, C.D.; Lai, J.H.; Kwoh, C.-K. Toward multidiversified ensemble clustering of high-dimensional data: From subspaces to metrics and beyond. IEEE Trans. Cybern. 2021, 52, 12231–12244. [Google Scholar] [CrossRef]
  28. Li, X.; Wang, P.; Liu, Y.; Xu, L. Massive load pattern identification method considering class imbalance. Proc. CSEE 2020, 40, 128–137. [Google Scholar]
  29. Farivar, M.; Low, S.H. Branch flow model: Relaxations and convexification—Part I. IEEE Trans. Power Syst. 2013, 28, 2554–2564. [Google Scholar] [CrossRef]
  30. Baran, M.E.; Wu, F.F. Optimal capacitor placement on radial distribution systems. IEEE Trans. Power Deliv. 2002, 4, 725–734. [Google Scholar] [CrossRef]
Figure 1. Multi-level evaluation framework for flexible load regulation potential.
Figure 1. Multi-level evaluation framework for flexible load regulation potential.
Applsci 15 12885 g001
Figure 2. Basic workflow of the ensemble clustering algorithm.
Figure 2. Basic workflow of the ensemble clustering algorithm.
Applsci 15 12885 g002
Figure 3. Structure of the modified IEEE 33-bus distribution network.
Figure 3. Structure of the modified IEEE 33-bus distribution network.
Applsci 15 12885 g003
Figure 4. Boxplots of DBI by different clustering algorithms. (a) DBI distribution of repeated clustering runs for a single user. (b) DBI distribution of clustering results for 100 different users.
Figure 4. Boxplots of DBI by different clustering algorithms. (a) DBI distribution of repeated clustering runs for a single user. (b) DBI distribution of clustering results for 100 different users.
Applsci 15 12885 g004
Figure 5. Experimental results of clustering robustness. (a) Accuracy rate of clustering with different noise content. (b) DBI of clustering with different noise content.
Figure 5. Experimental results of clustering robustness. (a) Accuracy rate of clustering with different noise content. (b) DBI of clustering with different noise content.
Applsci 15 12885 g005
Figure 6. Results of clustering the load curves of a certain user at bus 9 by the ensemble clustering.
Figure 6. Results of clustering the load curves of a certain user at bus 9 by the ensemble clustering.
Applsci 15 12885 g006
Figure 7. Quantitative results of user-level regulation potential. (a) Baseline load and regulation potential. (b) Upward and downward regulation potential.
Figure 7. Quantitative results of user-level regulation potential. (a) Baseline load and regulation potential. (b) Upward and downward regulation potential.
Applsci 15 12885 g007
Figure 8. Fluctuation sequence of system load.
Figure 8. Fluctuation sequence of system load.
Applsci 15 12885 g008
Figure 9. Wind power consumption curve.
Figure 9. Wind power consumption curve.
Applsci 15 12885 g009
Figure 10. Peak-valley difference in system load.
Figure 10. Peak-valley difference in system load.
Applsci 15 12885 g010
Figure 11. System contribution degree indexes.
Figure 11. System contribution degree indexes.
Applsci 15 12885 g011
Figure 12. Structure of the modified PG&E 69-bus distribution network.
Figure 12. Structure of the modified PG&E 69-bus distribution network.
Applsci 15 12885 g012
Figure 13. Fluctuation sequence of system load of PG&E 69-bus system.
Figure 13. Fluctuation sequence of system load of PG&E 69-bus system.
Applsci 15 12885 g013
Figure 14. Renewable energy consumption curve of PG&E 69-bus system.
Figure 14. Renewable energy consumption curve of PG&E 69-bus system.
Applsci 15 12885 g014
Figure 15. Peak-valley difference in system load of PG&E 69-bus system.
Figure 15. Peak-valley difference in system load of PG&E 69-bus system.
Applsci 15 12885 g015
Table 1. Number of flexible load users connected to each bus.
Table 1. Number of flexible load users connected to each bus.
Bus91116202429
Number of flexible load users1191210119
Table 2. Comparison of system contribution degree indexes.
Table 2. Comparison of system contribution degree indexes.
System Contribution Degree IndexesIEEE 33-Bus SystemPG&E 69-Bus System
System load fluctuation18.12%55.96%
Renewable energy consumption8.31%24.45%
Peak-valley difference in system load4.74%24.84%
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Lou, W.; Zhao, C.; Pan, M.; Zhen, C.; Liu, H.; Qi, X. Multi-Level Evaluation for Flexible Load Regulation Potential in Distribution Network Based on Ensemble Clustering. Appl. Sci. 2025, 15, 12885. https://doi.org/10.3390/app152412885

AMA Style

Lou W, Zhao C, Pan M, Zhen C, Liu H, Qi X. Multi-Level Evaluation for Flexible Load Regulation Potential in Distribution Network Based on Ensemble Clustering. Applied Sciences. 2025; 15(24):12885. https://doi.org/10.3390/app152412885

Chicago/Turabian Style

Lou, Wei, Cheng Zhao, Min Pan, Chao Zhen, Hao Liu, and Xianjun Qi. 2025. "Multi-Level Evaluation for Flexible Load Regulation Potential in Distribution Network Based on Ensemble Clustering" Applied Sciences 15, no. 24: 12885. https://doi.org/10.3390/app152412885

APA Style

Lou, W., Zhao, C., Pan, M., Zhen, C., Liu, H., & Qi, X. (2025). Multi-Level Evaluation for Flexible Load Regulation Potential in Distribution Network Based on Ensemble Clustering. Applied Sciences, 15(24), 12885. https://doi.org/10.3390/app152412885

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop