Enhancing Load Stratification in Power Distribution Systems Through Clustering Algorithms: A Practical Study
Abstract
1. Introduction
2. Background
3. Methodology
3.1. Stage 1: Data Acquisition and Processing
3.1.1. Data Acquisition of Low and Medium Voltage Customers
3.1.2. Data Validation
3.1.3. Calculations
3.1.4. Information Filtering
3.2. Stage 2: Data Management System Export and Evaluation of Clustering Methods
3.2.1. Optimal Number of Clusters
3.2.2. Analysis of Clustering Methods for Characterization
3.2.3. Selection of Methods to Apply
3.3. Stage 3: Classification and Characterization
3.3.1. Customer Classification and Characterization
3.3.2. Analysis
4. Results
4.1. Test Methods
- K-means and GMM: It is essential to determine the appropriate number of clusters to ensure reliable segmentation. For this purpose, the elbow method was applied, as illustrated in Figure 2. The figure represents the average distance between the observations and their corresponding centroids as a function of the number of clusters.
- DBSCAN: To determine this value, the elbow method was used, based on the relationship between the distance among observations and the number of clusters, as illustrated in Figure 3. The optimal number of clusters was established at “5”, as no significant improvement was observed when testing different values around this point.
4.2. Evaluation of the Results Obtained by the Methods Applied and Results of EERCS
4.3. Analysis of GMM Results
4.4. Assessment of the Method for Sizing Transformer Stations
5. Discussion and Analysis of Results
6. Conclusions
7. Recommendations
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations
| ARCERNNR | Agencia de Regulación y Control de Energía y Recursos Naturales no Renovables |
| EEC | Electric Energy Consumption |
References
- Mora-Alvarez, M.; Contreras-Ortiz, P.; Serrano-Guerrero, X.; Escrivá-Escriva, G. Characterization and Classification of Daily Electricity Consumption Profiles: Shape Factors and k-Means Clustering Technique. E3S Web Conf. 2018, 64, 08004. [Google Scholar] [CrossRef]
- Soto, P.A.; Castro, J.R.; Reategui, R.M.; Castillo, T.D. Partición de una Red Eléctrica de Distribución Aplicando Algoritmos de Agrupamiento K-means y DBSCAN. Rev. Tec. Energ. 2023, 20, 73–81. [Google Scholar] [CrossRef]
- Mahesh, B. Machine Learning Algorithms—A Review. Int. J. Sci. Res. 2020, 9, 381–386. [Google Scholar] [CrossRef]
- Bonaccorso, G. Machine Learning Algorithms; Packt Publishing Ltd.: Birmingham, UK, 2017. [Google Scholar]
- Enriquez-Loja, J.; Castillo-Pérez, B.; Serrano-Guerrero, X.; Barragán-Escandón, A. Performance evaluation method for different clustering techniques. Comput. Electr. Eng. 2025, 123, 110132. [Google Scholar] [CrossRef]
- Germán, A.; Juan, S.P. Optimización de Los Alimentadores de Media TensióN y Transformadores de Distribución de la S/E 17 Los Cerezos Proyectada por la CENTROSUR. Bachelor’s Thesis, UPS, Cuenca, Ecuador, 2019. [Google Scholar]
- Ullah, A.; Haydarov, K.; Haq, I.U.; Muhammad, K.; Rho, S.; Lee, M.; Baik, S.W. Deep Learning Assisted Buildings Energy Consumption Profiling Using Smart Meter Data. Sensors 2020, 20, 873. [Google Scholar] [CrossRef] [PubMed]
- Huang, D.; Wang, C.D.; Peng, H.; Lai, J.; Kwoh, C.K. Enhanced ensemble clustering via fast propagation of cluster-wise similarities. IEEE Trans. Syst. Man Cybern. Syst. 2018, 51, 508–520. [Google Scholar] [CrossRef]
- McLoughlin, F.; Duffy, A.; Conlon, M. A Clustering Approach to Domestic Electricity Load Profile Characterisation Using Smart Metering Data. Appl. Energy 2015, 141, 190–199. [Google Scholar] [CrossRef]
- Seem, J.E. Pattern recognition algorithm for determining days of the week with similar energy consumption profiles. Energy Build. 2005, 37, 127–139. [Google Scholar] [CrossRef]
- Wang, J.; Wang, K.; Jia, R.; Chen, X. Research on Load Clustering Based on Singular Value Decomposition and K-means Clustering Algorithm. In Proceedings of the 2020 Asia Energy and Electrical Engineering Symposium (AEEES), Chengdu, China, 28–31 May 2020; pp. 831–835. [Google Scholar]
- Benavoli, A.; Corani, G.; Demšar, J.; Zaffalon, M. Time for a Change: A Tutorial for Comparing Multiple Classifiers through Bayesian Analysis. J. Mach. Learn. Res. 2017, 18, 136–181. [Google Scholar] [CrossRef]
- Li, Y.; Zhang, Y.; Tang, Q.; Huang, W.; Jiang, Y.; Xia, S.T. tk-means: A robust and stable k-means variant. In Proceedings of the ICASSP 2021–2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto, ON, Canada, 6–11 June 2021; pp. 3120–3124. [Google Scholar]
- Likas, A.; Vlassis, N.; Verbeek, J.J. The global k-means clustering Algorithm. Pattern Recognit. 2003, 36, 451–461. [Google Scholar] [CrossRef]
- García-Santander, L.; San Martín-Ayala, J.; Ulloa-Vásquez, F.; Carrizo, D.; Esparza, V.; Rohten, J.; Mejias, C. Classification of Behavior Profiles for Non-Residential Customers Considering the Variable of Electrical Energy Consumption: Case Study—SAESA Group SA Company. Energies 2022, 15, 6634. [Google Scholar] [CrossRef]
- Rodrigo, J.A. Detección de AnomalíAs Con Gaussian Mixture Model (GMM) y Python. 2020. Available online: https://cienciadedatos.net/documentos/py23-deteccion-anomalias-gmm-python (accessed on 3 February 2023).
- Dempster, A.P.; Laird, N.M.; Rubin, D.B. Maximum Likelihood from Incomplete Data via the EM Algorithm. J. R. Stat. Soc. Ser. B Methodol. 1977, 39, 1–22. [Google Scholar] [CrossRef]
- Ohadi, N.; Kamandi, A.; Shabankhah, M.; Fatemi, S.M.; Hosseini, S.M.; Mahmoudi, A. Sw-dbscan: A grid-based dbscan algorithm for large datasets. In Proceedings of the 2020 6th International Conference on Web Research (ICWR), Tehran, Iran, 22–23 April 2020; pp. 139–145. [Google Scholar]
- Pascual, D.; Pla, F.; Sánchez, S. Algoritmos de agrupamiento. In Método Informáticos Avanzados; Publicacions de la Universitat Jaume I: Castelló, Spain, 2007; pp. 164–174. [Google Scholar]
- Jebari, S.; Smiti, A.; Louati, A. AF-DBSCAN: An Unsupervised Automatic Fuzzy Clustering Method Based on DBSCAN Approach. In Proceedings of the 2019 IEEE International Work Conference on Bioinspired Intelligence (IWOBI), Budapest, Hungary, 3–5 July 2019; pp. 1–6. [Google Scholar]
- Ciardullo, E.; Quaglino, M. Estudio Comparativo de méTodos de Clasificación no Supervisada en Contextos de Grandes Bases de Datos. 2020. Available online: http://hdl.handle.net/11086/16851 (accessed on 1 July 2025).
- Betancourt Vasco, E.E. Estudio y Planteamiento Para Establecer Una Tarifa Horaria en el Pico del Sistema EléCtrico en el Ecuador Como Incentivo de Eficiencia EnergéTica. Bachelor’s Thesis, EPN, Quito, Ecuador, 2012. [Google Scholar]
- Zambrano, S.; Molina, M. Investigación y Caracterización de la Carga Muestreo Aleatorio Por Estratos; Empresa Eléctrica Regional Centro Sur C.A., Departamento de Estudios Técnicos: Cuenca, Ecuador, 2016. [Google Scholar]
- Shi, C.; Wei, B.; Wei, S.; Wang, W.; Liu, H.; Liu, J. A quantitative discriminant method of elbow point for the optimal number of clusters in clustering algorithm. EURASIP J. Wirel. Commun. Netw. 2021, 2021, 31. [Google Scholar] [CrossRef]











| Category | Tariff |
|---|---|
| General low voltage without demand | RD (Residential) SC (Senior Citizens) SA (Social Assistance) PB (Public Benefit) CO (Commercial) SA (Sports Arena) IA (Industrial Artisanal) OW (Official Entities) WP (Water Pumping) |
| General low voltage without demand | AB (Social assistance in LV with demand) BB (Public benefit in LV with demand) CB (Commercial LV with demand) B (Industrial LV with demand) MB (Municipal entity LV with demand) |
| General low voltage with hourly demand metering | A3 (Social assistance LV with hourly demand) B3 (Public benefit LV with hourly demand) C3 (Commercial LV with hourly demand) E3 (Sports facility LV with hourly demand) HH (Artisanal industry with hourly demand |
| Public lighting | PL (Public Lighting) |
| General with demand | WP (Water Pumping) PB (Public Benefit) CD (Commercial) RW (Religious Worship) SF (Sports Facility) ID (Industrial) ME (Municipal Entity) OE (Official Entities) |
| General with hourly demand | AH (Social assistance with hourly demand) BH (Public benefit with hourly demand) CH (Commercial with hourly demand) SH (Sports facility with hourly demand) IH (Industrial with hourly demand) JH (Industrial with hourly metering and incentives in MV) |
| High voltage service | KH (Industrial with hourly metering and incentives in HV) |
| Consumer Group | Strata kWh> | Strata kWh> | Strata |
|---|---|---|---|
| Residential | 0 | 60 | 1 |
| Residential | 60 | 110 | 2 |
| Residential | 110 | 180 | 3 |
| Residential | 180 | 310 | 4 |
| Residential | 310 | Upper | 5 |
| Commercial | 0 | 290 | 1 |
| Commercial | 290 | 1235 | 2 |
| Commercial | 1235 | Upper | 3 |
| Industrial | 0 | 410 | 1 |
| Industrial | 410 | 2520 | 2 |
| Industrial | 2520 | Upper | 3 |
| Others | 0 | 405 | 1 |
| Others | 405 | 1820 | 2 |
| Others | 1820 | Upper | 3 |
| Methods | Clusters | E [kWh/Month] | Observations | Strata Representation [%] |
|---|---|---|---|---|
| K-Means | 4 | 6.84 43.13 46.44 62.17 | 1647 1008 407 132 | 31.56 |
| DBSCAN | 5 | 54.59 21.49 46.33 31.66 64.05 | 390 2793 5 3 3 | 92.38 |
| GMM | 4 | 0.02 2.73 13.54 41.41 | 436 482 616 1308 | 46.02 |
| Methods | Observations | Strata | Clusters |
|---|---|---|---|
| DBSCAN | 7480 | Residential Commercial Industrial Others | 8 5 2 1 |
| EERCS | 1214 | Residential Commercial Industrial Others | 5 5 3 3 |
| GMM | 8487 | Residential Commercial Industrial Others | 7 4 3 3 |
| Methods | Strata | Observations | E [kW/Month] | Reduction Observations [%] | |
|---|---|---|---|---|---|
| DBSCAN | RD1 RD2 RD3 RD4 RD5 | 2793 1991 1662 1355 662 | 21.5 80.0 135.8 228.3 481.1 | 55.0 57.2 62.1 69.6 74.8 | 33.0 |
| EERCS | Residential 1 Residential 2 Residential 3 Residential 4 Residential 5 | 161 212 173 116 48 | 60 110 180 310 500 | 49.3 54.2 58.2 65.6 75.2 | 24.366 |
| GMM | Residential 1 Residential 2 Residential 3 Residential 4 Residential 5 | 1455 1919 2544 2052 775 | 4.5 44.0 92.4 170.9 346.5 | 42.1 56.7 56.7 65.2 73.4 | 16.6 |
| Element | E [kWh/mes] | Customers | LF [%] |
|---|---|---|---|
| Trafo 31240 | 9443.85 | 71 | 67.948 |
| Residential 1 | 60 | 110 | 2 |
| Residential 3 | 170.94 | 23 | 65.2 |
| Residential 4 | 346.51 | 5 | 73.4 |
| Commercial 1 | 727.59 | 3 | 61.4 |
| Methods | Consumer Group | Energy kWh> | Energy kWh> | Strata |
|---|---|---|---|---|
| DBSCAN | Residential | 0 60 110 180 310 | 60 110 180 310 Upper | 1 2 3 4 5 |
| EERCS | Residential | 0 60 110 180 310 | 60 110 180 310 Upper | 1 2 3 4 5 |
| GMM | Residential | 0 68 131 258 | 68 131 258 Upper | 1 2 3 4 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Mendoza-Vitonera, W.; Serrano-Guerrero, X.; Cabrera, M.-F.; Enriquez-Loja, J.; Barragán-Escandón, A. Enhancing Load Stratification in Power Distribution Systems Through Clustering Algorithms: A Practical Study. Energies 2025, 18, 5314. https://doi.org/10.3390/en18195314
Mendoza-Vitonera W, Serrano-Guerrero X, Cabrera M-F, Enriquez-Loja J, Barragán-Escandón A. Enhancing Load Stratification in Power Distribution Systems Through Clustering Algorithms: A Practical Study. Energies. 2025; 18(19):5314. https://doi.org/10.3390/en18195314
Chicago/Turabian StyleMendoza-Vitonera, Williams, Xavier Serrano-Guerrero, María-Fernanda Cabrera, John Enriquez-Loja, and Antonio Barragán-Escandón. 2025. "Enhancing Load Stratification in Power Distribution Systems Through Clustering Algorithms: A Practical Study" Energies 18, no. 19: 5314. https://doi.org/10.3390/en18195314
APA StyleMendoza-Vitonera, W., Serrano-Guerrero, X., Cabrera, M.-F., Enriquez-Loja, J., & Barragán-Escandón, A. (2025). Enhancing Load Stratification in Power Distribution Systems Through Clustering Algorithms: A Practical Study. Energies, 18(19), 5314. https://doi.org/10.3390/en18195314

