Next Article in Journal
A Temporal Graph Network Algorithm for Detecting Fraudulent Transactions on Online Payment Platforms
Previous Article in Journal
Enhancing Intrusion Detection Systems with Dimensionality Reduction and Multi-Stacking Ensemble Techniques
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Ellipsoidal K-Means: An Automatic Clustering Approach for Non-Uniform Data Distributions

by
Alaa E. Abdel-Hakim
1,2,*,
Abdel-Monem M. Ibrahim
3,4,
Kheir Eddine Bouazza
5,6,
Wael Deabes
7,8 and
Abdel-Rahman Hedar
9,10
1
Department of Computer Science in Jamoum, Umm Al-Qura University, Makkah 25371, Saudi Arabia
2
Electrical Engineering Department, Assiut University, Assiut 71516, Egypt
3
Department of Mathematics, Faculty of Science, Al-Azhar University, Assiut Branch, Assiut 71524, Egypt
4
Department of Mathematics and Statistics, Faculty of Science, Thompson Rivers University, Kamloops, BC V2C 0C8, Canada
5
Computer and Information Science Division, Higher Colleges of Technology, Abu Dhabi 25026, United Arab Emirates
6
Laboratoire d’Informatique et des Technologies de l’Information d’Oran (LITIO), University of Oran, Oran 31000, Algeria
7
Department of Computational, Engineering, Mathematical Sciences (CEMS), Texas A&M University-San Antonio, San Antonio, TX 78224, USA
8
Computers and Systems Engineering Department, Mansoura University, Mansoura 35516, Egypt
9
Department of Computer Science, Faculty of Computers & Information, Assiut University, Assiut 71526, Egypt
10
Artificial Intelligence Department, School of Artificial Intelligence and Data Management, Badr University in Assiut, Assiut 19592, Egypt
*
Author to whom correspondence should be addressed.
Algorithms 2024, 17(12), 551; https://doi.org/10.3390/a17120551
Submission received: 8 September 2024 / Revised: 21 October 2024 / Accepted: 26 November 2024 / Published: 3 December 2024
(This article belongs to the Section Analysis of Algorithms and Complexity Theory)

Abstract

Traditional K-means clustering assumes, to some extent, a uniform distribution of data around predefined centroids, which limits its effectiveness for many realistic datasets. In this paper, a new clustering technique, simulated-annealing-based ellipsoidal clustering (SAELLC), is proposed to automatically partition data into an optimal number of ellipsoidal clusters, a capability absent in traditional methods. SAELLC transforms each identified cluster into a hyperspherical cluster, where the diameter of the hypersphere equals the minor axis of the original ellipsoid, and the center is encoded to represent the entire cluster. During the assignment of points to clusters, local ellipsoidal properties are independently considered. For objective function evaluation, the method adaptively transforms these ellipsoidal clusters into a variable number of global clusters. Two objective functions are simultaneously optimized: one reflecting partition compactness using the silhouette function (SF) and Euclidean distance, and another addressing cluster connectedness through a nearest-neighbor algorithm. This optimization is achieved using a newly-developed multiobjective simulated annealing approach. SAELLC is designed to automatically determine the optimal number of clusters, achieve precise partitioning, and accommodate a wide range of cluster shapes, including spherical, ellipsoidal, and non-symmetric forms. Extensive experiments conducted on UCI datasets demonstrated SAELLC’s superior performance compared to six well-known clustering algorithms. The results highlight its remarkable ability to handle diverse data distributions and automatically identify the optimal number of clusters, making it a robust choice for advanced clustering analysis.
Keywords: unsupervised learning; data clustering; simulated annealing; silhouette function; adaptive ellipsoidal K-means; automatic clustering unsupervised learning; data clustering; simulated annealing; silhouette function; adaptive ellipsoidal K-means; automatic clustering

Share and Cite

MDPI and ACS Style

Abdel-Hakim, A.E.; Ibrahim, A.-M.M.; Bouazza, K.E.; Deabes, W.; Hedar, A.-R. Ellipsoidal K-Means: An Automatic Clustering Approach for Non-Uniform Data Distributions. Algorithms 2024, 17, 551. https://doi.org/10.3390/a17120551

AMA Style

Abdel-Hakim AE, Ibrahim A-MM, Bouazza KE, Deabes W, Hedar A-R. Ellipsoidal K-Means: An Automatic Clustering Approach for Non-Uniform Data Distributions. Algorithms. 2024; 17(12):551. https://doi.org/10.3390/a17120551

Chicago/Turabian Style

Abdel-Hakim, Alaa E., Abdel-Monem M. Ibrahim, Kheir Eddine Bouazza, Wael Deabes, and Abdel-Rahman Hedar. 2024. "Ellipsoidal K-Means: An Automatic Clustering Approach for Non-Uniform Data Distributions" Algorithms 17, no. 12: 551. https://doi.org/10.3390/a17120551

APA Style

Abdel-Hakim, A. E., Ibrahim, A.-M. M., Bouazza, K. E., Deabes, W., & Hedar, A.-R. (2024). Ellipsoidal K-Means: An Automatic Clustering Approach for Non-Uniform Data Distributions. Algorithms, 17(12), 551. https://doi.org/10.3390/a17120551

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop