Clustering Performance of a Recombinator Hartigan–Wong Algorithm
Abstract
1. Introduction
- Rec-HW design consistently depends on a non-Voronoi organization: each point move, during a trial, actually occurs when it optimizes the objective function cost.
- Rec-HW no longer generates empty clusters, as can instead occur in the preliminary Voronoi version in [29].
- Rec-HW performs a batch of R independent runs (e.g., R = 50), from which confidence intervals are estimated for the various clustering quality indices.
- Rec-HW is thoroughly tested by applying it to many challenging datasets, both synthetic and realistic. Experimental results are also compared with those generated by competitor algorithms.
2. Basic Concepts
2.1. K-Means and Seeding Methods
| Algorithm 1. Pseudo-code of the K-Means algorithm. |
| Input: Dataset X, number of clusters K, maximum number of iterations T 1. initialization: use a seeding method to define initialize centroids 2. refinements: iterations = 0 do{ partition points to clusters according to the nearest centroid rule nc(.) recompute centroids of resultant clusters as mean points: ++iterations }while( centroids differ from the previous ones and iterations < T ) Output: compute SSE and other accuracy clustering indices on the resultant solution. |
2.2. Voronoi Version of Hartigan–Wong
| Algorithm 2. Hartigan–Wong operation. |
| Input: Dataset X, number of clusters K, maximum number of iterations T 1. initialization: define initial centroids with a seeding method partition dataset points according to nc(.) rule and define initial clusters 2. trials: s = true iterations = 0 do { s = true for (each point ){ remove from its source cluster sc and update the centroid of sc assign to other cluster dc according to nc(.) rule, and update centroid of dc if (dc ! = sc) s = false } ++iterations }while ( !s and iterations < T ) Output: compute SSE and other accuracy clustering indices on the resultant solution |
2.3. Clustering Accuracy Indices
2.4. Genetic Concepts for Clustering
3. Recombinator Hartigan–Wong Algorithm
3.1. Non-Voronoi Behaviour
3.2. Operation of Rec-HW
| Algorithm 3. Abstract operation of Recombinator Hartigan–Wong algorithm. |
| Input: Dataset X, number of clusters K, number of solutions J of population, maximum number of trails T, number of repetitions/recombinations R 1. Initialization: create population with J solutions (J*K centroids) repeat J times { C ← seeding(X, K, refine) P ← partition(X, C, K) <C′,P′> ← refine-by-trials(<C,P>) {C′} } 2. Recombination: create R generations of best-cost ← ∞, best ← ? repeat R times{ C ← seeding(, K, g-k-means++) P ← partition(X, C, K) <C′,P′> ← refine-by-trials(<C,P>) cost ← SSE(<C′,P′>) if (cost < best-cost){ best ← <C’,P’> best-cost ← cost } } Output: SSE and other clustering quality indices, including statistical data, of the emerged best solution. |
3.3. Implementation Issues
| Algorithm 4. Java parallel version of data partitioning and centre[] initialization. |
| … //0. initialize array centre[] for (int k = 0; k < K; ++k) {centre[k].reset(); centre[k].setCID(k);} //1. partition dataset points according to current defined centroids[] Stream<DataPoint> p_stream = Stream.of(dataset); if (PARALLEL) p_stream = p_stream.parallel(); p_stream .map( p -> { double md = Double.MAX_VALUE; for( int k = 0; k < K; ++k ){ double d = p.distance(centroids[k]); if (d < md) {md = d; p.setCID(k);} } return p; }) .forEach (p->{}); //2. put in centre[c] the sum of points and cardinality of cluster c Stream<DataPoint> c_stream = Stream.of(centre); if (PARALLEL) c_stream = c_stream.parallel(); c_stream .map( c -> { for( int i = 0; i < N; ++i ){ if (dataset[i].getCID()==c.getCID()) c.add(dataset[i]); } return c; }) .forEach( c->{} ); … |
| Algorithm 5. Trials operations of Rec-HW. |
| … //3. make trials it = 0; boolean s = true; do{ s = true; ++it; //for each data point xi, that is, dataset[i] for( int i = 0; i < N; ++i ){ //remove xi from its source cluster sc sc = dataset[i].getCID(); centre[sc].sub(dataset[i]); double nsc = centre[sc].getN(); //new cardinality of sc DataPoint musc = new DataPoint(centre[sc]); musc.mean(); //new centroid of sc //compute distortion decrement Ddec double d = dataset[i].distance(musc); double Ddec = (nsc/(nsc + 1)) * d * d; //detect cluster dc which would imply the minimum distortion increment Dinc dc = 0; double Dinc = Double.MAX_VALUE; for( int c = 0; c < K; ++c ){ centre[c].add(dataset[i]); double ndc = centre[c].getN(); DataPoint mudc = new DataPoint(centre[c]); mudc.mean(); //compute minimal Dinc d = dataset[i].distance(mudc); double Ddc = (ndc/(ndc + 1)) * d * d; centre[c].sub(dataset[i]); if (Ddc < Dinc) {Dinc = Ddc; dc = c;} } //check if switch has to be carried out if( Ddec > Dinc && dc != sc ){ //move xi to dc centre[dc].add(dataset[i]); dataset[i].setCID(dc); centroids[sc] = new DataPoint(centre[sc]); centroids[sc].mean(); centroids[dc] = new DataPoint(centre[dc]); centroids[dc].mean(); s = false; } else{ //no move centre[sc].add(dataset[i]); dataset[i].setCID(sc); centroids[sc] = new DataPoint(centre[sc]); centroids[sc].mean(); } }//for }while( !s ); … |
4. Series of Clustering Experiments
4.1. Clustering the A3 Dataset [40]
4.2. Dataset Clustering Sensitive to SSE Minimization
4.3. Some Challenging Datasets
4.4. Realistic Datasets
4.5. Sensitivity Analysis
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Aggarwal, C.C.; Reddy, C.K. Data Clustering—Algorithms and Applications; CRC Press: Boca Raton, FL, USA; Taylor and Francis Group: London, UK, 2014. [Google Scholar]
- Wang, R. Introduction to Machine Learning: From Math to Code; Cambridge University Press: Cambridge, UK, 2025. [Google Scholar]
- Nielsen, F. Partition-based clustering with k-means. In Introduction to HPC with MPI for Data Science; Springer International Publishing: Cham, Switzerland, 2016; pp. 163–193. [Google Scholar]
- Aurenhammer, F.; Klein, R. Voronoi Diagrams; Fernuniv., Fachbereich Informatik: Berlin, Germany, 1996. [Google Scholar]
- Koivistoinen, H.; Ruuska, M.; Elomaa, T. A Voronoi diagram approach to autonomous clustering. In International Conference on Discovery Science; Springer: Berlin/Heidelberg, Germany, 2006; pp. 149–160. [Google Scholar]
- MacQueen, J. Some methods for classification and analysis of multivariate observations. In Proceedings of the 5th Berkeley Symposium on Mathematical Statistics and Probability; University of California: Berkeley, CA, USA, 1967; pp. 281–297. [Google Scholar]
- Lloyd, S.P. Least squares quantization in PCM. IEEE Trans. Inf. Theory 1982, 28, 129–137. [Google Scholar] [CrossRef]
- Jain, A.K. Data clustering: 50 years beyond k-means. Pattern Recognit. Lett. 2010, 31, 651–666. [Google Scholar] [CrossRef]
- Celebi, M.E.; Kingravi, H.A.; Vela, P.A. A comparative study of efficient initialization methods for the k-means clustering algorithm. Expert Syst. Appl. 2013, 40, 200–210. [Google Scholar] [CrossRef]
- Fränti, P.; Sieranoja, S. K-means properties on six clustering benchmark datasets. Appl. Intell. 2018, 48, 4743–4759. [Google Scholar] [CrossRef]
- Fränti, P.; Sieranoja, S. How much can k-means be improved by using better initialization and repeats? Pattern Recognit. 2019, 93, 95–112. [Google Scholar] [CrossRef]
- Fränti, P. Efficiency of random swap algorithm. J. Big Data 2018, 5, 1–29. [Google Scholar] [CrossRef]
- Nigro, L.; Cicirelli, F.; Fränti, P. Parallel Random Swap: An efficient and reliable clustering algorithm in Java. Simul. Model. Pract. Theory 2023, 124, 102712. [Google Scholar] [CrossRef]
- Hartigan, J.A. Clustering Algorithms; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 1975. [Google Scholar]
- Hartigan, J.A.; Wong, M.A. Algorithm AS 136: A K-Means clustering algorithm. J. R. Stat. Soc. Ser. C (Appl. Stat.) 1979, 28, 100–108. [Google Scholar] [CrossRef]
- Telgarsky, M.; Vattani, A. Hartigan’s method: K-Means clustering without voronoi. In Proceedings of the 13th International Conference on Artificial Intelligence and Statistics, Sardinia, Italy, 13–15 May 2010; JMLR Workshop and Conference Proceedings; PMLR: Cambridge, MA, USA; pp. 820–827.
- Slonim, N.; Aharoni, E.; Crammer, K. Hartigan’s K-means vs. Lloyd’s K means—Is it time for a change? In Proceedings of the 23rd International Joint Conference on Artificial Intelligence (IJCAI), Beijing, China, 3–9 August 2013. [Google Scholar]
- Vouros, A.; Langdell, S.; Croucher, M.; Vasilaki, E. An empirical comparison between stochastic and deterministic centroid initialization for K-Means variations. Mach. Learn. 2021, 110, 1975–2003. [Google Scholar] [CrossRef]
- Nigro, L. Parallel K-Means algorithms in Java. Algorithms 2022, 15, 117. [Google Scholar] [CrossRef]
- Baldassi, C. Recombinator-k-means: An evolutionary algorithm that exploits k-means++ for recombination. IEEE Trans. Evol. Comput. 2022, 26, 991–1003. [Google Scholar] [CrossRef]
- Baldassi, C. Systematically and efficiently improving K-Means initialization by pairwise-nearest-neighbor smoothing. arXiv 2022, arXiv:2202.03949. [Google Scholar]
- Nigro, L.; Cicirelli, F. Improving clustering accuracy of K-Means and Random Swap by an evolutionary technique based on careful seeding. Algorithms 2023, 16, 572. [Google Scholar] [CrossRef]
- Nigro, L.; Cicirelli, F.; Pupo, F. Genetic Elitist Approach and Density Peaks to Improve K-Means Clustering. Algorithms 2026, 19, 131. [Google Scholar] [CrossRef]
- Fränti, P. Genetic algorithm with deterministic crossover for vector quantization. Pattern Recognit. Lett. 2000, 21, 61–68. [Google Scholar] [CrossRef]
- Nigro, L.; Cicirelli, F. Fast clustering convergence by genetic algorithm. In International Conference on WorldS4; Springer Nature: Singapore, 2024; pp. 331–342. [Google Scholar]
- Rodriguez, R.; Laio, A. Clustering by fast search and find of density peaks. Science 2014, 344, 1492–1496. [Google Scholar] [CrossRef] [PubMed]
- Sieranoja, S.; Fränti, P. Fast and general density peaks clustering. Pattern Recognit. Lett. 2019, 128, 551–558. [Google Scholar] [CrossRef]
- Nigro, L.; Cicirelli, F. ParDP: A parallel density peaks-based clustering algorithm. Mathematics 2025, 13, 1285. [Google Scholar] [CrossRef]
- Nigro, L.; Cicirelli, F. Evolutionary Hartigan-Wong clustering algorithm. In Proceedings of the 10th International Conference WorldS4, London, UK, 28–30 July 2026; Springer: Berlin/Heidelberg, Germany, 2026. [Google Scholar]
- Urma, R.G.; Fusco, M.; Mycroft, A. Modern Java in Action; Manning: Shelter Island, NY, USA; Simon Schuster: New York, NY, USA, 2019. [Google Scholar]
- Subramaniam, V. Functional Programming in Java: Harness the Power of Streams and Lambda Expressions; The Pragmatic Programmers LLC: Dallas, TX, USA, 2023. [Google Scholar]
- Arthur, D.; Vassilvitskii, S. k-means++: The advantages of careful seeding. In Proceedings of the 18th Annual ACM-SIAM Symposium on Discrete Algorithms; Society for Industrial and Applied Mathematics: Philadelphia, PA, USA, 2007; pp. 1027–1035. [Google Scholar]
- Bradley, P.S.; Fayyad, U.M. Refining initial points for k-means clustering. In Proceedings of the Fifteenth International Conference on Machine Learning, Madison, WI, USA, 24–27 July 1998; pp. 91–99. [Google Scholar]
- Fränti, P.; Kaukoranta, T. Fast implementation of the optimal PNN method. In Proceedings of the 1998 International Conference on Image Processing. ICIP98 (Cat. No. 98CB36269); IEEE: New York, NY, USA, 1998; Volume 3, pp. 104–108. [Google Scholar]
- Rezaei, M.; Franti, P. Set matching measures for external cluster validity. IEEE Trans. Know. Data Eng. 2016, 28, 2173–2186. [Google Scholar] [CrossRef]
- Fränti, P.; Rezaei, M.; Zhao, Q. Centroid index: Cluster level similarity measure. Pattern Recognit. 2014, 47, 3034–3045. [Google Scholar] [CrossRef]
- Fränti, P.; Rezaei, M. Generalized centroid index to different clustering models. In Proceedings of the Joint IAPR International Workshop on Structural, Syntactic, and Statistical Pattern Recognition (S+SSPR 2016); Springer International Publishing: Cham, Switzerland, 2016; Volume 10029, pp. 285–296. [Google Scholar]
- Franti, P.; Sieranoja, S. Clustering accuracy. Appl. Comput. Intell. 2024, 4, 24–44. [Google Scholar] [CrossRef]
- Nigro, L.; Fränti, P. Two medoid-based algorithms for clustering Sets. Algorithms 2023, 16, 349. [Google Scholar] [CrossRef]
- Fränti, P. Benchmark Datasets Repository. Available online: http://cs.uef.fi/sipu/datasets/ (accessed on 1 May 2026).
- Nie, F.; Li, Z.; Wang, R.; Li, X. An effective and efficient algorithm for K-means clustering with new formulation. IEEE Trans. Knowl. Data Eng. 2023, 35, 3433–3443. [Google Scholar] [CrossRef]
- UCI Machine Learning Repository. Available online: http://archive.ics.uci.edu/ml (accessed on 1 May 2026).
- Caliński, T.; Harabasz, J. A dendrite method for cluster analysis. Commun. Stat. 1974, 3, 1–27. [Google Scholar] [CrossRef]










| Algorithm | SSE | CI | SI | SR (%) | avIT | ET (s) |
|---|---|---|---|---|---|---|
| R-LKM | 9.37 | 4 | 0.54 | 0 | 27.1 | 0.67 |
| R-V-HW | 6.74 | 0 | 0.60 | 7 | 8.93 | 3.04 |
| R-nV-HW | 6.74 | 0 | 0.60 | 10 | 7.67 | 5.25 |
| Rec-HW | 6.74 | 0 | 0.60 | 100 | 2.5 | 4.18 |
| Dataset | N | D | K |
|---|---|---|---|
| Asymmetric | 1000 | 2 | 5 |
| Overlap | 1000 | 2 | 6 |
| S4 | 5000 | 2 | 15 |
| Unbalance2 | 6500 | 2 | 8 |
| Birch1/2 | 100,000 | 2 | 100 |
| Dataset | SSE | CI | SI | SR (%) | avIT | ET (s) |
|---|---|---|---|---|---|---|
| Asymmetric | 0.98 | 0 | 0.64 | 100 | 1 | 0.23 |
| Overlap | 1.30 | 0 | 0.45 | 100 | 1 | 0.34 |
| S4 | 16.44 | 0 | 0.48 | 100 | 5.72 | 2.41 |
| Unbalance2 | 1.09 | 0 | 0.85 | 100 | 1.97 | 0.81 |
| Birch1 | 92.77 | 0 | 0.46 | 100 | 6.5 | 225.58 |
| Birch2 | 0.23 | 0 | 0.78 | 100 | 2.1 | 110.13 |
| Dataset | N | D | K |
|---|---|---|---|
| Birch3 | 100,000 | 2 | 100 |
| Worms_2D | 105,600 | 2 | 35 |
| Olivetti | 400 | 4096 | 40 |
| Dataset | SSE | CI | SI | SR (%) | avIT | ET (s) |
|---|---|---|---|---|---|---|
| Birch3 | 37.74 | 11 | 0.52 | 0 | 28.07 | 1134.7 |
| Worms_2D | 79.32 | 7 | 0.36 | 0 | 25.45 | 475.6 |
| Olivetti | 11,480.75 | 6 | 0.16 | 0 | 4.56 | 422.2 |
| Dataset | N | D | K |
|---|---|---|---|
| Iris | 150 | 4 | 3 |
| Balance | 625 | 4 | 3 |
| Dermatology | 366 | 34 | 6 |
| Uspst | 2007 | 256 | 10 |
| USPSdata_20 | 1854 | 256 | 10 |
| USPSdata | 9298 | 256 | 10 |
| MSRA25 | 1799 | 256 | 12 |
| PalmData25 | 2000 | 256 | 100 |
| Binalpha | 1404 | 320 | 36 |
| Ecoli | 336 | 343 | 8 |
| Corel_5k | 5000 | 423 | 50 |
| MnistData_05 | 3495 | 784 | 10 |
| MnistData_10 | 6996 | 784 | 10 |
| Coil20Data_25 | 1440 | 1024 | 20 |
| Mpeg7 | 1400 | 6000 | 70 |
| TDT2_10 | 653 | 36,771 | 10 |
| Dataset | SSE | CI | SI | SR (%) | avIT | ET (s) | PCA |
|---|---|---|---|---|---|---|---|
| Iris | 78.95 | 0 | 0.55 | 80 | 4.20 | 0.55 | 4 |
| Balance | 3472.32 | 1 | 0.17 | 0 | 7.48 | 0.98 | 4 |
| Dermatology | 5580.60 | 1 | 0.19 | 12 | 4.96 | 1.18 | 34 |
| Uspst | 63,342.50 | 1 | 0.15 | 0 | 19.12 | 22.09 | 226 |
| USPSdata_20 | 66,241.37 | 1 | 0.16 | 0 | 17.30 | 18.97 | 226 |
| USPSdata | 33,3917.29 | 1 | 0.17 | 0 | 29.24 | 151.92 | 226 |
| MSRA25 | 151,542,870.88 | 2 | 0.19 | 0 | 11.68 | 18.54 | 240 |
| PalmData25 | 502,924,881.30 | 15 | 0.29 | 0 | 8.68 | 138.83 | 252 |
| Binalpha | 67,101.17 | 7 | 0.06 | 0 | 14.98 | 54.5 | 315 |
| Ecoli | 338.90 | 2 | 0.01 | 0 | 7.74 | 3.99 | 339 |
| Corel_5k | 4,367,794.42 | 18 | 0.09 | 0 | 34.78 | 416.31 | 183 |
| MnistData_05 | 8,750,188,511.82 | 1 | 0.07 | 0 | 24.56 | 107 | 434 |
| MnistData_10 | 17,594,185,757.59 | 1 | 0.06 | 0 | 32.90 | 273 | 434 |
| Coil20Data_25 | 2,390,598,975.34 | 4 | 0.23 | 0 | 13.54 | 89.9 | 893 |
| Mpeg7 | 5508.91 | 12 | 0.11 | 0 | 11.58 | 2617.48 | 5248 |
| TDT2_10 | 195,506.52 | 2 | 0.19 | 0 | 7.96 | 815.4 | 11,028 |
| Dataset | SSE | CI | SI | SR (%) | avIT | ET (s) |
|---|---|---|---|---|---|---|
| Iris | 78.95 | 0 | 0.55 | 100 | 1.38 | 0.59 |
| Balance | 3472.32 | 1 | 0.17 | 0 | 5.34 | 1.57 |
| Dermatology | 5585.02 | 1 | 0.22 | 90 | 2.12 | 1.21 |
| Uspst | 63,342.41 | 1 | 0.16 | 0 | 4.06 | 13.6 |
| USPSdata_20 | 66,241.37 | 1 | 0.16 | 0 | 3.86 | 12.17 |
| USPSdata | 333,917.36 | 1 | 0.16 | 0 | 7.74 | 79.42 |
| MSRA25 | 151,542,870.88 | 2 | 0.19 | 0 | 5.3 | 15.24 |
| PalmData25 | 478,967,435.73 | 8 | 0.32 | 0 | 4.4 | 239.68 |
| Binalpha | 66,886.37 | 6 | 0.07 | 0 | 9.46 | 53.62 |
| Ecoli | 338.90 | 2 | 0.01 | 0 | 6.14 | 4.04 |
| Corel_5k | 4,355,292.35 | 18 | 0.09 | 0 | 14.14 | 257.49 |
| MnistData_05 | 875,002,3791.49 | 1 | 0.06 | 0 | 11.5 | 73.37 |
| MnistData_10 | 17,594,181,633.10 | 1 | 0.06 | 0 | 13.22 | 172.19 |
| Coil20Data_25 | 2,360,483,642.93 | 3 | 0.21 | 0 | 6.24 | 72.34 |
| Mpeg7 | 5474.46 | 12 | 0.11 | 0 | 7.74 | 4124.4 |
| TDT2_10 | 195,329.82 | 2 | 0.19 | 0 | 4.92 | 767.8 |
| Dataset | ACC | NMI | F-Score | ARI |
|---|---|---|---|---|
| Iris | 0.8427 ± 0.0246 | 0.7106 ± 0.0175 | 0.7786 ± 0.0182 | 0.6582 ± 0.0326 |
| Balance | 0.5266 ± 0.0078 | 0.1218 ± 0.0112 | 0.4647 ± 0.0066 | 0.1410 ± 0.0105 |
| Dermatology | 0.8324 ± 0.0197 | 0.8404 ± 0.0179 | 0.7659 ± 0.0313 | 0.7019 ± 0.0412 |
| Uspst | 0.7002 ± 0.0043 | 0.6123 ± 0.0036 | 0.5751 ± 0.0053 | 0.5240 ± 0.0060 |
| USPSdata_20 | 0.6934 ± 0.0065 | 0.6193 ± 0.0048 | 0.5700 ± 0.0073 | 0.5183 ± 0.0083 |
| USPSdata | 0.7106 ± 0.0020 | 0.6131 ± 0.0012 | 0.5824 ± 0.0023 | 0.5320 ± 0.0028 |
| MSRA25 | 0.5334 ± 0.0112 | 0.5855 ± 0.0106 | 0.3961 ± 0.0139 | 0.3304 ± 0.0170 |
| PalmData25 | 0.7864 ± 0.0036 | 0.9235 ± 0.0014 | 0.7129 ± 0.0050 | 0.7099 ± 0.0050 |
| Binalpha | 0.4636 ± 0.0045 | 0.5883 ± 0.0026 | 0.3085 ± 0.0035 | 0.2886 ± 0.0037 |
| Ecoli | 0.5165 ± 0.0065 | 0.5626 ± 0.0048 | 0.4668 ± 0.0066 | 0.3545 ± 0.0076 |
| Corel_5k | 0.1896 ± 0.0011 | 0.2712 ± 0.0008 | 0.0836 ± 0.0004 | 0.0626 ± 0.0005 |
| MnistData_05 | 0.5566 ± 0.0080 | 0.4826 ± 0.0056 | 0.4163 ± 0.0073 | 0.3491 ± 0.0080 |
| MnistData_10 | 0.5736 ± 0.0062 | 0.4932 ± 0.0042 | 0.4289 ± 0.0050 | 0.3625 ± 0.0055 |
| Coil20Data_25 | 0.6619 ± 0.0107 | 0.7727 ± 0.0045 | 0.5903 ± 0.0091 | 0.5664 ± 0.0097 |
| Mpeg7 | 0.5883 ± 0.0036 | 0.7547 ± 0.0017 | 0.4398 ± 0.0037 | 0.4316 ± 0.0038 |
| TDT2_10 | 0.4736 ± 0.0062 | 0.4763 ± 0.0066 | 0.2716 ± 0.0030 | 0.1374 ± 0.0038 |
| Dataset | ACC | NMI | F-Score | ARI |
|---|---|---|---|---|
| Iris | 0.8867 ± 0.0000 | 0.7419 ± 0.0000 | 0.8111 ± 0.0000 | 0.7163 ± 0.0000 |
| Balance | 0.5290 ± 0.0061 | 0.1244 ± 0.0076 | 0.4673 ± 0.0051 | 0.1448 ± 0.0081 |
| Dermatology | 0.9340 ± 0.0072 | 0.8938 ± 0.0026 | 0.8979 ± 0.0126 | 0.8728 ± 0.0157 |
| Uspst | 0.7034 ± 0.0005 | 0.6184 ± 0.0002 | 0.5851 ± 0.0004 | 0.5361 ± 0.0005 |
| USPSdata_20 | 0.7042 ± 0.0003 | 0.6318 ± 0.0006 | 0.5928 ± 0.0010 | 0.5446 ± 0.0012 |
| USPSdata | 0.7116 ± 0.0001 | 0.6159 ± 0.0001 | 0.5890 ± 0.0002 | 0.5401 ± 0.0002 |
| MSRA25 | 0.5826 ± 0.0060 | 0.6238 ± 0.0055 | 0.4349 ± 0.0077 | 0.3760 ± 0.0089 |
| PalmData25 | 0.8409 ± 0.0027 | 0.9392 ± 0.0010 | 0.7781 ± 0.0036 | 0.7758 ± 0.0036 |
| Binalpha | 0.4877 ± 0.0029 | 0.5971 ± 0.0015 | 0.3237 ± 0.0023 | 0.3044 ± 0.0024 |
| Ecoli | 0.5346 ± 0.0062 | 0.5733 ± 0.0038 | 0.4796 ± 0.0062 | 0.3680 ± 0.0069 |
| Corel_5k | 0.1904 ± 0.0007 | 0.2695 ± 0.0005 | 0.0841 ± 0.0004 | 0.0628 ± 0.0004 |
| MnistData_05 | 0.5631 ± 0.0036 | 0.4876 ± 0.0022 | 0.4188 ± 0.0015 | 0.3526 ± 0.0016 |
| MnistData_10 | 0.5822 ± 0.0015 | 0.4988 ± 0.0014 | 0.4308 ± 0.0014 | 0.3651 ± 0.0016 |
| Coil20Data_25 | 0.7286 ± 0.0063 | 0.8055 ± 0.0033 | 0.6680 ± 0.0071 | 0.6500 ± 0.0075 |
| Mpeg7 | 0.6049 ± 0.0025 | 0.7594 ± 0.0013 | 0.4488 ± 0.0030 | 0.4407 ± 0.0030 |
| TDT2_10 | 0.4759 ± 0.0041 | 0.4769 ± 0.0044 | 0.2702 ± 0.0027 | 0.1338 ± 0.0035 |
| J | ACC | CI | avIT | ET (s) |
|---|---|---|---|---|
| 5 | 0.4804 ± 0.0024 | 7 | 7.15 | 62.32 |
| 10 | 0.4786 ± 0.0024 | 7 | 8.18 | 73.95 |
| 15 | 0.4787 ± 0.0022 | 7 | 8.4 | 80.4 |
| 20 | 0.4877 ± 0.0024 | 6 | 9.33 | 94.42 |
| 25 | 0.4823 ± 0.0022 | 7 | 9.23 | 98.72 |
| 30 | 0.4789 ± 0.0025 | 6 | 8.78 | 102.68 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Nigro, L.; Cicirelli, F. Clustering Performance of a Recombinator Hartigan–Wong Algorithm. Computers 2026, 15, 394. https://doi.org/10.3390/computers15060394
Nigro L, Cicirelli F. Clustering Performance of a Recombinator Hartigan–Wong Algorithm. Computers. 2026; 15(6):394. https://doi.org/10.3390/computers15060394
Chicago/Turabian StyleNigro, Libero, and Franco Cicirelli. 2026. "Clustering Performance of a Recombinator Hartigan–Wong Algorithm" Computers 15, no. 6: 394. https://doi.org/10.3390/computers15060394
APA StyleNigro, L., & Cicirelli, F. (2026). Clustering Performance of a Recombinator Hartigan–Wong Algorithm. Computers, 15(6), 394. https://doi.org/10.3390/computers15060394

