We employ the Wilcoxon rank-sum test and the Friedman test To evaluate the statistical significance of the experimental results. A significance level of 0.05 is chosen to examine if there are any noteworthy differences in the obtained results.
4.2. Experimental Analysis
Table 3 presents the classification error rates obtained by the compared algorithms on ten benchmark datasets. The experimental results clearly demonstrate that the proposed ABPSO outperforms BPSO, HHO, WOA, and ACO on most datasets. ABPSO achieves the lowest classification error on seven out of the ten datasets, including Dermatology, Ionosphere, Lung, Sonar, Spambase, Statlog, and Wine. Notably, ABPSO attains the best performance on the Ionosphere and Sonar datasets, which are characterized by complex feature structures. This observation further confirms the effectiveness and suitability of ABPSO for challenging feature selection tasks.
Compared with the standard BPSO, ABPSO demonstrates notable improvements across multiple datasets. For instance, on the Lung dataset, ABPSO reduces the classification error, corresponding to a relative improvement of approximately 27.7%. Similarly, on the Sonar and Ionosphere datasets, ABPSO performs better than BPSO by 7.5% and 14.5%, respectively. These consistent performances validate the effectiveness of the adaptive mechanisms incorporated into ABPSO, which enhance search behavior and help prevent premature convergence.
ABPSO outperforms HHO and WOA on most datasets. For example, on the Wine dataset, ABPSO achieves a classification error of 0.0280, compared to 0.0373 for HHO and 0.0358 for WOA. Similarly, on the Statlog dataset, ABPSO attains an error rate of 0.1446, whereas HHO and WOA produce high errors. It is worth noting that on the LSVT dataset, ACO delivers the best performance and it outperforms ABPSO. Nevertheless, ABPSO still performs comparably to, or better than, BPSO, HHO, and WOA on this dataset. These results indicate that the proposed adaptive strategy and hybrid learning mechanism enable ABPSO to achieve a more effective balance between exploration and exploitation than recent metaheuristic algorithms.
To further validate the overall performance differences among the compared feature selection algorithms, two nonparametric Friedman and Wilcoxon rank-sum tests are conducted on the evaluated datasets. The results are presented in
Table 4, where “>” indicates that an algorithm achieves the best performance on a given dataset, “=” denotes no statistically significant difference from the best result, and “-” signifies inferior performance.
As shown in
Table 4, the proposed ABPSO achieves the lowest average rank, which is substantially better than those of BPSO, HHO, WOA, and ACO. These results provide strong statistical evidence that ABPSO delivers consistently superior performance across diverse datasets. Specifically, ABPSO ranks first on seven out of ten datasets (Dermatology, Ionosphere, Lung, Sonar, Spambase, Statlog, and Wine), and acquires a close second place on two additional datasets (Iris and WDBC). The Friedman ranking results, along with the lowest average rank, statistically confirm the superiority and robustness of ABPSO in feature selection tasks.
The Wilcoxon rank-sum results clearly demonstrate the strong competitiveness of ABPSO. On the Iris dataset, ABPSO exhibits performance statistically equivalent (“=”) to the best result (achieved by BPSO), confirming its robustness even when not ranked first. In contrast, HHO and WOA fail to achieve either a best or equivalent outcome on any dataset. While ACO and BPSO achieve the best performance on the LSVT and WDBC datasets, respectively, ABPSO remains statistically competitive or superior across the remaining eight datasets.
Nonparametric tests indicate that ABPSO significantly outperforms BPSO, HHO, WOA, and ACO in most datasets. These findings strongly support the conclusion that ABPSO provides substantial and reliable advantages over other metaheuristic feature selection methods. Its performance can be explained by the integration of adaptive measures, multi-source guidance (utilizing pbest, gbest, and elite solutions), and the local refinement mechanism, which collectively enhance search diversity and enable more effective exploration of the solution space.
In addition to classification accuracy, computational efficiency is a critical factor for practical feature selection.
Table 5 summarizes the execution times (in seconds) of the compared methods on the ten benchmark datasets.
The proposed ABPSO exhibits highly competitive and often superior computational efficiency relative to the other metaheuristic methods. For most datasets, its execution time is comparable to or faster than those of BPSO, HHO, WOA, and ACO. Notably, on the Spambase dataset, which involves a large number of samples, ABPSO completes execution in 319.79 s, significantly outperforming BPSO, HHO, and WOA. Although ACO records the shortest time on this dataset, ABPSO still provides a favorable trade-off between runtime efficiency and classification performance.
While ACO demonstrates the shortest running time on several datasets (e.g., Spambase, LSVT, and Iris), earlier error analyses show that it does not consistently achieve the best classification performance. Across most datasets, ABPSO maintains stable and computationally reasonable running times. For example, on Dermatology, Ionosphere, Iris, Lung, Sonar, Statlog, and WDBC, the execution time of ABPSO approaches the fastest reported results, indicating reliable scalability and efficient computational behavior.
The computational efficiency of ABPSO can be attributed to the relatively simple update operations inherent in the PSO framework and its ability to identify compact feature subsets which significantly reduce classifier evaluation costs. Ultimately, ABPSO can be used in real-world applications that require both high predictive accuracy and acceptable computational costs because it can maintain low error rates while operating within practical time constraints.
Table 6 displays the number of selected features for each algorithm, and ABPSO exhibits a distinct advantage. Compared to BPSO, HHO, and WOA, ABPSO selects fewer features in seven out of nine datasets, with particularly outstanding dimensionality reduction on high-dimensional datasets LSVT, Lung, and Sonar. Although ACO selects the fewest features, it exhibits significantly higher classification error, indicating that excessive feature reduction can adversely affect model performance. In contrast, ABPSO effectively reduces feature dimensionality while maintaining low classification error.
4.3. Parameter Sensitivity Analysis
To investigate the individual contributions of the proposed adaptive and hybrid mechanisms, an ablation study is conducted by systematically removing key components of ABPSO. We evaluate three algorithm variants, including ABPSO-1 (retaining only the adaptive transfer function and adaptive learning coefficients), ABPSO-2 (using only hybrid learning with neighborhood elites), and ABPSO-3 (employing solely SA-based local search).
Table 7 summarizes the classification errors of these variants on the ten benchmark datasets. ABPSO consistently achieves the lowest or highly competitive error rates on most datasets. On the Lung dataset, ABPSO obtains an error of 0.1521, significantly outperforming ABPSO-1, ABPSO-2, and ABPSO-3. Similarly, on the Ionosphere and Sonar datasets, ABPSO delivers the best results among all variants. To achieve optimal feature selection performance, it is crucial to integrate adaptive mechanisms, hybrid learning, and local refinement simultaneously.
ABPSO-1 demonstrates clear improvements over BPSO on several datasets, such as Spambase and WDBC. However, its performance becomes inconsistent on more complex datasets like Ionosphere and Lung, where it underperforms compared to ABPSO. The incorporation of neighborhood elites generally improves performance over BPSO, especially on WDBC. Nevertheless, without adaptive mechanisms or local refinement, the gains achieved by ABPSO-2 remain limited and exhibit lower stability on the datasets. Meanwhile, ABPSO-3 improves accuracy on certain datasets, including Iris and Spambase, although its effect in isolation is modest. These observations indicate that local search operates most effectively when integrated with adaptive global guidance and diversified learning strategies.
The nonparametric tests reveal that ABPSO, ABPSO-2, and ABPSO-4 achieve similar statistical results on the Dermatology dataset. On the LSVT dataset, all four algorithms perform comparably. On the Sonar and Statlog datasets, ABPSO and ABPSO-4 produce similar outcomes. ABPSO, ABPSO-1, ABPSO-2, and ABPSO-4 achieve the best performance on 7, 2, 3, and 6 datasets, respectively.
In summary, ABPSO outperforms all partial variants across most datasets, so the three proposed components complement each other effectively. ABPSO achieve superior generalization performance in high-dimensional feature selection tasks with the proposed mechanisms.