Adapted Multi-Strategy Fractional-Order Relative Pufferfish Optimization Algorithm for Feature Selection

Xu, Lukui; Lv, Jiajun; Yu, Youling

doi:10.3390/math13172799

Open AccessFeature PaperArticle

Adapted Multi-Strategy Fractional-Order Relative Pufferfish Optimization Algorithm for Feature Selection

by

Lukui Xu

^†,

Jiajun Lv

^† and

Youling Yu

^*

School of Electronic and Knowledge Engineering, Tongji University, Shanghai 201804, China

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Mathematics 2025, 13(17), 2799; https://doi.org/10.3390/math13172799

Submission received: 9 July 2025 / Revised: 26 August 2025 / Accepted: 28 August 2025 / Published: 31 August 2025

(This article belongs to the Special Issue Advances in Metaheuristic Optimization Algorithms)

Download

Browse Figures

Versions Notes

Abstract

In the development of artificial intelligence (AI) technology, utilizing datasets for model instruction to achieve higher predictive and reasoning efficacy has become a common technical approach. However, primordial datasets often contain a significant number of redundant features (RF), which can compromise the prediction accuracy and generalization ability of models. To effectively reduce RF in datasets, this work advances a new version of the Pufferfish Optimization Algorithm (POA), termed AMFPOA. Firstly, by considering the knowledge disparities among different groups of members and incorporating the concept of adaptive learning, an adaptive exploration strategy is introduced to enhance the algorithm’s Global Exploration (GE) capability. Secondly, by dividing the entire swarm into multiple subswarms, a three-swarm search strategy is advanced. This allows for targeted optimization schemes for different subswarms, effectively achieving a good balance across various metrics for the algorithm. Lastly, leveraging the historical memory property of Fractional-Order theory and the member weighting of Bernstein polynomials, a Fractional-Order Bernstein exploitation strategy is advanced, which significantly augments the algorithm’s local exploitation (LE) capability. Subsequent experimental results on 23 real-world Feature Selection (FS) problems demonstrate that AMFPOA achieves an average success rate exceeding 87.5% in fitness function value (FFV), along with ideal efficacy rates of 86.5% in Classification Accuracy (CA) and 60.1% in feature subset size reduction. These results highlight its strong capability for RF elimination, establishing AMFPOA as a promising FS method.

Keywords:

pufferfish optimization; adaptive exploration strategy; three-swarm search strategy; fractional-order Bernstein exploitation strategy; feature selection

MSC:

65K10

1. Introduction

Advancements in AI technology have significantly propelled progress in cutting-edge fields such as education, economy, and healthcare [1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18]. Within AI technology, large-scale model instruction has gradually become the focal point of researchers’ attention [19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34]. Utilizing massive amounts of data for model instruction to achieve higher predictive and reasoning efficacy has emerged as an effective method for efficacy enhancement [35,36,37,38,39,40,41,42,43,44,45,46,47,48]. Unfortunately, however, the data used for model instruction often contains a substantial amount of redundant knowledge, which compromises the interpretability and generalization ability of the models [49]. To improve the effectiveness of model instruction, it is often necessary to eliminate RF from the primordial dataset to enable accurate and lightweight model instruction, a process known as feature dimension reduction [50]. Instruction models using datasets that have undergone feature dimension reduction help free up a significant amount of computational resources and augment the reliability of AI technology [51]. Currently, employing metaheuristic-based FS methods to clean RF from the primordial dataset has become an effective technical approach [52]. Nevertheless, as the complexity of datasets continues to increase, these methods exhibit certain limitations. Therefore, to better perform FS on the primordial dataset, this paper aims to advance an efficient metaheuristic algorithm to achieve superior FS efficacy, thereby improving the usability of datasets and the reliability of models.

Metaheuristic algorithms are primarily lightweight computational methods formulated by simulating the objective behaviors of animals in nature. Currently, they are mainly categorized into four types: swarm-based, human-based, physics-based, and evolutionary-based [53]. Among them, typical representatives of swarm-based optimization algorithms include FFO [54], GWO [55], and AVOA [56]. For human-based optimization algorithms, notable examples are WSO [57], MOA [58], and TOA [59]. In the realm of physics-based optimization algorithms, typical ones are EO [60], AOA [61], and BHA [62]. Regarding evolutionary-based optimization algorithms, classic examples include GA [63], ES [64], and DE [65]. Thanks to their efficient search capabilities and relatively low computational costs, researchers have widely applied these algorithms to FS in datasets, aiming to achieve higher CA using fewer features.

In recent years, researchers have advanced numerous optimization algorithm-based FS methods to enhance the efficacy of RF elimination while improving the Classification Accuracy (CA) of feature subsets. For instance, Haouassi et al. introduced a novel binary grasshopper optimization FS algorithm and applied it to the feature subset selection problem. Experimental results demonstrated that their advanced algorithm attained a success rate of over 95% in terms of feature subset size across twenty datasets, outperforming five state-of-the-art comparative algorithms [66]. Mohan et al. advanced a binary teaching–learning-based optimization FS algorithm (FS-BTLBO) for FS. Experimental evidence showed that FS-BTLBO attained higher accuracy with the fewest features on the Wisconsin Diagnostic Breast Cancer (WDBC) dataset, effectively classifying malignant and benign tumors [67]. Mohammed et al. presented a monarch butterfly optimization (MBO) FS algorithm. Results from eighteen benchmark datasets indicated that, compared to four metaheuristic algorithms, MBO attained an average CA of up to 93% across all datasets while significantly reducing the number of selected features [68]. Hu et al. advanced an enhanced version of the black widow optimization FS algorithm (SDABWO) for FS. By introducing three learning strategies, the algorithm’s FS efficacy was enhanced. Results from twelve standard datasets from the UCI repository confirmed that SDABWO can simultaneously improve CA and reduce the dimensionality of the primordial dataset, making it a promising FS approach [69]. Ewees et al. advanced an improved sine–cosine FS algorithm (ISOA) that enhanced efficacy by incorporating Lévy flight and mutation operators. A comparative study on twenty benchmark datasets demonstrated that ISOA attained superior feature subset CA compared to FS methods constructed using other metaheuristic approaches [70]. Mohammed et al. introduced a binary version of the Horse herd Optimization FS Algorithm (HOA) to address the FS problem. They experimentally evaluated three transfer functions and three crossover operators to ensure the FS efficiency of their method. Experimental results on twenty-four real-world FS datasets indicated that it can effectively reduce RF in the primordial dataset, making it a promising FS approach [71]. Pan et al. advanced an improved grey wolf optimization FS algorithm and applied it to the FS problem in high-dimensional data. By amalgamating the ReliefF algorithm and Copula entropy, along with two novel search strategies, the algorithm’s FS efficacy was enhanced. Results on ten high-dimensional, small-sample gene explication datasets showed that the advanced algorithm selected fewer than 0.67% of the features while improving CA, demonstrating good efficacy and robustness in high-dimensional FS [72]. The shortcomings and advantages of related technologies are summarized as Table 1.

The aforementioned facts validate the effectiveness of optimization algorithm-based FS methods in feature dimension reduction and demonstrate that the incorporation of learning strategies can enhance FS efficacy of algorithms. However, despite the promising results attained by existing works in the FS domain, as the dimensionality of primordial datasets increases, current FS methods exhibit certain limitations. For instance, they often fail to comprehensively consider the relationships between RF elimination and the improvement of CA, leading to a compromise in the classification efficacy of datasets. Therefore, this paper aims to advance an FS method that can strike a good balance among multiple metrics to effectively enhance FS efficacy and eliminate RF. Fortunately, the Pufferfish Optimization Algorithm (POA) has been proven to be an optimization algorithm with efficient search capabilities [73], along with advantages such as a simple structure and good application scalability. Nevertheless, as the complexity of datasets increases, POA still faces several challenges, including the tendency to get trapped in locally non-ideal feature subsets and insufficient LE capabilities. To further improve the FS efficacy of the algorithm, this paper introduces an enhanced version of POA, termed AMFPOA, by amalgamating an adaptive exploration strategy, a three-swarm search strategy, and a Fractional-Order Bernstein exploitation strategy. Experimental results on 23 real-world FS problems indicate that AMFPOA exhibits a strong ability to balance the improvement of CA and the elimination of RF. Compared to eight efficient optimization algorithms, AMFPOA demonstrates superior FS efficacy and can be considered a promising FS approach. The main contributions of this paper are as follows:

An adaptive exploration strategy is advanced, which effectively augments the algorithm’s GE capability and improves the CA of feature subsets.
A three-swarm search strategy is introduced to ensure balance during the algorithm’s operation and enhance the metric trade-off when resolving FS problems.
A Fractional-Order Bernstein exploitation strategy is presented, which improves the algorithm’s LE capability when addressing FS problems, enabling effective elimination of RF.
By amalgamating the aforementioned three improvement strategies, an enhanced version of the POA, namely AMFPOA, is advanced, which boosts the algorithm’s FS efficacy.
Applying AMFPOA to resolve 23 real-world FS problems confirms that AMFPOA is a promising FS method.

The subsequent work plan of this paper is as follows: Section 2 introduces the mathematical model and execution logic of the POA. Section 3 advances an enhanced version of POA, termed AMFPOA, by amalgamating three improvement strategies. Section 4 applies AMFPOA to resolve 23 real-world FS problems, achieving excellent results in RF elimination and CA. Section 5 presents the research conclusions of this paper and outlines the future work plan. For the convenience of subsequent reading, we have summarized all the abbreviated words used in this paper in Table 2.

2. Mathematical Model of the POA

This section primarily introduces the mathematical model and implementation logic of the POA. POA is an optimization algorithm developed by simulating the predatory attack behaviors and defensive behaviors of pufferfish. In POA, the predatory attack on pufferfish mainly simulates the GE stage of the algorithm, while the defensive behaviors of pufferfish to evade predator attacks primarily simulate the LE stage. When utilizing the POA to resolve optimization problems, it is first necessary to initialize a set of initial solution candidates with search capabilities, a process known as the swarm initialization stage. Subsequently, the GE and LE stages of the POA are employed to iteratively refine the initialized swarm, thereby enhancing the quality of the candidate solutions. Below, a detailed description of the swarm initialization stage, GE stage, and LE stage of POA will be provided.

2.1. Swarm Initialization Stage

This section primarily focuses on the mathematical modeling of the swarm initialization stage in the POA. When employing an optimization algorithm to resolve a problem, it is initially necessary to produce an initialized swarm with a certain level of quality. Each member within the swarm represents a solution candidate for the problem to be optimized. The symbolic representation of a swarm

X

containing

N

members is shown in Equation (1).

X = {[\begin{matrix} X_{1} \\ ⋮ \\ X_{i} \\ ⋮ \\ X_{N} \end{matrix}]}_{N \times 1} = {[\begin{matrix} x_{1, 1} & \dots & x_{1, j} & \dots & x_{1, d i m} \\ ⋮ & ⋱ & ⋮ & ⋱ & ⋮ \\ x_{i, 1} & \dots & x_{i, j} & \dots & x_{i, d i m} \\ ⋮ & ⋱ & ⋮ & ⋱ & ⋮ \\ x_{N, 1} & \dots & x_{N, j} & \dots & x_{N, d i m} \end{matrix}]}_{N \times d i m}

(1)

where

X

represents the initialized swarm,

X_{i}

signifies the knowledge of the

i t h

member,

x_{i, j}

signifies the

j t h

dimensional knowledge of the

i t h

member,

d i m

indicates the dimensionality of each member, which corresponds to the number of variables in the problem to be optimized, and

N

represents the swarm size.

x_{i, j}

is represented using Equation (2).

x_{i, j} = l b_{j} + r \cdot (u b_{j} - l b_{j})

(2)

where

r

signifies a random number within the interval [0, 1], while

l b_{j}

and

u b_{j}

represent the lower and upper boundary constraints, respectively, for the

j t h

variable of the problem to be optimized. Meanwhile, during the iterative process, the capability of an member to resolve the optimization problem is primarily evaluated using the FFV. The vector of FFV for members in the swarm is explicated as Equation (3).

F = {[\begin{matrix} F_{1} \\ ⋮ \\ F_{i} \\ ⋮ \\ F_{N} \end{matrix}]}_{N \times 1} = {[\begin{matrix} F (X_{1}) \\ ⋮ \\ F (X_{i}) \\ ⋮ \\ F (X_{N}) \end{matrix}]}_{N \times 1}

(3)

where

F

represents the vector of FFV for members in the swarm, and

F_{i}

signifies the FFV corresponding to the

i t h

member. For a minimization optimization problem, a smaller FFV indicates a higher quality of the member. After swarm initialization, the POA utilizes the GE stage and the LE stage to refine the quality of members. Below, the mathematical models of these two stages will be described in detail.

2.2. GE Stage

This section primarily introduces the mathematical model of the GE stage in the POA. This stage is mainly formulated by simulating the predator’s attack behavior on pufferfish. The core idea is that the predator initiates an attack on the pufferfish, moving in a direction guided by the pufferfish’s position, thereby enabling members within the swarm to “jump” across the global solution space. By abstractly representing the members in the swarm as the predator’s positions, the algorithm facilitates the exploration of the entire solution space by each member. The position update mechanism during the GE stage is explicated by Equation (4).

x_{i, j}^{P 1} = x_{i, j} + r_{i, j} \cdot (S P_{i, j} - I_{i, j} \cdot x_{i, j})

(4)

where

x_{i, j}^{P 1}

represents the updated knowledge of the

j t h

dimension for the

i t h

member after the GE stage.

r_{i, j}

signifies a random number produced within the interval [0, 1].

I_{i, j}

is a randomly selected constant from the set {1, 2}.

S P_{i, j}

indicates the pufferfish that the

i t h

member selects to attack from the set of pufferfish that are more likely to be attacked. This set is defined using Equation (5).

C P_{i} = {X_{k} : F_{k} < F_{i} and k \neq i}, where i = 1, 2, \dots, N and k \in {1, 2, \dots, N}

(5)

where

C P_{i}

signifies the set of candidate pufferfish that the

i t h

member may attack.

X_{k}

represents an member whose FFV is superior to that of

X_{i}

, and

F_{k}

signifies the FFV of

X_{k}

. Subsequently, the knowledge of the

i t h

member is preserved using Equation (6).

X_{i} = \{\begin{array}{l} X_{i}^{P 1}, & F_{i}^{P 1} < F_{i} \\ X_{i}, & e l s e \end{array}

(6)

where

X_{i}^{P 1}

represents the updated state of the

i t h

member after undergoing the GE stage of the POA, and

F_{i}^{P 1}

signifies the FFV corresponding to

X_{i}^{P 1}

. Members are updated through the GE stage of the POA, which effectively ensures the algorithm’s GE capability and contributes to enhancing its optimization accuracy.

2.3. LE Stage

This section primarily introduces the mathematical model of the LE stage in the POA. This stage is mainly developed by simulating the defensive behavior of pufferfish when they are attacked by predators. The core idea is that, when under attack, a pufferfish inflates itself into a spiky ball. In response to this dangerous situation, the predator flees in the vicinity of the pufferfish, causing its position to be updated around the member, thereby realizing the LE stage of the POA. The position update mechanism during the LE stage is explicated by Equation (7).

x_{i, j}^{P 2} = x_{i, j} + (1 - 2 r_{i, j}) \cdot \frac{u b_{j} - l b_{j}}{t}

(7)

where

x_{i, j}^{P 2}

represents the updated knowledge of the

j t h

dimension for the

i t h

member after undergoing the LE stage.

r_{i, j}

signifies a random number produced within the interval [0, 1], and

t

signifies the current iteration count. Subsequently, the knowledge for the

i t h

member is preserved using Equation (8).

X_{i} = \{\begin{array}{l} X_{i}^{P 2}, & F_{i}^{P 2} < F_{i} \\ X_{i}, & e l s e \end{array}

(8)

where

X_{i}

represents the updated state of the

i t h

member after undergoing the LE stage of the POA, and

F_{i}^{P 2}

signifies the FFV corresponding to

X_{i}^{P 2}

. The updating of members through the LE stage of the POA effectively ensures the algorithm’s LE capability, thereby contributing to enhancing its optimization accuracy.

2.4. Implementation of the POA

The preceding sections introduced the mathematical models for the swarm initialization stage, GE stage, and LE stage of the POA. In this section, we will mainly elaborate on the execution logic of POA when resolving optimization problems. Specifically, the pseudocode for the execution of POA is presented in Algorithm 1.

Algorithm 1: Pseudo code for POA

Input: Parameters

N

,

d i m

,

u b

,

l b

,

T

.

Output: Best solution (

X_{b e s t}

).

1.: Initialize swarm base on Equation (1) and compute FFV base on Equation (3).
2.: for $t = 1 : T$
3.: for $i = 1 : N$
4.: exploration stage
5.: for $j = 1 : d i m$
6.: Compute the $j t h$ dimensional of the $i t h$ member base on Equation (4).
7.: end for
8.: Use Equation (6) to preserve member $X_{i}$ .
9.: Exploitation Stage
10.: for $j = 1 : d i m$
11.: Compute the $j t h$ dimensional of the $i t h$ member base on Equation (7).
12.: end for
13.: Use Equation (8) to preserve member $X_{i}$ .
14.: end for
15.: Save the best solution $X_{b e s t}$ .
16.: end for
17.: Output the best solution $X_{b e s t}$ .

3. Mathematical Model of the AMFPOA

The primordial POA exhibits deficiencies when addressing the FS problem, including insufficient GE capability, inadequate LE capability, and an imbalance between GE and LE. These shortcomings often lead the algorithm to become trapped in locally non-ideal feature subsets, resulting in reduced redundancy feature elimination and compromised CA of feature subsets. To mitigate these limitations, this section advances an enhanced version of POA, termed AMFPOA, by amalgamating three improvement strategies aimed at enhancing the algorithm’s FS efficacy and improving subset CA. In AMFPOA, firstly, to address the issue of inadequate GE capability in POA, an adaptive exploration strategy is introduced. The core idea behind this strategy is to enhance the algorithm’s GE by considering knowledge disparities among members from different groups and incorporating the concept of adaptive learning. This enables members to effectively explore the solution space, thereby improving the algorithm’s GE capability. Secondly, to tackle the imbalance between GE and LE stages in POA, a three-swarm search strategy is advanced. By considering the diverse characteristics of members, the entire swarm is divided into multiple subswarms. Tailored optimization strategies are then applied to each subswarm based on its specific characteristics, achieving a balance between the two stages of the algorithm. This augments the algorithm’s ability to escape from local optima. Lastly, to address the problem of insufficient LE capability in POA, a Fractional-Order Bernstein exploitation strategy is introduced. Leveraging the historical memory property of Fractional-Order theory and the weighted averaging capability of Bernstein polynomials, this strategy effectively guides members in their exploitation behavior, thereby strengthening the algorithm’s LE capability. As a result, the algorithm achieves higher CA when resolving the FS problem. The following sections will provide a detailed introduction to these three improvement strategies.

3.1. Adaptive Exploration Strategy

The primordial POA demonstrates inadequate GE capabilities when addressing complex FS problems, primarily due to the increase in the number of features in the primordial dataset. This deficiency leads to a reduction in Population Diversity (PD), which hampers the algorithm’s ability to locate regions that may contain the ideal combinations of FS subsets. Consequently, the CA of the subsets is diminished. To address this issue, there is an urgent need to advance a search strategy with efficient exploration capabilities. Zhang et al. [74] pointed out in their work that allowing members to learn from the gaps between different types of members within the swarm can effectively enhance the algorithm’s GE ability, and this viewpoint has been verified. Inspired by this, to strengthen the algorithm’s GE ability in resolving FS problems, this section incorporates this learning concept. At the same time, considering that enhancing the GE efficacy of the algorithm may reduce the convergence degree of the solution set, a learning factor is employed to control the process of gap-based learning. This ensures that while the GE efficacy is improved, the convergence speed is not significantly affected.

This section primarily focuses on introducing the core idea and mathematical model of the adaptive exploration strategy. The main concept behind the adaptive exploration strategy is that members learn from the disparities between themselves and others with different attributes, while also taking into account the members’ learning capabilities and the acceptability of these disparities, in order to enhance the algorithm’s GE capability. The schematic diagram of the adaptive exploration strategy is illustrated in Figure 1. Specifically, in the adaptive exploration strategy, four distinct types of disparities are considered: the disparity between the ideal member and a relatively superior member (

g a p_{1}

), the disparity between the ideal member and a relatively inferior member (

g a p_{2}

), the disparity between a relatively superior member and a relatively inferior member (

g a p_{3}

), and the disparity between two different randomly selected members (

g a p_{4}

). These disparities are represented using Equation (9).

(\begin{array}{l} g a p_{1} = X_{b e s t} - X_{b e t t e r} \\ g a p_{2} = X_{b e s t} - X_{w o r s e} \\ g a p_{3} = X_{b e t t e r} - X_{w o r s e} \\ g a p_{4} = X_{r a n d 1} - X_{r a n d 2} \end{array}

(9)

where

X_{b e s t}

signifies the ideal member within the swarm.

X_{b e t t e r}

represents a relatively superior member in the swarm, defined as a randomly selected member from the set of the top five members with the smallest fitness values. Conversely,

X_{w o r s e}

indicates a relatively inferior member in the swarm, defined as a randomly selected member from the set of the top five members with the largest fitness values.

X_{r a n d 1}

and

X_{r a n d 2}

denote two distinct random members within the swarm, respectively. Additionally, the acceptability levels of different disparities vary and are represented using Equation (10).

L F_{k} = \frac{‖ g a p_{k} ‖}{\sum_{k = 1}^{4} ‖ g a p_{k} ‖}, (k = 1, 2, 3, 4)

(10)

where

L F_{k}

represents the acceptability level of the member disparities within the

k t h

group. Additionally, it is taken into account that members with different characteristics possess varying capabilities for learning from disparities, and the learning capability of a member is explicated using Equation (11).

S F_{i} = \frac{F_{i}}{F_{m a x}}, (1 \leq i \leq N)

(11)

where

S F_{i}

signifies the learning capability of the

i t h

member; the process of the

i t h

member learning from the disparities within the

k t h

group is represented using Equation (12).

K A_{k} = a r c t a n (1 - t / T) \cdot S F_{i} \cdot L F_{k} \cdot g a p_{k}, (k = 1, 2, 3, 4)

(12)

where

K A_{k}

represents the amount of knowledge acquired by the

i t h

member after learning from the disparities within the

k t h

group. The

a r c t a n (\cdot)

signifies the arctangent function operation; in the context of the arctangent function, as the variable

t

increases, the function value decreases non-linearly from

π / 4

to 0. This design is primarily intended to simulate the situation where, in the later stages of iteration, the learning of knowledge volume should be gradually reduced. One cannot simply focus on enhancing the algorithm’s GE efficacy while neglecting the member-aggregation ability in the late iteration stage. This approach can effectively strengthen the convergence efficacy during the later stages of iteration.

T

indicates the Maximum Number (MN) of iterations. Subsequently, the new state produced by the

i t h

member after learning from the four groups of disparities is represented using Equation (13).

X_{i}^{n e w} = X_{i} + K A_{1} + K A_{2} + K A_{3} + K A_{4}

(13)

where

X_{i}^{n e w}

represents the new state of the

i t h

member after being updated through the adaptive exploration strategy. Subsequently, the knowledge of the

i t h

member is preserved using Equation (14).

X_{i} = \{\begin{array}{l} X_{i}^{n e w} & i f F_{i}^{n e w} < F_{i} \\ X_{i} & o t h e r w i s e \end{array}

(14)

where

F_{i}^{n e w}

signifies the FFV corresponding to the member

X_{i}^{n e w}

. By adaptively learning from different knowledge disparities, the member effectively augments the algorithm’s GE capability, thereby improving its ability to eliminate RF.

3.2. Three-Swarm Search Strategy

As the dimensionality of the FS problem increases, the algorithm needs to search through an exponentially growing number of feature subset candidates when resolving the FS problem. This necessitates a well-balanced exploration and exploitation stage in the algorithm, enabling it to effectively locate local regions while further exploring these regions to enhance the CA of the feature subset. When the primordial POA is applied to resolve the FS problem, its CA is compromised due to an inadequate balance between the GE and LE stages. To alleviate this issue, there is an urgent need for us to advance a solution that can enhance the balance between the two stages. Fortunately, Shen et al. [75] introduced the concept of multi-swarm evolution in their work. By dividing the swarm into sub-swarms with distinct characteristics and then enabling each sub-swarm to undergo targeted updates, they effectively improved the balance between the two stages during the algorithm’s execution. Inspired by this idea, this section adopts the same concept of utilizing a three-swarm search strategy to improve the balance between the GE and LE stages. Specifically, by considering the FFV of members in the swarm, they are divided into three sub-swarms; the proportion for swarm division mainly adopts the concept of averaging. Specifically, the entire swarm is divided into three sub-swarms in a ratio of 3:4:3. This approach not only effectively preserves the members’ GE capabilities but also augments their LE abilities, thereby contributing to the balance during the algorithm’s execution process. The set of members ranking in the top 30% in terms of FFV is classified as the exploitation sub-swarm. LE operations are performed on this sub-swarm to enable better tuning of CA within local regions. Secondly, the set of members ranking in the bottom 30% in terms of FFV is classified as the exploration sub-swarm. GE operations are carried out on this sub-swarm to facilitate exploration across a wide range of solution spaces, thereby enhancing the algorithm’s GE capability. In addition, the remaining members are classified as the exploration–exploitation sub-swarm. Both GE and LE behaviors are simultaneously applied to this sub-swarm to achieve a better balance between the GE and LE stages of the algorithm, enhancing its ability to escape from local sub-ideal feature subset traps. This swarm division process is illustrated in Figure 2.

The exploration sub-swarm primarily performs GE operations and updates its positions using Equation (15).

X_{i}^{n e w} = X_{i} + a r c t a n (1 - \frac{t}{T}) \cdot r a n d \cdot (X_{r a n d} - X_{i})

(15)

where

r a n d

signifies a random number produced within the interval [0, 1], and

X_{r a n d}

represents a random member in the swarm that is different from

X_{i}

. Meanwhile, Mirjalili et al. [76] pointed out that the idea of employing a cosine-function-based form for member spiral updating can effectively enhance the algorithm’s LE capability. Therefore, to strengthen the exploitation ability of the exploitation sub-swarm in our case, we adopt this cosine-spiral updating method. Additionally, we control the spiral factor by using the proportion of the iteration number. This ensures that while the spiral strategy augments the LE ability, it does not fall into the local optima. In summary, the exploitation sub-swarm updates its positions using Equation (16).

X_{i}^{n e w} = \{\begin{array}{l} X_{b e s t} + (1 - \frac{t}{T}) \cdot (r a n d - 1) \cdot (X_{r a n d} - X_{i}) & i f P \leq 0.5 \\ X_{b e s t} + r a n d \cdot \cos (2 π \cdot \frac{1}{1 + e^{(- \frac{t}{T})}}) \cdot (X_{b e s t} - X_{i}) & o t h e r w i s e \end{array}

(16)

where

P

represents a random number within the interval [0, 1]. Subsequently, the exploration–exploitation sub-swarm simultaneously leans towards both GE and LE operations and updates its positions using Equation (17).

X_{i}^{n e w} = \{\begin{array}{l} E x p l o r a t i o n p h a s e : Execution of Equation (15) & r a n d \leq 0.5 \\ E x p l o i t a t i o n p h a s e : Execution of Equation (16) & r a n d > 0.5 \end{array}

(17)

After conducting multi-swarm search behaviors, the member knowledge is preserved using Equation (14). By updating members through the three-swarm search strategy, a good balance is attained between the GE and LE stages of the algorithm, thereby increasing the probability of the algorithm escaping from local sub-ideal feature subset traps.

3.3. Fractional-Order Bernstein Exploitation Strategy

The primordial POA suffers from insufficient LE capability when resolving the FS problem, resulting in a certain loss in the CA of the feature subsets. To address this, there is an urgent need for us to advance an effective LE strategy to enhance the convergence accuracy of the algorithm. Fortunately, Chen et al. [77] pointed out that applying fractional-order theory to weight historical members can yield members with better representativeness. That is, these weighted members can absorb knowledge from past members and improve their own quality; based on this inspiration, in this section, we partially utilize the former representative fractional-order weighted members for guidance to strengthen the algorithm’s LE capability. Since it has been verified in the literature [77] that using members from the previous three generations can achieve favorable results, we still adhere to this proven idea when weighting the fractional-order members. Moreover, Wang et al. [78] and Zhang et al. [79] indicated that using Bernstein polynomials to weight members with different characteristics can also enhance member quality and strengthen the algorithm’s exploitation efficacy. Therefore, we integrate the above-mentioned ideas based on fractional-order theory and Bernstein polynomials; this paper advances a Fractional-Order Bernstein exploitation strategy to enhance the algorithm’s LE efficacy. Compared to traditional development strategies such as Lévy flight and those based on chaos, this advanced strategy takes into account both the risk of getting trapped in local optima associated with Lévy flight and the irregularity introduced by chaos-based approaches. Consequently, it is better at enhancing the algorithm’s effective exploitation capability rather than promoting a blind or untargeted exploitation process. Specifically, this strategy simultaneously takes into account the historical learning property of the fractional-order learning strategy and the weighted nature of the Bernstein learning strategy, significantly improving the algorithm’s LE capability. The Fractional-Order Bernstein exploitation strategy mainly guides the swarm through fractional-order weighted members and Bernstein weighted members to enhance the CA of feature subsets. Among them, the fractional-order theory enables the enhancement of a member’s LE capability by incorporating its historical knowledge, allowing for full utilization of such historical data. Furthermore, the historical knowledge of members is updated progressively. Compared to members from the previous generation, the overall quality of members in the current iteration tends to be higher. Therefore, the extent to which historical member knowledge is utilized should gradually diminish as the historical time lapse increases. This approach not only further ensures the algorithm’s LE capability but also effectively leverages historical knowledge. Among them, the fractional-order weighted members are calculated using Equation (18).

\begin{matrix} X_{i}^{f r a c} & = \frac{1}{1!} \cdot q \cdot X_{i}^{t} + \frac{1}{2!} \cdot q \cdot (1 - q) \cdot X_{i}^{t - 1} + \frac{1}{3!} \cdot q \cdot (1 - q) \cdot (2 - q) \cdot \\ X_{i}^{t - 2} + \frac{1}{4!} \cdot q \cdot (1 - q) \cdot (2 - q) \cdot (3 - q) \cdot X_{i}^{t - 3} + r a n d \cdot (X_{i}^{t} - X_{r}^{t}) \end{matrix}

(18)

where

X_{i}^{f r a c}

represents the fractional-order weighted member.

X_{i}^{t}

signifies the knowledge of the

i^{t h}

member at the

t^{t h}

iteration,

X_{i}^{t - 1}

indicates the knowledge of the

i^{t h}

member at the

{(t - 1)}^{t h}

iteration,

X_{i}^{t - 2}

signifies the knowledge of the

i^{t h}

member at the

{(t - 2)}^{t h}

iteration,

X_{i}^{t - 3}

represents the knowledge of the

i^{t h}

member at the

{(t - 3)}^{t h}

iteration, and

X_{r}^{t}

stands for the knowledge of the rand member at the

t^{t h}

iteration. The adaptive factor

q

is calculated using Equation (19).

q = \frac{1}{1 + e^{L}} \cdot c o s (2 \cdot π \cdot L)

(19)

where

L

is represented using Equation (20). The function value of

q

with respect to the increase in the iteration number

t

is depicted in Figure 3. As can be observed from Figure 3, the value of

q

tends to exhibit a chaotic and fluctuating state. Meanwhile, after the iteration process reaches its mid-point, the fluctuations in the value of

q

become more pronounced. The setting of this value is primarily aimed at mitigating the issue of the algorithm easily falling into local optima caused by fractional-order weighting. By introducing certain fluctuations in the weighting process through

q

, the risk of getting stuck in local optima can be reduced. Furthermore, these fluctuations should be more significant in the later stages of iteration compared to the early stages. This is precisely the reason behind our design of the value-determination method for

q

.

L = (- 1 \cdot (\frac{t}{T}) - 2) \cdot r a n d + 1

(20)

Moreover, the second-order Bernstein polynomial is explicated as Equation (21), and its graph is visualized in Figure 4.

\{\begin{array}{l} B_{0, 2} (p) = {(1 - p)}^{2} \\ B_{1, 2} (p) = 2 \cdot p \cdot (1 - p) \\ B_{2, 2} (p) = p^{2} \end{array}

(21)

As can be observed from the figure, when the value of

p

lies within the interval [0, 1], the sum of the three polynomials equals 1. This indicates that they possess the property of member weighting. Therefore, we employ the second-order Bernstein polynomial to weight the ideal member, the superior member, and a random member, generating a Bernstein-weighted member, which is explicated as Equation (22).

X_{B e r n} = B_{0, 2} (p) \cdot X_{b e s t} + B_{1, 2} (p) \cdot X_{b e t t e r} + B_{2, 2} (p) \cdot X_{r a n d}

(22)

where

X_{B e r n}

represents the second-order Bernstein-weighted member. After generating the fractional-order weighted member and the Bernstein-weighted member, the members in the swarm implement the Fractional-Order Bernstein exploitation strategy through Equation (23). This significantly augments the algorithm’s LE capability and effectively ensures the CA of the feature subsets.

X_{i}^{n e w} = X_{i} + 0.5 \cdot (1 - \frac{t}{T}) \cdot (X_{i}^{f r a c} - X_{i}) + 0.5 \cdot (1 - \frac{t}{T}) \cdot (X_{B e r n} - X_{i})

(23)

Subsequently, the knowledge of the

i t h

member is preserved using Equation (14). By guiding the swarm with the Fractional-Order Bernstein exploitation strategy, the algorithm not only augments its LE capability but also maintains a certain degree of randomness. This prevents the algorithm from easily getting trapped in local sub-ideal feature subset traps.

3.4. Implementation of the AMFPOA

In response to the shortcomings of the POA when resolving the FS problem, this section advances three improvement strategies to alleviate these deficiencies, resulting in an enhanced version of the POA, referred to as AMFPOA. To visually demonstrate the execution logic of AMFPOA when resolving optimization problems, Figure 5 presents the execution flowchart of AMFPOA.

Below, we will conduct a detailed analysis of the steps involved in employing the AMFPOA to resolve optimization problems, as depicted in Figure 5.

Step 1:: Set the operational parameters, namely the swarm size $N$ and the MN of iterations $T$ . Initialize the current member index i = 1, iteration counter $t = 0$ , and index j = 1.
Step 2:: Update the ideal solution based on the current state of the swarm.
Step 3:: If $r a n d < 0.5$ , update the member’s position using Equation (4). Otherwise, update it using Equation (13). Here, $r a n d$ represents a randomly produced number within the range [0, 1].
Step 4:: Preserve the member’s knowledge using either Equation (6) or Equation (14), depending on specific criteria or predefined rules.
Step 5:: Calculate the member’s position knowledge using Equations (15) through (17).
Step 6:: If $r a n d < 0.5$ , update the member’s position using Equation (7). Otherwise, update it using Equation (23).
Step 7:: Preserve the member’s knowledge again, this time using either Equation (8) or Equation (14), as appropriate.
Step 8:: Increment the iteration counter: t = t + 1. Check if $i$ is equal to $N$ . If $i = = N$ , save the ideal solution obtained during the current iteration. Then, check if $t < T$ where $T$ is the MN of iterations. If $t \leq T$ , reset i = 1 and j = 1, and jump back to Step 2. If $t > T$ , terminate the algorithm iteration and output the ideal solution. If the pre-condition $i \neq N$ holds, increment i = i + 1 and reset j = 1, then jump back to Step 3.

3.5. Time Complexity

This subsection primarily focuses on comparing the time complexity of the AMFPOA and POA. In the primordial POA, there are two main components: swarm initialization and algorithm iteration. Here, we use the evaluation of the FFV as the fundamental step for time-complexity estimation. In the initialization stage, the time complexity is

O (N \cdot D i m)

, where

N

represents the number of members in the swarm, and

D i m

signifies the dimensionality of each member. During the iteration process, the time complexity for a single iteration is

O (2 \cdot N \cdot D i m)

. Therefore, for

T

iterations, where

T

represents the MN of iterations of the algorithm, the time complexity is

O (2 \cdot N \cdot D i m \cdot T)

. Consequently, the overall time complexity of the POA is

O (N \cdot D i m \cdot (1 + 2 T))

. The initialization process of the improved AMFPOA has the same time complexity as that of the POA, which is

O (N \cdot D i m)

. During the iteration process, the member position-updating method in AMFPOA is combined with the updating method of the primordial POA in an equal-probability manner. Considering the evaluation of the FFV as the basic step, the time complexity of its iteration process remains

O (2 \cdot N \cdot D i m \cdot T)

. Thus, the time complexity of the AMFPOA is also

O (N \cdot D i m \cdot (1 + 2 T))

, which is the same as that of the POA. The above analysis is based on taking the evaluation of the FFV as the basic step to calculate the time complexity of both the AMFPOA and POA, and there is no significant difference between them. However, from the perspective of program execution, in the AMFPOA, necessary intermediate variable calculations are required before a member updates its position. This will increase its computational cost. Nevertheless, the difference in computational cost between AMFPOA and POA does not span orders of magnitude and can be considered negligible. We deem this acceptable, as it means sacrificing a small amount of time in exchange for a higher accuracy return.

4. Results and Discussion

This section primarily focuses on evaluating the FS efficacy of the AMFPOA. We conducted experiments on 23 real-world FS datasets and compared the results with eight state-of-the-art algorithms to comprehensively assess the FS efficacy of AMFPOA. The detailed knowledge of the 23 FS datasets is presented in Table 3, and the specific parameters of the eight comparative algorithms are outlined in Table 4. Moreover, to ensure the rationality of the experiments, each experiment was independently and non-repetitively run 30 times. We objectively and comprehensively evaluated the FS efficacy of AMFPOA by analyzing the FFV, CA, feature subset size, and Friedman non-parametric test results obtained from these 30 experimental runs. Additionally, to maintain fairness in the comparative experiments, the swarm size was set to 40, and the MN of iterations was set to 100. All experiments involved in this paper were coded and executed using MATLAB R2021b on a WINDOWS 11 operating system. Below, we will provide a detailed analysis of the FS problem model and the experimental results. Before explaining the experiment, we describe the detailed knowledge of the comparative algorithm.

Detailed description of the comparison algorithm:

(1): POA [73]: A novel swarm-based optimization algorithm advanced in 2024 demonstrated superior efficacy by being compared with 12 state-of-the-art optimization algorithms on the CEC 2017 test suite and 26 real-world problems. Its execution logic framework for resolving optimization problems remains consistent with that of AMFPOA.
(2): ALSHADE [80]: An improved swarm-based optimization algorithm introduced in 2022 attained better efficacy through comparison with six champion algorithms on the CEC 2014, CEC 2018, and unmanned aerial vehicle resource allocation problems. Its execution logic framework for optimization problem-resolving is in line with that of AMFPOA.
(3): PLO [81]: A new swarm-based optimization algorithm put forward in 2024 outperformed 17 optimization algorithms on the CEC 2022 and multi-threshold image segmentation problems. Its execution logic framework for resolving optimization problems is consistent with that of AMFPOA.
(4): LSHADE [82]: An improved swarm-based optimization algorithm advanced in 2014, which was the champion algorithm of the 2014 CEC competition, showed better efficacy by being compared with multiple champion algorithms on the CEC 2014 problems. Its execution logic framework for optimization problem-resolving is the same as that of AMFPOA.
(5): BEGJO [83]: An improved binary swarm-based optimization algorithm introduced in 2024 attained better efficacy through comparison with various optimization algorithms on the FS problem. Its execution logic framework for resolving optimization problems is consistent with that of AMFPOA.
(6): IPOA [84]: An improved swarm-based optimization algorithm advanced in 2024 attained better efficacy by being compared with multiple optimization algorithms on the CEC 2017 and scheduling problems. Its execution logic framework for optimization problem-resolving is in line with that of AMFPOA.
(7): MCOA [85]: An improved swarm-based optimization algorithm put forward in 2024 outperformed various algorithms on the CEC 2020 problems and the FS problem. Its execution logic framework for resolving optimization problems is consistent with that of AMFPOA.
(8): QHDBO [86]: An improved swarm-based optimization algorithm introduced in 2024 demonstrated better efficacy through comparison with multiple algorithms on 37 optimization problems. Its execution logic framework for optimization problem-resolving remains consistent with that of AMFPOA.

4.1. Establishment of the FS Problem Model

This section primarily focuses on establishing the FS problem model. The FS problem aims to reduce RF knowledge in the primordial dataset while enhancing the CA of the feature subset. Specifically, by searching through numerous combinations of features in the primordial dataset, we seek to identify a feature subset that balances both the size of the feature subset and CA, thereby effectively representing the primordial dataset and reducing the cost of model instruction. Consequently, the objective function primarily incorporates metrics for classification error rate and feature subset size, with the fitness function explicated using Equation (24).

\min f (X_{i}) = λ_{1} \cdot e r r o r + λ_{2} \cdot \frac{R}{n}

(24)

where

X_{i}

represents the

i t h

feature subset combination,

e r r o r

signifies the classification error rate when using feature subset

X_{i}

,

n

indicates the total number of feature elements in the primordial dataset, and

R

signifies the number of feature elements in the selected feature subset.

λ_{1}

is a constant within the interval [0, 1], with

λ_{2} = 1 - λ_{1}

. With regard to the selection of the parameter

λ_{1}

, we consulted the literature [87,88]. In these references, the parameter

λ_{1}

is consistently set to 0.9 to ensure the model’s applicability. This well-chosen parameter can further enhance the efficacy when resolving real-world FS problems, making it more in line with the application requirements in practical scenarios. Therefore, we selected

λ_{1} = 0.9

in this paper, which is reasonably justified. By considering CA as the most vital efficacy metric, our FS model is predisposed to select features that can substantially improve CA. This approach not only effectively reduces RF but also boosts CA, thereby endowing the model with a certain degree of generalizability during its application.

The FS problem is a classic combinatorial optimization problem involving discrete variables. Therefore, when applying the AMFPOA to resolve the FS problem, it is necessary to convert the continuous real-valued members produced during algorithm iteration into binary (0–1) discrete values. Here, 0 indicates that the corresponding feature is not selected, while 1 signifies that the feature is selected. Below, we provide a detailed description of the numerical conversion and computational steps when using AMFPOA for FS, with the workflow illustrated in Figure 6.

Step 1:: Randomly sample 80% of the data from the primordial dataset to serve as the instruction set, while designating the remaining 20% of the data as the test set.
Step 2:: Convert the continuous real-valued member $X_{i} = (x_{i, 1}, x_{i, 2}, \dots x_{i, j}, \dots, x_{i, d i m})$ into a discrete member $Y_{i} = (y_{i, 1}, y_{i, 2}, \dots y_{i, j}, \dots, y_{i, d i m})$ using Equation (25). Here, $y_{i, j} = 1$ indicates that the $j t h$ feature element is selected in the $i t h$ feature subset combination, while $y_{i, j} = 0$ signifies its exclusion.

$y_{i, j} = \{\begin{array}{l} 1 & i f x_{i, j} < 0.5 \\ 0 & i f x_{i, j} \geq 0.5 \end{array}, i = 1, 2, \dots, N, j = 1, 2, \dots, d i m .$

(25)
Step 3:: Select feature subset elements from the primordial dataset based on the discrete member $Y_{i}$ derived from the continuous-to-discrete conversion.
Step 4:: Compute the CA of the selected feature subset using the K-Nearest Neighbors (KNN) classifier. In this study, $K$ is set to 5 for the KNN algorithm. Simultaneously, 5-fold cross-validation is employed for cross-validation.
Step 5:: Calculate the FFV of the selected feature subset combination using Equation (24).

4.2. Sensitivity Analysis

In this section, a sensitivity analysis is primarily conducted on the settings of the swarm size and the number of iterations for AMFPOA. First, to reasonably set the swarm size, we employ the control variable method. We fix the MN of iterations at 100 and consider five different swarm sizes: 20, 30, 40, 60, and 100. Subsequently, by analyzing the efficacy of the AMFPOA under these various swarm sizes, we determine the ideal swarm size for algorithm execution. The experimental results are illustrated in Figure 7. After establishing an appropriate swarm size, we set the MN of iterations to 50, 80, 100, 150, and 200, respectively, to observe the convergence behavior of the AMFPOA during execution and select a suitable MN of iterations. The experimental results are presented in Figure 8.

Figure 7 displays a bar chart illustrating the average rankings of the AMFPOA when resolving 23 FS problems under different swarm size settings. As evident from the chart, when the swarm size was set to 20 and 30, the AMFPOA attained average rankings of 4.39 and 2.39, respectively. However, when the swarm size was set to 40, it attained an impressive average ranking of 1.17. The primary reason for this phenomenon is that an excessively small swarm size results in too few search agents during the algorithm’s process of exploring feature subsets. Consequently, the algorithm fails to extensively explore the solution space, making it prone to falling into local optima and ultimately compromising the final CA. Furthermore, when the swarm size was set to 60 and 100, the average rankings were 2.57 and 4.48, respectively, which were noticeably inferior to the average ranking attained with a swarm size of 40. This discrepancy can be attributed to the fact that although a larger swarm size enables the algorithm to thoroughly explore the solution space, the excessive number of search agents subsequently hinders effective convergence among members within the swarm, thereby weakening the algorithm’s ultimate search efficacy. In summary, we opted for a swarm size of 40 for subsequent experiments. This choice takes into account both the algorithm’s global search capability and its convergence efficacy, allowing for better optimization of the algorithm’s overall effectiveness.

Figure 8 presents the convergence curves of the AMFPOA on a selection of FS problems. As can be observed from the graph, under most circumstances, the algorithm begins to exhibit a relatively stable convergence trend by the 70th iteration. As iterations progress, the algorithm’s convergence becomes increasingly stable after the 100th iteration. Throughout the optimization process from the 100th to the 200th iteration, the algorithm’s optimization capability remains highly consistent. Therefore, to avoid unnecessary waste of computational resources, we set the swarm size at 100. This configuration not only ensures that the algorithm achieves stable convergence when addressing FS problems but also effectively conserves computational resources. Consequently, we have established the MN of iterations at 100 for subsequent experiments.

4.3. Swarm Diversity Analysis

This section primarily focuses on analyzing the PD of AMFPOA when resolving FS problems. An algorithm with high PD contributes to enhancing its ability to escape from local optimum feature subset traps and ensures a more extensive exploration of the FS solution space. We employ the moment of inertia

I_{C}

to quantify PD, as depicted in Equation (26). Here,

c_{d}

represents the degree of dispersion of the swarm from its centroid

c

during each iteration, where the centroid

c

is defined as shown in Equation (27). The parameter

x_{i d}

signifies the value of the

d t h

dimension of the

i t h

search agent at the

t t h

iteration [77].

I_{C} (t) = \sqrt{\sum_{i = 1}^{N} \sum_{d = 1}^{D i m} (x_{i d} (t) - c_{d} (t))^{2}}

(26)

c_{d} (t) = \frac{1}{D i m} \sum_{i = 1}^{N} x_{i d} (t)

(27)

The specific experimental results are illustrated in Figure 9, where the blue line represents the PD of AMFPOA in resolving FS problems, and the red line indicates the PD of POA in addressing the same FS problems. As can be observed from Figure 9, during the early iterations of the algorithm, the PD corresponding to AMFPOA consistently surpasses that of POA. This implies that, compared to POA, AMFPOA exhibits stronger localization capability towards local solution regions within the solution space during the iterative process, thereby enhancing the probability of the algorithm locating the global optimum region. This is primarily attributed to the adaptive exploration strategy advanced in this paper, which effectively improves the algorithm’s GE capability through learning from member differences. Furthermore, as the iterative process progresses, the PD of AMFPOA remains higher than that of POA even in the later iterations. This is mainly due to the introduction of a three-swarm search strategy during the iteration, which achieves a good balance between the GE stage and the LE stage, thereby enhancing the algorithm’s ability to escape from local optimum feature subsets and increasing its PD. In summary, the introduction of the learning strategy in this paper effectively augments the PD of the algorithm, strengthening its GE capability and its ability to escape from local optimum feature subsets. This contributes to improving the CA of feature subsets and effectively reducing RF in the dataset.

4.4. Exploration/Exploitation Balance Analysis

This section primarily focuses on analyzing the exploration/exploitation balance of AMFPOA when resolving FS problems. An optimization algorithm with superior efficacy should strike a good balance between these two stages. It first locates potential solution regions through the GE stage and then conducts further searches within the identified regions through the LE stage. This approach significantly augments the optimization accuracy during algorithm execution. We utilize Equations (28) and (29) to calculate the percentages of GE and LE of the algorithm, respectively. Here,

D i v (t)

represents the diversity measurement in the dimensional space, which is computed using Equation (30). Specifically,

D i v_{m a x}

signifies the maximum diversity throughout the entire iterative process, and

m e d i a n (x_{d} (t))

indicates the median value of the

d t h

dimensional values of members during the

t t h

iteration [89,90,91].

E x p l o r a t i o n (%) = \frac{D i v (t)}{D i v_{m a x}} \times 100

(28)

E x p l o i t a t i o n (%) = \frac{| D i v (t) - D i v_{m a x} |}{D i v_{m a x}} \times 100

(29)

D i v (t) = \frac{1}{D i m} \sum_{d = 1}^{D i m} \frac{1}{N} \sum_{i = 1}^{N} |m e d i a n (x_{d} (t)) - x_{i d} (t)|

(30)

The experimental results are depicted in Figure 10, where the blue line represents the GE rate of AMFPOA, and the red line indicates its LE rate. As can be seen from Figure 10, during the early iterations of the algorithm, the GE stage of AMFPOA dominates. In this stage, the primary focus is on locating potential ideal solution regions to strengthen the algorithm’s ability to explore the entire problem search space. This is largely attributed to the adaptive exploration strategy advanced in this paper, which augments the algorithm’s GE capability by adaptively learning from the differences among various members. As the iterative process progresses, the GE rate gradually declines and subsequently achieves a balance with the LE stage. During this stage, AMFPOA conducts further searches within the potentially ideal solution regions identified in the early iterations, thereby appropriately improving the CA of feature subsets. This is primarily due to the reasonable balance attained between these two stages by the three-swarm search strategy, which employs targeted adjustment strategies for different attribute subswarms. In the later iterations, the LE capability of AMFPOA becomes dominant. In this stage, the algorithm thoroughly explores the solution regions identified in the early iterations to enhance the CA and optimization speed when resolving FS problems. This is mainly attributed to the Fractional-Order Bernstein Exploitation Strategy advanced in this paper, which effectively augments the algorithm’s LE capability by leveraging historical swarm knowledge and the weighted nature of Bernstein polynomials. Additionally, it can be observed that the algorithm still preserves a certain degree of GE capability in the later iterations, which helps ensure its ability to escape from local non-ideal feature subset traps. In summary, the strategies advanced in this paper enhance the algorithm’s GE and LE capabilities from different perspectives, achieving a good balance between the two stages. This effectively improves the CA and optimization efficiency of the algorithm when resolving FS problems.

4.5. Ablation Analysis of Strategy Effectiveness

In this section, ablation experiments are conducted primarily to evaluate the effectiveness of the adaptive search strategy, multi-swarm search strategy, and Fractional-Order Bernstein exploitation strategy incorporated in AMFPOA. Specifically, we define APOA by introducing the adaptive search strategy into the POA, MPOA by incorporating the multi-swarm search strategy, and FPOA by amalgamating the Fractional-Order Bernstein exploitation strategy. Meanwhile, AMFPOA is formed by introducing all three learning strategies into POA. Subsequently, the above-defined algorithms are employed to resolve 23 FS problems to assess the effectiveness of each learning strategy. Here, we employ the Friedman non-parametric test to calculate the corresponding rankings of the algorithms. Specifically, the Friedman test is a non-parametric statistical test method designed to compare the efficacy differences among multiple related samples. In this paper, the steps for calculating the Friedman rankings are as follows:

Step 1:: Suppose we have $k$ algorithms undergoing efficacy comparison, with each algorithm undergoing $N N$ independent and non-repetitive experiments (in this case, $k = = 9$ and $N N = 30$ ). Record the FFV of each algorithm in each experiment.
Step 2:: For each algorithm $j (j = 1, \dots, k)$ , calculate its average rank $R_{j}$ across all experiments, as explicated in Equation (31).

$R_{j} = \frac{1}{N N} \sum_{i = 1}^{N N} r_{i j}$

(31)

Here,

r_{i j}

represents the rank of algorithm

j

in the

i t h

experiment (where 1 indicates the best efficacy and

k

indicates the worst).

Step 3:: Calculate the Friedman statistic $Q$ . The statistic $Q$ in the Friedman test is used to assess whether the differences among the algorithms are significant. It is computed as shown in Equation (32).

$Q = \frac{12 N N}{k (k + 1)} [\sum_{j = 1}^{k} R_{j}^{2} - \frac{k {(k + 1)}^{2}}{4}]$

(32)
Step 4:: Compare the statistic $Q$ with the critical value to determine whether there are significant differences among the algorithms.
Step 5:: Output the Friedman average ranks $R_{j}$ corresponding to each algorithm and determine whether significant differences exist among the algorithms.

The experimental results of the Friedman non-parametric test for the FFV of the algorithms are presented in Figure 11. As can be seen from the figure, compared to the Friedman ranking of the primordial POA, APOA, MPOA, and FPOA all show advantages in ranking. This result confirms that each of the three learning strategies can enhance FS efficacy of the algorithm from different perspectives, validating the effectiveness of each strategy in improving algorithm efficacy. Moreover, AMFPOA exhibits a better Friedman ranking compared to the other algorithms, indicating that introducing all three learning strategies simultaneously into POA contributes to a further enhancement of the algorithm’s FS efficacy compared to introducing a single learning strategy. The aforementioned experimental results confirm the effectiveness of each learning strategy in promoting the algorithm’s FS efficacy. Notably, AMFPOA, which incorporates all three learning strategies, can further boost the algorithm’s FS efficacy, achieving higher CA and precision.

The above description confirms that the integration of the three learning strategies into the POA can effectively enhance its efficacy in resolving FS problems as a whole. However, the efficacy of each strategy in low-dimensional, medium-dimensional, and high-dimensional FS problems has not been thoroughly validated. Therefore, the following section will provide a detailed discussion on the efficacy improvements of the three enhancements in FS problems of different dimensions. Specifically, by applying APOA, MPOA, and FPOA to resolve 23 FS problems, we analyze the average fitness function value they achieve and summarize their efficacy improvement ratios. The experimental results are presented in Table 5, and the comparative rankings of POA against APOA, MPOA, and FPOA are illustrated in Figure 12.

Firstly, as can be seen from Table 5, when using FPOA to resolve low-dimensional FS problems, its fitness function value improved by 0.53% on the Banana problem compared to POA. Similarly, its efficacy improved by 30.91% on the Iris problem, 9.54% on the Bupa problem, 15.74% on the Glass problem, 14.85% on the Breastcancer problem, 4.17% on the Lipid problem, and 40.23% on the HeartEW problem. By synthesizing the aforementioned experimental results, we found that the introduction of the Fractional-Order Bernstein exploitation strategy into the primordial POA resulted in an average efficacy improvement rate of 14.5%. This is primarily because, when resolving low-dimensional FS problems, the solution space is relatively small, thus requiring the algorithm to possess stronger local exploitation capabilities to effectively explore this solution space and enhance the classification accuracy of the feature subset. The Fractional-Order Bernstein exploitation strategy introduced in this paper, which integrates fractional-order weighted members and Bernstein-weighted members, effectively augments the algorithm’s local exploitation capabilities. The experimental phenomena observed in low-dimensional FS problems confirm this point, validating the improvement effect of the Fractional-Order Bernstein exploitation strategy on local exploitation capabilities when resolving low-dimensional problems. Meanwhile, the advantages of FPOA in low-dimensional FS problems can also be visually observed in Figure 12a.

Secondly, as evident from Table 5, when employing APOA to resolve medium-dimensional FS problems, its fitness function value improved by 7.02% on the Zoo problem compared to POA. Similarly, its efficacy enhanced by 21.32% on the Vote problem, 38.31% on the Congress problem, 18.64% on the Lymphography problem, 13.04% on the Vehicle problem, 22.55% on the WDBC problem, 28.66% on the BreastEW problem, and 67.42% on the SonarEW problem. By synthesizing the aforementioned experimental results, we discovered that the introduction of the adaptive exploration strategy into the primordial POA led to an average efficacy improvement rate of 27.12%. This is primarily because, when addressing medium-dimensional FS problems, as the number of feature elements increases, the search space of the solution region gradually expands, necessitating the algorithm to possess robust global search capabilities to effectively explore the feature subsets. The adaptive exploration strategy introduced in this paper effectively augments the algorithm’s global exploration capabilities by amalgamating knowledge differences among members in the swarm along with adaptability. This enables the algorithm to conduct extensive searches for effective feature subsets as the dimensionality of FS increases, confirming the advantages of the adaptive exploration strategy in improving the algorithm’s global exploration efficacy. Additionally, the advantages of APOA in medium-dimensional FS problems can be visually observed in Figure 12b.

Finally, as can be seen from Table 5, when utilizing MPOA to resolve high-dimensional FS problems, its fitness function value improved by 16.48% on the Libras problem compared to POA. Similarly, its efficacy enhanced by 18.18% on the Hillvalley problem, 24.59% on the Musk problem, 47.34% on the Clean problem, 45.17% on the Semeion problem, 65.64% on the Madelon problem, and 56.71% on the Isolet problem. By synthesizing the aforementioned experimental results, we found that the introduction of the multi-swarm search strategy into the primordial POA resulted in an average efficacy improvement rate of 39.16%, indicating a notably significant enhancement. This is primarily because, when addressing high-dimensional FS problems, the substantial increase in feature elements renders the search space of the solution region exceedingly complex. Consequently, the algorithm may struggle to achieve a favorable trade-off across multiple metrics, leading to the obtained feature subsets being prone to local optima. Therefore, the algorithm is required to possess a strong ability to escape from the trap of local ideal feature subsets, enabling the obtained feature subsets to achieve a better balance across various metrics. The multi-swarm search strategy introduced in this paper integrates swarms with diverse characteristics, allowing different subswarms to undergo targeted update guidance. This effectively strengthens the algorithm’s ability to balance across multiple metrics and augments its capacity to escape from local optima. The experimental results also corroborate this point, demonstrating significant efficacy improvements when resolving high-dimensional FS problems. Additionally, the advantages of MPOA in high-dimensional FS problems can be visually observed in Figure 12c.

4.6. Fitness Function Value Analysis

To evaluate the efficacy of the AMFPOA in resolving FS problems, this section analyzes the FFV obtained by AMFPOA when reducing dimensionality across 23 FS datasets spanning low-dimensional, medium-dimensional, and high-dimensional scenarios. The results are compared with eight state-of-the-art algorithms, with experimental outcomes summarized in Table 6. In Table 6, “MIN”, “AVG”, and “MAX” denote the minimum, average, and maximum FFV obtained by each algorithm over 30 independent non-repetitive trials, respectively. “Mean Rank” represents the algorithm’s average efficacy ranking, while “Final Rank” indicates its ultimate ranking based on the “Mean Rank” metric. Below, we provide a detailed analysis of the obtained FFV.

As shown in Table 6, when resolving low-dimensional FS problems, AMFPOA attained first place in the minimum FFV metric across six FS datasets, with a winning rate of 75%, outperforming the second-ranked ALSHADE by 37.5%. This result demonstrates AMFPOA’s superior capability in locating ideal feature subsets for low-dimensional FS problems, primarily due to the Fractional-Order Bernstein exploitation strategy introduced in this study, which effectively augments the algorithm’s LE efficacy and improves optimization precision. Additionally, in terms of the average FFV, AMFPOA ranked first in eight FS datasets for dimensionality reduction, achieving a 100% success rate over the comparative algorithms. This indicates AMFPOA’s superior solution stability when addressing low-dimensional FS problems. To visually illustrate this stability, Figure 13 presents box plots of the algorithm’s efficacy across 30 independent runs, among them, "+" represents an outlier. The figure reveals that AMFPOA consistently exhibits narrower box widths, reflecting higher solution stability under most conditions. Finally, in the maximum FFV metric, AMFPOA secured first place in seven FS datasets, with an 87.5% success rate, demonstrating its enhanced fault tolerance and reduced error rates in FS problem-resolving. In summary, compared to the benchmark algorithms, AMFPOA exhibits superior optimization efficacy and higher solution stability when resolving low-dimensional FS problems.

As shown in Table 6, when resolving medium-dimensional FS problems, AMFPOA attained first place in the minimum FFV metric across seven FS datasets, with a success rate of 87.5%, demonstrating superior redundancy reduction capability compared to benchmark algorithms. This improvement is primarily attributed to the adaptive exploration strategy advanced in this study, which effectively augments the algorithm’s global search capability, enabling AMFPOA to efficiently explore multiple solution regions and thereby improve the quality of selected FS subsets. Additionally, in terms of the average FFV, AMFPOA ranked first in eight FS datasets for dimensionality reduction, achieving a 100% success rate over comparative algorithms. This outstanding efficacy reflects the enhanced stability of AMFPOA due to its optimized search strategy. As illustrated in Figure 13, AMFPOA consistently exhibits higher solution stability under most conditions, indicating its strong practical applicability. Finally, in the maximum FFV metric, AMFPOA secured first place in five FS datasets, with a 62.5% success rate, outperforming the second-ranked algorithm by approximately 50%. This demonstrates AMFPOA’s superior fault tolerance and improved robustness in FS problem-resolving. In summary, compared to benchmark algorithms, AMFPOA exhibits higher optimization efficacy and greater solution stability when addressing medium-dimensional FS problems, making it a promising FS method for real-world applications.

As shown in Table 6, when addressing high-dimensional FS problems, AMFPOA attained first place in the minimum FFV metric across six FS datasets, with a success rate of 85.7%. This demonstrates its superior capability in efficiently exploring subset combinations compared to benchmark algorithms. This improvement is primarily attributed to the three-swarm search strategy advanced in this study, which effectively augments the algorithm’s ability to escape local non-ideal feature subset traps. Consequently, AMFPOA can better tackle the challenges posed by high-dimensional FS problems, improving CA and enhancing dataset utility. Additionally, in terms of the average FFV, AMFPOA ranked first in seven FS datasets for dimensionality reduction, achieving a 100% success rate over comparative algorithms. This reflects its higher solution stability when resolving high-dimensional FS problems. As illustrated in Figure 13, AMFPOA consistently exhibits greater solution stability under most conditions, ensuring its practical applicability to a certain extent. Finally, in the maximum FFV metric, AMFPOA secured first place in five FS datasets, with a 71.4% success rate, demonstrating a clear advantage over benchmark algorithms and showcasing its stronger fault tolerance in problem-resolving. In summary, compared to benchmark algorithms, AMFPOA exhibits higher optimization efficacy and greater solution stability when addressing high-dimensional FS problems, making it a promising FS method for real-world applications.

The aforementioned analysis of FFV confirms that AMFPOA is an algorithm with robust FS efficacy. However, it must be acknowledged that AMFPOA underperforms compared to benchmark algorithms on certain specific FS datasets, indicating room for further improvement in its efficacy. From a comprehensive perspective, Figure 14 illustrates the average ranking of algorithms across 23 FS datasets. Notably, AMFPOA achieves the lowest bar heights in terms of minimum, average, and maximum FFV, demonstrating its strong overall efficacy. These findings suggest that AMFPOA can be considered a promising FS method when evaluated holistically.

4.7. Wilcoxon Rank Sum Test Analysis

The aforementioned analysis delved into the numerical results of the AMFPOA when resolving FS problems. However, numerical results may occasionally be skewed by outliers, potentially impacting the overall findings. To mitigate this randomness, this section presents a Wilcoxon non-parametric test conducted on the results obtained from 30 independent and non-repetitive experiments, with the significance level set at 0.05. The experimental results are summarized in Table 7. In the last row of the table, “+” indicates that the efficacy of the compared algorithm is significantly superior to that of the AMFPOA, “−” signifies that the compared algorithm’s efficacy is significantly inferior to that of the AMFPOA, and “=” signifies that there is no significant difference in efficacy between the compared algorithm and the AMFPOA. As can be seen from Table 7, when addressing 23 FS problems encompassing low-dimensional, medium-dimensional, and high-dimensional scenarios, the POA, ALSHADE, PLO, LSHADE, BEGJO, and QHDBO algorithms exhibit significantly inferior efficacy compared to the AMFPOA on 22 of these FS problems, representing a ratio of 95.6%. Meanwhile, the IPOA and MCOA algorithms also demonstrate significantly worse efficacy than the AMFPOA across all 23 FS problems. Moreover, out of the 184 tests conducted, the AMFPOA significantly outperforms the competing algorithms in 97.2% of the cases. This evidence confirms that the AMFPOA possesses superior FS efficacy. It also validates that the integration of the three learning strategies advanced in this paper effectively augments the efficacy of the POA.

4.8. CA Analysis

CA is a critical metric in FS problems, as it directly reflects the quality of the selected feature subsets. This section primarily analyzes the CA attained by AMFPOA when resolving FS problems. The experimental results are presented in Table 8, where “Mean Rank” signifies the algorithm’s average ranking in terms of CA, and “Final Rank” represents its average ranking based on the feature subset size metric.

As demonstrated in Table 8, when addressing low-dimensional FS problems, AMFPOA secured first place in seven FS problems, achieving the highest success rate compared to benchmark algorithms. This indicates that AMFPOA effectively eliminates RF while enhancing the utility of primordial low-dimensional datasets. This improvement is primarily attributed to the advanced strategy’s enhancement of LE capability, enabling higher optimization precision in low-dimensional FS problem-resolving. Additionally, for medium-dimensional FS problems, AMFPOA attained the highest CA in seven FS problems, with a success rate of 87.5%. This success stems from the advanced learning strategy, which facilitates comprehensive exploration of solution spaces, allowing AMFPOA to maintain efficient search efficacy despite increasing dataset dimensionality, thereby improving data utility. Furthermore, in high-dimensional FS problems, AMFPOA attained the highest CA in six FS problems, with a success rate of 85.7%. This is largely due to the three-swarm search strategy, which improves algorithmic balance during solution exploration, enabling easier escape from local feature subset traps caused by extreme dimensionality increases and yielding higher-quality feature subsets for primordial dataset representation. Figure 15 illustrates the average CA rankings of algorithms across 23 FS datasets, showing that AMFPOA consistently achieves lower average rankings compared to benchmark algorithms. In summary, the three learning strategies advanced in this study significantly enhance AMFPOA’s search efficacy, enabling superior FS problem-resolving capabilities relative to benchmark algorithms. By effectively improving CA, AMFPOA demonstrates strong potential as a promising FS method.

4.9. Feature Subset Size Analysis

This section primarily analyzes feature subset size, another critical metric in FS problems. The experimental results are presented in Table 9, where “Mean Rank” signifies the algorithm’s average ranking in terms of feature subset size across 23 FS problems, and “Final Rank” represents the algorithm’s overall ranking based on the “Mean Rank” metric. A detailed analysis of the experimental results follows.

As shown in Table 9, when addressing low-dimensional FS problems, all algorithms effectively reduced the number of feature elements. Notably, AMFPOA secured first place in six FS problems, achieving a success rate of 75%, demonstrating superior RF elimination capability compared to benchmark algorithms. This advantage primarily stems from AMFPOA’s enhanced LE capability, enabling effective removal of noisy features. Additionally, for medium-dimensional FS problems, AMFPOA ranked first in five FS problems, with a success rate of 62.5%, outperforming the second-ranked QHDBO by 25%. This efficacy highlights AMFPOA’s superior RF elimination in medium-dimensional scenarios and validates the advanced adaptive exploration strategy, which improves global search efficacy by optimizing feature subsets across the entire solution space, thereby enhancing dataset utility. Furthermore, in high-dimensional FS problems, AMFPOA attained first place in three FS problems (42.8%) and second place in four FS problems, exhibiting stronger feature reduction capability than benchmark algorithms. This confirms that AMFPOA’s three-swarm search strategy enables comprehensive consideration of multiple metrics during exploration, effectively eliminating RF and further improving dataset utility and model reliability. To demonstrate AMFPOA’s overall efficacy, Figure 16 illustrates the algorithm’s average ranking across 23 FS problems, showing that AMFPOA consistently achieves lower average rankings, indicating strong feature dimensionality reduction capability. In summary, the integration of learning strategies in AMFPOA establishes it as a highly efficient FS method, capable of effectively reducing RF in datasets and enhancing the utility of data models.

4.10. Convergence Analysis

The aforementioned discussions have substantiated that the AMFPOA exhibits favorable FS efficacy. However, in practical applications, the convergence efficacy of an algorithm is also of critical importance. An appropriate algorithm should not only possess high convergence accuracy but also demonstrate a satisfactory convergence speed. Therefore, in this section, we conduct an analysis of the convergence curves of the AMFPOA to evaluate its convergence efficacy. The experimental results are presented in Figure 17. Here, the x-axis represents the number of iterations, while the y-axis represents the average FFV.

As can be observed from Figure 17, most algorithms are in a state of stable convergence, making it meaningful to analyze the convergence curves under such stable conditions. In the majority of cases, for instance, when resolving FS problems on datasets like Glass, Breastcancer, Lipid, Vote, and Vehicle, the AMFPOA takes the lead in efficacy after approximately the 20th iteration, demonstrating rapid convergence speed and high convergence accuracy. In a small number of cases, such as when addressing FS problems on the HeartEW, Zoo, and Congress datasets, the AMFPOA achieves a leading position in terms of FFV after around the 40th iteration. This reflects the algorithm’s superior global search capabilities, which enhance its convergence speed as the problem dimensionality increases. As the iteration process progresses, the AMFPOA consistently maintains its leading edge while gradually approaching a state of stable convergence. These advantages are primarily attributable to the three learning strategies advanced in this paper, which effectively strengthen the algorithm’s search ability. Consequently, the AMFPOA attains faster convergence speed and higher convergence accuracy, exhibiting a certain degree of algorithmic applicability.

4.11. Comprehensive Metric Analysis

The preceding section conducted separate analyses of AMFPOA’s efficacy on FS problems across three key metrics: FFV, CA, and feature subset size. These analyses confirmed AMFPOA’s superior feature subset search capability, demonstrating its effectiveness in eliminating RF from raw datasets while significantly improving CA. To evaluate AMFPOA’s efficacy more comprehensively, this section integrates these three metrics into a multi-indicator analysis, visualized through the stacked bar chart shown in Figure 18. As illustrated, AMFPOA exhibits the lowest stacked height across all five aggregated indicators, confirming its strong ability to balance multiple efficacy metrics. This balanced optimization enables AMFPOA to achieve superior overall FS efficacy, establishing it as a promising FS method.

4.12. Extended Experimental Analysis

This section primarily aims to explore and expand the efficacy of the AMFPOA when addressing FS problems with tens of thousands of dimensions. In this regard, we employ seven datasets, each containing over ten thousand feature-related knowledge entries, for experimental evaluation. The specific details of these datasets are presented in Table 10. Similarly, each evaluation experiment is independently and non-repetitively conducted 30 times to gather statistical data on various metrics, including the ideal FFV, the average FFV, and the worst FFV. The experimental results are displayed in Table 11. Here, “Mean Rank” represents the average ranking of an algorithm’s FFV across the seven FS problems, while “Final Rank” indicates the algorithm’s ultimate ranking based on the “Mean Rank” metric.

As can be observed from the table, with regard to the ideal FFV metric, the AMFPOA ranks first across all seven datasets, demonstrating its remarkable ability to search for ideal feature subsets. This also validates that the learning strategies advanced in this paper can still exert a certain positive influence on the algorithm’s efficacy even when the feature dimensionality is extremely high. Moreover, concerning the average FFV metric, the AMFPOA achieves the best values on all seven listed datasets. This showcases the algorithm’s high solution stability. As the feature dimensionality increases, the AMFPOA can still maintain its solution stability. Finally, in terms of the worst FFV metric, the AMFPOA attains the ideal values on five datasets, with a success rate of 71.4%. This indicates that it has better fault-tolerance compared to the competing algorithms. Additionally, to provide a more intuitive demonstration of the AMFPOA’s efficacy in resolving ultra-high-dimensional FS problems, Figure 19 presents the average rankings of the algorithm across the three metrics. It is clearly evident that the AMFPOA significantly outperforms the competing algorithms in all three metrics. This also reflects that the AMFPOA has better applicability and holds advantages in resolving ultra-high-dimensional datasets compared to the competing algorithms.

5. Conclusions and Future Works

This study addresses the limitations of the POA in resolving FS problems, specifically its deficiencies in GE and LE and imbalance between exploration and exploitation, which compromise CA and dataset utility, and advances an enhanced POA variant termed AMFPOA by amalgamating three complementary strategies to improve FS efficacy and subset CA: first, to address POA’s insufficient GE, an adaptive exploration strategy is introduced, which augments GE capability by considering knowledge disparities among different groups of members and incorporating adaptive learning principles, enabling thorough exploration of the feature subset combination space and improving CA; second, to resolve the imbalance between GE and LE, a three-swarm search strategy is advanced, which divides the swarm into multiple subswarms and applies tailored optimization strategies to each, achieving superior multi-metric balance in FS problem-resolving and optimizing feature subset quality; third, to strengthen POA’s LE capability, a Fractional-Order Bernstein exploitation strategy is developed, which leverages the historical memory properties of Fractional-Order theory and the multi-member weighting of Bernstein polynomials to significantly enhance LE, yielding higher CA in FS tasks; experimental results on 23 real-world FS problems demonstrate that AMFPOA achieves an average success rate of over 87.5% in FFV, along with ideal efficacy rates of 86.5% in CA and 60.1% in feature subset size reduction, exhibiting superior FS efficacy compared to benchmark algorithms and establishing itself as a promising FS method.

Based on the work presented in this paper, our follow-up research plans mainly revolve around the following two aspects: (1) The AMFPOA advanced in this paper is a single-objective optimization algorithm. In our subsequent work, we will introduce its corresponding multi-objective version to further expand the algorithm’s application domains. (2) We will apply AMFPOA to resolve more challenging combinatorial optimization problems to further evaluate its efficacy in combinatorial optimization.

Author Contributions

Conceptualization, L.X.; methodology, L.X. and J.L.; software, L.X.; validation, L.X., J.L. and Y.Y.; formal analysis, L.X. and J.L.; investigation, Y.Y.; resources, L.X.; data curation, L.X., J.L. and Y.Y.; writing—primordial draft preparation, L.X. and J.L.; writing—review and editing, Y.Y.; visualization, L.X.; supervision, Y.Y.; project administration, Y.Y.; funding acquisition, Y.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

All experimental data can be obtained by contacting the corresponding author.

Acknowledgments

We greatly appreciate the efforts of the reviewers and editorial team for our article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Gomes, B.; Ashley, E.A. Artificial intelligence in molecular medicine. N. Engl. J. Med. 2023, 388, 2456–2465. [Google Scholar] [CrossRef] [PubMed]
Xu, Q.; Cai, J.R.; Zhang, W.; Bai, J.W.; Li, Z.Q.; Tan, B.; Sun, L. Detection of citrus Huanglongbing (HLB) based on the HLB-induced leaf starch accumulation using a home-made computer vision system. Biosyst. Eng. 2022, 218, 163–174. [Google Scholar] [CrossRef]
Wang, X.; Jiang, H.; Zeng, T.; Dong, Y. An adaptive fused domain-cycling variational generative adversarial network for machine fault diagnosis under data scarcity. Knowl. Fusion 2025, 126, 103616. [Google Scholar] [CrossRef]
Wu, Q.; Gu, J. Design and research of robot visual servo system based on artificial intelligence. Agro Food Ind. Hi-Tech 2017, 28, 125–128. [Google Scholar]
Chen, J.; Zhang, M.; Xu, B.; Sun, J.; Mujumdar, A.S. Artificial intelligence assisted technologies for controlling the drying of fruits and vegetables using physical fields: A review. Trends Food Sci. Technol. 2020, 105, 251–260. [Google Scholar] [CrossRef]
Zhu, C.; Hao, S.; Liu, C.; Wang, Y.; Jia, X.; Xu, J.; Wang, W. An Efficient Computer Vision-Based Dual-Face Target Precision Variable Spraying Robotic System for Foliar Fertilisers. Agronomy 2024, 14, 2770. [Google Scholar] [CrossRef]
Zhang, L.; Liao, B.; Liu, D.; Jiang, Q.; Sun, Q. Artificial Intelligence empowered evolution in medicine food homology: Innovations, Challenges, and Future Prospects. Food Biosci. 2025, 69, 106928. [Google Scholar] [CrossRef]
Li, H.; Geng, W.; Hassan, M.M.; Zuo, M.; Wei, W.; Wu, X.; Ouyang, Q.; Chen, Q. Rapid detection of chloramphenicol in food using SERS flexible sensor coupled artificial intelligent tools. Food Control 2021, 128, 108186. [Google Scholar] [CrossRef]
El-Mesery, H.S.; Qenawy, M.; Ali, M.; Hu, Z.; Adelusi, O.A.; Njobeh, P.B. Artificial intelligence as a tool for predicting the quality attributes of garlic (Allium sativum L.) slices during continuous infrared-assisted hot air drying. J. Food Sci. 2024, 89, 7693–7712. [Google Scholar] [CrossRef] [PubMed]
El-Mesery, H.S.; Qenawy, M.; Ali, M.; Rostom, M.; Elbeltagi, A.; Salem, A.; Elwakeel, A.E. Optimization of dried garlic physicochemical properties using a self-organizing map and the development of an artificial intelligence prediction model. Sci. Rep. 2025, 15, 3105. [Google Scholar] [CrossRef]
Chen, Q.; Hu, W.; Su, J.; Li, H.; Ouyang, Q.; Zhao, J. Nondestructively sensing of total viable count (TVC) in chicken using an artificial olfaction system based colorimetric sensor array. J. Food Eng. 2016, 168, 259–266. [Google Scholar] [CrossRef]
Li, H.; Kutsanedzie, F.; Zhao, J.; Chen, Q. Quantifying total viable count in pork meat using combined hyperspectral imaging and artificial olfaction techniques. Food Anal. Methods 2016, 9, 3015–3024. [Google Scholar] [CrossRef]
Li, L.; Xie, S.; Zhu, F.; Ning, J.; Chen, Q.; Zhang, Z. Colorimetric sensor array-based artificial olfactory system for sensing Chinese green tea’s quality: A method of fabrication. Int. J. Food Prop. 2017, 20 (Suppl. 2), 1762–1773. [Google Scholar] [CrossRef]
Li, H.; Zhang, B.; Hu, W.; Liu, Y.; Dong, C.; Chen, Q. Monitoring black tea fermentation using a colorimetric sensor array-based artificial olfaction system. J. Food Process. Preserv. 2018, 42, e13348. [Google Scholar] [CrossRef]
Bai, J.W.; Xiao, H.W.; Ma, H.L.; Zhou, C.S. Artificial neural network modeling of drying kinetics and color changes of ginkgo biloba seeds during microwave drying process. J. Food Qual. 2018, 2018, 3278595. [Google Scholar] [CrossRef]
Dai, C.; Huang, X.; Huang, D.; Lv, R.; Sun, J.; Zhang, Z.; Aheto, J.H. Real-time detection of saponin content during the fermentation process of Tremella aurantialba using a homemade artificial olfaction system. J. Food Process Eng. 2019, 42, e13101. [Google Scholar] [CrossRef]
Yunfeng, X.; Xiliang, Z.; Xiaojia, S.; Jizhang, W.; Jizhan, L.; Zhiguo, L.; Pingping, L. Tensile mechanical properties of greenhouse cucumber cane. Int. J. Agric. Biol. Eng. 2016, 9, 1–8. [Google Scholar]
Jin, Y.; Liu, J.; Xu, Z.; Yuan, S.; Li, P.; Wang, J. Development status and trend of agricultural robot technology. Int. J. Agric. Biol. Eng. 2021, 14, 1–19. [Google Scholar] [CrossRef]
Yüksel, N.; Börklü, H.R.; Sezer, H.K.; Canyurt, O.E. Review of artificial intelligence applications in engineering design perspective. Eng. Appl. Artif. Intell. 2023, 118, 105697. [Google Scholar] [CrossRef]
Liu, S.; Li, H.; Hassan, M.M.; Ali, S.; Chen, Q. SERS based artificial peroxidase enzyme regulated multiple signal amplified system for quantitative detection of foodborne pathogens. Food Control 2021, 123, 107733. [Google Scholar] [CrossRef]
Zareef, M.; Arslan, M.; Hassan, M.M.; Ahmad, W.; Ali, S.; Li, H.; Ouyang, Q.; Wu, X.; Hashim, M.M.; Chen, Q. Recent advances in assessing qualitative and quantitative aspects of cereals using nondestructive techniques: A review. Trends Food Sci. Technol. 2021, 116, 815–828. [Google Scholar] [CrossRef]
Xu, Y.; Hassan, M.M.; Sharma, A.S.; Li, H.; Chen, Q. Recent advancement in nano-optical strategies for detection of pathogenic bacteria and their metabolites in food safety. Crit. Rev. Food Sci. Nutr. 2023, 63, 486–504. [Google Scholar] [CrossRef]
Wang, H.; Gu, J.; Wang, M. A review on the application of computer vision and machine learning in the tea industry. Front. Sustain. Food Syst. 2023, 7, 1172543. [Google Scholar] [CrossRef]
Liang, Z.; Wada, M.E. Development of cleaning systems for combine harvesters: A review. Biosyst. Eng. 2023, 236, 79–102. [Google Scholar] [CrossRef]
El-Mesery, H.S.; Qenawy, M.; Li, J.; El-Sharkawy, M.; Du, D. Predictive modeling of garlic quality in hybrid infrared-convective drying using artificial neural networks. Food Bioprod. Process. 2024, 145, 226–238. [Google Scholar] [CrossRef]
Jing, T.; Tang, Z.; Hao, S.; Shen, C.; Wang, T.; Wang, M. Structure design and rice threshing efficacy of the variable-speed inertial pulley for simulating artificial threshing. Int. J. Agric. Biol. Eng. 2024, 17, 33–40. [Google Scholar]
Ngolong Ngea, G.L.; Yang, Q.; Xu, M.; Ianiri, G.; Dhanasekaran, S.; Zhang, X.; Bi, Y.; Zhang, H. Revisiting the current and emerging concepts of postharvest fresh fruit and vegetable pathology for next-generation antifungal technologies. Compr. Rev. Food Sci. Food Saf. 2024, 23, e13397. [Google Scholar] [CrossRef]
Lakhiar, I.A.; Yan, H.; Zhang, C.; Wang, G.; He, B.; Hao, B.; Han, Y.; Wang, B.; Bao, R.; Rakibuzzaman, M.; et al. A review of precision irrigation water-saving technology under changing climate for enhancing water use efficiency, crop yield, and environmental footprints. Agriculture 2024, 14, 1141. [Google Scholar] [CrossRef]
Cao, Y.; Tang, Z.; Lu, D.; Lin, S. Efficacy Test of Artificial Defoliating Broccoli Conveyor Line and Analysis of Defoliating Broccoli Inflorescences. Agronomy 2024, 14, 1925. [Google Scholar] [CrossRef]
Zhao, S.; Adade, S.Y.S.S.; Wang, Z.; Jiao, T.; Ouyang, Q.; Li, H.; Chen, Q. Deep learning and feature reconstruction assisted vis-NIR calibration method for on-line monitoring of key growth indicators during kombucha production. Food Chem. 2025, 463, 141411. [Google Scholar] [CrossRef]
Ji, T.; Liaqat, F.; Khazi, M.I.; Liaqat, N.; Nawaz, M.Z.; Zhu, D. Lignin biotransformation: Advances in enzymatic valorization and bioproduction strategies. Ind. Crops Prod. 2024, 216, 118759. [Google Scholar] [CrossRef]
Bai, J.; Mujumdar, A.S.; Xiao, H. Ethical and strategic challenges of AI weapons: A call for global action. Int. J. Agric. Biol. Eng. 2024, 17, 293–294. [Google Scholar] [CrossRef]
Wu, P.; Lei, X.; Zeng, J.; Qi, Y.; Yuan, Q.; Huang, W.; Ma, Z.; Shen, Q.; Lyu, X. Research progress in mechanized and intelligent zed pollination technologies for fruit and vegetable crops. Int. J. Agric. Biol. Eng. 2024, 17, 11–21. [Google Scholar]
Deng, J.; Jiang, H.; Chen, Q. Enhancing Fourier Transform Near-infrared Spectroscopy with Explainable Ensemble Learning Methods for Detecting Mineral Oil Contamination in Corn Oil. J. Food Compos. Anal. 2025, 143, 107594. [Google Scholar] [CrossRef]
Jan, Z.; Ahamed, F.; Mayer, W.; Patel, N.; Grossmann, G.; Stumptner, M.; Kuusk, A. Artificial intelligence for industry 4.0: Systematic review of applications, challenges, and opportunities. Expert Syst. Appl. 2023, 216, 119456. [Google Scholar] [CrossRef]
Taha, M.F.; Mao, H.; Zhang, Z.; Elmasry, G.; Awad, M.A.; Abdalla, A.; Mousa, S.; Elwakeel, A.E.; Elsherbiny, O. Emerging technologies for precision crop management towards agriculture 5.0: A comprehensive overview. Agriculture 2025, 15, 582. [Google Scholar] [CrossRef]
Jayan, H.; Min, W.; Guo, Z. Applications of Artificial Intelligence in Food Industry. Foods 2025, 14, 1241. [Google Scholar] [CrossRef]
Li, D.; Chen, Q.; Ouyang, Q.; Liu, Z. Advances of Vis/NIRS and imaging techniques assisted by AI for tea processing. Crit. Rev. Food Sci. Nutr. 2025, 1–19. [Google Scholar] [CrossRef]
Wang, Y.; Zhang, Z.; Jia, W.; Ou, M.; Dong, X.; Dai, S. A review of environmental sensing technologies for targeted spraying in orchards. Horticulturae 2025, 11, 551. [Google Scholar] [CrossRef]
Guan, B.; Zhao, J.; Jin, H.; Lin, H. Determination of rice storage time with colorimetric sensor array. Food Analytical Methods. 2017, 10, 1054–1062. [Google Scholar] [CrossRef]
Xu, Y.; Hassan, M.M.; Kutsanedzie, F.Y.H.; Li, H.H.; Chen, Q.S. Evaluation of extra-virgin olive oil adulteration using FTIR spectroscopy combined with multivariate algorithms. Qual. Assur. Saf. Crops Foods 2018, 10, 411–421. [Google Scholar] [CrossRef]
Zhao, Z.; Jin, M.; Tian, C.; Yang, S.X. Prediction of seed distribution in rectangular vibrating tray using grey model and artificial neural network. Biosyst. Eng. 2018, 175, 194–205. [Google Scholar] [CrossRef]
Xu, Y.; Chen, Q.; Liu, Y.; Sun, X.; Huang, Q.; Ouyang, Q.; Zhao, J. A novel hyperspectral microscopic imaging system for evaluating fresh degree of pork. Korean J. Food Sci. Anim. Resour. 2018, 38, 362. [Google Scholar]
Chen, H.; Chen, B.; Lu, D. A novel method for detection of camellia oil adulteration based on time-resolved emission fluorescence. Sci. Rep. 2018, 8, 13784. [Google Scholar] [CrossRef]
Jia, W.; Zheng, Y.; Zhao, D.A.; Yin, X.; Liu, X.; Du, R. Preprocessing method of night vision image application in apple harvesting robot. Int. J. Agric. Biol. Eng. 2018, 11, 158–163. [Google Scholar] [CrossRef]
Li, Y.; Sun, J.; Wu, X.; Lu, B.; Wu, M.; Dai, C. Grade identification of tieguanyin tea using fluorescence hyperspectra and different statistical algorithms. J. Food Sci. 2019, 84, 2234–2241. [Google Scholar] [CrossRef]
Li, Y.; Sun, J.; Wu, X.; Chen, Q.; Lu, B.; Dai, C. Detection of viability of soybean seed based on fluorescence hyperspectra and CARS-SVM-AdaBoost model. J. Food Process. Preserv. 2019, 43, e14238. [Google Scholar] [CrossRef]
Wu, X.; Zhu, J.; Wu, B.; Zhao, C.; Sun, J.; Dai, C. Discrimination of Chinese liquors based on electronic nose and fuzzy discriminant principal component analysis. Foods 2019, 8, 38. [Google Scholar] [CrossRef] [PubMed]
Entezari, A.; Aslani, A.; Zahedi, R.; Noorollahi, Y. Artificial intelligence and machine learning in energy systems: A bibliographic perspective. Energy Strategy Rev. 2023, 45, 101017. [Google Scholar] [CrossRef]
Jia, W.; Sun, M.; Lian, J.; Hou, S. Feature dimensionality reduction: A review. Complex Intell. Syst. 2022, 8, 2663–2693. [Google Scholar] [CrossRef]
Zhang, J.; Yu, J.; Tao, D. Local deep-feature alignment for unsupervised dimension reduction. IEEE Trans. Image Process. 2018, 27, 2420–2432. [Google Scholar] [CrossRef] [PubMed]
Abdollahzadeh, B.; Gharehchopogh, F.S. A multi-objective optimization algorithm for FS problems. Eng. Comput. 2022, 38, 1845–1863. [Google Scholar] [CrossRef]
Hashim, F.A.; Hussien, A.G. Snake Optimizer: A novel meta-heuristic optimization algorithm. Knowl.-Based Syst. 2022, 242, 108320. [Google Scholar] [CrossRef]
Zervoudakis, K.; Tsafarakis, S. A global optimizer inspired from the survival strategies of flying foxes. Eng. Comput. 2023, 39, 1583–1616. [Google Scholar] [CrossRef]
Mirjalili, S.; Mirjalili, S.M.; Lewis, A. Grey Wolf Optimizer. Adv. Eng. Softw. 2014, 69, 46–61. [Google Scholar] [CrossRef]
Abdollahzadeh, B.; Gharehchopogh, F.S.; Mirjalili, S. African vultures optimization algorithm: A new nature-inspired metaheuristic algorithm for global optimization problems. Comput. Ind. Eng. 2021, 158, 107408. [Google Scholar] [CrossRef]
Braik, M.; Ryalat, M.H.; Al-Zoubi, H. A novel meta-heuristic algorithm for resolving numerical optimization problems: Ali Baba and the forty thieves. Neural Comput. Appl. 2022, 34, 409–455. [Google Scholar] [CrossRef]
Matoušová, I.; Trojovský, P.; Dehghani, M.; Trojovská, E.; Kostra, J. Mother optimization algorithm: A new human-based metaheuristic approach for resolving engineering optimization. Sci. Rep. 2023, 13, 10312. [Google Scholar] [CrossRef]
Dehghani, M.; Trojovský, P. Teamwork Optimization Algorithm: A New Optimization Approach for Function Minimization/Maximization. Sensors 2021, 21, 4567. [Google Scholar] [CrossRef]
Faramarzi, A.; Heidarinejad, M.; Stephens, B.; Mirjalili, S. Equilibrium optimizer: A novel optimization algorithm. Knowl.-Based Syst. 2020, 191, 105190. [Google Scholar] [CrossRef]
Hashim, F.A.; Hussain, K.; Houssein, E.H.; Mabrouk, M.S.; Al-Atabany, W. Archimedes optimization algorithm: A new metaheuristic algorithm for resolving optimization problems. Appl. Intell. 2021, 51, 1531–1551. [Google Scholar] [CrossRef]
Hatamlou, A. Black hole: A new heuristic optimization approach for data clustering. Inf. Sci. 2013, 222, 175–184. [Google Scholar] [CrossRef]
Goldberg, D.E.; Holland, J.H. Genetic Algorithms and Machine Learning. Mach. Learn. 1988, 3, 95–99. [Google Scholar] [CrossRef]
Reynolds, R.G. An Introduction to Cultural Algorithms. In Proceedings of the Third Annual Conference on Evolutionary Programming, San Diego, CA, USA, 24–26 February 1994; World Scientific: Singapore, 1994; pp. 131–139. [Google Scholar]
Storn, R.; Price, K. Differential evolution—A simple and efficient heuristic for global optimization over continuous spaces. J. Glob. Optim. 1997, 11, 341–359. [Google Scholar] [CrossRef]
Hichem, H.; Elkamel, M.; Rafik, M.; Mesaaoud, M.T.; Ouahiba, C. A new binary grasshopper optimization algorithm for FS problem. J. King Saud Univ.-Comput. Knowl. Sci. 2022, 34, 316–328. [Google Scholar] [CrossRef]
Allam, M.; Nandhini, M. Bestl FS using binary teaching learning based optimization algorithm. J. King Saud Univ.-Comput. Knowl. Sci. 2022, 34, 329–341. [Google Scholar] [CrossRef]
Alweshah, M.; Khalaileh, S.A.; Gupta, B.B.; Almomani, A.; Hammouri, A.I.; Al-Betar, M.A. The monarch butterfly optimization algorithm for resolving FS problems. Neural Comput. Appl. 2022, 34, 11267–11281. [Google Scholar] [CrossRef]
Hu, G.; Du, B.; Wang, X.; Wei, G. An enhanced black widow optimization algorithm for FS. Knowl.-Based Syst. 2022, 235, 107638. [Google Scholar] [CrossRef]
Ewees, A.A.; Mostafa, R.R.; Ghoniem, R.M.; Gaheen, M.A. Improved seagull optimization algorithm using Lévy flight and mutation operator for FS. Neural Comput. Appl. 2022, 34, 7437–7472. [Google Scholar] [CrossRef]
Awadallah, M.A.; Hammouri, A.I.; Al-Betar, M.A.; Braik, M.S.; Abd Elaziz, M. Binary Horse herd optimization algorithm with crossover operators for FS. Comput. Biol. Med. 2022, 141, 105152. [Google Scholar] [CrossRef] [PubMed]
Pan, H.; Chen, S.; Xiong, H. A high-dimensional FS method based on modified Gray Wolf Optimization. Appl. Soft Comput. 2023, 135, 110031. [Google Scholar] [CrossRef]
Al-Baik, O.; Alomari, S.; Alssayed, O.; Gochhait, S.; Leonova, I.; Dutta, U.; Dehghani, M. Pufferfish optimization algorithm: A new bio-inspired metaheuristic algorithm for resolving optimization problems. Biomimetics 2024, 9, 65. [Google Scholar] [CrossRef] [PubMed]
Zhang, Q.; Gao, H.; Zhan, Z.H.; Li, J.; Zhang, H. Growth Optimizer: A powerful metaheuristic algorithm for resolving continuous and discrete global optimization problems. Knowl.-Based Syst. 2023, 261, 110206. [Google Scholar] [CrossRef]
Shen, Y.; Zhang, C.; Gharehchopogh, F.S.; Mirjalili, S. An improved whale optimization algorithm based on multi-swarm evolution for global optimization and engineering design problems. Expert Syst. Appl. 2023, 215, 119269. [Google Scholar] [CrossRef]
Mirjalili, S.; Lewis, A. The whale optimization algorithm. Adv. Eng. Softw. 2016, 95, 51–67. [Google Scholar] [CrossRef]
Chen, F.; Ye, S.; Wang, J.; Luo, J. Multi-Strategy Improved Binary Secretarial Bird Optimization Algorithm for FS. Mathematics 2025, 13, 668. [Google Scholar] [CrossRef]
Wang, J.; Bao, Z.; Dong, H. An Improved Northern Goshawk Optimization Algorithm for Mural Image Segmentation. Biomimetics 2025, 10, 373. [Google Scholar] [CrossRef]
Zhang, X.; Lin, Q. Three-learning strategy particle swarm algorithm for global optimization problems. Knowl. Sci. 2022, 593, 289–313. [Google Scholar] [CrossRef]
Li, Y.; Han, T.; Zhou, H. A novel adaptive L-SHADE algorithm and its application in UAV swarm resource configuration problem. Knowl. Sci. 2022, 606, 350–367. [Google Scholar] [CrossRef]
Yuan, C.; Zhao, D.; Heidari, A.A. Polar lights optimizer: Algorithm and applications in image segmentation and FS. Neurocomputing 2024, 607, 128427–128472. [Google Scholar] [CrossRef]
Tanabe, R.; Fukunaga, A.S. Improving the search efficacy of SHADE using linear swarm size reduction. In Proceedings of the 2014 IEEE Congress on Evolutionary Computation (CEC), Beijing, China, 6–11 July 2014; IEEE: New Yor, NY, USA, 2014; pp. 1658–1665. [Google Scholar]
Askr, H.; Abdel-Salam, M.; Hassanien, A.E. Copula entropy-based golden jackal optimization algorithm for high-dimensional FS problems. Expert Syst. Appl. 2024, 238, 121582–121607. [Google Scholar] [CrossRef]
SeyedGarmroudi, S.D.; Kayakutlu, G.; Kayalica, M.O. Improved pelican optimization algorithm for resolving load dispatch problems. Energy 2024, 289, 129811–129826. [Google Scholar] [CrossRef]
Jia, H.; Zhou, X.; Zhang, J. Modified crayfish optimization algorithm for resolving multiple engineering application problems. Artif. Intell. Rev. 2024, 57, 127–183. [Google Scholar] [CrossRef]
Zhu, F.; Li, G.; Tang, H. Dung beetle optimization algorithm based on quantum computing and multi-strategy fusion for resolving engineering problems. Expert Syst. Appl. 2024, 236, 121219–121236. [Google Scholar] [CrossRef]
Wu, B.; Luo, J. A Novel Improved Binary Optimization Algorithm and Its Application in FS Problems. Mathematics 2025, 13, 675. [Google Scholar] [CrossRef]
Cao, Q.; Yuan, S.; Fang, Y. Three Strategies Enhance the Bionic Coati Optimization Algorithm for Global Optimization and FS Problems. Biomimetics 2025, 10, 380. [Google Scholar] [CrossRef]
Nadimi-Shahraki, M.H.; Zamani, H. DMDE: Diversity-maintained multi-trial vector differential evolution algorithm for non-decomposition large-scale global optimization. Expert Syst. Appl. 2022, 198, 116895. [Google Scholar] [CrossRef]
Li, Y.; Yang, C.; Zeng, H.; Dong, Z.; An, Z.; Xu, Y.; Wu, H. Frequency-Aligned Knowledge Distillation for Lightweight Spatiotemporal Forecasting. arXiv 2025, arXiv:2507.02939. [Google Scholar]
Li, Y.; Dong, J.; Dong, Z.; Yang, C.; An, Z.; Xu, Y. SRKD: Towards Efficient 3D Point Cloud Segmentation via Structure-and Relation-aware Knowledge Distillation. arXiv 2025, arXiv:2506.17290. [Google Scholar]

Figure 1. Schematic of adaptive exploration strategy.

Figure 2. Schematic of three-swarm search strategy.

Figure 3. The variation curve of dynamic function q.

Figure 4. Second-order Bernstein polynomial function curve.

Figure 5. Execution flowchart of AMFPOA.

Figure 6. Calculation process of FFV for FS problem.

Figure 7. Average ranking of AMFPOA corresponding to different swarm sizes.

Figure 8. AMFPOA iterative convergence curve. (a) Breastcancer. (b) Lipid. (c) HeartEW. (d) Zoo. (e) Vote. (f) Congress. (g) Lymphography. (h) Vehicle.

Figure 9. Swarm diversity of algorithms. (a) Aggregation. (b) Glass. (c) Congress. (d) BreastEW. (e) Hillvalley. (f) Isolet.

Figure 10. Exploration and exploitation ratio of algorithms. (a) Aggregation. (b) Glass. (c) Congress. (d) BreastEW. (e) Hillvalley. (f) Isolet.

Figure 11. Comparison of strategy effectiveness of algorithms.

Figure 12. Ranking of algorithms on different types of FS problems. (a) Low. (b) Medium. (c) High.

Figure 13. Box plots for resolving FS problems. (a) Glass. (b) Breastcancer. (c) Congress. (d) Lymphography. (e) WDBC. (f) Hillvalley. (g) Musk. (h) Madelon. (i) Isolet.

Figure 14. Ranking of FFV.

Figure 15. Average ranking of CA.

Figure 16. Average ranking of feature subset size.

Figure 17. Convergence curves of all algorithms. (a) Glass. (b) Breastcancer. (c) Lipid. (d) HeartEW. (e) Zoo. (f) Vote. (g) Congress. (h) Lymphography. (i) Vehicle.

Figure 18. Comprehensive indicator ranking fill chart.

Figure 19. Stacked plot of the ranking of FFV for the FS problem with tens of thousands of dimensions.

Table 1. Comparison of related technologies.

Technology	Efficacy	Limitations
Binary grasshopper optimization FS algorithm	Success rate with feature subset size exceeding 95%	There are limitations in calculating costs
Binary teaching–learning-based optimization FS algorithm	Effectively distinguished malignant tumors from benign tumors	Lack of universality for FS problems
Monarch butterfly optimization FS algorithm	The average CA reaches 93%	There are limitations in resolving high-dimensional FS problems
Black widow optimization FS algorithm	Reduction in RF has been attained	High-dimensional FS problems have limitations
Improved sine–cosine FS algorithm	High CA of feature subsets	The reduction in RF is not obvious
Binary Horse herd Optimization FS Algorithm	Effectively reduce RF in the primordial dataset	It is difficult to overcome the challenges posed by high-dimensional FS problems
Improved grey wolf optimization FS algorithm	It effectively reduces RF	Not widely applicable

Table 2. Abbreviations of phrases involved in the paper.

No.	Full Name	Abbreviation
1	Pufferfish Optimization Algorithm	POA
2	FS	FS
3	Artificial Intelligence	AI
4	Flying Fox Optimization	FFO
5	Grey Wolf Optimizer	GWO
6	African Vulture Optimization Algorithm	AVOA
7	War Strategy Optimization	WSO
8	Mother Optimization Algorithm	MOA
9	Teamwork Optimization Algorithm	TOA
10	Equilibrium Optimizer	EO
11	Archimedes Optimization Algorithm	AOA
12	Black Hole Algorithm	BHA
13	Genetic Algorithm	GA
14	Cultural Algorithm	CA
15	Differential Evolution	DE

Table 3. Knowledge of 23 FS datasets.

Type	Name	Feature Number	Instance Size
Low	Banana	2	5300
	Aggregation	2	788
	Iris	4	150
	Bupa	6	345
	Breastcancer	9	699
	Glass	9	214
	Lipid	10	583
	HeartEW	13	270
	Congress	16	435
	Vote	16	435
	Zoo	16	101
	Vehicle	18	846
	Lymphography	18	148
	BreastEW	30	569
	WDBC	30	569
	SonarEW	60	208
High	Libras	90	360
	Hillvalley	100	606
	Musk	166	476
	Clean	167	476
	Semeion	256	1593
	Madelon	500	2600
	Isolet	617	1559

Table 4. Parameter settings for eight comparative algorithms.

Algorithms	Time	Parameter Settings
POA [73]	2024	$I \in {1, 2}$
ALSHADE [80]	2022	$N P_{init} = 18 \cdot D, N P_{\min} = 4, \| A \| = 2.6 \cdot N P, p = 0.11, H = 6, e = 0.5$
PLO [81]	2024	$W_{1} = \frac{2}{1 + e^{- 2 {(t / T)}^{4}}} - 1, W_{2} = e^{- {(2 t / T)}^{3}}$
LSHADE [82]	2014	$N P_{init} = 18 \cdot D, N P_{\min} = 4, \| A \| = 2.6 \cdot N P, p = 0.11, H = 6$
BEGJO [83]	2024	$β = 1.5, c 1 = 1.5$
IPOA [84]	2024	$F_{P K} = 0.4, F_{G K} = 0.6$
MCOA [85]	2024	$C_{2} = 2 - (\frac{F E s}{M a x F E s})$
QHDBO [86]	2024	$R = (\cos (π \cdot (\frac{t}{T_{\max}})) + 1) \cdot 0.5$

Table 5. Fitness function value of algorithms formed by introducing different strategies to FS problems of varying dimensions.

Datasets	POA	FPOA	Percentage	Datasets	POA	APOA	Percentage	Datasets	POA	MPOA	Percentage
Aggregation	0.100	0.100	0.00%	Zoo	0.052	0.048	7.02%	Libras	0.170	0.142	16.48%
Banana	0.190	0.189	0.53%	Vote	0.048	0.038	21.32%	Hillvalley	0.374	0.306	18.18%
Iris	0.055	0.038	30.91%	Congress	0.037	0.023	38.31%	Musk	0.084	0.063	24.59%
Bupa	0.324	0.293	9.54%	Lymphography	0.107	0.087	18.64%	Clean	0.072	0.038	47.34%
Glass	0.222	0.187	15.74%	Vehicle	0.291	0.253	13.04%	Semeion	0.139	0.076	45.17%
Breastcancer	0.061	0.052	14.85%	WDBC	0.027	0.021	22.55%	Madelon	0.218	0.075	65.64%
Lipid	0.256	0.245	4.17%	BreastEW	0.049	0.035	28.66%	Isolet	0.215	0.093	56.71%
HeartEW	0.164	0.098	40.23%	SonarEW	0.132	0.043	67.42%
Mean increase			14.50%				27.12%				39.16%

Table 6. The FFV when resolving FS problems.

Datasets	Metrics	POA	ALSHADE	PLO	LSHADE	BEGJO	IPOA	MCOA	QHDBO	AMFPOA
Aggregation	MIN	0.100	0.100	0.100	0.100	0.100	0.106	0.106	0.100	0.100
	AVG	0.100	0.100	0.100	0.100	0.100	0.106	0.106	0.100	0.100
	MAX	0.100	0.100	0.100	0.100	0.100	0.106	0.106	0.100	0.100
Banana	MIN	0.190	0.193	0.200	0.198	0.195	0.204	0.200	0.193	0.187
	AVG	0.190	0.193	0.200	0.198	0.195	0.204	0.200	0.193	0.187
	MAX	0.190	0.193	0.200	0.198	0.195	0.204	0.200	0.193	0.187
Iris	MIN	0.055	0.055	0.055	0.080	0.055	0.055	0.080	0.080	0.025
	AVG	0.055	0.055	0.055	0.081	0.055	0.055	0.081	0.080	0.025
	MAX	0.055	0.055	0.055	0.085	0.055	0.055	0.085	0.080	0.025
Bupa	MIN	0.324	0.298	0.346	0.307	0.288	0.333	0.307	0.307	0.281
	AVG	0.324	0.309	0.347	0.310	0.289	0.340	0.309	0.307	0.284
	MAX	0.324	0.346	0.356	0.337	0.298	0.367	0.320	0.307	0.298
Glass	MIN	0.216	0.302	0.291	0.259	0.237	0.248	0.312	0.215	0.173
	AVG	0.222	0.325	0.293	0.262	0.257	0.251	0.321	0.222	0.173
	MAX	0.237	0.344	0.302	0.270	0.323	0.258	0.355	0.279	0.173
Breastcancer	MIN	0.061	0.035	0.042	0.046	0.048	0.042	0.053	0.053	0.040
	AVG	0.061	0.050	0.042	0.047	0.052	0.045	0.058	0.053	0.040
	MAX	0.061	0.055	0.042	0.053	0.059	0.059	0.068	0.055	0.046
Lipid	MIN	0.249	0.224	0.247	0.245	0.255	0.266	0.253	0.261	0.235
	AVG	0.256	0.248	0.249	0.259	0.264	0.274	0.259	0.264	0.241
	MAX	0.274	0.271	0.261	0.276	0.286	0.289	0.271	0.274	0.253
HeartEW	MIN	0.146	0.190	0.123	0.156	0.114	0.088	0.156	0.081	0.056
	AVG	0.164	0.212	0.133	0.165	0.127	0.103	0.169	0.114	0.080
	MAX	0.208	0.240	0.155	0.190	0.155	0.156	0.214	0.191	0.140
Zoo	MIN	0.038	0.038	0.076	0.083	0.038	0.038	0.038	0.044	0.031
	AVG	0.052	0.053	0.076	0.094	0.061	0.061	0.048	0.071	0.046
	MAX	0.076	0.076	0.076	0.128	0.095	0.108	0.063	0.115	0.076
Vote	MIN	0.042	0.054	0.035	0.048	0.066	0.037	0.050	0.037	0.033
	AVG	0.048	0.064	0.036	0.048	0.068	0.039	0.060	0.037	0.034
	MAX	0.054	0.077	0.039	0.048	0.079	0.044	0.070	0.037	0.037
Congress	MIN	0.037	0.027	0.058	0.037	0.046	0.035	0.027	0.058	0.017
	AVG	0.037	0.041	0.063	0.037	0.058	0.042	0.027	0.058	0.017
	MAX	0.037	0.060	0.068	0.037	0.079	0.050	0.027	0.058	0.017
Lymphography	MIN	0.090	0.157	0.101	0.081	0.126	0.115	0.141	0.146	0.059
	AVG	0.107	0.200	0.142	0.089	0.136	0.128	0.159	0.162	0.066
	MAX	0.126	0.228	0.166	0.110	0.163	0.163	0.220	0.189	0.079
Vehicle	MIN	0.257	0.279	0.236	0.252	0.251	0.225	0.257	0.241	0.230
	AVG	0.291	0.296	0.247	0.271	0.275	0.249	0.284	0.255	0.241
	MAX	0.310	0.315	0.263	0.294	0.311	0.263	0.305	0.273	0.257
WDBC	MIN	0.023	0.056	0.037	0.042	0.058	0.053	0.046	0.039	0.015
	AVG	0.027	0.065	0.051	0.067	0.072	0.064	0.062	0.040	0.017
	MAX	0.031	0.072	0.059	0.078	0.090	0.074	0.083	0.054	0.039
BreastEW	MIN	0.036	0.062	0.036	0.029	0.035	0.046	0.044	0.028	0.021
	AVG	0.049	0.069	0.039	0.034	0.053	0.058	0.062	0.036	0.031
	MAX	0.059	0.080	0.046	0.041	0.071	0.070	0.084	0.048	0.044
SonarEW	MIN	0.089	0.106	0.030	0.013	0.028	0.025	0.012	0.054	0.010
	AVG	0.132	0.146	0.043	0.042	0.052	0.047	0.095	0.073	0.020
	MAX	0.156	0.170	0.074	0.062	0.082	0.071	0.131	0.099	0.042
Libras	MIN	0.147	0.154	0.141	0.109	0.134	0.200	0.242	0.174	0.109
	AVG	0.170	0.176	0.158	0.135	0.156	0.229	0.273	0.191	0.132
	MAX	0.187	0.187	0.180	0.152	0.173	0.263	0.324	0.208	0.146
Hillvalley	MIN	0.347	0.360	0.301	0.312	0.344	0.321	0.319	0.298	0.276
	AVG	0.374	0.376	0.315	0.342	0.364	0.338	0.338	0.310	0.299
	MAX	0.389	0.384	0.334	0.383	0.373	0.363	0.356	0.323	0.317
Musk	MIN	0.062	0.080	0.047	0.021	0.043	0.063	0.097	0.022	0.020
	AVG	0.084	0.104	0.065	0.045	0.059	0.077	0.123	0.052	0.035
	MAX	0.098	0.119	0.090	0.063	0.084	0.111	0.149	0.083	0.048
Clean	MIN	0.057	0.089	0.040	0.023	0.047	0.047	0.056	0.032	0.020
	AVG	0.072	0.109	0.054	0.033	0.088	0.063	0.075	0.053	0.030
	MAX	0.091	0.127	0.068	0.045	0.109	0.088	0.101	0.067	0.053
Semeion	MIN	0.134	0.122	0.072	0.059	0.109	0.081	0.124	0.069	0.070
	AVG	0.139	0.127	0.082	0.078	0.115	0.087	0.137	0.078	0.078
	MAX	0.143	0.134	0.091	0.090	0.126	0.095	0.147	0.086	0.090
Madelon	MIN	0.186	0.228	0.102	0.107	0.189	0.184	0.123	0.099	0.060
	AVG	0.218	0.241	0.144	0.126	0.203	0.202	0.199	0.127	0.079
	MAX	0.265	0.261	0.191	0.141	0.221	0.243	0.257	0.172	0.095
Isolet	MIN	0.198	0.151	0.085	0.075	0.124	0.130	0.138	0.112	0.070
	AVG	0.215	0.173	0.108	0.093	0.146	0.143	0.176	0.126	0.080
	MAX	0.228	0.186	0.130	0.105	0.165	0.161	0.204	0.145	0.099
Mean Rank	MIN	5.391	6.087	4.522	4.087	5.043	5.391	6.348	4.391	1.217
	AVG	5.391	6.565	4.391	4.261	5.478	5.348	6.696	4.391	1.000
	MAX	5.174	6.348	3.826	4.043	5.522	5.565	6.826	4.217	1.261
Final Rank	MIN	6	8	4	2	5	6	9	3	1
	AVG	6	8	3	2	7	5	9	3	1
	MAX	5	8	2	3	6	7	9	4	1

Table 7. Wilcoxon rank sum test results.

Datasets	POA	ALSHADE	PLO	LSHADE	BEGJO	IPOA	MCOA	QHDBO
Aggregation	9.110 × 10⁻²/=	6.000 × 10⁻²/=	9.200 × 10⁻²/=	7.280 × 10⁻¹/=	8.290 × 10⁻²/=	1.892 × 10⁻¹⁰/−	6.956 × 10⁻¹⁰/−	7.120 × 10⁻²/=
Banana	4.108 × 10⁻⁴/−	4.574 × 10⁻¹⁰/−	7.146 × 10⁻¹⁰/−	2.550 × 10⁻¹⁰/−	7.087 × 10⁻¹⁰/−	4.462 × 10⁻¹⁰/−	6.548 × 10⁻¹⁰/−	2.392 × 10⁻¹⁰/−
Iris	7.930 × 10⁻⁵/−	8.709 × 10⁻⁶/−	8.512 × 10⁻⁶/−	3.301 × 10⁻⁷/−	7.322 × 10⁻¹⁰/−	4.947 × 10⁻⁶/−	5.880 × 10⁻⁸/−	8.915 × 10⁻¹⁰/−
Bupa	4.632 × 10⁻⁵/−	8.900 × 10⁻⁶/−	1.999 × 10⁻¹⁰/−	9.077 × 10⁻¹⁰/−	7.419 × 10⁻¹⁰/−	5.243 × 10⁻¹⁰/−	7.833 × 10⁻¹⁰/−	2.413 × 10⁻¹⁰/−
Glass	3.759 × 10⁻⁵/−	5.281 × 10⁻⁷/−	3.112 × 10⁻¹⁰/−	3.862 × 10⁻¹⁰/−	8.152 × 10⁻¹⁰/−	1.258 × 10⁻¹⁰/−	2.422 × 10⁻¹⁰/−	8.418 × 10⁻¹⁰/−
Breastcancer	8.523 × 10⁻¹⁰/−	4.779 × 10⁻⁶/−	7.172 × 10⁻⁵/−	5.753 × 10⁻⁵/−	7.568 × 10⁻¹⁰/−	2.606 × 10⁻⁵/−	8.336 × 10⁻⁶/−	6.122 × 10⁻¹⁰/−
Lipid	4.106 × 10⁻¹⁰/−	1.040 × 10⁻⁶/−	5.537 × 10⁻⁵/−	3.546 × 10⁻⁵/−	9.842 × 10⁻⁷/−	8.518 × 10⁻⁵/−	5.125 × 10⁻⁵/−	2.049 × 10⁻⁷/−
HeartEW	6.135 × 10⁻¹⁰/−	3.467 × 10⁻⁶/−	4.884 × 10⁻⁵/−	4.749 × 10⁻⁵/−	4.925 × 10⁻⁹/−	7.337 × 10⁻⁵/−	7.799 × 10⁻⁵/−	6.806 × 10⁻⁷/−
Zoo	8.118 × 10⁻⁴/−	5.376 × 10⁻⁵/−	3.168 × 10⁻⁶/−	6.752 × 10⁻⁴/−	6.953 × 10⁻⁷/−	4.294 × 10⁻⁵/−	9.395 × 10⁻⁴/−	7.198 × 10⁻⁷
Vote	8.687 × 10⁻⁴/−	5.312 × 10⁻⁵/−	4.068 × 10⁻⁵/−	6.744 × 10⁻⁴/−	4.834 × 10⁻⁷/−	4.891 × 10⁻⁵/−	8.118 × 10⁻⁴/−	3.323 × 10⁻⁷/−
Congress	9.206 × 10⁻⁵/−	5.752 × 10⁻⁵/−	4.553 × 10⁻⁶/−	2.470 × 10⁻⁴/−	2.908 × 10⁻⁷/−	2.458 × 10⁻⁶/−	4.939 × 10⁻⁴/−	8.754 × 10⁻⁷/−
Lymphography	9.863 × 10⁻⁴/−	2.135 × 10⁻⁵/−	4.500 × 10⁻⁵/−	4.351 × 10⁻⁵/−	3.919 × 10⁻⁷/−	4.051 × 10⁻⁵/−	7.508 × 10⁻⁵/−	5.405 × 10⁻⁷/−
Vehicle	8.115 × 10⁻⁴/−	8.065 × 10⁻⁵/−	4.072 × 10⁻⁵/−	8.227 × 10⁻⁴/−	7.462 × 10⁻⁷/−	8.248 × 10⁻⁵/−	8.966 × 10⁻⁴/−	9.643 × 10⁻⁷/−
WDBC	5.441 × 10⁻⁴/−	8.676 × 10⁻⁶/−	5.874 × 10⁻¹⁰/−	8.534 × 10⁻⁵/−	2.194 × 10⁻⁷/−	2.735 × 10⁻¹⁰/−	8.741 × 10⁻⁵/−	9.746 × 10⁻⁷/−
BreastEW	4.337 × 10⁻⁴/−	1.552 × 10⁻⁵/−	2.468 × 10⁻¹⁰/−	5.237 × 10⁻⁵/−	9.608 × 10⁻⁷/−	8.287 × 10⁻¹⁰/−	6.717 × 10⁻⁵/−	7.761 × 10⁻⁷/−
SonarEW	7.547 × 10⁻⁴/−	8.412 × 10⁻⁵/−	8.319 × 10⁻¹⁰/−	3.371 × 10⁻⁵/−	8.322 × 10⁻⁷/−	9.829 × 10⁻¹⁰/−	3.971 × 10⁻⁵/−	6.279 × 10⁻⁷/−
Libras	7.958 × 10⁻⁴/−	4.018 × 10⁻⁵/−	8.241 × 10⁻¹⁰/−	5.606 × 10⁻⁵/−	1.044 × 10⁻⁷/−	4.766 × 10⁻¹⁰/−	6.978 × 10⁻⁵/−	4.869 × 10⁻⁷/−
Hillvalley	7.086 × 10⁻⁴/−	8.037 × 10⁻⁵/−	3.034 × 10⁻¹⁰/−	9.272 × 10⁻⁵/−	6.400 × 10⁻¹⁰/−	6.108 × 10⁻¹⁰/−	4.109 × 10⁻⁵/−	1.616 × 10⁻¹⁰/−
Musk	8.895 × 10⁻⁴/−	3.520 × 10⁻⁵/−	6.301 × 10⁻¹⁰/−	8.268 × 10⁻⁵/−	4.725 × 10⁻¹⁰/−	8.507 × 10⁻¹⁰/−	3.247 × 10⁻⁵/−	2.298 × 10⁻¹⁰/−
Clean	6.316 × 10⁻⁴/−	8.486 × 10⁻¹⁰/−	2.961 × 10⁻¹⁰/−	8.470 × 10⁻⁵/−	5.054 × 10⁻¹⁰/−	8.302 × 10⁻¹⁰/−	9.154 × 10⁻⁵/−	3.678 × 10⁻¹⁰/−
Semeion	3.121 × 10⁻¹⁰/−	2.848 × 10⁻¹⁰/−	4.749 × 10⁻¹⁰/−	2.417 × 10⁻¹⁰/−	2.158 × 10⁻¹⁰/−	1.291 × 10⁻¹⁰/−	5.210 × 10⁻¹⁰/−	8.495 × 10⁻¹⁰/−
Madelon	5.407 × 10⁻⁷/−	7.818 × 10⁻¹⁰/−	5.090 × 10⁻¹⁰/−	2.216 × 10⁻¹⁰/−	8.459 × 10⁻¹⁰/−	8.813 × 10⁻¹⁰/−	8.294 × 10⁻¹⁰/−	8.301 × 10⁻¹⁰/−
Isolet	7.445 × 10⁻¹⁰/−	9.694 × 10⁻¹⁰/−	1.623 × 10⁻¹⁰/−	6.925 × 10⁻⁵/−	7.022 × 10⁻¹⁰/−	9.167 × 10⁻¹⁰/−	2.646 × 10⁻⁵/−	6.848 × 10⁻¹⁰/−
+/−/=	0/22/1	0/22/1	0/22/1	0/22/1	0/22/1	0/23/0	0/23/0	0/22/1

Table 8. CA in resolving FS problems.

Datasets	POA	ALSHADE	PLO	LSHADE	BEGJO	IPOA	MCOA	QHDBO	AMFPOA
Aggregation	100.00	100.00	100.00	100.00	100.00	99.36	99.36	100.00	100.00
Banana	90.00	89.72	88.87	89.15	89.43	88.49	88.87	89.62	90.28
Iris	96.67	96.67	96.67	96.33	96.67	96.67	96.00	96.67	100.00
Bupa	69.57	71.01	64.93	69.42	72.46	66.52	69.42	69.57	75.07
Glass	79.52	69.29	73.33	75.71	76.43	75.48	67.38	77.86	85.71
Breastcancer	95.68	97.70	97.84	98.35	96.91	97.70	96.40	97.27	99.21
Lipid	75.26	74.91	75.60	73.45	73.79	70.86	73.19	72.41	75.09
HeartEW	85.37	78.89	88.52	84.26	90.00	92.04	83.70	90.00	93.15
Zoo	98.50	99.50	95.00	94.50	98.00	98.50	100.00	96.50	98.50
Vote	96.09	94.71	97.59	95.40	95.52	96.67	94.60	96.55	98.62
Congress	96.55	97.36	95.17	96.55	96.67	98.74	97.70	94.25	98.85
Lymphography	92.07	80.00	87.59	93.10	88.97	88.97	84.83	85.17	95.52
Vehicle	70.95	70.83	76.21	73.25	73.31	75.74	72.37	75.50	77.16
WDBC	97.88	95.58	95.22	93.63	94.42	94.87	94.42	96.28	98.85
BreastEW	98.14	95.58	97.70	98.05	98.05	97.17	95.04	98.32	98.32
SonarEW	88.78	86.10	98.05	97.56	98.05	98.05	92.20	93.17	99.27
Libras	83.75	83.06	84.72	87.64	87.22	78.75	71.81	80.28	87.64
Hillvalley	59.09	59.42	66.53	64.30	63.97	66.94	63.97	66.86	67.52
Musk	93.79	91.37	95.16	97.68	98.11	96.53	89.26	96.11	98.53
Clean	93.58	90.95	98.63	95.89	95.16	97.68	93.79	95.68	99.26
Semeion	89.59	89.81	94.72	94.47	92.48	95.50	89.69	96.73	96.67
Madelon	78.62	76.17	85.81	88.19	82.79	82.83	80.17	86.98	92.37
Isolet	79.42	83.92	90.29	92.03	89.07	89.39	83.83	88.30	93.41
Mean Rank	5.26	6.13	4.39	4.83	4.43	4.70	7.30	4.48	1.22
Final Rank	7	8	2	6	3	5	9	4	1

Table 9. The size of feature subsets in resolving FS problems.

Datasets	POA	ALSHADE	PLO	LSHADE	BEGJO	IPOA	MCOA	QHDBO	AMFPOA
Aggregation	2.00	2.00	2.00	2.00	2.00	2.00	2.00	2.00	2.00
Banana	2.00	2.00	2.00	2.00	2.00	2.00	2.00	2.00	2.00
Iris	1.00	1.00	1.00	1.90	1.00	1.00	1.80	2.00	1.00
Bupa	3.00	2.90	1.95	2.10	3.90	2.30	2.00	2.00	1.90
Glass	3.40	4.40	4.80	3.90	4.00	2.70	2.50	2.00	2.40
Breastcancer	2.00	2.60	2.00	2.90	2.20	2.20	2.30	2.60	2.00
Lipid	3.30	2.20	2.90	2.00	2.80	1.20	1.80	1.60	1.70
HeartEW	4.20	2.80	3.80	3.00	4.80	4.10	2.90	3.10	2.40
Zoo	6.10	7.70	5.00	7.10	6.90	7.60	7.70	6.30	5.00
Vote	2.10	2.60	3.80	1.00	4.50	1.40	1.90	2.00	1.00
Congress	1.00	2.80	3.10	1.00	4.40	4.90	1.00	1.00	1.00
Lymphography	6.40	3.60	5.50	4.80	6.60	5.20	4.10	5.10	4.70
Vehicle	5.30	6.10	6.00	5.40	6.30	5.50	6.30	6.30	6.30
WDBC	2.40	7.50	2.50	2.90	6.60	5.30	3.60	2.00	2.00
BreastEW	9.70	8.70	5.50	5.80	10.70	9.90	5.10	6.20	4.10
SonarEW	18.60	12.40	15.10	11.80	20.60	17.70	14.70	7.20	7.90
Libras	21.40	21.00	18.40	21.00	37.00	33.80	17.20	11.80	17.10
Hillvalley	5.80	10.50	14.00	21.00	39.40	40.40	14.20	12.20	6.60
Musk	45.90	43.50	35.70	39.70	69.90	75.60	44.00	27.90	35.40
Clean	28.10	46.40	29.30	43.80	73.60	70.10	32.00	24.20	24.00
Semeion	115.00	90.10	87.70	73.30	121.40	118.80	111.80	124.00	87.00
Madelon	129.10	134.00	52.90	97.40	242.90	236.20	104.30	82.90	50.30
Isolet	182.70	175.60	130.50	131.10	295.10	291.00	188.90	126.90	125.10
Mean Rank	4.65	5.00	4.04	4.17	7.04	5.74	4.30	3.43	1.65
Final Rank	6	7	3	4	9	8	5	2	1

Table 10. Knowledge on the FS dataset with tens of thousands of dimensions.

Name	Feature Number	Instance Size	Class
Lung	12,533	203	5
MLL	12,582	72	3
Ovarian	15,154	253	2
Arcene	10,000	200	2
RNA-Seq	20,531	801	5
Dorothea	100,000	800	2
CNS	7129	60	2

Table 11. FFV for the FS problem with tens of thousands of dimensions.

Datasets	Metrics	POA	ALSHADE	PLO	LSHADE	BEGJO	IPOA	MCOA	QHDBO	AMFPOA
Lung	MIN	0.120	0.152	0.097	0.138	0.151	0.141	0.151	0.159	0.094
	AVG	0.141	0.166	0.118	0.162	0.182	0.162	0.177	0.174	0.113
	MAX	0.170	0.170	0.136	0.186	0.216	0.188	0.193	0.188	0.139
MLL	MIN	0.344	0.333	0.336	0.310	0.293	0.372	0.316	0.292	0.270
	AVG	0.364	0.345	0.354	0.330	0.315	0.389	0.345	0.308	0.288
	MAX	0.389	0.361	0.368	0.346	0.329	0.408	0.364	0.337	0.297
Ovarian	MIN	0.058	0.083	0.029	0.057	0.056	0.072	0.067	0.038	0.022
	AVG	0.089	0.106	0.050	0.070	0.085	0.089	0.092	0.047	0.031
	MAX	0.109	0.135	0.070	0.095	0.110	0.100	0.131	0.059	0.042
Arcene	MIN	0.048	0.089	0.034	0.046	0.085	0.054	0.050	0.036	0.017
	AVG	0.070	0.106	0.054	0.066	0.097	0.066	0.064	0.046	0.030
	MAX	0.107	0.123	0.067	0.105	0.108	0.086	0.078	0.060	0.038
RNA-Seq	MIN	0.105	0.106	0.076	0.082	0.101	0.079	0.111	0.084	0.062
	AVG	0.109	0.116	0.089	0.089	0.111	0.089	0.124	0.091	0.069
	MAX	0.115	0.125	0.098	0.098	0.126	0.107	0.132	0.100	0.083
Dorothea	MIN	0.195	0.204	0.096	0.096	0.191	0.181	0.154	0.096	0.082
	AVG	0.254	0.263	0.125	0.114	0.221	0.200	0.220	0.126	0.098
	MAX	0.285	0.305	0.162	0.133	0.266	0.224	0.275	0.173	0.135
CNS	MIN	0.143	0.163	0.091	0.083	0.139	0.137	0.159	0.111	0.069
	AVG	0.160	0.179	0.116	0.096	0.160	0.151	0.178	0.128	0.077
	MAX	0.175	0.187	0.143	0.115	0.196	0.164	0.207	0.144	0.088
Mean Rank	MIN	6.286	8.286	2.857	3.857	5.857	6.143	6.571	4.143	1.000
	AVG	6.429	8.000	3.429	3.429	6.571	5.571	6.857	3.714	1.000
	MAX	6.286	7.143	3.286	3.429	6.857	5.714	7.286	3.714	1.286
Final Rank	MIN	7	9	2	3	5	6	8	4	1
	AVG	6	9	2	2	7	5	8	4	1
	MAX	6	8	2	3	7	5	9	4	1

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Xu, L.; Lv, J.; Yu, Y. Adapted Multi-Strategy Fractional-Order Relative Pufferfish Optimization Algorithm for Feature Selection. Mathematics 2025, 13, 2799. https://doi.org/10.3390/math13172799

AMA Style

Xu L, Lv J, Yu Y. Adapted Multi-Strategy Fractional-Order Relative Pufferfish Optimization Algorithm for Feature Selection. Mathematics. 2025; 13(17):2799. https://doi.org/10.3390/math13172799

Chicago/Turabian Style

Xu, Lukui, Jiajun Lv, and Youling Yu. 2025. "Adapted Multi-Strategy Fractional-Order Relative Pufferfish Optimization Algorithm for Feature Selection" Mathematics 13, no. 17: 2799. https://doi.org/10.3390/math13172799

APA Style

Xu, L., Lv, J., & Yu, Y. (2025). Adapted Multi-Strategy Fractional-Order Relative Pufferfish Optimization Algorithm for Feature Selection. Mathematics, 13(17), 2799. https://doi.org/10.3390/math13172799

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Adapted Multi-Strategy Fractional-Order Relative Pufferfish Optimization Algorithm for Feature Selection

Abstract

1. Introduction

2. Mathematical Model of the POA

2.1. Swarm Initialization Stage

2.2. GE Stage

2.3. LE Stage

2.4. Implementation of the POA

3. Mathematical Model of the AMFPOA

3.1. Adaptive Exploration Strategy

3.2. Three-Swarm Search Strategy

3.3. Fractional-Order Bernstein Exploitation Strategy

3.4. Implementation of the AMFPOA

3.5. Time Complexity

4. Results and Discussion

4.1. Establishment of the FS Problem Model

4.2. Sensitivity Analysis

4.3. Swarm Diversity Analysis

4.4. Exploration/Exploitation Balance Analysis

4.5. Ablation Analysis of Strategy Effectiveness

4.6. Fitness Function Value Analysis

4.7. Wilcoxon Rank Sum Test Analysis

4.8. CA Analysis

4.9. Feature Subset Size Analysis

4.10. Convergence Analysis

4.11. Comprehensive Metric Analysis

4.12. Extended Experimental Analysis

5. Conclusions and Future Works

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI