Abstract
Metaheuristics have proven to be effective in solving large-scale combinatorial problems by combining global exploration with local exploitation, all within a reasonably short time. The balance between these phases is crucial to avoid slow or premature convergence. We propose binary variants of the Arithmetic Optimization Algorithm for the set cover problem, integrating a two-step binarization scheme based on transfer functions with binarization rules and a greedy repair operator to ensure feasibility. We evaluate the proposed solution using forty-five instances from OR-Beasley and compare it with representative approaches, including genetic algorithms, path-relinking strategies, and Lagrangian-based heuristics. The quality of the solution is evaluated using relative percentage deviation and stability with the coefficient of variation. The results show competitive deviations and consistently low variation, confirming that our approach is a robust alternative with a solid balance between exploration and exploitation.
Keywords:
arithmetic optimization algorithm; binary metaheuristics; binarization techniques; set covering problem; combinatorial optimization; exploration–exploitation balance MSC:
68T20; 68W25; 90C27; 90C59; 68Q25
1. Introduction
The Set Covering Problem [1,2] is a widely studied NP-hard combinatorial optimization problem of great industrial relevance, including applications in service planning, facility location, and network optimization [3]. Despite its straightforward formulation, its computational complexity has led to the development of numerous exact and approximate methods, including metaheuristics such as genetic algorithms, ant colony optimization, GRASP, and chemical reaction algorithms. However, no metaheuristic consistently outperforms others across all instances, highlighting the need for tailored approaches.
In this context, we selected the Arithmetic Optimization Algorithm [4] due to its simplicity, strong balance between exploration and exploitation, and recent success in continuous optimization tasks. Although originally designed for continuous domains, its potential for discrete problems such as SCPs remains largely unexplored. This study presents an adaptation of the AOA to the SCP, incorporating a two-step binarization [5] process that preserves the fundamental structure of the original algorithm while integrating well-established transfer functions and binarization rules used in previous SCP research.
The rest of this article is organized as follows: Section 2 presents the Set Covering Problem (SCP) along with its state of the art and formalization; Section 3 describes the Arithmetic Optimization Algorithm (AOA) metaheuristic, including its main parameters, exploration and exploitation components, and original algorithm; Section 4 introduces the two-step binarization mechanism, transfer functions, and binarization rules along with the reasons for their selection; Section 5 details our proposal, specifically, adaptation of the AOA to a binary version that incorporates a mechanism for validating and repairing candidate solutions; Section 6 reports the experimental results as evaluated using Relative Percentage Deviation (RPD) and Coefficient of Variation (CV), presents comparative analyses with reference techniques, and discusses the effect of different combinations of transfer functions and binarization rules and the statistical analysis of the results; Section 7 reports the computational overhead along with our method’s limitations; Section 8 presents the novel elements and main achievements of our proposal; finally, Section 9 presents the conclusions along with future work that could be developed based on the results of this study.
2. Set Covering Problem
The Set Covering Problem is a classical NP-complete combinatorial optimization problem with broad applications in areas such as facility location, scheduling, and resource allocation. A notable case is crew scheduling, where the goal is to select a cost-efficient set of crews to cover all required trips [6]. Due to its computational complexity, the problem is commonly addressed using exact methods for small instances and heuristic or metaheuristic approaches for larger ones [7].
In recent years, nature-inspired metaheuristics such as genetic algorithms, ACO, and hybrid methods have shown strong performance on this problem [8,9]. However, most of these approaches rely on well-established binary frameworks, leaving room to explore newer algorithms that have not yet been fully adapted to discrete spaces.
Formally, the SCP can be defined as follows. Let be a binary matrix of size , where , and let be a non-negative cost vector associated with the n columns. Define as the set of rows and as the set of columns. Each for represents the cost of selecting column j, which covers row i if . The objective is to find a subset such that every row is covered by at least one column and the total cost of the selected columns is minimized. This formulation is commonly referred to as the column-based representation.
subject to
where is a binary decision variable such that if column j is selected in the solution and otherwise. The constraints [10] ensure that each row i is covered by at least one selected column j.
3. Arithmetic Optimization Algorithm
The Arithmetic Optimization Algorithm (AOA) is a population-based metaheuristic introduced by Abualigah et al. [4], which was designed to solve complex optimization problems using a mathematical framework based on the fundamental arithmetic operations: addition, subtraction, multiplication, and division. Unlike many metaheuristics inspired by biological or physical phenomena, the AOA relies exclusively on these operators, which are applied probabilistically to update candidate solutions and balance the exploration and exploitation phases [11].
3.1. Core Components of the Arithmetic Optimization Algorithm
3.1.1. Random Population Initialization
The initial population matrix is defined as , where m is the number of solutions and n is the number of variables. It is initialized randomly and updated in each iteration according to the exploration and exploitation mechanisms of the Arithmetic Optimization Algorithm.
3.1.2. Math Optimizer Accelerated
The Math Optimizer Accelerated (MOA) is an adaptive parameter that controls the balance between exploration and exploitation during the execution of the AOA algorithm, which is recalculated in each iteration. This parameter is typically set high enough to ensure exploration when starting the process, then decreases to ensure exploitation. The MOA is defined by Equation (5). The parameters and are defined in Table 1.
Table 1.
Parameters used in the AOA.
3.1.3. Exploration Phase
The exploration phase is responsible for searching for solutions in different regions of the search space, with the goal of avoiding becoming trapped in local optima. This is accomplished using the multiplication and division operators, according to Equation (6) and conditioned on the MOA value:
3.1.4. Mathematical Optimizer Probability
The Mathematical Optimizer Probability (MP) coefficient is adaptive, being defined in terms of , and works as a control factor that regulates the transition between the exploration and exploitation stages of the optimization process.
3.1.5. Exploitation Phase
The exploitation phase is activated when a random number , marking the transition from global exploration to local improvement. In this stage, the AOA applies the Subtraction and Addition operators using Equation (8) in order to intensify the search around the current optimum, while a stochastic component is introduced in each iteration to preserve some exploration, especially at the beginning and end:
where denotes the j-th component of the current best solution, while denote the upper and lower bounds of dimension j. For each dimension j, are defined as constant values before the optimization process begins. These bounds delimit the feasible search space for each variable , ensuring that the generated solutions remain within problem-specific limits.
3.1.6. Pseudocode of the Original Arithmetic Optimization Algorithm
The AOA Algorithm 1 is presented below. In the exploratory stage, the algorithm applies multiplication and division operations. In the exploitation phase, it employs addition and subtraction to focus the search around the most promising solutions, gradually improving the quality of the results. This balanced dynamic enables the AOA to effectively guide the optimization process, combining global diversification and local convergence within a numerically simple yet highly efficient framework.
| Algorithm 1 Arithmetic Optimization Algorithm. |
|
4. Two-Step Binarization Scheme
This process consists of two main phases:
- (a)
- Transfer phase: The transformation of a real-valued variable into a value within the range . These functions are shown in Table 2
Table 2. Transfer functions (S-shaped and V-shaped); is a probability, while x is the individual’s continuous position. - (b)
- Binarization phase: The mapping of this probabilistic value into a binary value in . These functions are shown in Table 3 and Table 4.
Table 3. Summary of binarization rules (STD, COM, PS, ELIT, ELITR). Transfer function ; random variate .
Table 4. Binarization rules. Transfer function ; .
Though structurally simple, these two components play a critical role in shaping the exploration and exploitation behavior of a binary metaheuristic, and as such have a significant impact on the overall performance of the algorithm [12].
Here, we adopt S-shaped and V-shaped transfer functions together with the binarization rules STD, COM, PS, ELIT and ELITR. This is because they capture two key behaviors: cases where mapping continuous values to binary decisions cases where changing the sign should flip the decision (S-shaped, with ), and cases where the magnitude matters regardless of the sign (V-shaped, as a function of ). Here, d denotes the continuous value prior to binarization in the current iteration t; applying yields a probability in of the binarization rules mapping to a binary decision (0/1). These families are monotone, smooth, bounded, and computationally efficient, allowing the exploration–exploitation balance to be adjusted gradually over iterations. For the SCP, the results of the OR-Library instances support these choices; comparative analyses show that the choice of binarization is crucial to performance [13], while systematic studies report that the binarization rule explains more performance variability than the transfer function family.
where:
| Resulting bit at position j | |
| Current bit of individual w at position j | |
| Bit j of the best-known individual | |
| Continuous value (input to T) associated with | |
| Transfer function returning a probability | |
| Uniform random number in | |
| Threshold used in PS | |
| Elite set of size , with the selected elite index, bit j of elite s, and its fitness |
5. Binary AOA
In this section, we present the Binary Arithmetic Optimization Algorithm (BAOA), an adaptation of the AOA to the binary domain in order to solve the SCP. This design keeps the operation, structure, and parameters of the original algorithm intact, on which basis we incorporate a two-step binarization scheme consisting of transfer functions and binarization rules to transform continuous positions into vectors. In addition, a greedy repair operator guarantees the feasibility of the solutions and eliminates redundancies without altering the logic of the base method. We detail the execution flow by iterations, the alternation between exploration and exploitation, the notation used, and the inputs and outputs of the procedure. This structure allows us to use the search dynamics of the arithmetic algorithm and makes it operational in discrete spaces, preserving its essence and producing viable solutions for the SCP.
The algorithm description is as follows: the BAOA operates in iterations in which the arithmetic core (kept intact) updates the continuous population, determining the degree of exploration or exploitation through its adaptive control. The parameters are the same as in the original algorithm; see Table 1. Then, the binarization procedure transforms these positions into candidates using the Algorithm 2, while the repair operator ensures feasibility and eliminates redundancies using Algorithm 3.
| Algorithm 2 Two-Step Binarization Scheme with Specific Rule. |
|
Where:
| j | Index of the dimension (loop variable), |
| w | Index of the individual (solution) in the population |
| Continuous solution vector | |
| Transfer function that converts the continuous value into a probability | |
| Resulting binary vector |
| Algorithm 3 Greedy Repair Operator for the SCP. |
|
Finally, the cost is evaluated and the best found solution is updated. The inputs to the procedure are the cost and coverage matrix, the population size, the stopping criterion, and the parameters of the original arithmetic algorithm along with the choice of transfer function and binarization rule (outer layers); the output is the best feasible binary solution and its cost. The complete process is presented in Algorithm 4.
| Algorithm 4 Binary Arithmetic Optimization Algorithm with Greedy Repair. |
|
Computational Complexity Analysis
The computational complexity of the BAOA for the SCP is estimated as , where T is the maximum number of iterations, P is the population size, and are the dimensions of the coverage matrix. This estimate encompasses the full procedure: the arithmetic updates over n variables drive the search, binarization is applied to each individual with a per-iteration cost on the order of , and feasibility is evaluated with greedy repair on the problem matrix with a per-iteration cost on the order of . The latter term becomes dominant in combinatorial covering settings, meaning that the binarization cost is absorbed into the final order. Presenting the complexity in this way aligns with metaheuristic studies that explicitly analyze runtime using Big-O notation and detail the dependence on population size, problem dimensionality, and iteration count [14].
6. Experiments Results
In this section, we discuss both the experimental method and the obtained experimental results.
The experiments were executed on a MacBook Pro (15-inch, 2019) with the following specifications:
- Processor: 2.3 GHz 8-core Intel Core i9
- Graphics: Radeon Pro 560X 4 GB + Intel UHD Graphics 630 1536 MB
- Memory: 16 GB DDR4 at 2400 MHz
- Operating System: macOS Sequoia 15.5
6.1. Experimental Methodology
The experimental analysis of the BAOA was conducted using SCP instances from the OR-Library, which are widely used for their diversity in terms of size and complexity. Table 5 presents the considered set of instances, detailing the number of instances, dimensions of the problem (m rows and n columns), cost range, density of the coverage matrix, and availability of optimal solutions. These instances allow us to evaluate the algorithm’s performance in heterogeneous scenarios, from cases with known optimal solutions (sets 4, 5, 6, A, B, C, D, NRE, and NRF) to more challenging problems for which only the best historical solution is available (NRG, NRH, and unicost).
Table 5.
Description of the OR-Library SCP benchmark sets, including the number of instances, problem size, cost range, density, and optimal solution status.
For each instance and experimental configuration, 30 independent runs were performed; this number is considered sufficient to draw reliable statistical conclusions in metaheuristic studies. The BAOA parameters were established according to a previous limited study, with the results shown in Table 5. The evaluation metrics considered here were the value of the best found solution, the RPD (Relative Percentage Deviation) from the known optimum or best historical solution, the total execution time, and the Coefficient of Variation (CV) as a measure of stability [15].
The stopping criterion was identical across all instances, and was defined by the number of evaluations. This approach ensures fairness compared to alternatives such as elapsed time, which are highly dependent on machine performance.
Finally, the obtained results were analyzed using descriptive statistics and compared using nonparametric tests according to the methodology from [9,13], as shown in Figure 1.
Figure 1.
Statistical decision flow used for test selection.
6.2. Parameter Setting
The Table 6 shows the bounded execution of the BAOA on a sample of one instance per set. The execution parameters of the BAOA are the same as in the original version. The results were of good quality when considering RPD as a quality measure; thus, we used them in our experiments while only increasing the population size and number of iterations. The parameter settings are shown in Table 7.
Table 6.
Comprehensive summary of BAOA results per SCP instance set (V3–ELIT) over 100 iterations, as used in the parameter-setting experiments.
Table 7.
BAOA parameter settings.
6.3. Statistical Indicators for Performance Evaluation
In this section, we present the indicators used to measure the quality of our solutions.
- (a)
- Relative Percentage Deviation (RPD). As a quality indicator, we use the average Relative Percentage Deviation (RPD), which measures the proximity of a candidate solution to the known optimum:where is the objective function value returned by the algorithm under evaluation [16] and is the best known or optimal value for the problem instance.
- (b)
- Coefficient of Variation (CV). The CV evaluates the stability of the algorithm over multiple independent runs [16]. It is defined as the ratio between the standard deviation and the mean of the results. A lower CV indicates greater consistency. The equation for CV isThe CV, defined as the ratio between the standard deviation () and the mean (), enables a relative assessment of stability across instances.
6.4. Performance Analysis of the BAOA
This section presents a comprehensive summary of our BAOA’s performance in all benchmark instances. The statistical analysis is shown in Table 8 and Table 9 (Min, Max, Avg, CV, RPD), while Figure 2, Figure 3, Figure 4 and Figure 5 show the most effective combinations of instance-based binarization.
Table 8.
BAOA results on SCP (Group 1). Columns: Opt, Min (best), Max, Avg, CV, and RPD.
Table 9.
BAOA results on SCP (Group 2). Columns: Opt, Min (best), Max, Avg, CV, and RPD.
Figure 2.
RPD heatmap for TF–binarization combinations; lighter cells indicate lower RPD.
Figure 3.
RPD heatmap for TF–binarization combinations; lighter cells indicate lower RPD.
Figure 4.
RPD heatmap for TF–binarization combinations; lighter cells indicate lower RPD.
Figure 5.
RPD heatmap for TF–binarization combinations; lighter cells indicate lower RPD.
We have used heatmaps to evaluate the results of different combinations between transfer functions and binarization rules; the results are as follows.
Figure 2, Figure 3, Figure 4 and Figure 5 show heatmaps of the RPD values obtained from different combinations of transfer functions and binarization strategies for all SCP instance families; we have selected the best RPD results, in this case those below 5%. Lighter shades correspond to lower RPD values, i.e., solutions closer to the optimum.
The results reveal a clear trend, with the S1+elitist, V1+dynamic, and V2+elitist configurations achieving the lowest RPD values in most instances. In general, combinations with V transfer functions perform best. This indicates that elitist strategies are especially effective when combined with transfer functions that properly balance exploration and exploitation.
The results presented in Table 10 and Table 11 show the average RPD, 95% confidence intervals [17], and Coefficient of Variation (CV) obtained by the V3-elitist combination across various SCP instances. Overall, the performance of this configuration shows stable convergence, with RPD values close to the known optima for each instance. The confidence intervals are narrow, indicating consistency across runs, and the CV is low in most instances.
Table 10.
Average RPD results, 95% confidence intervals, and Coefficient of Variation (CV) (Group 1).
Table 11.
Average RPD results, 95% confidence intervals, and Coefficient of Variation (CV) (Group 2).
In Table 12, considering the density of the OR-Library instances (low/medium in the SCP series (e.g., scp41, scp51, scp61) and high in the SCPA–SCPD series), the data show an increase in average execution time as density grows, from low values in scp61 (2.345 s) and scp51 (5.374 s) to the maximum observed in scpd1 (64.522 s). In contrast, the stagnation indicators display heterogeneous variation across families and do not follow a monotonic pattern with respect to density.
Table 12.
Average time: Average seconds per iteration. No-progress iteration: The iteration at which fitness shows no improvement. Stagnation ratio: No-progress iteration/total iterations.
The boxplots in Figure 6 show the distribution of the final RPD values in groups of ten instances. In most cases, the runs exhibit low variability and consistently converge toward near-optimal solutions, although some instances show greater dispersion, reflecting sensitivity to the problem structure. The Figure 7 and Figure 8 show the performance metrics in terms of iteration time and convergence characteristics.
Figure 6.
Boxplots RPD distribution for the elitist rule with the V3 transfer function (scpnrh2–scpnrh5). The blue box shows the interquartile range, the black line the median, the green whiskers the 1.5×IQR limits, and the white circles the outliers.
Figure 7.
BAOA on (V3–ELIT): Time per iteration, XPL–XPT balance (averages), and convergence curve indicating the point from which the fitness no longer improves.
Figure 8.
BAOA on (V3–ELIT): Time per iteration, XPL–XPT balance (averages), and convergence curve indicating the point from which the fitness no longer improves.
6.5. Benchmarking the BAOA with Competing Approaches
We compared our BAOA against recent competitive metaheuristics selected for their strong results in combinatorial optimization, namely, SCA, PSA, GWO, and BGO [18]. SCA provides a simple design with an effective exploration–exploitation balance [19], PSA is physics-inspired and performs well on discrete tasks [20], GWO is widely used in binary settings with robust convergence, and BGO is a recent variant tailored to binary spaces that attains high-quality solutions [21]. The evaluation consistently compared the best cost achieved by each method on standard SCP instances, ensuring fairness and reproducibility. The experimental data were taken from [18]; the data are reported in Table 13 and Table 14.
Table 13.
Comparative performance analysis of the evaluated metaheuristics.
Table 14.
Performance comparison between the BAOA and recent metaheuristics (SCA, PSA, GWO, BGO) on benchmark SCP instances. The table reports the best cost and the RPD (%) for each method; lower values are better for both metrics, highlighting the BAOA’s competitiveness across instances.
Figure 9 presents a comparative performance analysis of the evaluated metaheuristics. It should be noted that this table does not represent a statistical significance test but rather a descriptive comparison of results across problem instances. Three indicators are reported for each algorithm.
Figure 9.
Barchart comparison of the metaheuristics based on three performance indicators.
The results show that the BAOA exhibits the most competitive overall performance among the evaluated metaheuristics. In terms of solution quality, the BAOA achieves the lowest average minimum RPD (1.51), indicating solutions consistently closer to the optimal values. Regarding consistency, the BAOA also obtains the highest number of best instances (21), outperforming SCA (15), PSA (6), GWO (2), and BGO (1). Although its average rank (2.36) is not the lowest, the BAOA maintains a strong relative position across all instances. These findings highlight the BAOA’s ability to combine high-quality solutions with robustness and reliability, positioning it as a competitive alternative to other state-of-the-art approaches.
6.6. Statistical Analysis
In this study, we used the RPD for the statistical analysis instead of absolute cost values. This indicator allows for fair comparisons between problem instances with different cost scales and provides a more consistent and impartial evaluation. Furthermore, the use of the RPD is well established in the optimization and metaheuristic literature, and is suitable for normality assessments such as the Shapiro–Wilk test [22].
We first assessed normality (Shapiro–Wilk and KS–Lilliefors) [23,24] on the paired differences between the BAOA and each metaheuristic. The hypotheses were as follows: , “the data are normal”; and , “the data are not normal”. In most cases ; thus, was rejected. The results of this test are shown in Table 15.
Table 15.
Normality tests (Shapiro–Wilk and Kolmogorov–Smirnov–Lilliefors) applied to the differences between the BAOA and other algorithms (RPD); indicates rejection of normality.
We then proceeded to the second stage, as the normality assumption was rejected in most cases according to the Shapiro–Wilk and Kolmogorov–Smirnov–Lilliefors tests. Therefore, following the methodological flow presented in Figure 1 and considering that the samples were paired across the same problem instances, we applied the Wilcoxon signed-rank [25] test as the appropriate non-parametric alternative to evaluate the significance of performance differences between our BAOA and the competing algorithms. The hypotheses were as follows: , “there are no statistically significant differences between the compared metaheuristics” (that is, both exhibit similar performance in terms of the minimum RPD value); and , “there are statistically significant differences between at least one pair of metaheuristics” (indicating differences in performance in terms of the minimum RPD value).
No statistically significant differences were found between any pair of algorithms, as all p-values were greater than 0.05 the results are show in Table 16. Therefore, the null hypothesis is accepted in all comparisons, indicating statistically comparable performance. These results suggest that additional evaluation criteria such as convergence speed or robustness could provide further insights into the algorithms’ relative effectiveness.
Table 16.
Wilcoxon signed-rank test p-values for the BAOA compared with other algorithms (RPD); a value of ns indicates (no significant difference).
6.7. Conclusions from the Statistical Tests
The Shapiro–Wilk test showed that most RPD distributions were non-normal, justifying the use of non-parametric methods. The Wilcoxon signed-rank test revealed no statistically significant differences between the metaheuristics, as all p-values were above 0.05. Therefore, the null hypothesis could not be rejected. These results suggest that the algorithms have comparable performance based on minimum RPD; thus, further analysis using additional performance metrics is recommended.
7. Analysis of Computational Overhead and Methodological Limitations
We chose V3-ELIT to develop this section due to its ability to balance exploration and exploitation. The V3 (hyperbolic V-shaped) transfer function introduces smooth and controlled transitions in the binarization, while the ELIT rule preserves the best solutions, which in our experiments translated into low RPDs and practically zero variability (mean = best value; deviation around 0) in most instances. On this basis, Figure 10 shows the time per iteration of BAOA V3-ELIT across several SCP instances, revealing two clear phases: a rapid initial overhead, and a subsequent stabilization phase in which times remain nearly constant. The largest instances, such as scpc1 (400 × 4000) and scpd1 (400 × 4000), reach the highest costs, with 30.0 s and 19.0 s per iteration, respectively, while scpa1 (300 × 3000) and scpb1 (300 × 3000) fall in an intermediate range, with 15.0 s and 11.0 s. In contrast, smaller instances such as scp61 (200 × 1000) and scp51 (200 × 2000) require only 4.5 s and 3.3 s per iteration. This pattern confirms that the computational cost increases with problem size; however, once past the initial phase, the time remains stable, reflecting temporal consistency of the algorithm across instances of different scales.
Figure 10.
Summary of iteration times of AOA V3-ELIT across different SCP instances. Two stages can be observed: an initial overhead, and a stabilization phase with nearly constant time.
Across the hardest SCP cases, BAOA V3-ELIT exhibits a short transient followed by stable timing and a strongly exploitative regime. As shown in Figure 11, the iteration time settles ∼18–19 s after the early iterations. In Figure 12, exploration quickly collapses to about 2.12% while exploitation stabilizes near 97.88%, evidencing minimal search volatility and a persistent bias toward exploitation once the algorithm stabilizes.
Figure 11.
Iteration time per iteration for instance SCPd1 under the AOA V3-ELIT configuration. The curve shows two phases: an initial overhead where computation time increases sharply, followed by a stabilization phase where the iteration time remains nearly constant around 18–19 s.
Figure 12.
Exploration (XPL%) and exploitation (XPT%) dynamics for BAOA V3-ELIT in the scpd1 instance. The curves show a sharp decline in exploration during the first iterations, stabilizing below 2.12%, while exploitation dominates above 97.88%.
Table 17 presents the theoretical computational complexity of the BAOA algorithm expressed in terms of population size (P), number of iterations (T), and dimensions of the coverage matrix (m rows and n columns) for the Set Covering Problem (SCP). As discussed previously, this complexity is represented as , which reflects how the expected execution time increases with the size of the problem instances.
Table 17.
Computational complexity of the BAOA per SCP instance set. The general complexity is , with iterations and individuals.
8. Novelty and Contributions
In this paper, we have presented the first binary adaptation of the AOA to solve an important industry problem. In addition, we have introduced a new index that is complementary to the RPD for measuring stability across runs to quantify solution stability; Figure 2 shows this information. The joint analysis of RPD versus CV shows that 35 instances fall into the high quality zone and 27 are distributed across moderate zones, with only three instances falling into the low quality zone. These results reinforce the robustness of our BAOA in terms of solution quality and stability (see Figure 13).
Figure 13.
Summary of the high-quality zone from the RPD vs. CV analysis.
Following the statistical methodology (normality → nonparametric test for paired samples), the applied tests show that the proposed version is competitive with state-of-the-art techniques (SCA, PSA, GWO, BGO), with no statistically significant unfavorable differences.
In terms of computational efficiency, the results show that our BAOA achieves high-quality solutions with reduced execution times. Table 6 reports total times in the range of 3 s to 19 s per iteration, depending on the instance (e.g., ∼3.3 s on scp51 and ∼18–19 s on scpd1). Furthermore, the dynamic analysis of Figure 8 and Figure 9 reveals stable behavior; after an initial overload phase, the time per iteration remains virtually constant throughout the execution.
9. Conclusions and Future Work
In this work, we have presented a new version for solving the SCP based on the Arithmetic Optimization Algorithm metaheuristic and coverage problems in general. Our design preserves the core mechanisms of the original algorithm while introducing a two-step binarization scheme (transfer functions and binarization rules) and a conventional greedy repair operator to ensure feasibility in discrete spaces. We propose the Coefficient of Variation (CV) as an index of stability across runs, enabling comparisons between configurations with similar averages, which complements the RPD index and allows for a precise evaluation of the solution’s performance across quality zones formed by graphically plotting RPD versus CV. Our BAOA exhibits competitive performance across 45 benchmark instances from the OR-Library when compared to state-of-the-art metaheuristics SCA, PSA, GWO, and BGO, achieving a minimum average RPD (1.51%) and a high number of optimal solutions (21 out of 25 in the subset where the optima are known). After using multiple statistical analyses (the Shapiro-Wilk and KS-Lilliefors tests as well as the unpaired Wilcoxon signed-rank test), no statistically significant differences were revealed, with all results yielding p > 0.05. Therefore, there are no significant differences between our BAOA and the aforementioned metaheuristics. While the results of these tests and descriptive comparisons were good, additional improvements can still be applied. Future research could explore alternative combinations of transfer functions and binarization rules, principle-based hybridization, adaptive parameter control, lightweight intensification strategies, and population diversity mechanisms to better balance exploration and exploitation. These guidelines could further improve the quality and efficiency of the solution.
Author Contributions
Conceptualization, H.C., B.C., and F.C.-C.; methodology, B.C., F.C.-C., and G.A.; software, H.C., G.G., F.S.-P., and F.C.-C.; validation, B.C., R.S., F.C.-C., and G.A.; formal analysis, B.C., R.S., and H.C.; investigation, B.C. and G.A.; resources, H.C., F.C.-C., and G.A.; writing—original draft, H.C., F.S.-P., and G.A.; writing—review and editing, B.C., R.S., G.A., G.G., and F.C.-C.; supervision, B.C., R.S., and F.C.-C.; funding acquisition, B.C. All authors have read and agreed to the published version of the manuscript.
Funding
Felipe Cisternas-Caneo is supported by National Agency for Research and Development (ANID)/Scholarship Program/DOCTORADO NACIONAL/2023-21230203.
Data Availability Statement
The original contributions presented in the study are included in the article; further inquiries can be directed to the corresponding authors.
Conflicts of Interest
The authors declare no conflicts of interest.
Abbreviations
The following abbreviations are used in this manuscript:
| AOA | Arithmetic Optimization Algorithm |
| BAOA | Binary Arithmetic Optimization Algorithm |
| SCP | Set Covering Problem |
| MOA | Mathematical Optimizer Accelerated |
| RPD | Relative Percentage Deviation |
| SCA | Sine Cosine Algorithm |
| PSA | Pendulum Search Algorithm |
| GWO | Grey Wolf Optimizer |
| BGO | Binary Growth Optimizer |
| ACO | Ant Colony Optimization |
| MOP | Mathematical Optimizer Probability |
References
- Karp, R.M. Reducibility among Combinatorial Problems. In Complexity of Computer Computations; Miller, R.E., Thatcher, J.W., Eds.; Springer: Berlin/Heidelberg, Germany, 1972; pp. 85–103. [Google Scholar] [CrossRef]
- Garey, M.R.; Johnson, D.S. Computers and Intractability: A Guide to the Theory of NP-Completeness; W. H. Freeman: New York, NY, USA, 1979. [Google Scholar] [CrossRef]
- Suh, W.H.; Oh, S.; Ahn, C.W. Metaheuristic-based time series clustering for anomaly detection in manufacturing industry. Appl. Intell. 2023, 53, 21723–21742. [Google Scholar] [CrossRef]
- Abualigah, L.; Diabat, A.; Mirjalili, S.; Abd Elaziz, M.; Gandomi, A.H. The arithmetic optimization algorithm. Comput. Methods Appl. Mech. Eng. 2021, 376, 113609. [Google Scholar] [CrossRef]
- Crawford, B.; Soto, R.; Astorga, G.; García, J.; Castro, C.; Paredes, F. Putting Continuous Metaheuristics to Work in Binary Search Spaces. Complexity 2017, 2017, 8404231. [Google Scholar] [CrossRef]
- Wen, X.; Chung, S.H.; Ma, H.L.; Khan, W.A. Airline crew scheduling with sustainability enhancement by data analytics under circular economy. Ann. Oper. Res. 2023, 342, 959–985. [Google Scholar] [CrossRef]
- Beasley, J.E. OR-library: Distributing test problems by electronic mail. J. Oper. Res. Soc. 1990, 41, 1069–1072. [Google Scholar] [CrossRef]
- Blum, C.; Roli, A. Metaheuristics in combinatorial optimization: Overview and conceptual comparison. ACM Comput. Surv. (CSUR) 2003, 35, 268–308. [Google Scholar] [CrossRef]
- Lanza-Gutiérrez, J.M.; Caballé, N.C.; Crawford, B.; Soto, R.; Gómez-Púlido, J.A.; Paredes, F. Exploring Further Advantages in an Alternative Formulation for the Set Covering Problem. Math. Probl. Eng. 2020, 2020, 5473501. [Google Scholar] [CrossRef]
- Carrabs, F.; Cerulli, R.; Mansini, R.; Moreschini, L.; Serra, D. Solving the Set Covering Problem with Conflicts on Sets: A new parallel GRASP. Comput. Oper. Res. 2024, 166, 106620. [Google Scholar] [CrossRef]
- Al-Himyari, B.; Al-khafaji, H. Exploration-Exploitation Tradeoffs in Metaheuristics: A Review. Asian J. Appl. Sci. 2024, 12, 1–27. [Google Scholar] [CrossRef]
- Saremi, S.; Mirjalili, S.; Lewis, A. How important is a transfer function in discrete heuristic algorithms. Neural Comput. Appl. 2014, 26, 625–640. [Google Scholar] [CrossRef]
- Lanza-Gutiérrez, J.M.; Crawford, B.; Soto, R.; Berrios, N.; Gomez-Pulido, J.A.; Paredes, F. Analyzing the effects of binarization techniques when solving the set covering problem through swarm optimization. Expert Syst. Appl. 2017, 70, 67–82. [Google Scholar] [CrossRef]
- Arora, S.; Barak, B. Computational Complexity: A Modern Approach; Cambridge University Press: New York, NY, USA, 2009. [Google Scholar] [CrossRef]
- Ospina, R.; Marmolejo-Ramos, F.; Medina, M. Performance of Some Estimators of Relative Variability. Front. Appl. Math. Stat. 2019, 5, 43. [Google Scholar] [CrossRef]
- Cook, W.J.; Applegate, D.; Bixby, R.E.; Bixby, R.E.; Chvátal, V. The Traveling Salesman Problem: A Computational Study; Princeton University Press: Princeton, NJ, USA, 2007. [Google Scholar] [CrossRef]
- Hazra, A. Using the confidence interval confidently. J. Thorac. Dis. 2017, 9, 4125–4130. [Google Scholar] [CrossRef] [PubMed]
- Leiva, D.; Ramos-Tapia, B.; Crawford, B.; Soto, R.; Cisternas-Caneo, F. A Novel Approach to Combinatorial Problems: Binary Growth Optimizer Algorithm. Biomimetics 2024, 9, 283. [Google Scholar] [CrossRef]
- Mirjalili, S. SCA: A sine cosine algorithm for solving optimization problems. Knowl.-Based Syst. 2016, 96, 120–133. [Google Scholar] [CrossRef]
- Ab. Aziz, N.A.; Ab. Aziz, K. Pendulum Search Algorithm: An Optimization Algorithm Based on Simple Harmonic Motion and Its Application for a Vaccine Distribution Problem. Algorithms 2022, 15, 214. [Google Scholar] [CrossRef]
- Martín-Santamaría, R.; López-Ibáñez, M.; Stützle, T.; Colmenar, J.M. On the automatic generation of metaheuristic algorithms for combinatorial optimization problems. Eur. J. Oper. Res. 2024, 318, 740–751. [Google Scholar] [CrossRef]
- Yap, B.W.; Sim, C.H. Comparisons of various types of normality tests. J. Stat. Comput. Simul. 2011, 81, 2141–2155. [Google Scholar] [CrossRef]
- Shapiro, S.S.; Wilk, M.B. An analysis of variance test for normality (complete samples). Biometrika 1965, 52, 591–611. [Google Scholar] [CrossRef]
- Lilliefors, H.W. On the Kolmogorov-Smirnov test for normality with mean and variance unknown. J. Am. Stat. Assoc. 1967, 62, 399–402. [Google Scholar] [CrossRef]
- Wilcoxon, F. Individual Comparisons by Ranking Methods. Biom. Bull. 1945, 1, 80–83. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).












