A Statistical Comparison of Metaheuristics for Unrelated Parallel Machine Scheduling Problems with Setup Times

: Manufacturing scheduling aims to optimize one or more performance measures by allocating a set of resources to a set of jobs or tasks over a given period of time. It is an area that considers a very important decision-making process for manufacturing and production systems. In this paper, the unrelated parallel machine scheduling problem with machine-dependent and job-sequence-dependent setup times is addressed. This problem involves the scheduling of tasks on unrelated machines with setup times in order to minimize the makespan. The genetic algorithm is used to solve small and large instances of this problem when processing and setup times are balanced (Balanced problems), when processing times are dominant (Dominant P problems), and when setup times are dominant (Dominant S problems). For small instances, most of the values achieved the optimal makespan value, and, when compared to the metaheuristic ant colony optimization (ACOII) algorithm referred to in the literature, it was found that there were no signiﬁcant differences between the two methods. However, in terms of large instances, there were signiﬁcant differences between the optimal makespan obtained by the two methods, revealing overall better performance by the genetic algorithm for Dominant S and Dominant P problems.


Introduction
In the current 4th Industrial Revolution scenario, and with the underlying gradual transition of the use of exponential technologies and high-performance computing, companies, namely industrial ones, must be aware of the need to be able to progressively update their decision support tools, for instance, regarding manufacturing scheduling decision-making.
Thus, for an industrial company to remain successful and competitive, it is necessary to use effective and efficient methods and optimization algorithms, along with optimized production processes, in order to differentiate itself in the current global market. In this sense, the manufacturing scheduling decision support systems and underlying algorithms continue to play a crucial role by enabling manufacturing companies to obtain best-suited manufacturing schedules while avoiding unnecessary order cancellations or delays as well as the best use of production resources and costs; this has been a major concern of many researchers over the last decades.
Production management in an industrial environment faces several challenges due to dynamic changes in production and market volatility. Companies are increasingly looking to shorten delivery times as an economic measure due to increased competitiveness in emerging markets [1]. Industrial problems such as scheduling jobs or tasks are affected by customer requirements due to the variety of orders and speed of delivery [1,2].
Manufacturing scheduling is, therefore, a well-studied subject area and is considered a very important decision-making process for manufacturing and production systems. It aims to optimize one or more performance measures by allocating a set of resources to a set of jobs or tasks over a certain period of time [2].
Scheduling problems occur in different kinds of manufacturing environments, varying from a single stage of operation to multi-stage ones. In parallel machine scheduling problems, there is a need to assign tasks to machines coupled with sequencing problems. The scheduling problems in parallel machine environments have been the object of many studies in the last decades. However, cases on unrelated parallel machines are less investigated, especially when setup times are used [3][4][5][6][7].
Parallel machines are considered unrelated when the processing times of tasks depend on the machines to which they are assigned and when there is no relationship between the speeds of the machines [5]. The setup times are dependent since their values depend on both the work sequence and the configuration time matrix of each machine.
Many authors have considered the use of algorithms, heuristics, and metaheuristics to solve scheduling problems on unrelated parallel machines [3,6,8,9], although not all proposed approaches and underlying algorithms are equally effective and efficient in providing scheduling solutions. Yang et al. in 2022 [10] proposed an elite learning differential particle swarm optimization (DELPSO) in order to improve particle swarm optimization (PSO) for large problems. For this, two examples of guidance were applied to direct the update of each particle. The authors verified that the proposed model obtained better results when compared to the PSO. In another paper [11], a new approach was applied in order to improve the performance of multi-objective particle swarm optimizers (MOPSOs). The strategy used is called hybrid global leader selection (HGLSS), where Pareto dominance and density estimation are analyzed to verify the effectiveness of the proposed model. For this, the performance of the proposed approach was compared with nine multi-objective metaheuristics in solving several benchmark problems. The results showed an overcoming of the proposed algorithm in terms of the modified inverted generational distance indicator. The authors, Das and Suganthan [12], presented a literature review on the evolutionary algorithm (EA). In addition, an overview of works carried out on differential evolution (DE) was conducted, where subjects such as variants, multi-objective applications, constrained optimization problems, and theoretical studies of DE were addressed. Finally, engineering applications in which DE was applied were presented. In Pan et al. [13], a differential evolution (DE) algorithm was presented to solve a permutation flow shop scheduling problem in order to minimize the makespan. DE is a traditional continuous algorithm, and the lowest position value rule was presented in order to convert the continuous vector to a discrete work permutation.
This paper focuses on single-stage scheduling problems occurring in parallel machine environments. It is intended to evaluate the behavior of the genetic algorithm (GA) when applied to the scheduling problem of unrelated parallel machines in order to minimize the makespan of a set of tasks subject to varying setup times. A set of small and large instances of this problem will be used to assess the GA performance when compared to the solutions obtained by the metaheuristic ant colony optimization (ACOII) [8,9]. Then, statistical analysis of these two metaheuristics is performed.
In order to properly put forward the main contributions of this work, this paper is organized as follows. In the second section, a summarized introduction to manufacturing scheduling is provided. In Section 3, the unrelated parallel machine scheduling problem is briefly described. Section 4 presents the guidelines for the computational study to be carried out, the metaheuristic GA, and some implementation details. In Section 5, a comparative statistical analysis of the metaheuristics GA and ACOII is performed for small and large instances of scheduling problems, and the main conclusions and planned future work are summarized in Section 6.

General Overview
Scheduling plays a very important role in the decision-making process to be carried out in different decision-making environments, from manufacturing to information processing, transportation, and distribution, among other kinds of services [2]. In manufacturing, scheduling problems vary from single-stage or operations problems, occurring in single or parallel machine environments, up to multi-stage or operation ones, typically occurring in flow shop (FSP), job shop (JSP), or open shop scheduling problems (OSPs), to mention some of the most well-known [2]. The multi-stage scheduling problems tend to be more complex than the single-stage ones; for instance, FSPs have some complicated variants, namely, considering energy issues and material handling assumptions or stochastic data [14,15].
Manufacturing scheduling implies the allocation of jobs or tasks to resources or vice versa, its sequencing or order in time for being processed, along with the definition of the initial and final processing times for each job or task in a given resource, made available through a given manufacturing environment [16].
In [17], Brucker points out that the scheduling of a production system is dependent on several factors, such as manufacturing orders, available resources, available machines, and the operations to be performed by each one, and there may be processing times that are different from one machine to another one while satisfying and/or optimizing a single or complex performance measure or criteria.
The programming of parallel machines has been a growing research area since the first works carried out in the early 20th century [18]. This type of problem has, since then, received continuous interest from researchers due to its relevance to manufacturing environments [3,[7][8][9].
Allahverdi, in [26], presents a review of scheduling problems. In his work, the scheduling problems are further classified based on the underlying production environment as a single machine, parallel machines, flow shop, job shop, or open shop. It also classifies them according to the consideration and processing of information regarding their inclusion in family sets, besides the characterization of sequence-dependent jobs/tasks or machine configurations, which also affects production times and/or costs, among other characterization parameters that are also further explored in other publications, such as [3,21,22].
Thus, scheduling requires decisions about jobs/tasks and processing resources. The sequencing corresponds to a permutation of jobs/tasks or the order in which they might be processed on each resource or machine. On the other hand, the allocation of resources or machines refers to the choice of which one will process each job or task [27]. The scheduling problem aims to assign tasks to machines and define the periods that each task is processed on each machine in order to minimize and/or maximize an optimization criterion, usually expressed in the form of an objective function intended to be optimized [11]. The Brucker classification system, in [17], uses the nomenclature α|β|γ, initially introduced by [28]. In this nomenclature, the α represents the scheduling classification factors related to the manufacturing environment, which usually include the type of manufacturing system and the number of machines. The manufacturing environment can range from a simple one, such as a single machine, up to more complex ones, occurring in flow shop, job shop, and open shop environments or flexible manufacturing environments, besides other kinds of manufacturing systems, for instance, taking place in different types of parallel machine environments. These manufacturing environments can have different complexity levels, not just according to the nature of the manufacturing systems' configuration and the underlying production flows themselves but further to other, additional characteristics, about a more or less widened set of conditions and constraints imposed in the scheduling problem, expressed by a corresponding β set of factors in the problem classification nomenclature. Moreover, a simple or combined set of performance measures can also be considered and expressed through the γ factor. Since its introduction, this notation has been used and reformulated by several authors, and many classifications have been added as other problems arise [22].

Scheduling Assumptions
The scheduling problem studied in this work occurs in a parallel-machines scheduling environment, where the processing times of the task are expressed through a task-machine tuple, which varies from one machine to another for processing a given task [29,30].
In scheduling problems arising in manufacturing environments, frequently, multiple machines are available for processing a set of tasks, and the processing times are often even more dependent on task sequences. Thus, there is a sequence-dependent setup time (S i,j,k ) whenever, after processing task j, preparation time S i,j,k is required before processing task k. Moreover, when these times are also dependent on the machines, index i is added [31][32][33].
The scheduling problem considers a set of tasks with setup times that are dependent on the machine used and sequence-dependent on the unrelated parallel machines set, with the goal of minimizing the maximum completion time or makespan. In scheduling theory, the makespan (C max ) is defined as the completion time of the final task of a job to be processed (when the job leaves the system). The scheduling problem is thus based on a set of N tasks that must be processed on a machine from a set of M unrelated parallel machines (R M ). The processing time of the tasks depends on the assigned machine, and there is no relationship between these machines as they are unrelated. The setup times are machine-and task-sequence-dependent (S i,j,k ). Each machine has its setup time matrix, and each matrix is different from the others for the remaining unrelated parallel machines.
The problem studied in this work is classified in the literature as R M |S i,j,k |C max . Minimizing the makespan of a scheduling problem with identical parallel machines and sequence-dependent setup times is categorized as NP-hard. Therefore, the most complex problem of unrelated parallel machines is also considered NP-hard.

Review about Scheduling Sequence-Dependent Setups in Unrelated Parallel Machines
The scheduling problem occurring in unrelated parallel machines for processing sequence-dependent setup times is quite important as it can be found in several areas such as the electronics, steel, and textile industries [34][35][36][37].
Kim et al. [35] used the simulated annealing (SA) algorithm to solve a scheduling problem in the electronics industry. In their work, it was possible to conclude that the proposed SA method significantly outperformed a neighborhood search method regarding the total delay of jobs or tasks.
Tang and Wang [36] formulated a scheduling problem for the steel industry as a mixed nonlinear program and proposed the Tabu search (TS) heuristic to obtain satisfactory solutions. The results showed that their model and heuristic performed more efficiently and effectively than other manual planning approaches.
In the textile industry, Gendreau, Laporte, and Guimarães [34] applied a heuristic to the multiprocessor scheduling problem with sequence-dependent setup times; their results showed that their heuristic was faster than a TS-based one but, at the same time, provided solutions of almost similar quality.
Thus, the complexity of the scheduling problem of unrelated parallel machines has led to increased interest in heuristic procedures to find solutions in a reasonable time interval. Kim and Chen [38] proposed four research heuristics for the aforementioned problem. According to the authors, these heuristics can be easily applied to obtain practical production scheduling solutions. Ghirardi and Potts [39] also studied the problem of unrelated parallel machines for minimizing the makespan; the underlying heuristic used was an application of the recovering beam search technique. The computational results allowed them to generate approximate solutions for large instances of problems (up to 50 machines and 1000 jobs) in just a few minutes.
Another heuristic, called Meta-RaPS, was introduced by Rabadi, Moraga, and Al-Salem [7] to minimize the makespan in unrelated parallel machine problems with sequencedependent setup times. The performance of the proposed heuristic was evaluated by comparing its solutions with those obtained by other existing heuristics for the same problem. The results showed that the Meta-RaPS found optimal solutions for small problems and performed better than other existing heuristics for larger problems.
For the same problem and makespan objective function, Arnaout et al. [8] introduced the ant colony optimization (ACO) approach. To evaluate the performance of the ACO, the authors compared their solutions with the ones obtained by other heuristics, for example, solutions based on Tabu search and a partitioning heuristic with those obtained by Meta-RaPS [7]. They concluded that the ACO outperformed the other algorithms.
Arnaout et al. [9] also proposed an improved ACO algorithm (ACOII) and mentioned achieving better performance than the previous version; further, the algorithm had the ability to solve more difficult combinatorial optimization problems by partitioning them into subproblems.
The minimization of the makespan is one of the most studied criterion in the production scheduling literature, whether in parallel or single machines. For example, Woo, Jung, and Kim, in [23], developed a mixed-integer linear programming (MILP) model to find the optimal solution to the scheduling problem in unrelated parallel machines with the aim of minimizing the makespan. They proposed a new rule based on a genetic algorithm with a chromosome that represents the sequence of assignment of jobs or tasks to a machine, with the scheduling of jobs/tasks for each machine being determined by a heuristic based on completion time during the chromosome decoding process.
Considering that the setup times are dependent on the work sequence, the authors of [6] presented a GA for minimizing the makespan when solving scheduling problems in unrelated parallel machines. Their GA algorithm was further compared with other algorithms found in the literature, and they concluded that their proposed GA outperformed existing ones.
More recently, the scheduling of unrelated parallel machines for green manufacturing purposes, with resource constraints, was proposed by Zheng and Wang [40]. This work aimed at minimizing the makespan and total carbon emissions; to solve the problem, a collaborative multi-objective fruit fly optimization algorithm (CMFOA) was proposed. The results showed that their multi-objective algorithm was able to obtain more and better non-dominated solutions than other existing algorithms in comparison.
Aydilek et al. [41] addressed a scheduling problem to minimize order delays, in which the setup times were independent of the processing times, through the application of algorithms of self-adaptive differential evolution and hybrid and simulated insertion algorithms. A scheduling problem with different approaches to setup times, aiming to minimize the makespan with the application of an enhanced version of the ACO algorithm, was studied in [24,25].
Abreu and Prata [42] presented a hybrid GA for solving the unrelated parallel machine scheduling problem with sequence-dependent setup times. A case study on the granite industry is presented, and the proposed approach outperformed three traditional dispatch rules presented in the current literature. Gedik et al. in [43] studied the nonpreemptive unrelated parallel machine scheduling problem with job/task-sequence-and machine-dependent setup times in order to minimize the makespan. Their study provided a novel constraint programming (CP) model with two customized branching strategies that used CP's global constraints, interval decision variables, and domain filtering algorithms. According to the authors, in terms of average solution quality, the computational results indicated that their CP model slightly outperformed their analyzed contributions from state-of-art algorithms in solving small problem instances and was able to prove the optimality of 283 currently best-known solutions. It is also mentioned to be effective in finding good quality feasible solutions for larger problem instances. Fanjul-Peyro et al. in [44] studied the same problem with the same objective function, but they modeled the problem by means of two integer linear programming problems. One was based on a model previously proposed in the literature, and the other was based on packing problems. According to the authors, since their models were unable to solve medium-sized instances to optimality, they proposed three other metaheuristics for each of these two models. Their results showed that the proposed metaheuristics significantly outperformed the mathematical models. Recently, in [45], Arnaout addressed the unrelated parallel machine scheduling problem with setup times when minimizing the makespan through a worm optimization (WO) algorithm. The performance of the WO algorithm was evaluated by comparing its solutions to solutions of six other known metaheuristics.
Considering the previously presented literature review, most of the research addressed the scheduling problem with different objective functions and algorithm applications in different contexts and industrial environments.
The work underlying this paper was motivated by the work carried out by Arnaout et al. [9], where a comparison was made with the solutions obtained by the Meta-RaPS metaheuristic [46]. Based on data for small and large problem instances, this paper aims to propose a genetic algorithm for solving unrelated parallel machine scheduling problems with setup times for minimizing the makespan, C max . Moreover, in this study, we intend to present an extended evaluation of the behavior of the GA when solving different types of case studies (small and large instances) of unrelated parallel machine scheduling problems with setup times.

The Scheduling Problem
This paper addresses the unrelated parallel machine scheduling problem considering the scheduling of N tasks that are available at time zero on M unrelated machines (R M ). The objective function is the makespan C max , considering machine-dependent and sequencedependent setup times S i,j,k . This problem is classified in the literature as R M |S i,j,k |C max , which is a generalization of the P M ||C max problem of identical speeds for processing a set of tasks on the machines [2,47]. The unrelated parallel machine scheduling problem is known to be NP-hard [2] and can be formulated as a mixed-integer linear programming (MILP) model.
In the following, the problem assumptions are described: • M is the number of parallel machines; • N denotes the number of tasks to be scheduled; • Each machine can only process one task at a time without preemption; • For the initial time instant, which is at time zero, all tasks are available. No restrictions of precedence are imposed among tasks; • For each machine i, each task j has a processing time, p i,j ; • For each machine i, for processing tasks j just after tasks k, there is a setup time, S i,j,k . The setup time is different for each machine; • The objective is to minimize the makespan C max . The term span is used to define the completion time of a machine, while the term makespan is used for the maximum span in the solution of the problem.
The mathematical programming model of the considered problem is presented below, which consists of finding an optimal solution to schedule a set of jobs or tasks in a set of unrelated parallel machines regarding the existence of sequence-dependent setup times, a similar model to the one used by Guinet in [8,48]. This MILP model includes binary variables (x i,j,k ∈ {0, 1}) indicating the assignment of tasks to machines and continuous variables denoting the completion times of tasks (C j ≥ 0 and C max ≥ 0).
x i,j,k ∈ {0, 1}, ∀i = 1, . . . , N, ∀j = 1, . . . , N, ∀k = 1, . . . , M where C j : maximum completion time of task j; p j,i : processing time of task j in machine i; S i,j,k : setup time dependent on the processing sequence of task j after task k in machine i; S 0,j,i : setup time for processing first task j on machine i; x k,j,i : 1 if task j is processed immediately after task k in machine i, and 0 otherwise; x 0,j,i : 1 if task j is the first one to be processed in machine i, and 0 otherwise; x j,0,i : 1 if task j is the last task to be processed in machine i, and 0 otherwise; HV: a large positive number (usually denoted by a capital M).
The objective function (1) intends to minimize the makespan, where C max is the length of time that elapses from the start of jobs to the end of the last job. Constraints (2) ensure that each task is only scheduled once and processed by a single machine. Constraints (3) ensure that each task must not be preceded or succeeded by more than one task. Constraint (4) calculates the completion time and ensures that no task precedes or succeeds the same task. Constraint (5) ensures that only one task can be scheduled first on each machine. There is no need for additional constraints to ensure that only one task is scheduled last on each machine because Constraints (5) and (3) guarantee this. Constraints (6) express the makespan C max as a variable that must be larger than any other job's completion time. Constraints (7) guarantee that the decision variable x is binary in all domains. Constraint (8) states that the completion time for dummy work is zero, and Constraint (9) ensures that the completion time is non-negative. Solving the scheduling problem described above enables optimal solutions to be obtained.

Computational Study
This section presents the way that this computational study will be conducted. Firstly, the scheduling data description will be presented. Then, the genetic algorithm will be briefly described, followed by the implementation details considered to achieve the goals of the research.

Scheduling Data Description
The data available on the Scheduling Research Virtual Center page (available online: https://sites.wp.odu.edu/schedulingresearch/wp-content/uploads/sites/99/201 6/01/Rm-Cmax-ACO-Arnaout-2014.xlsx (accessed on 10 May 2021) [49] were used for comparison with the GA. This work is divided into two large groups: first, a comparative analysis of the solutions obtained by GA for small problems, and then, a statistical analysis of the solutions obtained with large problems. The

Implementation Details
In this work, the implementation of the genetic algorithm [50][51][52] to solve the unrelated parallel machine scheduling problem with machine-dependent and job-sequencedependent setup times is addressed. The ga function from MATLAB ® , which implements the genetic algorithm, was used to solve this combinatorial problem. However, an implementation of a permutation to represent the solutions was previously defined in order to use the ga function. GA works with a population of chromosomes that represent the potential solution of the optimization problem and is adequate to tackle combinatorial problems since the constraints can be handled by permutation representations. For this scheduling problem, a solution is represented by a chromosome (a sequence of genes) that is a permutation of size N + M − 1. In this representation, a solution is a sequence of integer values that can occur only once. In a chromosome, the permutation values that are superior to N divide the chromosome into M subsequences that indicate the tasks and the corresponding order assigned to each machine. The genetic operators implemented to guarantee the feasibility of the solutions during the search were the order-based crossover and swap mutation. In the order-based crossover, the genetic material between two chromosomes is combined in order to generate offspring. Two random positions are selected, and the genes between them are swapped. Then, the remaining empty positions are filled with the other parent's genes while preventing repetitions. In the swap mutation, two positions are generated at random, and the genes in a chromosome are swapped. A stochastic uniform selection operator was used to select chromosomes for the application of genetic operators.
The population size and the maximum number of generations of GA were set to 50 (default value) and 15,000, respectively. The crossover fraction was 0.8, i.e., each generation of the order-based crossover was applied to 80% of the population. The swap mutation was applied to the same fraction of the population. GA parameter values were chosen, taking into account the ACOII parameter values used in [9] in order to guarantee a fair performance comparison between the algorithms. Due to the stochastic nature of the GA, 30 independent runs were performed; the value of MaxStallGenerations was 1000. The case studies were run in the MatLab ® R2021a using a 11th Gen Intel(R) Core(TM) i7-1165G7 @ 2.80 GHz 2.80 GHz.
After obtaining the results using GA, the pandas and scipy.stats libraries [53,54] for Python version 3.8 were used to perform the statistical analysis.
Firstly, data were imported using the pandas' library by applying the read_csv function. Then, the function groupby was performed to identify the minimum value of the makespan, considering the same number of machines, tasks, and instances.
Thereafter, the library scipy.stats was used to implement the shapiro, ttest_rel and wilcoxon functions to evaluate if data followed a normal distribution and to apply a parametric t-test and non-parametric Wilcoxon test for the related samples, respectively.
In the case of data following a normal distribution, a t-test was considered to compare the metaheuristics GA and ACOII; otherwise, the Wilcoxon test was performed. After identifying the combinations of M and N values where there are differences between the two methods, the confidence interval for paired samples was computed, taking into consideration Equation (10) [55].
First, the differences between the makespan of ACOII and GA must be performed, defined as d. Moreover, d and s d are the mean and standard deviation values of d, respectively. Furthermore, t is a quantile from the t-Student distribution, where n is the number of observations and α is the level of significance.
In order to visualize the obtained results, the seaborn library [56] was implemented to display the boxplot graphs using the boxplot function. Boxplots allow us to compare the distribution of the makespan values obtained by each algorithm for the different combinations of M and N.

Comparative Statistical Analysis
In this section, comparative statistical analysis is performed considering the makespan and the execution time obtained by the metaheuristics GA and ACOII when solving small and large instances arising in the scheduling problem on unrelated parallel machines with sequence-dependent setup times.

Small Problems
The comparison between both algorithms, GA and ACOII, was made considering the performance measures given by Equations (11) and (12) to identify which method achieved better results for small and large problems.
For the small instances, the optimal values are known when M = 2 and N ∈ {6, 7, 8, 9}, M = 4 and N ∈ {6, 7, 8}, and M = 6 and N = 8. Note that if the GA method, in the small instances, achieves the optimal value, then γ must be equal to zero. If γ is positive, it means that the proposed method achieved a worse value (greater value in terms of minimization) than the optimal value; otherwise, a better value was found (smaller). In terms of δ values, if it is positive, then GA achieved better results than ACOII; otherwise, ACOII achieved better results than GA. Table 1 presents the average values of γ for the small instances where the optimal value is known. According to the results, in most of them, GA found the optimal value, except for the Dominant S (M = 4, N = 7) and Dominant P problems (M = 4, N = 6). For this Dominant S problem, the γ value is positive (γ = 0.0005), which means the result achieved is close to the optimal value. Furthermore, in the Dominant P problem, the γ value is negative (γ = −0.00034); thus, a better result than the optimal known value is found.
The δ values for the two algorithms are presented in Table 2 for Balanced, Dominant S, and Dominant P problems. In the Balanced instances, there are three cases where GA achieved better results (δ > 0) and one case where ACOII had a better result (δ < 0). In terms of Dominant S instances, there are three cases where GA had better results than ACOII and two cases where ACOII achieved better results than GA. Finally, in the Dominant P instances, there are also three cases where GA had the best performance and one case where ACOII performed better than GA. Overall, the GA method achieved better results when compared to ACOII, although these are very small differences. Table 1. Average γ deviation from optimal solutions for small problems. Hypothesis testing has been used to test if there are statistically significant differences between the two methods (GA and ACOII). Parametric tests were first considered. Therefore, the underlying assumptions were tested, namely, the normality of data. If so, then the parametric t-test for comparing the means of two related samples was computed. Otherwise, a non-parametric Wilcoxon test was used. The hypotheses to be tested are: H 0 : There are no differences between the GA and ACOII results.
H 1 : There are differences between the GA and ACOII results.
Considering the level of significance as 5%, if the p-value is greater than 0.05, it means that the null hypothesis (H 0 ) is not rejected. Otherwise, it is rejected.
For the small instances, the hypothesis test was performed for M and N values that presented some differences between the GA and ACOII methods (see Table 2). The p-values and the 95% confidence intervals (CI) are shown in Table 3. According to the results achieved, all the p-values were greater than 0.05, considering a significance level of 5%. Thus, the null hypothesis (H 0 ) is not rejected, and consequently, there are no statistically significant differences between GA and ACOII results for small instances. The same conclusion can be drawn using the confidence interval since the value zero belongs to all confidence intervals. In other words, the mean difference between the average δ deviations obtained for GA and ACOII can be equal to zero. Table 4 shows the average execution times, in seconds, for each type of small instance, where the time increases as the number of concurrent machines and the number of tasks increase. It is possible to observe that for the Dominant P problems, a higher average execution time is required when the number of machines is 8. On the contrary, for the Dominant S problems, the lowest average execution time is observed for the number of machines equal to 2.

Large Problems
The same analysis was reproduced for large problems to compare GA and ACOII performances. The computed values for the average makespan deviation from GA (δ) are presented in Table 5. For the Balanced instances, negative values were obtained for all instances, which means that the ACOII method has better performance than GA. However, GA has better results in Dominant P and Dominant S instances, even though ACOII presents better results in Dominant P instances when the number of machines is equal to 12 and in Dominant S instances when M = 12 and N = 40.
Parametric and non-parametric tests were applied to statistically prove if there are significant differences between the average δ deviations achieved by the GA and ACOII algorithms for large instances. The hypotheses to be tested were the same as the ones previously defined for small instances. In terms of results, all p-values were less than 0.05; therefore, for the large instances, the null hypothesis is rejected, and it is possible to conclude that there are significant differences between GA and ACOII results considering the significance level of 5%. Furthermore, it is also possible to conclude that there are significant differences between the two algorithms for the significance level of 0.1% since all p-values are less than 0.001. The next step was to compute the mean and median C max values for each algorithm to identify which one achieved better results. For example, considering M = 12 and N = 40, in Dominant S instances, the mean values were 748.33 and 737.60, and the median values were 749.00 and 739.00 for GA and ACOII, respectively. Thereby, in this case, ACOII performed better than GA since it achieved lower mean and median C max values. This analysis was conducted for all combinations of M and N. Overall, for Balanced instances, the ACOII results were significantly better than the GA results. On the other hand, for the Dominant P and Dominant S instances, the GA algorithm obtained better results than ACOII, except when M = 12 and N = 40 for Dominant S and M = 12 for Dominant P, where ACOII achieved better results.  Figures 1-3 for Balanced, Dominant S, and Dominant P instances, respectively. Using this visualization, it is more perceptible that when the problem becomes more complex (that is, the number of tasks increases), the makespan value also increases. It can also be concluded that most of the GA results are better than the ACOII method.
In Table 6, the average execution time, in seconds, for the genetic algorithm when solving Balanced, Dominant P, and Dominant S large problems is presented. It can be seen that as the number of tasks increases for each number of concurrent machines, the execution time also increases. The lowest average execution time value is observed for problems with six machines. Conversely, the highest average execution time value is obtained for problems with four concurrent machines. It is also observed that the highest average execution time occurred for the Balanced problem with M = 2 and N = 120. For the Dominant P problem with M = 2 and N = 20, the shortest average execution time was obtained.

Conclusions
Decision support systems are a much-needed factor for industries worldwide. For this, raw material production scheduling and mathematical algorithms are widely used methods to help companies make the best decisions. These decisions avoid unnecessary order cancellations or delays and help to better manage product production costs. In parallel machine scheduling problems, there is a need to assign tasks to machines, along with sequencing problems. In this article, a scheduling problem was developed in an environment of unrelated parallel machines. The machines are considered unrelated when the processing times of tasks depend on the machines to which they are assigned and when there is no relationship between the speeds of the machines. Two types of problems were applied, small and large problems (Balanced, Dominant S, and Dominant P), using the genetic algorithm to minimize the makespan.
In terms of small problems, in general, the proposed approach obtained good results in terms of the makespan, achieving the known optimal value. For Dominant S (M = 4, N = 7) and Dominant P (M = 4, N = 6) problems, the optimal solution was not found, despite the makespan of the solutions found being very close to the optimal value. Moreover, a new comparison was made with the ACOII method, and there are nine values where the genetic algorithm achieved better results and only four values where ACOII performed better. However, after applying hypothesis testing, there were no significant differences between GA and ACOII results, considering a level of confidence of 95%. The average execution time was the longest for the Dominant S problems when M = 8 and N = 11. However, the shortest average execution time was obtained for Dominant P problems for M = 2 and N = 6.
For large problems, the optimal solution is unknown, and thus, a comparison was made between the solutions obtained by the genetic algorithm and the ACOII method. For the Balanced problems, the genetic algorithm exhibited the worst performance. On the contrary, for both Dominant S and Dominant P problems, the genetic algorithm performed better, except when M = 12 and N = 40 and for M = 12, respectively. Furthermore, it was proved that there are significant differences between the average makespan values of the genetic algorithm and the ACOII methods. Moreover, it was also possible to observe that the highest average execution time occurred for the Balanced problem with M = 2 and N = 120. For the Dominant P problem with M = 2 and N = 20, the shortest average execution time was obtained.
In conclusion, GA has the ability to effectively solve small and large scheduling problems when minimizing the makespan of unrelated parallel machines with sequencedependent setup times; better performance for the GA was observed in the comparative statistical analysis between the two metaheuristics, especially for the large problems.
In the future, we intend to study the effect of using different chromosome representations and genetic operators in GA performance and to consider other objectives such as completion time and tardiness.