1. Introduction
Organisations from different sectors and domains often face problems with multiple conflicting objectives to optimise. The scientific literature classifies these problems according to the number of objectives as Multiobjective Optimisation Problems (MOPs) for problems with 2–4 objectives and Manyobjective Optimisation Problems (MaOPs) for problems with more than four objectives. Typically, MOPs and MaOPs are addressed through the socalled a posteriori approach, which consists of the following two phases:
Metaheuristic algorithms are promising alternatives to address Phase 1. Multiobjective Evolutionary Algorithms (MOEAs) and Multiobjective Swarm Intelligence Algorithms (MOSIAs) are quite popular because their application does not demand particular mathematical properties on the objective functions, the geometry of the Pareto frontier or the constraints of the problem [
1,
2,
3].
Even though MOEAs and MOSIAs have been widely used in addressing MOPs, most of them cannot adequately approximate the Pareto frontier for MaOPS. They have to cope with severe difficulties as the number of objectives increases; remarkably, the number of dominance resistant solutions, the high cost to ensure diversity, and the low effectiveness of the genetic operators. Studies on the difficulties and challenges found by MOEAs in addressing MaOPs are discussed by Bechikh et al. [
4], López Jaimes and Coello Coello [
5], Sudeng and Wattanapongsakorn [
6], and Ikeda et al. [
7]. One of the main criticisms of the a posteriori approaches is that they require reaching a sufficiently representative approximation of the complete Pareto front before the multicriteria decision making.
Even supposing a good approximation of the Pareto frontier, the DM has to solve a multicriteria selection problem in this set to address Phase 2. An alternative is to make a heuristic selection; the DM is supposed to consistently compare solutions on the approximated Pareto frontier until the best compromise is identified. This task is likely to become pretty hard and impractical in problems with many objective functions because of the human mind cognitive limitations (as stated by Miller [
8]). Another alternative is to apply a MultiCriteria Decision Analysis (MCDA) method that articulates the preferences of the DM. Again, this approach assumes that the solution set effectively contains the most preferred solutions, which is questionable under the presence of many objective functions.
As a consequence of the above discussion, there has been an increasing interest in combining MOEAs and MCDA methods in recent years. One way is to consider the preferences to bias the search toward the Region of Interest (RoI). The RoI is the region of the Pareto frontier with the solutions that best match the DM’s preferences; accordingly, the best compromise is a solution belonging to the RoI.
The preference incorporation requires considering nontrivial aspects such as defining the model of the DM’s preferences, characterising the RoI, and determining the relevance of the solutions [
9]. Most methods for preference incorporation admit at least one of the following kinds of information on the preferences [
10,
11]: weights, e.g., [
12]; ranking of solutions, e.g., [
13]; ranking of objective functions, e.g., [
14]; reference points, e.g., [
15]; tradeoffs between objective functions, e.g., [
16]; desirability thresholds, e.g., [
17]; solution classification, e.g., [
18]; and pairwise comparisons based on preference relations, e.g., [
19,
20].
The two alternatives to incorporate the above strategies are the interactive approach and the a priori approach, and both have advantages over the a posteriori one. On the one hand, better solutions are found because there is an increment in the selective pressure toward the RoI [
21]. On the other hand, preference incorporation can alleviate the DM’s cognitive effort to select the best compromise because the number of candidate solutions is relatively short. Still, the interactive approaches are strongly criticised because they require preference relations with full comparability and transitivity (as the dimensionality increases, these properties become unlikely) [
22]. Contrastingly, the a priori preference incorporation does not mandatorily require such properties. However, it demands a model that reflects the DM’s preferences about the solutions.
In many realworld MaOPs, the models should support imprecision and vagueness in the DM’s preferences. For instance, if the DM is a heterogeneous group (e.g., a board of directors) or an illdefined entity (e.g., a community in social networks). In those circumstances, the task of eliciting the parameters of a preference model is highly difficult and is only reachable with some level of imprecision. If such imperfect knowledge is not considered, the best compromise could hardly be identified among the existing alternatives. Interval mathematics is a straightforward but effective way to express imprecision [
23].
Fernandez et al. [
24] introduced an extension of the outranking method by incorporating interval numbers in the preference parameters. This MCDA method can handle incomparability, veto situations, and nontransitive preferences. These properties become critical to address realworld MaOPs because many DMs have nontransitive and noncompensatory preferences. Additionally, the DM feels more comfortable eliciting the values of the parameters as interval numbers than as precise values. If the DM cannot (or not want to) directly give a value for some required parameters, they may use an indirect elicitation method to infer them, e.g., [
25].
In light of the above discussions, this paper presents a further analysis to observe how incorporating interval outranking in MOSIAs impacts the performance. We propose embedding multicriteria ordinal classification based on interval outranking (strictly speaking, the INTERCLASSnC method [
26]) into manyobjective optimisation algorithms. Similar to previous approaches based on ordinal classification, e.g., [
11], our proposal also requires representative samples of solutions classified by the DM as ‘Satisfactory’ or ‘Dissatisfactory’. However, we extended this notion with two artificial classes (for internal use of the algorithms) that increase the selective pressure toward the RoI. By applying this strategy, we introduce the a priori versions of two relevant a posteriori algorithms based on swarm intelligence; specifically, Multiobjective Grey Wolf Optimisation (MOGWO) [
27] and Indicatorbased Multiobjective Ant Colony Optimisation for continuous domains (
${\mathrm{iMOACO}}_{\mathbb{R}}$) [
28]. The a priori preference incorporation significantly increased the performance of the MOSIAs according to a nonparametric test (Mann–Whitney–Wilcoxon).
The remainder of this paper is organised as follows.
Section 2 includes some preliminaries on multiobjective optimisation, interval outranking, ordinal classification, and the MOSIAs taken as baseline.
Section 3 details the proposed algorithms with preference incorporation.
Section 4 shows the experimental results. Lastly,
Section 5 discusses the conclusions and provides some directions for future research.
2. Background
This section presents an overview of the theoretical foundations.
Section 2.1 presents some preliminaries on optimisation with multiple objectives.
Section 2.2 briefly describes the baseline versions of the MOSIAs used in this paper. Lastly,
Section 2.3 presents the model for multicriteria ordinal classification based on interval outranking.
2.1. Preliminaries on MultiObjective Optimisation
Optimisation refers to finding the values in the decision variables (independent variables) that provoke extreme values of one or more objective functions (dependent variables). It is called ‘monoobjective’ optimisation if a single function is treated. In contrast, it is called ‘multiobjective’ optimisation if a few objective functions are treated (typically, up to four). Farina and Amato [
29] recognised that most MOEAs are severely affected when they address problems with more than four objective functions, named them ‘manyobjective’ optimisation problems.
Realworld applications often involve optimising several functions that are essentially conflicting [
30]. As a consequence, no point is simultaneously optimal in all objective functions.
Here,
$x=\u2329{x}_{1},{x}_{2},{x}_{3},\dots ,{x}_{n}\u232a$ represents a solution of a MOP/MaOP: a vector of decision variables that optimises a vector function
$f\left(x\right)$ whose components represent the values of the objectives. Equation (
1) defines
$f\left(x\right)$, where
m is the number of objectives (dimensionality of the problem), and
n is the number of decision variables. Applied optimisation models usually add constraints to Equation (
1) to reflect real situations.
Pareto dominance is widely accepted to compare two solutions, determining which of them is better. Thus, Pareto dominance discriminates between solutions by comparing their
$f\left(x\right)$. Without loss of generality, let us consider minimising the
m objectives; the Pareto dominance relation, represented by the symbol ≼, may be expressed as [
31]
The nondominated solutions make up the Pareto set, expressed as
where
${R}_{F}$ is the feasible region.
The Pareto frontier, $PF=\left\{f\left(x\right):x\in PS\right\}$, is the image of the Pareto set. In the absence of information on the preferences of the DM, a sufficiently representative sample of the Pareto frontier should be calculated.
Identifying a set of Pareto efficient solutions is indeed necessary to solve MOPs and MaOPS. Still, it is not sufficient since the DM must select the best compromise (the solution to implement). The DM chooses the best compromise according to their personal preferences about the objective functions. In practice, the best compromise is the ultimate solution to the problem.
2.2. An Overview of Two Swarm Intelligence Algorithms to Address MaOPs
In this section, we provide a brief description of MOGWO [
27] and
${\mathrm{iMOACO}}_{\mathbb{R}}$ [
28].
2.2.1. MultiObjective Grey Wolf Optimisation
Mirjalili et al. [
27] proposed MOGWO—MultiObjective Grey Wolf Optimiser—which extends the monoobjective algorithm Grey Wolf Optimiser (GWO) [
32] to treat multiple objectives. This swarm intelligence algorithm is inspired by nature, specifically by the behaviour of grey wolves in tracking and hunting their prey. By analogy, the solution with the best value in the objective function is named the
$\mathsf{\alpha}$ wolf. The secondbest and the thirdbest solutions are named
$\mathsf{\beta}$ and
$\mathsf{\delta}$ wolves, respectively. The remaining solutions are known as Ω wolves. The leaders (
$\mathsf{\alpha}$,
$\mathsf{\beta}$, and
$\mathsf{\delta}$) guide the optimisation process, and the Ω wolves follow the leaders in the search for the global optimum.
Let
n be the number of decision variables,
$\iota $ be the number of the current iteration,
${x}_{i}^{\iota}=\u2329{x}_{i,1}^{\iota},{x}_{i,2}^{\iota},{x}_{i,3}^{\iota},\dots ,{x}_{i,n}^{\iota}\u232a$ be the
ndimensional location point of the
ith wolf during the
$\iota $th iteration, and
${x}^{\iota}=\u2329{x}_{1}^{\iota},{x}_{2}^{\iota},{x}_{3}^{\iota},\dots ,{x}_{\hslash}^{\iota}\u232a$ be the positions of a pack with
ℏ wolves. The following equation simulates how the Ω wolves are relocated to siege their prey during hunt following the social leadership:
where
${\mathfrak{x}}_{1}^{\iota}$,
${\mathfrak{x}}_{2}^{\iota}$, and
${\mathfrak{x}}_{3}^{\iota}$ are
ndimensional vectors reflecting the influence of the three leader wolves during the
$\iota $th iteration. Each component of them is calculated as
In Equations (
4)–(
7),
${x}_{\mathsf{\alpha},j}$,
${x}_{\mathsf{\beta},j}$ and
${x}_{\mathsf{\delta},j}$ are the positions of the leader wolves at the
jth coordinate,
$\iota +1$ is the number of the next iteration, and
A and
${D}_{\mathsf{\alpha}}$ are coefficient vectors modelling the encircling behaviour of wolves as
and
In Equations (
8) and (
9),
a is an
ndimensional vector whose elements linearly decrease from two to zero throughout the run of the algorithm, and
${r}_{1}$ and
${r}_{2}$ are random
ndimensional vectors with values in
$[0,1]$. The components of
${D}_{\mathsf{\beta}}$ and
${D}_{\mathsf{\delta}}$ are similarly calculated as those of
${D}_{\mathsf{\alpha}}$ in Equation (
9).
Exploration is promoted by
A with values greater than one (or less than –1); with that setting, the Ω wolves diverge from the leaders.
C is another component of GWO that favours exploration, whose components represent weights and are generated at random to emphasise (
${C}_{j}>1$) or deemphasise (
${C}_{j}<1$) the influence of the
$\mathsf{\alpha}$,
$\mathsf{\beta}$ and
$\mathsf{\delta}$ wolves in defining the distance in Equations (
5)–(
7). According to Mirjalili et al. [
27], p. 109: “
C is not linearly decreased in contrast to
A. The
C parameter was deliberately required to provide random values at all times in order to emphasise exploration not only during initial iterations but also final iterations”.
GWO exploits the search space if ${A}_{j}<1$ because, when the components of A are in $[1,1]$, the position of the ith wolf in the next iteration will be located between its current position and the position of the leader. This setting assists Ω wolves to converge toward an estimated position of their prey, provided by ${x}_{\mathsf{\alpha}}$, ${x}_{\mathsf{\beta}}$ and ${x}_{\mathsf{\delta}}$.
GWO starts the optimisation process by generating solutions at random during the first population. Then, the three best solutions so far are considered as the
$\mathsf{\alpha}$,
$\mathsf{\beta}$ and
$\mathsf{\delta}$ wolves. In the next iteration, each Ω wolf updates its position by applying Equations (
4)–(
7). Simultaneously, the components of
a are linearly decreased in each iteration. Therefore, the pack of wolves tends to widely explore the search space during the first iterations and intensively exploit it during the last iterations. The algorithm stops when a maximum number of iterations is reached, and
${x}_{\mathsf{\alpha}}$ is returned as the best solution obtained throughout the optimisation process.
To treat MOPs via GWO, MOGWO integrates the following two extensions into GWO: (i) an archive with the Pareto optimal solutions obtained so far, and (ii) a criterion for picking the leaders (${x}_{\mathsf{\alpha}}$, ${x}_{\mathsf{\beta}}$ and ${x}_{\mathsf{\delta}}$) from the archive.
If the archive is full of nondominated solutions (note that there is a predefined size) and there is a new solution to be entered; then, a grid technique determines the region in the Pareto frontier that is the most crowded by the archive. A solution is removed from this region at random; then, the new solution may be added to the archive. Thus, the probability of a solution being deleted is proportional to the number of points in each hypercube (region).
Complementarily, the criterion to select
${x}_{\mathsf{\alpha}}$,
${x}_{\mathsf{\beta}}$ and
${x}_{\mathsf{\delta}}$ favours the least crowded regions in the Pareto front. The selection is based on a roulettewheel method with the following probability for each hypercube:
where
$\mathcal{C}$ is a constant (
$\mathcal{C}\u2a7e1$), and
${N}_{l}$ is the number of solutions in the
lth hypercube. The two extensions of MOGWO jointly promote the representativeness in the sample of the Pareto frontier.
2.2.2. IndicatorBased MultiObjective Ant Colony Optimisation for Continuous Domains
Socha and Dorigo [
33] proposed
${\mathrm{ACO}}_{\mathbb{R}}$, an extension of Ant Colony Optimisation (ACO) [
34] to optimise monoobjective problems with continuous decision variables. In
${\mathrm{ACO}}_{\mathbb{R}}$, the pheromone matrix (
$\tau $) is an archive that stores the bestsofar solutions. Here, a vector
${x}_{l}=\u2329{x}_{l,1},{x}_{l,2},{x}_{l,3},\dots ,{x}_{l,n}\u232a$ represents a solution of a problem with
n decision variables, and
$f\left({x}_{l}\right)$ is the objective function to minimise. The
$\kappa $ bestevaluated solutions are stored in
$\tau $, following the implicit order given by
$f\left({x}_{l}\right)$. The position in
$\tau $ of each
${x}_{l}$ determines a weight (
${\omega}_{l}$) that measures the quality of
${x}_{l}$, defined in Equation (
11).
Equation (
11) defines
${\omega}_{l}$ as a value of the Gaussian function with standard deviation
$\varsigma \xb7\kappa $, mean 1.0, and argument
l. Here,
$\kappa $ is the number of solutions in
$\tau $, and
$\varsigma $ is a parameter (
$0\u2a7d\varsigma \u2a7d1$). The effect of
$\varsigma $ is to establish the proper balance between the pressures exerted by the bestsofar solution (with values close to one) and the iterationbest solutions (with values close to zero).
Furthermore, a Gaussian kernel (
${G}_{j}$) is calculated for each decision variable
$(1\u2a7dj\u2a7dn)$ as
where
According to Equation (
13),
${g}_{l}^{j}\left(x\right)$ is a normal distribution, where
${s}_{l}^{j}$ is the standard deviation, and
${x}_{l,j}$ is the mean. The former is calculated in each iteration as ants construct solutions. Then,
${G}_{j}$ in Equation (
12) is a weighted sum of
${g}_{l}^{j}\left(x\right)\phantom{\rule{0.277778em}{0ex}}\forall l\in \{1,2,3,\dots \kappa \}$, which defines the onedimensional Gaussian function for the
jth decision variable of the
lth solution in
$\tau $.
Ants construct solutions by performing
n steps. The
ith ant sets the value for the variable
${x}_{i,j}$ at the
jth step. Here, only
${G}_{j}$—the resulting Gaussian kernel—is needed. Then, the weights
${\omega}_{l}$ are computed through Equation (
11) and used to sample, by following two phases:
2.3. MultiCriteria Ordinal Classification Based on Interval Outranking
The notion behind outranking is that the credibility of the proposition ‘
x is at least as good as
y’—represented as
$\sigma (x,y)$—may be calculated by analysing each pair of their criteria scores [
35]. ELECTRE is the most representative MCDA method of the outranking approaches. Typically, ELECTRE defines
$\sigma (x,y)=c(x,y)\xb7d(x,y)$, where
$c(x,y)$, the concordance index, cumulates the weights of the criteria in favour of the statement ‘x is at least as good as y’; and
$d(x,y)$, the discordance index, assesses the combined strength of the criteria against ‘x is at least as good as y’.
Both indexes are calculated in function of a series of parameters that must be appropriately inferred so that $\sigma $ models the preferences of the DM. Interval outranking generalises the classic outranking to the framework of interval numbers, providing support when the values of the preference parameters are imprecisely known.
If the reader is be unfamiliar with interval mathematics,
Appendix A compiles the basic notions to understand interval outranking. Note that interval numbers in this paper are written in boldface italic letters.
Let O be a set of alternatives (solutions). Each $x\in O$ is evaluated with an mdimensional objective function $f\left(x\right)=\u2329{f}_{1}\left(x\right),{f}_{2}\left(x\right),{f}_{3}\left(x\right),\dots ,{f}_{m}\left(x\right)\u232a$. Without loss of generality, we suppose that each ${f}_{k}\left(x\right)$ is a minimising objective and, consequently, the preference of the DM increases as the value of ${f}_{k}\left(x\right)$ decreases. The parameters of the outranking model are:
The vector of weights, ${\mathit{w}}_{\mathit{k}}=\left[\underline{{w}_{k}},\overline{{w}_{k}}\right]\phantom{\rule{0.277778em}{0ex}}\forall k\in \{1,2,3,\dots ,m\}$, where $\sum _{k=1}^{m}\underline{{w}_{k}}\u2a7d1$ and $\sum _{k=1}^{m}\overline{{w}_{k}}\u2a7e1$;
The vector of veto thresholds, ${\mathit{v}}_{\mathit{k}}=\left[\underline{{v}_{k}},\overline{{v}_{k}}\right]\phantom{\rule{0.277778em}{0ex}}\forall k\in \{1,2,3,\dots ,m\}$;
The majority threshold, $\mathit{\lambda}=\left[\underline{\lambda},\overline{\lambda}\right]$, where $0.5\u2a7d\underline{\lambda}\u2a7d\overline{\lambda}\u2a7d1$; and
The credibility threshold, $\beta $, where $0.5\u2a7d\beta \u2a7d1$.
The concordance coalition of two solutions
$(x,y)$, denoted as
${C}_{x,y}=\{k\in \{1,2,3,\dots ,m\}:$$P\left({f}_{k}\left(y\right)\u2a7e{f}_{k}\left(x\right)\right)\u2a7e0.5\}$, is the subset of objectives favouring the statement ‘
x is at least as good as
y’. The concordance index for the statement ‘
x is at least as good as
y’ is the interval number
$\mathit{c}(\mathit{x},\mathit{y})=\left[\underline{c(x,y)},\overline{c(x,y)}\right]$, defined as
and
Moreover, the discordance coalition consists of the objectives that are not in the concordance coalition, defined as
${D}_{x,y}=\left\{1,2,3,\dots ,m\right\}\backslash {C}_{x,y}$. These objectives justify arguments invalidating the outranking relation ‘
x is at least as good as
y’. The value
$P\left({f}_{k}\left(x\right){f}_{k}\left(y\right)\u2a7e{\mathit{v}}_{\mathit{k}}\right)$ models the degree of credibility of the proposition ‘the
kth criterion alone vetoes the statement
x outranks y’. The discordance index,
$d(x,y)$, is calculated appraising the credibility of veto of each objective in
${D}_{x,y}$, and is expressed as
Accordingly,
$\sigma (x,y)$ is redefined as
By following Equation (
20), Fernandez et al. [
24] introduced a pair of binary preference relations. First, the crisp outranking relation (
$\mathrm{S}$) is presented in Equation (
21), where
$\beta $ is a threshold on the credibility of ‘
x is at least as good as
y’. Lastly, Equation (
22) presents the crisp relation ‘
x is preferred to
y’.
Fernandez et al. [
26] proposed INTERCLASSnC, an extension of ELECTRE TRInC [
36] in the framework of interval outranking to multicriteria ordinal classification. INTERCLASSnC is applicable in circumstances in which other MCDA methods fail, specifically when the DM does not want to (or cannot) set precise values for the model parameters (majority threshold, veto thresholds, and weights). INTERCLASSnC is especially advantageous if the DM has only a vague idea about the boundaries between adjacent classes; nonetheless, they can quickly identify one representative solution at least in each category, which may be characterised by intervals.
INTERCLASSnC considers the array of classes
$C=\u2329{C}_{1},{C}_{2},{C}_{3},\dots ,{C}_{\varpi}\u232a$, increasingly sorted by preference (
$\varpi \u2a7e2$), and the subset of reference solutions introduced to characterise each
${C}_{\ell}$,
${R}_{\ell}=\left\{{r}_{\ell ,\mathsf{\U0001d6a5}}\right\}$ (where
$1\u2a7d\mathsf{\U0001d6a5}\u2a7d\left{R}_{\ell}\right$, and
$1\u2a7d\ell \u2a7d\varpi $). Additionally,
$\u2329{r}_{0},{R}_{1},{R}_{2},{R}_{3},\dots ,{R}_{\varpi},{r}_{\varpi +1}\u232a$ is the array of all reference solutions sorted by preference, where
${r}_{0}$ is the antiideal point and
${r}_{\varpi +1}$ is the ideal point. Following the interval outranking, the following condition is true for
$1\u2a7d\ell <\varpi $:
Then, Equations (
24) and (
25) are used to define the categorical credibility indices between an alternative
x and the category
${C}_{\ell}$.
The crisp relation of interval outranking is extended to compare actions with the sets of characteristic actions as
$x\mathsf{S}{R}_{\ell}=\left\{\left(x,{R}_{\ell}\right):\vartheta \left(x,{R}_{\ell}\right)\u2a7e\beta \right\}$;
${R}_{\ell}\mathsf{S}x=\left\{\left({R}_{\ell},x\right):\vartheta \left({R}_{\ell},x\right)\u2a7e\beta \right\}$.
To suggest a class for a new alternative x, INTERCLASSnC uses the selection function ${S}_{f}(x,{R}_{\ell})=min\left\{\vartheta \left(x,{R}_{\ell}\right),\vartheta \left({R}_{\ell},x\right)\right\}$ and two heuristics—the ascending rule and the descending rule—that are conjointly used. Each one of these heuristics proposes a class for an alternative x. If the classes do not coincide, they define a range of assignments for x (any category within such a range is admissible as the class of x).
The steps of the ascending rule are the following
Compare x to ${R}_{\ell}$ for $\ell =1,2,3,\dots ,\varpi +1$ until the first ℓ such that ${R}_{\ell}\mathsf{S}x$;
If $\ell =1$, select ${C}_{1}$ as a possible class for x;
If $1<\ell <\varpi +1$, select ${C}_{\ell}$ as a possible class for x if ${S}_{f}(x,{R}_{\ell})\u2a7e{S}_{f}(x,{R}_{\ell 1})$; otherwise, select ${C}_{\ell 1}$.
If $\ell =\varpi +1$, select ${C}_{\varpi}$ as a possible class for x.
The descending rule has the following steps
Compare x to ${R}_{\ell}$ for $\ell =\varpi ,\varpi 1,\varpi 2,\dots ,0$ until the first ℓ such that $x\mathsf{S}{R}_{\ell}$;
If $\ell =\varpi $, select ${C}_{\varpi}$ as a possible class for x;
If $0<\ell <\varpi $, select ${C}_{\ell}$ as a possible class for x if ${S}_{f}(x,{R}_{\ell})\u2a7e{S}_{f}(x,{R}_{\ell +1})$; otherwise, select ${C}_{\ell +1}$.
If $\ell =0$, select ${C}_{1}$ as a possible class for x.
3. Proposed Algorithms
In this section, we describe how ordinal classification based on interval outranking was embedded in MOGWO and
${\mathrm{iMOACO}}_{\mathbb{R}}$ to incorporate the preferences of the DM. The primary strategy is to measure the quality of the solutions in terms of the class suggested by INTERCLASSnC. Consequently, the selective pressure increases toward the solutions the DM is highly satisfied with. Note that both proposed algorithms are not intended for searching representative approximations of the complete Pareto frontier; instead, they search for the RoI: a relatively short subset of Pareto optimal solutions that best match the DM’s preferences. Here, we propose that the RoI is made of solutions classified as ‘Highly Satisfactory’, and the best compromise—the final prescription chosen by the DM—should be a solution belonging to the RoI.
Section 3.1 introduces the Grey Wolf Optimiser with Interval outrankingbased ordinal Classification, abbreviated as GWOInClass.
Section 3.2 introduces Ant Colony Optimisation with Interval outrankingbased ordinal Classification, abbreviated as ACOInClass.
3.1. The GWOInClass Algorithm
GWOInClass extends MOGWO by classifying the solutions in each iteration utilising INTERCLASSnC before picking the leaders from the archive, keeping the solutions ranked in the following order: ‘Highly Satisfactory’, ‘Satisfactory’, ‘Dissatisfactory’, and ‘Strongly Dissatisfactory’.
We suggest using an ordinal classifier that has already been validated in an evolutionary algorithm for manyobjective optimisation [
37]. In this approach, the DM should classify the reference solutions (input) in two classes, ‘Dissatisfactory’ and ‘Satisfactory’ (respectively,
${C}_{1}$ and
${C}_{2}$), the minimum number of classes to apply INTERCLASSnC. Then, the model is extended by artificially adding the classes ‘Highly Satisfactory’ and ‘Strongly Dissatisfactory’. Each new solution
x generated during the evolutionary search is classified according to the following assumptions:
If $x\mathrm{Pr}y\phantom{\rule{0.277778em}{0ex}}\forall y\in {R}_{2}$, then the DM is highly satisfied with x; otherwise, the DM is satisfied with x.
If $y\mathrm{Pr}x\phantom{\rule{0.277778em}{0ex}}\forall y\in {R}_{1}$, then the DM is strongly dissatisfied with the solution x; otherwise, the DM is dissatisfied with x.
Algorithm 1 presents an outline of the ordinal classifier proposed by Balderas et al. [
37].
A crucial process of GWOInClass is to update the archive and select the $\mathsf{\alpha}$, $\mathsf{\beta}$ and $\mathsf{\delta}$ wolves; this process is presented in Algorithm 2. Here, Lines 1–2 initialise the variables; Lines 3–12 rank the solutions according to the class suggested by Algorithm 1; Lines 13–15 make sure that the archive ($\mathcal{A}$) is not larger than the maximum size allowed by the parameter ℏ; and, lastly, Lines 16–20 pick three different solutions from the nonempty bestevaluated class to be the leader wolves.
Algorithm 3 provides an algorithmic outline of GWOInClass. Unlike MOGWO, GWOInClass needs information about the preferences of the DM, which is exploited to become closer to the RoI. The preferences of the DM are articulated through an interval outranking model. We suggest using the proposal by Fernandez et al. [
25] to infer the model parameters that reflect the preferences of the DM. Furthermore, GWOInClass also needs the reference sets with solutions labelled by the DM as ‘Satisfactory’ or ‘Dissatisfactory’. Initially, synthetic solutions are helpful to perform this task; however, if the DM is not confident about those initial sets, they may additionally have some interactions with GWOInClass throughout the optimisation process and directly classify the solutions in the archive, updating the reference sets. Binary classification of solutions is one of the least cognitively demanding ways to interact with the DM.
Algorithm 1 Ordinal classifier based on interval outranking 
Input: Number of objectives (m), parameters of the outranking model ($\mathit{\lambda}$, $\beta $, $\mathit{v}$, $\mathit{w}$), a solution x, representative sets ${R}_{1}$ and ${R}_{2}$ Output: The class for x ($class$)
 1:
${c}_{nc}\leftarrow \mathrm{INTERCLASS}\mathrm{nC}(x,{R}_{1},{R}_{2})$ ▹ Descending and ascending rules  2:
if
${c}_{nc}=\u2018\mathrm{Satisfactory}\u2019$
then  3:
if $x\mathrm{Pr}y\phantom{\rule{0.277778em}{0ex}}\forall y\in {R}_{2}$ then  4:
$class\leftarrow \u2018\mathrm{Highly}\mathrm{Satisfactory}\u2019$  5:
else  6:
if $y\mathrm{Pr}x\phantom{\rule{0.277778em}{0ex}}\forall y\in {R}_{1}$ then  7:
$class\leftarrow \u2018\mathrm{Strongly}\mathrm{Dissatisfactory}\u2019$  8:
return
$class$

Algorithm 2 Selection of the leader wolves 
Input: Archive ( $\mathcal{A}$ ), solutions of the current iteration (${x}^{\iota}$), maximum size of the archive (ℏ) Output: The leader wolves (${x}_{\mathsf{\alpha}},{x}_{\mathsf{\beta}},{x}_{\mathsf{\delta}}$)  1:
$\mathcal{R}\leftarrow \u2329\varnothing ,\varnothing ,\varnothing ,\varnothing \u232a$  2:
$\mathcal{T}\leftarrow PS\left(\mathcal{A}\cup {x}^{\iota}\right)$ ▹ Filtering only Pareto efficient solutions, see Equation (3)  3:
for ${x}_{i}\in \mathcal{T}$ do ▹ Main loop for ranking solutions  4:
$class\leftarrow \mathtt{extended}\_\mathtt{INTERCLASS}\mathtt{nC}\left({x}_{i}\right)$ ▹ Classifying by Algorithm 1  5:
if $class=\u2018\mathrm{Highly}\mathrm{Satisfactory}\u2019$ then  6:
${\mathcal{R}}_{1}\leftarrow {\mathcal{R}}_{1}\cup \left\{{x}_{i}\right\}$  7:
if $class=\u2018\mathrm{Satisfactory}\u2019$ then  8:
${\mathcal{R}}_{2}\leftarrow {\mathcal{R}}_{2}\cup \left\{{x}_{i}\right\}$  9:
if $class=\u2018\mathrm{Dissatisfactory}\u2019$ then  10:
${\mathcal{R}}_{3}\leftarrow {\mathcal{R}}_{3}\cup \left\{{x}_{i}\right\}$  11:
if $class=\u2018\mathrm{Strongly}\mathrm{Dissatisfactory}\u2019$ then  12:
${\mathcal{R}}_{4}\leftarrow {\mathcal{R}}_{4}\cup \left\{{x}_{i}\right\}$  13:
if
$\left\mathcal{T}\right>\hslash $
then  14:
Remove the $\left\mathcal{T}\right\hslash $ worst solutions according to the ranking  15:
$\mathcal{A}\leftarrow \mathcal{T}$  16:
$leaders\leftarrow \u2329{x}_{\mathsf{\alpha}},{x}_{\mathsf{\beta}},{x}_{\mathsf{\delta}}\u232a$  17:
for ${x}_{i}\in leaders$ do ▹ Main loop for picking the leaders  18:
$topRanked\leftarrow \left\{\begin{array}{cc}{\mathcal{R}}_{1}\hfill & \mathrm{if}{\mathcal{R}}_{1}\ne \varnothing \hfill \\ {\mathcal{R}}_{2}\hfill & \mathrm{if}{\mathcal{R}}_{1}=\varnothing \phantom{\rule{0.277778em}{0ex}}\wedge \phantom{\rule{0.277778em}{0ex}}{\mathcal{R}}_{2}\ne \varnothing \hfill \\ {\mathcal{R}}_{3}\hfill & \mathrm{if}{\mathcal{R}}_{1}={\mathcal{R}}_{2}=\varnothing \phantom{\rule{0.277778em}{0ex}}\wedge \phantom{\rule{0.277778em}{0ex}}{\mathcal{R}}_{3}\ne \varnothing \hfill \\ {\mathcal{R}}_{4}\hfill & \mathrm{otherwise}\hfill \end{array}\right.$ ▹ Choosing the best evaluated set that is not empty  19:
${x}_{i}\leftarrow \mathtt{roulette}\_\mathtt{wheel}\left(topRanked\right)$ ▹ Probabilities calculated by Equation (10)  20:
Remove ${x}_{i}$ from $\mathcal{R}$ ▹ To avoid selecting the same solution as ${x}_{\mathsf{\alpha}}$, ${x}_{\mathsf{\beta}}$ or ${x}_{\mathsf{\delta}}$  21:
return
${x}_{\mathsf{\alpha}},{x}_{\mathsf{\beta}},{x}_{\mathsf{\delta}}$

In Algorithm 3, Lines 1–5 initialise the parameters and generate the initial positions of the wolves. Line 6 picks the first leader wolves. Lines 7–12 present the iterated process of GWOInClass; here, Line 8 applies Equations (
4)–(
9) to update the positions of the Ω wolves, following the social leadership of the
$\mathsf{\alpha}$,
$\mathsf{\beta}$ and
$\mathsf{\delta}$ wolves; Line 10 updates the archive and obtains the latest positions of the leader wolves; and lastly, Lines 11–12 update the parameters and variables to perform the next iteration.
Algorithm 3 The Grey Wolf Optimiser with Interval outrankingbased ordinal Classification 
Input Number of objectives (m), number of decision variables (n), parameters of the outranking model (${\mathit{w}}_{\mathit{k}},{\mathit{v}}_{\mathit{k}}\phantom{\rule{0.277778em}{0ex}}\forall k\in \{1,2,3,\dots ,m\}$, $\mathit{\lambda}$, $\mathit{\beta}$), reference solution sets (${R}_{1}$, ${R}_{2}$), maximum size of the archive (ℏ) Output: An approximation of the RoI ($\mathcal{A}$)
 1:
Initialise the parameters of MOGWO (a, A, and C)  2:
${x}_{i}^{0}\leftarrow \mathtt{random}\_\mathtt{solution}\left(\right)\phantom{\rule{0.277778em}{0ex}}\phantom{\rule{0.277778em}{0ex}}\forall i\in \{1,2,3,\dots ,\hslash \}$ ▹ Generating the initial pack of wolves at random  3:
Calculate ${f}_{k}\left({x}_{i}^{0}\right)\phantom{\rule{0.277778em}{0ex}}\phantom{\rule{0.277778em}{0ex}}\forall k\in \{1,2,3,\dots ,m\},i\in \{1,2,3,\dots ,\hslash \}$ ▹ Getting the values of the objective functions for each wolf  4:
$\mathcal{A}\leftarrow \varnothing $  5:
$\iota \leftarrow 1$  6:
$\left({x}_{\mathsf{\alpha}},{x}_{\mathsf{\beta}},{x}_{\mathsf{\delta}}\right)\leftarrow \mathtt{get}\_\mathtt{leaders}(\mathcal{A},{x}^{0},\hslash )$ ▹ Algorithm 2  7:
while $\iota \u2a7dite{r}_{\mathrm{max}}$ do ▹ Main loop of the optimisation process  8:
${x}_{i}^{\iota}\leftarrow \frac{{\mathfrak{x}}_{1}^{\iota 1}+{\mathfrak{x}}_{2}^{\iota 1}+{\mathfrak{x}}_{3}^{\iota 1}}{3}\phantom{\rule{0.277778em}{0ex}}\phantom{\rule{0.277778em}{0ex}}\forall i\in \{1,2,3,\dots ,\hslash \}$ ▹ Updating the pack of wolves by applying Equations (4)–(9)  9:
Calculate ${f}_{k}\left({x}_{i}^{\iota}\right)\phantom{\rule{0.277778em}{0ex}}\phantom{\rule{0.277778em}{0ex}}\forall k\in \{1,2,3,\dots ,m\},i\in \{1,2,3,\dots ,\hslash \}$  10:
$\left({x}_{\mathsf{\alpha}},{x}_{\mathsf{\beta}},{x}_{\mathsf{\delta}}\right)\leftarrow \mathtt{get}\_\mathtt{leaders}(\mathcal{A},{x}^{\iota},\hslash )$ ▹ Algorithm 2  11:
Update a, A, and C  12:
$\iota \leftarrow \iota +1$  13:
return
$\mathcal{A}$

3.2. The ACOInClass Algorithm
Like GWOInClass, ACOInClass stores the best solutions in the pheromone matrix
$\tau $—its archive—considering the class suggested by Algorithm 1.
Figure 1 depicts the pheromone representation used in ACOInClass; here,
$\kappa $ is the size of the archive,
n is the number of decision variable,
${x}_{l}\phantom{\rule{0.277778em}{0ex}}\phantom{\rule{0.277778em}{0ex}}\forall l\in \{1,2,3,\dots ,\kappa \}$ is the vector of the
lth solution in the archive,
${x}_{l,j}$ is the value of the
jth decision variable of
${x}_{l}$,
${G}_{j}\phantom{\rule{0.277778em}{0ex}}\phantom{\rule{0.277778em}{0ex}}\forall j\in \{1,2,3,\dots ,n\}$ is the Gaussian kernel for the
jth decision variable, and
${\omega}_{l}\phantom{\rule{0.277778em}{0ex}}\phantom{\rule{0.277778em}{0ex}}\forall l\in \{1,2,3,\dots ,\kappa \}$ is the weight of the
lth solution.
The chief difference between
${\mathrm{iMOACO}}_{\mathbb{R}}$ and ACOInClass is the criteria to sort
$\tau $. FalcónCardona and Coello Coello [
28] suggested
${R}_{2}$ scores [
38] as the primary criterion to rank the solutions in
$\tau $. Contrarily, in this paper, we propose the class indicated by Algorithm 1 as the primary criterion, and
${R}_{2}$ scores as the secondary criterion (intraclass solutions are sorted using the
${R}_{2}$ metric). The archive is set to store the
$\kappa $ topranked solutions.
Algorithm 4 provides an algorithmic outline of ACOInClass. Lines 1 and 7 initialise the parameters and variables of the algorithm; Lines 2–6 generate and evaluate the initial random solutions of the colony. Lines 8–16 present the iterated process of ACOInClass; here, Lines 9–10 generate the solutions of the colony by following the equations provided in
Section 2.2.2, particularly Equations (
14)–(
16); Lines 11–14 evaluate and rank the solutions; and, lastly, Lines 15–16 update the data structures to perform the next iteration.
Algorithm 4 Ant Colony Optimisation with Interval outrankingbased ordinal Classification 
Input Number of objectives (m), number of decision variables (n), parameters of the outranking model (${\mathit{w}}_{\mathit{k}},{\mathit{v}}_{\mathit{k}}\phantom{\rule{0.277778em}{0ex}}\forall k\in \{1,2,3,\dots ,m\}$, $\mathit{\lambda}$, $\mathit{\beta}$), reference solution sets (${R}_{1}$, ${R}_{2}$), maximum size of the archive ($\kappa $) Output: An approximation of the RoI ($\tau $)
 1:
Initialise the parameters of ${\mathrm{iMOACO}}_{\mathbb{R}}$ ($\xi $ and $\varsigma $)  2:
${x}_{i}\leftarrow \mathtt{random}\_\mathtt{solution}\left(\right)\phantom{\rule{0.277778em}{0ex}}\phantom{\rule{0.277778em}{0ex}}\forall {x}_{i}\in \tau $ ▹ Generating the initial solutions of the pheromone trail at random  3:
Calculate ${f}_{k}\left({x}_{i}\right)\phantom{\rule{0.277778em}{0ex}}\phantom{\rule{0.277778em}{0ex}}\forall k\in \{1,2,3,\dots ,m\},{x}_{i}\in \tau $ ▹ Getting the values of the objective functions for each ant  4:
$\tau \leftarrow PS\left(\tau \right)$ ▹ Filtering only Pareto efficient solutions, see Equation (3)  5:
$\mathtt{Normalise}(\tau $)  6:
Rank solutions in $\tau $ ▹ See Algorithm 1  7:
$\iota \leftarrow 1$  8:
while $\iota \u2a7dite{r}_{\mathrm{max}}$ do ▹ Main loop of the optimisation process  9:
for ${x}_{i}\in \Lambda $ do ▹$\Lambda $ is the colony (with $\kappa $ ants)  10:
Generate a new solution based on $\tau $ ▹ Applying Equation (16) 11:
Calculate ${f}_{k}\left({x}_{i}\right)\phantom{\rule{0.277778em}{0ex}}\phantom{\rule{0.277778em}{0ex}}\forall k\in \{1,2,3,\dots ,m\},{x}_{i}\in \Lambda $ ▹ Getting the values of the objective functions for each ant  12:
$\mathcal{O}\leftarrow PS\left(\Lambda \cup \tau \right)$ ▹ Filtering only Pareto efficient solutions, see Equation (3)  13:
$\mathtt{Normalise}(\mathcal{O}$)  14:
Rank solutions in $\mathcal{O}$ ▹ See Algorithm 1  15:
Copy into $\tau $ the $\kappa $ first solutions of $\mathcal{O}$  16:
$\iota \leftarrow \iota +1$  17:
return
$\tau $

4. Experimental Validation
We implemented GWOInClass and ACOInClass in Java using OpenJDK 11.0.10, on a computer with an Intel Core i710510U CPU 1.80 GHz, 16 GB of RAM, and Manjaro 5.10 as operating system. This computer setting applies to all experiments reported in this section.
GWOInClass has three main parameters to be adjusted:
a,
C, and
ℏ. The components of the vector
a are initially set in two, and are linearly decreased to reach zero in the last iteration. The vector
C is dynamically generated during each iteration, providing random numbers (cf. [
27]). The parameter setting of ACOInClass is
$\varsigma =0.1$ and
$\xi =0.5$ (cf. [
28]). The parameter settings of the reference algorithms—NSGAIII, MOGWO and
${\mathrm{iMOACO}}_{\mathbb{R}}$—are also those originally published by the authors in the articles [
27,
28,
39]. A particular case is the size of the population (
ℏ in GWOInClass and
$\kappa $ in ACOInClass), which depends on the number of objective functions (
m):
$\hslash =\kappa =92$ for
$m=3$,
$\hslash =\kappa =212$ for
$m=5$, and
$\hslash =\kappa =276$ for
$m=10$. These values were inspired by the discussion of Deb and Jain [
39] on suitable population sizes for evolutionary algorithms. The maximum number of iterations for all the algorithms reported in this section is
$ite{r}_{\mathrm{max}}=1000$ for
$m\in \{3,5\}$ and
$ite{r}_{\mathrm{max}}=1500$ for
$m=10$.
The adjective ‘significant’ is used in this section if a nonparametric
U test (also know as Mann–Whitney test or Wilcoxon ranksum test) with a 0.95confidence level validates the difference as statistically significant. Furthermore, we performed all tests for statistical significance through STAC [
40].
The rest of this section is organised as follows.
Section 4.1 presents the test suite used to validate the results, as well as the indicators to measure the quality of the solutions.
Section 4.2 describes the results obtained by GWOInClass. Lastly,
Section 4.3 presents the results by ACOInClass.
4.1. Benchmark Problems and Performance Indicators
DTLZ [
41] has become the standard test suite most broadly accepted to assess the performance of MOEAs and MOSIAs. Accordingly, we ran our algorithms on the nine problems in the DTLZ suite, named DTLZ1–DTLZ9. These problems have continuous decision variables, and are scalable regarding the number of decision variables and objective functions, offering Pareto frontiers with challenging properties (e.g., bias, concavity, convexity, degeneration, multifrontality, and separability). We explored the dimensionality of each DTLZ problem by considering 3, 5, and 10 objective functions (
m); accordingly, there are 27 different instances to validate our algorithms. The numbers of positionrelated variables (
k) and decision variables (
n) for each problem are the following:
DTLZ1: $k=5$ and $n=m+k1$.
DTLZ2–DTLZ6: $k=10$ and $n=m+k1$.
DTLZ7: $k=20$ and $n=m+k1$.
DTLZ8 and DTLZ9: $k=m1$ and $n=10m$.
As GWOInClass and ACOInClass consider the DM’s preferences, each of the 27 test instances was validated using ten interval outranking models, representing different DMs. Some stateoftheart studies [
42,
43,
44] have proposed and used these synthetic DMs to validate a priori optimisation methods run on the DTLZ suite. The primary motivation behind this choice is that an acceptable approximation to the true RoI is known for these synthetic DMs, and these Approximated RoIs (abbreviated as ‘ARoI’ from hereon) are in compliance with the interval outranking model described in
Section 2.3, which is the keystone of the interval ordinal classifier GWOInClass and ACOInClass use (see Algorithm 1).
The ARoI should contain the most preferred solutions in terms of interval outranking; Rivera et al. [
44] followed this underlying principle to calculate the ARoI as follows (cf. [
42,
43]):
The proposed algorithms ran 30 times on each test instance using each synthetic DM (as a consequence, each algorithm was run 300 times per instance). The benchmark algorithms—NSGAIII, MOGWO and ${\mathrm{iMOACO}}_{\mathbb{R}}$—were also run 300 times.
There are several indicators for evaluating the performance of a multiobjective optimiser that approximates the complete Pareto frontier (e.g., hypervolume, spacing, spread, and inverted generational distance). However, these indicators are inadequate for assessing the performance of preferencebased multiobjective algorithms because no metric can be directly applied when only a partial Pareto frontier is considered [
45]. Furthermore, some attempts to assess the quality of a preferred solution adapt these indicators in an oversimplified way, making the assessments misleading. In line with this notion, we measured the performance via three indicators that consider the solutions that maximise the preferences of the DM. Let
${X}^{*}$ be the latest set of solutions of an algorithm, the following three indicators are utilised:
Minimum Euclidian distance. The distance from the ARoI to the closest point in ${X}^{*}$.
Average Euclidian distance. The average distance from the points in the ARoI to those in ${X}^{*}$.
Satisfaction. The proportion of solutions in ${X}^{*}$ belonging to the class ‘Highly Satisfactory’.
On the one hand, the distancebased indicators measure the quality in terms of the similarity between ${X}^{*}$ and the ARoI; the minimum distance considers the best solution alone, and the average distance considers the overall trend. For these indicators, the lower the values, the closer the approximation. On the other hand, the satisfactionbased indicator measures the quality of the algorithm considering the number of solutions that could potentially become the best compromise solution. For this indicator, greater values are preferred.
4.2. On the Performance of GWOInClass
Table 1 shows the results of GWOInClass in comparison with MOGWO. Given a number of objectives (Column 1),
Table 1 presents the average performance in terms of the indicators (referred to in Column 2) obtained by each algorithm (referred to in Column 3) on each DTLZ problem (referred to in Columns 4–12); here, the cells were shaded if the difference is statistically significant: in blue if it is in favour of GWOInClass, in yellow if it is in favour of MOGWO.
The information in
Table 1 may be summarised in the following points:
Considering the satisfactionbased indicator: The results of GWOInClass were significantly better than those of MOGWO in DTLZ1, DTLZ2, and DTLZ6 regardless of the number of objectives. In other problems (DTLZ3, DTLZ5, and DTLZ7), the advantage of including ordinal classification only became significant when m increased. On the whole, our strategy had a greater impact on the ‘satisfaction’ indicator in manyobjective optimisation ($m\in \{5,10\}$) than in multiobjective optimisation ($m=3$).
Considering the Euclidean indicators: The results on DTLZ1–3 were particularly encouraging because the averages of GWOInClass were consistently lower than those of MOGWO, having statistical significance in the great majority of the instances. In DTLZ5 and DTLZ6, GWOInClass became closer to the RoI only in manyobjective instances. Contrastingly, when $m=3$, our strategy was inconvenient to treat DTLZ4 and DTLZ6.
Considering DTLZ8 and DTLZ9: The embedding of ordinal classification in MOGWO does not yield any significant benefit. It is worth noting that they are the only problems with side constraints in this benchmark [
46].
As a partial conclusion, we would recommend using GWOInClass instead of MOGWO to address MaOPS. GWOInClass was at least as good as MOGWO; what is more, it was significantly better on a regular basis. A test for statistical significance supports this hypothesis on the DTLZ test suit.
Additionally, we compared the results of GWOInClass with NSGAIII, which has been widely accepted by the scientific community as a benchmark algorithm for evolutionary manyobjective optimisation.
Table 2 presents the results of both algorithms on the DTLZ test suit. Again, the three indicators—the average Euclidean distance, the minimum Euclidean distance, and satisfaction—are considered, and the cells with significant differences are shaded (blue in favour of GWOInClass, yellow in favour of NSGAIII).
According to
Table 2, the worst performance was observed when
$m=3$. Contrarily, GWOInClass approximated the RoI significantly better than NSGAIII in most of the 10objective instances. Considering the satisfactionbased indicator, DTLZ1, DTLZ7, and DTLZ9 are especially challenging for GWOInClass. To conclude, the performance of GWOInClass becomes competitive as the number of objectives increases according to this standard of the literature.
Lastly, we plotted the results of some single runs of GWOInClass in
Figure 2. We took those runs on the threeobjective problems with the results closest to the ARoI. Although such runs are not representative, they clearly depict how GWOInClass biases the search toward a privileged region in the Pareto frontier. The best compromise would be a solution belonging to such region (coloured in red in
Figure 2).
4.3. On the Performance of ACOInClass
Table 3 presents the results of ACOInClass in comparison with
${\mathrm{iMOACO}}_{\mathbb{R}}$; its columns should be interpreted with the same meaning provided for
Table 1. As a summary of the information provided, let us discuss the following remarks:
Considering ten objective functions: The advantage of embedding ordinal classification became statistically significant only when $m=10$. Taking DTLZ1–4 and DTLZ6–8, ACOInClass outperformed ${\mathrm{iMOACO}}_{\mathbb{R}}$ in at least one indicator, performing especially well in DTLZ1, DTLZ3, DTLZ6, and DTLZ8.
Considering three and five objective functions: With the only exception of a specific setting (DTLZ6, average Euclidean distance, and $m=5$), ACOInClass was at least as good as ${\mathrm{iMOACO}}_{\mathbb{R}}$ regardless of the number of objectives.
Considering DTLZ5 and DTLZ9: Like in the case of GWOInClass, no advantage was observed in DTLZ9. Additionally, this situation also occurred in DTLZ5.
We would strongly recommend using ACOInClass instead of ${\mathrm{iMOACO}}_{\mathbb{R}}$ to address MaOPS with about ten objective functions. This insight is relevant because the efficiency of MOSIAs and MOEAs is degraded as the number of objectives increases. The strategy proposed in this paper could become a viable means of mitigating this severe drawback. Still, the DM should be prepared to devote the necessary time to express their preferences.
Furthermore,
Table 4 shows the results of ACOInClass in comparison with those of NSGAIII. The columns of
Table 4 should be analogously interpreted as in
Table 2. These results together with the statistical tests allow concluding that ACOInClass is competitive to address 10objective problems according to the standard established by NSGAIII; in fact, ACOInClass outperformed NSGAIII in the vast majority of these instances. As a welcome side effect, ACOInClass can also support the a posteriori decision analysis; this feature would reduce the DM’s cognitive effort invested in identifying the best compromise. Adversely, these results clearly imply that ACOInClass was unsuitable for treating threeobjective instances.
Again, we plotted the results of some runs of ACOInClass, which are shown in
Figure 3. These runs presented the best results on the threeobjective instances and clearly depicted how ACOInClass biases the search toward the ARoI. Here, the best compromise would be a solution obtained by ACOInClass (coloured in red in
Figure 3).
5. Conclusions and Directions for Future Research
This paper introduces a novel strategy to incorporate preferences into swarm intelligence algorithms. Following the taxonomy of Bechikh et al. [
4], the proposed strategy falls into the category of ‘solution classification’. Initially, the DM should express their preferences about the solutions by classifying them as ‘Satisfactory’ or ‘Dissatisfactory’. The proposed strategy is one of the least cognitively demanding in the framework of solution classification because the DM must merely classify solutions into just two categories.
Then, we suggest that the optimisation algorithms additionally identify two artificial classes—‘Highly Satisfactory’ and ‘Strongly Dissatisfactory’—through an ordinal classifier based on interval outranking to model different levels of intensity in the DM’s preferences. Consequently, the classifier increases its ability to discriminate. Here, we hypothesised that swarm intelligence algorithms can obtain the edge by increasing the selective pressure toward the region of the Pareto frontier containing ‘Highly Satisfactory’ solutions. A straightforward way to achieve it is to consider ordinal classification as the major criterion to rank the solutions in the archive.
Typically, swarm intelligence algorithms use an archive with the best sofar approximation of the Pareto frontier. These solutions are sorted to pick the solution(s) whose patterns will be exploited in the next iteration, hence its relevance.
By applying this strategy, ordinal classification was embedded in two swarm intelligence algorithms, expressly Multiobjective Grey Wolf Optimisation and Indicatorbased Multiobjective Ant Colony Optimisation for continuous domains. The extended versions were called Grey Wolf Optimiser with Interval outrankingbased ordinal Classification (GWOInClass) and Ant Colony Optimisation with Interval outrankingbased ordinal Classification (ACOInClass). We used ten synthetic DMs to validate the results; we also considered each problem in the DTLZ test suite and explored different numbers of objective functions (three, five, and ten). The algorithms ran 30 times for each setting.
The impact of our strategy depended on several factors: the number of objectives, the baseline algorithm, and the properties of the test problem. Despite this, our strategy conferred marked benefits when many objective functions were treated. In the case of GWOInClass, such benefits became significant in six of the nine DTLZ problems when the number of objectives was at least five. In the case of ACOInClass, the ordinal classifier impacted the performance in seven DTLZ problems only when ten objective functions were considered.
Although our strategy requires that the DM is welldisposed to spend the necessary time to elicit their preferences, such an effort can be favourably compensated when problems with many objective functions are treated; especially, keeping in mind that these problems are still highly challenging for a posteriori algorithms (that approximate the complete Pareto frontier). The embedding of ordinal classification based on interval outranking contributed to coping with these difficulties. Numerical results and tests for statistical significance supported these conclusions.
Perhaps, the major criticism of our approach is that it is only applicable when the preferences of the DM are compatible with the underlying principles of outranking. That is, the DM admits veto effects and has a noncompensatory preference about the objectives.
Further research is needed to draft conclusions with a greater generalisation. First, it is necessary to know the impact of this strategy on other swarm intelligence algorithms (e.g., particle swarm optimisation, artificial bee colony, and elephant herding optimisation). Second, it is also necessary to conduct more experimentation with a deeper analysis to connect the performance with the properties of the problem, providing plausible explanations for those problems in which no advantage was observed (e.g., DTLZ9).