1. Introduction
To address the increasing complexity of real-world optimization challenges, the Traveling Thief Problem (TTP) was introduced as a benchmark that integrates two classical combinatorial optimization problems: the Traveling Salesman Problem (TSP) and the Knapsack Problem (KP) [
1]. The TTP presents a scenario in which a thief must traverse a series of cities, returning to the starting point while selectively picking valuable items to maximize profit. The challenge lies in the limited capacity of the knapsack and the time-based rental cost, creating a unique interdependence between the TSP and KP. This study aims to develop an optimal travel plan and item selection strategy to maximize the thief’s total benefit.
The TTP combines the path optimization of the TSP with the item selection of the KP, creating a complex problem where solving one subproblem directly influences the outcome of the other. The complexity of TTP arises not only from the NP-hard nature of both the TSP and KP [
2], but also from the intricate interdependence between these two problems, making the overall problem even more challenging.
Metaheuristic algorithms have emerged as powerful tools for addressing complex challenges like the TTP. These algorithms incorporate randomness and heuristic rules to explore global optimal solutions. Metaheuristic algorithms can be broadly categorized based on their operational principles and application scenarios:
- Evolutionary Algorithms (EAs), Including Genetic Algorithms (GA) and Differential Evolution (DE): These algorithms generate new solutions by simulating natural selection. Through operations like selection, crossover, and mutation, they are widely used in path optimization and resource allocation [ 3- ]. 
- Swarm Intelligence Algorithms, such as Ant Colony Optimization (ACO) and Particle Swarm Optimization (PSO): These algorithms solve combinatorial optimization problems by mimicking the collective behavior of swarms. They have shown excellent performance in solving TSP and other path optimization problems [ 4- ]. 
- Simulated Annealing (SA): Inspired by the physical annealing process, this algorithm gradually reduces randomness to help the search process converge to a global optimum [ 5- ]. SA has been widely applied in solving combinatorial optimization problems. 
- Tabu Search (TS): This algorithm maintains a “tabu list” of previously visited solutions to avoid cycling, helping to escape local optima. It is particularly effective in path optimization and scheduling problems [ 6- ]. 
- Hybrid Metaheuristics: These combine two or more metaheuristic algorithms to enhance efficiency and precision. For instance, the hybrid of Genetic Algorithms with Tabu Search (GATS) combines the global search capability of GA with the local search capability of TS, showing outstanding performance in complex problems [ 7- ]. 
The TSP and KP have been extensively studied, with various algorithms developed to address their complexities. Significant progress in TSP has been achieved through heuristic algorithms like ACO and GA, which have demonstrated excellent performance in path optimization [
3,
4]. Similarly, KP has been effectively tackled using dynamic programming and greedy algorithms, proving effective in resource allocation [
8,
9].
Building on these foundational problems, the TTP not only inherits their complexities but also introduces a new layer of interdependence between path selection and item selection, making it a more challenging and comprehensive optimization problem [
10]. Consequently, developing new approaches to manage the interaction between path and item selection in TTP is particularly important.
While TTP shares some similarities with the Capacitated Vehicle Routing Problem (CVRP) in optimizing routes under capacity constraints, there are fundamental differences in their structures. CVRP focuses primarily on optimizing vehicle routes within capacity limits to service customers, whereas TTP adds the complexity of item selection, where the choice of items directly affects overall route efficiency. This interdependence between path selection and item selection makes TTP a more comprehensive and challenging problem [
11].
Researchers have proposed various methods to solve TTP instances. Evolutionary approaches, such as GA and Hybrid Genetic Algorithms combined with Tabu Search (GATS), have been widely applied to tackle TTP [
12,
13]. Ant Colony Algorithms [
14] and Hybrid Ant Colony Algorithms [
15] have also been employed. Additionally, modified Artificial Bee Colony Algorithms [
16] and innovative Differential Evolution Algorithms [
17] have shown significant advantages. Typically, these algorithms determine the tour plan first, followed by the item selection plan. However, in some studies [
18,
19], an enhanced Simulated Annealing Algorithm was used to determine the item selection plan first, followed by the tour plan. These studies emphasize the importance of integrating multiple optimization methods in solving complex TTP problems.
In recent years, heuristic methods for TTP have gained widespread attention, leading to many innovative studies. For instance, Mei et al. [
20] introduced Cooperative Coevolution (CC) and the Memetic Algorithm (MA) to solve TTP. CC decomposes TTP into two independent subproblems, solving them separately, while MA addresses TTP as a unified problem, emphasizing the interdependence between the subproblems. Their research highlights the importance of considering the interaction between subproblems in the optimization process, a conclusion further validated by Bonyadi et al. [
21]. Bonyadi and colleagues proposed the CoSolver algorithm, which also focuses on this interdependence, and developed the Density-based Heuristic (DH). Building on these frameworks, El Yafrani et al. [
22] proposed the CS2SA
* and CS2SA-R algorithms, combining the 2-OPT steepest ascent hill-climbing heuristic with Simulated Annealing.
Additionally, Polyakovskiy et al. [
10] initially proposed simple heuristic methods, marking the first attempt to address TTP. Building on this foundation, Mei et al. [
23] developed an efficient Memetic Algorithm that incorporates a two-stage local search, demonstrating effectiveness in solving large-scale TTP benchmark instances. Mei et al. [
24] further advanced the automatic evolution of effective item selection heuristics through Genetic Programming (GP). Martins et al. [
25] proposed an Estimation of Distribution Algorithm (EDA)-based heuristic selection method in a hyper-heuristic context, while El Yafrani et al. [
26] applied low-level hyper-heuristics to small and mid-sized TTP instances. These studies expanded the solution space for TTP and validated the effectiveness of combining local search heuristics with other methods [
27,
28,
29,
30]. Comprehensive comparisons were conducted in the experimental section, and the outcomes were contrasted with those of the proposed algorithm.
The application of mutation operators in optimization algorithms, particularly for solving TSP and other combinatorial optimization problems, has been extensively studied. These operators introduce randomness and diversity into the search process, effectively preventing the algorithm from getting trapped in local optima [
31]. In the context of TTP, the application of mutation operators has proven crucial, especially in multiobjective optimization problems [
32].
To address these complex optimization challenges, the Five-element Cycle Optimization algorithm (FECO) was introduced [
33]. FECO is a heuristic algorithm based on the traditional Chinese Five Elements theory, simulating the generative and restrictive relationships among the five elements to create a dynamically balanced optimization model. Over time, FECO has evolved to tackle more challenging multiobjective optimization problems. For instance, the Local Search-Based Many-Objective FECO (LSMaOFECO) [
34] significantly improves FECO’s effectiveness in high-dimensional objective spaces. Additionally, in cold chain logistics, the Dual-Mode Updated FECO (FECO-DMUI) [
35] optimizes delivery routes by balancing costs and customer satisfaction, demonstrating superior performance compared with traditional algorithms.
Despite its versatility and robustness in addressing complex optimization problems, FECO’s application to TTP remains underexplored, particularly in scenarios involving highly interdependent objectives. This study aims to bridge this gap by applying an enhanced version of FECO to TTP and integrating mutation operators to further improve solution quality and convergence speed.
In this paper, the Five-element Cycle Optimization algorithm based on Integrated Mutation Operator (FECOIMO) is proposed and applied to solve TTP. FECOIMO enhances the basic FECO method by incorporating various mutation operators to prevent premature convergence in complex combinatorial optimization problems. Our goal is to validate the effectiveness of FECOIMO in solving TTP instances of different scales and demonstrate its superiority by comparing it with other well-established algorithms.
The remainder of this paper is structured as follows: 
Section 2 provides an overview of the TTP, while 
Section 3 delves into the principles underlying FECO. In 
Section 4, the application of FECOIMO in solving TTP instances is elucidated. Subsequently, 
Section 5 presents the experimental results and analysis derived from our approach. Finally, 
Section 6 offers concluding remarks and outlines potential avenues for future research.
  2. The Traveling Thief Problem (TTP)
The Traveling Thief Problem (TTP) is a unique combination of two classic combinatorial optimization problems: the Traveling Salesman Problem (TSP) and the Knapsack Problem (KP). TTP presents a scenario where a thief must traverse a set of n cities exactly once, starting and ending at the same city—a structure similar to TSP. However, unlike the traditional TSP, during each visit to a city, the thief has the option to steal items from a pool of m available choices, placing them in a knapsack that has a finite capacity—this aspect resembles KP.
The decision variables in TTP consist of two main components: the tour plan, denoted by , representing a permutation of the n cities including the starting city , and the picking plan, denoted by , where  indicates that item t is picked in city , and  signifies that item t remains unpicked (; ).
The thief’s speed decreases linearly with the cumulative weight of the items in the knapsack. Thus, the departure speed of the thief from city 
 can be expressed as follows:
      where 
 (
) represents the total weight of items carried when departing city 
. When the knapsack is empty (
), the thief’s speed is at its maximum, whereas when the knapsack reaches full capacity (
), the speed decreases to its minimum.
To compute 
, the variable 
 (
, 
) is defined as follows:
      where 
 indicates that item 
t is picked in city 
, while 
 indicates it remains unpicked. Thus, 
 is calculated as:
The distances between the n cities are represented by the symmetric distance matrix , where  denotes the distance from city  to , with . When ,  is replaced by 1 to account for the cyclic nature of the tour.
Additionally, the distribution of the m items across the cities is governed by the availability matrix , a binary matrix where  indicates that item t is present at city , and   signifies its absence. Each item t has a value denoted by  and a weight denoted by , while the knapsack’s capacity is denoted by Q.
The total time spent by the thief on the tour can be calculated as follows:
	  The aggregate value of all picked items can be expressed as follows:
This paper focuses on solving the single-objective version of TTP, where the primary goal is to develop an optimal tour plan, 
, along with an optimal picking plan, 
, to maximize the thief’s total benefit. This benefit comprises two components: the total value of the stolen items and the corresponding knapsack rental costs. The objective function is to balance maximizing the value of the stolen items while minimizing the associated rental costs, thereby ensuring the thief attains the highest possible net benefit. The objective function of TTP can be succinctly represented by Equation (
6):
      where 
 denotes the total benefit accrued by the thief, 
 represents the aggregated value of all picked items. 
R signifies the rental cost of the knapsack per unit of time, and 
 corresponds to the duration of the thief’s tour.
Subject to:
	  This constraint ensures that the total weight of the items picked by the thief does not exceed the capacity of the knapsack.
  3. Five-Element Cycle Optimization Algorithm (FECO)
Optimization algorithms are crucial tools for solving complex combinatorial problems. In this context, the Five-element Cycle Optimization algorithm (FECO) emerges as a unique and effective approach, inspired by ancient Chinese philosophy. FECO leverages the Five-element Cycle Model (FECM), which models the dynamic interactions among different elements, to guide the search for optimal solutions. This section delves into the underlying principles of the FECM and explains how it forms the foundation for the FECO, enabling it to address challenging optimization tasks like the Traveling Salesman Problem (TSP) and other multiobjective problems.
  3.1. The Five-Element Cycle Model
The Five-element Cycle Model (FECM) [
33] is rooted in the ancient Chinese philosophy of the Five Elements (Wu Xing), which represents the dynamic interactions among the elements of metal, wood, water, fire, and earth. These elements interact through generating (promoting) and restricting (inhibiting) forces, forming a balanced and cyclical relationship. The generating interaction can be likened to a mother nurturing her child, ensuring its growth, while the restricting interaction is similar to a grandparent disciplining a grandchild, maintaining order within the system.
As shown in 
Figure 1, the outer circle symbolizes generating interactions, while the inner circle represents restricting interactions. For example, wood receives generating force from water and is constrained by metal. At the same time, wood exerts a generating force on fire and restricts earth.
In FECM, these interactions are mathematically modeled to capture the dynamics of these relationships. The mass of each element, denoted as 
, where 
i represents the element and 
k the time step, evolves based on the forces exerted by other elements. The force 
 acting on element 
i at time 
k is calculated considering the generating and restricting influences from other elements, as described by Equation (
8).
        
        where 
, 
, 
, 
, and 
 represent the masses of metal, water, wood, fire, and earth, respectively, at time 
k.
Each element  () in the cycle is influenced by four distinct segments:
- Parent element generation force: The first segment in the equation, , represents the force generated by the parent element on the current element. This force is positive if the mass of the parent element  is greater than the mass of the current element , implying effective generation. Conversely, if the current element’s mass is greater, this force diminishes, indicating a weaker generation effect. 
- Grandparent element inhibition force: The second segment, , represents the inhibitory force exerted by the grandparent element on the current element. This segment is negative, indicating that as the mass of the grandparent element  increases relative to the current element , the inhibitory effect strengthens, reducing the current element’s growth or influence. 
- Child element generation force: The third segment, , denotes the generation force that the current element exerts on its child element. A smaller mass of the current element  compared with its child  implies a weaker generation force, as the element’s ability to generate the next element in the cycle is diminished. 
- Grandchild element inhibition force: The fourth segment, , describes the inhibition force that the current element exerts on its grandchild element. Similar to the child element generation force, inhibition becomes stronger as the mass of the current element decreases relative to the grandchild . 
To illustrate, consider the wood element (i.e., ). Wood receives generating force from water (element ) and is inhibited by metal (element ). Simultaneously, wood exerts a generation force on fire (element ) and an inhibitory force on earth (element ). The first term  reflects water’s positive influence on Wood, as water nourishes wood. The second term  illustrates metal’s inhibitory effect on wood. The third and fourth terms represent Wood’s influence on fire and earth, respectively, showing how wood interacts with its descendants in the cycle.
From the above analysis, it is evident that the forces acting on each element in the Five-element Cycle are intricately tied to the masses of the elements themselves. The greater the mass difference between related elements, the stronger the force exerted, whether generative or inhibitive. This relationship between mass and force is crucial to understanding how the elements dynamically interact within the cycle, providing insight into their balance and overall system behavior.
Generalizing to a case with 
L elements in each cycle, the FECM can be formulated as follows: 
        where 
i ranges from 1 to 
L. When 
, 
 is replaced by 
L; when 
, 
 is replaced by 
L; when 
, 
 is replaced by 1; and when 
, 
 is replaced by 1. The weight coefficients 
, 
, 
 and 
 control the strength of these interactions and are typically set to 1, reflecting equal emphasis on all interaction types.
The FECM is not only a conceptual model but also forms the basis for FECO, which has been effectively applied to solve complex combinatorial optimization problems, such as the Traveling Salesman Problem (TSP) [
33]. Additionally, FECO has evolved to tackle more challenging multiobjective optimization problems. For instance, the Local Search-Based Many-Objective FECO (LSMaOFECO) [
34] significantly improves FECO’s effectiveness in high-dimensional objective spaces. By modeling the interactions among elements as iterative processes, FECO leverages the balance of generating and restricting forces to guide the search for optimal solutions.
This model offers a unique perspective on optimization, enabling a dynamic balance between exploration and exploitation in the search space, mirroring the harmonious interplay found in nature.
  3.2. The Five-Element Cycle Optimization
Building on FECM, FECO has been proposed and effectively applied to solve TSP [
33]. FECO is a nature-inspired metaheuristic algorithm, drawing on the traditional Chinese philosophy of the Five Elements theory. Metaheuristic algorithms are commonly used to solve complex optimization problems, characterized by their use of randomness and heuristic rules to search for global optimal solutions [
36]. In FECO, the population in the optimization algorithm corresponds to the elements within the Five-element Cycle framework, with individuals representing each element. The element space is illustrated in 
Figure 2. It shows that a population of size 
N is divided into 
q cycles, each containing 
L elements, i.e., 
. FECO operates as an iterative algorithm, where 
 denotes the 
i-th element within the 
j-th cycle at the 
k-th iteration (
; 
). Here, 
 represents the mass of 
, and 
 signifies the force exerted by 
.
When applying FECO to solve optimization problems, an element corresponds to a solution. The objective function, or a related variant, can be used as the mass M of the element. Based on the relationship between M and F, F can be used to evaluate the objective function. For instance, in the case of minimizing an objective function, when the objective function value is used as the mass of the element, a larger F implies a smaller corresponding M and hence a smaller objective value.
Consequently, when solving the TSP, solutions where 
 are considered superior and require no further updating, while solutions where 
 are deemed inferior and are subject to update operations [
33]. Through this distinctive mechanism of FECO, in conjunction with appropriate operators, effective resolution of the optimization problem can be achieved.
  4. Five-Element Cycle Optimization Algorithm Based on Integrated Mutation Operator for the TTP
In this section, the Five-element Cycle Optimization algorithm based on the Integrated Mutation Operator (FECOIMO) is presented, specifically designed to address the complex Traveling Thief Problem (TTP). The TTP poses significant challenges due to its dual-component structure, which involves optimizing both a tour plan and a picking plan. FECOIMO combines the principles of the Five-element Cycle Model with advanced mutation operators to efficiently explore the solution space and enhance the quality of the solutions. The algorithm dynamically adjusts its operations based on the current state of the solution, ensuring a balanced approach between exploration and exploitation. The following subsections detail the expression of solutions, initial solution generation, force calculation, integrated mutation operator, heuristic operator, element update process, and the overall implementation of FECOIMO.
  4.1. Expression of Solution and the Mass of the Element
This paper introduces the Five-element Cycle Optimization algorithm based on the Integrated Mutation Operator (FECOIMO). When applying FECOIMO to address the Traveling Thief Problem (TTP), , represents a feasible solution for TTP. Given that the decision variable of TTP comprises two distinct components–a tour plan  and a picking plan –the solution can be expressed as .
The tour plan 
 encompasses 
n decision variables, corresponding to the number of cities, 
n:
The picking plan 
 comprises 
m decision variables, reflecting the number of items, 
m:
In solving the TTP using FECOIMO, the objective function aims to maximize the benefit 
G. Leveraging the relationship between mass 
M and force 
F, the mass of the element is formulated according to Equation (
12), as designed in this paper.
        
		In Equation (
12), 
 denotes the benefit of element 
, while 
 represents the maximum benefit among all elements at the 
k-th iteration. To ensure the validity of Equation (
9), Equation (
12) has been adjusted by adding 1, ensuring that 
. Notably, a smaller value of 
 indicates closer proximity to the optimal solution. Consequently, a larger value of 
 suggests that the 
i-th element in the 
j-th cycle may represent a promising solution. Elements are thus evaluated based on 
 according to FECO.
Based on the relationship among FECM, FECOIMO, and TTP, as shown in 
Table 1, FECOIMO is specifically designed to solve the TTP.
  4.2. Initial Solutions Generation
The process of constructing the initial solution 
) involves generating both an initial tour plan and an initial picking plan. To generate the initial tour plan, a random tour 
 is first created. Following this, a greedy operator is applied to refine the tour and approximate an optimal solution. The greedy operator iteratively adjusts the positions of remaining cities 
 (
) by selecting the position that minimizes the overall tour length. The pseudocode for the greedy operator is provided in Algorithm 1. The goal is to construct a tour where each city’s position is determined based on a local optimization criterion, typical of a greedy algorithm.
        
| Algorithm 1 Greedy operator | 
| Require: 
                    n,  (distance matrix)Ensure: 
                      1:Randomly select three cities to generate a tour ,   2:for  do  3:    Find the optimal position for  by minimizing the additional distance  4:    Insert  into the tour  at the position that results in the shortest distance  5:end for  6:Return 
 | 
The initial picking plan 
 is then determined based on the generated initial tour plan 
. In the initial picking plan, each item is assigned to a city where it can be collected. A greedy approach can be applied here to maximize the value-to-weight ratio of the items selected while respecting the knapsack’s capacity. If the total weight exceeds the knapsack’s capacity, items are removed, prioritizing those with the lowest value-to-weight ratio. The detailed steps for generating the initial picking plan are provided in Algorithm 2.
        
| Algorithm 2 Initial picking plan | 
| Require: 
                    Q, , , ,Ensure: 
                      1:,   2:for  do  3:    for  do  4:        if  then  5:             6:        end if  7:    end for  8:    Randomly select one city  from   9:    10:    11:end for12:while  do13:    Remove item t with the lowest value-to-weight ratio14:    15:end while16:Return 
 | 
  4.3. Force Calculation
In FECOIMO, the force exerted on each element is calculated using the following formula:
        where 
 represents the force exerted on the 
i-th element by other elements within the 
j-th cycle at the 
k-th iteration.
  4.4. Integrated Mutation Operator
The mutation operator plays a critical role in optimization algorithms by randomly altering the genes of individuals. This mechanism enables the algorithm to explore additional solutions within the search space, thereby reducing the risk of getting trapped in local optima. Additionally, the mutation operator introduces novel individuals, which enhances the diversity of the population and effectively mitigates premature convergence. By incorporating the mutation operator, the algorithm can escape local optimal solutions, thus improving its global search capability. The randomness introduced by the mutation operator also helps in escaping local optima and accelerating the algorithm’s convergence speed.
In this study, three distinct mutation operators are selected to update the tour plan of elements: the flip mutation operator (), the insert mutation operator (), and the move mutation operator (). Each operator has unique characteristics and impacts on the optimization process.
  4.4.1. Flip Mutation Operator ()
The flip mutation operator alters a chromosome by flipping a gene, effectively rearranging the gene order and generating new solutions. This operator primarily fosters population diversity, facilitates the escape from local optima, and enhances the exploitation of the global optimum. By changing the orientation of specific genes, it introduces variations that help explore new regions of the search space.
The implementation process of  is outlined as follows:
- Copy  to ; 
- Randomly select two numbers  and , where  and ; 
- Flip the sequence  in ; 
- Return . 
An example of the 
 process is shown in 
Figure 3. The red-colored numbers in the figure represent the selected sequence that are flipped during the process, i.e., 
.
  4.4.2. Insert Mutation Operator ()
The insert mutation operator mutates a chromosome by relocating a gene from one position to another within the chromosome. This reshuffling produces novel solutions and focuses on enhancing the local search capability of the algorithm. By enabling the population to better adapt to local optimal solutions, this operator ensures that the search process can effectively exploit the local regions of the search space.
The implementation process of  is outlined as follows:
- Randomly select two numbers  and , where  and ; 
- Remove  from  and generate a new tour - ; 
- Randomly select a number , where ; 
- Insert the sequence  into  starting from the -th position; 
- Return . 
An example of the 
 process is shown in 
Figure 4. The red-colored numbers in the figure represent the selected sequence that are inserted during the process, i.e., 
.
  4.4.3. Move Mutation Operator ()
The move mutation operator displaces a gene segment either left or right within the chromosome, altering its structure and generating new solutions. This operator is designed to diversify local chromosome structures, thereby facilitating broader exploration of the solution space. By modifying the positions of gene segments, it helps discover new configurations that might lead to better solutions.
The implementation process of  is outlined as follows:
- Copy  to ; 
- Randomly select a number , where ; 
- Swap the segments  and the tour  in ; 
- Return . 
An example of the 
 process is shown in 
Figure 5. The red-colored numbers in the figure represent the selected sequence that are moved during the process, i.e., 
.
Although these three operators share the commonality of being mutation operators, their specific operations and mutation effects differ. The concurrent utilization of these operators enhances population diversity and ensures a thorough exploration of the solution space. The flip mutation operator contributes to global exploration by introducing significant changes, the insert mutation operator improves local adaptation by fine-tuning gene positions, and the move mutation operator provides structural diversity to prevent premature convergence. Together, these operators synergize to create a robust optimization process capable of efficiently finding high-quality solutions.
  4.4.4. The Calculation Process of the Effect of Mutation Operators
In this study, the usage probabilities of each mutation operator are dynamically adjusted based on their optimization effects. The process involves the following steps:
		  
- (1)
- Initial probability assignment: - Each mutation operator is initially assigned the same usage probability for evolving the individuals in the population. 
- (2)
- Optimization effect calculation: - For each generation, the optimization effect of each mutation operator is determined by identifying the individual with the greatest improvement due to that operator. Specifically, for each operator  - , the effect is calculated as follows:
               - 
              where  -  and  -  represent the objective function values before and after applying the mutation operator  - . 
- (3)
- Average effect over generations: - The optimization effects of each mutation operator are averaged over all generations to determine their overall effectiveness for each TTP instance. For a given operator  - , the average effect is computed as:
               - 
              where  -  is the maximum iteration. 
- (4)
- Summing effects across TTP instance: - To generalize the effectiveness of each mutation operator, the average effects  -  are summed across different test instances. The cumulative effect for each operator is given by:
               - 
              where  C-  is the total number of TTP instances that need to be resolved. 
- (5)
- Determining usage probabilities: - The final step involves determining the usage probability of each mutation operator based on its cumulative effect. The probability  -  for each operator  -  is calculated as:
               
In this approach, multiple mutation operators are integrated by assigning different probabilities to each operator. This ensures that the operators are selected based on their assigned likelihoods, allowing for a balanced application of each operator during the optimization process. The pseudocode for this integration is outlined in Algorithm 3.
          
| Algorithm 3 Integrated mutation operator | 
| Require: 
                      Ensure: 
                        1:if  then  2:    ⇐  3:else if  then  4:    ⇐  5:else  6:    ⇐  7:end if  8:Return 
 | 
In Algorithm 3,  is set to the probability  for the flip mutation operator,  is set to the sum of probabilities  for the flip and insert mutation operators.
By following this process, the selection of a mutation operator for each individual in the population is probabilistically determined. This method allows for dynamic and balanced integration of multiple mutation operators, enhancing the exploration and exploitation capabilities of the optimization algorithm. The probabilities  (where ),  (where ), and  (where ) are assigned based on the optimization effects of each operator, ensuring that more effective operators are applied more frequently.
  4.4.5. Conclusion of the Integrated Mutation Operator
A key enhancement in the FECOIMO algorithm over the original FECM is the dynamic adjustment of mutation operator usage based on their effectiveness during the optimization process. By evaluating the contribution of each mutation operator throughout the evolutionary process, FECOIMO can assign usage probabilities that prioritize more effective strategies. This adaptive approach ensures that the algorithm can efficiently explore and exploit the solution space, increasing the likelihood of finding the global optimum.
Rather than relying on a single, static mutation strategy, this probabilistic adjustment allows the algorithm to balance exploration and exploitation dynamically. This flexibility is crucial in navigating complex optimization problems and represents a significant improvement that enhances the overall performance of the FECOIMO algorithm.
  4.5. Heuristic Operator
Heuristic operators are strategies or rules employed during problem-solving to guide the search process. They are designed based on specific knowledge and experience within the problem domain to help algorithms avoid local optima, accelerate the search process, reduce the search space, and thus more efficiently explore the solution space. Heuristic operators aim to find high-quality solutions, improve the efficiency and performance of algorithms, and identify optimal solutions within a reasonable timeframe.
In this paper, when optimizing the picking plan () in the TTP, the thief’s speed is directly affected by the weight of the items carried. Heavier items will slow down the thief’s travel speed. Additionally, since each item is available in multiple cities, denoted as  (), for each item t, the last city  on the optimized route () that contains the item t is identified. By picking item t from , the value of the item is ensured, and the travel time due to picking item t is minimized, thereby maximizing the thief’s overall profit. Therefore, this paper designs the heuristic operator () to optimize the picking plan ().
The heuristic operator () is designed to optimize the picking plan in the following steps:
- (1)
- Adjustment of picked items: - Initially, all previously picked items in  are adjusted to be picked from the last city  on the  that contains the item t. 
- (2)
- Random removal of an item: - To increase algorithm diversity and generate a broader range of solutions, one of the already picked items  is randomly removed from the knapsack with a probability of . 
- (3)
- Calculation of remaining capacity: - The remaining capacity of the knapsack is then calculated, denoted as . 
- (4)
- Selection of unpicked items: - Within the constraint of the knapsack’s remaining capacity, unpicked items are selected based on their value-to-weight ratio  -  in descending order. These items are also chosen from the last city  -  on the  -  that contains the item  t- .
             
Overall, the heuristic operator enhances the algorithm’s ability to explore and exploit the solution space effectively, leading to better optimization performance in solving the  of TTP.
The following pseudocode in Algorithm 4 illustrates the implementation of the heuristic operator for optimizing the 
.
        
| Algorithm 4 Heuristic operator for the picking plan | 
| Require: 
                    Ensure: 
                      1:Initialize   2:for each item t in  do  3:    Find the last city  on  that contains item t  4:    Update  to pick item t from city   5:end for  6:if  then  7:    Select a random item  from  that   8:    Remove item  from :   9:end if10:Calculate the remaining capacity of the knapsack: 11:Sort unpicked items by their  in descending order12:for  size(unpicked items) do13:    if  then14:        Add item  to  from city 15:        Update the 16:    end if17:end for18:Return 
 | 
  4.6. Update the Elements
As an iterative algorithm, the performance of FECOIMO in solving the TTP heavily relies on the updating process of its elements. In FECOIMO, the evaluation of elements is based on the force F of each element, which is closely related to the objective function G. Specifically, if , the element  is considered a good solution and should be retained. As the objective function value of an element approaches that of the optimal element in the population, the mass value , as defined, decreases, leading to a correspondingly larger force value . Conversely, if , the element  needs to be updated.
New elements are generated in the vicinity of the optimal element within the 
j-th cycle 
 and the global optimal element 
 to replace it. The probability of updating 
 is 
, and the probability of updating 
 is 
. Through conditional checks and random selection of mutation targets, the algorithm achieves a balance between exploitation (utilizing the currently known best solutions) and exploration (seeking new solutions). Specifically, when the cycle’s best element is chosen for mutation, the algorithm focuses more on local search; when the global best element is chosen, it emphasizes global search. This balance helps to improve the overall performance of the algorithm. The pseudocode for the element 
 updating process is shown in Algorithm 5.
        
| Algorithm 5 The update of the element | 
| Require: 
                    Ensure: 
                      1:if  
                      then  2:      3:else  4:    if  then  5:          6:          7:    else  8:          9:        10:    end if11:end if12:Return 
 | 
  4.7. Implementation of FECOIMO
The flowchart of the FECOIMO algorithm for solving the TTP is depicted in 
Figure 6.
The process begins with setting the algorithm parameters and generating the initial population. The initial objective function values , the masses , and the forces  of the elements are then calculated. The optimal element within the j-th cycle  and the current optimal element  are identified.
During the k-th generation, the algorithm determines how to update the elements based on the calculated forces. Specifically, the elements are updated using different strategies depending on whether the force is nonpositive and based on a probabilistic decision.
The mutation operator () is applied to update the elements. The selection of the specific mutation operator () is based on random probabilities. If the random probability  is less than or equal to a predefined threshold , the  operator is used. If  lies between  and , the  operator is used. Otherwise, the  operator is applied.
When the tour plan  has been updated, the heuristic operator  is then used to update another decision variable, the picking plan .
After updating the elements, the objective function values and masses are recalculated. The current optimal element is updated accordingly. This process is repeated until the termination criteria are met (), at which point the algorithm stops and outputs the optimal solution ().
  5. Experimental Results and Analysis
This section presents a comprehensive analysis of the experimental results obtained by applying the proposed Five-Element Cyclic Integrated Mutation Optimization (FECOIMO) algorithm to the Traveling Thief Problem (TTP). The experiments are designed to validate the effectiveness of FECOIMO in solving a variety of TTP instances, ranging from small to large scales, and to compare its performance with that of five well-known metaheuristic algorithms. The analysis covers several aspects, including the operation environment and TTP instances used, the determination of maximum iteration for each instance, the effects of different mutation operators, the determination of update probability, and a comparative analysis of FECOIMO against other algorithms in terms of performance, convergence behavior, and execution time.
  5.1. Operation Environment and Traveling Thief Problem (TTP) Instances
The experiments conducted in this study were implemented using MATLAB R2018a and were run on a system equipped with a 
 GHz Intel Xeon-E5645 processor, 32 GB of RAM, and running Windows 10. The dataset utilized in the experiments consists of a representative subset of the TTP instances generated by Bonyadi et al. [
1].
The naming convention for the TTP instances follows the pattern n-m--, where n represents the number of cities, m denotes the number of items,  is the instance identity, and  indicates the tightness of the capacity constraint (i.e., the ratio of the knapsack capacity to the total weight of the items). The values for n are chosen as , and 100, with corresponding values for m as described in the paper. The values for n are chosen as 10, 20, 50, and 100, with corresponding values for m as described in the paper. Specifically, for , ; for , ; for , ; and for , . The instances considered in this paper correspond to , and three values of  are selected: 25, 50 and 75. This selection results in a total of 39 TTP instances, encompassing various scales, to comprehensively validate the performance of the FECOIMO.
The parameters of the proposed FECOIMO include the number of cycles 
q, the number of elements in each cycle 
L, the maximum iteration 
, the update probability 
, and parameters 
 and 
. Based on the findings in reference [
33], 
 and 
 were set. The remaining parameters were fine-tuned through experimentation.
  5.2. Determination of the Maximum Iteration Corresponding to Each TTP Instance
Due to the varying number of cities (n) and items (m) in each TTP instance, the corresponding maximum iteration count () also differs. The determination of  is based on the evolutionary process of FECOIMO when solving each instance. Specifically,  is identified as the iteration at which the objective function value converges.
For brevity, 
Figure 7, 
Figure 8 and 
Figure 9 display convergence curves for a subset of 9 out of the 39 instances. These figures visually represent the convergence process for solving small (
), medium (
), and large (
) TTP instances. Each subplot within the graphs illustrates the convergence for instances of different types (i.e., different 
) but of the same scale.
Observing the convergence curves, it becomes evident that the number of iterations required for convergence primarily depends on 
n and 
m, with larger values leading to larger 
. Based on the convergence behavior, suitable values for 
 are determined for each instance, as outlined in 
Table 2.
  5.3. Determination of the Integrated Mutation Operator
In this paper, the effect of each operator on each instance is calculated according to the method introduced in 
Section 4.4.4, and the probability of each mutation operator is thereby determined to integrate the operators. Initially, the parameters are set as 
 and 
 to ensure equal use probability for all three mutation operators. The maximum iteration count 
 is determined based on 
Table 2. To mitigate experiment contingency, the FECOIMO is independently run 30 times on each instance, and the final effect of each operator is averaged over these 30 runs.
The 
Figure 10, 
Figure 11 and 
Figure 12 illustrate the average effect of three mutation operators (
, 
, 
) during the evolution process across nine representative TTP instances out of a total of 39 instances. These nine instances are selected to represent different scales, including small-scale (e.g., 
), medium-scale (e.g., 
), and large-scale instances (e.g., 
). Each graph shows the average effect of the mutation operators over 30 independent runs of FECOIMO, highlighting the contribution of each operator throughout the evolutionary process.
It can be concluded that the 
 and the 
 have comparable effects, while the 
 performs better than both, especially as the scale of the instance increases. In smaller-scale TTP instances (
Figure 10), the effects of the 
 and the 
 operators are similar, while the 
 operator shows a significantly better effect. In medium-scale TTP instances (
Figure 11), the 
 operator performs significantly better than the 
 and the 
 operators, demonstrating stronger optimization capability. In large-scale TTP instances (
Figure 12), the effect of the 
 is the most prominent, with its optimization capability far exceeding the other two operators, especially in large-scale problems. The 
 shows stronger optimization capability across different scales of TTP instances, particularly in large-scale problems. This may be due to the 
 operator’s ability to more effectively adjust the structure of the solution, thus finding better solutions in more complex search spaces. This finding indicates that considering the roles and advantages of different operators is crucial for improving the overall performance of the algorithm in the design and optimization process.
Table 3 provides a quantitative analysis of the effect of each mutation operator across all 39 instances. Notably, 
 and 
 have comparable effects, while 
 becomes more influential as the instance scale increases. For instance, at a scale of 
, 
 is twice as effective as 
, whereas for 
, its effectiveness is nearly quadruple. This underscores the importance of 
 in exploring solutions within a larger search space. From the results calculated in 
Table 3, the usage probabilities of each operator in 
 are 
. Therefore, 
.
   5.4. Determination of Update Probability
When employing FECOIMO to update elements, the parameter  plays a crucial role in determining the probability of updating  and . As delineated in Algorithm 5, the probability of selecting  is represented by , while the probability of selecting  is . Similarly, our proposed algorithm conducts parameter experiments, where only the value of  is varied for comparison experiments. The parameter  ranges from 0 to 1, with  signifying mutation solely on , and  indicating mutation exclusively on .
To ensure experimental reliability, each instance is independently executed 30 times. 
Table 4 and 
Table 5 present the mean benefits and Friedman test ranks from 30 independent runs of FECOIMO with different 
 values. The Friedman test ranks [
37] indicate the performance of the algorithm under various 
 settings, with a lower rank indicating better overall performance.
Based on these results, the p-value from the Friedman test is , indicating significant differences in performance between different  values. And the optimal value of  is determined to be 0.9, as it has the lowest Friedman rank (2.36) and the highest final rank (1), indicating the best overall performance of the algorithm.
This observation underscores the efficacy of prioritizing the update of the optimal solution within the cycle in most scenarios, complemented by occasional updates of the current optimal solution. This dual-update strategy enhances the algorithm’s global search capability by facilitating synchronous exploration of all cycles, thereby improving search efficiency. Notably, updating the current optimal solution mitigates the risk of algorithmic convergence to local optima. The synergistic integration of these update methods substantially enhances the algorithm’s search prowess.
  5.5. Comparative Analysis of Algorithms
To validate the effectiveness of the proposed FECOIMO algorithm in solving the TTP, it was compared against five other metaheuristic algorithms: Enhanced Simulated Annealing (ESA) [
18], Improved Grey Wolf Optimization Algorithm (IGWO) [
38], Improved Whale Optimization Algorithm (IWOA) [
39], Genetic Algorithm (GA) [
40], and Profit Guided Coordination Heuristic (PGCH) [
41]. To ensure a fair comparison, the parameter settings of these algorithms were aligned with those of FECOIMO wherever similar parameters were involved, while other parameters were set according to their original proposals. Each algorithm was independently executed 30 times across 39 TTP instances. Their performance was evaluated based on execution time, solution quality, and statistical significance using the Friedman test.
  5.5.1. Performance Comparison
Table 6, 
Table 7 and 
Table 8 show the mean, maximum, and standard deviations of the objective function values obtained from 30 independent runs of the six algorithms on 39 TTP instances. These tables provide insights into the performance of each algorithm on different-sized TTP instances. At the bottom of each table, the Friedman test ranks are displayed, offering a statistical comparison of the algorithms’ performances.
 It can be observed that FECOIMO consistently ranks first in the Friedman test, indicating superior performance across all instances. Specifically, the p-values of , , and   suggest significant differences among the six algorithms. Moreover, as the problem instance size increases, FECOIMO’s performance becomes noticeably superior compared with the other algorithms. This demonstrates the effectiveness of the operators designed specifically for FECOIMO in addressing larger instances, whereas the simpler mechanisms employed by the other algorithms likely contribute to their inability to find optimal solutions for larger instances.
It can be observed that FECOIMO consistently ranks first in the Friedman test, indicating superior performance across all instances. Specifically, the p-values of , , and  suggest significant differences among the six algorithms. Moreover, as the problem instance size increases, FECOIMO’s performance becomes noticeably superior compared with the other algorithms. This demonstrates the effectiveness of the operators designed specifically for FECOIMO in addressing larger instances, whereas the simpler mechanisms employed by the other algorithms likely contribute to their inability to find optimal solutions for larger instances.
Given that the 
p-values in 
Table 6, 
Table 7 and 
Table 8 are 
p-value = 
, 
p-value = 
, and 
p-value = 
, respectively, it is clear that there are significant differences among the six algorithms. To further identify the specific differences between each pair of algorithms, multiple comparison tests were applied. 
Table 9, 
Table 10 and 
Table 11 show the adjusted 
p-values from these multiple comparison tests. The detailed comparisons reveal significant differences between specific pairs of algorithms. For example, in 
Table 9, the 
p-value for the comparison between GA and FECOIMO is 
. In these tables, 
p-values below 0.05 indicate that the differences between the corresponding row and column algorithms are statistically significant.
The differences between ESA and FECOIMO are not significant, as the 
p-values in 
Table 9, 
Table 10 and 
Table 11 are greater than 0.05. This indicates that there are no significant differences between ESA and FECOIMO in terms of the mean, variance, and maximum results. Although FECOIMO and Enhanced Simulated Annealing (ESA) do not show significant differences in certain statistical metrics, this actually underscores the strength of FECOIMO. ESA is a mature and well-established algorithm, particularly effective in path optimization problems. However, FECOIMO not only matches ESA’s performance but also demonstrates advantages in balancing exploration and exploitation and avoiding premature convergence. This suggests that FECOIMO is highly adaptable and robust in handling complex optimization problems.
From 
Table 9, 
Table 10 and 
Table 11, it can be observed that the differences in the mean results between GA and ESA, IWOA, and PGCH are not significant. This indicates that their overall performance is comparable, even though their performance might vary in specific instances or under certain conditions. Similarly, the differences in the maximum results between GA and ESA, IWOA, and PGCH are also not significant, suggesting that their capabilities in finding the optimal solutions are similar. The final optimal solutions they find are very close. Additionally, the differences in the variance of results between IWOA, IGWO, and PGCH are not significant, indicating that these three algorithms have similar stability and consistency. The degree of fluctuation in the quality of solutions they obtain across different runs is comparable.
These observations collectively suggest that while the classic optimization algorithms—GA, ESA, IWOA, IGWO, and PGCH—show similarities in specific performance metrics when applied to TTP instances of varying scales, FECOIMO stands out as the more effective solution. The design of FECOIMO’s updating operators, tailored to the characteristics of the TTP instances, enhances its applicability and effectiveness. Therefore, while the classical algorithms exhibit comparable performance in certain aspects, FECOIMO’s superior adaptability to the problem characteristics makes it more effective across different scales of TTP instances. This tailored design of FECOIMO allows it to handle the complexities and nuances of the TTP more efficiently, thereby yielding better overall performance.
  5.5.2. Convergence Comparison
Figure 13, 
Figure 14 and 
Figure 15 present the convergence curves of the six algorithms on nine representative instances from the 39 TTP instances, covering various scales. These figures illustrate the differences in convergence behavior among the algorithms, providing insight into their performance characteristics. It is important to note that the starting points of these curves differ because the initial populations for each algorithm are randomly generated. As a result, the algorithms were run independently, leading to different initial conditions for each algorithm.
 For smaller-scale instances (e.g., 
 in 
Figure 13), FECOIMO consistently achieves better convergence compared with the other algorithms. GA and ESA show slower convergence rates, while IWOA and IGWO perform relatively well in the initial stages but are eventually surpassed by FECOIMO. In medium and large-scale instances shown in 
Figure 14 and 
Figure 15, FECOIMO continues to outperform the others, demonstrating faster convergence and higher-quality solutions.
  5.5.3. Execution Time and Complexity Comparison
In the comparison of algorithm complexity and the time taken for a single run of each algorithm across different problem scales, it is observed that while the algorithms have similar theoretical complexities, their actual execution times show significant differences.
The FECOIMO algorithm proposed in this paper demonstrates unique performance characteristics in terms of both time complexity and execution time. Regarding time complexity, as shown in 
Table 12, FECOIMO is characterized by a complexity of 
, where 
L and 
q represent the number of elements in each cycle and cycles, respectively. This allows FECOIMO to conduct a more comprehensive search and optimization of the solution space through its complex five-element cycle structure. Although FECOIMO’s time complexity is similar to other algorithms like GA, IWOA, and PGCH, its structure is more intricate, especially when handling large-scale problems, resulting in a somewhat higher computational complexity.
In terms of execution time shown in 
Table 13, FECOIMO exhibits significantly longer run times across various instances compared with other algorithms. This can be attributed to the additional computational steps and deeper exploration of the solution space inherent in FECOIMO. While these extra computations increase execution time, they also enhance the algorithm’s capability to solve large-scale complex problems. In other words, the additional computational overhead in FECOIMO translates into better solutions, which is particularly crucial when dealing with complex combinatorial optimization problems.
Overall, despite the increased execution time, FECOIMO’s superior performance on complex problems justifies this additional time investment. Through comparison, it is evident that FECOIMO’s design achieves a new level of balance between computational complexity and solution accuracy, providing an effective approach to solving complex optimization problems.
  6. Conclusions and Future Work
In this paper, we introduced the Five-element Cycle Integrated Mutation Optimization algorithm (FECOIMO), designed specifically to address the complexities of the Traveling Thief Problem (TTP). By integrating the Five-element Cycle Model with a tailored set of mutation and heuristic operators, FECOIMO effectively manages the dual challenges of optimizing both the tour and picking plans across 39 TTP instances of varying scales and complexities. The algorithm’s iterative approach, enhanced by the integration of specialized mutation operators, significantly improves its ability to explore the search space and avoid premature convergence, thereby yielding superior solutions.
The efficacy of FECOIMO was rigorously validated through extensive comparative experiments against five other state-of-the-art metaheuristic algorithms. The results clearly demonstrate FECOIMO’s superior performance, particularly in larger and more complex TTP instances, where it consistently outperformed the alternatives in terms of solution quality and robustness. These findings underscore FECOIMO’s capability to address the diverse challenges posed by TTP instances comprehensively.
However, this study also has its limitations. Notably, the role of each mutation operator varies depending on the scale of the problem instances, suggesting that further refinement could involve adapting these operators more precisely to different problem sizes. Future research should explore these differential effects and consider the development of alternative mutation operators that can further enhance the algorithm’s performance. Additionally, integrating these strategies with other advanced optimization techniques could lead to the creation of even more robust and versatile algorithms.
In conclusion, the FECOIMO algorithm represents a significant advancement in solving the TTP by effectively combining mutation operators and heuristic strategies to address this complex combinatorial optimization problem. Future work will focus on refining these strategies, extending their application to other complex optimization challenges, and continuing to build on the algorithm’s practical and theoretical contributions to the field.