1. Introduction
Navigation is a critical research domain in mobile robotics [
1], facilitating robots to achieve optimal paths without collision from one location to another [
2]. Path planning is widely applied in various fields, including the prevention of robot collision [
3], obstacle avoidance [
4], unmanned aerial vehicle (UAV) flight [
5], urban road planning [
6], and logistics vehicle routing [
7]. At present, numerous scholars have explored various path-planning methods, such as the artificial potential field method [
8,
9], Dijkstra’s algorithm [
10,
11], A* algorithm [
12], genetic algorithm [
13,
14], particle swarm algorithm [
15], artificial neural network [
16], and ant colony optimization [
17,
18], among others. The ant colony optimization algorithm (ACO), proposed by M. Dorigo and colleagues [
19], is a distributed heuristic method inspired by natural processes. As a prominent approach within the realm of swarm intelligence systems, ACO emulates the foraging process of ants to identify optimal solutions [
20]. It is distinguished by its capacity for parallel processing, distributed computing, high robustness, and positive feedback mechanisms. Owing to these characteristics, ACO has been widely applied to a variety of combinatorial optimization problems, including quadratic assignment, scheduling, image edge detection, economic modeling, and mobile robot navigation [
21]. In the field of mobile robotics, ACO has demonstrated significant versatility and effectiveness across diverse applications. For instance, in intelligent agricultural environments, path planning based on the ant colony algorithm has greatly advanced agricultural robotics technology, enabling autonomous agricultural vehicles to accurately navigate complex field terrains during operations, such as sowing, spraying, and harvesting, thereby effectively improving overall operational efficiency [
22]. In the warehousing and logistics industry, ACO can assist Automated Guided Vehicles (AGVs) in global path planning, thereby helping to streamline production workflows and reduce operational costs [
23]. In emergency tasks, such as disaster relief and search and rescue, robots can efficiently perform path planning and navigation in complex and unpredictable environments using the ACO algorithm, thereby better ensuring the safety of human life and property [
24,
25]. Furthermore, in document management domains, like libraries and archives, the ACO algorithm provides an efficient path planning approach, significantly improving the efficiency of document retrieval and storage [
26]. Additionally, in hazardous environments containing radioactive materials, such as nuclear power plants, mobile robots utilize ant colony algorithm-based path planning to develop efficient inspection routes, thereby effectively ensuring the safe and stable operation of critical equipment [
27].
However, in practical application, the ACO exhibits several limitations, including stagnation, premature convergence, low search efficiency, difficulty in determining control parameters, susceptibility to local optima, and a significant number of lost ants [
28,
29,
30]. These shortcomings become increasingly pronounced as the complexity of the environmental map increases. To enhance the performance of ACO, numerous researchers have proposed some improvements. To effectively mitigate the issue of search blindness, Zhao et al. [
31] introduced an approach that involves an uneven distribution of initial pheromones along with directional heuristic information. Similarly, Jiang et al. [
32] developed an improved ACO-based algorithm that enhances the initial pheromone concentration, dynamically regulates the pseudorandom transfer strategy, updates the pheromones of high-quality ants, and adaptively adjusts the volatility coefficient. These modifications have significantly improved global search ability, convergence rate, and computational efficiency. Additionally, Miao et al. [
33] introduced an improvement adaptive ant colony algorithm (IAACO), which refines the transition probabilities and incorporates an adaptive pheromone update mechanism to further optimize performance. To minimize the number of lost ants, Wu et al. [
34] implemented a rollback mechanism within the standard ACO framework to reduce the loss of ants, thereby improving target reachability and mitigating the disruptive effects of invalid pheromones on ACO evolution. Yue et al. [
35] introduced a penalty-based strategy that enhances the ants’ ability to explore uncharted regions by accelerating pheromone evaporation on suboptimal paths. To address the challenge of deadlocks and improve global search efficiency, Jiao et al. [
36] developed an adaptive polymorphic ant colony algorithm. Additionally, Zhang et al. [
37] introduced an improved ant colony system (EACSPGO), which can effectively speed up convergence rate and produce a better solution, as well as further optimize the initial path through the path geometry optimization method.
Based on the aforementioned studies, the performance of ACO has been enhanced using various optimization measures. However, certain limitations persist. One notable issue is the significant number of lost ants—particularly in complex environments—which has not been adequately addressed. Additionally, while the optimization path has seen improvements, there remains considerable potential for further optimization; however, only a few researchers have explored this aspect. Furthermore, the total turning angle of the shortest route requires further investigation to enhance overall efficiency.
This paper proposes an improved ant colony optimization algorithm, called the Enhanced Ant Colony Algorithm based on Islands (EACI). Compared with other variants of ACO, the main contributions of this work are summarized as follows:
- (1)
To achieve efficient pathfinding, the algorithm introduces an island-based division strategy between the starting and ending points. Specifically, grid cells in the original map that are likely to cause deadlocks are first converted into obstacle grids to form an auxiliary map. Then, through using the search circle between the start and end points on the auxiliary map, several intermediate islands are identified and mapped back to the original grid. Ants travel from the starting point, sequentially passing through these intermediate islands, and then finally reach the destination.
- (2)
To improve the efficiency of path planning between adjacent islands, the algorithm applies a non-uniform pheromone distribution during the initialization stage. Different concentrations of pheromones are preset between islands, which effectively accelerates the convergence speed.
- (3)
To further enhance the global optimization ability of the algorithm, an adaptive pheromone evaporation coefficient is introduced to dynamically update pheromone levels. In addition, extra pheromones are added to the optimal path as a reward, while pheromones on the worst path are reduced as a penalty, thereby improving the search direction and stability.
The rest of this paper is structured as follows.
Section 2 focuses on environment modeling.
Section 3 covers the ACO algorithm.
Section 4 introduces the EACI algorithm.
Section 5 presents the process of the EACI algorithm in path planning.
Section 6 details the simulation experiments and analyzes the results.
Section 7 validates the approach through real vehicle experiments. Finally,
Section 8 summarizes this study with conclusions.
2. Modeling of the Environment and Mobile Robots
Due to its simplicity and efficiency, the grid-based approach is commonly employed for environment modeling [
38]. Consequently, the environment is constructed by the grid method in this paper. In this representation, unoccupied grids are represented by 0 and displayed in white, whereas obstacle grids are marked by 1 and shown in black. To precisely identify the location of each grid cell, a Cartesian coordinate system, along with serial number notation, is utilized [
39]. For example,
Figure 1 presents an example of an environment modeled using the grid method. The modeled environment has 10 × 10 grids, where the robot “R” has eight optional grids available for movement during path planning.
In a Cartesian coordinate system, the center of each grid is uniquely determined by its absolute coordinates (
x,
y). Utilizing the serial numbering method, grid cells are sequentially assigned unique numbers from left to right and top to bottom starting from 1, ensuring that each cell is distinctly identified. The complete set of serial numbers assigned to all grids within the environment is expressed by the following (
1):
In Equations (
1)–(
4),
g indicates the grid number;
n and
m denote the total number of rows and columns in the environment, respectively; and
and
refer to the mobile robot’s length and width, respectively. To ensure that the robot can perform in-place rotations and other maneuvers smoothly within free grid cells, the parameter
a is set to the larger of
and
. The
function calculates the remainder of a division operation, whereas the
function rounds a given value up to the nearest integer.
A differentially driven mobile robot (DDMR) is commonly utilized for validating path planning verification [
40].
Figure 2 depicts the motion analysis of a DDMR during a turning maneuver, where linear velocities of the two driving wheels are expressed as follows:
where
and
represent the inner and outer linear velocities of the DDMR, respectively;
signifies the location of the DDMR in the Cartesian coordinate framework;
v denotes the linear velocity of the centroid;
indicates the angular velocity of the DDMR as it turns around point
; the turning radius is
R; and
B signifies the width of the robot. As presented in Equation (
5), the linear velocity of the centroid is formulated as follows:
It can be derived from Equations (
5) and (
6) that
R and
yield the following:
According to Equation (
7), when
and
, the DDMR achieves in-place rotation. During the turning process, if
remains constant, the larger turning angle
, the more time it takes. This observation ensures that the DDMR can smoothly execute both turning and linear motion, providing the necessary conditions for verifying the paths calculated by the algorithm.
3. Brief Introduction of ACO
The ACO is employed to address optimization problems by imitating the natural foraging process of ants, which rely on pheromone trails to identify the shortest path from their nest to food sources. During the early phase of this process, ants explore their surroundings randomly. As they traverse different paths, they deposit pheromones along the path. The pheromone concentration increases as more ants follow a particular trail, reinforcing its attractiveness. This positive feedback loop encourages even more ants to choose the route with the strongest pheromone concentration, ultimately resulting in the identification of the shortest path.
Assuming that the
kth ant moves from position
i to position
j at time
t, where position
i refers to the ant’s current grid and position
j indicates the unvisited grid to which the ant will move. If there are multiple unvisited grids
j around the current grid
i, the transition probability
is determined using Equation (
8), and the grid
j is selected using the roulette model. The following expression describes Equation (
8):
where
represents the pheromone amount between position
i and position
j at time
t;
and
refer to the pheromone concentration factor and expected heuristic factor, respectively;
denotes the collection of unvisited grids around the position
i; and
indicates the heuristic value that can be determined by Equation (
9):
where
indicates the Euclidean distance between the next position
j and the ending position. Once all ants have finished an iterative search, the pheromone on each path is updated using Equation (
10):
where
signifies of the pheromone evaporating rate parameter,
indicates the pheromone increment of
kth ant from position
i to position
j in the current iterative cycle, and
m denotes the total number of ants. The parameter
can be calculated as follows:
where
is the route length searched by the
kth ant, while
Q (the value remains fixed and positive) represents the pheromone accumulation parameter.
4. Introduction of EACI
The conventional ACO has a disadvantage: it is prone to become trapped in local optimization early in the process. This issue arises because the selection of transition probability does not always guarantee the identification of the global optimal solution. In addition, the convergence rate is constrained by the limitations of a heuristic search. Consequently, higher environmental complexity increases the probability of stalemates for the ant. A significant challenge arises when the ants face restricted movement due to the presence of taboo lists, obstacles, and boundaries, leading to a condition called a deadlock. Specifically, the taboo list is a temporary memory maintained by each ant to store the nodes it has already visited during its current traversal, preventing it from revisiting the same nodes and forming cycles. Obstacles refer to impassable grid cells that represent physical barriers in the environment. Boundaries define the outer limits of the grid map, beyond which ants are not allowed to move. Hou et al. [
41] categorized deadlock into three types, as illustrated in
Figure 3. The first type of deadlock occurs when ants enter a dead end, which is indicated by a yellow grid. The second type of deadlock arises due to the combined influence of taboo lists, obstacles, and boundaries, which is denoted by a red grid. The third type of deadlock is exclusively caused by taboo lists, which is represented by a cyan grid.
For the purpose of enhancing the performance of ACO and to minimize the number of lost ants, we propose an enhanced variant algorithm of ACO—known as EACI. This algorithm incorporates several key modifications, including an island-based division strategy, a non-uniform pheromone distribution, an improved heuristic function, and an optimized global pheromone updating rule.
4.1. Island Division Strategy
To reducing the quantity of lost ants in a complex grid map, additional intermediate grids—referred to as “islands”—are introduced between the start (S) and end (E) points before path planning. These island grids represent mandatory waypoints that the optimal path must traverse. Consequently, the original environment map requires modification to ensure that the deadlock grid is not mistakenly selected as the island grid.
4.1.1. Grid Map Preprocessing
To prevent the selection of deadlock grids as island grids, it is essential to assess the status of the four adjacent grids surrounding each free grid. As illustrated in
Figure 4, if two of these adjacent grids are classified as obstacle grids, the free grid is at risk of causing a deadlock. Consequently, it is transformed into an obstacle grid, represented by the number one, as specified in Equation (
12). This preprocessing step converts the potential deadlock grids into obstacles, resulting in the creation known as the auxiliary map. For example, after preprocessing the grid map, as shown in
Figure 1, the resulting auxiliary map, as depicted in
Figure 5, highlights newly added obstacle grids, with black grids framed by blue borders.
4.1.2. Islands Selection
To determine the location of the intermediate islands within the auxiliary map, it is essential to calculate the radius of the search circle (
). This radius can be expressed as follows:
where
signifies the Euclidean distance between
S and
E, and
N refers to the total number of islands, including grids
S and
E. A search circle is defined with its center located at the grid corresponding to the starting grid S and a radius of
. The free grids within the search circle are designated as candidate grids. For each candidate grid, the sum of the Euclidean distances to both the starting and ending points is calculated independently. The detailed formula for this calculation is presented below:
where
indicates the sum of the Euclidean distances from the candidate grid, which is identified by the sequence number
I to the starting point
S and the endpoint
E , and
signifies the set of sequence numbers for selectable grids. The candidate grid with the minimum
value is selected, and its sequence number
I is designated as the next island
. Taking the grid center of
as the reference point, the above steps are repeated until the search circle includes the endpoint
E , thereby determining the sequence of all islands as follows:
where
indicates the serial number of the
zth island.
Taking the auxiliary map of
Figure 1 as an example, as shown in
Figure 6, 3 and 100 represent the starting position and target position, respectively. Assuming that the map requires five islands, the final path planning sequence is given by
. The EACI algorithm sequentially generates the local optimal path between the two adjacent sequence numbers. Connecting all local optimal paths from start to end yields the global optimal path for the entire complex map.
4.2. Modified Heuristic Function
In the basic ACO, the heuristic value only considers the distance between each free grid and the target grid, neglecting the impact of the turning angle in the optimal path calculation. However, a larger turning angle results in higher energy consumption. To address this issue, this study modifies the heuristic function by incorporating the turning angle variable. The specific calculation is shown in Equation (
16):
where
signifies the distance between the next position
j and the ending position, and
indicates the angle coefficient. The distribution of angle coefficients from node
i to node
j is illustrated in
Figure 7 and can be calculated using Equation (
17):
where
signifies the angle coefficient;
refers to the vector that starts with node
and ends with node
i; node
denotes the previous node of node
i in the path; node
i implies the current position of the robot;
refers to the vector that starts with node
i and ends with node
j; node
j indicates the next node of node
i in the path; node
j represents the node to be selected by the robot in the path planning; and
refers to the angle between vector
and vector
.
4.3. Improved Distribution of the Initial Pheromones
During the initial phase of ACO, the pheromone values are uniformly distributed throughout the environment. As a result, ants explore paths with a certain degree of randomness, leading to slow algorithm convergence. To address this issue, this article introduces an uneven initial pheromone distribution strategy, with its calculation principle illustrated in
Figure 8.
The rectangular region (with the adjacent island number
and
as vertices) is designated as the favorable region
. The equation of the straight line passing through points
and
is determined based on their coordinates:
where
A,
B, and
C refer to the relevant parameters of the linear equation.
The formula for calculating the midpoint coordinate
of the adjacent path nodes
and
within the rectangular area
is expressed in Equation (
19). The local initialization pheromone concentration for this region is calculated by Equation (
20). Subsequently, the local initialization pheromone concentration of other favorable regions in the environment is calculated in sequence. Finally, the non-uniform initial pheromone distribution for the entire map can be calculated using Equation (
21):
where
indicates the local initialization pheromone concentration from node
i to node
j in the rectangular area
;
refers to the global pheromone concentration from node
i to node
j in any region of entire map;
implies the grid set of all favorable region;
represents the pheromone expansion coefficient; and
and
signify the maximum and minimum values of pheromone concentration in all favorable region, respectively. The latter can be expressed as follows:
A case of an uneven initial pheromone distribution is shown in
Figure 9, where each red rectangle represents a favorable area between adjacent islands.
4.4. Adaptive Pheromone Update Rule
Once all ants finish an iterative cycle, the pheromones on the original path volatilize at a certain speed, while the new pheromones are deposited by ants traversing through the path. After multiple iterations, the optimal route is identified as the one with highest pheromone concentration. In this study, an improved pheromone update rule is introduced to accelerate pheromone accumulation speed on the optimal route, reduce the pheromone accumulation rate on the worst route, and incorporate a dynamic adaptive volatilization coefficient. The specific calculations are presented as follows:
When all ants complete an iteration, the pheromones of the entire map will be updated by Equation (
23), where
represents the volatilization term, indicating the volatilization of the original pheromone;
implies the item of pheromones addition, signifying the addition of new pheromones to the path the ants have passed; and
refers to the term of reward or punishment, which can be calculated according to Equation (
24). This observation involves enhancing the optimal path by adding extra pheromones to reinforce its guidance for subsequent iterations while reducing pheromones on the worst path to minimize its misleading effect.
represents the optimum route length;
implies the worst route length in current iteration; and
denotes the adaptive volatilization rate, which can be calculated using Equation (
25), where
implies the adjustment coefficient ranging from 0 to 1. From this formula, we can conclude that, at the beginning of the algorithm, a large
leads to a small
and minimal differences in the pheromone values across paths, allowing the ant colony to maintain a strong global path planning capability. As
decreases, the value of
increases gradually, leading to greater differences in pheromone values across paths. Consequently, more ants are guided toward the optimal path, allowing the algorithm to achieve faster convergence in later stages.
5. Process of EACI for Path Planning
In order to improve ACO efficiency, reduce turning angles, and minimize deadlock ants, this study introduces the EACI algorithm. The steps for applying EACI in path planning are outlined below.
Step 1 Initialize the parameters: The parameters, including S, E, m, , , , , N, Q, and , are initialized. Simultaneously, the maximum and current iterations are set to and .
Step 2 Determine the divided island number based on Equations (
13)–(
15).
Step 3 Calculate
, which indicates the initial pheromone, and it is calculated according to Equations (
18)–(
22).
Step 4 Path selection: First, the beginning position
S is designated as the first island, while the ending point
E is set as the
Nth island. The sequence of all islands can be represented as
. Additionally, the number of deadlock ants is initialized to 0. In addition, the ant is initially placed on the first island. Using Equations (
8), (
16), and (
17), the next grid
j is calculated. Once the ant passes through the grid
j, it is added to the taboo list, and this process is repeated. If the ant successfully reaches the next island, that island becomes the new starting position, and the path planning continues, passing through the intermediate islands in sequence until the target point is reached. The complete path formed by connecting all the nodes—which the ant has passed through—is the path discovered by the ant. If the ant has a deadlock between any two adjacent islands, it halts its route search, and the number of lost ants increases by 1. Finally, another ant is placed at the first island to begin path planning from grids
S to
E, following the same process as the previous ant. The iterative cycle continues until all ants have completed their path planning.
Step 5 Global update of the pheromones: Once an iterative search is completed, the paths searched by all ants, from
S to
E, are recorded. The longest and the shortest path lengths are identified from these records. The adaptive volatilization coefficient
can then be calculated using Equation (
25). Following this, the pheromones on shortest route are rewarded, and the pheromones on the longest route are punished, as described in Equation (
24). Finally, the global pheromones for the whole map are updated according to Equation (
23).
Step 6 Search end: If exceeds , the outputs include the optimal route length, its sequence, deadlock ant count, and other associated parameters. Otherwise, let and clear the taboo list. Then, return to Step 4 and continue the path planning process until the stopping condition appears.
Step 7 Output the optimal solution.
The flowchart demonstrating the EACI algorithm for path planning is shown in
Figure 10.
7. Real Vehicle Verification and Results
To validate the effectiveness of the EACI algorithm in a real-world environment, an experimental platform was constructed consisting of a Mini ROS vehicle (which produced by WHEELTEC Co., Ltd. of China in Dongguan) and a personal computer. As shown in
Figure 31, the mini vehicle, measuring 265 mm × 205 mm × 195 mm, served as host computer, while the personal computer acted as the client computer. Both the host and the client were equipped with the 64-bit Ubuntu 18.04 operating system and the Robot Operating System (ROS), Melodic. Communication between the two was established via a Wi-Fi network using the SSH protocol, enabling remote control of the mini ROS vehicle through its IP address.
The schematic diagram of the Mini ROS vehicle is shown in
Figure 31. The main control system of the Mini ROS vehicle includes an STM32 control board (model STM32F407VET6) equipped with an MPU9250 9-axis sensor, a Jetson Nano 4 GB board, a laser radar (M10P), four 12 V DC motors with Giant Magneto Resistance (GMR) encoders, a PlayStation 2 (PS2) wireless controller, and a wireless USB adapter. The laser radar is mounted on top of the vehicle at a height of 180 mm above the ground, and it is primarily used for environmental perception and obstacle detection. Detailed specifications are provided in
Table 4.
After starting the Mini ROS vehicle, the PC connects to it via Wi-Fi and remotely logs into the vehicle system using the Secure Shell (SSH) protocol. The mapping program is launched by entering the command roslaunch turn_on_wheeltec_robot_mapping.launch. Then, a new terminal window is opened on the PC to start the Rviz visualization tool, and the PS2 wireless controller is used to control the vehicle’s movement. The Gmapping SLAM algorithm is employed to construct a grid map. The experimental area measures 480 cm × 300 cm and includes various obstacles, such as L-shaped, U-shaped, and cubic structures. The inner grid regions of the L-shaped and U-shaped obstacles are prone to causing deadlock in the ant colony algorithm, thus providing a suitable test scenario for verifying the proposed algorithm’s ability to avoid deadlocks. To ensure accurate map construction, each grid cell was set to a size of 5 cm × 5 cm. After completing the mapping process, the experimental environment map was exported using the map-saving package, the experimental environment and a map model generated by Laser Radar are displayed in
Figure 32, where black areas represent obstacles and gray areas represent navigable regions.
In the experiment, the robot navigated fully autonomously using the algorithm proposed in this article. Its movement was controlled by a PID controller that followed the path generated by the algorithm. A position feedback system, based on the odometer, precisely measured the robot’s displacement using encoders and relayed this information back to the controller to ensure accurate path tracking.
The EACI algorithm was implemented as a plugin and integrated into the move_base package, with configuration managed through the move_base_params.yaml parameter file. After defining the start points and endpoints set on the map, the robot can execute path planning using the proposed algorithm. As shown in
Figure 33, the DDMR successfully navigated along the designated navigation route and reached the destination without encountering deadlock grids. This finding highlights the robot’s strong navigation capabilities and further validates the effectiveness of the algorithm in specialized environments.
8. Conclusions
This study proposes an EACI algorithm for mobile robot path planning. The algorithm begins by preprocessing the original grid map to generate an auxiliary map, where multiple islands are evenly distributed between the starting and ending points. Non-uniform pheromones are then assigned between adjacent islands, the heuristic function is optimized, and an adaptive volatilization coefficient is introduced to dynamically update the pheromone level. Finally, the paths between adjacent islands are sequentially calculated and connected to form the global path planning scheme on the original map. In the 20 × 20 environment, the average number of iterations for EACI was only 1, representing reductions of 97.53%, 90.48%, and 84.38% compared to the ACO, IACO, and CMEACO algorithms, respectively. Meanwhile, the average number of lost ants was only 9.85, which was 99.21%, 94.28%, and 94.03% lower than that of ACO, IACO, and CMEACO, respectively. In the 30 × 30 environment, the average number of iterations for EACI was 1.4, exhibiting reductions of 98.18%, 93.61%, and 88.03% compared to ACO, IACO, and CMEACO, respectively, while the average number of lost ants was 27.5, representing reductions of 99.12%, 93.38%, and 88.43%. In the 40 × 40 environment, the average number of iterations for EACI was only 6.2, representing reductions of 93.8%, 76.25%, and 60.5% compared to ACO, IACO, and MAACO, respectively. Meanwhile, the average number of lost ants was only 47.6, which was 98.8%, 93.15%, and 74.85% lower than that of ACO, IACO, and MAACO, respectively. In the 50 × 50 environment, the average number of iterations for EACI was only 7.1, showing reductions of 92.9%, 74.54%, and 59.66% compared to ACO, IACO, and MAACO, respectively. At the same time, the average number of lost ants was only 99.2, which was 97.51%, 86.94%, and 74.85% lower than those of ACO, IACO, and MAACO, respectively. These results indicate that the EACI algorithm offers faster convergence and significantly fewer deadlock ants than other models, demonstrating its strong optimization performance. Moreover, real-world experiments confirm that the algorithm can quickly compute the navigation path from the starting to the endpoints, proving its practical application value.