Path Planning for Delivery Robots Based on an Improved Ant Colony Optimization Algorithm Combined with Dynamic Window Approach

Huang, Limin; Hu, Tao; Wei, Jiabao; Guo, Yifeng; Tong, Xubin; Ding, Jiaxin; Yang, Hao; Zhong, Bin

doi:10.3390/s26010072

Open AccessArticle

Path Planning for Delivery Robots Based on an Improved Ant Colony Optimization Algorithm Combined with Dynamic Window Approach

by

Limin Huang

^1,†,

Tao Hu

¹,

Jiabao Wei

^2,†,

Yifeng Guo

^1,*,

Xubin Tong

¹

,

Jiaxin Ding

¹,

Hao Yang

¹ and

Bin Zhong

¹

School of Mechanical Engineering, Chengdu University, Chengdu 610106, China

²

Multipurpose Utilization of Mineral Resources, Chinese Academy of Geological Sciences, Chengdu 610041, China

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Sensors 2026, 26(1), 72; https://doi.org/10.3390/s26010072 (registering DOI)

Submission received: 2 November 2025 / Revised: 4 December 2025 / Accepted: 18 December 2025 / Published: 22 December 2025

(This article belongs to the Section Sensors and Robotics)

Download

Browse Figures

Review Reports Versions Notes

Abstract

In meal delivery robot path planning, enabling the robot to find an optimal path that avoids obstacles within its workspace is a crucial step. Usually, the traditional ant colony optimization (ACO) suffers from slow convergence and blind search behavior in path planning, lacking dynamic obstacle avoidance functionality. Meanwhile, the dynamic window approach (DWA) tends to become entrapped in local optima during local path planning. It is therefore proposed that a hybrid path planning algorithm be developed, based on an improved IACO and DWA algorithm. To address issues such as aimless search, slow convergence speed, and low path smoothness in ACO, the concept of gravity from gravity search algorithms is introduced to direct the search. The acceleration of convergence is achieved through the implementation of path sorting and the administration of additional pheromone to superior paths in pheromone updates. The transition paths are optimized to address the issue of excessive path transitions in ACO, resulting in smoother paths. The key nodes of the obtained globally optimal path are used as local target points, serving as multiple target points for DWA operation to enable dynamic obstacle avoidance. Simulation results indicate that compared to the ACO, the IACO reduces path length by up to 30.03% and decreases path turns by up to 71.43% in four different static maps. In other static comparison experiments, the IACO demonstrated superior performance compared to the other tested algorithms. In dynamic experiments, the proposed fusion algorithm can plan smooth paths that successfully avoid both static and dynamic obstacles.

Keywords:

food delivery robot; path planning; ant colony optimization algorithm; dynamic window approach; algorithm combination

1. Introduction

Due to the rapid advancement of intelligent technologies, robotics has found applications across multiple domains [1,2]. Currently, the most prevalent are service robots, capable of performing tasks beneficial to humans. As a subset of service robots, delivery robots are increasingly deployed in venues such as hotels, restaurants, and shopping centers. Given the complexity of restaurant environments, path planning technology has emerged as pivotal element in enabling delivery robots to navigate to their destinations [3]. Path planning algorithms for delivery robots are primarily categorized into two types: global path planning and local path planning [4]. During operation, delivery robots require not only effective global path planning to determine optimal routes but also local path planning with obstacle avoidance capabilities to navigate dynamically changing obstacles.

In robot path planning, common algorithms employed for global path planning include the A* algorithm [5], rapidly exploring random tree (RRT) [6], the genetic algorithm (GA) [7], sparrow search algorithm (SSA) [8], and ant colony optimization (ACO) [9]. These algorithms each possess distinct advantages and are widely applied.

Among them, ACO, as a distributed intelligent bionic algorithm that simulates the mechanism of ants to find the shortest path between nest and food through pheromone communication, is widely cited in various fields. The ACO is robust because of its positive feedback mechanism and also has powerful global search capabilities. However, the ACO suffers from problems such as low search efficiency, slow convergence, and a tendency to fall into local optimization. In order to solve these problems, numerous researchers have extensively studied the ACO. Wu et al. and Li et al. both transformed the path planning problem into a multi-objective optimization problem through multiple comprehensive evaluation metrics. They, respectively, employed the farthest-point optimization strategy and multi-optimization strategies (such as non-uniform initial pheromone distribution and ε-greedy algorithms) to achieve more comprehensive path optimization [10,11]. Wu et al. advanced a novel variant of the ACO, which embeds information such as directional information and path turns into an enhanced heuristic function, improving the quality of paths and search efficiency [12]. Zhang et al. developed an adaptive ant colony algorithm based on population information entropy, while Gao et al. incorporated backtracking and path merging strategies. Both approaches employed similar pheromone diffusion mechanisms for pheromone refinement, achieving a balance between path exploration and convergence speed or enhancing global search capabilities [13,14]. Cui et al. used deterministic state transfer probability rules to accelerate the convergence speed of the ACO and adaptive heuristic functions to optimize the number of turns and path length [15]. Liang et al. applied an ACO to global path planning for deep-sea mining vehicles, while introducing a pheromone updating strategy with a reward-penalty mechanism and an angle-inspired state transfer probability to make the generated paths smoother [16]. Liu et al. enhanced the guidance of pre-selected nodes during the initial search process through a pheromone concentration adaptive setting mechanism and also introduced a pseudo-random transfer strategy and dynamically adjusted pheromone volatilization rate to increase the population diversity and strengthen global search capability [17]. Fu et al. designed a grid environment model based on bidirectional artificial potential fields and introduced potential fields into the environment to provide direction for the ants. They also used the potential energy difference between nodes to optimize the pseudo-random state transfer rule. This improvement enhanced the efficiency of path planning and avoided blind search [18].

In addition to the above algorithms, there are a range of other effective path planning methods. For example, the A*, as a classical global path planning algorithm for static environments, has the advantages of being flexible, efficient, and generating relatively optimal paths, and is often used for path planning of mobile robots. Huang et al. proposed a 5-domain search method with the objective of improving efficiency and ensuring path smoothness [19]. Xu et al. used exploration from both the starting point and the goal point to reduce the number of nodes to be explored, and also improved the A* algorithm by designing a new adaptive cost function and a Slide-Rail corner adjustment method to improve path security and smoothness [20]. The RRT algorithm is a probabilistically complete algorithm for constructing a search tree by random sampling. Ganesan et al. solved the problems of slow convergence with uniform sampling and restricted exploration with non-uniform sampling by utilizing a path planning method with a mixed sampling RRT* that uses both non-uniform and uniform samplers [21]. Li et al. proposed a knowledge-based GA with a novel path representation combining grids and coordinates and a new evaluation method to make it better adapted to very complex environments [22]. The improved same-neighbor crossover operator and fitness function proposed by Lamini et al. for solving GA path planning problems in static environments resulted in a better average number of iterations and average number of transitions [23]. To overcome the susceptibility of SSA to fall into local optima and slow convergence speed, Yan et al. proposed an improved SSA with better search accuracy and faster convergence speed in mobile robot path planning [24].

The local path planning of the food delivery robot needs real-time adjustments according to the data collected by the sensors to ensure that the obstacle avoidance function can be achieved in complex environments. The commonly used local path planning algorithms are artificial potential field (APF) and dynamic window approach (DWA). Researchers enhance the performance of these algorithms by improving them. Liu et al. designed an improved method based on an environment-aware model for RRT guidance, which solves the problems of local minima, target inaccessibility, as well as local trajectory oscillations generated by the APF [25]. In order to address the problem that traditional APF methods are prone to generating zigzag paths in self-driving vehicles, Li et al. designed a real-time path planning method for self-driving cars using the dynamically enhanced fireworks algorithm APF [26]. Shin et al. considered localization accuracy in path planning and proposed a hybrid use of potential and localization risk fields to generate hybrid directed flows to guide driverless vehicles safely and efficiently [27]. The DWA algorithm is also used as a classical and practical local path planning algorithm, which is applied in several fields owing to its strong applicability. Yao et al. advanced a fuzzy logic improved DWA to address the problem of poor robustness of the DWA as well as the non-smoothness of the path, which achieves better path length, smoothness and robustness [28]. Zhang et al. improved the security and stability of local planning by introducing path smoothing coefficients and an improved DWA algorithm with a local goal selection strategy in path planning for USVs [29]. When applying DWA to mobile robot formations, Cao et al. implemented several measures to enhance navigational safety. These included analyzing formation patterns and surrounding obstacle environments, evaluating velocity change coefficients, and designing safety obstacle avoidance distance evaluation coefficients [30]. Wang et al. enabled agricultural robots to navigate more safely and autonomously in complex agricultural environments by integrating a dual-delay depth deterministic strategy gradient (TD3) into DWA and introducing an obstacle motion estimation module [31]. Wang et al. designed a distributed optimization method and an event-triggered behavior switching strategy to enable UGV formations to harmonize formation maintenance, global navigation, and local obstacle avoidance [32]. Abubakr et al. proposed an objective function weight that can optimize the DWA, considering the dynamic characteristics of the obstacles and the use of a fuzzy logic control system that can move quickly toward the target [33]. Niu et al. enhanced the DWA algorithm by using the data acquired from high-precision sensors on the vehicle and utilizing an ACO to update the speed objective function in real time to address the DWA algorithm’s problem of irrational path planning and incapacity to balance speed and driving safety during the task of crossing substantial obstacles [34]. Zhong et al. proposed a hybrid path planning algorithm based on an improved DWA, which enhances the obstacle avoidance effect and real-time path optimization of the DWA algorithm by using the A* algorithm to plan paths and avoid static obstacles in the operating environment, as well as by designing a target selection strategy and an improved method incorporating the speed barrier method [35].

Figure 1 illustrates the general workflow of path planning integrated with control. For a mobile robot to accomplish a navigation process, it generally requires a closed-loop sequence involving perception, positioning, planning, and control. The robot moves from the starting point to the target point while avoiding obstacles and ensuring the optimal path. The perception module acquires the robot’s surrounding environment and its current pose; Path planners are employed to identify optimal routes. Hybrid path planning integrates the global optimization capabilities of global planners with the dynamic responsiveness of local planners, thereby achieving the dual objectives of global optimality and local safety. The execution module will perform real-time motion based on the path planner’s output (linear and angular velocities), while simultaneously feeding back the motion status and pose to the perception module and local planner, thereby forming a closed-loop control system.

The global and local path planning algorithms described in the aforementioned literature each exhibit certain limitations. Scene adaptability is limited, with most improved algorithms designed for specific environments (such as static grids), exhibiting insufficient generalization capabilities in complex dynamic scenarios featuring multiple moving obstacles and abrupt terrain changes. In environments with dense dynamic obstacles, there is insufficient integrated capability regarding real-time performance, safety, and path smoothness. This paper proposes an approach that combines an improved ACO with a DWA to address the constraints in both global and local path planning for delivery robots. The main contributions are as follows:

(1): Improvements to the ACO employ the following strategies: A distribution strategy for initial pheromone concentration is determined by integrating node location data with obstacle information. New heuristics are constructed by incorporating a gravitational mechanism and directional angle heuristics to refine state transition probability rules. Pheromone update rules are implemented by sorting paths and augmenting superior paths with additional pheromone. Simultaneously, redundant turning points in paths are optimized to enhance path smoothness.
(2): Key nodes in global path planning are employed as target points for local paths. Dynamic obstacle distance evaluation subfunctions, linear velocity change evaluation subfunctions, and path proximity evaluation subfunctions are introduced. These are integrated with an IACO.
(3): Experimental verification was conducted on both the IACO and the fusion algorithm. Simulation experiments demonstrated that the IACO exhibits superior advantages in path length, smoothness, and convergence time. In obstacle avoidance experiments, the fusion algorithm effectively achieves obstacle avoidance planning.

The content of this paper is as follows: Section 2 introduces the fundamental theory of traditional ACO and IACO algorithms. Section 3 describes enhancements to the DWA and the overall rules for the hybrid algorithm. Section 4 details simulation experiments and result analysis. Section 5 presents the conclusions.

2. An Improved ACO (IACO) for Global Path Planning

2.1. Establish a Grid-Based Environmental Model

Before applying path planning algorithms to robotic path planning, the surrounding environment must be abstracted into a mathematical model. For two-dimensional environments, using the grid method for modeling is a more widely adopted and effective approach. In grid maps, the surrounding environment is treated as a two-dimensional plane, which is divided into an array of squares of equal size containing binary information. Typically, black squares represent obstacles or impassable areas, with a value of 1; white squares represent passable areas, with a value of 0. Each grid has a corresponding serial number, which is sorted sequentially from left to right and top to bottom, as shown in Figure 2.

For grid maps with coordinates, each grid pixel has unique coordinates. Assuming the grid has

M_{x}

rows and

M_{y}

columns, the position of the i-th. grid pixel is obtained using the following Equation (1):

\{\begin{matrix} x_{i} = a [\mod (i, M_{y}) - a / 2] \\ y_{i} = a [x + a / 2 - ceil (i / M_{x})] \end{matrix}

(1)

where

a

denotes the side length of the small square;

m o d

denotes the modulo operation;

ceil

denotes the ceiling function.

2.2. Traditional Ant Colony Optimization

ACO was proposed by Dorigo, an Italian researcher, as a population-based heuristic algorithm with strong robustness [36]. It finds the optimal path by simulating the exploratory behavior of ants searching for food using distributed computing and pheromone updating mechanisms. The ant’s pathfinding starts from a certain point and chooses the next step according to a specific rule, then again chooses the next step according to a specific rule after reaching each next node until it reaches the goal. Pheromones are then left on the path traveled to guide subsequent ants; the higher the concentration of pheromones left behind, the greater the chance that the ants will choose that path.

The specific rules governing an ant’s choice of next step are called transition probabilities. For instance, an ant’s movement from node

i

to node

j

is determined by the transition probability

P_{i j} (t)

.

P_{i j} (t) = \{\begin{matrix} \frac{{[τ_{i j} (t)]}^{α} {[η_{i j} (t)]}^{β}}{\sum_{j \in a l l o w e d (k)} {[τ_{i j} (t)]}^{α} {[η_{i j} (t)]}^{β}} & j \in a l l o w e d (k) \\ 0 & j \notin a l l o w e d (k) \end{matrix}

(2)

In Equation (2),

τ_{i j} (t)

denotes the pheromone concentration from node

i

to node

j

;

η_{i j} (t)

represents the heuristic function;

α

is the importance factor for

τ_{i j} (t)

;

β

is the heuristic expectation factor for

η_{i j} (t)

;

a l l o w e d (k)

indicates the set of nodes that node

i

can visit next. Where

η_{i j} (t)

is commonly expressed as:

η_{i j} (t) = \frac{1}{d_{i j}}

(3)

In Equation (3),

d_{i j}

denotes the Euclidean distance from node

i

to node

j

.

When ants move, they release pheromones between the nodes they traverse. As ants traverse more paths, pheromone concentration increases, and it also evaporates with each iteration. After each iteration, pheromone concentration is updated using Equations (4)–(6).

τ_{i j} (t + 1) = (1 - ρ) τ_{i j} (t) + Δ τ_{i j} (t)

(4)

Δ τ_{i j} = \sum_{k = 1}^{M} Δ τ_{i j}^{k} (t)

(5)

τ_{i j}^{k} = \{\begin{matrix} \frac{Q}{L_{k}} & i f a n t k t r a v e l s o n n o d e i t o n o d e j \\ 0 & o t h e r w i s e \end{matrix}

(6)

where

ρ

denotes the pheromone evaporation factor;

Δ τ_{i j} (t)

denotes the total amount of pheromone released along the path from node

i

to node

j

;

L_{k}

indicates the path length traversed by the k-th ant;

Q

is a constant.

2.3. Improvements to Ant Colony Optimization

2.3.1. Initial Pheromone Distribution Rules

In the traditional ACO, the initial pheromone distribution is typically uniform and constant. Thus, the absence of differences between any two nodes results in blind exploration by ants during the initial phase and slows convergence speed. To address this uniform distribution and improve ants’ aimless exploration in the early stages, the initial pheromone concentration is determined by integrating node location information and obstacle information, as shown in Equation (7).

τ_{i j} (0) = \frac{10 d_{S T}}{d_{S i} + d_{i E} + b_{i}}

(7)

where

d_{S i}

denotes the Euclidean distance from the current point to the starting node;

d_{i E}

denotes the Euclidean distance from the current node to the target point;

d_{S T}

denotes the Euclidean distance from the starting point to the target point;

b_{i}

denotes obstacle information surrounding the current node, as shown in the following Equation (8).

b_{i} = \frac{\sum_{s = 1}^{n} \sqrt{{(x_{s} - x_{i})}^{2} + {(y_{s} - y_{i})}^{2}}}{8}

(8)

where

(x_{i}, y_{i})

denotes the coordinates of the current node;

(x_{s}, y_{s})

denotes the coordinates of surrounding obstacles;

n

denotes the number of obstacles surrounding node

i

. The number 8 is a normalization constant used to convert the sum of obstacle distances into a standardized density metric. As shown in Equation (7), a larger value of

d_{S i} + d_{i E}

results in lower pheromone concentration, reducing the likelihood of selecting that node. Simultaneously, higher obstacle information

b_{i}

indicates denser surrounding obstacles, which also diminishes pheromone concentration. The initial pheromone distribution is illustrated in Figure 3, where color visually represents pheromone concentration variations. According to the initial pheromone distribution rules, pheromone concentration is higher near the line between the starting point S and the target point E, appearing as a deeper red hue. In areas with more obstacles, surrounding pheromones decrease, resulting in lower pheromone concentration and a bluer color.

2.3.2. Improved State Transition Probability Rules

(1): Introducing universal gravitation as an inspiration function

In traditional ACO, the heuristic function used for transition probabilities only represents the reciprocal of the distance between the current node and each of the next available nodes. Nevertheless, since the distance between any two nodes in a grid map is always either 1 or

\sqrt{2}

, this approach suffers from blind search and lacks global optimization. To sovle this issue, the gravity search algorithm (GSA) [37] is introduced to replace the original heuristic function. It calculates the heuristic function by determining the gravitational pull between the current node and the next node.

The gravitational search algorithm is predicated on the mutual attraction between objects through universal gravitation. This force causes all objects to move toward those with greater mass.

F_{i j}^{d} (t) = G (t) \frac{M_{i} (t) M_{j} (t)}{R_{i j} (t) + ε}

(9)

In Equation (9),

F_{i j}^{d} (t)

denotes the gravitational force exerted by object

i

on object

j

in the d-th dimension at time

t

;

G (t)

denotes the gravitational constant at time

t

, which decreases over time;

M_{i} (t)

and

M_{j} (t)

denotes the gravitational masses of objects

i

and

j

, respectively;

R_{i j} (t)

indicates the Euclidean distance between objects

i

and

j

;

ε

is a very small constant.

G (t) = G_{0} e^{- σ \frac{t}{T}}

(10)

In Equation (10),

G_{0}

denotes;

t

denotes the current iteration step;

T

denotes the total number of iterations;

σ

is a constant.

The greater the mass

M

, the stronger the gravitational pull. The value of mass can be easily calculated using the corresponding fitness function.

f i t_{i} (t) = norm (p_{i} (t) - E)

(11)

M_{i} (t) = \frac{f i t_{i} (t)}{\sum_{i = 1}^{S} f i t_{i} (t)}

(12)

In Equations (11) and (12),

f i t_{i} (t)

denotes the fitness value at position

i

at time

t

;

p_{i} (t)

denotes the coordinates of the surrounding passable grid cells including the current position

i

;

E

denotes the target point coordinates;

S

is the total number of grid cells comprising the current position and its surrounding passable grid cells;

norm

denotes the Euclidean distance between two points.

(2): Angle-inspired function for introducing directional guidance

In most cases, path planning prioritizes smoother trajectories with fewer turns, making robotic motion more rational. To guide the robot toward optimal paths, a directional guidance angle heuristic is introduced into path planning. This enhances path smoothness and accelerates convergence, thereby improving robotic motion efficiency. The directional guidance angle heuristic function

O_{i j} (t)

is expressed as shown in Equation (13).

O_{i j} (t) = \frac{1}{e^{θ / 2}}

(13)

where

θ

denotes the angle between the vector connecting the current node and the next node and the vector connecting the current node and the target node, which can be calculated using the following Equation (14).

θ = \cos^{- 1} (\frac{(x_{2} - x_{1}) (x_{3} - x_{2}) + (y_{2} - y_{1}) (y_{3} - y_{2})}{\sqrt{{(x_{2} - x_{1})}^{2} + {(y_{2} - y_{1})}^{2}} \sqrt{{(x_{3} - x_{2})}^{2} + {(y_{3} - y_{2})}^{2}}})

(14)

The coordinates

(x_{1}, y_{1})

,

(x_{2}, y_{2})

, and

(x_{3}, y_{3})

denotes the current node, the next node, and the target node, respectively. A smaller value of

θ

denotes a smoother path that is closer to the shortest path, as shown in Figure 4.

Therefore, integrating the above rules yields the following new transition probability rules:

P_{i j} (t) = \{\begin{matrix} \frac{{[τ_{i j} (t)]}^{α} {[F_{i j} (t)]}^{β} {[O_{i j} (t)]}^{γ}}{\sum_{j \in a l l o w d e (k)} {[τ_{i j} (t)]}^{α} {[F_{i j} (t)]}^{β} {[O_{i j} (t)]}^{γ}} & j \in a l l o w e d (k) \\ 0 & j \notin a l l o w e d (k) \end{matrix}

(15)

In Equation (15),

γ

is the heuristic factor for

O_{i j} (t)

.

2.3.3. Improve the Pheromone Update Method

In ACO, the update method typically involves updating all paths uniformly, without prioritizing higher-quality paths. Over time, this approach tends to converge to local optima. To address this problem, paths are sorted by length and selected at a certain ratio. Superior paths receive additional rewards to increase their pheromone concentration, thereby accelerating convergence.

For pheromone updates, a proportion-based strategy is employed to modify pheromone concentrations along paths. Ants reaching the target point in each generation are evaluated, with paths ranking within the top

Z (N) %

receiving enhanced pheromone concentrations. The specific update formulas are shown in Equations (16)–(18):

τ_{i j} (t + 1) = (1 - ρ) τ_{i j} (t) + Δ τ_{i j} (t) + e_{1} Δ τ_{i j}^{b s} + e_{2} \frac{1}{L_{g b}}

(16)

Δ τ_{i j}^{b s} = \{\begin{matrix} 1 / f_{m} & S_{f} (n) \geq S_{f} (ceil (Z (N) Z)) \\ 0 & o t h e r w i s e \end{matrix}

(17)

Z (N) = \max (u - v \times N, Z_{\min})

(18)

where

S_{f}

denotes the paths sorted in descending order;

N

denotes the total number of paths reaching the target point in each generation, where

n = 1,2, 3 \dots N

;

Z (N)

is the dynamic threshold;

u

and

v

are constants, with

u = 0.4

,

v = 0.002

, and

Z_{m i n} = 0.05

;

f_{m}

is the average score of the top

Z (N) %

paths;

L_{g b}

is the global shortest path;

e_{1}

and

e_{2}

are constants;

ρ

denotes the pheromone volatilization factor, calculated using Equation (19).

ρ = \frac{1}{1 + e^{A \times K / k - B}}

(19)

where K denotes the maximum iteration times; k denotes the current iteration times; A and B are constants.

During the iteration process, the quantity of pheromones changes as ants explore. Early on, ants are encouraged to explore extensively. As exploration continues, pheromones begin to accumulate along high-quality paths. Later, search efforts are concentrated to conduct detailed searches near the optimal solution, accelerating convergence.

To prevent ants from clustering along a single path and avoid premature convergence to local optima that could cause the algorithm to stall, upper and lower bounds are set for pheromone concentration.

τ_{i j} (t) = \{\begin{matrix} τ_{\max} (t) & τ_{i j} (t) \geq τ_{\max} (t) \\ τ_{i j} (t) & τ_{\min} (t) < τ_{i j} (t) < τ_{\max} (t) \\ τ_{\min} (t) & τ_{i j} (t) \leq τ_{\min} (t) \end{matrix}

(20)

\{\begin{matrix} τ_{\max} (t) = τ_{\max} (t - 1) + \frac{1}{k} (\frac{23 Q}{L_{g b} + 0.625} - τ_{\max} (t - 1)) \\ τ_{\min} (t) = \frac{τ_{\max} (t - 1)}{100} \end{matrix}

(21)

In Equations (20) and (21),

τ_{m a x}

and

τ_{m i n}

denotes the upper and lower bounds of pheromone concentration, respectively;

τ_{m a x} (t - 1)

denotes the maximum pheromone concentration of the previous generation;

k

denotes the current iteration count; and

Q

is a constant.

2.3.4. Path Redundancy Inflection Point Optimization

Due to the inherent nature of grid maps, paths generated by algorithms are always diagonally connected. This typically results in paths with numerous turning points and insufficient smoothness. Therefore, this paper employs an optimization technique to reduce redundant turning points, aiming to shorten path lengths and enhance path smoothness. The optimized and original paths are shown in Figure 5.

The specific approach for optimizing the method is as follows:

Step 1: Identify the turning points along the original path and designate them as nodes to be optimized.

Step 2: Starting from the origin, sequentially search each subsequent turning point. Determine whether there are obstacles between the line segments connecting two turning points. If no obstacles exist, continue searching the next turning point. If obstacles are present, terminate the search and adopt the preceding turning point as the new node in the path.

Step 3: Using the newly obtained node as the starting point, repeat Step 2 until the endpoint is reached.

3. Fusion Path Planning Algorithm

3.1. Dynamic Window Approach for Local Path Planning

The DWA is a common local path planning algorithm. Its principle involves generating multiple predicted trajectories by selecting velocity combinations in the velocity sampling space, then scoring each trajectory. It determines the optimal trajectory and corresponding velocity combination for the robot at the current time step. The resulting velocity and angular velocity combinations are provided to the respective robot as control parameters. Due to limitations in global path planning capability, DWA is generally combined with global path planning algorithms. This combination enables the delivery robot to possess not only global path planning but also local dynamic obstacle avoidance capabilities.

3.1.1. Establishment of a Robot Motion Model

The DWA algorithm primarily controls motion and determines position information through the robot’s linear velocity, angular velocity, and heading angle. Different kinematic models exist for different robot drive configurations. The robot motion model employed by delivery robots is a two-wheel differential motion model, as shown in Figure 6.

Assume the robot’s position at time

t

is

{[\begin{matrix} x (t) & y (t) & θ (t) \end{matrix}]}^{T}

. Over the time interval

Δ t

, the position increment is given by Equation (22):

[\begin{matrix} x (t + Δ t) \\ y (t + Δ t) \\ θ (t + Δ t) \end{matrix}] = [\begin{matrix} x (t) \\ y (t) \\ θ (t) \end{matrix}] + [\begin{matrix} Δ t \cos (θ_{t}) & 0 \\ Δ t \sin (θ_{t}) & 0 \\ 0 & Δ t \end{matrix}] [\begin{matrix} v_{t} \\ ω_{t} \end{matrix}]

(22)

where

v_{t}

denotes the robot’s velocity at time

t

, and

ω_{t}

denotes the robot’s angular velocity at time

t

.

3.1.2. Establish a Speed Zone

After establishing the robot’s motion trajectory space, the velocity space must be defined. Within this defined velocity space, multiple velocity combinations are sampled at a specified resolution. Based on these sampled velocity combinations, multiple corresponding motion trajectories are simulated over a defined time period. These trajectories will be evaluated by the evaluation function to determine relatively optimal velocity combinations. The velocity space represents the robot’s speed range, constrained by factors associated with the delivery robot’s performance capabilities and environmental conditions, ultimately limiting it to a specific velocity range.

(1): Constrained by its maximum and minimum speeds, the delivery robot can reach a speed of $V_{m}$ .

V_{m} = \{(v, ω) | v \in [v_{\min}, v_{\max}], ω \in [ω_{\min}, ω_{\max}]\}

(23)

In Equation (23),

v_{m i n}

and

v_{m a x}

denote the minimum and maximum linear velocities achievable by the delivery robot, respectively;

ω_{m i n}

and

ω_{m a x}

denote the minimum and maximum angular velocities achievable by the delivery robot, respectively.

(2): Constrained by motor performance, the delivery robot can achieve a speed of $V_{d}$ .

V_{d} = \{(v, ω) | v \in [v_{c} - {\dot{v}}_{b} Δ t, v_{c} + {\dot{v}}_{a} Δ t] \land ω \in [ω_{c} - {\dot{ω}}_{b} Δ t, ω_{c} + {\dot{ω}}_{a} Δ t]\}

(24)

In Equation (24),

v_{c}

and

ω_{c}

denote the linear velocity and angular velocity at the current time;

{\dot{v}}_{a}

and

{\dot{v}}_{b}

represent the maximum acceleration and maximum deceleration achievable in linear velocity under motor performance;

{\dot{ω}}_{a}

and

{\dot{ω}}_{b}

denote the maximum acceleration and maximum deceleration achievable in angular velocity under motor performance.

(3): Constrained by obstacles, the delivery robot can achieve a speed of $V_{a}$ .

V_{a} = \{(v, ω) | v \leq \sqrt{2 dist (v, ω) {\dot{v}}_{b}} \land ω \leq \sqrt{2 dist (v, ω) {\dot{ω}}_{b}}\}

(25)

In Equation (25),

dist (v, ω)

denotes the distance between the simulation trajectory corresponding to

(v, ω)

and the nearest obstacle.

Taking the intersection of the above constraints yields the robot’s feasible velocity space, with the velocity range given by Equation (26):

V_{r} = V_{m} \cap V_{d} \cap V_{a}

(26)

3.1.3. Optimization of the Evaluation Function

After sampling the velocity space, multiple predicted trajectories can be obtained based on the established robot motion model. At this point, these trajectories must be evaluated to select the optimal motion path, i.e., the optimal velocity combination. The evaluation function for the traditional DWA is shown in Equation (27):

G (v, ω) = a H e a d i n g (v, ω) + b D i s t (v, ω) + c V e l (v, ω)

(27)

where

H e a d i n g (v, ω)

denotes the heading evaluation function, which assesses the angular difference between the predicted trajectory endpoint orientation and the target point at the current sampled velocity in velocity space;

D i s t (v, ω)

is the distance evaluation function, representing the distance between the estimated trajectory endpoint and the nearest obstacle;

V e l (v, ω)

is the velocity evaluation function, representing the robot’s current linear velocity; a, b, and c are weighting coefficients.

To enhance the robot’s capability to avoid dynamic obstacles, dynamic obstacles are initially identified by comparing the obstacle’s position at the current time step with its position at the next time step. If the obstacle is determined to be dynamic, dynamic obstacle avoidance rules are applied. If the obstacle is static, static obstacle avoidance rules are applied. The distance evaluation subfunction is optimized within the evaluation function, specifically:

D i s t (v, ω) = \{\begin{matrix} J D i s t 1 (v, ω) + K D i s t 2 (v, ω) & d_{s} \neq 0 \\ D i s t 1 (v, ω) & d_{s} = 0 \end{matrix}

(28)

In Equation (28),

D i s t 1 (v, ω)

denotes the nearest distance between the endpoint of the predicted trajectory and static obstacles;

D i s t 2 (v, ω)

denotes the nearest distance between the endpoint of the predicted trajectory and dynamic obstacles;

d_{s}

represents the displacement of dynamic obstacles, as shown in Equation (29).

d_{s} = \sqrt{{(x_{o} (t) - x_{o} (t - 1))}^{2} + {(y_{o} (t) - y_{o} (t - 1))}^{2}}

(29)

When robots operate in areas with numerous obstacles, fluctuations in linear velocity become more frequent and pronounced, leading to reduced operational efficiency and unnecessary motor wear. Therefore, introducing a linear velocity variation sub-evaluation function into the assessment function, namely

V e l_c (v, ω) = e^{- w (v_{i + 1} - v_{i})}

(30)

In Equation (30),

V e l_c (v, ω)

denotes the linear velocity change evaluation function;

v_{i}

represents the linear velocity at the robot’s current position;

v_{i + 1}

denotes the linear velocity at the end of the predicted trajectory.

To ensure that the robot can better approach the global path and obtain a smoother path during its movement, a path proximity evaluation subfunction is introduced into the evaluation function, namely:

p a t h (v, ω) = \frac{a b s ((x_{2} - x_{1}) (y_{p} - y_{1}) - (y_{2} - y_{1}) (x_{p} - x_{1}))}{\sqrt{{(x_{2} - x_{1})}^{2} + {(y_{2} - y_{1})}^{2}}}

(31)

In Equation (31),

(x_{p}, y_{p})

denotes the position at the end of the predicted trajectory,

(x_{1}, y_{1})

represents the previous path key node, and

(x_{2}, y_{2})

denotes the subsequent path key node.

Finally, the improved evaluation function is given by Equation (32):

\begin{matrix} G (v, ω) = a H e a d i n g (v, ω) + b D i s t (v, ω) + c V e l (v, ω) \\ + f V e l_c (v, ω) + g P a t h (v, ω) \end{matrix}

(32)

where

f

denotes the weight parameter for the linear velocity change assessment function;

g

denotes the weight parameter for the path proximity assessment function.

3.2. Dynamic Path Planning Algorithm Fusion

In traditional DWA, the problem often gets stuck in local optima. While IACO can achieve optimal solutions for global path planning in static environments, it lacks obstacle avoidance capabilities in complex environments with dynamic obstacles. To address this issue, the global path generated by the IACO is introduced as a guidance target for local dynamic planning.

By integrating the IACO with the enhanced DWA, the path turning points gained from the IACO serve as the required local sub-goal points for DWA. DWA performs segmented path planning using these sub-goal points to attain obstacle avoidance for the robot. The fusion algorithm enables globally optimal dynamic path planning and real-time obstacle avoidance. The path planning process of the fusion algorithm is illustrated in Figure 7.

The specific implementation steps are as follows:

Step 1: Create an environmental map by establishing the robot’s working environment using a grid map, and set the robot’s starting point and destination.

Step 2: Initialize relevant parameters such as ant population size, maximum iteration count, and various importance factors; initialize the initial pheromones on the map.

Step 3: Employ the IACO for global path planning to obtain a globally optimal path, and extract the sequence of key nodes along the path.

Step 4: Perform velocity sampling and generate predicted trajectories based on the robot’s kinematic model.

Step 5: According to the key nodes of the extracted path and the generated predicted trajectory, evaluate using the improved assessment function and select the optimal speed control as the output.

Step 6: Determine whether a local target point has been reached. If reached, select the next node as the new local target point and execute Step 4. If not reached, return to Step 4 to continue iteration.

Step 7: Determine whether the global target point has been reached. If the global target point has not been reached, return to step 6. If the global target point has been reached, terminate the loop.

4. Simulation Experiments and Results Analysis

In this section, all simulation experiments were conducted in a Windows 11 64-bit operating environment with a 3.2 GHz processor, 16 GB of RAM, and the simulation software was Matlab R2022b.

4.1. Determination of Key Parameters for the IACO

For IACO, different parameter combinations can significantly impact algorithm performance. However, no comprehensive theoretical method exists for determining optimal parameter configurations. In this paper, to identify key parameter combinations for IACO, we conducted multiple simulation experiments on primary parameters and performed statistical analysis.

To find the optimal parameter combination, the testing method involves altering one parameter at a time and all others are kept fixed. As shown in Equation (15), the primary parameters requiring testing are the pheromone importance factor α, the heuristic function expectation factor β, and the angular guidance heuristic factor γ. Under the condition of altering one parameter, four data metrics were analyzed: the optimal path length, the average path length at convergence, the average convergence iteration count, and the average iteration to reach the shortest path. Other parameters are listed in Table 1.

Simulation experiments were conducted on the grid map shown in Figure 8. The starting point was located at the blue grid in the upper left corner, and the target point was at the red grid in the lower right corner. To minimize the impact of errors, each parameter combination underwent 10 simulation runs.

For parameter α, its variation range is set to [1,8] with a step size of 0.5. Other parameters remain fixed: β = 2, γ = 5.5. Experimental results are shown in Figure 9. As seen in Figure 9a, the optimal path length consistently remains at 28.4132, indicating no significant variation in optimal path length regardless of α value. As shown in Figure 9b, the average path length at convergence exhibits an increasing trend as the value of α increases. Figure 9c,d reveal that α significantly influences the number of iterations. As α increases, both the average convergence iterations and the average iterations to reach the shortest path exhibit a pattern of first decreasing significantly and then increasing. The minimum average convergence iterations occur at α = 5.5. Therefore, the parameter value is determined as α = 5.5.

For parameter β, its variation range is set to [1,8] with a step size of 1. Other parameters remain fixed: α = 2, γ = 5.5. Experimental results are shown in Figure 10. Figure 10a indicates that the optimal path length consistently remains at 28.4132, demonstrating that the optimal path length shows no significant variation regardless of β value. Figure 10b reveals that when β ranges from 2 to 6, the average path length at convergence maintains a stable value, suggesting that β values within this range have no substantial impact on path length. Figure 10c,d reveal that as β increases, the average convergence iterations and the average iterations to reach the shortest path first decrease and then increase, with a minimum occurring at β = 4. Therefore, β = 4 is selected.

For parameter γ, its variation range is set to [1,10] with a step size of 0.5. Other parameters remain fixed: α = 2, β = 2. Experimental results are shown in Figure 11. Figure 11a indicates that when γ ranges from 1 to 8, the optimal path length remains constant at 28.4132. However, γ exhibits significant changes beyond 8. As shown in Figure 11b, the average path length at convergence first decreases and then increases as γ increases. The average path length remains relatively stable within the range of γ = 2.5 to 5.5. Figure 11c,d demonstrate that γ significantly affects the average convergence iterations and the average iterations to reach the shortest path, with a minimum value occurring at γ = 7.5. However, at γ = 7.5, the average path length at convergence falls outside the optimal range, making it prone to local optima. Finally, through experimental validation of γ within the range of 2.5 to 5.5, γ = 4.5 was ultimately selected to achieve better performance.

In summary, after statistical experimentation, the optimal configuration of parameter for the IACO was determined as α = 5.5, β = 4, and γ = 4.5. Substituting this parameter set into the algorithm, its impact on path planning is illustrated in Figure 12. The optimal path generated by the IACO achieved a length of 28.4132, with 6 turns, and 4 iterations.

4.2. Comparative Simulation Experiments of the IACO

In this section, multiple groups of experiments were conducted to better verify the performance of the IACO in path planning. These experiments involved various maps and were performed across multiple distinct experimental environments to validate the efficiency and adaptability of the IACO. Furthermore, the IACO was compared with other algorithms to demonstrate its effectiveness.

4.2.1. Performance Validation in Different Complex Environments

Four distinct environmental maps were designed to test the adaptability of the proposed IACO. This set of experiments compared the proposed algorithm with the traditional ACO on different maps. Four distinct maps are illustrated in Table 2 and Figure 13. On the map, the blue grid serves as the starting point, while the red grid serves as the destination. In the algorithm parameter configuration, the proposed IACO shares the same common parameter settings as the traditional ACO. Specifically, the traditional ACO parameters are M = 50, K = 100, α = 5.5, β = 4, ρ = 0.3, Q = 1. The proposed IACO parameters are set as M = 50, K = 100, α = 5.5, β = 4, γ = 4.5. Other parameters are shown in Table 1. To minimize the impact of random errors, each algorithm was run independently 20 times. The experimental results are shown in Figure 13 and Table 3. Figure 13 displays the optimal paths generated by each algorithm. Additionally, Table 3 presents three evaluation metrics: the percentage improvement in optimal path length (Length improve Best), the percentage improvement in average path length (Length improve Mean), and the percentage improvement in optimal path turning points (Turn improve Best).

As shown in Figure 13 and Table 3, the IACO demonstrates excellent adaptability across various environments. Across four distinct map characteristics, the improved algorithm outperforms the traditional ACO. It exhibits significantly superior performance in terms of fewer turns required. Across the four maps, the improved algorithm reduced the number of turns by 25%, 41.67%, 25%, and 71.43%, respectively, compared to the traditional ACO. With respect to path length performance metrics, the IACO demonstrated significant advantages in optimal path length, average path length, and path length standard deviation. The IACO achieved optimal path lengths of 27.218, 29.458, 28.951, and 33.066 across four maps, all of which were shorter than those obtained by the traditional ACO. The average path lengths achieved by the IACO across the four maps were 27.218, 29.521, 29.206, and 33.416, respectively, all lower than those of the traditional ACO. Notably, in Map 4, the average path length improvement reached 30.03%. The standard deviation of paths obtained across the four maps is relatively small, with Map 1 exhibiting a standard deviation of zero, indicating that the proposed IACO demonstrates high stability. Experimental results demonstrate that the IACO possesses advantages and adaptability in diverse environmental conditions.

4.2.2. Comparative Experiments with Other Algorithms

(1): Comparative experiment 1.

In Comparative experiment 1, the grid map used in Reference [17] served as the experimental environment. The proposed IACO algorithm was compared against six other intelligent algorithms: the improved ACO algorithms (IACO and APACA) mentioned in Reference [38], the IHMACO algorithm proposed in Reference [17], the A*, the Dijkstra, and the best-first-search. To minimize errors, all experiments were independently executed 20 times. The obtained optimal path trajectories and convergence curves are shown in Figure 14 and Figure 15, respectively, while the statistical data are presented in Table 4.

The data in Figure 14 and Figure 15 and Table 4 demonstrate that the proposed IACO is superior to the other six algorithms in terms of path length, turn times, and convergence speed. Regarding path length, the proposed IACO achieves the minimum values for both optimal and average path lengths, both at 27.63. Additionally, the standard deviation is zero, indicating algorithmic stability. Except for matching the number of turns with IHMACO, the proposed IACO exhibits the fewest turns at 4, indicating superior path smoothness. Despite sharing the same number of turns as IHMACO, the improved algorithm achieves shorter paths. Furthermore, Figure 15 demonstrates that the improved algorithm converges faster. Compared to the IACO in the literature [38], APACA, IHMACO, A*, Dijkstra, and best-first-search algorithms, the proposed IACO demonstrates superior performance advantages. This demonstrates that the proposed improvement mechanism is effective and exhibits superior overall performance.

(2): Comparative experiment 2.

In Comparative experiment 2, the proposed IACO was compared with five other intelligent algorithms, including the traditional ACO, the ZACO mentioned in [39], the MAACO proposed in [12], the A*, and the Dijkstra. Experiments were performed on the 20 × 20 map model described in Reference [12], as shown in Figure 15. To minimize error, all experiments were independently executed 20 times. The obtained optimal path trajectories and convergence curves are illustrated in Figure 16 and Figure 17, respectively, with the corresponding statistical data provided in Table 5.

The data in Figure 16 and Figure 17 and Table 5 show that, except for the Dijkstra algorithm, the other four algorithms all converge stably to 29.21. The IACO proposed in this paper, by adopting the strategy of redundant turning point optimization, further reduces the path length to 27.88. It also further reduces the number of turning points to 5. Figure 17 and Table 5 demonstrate that while ensuring optimal solutions, the algorithm achieves a faster convergence rate with an average of 4.7 iterations—the swiftest among all methods. This acceleration stems from the incorporation of an initial pheromone strategy and an enhanced transition mechanism. In summary, the experimental results indicate that the IACO attains shorter path lengths and faster convergence rates compared to traditional ACO, improved ACO in [39], and MAACO algorithms. Furthermore, it yields superior solutions relative to the A* and Dijkstra algorithms.

(3): Comparative experiment 3.

In experiment 3, to further verify the performance of the IACO, performance tests were conducted in a more complex environment. The 30 × 30 map model described in [12] was employed for testing. The IACO was compared against the traditional ACO, the MRCACO proposed in [40], the AIACSE proposed in [13], the MAACO proposed in [12], the A* algorithm, and the Dijkstra algorithm. To minimize error, all experiments were independently executed 20 times. The obtained optimal path trajectories are shown in Figure 18, and the statistical data are presented in Table 6.

Based on the data in Table 6, despite larger and more complex maps, the IACO algorithm still demonstrates superior performance. Regarding path length, the IACO algorithm achieved the shortest optimal path length of 41.4635. Additionally, the standard deviation of path length was zero, indicating both the ability to find optimal paths and excellent stability. Algorithms such as MRCACO, MAACO, A*, and Dijkstra also achieve relatively optimal paths with good stability. However, the paths they find contain a higher number of turning points. The IACO, on the other hand, achieves fewer turning points, with only 4 turns. This demonstrates that the strategies of redundant turning point optimization, improved transition mechanism, and enhanced pheromone update strategy effectively enhance the algorithm’s performance, leading to superior path planning.

4.3. Fusion Algorithm Simulation Experiments

In the hybrid algorithm simulation experiment, the kinematic parameters of the delivery robot are shown in Table 7. The evaluation parameters for the DWA algorithm are: a = 0.3, b = 0.12, c = 0.2, f = 0.15, g = 0.2, J = 0.35, K = 0.65. The prediction period is 3 s. In the following experiment, the starting point is the grid cell marked by the blue triangle, the endpoint is the grid cell marked by the red circle, the solid blue line represents the trajectory of the fusion algorithm, and the dashed black line denotes the global path planned by the ACO.

4.3.1. Experimental Evaluation and Analysis of Fusion Algorithm Effectiveness

To evaluate the effectiveness of the fusion algorithm, comparisons were conducted on the same map between the traditional DWA and the fusion algorithm incorporating global path planning. Figure 19 illustrates the paths generated by the traditional DWA algorithm and the improved fusion algorithm, while Table 8 presents the statistical data from the comparative experiments.

4.3.2. Experimental Analysis of Fusion Algorithm Obstacle Avoidance

Taking the actual layout diagram of a cafeteria as an example, as shown in Figure 20a, a fusion path planning algorithm is applied to perform a global search for the robot’s walking path, identifying an optimal route that is relatively short, smooth, and highly secure. Obstacles in the space are arranged according to patterns based on the actual restaurant layout (height is not considered). During robot operation, the structure, volume, and position information of obstacles remain fixed and known. Based on this, the robot’s workspace model defines a simulated grid map according to the actual restaurant scale.

Determine the grid map size based on the delivery robot’s dimensions and the actual environment. Assuming the robot’s projected footprint on the floor is 0.5 m × 0.5 m and the restaurant area is 20 m × 20 m, the grid size is set slightly larger than the robot’s maximum diameter to allow free movement within the grid. Therefore, a grid map size of 20 × 20 is adopted, as shown in Figure 20b.

In a 20 × 20 map of a cafeteria dining hall, the feasibility of the fusion algorithm is validated by setting static and dynamic obstacles. In the experimental map, two static obstacles are placed, as shown by the gray circles in Figure 21. Additionally, two dynamic obstacles are placed, as shown by the orange circles in Figure 22.

In Figure 21 and Figure 22, The black circle denotes the path key point. The path planning data results are shown in the diagram Table 9 and Figure 23. Table 9 displays the path length, planning time, number of control nodes, and minimum distance from obstacles during navigation achieved by the fusion algorithm throughout the path planning process. Figure 23 illustrates the variation in speed during obstacle avoidance and the path deviation relative to the distance key point.

(1): Static obstacle avoidance experiment analysis.

Figure 21 illustrates the obstacle avoidance process of the fusion algorithm in a map containing static obstacles. Before encountering obstacles, the robot effectively navigates along key nodes specified by the global path. When static obstacles appear ahead, the robot detects them and executes avoidance maneuvers. As shown in Figure 21, the robot successfully bypasses both obstacles on straight paths and those at corners. After obstacle avoidance, it resumes following key nodes along the global path until reaching the destination. The path results are shown in Table 9. The final planned path length is 28.452 m, with 766 control nodes and a simulation time of 386.98 s. During navigation, the minimum distance to obstacles is 0.2 m, occurring between control nodes 0 and 30, at which point the robot operates at a slower speed.

(2): Dynamic obstacle avoidance experiment analysis.

Figure 22 illustrates the obstacle avoidance process of the fusion algorithm in a map containing dynamic obstacles. During the movement, the robot dynamically detects obstacles, determining whether they are moving. For moving obstacles, it follows dynamic obstacle avoidance rules; for stationary obstacles, it follows static obstacle avoidance rules. As shown in Figure 22, the robot successfully navigated around both dynamic and static obstacles at two locations, respectively, and safely reached the destination. The path results are shown in Table 9. The final planned path length is 28.94 m, with 800 control nodes and a simulation time of 419.25 s. During navigation, the minimum distance to obstacles is 0.2 m, occurring between control nodes 0 and 30 as in Experiment (1), at which point the robot operates at a slower speed.

Figure 23a illustrates the variation in linear velocity of the robot during the simulation experiment. Linear velocity 1 represents the linear velocity variation in Experiment (1), while linear velocity 2 denotes the linear velocity variation in Experiment (2). As can be seen from the diagram, the linear velocity exhibits relatively stable variation, enabling timely deceleration upon encountering obstacles and navigating around them at a comparatively safe speed. Figure 23b illustrates the path deviation during the simulation experiment. Error1 denotes the degree of path deviation in Experiment (1), while Error2 denotes the degree of path deviation in Experiment (2). In Experiment (1), the maximum deviation of the path distance from the target point was 0.29 m, with an average deviation of 0.188 m. In Experiment (2), the maximum deviation of the path distance from the target point was 0.74 m, with an average deviation of 0.318 m. This indicates that the robot must deviate from the target point to avoid obstacles, overall continues to travel along the global path.

5. Conclusions

Robot navigation relies heavily on path planning, hence studying path planning algorithms is necessary. This paper overcomes the shortcomings and limitations of traditional ACO and DWA algorithms by optimizing both approaches. It then integrates the IACO and DWA algorithms to improve the overall performance and obstacle avoidance capabilities of the path planning system.

To address the limitations of traditional ACO, the following improvement strategies are proposed: A strategy for initial pheromone distribution based on node positions and obstacle information is introduced. A gravitational mechanism and directional angle heuristic function are adopted as new heuristic information for state transition rules. A new pheromone update rule is implemented that sorts paths by length and adds extra pheromone to paths with optimal lengths. By reducing redundant turning points in global paths, the algorithm optimizes the issue of excessive path turns in traditional ACO, resulting in smoother paths.

To optimize the local path planning performance of the DWA, this study enhances its evaluation function to better track the global path while simultaneously improving dynamic obstacle avoidance capabilities. To this end, three evaluation subfunctions have been introduced based on the evaluation function: the dynamic obstacle distance evaluation subfunction, the linear velocity change evaluation subfunction, and the path proximity evaluation subfunction.

Simulation experiments demonstrate that compared to the traditional ACO across four environments, the IACO decreased the average path length by 13.98%, 17.53%, 20.32%, and 30.03%, respectively. It also decreased the number of path turns by 25%, 41.67%, 25%, and 71.43%, respectively. Comparisons with several other intelligent algorithms further demonstrated that the IACO exhibits superior advantages in path length, smoothness, and convergence iterations. Finally, the integration of the IACO and DWA algorithms enabled the fusion algorithm to effectively plan optimal paths and execute superior obstacle avoidance strategies in both static and dynamic obstacle avoidance experiments.

Author Contributions

Conceptualization, L.H. and Y.G.; Methodology, T.H., J.W. and B.Z.; Software, T.H.; Validation, J.W.; Data curation, X.T., J.D. and H.Y.; Writing—original draft, T.H.; Writing—review & editing, L.H.; Supervision, Y.G. and B.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Sichuan Provincial Regional Innovation Cooperation Project, grant number 2024YFHZ0209, the Sichuan Provincial Regional Innovation Cooperation Project, grant number 2023YFQ0092 and the Natural Science Foundation of Sichuan, grant number 2023NSFSC0368.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

Lu, Y.; Da, C. Global and Local Path Planning of Robots Combining ACO and Dynamic Window Algorithm. Sci. Rep. 2025, 15, 9452. [Google Scholar] [CrossRef]
Dang, T.-V.; Tan, P.X. Hybrid Mobile Robot Path Planning Using Safe JBS-A*B Algorithm and Improved DWA Based on Monocular Camera. J. Intell. Robot. Syst. 2024, 110, 151. [Google Scholar] [CrossRef]
Liu, Y.; Guo, S.; Tang, S.; Song, J.; Zhang, J. Path Planning for Robots Based on Adaptive Dual-Layer Ant Colony Optimization Algorithm and Adaptive Dynamic Window Approach. IEEE Sens. J. 2025, 25, 19694–19708. [Google Scholar] [CrossRef]
Sun, S.; Zhou, Q.; He, Y. Path Planning for Mobile Robots in Complex Environments Using a Fusion Algorithm by Combining an Optimized A-Star Algorithm with a Dynamic Window Approach. In Proceedings of the Canadian Society for Mechanical Engineering International Congress, 31st Annual Conference of the Computational Fluid Dynamics Society of Canada, Toronto, ON, Canada, 26–29 May 2024. [Google Scholar]
Wang, Y.; Fu, C.; Huang, R.; Tong, K.; He, Y.; Xu, L. Path Planning for Mobile Robots in Greenhouse Orchards Based on Improved A* and Fuzzy DWA Algorithms. Comput. Electron. Agric. 2024, 227, 109598. [Google Scholar] [CrossRef]
Zhang, R.; Guo, H.; Andriukaitis, D.; Li, Y.; Królczyk, G.; Li, Z. Intelligent Path Planning by an Improved RRT Algorithm with Dual Grid Map. Alex. Eng. J. 2024, 88, 91–104. [Google Scholar] [CrossRef]
Ab Wahab, M.N.; Nazir, A.; Khalil, A.; Ho, W.J.; Akbar, M.F.; Noor, M.H.M.; Mohamed, A.S.A. Improved Genetic Algorithm for Mobile Robot Path Planning in Static Environments. Expert Syst. Appl. 2024, 249, 123762. [Google Scholar] [CrossRef]
Zhang, J.; Zhu, X.; Li, J. Intelligent Path Planning with an Improved Sparrow Search Algorithm for Workshop UAV Inspection. Sensors 2024, 24, 1104. [Google Scholar] [CrossRef] [PubMed]
Sun, H.; Zhu, K.; Zhang, W.; Ke, Z.; Hu, H.; Wu, K.; Zhang, T. Emergency Path Planning Based on Improved Ant Colony Algorithm. J. Build. Eng. 2025, 100, 111725. [Google Scholar] [CrossRef]
Wu, S.; Dong, A.; Li, Q.; Wei, W.; Zhang, Y.; Ye, Z. Application of Ant Colony Optimization Algorithm Based on Farthest Point Optimization and Multi-Objective Strategy in Robot Path Planning. Appl. Soft Comput. 2024, 167, 112433. [Google Scholar] [CrossRef]
Li, P.; Wei, L.; Wu, D. An Intelligently Enhanced Ant Colony Optimization Algorithm for Global Path Planning of Mobile Robots in Engineering Applications. Sensors 2025, 25, 1326. [Google Scholar] [CrossRef]
Wu, L.; Huang, X.; Cui, J.; Liu, C.; Xiao, W. Modified Adaptive Ant Colony Optimization Algorithm and Its Application for Solving Path Planning of Mobile Robot. Expert Syst. Appl. 2023, 215, 119410. [Google Scholar] [CrossRef]
Zhang, S.; Pu, J.; Si, Y. An Adaptive Improved Ant Colony System Based on Population Information Entropy for Path Planning of Mobile Robot. IEEE Access 2021, 9, 24933–24945. [Google Scholar] [CrossRef]
Gao, W.; Tang, Q.; Ye, B.; Yang, Y.; Yao, J. An Enhanced Heuristic Ant Colony Optimization for Mobile Robot Path Planning. Soft Comput. 2020, 24, 6139–6150. [Google Scholar] [CrossRef]
Cui, J.; Wu, L.; Huang, X.; Xu, D.; Liu, C.; Xiao, W. Multi-Strategy Adaptable Ant Colony Optimization Algorithm and Its Application in Robot Path Planning. Knowl.-Based Syst. 2024, 288, 111459. [Google Scholar] [CrossRef]
Liang, W.; Lou, M.; Chen, Z.; Qin, H.; Zhang, C.; Cui, C.; Wang, Y. An Enhanced Ant Colony Optimization Algorithm for Global Path Planning of Deep-Sea Mining Vehicles. Ocean Eng. 2024, 301, 117415. [Google Scholar] [CrossRef]
Liu, C.; Wu, L.; Xiao, W.; Li, G.; Xu, D.; Guo, J.; Li, W. An Improved Heuristic Mechanism Ant Colony Optimization Algorithm for Solving Path Planning. Knowl.-Based Syst. 2023, 271, 110540. [Google Scholar] [CrossRef]
Fu, B.; Chen, Y.; Quan, Y.; Zhou, X.; Li, C. Bidirectional Artificial Potential Field-Based Ant Colony Optimization for Robot Path Planning. Robot. Auton. Syst. 2025, 183, 104834. [Google Scholar] [CrossRef]
Huang, J.; Chen, C.; Shen, J.; Liu, G.; Xu, F. A Self-Adaptive Neighborhood Search A-Star Algorithm for Mobile Robots Global Path Planning. Comput. Electr. Eng. 2025, 123, 110018. [Google Scholar] [CrossRef]
Xu, X.; Zeng, J.; Zhao, Y.; Lü, X. Research on Global Path Planning Algorithm for Mobile Robots Based on Improved A*. Expert Syst. Appl. 2024, 243, 122922. [Google Scholar] [CrossRef]
Ganesan, S.; Ramalingam, B.; Mohan, R.E. A Hybrid Sampling-Based RRT* Path Planning Algorithm for Autonomous Mobile Robot Navigation. Expert Syst. Appl. 2024, 258, 125206. [Google Scholar] [CrossRef]
Li, J.; Hu, Y.; Yang, S.X. A Novel Knowledge-Based Genetic Algorithm for Robot Path Planning in Complex Environments. IEEE Trans. Evol. Comput. 2025, 29, 375–389. [Google Scholar] [CrossRef]
Lamini, C.; Benhlima, S.; Elbekri, A. Genetic Algorithm Based Approach for Autonomous Mobile Robot Path Planning. Procedia Comput. Sci. 2018, 127, 180–189. [Google Scholar] [CrossRef]
Yan, Z.; Luo, W.; Yu, L.; Zhang, D.; Wang, B.; Li, Q. A Mobile Robot Path Planning Based on Modified Sparrow Search Algorithm. In Proceedings of the 2022 37th Youth Academic Annual Conference of Chinese Association of Automation (YAC), Beijing, China, 19–20 November 2022; IEEE: New York, NY, USA, 2022; pp. 1537–1541. [Google Scholar]
Liu, J.; Yan, Y.; Yang, Y.; Li, J. An Improved Artificial Potential Field UAV Path Planning Algorithm Guided by RRT Under Environment-Aware Modeling: Theory and Simulation. IEEE Access 2024, 12, 12080–12097. [Google Scholar] [CrossRef]
Li, H.; Liu, W.; Yang, C.; Wang, W.; Qie, T.; Xiang, C. An Optimization-Based Path Planning Approach for Autonomous Vehicles Using the DynEFWA-Artificial Potential Field. IEEE Trans. Intell. Veh. 2022, 7, 263–272. [Google Scholar] [CrossRef]
Shin, Y.; Kim, E. Hybrid Path Planning Using Positioning Risk and Artificial Potential Fields. Aerosp. Sci. Technol. 2021, 112, 106640. [Google Scholar] [CrossRef]
Yao, M.; Deng, H.; Feng, X.; Li, P.; Li, Y.; Liu, H. Improved Dynamic Windows Approach Based on Energy Consumption Management and Fuzzy Logic Control for Local Path Planning of Mobile Robots. Comput. Ind. Eng. 2024, 187, 109767. [Google Scholar] [CrossRef]
Zhang, J.; Ling, H.; Tang, Z.; Song, W.; Lu, A. Path Planning of USV in Confined Waters Based on Improved A∗ and DWA Fusion Algorithm. Ocean Eng. 2025, 322, 120475. [Google Scholar] [CrossRef]
Cao, Y.; Mohamad Nor, N. An Improved Dynamic Window Approach Algorithm for Dynamic Obstacle Avoidance in Mobile Robot Formation. Decis. Anal. J. 2024, 11, 100471. [Google Scholar] [CrossRef]
Wang, J.; Yang, L.; Cen, H.; He, Y.; Liu, Y. Dynamic Obstacle Avoidance Control Based on a Novel Dynamic Window Approach for Agricultural Robots. Comput. Ind. 2025, 167, 104272. [Google Scholar] [CrossRef]
Wang, J.; Wang, J.; Chen, R.; Yang, K.; Wu, B.; Qi, Y. Formation Collaborative Obstacle Avoidance with Multiple UGVs in Restricted Environments Based on Adaptive DWA. Expert Syst. Appl. 2025, 265, 125870. [Google Scholar] [CrossRef]
Abubakr, O.A.; Jaradat, M.A.; Abdel-Hafez, M.F. Intelligent Optimization of Adaptive Dynamic Window Approach for Mobile Robot Motion Control Using Fuzzy Logic. IEEE Access 2022, 10, 119368–119378. [Google Scholar] [CrossRef]
Niu, J.; Shen, C.; Wei, J.; Liu, S.; Lin, C. Path Planning of Self-Driving Vehicles Combining Ant Colony and DWA Algorithms in Complex Dense Obstacles. EAI Endorsed Trans. Energy Web 2024, 11. [Google Scholar] [CrossRef]
Zhong, L.; Tang, X.; Liu, L.; Yang, G.; Guo, K.; Sun, H. Hybrid Path Planning Algorithm Based on Improved Dynamic Window Approach. In Proceedings of the 2021 IEEE 2nd International Conference on Big Data, Artificial Intelligence and Internet of Things Engineering (ICBAIE), Nanchang, China, 26–28 March 2021; IEEE: New York, NY, USA, 2021; pp. 57–62. [Google Scholar]
Dorigo, M.; Maniezzo, V.; Colorni, A. Ant System: Optimization by a Colony of Cooperating Agents. IEEE Trans. Syst. Man Cybern. Part B Cybern. 1996, 26, 29–41. [Google Scholar] [CrossRef]
Rashedi, E.; Nezamabadi-pour, H.; Saryazdi, S. GSA: A Gravitational Search Algorithm. Inf. Sci. 2009, 179, 2232–2248. [Google Scholar] [CrossRef]
Jiao, Z.; Ma, K.; Rong, Y.; Wang, P.; Zhang, H.; Wang, S. A Path Planning Method Using Adaptive Polymorphic Ant Colony Algorithm for Smart Wheelchairs. J. Comput. Sci. 2018, 25, 50–57. [Google Scholar] [CrossRef]
Zhao, J.; Cheng, D.; Hao, C. An Improved Ant Colony Algorithm for Solving the Path Planning Problem of the Omnidirectional Mobile Vehicle. Math. Probl. Eng. 2016, 2016, 7672839. [Google Scholar] [CrossRef]
Zhang, D.; You, X.; Liu, S.; Pan, H. Dynamic Multi-Role Adaptive Collaborative Ant Colony Optimization for Robot Path Planning. IEEE Access 2020, 8, 129958–129974. [Google Scholar] [CrossRef]

Figure 1. Path Planning and Fusion Control Framework.

Figure 2. Grid map environment model.

Figure 3. Schematic diagram of initial pheromone concentration distribution.

Figure 4. Angle diagram.

Figure 5. Path turning point optimization diagram.

Figure 6. Delivery robot kinematic model.

Figure 7. Fusion algorithm flowchart.

Figure 8. Parameter-optimized environment map.

Figure 9. Effect of different α values on path: (a) optimal path length; (b) average path length at convergence; (c) average convergence iterations; (d) average iterations to reach the shortest path.

Figure 10. Effect of different β values on the path: (a) optimal path length; (b) average path length at convergence; (c) average convergence iterations; (d) average iterations to reach the shortest path.

Figure 11. Effect of different γ values on path: (a) optimal path length; (b) average path length at convergence; (c) average convergence iterations; (d) average iterations to reach the shortest path.

Figure 12. Results validation: (a) Optimal path; (b) convergence curve.

Figure 13. Optimal path across four types of maps: (a) Map 1; (b) Map 2; (c) Map 3; (d) Map 4.

Figure 14. Optimal paths: (a) IACO in the literature [38], APACA, IHMACO, and IACO; (b) A*, Dijkstra, Best-First-Search, and IACO.

Figure 15. Convergence iteration curves for IACO in the literature [38], APACA, IHMACO, and IACO.

Figure 16. Optimal paths: (a) ACO, ZACO, MAACO and IACO; (b) A*, Dijkstra, and IACO.

Figure 17. Convergence iteration curves for ACO, ZACO, MAACO, and IACO.

Figure 18. Optimal paths: (a) ACO; (b) MRCACO; (c) AIACSE; (d) MAACO; (e) A*; (f) Dijkstra.

Figure 19. Path planning results comparison: (a) Traditional DWA; (b) Improved fusion algorithm.

Figure 20. Experimental environment in the cafeteria: (a) Actual layout of the cafeteria dining area (partial view); (b) cafeteria grid map (20 × 20).

Figure 21. Static obstacle avoidance process: (a) Avoid the first static obstacle; (b) Avoid the second static obstacle; (c) Reaching the destination.

Figure 22. Dynamic obstacle avoidance process: (a) Avoid the first dynamic obstacle; (b) Avoid the second dynamic obstacle; (c) Reaching the destination.

Figure 23. Speed Variations and Path Deviations During Obstacle Avoidance: (a) Linear Speed Output Results; (b) Path Distance Deviation.

Table 1. Relevant parameters involved in the IACO.

Parameters	K	M	G₀	σ	A	B	e₁	e₂	Q
Value	100	50	10	2	0.5	0.05	0.5	200	100

Table 2. Characteristics and design objectives of different complex environments.

Map	Features	Design Objectives
1	Simple rectangular obstacle	Testing the capability to handle simple obstacles
2	A large number of rectangular obstacles distributed uniformly and discretely	Testing the for handling multi-channel and multi-turn scenarios
3	Obstacles of varying sizes and shapes that are randomly distributed	Testing the capability to handle obstacles of varying sizes and shapes that are randomly distributed
4	Irregularly shaped complex obstacles with traps	Testing the IACO ability to handle traps and complex maps

Table 3. Performance validation results of traditional ACO and IACO on four types of maps.

Map	Algorithm	Path Length			Turn Times	Length Improve		Turn Improve
		Optimal	Mean	Std.	Best	Best	Mean	Best
1	ACO	28.042	31.642	2.583	4	2.94%	13.98%	25%
	IACO	27.218	27.218	0	3	2.94%	13.98%	25%
2	ACO	32.385	35.797	1.704	12	9.04%	17.53%	41.67%
	IACO	29.458	29.521	0.130	7	9.04%	17.53%	41.67%
3	ACO	30.385	36.655	3.502	8	4.72%	20.32%	25%
	IACO	28.951	29.206	0.223	6	4.72%	20.32%	25%
4	ACO	37.284	47.759	3.861	14	11.31%	30.03%	71.43%
	IACO	33.066	33.416	0.344	4	11.31%	30.03%	71.43%

Table 4. Comparative results of experiment 1.

Algorithm	Path Length			Turn Times	Length Improve		Turn Improve
	Optimal	Mean	Std.	Best	Best	Mean	Best
IACO in the literature [38]	30.380	35.357	4.3764	24	9.05%	21.85%	83.33%
APACA	28.038	29.184	1.0020	9	1.46%	5.32%	55.56%
IHMACO	28.038	28.038	0	4	1.46%	1.46%	0
A*	28.038	28.038	0	7	1.46%	1.46%	42.86%
Dijkstra	28.038	28.038	0	7	1.46%	1.46%	42.86%
Best-First-Search	28.038	28.038	0	7	1.46%	1.46%	42.86%
IACO	27.630	27.630	0	4	-	-	-

Table 5. Comparative results of comparative experiment 2.

Algorithm	Path Length			Iteration			Turn Times	Path Improve		Turn Improve
Algorithm	Optimal	Mean	Std.	Best	Mean	Std.	Best	Best	Mean	Best
ACO	29.21	33.55	1.58	49	54.35	2.26	11	4.54%	16.89%	54.55%
ZACO	29.21	-	-	5	-	-	10	4.54%	-	50%
MAACO	29.21	29.21	0	9	9.8	0.72	7	4.54%	4.54%	28.57%
A*	29.21	29.21	0	-	-	-	9	4.54%	4.54%	44.44%
Dijkstra	30.04	30.04	0	-	-	-	9	7.18%	7.18%	44.44%
IACO	27.88	27.88	27.88	3	4.7	1.1	5	-	-	-

Table 6. Comparative results of experiment 3.

Algorithm	Path Length			Turn Times	Path Improve		Turn Improve
Algorithm	Optimal	Mean	Std.	Best	Best	Mean	Best
ACO	43.3553	47.9413	1.3199	13	4.36%	13.51%	69.23%
MRCACO	42.7696	42.7696	0	9	3.05%	3.05%	55.56%
AIACCSE	44.5269	44.5269	0	17	6.88%	6.88%	76.47%
MAACO	42.7696	42.7696	0	7	3.05%	3.05%	42.86%
A*	42.7696	42.7696	0	9	3.05%	3.05%	55.56%
Dijkstra	42.7696	42.7696	0	9	3.05%	3.05%	55.56%
IACO	41.4635	41.4635	0	4	-	-	-

Table 7. Kinematics Parameters of Delivery Robots.

Parameters	Value
Maximum linear velocity (m/s)	1
Maximum angular velocity ((°)/s)	20
Acceleration (m/s²)	0.2
Angular acceleration ((°)/s²)	50
Linear velocity resolution (m/s)	0.01
Angular velocity resolution (°/s)	1

Table 8. Experimental data for the traditional DWA algorithm and the improved fusion algorithm.

	Path Length (m)	Number of Control Nodes	Times (s)
Traditional DWA	21.362	487	188.2463
Improved fusion algorithm	19.940	479	185.5991

Table 9. Obstacle Avoidance Experiment Statistics.

	Path Length (m)	Number of Control Nodes	Time (s)	Minimum Distance (m)
Experiment (1)	28.452	766	386.98	0.2
Experiment (2)	28.940	800	419.25	0.2

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Huang, L.; Hu, T.; Wei, J.; Guo, Y.; Tong, X.; Ding, J.; Yang, H.; Zhong, B. Path Planning for Delivery Robots Based on an Improved Ant Colony Optimization Algorithm Combined with Dynamic Window Approach. Sensors 2026, 26, 72. https://doi.org/10.3390/s26010072

AMA Style

Huang L, Hu T, Wei J, Guo Y, Tong X, Ding J, Yang H, Zhong B. Path Planning for Delivery Robots Based on an Improved Ant Colony Optimization Algorithm Combined with Dynamic Window Approach. Sensors. 2026; 26(1):72. https://doi.org/10.3390/s26010072

Chicago/Turabian Style

Huang, Limin, Tao Hu, Jiabao Wei, Yifeng Guo, Xubin Tong, Jiaxin Ding, Hao Yang, and Bin Zhong. 2026. "Path Planning for Delivery Robots Based on an Improved Ant Colony Optimization Algorithm Combined with Dynamic Window Approach" Sensors 26, no. 1: 72. https://doi.org/10.3390/s26010072

APA Style

Huang, L., Hu, T., Wei, J., Guo, Y., Tong, X., Ding, J., Yang, H., & Zhong, B. (2026). Path Planning for Delivery Robots Based on an Improved Ant Colony Optimization Algorithm Combined with Dynamic Window Approach. Sensors, 26(1), 72. https://doi.org/10.3390/s26010072

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Path Planning for Delivery Robots Based on an Improved Ant Colony Optimization Algorithm Combined with Dynamic Window Approach

Abstract

1. Introduction

2. An Improved ACO (IACO) for Global Path Planning

2.1. Establish a Grid-Based Environmental Model

2.2. Traditional Ant Colony Optimization

2.3. Improvements to Ant Colony Optimization

2.3.1. Initial Pheromone Distribution Rules

2.3.2. Improved State Transition Probability Rules

2.3.3. Improve the Pheromone Update Method

2.3.4. Path Redundancy Inflection Point Optimization

3. Fusion Path Planning Algorithm

3.1. Dynamic Window Approach for Local Path Planning

3.1.1. Establishment of a Robot Motion Model

3.1.2. Establish a Speed Zone

3.1.3. Optimization of the Evaluation Function

3.2. Dynamic Path Planning Algorithm Fusion

4. Simulation Experiments and Results Analysis

4.1. Determination of Key Parameters for the IACO

4.2. Comparative Simulation Experiments of the IACO

4.2.1. Performance Validation in Different Complex Environments

4.2.2. Comparative Experiments with Other Algorithms

4.3. Fusion Algorithm Simulation Experiments

4.3.1. Experimental Evaluation and Analysis of Fusion Algorithm Effectiveness

4.3.2. Experimental Analysis of Fusion Algorithm Obstacle Avoidance

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI