1. Introduction
With the development of intelligent robotics technology, multi-robot systems are increasingly applied in fields such as industry, logistics, agriculture, and rescue operations. In modern logistics warehouses, multiple autonomous robots often need to collaborate to complete tasks such as goods handling and sorting. These robots must not only avoid shelves and walls but also navigate in real-time to avoid other robots and workers. In modern agriculture, robots frequently move efficiently in complex field environments to perform tasks such as crop monitoring, fertilizing, and spraying. However, path planning for multi-robot systems in complex dynamic environments remains a challenging research problem. Path planning not only needs to ensure that robots can successfully reach their target points but also must address a series of issues, such as avoiding unknown obstacles, real-time avoidance of dynamic obstacles, and coordination of robot-to-robot conflicts. Traditional path planning algorithms, such as the A* algorithm [
1], genetic algorithms [
2], ant colony optimization [
3], zebra optimization algorithm [
4], grey wolf optimization [
5,
6], and whale optimization algorithms [
7,
8], excel in static environments but exhibit limitations in dynamic settings. They struggle to effectively manage real-time obstacle avoidance and multi-robot coordination. In such ever-changing environments, planning algorithms must also possess the robust capability to counter various uncertainties [
9,
10].
Yang et al. [
11,
12,
13] were the pioneers in applying the Shunting neural network model to mobile robot path planning, proposing a bio-inspired neural network algorithm. This algorithm, which does not require any neural network learning or training process, has demonstrated good real-time performance and has found widespread application. Building on the foundation of bio-inspired neural networks, Tang [
14] introduced the template model method and jump point search algorithm to address the original algorithm’s inability to fully cover and lock adjacent obstacles. Luo [
15] incorporated the multi-scale mapping method into bio-inspired neural networks, reducing the time cost and mathematical complexity of the path planning algorithm. Han et al. [
16] proposed a novel CCPP strategy, where the planned path of the robot depends not only on neuron activity but also on the distribution of obstacles in the environment map. They used a proposed path backtracking algorithm to reduce path redundancy. Qu et al. [
13] addressed the issue of neuron signals being confined to local propagation by proposing a method to propagate neuron waveforms, linking the propagation distance to signal strength. Wang [
16] tackled the problem of low neuron activity values near boundaries and obstacles by proposing an improved BINN, effectively optimizing the movement paths of multiple robots. Analysis reveals that past research has, to varying degrees, improved the neural network algorithm’s tendency to fall into local optima. However, these improvements have not adequately balanced the algorithm’s local and global optimization capabilities, and in complex environments, the algorithm fails when dynamic obstacles appear.
The Dynamic Window Approach (DWA) algorithm, introduced by Fox et al. [
17], is a classic method that ingeniously transforms the problem of positional constraints into a velocity constraint problem, thus converting obstacle avoidance into an optimal velocity execution issue. Xu et al. [
18] proposed a Parameter-Adaptive DWA (PA-DWA) algorithm, addressing the limitations of the traditional DWA in complex environments with numerous obstacles, where fixed parameters fail to balance safety and speed effectively. Gong et al. [
19] improved the original DWA by enhancing the speed cost function and adding a distance cost function for the target point, enabling the robot to continuously adjust and adapt to unknown environments. Chang et al. [
20] modified and expanded the existing evaluation function by introducing two new evaluation functions to enhance global navigation performance. Liu et al. [
21] addressed the poor adaptability of fixed DWA evaluation function weights in complex environments by designing a fuzzy controller to improve the algorithm’s adaptability. Mai et al. [
22] introduced an obstacle density-related evaluation factor into the evaluation function for environments with dense obstacles, further enhancing the efficiency of the DWA in complex settings, though it still falls short of achieving globally optimal paths. Currently, the standalone DWA algorithm struggles with processing complex maps and is prone to falling into “traps” and local optima.
The existing research has improved neural network algorithms and the DWA algorithm from different perspectives. However, studies that simultaneously consider environmental information, obstacle avoidance capabilities, and coordination between multiple robots in multi-robot path planning are relatively scarce. To address these challenges, this paper proposes a dual-layer symmetric path planning method based on an improved neural network-DWA algorithm. This method not only enables robots to achieve more efficient and real-time path planning in complex dynamic environments but also significantly enhances the collaborative capabilities and safety of multi-robot systems. The main contributions of this paper are as follows:
(1) In the traditional GBNN algorithm, the propagation of neuronal activation signals is typically limited to a local range, causing the algorithm to easily fall into local optima and fail to effectively explore the global space, resulting in low global optimization capability. To address this issue, we introduce the concept of single-neuron signal waveform propagation. The signals from the target neuron propagate like waves across the global space, allowing the activation signals to span a larger area and influence more neurons. This expands the search range, enhances the global optimization ability, and establishes a path optimization function to remove redundant points from the path, improving its smoothness.
(2) To address the obstacle avoidance requirements in complex dynamic environments, a reward function is introduced to improve the traditional Score function in the DWA algorithm. This effectively reduces detour paths and minimizes the probability of deviation from the planned trajectory, ensuring real-time obstacle avoidance while maintaining the optimality of the global path.
(3) When multiple robots encounter each other, traditional priority assignment methods usually rely on static rules, which can lead to low-priority robots waiting for long periods or result in unreasonable path planning. To address this issue, the dynamic priority rule proposed in this paper adjusts the priority in real-time based on the robot’s distance to the target point. Robots closer to the target point are assigned higher priority and are given tasks first. When two robots are at the same distance from the target point, the robot with the smaller ID will have a higher priority, indicating a more urgent task. This allows robots to flexibly adjust their travel order based on the actual situation, effectively avoiding conflicts between robots and improving the overall efficiency and coordination of the system.
(4) This paper proposes an improved dual-layer symmetric path planning algorithm based on the improved GBNN algorithm and the enhanced DWA, enabling multi-robots to achieve real-time obstacle avoidance in dynamic environments while effectively addressing the conflicts between robots.
For the multi-robot path planning problem, this paper employs a dual-layer symmetric path planning method based on an improved neural network-DWA algorithm. First, a signal waveform propagation model for single-neuron signals is established. Then, the Score function of the DWA algorithm is improved. Next, a dynamic priority rule is applied to effectively resolve conflicts between robots. Finally, the improved GBNN algorithm is combined with the enhanced DWA.
The structure of the rest of this paper is as follows:
Section 2 introduces the conflict model for multi-robot systems and the improved priority algorithm.
Section 3 presents the improved global path planning method.
Section 4 introduces the improved local path planning method.
Section 5 discusses the proposed method.
Section 6 provides the experimental simulation results.
Section 7 concludes with the conclusion and outlook.
4. Local Path Planning
4.1. DWA Algorithm
The traditional Dynamic Window Approach (DWA) is an obstacle avoidance algorithm grounded in local path planning, widely utilized in the navigation of mobile robots. The essence of the DWA algorithm lies in its real-time dynamic sampling of the robot’s velocity space, generating a set of potential velocity pairs (linear and angular velocities). From this set, the algorithm selects the most optimal velocity pair, ensuring that the robot can advance towards its target while deftly circumventing obstacles.
Within a specified time window, the DWA algorithm initially samples the velocity space based on the robot’s current speed, generating multiple candidate velocity pairs. This velocity space is constrained by the robot’s maximum acceleration, deceleration, and current speed limitations.
Here, and represent the robot’s minimum and maximum linear velocities, respectively.
Following the sampling of the velocity space, the DWA algorithm predicts trajectories for each velocity pair (
v,
w), simulating the robot’s motion over a future time interval T. The robot’s trajectory can be described by the following differential equations:
Here, denote the robot’s coordinates at time t, while represents the robot’s orientation angle. After conducting a search and sampling within the velocity space, the DWA algorithm simulates several feasible trajectories corresponding to multiple velocity combinations (v, ). Subsequently, each trajectory is assessed using a scoring function. The traditional DWA evaluation function encompasses three criteria: the goal orientation function, the obstacle clearance function, and the velocity function, expressed as follows:
Obstacle avoidance function:
Velocity smoothness function:
Comprehensive evaluation function:
Here, , , and serve as weighting parameters to balance the significance of various evaluation metrics.
The DWA algorithm assigns scores to all possible speed pairs (
v,
) using the evaluation function and selects the pair with the highest score as the control input for the robot’s movement at the next moment.
This study focuses on non-holonomic mobile robots, which are restricted to forward movement, turning, and waiting in place. As illustrated, we assume that the robot’s trajectory can be segmented into multiple time slices. Within each time slice
, the robot can be approximated as moving in uniform straight-line motion, and its kinematic model can be expressed as follows:
As depicted in
Figure 7, the variables
,
, and
represent the robot’s position along the x and y axes and its orientation at time
t, respectively. The range of linear velocity
is contingent upon the robot’s closest distance to obstacles and its maximum linear deceleration. Meanwhile, the variation in angular velocity
is determined by both the robot’s proximity to obstacles and its maximum angular deceleration.
4.2. Enhanced DWA Algorithm
The conventional DWA algorithm’s comprehensive evaluation function primarily takes into account the distance to the target, obstacle avoidance capability, and smoothness of velocity. However, this approach can lead the robot to opt for longer detours or excessive and unnecessary maneuvers in complex environments, thereby diminishing the efficiency of path planning.
In path planning, factors such as the shortest route and minimal turns should be considered. This paper’s reward function is based on an improved neural network algorithm for static path planning. Specifically, during local path tracking, we establish a search circle with a radius r around the robot’s current position (
,
). If the path point generated by the neural network (
,
) lies within this search circle, the robot receives a reward score for that local path. This reward mechanism effectively guides the robot to prioritize movement along the static planned path, thereby reducing the likelihood of deviating from the optimal route and minimizing the occurrence of detours and unnecessary turns. The reward function can be expressed as
Here,
represents the reward weight parameter, which adjusts the influence of the reward score within the comprehensive evaluation function;
is the indicator function, taking a value of 1 when the neural network path point (
,
) is within the search circle, and 0 otherwise. The final comprehensive evaluation function can be expressed as:
represents the distance score, indicating the robot’s progression towards the target point. denotes the obstacle avoidance score, signifies the velocity smoothness score, and is the newly introduced reward function, designed to encourage the robot to advance along a more optimal path. The parameters , , , and serve as weights for these four evaluation functions, balancing the influence of various factors on path selection.
5. Improved Neural Network-DWA Dual-Layer Symmetric Path Planning Algorithm
While the standalone improved neural network algorithm can effectively generate a collision-free path from the starting point to the target, its performance often falls short when confronted with dynamic or unknown obstacles in the environment. Furthermore, in maps characterized by complex static obstacles, relying solely on the DWA algorithm for local path planning may lead the robot to become trapped in local optima, ultimately preventing it from reaching its destination.
To address this shortcoming, this paper presents a method that integrates the improved neural network algorithm with the enhanced DWA algorithm, leveraging the strengths of both approaches. In this hybrid algorithm, the improved neural network is tasked with global path planning, ensuring the generation of a collision-free route in static environments, while the enhanced DWA algorithm focuses on dynamic obstacle avoidance, managing moving or unknown obstacles in real-time settings. With the support of this integrated approach, the robot can not only rely on the global path provided by the neural network but also benefit from the dynamic adjustments of the DWA algorithm, enabling a responsive navigation through complex environments.
The specific algorithm flow is shown in
Figure 8. In the first layer of path planning, the robot first models the complex static environment using a grid-based method. Then, after the target neuron is activated, the waveform propagates the neuron activity. Using the neural network algorithm, a global collision-free shortest path is planned for the robot, which it will follow to start moving. The waveform propagation model of the target neuron is represented by Equations (6)–(9), and the optimal path point set is defined by Equation (
13). During the robot’s movement along the global path, if dynamic obstacles are encountered, the process transitions to the second layer of path planning—local path planning based on the improved DWA algorithm. First, the robot determines its current state, and then selects the trajectory with the optimal evaluation function value based on its current state. Once the robot determines that the distance to the obstacle has reached a safe range, it exits the second layer of path planning and continues moving along the global path toward the target point. The final comprehensive evaluation function is defined by Equation (
23). If the robot encounters conflicts due to interactions with other robots during movement, a priority allocation is performed, and the robot continues moving toward the target point.
6. Experiments
To validate the path planning method of the improved neural network and DWA algorithm under complex road conditions and to better illustrate the algorithm’s performance, this paper conducts simulation experiments from three perspectives. The first direction focuses on performance analysis, which includes a comparative study of the algorithm’s effectiveness before and after improvements in a static environment, as well as comparisons with other algorithms to confirm its overall superiority. The second direction examines dynamic environments, assessing the flexibility of this method in addressing robot collision conflicts and encountering unknown or dynamic obstacles. The third direction explores complex environments, where the introduction of more intricate known obstacles, unknown obstacles, and dynamic obstacles within a multi-robot system demonstrates the comprehensive advantages of this algorithm in tackling complex challenges. The simulations were conducted using MATLAB R2022B on a Windows 11 platform.
6.1. Algorithm Performance Experiments
Due to the limitations of traditional neural network algorithms, robots often generate excessive turns in complex environments, resulting in increased path lengths. As illustrated in
Figure 9, the dashed line represents the path planned by the improved neural network model proposed in this paper, while the solid line denotes the path planned using the optimized function incorporated into the improved neural network model. The experimental setup comprises a 21 × 21 grid map with 51 obstacles and 3 robots.
R1–R3 represent the starting points of each robot, T1–T3 represent the target points of each robot, the dashed lines represent the original paths of the robots, and the solid lines represent the optimized paths.By analyzing
Figure 9, we can obtain the data presented in
Table 1.
Through
Table 1, we observe that the improved neural network optimization demonstrates an increase in turning frequency of as much as 100% and no less than 75% compared to the previous optimization. The path distance shows a maximum enhancement of 10% and a minimum of 6%, significantly enhancing both the smoothness and length of the path.
To validate the performance of the improved neural network—DWA algorithm, we compared it with traditional approaches such as the Genetic Algorithm (GA), Ant Colony Optimization (ACO), Hybrid Genetic Algorithm (HGA), and Bio-inspired Neural Network Algorithm (BINN). The results are shown in
Figure 10.
From
Figure 10, it is evident that the path planned by our method significantly outperforms those generated by other algorithms in terms of path distance and turning frequency. To mitigate the effects of randomness and ensure the accuracy of our results, we conducted 20 repeated simulation experiments. The analyzed experimental data is presented in
Figure 11 and
Table 2.
Through the repeated experimental data presented in
Table 2, we found that the paths planned using the proposed method achieve the shortest path length and the lowest total number of turns compared to other methods. In terms of path length, the proposed method demonstrates improvements of up to 19.74% over other algorithms, and in terms of the number of turns, improvements reach up to 93.17%. Additionally, in the repeated experiments, the variance, standard error, interquartile range (IQR), and coefficient of variation (CV) of the proposed method are all zero. Compared to other methods, the proposed method significantly improves path quality while exhibiting excellent robustness and stability.
6.2. Experiments in Dynamic Environments
In dynamic environments, robots must navigate various uncertainties, such as encountering conflicts, unknown obstacles, and dynamic barriers. These challenges are particularly pronounced in multi-robot systems. The improved neural network proposed in this paper—the DWA algorithm—introduces a dynamic priority mechanism, enabling robots to automatically slow down, brake, or change direction when approaching other robots, unknown obstacles, or dynamic barriers, thereby preventing collisions.
In the experimental environment, we designate black as known obstacles, gray as unknown obstacles, yellow as dynamic obstacles, and the blue dashed line represents the global path planned by the robot in a static environment, triangles represent the starting points of the robots, and asterisks represent the target points of the robots. As illustrated in
Figure 12, at time t = t0, Robot 1 (R1) and Robot 3 (R3) encounter an unknown obstacle. Utilizing the improved DWA algorithm, both robots executed turning maneuvers, successfully avoiding the obstacle and continuing toward their target. A dynamic obstacle suddenly appeared in the path of Robot 2 (R2), moving to the right at a certain speed. To prevent a collision with this dynamic obstacle, R3 automatically activated its braking mechanism as it approached, temporarily halting its movement.
As depicted in
Figure 13, at time t = t1, R2 has successfully navigated around the dynamic obstacle and continues to advance toward the target point along its planned path.
As illustrated in
Figure 14, at time t = t2, the three robots converge, with their current distances from the target point as follows: R3 < R1 < R2. Consequently, the dynamic priority among the robots is established as R3 > R1 > R2. R1 and R2 automatically engage their braking mechanisms, momentarily halting their movements, while R3 proceeds toward the target point.
As depicted in
Figure 15, at time t = t3, the dynamic priority is established as R1 > R2. To prevent a collision with R1, R2 automatically activates its braking mechanism, temporarily halting its movement, while R1 and R3 continue their advance toward the target.
As illustrated in
Figure 16 and
Figure 17, at time t = t4, the three robots have successfully navigated around all obstacles and adeptly managed any potential conflicts during their encounters. They continue their journey toward the target point and arrive there at time t = t5.
6.3. Experiments in Complex Environments
To further verify the practicality and reliability of the proposed method, an in-depth study was conducted in a complex environment. In this study, several unknown static obstacles were placed in a known complex environment, and dynamic obstacles were also introduced, further increasing the complexity and risk of the environment. Finally, the proposed method was validated using three mobile robots, labeled as R1, R2, and R3, with their movements illustrated in
Figure 18,
Figure 19,
Figure 20 and
Figure 21. In the figures, gray represents temporarily added unknown obstacles, and yellow represents dynamic obstacles moving to the right.
Table 3 shows the kinematic parameters of the mobile robots.
As shown in
Figure 18, at time t = t0, R1, R2, and R3 are moving toward their target points along the paths planned by the improved neural network algorithm.
Robots R1, R2, and R3 travel along their pre-planned paths. When a robot detects unknown obstacles during its movement, it will automatically replan a local path to avoid the unknown obstacles. As shown in
Figure 19, at time t = t1, robots R1, R2, and R3 successfully avoid static unknown obstacles by replanning their local paths. After avoiding the obstacles, they continue along their original paths toward their target points. Meanwhile, when robot R3 detects a dynamic obstacle, it triggers the automatic braking function. R3 stops briefly by braking and resumes movement after detecting that the dynamic obstacle has moved away, thereby avoiding the dynamic obstacle and improving the safety of the mobile robot’s operation.
During the movement of multiple robots, conflict issues may arise when robots encounter each other. We determine the dynamic priority of the robots based on their current distance from their target points. As shown in
Figure 20, at time t = t2, R1 and R2 encounter each other. The current distance to the target point is R1 < R2, and therefore, the dynamic priority is R1 > R2. R1 continues its movement, while R2 triggers the automatic braking function, stopping briefly. Once R2 detects that R1 has left, it resumes moving forward, thereby resolving the conflict issue when multiple robots meet.
As shown in
Figure 21, at time t = t3, R1, R2, and R3 have all successfully reached their target points, successfully avoiding both dynamic and unknown obstacles. Additionally, the coordination issues between the robots have been successfully resolved.
In summary, the experimental results demonstrate that the dual-layer symmetric path planning algorithm, which integrates the improved neural network algorithm and the DWA, can efficiently and effectively plan global paths for robots in complex dynamic environments. Furthermore, when dynamic or unknown obstacles appear on the path, robots can bypass these obstacles using local path planning and braking mechanisms, ensuring their safety. Finally, for conflict issues arising from encounters between multiple robots, the proposed method resolves these conflicts based on the dynamic priority rule. The experimental results indicate that the proposed algorithm is well-suited for complex dynamic environments, enabling robots to plan collision-free paths and significantly improving the adaptability and safety performance of multi-robot systems.