Dynamic Target Hunting Under Autonomous Underwater Vehicle (AUV) Motion Planning Based on Improved Dynamic Window Approach (DWA)

Li, Juan; Lu, Houtong; Zhang, Honghan; Zhang, Zihao

doi:10.3390/jmse13020221

Open AccessArticle

Dynamic Target Hunting Under Autonomous Underwater Vehicle (AUV) Motion Planning Based on Improved Dynamic Window Approach (DWA)

¹

Key Laboratory of Underwater Robot Technology, Harbin Engineering University, Harbin 150001, China

²

College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin 150001, China

^*

Author to whom correspondence should be addressed.

J. Mar. Sci. Eng. 2025, 13(2), 221; https://doi.org/10.3390/jmse13020221

Submission received: 26 December 2024 / Revised: 17 January 2025 / Accepted: 23 January 2025 / Published: 24 January 2025

(This article belongs to the Section Ocean Engineering)

Download

Browse Figures

Versions Notes

Abstract

A dynamic distributed target hunting method is proposed for the problem of distributed moving target hunting by multiple Autonomous Underwater Vehicles (AUVs). By integrating the improved Dynamic Window Approach (DWA) with the Rapidly-exploring Random Tree (RRT) algorithm and incorporating collision avoidance rules between AUVs into the evaluation system of the DWA, the collision avoidance rules are quantified, and corresponding evaluation functions are established. This allows for the selection of motion trajectories that comply with the collision avoidance rules from the predicted trajectory set, improving the obstacle avoidance capability during AUV motion planning and enhancing the reliability of the target hunting task. The introduction of a consistency algorithm maintains the consistency of the group task information and ensures that the hunting strategy can be adjusted promptly in the event of an AUV failure, allowing the target hunting task to continue. Polynomial regression algorithms are used to predict the moving target’s trajectory. Based on a polygonal hunting formation, the hunting potential points are dynamically allocated, and, finally, each AUV executes distributed motion planning towards the hunting potential points to form the hunting formation. Simulation results show that the proposed method achieves efficient multi-AUV-distributed dynamic target hunting.

Keywords:

autonomous underwater vehicle; motion planning; obstacle voidance; target hunting

1. Introduction

The Autonomous Underwater Vehicle (AUV) is a complex system integrating sensor detection, information fusion, vehicle control, and other technologies and has no physical connection with the mother ship [1]. AUVs can play an increasingly important role in the exploration and exploitation of marine resources, and AUVs can fulfill a wide range of underwater tasks in both civil and military applications [2]. Motion planning is the foundation of AUVs for underwater missions, but underwater environments are harsh, with not only static but also dynamic obstacles, so it is important for AUVs to have localized planning capabilities [3]. AUVs can accomplish many tasks once motion planning is achieved, such as dynamic target hunting tasks, which have an important role in the military field [4].

The Artificial Potential Field (APF) [5], Velocity Obstacle (VO) [6], Dynamic Window Approach (DWA) [7], and Time Elastic Band (TEB) [8] algorithms can all be used for autonomous collision avoidance in robots. Among them, the DWA algorithm has dynamic planning capabilities, but its collision avoidance performance is poor in complex environments [9]. Molinos et al. proposed the DW4DO algorithm and the DW4DOT algorithm to handle dynamic obstacles. These algorithms increase the data dimensions stored in the grid and can predict multiple velocities, making dynamic obstacle avoidance safer and more efficient compared to basic algorithms [10]. Liu et al. optimized the trajectory evaluation function of the DWA algorithm to guide the DWA algorithm for local path planning and dynamic obstacle avoidance by using the critical nodes extracted from the global path planned by the A* algorithm [11]. Song et al. used an improved ant colony algorithm for global path planning in static environments and then improved the DWA algorithm by removing redundant nodes, optimizing the initial direction, and improving the evaluation function to achieve path planning in the case of avoiding static obstacles and dynamic obstacles [12]. By analyzing the above literature, it can be concluded that reference [10] can predict multiple velocities, but it still does not fully cover the entire dynamic window. The improved DWA algorithm proposed in reference [11] achieves better obstacle avoidance, but its performance in avoiding unknown dynamic obstacles is poor, leading to a higher risk of collisions. Reference [12] suggests combining the ant colony algorithm with DWA, which works well in known environments but is not suitable for environments with unknown obstacles.

The advantage of the RRT algorithm lies in its ability to efficiently find feasible paths in high-dimensional, complex, and unknown environments [13]. Wang et al. proposed the HA-RRT algorithm, which realizes two-dimensional path planning by introducing dynamic factors to improve the convergence efficiency of the algorithm as well as adaptive adjustment strategies to adapt to the complex marine environment [14]. Li et al. proposed an algorithm combining DWA and RRT to realize path planning in complex static obstacle environments by improving the period of the whole dynamic window to generate the velocity space and planning the guidance points in the localized region with the RRT algorithm [15]. From the analysis of the above literature, it can be concluded that the RRT algorithm can perform path planning to avoid obstacles and reach the global goal point in unknown environments. However, the generated path does not consider the constraints of the moving objects, and, in some cases, the robot may not be able to follow the planned path completely, leading to task failure. Additionally, the algorithm lacks the ability to avoid dynamic obstacles.

In terms of dynamic target hunting technology, the research not only includes accurate prediction of the target’s trajectory but also covers how AUV groups work together to form a hunting formation and how to adjust tactics and strategies when the target dynamics change to ensure mission success.

In the area of moving target trajectory tracking and prediction, Zhu et al. developed a set of target trajectory prediction models based on LSTM networks, which can predict the posture in the battlefield environment and provide tactical decision-making references for combatants [16]. Wang et al. developed a modified Mean Shift tracking algorithm to solve the difficult problem of position prediction when the target is fast moving or occluded and accurately predicted the position by inter-frame clustering motion and scale estimation [17]. Zhang et al. proposed a heterogeneous model integration method of the inverse optimization propagation neural network and physical model for whale algorithm (WOA) optimization, which can predict the underwater position and surface position of an underwater glider [18]. It can be seen that advances in trajectory prediction techniques are closely dependent on artificial intelligence algorithms, especially neural-network-based methods, which in turn cannot succeed without adequate training data.

In terms of target interception point computation, Meng et al. proposed a prediction and planning-based interception algorithm in the field of harbor protection, which mainly operates on potentially hazardous targets approaching the harbor, where an artificial potential field algorithm is utilized to design the interception path. In addition, the authors consider the influence of environmental factors such as the direction and speed of the sea currents and propose corresponding coping strategies [19]. Lan et al. proposed a distributed control strategy for the multi-target interception problem of evacuation targets, using a dynamic assignment method to assign a tracking robot to each target. By analyzing the target trajectory and the movement paths of other robots, the interception target of each robot is determined, and a one-to-one interception task is performed [20]. Tahir et al. adopted an improved proportional navigation guidance algorithm to intercept maneuvering targets by estimating the acceleration of the maneuvering target through a prediction algorithm, estimating the target state after obtaining the acceleration information, and then performing the interception [21].

The article proposes the following basic assumptions:

Assumption 1: Obstacles and AUVs are on the same horizontal plane.
Assumption 2: AUVs can obtain environmental information using sonar and other sensor equipment, including the positions of obstacles and the size and speed of dynamic obstacles.
Assumption 3: AUVs navigate along the planned path, and their motion parameters are consistent with the planned parameters.
Assumption 4: The motion state information of AUVs, including position, heading angle, and speed, can be shared within the cluster.

Existing hunting research emphasizes inter-robot collaborative approaches to enhance hunting efficiency and reduce resource waste. For potential collision risks, traditional local planning strategies such as artificial potential fields are mostly used to avoid them, the motion planning is ineffective, the prediction accuracy and validity of interception points need to be improved, and the strategy of the hunting formation needs to be improved. Aiming at the above problems, this paper proposes a dynamic target hunting strategy under AUV motion planning based on improved DWA. The main innovations include the following:

A motion planning algorithm based on improved DWA is proposed to enhance the collision avoidance performance of AUVs in complex static obstacle environments and dynamic obstacle environments.
Setting up multi-AUV-distributed collision avoidance rules and integrating them into the evaluation system of DWA, quantifying the collision avoidance rules, and establishing the corresponding rule evaluation function so as to optimize the motion trajectories conforming to the collision avoidance rules among the predicted set of trajectories.
A consistency algorithm is introduced to ensure the consistency of multi-AUV information and mission continuity in the case of leader failure. Dynamic target trajectories are predicted by polynomial regression, and hunting potential points are dynamically assigned according to the polygonal hunting formation, which is formed by combining the distributed motion planning of each AUV.

2. Task Element Modeling

2.1. AUV Kinematic Modeling

This paper simplifies the modeling by taking the Minesniper MkII torpedo-type AUV as the research object [22], and an AUV kinematic model was introduced. Define

S_{k} = \{x_{k}, y_{k}, φ_{k}\}

as the AUV state vector. In the above equation,

\{x_{k}, y_{k}\}

represents the position of the AUV at moment

k

, and

φ_{k}

represents the AUV bow angle. The AUV kinematic equation can be simplified to

S_{k + 1} = S_{k + 1} + f (S_{k}, v_{k}, {\dot{φ}}_{k})

, and

\begin{array}{l} f (S_{k}, v_{k}, {\dot{φ}}_{k}) = \\ [\begin{matrix} (v_{k} + {\dot{v}}_{k}) \cos (φ_{k} + {\dot{φ}}_{k} \cdot Δ t) \cdot Δ t \\ (v_{k} + {\dot{v}}_{k}) \sin (φ_{k} + {\dot{φ}}_{k} \cdot Δ t) \cdot Δ t \\ φ_{k} + {\dot{φ}}_{k} \cdot Δ t \end{matrix}] \end{array}

In the equation,

v_{k}

represents the velocity of AUV movement,

{\dot{v}}_{k}

represents the acceleration of AUV movement, and

{\dot{φ}}_{k}

represents the angular velocity of AUV movement. The following are the kinematic constraints of the AUV in this paper:

\{\begin{matrix} 0 \leq v_{k} \leq 2 \\ - \frac{π}{3} \leq {\dot{φ}}_{k} \leq \frac{π}{3} \end{matrix}

2.2. Forward-Looking Sonar Detection Model

Considering the underwater environment, a mathematical model was developed based on the working principle of sonar [23]. Therefore, in this paper, the radius of detection of the forward-looking sonar equipped by the navigator is 100 m in which the horizontal detection angle is 120°, the operating frequency is 2 KHz, and the method of matrix array statistics is used to deal with the information within the range of the sonar opening angle. The forward-looking sonar can be simply expressed as shown in Figure 1:

Objects that meet the following conditions can be detected by sonar:

\{\begin{matrix} \frac{|y_{t}|}{\sqrt{x_{t}^{2} + y_{t}^{2}}} \leq \sin \frac{α}{2} \\ \sqrt{x_{t}^{2} + y_{t}^{2}} \leq R \end{matrix}

Additionally,

(x_{t}, y_{t})

can be expressed as follows:

\{\begin{cases} x_{t} = x - x_{0} \\ y_{t} = y - y_{0} \end{cases}

In the equation,

(x, y)

is the coordinates of the target in the hull coordinate system.

In addition,

(x_{0}, y_{0})

is the coordinates of the AUV configured sonar in the hull coordinate system, and

(x_{t}, y_{t})

indicates the relative position of the target in relation to this AUV.

2.3. Static Obstacle Modeling

This paper uses grid map modeling and distinguishes between passable and impassable areas using black and white colors. The former represents obstacle regions, while the latter represents feasible areas. The obstacle model consists of irregular single or continuous static grids, which can simulate complex underwater environments with obstacles [24].

2.4. Dynamic Obstacle Modeling

In order to enhance the safety of avoiding dynamic obstacles, the dynamic obstacle is expanded into a circle of equal diameter based on its body length, and assuming that the motion is linear in the localized region, the kinematic equation can be expressed as follows:

[\begin{array}{l} x_{k + 1} \\ {\dot{x}}_{k + 1} \\ y_{k + 1} \\ {\dot{y}}_{k + 1} \end{array}] = [\begin{matrix} 1 & T & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & T \\ 0 & 0 & 1 & 1 \end{matrix}] [\begin{array}{l} x_{k} \\ {\dot{x}}_{k} \\ y_{k + 1} \\ {\dot{y}}_{k + 1} \end{array}] + [\begin{matrix} 1 & 0 & 0 & 0 \\ 1 & 1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 1 & 1 \end{matrix}] [\begin{array}{l} ω_{k} \\ {\ddot{x}}_{k} \\ ω_{k} \\ {\ddot{y}}_{k} \end{array}]

In the equation,

x_{k}

,

{\dot{x}}_{k}

, and

{\ddot{x}}_{k}

represent the position, velocity, and acceleration of the target in the x-axis direction at moment k;

y_{k}

,

{\dot{y}}_{k}

, and

{\ddot{y}}_{k}

represent the position, velocity, and acceleration of the target in the x-axis direction at moment k;

ω_{k}

is Gaussian white noise;

T

is the sampling time.

3. Obstacle Avoidance Motion Planning for AUV Based on DWA and RRT

3.1. Basic DWA Algorithm

The basic DWA selects the optimal trajectory to maximize the distance between the robot and obstacles, maximize speed, and achieve a heading closest to the target [25]. The robot moves with the velocity combination of the selected trajectory into the next motion cycle. This process is repeated to achieve online real-time motion planning. The pseudocode for the DWA algorithm (Algorithm 1) is as follows:

Algorithm 1 Dynamic Window Approach

Input: current position robotPose, target point robotGoa, model parameter robotModel
Output: motion trajectory dataset robotTrajectory

1: BEGIN
2:   desiredV = calculateV(robotPose,robotGoal)
3:   laserscan = readScanner()
4:   allowable_v = generateWindow(robotV, robotModel)
5:   allowable_w = generateWindow(robotW, robotModel)
6:   for each v in allowable_v
7:     for each w in allowable_w
8:     dist = find_dist(v,w,laserscan,robotModel)
9:     breakDist = calculateBreakingDistance(v)
10:    if (dist > breakDist)
11:     heading = hDiff(robotPose,goalPose, v,w)
12:     clearance = (dist − breakDist)/(dmax − breakDist)
13:     cost = costFunction(heading,clearance, abs(desired_v −v))
14:       if (cost > optimal)
15:       best_v = v
16:       best_w = w
17:       optimal = cost
18: set robotTrajectory to best_v, best_w
19:END

The functions in the code are explained as follows:

DWA(): The main function of the DWA algorithm.

calculateV(): Calculates the desired velocity based on the robot’s current position and the target position.

readScanner(): Reads data from the distance sensor and returns information about obstacles in the surrounding environment.

generateWindow(): Generates a feasible velocity window based on the current speed and the robot’s model.

generateWindow(): Generates a feasible angular velocity window based on the current angular velocity and the robot’s model.

find_dist(): Calculates the safe driving distance where the robot will not collide with any obstacles using sensor data and the robot’s model.

calculateBreakingDistance(): Computes the distance required for the robot to come to a complete stop.

hDiff(): Evaluates the heading cost as the angular difference between the robot’s heading and the target position for a given velocity and angular velocity.

costFunction(): Calculates a total evaluation value by considering the heading cost, obstacle clearance cost, and the cost of the difference between the current velocity and the desired velocity.

3.2. Improvement of Velocity Space

Traditional DWA limits velocity space to the first motion cycle, potentially missing feasible trajectories. In the dynamic window, the improved DWA algorithm expands the reachable velocity range of a single motion cycle to the reachable velocity range of all motion cycles as the velocity space. This improvement can significantly increase the number and breadth of assessable trajectories within each prediction cycle. The following equation is for improving the speed space:

V_{w} = \{{(v_{w}, ω_{w})}^{T}\}

and

\{\begin{cases} v_{w} \in [\max (0, v_{a 0} - a T), \min (v_{\max}, v_{a 0} + a T)] \\ ω_{w} \in [\max (- ω_{\max}, ω_{a 0} - α T), \min (ω_{\max}, ω_{a 0} + α)] \end{cases}

In the above equation,

v_{w}

and

ω_{w}

are the AUV velocity and bow angular velocity in the dynamic window,

v_{a 0}

and

ω_{a 0}

are the initial AUV velocity and bow angular velocity in the window,

a

and

α

are the maximum acceleration of the AUV and the maximum bow angular acceleration,

v_{\max}

is the maximum velocity of the AUV,

T

is the prediction period, max(∙) is the function of maximum value, and min(∙) is the function of minimum value.

After generating the velocity space, samples are taken at a sampling rate of

ls

, and each sampling velocity state

V_{s} = \{v_{s}, ω_{s}\}

is taken as the expected velocity state of a trajectory. Combined with the AUV motion model, trajectories within the prediction period are generated.

3.3. Obstacle Sampling Window

Conventional DWA can stagnate in complex or confined spaces [26], and it is not possible to plan a path to bypass the obstacles. To address this issue, this paper proposes an obstacle sampling window, as shown in Figure 2, where the red asterisk represents the target point and the blue lines represent the sampling lines, which also reflect the sonar-detectable area. The obstacle sampling window is used to determine whether the AUV will encounter concave continuous obstacles and predict if it may lead to the AUV becoming stagnant. The formula for the obstacle sampling window space is as follows:

S_{w} = \{{(θ_{w}, d_{w})}^{T} ∣ θ_{w} \in [0, θ_{r}] \cup (0, 2 π - θ_{r}], d_{w} \in [0, ρ]\}

In the above equation,

θ_{w}

is the sampling angle;

d_{w}

is the sampling distance;

θ_{r}

is the opening angle parameter;

ρ

is the distance parameter.

If the sampling line intersects with the obstacle, it is defined as

l_{o i} = 1

. The occupancy value of the obstacle window is as follows:

F_{a} = \{\begin{array}{l} 1, \sum_{i = 1}^{\frac{2 θ_{r}}{θ_{w}} + 1} | l_{o i} - 1 | = 0 \\ 0, otherwise \end{array}

In the above equation,

l_{o i}

is the occupancy value of the sampling line. When

F_{a} = 1

, if the AUV continues moving in the current direction, it will not be able to avoid the obstacle and will become stagnant.

3.4. Complex Static Obstacle Avoidance Incorporating SAGRRT

In order to overcome DWA’s limitations in complex environments, this section proposes the SAGRRT algorithm. The SAGRRT algorithm, when there is a risk of stagnation for the AUV, uses guidance points to guide DWA in planning the path, thereby escaping complex obstacles and achieving efficient obstacle avoidance.

3.4.1. SAGRRT Static Obstacle Avoidance Guidepost Planning

When there is a risk of stagnation, SAGRRT searches for feasible region nodes, and the nodes are used as local target points

P_{l o c a l goal}

. As shown in Figure 3, all the guidance points are composed of all the connected nodes in the path from

P_{l o c a l goal}

to

P_{n o w}

.

3.4.2. Extraction Rules for Key Guiding Points

This paper proposes a key point selection rule to eliminate redundant guide points. The principle is shown in Figure 4; between the start point and point 7, starting from the starting point, the path between each pair of guide points is checked in turn. If the path does not collide with an obstacle, the previous guide point is redundant. If a collision occurs at a connection, the previous guide point is the critical point, and the checking of subsequent connections continues from that point. Traversing all of the guide points generates the set of critical guide points {3, 4, 6, 7}, as shown in Figure 4.

3.5. Dynamic Obstacle Avoidance Incorporating DAGRRT

The traditional DWA struggles with fast-moving obstacles due to its short-sighted trajectory prediction. In order to solve this problem, this section proposes a dynamic obstacle avoidance method incorporating the DAGRRT algorithm, which introduces the DAGRRT algorithm to plan local guidance points when it is determined that the DWA algorithm cannot effectively avoid dynamic obstacles. The DAGRRT algorithm uses safety guidance points as the target points for DWA, which can plan safer dynamic obstacle avoidance trajectories.

The dynamic obstacle is first inflated to a circle with its volume length as the diameter, and then the forward space of the dynamic obstacle is divided into three regions based on the relative state between the AUV and the dynamic obstacle: the collision risk region, the collision danger region, and the safety region.

If the AUV enters the collision risk area, it means that it has the risk of collision with dynamic obstacles. If the AUV enters the collision risk zone, the current velocity, maximum acceleration, and relative velocity of the AUV to the obstacle need to be taken into account to calculate the minimum distance threshold that should be maintained between the AUV and the obstacle

d_{s}

, which is expressed as follows:

d_{s} = v_{o} [\frac{v_{\max} + v_{o}}{a} + t_{ω} + \sqrt{\frac{2 d_{o}}{a}}] + \frac{v_{\max^{2}} + v_{o}^{2}}{2 a} + |d_{p}|

In the equation,

d_{o}

is the diameter of the dynamic obstacle expansion circle;

d_{p}

is the distance threshold adjustment parameter;

t_{ω}

is the time required for the AUV to complete all steering maneuvers. The expression is as follows:

t_{ω} = \{\begin{array}{l} \frac{3 π}{2 ω} + \frac{2 ω}{α}, α > \frac{π^{2}}{ω} \\ \sqrt{\frac{2 π}{α}} + \sqrt{\frac{4 π}{α}}, otherwise \end{array}

ω

represents the angular velocity of AUV, and

α

represents the angular acceleration of AUV.

When the AUV enters the boundary of the hazardous area, the DAGRRT algorithm plans dynamic obstacle avoidance guide points, and the principle is shown in Figure 5. The search node is discarded if it is within the hazardous area; on the contrary, this node is set as the local goal point

P_{l o c a l goal}

of dynamic obstacle avoidance, and a set of dynamic obstacle avoidance guide points is established by backtracking the path nodes, which is used to guide the planning algorithm to avoid the dynamic obstacles safely.

4. Introduction of DAGRRT Dynamic Obstacle Avoidance

We propose a distributed collision avoidance framework for AUVs using an improved DWA. This framework employs a standardized rule-based approach to ensure safe and efficient multi-AUV operations. This framework can improve the safety and efficiency of collision avoidance planning between multiple AUVs.

4.1. Bump Avoidance Space Division

The space around the AUVs is divided into three different areas as shown in Figure 6: safe space, risk space, and collision space.

R_{s}

,

R_{a}

, and

R_{d}

, respectively, correspond to the boundaries of safety space, risk space, and collision space in the risk space division. When the distance between two AUVs is

d_{a o} \in (R_{s}, + \infty)

, it is in the safe space, and no active collision avoidance is required; when

d_{a o} \in (R_{d}, R_{s}]

, it is in the risk space, and active collision avoidance is performed when there is a risk of collision avoidance; when

d_{a o} \in (R_{d}, R_{a}]

, it is necessary to comply with the rules of collision avoidance; and when

d_{a o} \leq R_{d}

, it is regarded as a collision. By dividing the collision avoidance space and setting the threshold value, the multi-AUV system can significantly improve the safety of collision avoidance while guaranteeing the efficiency of task execution.

4.2. DWA Collision Avoidance Rule Evaluation Function

In this paper, we propose a collision avoidance rule in which an AUV needs to avoid to the right side, and, in Figure 7,

A_{a}

takes the non-collision direction with a clockwise magnitude of

Δ θ_{a v}

in the

θ_{a o}

direction at this time as the base avoidance direction, which is calculated by the following equation:

φ_{n_{i}} = \{\begin{array}{l} θ_{a o_{i}} - Δ θ_{a v}, θ_{a o_{i}} - Δ θ_{a v} > - π \\ 2 π + θ_{a o_{i}} - Δ θ_{a v}, θ_{a o_{i}} - Δ θ_{a v} \leq - π \end{array} (Δ θ_{a v} \leq \frac{π}{4})

In the equation,

θ_{a o_{i}}

is the direction of the i-th AUV to be avoided relative to the AUV

A_{a}

,

φ_{n_{i}}

is the corresponding i-th avoidance direction, and

Δ θ_{a v}

is the avoidance magnitude.

When

A_{a}

needs to avoid multiple AUVs at the same time, a predetermined avoidance direction should be calculated for each potential collision target, from which the largest avoidance direction in the clockwise direction should be selected as the final avoidance direction, as shown in Figure 8. The calculation expression for the optimal avoidance direction is shown as follows:

\begin{array}{l} φ_{o b} = \min (\min (|φ_{r b} - φ_{n_{1}}|, 2 π - |φ_{r b} - φ_{n_{1}}|), \min (|φ_{r b} - φ_{n_{2}}|, 2 π - |φ_{r b} - φ_{n_{2}}|), \\ \dots, \min (|φ_{r b} - φ_{n_{m}}|, 2 π - |φ_{r b} - φ_{n_{m}}|)) \end{array}

and

φ_{r b} = \{\begin{array}{l} φ_{a} - π, φ_{a} \geq 0 \\ - φ_{a}, φ_{a} < 0 \end{array}

φ_{a}

is the bow angle of AUV

A_{a}

.

The closer the end bow of the trajectory is to

φ_{o b}

, the higher the evaluation value of the rule evaluation function is, and the expression of the rule evaluation function is as follows:

avoid (v, ω) = 2 π - \min (|φ_{a_{\frac{T}{τ}}} - φ_{o b}|, 2 π - |φ_{a_{\frac{T}{τ}}} - φ_{o b}|)

In the above equation,

φ_{a_{\frac{T}{τ}}}

is the bow angle at the end of the predicted trajectory of the AUV of this side.

5. Distributed Dynamic Target Hunting by AUVs Based on CPPPEA

This section proposes the CPPPEA (Consensus-Based Polynomial Prediction and Polygonal Encirclement Algorithm), in which the consistency algorithm ensures the reliability of AUV cluster distributed cooperation, the polynomial regression algorithm predicts the trajectory of dynamic targets in real time, providing data support for encirclement, and the standard polygon strategy can achieve the generation of encirclement potential points and efficient encirclement of targets.

5.1. AUV Swarm Consistency Algorithm

We introduce a leader–follower architecture with a robust leader election mechanism to ensure information consistency and fault tolerance within the AUV swarm. The consistency algorithm ensures that even when the leader AUV fails, the cluster of capture tasks can maintain high availability and stability.

A distributed consistency scheme is designed based on BASE theory, as shown in Figure 9. In the distributed swarm, all nodes can be in one of three states: follower, candidate, or leader. The leader node is elected to handle and maintain the consistency of distributed information. The leader periodically sends heartbeat messages to synchronize information within the swarm and maintain its term. If the leader node fails, a new leader is elected, and the term number is incremented. If the previous leader recovers and receives heartbeat data with a higher term number from the new leader, it will automatically revert to the follower state. When selecting a new leader, candidates will first vote for themselves and send a message requesting voting to other nodes in the cluster. When a candidate node receives more than half of the votes, it will be promoted to a leader node, and its term will be increased by 1. This mechanism ensures the information consistency and high availability of the distributed system.

5.2. Formation Method of the Hunting Formation

We propose a novel hunting formation strategy that leverages target trajectory prediction and real-time AUV status to achieve efficient encirclement. This method includes the trajectory prediction of moving targets, the generation of capture base points, the and generation of capture potential points.

5.2.1. Nonlinear Regression Fitting of Moving Target Trajectories

The polynomial regression method is a flexible regression technique that models data by establishing a polynomial relationship between the independent and dependent variables [27]. For the trajectory of a moving target in the ocean over a period of time, the trajectory can be fitted using a univariate polynomial regression model with the following equation:

f (x) = β_{0} + β_{1} x + β_{2} x^{2} + β_{3} x^{3} + \dots β_{n} x^{n}

In the equation,

β

is the coefficient of each term.

For the trajectory fitting of moving targets, the method of decoupling each coordinate by time step is used to reduce the number of polynomials in each dimension in order to improve the fitting accuracy, and the relationship between the target position and the time step can be obtained, which is defined in the model as follows:

x_{i} = β_{0} + β_{1} t_{i} + β_{2} t_{i}^{2} + β_{3} t_{i}^{3} + \dots β_{n} t_{i}^{n}

The model can be written as a system of linear equations since the dataset has multiple steps of data:

\begin{matrix} \vec{X} = A \overset{⇀}{β} \\ \Rightarrow [\begin{matrix} x_{s 1} \\ x_{s 2} \\ x_{s 3} \\ ⋮ \\ x_{s n} \end{matrix}] = [\begin{array}{l} 1 & t_{1} & t_{1}^{2} & \dots & t_{1}^{n} \\ 1 & t_{2} & t_{2}^{2} & \dots & t_{2}^{n} \\ 1 & t_{3} & t_{3}^{2} & \dots & t_{3}^{n} \\ 1 & ⋮ & ⋮ & ⋱ & ⋮ \\ 1 & t_{m} & t_{m}^{2} & \dots & t_{m}^{n} \end{array}] [\begin{matrix} \begin{matrix} β_{0} \\ β_{1} \\ β_{2} \\ ⋮ \\ β_{n} \end{matrix} \end{matrix}] \end{matrix}

The vector of polynomial regression coefficients estimated using least squares is as follows:

\hat{\vec{β}} = {(A^{T} A)}^{- 1} A^{T} \overset{⇀}{X}

Assuming that n < m is a necessary condition for matrix invertibility, then, since

A

is a Vandermont matrix, the invertibility condition is guaranteed to hold if all

t_{i}

values are not the same, which is the only least squares solution.

In order to accelerate the speed of parameter convergence, the coordinates of the existing trajectory data are standardized as training data, and the mean

σ

and standard deviation

μ

of all coordinate values in the data are calculated and standardized according to the following equation:

x_{s i} = \frac{x_{i} - μ}{σ}

Let the prediction function be

f_{β} (t) = β_{0} + β_{1} t + β_{2} t^{2} + \dots β_{n} t^{n}

Then the objective function of the regression algorithm is as follows:

E (β) = \frac{1}{2} \sum_{i = 1}^{m} {(x_{s i} - f_{β} (t_{i}))}^{2}

All parameters are updated in a synchronized loop to ensure that the gradient direction remains stable, and the parameter update expression is as follows:

β_{j}^{'} = β_{j} - η \sum_{i = 1}^{m} (f_{β} (t_{i}) - x_{s i}) t_{i}^{j}

In the equation,

η

is the learning rate when the model parameters are updated.

The dataset is approximated by increasing the higher terms of the independent variables until the error is minimized, and, finally, the trajectory equation of the target position with respect to time is generated.

5.2.2. Calculation Method for Hunting Base Point

The hunting base point is defined as the location where the AUV swarm can encounter the moving target in the shortest time. After fitting the trajectory equation of the moving target using a regression method, the target’s motion trajectory is converted into a discrete set of points

{P_{i}}

, with intervals corresponding to the AUV’s motion cycle

τ

. The task is to calculate the earliest point

P_{k}

, where the AUV, starting from its current position

X_{a}

and accelerating with maximum acceleration, can encounter the dynamic target. A decision variable

d_{i}

is introduced to indicate whether the i-th point is selected as the encounter point. The problem can then be further formulated as finding the value of

i

that minimizes

t_{i}

. The formula is as follows:

\begin{array}{l} \min \sum_{i = 1}^{n} d_{i} t_{i} \\ s . t . \sum_{i = 1}^{n} d_{i} = 1 and t_{i} \geq t_{P_{i}} \\ d_{i} \in {0, 1}, \forall i \end{array}

In the equation,

t_{h}

is the time required for the AUV to reach point

t_{P_{i}}

.

The time for the AUV to reach the encounter point is mainly considered to calculate the displacement time

t_{d}

and the steering time

t_{φ}

.

The calculation of

t_{d}

needs to consider the current position

X_{a}

, current speed

v_{a}

, maximum acceleration

a

, and maximum cruise speed

v_{m}

of the AUV, which are calculated separately according to whether the maximum cruise speed can be reached during the voyage and are expressed as follows:

\begin{array}{l} \{\begin{cases} t_{d} = \frac{2 a {‖P_{i} - X_{a}‖}_{2} + {(v_{m} - v_{a})}^{2}}{2 a v_{m}}, {‖P_{i} - X_{a}‖}_{2} > \frac{v_{m}^{2} - v_{a}^{2}}{2 a} \\ t_{d} = \frac{- v_{a} \pm \sqrt{v_{a}^{2} + 2 a {‖P_{i} - X_{a}‖}_{2}}}{a}, otherwise \end{cases} (t_{d} \geq 0) \end{array}

The calculation of

t_{φ}

needs to consider the current bow angle

φ_{a}

, current bow angular velocity

ω_{a}

, maximum bow angular acceleration

α

, and maximum bow angular velocity

ω_{m}

of the AUV, which are calculated according to whether the maximum bow angular velocity can be reached or not during steering, and their expressions are as follows:

\{\begin{cases} t_{φ} = \frac{|ω_{m} - ω_{a}| + ω_{m}}{α} + \frac{|φ_{p_{i}} - φ_{a}| - φ_{m}}{ω_{m}}, |φ_{p_{i}} - φ_{a}| > φ_{m} \\ t_{φ} = \frac{|ω_{m i d} - ω_{a}| + |ω_{m i d}|}{α}, otherwise \end{cases} (t_{φ} \geq 0)

φ_{m} = \frac{2 ω_{m}^{2} + s ω_{a}^{2}}{2}

ω_{m i d} = \sqrt{\frac{2 α |φ_{p_{i}} - φ_{a}| + s ω_{a}^{2}}{2}}

In the equation,

φ_{m}

is the total change of bow direction during acceleration from the current bow angular velocity to the maximum bow angular velocity and deceleration from the maximum bow angular velocity to 0;

ω_{m i d}

is the intermediate bow angular velocity, if it is not possible to reach the maximum bow angular velocity during steering, deceleration is made

ω_{m i d}

when it is reached, and the bow is facing

φ_{p_{i}}

when deceleration is made to 0;

s

is the sign factor.

The time

t_{P_{i}}

to reach point

P_{i}

can be approximated as follows:

t_{P_{i}} = \max (t_{d}, t_{φ})

The above calculation yields the set of fastest encounter points

{p_{K_{j}}}

between each AUV and a moving target and the fastest encounter time

t_{K_{j}}

between each AUV and a moving target. Then the slowest encounter point

P_{h}

is the roundup base point of the AUV cluster.

5.2.3. Moving Target Hunting Formation Method

Collaborative roundup by AUVs aims to construct a polygonal roundup formation centered on the roundup target and formed by the participating AUVs to surround the target. The leader AUV, elected by the coherence algorithm, acts as the roundup planning organizer and is responsible for planning the potential points of other follower AUVs when rounding up the moving target. The method for determining the roundup potential point is demonstrated in Figure 10.

Based on the standard polygon strategy to generate the roundup potential points, the previously calculated roundup base point

P_{h} (x_{h}, y_{h})

is used as the polar coordinate origin, the potential points are assigned on the roundup encirclement circle with radius

r_{h}

, and the angle between neighboring roundup potential points is

θ_{h} = 2 π / m

.

T_{i} (x_{T_{i}}, y_{T_{i}}) (i = 1, 2, \dots, m)

is the potential points around the rounded up target, the set of roundup potential points is

T = {T_{1}, T_{2}, \dots, T_{n}} (i = 1, 2, \dots, m)

, and the formula for roundup point coordinates is as follows:

\{\begin{cases} x_{T_{i}} = x_{h} + r_{h} \cdot \cos (φ_{d} + (i - 1) \frac{2 π}{m}) \\ y_{T_{i}} = y_{h} + r_{h} \cdot \sin (φ_{d} + (i - 1) \frac{2 π}{m}) \end{cases}

In the above equation,

φ_{d}

is the heading of the moving target when it moves to the roundup base

P_{h}

, and

m

is the number of AUVs participating in the roundup mission.

To achieve optimal allocation of capture potential points for each AUV, the motion direction of the moving target is taken as the polar-axis 0-degrees direction, and clockwise direction is taken as the positive direction. Calculate the polar angle

θ_{a_{i}}

of each AUV’s current position. Then, sort each AUV in ascending order of the polar angle to obtain set

A = \{A_{1}, A_{2}, A_{3}, \dots, A_{i}\} (i = 1, 2, \dots, m)

. Correspond the elements in sets

A

and

T

one by one in order, and the

T_{i}

element corresponding to each AUV is the encirclement potential point that the AUV needs to reach.

The maximum radius of the rounding potential point is

r_{p}

, the maximum radius of the target to be rounded up is

r_{d}

, the safety distance between AUVs is set to

d_{s}

, and conditions

d_{s} > r_{p}

,

d_{s} > r_{d}

need to be set in order to ensure the safety of the rounding up task in which the radius of the rounding potential point

r_{p}

and the arc length between neighboring potential points

l_{p}

need to meet the following requirements:

\{\begin{cases} 2 r_{p} + β_{1} d_{s} \leq l_{p} \\ r_{d} + r_{p} + β_{2} d_{s} \leq r_{h} \end{cases} (β_{1}, β_{2} \geq 1)

β_{1}

and

β_{2}

are safe distance factors.

Due to the constraint of arc length

l_{p}

, the distance between the AUVs and the moving target will limit the size of the roundup radius so the number of AUVs participating in the roundup will be limited. Assuming that there are

m

AUVs, the roundup radius is

m l_{p} = 2 π d_{h}

, and solving for the minimum roundup radius based on the conditions of the above inequality yields the relationship between the number of AUVs and each parameter as follows:

m = [\frac{2 π (r_{p} + r_{d} + β_{2} d_{s})}{2 r_{p} + β_{1} d_{s}}]

When the AUV clusters enter the state of predetermined conversion to a roundup formation, they will move toward their respective designated roundup potential points to form an effective roundup formation, as shown in Figure 11.

6. Simulation Results

In order to validate the proposed method and its adaptability and efficiency under different conditions, a series of simulation experiments are carried out to set up a total of five AUVs within the mission range of 500 m × 500 m to participate in the encirclement mission and to encircle the moving target as an enemy AUV. The moving target is driven to the target point before this if the AUV cluster forms the formation of encirclement, and then encirclement will be successful, or else it is regarded as a failure of the encirclement.

6.1. Simulation of Target Hunting Task When the Moving Target’s Speed Is Less than the AUV’s

In conventional hunting methods, a prerequisite for successful hunting is that the hunter’s (AUV’s) speed must be greater than that of the target. If this condition is not met, the success rate of the hunting task will be significantly reduced. To address this issue, this section presents a simulation experiment aimed at comparing the effectiveness of the traditional tracking hunting method with the hunting method proposed in this paper for capturing a moving target in an environment with obstacles. The initial states of the AUV and the moving target in the experiment are detailed in Table 1.

The simulation results are shown in Figure 12, visually illustrating the differences in the motion trajectories between the traditional tracking hunting method and the hunting method proposed in this paper. The proposed method performs excellently in reducing redundant travel paths and is able to form an effective hunting formation more quickly. Through the quantitative analysis of the simulation data, and, in conjunction with the results in Table 2, it is evident that the proposed method significantly outperforms the traditional tracking hunting method in key performance indicators such as travel distanceand turning angle cost.

6.2. Simulation of the Hunting Task When the Moving Target’s Speed Equals the AUV’s

This section tests the adaptability and effectiveness of the proposed hunting method under the challenge of equal speeds, breaking the traditional condition where the hunter’s speed exceeds that of the target. The test is conducted under the scenario where the moving target’s speed equals the AUV’s speed. This setup aims to explore the performance of the traditional tracking hunting method and the proposed hunting method when the hunting AUV has no speed advantage. The initial states of the AUV and the moving target in the experiment are detailed in Table 3.

The simulation results, as shown in Figure 13, demonstrate the challenges faced by the traditional tracking hunting method under the condition where the hunting AUV and the moving target have the same speed. Specifically, when the AUV needs to perform maneuvering and obstacle avoidance, it is unable to catch up with the moving target again, preventing the formation of an effective hunting formation and ultimately leading to a failure of the hunting task. In contrast, the method proposed in this paper breaks the traditional limitation that the hunter’s speed must be greater than that of the target. By pre-planning hunting positions and adjusting strategies in real time, it effectively overcomes the hunting challenge under equal speed conditions. Based on the analysis of various indicators in Table 4, the key to the success of the proposed algorithm in executing the hunting task lies in its ability to rapidly form an effective hunting formation with relatively low motion and adjustment costs.

6.3. Simulation of Hunting Task When the Leader Fails

To test the availability of the consistency algorithm proposed in this paper when facing leader failure, this section simulates the failure of the leader AUV and a randomly selected AUV during the hunting process, causing them to be unable to maintain the original course. Additionally, a scenario where the moving target’s speed equals the AUV’s speed is set up for testing. The aim is to evaluate the usability of the proposed hunting method under the complex condition of no speed advantage and leader failure. The initial setup for the experiment is detailed in Table 5.

As shown in Figure 14a, the hunting task swarm that does not use the consistency algorithm proposed in this paper faces difficulty in effectively updating decisions and synchronizing information after the leader node fails. As a result, the swarm can only continue the hunting task based on the potential point allocation before the failure. Unable to adjust in time to match the actual trajectory of the moving target and failing to form an effective hunting formation, this situation ultimately leads to the failure of the hunting task. In contrast, as shown in Figure 14b, the hunting task swarm that applies the consistency algorithm proposed in this paper is able to adapt to changes by re-electing a leader and making task decisions after the leader node fails. By setting up a reformation strategy for the AUVs in normal operation and reallocating the hunting potential points, the continuity of the task is ensured. Ultimately, three functioning AUVs cooperatively completed the effective interception of the target and the formation of the hunting formation. Based on the analysis of the various indicators in Table 6, the proposed algorithm can ensure the completion of the hunting task even when a few AUVs experience failures.

6.4. Simulation of Hunting Task in a Complex Obstacle Environment

In this section’s hunting experiment, an environment with concave obstacles, dynamic obstacles, and trap areas is set up to comprehensively test the effectiveness of the motion planning algorithm and hunting method proposed in this paper. The initial setup for the experiment is detailed in Table 7.

The simulation results, as shown in Figure 15, demonstrate that when the AUVs use the traditional DWA for hunting motion planning, they face several challenges. Due to the lack of foresight, AUV1 and AUV4 become trapped in local areas when encountering concave obstacles with trap zones, while AUV2, with a relatively high relative speed, is unable to avoid a dynamic obstacle in time, resulting in a collision. As a result, the number of remaining AUVs capable of completing the hunting task is insufficient to form an effective hunting formation. In contrast, the motion planning algorithm based on the improved DWA proposed in this paper is able to bypass local obstacle areas and avoid dynamic obstacles in time. After obstacle avoidance, the algorithm adjusts the strategy in real time, ensuring that all AUVs successfully avoid obstacles and can form an effective hunting formation. Based on the analysis of the various indicators in Table 8, the proposed algorithm can accomplish the hunting task in environments with complex static and dynamic obstacles.

7. Conclusions

This paper investigates the dynamic target hunting problem under AUV motion planning based on the improved DWA. Firstly, by optimizing the speed space and evaluation function of the DWA algorithm and incorporating an obstacle detection window and integrating SAGRRT planning for static obstacle guidance points, the issue of DWA stagnating due to the evaluation value falling into local optima is effectively addressed. At the same time, after establishing a collision risk zone, dynamic obstacle avoidance guidance points are planned through DAGRRT to guide the DWA motion planning, enabling dynamic obstacle avoidance.

Secondly, a dynamic collision avoidance algorithm based on consistency collision avoidance rules is proposed, and the collision avoidance rules are integrated into the DWA evaluation system. The rule evaluation function is used to select motion trajectories that comply with the collision avoidance rules, significantly improving the safety and efficiency of distributed AUV collision avoidance.

Next, a consistency algorithm is introduced to ensure the consistency of multi-AUV information and the continuity of tasks in the case of leader failure. Polynomial regression is used to predict the moving target’s trajectory, hunting potential points are dynamically allocated based on a polygonal hunting formation, and each AUV’s distributed motion planning is combined to form the hunting formation.

Finally, simulations verify that this series of systematic methods can effectively enhance the usability and efficiency of multi-AUV distributed dynamic target hunting. In fact, underwater communication issues have a significant impact on AUV collaborative operations, and this aspect should be considered in future research.

Author Contributions

Conceptualization, J.L. and H.L.; methodology, J.L.; software, J.L.; validation, J.L. and H.L.; formal analysis, J.L.; investigation, J.L. and H.L.; resources, J.L.; data curation, H.Z.; writing—original draft preparation, J.L.; writing—review and editing, Z.Z. and H.L.; visualization, H.Z. and H.L.; supervision, J.L.; project administration, J.L.; funding acquisition, J.L. All authors have read and agreed to the published version of the manuscript.

Funding

This search work was supported by the National Natural Science Foundation of China (Grant No. 5217110503), the Research Fund from Science and Technology on Underwater Vehicle Technology (grant No. JCKYS2021SXJQR-09), and the Natural Science Foundation of Shandong Province (grant No. ZR202103070036).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Wang, L.; Zhu, D.; Pang, W.; Zhang, Y. A survey of underwater search for multi-target using Multi-AUV: Task allocation, path planning, and formation control. Ocean. Eng. 2023, 278, 114393. [Google Scholar] [CrossRef]
Cheng, C.; Sha, Q.; He, B.; Li, G. Path planning and obstacle avoidance for AUV: A review. Ocean. Eng. 2021, 235, 109355. [Google Scholar] [CrossRef]
Hao, K.; Zhao, J.; Li, Z.; Liu, Y.; Zhao, L. Dynamic path planning of a three-dimensional underwater AUV based on an adaptive genetic algorithm. Ocean. Eng. 2022, 263, 112421. [Google Scholar] [CrossRef]
Xiang, X.; Yu, C.; Zhang, Q. On intelligent risk analysis and critical decision of underwater robotic vehicle. Ocean. Eng. 2017, 140, 453–465. [Google Scholar] [CrossRef]
Zhang, W.; Wang, N.; Wu, W. A hybrid path planning algorithm considering AUV dynamic constraints based on improved A* algorithm and APF algorithm. Ocean. Eng. 2023, 285, 115333. [Google Scholar] [CrossRef]
Liu, T.; Zhao, J.; Huang, J.; Li, Z. A hybrid RVO-MPPI approach for efficient collision avoidance for multiple autonomous underwater vehicles. Ocean. Eng. 2024, 312, 119205. [Google Scholar] [CrossRef]
Zhang, J.; Zhu, Z.; Xue, Y.; Deng, Z.; Qin, H. Local path planning of under-actuated AUV based on VADWA considering dynamic model. Ocean. Eng. 2024, 310, 118705. [Google Scholar] [CrossRef]
Rösmann, C.; Feiten, W.; Wösch, T.; Hoffmann, F.; Bertram, T. Trajectory modification considering dynamic constraints of autonomous robots. In Proceedings of the ROBOTIK 2012—7th German Conference on Robotics, Munich, Germany, 22–25 May 2012; pp. 74–79. [Google Scholar]
Guo, Y.; Liu, H.; Fan, X.; Lyu, W. Research progress of path planning methods for autonomous underwater vehicle. Math. Probl. Eng. 2021, 2021, 8847863. [Google Scholar] [CrossRef]
Molinos, E.J.; Llamazares, A.; Ocaña, M. Dynamic window based approaches for avoiding obstacles in moving. Robot. Auton. Syst. 2019, 118, 112–130. [Google Scholar] [CrossRef]
Liu, Y.; Wang, C.; Wu, H.; Wei, Y. Mobile Robot Path Planning Based on Kinematically Constrained A-Star Algorithm and DWA Fusion Algorithm. Mathematics 2023, 11, 4552. [Google Scholar] [CrossRef]
Song, B.; Tang, S.; Li, Y. A new path planning strategy integrating improved ACO and DWA algorithms for mobile robots in dynamic environments. Math. Biosci. Eng. MBE 2024, 21, 2189–2211. [Google Scholar] [CrossRef] [PubMed]
Xu, T. Recent advances in Rapidly-exploring random tree: A review. Heliyon 2024, 10, e32451. [Google Scholar] [CrossRef] [PubMed]
Hu, W.; Chen, S.; Liu, Z.; Luo, X.; Xu, J. HA-RRT: A heuristic and adaptive RRT algorithm for ship path planning. Ocean. Eng. 2024, 316, 119906. [Google Scholar] [CrossRef]
Li, J.; Zhang, Z. AUV Local Path Planning Based on Fusion of Improved DWA and RRT Algorithms. In Proceedings of the ICMA 2023—2023 IEEE International Conference on Mechatronics and Automation, Harbin, China, 6–9 August 2023; pp. 935–941. [Google Scholar]
Zhu, Y.; Liu, J.; Guo, C.; Song, P.; Zhang, J.; Zhu, J. Prediction of Battlefield Target Trajectory Based on LSTM. In Proceedings of the ICCA 2020 IEEE International Conference on Control and Automation, Singapore, 9–11 October 2020; pp. 725–730. [Google Scholar]
Wang, H.; Wang, X.; Yu, L.; Zhong, F. Design of Mean Shift Tracking Algorithm Based on Target Position Prediction. In Proceedings of the ICMA 2019—2019 IEEE International Conference on Mechatronics and Automation, Tianjin, China, 4–7 August 2019; pp. 1114–1119. [Google Scholar]
Zhang, R.; Wang, Y.; Wan, X.; Ming, Y.; Yang, S. Position prediction of underwater gliders based on a new heterogeneous model ensemble method. Ocean. Eng. 2024, 309, 118312. [Google Scholar] [CrossRef]
Meng, X.; Sun, B.; Zhu, D. Harbour protection: Moving invasion target interception for multi-AUV based on prediction planning interception method. Ocean. Eng. 2021, 219, 108268. [Google Scholar] [CrossRef]
Lan, Y. Multiple mobile robot cooperative target intercept with local coordination. In Proceedings of the CCDC 2012—2012 24th Chinese Control and Decision Conference, Taiyuan, China, 23–25 May 2012; pp. 145–151. [Google Scholar]
Tahir, M.N.; Ren, Z. Maneuvering target interception using retrospective-cost-based adaptive input and state estimation. In Proceedings of the CGNCC 2016—2016 IEEE Chinese Guidance, Navigation and Control Conference, Nanjing, China, 12–14 August 2016; pp. 603–608. [Google Scholar]
Refsnes, J.E.; Sorensen, A.J.; Pettersen, K.Y. Model-based output feedback control of slender-body underactuated AUVs: Theory and experiments. IEEE Trans. Control. Syst. Technol. 2008, 16, 930–946. [Google Scholar] [CrossRef]
Cao, X.; Sun, H.; Jan, G.E. Multi-AUV cooperative target search and tracking in unknown underwater environment. Ocean. Eng. 2018, 150, 1–11. [Google Scholar] [CrossRef]
Han, L.; Wu, X.; Sun, X. Hybrid path planning algorithm for mobile robot based on A* algorithm fused with DWA. In Proceedings of the ICIBA 2023—2023 IEEE 3rd International Conference on Information Technology, Big Data and Artificial Intelligence, Chongqing, China, 26–28 May 2023; pp. 1465–1469. [Google Scholar]
Wu, B.; Chi, X.; Zhao, C.; Zhang, W.; Lu, Y.; Jiang, D. Dynamic Path Planning for Forklift AGV Based on Smoothing A* and Improved DWA Hybrid Algorithm. Sensors 2022, 22, 7079. [Google Scholar] [CrossRef] [PubMed]
Zhong, X.; Tian, J.; Hu, H.; Peng, X. Hybrid Path Planning Based on Safe A* Algorithm and Adaptive Window Approach for Mobile Robot in Large-Scale Dynamic Environment. J. Intell. Robot. Syst. Theory Appl. 2020, 99, 65–77. [Google Scholar] [CrossRef]
Yu, Q.; Lin, Q.; Zhu, Z.; Wong, K.C.; Coello, C.A. A dynamic multi-objective evolutionary algorithm based on polynomial regression and adaptive clustering. Swarm Evol. Comput. 2022, 71, 101075. [Google Scholar] [CrossRef]

Figure 1. Sonar sensor model.

Figure 2. Obstacle sampling window.

Figure 3. Global target points linearly connected nodes as SAGRRT local target points.

Figure 4. Extract key guiding points.

Figure 5. Introduction of DAGRRT dynamic obstacle avoidance.

Figure 6. Bump avoidance space division.

Figure 7. Baseline avoidance directions and evaluation intervals.

Figure 8. Multi-avoidance condition baseline avoidance direction and evaluation interval.

Figure 9. Consistency algorithm.

Figure 10. Distribution of the potential points in the hunting process.

Figure 11. Diagram of hunting formation and maintenance.

Figure 12. (a) Simulation of tracking hunting method when the moving target’s speed is less than the AUV’s; (b) simulation of tracking hunting method when the moving target’s speed is less than the AUV’s.

Figure 13. (a) Simulation of tracking hunting method when the moving target’s speed equals the AUV’s; (b) simulation of the hunting method proposed in this paper when the moving target’s speed equals the AUV’s.

Figure 14. (a) Simulation of hunting without using the consistency algorithm when a failure occurs; (b) simulation of hunting using the consistency algorithm proposed in this paper when a failure occurs.

Figure 15. (a) Simulation of hunting motion planning using traditional DWA; (b) simulation of hunting motion planning using traditional DWA.

Table 1. Initial state of AUV and moving target.

Mission Elements	Initial Position/m	Bow Angle /rad	Maximum Speed/kn
AUV1	50,250	−0.1	4
AUV2	11,090	0.7	4
AUV3	25,050	1.5	4
AUV4	410,110	2.4	4
AUV5	450,290	−2.8	4
Moving target	450,390	−2.5	2

Table 2. Simulation results of hunting motion planning when the moving target’s speed is less than the AUV’s.

Hunting Method	Hunting Success	Travel Distance/m	Turning Cost/rad
Tracking hunting	Yes	1692.3	126.7
The proposed method	Yes	1123.8	36.1

Table 3. Initial state of AUV and moving target.

Mission Elements	Initial Position/m	Bow Angle /rad	Maximum Speed/kn
AUV1	50,250	−0.1	4
AUV2	11,090	0.7	4
AUV3	25,050	1.5	4
AUV4	410,110	2.4	4
AUV5	450,290	−2.8	4
Moving target	450,390	−2.5	4

Table 4. Simulation results of hunting motion planning when the moving target’s speed equals the AUV’s.

Hunting Method	Hunting Success	Travel Distance/m	Turning Cost/rad
Tracking hunting	No	2088.5	63.9
The proposed method	Yes	976.6	44.7

Table 5. Initial state of AUV and moving target.

Mission Elements	Initial Position/m	Bow Angle /rad	Maximum Speed/kn
AUV1	50,230	0	4
AUV2	17,050	1.1	4
AUV3	29,050	1.7	4
AUV4	43,070	2.4	4
AUV5	450,290	−2.8	4
Moving target	410,230	−2.2	4

Table 6. Simulation results of hunting motion planning when a failure occurs.

Hunting Method	Hunting Success	Travel Distance/m	Turning Cost/rad
Tracking hunting	No	707.6	20.5
The proposed method	Yes	796.6	22.8

Table 7. Initial states of the AUV, moving target, and dynamic obstacle.

Mission Elements	Initial Position/m	Bow Angle /rad	Initial Speed /kn	Maximum Speed/kn
AUV1	30,250	−0.1	0	4
AUV2	7070	0.7	0	4
AUV3	25,070	1.4	0	4
AUV4	410,110	2.4	0	4
AUV5	450,310	−2.7	0	4
Moving target	390,430	−2.2	0	2
Dynamic obstacle	210,230	−2.3	11	11

Table 8. Simulation results of hunting motion planning in a complex obstacle environment.

Hunting Method	Hunting Success	Travel Distance/m	Turning Cost/rad
DWA + tracking hunting	No	1835.9	178.9
The proposed method	Yes	1346.2	44.3

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Li, J.; Lu, H.; Zhang, H.; Zhang, Z. Dynamic Target Hunting Under Autonomous Underwater Vehicle (AUV) Motion Planning Based on Improved Dynamic Window Approach (DWA). J. Mar. Sci. Eng. 2025, 13, 221. https://doi.org/10.3390/jmse13020221

AMA Style

Li J, Lu H, Zhang H, Zhang Z. Dynamic Target Hunting Under Autonomous Underwater Vehicle (AUV) Motion Planning Based on Improved Dynamic Window Approach (DWA). Journal of Marine Science and Engineering. 2025; 13(2):221. https://doi.org/10.3390/jmse13020221

Chicago/Turabian Style

Li, Juan, Houtong Lu, Honghan Zhang, and Zihao Zhang. 2025. "Dynamic Target Hunting Under Autonomous Underwater Vehicle (AUV) Motion Planning Based on Improved Dynamic Window Approach (DWA)" Journal of Marine Science and Engineering 13, no. 2: 221. https://doi.org/10.3390/jmse13020221

APA Style

Li, J., Lu, H., Zhang, H., & Zhang, Z. (2025). Dynamic Target Hunting Under Autonomous Underwater Vehicle (AUV) Motion Planning Based on Improved Dynamic Window Approach (DWA). Journal of Marine Science and Engineering, 13(2), 221. https://doi.org/10.3390/jmse13020221

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Dynamic Target Hunting Under Autonomous Underwater Vehicle (AUV) Motion Planning Based on Improved Dynamic Window Approach (DWA)

Abstract

1. Introduction

2. Task Element Modeling

2.1. AUV Kinematic Modeling

2.2. Forward-Looking Sonar Detection Model

2.3. Static Obstacle Modeling

2.4. Dynamic Obstacle Modeling

3. Obstacle Avoidance Motion Planning for AUV Based on DWA and RRT

3.1. Basic DWA Algorithm

3.2. Improvement of Velocity Space

3.3. Obstacle Sampling Window

3.4. Complex Static Obstacle Avoidance Incorporating SAGRRT

3.4.1. SAGRRT Static Obstacle Avoidance Guidepost Planning

3.4.2. Extraction Rules for Key Guiding Points

3.5. Dynamic Obstacle Avoidance Incorporating DAGRRT

4. Introduction of DAGRRT Dynamic Obstacle Avoidance

4.1. Bump Avoidance Space Division

4.2. DWA Collision Avoidance Rule Evaluation Function

5. Distributed Dynamic Target Hunting by AUVs Based on CPPPEA

5.1. AUV Swarm Consistency Algorithm

5.2. Formation Method of the Hunting Formation

5.2.1. Nonlinear Regression Fitting of Moving Target Trajectories

5.2.2. Calculation Method for Hunting Base Point

5.2.3. Moving Target Hunting Formation Method

6. Simulation Results

6.1. Simulation of Target Hunting Task When the Moving Target’s Speed Is Less than the AUV’s

6.2. Simulation of the Hunting Task When the Moving Target’s Speed Equals the AUV’s

6.3. Simulation of Hunting Task When the Leader Fails

6.4. Simulation of Hunting Task in a Complex Obstacle Environment

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI