Trajectory Planning Method for Multi-UUV Formation Rendezvous in Obstacle and Current Environments

Chen, Tao; Wang, Kai; Wang, Qingzhe

doi:10.3390/jmse13122221

Open AccessArticle

Trajectory Planning Method for Multi-UUV Formation Rendezvous in Obstacle and Current Environments

by

Tao Chen

,

Kai Wang

and

Qingzhe Wang

^*

College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin 150001, China

^*

Author to whom correspondence should be addressed.

J. Mar. Sci. Eng. 2025, 13(12), 2221; https://doi.org/10.3390/jmse13122221

Submission received: 18 October 2025 / Revised: 19 November 2025 / Accepted: 20 November 2025 / Published: 21 November 2025

(This article belongs to the Section Ocean Engineering)

Download

Browse Figures

Versions Notes

Abstract

Formation rendezvous is a critical phase during the deployment or recovery of multiple unmanned underwater vehicles (UUVs) in cooperative missions, and represents one of the core problems in multi-UUV cooperative planning. In practical marine environments with obstacles and currents, multiple constraints must be simultaneously satisfied, including obstacle avoidance, inter-UUV collision prevention, kinematic limitations, and specified initial and terminal states. These requirements make energy-optimal trajectory planning for multi-UUV formation rendezvous highly challenging. Traditional integrated cooperative planning methods often struggle to obtain optimal or even feasible solutions due to the complexity of constraints and the vastness of the solution space. To address these issues, a dual-layer planning framework for multi-UUV formation rendezvous trajectory planning in environments with obstacles and currents is proposed in this paper. The framework consists of an initial individual trajectory planning layer and a secondary cooperative planning layer. In the initial individual trajectory planning stage, the Grey Wolf Optimization (GWO) algorithm is employed to optimize high-order terms of polynomial curves, generating initial trajectories for individual UUVs that satisfy obstacle avoidance, kinematic constraints, and state requirements. These trajectories are then used as inputs to the secondary cooperative planning stage. In the cooperative stage, a Self-Adaptive Particle Swarm Optimization (SAPSO) is introduced to explicitly address inter-UUV collision avoidance while incorporating all individual constraints, ultimately producing a cooperative rendezvous trajectory that minimizes overall energy consumption. To validate the effectiveness of the proposed method, a simulation environment incorporating vortex flow fields and real-world island topography was constructed. Simulation results demonstrate that the proposed hierarchical trajectory planning method is capable of generating energy-optimal formation rendezvous trajectories that satisfy multiple constraints for multi-UUV systems in environments with obstacles and ocean currents, highlighting its strong potential for practical engineering applications.

Keywords:

UUVs; formation rendezvous; trajectory planning; energy consumption optimization

1. Introduction

In recent years, with the continued expansion of fields such as exploration, resource development, and underwater search [1], operations involving UUVs [2] have become a prominent research focus. Formation rendezvous constitutes a critical phase during the deployment or recovery stages of cooperative UUV missions and represents one of the fundamental challenges in UUVs cooperative planning. The trajectory planning for multi-UUV formation rendezvous [3] aims to generate trajectories for a group of initially dispersed UUVs from their respective starting points to a designated rendezvous location. This process requires all UUVs to arrive at the target point simultaneously at a specified time, achieve the desired formation upon convergence, and maintain consistency in motion states such as heading and velocity.

Regarding cooperative trajectory or path planning for multiple unmanned systems—such as UUVs, Unmanned Surface Vehicles (USVs), Unmanned Aerial Vehicles (UAVs), and Unmanned Ground Vehicles (UGVs)—current research efforts have predominantly concentrated on constraint management, optimization objectives, and the effective exploitation of environmental information.

Satisfying constraint conditions represents the primary consideration in formation rendezvous trajectory or path planning. These constraints typically encompass environmental limitations, vehicle-specific physical restrictions, and inter-agent relational requirements. Regarding obstacle avoidance constraints involving both static and dynamic obstacles during planning. Zhang et al. [4] proposed a two-phase UUVs rendezvous path planning method. Their approach initially employs a fusion of artificial potential field and fish swarm trust search algorithms for global planning to identify critical sub-goal points. Subsequently, for addressing unknown threats, a bidirectional Rapidly-exploring Random Tree Star (RRT*) algorithm incorporating incomplete constraint rolling optimization is adopted for local real-time replanning. Addressing UUVs cooperative path planning and collision avoidance in obstacle-rich environments. Wu et al. [5] developed a novel algorithm integrating Artificial Potential Field (APF) with A* search. By introducing inter-vehicle repulsive forces and temporary virtual target points, their method effectively resolves the local minimum problem inherent to traditional Artificial Potential Field (APF) methods, while leveraging the A* algorithm to ensure path feasibility in complex obstacle configurations. For constrained navigation through narrow spatial passages, Xing et al. [6] presented an enhanced deep reinforcement learning scheme. Their solution incorporates an LSTM-enhanced MATD3 network, a novel dense reward function, and hierarchical training mechanism, significantly improving formation adaptability for UAVs. To address multi-task point access problems constrained by complex urban road networks and diversified mission durations, Yang et al. [7] developed an Ant Colony Optimization (ACO) based coordinated path planning method for truck-mounted UAV swarms, incorporating kinematic constraints and mission time limitations. Their methodology first computes optimal UGV paths satisfying road network constraints, then discretizes them into docking points where optimal paths are planned for UAV swarms to visit multiple mission points. Concerning inter-agent constraints within clusters, Ma et al. [8] established an online cooperative obstacle avoidance path planning method for AUVs. Their approach considers both temporal and spatial coordination aspects by establishing information sharing mechanisms to obtain real-time path data for coordination requirement assessment, with specifically formulated cooperative avoidance strategies ensuring spatiotemporal safety constraints among Autonomous Underwater Vehicle (AUV) clusters. Under comprehensive constraints including terrain threats, radar exposure, UAV turning angles, maximum flight distance, and altitude limitations, Geng et al. [9] implemented crucial improvements to traditional Particle Swarm Optimization (PSO) by introducing opposition-based learning mechanisms. This strategy significantly enhances algorithmic search efficiency and global exploration capability, enabling rapid convergence to superior flight paths while strictly satisfying multiple complex constraints.

The objectives of cooperative path and trajectory planning vary significantly depending on the specific mission context [10]. Y. Volkan et al. [11] proposed an improved Genetic Algorithm (GA) solution. Their approach incorporated revisit time intervals and multiple runway strategies to enhance planning efficiency and feasibility in complex scenarios. With optimization objectives focusing on path length under constraints including minimum and maximum flight range, altitude limitations, and maximum turning angle. Lu et al. [12] developed an Adaptive Differential Evolution (ADE) method to address modeling complexity and computational intensity in UAVs cooperative trajectory planning. For cooperative implementation, they adopted a distributed framework where PSO-ADE handles individual trajectory planning tasks, with trajectory feedback incorporated into the PSO velocity update formula to guide particle movement direction through cooperative path optimization. Luo et al. [13]. proposed a self-adaptive optimization method considering key objectives including flight duration, path length, and energy consumption. Their methodology first constructs a multi-objective planning model centered on these core parameters, then designs an adaptive optimization algorithm enabling UAVs to autonomously select optimal paths through real-time adjustment of search step sizes and weight distributions. Anikó Kopacz et al. [14], for the Multi Robot Path Planning (MPP) problem, collision free obstacles and collisions are used as constraints, and the shortest path is also taken as the optimization objective. A hybrid method combining A adaptive path planning and local search has been proposed. On the basis of ensuring the optimality of path search, this method extends to multi-agent scenarios through heuristic strategies to plan collision free optimal paths for multiple robots in a static environment. This study focuses on static conditions and does not consider other dynamic environments. V. H. A. Nguyen et al. [15], take the length of the entire path as the optimization objective and add an optimization term for the smoothness of the path. Propose an improved RRT algorithm that combines the advantages of RRT PSO and Informed RRT*, and adopts a trapezoidal turning optimization strategy to replace path corners, significantly improving path smoothness and robot motion feasibility. Similarly, it is based on a completely static environment. In multi-system cooperative planning, the strategic utilization of environmental elements as active resources has emerged as a novel research direction. For AUV operations in current environments, Gong et al. [16] modeled current influences as longitudinal and lateral velocity components affecting AUV navigation, while incorporating trajectory length into energy consumption optimization objectives. Their approach employs ant colony optimization to resolve multi-trajectory planning problems, enabling flexible trajectory selection for AUVs according to specific mission metrics. Addressing the energy disparity challenge in airborne UAV formation rendezvous caused by varying initial potential energy under diverse wind conditions, Wang et al. [17] proposed a wind energy harvesting strategy integrated into collective trajectory planning. Through analysis of tailwind, headwind, no-wind, and crosswind scenarios, their method enhances solution efficiency via simplified collocation and intelligent initialization strategies, enabling individual members to compensate for energy costs through independent wind energy harvesting while planning respective trajectories. For AUVs cooperative operations in marine environments with multiple vortex currents and obstacles, Li et al. [18] developed a hybrid algorithmic framework combining K-means preprocessing, distributed auction-based task allocation, and Deep Neural Network (DNN) path planning. This methodology integrates path length with current compliance to modulate neuronal activity intensity during post-allocation path planning, significantly improving mission execution effectiveness in complex scenarios. Wang et al. [19] proposed a method for assigning intersection points and planning formation intersection trajectories based on dynamic parameter particle swarm optimization (DPPSO) to optimize polynomial trajectories has been proposed. Taking into account kinematic constraints, collision avoidance constraints between clusters, and other constraints, the energy optimization objective is to minimize the cumulative trajectory length. However, obstacle avoidance constraints and energy consumption requirements of ocean currents were not included in the evaluation of trajectory planning. Shao [20] et al. To meet the kinematic constraints of maximum curvature and continuous path curvature for unmanned aerial vehicles (UAVs), and considering collision avoidance constraints between obstacles and clusters, a path planning method based on pH curve was proposed. The distributed collaborative particle swarm optimization (DCPSO) algorithm with elite preservation strategy was adopted to generate a safe and flight path for each UAV.

Previous studies have rarely comprehensively considered multiple constraint conditions and utilized environmental flow fields to achieve energy optimal collaborative trajectory or path planning. At the same time, when various constraints and optimization terms are coupled with each other, there is room for improvement in this area of research To address this gap, this paper proposes a dual-layer planning framework for multi-UUV formation rendezvous trajectory planning in environments with obstacles and currents. This method decouples the complex multi-UUV trajectory planning problem into two sequential phases: initial individual trajectory planning layer and a secondary cooperative planning layer, effectively separating individual and collaborative trajectory planning, ultimately generating formation rendezvous trajectories that satisfy all constraints while achieving optimal energy consumption.

The remainder of this paper is organized as follows: Section 2 formulates the multi-UUV formation rendezvous problem and specifies the corresponding constraints and optimization objectives. Section 3 presents the core methodology of this work—the dual-layer planning framework. Section 4 provides a detailed exposition of the initial individual trajectory planning approach. Section 5 introduces the secondary cooperative trajectory planning approach. Section 6 presents and analyzes the simulation results. Finally, Section 7 concludes the paper with a summary and concluding remarks.

2. Problem Statement

2.1. Preliminaries

This study investigates the trajectory planning problem for formation rendezvous of UUVs in environments with obstacles and currents. Consider a UUV numbered

i

with an initial state

q_{i S} = (x_{i S}, y_{i S}, u_{i S}, v_{i S}, ψ_{i S})

at the initial point and a terminal state

q_{i E} = (x_{i E}, y_{i E}, u_{E}, v_{E}, ψ_{E})

at the terminal point, where

(x, y)

denotes the UUV position,

(u, v)

represents the surge and sway velocity, and

ψ

denotes the heading angle. The trajectory planning problem involves determining a set of trajectories

γ_{i} (t)

connecting the initial and terminal points, which is mathematically formulated as:

q_{i S} = (x_{i S}, y_{i S}, u_{i S}, v_{i S}, ψ_{i S}) \overset{∐ γ_{i} (t)}{\to} q_{i E} = (x_{i E}, y_{i E}, u_{E}, v_{E}, ψ_{E})

(1)

where

∐

represents the constraints during the formation rendezvous process, including initial and terminal state constraints, obstacle avoidance constraints, inter-UUV collision avoidance constraints, and kinematic constraints. As illustrated in Figure 1, the trajectory planning problem is exemplified using a system of three UUVs operating in an environment with multiple obstacles and currents. The blue arrow in Figure 1 represents the direction of the current, the red curve represents the trajectory of the UUV, Obs represents obstacles in the environment, the green line represents the minimum distance, and

d_{s a f e}

represents the safe distance of the UUV. UUV1, UUV2, and UUV3 are initially located at distinct initial points with different initial states

q_{1 S}

,

q_{2 S}

, and

q_{3 S}

. After rendezvous time

T

, they arrive at their respective rendezvous points

(x_{1} (T), y_{1} (T))

,

(x_{2} (T), y_{2} (T))

, and

(x_{3} (T), y_{3} (T))

, achieving consistent and identical motion states

(u_{E}, v_{E}, ψ_{E})

.

2.2. Constraints

The trajectory planning for multi-UUV formation rendezvous must simultaneously satisfy multiple stringent constraints, which can be categorized into the following key aspects:

(1): Formation rendezvous initial and terminal state constraints

In UUVs formation rendezvous, the initial states of individual UUVs are typically dispersed. The rendezvous process requires all UUVs to reach their designated rendezvous points within the specified time

T

while achieving consensus in velocity vectors and heading angles to form the prescribed formation configuration [21].

(2): Obstacle avoidance

In environments containing obstacles and currents, although obstacles exhibit diverse geometries, they can all be approximated using finite grids [22]. During the UUVs formation rendezvous process, no trajectory points

p_{i} (t)

of any UUV shall intersect with obstacle grids, which can be formulated as:

\forall t \in [0, T], p_{i} (t) \in W, p_{i} (t) \notin \underset{(m, n) \in ο}{\cup} G_{m, n}

(2)

In the mathematical formulation,

ο

denotes the complete obstacle index set,

W

defines the entire map domain boundaries, and

G_{m, n}

specifically represents the precise grid region corresponding to index

(m, n)

, formally as

G_{m, n} = [m Δ x, (m + 1) Δ x)] \times [n Δ y, (n + 1) Δ y)]

.

(3): Inter-UUV collision avoidance constraints

During the UUVs formation rendezvous process, collision risks between UUVs must be thoroughly considered [23]. Given the formation rendezvous UUV set

U

, at any time instant

t

, the position of UUVs is

p_{i} (t)

, the distance between any two distinct UUVs

i

and

j

must exceed the minimum safe distance

d_{s a f e}

, formulated as:

\forall t \in [0, T], d_{i j} = ‖p_{i} (t) - p_{j} (t)‖ > d_{s a f e}, \forall i, j \in U

(3)

Based on the comprehensive consideration of navigation positioning uncertainty, control dynamic characteristics, and system safety redundancy, this article sets a safety distance dimension of 40 m.

(4): Kinematic constraints

To enhance the feasibility of formation rendezvous trajectory planning [24], it is essential to account for the kinematic constraints of UUVs, including velocity constraints and heading angular velocity constraints. The planning framework of this study is based on a basic control assumption, that is, when the speed of the UUV is within the range of 1 to 6 knots, its propulsion system and underlying controller can provide sufficient control torque and steering effect, thereby ensuring that the UUV has stable and accurate tracking ability for the longitudinal velocity u and lateral velocity v generated by the plan.

All UUVs must comply with these constraint conditions.

V_{M i n} \leq \sqrt{u^{2} + v^{2}} \leq V_{M a x}

(4)

where

V_{M i n}

and

V_{M a x}

denote the minimum and maximum velocity constraints of the UUV. The value of

V_{M i n}

is 0.6 m/s, The value of

V_{M a x}

is 3 m/s. For the turning angular velocity

r

of UUV, it also satisfies:

r \leq r_{M a x}

(5)

where

r_{M a x}

represents the maximum turning rate of the UUV.

2.3. Problem Formulation

The multi-UUV cooperative trajectory planning problem can be formulated as an optimization problem [25], As mentioned in reference [24], the concept of multi-objective optimization is to achieve the ideal value of the objective function to be optimized while satisfying multiple constraint conditions. The research content of this article is to achieve formation rendezvous of multi-UUV systems within a specified time limit with the goal of optimal energy consumption, while satisfying all constraints during the rendezvous process and planning the rendezvous trajectory

γ_{i}^{o p t} (t)

reasonably.

UUVs typically carry power batteries with limited capacity. Their navigation energy consumption E, is mainly reflected in the power consumption term,

P_{p r o p}

and heading time T of the propulsion system, which is calculated by

E \propto \int_{T} P_{p r o p} d t

. Propulsion power

P_{p r o p}

is mainly used to overcome water resistance

F_{r e s}

which is related to velocity and head in angle relative to the surrounding current. Therefore, this article considers the Voyage length and the angle between the UUV and the direction of the current as factors affecting energy consumption. To support this approach conceptually, power consumption models that similarly relate dynamic and kinematic variables to the battery load are available [26].

The core of this article lies in the trajectory planning method, with a focus on verifying the effectiveness of the proposed method in environments where obstacles and ocean currents coexist. Therefore, based on the above propulsion system energy consumption model, we transform it into an expression form of influencing factors, that is selecting trajectory length and current energy expenditure as components of the objective function

J

, which facilitates the generation of energy-optimal formation rendezvous trajectories. The multi-UUV cooperative trajectory planning problem can be mathematically formulated as follows:

\begin{array}{l} J (γ_{i}^{o p t} (t)) = m i n [a_{1} \sum_{i} \int_{0}^{T} \sqrt{u^{2} + v^{2}} d t + a_{2} \sum_{i} F_{i C u r}], \\ s . t . (2) - (4), \forall i \in U \end{array}

(6)

where

a_{1}

and

a_{2}

are weighting coefficients, and

F_{i C u r}

represents the current compliance degree of the trajectory for UUV

i

, specifically calculated as:

F_{i C u r} = \{\begin{cases} \sum_{k = 1}^{N} \frac{ε Δ ψ_{i} (k)}{\sqrt{u^{2} (k) + v^{2} (k)}}, 0^{°} < Δ ψ_{i} (k) < 9 0^{°} \\ \sum_{k = 1}^{N} \frac{Δ ψ_{i} (k)}{\sqrt{u^{2} (k) + v^{2} (k)}}, 9 0^{°} < Δ ψ_{i} (k) < 180^{°} \end{cases}

(7)

where

Δ ψ_{i} (k)

represents the angle between the heading direction of UUV

i

and the current direction at the

k

-th trajectory point, while

u (k)

and

v (k)

denote the surge and sway velocity components induced by the current at the

k

-th trajectory point, respectively.

ε

is the reward factor, this article takes 0.01. According to Equation (7), a UUV is considered to be leveraging the current for enhanced energy efficiency when a smaller acute angle between its heading and the current direction leads to a greater resultant velocity. Therefore, the energy consumption values mentioned in the subsequent sections of this article and in the table (such as Table 3) are normalized dimensionless values calculated according to this model and do not have physical units. Intended as a unified comparative indicator for subsequent comparative analysis.

3. Dual-Layer Planning Framework

This section is the core part of the article, aiming to achieve decoupling of various constraints and optimization objectives in the planning of individual and cluster trajectories in UUV rendezvous trajectory planning. Conventional integrated cooperative planning methods typically address all motion constraints and optimization objectives for multi-UUV trajectories at a single level [19]. However, in marine environments where obstacles and currents coexist, the pursuit of energy-optimal solutions under multiple coupled constraints—including obstacle avoidance, inter-UUV collision prevention, kinematic limitations, and terminal rendezvous consistency—renders the formation rendezvous trajectory planning problem highly complex. These high-dimensional, strongly constrained optimization problems impose significant computational burdens and are prone to local optima, often failing to yield feasible solutions that satisfy all constraints within limited timeframes. These limitations severely restrict the practical applicability of such methods in marine environments. Therefore, this article transforms the complex multi-objective optimization problem mentioned above into the hierarchical optimization problem mentioned in reference [24]. Divide the optimization objectives and constraints during the formation rendezvous process into two planning layers: individual and group. This paper proposes a dual-layer planning framework for multi-UUV formation rendezvous trajectory planning. The initial individual trajectory planning layer generates input information for the secondary cooperative planning layer. In the secondary collaborative planning layer, as the rendezvous trajectories of each UUV are fine tuned, emphasis is placed on considering collision avoidance constraints between groups, while also taking into account all individual constraint terms such as starting and ending point state constraints. For any set of trajectories that do not meet the UUV constraint conditions, they should all be discarded and iteratively searched for the next set of trajectories that can meet them. All “optimizable individual trajectories” should be synchronized and optimized as a whole “cluster trajectory”. This is the core idea of ensuring that all team rendezvous constraints are met. The overall architecture is illustrated in Figure 2.

The proposed dual-layer planning framework consists of an upper initial individual trajectory planning layer and a lower secondary cooperative planning layer. The core idea is to separate individual constraints from group constraints, with optimal energy consumption as the optimization objective. The second layer (collaborative planning) serves as the upper layer optimization, with the goal of coordinating the solutions (individual trajectories) provided by the lower layer to meet the constraints and objectives of the formation. In the initial individual trajectory planning layer, the trajectory

γ_{i} (t)

of each individual UUV is input into the single-UUV trajectory evaluation function

F_{S} [γ_{i} (t)]

, and the GWO algorithm is employed to optimize high-order terms of polynomial curves. This generates initial single-UUV trajectories

{γ_{i}}^{*} (t)

that aim for energy optimality while satisfying obstacle avoidance, kinematic constraints, and state requirements, which then serve as input information for the secondary cooperative planning layer. In the secondary cooperative planning layer, in addition to maintaining the constraints from the single-UUV trajectory planning phase, inter-UUV collision avoidance constraints are additionally considered. Based on the optimal individual trajectories

{γ_{i}}^{*} (t)

of each UUV, the secondary layer correspondingly generates new optimizable individual trajectories

{γ_{i}}^{+} (t)

. Subsequently, during the formation rendezvous trajectory planning process, all optimizable individual trajectories

{γ_{i}}^{+} (t)

are simultaneously optimized as a collective cluster trajectory

γ_{G} (t) = {{γ_{i}}^{+} (t), {γ_{i}}^{+} (t), \dots, {γ_{i}}^{+} (t)}

, where all UUV trajectories collectively form the optimization variables instead of single UUV trajectories, and are input into the secondary cooperative UUV trajectory evaluation function

F_{G} [γ_{G} (t)]

. The SAPSO algorithm is then applied for iterative optimization to obtain the optimal collective formation rendezvous trajectory

{γ_{G}}^{o p t} (t)

, from which the optimal rendezvous trajectories for each individual UUV

{{γ_{1}}^{o p t} (t), {γ_{2}}^{o p t} (t), \dots, {γ_{i}}^{o p t} (t)}

are derived. This is the difference between the methods used in this article and other literature such as [19], which no longer consider multiple constraints and optimization conditions at the same level, but instead achieve decoupling of constraints and optimization in individual and group planning.

4. Initial Layer for Individual UUV Polynomial Trajectory Planning Optimized by GWO

This section presents the trajectory planning methodology for the initial individual trajectory planning layer. The layer integrates the GWO with high-order polynomial curves, targeting energy-optimal performance while addressing UUV-specific constraints. The optimization results serve as critical inputs to the secondary cooperative planning layer, establishing the foundation for subsequent coordinated trajectory planning.

4.1. Polynomial Trajectory Design

Considering that the UUV system must satisfy strict motion state constraints at both the initial and terminal points, polynomial curves with continuous differentiability characteristics [27] are selected for trajectory construction. Meanwhile, the planning method for high-order polynomial trajectory curves essentially involves planning a continuous and differentiable trajectory curve. The high-order differentiability inherent in this curve ensures the continuity of the derivative of each order of the trajectory, making it highly user-friendly for the control system of UUVs. By adjusting the values of high-order terms, precise control over key motion parameters such as heading angular velocity and velocity can be achieved, enabling the trajectories to fulfill various constraints and optimization requirements.

The trajectory

γ_{i} (t)

of a UUV numbered

i

can be represented by a high-order power series in discrete mathematical space, expressed as:

γ_{i} (t) = \sum_{k = 0}^{M_{i}} a_{k} t^{k}

(8)

For formation trajectory planning, the order of the trajectory power series is typically determined by the following boundary conditions:

\begin{matrix} M_{i} = d_{S} + d_{E} + 1 \end{matrix}

(9)

where

d_{S}

and

d_{E}

represent the higher-order derivatives of the boundary constraints at the initial point and terminal point in point-to-point trajectory planning, respectively. Considering the following boundary conditions:

\{\begin{cases} x_{i} (t_{S}) = x_{i S}, y_{i} (t_{S}) = y_{i S} \\ {\dot{x}}_{i} (t_{S}) = {\dot{x}}_{i S}, {\dot{y}}_{i} (t_{S}) = {\dot{y}}_{i S} \\ x_{i} (t_{E}) = x_{i E}, y_{i} (t_{E}) = y_{i E} \\ {\dot{x}}_{i} (t_{E}) = {\dot{x}}_{i E}, {\dot{y}}_{i} (t_{E}) = {\dot{y}}_{i E} \end{cases}

(10)

This article imposes constraints on velocity and heading at the starting and ending points, From this, it can be concluded that

d_{S} = d_{E} = 1

, meaning the highest-order term

M_{i}

of the polynomial trajectory is of order 3. Therefore, the trajectory planning for the UUV in the horizontal plane can be derived using the following formula:

\begin{matrix} \{\begin{cases} x (t) = a_{0} + a_{1} t + a_{2} t^{2} + a_{3} t^{3} \\ y (t) = b_{0} + b_{1} t + b_{2} t^{2} + b_{3} t^{3} \end{cases} \end{matrix}

(11)

Based on this foundation, a fourth-order high-degree optimization term is incorporated into the trajectory of UUV numbered

i

, as follows:

\{\begin{cases} x_{i} (t) = a_{i 0} + a_{i 1} t + a_{i 2} t^{2} + a_{i 3} t^{3} + a_{i 4} t^{4} \\ y_{i} (t) = b_{i 0} + b_{i 1} t + b_{i 2} t^{2} + b_{i 3} t^{3} + b_{i 4} t^{4} \end{cases}

(12)

Substituting the boundary conditions from Equation (10) into the expression, where

t_{S}

and

t_{E}

represent the start time and end time respectively, we obtain:

\{\begin{cases} x_{i} (t_{S}) = a_{i 0} + a_{i 1} t_{S} + a_{i 2} t_{S}^{2} + a_{i 3} t_{S}^{3} + a_{i 4} t_{S}^{4} = x_{i S} \\ {\dot{x}}_{i} (t_{S}) = a_{i 1} + 2 a_{i 2} t_{S} + 3 a_{i 3} t_{S}^{2} + 4 a_{i 4} t_{S}^{3} = {\dot{x}}_{i S} \\ x_{i} (t_{E}) = a_{i 0} + a_{i 1} t_{E} + a_{i 2} t_{E}^{2} + a_{i 3} t_{E}^{3} + a_{i 4} t_{E}^{4} = x_{i E} \\ {\dot{x}}_{i} (t_{E}) = a_{i 1} + 2 a_{i 2} t_{E} + 3 a_{i 3} t_{E}^{2} + 4 a_{i 4} t_{E}^{3} = {\dot{x}}_{i E} \end{cases}

(13)

This article abstracts each UUV as a particle (i.e., its geometric center point) for processing. The conversion relationship between the UUV carrier coordinate system and the fixed coordinate system is shown in Figure 3 [28]. The fixed coordinate system is represented as

{B}

, and the carrier coordinate system is represented as

{E}

. The simplified underactuated horizontal plane UUV kinematic model [19] is adopted as follows:

\{\begin{array}{l} \dot{x} = u c o s ψ - ν s i n ψ \\ \dot{y} = u s i n ψ + ν c o s ψ \\ \dot{ψ} = r \end{array}

(14)

where

x

and

y

represent the position coordinates of the UUV in the global coordinate system,

ψ

denotes the heading angle,

\dot{ψ}

is the heading angular velocity. In the carrier coordinate system of UUV,

u

and

v

indicate the surge and sway velocities of the UUV, respectively. By using Formula (14) for coordinate transformation, the geodetic coordinate system and the carrier coordinate system are connected, providing a unified standard for the planning of various UUVs.

Substituting the model into Equation (13) yields:

\{\begin{cases} x_{i} (t_{S}) = a_{i 0} + a_{i 1} t_{S} + a_{i 2} t_{S}^{2} + a_{i 3} t_{S}^{3} + a_{i 4} t_{S}^{4} = x_{i S} \\ {\dot{x}}_{i} (t_{S}) = a_{i 1} + 2 a_{i 2} t_{S} + 3 a_{i 3} t_{S}^{2} + 4 a_{i 4} t_{S}^{3} = u_{i S} \cos ψ_{i S} - v_{i S} \sin ψ_{i S} \\ x_{i} (t_{E}) = a_{i 0} + a_{i 1} t_{E} + a_{i 2} t_{E}^{2} + a_{i 3} t_{E}^{3} + a_{i 4} t_{E}^{4} = x_{i E} \\ {\dot{x}}_{i} (t_{E}) = a_{i 1} + 2 a_{i 2} t_{E} + 3 a_{i 3} t_{E}^{2} + 4 a_{i 4} t_{E}^{3} = u_{i E} \cos ψ_{i E} - v_{i E} \sin ψ_{i E} \end{cases}

(15)

Solving the above equation,

[\begin{matrix} 1 & t_{S} & t_{S}^{2} & t_{S}^{3} \\ 0 & 1 & 2 t_{S}^{2} & 3 t_{S}^{2} \\ 1 & t_{E} & t_{E}^{2} & t_{E}^{3} \\ 0 & 1 & 2 t_{E}^{2} & 3 t_{E}^{2} \end{matrix}] [\begin{matrix} a_{i 0} \\ a_{i 1} \\ a_{i 2} \\ a_{i 3} \end{matrix}] = [\begin{matrix} x_{i S} - a_{i 4} t_{S}^{4} \\ u_{i S} \cos ψ_{i S} - v_{i S} \sin ψ_{i S} - 4 a_{i 4} t_{i S}^{3} \\ x_{i E} - a_{i 4} t_{E}^{4} \\ u_{i E} \cos ψ_{i E} - v_{i E} \sin ψ_{i E} - 4 a_{i 4} t_{i E}^{3} \end{matrix}]

(16)

Since

a_{i 4}

represents the optimization term coefficients with known numerical values, the expression for

x_{i} (t)

can be derived. Using the same methodology and substituting accordingly,

[b_{i 0}, b_{i 1}, b_{i 2}, b_{i 3}, b_{i 4}]

can be determined, yielding the expression for

y_{i} (t)

.

4.2. Initial Monomer UUV Trajectory Planning

This subsection achieves optimal formation rendezvous trajectory planning by encoding trajectory parameters in the GWO and guiding the population to emulate hunting behavior through iterative optimization.

4.2.1. Grey Wolf Optimizer

The GWO [29] is a metaheuristic algorithm that simulates the leadership structure and cooperative hunting mechanisms of grey wolf packs. The algorithm classifies the population into four hierarchical levels:

α, β, δ, θ

, modeling the encircling and attacking phases of wolf pack behavior mathematically.

(1): The grey wolf encircling process

The core of the GWO Algorithm lies in the movement of grey wolves, which is represented in mathematical space as:

\{\begin{cases} \vec{X} (t + 1) = {\vec{X}}_{p} (t) - \vec{A} \cdot \vec{D} \\ \vec{D} = |\vec{C} \cdot {\vec{X}}_{p} (t) - \vec{X} (t)| \end{cases}

(17)

where

t

denotes the current iteration number,

\vec{A}

and

\vec{C}

are coefficient vectors,

{\vec{X}}_{p} (t)

represents the position vector of the prey, and

\vec{X} (t)

indicates the position vector of the grey wolf at the

t

iteration. The coefficient vectors

\vec{A}

and

\vec{C}

are computed as follows:

\{\begin{cases} \vec{A} = 2 \vec{a} \cdot {\vec{r}}_{1} - \vec{a} \\ \vec{C} = 2 \cdot {\vec{r}}_{2} \end{cases}

(18)

where

{\vec{r}}_{1}

and

{\vec{r}}_{2}

are random values within the range

[0, 1]

. To simulate the process of gradually approaching the prey,

\vec{A}

represents a random vector within the interval

[- \vec{a}, \vec{a}]

, where the value of

\vec{a}

decreases linearly from 2 to 0 as the number of iterations increases.

(2): Grey wolf attacking process

To mathematically abstract the attacking behavior of grey wolves [30], within the entire population, the

α, β, δ

wolves serve as leaders that guide the hunting activities of the lower-level wolves. They are considered closest to the prey. The other grey wolves are influenced by them and conduct search and attack operations based on their positions. Similarly, the

α, β, δ

wolves update their positions based on information from the other wolves, as illustrated in Figure 4.

In Figure 4, the

α, β, δ

wolves possess distinct random values and consequently maintain different distances to the

θ

wolves. For the other

θ

grey wolves, the position information generated under the influence of the

α, β, δ

wolves are expressed as:

\begin{matrix} X_{1} = X_{α} - A_{1} \cdot (D_{a}) \\ X_{2} = X_{β} - A_{2} \cdot (D_{β}) \\ X_{3} = X_{δ} - A_{3} \cdot (D_{δ}) \end{matrix}

(19)

where

X_{1}

,

X_{2}

, and

X_{3}

represent the position information generated by the influence of the

α, β, δ

wolves on the

θ

level wolves, respectively;

D_{a}

,

D_{β}

, and

D_{δ}

denote the distances between other grey wolves and the

α, β, δ

wolves, respectively, expressed as:

\begin{matrix} D_{α} = |C_{1} \cdot X_{α} - X| \\ D_{β} = |C_{2} \cdot X_{β} - X| \\ D_{δ} = |C_{3} \cdot X_{δ} - X| \end{matrix}

(20)

where

C_{1}

,

C_{2}

, and

C_{3}

represent random values, and

X

denotes the current position of the

θ

grey wolf. Finally, by averaging the position information influenced by the

α, β, δ

wolves, the final adjusted position of the

θ

grey wolf is obtained as:

\begin{matrix} \vec{X} (t + 1) = \frac{{\vec{X}}_{1} + {\vec{X}}_{2} + {\vec{X}}_{3}}{3} \end{matrix}

(21)

4.2.2. Trajectory Planning Process

The core of the initial individual trajectory planning lies in integrating the GWO with the high-order polynomial curve method. This is achieved by employing the GWO to search for optimal values within the defined ranges of parameters

a_{i 4}

and

b_{i 4}

, thereby adjusting the UUV’s trajectory curve. The specific procedural steps are as follows:

(1): Grey wolf population model

Let the grey wolf population size be

N

. The grey wolf model

Γ

selects the high-order optimization term

Γ_{i} (k) = [a_{i 4} (k), b_{i 4} (k)], k = 1, 2, 3 \dots N

for the

k

-th trajectory

γ_{i} (t)

of UUV

i

, with a value range of

[(a_{i M i n}, a_{i M a x}), (b_{i M i n}, b_{i M a x})]

. Let

Z

denote the mapping from the high-order optimization term to the trajectory, as referenced in Equation (15), expressed as:

Z [Γ_{i} (k)] = Z [(a_{i 4} (k), b_{i 4} (k))] = γ_{i} (t) (k)

(22)

(2): Population initialization

Initialize the

θ

wolf in the population as:

θ = {Γ_{i} (1), Γ_{i} (2), Γ_{i} (3), \dots, Γ_{i} (k)}, k = 1, 2, 3 \dots N

(23)

where the value of

Γ_{i} (k)

is assigned as follows:

Γ_{i} (k) = (a_{i 4} (k), b_{i 4} (k)) = \{\begin{cases} a_{i 4} (k) = a_{i M i n} + ξ (a_{i M a x} - a_{i M i n}) \\ b_{i 4} (k) = b_{i M i n} + ξ (b_{i M a x} - b_{i M i n}) \end{cases}

(24)

ξ

satisfies a normal distribution within the interval (0, 1). The corresponding single-UUV trajectory evaluation function

J_{S} (θ)

for the initial

θ

wolf is then given by:

J_{S} (θ) = {J_{S} [Z (Γ_{i} (1))], J_{S} [Z (Γ_{i} (2))], \dots, J_{S} [Z (Γ_{i} (k))]}, k = 1, 2, 3 \dots N

(25)

From the initialized

θ

wolves, three grey wolves—

J_{i} (α)

,

J_{i} (β)

, and

J_{i} (δ)

—are selected based on the criterion of minimal evaluation function values, serving as the

α, β, δ

wolves that represent the three optimal trajectories in the current single-UUV trajectory planning, namely:

\begin{array}{c} J_{S} [γ_{i} (t) (α)] < J_{S} [γ_{i} (t) (β)] < J_{S} [γ_{i} (t) (δ)] \leq J_{S} [γ_{i} (t) (k)], \forall k \neq α \neq β \neq δ \\ s . t . (2), (4) - (5) \end{array}

(26)

(3): Iterative optimization

After one iteration, the three optimal trajectories are selected as the current best trajectories—

Z [Γ_{i}^{i t e r} (α)]

,

Z [Γ_{i}^{i t e r} (β)]

, and

Z [Γ_{i}^{i t e r} (δ)]

. Following the encircling and attacking process of the GWO, these guide the update of the

θ

wolf positions

Γ_{i}^{i t e r + 1} (k)

in the next iteration. This process continues until the maximum number of iterations is reached, at which point the optimal trajectory for UUV

i

is derived based on the optimal high-order optimization terms

Γ_{i}^{o p t}

.

5. Cooperative Layer for Multi-UUV Polynomial Trajectory Planning Optimized by SAPSO

This section introduces the secondary cooperative planning layer within the dual-layer planning framework. Building upon the initial single-UUV trajectories, this study further incorporates the SAPSO algorithm for secondary cooperative trajectory planning. Aiming to minimize energy consumption, the secondary cooperative planning layer comprehensively considers individual UUV constraints while emphasizing collision avoidance among multiple UUVs.

5.1. SAPSO

The performance of the PSO algorithm [31] is inherently constrained by its control parameters. Traditional static parameter configurations exhibit significant adaptability limitations—they are prone to premature convergence from insufficient early-stage exploration and limited convergence precision from inadequate late-stage exploitation. The SAPSO algorithm proposed in this study addresses these issues by establishing an iteration-aware mechanism that enables dynamic parameter adjustment. This approach achieves optimal balance between exploration and exploitation in the solution space through real-time parameter modulation. The parameter update strategy for the particle swarm is detailed as follows.

(1): Adaptive inertia weight adjustment

The inertia weight

ω

, as a key parameter in the PSO [32], determines the degree to which a particle retains its previous velocity. When

ω

is set to a larger value, particles tend to maintain their original motion state, enhancing global exploration capability; when

ω

is set to a smaller value, particles become more influenced by individual and global optimal solutions, facilitating local refinement. The update formula is as follows:

\begin{matrix} ω (k) = ω_{s t a r t} - (ω_{s t a r t} - ω_{e n d}) \cdot [\frac{2 t}{T} - {(\frac{t}{T})}^{2}] \end{matrix}

(27)

where

ω_{s t a r t}

and

ω_{e n d}

represent the initial and terminal inertia weight, respectively.

(2): Adaptive learning factor adjustment

The learning factors

c_{1}

and

c_{2}

are critical parameters in the PSO. Excessively large

c_{1}

values cause particles to over-rely on individual experience and trap them in local search, while disproportionately large

c_{2}

values drive premature convergence toward the swarm’s global best solution. Therefore, during initial iterations, the strategy employs relatively large

c_{1}

and small

c_{2}

values, then linearly decreases

c_{1}

while increasing

c_{2}

throughout the optimization process to guide particles toward the global optimum.

\begin{matrix} c_{1}^{t} = c_{1}^{i n i} + (c_{1}^{f i n} - c_{1}^{i n i}) \cdot \frac{t}{T} \end{matrix}

(28)

\begin{matrix} c_{2}^{t} = c_{2}^{i n i} + (c_{2}^{f i n} - c_{2}^{i n i}) \cdot \frac{t}{T} \end{matrix}

(29)

where

c_{1}^{t}

represents the value of the individual learning factor at the t-th iteration,

c_{1}^{i n i}

denotes the initial value of

c_{1}

at the start of iteration,

c_{1}^{f i n}

indicates the terminal value of

c_{1}

at the end of iteration, while

c_{2}^{t}

,

c_{2}^{i n i}

, and

c_{2}^{f i n}

follow analogous definitions.

(3): Adaptive velocity factor parameter adjustment

During the iterative process of the PSO algorithm, the value of the velocity factor

ρ

critically influences optimization performance. Adopting a larger

ρ

in the early iterations enhances the global search capability of particles, while a smaller

ρ

in later stages facilitates refined local search, thereby improving solution accuracy.

\begin{matrix} ρ (t) = \frac{ρ_{m a x}}{ρ_{m i n} + \exp (γ \cdot (t - \frac{T}{2}))} + ρ_{i n t} \end{matrix}

(30)

where

ρ_{m a x}

and

ρ_{m i n}

represent the maximum and minimum values of the velocity factor, respectively,

ρ_{i n t}

denotes the initial value of the velocity factor, and

γ

serves as the velocity factor parameter.

5.2. Secondary Cooperative Trajectory Planning Process

Secondary cooperative trajectory planning extends the initial single UUV planning by incorporating inter-UUV collision avoidance constraints while maintaining the energy-optimal objective, thereby holistically addressing all operational constraints.

(1): Particle Swarm Model

Let the optimization population size be

N

. The particle model selects

S (k)

. Unlike the optimization population model in the initial single UUV planning, this layer no longer treats the high-order optimization terms

a_{i 4} (k)

and

b_{i 4} (k)

of a single UUV trajectory

γ_{i} (t)

as the optimization object. Instead, it considers the set of high-order optimization terms corresponding to all UUV trajectories participating in the formation rendezvous as a single optimization particle, denoted as

S (k)

.

S (k) = \{[(a_{14} (k), b_{14} (k)], [(a_{24} (k), b_{24} (k)], [(a_{34} (k), b_{34} (k)] \dots, [(a_{i 4} (k), b_{i 4} (k)]\}, i = 1, 2, 3, \dots, \in U

(31)

(2): Population initialization

Unlike the single UUV trajectory planning, the optimization object during initialization no longer selects values from a fixed range as in Formula (24). Instead, it leverages the optimal results from the single UUV trajectories, where

Γ_{i}^{+}

represents the high-order optimization terms for each UUV trajectory generated based on the optimal single UUV trajectories. The initialization is performed as follows:

S (k) = \{(Γ_{1}^{+} (k), Γ_{2}^{+} (k), \dots, Γ_{i}^{+} (k)\} = \{\begin{cases} Γ_{1}^{+} (k) = 0.5 Γ_{1}^{o p t} + 0.5 ξ Γ_{1}^{o p t} \\ Γ_{2}^{+} (k) = 0.5 Γ_{2}^{o p t} + 0.5 ξ Γ_{2}^{o p t} \\ \dots \\ Γ_{i}^{+} (k) = 0.5 Γ_{i}^{o p t} + 0.5 ξ Γ_{i}^{o p t} \end{cases} \begin{matrix} i = 1, 2, \dots, \in U \\ k = 1, 2, 3, \dots, N \end{matrix}

(32)

Equation (32) indicates that after obtaining the high-order optimization terms

Γ_{i}^{o p t}

for each UUV trajectory optimized in the initial layer, new optimizable terms

Γ_{i}^{+}

are randomly generated within a small range. This approach enables fine-tuning of the trajectory curves while satisfying the single UUV constraint conditions, thereby further fulfilling inter-UUV distance constraints and achieving decoupling between single UUV trajectory planning and swarm trajectory planning.

The collective trajectory is subsequently evaluated using the swarm trajectory evaluation function, expressed as:

\begin{array}{l} J_{G} (Z [S (k)]) = {J_{G} [γ_{1} (t) (k)], J_{G} [γ_{2} (t) (k)], \dots, J_{G} [γ_{i} (t) (k)]}, \\ i = 1, 2, \dots, \in U, k = 1, 2, 3 \dots, N, s . t . (2) - (5) \end{array}

(33)

(3): Optimization process of SAPSO

After one iteration, the locally optimal particle

S^{*} (k)

is selected from all

N

particles in the current generation, corresponding to the rendezvous trajectories

{{γ^{*}}_{1} (t), {γ^{*}}_{2} (t), {γ^{*}}_{3} (t), \dots {γ^{*}}_{i} (t)}

of each UUV participating in the formation rendezvous. Guided by the particle swarm movement rules, the particles of the next generation are updated. After recalculating the evaluation function, the optimal rendezvous trajectory

Z [S^{o p t} (k)] = {γ_{1}^{o p t} (t), γ_{2}^{o p t} (t), γ_{3}^{o p t} (t), \dots γ_{i}^{o p t} (t)}

for each UUV is ultimately obtained.

6. Simulation Verification

To validate the effectiveness of the proposed dual-layer planning framework for multi-UUV formation rendezvous trajectory planning, this section presents simulation experiments. Initially, a realistic marine environment with coexisting obstacles and currents is constructed. A 3 km × 3 km satellite map of a selected area in the East China Sea is utilized to create a grid-based map model [33] using the grid method. Subsequently, a current model is established based on the classical Lamb dipole vortex model, which accurately captures key characteristics of full-range intensity variations in practical currents. The impact of vortex flow is abstracted as surge and sway velocity components

u_{c}

and

v_{c}

generated at specific points.

u_{C} ({\vec{c}}_{0}) = - S \frac{y - y_{0}}{2 π {(\vec{c} - {\vec{c}}_{0})}^{2}} [1 - e^{- (\frac{{(\vec{c} - {\vec{c}}_{0})}^{2}}{R^{2}})}]

(34)

v_{C} (\vec{c}) = - S \frac{x - x_{0}}{2 π {(\vec{c} - {\vec{c}}_{0})}^{2}} [1 - e^{- (\frac{{(\vec{c} - {\vec{c}}_{0})}^{2}}{R^{2}})}]

(35)

ω (\vec{c}) = \frac{S}{π R^{2}} e^{- (\frac{{(\vec{c} - {\vec{c}}_{0})}^{2}}{R^{2}})}

(36)

Assume the constant value of the eddy current intensity

S_{1}

in the first vortex field is 1000, with an effective radius

R_{1}

of 500, and the vortex center is located at (750 m, 750 m). For the second vortex field, the constant value of vortex intensity

S_{2}

is 1200, with an effective radius

R_{2}

of 400 m, and the vortex center is positioned at (2000 m, 2000 m). The plotting scale is set to 300 m, meaning currents are visualized at 300 m intervals. The obstacles and current environments constructed in this article are shown in Figure 5, where black areas represent obstacles and blue arrows represent currents.

Assume five UUVs are deployed to execute a formation rendezvous mission, to form a pentagonal formation at the designated area. The relevant rendezvous boundary conditions are specified in Table 1, and the associated parameter configurations are detailed in Table 2, Among them, values without units are constant parameters.

Considering the four cases presented in the table, four simulation experiments were conducted to validate the dual-layer formation rendezvous trajectory planning method under different weighting coefficient scenarios. The simulation results are shown in Table 3.

Table 3. Evaluation results of trajectory length and current energy consumption.

Case	Trajectory Length	Current Energy Consumption Value
Case 1	11.61 × 10³ m	6.26 × 10³
Case 2	11.31 × 10³ m	7.16 × 10³
Case 3	11.01 × 10³ m	8.65 × 10³
Case 4	10.11 × 10³ m	20.74 × 10³

Considering the four cases presented in the table, four simulation experiments were conducted to validate the dual-layer formation rendezvous trajectory planning method under different weighting coefficient scenarios. The simulation results are shown in Table 3.

As shown in Figure 6 and Table 3, the energy consumption values of currents in Table 3 are normalized dimensionless values without physical units and are used for subsequent comparative analysis. with the progressive increase in the weighting coefficient for UUV trajectory length, trajectory length increasingly dominates the planning process. Simultaneously, the formation rendezvous trajectory of the UUVs aligns with the current direction to achieve energy conservation objectives.

In Case 1, where the trajectory length proportion is minimized at 50%, the configuration achieves the optimal current energy consumption evaluation value among Cases 1–4.

As the trajectory length weighting increases to 75% in Case 2, the total trajectory distance shortens from 11.63 × 10³ in Case 1 to 11.31 × 10³, accompanied by noticeable modifications in the trajectory curvature. Consequently, the corresponding energy consumption attributed to currents changes to 7.16 × 10³.

In Case 3, with the trajectory length weighting set at 83%, the UUV trajectory becomes noticeably shorter in the graphical representation, and the specific trajectory length value decreases to 11.01 × 10³. Meanwhile, the energy consumption evaluation value for currents increases as the trajectory length weighting grows, indicating a reduced compliance with currents.

In Case 4, where the trajectory length weighting reaches 100% and thus completely disregards current factors, the resulting trajectory achieves the shortest possible length but corresponds to the worst current energy consumption evaluation value, recorded as 20.74 × 10³.

The subsequent analysis of constraint conditions and optimization results will be based on the outcomes from Case 3.

As shown in Figure 7, the number represents the number of the UUV and is distinguished by color. All UUVs depart from their initial points without colliding with obstacles and arrive at their designated rendezvous positions to form a pentagonal formation. To better visualize the minimum distance between each UUV, let

d_{\min} (i, t)

represent the minimum distance between the

i

-th UUV and any other

j

-th UUV at any time

t

, with the specific expression given by Formula (37).

d_{\min} (i, t) = \underset{i \neq j}{m i n} d (i, j, t), \forall i, j \in U

(37)

As shown in Figure 8, the vertical axis Inter-UUV distance(m) represents the minimum distance from each UUV to the other UUVs, and the vertical axis represents time(s). The minimum distance between any two UUVs is 59.10 m, and at any given moment, all inter-UUV distances exceed the predefined safety threshold

d_{s a f e}

.

As shown in Figure 9, The vertical axis represents the navigation speed(m/s) of UUV, the vertical axis represents time(t). The velocities of all UUVs range between 0.68 m/s and 2.95 m/s, which aligns with the achievable speed range of UUVs. Upon reaching the rendezvous points, all UUVs achieve the preset desired velocities, satisfying the velocity constraints at the rendezvous points.

According to Figure 10, the horizontal axis represents angular velocity r(°/s), and the vertical axis represents time(t). The maximum heading angular velocity observed among all UUVs is 0.46°/s, demonstrating compliance with the turning motion constraints in their trajectories.

Figure 11 displays the minimum distances from each UUV to all obstacle grids. The vertical axis UUV-Obs distance(m) represents the minimum distance from each UUV to the obstacles, and the vertical axis represents time(s). The UUV5 maintains the closest proximity to obstacles with a minimum distance of 6.45 m, confirming that all UUVs successfully avoid collisions with obstacles.

During the simulation, the predefined optimization objective function served as the evaluation criterion for the SAPSO algorithm. When the algorithm iterated until the evaluation function converged to a stable minimum value, this indicated that the high-order optimization terms yielding the optimal objective function had been obtained, thereby determining the optimal formation rendezvous trajectory for the multi-UUV system.

As shown in Figure 12, the algorithm ultimately converged to solution 6.36 × 10⁴. The comprehensive analysis of these results verifies the feasibility of the obtained formation rendezvous trajectory.

In the simulation experiment, we found that we discussed a key unexpected finding: in the ocean current environment, simply pursuing the geometric shortest path (i.e., assigning excessive weight to the “trajectory length” in our method) can actually lead to a significant increase in total energy consumption. In contrast, by taking reasonable values for length and energy consumption, UUVs can be guided to plan moderately circuitous trajectories that can effectively “leverage” the ocean current, thereby achieving better energy consumption performance globally.

7. Conclusions

Formation rendezvous trajectory planning represents a critical challenge in multi-UUV system coordination, particularly in environments with obstacles and currents. The core complexity lies in simultaneously addressing multiple constraints including obstacle avoidance, inter-UUV collision prevention, kinematic limitations, and rendezvous timing synchronization, while maintaining energy-optimal performance as the primary objective. This paper systematically investigates this problem and innovatively proposes a dual-layer planning framework for cooperative trajectory planning.

This method decouples the conventional integrated cooperative planning problem into two hierarchical stages: initial individual trajectory planning and secondary cooperative trajectory planning. In the initial planning layer, the GWO is employed to optimize high-order polynomial curves, comprehensively considering multiple objectives including obstacle avoidance, kinematic constraints, collision prevention, and energy consumption. This process generates initial trajectories for each UUV, serving as inputs to the cooperative layer. In the secondary cooperative planning layer, building upon the initial trajectory information, the SAPSO algorithm performs coordinated exploration within the solution space, ultimately producing globally optimal formation rendezvous trajectories that satisfy all constraint conditions.

Simulation results demonstrate that the proposed dual-layer planning method effectively handles multiple constraints, generating formation rendezvous trajectories that satisfy velocity and angular velocity constraints while ensuring collision avoidance between UUVs and between UUVs and obstacles. Furthermore, by adjusting the weighting coefficients for currents and trajectory length, the method achieves coordinated optimization of UUVs trajectories to minimize energy consumption. These findings validate the effectiveness and superiority of the proposed approach in addressing the multi-UUV formation rendezvous trajectory planning problem.

The current research of the article focuses on verifying the use of a two-layer optimization method to solve the problem of obstacle and multi-UUV rendezvous trajectory planning in the current environment. In practical applications, based on the research in this article, more attention should be paid to the feasibility of planning and real-time planning time to evaluate whether the planning effect is ideal. A broader comparison with general and state-of-the-art multi-layer optimization techniques is an important research direction for the future. In future research, we will increase ocean experiments to consider the impact of the ocean on the solid hull control of unmanned underwater vehicles, while also increasing the number of UUVs performing tasks. Through three-dimensional trajectory planning, we will study the impact of different optimization methods on the overall efficiency of task execution, in order to further improve its engineering practicality.

Author Contributions

Conceptualization, T.C. and K.W.; methodology, T.C. and K.W.; software, K.W.; validation, T.C., K.W. and Q.W.; formal analysis, T.C., K.W. and Q.W.; investigation, T.C. and K.W.; resources, T.C. and K.W.; data curation, T.C. and K.W.; writing—original draft preparation, K.W.; writing—review and editing, T.C., K.W. and Q.W.; visualization, T.C. and K.W.; supervision, Q.W.; project administration, T.C.; funding acquisition, T.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Youth Support Program of China under Grant 002040130635, and in part by the National Natural Science Foundation of China under Grant 52101347.

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Martynova, L.; Pashkevich, I.; Bykova, V. Development of a Digital Twin of an Autonomous Underwater Vehicle to Assess the Effectiveness of Searching for Bottom Objects. In Proceedings of the 2024 International Russian Smart Industry Conference (SmartIndustryCon), Sochi, Russia, 25–29 March 2024; pp. 243–248. [Google Scholar]
Luo, N.; Wang, H.; Huang, S.; Gao, W.; Zhong, B.; Huang, Y.; Li, B. Multi-UUV Dynamic Cooperative Task Planning Method Based on Multi-Objective Genetic Algorithm. In Proceedings of the 2023 62nd IEEE Conference on Decision and Control (CDC), Singapore, 13–15 December 2023; pp. 8836–8843. [Google Scholar]
Xun, Y.; Liu, Y.; Wang, Y.; Fan, Z.; Xu, H.; Ma, S. Formation Assembly and Transformation Controller of Multi-UAV based on Sliding Mode Control. In Proceedings of the 2024 36th Chinese Control and Decision Conference (CCDC), Xi’an, China, 25–27 May 2024; pp. 142–148. [Google Scholar]
Zhang, W.; Zhang, K.; Li, Y.; Han, P. Research on Multi-UUV Path Planning Method Based on Recycling Task. In Proceedings of the OCEANS 2022, Hampton Roads, VA, USA, 7–20 October 2022; pp. 1–7. [Google Scholar]
Wu, X.; Long, X.; Yuan, S.; Hu, Q.; Xie, P. Multi-UUV Coordinated Path Planning with Collision Avoidance (CPP/CA) Based on Combination of Improved APF and A*. In Proceedings of the 2022 8th International Conference on Control, Automation and Robotics (ICCAR), Xiamen, China, 8–10 April 2022; pp. 218–223. [Google Scholar]
Xing, X.; Zhou, Z.; Li, Y.; Xiao, B.; Xun, Y. Multi-UAV Adaptive Cooperative Formation Trajectory Planning Based on an Improved MATD3 Algorithm of Deep Reinforcement Learning. IEEE Trans. Veh. Technol. 2024, 73, 12484–12499. [Google Scholar] [CrossRef]
Yang, S.; Yu, J.; Zhang, Z.; Zhao, G. Cooperative Path Planning Method Based on Road Network Constraints for Vehicle-Mounted Multi-Rotor UAV Swarm. In Proceedings of the 2023 42nd Chinese Control Conference (CCC), Tianjin, China, 24–26 July 2023; pp. 1779–1784. [Google Scholar]
Ma, X.; Chen, W.; Sun, R.; Cao, J. Online Collaborative Obstacle Avoidance Path Planning Based on Multi-AUV. In Proceedings of the 2023 9th International Conference on Mechanical and Electronics Engineering (ICMEE), Xi’an, China, 17–19 November 2023; pp. 340–345. [Google Scholar]
Geng, L.; Dong, C.; Han, J.; Jia, J.; Zhao, R. Unmanned Aerial Vehicle Path Planning Based on Inverse Learning Strategy Particle Swarm Optimization Algorithm. In Proceedings of the 2025 IEEE 20th Conference on Industrial Electronics and Applications (ICIEA), Yantai, China, 3–6 August 2025; pp. 1–5. [Google Scholar]
Qu, J.; Li, X.; Sun, G. Optimal Formation Configuration Analysis for Cooperative Localization System of Multi-AUV. IEEE Access 2021, 9, 90702–90714. [Google Scholar] [CrossRef]
Pehlivanoğlu, Y.V.; Bekmezci, İ; Pehlivanoğlu, P. Efficient Strategy for Multi-UAV Path Planning in Target Coverage Problems. In Proceedings of the 2022 International Conference on Theoretical and Applied Computer Science and Engineering (ICTASCE), Istanbul, Turkey, 29 September–1 October 2022; pp. 110–115. [Google Scholar]
Lu, L.; Dai, J.; Ying, J. Distributed multi-UAV cooperation for path planning by an NTVPSO-ADE algorithm. In Proceedings of the 2022 41st Chinese Control Conference (CCC), Hefei, China, 25–27 July 2022; pp. 5973–5978. [Google Scholar]
Luo, R. Design of Multi-Objective Path Planning for UAV Based on Adaptive Optimization. In Proceedings of the 2024 6th International Conference on Frontier Technologies of Information and Computer (ICFTIC), Qingdao, China, 13–15 December 2024; pp. 717–720. [Google Scholar]
Kopacz, A.; González, E.G.; Chira, C.; Flecha, J.R.V. Hybrid Adaptive Greedy Algorithm Addressing the Multi-Robot Path Planning Problem. IEEE Lat. Am. Trans. 2025, 23, 856–864. [Google Scholar] [CrossRef]
Nguyen, V.H.A.; Tuong, V.C.; Nguyen, T.T.T.; Le, T.M.; Tran, H.M.; Wang, K.; Tran, L.V.; Dao, S.V.T. ITE-RRT*: Intelligent Path Planning for Autonomous Cars with Intermediary Trees, Triangle Inequality, and Equal Distance Optimization. IEEE Access 2025, 13, 192958–192980. [Google Scholar] [CrossRef]
Gong, Y.J.; Huang, T.; Ma, Y.N.; Jeon, S.W.; Zhang, J. MTrajPlanner: A Multiple-Trajectory Planning Algorithm for Autonomous Underwater Vehicles. IEEE Trans. Intell. Transp. Syst. 2023, 24, 3714–3727. [Google Scholar] [CrossRef]
Wang, X.; Ma, T.; Zhang, L. Rendezvous Trajectory Planning for Air-Launched UAV Swarms Using Wind Energy. IEEE Access 2024, 12, 168531–168546. [Google Scholar] [CrossRef]
Li, H.; Chen, M. Task allocation and path planning problems of multi-AUV system based on auction-dynamic neural network. In Proceedings of the 2023 35th Chinese Control and Decision Conference (CCDC), Yichang, China, 20–22 May 2023; pp. 2945–2949. [Google Scholar]
Wang, Q.; Xu, D.; Liu, X.; Zhang, G.; Han, Z. Trajectory Planning Method for Formation Rendezvous of Underactuated Multi-UUV Under Multiple Constraints. J. Mar. Sci. Eng. 2024, 12, 2118. [Google Scholar] [CrossRef]
Shao, Z.; Yan, F.; Zhou, Z.; Zhu, X. Path Planning for Multi-UAV Formation Rendezvous Based on Distributed Cooperative Particle Swarm Optimization. Appl. Sci. 2019, 9, 2621. [Google Scholar] [CrossRef]
Singh, P.; Kumar, V.; Maurya, H.L.; Kamath, A.K. Velocity Estimator and Twisting Control Based Formation of Mobile Robots in Presence of Delay. In Proceedings of the 2023 IEEE 3rd International Conference on Smart Technologies for Power, Energy and Control (STPEC), Bhubaneswar, India, 10–13 December 2023; pp. 1–6. [Google Scholar]
Louda, S.; Karkar, N.; Seghir, F.; Refoufi, S. Mobile Robot Path Planning Based on A-Star Algorithm and Artificial Potential Field Method for Autonomous Navigation. In Proceedings of the 2024 12th International Conference on Systems and Control (ICSC), Batna, Algeria, 3–5 November 2024; pp. 441–446. [Google Scholar]
Muslimov, T.; Kozlov, E.; Munasypov, R. Drone Swarm Movement without Collisions with Fixed Obstacles Using a Hybrid Algorithm Based on Potential Functions. In Proceedings of the 2023 International Russian Automation Conference (RusAutoCon), Sochi, Russia, 10–16 September 2023; pp. 781–785. [Google Scholar]
Ueda, Y.; Motoi, N. Local Path Planning Based on Velocity Obstacle Considering Collision Probability and Kinematic Constraint for Mobile Robot. In Proceedings of the IECON 2022–48th Annual Conference of the IEEE Industrial Electronics Society, Brussels, Belgium, 17–20 October 2022; pp. 1–6. [Google Scholar]
Mejía-De-Dios, J.A.; Rodríguez-Molina, A.; Mezura-Montes, E. Multiobjective Bilevel Optimization: A Survey of the State-of-the-Art. IEEE Trans. Syst. Man Cybern. Syst. 2023, 53, 5478–5490. [Google Scholar] [CrossRef]
Cebeci, C.; Grimble, M.J. Speed Tracking of an Electric Vehicle Using a Restricted Structure NGMV Control Algorithm. In Proceedings of the 2022 European Control Conference (ECC), London, UK, 12–15 July 2022; pp. 790–795. [Google Scholar]
Vinayak, A.; Zakaria, M.A.; Baarath, K.; Majeed, A.P.P.A. A novel Bezier curve control point search algorithm for autonomous navigation using N-order polynomial search with boundary conditions. In Proceedings of the 2021 IEEE International Intelligent Transportation Systems Conference (ITSC), Indianapolis, IN, USA, 19–22 September 2021; pp. 3884–3889. [Google Scholar]
Xu, Z.; Shen, Y.; Xie, Z.; Liu, Y. Research on Autonomous Underwater Vehicle Path Optimization Using a Field Theory-Guided A* Algorithm. J. Mar. Sci. Eng. 2024, 12, 1815. [Google Scholar] [CrossRef]
Lou, L.; Zhang, H. Grey Wolf Optimization algorithm based on Hybrid Multi-strategy. In Proceedings of the 2023 8th International Conference on Intelligent Computing and Signal Processing (ICSP), Xi’an, China, 21–23 April 2023; pp. 1342–1345. [Google Scholar]
Liu, Y.; Wang, J. Optimized Machine Learning Traffic Flow Prediction Model Based on Improved Gray Wolf Algorithm. In Proceedings of the 2022 7th International Conference on Intelligent Informatics and Biomedical Science (ICIIBMS), Nara, Japan, 24–26 November 2022; pp. 355–358. [Google Scholar]
Bin, W. A Novel Supply Chain Multi-level Inventory Model based on Improved PSO Algorithm. In Proceedings of the 2023 8th International Conference on Communication and Electronics Systems (ICCES), Coimbatore, India, 1–3 June 2023; pp. 1733–1737. [Google Scholar]
Boutalbi, O.; Seghir, F.; Boutalbi, A.; Guerra, L. A PSO-Based Global Path Planning Approach for Mobile Robots. In Proceedings of the 2024 12th International Conference on Systems and Control (ICSC), Batna, Algeria, 3–5 November 2024; pp. 354–359. [Google Scholar]
Moreira, L.G.; Brandão, A.S. SLAM-Based 2D Mapping and Route Planning for Autonomous Mobile Robot Navigation. In Proceedings of the 2025 Brazilian Conference on Robotics (CROS), Belo Horizonte, Brazil, 28–30 April 2025; pp. 1–6. [Google Scholar]

Figure 1. Schematic diagram of UUV formation rendezvous.

Figure 2. Dual-layer planning framework diagram.

Figure 3. Schematic diagram of UUV coordinate system relationship.

Figure 4. GWO algorithm diagram.

Figure 5. Formation rendezvous environment diagram.

Figure 6. Schematic diagram of trajectories of different energy consumption ratios. Among them, (a) corresponds to weighting coefficient Case 1, (b) to weighting coefficient Case 2, (c) to weighting coefficient Case 3, and (d) to weighting coefficient Case 4.

Figure 7. Schematic diagram of formation rendezvous trajectory.

Figure 8. Schematic diagram of distance between individual UUV.

Figure 9. Schematic diagram of single UUV velocity.

Figure 10. Schematic diagram of heading angular velocity of single UUV.

Figure 11. Minimum obstacle distance diagram.

Figure 12. Schematic diagram of optimization iteration process.

Table 1. Formation rendezvous boundary conditions.

Point Types	Position $(x, y) / m$	Velocity $(u, ν) / (m / s)$	Heading $ψ / °$
Initial points	(2000, 2500)	(2, 0.03)	−120°
	(500, 2000)	(2, 0.05)	120°
	(1000, 1000)	(2, 0.01)	120°
	(1100, 2900)	(2, 0.06)	180°
	(600, 1100)	(2, 0.04)	90°
Terminal points	(2770.7, 1770.7)	(1, 0)	45°
	(2789.1, 1654.6)	(1, 0)	45°
	(2584.4, 1601.2)	(1, 0)	45°
	(2501.2, 1684.4)	(1, 0)	45°
	(2554.6, 1789.1)	(1, 0)	45°

Table 2. Design of formation rendezvous parameters.

Parameter	Symbols	Value
Population size	$N$	50
Optimization iterations	$i t e r_{\max}$	100
$c_{1}$ initial value	$c_{1}^{i n i}$	0.5
$c_{1}$ terminal value	$c_{1}^{f i n}$	1
$c_{2}$ initial value	$c_{2}^{i n i}$	1.25
$c_{2}$ terminal value	$c_{2}^{f i n}$	2.25
Maximum velocity factor	$ρ_{\max}$	1.5
Minimum velocity factor	$ρ_{\min}$	1
Velocity factor parameter	$γ$	0.4
Initial velocity factor	$ρ_{int}$	0.4
Initial inertia weight	$ω_{s t a r t}$	0.9
Terminal inertia weight	$ω_{e n d}$	0.5
Search range	$S e a r c h_{\lim}$	(−10⁻⁸, 10⁻⁸)
Weighting coefficient	$a_{1}$ $, a_{2}$	Case 1: $a_{1} = 1$ $, a_{2} = 0$
		Case 2: $a_{1} = 1$ $, a_{2} = 1$
		Case 3: $a_{1} = 1$ $, a_{2} = 3$
		Case 4: $a_{1} = 1$ $, a_{2} = 5$
Maximum velocity constraints	$V_{M a x}$	3 m/s
Minimum velocity constraints	$V_{M i n}$	0.6 m/s
Safety distance	$d_{s a f e}$	40 m
Maximum angular velocity	$r_{M a x}$	$°$ /s

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Chen, T.; Wang, K.; Wang, Q. Trajectory Planning Method for Multi-UUV Formation Rendezvous in Obstacle and Current Environments. J. Mar. Sci. Eng. 2025, 13, 2221. https://doi.org/10.3390/jmse13122221

AMA Style

Chen T, Wang K, Wang Q. Trajectory Planning Method for Multi-UUV Formation Rendezvous in Obstacle and Current Environments. Journal of Marine Science and Engineering. 2025; 13(12):2221. https://doi.org/10.3390/jmse13122221

Chicago/Turabian Style

Chen, Tao, Kai Wang, and Qingzhe Wang. 2025. "Trajectory Planning Method for Multi-UUV Formation Rendezvous in Obstacle and Current Environments" Journal of Marine Science and Engineering 13, no. 12: 2221. https://doi.org/10.3390/jmse13122221

APA Style

Chen, T., Wang, K., & Wang, Q. (2025). Trajectory Planning Method for Multi-UUV Formation Rendezvous in Obstacle and Current Environments. Journal of Marine Science and Engineering, 13(12), 2221. https://doi.org/10.3390/jmse13122221

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Trajectory Planning Method for Multi-UUV Formation Rendezvous in Obstacle and Current Environments

Abstract

1. Introduction

2. Problem Statement

2.1. Preliminaries

2.2. Constraints

2.3. Problem Formulation

3. Dual-Layer Planning Framework

4. Initial Layer for Individual UUV Polynomial Trajectory Planning Optimized by GWO

4.1. Polynomial Trajectory Design

4.2. Initial Monomer UUV Trajectory Planning

4.2.1. Grey Wolf Optimizer

4.2.2. Trajectory Planning Process

5. Cooperative Layer for Multi-UUV Polynomial Trajectory Planning Optimized by SAPSO

5.1. SAPSO

5.2. Secondary Cooperative Trajectory Planning Process

6. Simulation Verification

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI