Research on Multi-UUVs Dynamic Formation Reconfiguration Considering Underwater Acoustic Communication Characteristics

Wan, Chuang; Chen, Tao; Liu, Zhenghong; Fan, Yunyao

doi:10.3390/jmse13122388

Open AccessArticle

Research on Multi-UUVs Dynamic Formation Reconfiguration Considering Underwater Acoustic Communication Characteristics

College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin 150001, China

^*

Author to whom correspondence should be addressed.

J. Mar. Sci. Eng. 2025, 13(12), 2388; https://doi.org/10.3390/jmse13122388

Submission received: 13 November 2025 / Revised: 9 December 2025 / Accepted: 12 December 2025 / Published: 16 December 2025

(This article belongs to the Section Ocean Engineering)

Download

Browse Figures

Versions Notes

Abstract

This study investigates the dynamic formation reconfiguration problem for multi-UUV (multi-Unmanned Underwater Vehicle) systems, with a particular focus on the challenges posed by underwater acoustic communication. A two-dimensional grid model is established in the horizontal plane, taking the leader vehicle as a reference point. Based on this model, fundamental motion strategies for formation reconfiguration are proposed. To facilitate reconfiguration, the Particle Swarm Optimization (PSO) algorithm is utilized to assign desired position points to the follower UUVs within the new formation, enabling dynamic target point planning during reconfiguration. Furthermore, the process of generating motion guidance commands and the impact of acoustic communication delays during command transmission are analyzed. To address these delays, a fuzzy logic-based delay compensation method is proposed. Simulation experiments were conducted to validate the proposed approach. The results demonstrate that the formation reconfiguration planning method and the centralized command communication compensation strategy are both effective and practical for multi-UUV systems.

Keywords:

multi-UUVs; formation reconfiguration; particle swarm optimization; underwater acoustic communication; fuzzy predictive

1. Introduction

Recent advancements in sensor technology, computing devices, and marine science have led to an increased application of Unmanned Underwater Vehicles (UUVs) in fields such as pipeline monitoring, marine surveying, underwater sampling, and mine detection [1,2,3]. Multi-UUV systems have shown significant advantages in functional redundancy, cooperative operations, and flexibility, making them a prominent focus of current research. Formation control, a central challenge in these systems, involves formation generation, maintenance, and reconfiguration, and has garnered significant attention [4,5]. In practical applications, multi-UUV systems often need to dynamically reconfigure their formations to achieve specific objectives, such as minimizing energy consumption, safeguarding designated targets, or enabling coordinated marine environmental monitoring. Consequently, the development of effective and safe formation reconfiguration strategies is of paramount importance for advancing both the theoretical foundations and engineering applications of multi-UUV systems.

Multi-UUV formation reconfiguration refers to the process of transitioning a multi-UUV system from an arbitrary initial formation to a target formation through information exchange and cooperative movement. Current research on formation reconfiguration in multi-agent systems primarily employs optimization techniques, including artificial potential fields, affine transformations, reinforcement learning, and differential evolution. For instance, graph rigidity and affine transformations (GR-AT) have been utilized to address formation reconfiguration challenges in multi-UUV systems [6,7]. To address dynamic environmental factors, an improved artificial potential field-based approach enables UUVs to avoid both static and dynamic obstacles under external disturbances [8]. Additionally, artificial potential fields have been applied to reconfigure formations after individual failures, ensuring collision avoidance within the multi-UUV system [9]. Other methods combine sensor data-driven transformation rules with control matrices and artificial potential fields to achieve reconfiguration [10]. Beyond UUV systems, innovative techniques have been explored in other domains. For example, model predictive control combined with differential evolution has been applied to unmanned aerial vehicles, enabling distributed planning and control for formation reconfiguration [11]. In spacecraft systems, reinforcement learning-based approaches, such as Q-learning, have been employed to facilitate reconfiguration while ensuring inter-agent collision avoidance through shared learning [12]. Less conventional methods have also been introduced, such as a model-based interfered fluid dynamical system (MIFDS), which balances reconfiguration and obstacle avoidance while respecting kinematic constraints [13], and control barrier functions with quadratic programming to optimize reconfiguration trajectories and ensure collision avoidance [14].

Multi-UUV systems are distinct from other multi-agent systems as they operate in underwater environments. Due to strong absorption and scattering of electromagnetic waves in seawater, transmission ranges are significantly shortened, making underwater acoustic communication the primary method for information exchange in multi-UUV systems [15]. The propagation characteristics of sound in water and the bandwidth of sonar channels lead to substantial delays and intermittency [16,17]. Consequently, designing effective communication compensation methods under acoustic conditions has become a critical research focus in multi-UUV systems [18,19]. Moreover, underwater acoustic communication delays are time-varying and susceptible to packet loss. Reference [20] introduces a gradient descent-based delay estimator and validates its effectiveness through pool experiments. In contrast, reference [21] utilizes kernel density estimation and curve fitting to address issues related to communication discretization and packet loss. Reference [22] develops a discretized UUV control method, proving the consistency of formation control under packet loss via matrix and Schuler theory. Reference [23] derives a nonlinear time-varying delay model for cooperative localization systems, converting delays into measurement deviations and reconstructing equations using Kalman filtering. However, most studies have focused on formation control and consistency analysis under underwater acoustic communication. Furthermore, the communication intervals and delay times of acoustic devices used in these studies are not consistent with actual operating conditions.

The research is structured into the following parts. First, the foundation of formation reconfiguration lies in designing control methods enabling follower UUVs to execute coupled movements while tracking the leader. A grid-based horizontal space model is established with the leader as a reference, and a basic motion strategy for followers is defined. This intuitive and effective approach facilitates deriving control commands for follower movements.

Next, the core focus is collision avoidance and path planning. This paper addresses the multi-objective optimization of formation reconfiguration in two stages. The first stage uses Particle Swarm Optimization (PSO) to allocate desired positions for follower UUVs in the target formation (“which point to go”), while the second applies PSO to generate guide path points (“how to go”). The first stage minimizes path intersections and movement distances, and the second ensures accurate and efficient reconfiguration.

Finally, communication constraints inherent in formation reconfiguration are addressed. Considering the practical limitations of underwater acoustic communication, this paper explores a centralized data transmission model and proposes a fuzzy theory-based delay estimation algorithm. This algorithm minimizes the impact of delays and failures on control command transmission, improving system robustness and efficiency.

The main contributions of this paper are:

A horizontal geometric grid model for multi-UUV dynamic formation reconfiguration is established, with dispersion and maneuver-based methods for follower movement. A collision-free planning scheme is proposed based on specific formation requirements.
A PSO-based method is developed for allocating desired formation positions and movement path points. The algorithm determines target coordinates under various constraints, guiding followers during reconfiguration.
A fuzzy theory-based underwater communication delay estimation method is proposed, addressing centralized formation control under communication delays and bandwidth constraints. A control command transmission method is designed to enhance formation reconfiguration robustness.

The paper is structured as follows: Section 2 presents the problem statement. Section 3 establishes the horizontal grid model and defines follower motion strategies. Section 4 discusses maneuvering methods and the PSO-based planning approach. Section 5 introduces the fuzzy-based communication delay estimation method. Section 6 provides simulation results, and Section 7 concludes the paper.

2. Problem Statement

This paper primarily investigates the issue of formation reconfiguration for follower UUVs while tracking the leader UUV. In this study, the leader UUV is assumed to maintain constant speed and linear motion throughout the reconfiguration process. During the reconfiguration process, the leader UUV applies a PSO algorithm to compute the maneuvering position of each follower UUV. This allows the problem of formation reconfiguration to be transformed from a global coordinate system into the moving coordinate frame of the leader UUV, and subsequently into a grid-based space. The focus is then on the allocation of relative positions for each follower UUV, which correspond to the desired point in the target formation. These relative positions are then converted into velocity and heading instructions, with the length of each instruction sequence dynamically adjusted based on communication delays. Upon receiving these commands, the followers complete the reconfiguration using their speed and heading controllers. This study centers on path planning for multi-UUV formation reconfiguration, where the output consists of velocity and heading commands for the followers. External factors like sea currents or chaotic fluctuations may affect control performance but are not the primary focus of this research.

2.1. Communication Topology and Delay Analysis

First, the communication model of the multi-UUV system studied in this paper is described. The multi-UUV system under study is based on a leader-follower centralized formation structure, as illustrated in Figure 1. The system utilizes a hybrid measurement-communication method. The leader UUV serves as the communication hub, equipped with an ultra-short baseline (USBL) positioning sonar array, while the followers are equipped with responders. The leader UUV periodically transmits acoustic pulses, which are received and responded to by the responders on the follower UUVs. These response pulses are then utilized by the leader to determine the relative positions of all follower UUVs [24,25]. The leader then broadcasts the measured relative positions of the followers and other necessary instructions to all followers. This dual-mode communication system not only minimizes system cost but also reduces communication cycles compared to systems that rely solely on acoustic communication.

Under the described communication method, the follower UUVs are unable to transmit information externally. As a result, the leader UUV is designated as the decision-maker, centrally planning the movements of all followers. Specifically, the control commands for the follower UUVs are determined by the leader and transmitted to the followers via acoustic communication. The acoustic communication employs a broadcast mode, where the leader UUV transmits control command information to all followers simultaneously during each communication instance. Considering the characteristics of underwater robotic formation communication, the following issues are addressed:

Unlike radio communication, acoustic sonar cannot sustain high communication frequencies, resulting in relatively long intervals between data transmissions. The time interval between each acoustic broadcast is denoted as $Δ t$ . In other words, the leader UUV broadcasts control command information to the followers once every $Δ t$ seconds.
Due to the complexity and uncertainty of the ocean environment, communication failures may occur. It is assumed in this study that consecutive failures in acoustic broadcast communication will not occur. Specifically, if a communication failure occurs during the broadcast at time t, the broadcast at time $t + Δ t$ will be successful.
The conversion of data into acoustic signals and the propagation of sound in water require a certain amount of time, leading to a time delay in the transmission of information. That is, the broadcast communication transmitted by the leader UUV at time t will be received by the followers at time $t + Δ t_{delay}^{i}$ , where $Δ t_{delay}^{i}$ represents the transmission delay.
The total amount of data transmitted in a single instance is constrained by the communication bandwidth. Thus, an appropriate amount of communication data must be selected. From a practical perspective, the minimum data volume that ensures the continuity of control commands should be chosen.

Let

T_{cmd}^{i}

represent the length of the control command sequence sent by the leader UUV to the followers during the i-th broadcast cycle. This sequence includes heading and velocity commands for a time duration of

T_{cmd}^{i}

. Based on the above considerations, the problem of acoustic communication can be described as follows:

Firstly, the issue of communication intervals in underwater acoustic communication is considered. To ensure the continuity of control instructions for the follower UUV, the leader UUV transmits a control command sequence of duration

T_{cmd}^{i} = Δ t

to the follower UUV at intervals of

Δ t

.

Secondly, the problem of communication failure in underwater acoustic communication is addressed. If we assume communication failure occurs at time

t = t_{i}

, and the length of each control command sequence is

T_{cmd}^{i} = Δ t

. A command vacuum period of duration

Δ t

will arise for the follower UUV between times

t = t_{i}

and

t = t_{i + 1}

. To mitigate the impact of communication failure while considering the bandwidth limitations, the duration of the transmitted control command sequence must be extended. Under the condition that the communication interval remains

Δ t

, the leader UUV can ensure the temporal continuity of the follower UUV’s control instructions by transmitting a control command sequence of duration

T_{cmd}^{i} = 2 \cdot Δ t

during each interval. As illustrated in Figure 2, if the length of each control command sequence is set to

2 \cdot Δ t

, the continuity of control commands for the followers can be ensured even under intermittent communication failures.

Finally, the issue of acoustic propagation delay in underwater communication is considered. The control command sequence must also account for the acoustic delay time,

Δ t_{delay}

. As shown in Figure 3, the control command sequence transmitted by the leader UUV at time

t = t_{i - 1}

can only be received by the follower UUV at time

t = t_{i - 1} + Δ t_{delay}^{i - 1}

. Although the follower UUV receives a control command sequence of duration

2 \cdot Δ t

, the effective command sequence duration is from

t_{i - 1} + Δ t_{delay}^{i - 1}

to

t_{i + 1}

. Therefore, to ensure the continuity of the follower UUV’s control instructions, the command sequence transmitted by the leader UUV at time

t_{i - 1}

must be of duration

T_{cmd}^{i - 1} = 2 \cdot Δ t + Δ t_{delay}^{i + 1}

, which includes both the acoustic communication interval

2 \cdot Δ t

and the acoustic delay time

Δ t_{delay}^{i + 1}

.

In summary, considering the various issues that may arise during underwater acoustic communication, the length of the control command sequence transmitted by the leader UUV to the follower UUV in a single transmission should be set to

T_{cmd}^{i} = 2 \cdot Δ t + Δ t_{delay}^{i + 2}

. Determining how to calculate the acoustic delay time

Δ t_{delay}^{i}

to ensure uninterrupted command transmission during the multi-UUV formation reconstruction process will be the primary focus of subsequent research.

2.2. Technical Method of Formation Reconfiguration

In this study, the multi-UUV system consists of one leader and four followers, all of which are located on the same horizontal plane. To avoid collisions, individual vehicles in a formation can be spread across different depths. However, multi-UUV systems often need to operate at the same depth in practice, such as during horizontal plane or deep diving operations. Therefore, depth variations are not considered in this study, as excluding them does not simplify the formation reconfiguration problem and is necessary for the study’s scope. During the formation reconfiguration process, the leader UUV maintains a constant linear motion with a heading angle of

h_{l}

and a velocity of

u_{l} (u_{l} < u_{f}^{\max})

. The desired relative positions of the followers before and after the formation reconfiguration are pre-defined and known to the leader UUV. At the initial moment, all followers are located at their respective desired positions and have the same heading angle and velocity as the leader UUV. Under these conditions, the followers are required to follow the leader while avoiding collisions with other UUVs, ultimately reaching their desired positions in the new formation.

To enable the followers to simultaneously follow the leader and perform formation reconfiguration, a basic motion strategy based on a grid space is defined for the follower UUVs. Based on this motion strategy, a basic motion sequence generation approach is designed to prevent collisions between UUVs.

Given the communication topology of the UUV formation, a centralized decision-making approach is adopted in this study. The overall technical framework is illustrated in Figure 4. Before the initiation of the formation reconfiguration, the leader UUV employs the PSO method to optimize the basic motion sequences for all followers. The leader UUV then broadcasts the optimization results to the follower UUVs. Based on the received motion sequences, each follower generates its respective control and guidance commands. Formation reconfiguration is initiated at the same time for all UUVs to minimize the impact of communication delays on the formation system.

3. Grid Space and Motion Mode

In the context of the multi-UUV formation reconfiguration problem, avoiding collisions among UUVs is of utmost importance. To generate control commands for the follower UUVs more safely and effectively while ensuring successful formation reconfiguration, the grid method is employed in this study to model the horizontal plane where the leader UUV operates.

3.1. Establishment of 2D Grid Space Model

As illustrated in Figure 5, in a relative coordinate system, horizontal and vertical lines are used to divide the plane into row and column regions with uniform spacing, which are then encoded. Straight lines parallel to the y-axis, denoted as

x = k_{row} L_{res}

, are used to divide the 2D space into multiple row regions along the x-axis direction, where

L_{res}

represents the interval between adjacent lines. The row regions are assigned positive encoding values in the positive x-axis direction. Similarly, straight lines parallel to the x-axis, denoted as

y = k_{col} L_{res}

, divide the 2D space into multiple column regions along the y-axis direction, where

L_{res}

represents the interval between adjacent lines. The column regions are assigned positive encoding values in the positive y-axis direction.

Here,

k_{col}

and

k_{row}

are integers representing the dimensions of the grid space. Based on the positions of all UUVs in the system and their desired positions in the formation, the dimensions of the grid space are restricted as follows:

\begin{array}{l} N_{r_\min} - N_{f} \leq k_{row} \leq N_{r_\max} + N_{f} \\ N_{c_\min} - N_{f} \leq k_{col} \leq N_{c_\max} + N_{f} \end{array}

(1)

where

N_{c_\min}

,

N_{c_\max}

,

N_{r_\min}

, and

N_{r_\max}

denote the minimum and maximum column and row indices, respectively. Their specific values are determined by the grid spacing

L_{res}

and the positions of UUVs before and after the formation reconfiguration. These constraints ensure that the follower UUVs remain within the grid space. Without loss of generality, an additional parameter

N_{f} \geq 1

, representing the number of followers, is included in the constraints. For this study,

N_{f} = 4

.

In this manner, the planar space, with the leader UUV as the origin, is divided into

(k_{row} - 1) \times (k_{col} - 1)

regions. To reduce the computational complexity of the subsequent optimization algorithm, a unified numbering scheme is applied to the regions. As illustrated in Figure 5, the regions are sequentially numbered from left to right and top to bottom. This ensures that each region is assigned a unique identifier, denoted as G. To facilitate subsequent calculations, a method for converting any point’s coordinates in the global coordinate system to the region identifier G is provided below:

For a point P located at

P (x, y)

in the global coordinate system, with the leader UUV positioned at

P_{l} (x_{l}, y_{l})

and oriented at an angle

ψ_{l}

relative to the north direction, the corresponding coordinates of P in the relative coordinate system, denoted as

P_{r} (x_{r}, y_{r})

, are given. The row and column indices of

P_{m} (x_{m}, y_{m})

in the horizontal grid space can be expressed as

P_{m} (x_{m}, y_{m})

. The relationship among these coordinates is described as follows:

[\begin{matrix} x_{r} \\ y_{r} \end{matrix}] = [\begin{matrix} \cos (ψ_{l}) & \sin (ψ_{l}) \\ - \sin (ψ_{l}) & \cos (ψ_{l}) \end{matrix}] \cdot [\begin{array}{l} x - x_{l} \\ y - y_{l} \end{array}]

(2)

[\begin{array}{l} x_{m} \\ y_{m} \end{array}] = [\begin{array}{l} sign (x_{r}) ⌈|x_{r}| / L_{res}⌉ \\ sign (y_{r}) ⌈|y_{r}| / L_{res}⌉ \end{array}]

(3)

If

x_{m} = 0

, then specify

x_{m} = 1

; If

y_{m} = 0

, then

y_{m} = 1

is specified. And the relationship between

P_{m} (x_{m}, y_{m})

and the region identifier G is described as follows:

G = \{\begin{cases} 2 \cdot k_{\max} (k_{\max} - x_{m}) + (k_{\max} + y_{m}), & if x_{m} > 0 and y_{m} > 0 \\ 2 \cdot k_{\max} (k_{\max} - x_{m}) + (k_{\max} + y_{m} + 1), & if x_{m} > 0 and y_{m} < 0 \\ 2 \cdot k_{\max} (k_{\max} - x_{m} - 1) + (k_{\max} + y_{m}), & if x_{m} < 0 and y_{m} > 0 \\ 2 \cdot k_{\max} (k_{\max} - x_{m} - 1) + (k_{\max} + y_{m} + 1), & if x_{m} < 0 and y_{m} < 0 \end{cases}

(4)

where

k_{\max}

represents the maximum row and column values in the regions.

3.2. Basic Motion Behavior in Grid Space

Due to the complexity of the underwater environment and poor communication conditions, UUVs face an increased risk of collision during the process of formation reconfiguration. Therefore, the selection of maneuvering strategies for follower UUVs during the formation reconfiguration process is critically important.

As shown in Figure 6b, within the horizontal grid space model based on a relative coordinate system, follower UUVs are restricted to only two basic motion modes:

Row motion: motion parallel to the x-axis within the same column region, which changes only the row coordinate of the follower UUV.
Column motion: motion parallel to the y-axis within the same row region, which changes only the column coordinate of the follower UUV.

It can be readily deduced that using these basic motion modes allows follower UUVs to reach any point in the horizontal grid space model. Based on the defined motion methods, multiple motions are required for a follower UUV to avoid collisions. This article proposes a four-step motion planning method, including dispersal motion for individual separation and maneuvering motion to guide individuals towards target locations. This article specifies two possible motion sequences for follower UUVs during formation reconfiguration:

Column dispersion, Row dispersion, Column maneuvering, Row maneuvering.
Row dispersion, Column dispersion, Row maneuvering, Column maneuvering.

As shown in Figure 6a, when a follower UUV moves from its initial position to the desired target point, the motion sequence can be divided into column dispersion, row dispersion, column maneuvering, and row maneuvering. Two types of motion sequences are equivalent. It is worth noting that the first type of motion sequence is preferred in this article (column dispersion, row dispersion, column maneuvering, row maneuvering).

In this study, a four-step maneuvering strategy is proposed instead of the traditional two-step maneuvering approach, which includes only row and column motions. This is because, in certain special cases, such as when a follower UUV and the leader UUV are in the same row or column, the follower UUV cannot reach its target position while avoiding the leader UUV with only two motions. The proposed strategy ensures that all follower UUVs can move to their desired positions in the coordinate system established with the leader UUV as the origin, forming the desired formation during reconfiguration.

To optimize the paths of the follower UUVs during formation reconfiguration and ensure the safety of both the follower and leader UUVs, this study employs the PSO algorithm to plan the regions indices for the follower UUVs’ distributed motions and their target positions.

4. Path Point Planning for Formation Reconfiguration

The PSO algorithm is a population-based optimization method inspired by the foraging behavior of bird flocks. It is well suited for our highly nonlinear and multimodal optimization problem, providing an effective balance between exploration and exploitation without requiring gradient information. Compared with other population-based metaheuristics, PSO uses fewer control parameters and typically achieves faster convergence, and its extensive record of success in structurally similar problems further supports its adoption in this study. In PSO, the movement of each particle is influenced by both its own experience and the behavior of the entire swarm, while the quality of each particle is evaluated through a predefined fitness function. Consequently, particle selection and fitness-function design are critical components of the optimization process. Based on these considerations, this section first analyzes the formation-reconfiguration path-planning problem from the perspectives of particle representation and fitness-function construction, and then presents the specific steps of the dynamic PSO-based path-planning method.

4.1. Selection and Dimensionality Reduction of PSO Particles

In the planning process of the grid-based multi-UUV formation reconfiguration method, collision avoidance among UUVs must be ensured. Therefore, the motion paths of UUVs must be carefully designed. The quality of the planned paths is primarily evaluated using the fitness function of the PSO algorithm, which serves as the sole evaluation criterion. The efficiency of the fitness function directly affects the optimization results produced by the PSO algorithm. Based on the proposed motion strategy for formation reconfiguration, the PSO algorithm is applied to uniformly plan four maneuvers for each follower UUV. Theoretically, four target points need to be planned for each follower UUV. Let the position of a particle in the PSO algorithm be denoted as

X = (x (1), x (2), \dots, x (m)), m = 1, 2, \dots, N_{f}

. the motion target point of the m-th follower is represented as

x (m) = {[\begin{matrix} c_{d} (m) & r_{d} (m) & c_{m} (m) & r_{m} (m) \end{matrix}]}^{T}

. This includes the column region coordinate for column dispersion

c_{d} (m)

, the row region coordinate for row dispersion

r_{d} (m)

, the column region coordinate for column maneuvering

c_{m} (m)

, and the row region coordinate for row maneuvering

r_{m} (m)

. The four motion target points’ row and column region coordinates are then derived based on the initial row and column positions of the follower UUVs, the desired target positions in the formation, and the properties of the four-maneuver strategy.

In the proposed PSO-based reconfiguration path planning method, the final particle position information serves as a feasible solution to the formation reconfiguration problem and constitutes the smallest unit of the algorithm. The dimensionality of the particle corresponds to the dimensionality of the solution space. According to the above analysis, the particles in the optimization algorithm are represented as a

4 \times N_{f}

matrix. However, the high dimensionality of the particles reduces the efficiency of the optimization process and negatively impacts the results.

In Section 3.1, the regions were assigned unique identifiers, denoted as G. t is evident that each identifier G uniquely corresponds to a pair of row and column values. Therefore, the region identifiers

G_{d} (m)

and

G_{m} (m)

can be used to replace

[c_{d} (m) r_{d} (m)]

and

[c_{m} (m) r_{m} (m)]

, respectively. As a result, in the PSO algorithm, a particle

X = (x (1), x (2), \dots, x (m))

, can be represented as

2 \times N_{f}

, effectively reducing the particle’s dimensionality to 2. Specifically,

x (m) = {[G_{d} (m) G_{m} (m)]}^{T}

.

4.2. Design of Fitness Function for PSO Algorithm

During the formation reconfiguration process, the primary evaluation criterion is safety, followed by efficiency. Safety is ensured by preventing collisions between the follower UUVs and other UUVs during the reconfiguration process. Efficiency is evaluated based on the maneuvering distance of all follower UUVs within the relative coordinate system. The design of the fitness function is approached from two perspectives: safety and efficiency. For the convenience of description,

(x_{f} (i), y_{f} (i))

is used to represent the coordinates of the follower UUV in the relative coordinate system, which can be calculated using Equation (2). Let

(x_{d} (i), y_{d} (i))

and

(x_{m} (i), y_{m} (i))

denote the coordinates of the region centers of

G_{d} (m)

and

G_{m} (m)

, respectively, in the relative coordinate system.

Efficiency Metric: The formation reconfiguration method proposed in this paper is achieved by the follower UUVs maneuvering relative to the leader. Therefore, the evaluation of reconfiguration efficiency can be conducted by calculating the maneuvering distance of each follower UUV relative to the leader. To achieve rapid and efficient multi-UUV formation reconfiguration, the total maneuvering path length of all UUVs must be minimized. Therefore, the efficiency metric function

f_{e}

is defined as follows:

f_{e} = \sum_{i = 1}^{N_{f}} [|x_{d} (i) - x_{f} (i)| + |y_{d} (i) - y_{f} (i)| + |x_{m} (i) - x_{d} (i)| + |y_{m} (i) - y_{d} (i)|]

(5)

Safety Metric: To ensure collision-free formation reconfiguration, the horizontal workspace is discretized into grid regions, and potential collision scenarios both follower–follower and leader–follower are analyzed. Corresponding constraint functions are incorporated into the optimization model to restrict unsafe dispersion values and avoid trajectory intersections. These constraints, together with the grid regions discretization, provide inherent separation among UUVs. A detailed description of the safety constraint design is given below.

Different row and column regions: In the four-step maneuvering method for followers defined in Section 3.2, the first two dispersion motions aim to increase the dispersion of followers, thereby preventing collisions. Therefore, it is desired that the follower UUVs remain as separated as possible, avoiding the same row and column regions after the row and column spreading maneuvers. A function $f_{dif_rc}$ for different row and column dispersion is defined as:

$\{\begin{array}{l} f_{dif_rc} = \sum_{i = 1}^{N_{f}} \sum_{j = 1}^{N_{f}} (f_{dif_r} (i, j) + f_{dif_c} (i, j)) \\ f_{dif_r} (i, j) = \{\begin{cases} 1, Δ f_{dif_r} (i, j) = 0 and i \neq j \\ 0, otherwise \end{cases} \\ f_{dif_c} (i, j) = \{\begin{cases} 1, Δ f_{dif_c} (i, j) = 0 and i \neq j \\ 0, otherwise \end{cases} \\ Δ f_{dif_r} (i, j) = x_{d} (i) - x_{d} (j) \\ Δ f_{dif_c} (i, j) = y_{d} (i) - y_{d} (j) \end{array}$

(6)
Assignment of Desired Positions: At the end of the reconfiguration process, each follower UUV must occupy a distinct desired position in the target formation. To ensure this one-to-one correspondence between follower UUVs and target positions, a conflict function for desired positions $f_{m}$ is defined as:

$f_{m} = \sum_{i = 1}^{N_{f}} \sum_{j = 1}^{N_{f}} (G_{m} (i) - G_{m} (j)), i \neq j$

(7)
Non-One Row and Column Dispersion Values: To prevent collisions between follower UUVs and the leader during the column and row maneuvering phases, it must be ensured that the absolute values of the row and column dispersion coordinates are non-one. Because the leader UUV is always in the first row and first column. A constraint function is defined as: A constraint function $f_{no}$ is defined as:

$\{\begin{cases} f_{no} = \sum_{i = 1}^{N_{f}} (f_{no_row} (i) + f_{no_col} (i)) \\ f_{no_row} (i) = \{\begin{cases} 1, |x_{d} (i)| \leq 1 \\ 0, otherwise \end{cases} \\ f_{no_col} (i) = \{\begin{cases} 1, |y_{d} (i)| \leq 1 \\ 0, otherwise \end{cases} \end{cases}$

(8)
Safety During Column Dispersion: During column dispersion, the starting and ending positions of the follower UUV may intersect with the leader UUV’s path, posing a significant collision risk. To mitigate this risk, a collision function $f_{cd_lf}$ (leader with follower) for column dispersion is defined as follows:

$\{\begin{cases} f_{cd_lf} = \sum_{i = 1}^{N_{f}} f_{col_dis_lf} (i) \\ f_{col_dis_lf} (i) = \{\begin{cases} 1, y_{d} (i) \cdot y_{f} (i) \leq 0 and |y_{d} (i)| \leq 1 \\ 0, otherwise \end{cases} \end{cases}$

(9)

During row dispersion, follower UUVs move along different column regions without risk of collision. However, during column dispersion, changes in the relative positions of follower UUVs within the same row region could lead to path intersections and collisions. To ensure relative positions are maintained during column dispersion, a conflict function

f_{cd_ff}

(follower with follower) is defined as:

\{\begin{cases} f_{cd_ff} = \sum_{i = 1}^{N_{f}} \sum_{j = 1}^{N_{f}} f_{col_dis_ff} (i, j), i \neq j \\ f_{col_dis_ff} (i, j) = \{\begin{cases} 1, (y_{d} (i) - y_{d} (j)) \cdot (y_{f} (i) - y_{f} (j)) \leq 0 and x_{d} (i) = x_{d} (j) \\ 0, otherwise \end{cases} \end{cases}

(10)

Safety During Row Maneuvering: During row maneuvering, the starting and ending positions of the follower UUV may intersect with the leader UUV’s path, posing a significant collision risk. To mitigate this risk, a collision function $f_{rm_lf}$ (leader with follower) for row maneuvering is defined as follows:

$\{\begin{cases} f_{rm_lf} = \sum_{i = 1}^{N_{f}} f_{row_mov_lf} (i) \\ f_{row_mov_lf} (i) = \{\begin{cases} 1, x_{d} (i) \cdot x_{m} (i) \leq 0 and |y_{m} (i)| \leq 1 \\ 0, otherwise \end{cases} \end{cases}$

(11)

During column maneuvering, follower UUVs move along different row regions without risk of collision. However, at the end of column maneuvering, multiple follower UUVs may occupy the same column region, and any changes in their relative positions during row maneuvering could lead to path intersections and collisions. To ensure safety, relative positions within the same column region must remain unchanged during row maneuvering. A conflict function

f_{rm_ff}

(follower with follower) is defined as:

\{\begin{cases} f_{rm_ff} = \sum_{i = 1}^{N_{f}} \sum_{j = 1}^{N_{f}} f_{row_mov_ff} (i, j), i \neq j \\ f_{row_mov_ff} (i, j) = \{\begin{cases} 1, (x_{d} (i) - x_{d} (j)) \cdot (x_{m} (i) - x_{m} (i)) \leq 0 and y_{m} (i) = y_{m} (j)) \\ 0, otherwise \end{cases} \end{cases}

(12)

Considering the evaluation functions, the comprehensive particle fitness function

f_{pso}

is defined as follows:

\{\begin{cases} f_{pso} = f_{s} \cdot k_{s} + f_{e} \cdot k_{e} \\ f_{s} = f_{dif_rc} + f_{m} + f_{no} + f_{cd_lf} + f_{cd_ff} + f_{rm_lf} + f_{rm_ff} \end{cases}

(13)

Here,

k_{s}

and

k_{e}

are adaptive weighting coefficients, with their values ranging between 0 and 1, and satisfying the condition

k_{s} + k_{e} = 1

.

4.3. Steps for Path Point Generation Based on PSO

By integrating the basic PSO algorithm with the previously defined particle fitness function, the detailed steps for PSO in allocating four maneuvering path regions coordinates for follower UUVs during the formation reconfiguration process are presented as follows:

Step 1: Space Grid Discretization. Based on the coordinates of the initial positions of the follower UUVs and the desired formation positions in the relative coordinate system, the space is discretized into grid regions, resulting in the determination of the number and indexing of regions.

Step 2: Initialization of Algorithm Parameters. Prior to iterative execution, the PSO algorithm necessitates the initialization of several key parameters. These include the learning factors

c_{1}

and

c_{2}

, the maximum number of iterations

i_{\max}

, the maximum allowable iterations for result stabilization

i_{nc}

, and the random initial positions of the particles. Additionally, the maximum velocity of the particles in the solution space is defined as:

v_{\max} = \{\begin{cases} k_{\max} \times 0.4 \\ N_{f} \times 0.2 \end{cases}

(14)

where

k_{\max}

represents the maximum row and column values in the regions.

Step 3: Calculation of Particle Fitness. At iteration k, the position of the i-th particle is represented as

X_{i}^{k} = (x_{i, 1}^{k}, x_{i, 2}^{k}, \dots, x_{i, d}^{k})

, and its velocity is denoted as

V_{i}^{k} = (v_{i, 1}^{k}, v_{i, 2}^{k}, \dots, v_{i, d}^{k})

, where d represents the dimensionality of the solution space,

i = 1, 2, 3, \dots, n

, and n is the number of particles in the solution space. Based on the defined fitness function

f_{pso}

and the particle position information

X_{i}^{k}

, the corresponding fitness value is calculated.

Step 4: Best Position Updates. After obtaining the fitness values of the particles, each particle updates and records the position

P_{pb}

, corresponding to its current minimum fitness value. Subsequently, the fitness values of all particles are compared to determine the optimal position

P_{gb}

of the entire population.

Step 5: Update of Particle Velocity and Position. Using the following equations, the inertia weight

w

at the current iteration is calculated, followed by the velocity

V_{i, d}^{k + 1}

and position

X_{i, d}^{k + 1}

of the particle at the next iteration:

w = \frac{(w_{\max} - w_{\min}) \cdot (i_{\max} - i_{now})}{i_{\max}} + w_{\min}

(15)

V_{i}^{k + 1} = w \cdot V_{i}^{k} + c_{1} r_{1} (P_{pb}^{k} - X_{i}^{k}) + c_{2} r_{2} (P_{gb}^{k} - X_{i}^{k})

(16)

X_{i}^{k + 1} = X_{i}^{k} + 〈V_{i}^{k + 1}〉

(17)

where

i_{now}

is the current iteration,

w_{\max}

and

w_{\min}

are the maximum and minimum inertia weights, respectively.

r_{1}

and

r_{2}

are random numbers uniformly distributed in the interval [0, 1].

c_{1}

and

c_{2}

are the learning factors initialized in Step 2. where

〈\cdot〉

denotes the floor function, which rounds a number down to the nearest integer. The velocity of all particles must satisfy the following constraint:

v_{i, d} = \{\begin{cases} v_{\max}, v_{i, d} > v_{\max} \\ v_{i, d}, - v_{\max} \leq v_{i, d} \leq v_{\max} \\ - v_{\max}, v_{i, d} < - v_{\max} \end{cases}

(18)

Step 6: Termination Condition Check. The following termination conditions are checked. If any condition is satisfied, the result corresponding to the optimal fitness value is output, providing the regions indices of the follower UUVs after the dispersion maneuver. If none of the conditions are satisfied, the algorithm proceeds to Step 4.

Termination conditions 1: $f_{s} = 0$
Termination conditions 2: The fitness function value corresponding to the global best position does not change after $i_{nc}$ iterations.

The flowchart of the planning algorithm is shown in Figure 7. After completing the above operations, the optimal results obtained from the PSO algorithm can be transformed using the method described in Section 3.1, yielding the desired path point coordinates for the follower UUVs in the relative coordinate system at each stage.

5. Formation Reconfiguration Instruction Generation Under Communication Delay

In the previous sections, the basic motion modes for follower UUVs in grid space were defined. During formation reconfiguration, the leader UUV centrally generates motion commands for each follower and adjusts them according to communication delays. A delay-adaptive mechanism dynamically determines the length of each command sequence based on the measured latency, reducing minor timing variations and enabling robust execution despite small asynchronous effects. The methods for computing the motion command sequences and the duration of each communication cycle are detailed below.

5.1. Method for Solving Control Instructions for the Behavior

In the two-dimensional grid space, multiple follower UUVs within the same row (or column) region may require row (or column) motion simultaneously. To achieve the spatial transformation of follower UUVs relative to the leader UUV while ensuring safety among the follower UUVs, it is required that all follower UUVs start their row (or column) motions simultaneously. Furthermore, identical accelerations are applied during the acceleration and deceleration phases. Taking the column motion of follower UUVs as an example, the displacement occurs along the y-axis of the relative coordinate system, while the position along the x-axis remains unchanged. Since the leader UUV performs uniform linear motion, the velocity command

u_{cmd_i}

and heading command

h_{cmd_i}

for follower-i during column motion are computed as follows:

\{\begin{array}{l} u_{cmd_i} = \sqrt{u_{col}^{2} + u_{l}^{2}} \\ h_{cmd_i} = h_{l} + \arctan (\frac{u_{col}}{u_{l}}) \end{array}

(19)

where

u_{l}

and

h_{l}

are the velocity and heading angle of the leader UUV, respectively, and

u_{col}

represents the velocity component of the follower UUVs along the y-axis during column motion. For convenience, the following definitions are introduced:

Definition 1.

The sign of the column motion velocity component of the follower UUV is the same as the positive y-axis direction of the relative coordinate system.

Definition 2.

The column motion distance

d_{col}

of a follower UUV in the relative coordinate system can be calculated from the path points determined by the PSO algorithm. The threshold

d_{tvc}

is used to segment the phases of column motion. If

d_{col} \leq d_{tvc}

, the column motion is divided into two phases: acceleration and deceleration. If

d_{col} > d_{tvc}

, the column motion consists of three phases: acceleration, constant velocity, and deceleration. The threshold

d_{tvc}

is calculated as follows:

d_{tvc} = t_{\max} \times u_{\max}

(20)

where

t_{\max}

and

u_{\max}

are user-defined parameters for acceleration and acceleration time, respectively. To ensure synchronized motion and prevent collisions among follower UUVs, all follower UUVs use the same acceleration during the acceleration and deceleration phases, and the acceleration and deceleration times are identical. As shown in Figure 8a, when

d_{col} \leq d_{tvc}

,

t_{acc}

and

t_{fin}

represent the end times of the acceleration and deceleration phases, respectively. When

d_{col} = d_{tvc}

,

t_{acc} = t_{acc}^{\max}

and

t_{fin} = t_{fin}^{\max}

are adjusted, and the following conditions are satisfied:

\{\begin{cases} t_{acc}^{\max} = t_{\max} \\ t_{fin}^{\max} = 2 \cdot t_{acc}^{\max} = 2 \cdot t_{\max} \end{cases}

(21)

The actual acceleration time

t_{acc} = \sqrt{d_{col} \cdot t_{\max} / u_{\max}}

, and

t_{fin} = 2 \sqrt{d_{col} \cdot t_{\max} / u_{\max}}

. The relationships between

u_{col}

, and time t are given as follows:

u_{col} = \{\begin{cases} \frac{u_{\max}}{t_{\max}} \cdot t, 0 \leq t \leq t_{acc}, t_{acc} = \sqrt{\frac{d_{col} \cdot t_{\max}}{u_{\max}}} \\ - \frac{u_{\max}}{t_{\max}} \cdot t + 2 \sqrt{\frac{d_{col} \cdot u_{\max}}{t_{\max}}}, t_{acc} \leq t \leq t_{fin}, t_{fin} = 2 \sqrt{\frac{d_{col} \cdot t_{\max}}{u_{\max}}} \end{cases}

(22)

As shown in Figure 8b, when

d_{col} > d_{tvc}

,

t_{acc}

and

t_{fin}

remain the end times of the acceleration and deceleration phases, respectively, while

t_{hoir}

represents the end time of the constant velocity phase. The following equation can be derived:

\{\begin{cases} t_{acc} = t_{\max} \\ t_{fin} = \frac{d_{col}}{u_{\max}} + t_{\max} \\ t_{hoir} = \frac{d_{col}}{u_{\max}} \end{cases}

(23)

Subsequently, the relationship between the column motion velocity component and time t can be expressed as:

u_{col} = \{\begin{cases} \frac{u_{\max}}{t_{\max}} \cdot t, & 0 \leq t \leq t_{acc}, t_{acc} = t_{\max} \\ u_{\max}, & t_{acc} \leq t < t_{hoir}, t_{hoir} = \frac{d_{col}}{u_{\max}} \\ - \frac{u_{\max}}{t_{\max}} \cdot t + (u_{\max} + \frac{d_{col}}{t_{\max}}), & t_{hoir} \leq t \leq t_{fin}, t_{fin} = \frac{d_{col}}{u_{\max}} + t_{\max} \end{cases}

(24)

Similarly, during row motion, the follower UUV generates displacement along the x-axis of the relative coordinate system, while the position along the y-axis remains unchanged. The velocity command

u_{cmd_i}

and heading command

h_{cmd_i}

for follower-i during row motion can be expressed as:

\{\begin{cases} u_{cmd_i} = u_{l} + u_{row} \\ h_{cmd_i} = h_{l} \end{cases}

(25)

where

u_{row}

represents the velocity component of the follower UUV along the x-axis during row motion. For convenience, the following definitions are introduced:

Definition 3.

The sign of the velocity component of the follower UUV’s motion is the same as the positive direction of the x-axis in the relative coordinate system.

Definition 4.

The row motion distance

d_{row}

of the follower UUV in the relative coordinate system can be calculated. The threshold

d_{tvr}

is used to segment the phases of row motion. As shown in Figure 9a, when

d_{row} \leq d_{tvr}

, the row motion is divided into two phases. As shown in Figure 9b, when

d_{row} > d_{tvr}

, the row motion consists of three phases. The threshold

d_{tvr}

is calculated as follows:

d_{tvr} = t_{\max} \times (u_{\max} + 2 \cdot u_{l})

(26)

where

t_{\max}

and

u_{\max}

are defined as the same parameters used in column motion.

Following a process similar to that used for column motion, the relationship between the row motion velocity component

u_{row}

and time t can be derived as follow:

u_{row} = \{\begin{cases} \frac{u_{\max}}{t_{\max}} \cdot t, & 0 \leq t \leq t_{acc}, t_{acc} = \sqrt{\frac{d_{row} \cdot t_{\max}}{u_{\max}}} \\ - \frac{u_{\max}}{t_{\max}} \cdot t + 2 \cdot \sqrt{\frac{d_{row} \cdot u_{\max}}{t_{\max}}} + u_{l}, & t_{acc} \leq t \leq t_{fin}, t_{fin} = 2 \cdot \sqrt{\frac{d_{row} \cdot t_{\max}}{u_{\max}}} \end{cases}

(27)

u_{row} = \{\begin{cases} \frac{u_{\max}}{t_{\max}} \cdot t + u_{l}, & 0 \leq t \leq t_{acc}, t_{acc} = t_{\max} \\ u_{l} + u_{\max}, & t_{acc} \leq t \leq t_{vert}, t_{vert} = \frac{d_{row}}{u_{\max}} \\ - \frac{u_{\max}}{t_{\max}} \cdot t + u_{\max} + \frac{d_{row}}{t_{\max}} + u_{l}, & t_{vert} \leq t \leq t_{fin}, t_{fin} = \frac{d_{row}}{u_{\max}} + t_{\max} \end{cases}

(28)

When the followers deviate from their expected motion, the leader can periodically update its velocity and heading commands to guide them back toward their target points. As defined in Section 3.2, each follower completes the formation reconfiguration after executing four planned row/column transformations. Although strict synchronization among multiple UUVs cannot be guaranteed, the system only proceeds to the next stage after all followers finish the current one. Since the planned trajectories are intersection-free within each stage, the safety of the reconfiguration is maintained despite minor timing variations.

5.2. Time Sequence Instruction Length Calculation Under Communication Delay

The fuzzy algorithm is utilized in this study to determine the compensation for underwater acoustic delay time. The primary factors influencing underwater acoustic communication delay and quality include the relative distance and velocity between the leader UUV and the follower UUV. Therefore, the relative distance and velocity between the leader and the follower UUV are selected as the inputs for the algorithm.

First, the distance between the follower UUV and the leader UUV is used as an input variable. The range of the input variable, referred to as the universe of discourse, is defined as (0, 2400) with units in meters. In fuzzy control, natural language is typically used. Consequently, input and output variables are expressed as linguistic variables, whose values are referred to as linguistic terms. The linguistic terms for the variable “distance” can be defined as {“Near (NL)”, “Relatively Near (NS)”, “Moderate (ZO)”, “Relatively Far (PS)”, and “Far (PL)”}, which can also be represented by integers −2, −1, 0, 1, and 2, respectively. Membership functions

μ (\cdot)

are then used to describe linguistic terms. These membership functions represent the degree to which an element in the universe of discourse belongs to a linguistic term. The membership function

μ (d_{lf})

for

d_{lf}

between the follower UUV and the leader UUV is illustrated in Figure 10 and is represented by triangular membership functions.

Second, the velocity of the follower UUV relative to the leader UUV

u_{lf}

is used as another input variable in fuzzy control. The universe of discourse for

u_{lf}

is defined as (0, 3) with units in meters per second. The linguistic terms for the variable “velocity” can be defined as {“Slow (NS)”, “Moderate (ZO)”, and “Fast (PS)”}. The membership function

μ (u_{lf})

for

u_{lf}

is shown in Figure 11.

Finally, the compensation for the underwater acoustic delay time

Δ t_{delay}

is the output of the algorithm. The universe of discourse for the output variable is defined as (0, 3) with units in seconds. The linguistic terms for the variable “time” can be defined as {“Minimal Compensation (NL)”, “Small Compensation (NS)”, “Moderate Compensation (ZO)”, “Relatively Large Compensation (PS)”, and “Large Compensation (PL)”}. The membership function

μ (Δ t_{delay})

for

Δ t_{delay}

is illustrated in Figure 12.

Once the input and output variables for the fuzzy control system are determined, a fuzzy rule base must be established based on empirical knowledge. Fuzzy rules are formulated in an “if-then” format, involving two input variables and one output variable. The structure of the fuzzy rules is also presented in this format. The manifestation of fuzzy rules is shown in Table 1.

R_{i} : if d_{m_f} is A_{i} and u_{m_f} is B_{i} then Δ t_{delay} is C_{i}

(29)

For the defuzzification process, the weighted average method is applied to obtain precise instructions, with the corresponding formula provided as follows:

Δ t_{delay} = \frac{\int_{Δ t_{delay}} Δ t_{delay} μ_{C^{'}} (Δ t_{delay}) d Δ t_{delay}}{\int_{Δ t_{delay}} μ_{C^{'}} (Δ t_{delay}) d Δ t_{delay}}

(30)

6. Simulations

To validate the feasibility of the proposed multi-UUV formation reconfiguration method under underwater acoustic communication delays, simulation experiments were conducted using specialized simulation software.

Firstly, the relevant parameters for the simulation experiments were defined. The adjacent interval between row and column regions in the horizontal grid model was set to

L_{res} = 10 m

. The multi-UUV system consisted of one leader and four followers. The underwater acoustic communication interval

Δ t = 5 s

. The leader UUV had a heading angle

h_{l} = 0 °

and a velocity

u_{l} = 1.5 m / s

in the global coordinate system. The maximum acceleration time for row and column motions

t_{\max} = 200 s

, and the maximum relative velocity for row and column motions was

u_{\max} = 1 m / s

.

In this formation reconfiguration simulation, the initial formation of the multi-UUV system was rectangular, while the desired formation was triangular. The initial position information of the follower UUVs in the relative coordinate system is presented in Table 2, and the desired formation position information is shown in Table 3.

For the PSO algorithm, the dimensionality of particles was set to 2, the number of particles in the swarm

n = 30

. The learning factors

c_{1} = c_{2} = 1.494

, and the maximum and minimum inertia weights

w_{\max} = 0.9, w_{\min} = 0.2

. The fitness function weighting coefficient

k_{s} = 0.7, k_{e} = 0.3

. The optimal position iteration count

i_{nc} = 200

, and the maximum number of iterations

i_{\max} = 200

.

The movement trajectories of the UUVs during the simulation are shown in Figure 13. At the initial moment (t = 0 s), the follower UUVs were distributed around the leader UUV, forming a rectangular formation. At this time, the leader UUV assigned initial positions and planned the target points for formation reconfiguration for each follower. Navigation and velocity control commands were periodically transmitted to the followers via simulated underwater acoustic communication. After 1912.5 s, the formation reconfiguration was completed, transforming the initial rectangular formation into the desired triangular formation.

During the process of formation reconstruction, the distances between the leader UUV and the follower UUVs are depicted in Figure 14, while the distances among the follower UUVs are shown in Figure 15. The simulation results indicate that the distances between the individual UUVs within the multi-UUV system consistently remained within a safe range, with no risk of collision observed. Based on the data presented in Figure 13, Figure 14 and Figure 15, it can be concluded that the proposed formation reconstruction method successfully achieves dynamic formation adjustments for multiple UUVs while preventing collisions.

The convergence process of the fitness function in the PSO-based path planning method utilized in this study is illustrated in Figure 16. It is demonstrated that the fitness value of the particle swarm reached its optimal value of 22.2 during the 19th optimization iteration, indicating that the optimal set of path points for formation reconstruction had been determined. These findings validate the effectiveness of the PSO-based path planning method proposed in this study.

Figure 17 displays the length of the control command sequences received by each follower UUV from the leader UUV. The bar segments in the figure represent the time duration encompassed in the control command sequences received by the follower UUVs at regular intervals. The red line indicates the variation trend of a single control instruction sequence length during the experiment. It can be observed that the time duration of the control command sequences transmitted by the leader UUV varied due to the relative distances between the leader and the follower UUVs and the speeds of the follower UUVs. The simulation results indicate that, although control command sequences were sent at regular intervals, the lengths of the received sequences were sufficient to ensure the continuity of control commands without any loss.

The changes in speed commands and actual speeds of the follower UUVs during the formation reconstruction process are presented in Figure 18, while the changes in heading commands and actual headings are shown in Figure 19. From Figure 19, it can be observed that the heading control commands for followers 1 and 2 remained constant at 0°, indicating that these two UUVs performed linear motions only during the reconstruction process. This reflects the effectiveness of the proposed target position allocation method. Furthermore, Figure 18 and Figure 19 confirm that the speed and heading control commands received by the follower UUVs were continuous over time, with no loss of control commands observed. These results indirectly demonstrate the feasibility of the proposed delay compensation method.

7. Conclusions

Multi-UUV systems often require formation reconfiguration based on the specific tasks. However, achieving multi-UUV formation reconfiguration involves several challenges. First, when the formation is in motion, coordinating the control of the UUVs to reconfigure while following the overall movement of the formation presents a challenge. Second, designing a simple and effective method to prevent collisions among individual UUVs within the system is crucial. Lastly, under acoustic communication conditions, designing a feasible information exchange method that satisfies the formation reconfiguration requirements is also a critical issue. To address these challenges, a multi-UUV formation reconfiguration method based on PSO and fuzzy theory is proposed in this paper.

This study takes into account the characteristics of acoustic communication delays in marine environments and designs a formation reconfiguration method to address both the safety and speed of multi-UUV maneuvers. A two-dimensional spatial model of the horizontal plane is established using a grid-based method, and basic motion patterns for the follower UUVs within this model are designed. The PSO method is then used to plan the maneuvering target points and desired formation positions for each follower UUV, ensuring the safety of the UUVs during the formation reconfiguration process. Furthermore, a corresponding UUV control command transmission method is implemented, considering the issues associated with acoustic communication. Simulation results demonstrate the effectiveness of the proposed method. The simulations indicate that the method can successfully achieve multi-UUV formation reconfiguration even under acoustic communication delays, with the leader UUV maintaining a constant speed.

This study focuses on two-dimensional multi-UUV formation reconfiguration due to project requirements. However, the study of formation reconfiguration in three-dimensional space is also of great importance and will be explored in future work. Additionally, the effects of model uncertainties and sea currents on system performance are important areas for future research. Future studies will address the effects of environmental disturbances and investigate formation reconfiguration in more complex marine conditions.

Author Contributions

Methodology, C.W.; Software, C.W. and Y.F.; Investigation, C.W.; Writing—original draft, C.W. and Y.F.; Writing—review & editing, C.W., T.C. and Z.L.; Supervision, T.C.; Funding acquisition, T.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Youth Support Program of China under Grant 002040130635 and, in part, by the National Natural Science Foundation of China under Grant 52101347.

Data Availability Statement

The original contributions presented in the study are included in the article; further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Mu, L.J.; Chen, C.Y.; Liu, C.S.; Yu, C.M.; Yang, Y.C.; Jang, J.P.; Chen, P.C.; Liu, S.Y.; Chen, T.T.; Paull, C. Underwater topography measurement and observation in Southwest Taiwan using unmanned underwater vehicles. In Proceedings of the OCEANS 2014—TAIPEI, Taipei, Taiwan, 7–10 April 2014; pp. 1–6. [Google Scholar]
Yu, F.; He, B.; Liu, J.; Wang, Q. Dual-branch framework: AUV-based target recognition method for marine survey. Eng. Appl. Artif. Intell. 2022, 115, 105291. [Google Scholar] [CrossRef]
Zhang, Y.; Ryan, J.P.; Hobson, B.W.; Kieft, B.; Scholin, C.A. A system of coordinated autonomous robots for Lagrangian studies of microbes in the oceanic deep chlorophyll maximum. Sci. Robot. 2021, 6, eabb9138. [Google Scholar] [CrossRef]
Wang, L.; Zhu, D.; Pang, W.; Zhang, Y. A survey of underwater search for multi-target using Multi-AUV: Task allocation, path planning, and formation control. Ocean Eng. 2023, 278, 114393. [Google Scholar] [CrossRef]
Zhen, Q.; Wan, L.; Li, Y.; Jiang, D. Formation control of a multi-AUVs system based on virtual structure and artificial potential field on SE(3). Ocean Eng. 2022, 253, 111148. [Google Scholar] [CrossRef]
Pang, W.; Zhu, D.; Liu, C.; Wang, L. The multi-AUV time-varying formation reconfiguration control based on rigid-graph theory and affine transformation. Ocean Eng. 2023, 270, 113521. [Google Scholar] [CrossRef]
Pang, W.; Zhu, D.; Chu, Z.; Chen, Q. Distributed Adaptive Formation Reconfiguration Control for Multiple AUVs Based on Affine Transformation in Three-Dimensional Ocean Environments. IEEE Trans. Veh. Technol. 2023, 72, 7338–7350. [Google Scholar] [CrossRef]
Pang, W.; Zhu, D.; Sun, C. Multi-AUV Formation Reconfiguration Obstacle Avoidance Algorithm Based on Affine Transformation and Improved Artificial Potential Field Under Ocean Currents Disturbance. IEEE Trans. Autom. Sci. Eng. 2024, 21, 1469–1487. [Google Scholar] [CrossRef]
Luan, T.; Bai, X.; Zhang, X.; Wang, M.; Sun, M. UUV two-phase formation and priority avoidance control considering steering amplitude limitation. Ocean Eng. 2024, 312, 119130. [Google Scholar] [CrossRef]
Fan, S.; Feng, Z.; Lian, L. Collision free formation control for multiple autonomous underwater vehicles. In Proceedings of the OCEANS’10 IEEE SYDNEY, Sydney, Australia, 24–27 May 2010. [Google Scholar]
Bian, L.; Sun, W.; Sun, T. Trajectory Following and Improved Differential Evolution Solution for Rapid Forming of UAV Formation. IEEE Access 2019, 7, 169599–169613. [Google Scholar] [CrossRef]
Kankashvar, M.; Bolandi, H.; Mozayani, N. Multi-agent Q-Learning control of spacecraft formation flying reconfiguration trajectories. Adv. Space Res. 2023, 71, 1627–1643. [Google Scholar] [CrossRef]
Wu, J.; Wang, H.; Li, N.; Su, Z. Formation Obstacle Avoidance: A Fluid-Based Solution. IEEE Syst. J. 2020, 14, 1479–1490. [Google Scholar] [CrossRef]
Xu, Y.; Liu, L.; Yin, Y.; Wang, D.; Peng, Z. Multi-ASV Motion Planning for Formation Reconfiguration based on Control Barrier Functions. IFAC-PapersOnLine 2022, 55, 223–227. [Google Scholar] [CrossRef]
Salagare, S.; Sudha, P.N.; K, P. Comparative Study on Requirements, Applications and Challenges of Low Power Underwater Devices. In Proceedings of the 2021 IEEE International Conference on Mobile Networks and Wireless Communications (ICMNWC), Tumkur, India, 3–4 December 2021; pp. 1–5. [Google Scholar]
Ali, M.F.; Jayakody, D.N.K.; Chursin, Y.; Sofiène, A.; Dmitry, S. Recent Advances and Future Directions on Underwater Wireless Communications. Arch. Comput. Methods Eng. 2019, 26, 1379–1412. [Google Scholar] [CrossRef]
Renner, B.C.; Heitmann, J.; Steinmetz, F. ahoi: Inexpensive, Low-power Communication and Localization for Underwater Sensor Networks and μAUVs. ACM Trans. Sen. Netw. 2020, 16, 18. [Google Scholar] [CrossRef]
Cai, W.; Liu, Z.; Zhang, M.; Lv, S.; Wang, C. Cooperative Formation Control for Multiple AUVs With Intermittent Underwater Acoustic Communication in IoUT. IEEE Internet Things J. 2023, 10, 15301–15313. [Google Scholar] [CrossRef]
Wang, Y.; Xia, N.; Chen, B.; Yin, Y.; Wei, S.; Zhang, K. Multi-AUV Cooperative Data Collection for Underwater Acoustic Sensor Networks Using Stackelberg Game. IEEE Sens. J. 2024, 24, 33442–33454. [Google Scholar] [CrossRef]
Suryendu, C.; Subudhi, B. Formation Control of Multiple Autonomous Underwater Vehicles Under Communication Delays. IEEE Trans. Circuits Syst. II Express Briefs 2020, 67, 3182–3186. [Google Scholar] [CrossRef]
Li, L.; Li, Y.; Zhang, Y.; Xu, G.; Zeng, J.; Feng, X. Formation Control of Multiple Autonomous Underwater Vehicles under Communication Delay, Packet Discreteness and Dropout. J. Mar. Sci. Eng. 2022, 10, 920. [Google Scholar] [CrossRef]
Yan, Z.; Yang, Z.; Pan, X.; Zhou, J.; Wu, D. Virtual leader based path tracking control for Multi-UUV considering sampled-data delays and packet losses. Ocean Eng. 2020, 216, 108065. [Google Scholar] [CrossRef]
Xu, B.; Wang, X.; Guo, Y.; Zhang, J.; Razzaqi, A.A. A Novel Adaptive Filter for Cooperative Localization Under Time-Varying Delay and Non-Gaussian Noise. IEEE Trans. Instrum. Meas. 2021, 70, 9600615. [Google Scholar] [CrossRef]
Franchi, M.; Bucci, A.; Zacchini, L.; Ridolfi, A.; Bresciani, M.; Peralta, G.; Costanzi, R. Maximum A Posteriori estimation for AUV localization with USBL measurements. IFAC-PapersOnLine 2021, 54, 307–313. [Google Scholar] [CrossRef]
Sans-Muntadas, A.; Brekke, E.F.; Hegrenaes, Y.; Pettersen, K.Y. Navigation and Probability Assessment for Successful AUV Docking Using USBL. IFAC-PapersOnLine 2015, 48, 204–209. [Google Scholar] [CrossRef]

Figure 1. Schematic of communication in the multi-UUV system. The red dots of leader UUV in the figure serves as a schematic diagram for the installation of the USBL positioning sonar array. And the red dot of follower is a schematic diagram of the installation location for responder.

Figure 2. Control instruction transmission length with communication intervals and failures.

Figure 3. Control instruction transmission length with intervals, delays, and failures.

Figure 4. Framework for multi-UUV formation reconfiguration.

Figure 5. Two-dimensional grid space model.

Figure 6. Two types of formation reconfiguration based on grid space motion modes. (a) Column dispersion, row dispersion, column maneuvering, row maneuvering (b) Row dispersion, column dispersion, row maneuvering, column maneuvering.

Figure 7. Path planning for formation reconfiguration based on the PSO algorithm.

Figure 8. Stages of column motion for followers. (a) Two stages at shorter distances (b) Three stages at longer distances. The solid line denotes motion distance; the dashed line denotes threshold distance.

Figure 9. Stages of row motion for followers. (a) Two stages at shorter distances (b) Three stages at longer distances. The solid line denotes motion distance; the dashed line denotes threshold distance.

Figure 10. Membership function of the distance between followers and leader.

Figure 11. Membership function of relative velocity between followers and leader.

Figure 12. Membership function for compensation of underwater acoustic delay time.

Figure 13. Trajectory of formation reconfiguration during simulation.

Figure 14. Distance between the leader and followers.

Figure 15. Distance among followers.

Figure 16. Convergence process of PSO fitness.

Figure 17. Length of a single control instruction sequence received by followers.

Figure 18. Variation in actual and instructed velocities of followers.

Figure 19. Variation in actual and instructed headings of followers.

Table 1. Fuzzy reasoning table.

$Acoustic Delay Time Δ t_{delay}$		$The Distance Between the Leader and Followers d_{lf}$
$Acoustic Delay Time Δ t_{delay}$		NL	NS	ZO	PS	PL
Relative velocity $u_{lf}$	NS	NL	NL	NS	ZO	PS
	ZO	NL	NS	ZO	PS	PL
	PS	NS	ZO	PS	PL	PL

Table 2. The initial position information of followers.

Initial Formation		y-Axis Coordinate (m)	x-Axis Coordinate (m)
Rectangular formation	Follower-1	150	150
	Follower-2	−150	150
	Follower-3	150	−150
	Follower-4	−150	−150

Table 3. Expected multi-UUVs formation position information.

Expected Formation		y-Axis Coordinate (m)	x-Axis Coordinate (m)
Triangle formation	Expected position 1	−200	−150
	Expected position 2	−200	150
	Expected position 3	−400	−250
	Expected position 4	−400	250

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wan, C.; Chen, T.; Liu, Z.; Fan, Y. Research on Multi-UUVs Dynamic Formation Reconfiguration Considering Underwater Acoustic Communication Characteristics. J. Mar. Sci. Eng. 2025, 13, 2388. https://doi.org/10.3390/jmse13122388

AMA Style

Wan C, Chen T, Liu Z, Fan Y. Research on Multi-UUVs Dynamic Formation Reconfiguration Considering Underwater Acoustic Communication Characteristics. Journal of Marine Science and Engineering. 2025; 13(12):2388. https://doi.org/10.3390/jmse13122388

Chicago/Turabian Style

Wan, Chuang, Tao Chen, Zhenghong Liu, and Yunyao Fan. 2025. "Research on Multi-UUVs Dynamic Formation Reconfiguration Considering Underwater Acoustic Communication Characteristics" Journal of Marine Science and Engineering 13, no. 12: 2388. https://doi.org/10.3390/jmse13122388

APA Style

Wan, C., Chen, T., Liu, Z., & Fan, Y. (2025). Research on Multi-UUVs Dynamic Formation Reconfiguration Considering Underwater Acoustic Communication Characteristics. Journal of Marine Science and Engineering, 13(12), 2388. https://doi.org/10.3390/jmse13122388

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Research on Multi-UUVs Dynamic Formation Reconfiguration Considering Underwater Acoustic Communication Characteristics

Abstract

1. Introduction

2. Problem Statement

2.1. Communication Topology and Delay Analysis

2.2. Technical Method of Formation Reconfiguration

3. Grid Space and Motion Mode

3.1. Establishment of 2D Grid Space Model

3.2. Basic Motion Behavior in Grid Space

4. Path Point Planning for Formation Reconfiguration

4.1. Selection and Dimensionality Reduction of PSO Particles

4.2. Design of Fitness Function for PSO Algorithm

4.3. Steps for Path Point Generation Based on PSO

5. Formation Reconfiguration Instruction Generation Under Communication Delay

5.1. Method for Solving Control Instructions for the Behavior

5.2. Time Sequence Instruction Length Calculation Under Communication Delay

6. Simulations

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI