Next Article in Journal
Multi-Branch Towed Array System: Systematic Analysis of Modeling Methods, Environmental Responses and Mechanical Properties in Fracture Analysis
Previous Article in Journal
Empirical Research to Design Rule-Based Strategy Control with Energy Consumption Minimization Strategy of Energy Management Systems in Hybrid Electric Propulsion Systems
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Formation Control of Multiple UUVs Based on GRU-KF with Communication Packet Loss

1
Key Laboratory of Underwater Robot Technology, Harbin Engineering University, Harbin 150001, China
2
College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin 150001, China
*
Author to whom correspondence should be addressed.
J. Mar. Sci. Eng. 2025, 13(9), 1696; https://doi.org/10.3390/jmse13091696
Submission received: 24 July 2025 / Revised: 22 August 2025 / Accepted: 27 August 2025 / Published: 2 September 2025
(This article belongs to the Section Ocean Engineering)

Abstract

In response to the problem of decreased collaborative control performance in underwater unmanned vehicles (UUVs) with communication packet loss, a GRU-KF method for multi-UUV control that integrates a gated recurrent unit (GRU) and a Kalman filter (KF) is proposed. First, a UUV feedback linearization model and a current model are established, and a multi-UUV controller-based leader–follower method is designed, using a neural network-based radial basis function (RBF) to counteract the uncertainty effects in the model. For scenarios involving packet loss in multi-UUV collaborative communication, the GRU network extracts historical temporal features to enhance the system’s adaptability to communication uncertainties, while the KF performs state estimation and error correction. The simulation results show that, compared to compensation by the GRU network, the proposed method significantly reduces the jitter level and convergence time of errors, enabling the formation to exhibit good robustness and accuracy in communication packet loss scenarios.

1. Introduction

Unmanned underwater vehicles (UUVs) serve as indispensable autonomous platforms for ocean exploration, performing tasks including marine resource sampling [1,2], subsea pipeline inspection [3,4,5], and military reconnaissance [6,7] without human intervention. Their operational scope remains constrained by endurance and payload limitations, driving significant research interest in cooperative UUV formations.
To achieve the cooperative operation of UUV formations, information exchange between UUVs needs to be carried out through underwater communication. Due to the long transmission distance and strong penetrability, hydroacoustic communication is the most widely used communication method at present. However, due to the specific characteristics of the underwater environment, hydroacoustic communication has certain limitations: the bandwidth of hydroacoustic communication is limited and susceptible to interference, which can cause time delays and packet loss during information transmission between UUVs [8]. Yan et al. constructed a discrete-time control architecture based on the virtual leader and established a sufficient condition for the convergence of the system based on Shuler’s theory [9]; Chen and Guo proposed an AUV formation-control algorithm based on consistency theory and the leader–follower method, and introduced a hybrid communication topology to adapt to the control of large formations [10]; Li et al. categorized communication into effective and ineffective categories through delay limitation, and designed an AUV formation-control method based on the switching of communication topology to address the problem of ineffective communication [11]; Jiang et al. designed a formation-control protocol with deterministic switching signals to solve the problem of weak communication [12]; Kuniyoshi et al. designed a sequential inter-carrier interference countermeasure to improve the channel transfer function [13]; Yan and Pan considered communication delay to be a specific function with upper and lower bounds, and proposed three coordinated control protocols to achieve UUV formation control [14]. Kiran et al. used artificial intelligence and reinforcement learning to optimize underwater communication protocols, and demonstrated the improved efficiency of AI-assisted communication compared to traditional methods by employing experimental design metrics [15]; Yan et al. designed a state message buffer to synchronize asynchronous messages with communication delays [16]; Cai et al. applied event-triggered leader–follower communication mechanisms to a formation-control model that reduces the number of unnecessary communications between multiple UUVs [17], but has strict requirements regarding the triggering conditions and constraints; and Tang et al. introduced hierarchical reinforcement learning to improve the communication rate of a maritime system through reinforcement learning [18]. Determining how to achieve stable and efficient UUV formation control with underwater communication constraints is a key direction in current UUV formation research.
Although significant progress has been made in UUV formation control under communication constraints, several critical challenges persist: first, existing studies often employ oversimplified multi-UUV models, inadequately accounting for model uncertainty and external disturbances; second, research on communication constraints predominantly focuses on time delay issues, while information gaps caused by communication packet loss remain insufficiently explored.
On this basis, this paper proposes a GRU-KF-based UUV formation-control method for the problem of communication packet loss in the cooperative control of UUV formations. The main contributions of this study are as follows:
(1)
In order to achieve the cooperative control of UUV formations, a backstepping sliding mode controller based on the leader–follower method is designed by combining the relative position constraints of the leader and the follower.
(2)
In order to solve the problem of communication packet loss, a GRU network is utilized to predict the time series of the leader’s trajectory; then, a Kalman filter is introduced, and the predicted value of the GRU network is used as the measured value of the state in the Kalman filtering algorithm to update the leader’s state in real time. This method can optimize the control performance of the UUV formation under the influence of communication packet loss and reduce the control errors caused by external environmental interference.

2. UUV Model

Mathematical models of UUVs represent the basis for studying their motion characteristics and control problems, which directly affect the accuracy and performance of their control algorithms. In practical applications, UUVs are usually underactuated, and to reduce model complexity and align with the actual mission requirements, we consider the rolling of UUVs to be self-stabilizing [19]. Given the prevalence of ocean currents in underwater environments, establishing a current disturbance model is essential to investigating their impact on UUV motion control and enhancing operational precision.

2.1. UUV Linearization Model

In order to study the motion of UUVs, it is necessary to establish a coordinate system that can accurately describe changes in UUV state variables; in this study, an inertial coordinate system { E } and a carrier coordinate system { B } are established, as shown in Figure 1.
The inertial coordinate system takes the earth or the seabed as the reference point and is a fixed global coordinate system. The axes of this coordinate system are defined as follows: X E points to the north; Y E is on the horizontal plane, coplanar and perpendicular to X E , and points to the east; and Z E points vertically downward.
The carrier coordinate system generally takes the center of mass of the UUV as the origin, and the coordinate axes are defined as follows: X B is oriented along the yaw direction of the UUV, pointing forward; Y B is oriented along the starboard direction, pointing to the right side; and Z B points vertically downward.
η = [ x y z θ ψ ] T R 3 × S 2 denotes the attitude vector in the fixed coordinate system, ν = [ u v w q r ] T R 5 denotes the velocity vector in the carrier coordinate system, and T = [ X Y Z M N ] T R 6 denotes the force and moment in the carrier coordinate system.
Combining the UUV kinematics and dynamics models, the vector expression of the nonlinear model of the UUV can be obtained as follows:
η ˙ = J ( η ) v M R v ˙ + M A v ˙ + C R ( v ) v + Y ( v ) + g ( η ) = T + ω
where ω is the external perturbation of the UUV, J ( η ) is the coordinate system transformation matrix, M R and M A are the rigid-body inertia matrix and the additional mass matrix, C R ( v ) is the Coriolis matrix, Y ( v ) is the fluid damping matrix, and g ( η ) is the vector of restoring force exerted by gravity and buoyancy simultaneously.
To achieve feedback linearization of the nonlinear mathematical model of the UUV, the second-order integral model is obtained as follows:
z ˙ 1 = z 2 z ˙ 2 = u
where z 1 is the position information of the UUV after coordinate transformation, z 2 is its velocity information, and u is its control input.

2.2. Current Model

In an actual marine environment, ocean currents are common and affected by many factors. During motion control, vehicles experience not only actuator inputs but also hydrodynamic disturbances from currents. Current forces have hydrodynamic effects on vehicles, including perturbation forces on the hull and perturbation torques. These forces and torques affect the kinematic performance of the UUV. Therefore, the effect of current disturbances needs to be considered when designing UUV controllers.
A current is a vector with magnitude and direction, and the mathematical model of a current in an inertial coordinate system is
d c u r E = u c E v c E w c E
where u c E is the northward velocity of the current in the inertial coordinate system, v c E is its eastward velocity, and w c E is its vertical velocity.
The expression of the current in the carrier coordinate system is given by
u c B v c B w c B = J v ( η r ) T u c E v c E w c E
where u c B is the northward velocity of the current in the carrier coordinate system, v c B is its eastward velocity, and w c B is its vertical velocity.
The relative velocity of the UUV with respect to the current is
v r e l a t i v e = u r v r w r = u u c B v v c B w w c B

3. UUV Controller Design

In this section, a multi-UUV formation controller is designed based on the leader–follower method, where the follower UUVs can dynamically adjust according to the state information of the leader UUV to maintain the preset formation. In addition, we consider that in practical applications, dynamics models of UUVs often have uncertainties, such as changes in hydrodynamic coefficients and uncertain parts of the models. Therefore, we also design a backstepping sliding mode controller [20] based on an RBF neural network to offset the effect of model uncertainty on UUV control.

3.1. Design of UUV Controller Based on Backstepping Sliding Mode

In practical applications, there is uncertainty in UUV models; the UUV model with uncertainty and current disturbance based on Equation (2) is
z ˙ 1 = z 2 z ˙ 2 + Δ = u + d c u r ( t )
where Δ is the model uncertainty constraint, and d c u r ( t ) is the current disturbance.
Assuming the position and attitude information of the leader z 1 L = η L , the information on the desired position and attitude of the ith follower is given as z 1 d F i = z 1 L + d i . Define the position and attitude tracking error as
e 1 i = z 1 d F i z 1 F i = z 1 L + d i z 1 F i
Define the Lyapunov function [21] as
V 1 = 1 2 e 1 i 2
The differentiation of V1 in Equation (8) can be obtained as follows:
V ˙ 1 = e 1 i e ˙ 1 i = e 1 i ( z ˙ 1 d F i z 2 F i ) = e 1 i ( z 2 L + d v i z 2 F i )
From Equation (9), when z 2 F i tends to z 2 L + d v i + c 1 i e 1 i ( c 1 i is a constant greater than 0), V ˙ 1 = c 1 i e 1 i 2 0 . When V ˙ 1 0 , e 1 i 0 , the closed-loop system is asymptotically stabilized.
Let z 2 d F i = z 2 L + d v i + c 1 i e 1 i . Then the next control objective is
e 2 i = z 2 d F i z 2 F i
Replace z 2 d F i in Equation (10) with z 2 L + d v i + c 1 i e 1 i , while e ˙ 1 i = z 2 L + d v i z 2 F i
e ˙ 1 i = e 2 i c 1 i e 1 i
Define the sliding mode as
σ i = k 1 i e 1 i + e 2 i
where k 1 i is a constant greater than 0. Then,
σ i = ( k 1 i + c 1 i ) e 1 i + e ˙ 1 i
Define the Lyapunov function as
V 2 = V 1 + 1 2 σ i 2
Derive V2 and substitute Equation (11) into it as follows:
V ˙ 2 = V ˙ 1 + σ i σ ˙ i = e 1 i ( e 2 i c 1 i e 1 i ) + σ i ( k 1 i e ˙ 1 i + e ˙ 2 i ) = e 1 i e 2 i c 1 i e 1 i 2 + σ i ( k 1 i ( e 2 i c 1 i e 1 i ) + z ˙ 2 d F i z ˙ 2 F i ) = e 1 i e 2 i c 1 e 1 2 + σ i ( k 1 i ( e 2 i c 1 i e 1 i ) + z ˙ 2 d F i u i d c u r i ( t ) + Δ i )
To ensure that the derivative of the Lyapunov function is negative definite, the control law is designed as follows:
u i = k 1 i ( e 2 i c 1 i e 1 i ) + z ˙ 2 d F i + η i sgn ( σ i ) + Δ i + e 1 i
where η i is a constant greater than 0, and sgn ( ) is the sign function.
Substituting Equation (16) into Equation (15), it can be determined that
V ˙ 2 = c 1 i e 1 i 2 k 1 i e 1 i 2 η i σ i σ i d c u r i ( t )
By taking the values of c 1 i , k 1 i , and η i , we can obtain η i D i and thus ensure that V ˙ 2 0 , so the controller designed in this section is stable.

3.2. Design of UUV Controller Based on RBF Neural Network

To address the uncertainty in the dynamics model, the nonlinear fitting ability of the RBF neural network is utilized to estimate and compensate for the model uncertainty term [22]. RBF neural networks typically consist of an input layer, a hidden layer, and an output layer. The input layer receives input signals and transmits them to the hidden layer. In this study, the hidden layer uses a Gaussian function as the activation function, which is expressed as follows [23]:
h j = exp ( x c j 2 2 b j 2 ) j = 1 , 2 , p
f ( x ) = W T h ( x ) + ε
where h = h j T is the output of the Gaussian function; p is the number of nodes in the hidden layer of the RBF neural network; W is the weight of the RBF neural network; and ε is the approximation error of the RBF network.
The input of RBF neural network is taken as x i = e 1 i e 2 i T , and the output of RBF neural network is
f ^ i ( x ) = W ^ i T h ( x i )
where W ^ i is the estimated weight of the RBF network.
Replacing the model uncertainty term Δ i in the controller with the output of the RBF neural network, the designed control law can be written as
u i = k 1 i ( e 2 i c 1 i e 1 i ) + z ˙ 2 d F i + η i sgn ( σ i ) + f ^ i ( x ) + e 1 i
Substitute the controller into Equation (15) as follows:
V ˙ 2 = c 1 i e 1 i 2 k 1 i e 1 i 2 η i σ i σ i d c u r i ( t ) + σ i ( f ^ i ( x ) + Δ i )
Define f ˜ i ( x ) as follows:
f ˜ i ( x ) = f i ( x ) f ^ i ( x ) = W i T h ( x i ) + ε i W ^ i T h ( x i ) = W ˜ i T h ( x i ) + ε i
where W ˜ i = W i W ^ i .
Define the Lyapunov function as
V 3 = V 1 + 1 2 σ i 2 + 1 2 γ i W ˜ i T W ˜ i
where γ i is a constant greater than 0.
Derive V3 as follows:
V ˙ 3 = V ˙ 1 + σ i σ ˙ i + γ i W ˜ i T W ˜ ˙ i = e 1 i e 2 i c 1 i e 1 i 2 + σ i ( k 1 i ( e 2 i c 1 i e 1 i ) + z ˙ 2 d F i u i d c u r i ( t ) + Δ i ) γ i W ˜ i T W ^ ˙ i
Substituting the control law into Equation (25) and Δ i = f i ( x ) , we can obtain
V ˙ 3 = c 1 i e 1 i 2 k 1 i e 1 i 2 + σ i ( η i sgn ( σ i ) d c u r i ( t ) f ^ i ( x ) + f i ( x ) ) γ i W ˜ i T W ^ ˙ i = c 1 i e 1 i 2 k 1 i e 1 i 2 + σ i ( η i sgn ( σ i ) d c u r i ( t ) + W ˜ i T h i ( x ) + ε i ) γ i W ˜ i T W ^ ˙ i = c 1 i e 1 i 2 k 1 i e 1 i 2 + σ i ( η i sgn ( σ i ) d c u r i ( t ) + ε i ) + W ˜ i T ( σ i h i ( x ) γ i W ^ ˙ i )
The design of the adaptive law is
W ^ ˙ i = h i ( x ) σ i γ i
Then,
V ˙ 3 = c 1 i e 1 i 2 k 1 i e 1 i 2 η i σ i ( d c u r i ( t ) ε i ) σ i
Since the approximation error ε i can be limited a small-enough value, V ˙ 3 0 can be determined by taking η i ( d c u r i ( t ) ε i ) .
Assume that η 0 i > 0 , η i η 0 i + d c u r i ( t ) ε i ; then,
V ˙ 3 c 1 i e 1 i 2 k 1 i e 1 i 2 η 0 i σ i 0
Since V 3 0 , V ˙ 3 0 , σ i , and W ˜ i are bounded. When the time tends to infinity, σ i tends to 0, which makes e 1 i and e 2 i tend to 0. Therefore, the designed controller is stable and convergent.

3.3. Design of UUV Formation Controller-Based Leader–Follower Method

The leader–follower method designates one UUV as the leader, responsible for trajectory planning and transmitting navigation states (position, attitude, and velocity) to follower vehicles within the formation [24]. Followers execute trajectory-tracking control based on received data to maintain preset relative positions, thereby achieving cooperative formation control. A schematic diagram of the leader–follower method is shown in Figure 2.
In this section, we implement a leader-based UUV triangular formation, which requires the ability to maintain the formation in 3D space, as shown in Figure 3.
Here, L is the distance between the leader and the follower, and β is the attitude angle of the formation.
Taking the spiral dive motion as an example, the constraint method designed in this study is as follows:
η L + d 1 = η F 1 η L + d 2 = η F 2 η ˙ L + d v 1 = η ˙ F 1 η ˙ L + d v 2 = η ˙ F 2
where η L is the position and attitude information of the leader, η ˙ L is the velocity information of the leader in the inertial coordinate system, η F 1 and η F 2 are the position and attitude information of the two followers, η ˙ F 1 and η ˙ F 2 are the velocity information of the two followers in the inertial coordinate system, d 1 and d 2 are the corresponding attitude offsets, and d v 1 and d v 2 are the corresponding velocity offsets in the inertial coordinate system. The expressions of d 1 , d 2 , d v 1 , and d v 2 are as follows:
d 1 = L cos ( ψ β ) L sin ( ψ β ) 0 0 arctan ( L cos β , R L sin β ) d 2 = L cos ( ψ + β ) L sin ( ψ + β ) 0 0 arctan ( L cos β , R + L sin β ) d v 1 = L sin ( ψ β ) ψ ˙ L cos ( ψ β ) ψ ˙ 0 0 0 d v 2 = L sin ( ψ + β ) ψ ˙ L cos ( ψ + β ) ψ ˙ 0 0 0
where R is the radius of gyration of the leader in a spiral dive motion, and ψ ˙ is the yaw angular velocity of the UUV in the inertial coordinate system. Due to the difference in the radius of gyration, the expected yaw angles of the leader and the follower are not the same when the UUV formation is in a spiral dive motion with a fixed deviation.

4. Design of GRU-KF-Based Formation Controller with Communication Packet Loss

4.1. Analysis of Formation Control with Communication Packet Loss

In UUV formation control, instability in hydroacoustic communication links can cause communication packet loss—failure to transmit the leader’s state information to followers. The main reasons for communication packet loss include the attenuation of underwater acoustic signals and environmental interference. When communication packet loss occurs, followers lack access to the leader’s state data for extended periods, resulting in accumulated trajectory prediction errors that ultimately compromise formation integrity.
The main impacts of communication packet loss are as follows: First, communication packet loss renders the follower unable to obtain the state information of the leader in real time, which leads to the error accumulation and thus affects the trajectory-tracking accuracy. Second, the error accumulation may not significantly affect the formation stability in a short period of time, but in the case of high packet loss, the follower may gradually deviate from the expected trajectory due to the lack of state feedback error correction. Finally, in extreme cases, if the packet loss rate is too high and the follower cannot update the state information of the leader for a long time, it may lead to obvious changes in the formation, and even the risk of follower collision, thus affecting the stability of the whole formation system.

4.2. GRU-KF-Based Formation Predictive Control

In this section, a GRU-KF-based formation predictive control method is designed for the problem of formation control with communication packet loss by combining the modeling capability of the GRU network with Kalman filter (KF) estimation for state optimization.

4.2.1. Formation Predictive Control Framework

In practical applications, the loss of UUV communication packets is a key factor affecting formation stability. With the aim of addressing this problem, a GRU-KF-based formation-control method is proposed; this method utilizes the GRU network to predict the time series of the leader’s trajectory, which is used to make up for the effect of communication packet loss on formation control. Then, Kalman filtering is introduced, and the predicted value of the GRU network is used as a state measurement value in the Kalman filter algorithm, which is used to update the leader’s state in real-time, in order to improve the UUV formation’s adaptability to communication packet loss. The overall control framework of the proposed method is shown in Figure 4.

4.2.2. GRU Neural Network

A GRU neural network is an improved recurrent neural network (RNN). A GRU is used to predict the future state of the leader UUV in this design. It is a variant of an RNN with a simpler structure and higher computational efficiency, and it also effectively mitigates the problem of gradient vanishing in the RNN. The internal structure of the GRU network is shown in Figure 5.
The process of the GRU network is as follows:
(1) The reset gate r t receives the state information h t 1 output by the network from the leader UUV at a time of t 1 and the state information x t input by the network at a time of t , and calculates the degree of forgetfulness of the network for the information at a time of t 1 ; the calculation formula is
r t = σ W r h t 1 , x t
where σ is the Sigmoid function, which is the activation function of the network, and W r is the weight matrix of the reset gate r t .
(2) The update gate z t receives the same state information as the reset gate r t and calculates the degree of information inflow from moment r t to moment t ; the formula is
z t = σ W z h t 1 , x t
where W z is the weight matrix of the update gate z t .
(3) To obtain the t moment activation state h ˜ t , we combine the result of the reset gate r t acting on the t 1 moment h t 1 with that of the t moment network input x t , and then introduce these results into the weight matrix W h multiplication, and obtain the final result using the tanh activation function. The formula for calculating the momentary activation state h ˜ t is as follows:
h ˜ t = tanh W h r t h t 1 , x t
where W h is the weight matrix of the activation state h ˜ t .
(4) We add the information retained by the network at moment t 1 with the activation state after the selection of the update gate z t at moment t ; then, the output of the network at moment t can be obtained, and is denoted as h t t . The output of moment h t is calculated using the formula
h t = 1 z t h t 1 + h ˜ t z t
(5) The predicted state of the next moment of the output layer of the GRU network is calculated using the formula
y = σ W y h t
where W y is the weight matrix of the output layer, and y is the output of the GRU network.

4.2.3. Kalman Filter

A Kalman filter (KF) is a recursive algorithm for estimating the state of a dynamic system and is used in this design to fuse the predicted and observed system state data to achieve optimal estimation of the system state; the flow of this algorithm is shown in Figure 6.
Assume that the state space equation of the leader UUV is
x k = A x k 1 + B u k 1 + w k 1
where x k is the state vector at moment k ; A is the system state transfer matrix; B is the system input matrix; u k 1 is the system control input vector; and w k 1 is the process noise.
Suppose the observation equation of the system is
z k = H x k + v k
where z k is the observation vector at the time k ; H is the observation matrix; v k is the measurement noise, which is assumed to be zero-mean Gaussian white noise; and the covariance matrix is R .
v N ( 0 , R )
In the prediction stage, Kalman filtering utilizes the system model to make a priori estimate of the next moment x ^ k | k 1 :
x ^ k | k 1 = A x ^ k 1 | k 1 + B u k 1
where x ^ k | k 1 is the state prediction at moment k , also known as the a priori estimate, and x ^ k 1 | k 1 is the state estimate at moment k 1 .
The a priori estimate x ^ k | k 1 lacks the influence of the process noise w k 1 , unlike the state variable x k , and it is also calculated by replacing the true value of the previous moment with the a posteriori estimate x ^ k 1 | k 1 at the previous moment.
At the same time, Kalman filtering also uses the a posteriori error covariance matrix P k 1 | k 1 of the previous moment and the process noise covariance matrix array Q to calculate the a priori error covariance matrix P k | k 1 .
P k | k 1 = A P k 1 | k 1 A T + Q
where P k 1 | k 1 is the a posteriori error covariance matrix at k 1 , and Q is the process noise covariance matrix.
We calculate the Kalman gain K k as follows:
K k = P k | k 1 H T H P k | k 1 H T + R 1
where R is the covariance matrix of the measurement noise.
Next, we update the a posteriori estimate x ^ k | k based on the a priori estimate x ^ k | k 1 , the measured value z k , and the Kalman gain K k .
x ^ k | k = x ^ k | k 1 + K k z k H x ^ k | k 1
The a posteriori estimate x ^ k | k is the optimal estimate output by the Kalman filtering algorithm after fusing the a priori estimate x ^ k | k 1 and the measurement value z k .

4.2.4. Formation Prediction Model Based on GRU-KF

The GRU-KF-based formation prediction model is primarily composed of a GRU network prediction module and a Kalman filter module, as shown in Figure 7.
The equation updating the state of the leader UUV can be expressed as
x k = η k η ˙ k = I 5 I 5 t 0 O 5 I 5 η k 1 η ˙ k 1 + w k 1
where η k = x k y k z k θ k ψ k T represents the position and attitude of the leader UUV in the inertial coordinate system; η ˙ k = x ˙ k y ˙ k z ˙ k θ ˙ k ψ ˙ k T represents the velocity and angular velocity of the leader UUV in the inertial coordinate system; I 5 is the five-dimensional unit matrix; O 5 is the five-dimensional zero matrix; and t 0 is the control period of the leader UUV’s control system.
The transformation relationship between η ˙ k and the velocity of the UUV in the carrier coordinate system is
η ˙ k = x ˙ k y ˙ k z ˙ k θ ˙ k ψ ˙ k = cos θ cos ψ sin ψ sin θ cos ψ 0 0 cos θ sin ψ cos ψ sin θ sin ψ 0 0 sin θ 0 cos θ 0 0 0 0 0 1 0 0 0 0 0 1 / cos θ u v w q r
The system model is established as follows:
z k = η k η k v = I 5 O 5 O 5 J ( η ) η k η ˙ k + v k
where η k v = u v w q r T is the velocity of the leader UUV in the carrier coordinate system, and J ( η ) is the transformation matrix from the carrier coordinate system to the inertial coordinate system of the leader UUV.

5. Simulation

To verify the effectiveness of the GRU-KF-based UUV formation-control strategy proposed in this paper, we conduct simulation verification of UUV formation control based on the leader–follower method and in a communication packet loss environment.

5.1. Simulation and Analysis of UUV Formation Control Based on Leader–Follower Method

This section focuses on the simulation verification of the UUV formation controller based on the leader–follower method and analyzes the formation-control effect. In the simulation experiment, the desired trajectory of the leader UUV1 is
x d = 250 cos ( 2 π t / 1000 ) y d = 250 sin ( 2 π t / 1000 ) 0 t 1500 z d = 0.015 t 5
Initial state of UUVs:
Leader UUV1: northward coordinate x 1 = 249.5   m , eastward coordinate y 1 = 1   m , vertical coordinate z 1 = 3   m , yaw angle ψ 1 = π / 2   rad , longitudinal velocity u 1 = 1   m / s , and remaining initial states have value of 0.
Follower UUV2: northward coordinate x 2 = 205.5   m , eastward coordinate y 2 = 49   m , vertical coordinate z 2 = 3.5   m , yaw angle ψ 2 = π / 2   rad , longitudinal velocity u 2 = 1   m / s , and remaining initial states have value of 0.
Follower UUV3: northward coordinate x 3 = 298.5   m , eastward coordinate y 3 = 50.1   m , vertical coordinate z 3 = 4.5   m , yaw angle ψ 3 = π / 2   rad , longitudinal velocity u 3 = 1   m / s , and remaining initial states have value of 0.
Formation parameters: L = 50 2 and β = π / 4 .
The uncertainty term of the UUV model is
Δ 1 = 0.5 ( sin ( 0.3 t × 0.1 ) + cos ( 0.5 t × 0.1 ) ) Δ 2 = 0.5 ( sin ( 0.3 t × 0.1 ) + cos ( 0.05 t × 0.1 ) ) Δ 3 = 0.5 ( sin ( 0.02 t × 0.1 ) + cos ( 0.4 t × 0.1 ) ) 0 t 1500 Δ 4 = 0.2 ( sin ( 0.03 t × 0.1 ) + cos ( 0.04 t × 0.1 ) ) Δ 5 = 0.2 ( sin ( 0.03 t × 0.1 ) + cos ( 0.4 t × 0.1 ) )
The parameters of the backstepping sliding mode controller based on the RBF neural network are c 1 = 1.5 , k 1 = 1.2 , and η = 0.3 . The initial weights of the RBF network are all 0.02, and the adaptive parameters are c i = 10 3 1 0.5 0.1 0 0.1 0.5 1 3 10 , γ = 50 , and b j = 6 . The parameter K = 0.8 , the current interference is v c u r r e n t 1 E = 0.2 0 0   m / s , and the simulation time is 1500 s.
Figure 8a–e show the position and attitude errors of the UUV formation. At 46 s, the formation gradually stabilizes. The maximum absolute errors for all three axes are less than 0.1 m, the longitudinal tilt steady-state error is less than 0.001 rad, and the maximum absolute error for the heading angle is less than 0.03 rad. Even under the influence of the current disturbance and model uncertainty constraints, the preset UUV formation can still be formed and maintained, and both the position and the attitude errors are limited and kept within a small range. Figure 8f–j show the velocity and angular velocity errors of the UUV formation. At 36 s, the angular velocity error converges, and the maximum absolute error of the three-axis speed is less than 0.05 m/s. The velocity and angular velocity of the UUV formation are still able to track the upper desired value, even under the influence of the current interference and model uncertainty constraints, and the errors are all limited and kept within a small range, which validates the effectiveness of the formation-control strategy.
Figure 9 shows a two-dimensional trajectory diagram of the UUV formation, and Figure 10 shows a three-dimensional trajectory diagram of the UUV formation. Even under the influence of current interference and model uncertainty, the preset UUV formation is able to be maintained, and the spiral dive motion in 3D space can be completed, further verifying the effectiveness of the formation-control strategy presented in this section.

5.2. Simulation and Analysis with Communication Packet Loss

To analyze the effect of communication packet loss on UUV formation control, we consider simulation verification with a 10% packet loss rate and set the leader to broadcast its own state information every 4 s. The other condition settings of the simulation are kept consistent with those described in Section 5.1. The key parameters for the GRU network training process are as follows: the maximum number of training rounds is set to 5, and the gradient clipping threshold is set to 1. To optimize the learning rate adjustment strategy, the initial learning rate is set to 0.001, and a piecewise learning rate decay mode is adopted, where the learning rate is reduced by 90% every two training rounds; i.e., the decay factor is 0.1. Additionally, to improve computational efficiency, mini-batch training is employed during the training process, with the batch size set to 16. The simulation results are as follows:
Figure 11a–e compare the follower’s position and attitude errors after the prediction compensation based on the GRU-KF network prediction model with a 10% packet loss rate. It can be seen that when relying solely on the GRU network for compensation, because of communication packet loss, more obvious jitter occurred in several of the follower’s state variables at the time step where communication packet loss occurred, and the maximum error reached 13.8 m. This kind of obvious jitter is not allowed in application scenarios that require high control accuracy, as it may have a large impact on the control accuracy of the system. However, with GRU-KF algorithm compensation, the formation stabilized after 28 s, with the maximum absolute error of the three axes reaching less than 0.4 m. The pitch angle and heading angle error converged after 42 s, with the maximum angle error not exceeding 0.02 rad.
Figure 11f–j compare the follower’s velocity and angular velocity errors after prediction compensation based on the GRU-KF network prediction model with a 10% packet loss rate. Similarly to the follower’s position error and attitude error, there is sudden jitter in the error curve at the time step where communication packet loss occurs when only the GRU network is used for compensation. When the GRU-KF network prediction model is used for compensation, the absolute values of the maximum errors in both longitudinal and transverse velocities are both less than 0.3 m/s. The error in vertical velocity converges at 37 s, with the maximum error not exceeding 0.01 m/s. The error in angular velocity converges at 39 s, with the maximum errors all being less than 0.01 rad/s.
It can be concluded that with the constraint of a 10% packet loss rate, the GRU-KF network prediction model can effectively compensate for the problem of data loss caused by communication packet loss, and the error curves of the follower’s position, attitude, velocity, and angular velocity are smoother. In addition, error fluctuation is significantly reduced, preventing the generation of sudden errors and improving the stability of formation control.

6. Conclusions

In this study, the problem of formation control in UUVs with the constraint of packet loss is investigated, and a predictive control method based on the GRU-KF network prediction model is proposed to improve the control accuracy and stability of the system in complex environments. To address these problems, a GRU-KF-based formation predictive control strategy is constructed, where the GRU network is used for short-term prediction of the leader trajectory, while the KF is used to combine the GRU prediction results with the a priori estimation of the system state equations to compensate for the uncertainty caused by communication packet loss and positioning errors. Finally, the effectiveness of the GRU-KF network prediction model is verified through simulation experiments. With communication packet loss, the GRU-KF network prediction model can effectively prevent the generation of sudden errors.
In practical applications, the problem of collisions between UUVs cannot be ignored. A collision avoidance algorithm is not proposed in this study; however, future research should endeavor to develop an effective solution to this problem.

Author Contributions

Conceptualization, J.L. and H.Z.; Software, R.L. and Z.T.; Writing—original draft, Z.T.; Writing—review & editing, R.L. All authors have read and agreed to the published version of the manuscript.

Funding

This search work was funded by the National Natural Science Foundation of China (Grant No. 5217110503), the Research Fund from Science and Technology on Underwater Vehicle Technology (grant No. JCKYS2021SXJQR-09), and the Natural Science Foundation of Shandong Province (grant No. ZR202103070036).

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Li, H.; Cui, S.; Wang, X.; Yu, B.; Wang, G.; Cao, C.; Shen, Y.; Zeng, Q. Development of a bionic jellyfish robot for collecting polymetallic nodules. Ocean. Eng. 2025, 324, 120655. [Google Scholar] [CrossRef]
  2. Subin Raj, V.B.; Sarun Lal, M.A.; Gowri, R.S.; Ramesh, S.; Ramesh, N.R.; Ramesh, R.; Deepak, V.; Vadivelan, A.; Ramadass, G.A. Assessment of polymetallic nodules in the Central Indian Ocean Basin: Factors influencing distribution patterns through high-resolution autonomous underwater vehicle (AUV) survey. Geo-Mar. Lett. 2025, 45, 7. [Google Scholar] [CrossRef]
  3. Zhu, H.; Chen, J.; Lin, Y.; Zhou, P.; Wang, K.; Lin, P.; Peng, X.; Li, H.; Guo, J.; Ren, X.; et al. The application of structured light for external subsea pipeline inspection based on the underwater dry cabin. Appl. Ocean. Res. 2025, 155, 104431. [Google Scholar] [CrossRef]
  4. Sang, I.-C.; Norris, W.R. An Adaptive Image Thresholding Algorithm Using Fuzzy Logic for Autonomous Underwater Vehicle Navigation. IEEE J. Sel. Top. Signal Process. 2024, 18, 358–367. [Google Scholar] [CrossRef]
  5. Patel, S.; Abdellatif, F.; Alsheikh, M.; Trigui, H.; Outa, A.; Amer, A.; Sarraj, M.; Al Brahim, A.; Alnumay, Y.; Felemban, A.; et al. Multi-robot system for inspection of underwater pipelines in shallow waters. Int. J. Intell. Robot. Appl. 2024, 8, 14–38. [Google Scholar] [CrossRef]
  6. Feng, W.; Ma, Y.; Li, H.; Liu, H.; Meng, X.; Zhou, M. Optimal search path planning of UUV in battlefeld ambush scene. Def. Technol. 2024, 32, 541–552. [Google Scholar] [CrossRef]
  7. sub Song, K.; Chu, P.C. Conceptual design of future undersea unmanned vehicle (UUV) system for mine disposal. IEEE Syst. J. 2012, 8, 43–51. [Google Scholar] [CrossRef]
  8. Yan, Z.; Yan, J.; Cai, S.; Yu, Y.; Wang, Y.; Hou, S. Distributed TMPC formation trajectory tracking of multiple underwater unmanned vehicles with uncertainties and external perturbations. Ocean Eng. 2024, 298, 117160. [Google Scholar] [CrossRef]
  9. Yan, Z.; Yang, Z.; Pan, X.; Zhou, J.; Wu, D. Virtual leader based path tracking control for multi-UUV considering sampled-data delays and packet losses. Ocean Eng. 2020, 216, 108065. [Google Scholar] [CrossRef]
  10. Chen, Y.; Guo, X.; Luo, G.; Liu, G. A Formation Control Method for AUV Group Under Communication Delay. Front. Bioeng. Biotechnol. 2022, 10, 848641. [Google Scholar] [CrossRef] [PubMed]
  11. Li, J.; Tian, Z.; Zhang, H. Discrete-time AUV formation control with leader-following consensus under time-varying delays. Ocean Eng. 2023, 286, 115678. [Google Scholar] [CrossRef]
  12. Jiang, X.; Xia, G. Nonfragile formation control of leaderless unmanned surface vehicles with memory sampling data and packet loss. IEEE Syst. J. 2022, 17, 3026–3035. [Google Scholar] [CrossRef]
  13. Kuniyoshi, S.; Oshiro, S.; Saotome, R.; Wada, T. An Iterative Orthogonal Frequency Division Multiplexing Receiver with Sequential Inter-Carrier Interference Canceling Modified Delay and Doppler Profiler for an Underwater Multipath Channel. Mar. Sci. Eng. 2024, 12, 1712. [Google Scholar] [CrossRef]
  14. Yan, Z.P.; Pan, X.L.; Yang, Z.W.; Yue, L.D. Formation Control of Leader-Following Multi-UUVs With Uncertain Factors and Time-Varying Delays. IEEE Access 2019, 7, 118792–118805. [Google Scholar] [CrossRef]
  15. Saleem, K.; Wang, L.; Bharany, S. Survey of AI-driven routing protocols in underwater acoustic networks for enhanced communication efficiency. Ocean Eng. 2024, 314, 119606. [Google Scholar] [CrossRef]
  16. Yan, Z.; Yan, J. Self-triggered formation control of asynchronous multiple underwater unmanned vehicle under communication delay. Unmanned Syst. 2024. [Google Scholar] [CrossRef]
  17. Cai, W.; Ju, Y.; Liu, Z.; Zhang, M.; Lv, S. Event-triggered leader-follower communication based formation control with dual-predictor for multiple AUVs. Ocean Eng. 2025, 334, 121552. [Google Scholar] [CrossRef]
  18. Tang, C.; Shi, H.; Zhang, L. Trajectory planning aided unmanned surface vehicle optimization communication method with hierarchical reinforcement learning. Ocean Eng. 2024, 307, 118225. [Google Scholar] [CrossRef]
  19. Liu, Y. Research on Coordinated Control Methods for Ocean Surveying by Underwater Vehicle Fleets. Ph.D. Dissertation, Harbin Engineering University, Harbin, China, 2017. [Google Scholar]
  20. Nettari, Y.; Kurt, S.; Labbadi, M. Adaptive Robust Control based on Backstepping Sliding Mode techniques for Quadrotor UAV under external disturbances. IFAC-PapersOnLine 2022, 55, 252–257. [Google Scholar] [CrossRef]
  21. Du, P.; Yang, W.; Wang, Y.; Hu, R.; Chen, Y.; Huang, S. A novel adaptive backstepping sliding mode control for a lightweight autonomous underwater vehicle with input saturation. Ocean Eng. 2022, 263, 112362. [Google Scholar] [CrossRef]
  22. Yan, Z.; Jiang, A.; Lai, C.; Li, H. Velocity-Free Formation Control and Collision Avoidance for UUVs via RBF: A High-Gain Approach. Electronics 2022, 11, 1170. [Google Scholar] [CrossRef]
  23. Wang, H.; Ren, J.; Han, M.; Wang, Z.; Zhang, K.; Wang, X. Robust adaptive three-dimensional trajectory tracking control for unmanned underwater vehicles with disturbances and uncertain dynamics. Ocean Eng. 2023, 289, 116184. [Google Scholar] [CrossRef]
  24. Huang, C.; Xu, H.; Batista, P.; Zhang, X.; Soares, C.G. Fixed-time leader-follower formation control of underactuated unmanned surface vehicles with unknown dynamics and ocean disturbances. Eur. J. Control 2023, 70, 100784. [Google Scholar] [CrossRef]
Figure 1. Coordinate system.
Figure 1. Coordinate system.
Jmse 13 01696 g001
Figure 2. The leader–follower method.
Figure 2. The leader–follower method.
Jmse 13 01696 g002
Figure 3. Leader–follower formation.
Figure 3. Leader–follower formation.
Jmse 13 01696 g003
Figure 4. Formation predictive control framework.
Figure 4. Formation predictive control framework.
Jmse 13 01696 g004
Figure 5. Internal structure of GRU network.
Figure 5. Internal structure of GRU network.
Jmse 13 01696 g005
Figure 6. Kalman filtering algorithm.
Figure 6. Kalman filtering algorithm.
Jmse 13 01696 g006
Figure 7. Formation prediction model based on GRU-KF.
Figure 7. Formation prediction model based on GRU-KF.
Jmse 13 01696 g007
Figure 8. (a) North direction error; (b) east direction error; (c) vertical error; (d) pitch angle error; (e) yaw angle error; (f) longitudinal speed error; (g) lateral velocity error; (h) vertical velocity error; (i) pitch angular velocity error; (j) yaw angular velocity error.
Figure 8. (a) North direction error; (b) east direction error; (c) vertical error; (d) pitch angle error; (e) yaw angle error; (f) longitudinal speed error; (g) lateral velocity error; (h) vertical velocity error; (i) pitch angular velocity error; (j) yaw angular velocity error.
Jmse 13 01696 g008aJmse 13 01696 g008b
Figure 9. Two-dimensional trajectory of UUV formation.
Figure 9. Two-dimensional trajectory of UUV formation.
Jmse 13 01696 g009
Figure 10. Three-dimensional trajectory of UUV formation.
Figure 10. Three-dimensional trajectory of UUV formation.
Jmse 13 01696 g010
Figure 11. (a) North direction error; (b) east direction error; (c) vertical error; (d) pitch angle error; (e) yaw angle error; (f) longitudinal speed error; (g) lateral velocity error; (h) vertical velocity error; (i) pitch angular velocity error; (j) yaw angular velocity error.
Figure 11. (a) North direction error; (b) east direction error; (c) vertical error; (d) pitch angle error; (e) yaw angle error; (f) longitudinal speed error; (g) lateral velocity error; (h) vertical velocity error; (i) pitch angular velocity error; (j) yaw angular velocity error.
Jmse 13 01696 g011aJmse 13 01696 g011b
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Li, J.; Luo, R.; Zhang, H.; Tian, Z. Formation Control of Multiple UUVs Based on GRU-KF with Communication Packet Loss. J. Mar. Sci. Eng. 2025, 13, 1696. https://doi.org/10.3390/jmse13091696

AMA Style

Li J, Luo R, Zhang H, Tian Z. Formation Control of Multiple UUVs Based on GRU-KF with Communication Packet Loss. Journal of Marine Science and Engineering. 2025; 13(9):1696. https://doi.org/10.3390/jmse13091696

Chicago/Turabian Style

Li, Juan, Rui Luo, Honghan Zhang, and Zhenyang Tian. 2025. "Formation Control of Multiple UUVs Based on GRU-KF with Communication Packet Loss" Journal of Marine Science and Engineering 13, no. 9: 1696. https://doi.org/10.3390/jmse13091696

APA Style

Li, J., Luo, R., Zhang, H., & Tian, Z. (2025). Formation Control of Multiple UUVs Based on GRU-KF with Communication Packet Loss. Journal of Marine Science and Engineering, 13(9), 1696. https://doi.org/10.3390/jmse13091696

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop