Prescribed Performance-Based Formation Control for Multiple Autonomous Underwater Helicopters with Complex Dynamic Characteristics

Wu, Zheyuan; Song, Zilong; Huang, Haocai

doi:10.3390/jmse12122246

Open AccessArticle

Prescribed Performance-Based Formation Control for Multiple Autonomous Underwater Helicopters with Complex Dynamic Characteristics

by

Zheyuan Wu

^1,2,

Zilong Song

²

and

Haocai Huang

^1,2,3,4,*

¹

Donghai Laboratory, Zhoushan 316021, China

²

Ocean College, Zhejiang University, Zhoushan 316021, China

³

Laboratory for Marine Geology, Qingdao Marine Science and Technology Center, Qingdao 266061, China

⁴

Hainan Institute of Zhejiang University, Sanya 572025, China

^*

Author to whom correspondence should be addressed.

J. Mar. Sci. Eng. 2024, 12(12), 2246; https://doi.org/10.3390/jmse12122246

Submission received: 15 November 2024 / Revised: 30 November 2024 / Accepted: 2 December 2024 / Published: 6 December 2024

(This article belongs to the Special Issue Unmanned Marine Vehicles: Perception, Planning, Control and Swarm)

Download

Browse Figures

Versions Notes

Abstract

This research addresses the challenge of formation control among multiple homogeneous autonomous underwater helicopters (AUHs) in the presence of external disturbances and complex dynamic characteristics. The study introduces a novel approach by integrating both disturbance and state observers within the control law framework to manage external disturbances and the immeasurability of velocity, respectively. Concurrently, localized radial basis function neural networks (RBFNNs) of identical configurations are incorporated into the formation control law to assimilate model uncertainties. Building upon this integration, an experience-based formation control strategy is developed, leveraging accumulated knowledge to diminish computational demands while maintaining stipulated performance criteria. Furthermore, the incorporation of a finite-time prescribed performance control (FTPPC) technique enhances the learning process’s efficiency by expediting convergence. Numerical simulations are presented to validate the efficacy of the proposed methodology.

Keywords:

autonomous underwater vehicle; prescribed performance; formation control; neural networks; trajectory tracking

1. Introduction

Underwater exploration plays a vital role in advancing our understanding of marine ecosystems, resource management, and climate regulation [1]. Over the past few decades, humans have sought to explore the vast “universe under oceans” to uncover and utilize hidden resources for the benefit of mankind. However, despite these efforts, only 5% of the world’s oceans have been explored to date, leaving an immense frontier of unknowns [2]. However, conducting underwater exploration poses significant challenges, including harsh environmental conditions, limited visibility, and high-pressure zones, which can impede data collection and system functionality. The development of robust and efficient underwater vehicles, such as autonomous underwater vehicles (AUVs), gliders, and remotely operated systems, has been instrumental in overcoming these challenges [3,4].

Among these technologies, AUVs have attracted significant attention, particularly in formation control, which is critical for cooperative and efficient underwater missions. Researchers have applied various methods to achieve cooperative formation of AUVs, including the leader–follower, virtual structure, and behavior-based methods [5,6,7,8]. Compared to current AUVs, an autonomous underwater helicopter (AUH), as shown in Figure 1, provides more flexible steering, hovering, and landing due to its special shape and propulsion direction, which is more suitable for underwater formation operations [9,10,11].

The complexity of the underwater environment, including its unmeasurable speed, model dynamics, and disturbances, poses significant challenges in designing control law for AUVs. Much work has been carried out to address these issues. An AUV velocity estimation method derived from the Nussbaum state observer was investigated to tackle the problem of unmeasurable velocity [12]. Li et al. [13] presented a finite-time sliding-mode disturbance observer to estimate accurately unknown disturbances in finite time. For disturbances, Gao et al. [14] and Li et al. [15] combined external disturbances with model uncertainties as lumped disturbances and introduced a disturbances observer to estimate them in a finite time. In [16,17], the extended state observer was designed to estimate velocity and disturbances simultaneously. The neural network method has been developed to deal with model uncertainties due to its excellent generalization ability. RBFNNs were used to approximate nonlinear uncertainties in [18,19]. An estimation method derived from the neural-based robustness controller was used to approximate the lumped uncertainty composed of uncertain dynamics, disturbances, and approximate error [20]. Building upon coordinate transformation techniques and a hybrid linear–nonlinear differentiator to estimate the derivative of surge displacement, Zhang et al. [21] introduced an output-feedback adaptive backstepping control strategy tailored for AUVs operating without direct velocity measurements.

The NN methods mentioned above are based on online adaptive neural networks, which means that the learning ability of NN is ignored. Therefore, adaptive methods need to be reused even if AUVs are dealing with similar model uncertainties, which increases the computational burden. Wang et al. [22] proposed the deterministic learning theory to address this problem, which fully utilized the spatially localized learning capabilities of the localized RBFNN. Based on this, a distributed learning strategy composed of a distributed observer and learning controller was proposed in [23]. The cooperative learning controller was presented in [24], which can learn the model uncertainties cooperatively and use knowledge to tackle the uncertainties. In [25], reinforcement learning (RL) is employed to handle partially observable systems by compensating for the lack of state measurements. This algorithm uses a finite history of input–output data to approximate the system dynamics and optimize control actions. In [26], the RL algorithm is utilized to achieve optimal control at a low level for individual agents within the multi-agent formation. It transforms the time-varying system into an equivalent autonomous system, ensuring stability and optimality of the control strategy.

It is important to note that AUHs starts to learn only after getting into the steady state. Therefore, the error convergence rate will affect the learning speed, which poses demands on the convergence rate. The prescribed performance control limits the transient convergence process of tracking errors by performing the error transformation via performance function. In [27,28,29,30], prescribed performance control was proposed to achieve faster convergence, which is realized by using the exponential decay performance function. The traditional method was improved by introducing a finite-time performance function to ensure the trace error gets into the compact set in a specified time [31,32,33].

This study focuses on the formation control problem for homogeneous AUHs with external disturbance and model uncertainty. A cooperative formation event-triggered controller is established based on prescribed performance. Multiple observers estimate the unmeasurable high-order states and external disturbances separately. Finally, the dynamic uncertainties are learned cooperatively, and the learned knowledge is used to establish an empirically based controller.

To contextualize the advancements and challenges in formation control techniques, we conducted a comprehensive review of recent related works. Table 1 provides a detailed comparison of these studies, focusing on key aspects such as the dynamic models considered, robustness to disturbances, the integration of learning mechanisms, and performance metrics. This comparison highlights the strengths and limitations of existing approaches, serving as a foundation to underscore the unique contributions of the proposed method in addressing current gaps in the field.

Hybrid observer design: In contrast to the observer proposed in [14,15], which estimates the lumped uncertainties composed of model uncertainty and disturbance, and the extended state observer presented in [17], which estimates the higher-order states and disturbances simultaneously, the NN-based Luenberger observer and disturbance observer are used to estimate the high-order states and disturbance, respectively, which achieves higher estimation accuracy.

Experience-based control: Compared to the online adaptive neural network used in [18,19], this study maximizes the learning ability of local RBFNN, which enables AUH to learn dynamic uncertainties when moving along the periodic reference trajectory. Based on this, an experience-based control law is designed using the gained knowledge, which means the similar dynamic uncertainties is dealt with by experience rather than by reusing the adaptive method, thus reducing the calculation burden.

Finite-time prescribed performance control: Compared to the current results [23,24,26,34], the finite-time prescribed performance control (FTPPC) is proposed to guarantee that the tracking error converges to a compact set in a finite time, which accelerates the learning process.

The rest of this work is organized as follows. Section 2 contains the preliminaries and control objectives. Section 3 investigates the cooperative formation controller. Section 4 establishes an experience-based cooperative formation controller. Section 5 shows the effectiveness of the strategy through simulation experiments. Finally, Section 6 presents the conclusions.

2. Preliminaries and Problem Formulation

2.1. AUHs Dynamics

This study examines the control problem for a set of

N

AUHs. The dynamic model of the AUH is based on references [19,35], and its specific expressions are as follows:

\begin{array}{l} {\dot{η}}_{i} = J (η_{i}) ν_{i} \\ M {\dot{ν}}_{i} + C (ν_{i}) ν_{i} + D (ν_{i}) ν_{i} + Δ (η_{i}, ν_{i}) + g = τ_{d, i} (t) + τ_{i} \end{array}

(1)

where the

i \in 𝒩 = [1, \dots, N]

signifies the i-th AUH.

M

denotes the inertia matrix.

η_{i} = {[x_{i}, y_{i}, z_{i}, φ_{i}, θ_{i}, ψ_{i}]}^{T}

denotes the position and declination of the AUH in the earth-fixed frame.

ν_{i} = {[u_{i}, υ_{i}, w_{i}, p_{i}, q_{i}, r_{i}]}^{T}

represents the linear and angular velocities of the AUH in the body-fixed frame.

C (ν_{i})

represents the uncertain Coriolis force and the centripetal force matrix.

D (ν_{i})

denotes the uncertain hydrodynamic damping matrix.

Δ (η_{i}, ν_{i})

represent unmodeled dynamics.

g

is the force and matrix of gravity and buoyancy.

τ_{d, i} (t)

and

τ_{i}

represent external disturbances and control inputs, respectively.

The rotation matrix

J (η_{i})

between the earth-fixed frame and body-fixed frame can be defined as:

J (η_{i}) = [\begin{matrix} J_{a} & 0_{3 \times 3} \\ 0_{3 \times 3} & J_{b} \end{matrix}]

(2)

with

J_{a} = [\begin{matrix} \cos ψ \cos θ & \cos ψ \sin θ \sin ϕ - \sin ψ \cos ϕ & \sin ψ \sin ϕ + \cos ψ \sin θ \cos ϕ \\ \sin ψ \cos θ & \sin θ \sin ψ \sin ϕ + \cos ψ \cos ϕ & \sin ψ \sin θ \cos ϕ - \cos ψ \sin θ \\ - \sin θ & \cos θ \sin ϕ & \cos θ \cos ϕ \end{matrix}]

and

J_{b} = [\begin{matrix} 1 & \sin ϕ \tan θ & \cos ϕ \tan θ \\ 0 & \cos ϕ & - \sin ϕ \\ 0 & \sin ϕ / \cos θ & \cos ϕ / \cos θ \end{matrix}]

According to Fossen’s handbook [35], the representation singularity at

θ \neq \pm π / 2

in the expression for

J_{b}

implies that the inverse matrix

J^{- 1} (η_{i})

does not exist at this value.

The dynamic model can be reformatted as:

\begin{array}{l} {\dot{x}}_{1 i} = x_{2 i} \\ {\dot{x}}_{2 i} = J_{1} (η_{i}) τ_{i} + J_{1} (η_{i}) τ_{d, i} (t) + F_{i} \end{array}

(3)

where

{[x_{1 i}, x_{2 i}]}^{T} = {[η_{i}, {\dot{η}}_{i}]}^{T}

,

J_{1} (η_{i}) = J (η_{i}) M^{- 1}

, and to simplify the notation,

J (η_{i})

,

J_{1} (η_{i})

is denoted as

J

and

J_{1}

, respectively, and

F_{i} = - J {\dot{J}}^{- 1} {\dot{η}}_{i} - J M^{- 1} C (ν_{i}) J^{- 1} {\dot{η}}_{i} - J M^{- 1} D (ν_{i}) J^{- 1} {\dot{η}}_{i} - J M^{- 1} Δ (η_{i}, ν_{i}) - J M^{- 1} g

represent the total set of dynamic uncertainties.

2.2. Formation Structure and Graph Theory

The ideal AUH formation consists of a virtual leader and

N

followers; the virtual leader follows the reference path

η_{d}

and the i-th follower travels along the reference path

η_{r, i}

.

η_{r, i} = η_{d} + η_{i}^{*}

(4)

where

η_{i}^{*}

is the relative placement of the i-th follower and the virtual leader that establish the formation structure and can be specified by the designer.

The communication topology is dictated by an undirected graph

𝒢 = (𝒱, ε)

, where

𝒱 = \{ϑ_{1}, \dots, ϑ_{N}\}

represents the node set and

ε \subseteq \{(ϑ_{i}, ϑ_{k})| ϑ_{i}, ϑ_{k} \in 𝒱, ϑ_{i} \neq ϑ_{k}\}

represents the edge set. The correlation adjacency matrix

𝒜 = [a_{i k}]

is determined as

a_{i k} = 1

if the i-th AUH is capable of receiving information from the k-th AUH; otherwise,

a_{i k} = 0

. Consider

𝒱_{i} = \{ϑ_{k}| (ϑ_{i}, ϑ_{k}) \in ε\}

to be the neighbor set of

ϑ_{i}

, with

k \in 𝒩_{i}

if

ϑ_{k} \in 𝒱_{i}

. The Laplacian matrix

ℒ

is defined by

ℒ = = [l_{i k}] \in ℝ^{n \times n}

where

l_{i i} = \sum_{k \in 𝒩_{i}} a_{i k}

,

l_{i k} = - a_{i k}

.

2.3. RBFNN

Neural networks are frequently used to estimate unknown or complex functions. This study employed RBFNN to approximate the dynamic uncertainties

F_{i} (x_{i})

.

F_{i} (x_{i}) = {W_{i}^{*}}^{T} S (x_{i}) + ε_{i}

(5)

where

x_{i} = {[x_{1 i}^{T}, {\hat{x}}_{2 i}^{T}]}^{T} \in ℝ^{12}

stands for the i-th RBFNN input variables and

S (x_{i}) = {[S_{1}^{T} (x_{i}), \dots, S_{6}^{T} (x_{i})]}^{T}

represents the Gaussian basis function

S_{j} (x_{i}) = \exp (- \frac{{(x_{i} - μ_{j})}^{T} (x_{i} - μ_{j})}{σ^{2}})

(6)

where

μ_{j} = {[μ_{j, 1}, \dots, μ_{j, q}]}^{T} \in ℝ^{q}

indicates the center vector of the j-th node,

σ

denotes the basis width,

{W_{i}^{*}}^{T} = blockdiag [{W_{i, 1}^{*}}^{T}, \dots, {W_{i, 6}^{*}}^{T}] \in ℝ^{6 \times 6 q}

is the ideal weight coefficient matrix, and

W_{i j}^{*} \in ℝ^{q}

,

ε_{i} \in {[ε_{i, 1}, \dots, ε_{i, 6}]}^{T}

represents the inherent error.

The control law design is based on the following assumptions.

Assumption 1.

All followers can access the information of virtual leader, and the communication among the followers is described by a connected undirected graph, which means there exists an undirected path between every pair of nodes.

Assumption 2.

The disturbances

τ_{d, i} (t)

and their derivative

{\dot{τ}}_{d, i} (t)

are bounded.

Assumption 3.

The tracking path

η_{d}

and its derivative is bounded, periodic, or semi-periodic.

The control objectives are listed as follows: the AUHs are driven to maintain the preset formation configuration while tracking the reference trajectory, and the specified performance can be achieved within the limits of the performance function. The experience-based control law is designed to reduce the computing burden by using the experience obtained from learning to approximate the dynamic uncertainty.

3. NN-Based AUH Formation Control Mechanism

3.1. FTPPC

Definition 1

[19]. A smooth function

β (t)

can be defined as a finite-time performance function (FTPF) if it satisfies

β (t) = β_{t f}

for

t \geq t_{f}

, where

β_{t f}

is a small constant and

t_{f}

is the specified time.

According to Definition 1, an FTPF candidate is proposed:

\dot{β} (t) = \{\begin{cases} - k_{1} [{(β (t) - β_{\infty})}^{2 - k_{2}} + {(β (t) - β_{\infty})}^{k_{2}}], & t \leq t_{f} \\ 0 & , t > t_{f} \end{cases}

(7)

where

k_{1} = {((1 - k_{2}) t_{f})}^{- 1} \tan^{- 1} ({(β_{0} - β_{\infty})}^{1 - k_{2}})

,

0 < k_{2} < 1

, and

β_{0}

and

β_{\infty}

are design parameters to constrain the overshoot and stable state error, respectively. Next, we will prove that (7) is a FTPF.

Proof of Definition 1.

Consider the following Lyapunov function:

V_{β} = \frac{1}{2} e_{β}^{2}

(8)

where

e_{β} = β (t) - β_{\infty}

. Combined with (7), we obtain:

\begin{matrix} {\dot{V}}_{β} & = - k_{1} e_{β} [{(β (t) - β_{\infty})}^{2 - k_{2}} + {(β (t) - β_{\infty})}^{k_{2}}] \\ = - k_{1} ({e_{β}}^{2 \times \frac{3 - k_{2}}{2}} + {e_{β}}^{2 \times \frac{1 + k_{2}}{2}}) \\ = - k_{1} (μ_{1} V_{β}^{1 - k_{2}} + μ_{2}) V_{β}^{\frac{1 + k_{2}}{2}} \end{matrix}

(9)

where

μ_{1} = 2^{\frac{3 - k_{2}}{2}}

and

μ_{2} = 2^{\frac{1 + k_{2}}{2}}

.

\frac{1 + k_{2}}{2} < 1

since

0 < k_{2} < 1

. Moreover, when

e_{β} \neq 0

,

{V_{β}}^{1 - k_{2}} > 0

. Thus,

e_{β}

may converge to 0 in finite time.

Let

x_{β} = V_{β}^{\frac{1 - k_{2}}{2}}

; then, (9) can be rewritten as follows:

\begin{matrix} V_{β}^{- \frac{1 + k_{2}}{2}} \frac{d V_{β}}{d t} = - k_{1} (μ_{1} V_{β}^{1 - k_{2}} + μ_{2}) \\ \frac{2}{1 - k_{2}} \frac{d V_{β}^{\frac{1 - k_{2}}{2}}}{d t} = - k_{1} (μ_{1} V_{β}^{1 - k_{2}} + μ_{2}) \\ \frac{2}{1 - k_{2}} \frac{d x_{β}}{d t} = - k_{1} (μ_{1} x_{β}^{2} + μ_{2}) \\ \frac{1}{μ_{1} x_{β}^{2} + μ_{2}} d x_{β} = - \frac{(1 - k_{2}) k_{1}}{2} d t \end{matrix}

(10)

By integrating both sides of (10), we obtain:

{(μ_{1} μ_{2})}^{- \frac{1}{2}} \tan^{- 1} ({(μ_{1} / μ_{2})}^{\frac{1}{2}} x_{β} (t)) = {(μ_{1} μ_{2})}^{- \frac{1}{2}} \tan^{- 1} ({(μ_{1} / μ_{2})}^{\frac{1}{2}} x_{β} (0)) - \frac{(1 - k_{2}) k_{1}}{2} t

(11)

Substituting

μ_{1} μ_{2} = 4

into (11) yields:

\tan^{- 1} ({(μ_{1} / μ_{2})}^{\frac{1}{2}} x_{β} (t)) = \tan^{- 1} ({(μ_{1} / μ_{2})}^{\frac{1}{2}} x_{β} (0)) - (1 - k_{2}) k_{1} t

(12)

Therefore, there exists

t_{f}

satisfying

x_{β} (t_{f}) = 0

.

\begin{matrix} t_{f} = \frac{2 {(μ_{1} μ_{2})}^{- \frac{1}{2}}}{(1 - k_{2}) k_{1}} \tan^{- 1} ({(μ_{1} / μ_{2})}^{\frac{1}{2}} x_{β} (0)) \\ = \frac{1}{(1 - k_{2}) k_{1}} \tan^{- 1} ({(β_{0} - β_{\infty})}^{1 - k_{2}}) \end{matrix}

(13)

Then, we know that

β (t)

tends to

β_{\infty}

in a finite time

t_{f}

.

The tracking error transformations are described in the following formula:

e_{1, i j} = β_{i j} (t) T_{i j} (z_{1, i j}) j = 1, \dots, 6

(14)

where

β_{i j} (t)

is the FTPF, whose update law is provided in (7),

z_{1, i j}

are the transformed errors, and

T_{i j} (z_{1, i j})

refers to the error transformation function which can be provided by:

T_{i j} (z_{1, i j}) = \frac{\exp (z_{1, i j}) - \exp (- z_{1, i j})}{\exp (z_{1, i j}) + \exp (- z_{1, i j})}

(15)

Substituting (15) into (14) yields:

z_{1, i j} = \frac{1}{2} \ln (1 + \frac{e_{1, i j}}{β_{i j} (t)}) - \frac{1}{2} \ln (1 - \frac{e_{1, i j}}{β_{i j} (t)})

(16)

□

3.2. State Observer Design

We introduce state observer to estimate the high-order states

x_{2 i}

. The RBFNN, disturbance observer, and Luenberger observer are combined to estimate the state variables

x_{2 i}

due to the uncertainties and disturbances in the dynamic model. The state observer based on RBFNN and disturbance observer are designed as follows:

\begin{array}{l} {\dot{\hat{x}}}_{1 i} = {\hat{x}}_{2 i} + L_{1 i} (y_{i} - {\hat{y}}_{i}) \\ {\dot{\hat{x}}}_{2 i} = J_{1} τ_{i} + J_{1} {\hat{τ}}_{d, i} + {\hat{F}}_{i} + L_{2 i} (y_{i} - {\hat{y}}_{i}) \\ {\hat{y}}_{i} = {\hat{x}}_{1 i} \end{array}

(17)

where

{\hat{x}}_{1 i}

,

{\hat{x}}_{2 i}

, and

{\hat{y}}_{i}

represent the estimation of the state variables and output variables, respectively.

{\hat{τ}}_{d, i}

denotes the estimation of disturbances,

{\hat{F}}_{i}

stands for the approximation of complex uncertainties using RBFNN, and

L_{1 i}

and

L_{2 i}

represent the gain matrix to be designed. The state estimation error can be calculated as follows:

\begin{array}{l} {\dot{\tilde{x}}}_{1 i} = - L_{1 i} {\tilde{x}}_{1 i} + {\tilde{x}}_{2 i} \\ {\dot{\tilde{x}}}_{2 i} = - L_{2 i} {\tilde{x}}_{1 i} + J_{1} {\tilde{τ}}_{d, i} + {\tilde{F}}_{i} \end{array}

(18)

where

{\tilde{x}}_{1 i} = x_{1 i} - {\hat{x}}_{1 i}

and

{\tilde{x}}_{2 i} = x_{2 i} - {\hat{x}}_{2 i}

stand for state estimation errors,

{\tilde{τ}}_{d, i}

stands for disturbances estimation error, and

{\tilde{F}}_{i}

denotes the approximation error. If

{\tilde{x}}_{i} = {[{\tilde{x}}_{1 i}, {\tilde{x}}_{2 i}]}^{T}

, then (18) is expressed in matrix form.

{\dot{\tilde{x}}}_{i} = A {\tilde{x}}_{i} + B {\tilde{\bar{F}}}_{i}

(19)

where

A = (\begin{matrix} - L_{1 i} & I \\ - L_{2 i} & 0 \end{matrix})

,

B = (\begin{matrix} 0 \\ I \end{matrix})

,

{\tilde{\bar{F}}}_{i} = J_{1} {\tilde{τ}}_{d, i} + {\tilde{F}}_{i}

.

Remark 1.

Although the Luenberger observer is traditionally applied to linear systems, this study extends its application by integrating it with an RBFNN to address the nonlinearities in the AUH model. Specifically, the RBFNN approximates the nonlinear components of the system dynamics, while the Luenberger observer estimates the linearized states. This hybrid design leverages the Luenberger observer’s efficiency in linear dynamics estimation and the RBFNN’s capability to handle nonlinearities.

3.3. NN-Based AUH Formation Controller

The tracking errors are defined as:

e_{1, i} = η_{i} - η_{r, i} = x_{1 i} - η_{r, i}

(20)

Combined with (16),

{\dot{z}}_{1, i}

is calculated as:

\begin{array}{l} {\dot{z}}_{1, i} = & Υ_{i} {\dot{e}}_{1, i} - Θ_{i} e_{1, i} \\ = Υ_{i} (x_{2 i} - {\dot{η}}_{r, i}) - Θ_{i} e_{1, i} \end{array}

(21)

where

Υ_{i} = diag [χ_{i 1}, \dots, χ_{i 6}]

,

Θ_{i} = diag [θ_{i 1}, \dots, θ_{i 6}]

,

χ_{i j} = \frac{1}{2} (\frac{1}{β_{i j} (t) + e_{1, i j}}

,

+ \frac{1}{β_{i j} (t) - e_{1, i j}})

, and

θ_{i j} = \frac{1}{2} (\frac{{\dot{β}}_{i j} (t)}{β_{i j} (t) (β_{i j} (t) + e_{1, i j})} + \frac{{\dot{β}}_{i j} (t)}{β_{i j} (t) (β_{i j} (t) - e_{1, i j})})

.

Step 1: The Lyapunov function are selected as

V_{1} = \frac{1}{2} \sum_{i = 1}^{N} {\tilde{x}}_{i}^{T} {\tilde{x}}_{i} + \frac{1}{2} \sum_{i = 1}^{N} {z_{1, i}}^{T} z_{1, i}

. According to (19) and (21), the derivative of

V_{1}

can be calculated as:

{\dot{V}}_{1} = \sum_{i = 1}^{N} \{{\tilde{x}}_{i}^{T} A {\tilde{x}}_{i} + {\tilde{x}}_{i}^{T} B {\tilde{\bar{F}}}_{i} + {z_{1, i}}^{T} [Υ_{i} ({\hat{x}}_{2 i} + {\tilde{x}}_{2 i} - {\dot{η}}_{r, i}) - Θ_{i} e_{1, i}]\}

(22)

Consider

{\hat{x}}_{2 i}

as the virtual control variable, and

z_{2, i}

is designed as:

z_{2, i} = {\hat{x}}_{2 i} - α_{i}

(23)

where the virtual control law

α_{i}

is designed as:

α_{i} = - Υ_{i}^{- 1} K_{1, i} z_{1, i} + {\dot{η}}_{r, i} + Υ_{i}^{- 1} Θ_{i} e_{1, i}

(24)

where

K_{1, i} = diag [k_{1, i 1}, \dots, k_{1, i 6}] > 0

is the gain matrix, and combining (22), (23), and (24), we obtain:

{\dot{V}}_{1} = \sum_{i = 1}^{N} ({\tilde{x}}_{i}^{T} A {\tilde{x}}_{i} + {\tilde{x}}_{i}^{T} B {\tilde{\bar{F}}}_{i} - {z_{1, i}}^{T} K_{1, i} z_{1, i} + {z_{1, i}}^{T} Υ_{i} z_{2, i} + {z_{1, i}}^{T} Υ_{i} {\tilde{x}}_{2, i})

(25)

Next, a nonlinear observer is proposed to estimate the

{\dot{α}}_{i}

, which avoids the direct computation of

{\dot{α}}_{i}

.

\begin{array}{l} {\dot{℘}}_{i} = ℑ_{i} - ς_{1} {sig}^{1 - 1 / ς_{3}} (℘_{i} - α_{i}) \\ {\dot{ℑ}}_{i} = - ς_{2} {sig}^{1 - 2 / ς_{3}} (℘_{i} - α_{i}) \end{array}

(26)

where

℘_{i}

and

ℑ_{i}

are the estimates of

α_{i}

and

{\dot{α}}_{i}

, respectively. The function sig is defined by

{sig}^{a} (m) = sign (m) {|m|}^{a}

.

ς_{1} > 0

,

ς_{2} > 0

, and

ς_{3} > 2

are the parameters.

Considering the estimate error

{\tilde{℘}}_{i} = ℘_{i} - α_{i}

and

{\tilde{ℑ}}_{i} = ℑ_{i} - {\dot{α}}_{i}

, the error dynamic can be calculated as:

\begin{array}{l} {\dot{\tilde{℘}}}_{i} = {\tilde{ℑ}}_{i} - ς_{1} {sig}^{1 - 1 / ς_{3}} ({\tilde{℘}}_{i}) \\ {\dot{\tilde{ℑ}}}_{i} = - ς_{2} {sig}^{1 - 2 / ς_{3}} ({\tilde{℘}}_{i}) - {\ddot{α}}_{i} \end{array}

(27)

According to [36], the error dynamic (27) is stable, which means the

{\tilde{ℑ}}_{i}

will converge to a compact set and satisfy

‖{\tilde{ℑ}}_{i}‖ \leq {\bar{ℑ}}_{i}

with

{\bar{ℑ}}_{i}

being a small constant.

Remark 2.

Note that direct computation of

{\dot{α}}_{i}

requires the use of higher order states

x_{2 i}

, which is unmeasurable and unavailable. And the

{\dot{α}}_{i}

obtained by the differentiator directly is often not smooth. To address this problem, the nonlinear observer is used to estimate the signal

{\dot{α}}_{i}

, which avoids both the direct computation of

{\dot{α}}_{i}

and the use of differentiator and is more practical for control law design.

Step 2: The derivative of

z_{2, i}

can be computed as:

\begin{array}{l} {\dot{z}}_{2, i} & = J_{1} τ_{i} + J_{1} τ_{d, i} (t) + F_{i} - {\dot{\tilde{x}}}_{2 i} - {\dot{α}}_{i} \\ = J_{1} τ_{i} + d_{i} + {W_{i}^{*}}^{T} S (x_{i}) - {\dot{\tilde{x}}}_{2 i} - {\dot{α}}_{i} \end{array}

(28)

where

d_{i} = J_{1} τ_{d, i} + ε_{i}

represents the sum of disturbances and RBFNN inherent errors, which can be estimated by the disturbance observer.

\begin{array}{l} {\dot{ξ}}_{i} = - l_{i} ξ_{i} - l_{i} [J_{1} τ_{i} - ℑ_{i} + {\hat{W}}_{i}^{T} S (x_{i}) + p_{i} (z_{2, i})] \\ {\hat{d}}_{i} = ξ_{i} + p_{i} (z_{2, i}) \end{array}

(29)

where

l_{i} > 0

is the design parameter,

p_{i} (z_{2, i}) = l_{i} \int {\dot{z}}_{2, i} d t

, and

ξ_{i}

is the state variable.

The estimation error dynamic is computed as follows:

\begin{array}{l} {\dot{\tilde{d}}}_{i} & = {\dot{d}}_{i} - {\dot{\hat{d}}}_{i} \\ = l_{i} ξ_{i} + l_{i} [J_{1} τ_{i} - ℑ_{i} + {\hat{W}}_{i}^{T} S (x_{i}) + p_{i} (z_{2, i})] - l_{i} {\dot{z}}_{2, i} + {\dot{d}}_{i} \\ = - l_{i} {\tilde{d}}_{i} - l_{i} {\tilde{W}}_{i}^{T} S (x_{i}) + l_{i} {\dot{\tilde{x}}}_{2 i} - l_{i} {\tilde{ℑ}}_{i} + {\dot{d}}_{i} \end{array}

(30)

where

{\tilde{d}}_{i} = d_{i} - {\hat{d}}_{i}

, and

{\tilde{W}}_{i} = W_{i}^{*} - {\hat{W}}_{i}

.

Based on the multi-agent consensus and graph theory, and taking full advantage of the information exchange between neighboring AUHs, the weight coefficient update law of cooperative neural network is devised as below:

{\dot{\hat{W}}}_{i j} = Γ_{1, i j} [S_{j} (x_{i}) z_{2, i j} + σ_{i j} {\hat{W}}_{i j}] - γ_{2, i j} \sum_{k = 1}^{N} a_{i k} ({\hat{W}}_{i j} - {\hat{W}}_{k j})

(31)

where

Γ_{1, i j}

,

σ_{i j}

, and

γ_{2, i j}

are positive constants,

Γ_{1, i j} [S_{j} (x_{i}) z_{2, i j} + σ_{i j} {\hat{W}}_{i j}]

represents the adaptive term for monomeric AUH, and

γ_{2, i j} \sum_{k = 1}^{N} a_{i k} ({\hat{W}}_{i j} - {\hat{W}}_{k j})

stands for the AUHs’ consensus cooperative adaptive term.

Remark 3.

Although this work focuses on formation control, the consensus problem is introduced as a critical step to ensure consistent information sharing among the AUHs. Specifically, the connection and interaction facilitated through the cooperative neural network ensure rapid convergence of the weight coefficients, thereby enabling a robust and efficient formation control strategy. This perspective aligns with the broader understanding of consensus as a means to achieve coordinated behavior in multi-agent systems.

The NN-based cooperative formation controller is designed using a state observer (17), disturbance observer (29), and updated law (31).

τ_{i} = {J_{1}}^{- 1} [- K_{2, i} z_{2, i} - Υ_{i} z_{1, i} + ℑ_{i} - {\hat{W}}_{i}^{T} S (x_{i}) - {\hat{d}}_{i}]

(32)

Theorem 1.

If we consider the AUHs’ formation system (3), the state observer (17), the disturbance observer (29), the cooperative RBFNN adaptive update law (31), and the NN-based cooperative formation controller (32), then we have (i). All of the state variables of the system are uniformly ultimately bounded (UUB). (ii) The system achieves the specified performance, with trace errors converging in finite time within the limits of the performance function.

The relevant proof procedure for Theorem 1 can be seen in Proof A1 in Appendix A.

4. AUH Formation Control Using Experience

We propose the learning mechanism that enables the NN-based formation controller (32) to learn dynamic uncertainties

F_{i} (x_{i})

and store the acquired knowledge during the adaptive process. Furthermore, the gained experience will be utilized to establish an experience-based cooperative formation controller which deals with similar dynamic uncertainties

F_{i} (x_{i})

using experience instead of using the adaptive method repeatedly.

4.1. Learn from Formation Track Control

In this section, our goal is to establish a cooperative formation controller using the localized RBFNN which can learn dynamic uncertainties over a local range along the periodic reference trajectory.

Lemma 1

([22]). In localized RBF networks, each individual basis function has a limited impact on the network output. In other words, localized RBFNN only has localized representation during deterministic learning. Therefore, for any bounded orbit

x_{i} (t)

, the model uncertainty

F_{i} (x_{i})

can be approximated using neurons located in a local region along the trajectory.

F_{i} (x_{i}) = {W_{ζ}}^{* T} S_{ζ} (x_{i}) + ε_{ζ, i}

(33)

where

ε_{ζ, i}

is the inherent error,

S_{ζ} (x_{i}) \in ℝ^{q_{ζ}}

is the regress subvector with the neural node center placed close to the trajectory, i.e.,

‖μ_{j} - x_{i}‖ < d_{N}

, where

μ_{j}

is the center vector defined in Section 2.

Lemma 2

([22]). Consider arbitrarily continuous periodic orbit

x_{i} (t) : [0, \infty) \to ℝ^{d}

within a bounded set

Ω_{x_{i}} \in ℝ^{d}

. Then, for RBFNN

W^{T} S (x_{i})

with node centers located on a large enough regular lattice

Ω_{ζ}

that satisfies

Ω_{x_{i}} \in Ω_{ζ}

, the regress subvector

S_{ζ} (x_{i})

defined in Lemma 1 satisfies the persistently exciting (PE) condition.

Remark 4.

It is important to note that the satisfaction of the PE condition is one of the bases of deterministic learning. In other words, deterministic learning occurs when the system tracking the periodic orbit and regress subvector satisfies the PE condition simultaneously.

Theorem 2.

Consider the AUH system (3) with Assumptions 1~3. For each AUH, if there exists a compact set large enough that

x_{i} \in Ω_{x_{i}}

holds at any time, then the following conclusion stands if

{[x_{1, i}^{T} (0), x_{2, i}^{T} (0)]}^{T} \in Ω_{x_{i} (0)}

and

{\hat{W}}_{i j} (0) = 0

.

(1) The partial PE condition of the subvector is satisfied.

(2) Along the periodic reference trace

φ_{ζ, i} (x_{i} (t)) |_{t \geq T_{i}}

, the weight of the regress subvector will converge to the optimal value

W_{ζ, i j}^{*}

.

(3) The local precise approximation of dynamic uncertainties

F_{i} (x_{i})

can be realized by

{\hat{W}}_{i}^{T} S (x_{i})

and

{\bar{W}}_{i}^{T} S (x_{i})

simultaneously.

\begin{array}{l} F_{i} (x_{i}) & = {\hat{W}}_{i}^{T} S (x_{i}) + {\tilde{W}}_{i}^{T} S (x_{i}) + ε_{i} \\ = {\bar{W}}_{i}^{T} S (x_{i}) + {\bar{ε}}_{i} \end{array}

(34)

where

{\bar{W}}_{i}^{T} = blockdiag [{\bar{W}}_{i, 1}^{T}, \dots, {\bar{W}}_{i, 6}^{T}] \in ℝ^{6 \times 6 q}

{\bar{W}}_{i j} = mean {\hat{W}}_{i j} (t), t \in [t_{a, i}, t_{b, i}]

(35)

where

[t_{a, i}, t_{b, i}]

is a time interval after the system (3) converges to the transient range, and

{\bar{ε}}_{i} = {[{\bar{ε}}_{i, 1}, \dots, {\bar{ε}}_{i, 6}]}^{T}

is close to

ε_{i}

.

Remark 5.

The time interval

[t_{a, i}, t_{b, i}]

should be at least one orbit period after the system converges, which means that enough dynamic information is expressed, and the RBFNN is sufficiently trained.

The relevant proof procedure for Theorem 2 can be seen in Proof A2 in Appendix A.

4.2. Experience-Based AUH Formation Controller

According to Theorem 2, the experience-based RBFNN can locally accurately approximate the dynamic uncertainties along the periodic reference trajectory. The RBFNN weight obtained by the adaptive method is learned by the controller and stored in the

{\bar{W}}_{i}^{T}

.

By using experience, the experience-based disturbance observer can be specified as follows:

\begin{array}{l} {\dot{ξ}}_{i} = - l_{i} ξ_{i} - l_{i} [J_{1} τ_{i} - ℑ_{i} + {\bar{W}}_{i}^{T} S (x_{i}) + p_{i} (z_{2, i})] \\ {\hat{d}}_{i} = ξ_{i} + p_{i} (z_{2, i}) \end{array}

(36)

The experience-based cooperative formation controller can be designed as:

τ_{i} = {J_{1}}^{- 1} [- K_{2, i} z_{2, i} - Υ_{i} z_{1, i} + ℑ_{i} - {\bar{W}}_{i}^{T} S (x_{i}) - {\hat{d}}_{i}]

(37)

Theorem 3.

If we consider the AUH formation system (3) with Assumptions 1~3, the state observer (17), the empirically based disturbance observer (36), the empirically based cooperative formation controller (37), then we have (i). All states in the system are UUB. (ii) The system achieves the specified performance, with trace errors converging in finite time within the limits of the performance function.

The relevant proof procedure for Theorem 3 can be seen in Proof A3 in Appendix A.

5. Simulation Results

To validate the proposed method, a group of four homogeneous AUHs was simulated in a lozenge formation tracking a predefined reference trajectory. The AUHs were modeled with identical dynamic parameters, operating in an environment with external disturbances and communication constraints. This setup represents practical applications such as collaborative underwater inspections, environmental monitoring, and resource exploration. The AUH’s model dynamics is shown in [14]. Table 2 presents the initial pattern of AUH formation. The desired formation shape is a lozenge, and the vertex of the lozenge represents the AUH’s barycenter. We choose the desired relative offsets as

η_{1}^{*} = {[0, - 5, 0]}^{T}

,

η_{2}^{*} = {[- 5, 0, 0]}^{T}

,

η_{3}^{*} = {[0, 5, 0]}^{T}

, and

η_{4}^{*} = {[5, 0, 0]}^{T}

to achieve a lozenge formation shape. The neighboring sets

𝒱_{1} = \{ϑ_{2}, ϑ_{3}\}

,

𝒱_{2} = \{ϑ_{1}, ϑ_{4}\}

,

𝒱_{3} = \{ϑ_{1}, ϑ_{4}\}

, and

𝒱_{4} = \{ϑ_{2}, ϑ_{3}\}

describe the communication topology. The modeling uncertainties are provided by

Δ (η_{i}, v_{i}) = {[Δ_{1}, Δ_{2}, Δ_{3}, Δ_{4}, Δ_{5}, Δ_{6}]}^{T}

, where

Δ_{1} = \cos (u_{i}^{2}) + 1

,

Δ_{2} = \sin (v_{i}^{2}) + 1

,

Δ_{3} = 0.1 w_{i}^{2} + 1

,

Δ_{4} = 1 - \cos (p_{i}^{2})

,

Δ_{5} = 1 - \sin (q_{i}^{2})

, and

Δ_{6} = 0.2 r_{i}^{2}

.

We construct the RBFNNs

{\hat{W}}_{i j}^{T} S_{j} (v_{i}), i = 1 - 4 and j = 1 - 6

to approximate the modeling uncertainties using 200 nodes with the centers evenly spaced on [−4, 4] and the widths being 6.

The parameters in FTPPC are provided as

ρ_{i 0} = 5

,

ρ_{i \infty} = 0.03

,

k_{2} = 0.8

, and

t_{f} = 20

. The parameters in state observer (17), disturbance observer (29), adaptive law (31), and formation control law (32) are given as

L_{1 i} = diag [15, 15, 15, 15, 15, 15]

,

L_{2 i} = diag [50, 50, 50, 50, 50, 50]

,

K_{1, i} = diag [5, 5, 5, 5, 5, 5]

,

K_{2, i} = diag [4, 4, 4, 4, 4, 4]

,

ς_{1} = 4

,

ς_{2} = 4

, and

ς_{3} = 2.5

. The training interval

[t_{a, i}, t_{b, i}]

is set to be

[20, 20 + 20 π]

, where 20 is the converge time specified by FTPPC, and

20 π

is the orbit period.

Figure 2 illustrates the shape and trajectory of AUH formation. Figure 3 shows the tracking error using the proposed experience-based cooperative formation control law (56), which converges in a finite time within the limit of the performance function. Furthermore, the tracking error remains in a compact set when

t > 20 + 20 π

, which means the tracking performance is guaranteed when experience is used to deal with the dynamic uncertainty. Figure 4 illustrates the RBFNN approximation error for dynamic uncertainty, which converges to a compact set during the adaptive process. Moreover, it can be seen that the approximation error remains in an admissible set after experience is used at

t = (20 + 20 π)

s.

Several comparison experiments are provided to further demonstrate the superiority of the proposed algorithm. The experience-based control method in this paper is compared with the adaptive NN-based control method used in [18], which is conducted on a computer with Intel i5-11400 2.60 GHz and 16 GB RAM, and the solver step is set to be 0.01 s in MATLAB 2021a. The result in Table 3 shows that the solver running time is shorter using experience-based control method under the same tracking accuracy, which means the computing burden is reduced. The NN-based state observer (NN-SO) is compared with the ESO proposed in [17], and the result is shown in Figure 5. We can see that the NN-SO has a better estimate accuracy and smaller oscillation magnitude compared to the ESO. Figure 6 provides the results of comparison between the disturbance observer (DO) used in this paper and the ESO proposed in [17]. It can be seen that the disturbance observer has better estimate accuracy compared with the ESO. A comparison between the FTPPC used in our work and the PPC in [18] is shown in Figure 7. We can observe that the FTPPC leads to a faster convergence speed, which is more suitable for deterministic learning control methods.

To highlight the advantages of the proposed formation control algorithm, we conducted a comparative simulation study against a recently published method [37]. This method also employs a prescribed performance control strategy and was chosen for its relevance and novelty in addressing formation control challenges.

To ensure a fair and rigorous comparison, the following conditions were maintained for both algorithms: (1) both controllers were initialized with identical formation configurations and tracking errors. (2) The parameters defining the prescribed performance functions were set equivalently. (3) Both simulations included the same model uncertainty and external disturbances.

The comparative results, presented in Figure 8, demonstrate the following advantages of the proposed algorithm: the proposed method achieved faster error decay during the transient phase compared to the reference algorithm, indicating quicker adaptation to the desired trajectory. The steady-state tracking error achieved by the proposed method was significantly smaller, reflecting higher accuracy and stability under disturbances and uncertainties. These findings confirm that the proposed control algorithm outperforms the reference method in both transient and steady-state performance.

6. Conclusions

This study presents a novel formation control framework for AUHs that integrates prescribed performance control, hybrid observers, and experience-based learning. The proposed method offers several advantages: the hybrid observer, which combines RBFNN with a Luenberger observer, effectively addresses the nonlinear dynamics of AUHs, resulting in higher estimation accuracy. Additionally, the incorporation of experience-based learning reduces the need for continuous adaptation, thereby enhancing computational efficiency. Furthermore, the adoption of finite-time prescribed performance control ensures faster convergence of tracking errors compared to conventional approaches, contributing to improved overall performance. Nevertheless, the proposed method has certain limitations. It relies on reliable communication among AUHs, necessitates sufficient training data to support the experience-based framework, and imposes moderate computational demands on onboard systems. Moreover, a key limitation of this study is the absence of experimental validation using real robots.

To address these limitations, future research will focus on developing decentralized communication frameworks to enhance robustness and designing training methods suitable for highly dynamic environments. In parallel, we are actively pursuing the miniaturization and manufacturing of additional AUH prototypes. These efforts aim to facilitate future experiments with real robots, which will validate the proposed control strategy under realistic operating conditions and further demonstrate its practical applicability.

Author Contributions

Conceptualization, Z.W.; methodology, Z.W. and Z.S.; validation, Z.W. and H.H.; writing—original draft preparation, Z.W. and Z.S.; supervision, H.H.; project administration, H.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Project of the Donghai Laboratory, grant number DH-2022ZY0004.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data is contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Proof A1.

Consider the Lyapunov function with the following expressions:

V_{2} = V_{1} + \sum_{i = 1}^{N} (\frac{1}{2} {z_{2, i}}^{T} z_{2, i} + \frac{1}{2} {\tilde{d}}_{i}^{T} {\tilde{d}}_{i} + \frac{1}{2} \sum_{j = 1}^{6} {\tilde{W}}_{i j}^{T} Γ_{1, i j}^{- 1} {\tilde{W}}_{i j})

(A1)

whose derivative can be calculated as:

\begin{array}{l} {\dot{V}}_{2} = \sum_{i = 1}^{N} & {{\tilde{x}}_{i}^{T} A {\tilde{x}}_{i} + {\tilde{x}}_{i}^{T} B {\tilde{\bar{F}}}_{i} - {z_{1, i}}^{T} K_{1, i} z_{1, i} + {z_{1, i}}^{T} Υ_{i} z_{2, i} + {z_{1, i}}^{T} Υ_{i} {\tilde{x}}_{2, i} \\ + {z_{2, i}}^{T} [J_{1} τ_{i} + d_{i} + W_{i}^{* T} S (x_{i}) - {\dot{\tilde{x}}}_{2 i} - {\dot{α}}_{i}] \\ + {\tilde{d}}_{i}^{T} [- l_{i} {\tilde{d}}_{i} - l_{i} {\tilde{W}}_{i}^{T} S (x_{i}) + l_{i} {\dot{\tilde{x}}}_{2 i} - l_{i} {\tilde{ℑ}}_{i} + {\dot{d}}_{i}] - \sum_{j = 1}^{6} {\tilde{W}}_{i j}^{T} Γ_{1, i j}^{- 1} {\dot{\hat{W}}}_{i j}} \end{array}

(A2)

Substituting (32) and (31) into (A2) yields:

\begin{array}{l} {\dot{V}}_{2} & = \sum_{i = 1}^{N} {{\tilde{x}}_{i}^{T} A {\tilde{x}}_{i} - {z_{1, i}}^{T} K_{1, i} z_{1, i} - {\tilde{d}}_{i}^{T} l_{i} {\tilde{d}}_{i} - {z_{2, i}}^{T} K_{2, i} z_{2, i} + {z_{2, i}}^{T} {\tilde{d}}_{i} + {z_{2, i}}^{T} {\tilde{ℑ}}_{i} \\ - \sum_{j = 1}^{6} σ_{i j} {\tilde{W}}_{i j}^{T} {\hat{W}}_{i j} - {z_{2, i}}^{T} {\dot{\tilde{x}}}_{2 i} + {z_{1, i}}^{T} Υ_{i} {\tilde{x}}_{2, i} - {\tilde{d}}_{i}^{T} l_{i} {\tilde{W}}_{i}^{T} S (x_{i}) + {\tilde{d}}_{i}^{T} l_{i} {\dot{\tilde{x}}}_{2 i} \\ - {\tilde{d}}_{i}^{T} l_{i} {\tilde{ℑ}}_{i} + {\tilde{d}}_{i}^{T} {\dot{d}}_{i} + {\tilde{x}}_{2 i}^{T} {\tilde{W}}_{i}^{T} S (x_{i}) + {\tilde{x}}_{2 i}^{T} {\tilde{d}}_{i} - \sum_{j = 1}^{6} Γ_{1, j}^{- 1} γ_{2, j} {\tilde{W}}_{j}^{T} (L \otimes I) {\tilde{W}}_{j}} \end{array}

(A3)

where

γ_{2, j} = diag [γ_{2, 1 j}, \dots, γ_{2, N j}] > 0

,

Γ_{1, j}^{- 1} = diag [Γ_{1, 1 j}^{- 1}, \dots, Γ_{1, N j}^{- 1}] > 0

, and

{\tilde{W}}_{j} = {[{\tilde{W}}_{1 j}^{T}, \dots, {\tilde{W}}_{N j}^{T}]}^{T}

.

ℒ

is positive semidefinite according to Assumption 1, which leads to

\sum_{j = 1}^{6} Γ_{1, j}^{- 1} γ_{2, j} {\tilde{W}}_{j}^{T} (ℒ \otimes I) {\tilde{W}}_{j} \geq 0

. Equation (A3) can be further computed as:

\begin{array}{l} {\dot{V}}_{2} & \leq \sum_{i = 1}^{N} \{{\tilde{x}}_{i}^{T} A {\tilde{x}}_{i} - {z_{1, i}}^{T} K_{1, i} z_{1, i} - {\tilde{d}}_{i}^{T} l_{i} {\tilde{d}}_{i} - {z_{2, i}}^{T} K_{2, i} z_{2, i} + {z_{2, i}}^{T} {\tilde{d}}_{i} + {z_{2, i}}^{T} {\tilde{ℑ}}_{i} \\ - \sum_{j = 1}^{6} σ_{i j} {\tilde{W}}_{i j}^{T} {\hat{W}}_{i j} - {z_{2, i}}^{T} {\dot{\tilde{x}}}_{2 i} + {z_{1, i}}^{T} Υ_{i} {\tilde{x}}_{2, i} - {\tilde{d}}_{i}^{T} l_{i} {\tilde{W}}_{i}^{T} S (x_{i}) \\ + {\tilde{d}}_{i}^{T} l_{i} {\dot{\tilde{x}}}_{2 i} - {\tilde{d}}_{i}^{T} l_{i} {\tilde{ℑ}}_{i} + {\tilde{d}}_{i}^{T} {\dot{d}}_{i} + {\tilde{x}}_{2 i}^{T} {\tilde{W}}_{i}^{T} S (x_{i}) + {\tilde{x}}_{2 i}^{T} {\tilde{d}}_{i}\} \end{array}

(A4)

By Young’s inequality, we obtain:

\begin{array}{l} - {\tilde{d}}_{i j}^{T} l_{i} {\tilde{W}}_{i j}^{T} S_{j} (x_{i}) \leq ‖{\tilde{d}}_{i j}‖ ‖l_{i}‖ ‖{\tilde{W}}_{i j}‖ \leq \frac{κ_{1} {‖l_{i}‖}^{2} {‖{\tilde{d}}_{i j}‖}^{2}}{2} + \frac{{‖{\tilde{W}}_{i j}‖}^{2}}{2 κ_{1}} \\ {\tilde{d}}_{i}^{T} {\dot{d}}_{i} \leq \frac{κ_{2} {‖{\tilde{d}}_{i}‖}^{2}}{2} + \frac{{\bar{\dot{d}}}_{i}^{2}}{2 κ_{2}} \\ {z_{2, i}}^{T} {\tilde{d}}_{i} \leq \frac{κ_{3} {‖{\tilde{d}}_{i}‖}^{2}}{2} + \frac{{‖z_{2, i}‖}^{2}}{2 κ_{3}} \\ {z_{2, i}}^{T} {\tilde{ℑ}}_{i} \leq \frac{κ_{4} {\bar{ℑ}}_{i}^{2}}{2} + \frac{{‖z_{2, i}‖}^{2}}{2 κ_{4}} \\ {\tilde{x}}_{2, i} {\tilde{W}}_{i}^{T} S (x_{i}) \leq ‖{\tilde{x}}_{i}‖ ‖{\tilde{W}}_{i}‖ \leq \frac{κ_{5} {‖{\tilde{x}}_{i}‖}^{2}}{2} + \frac{{‖{\tilde{W}}_{i}‖}^{2}}{2 κ_{5}} \\ {\tilde{x}}_{2, i}^{T} {\tilde{d}}_{i} \leq \frac{κ_{6} {‖{\tilde{d}}_{i}‖}^{2}}{2} + \frac{{‖{\tilde{x}}_{i}‖}^{2}}{2 κ_{6}} \\ {\tilde{d}}_{i}^{T} l_{i} {\dot{\tilde{x}}}_{2 i} \leq \frac{κ_{7} {‖l_{i}‖}^{2} {‖{\tilde{d}}_{i j}‖}^{2}}{2} + \frac{{‖{\dot{\tilde{x}}}_{i}‖}^{2}}{2 κ_{7}} \\ - {z_{2, i}}^{T} {\dot{\tilde{x}}}_{2 i} \leq \frac{κ_{8} {‖z_{2, i}‖}^{2}}{2} + \frac{{‖{\dot{\tilde{x}}}_{i}‖}^{2}}{2 κ_{8}} \\ - {\tilde{d}}_{i}^{T} l_{i} {\tilde{ℑ}}_{i} \leq \frac{κ_{9} {‖l_{i}‖}^{2} {‖{\tilde{d}}_{i j}‖}^{2}}{2} + \frac{{\bar{ℑ}}_{i}^{2}}{2 κ_{9}} \\ {z_{1, i}}^{T} Υ_{i} {\tilde{x}}_{2, i} \leq \frac{κ_{10} {‖z_{1, i}‖}^{2}}{2} + \frac{‖Υ_{i}‖ {‖{\tilde{x}}_{i}‖}^{2}}{2 κ_{10}} \\ - σ_{i j} {\tilde{W}}_{i j}^{T} {\hat{W}}_{i j} \leq - \frac{σ_{i j} {‖{\tilde{W}}_{i j}‖}^{2}}{2} + \frac{σ_{i j} {‖W_{i j}^{*}‖}^{2}}{2} \end{array}

(A5)

where

κ_{1}, \dots, κ_{10}

are positive constants, and

{\bar{\dot{d}}}_{i}

is a constant satisfying

‖{\dot{d}}_{i}‖ \leq {\bar{\dot{d}}}_{i}

. Substituting (A5) into (A4) yields:

\begin{array}{l} {\dot{V}}_{2} \leq \sum_{i = 1}^{N} (\sum_{j = 1}^{6} \frac{σ_{i j} {‖W_{i j}^{*}‖}^{2}}{2} + \frac{{\bar{\dot{d}}}_{i}^{2}}{2 κ_{2}} + \frac{{‖{\dot{\tilde{x}}}_{i}‖}^{2}}{2 κ_{6}} + \frac{{‖{\dot{\tilde{x}}}_{i}‖}^{2}}{2 κ_{8}} + \frac{κ_{4} {\bar{ℑ}}_{i}^{2}}{2} + \frac{{\bar{ℑ}}_{i}^{2}}{2 κ_{9}}) \\ - \frac{1}{2} \sum_{i = 1}^{N} ({\tilde{x}}_{i}^{T} K_{\tilde{x}, i} {\tilde{x}}_{i} + {z_{1, i}}^{T} K_{z 1, i} z_{1, i} + {\tilde{d}}_{i}^{T} K_{d, i} {\tilde{d}}_{i} + {z_{2, i}}^{T} K_{z 2, i} z_{2, i} + \sum_{j = 1}^{6} {\tilde{W}}_{i j}^{T} K_{W, i} {\tilde{W}}_{i j}) \end{array}

(A6)

where

K_{\tilde{x}, i}, K_{z 1, i}, K_{\tilde{d}, i}, K_{z 2, i}, K_{W, i}

are provided as follows:

\begin{array}{l} K_{\tilde{x}, i} = diag [- 2 A, - κ_{5}, - \frac{1}{κ_{6}}, - \frac{‖Υ_{i}‖}{κ_{10}}] \\ K_{z 1, i} = diag [2 K_{1, i}, - κ_{10}] \\ K_{\tilde{d}, i} = diag [2 l_{1}, - κ_{1} {‖l_{1}‖}^{2}, - κ_{2}, - κ_{3}, - κ_{6}, - κ_{7} {‖l_{1}‖}^{2}, - κ_{9} {‖l_{1}‖}^{2}] \\ K_{z 2, i} = diag [2 K_{2, i}, - \frac{1}{κ_{3}}, - \frac{1}{κ_{4}}, - κ_{8}] \\ K_{W, i} = diag [- \frac{1}{κ_{1}}, - \frac{1}{κ_{5}}, σ_{i j}] \end{array}

According to (A6) and (A1), we have:

{\dot{V}}_{2} \leq - J V_{2} + δ

(A7)

where

J = \min_{i \in N, j = 1, \dots, 6} \{λ_{\min} (K_{\tilde{x}, i}), 2 λ_{\min} (K_{1, i}), λ_{\min} (K_{d, i}), λ_{\min} (K_{z, i}), λ_{\min} (K_{W, i}) λ_{\min} (Γ_{1, i j})\}

δ = \sum_{i = 1}^{N} (\sum_{j = 1}^{6} \frac{σ_{i j} {‖W_{i j}^{*}‖}^{2}}{2} + \frac{{\bar{\dot{d}}}_{i}^{2}}{2 κ_{2}} + \frac{{‖{\dot{\tilde{x}}}_{i}‖}^{2}}{2 κ_{7}} + \frac{{‖{\dot{\tilde{x}}}_{i}‖}^{2}}{2 κ_{8}} + \frac{κ_{4} {\bar{ℑ}}_{i}^{2}}{2} + \frac{{\bar{ℑ}}_{i}^{2}}{2 κ_{9}})

.

Inequality (A7) implies that:

V_{2} \leq V_{2} (0) e^{- J t} + C

(A8)

where

C = = \frac{δ}{J}

. According to (A1) and (A8), when

t

tends to infinity, we have:

‖{\tilde{x}}_{i}‖ \leq \sqrt{2 C}, ‖z_{1, i}‖ \leq \sqrt{2 C}, ‖z_{2, i}‖ \leq \sqrt{2 C}, ‖{\tilde{d}}_{i}‖ \leq \sqrt{2 C}, ‖{\tilde{W}}_{i j}‖ \leq \sqrt{\frac{2 C}{λ_{\min} (Γ_{1, i j}^{- 1})}}

(A9)

According to (A8) and (A9), we have (i) The transformed error converges to the adjustable compact set

\sqrt{2 C}

, which can be adjusted by parameter

A

,

K_{1, i}

,

K_{2, i}

,

l_{i}

,

σ_{i j}

, and

ν

. (ii) The state estimation error

{\tilde{x}}_{i}

, disturbance estimation error

{\tilde{d}}_{i}

, weight coefficient estimation error

{\tilde{W}}_{i j}

and virtual control error

z_{2, i}

are UUB. (iii) According to (14),

e_{1, i}

converges to the adjustable compact set in a finite time. □

Proof A2.

From Theorem 1, we know that

x_{1 i}

can converges to

η_{r, i}

accurately in predefined

T_{i}

, which indicates

x_{1 i}

is periodic for

t > T_{i}

since

η_{r, i}

is periodic. In addition,

{\hat{x}}_{2 i}

will converge to

{\dot{η}}_{r, i}

since

e_{1, i}, z_{1, i}, z_{2, i}

will converge to a compact set near 0, which means

{\hat{x}}_{2 i}

is periodic. Therefore, the input of RBFNN

x_{i} = {[x_{1 i}^{T}, {\hat{x}}_{2 i}^{T}]}^{T}

is periodic and restricted to a compact set

Ω_{x_{i}}

when

t > T_{i}

. According to Lemma 1, the regress subvector

S_{ζ} (x_{i})

is persistently exciting.

Next, we will prove that the partial weight coefficient

{\hat{W}}_{ζ, i j}

corresponding to

S_{ζ} (x_{i})

converges to the optimal value

W_{ζ, i j}^{*}

if the partial PE condition is satisfied.

Based on (21), (23), and (24), we have:

{\dot{z}}_{1, i} = - K_{1, i} z_{1, i} + Υ_{i} z_{2, i} + Υ_{i} {\tilde{x}}_{2 i}

(A10)

According to (5), (28), and (32), we obtain:

{\dot{z}}_{2, i} = - K_{2, i} z_{2, i} - Υ_{i} z_{1, i} - {\dot{\tilde{x}}}_{2 i} + {\tilde{d}}_{i} + F_{i} (x_{i}) - ε_{i} - {\hat{W}}_{i}^{T} S (x_{i})

(A11)

where

{\hat{W}}_{i}^{T} S (x_{i})

can be represented in the equation based on the localization property of Gaussian function:

{\hat{W}}_{i}^{T} S (x_{i}) = {\hat{W}}_{ζ, i}^{T} S_{ζ} (x_{i}) + {\hat{W}}_{\bar{ζ}, i}^{T} S_{\bar{ζ}} (x_{i})

(A12)

where

S_{ζ} (x_{i})

and

S_{\bar{ζ}} (x_{i})

are the regress subvectors with neural nodes center placed close to the trajectory and far from the trajectory, respectively;

{\hat{W}}_{ζ, i}^{T}

and

{\hat{W}}_{\bar{ζ}, i}^{T}

are the corresponding weight.

Based on (33) and (A12), (A11) can be further expressed as:

{\dot{z}}_{2, i} = - K_{2, i} z_{2, i} - Υ_{i} z_{1, i} - {\dot{\tilde{x}}}_{2 i} + {\tilde{d}}_{i} + {\tilde{W}}_{ζ, i}^{T} S_{ζ} (x_{i}) + ε_{ζ, i}^{'}

(A13)

where

ε_{ζ, i}^{'}

is the approximation error using the localized RBFNN, which is computed as:

ε_{ζ, i}^{'} = ε_{ζ, i} - {\hat{W}}_{\bar{ζ}, i}^{T} S_{\bar{ζ}} (x_{i}) - ε_{i}

(A14)

where

ε_{ζ, i}

is the inherent error defined in Lemma 1, and

{\hat{W}}_{\bar{ζ}, i}^{T} S_{\bar{ζ}} (x_{i})

is the output component which is not persistently exciting and which is small according to the localization property of Gaussian function [38]. Therefore, we can conclude that

ε_{ζ, i}^{'} \to 0

with enough nodes.

According to

{\tilde{W}}_{ζ, i j} = W_{ζ, i j}^{*} - {\hat{W}}_{ζ, i j}

and (31), we have:

{\dot{\tilde{W}}}_{ζ, i j} = - Γ_{1, i j} [S_{ζ, j} (x_{i}) z_{2, i j} + σ_{i j} {\hat{W}}_{ζ, i j}] + γ_{2, i j} \sum_{k = 1}^{N} a_{i k} ({\hat{W}}_{ζ, i j} - {\hat{W}}_{ζ, k j})

(A15)

Then, (47), (48), and (52) can be provided in the compact form:

(\begin{array}{c} {\dot{z}}_{1, i} \\ {\dot{z}}_{2, i} \\ {\dot{\tilde{W}}}_{ζ, i, 1} \\ ⋮ \\ {\dot{\tilde{W}}}_{ζ, i, 6} \end{array}) = (\begin{array}{c} - K_{1, i} & Υ_{i} & 0 \\ - Υ_{i} & - K_{2, i} & ℑ_{i} \\ 0 & - Γ_{1, i, 1} S_{ζ, 1} (x_{i}) & γ_{2, i, 1} (L \otimes I) \\ ⋮ & ⋮ & ⋮ \\ 0 & - Γ_{1, i, 6} S_{ζ, 6} (x_{i}) & γ_{2, i, 6} (L \otimes I) \end{array}) (\begin{array}{c} z_{1, i} \\ z_{2, i} \\ {\tilde{W}}_{ζ, i, 1} \\ ⋮ \\ {\tilde{W}}_{ζ, i, 6} \end{array}) + (\begin{array}{c} Υ_{i} {\tilde{x}}_{2 i} \\ - {\dot{\tilde{x}}}_{2 i} + {\tilde{d}}_{i} + ε_{ζ, i}^{'} \\ - Γ_{1, i, 1} σ_{i, 1} {\hat{W}}_{ζ, i, 1} \\ ⋮ \\ - Γ_{1, i, 6} σ_{i, 6} {\hat{W}}_{ζ, i, 6} \end{array})

(A16)

where

ℑ_{i} = diag [S_{ζ, 1} (x_{i}), \dots, S_{ζ, 6} (x_{i})]

. The exponential stability property of the nominal part of system (A16) is proved in [37] under the partial PE condition. In addition,

ε_{ζ, i}^{'} \to 0

from the previous discussion,

- Γ_{1, i j} σ_{i j} {\hat{W}}_{ζ, i j}

can be adjusted to a compact value via

σ_{i j}

, and

{\tilde{x}}_{2 i}

and

{\tilde{d}}_{i}

will converge to a compact set, which means the error part of (53) is small enough. According to the above discussion,

z_{1, i}

,

z_{1, i}

, and

{\tilde{W}}_{ζ, i}

converge to a compact set, i.e.,

{\hat{W}}_{ζ, i j}

will converge to the optimal values

W_{ζ, i j}^{*}

.

When the AUH moves along the period reference trajectory, we have the following conclusion according to Lemma 1:

\begin{array}{l} F_{i} (x_{i}) & = W_{ζ, i}^{* T} S_{ζ} (x_{i}) + ε_{ζ, i} \\ = {\hat{W}}_{ζ, i}^{T} S_{ζ} (x_{i}) + ε_{ζ 1, i} \end{array}

(A17)

where

ε_{ζ 1, i} = {\tilde{W}}_{ζ, i}^{T} S (x_{i}) + ε_{ζ, i} \to 0

, since

{\tilde{W}}_{ζ, i j} \to 0

and

ε_{ζ, i} \to 0

from the above analysis. Due to the convergence of

{\hat{W}}_{ζ, i j}

, we can obtain the constant

{\bar{W}}_{ζ, i j}

by (35), based on the localized representation property of RBFNN [24]; then, (54) can be further expressed as:

\begin{array}{l} F_{i} (x_{i}) & = {\hat{W}}_{ζ, i}^{T} S_{ζ} (x_{i}) + ε_{ζ 1, i} \\ = {\bar{W}}_{ζ, i}^{T} S_{ζ} (x_{i}) + ε_{ζ 2, i} \end{array}

(A18)

where

ε_{ζ 2, i}

is the approximation error using the experience, and

ε_{ζ 2, i} - ε_{ζ 1, i}

is small when the system tracks the periodic reference trajectory and reaches the steady state [22], which means

ε_{ζ 2, i}

is small.

For an RBFNN with centers far away from the periodic reference trace, the regress subvector

S_{\bar{ζ}} (x_{i})

is activated slightly due to the local properties of the Gaussian function, which means that

{\hat{W}}_{\bar{ζ}, i}^{T} S_{\bar{ζ}} (x_{i})

will remain small during tracking reference trajectory; thus,

{\bar{W}}_{\bar{ζ}, i}^{T} S_{\bar{ζ}} (x_{i})

will also remain small according to (35).

From the above discussion and (A18), the global RBFNN

{\hat{W}}_{i}^{T} S (x_{i})

and

{\bar{W}}_{i}^{T} S (x_{i})

also have the ability to provide a locally accurate approximation of the dynamic uncertainties during tracking trajectory

φ_{ζ, i} (x_{i} (t)) |_{t \geq T_{i}}

.

\begin{array}{l} F_{i} (x_{i}) & = {\bar{W}}_{ζ, i}^{T} S_{ζ} (x_{i}) + ε_{ζ 2, i} \\ = {\bar{W}}_{i}^{T} S (x_{i}) + {\bar{ε}}_{i} \end{array}

(A19)

where

{\bar{ε}}_{i} = ε_{ζ 2, i} - {\bar{W}}_{\bar{ζ}, i}^{T} S_{\bar{ζ}} (x_{i})

are small approximation errors. This ends the proof. □

Proof A3.

According to (28), (34), and (37), we have:

\begin{array}{l} {\dot{z}}_{2, i} & = J_{1} τ_{i} + J_{1} τ_{d, i} (t) + F_{i} - {\dot{α}}_{i} - {\dot{\tilde{x}}}_{2 i} \\ = J_{1} τ_{i} - {\dot{α}}_{i} + d_{i} + {\bar{W}}_{i}^{T} S (x_{i}) - {\dot{\tilde{x}}}_{2 i} \end{array}

(A20)

where

d_{i} = J_{1} τ_{d, i} (t) + {\bar{ε}}_{i}

.

From (36) and (A20), we have:

{\dot{\tilde{d}}}_{i} = - l_{i} {\tilde{d}}_{i} + l_{i} {\dot{\tilde{x}}}_{2 i} - l_{i} {\tilde{ℑ}}_{i} + {\dot{d}}_{i}

(A21)

The Lyapunov function is selected as:

V_{3} = V_{1} + \sum_{i = 1}^{N} (\frac{1}{2} {z_{2, i}}^{T} z_{2, i} + \frac{1}{2} {\tilde{d}}_{i}^{T} {\tilde{d}}_{i})

(A22)

whose derivative is calculated as:

\begin{array}{l} {\dot{V}}_{3} & = \sum_{i = 1}^{N} \{{\tilde{x}}_{i}^{T} A {\tilde{x}}_{i} + {\tilde{x}}_{i}^{T} B {\tilde{\bar{F}}}_{i} - {z_{1, i}}^{T} K_{1, i} z_{1, i} + {z_{1, i}}^{T} Υ_{i} z_{2, i} + {z_{1, i}}^{T} Υ_{i} {\tilde{x}}_{2, i} \\ + {z_{2, i}}^{T} [J_{1} τ_{i} - {\dot{α}}_{i} + d_{i} + {\bar{W}}_{i}^{T} S (x_{i}) - {\dot{\tilde{x}}}_{2 i}] \\ + {\tilde{d}}_{i}^{T} [- l_{i} {\tilde{d}}_{i} + l_{i} {\dot{\tilde{x}}}_{2 i} - l_{i} {\tilde{ℑ}}_{i} + {\dot{d}}_{i}]\} \\ \leq \sum_{i = 1}^{N} \{{\tilde{x}}_{i}^{T} A {\tilde{x}}_{i} - {z_{1, i}}^{T} K_{1, i} z_{1, i} - {\tilde{d}}_{i}^{T} l_{1} {\tilde{d}}_{i} - {z_{2, i}}^{T} K_{2, i} z_{2, i} + {z_{2, i}}^{T} {\tilde{d}}_{i} + {z_{2, i}}^{T} {\tilde{ℑ}}_{i} \\ - {z_{2, i}}^{T} {\dot{\tilde{x}}}_{2 i} + {z_{1, i}}^{T} Υ_{i} {\tilde{x}}_{2, i} + {\tilde{d}}_{i}^{T} l_{i} {\dot{\tilde{x}}}_{2 i} - {\tilde{d}}_{i}^{T} l_{i} {\tilde{ℑ}}_{i} + {\tilde{d}}_{i}^{T} {\dot{d}}_{i} + {\tilde{x}}_{2 i}^{T} {\tilde{d}}_{i}\} \end{array}

(A23)

According to (A23), we have:

{\dot{V}}_{2} \leq - \bar{J} V_{2} + \bar{δ}

(A24)

where

\bar{J} = \min_{i \in N, j = 1, \dots, 6} \{λ_{\min} (K_{\tilde{x}, i}), 2 λ_{\min} (K_{1, i}), λ_{\min} (K_{d, i}), λ_{\min} (K_{z, i})\}

,

\bar{δ} = \sum_{i = 1}^{N} (\frac{{\bar{\dot{d}}}_{i}^{2}}{2 κ_{2}} + \frac{{‖{\dot{\tilde{x}}}_{i}‖}^{2}}{2 κ_{7}} + \frac{{‖{\dot{\tilde{x}}}_{i}‖}^{2}}{2 κ_{8}} + \frac{κ_{4} {\bar{ℑ}}_{i}^{2}}{2} + \frac{{\bar{ℑ}}_{i}^{2}}{2 κ_{9}})

. Equation (61) implies that:

V_{2} \leq V_{2} (0) e^{- \bar{J} t} + \bar{C}

(A25)

where

\bar{C} = = \frac{\bar{δ}}{\bar{J}}

. From (A22) and (A25), when

t

tends to infinity, the following inequalities hold:

‖{\tilde{x}}_{i}‖ \leq \sqrt{2 \bar{C}}, ‖z_{1, i}‖ \leq \sqrt{2 \bar{C}}, ‖z_{2, i}‖ \leq \sqrt{2 \bar{C}}, ‖{\tilde{d}}_{i}‖ \leq \sqrt{2 \bar{C}}

(A26)

Following a proof that is similar to that of Theorem 1, (i)

e_{1, i}

converges to a compact set in a finite time. (ii) All states in the system are UUB. (iii) Within the limits of the FTPF, the system meets the preset performance. The proof is thus accomplished. □

References

Joshi, B.; Xanthidis, M.; Roznere, M.; Burgdorfer, N.J.; Mordohai, P.; Li, A.Q.; Rekleitis, I. Underwater exploration and mapping. In Proceedings of the 2022 IEEE/OES Autonomous Underwater Vehicles Symposium (AUV), Singapore, 19–21 September 2022; pp. 1–7. [Google Scholar]
Mirza, J.; Kanwal, F.; Salaria, U.A.; Ghafoor, S.; Aziz, I.; Atieh, A.; Almogren, A.; Haq, A.U.; Kanwal, B. Underwater temperature and pressure monitoring for deep-sea SCUBA divers using optical techniques. Front. Phys. 2024, 12, 1417293. [Google Scholar] [CrossRef]
Mirza, J.; Atieh, A.; Kanwal, B.; Ghafoor, S.; Almogren, A.; Kanwal, F.; Aziz, I. Relay aided UWOC-SMF-FSO based hybrid link for underwater wireless optical sensor network. Opt. Fiber Technol. 2025, 89, 104045. [Google Scholar] [CrossRef]
Wibisono, A.; Piran, M.J.; Song, H.K.; Lee, B.M. A survey on unmanned underwater vehicles: Challenges, enabling technologies, and future research directions. Sensors 2023, 23, 7321. [Google Scholar] [CrossRef]
Thuyen, N.A.; Thanh, P.N.N.; Anh, H.P.H. Adaptive finite-time leader-follower formation control for multiple AUVs regarding uncertain dynamics and disturbances. Ocean Eng. 2023, 269, 113503. [Google Scholar] [CrossRef]
Wang, J.; Wang, C.; Wei, Y.; Zhang, C. Bounded neural adaptive formation control of multiple underactuated AUVs under uncertain dynamics. ISA Trans. 2020, 105, 111–119. [Google Scholar] [CrossRef] [PubMed]
Zhang, Y.; Wang, Q.; Shen, Y.; Dai, N.; He, B. Multi-AUV cooperative control and autonomous obstacle avoidance study. Ocean Eng. 2024, 304, 117634. [Google Scholar] [CrossRef]
Zhuang, Y.; Huang, H.; Sharma, S.; Xu, D.; Zhang, Q. Cooperative path planning of multiple autonomous underwater vehicles operating in dynamic ocean environment. ISA Trans. 2019, 94, 174–186. [Google Scholar] [CrossRef]
Li, H.; An, X.; Feng, R.; Chen, Y. Motion control of autonomous underwater helicopter based on linear active disturbance rejection control with tracking differentiator. Appl. Sci. 2023, 13, 3836. [Google Scholar] [CrossRef]
Wang, Q.; Wu, Z.; Xie, M.; Wu, F.; Huang, H. Finite-time prescribed performance trajectory tracking control for the autonomous underwater helicopter. Ocean Eng. 2023, 280, 114628. [Google Scholar] [CrossRef]
Wu, Z.; Wang, Q.; Huang, H. Low-complexity tracking for autonomous underwater helicopters with event-triggered mechanism. Ocean Eng. 2023, 280, 114633. [Google Scholar] [CrossRef]
Liu, X.; Zhang, M.; Yao, F.; Chu, Z. Observer-based region tracking control for underwater vehicles without velocity measurement. Nonlinear Dyn. 2022, 108, 3543–3560. [Google Scholar] [CrossRef]
Li, J.; Tian, Z.; Zhang, H.; Li, W. Robust finite-time control of a multi-AUV formation based on prescribed performance. J. Mar. Sci. Eng. 2023, 11, 897. [Google Scholar] [CrossRef]
Gao, Z.; Guo, G. Fixed-time sliding mode formation control of AUVs based on a disturbance observer. IEEE/CAA J. Automat. Sin. 2020, 7, 539–545. [Google Scholar] [CrossRef]
Li, X.; Qin, H.; Li, L. Fixed-time formation control for AUVs with unknown actuator faults based on lumped disturbance observer. Ocean Eng. 2023, 269, 113495. [Google Scholar] [CrossRef]
Li, Z.; Wang, M.; Ma, G.; Zou, T. Adaptive reinforcement learning fault-tolerant control for AUVs with thruster faults based on the integral extended state observer. Ocean Eng. 2023, 271, 113722. [Google Scholar] [CrossRef]
Kong, S.; Sun, J.; Qiu, C.; Wu, Z.; Yu, J. Extended State Observer-Based Controller with Model Predictive Governor for 3-D Trajectory Tracking of Underactuated Underwater Vehicles. IEEE Trans. Ind. Inf. 2021, 17, 6114–6124. [Google Scholar] [CrossRef]
Fang, K.; Fang, H.; Zhang, J.; Yao, J.; Li, J. Neural adaptive output feedback tracking control of underactuated AUVs. Ocean Eng. 2021, 234, 109211. [Google Scholar] [CrossRef]
Wu, Z.; Wang, Q.; Huang, H. Adaptive neural networks trajectory tracking control for autonomous underwater helicopters with prescribed performance. Ocean Eng. 2022, 264, 112519. [Google Scholar] [CrossRef]
Wang, J.; Wang, C.; Wei, Y.; Zhang, C. Observer-Based Neural Formation Control of Leader-Follower AUVs with Input Saturation. IEEE Syst. J. 2021, 15, 2553–2561. [Google Scholar] [CrossRef]
Zhang, Y.; Xu, O. Adaptive Backstepping Axial Position Tracking Control of Autonomous Undersea Vehicles with Deferred Output Constraint. Appl. Sci. 2023, 13, 2219. [Google Scholar] [CrossRef]
Wang, C.; Hill, D.J. Learning from neural control. IEEE Trans. Neural Netw. 2006, 17, 130–146. [Google Scholar] [CrossRef] [PubMed]
Yuan, C.; Licht, S.; He, H. Formation Learning Control of Multiple Autonomous Underwater Vehicles with Heterogeneous Nonlinear Uncertain Dynamics. IEEE Trans. Cybern. 2018, 48, 2920–2934. [Google Scholar] [CrossRef] [PubMed]
Dai, S.-L.; He, S.; Ma, Y.; Yuan, C. Cooperative Learning-Based Formation Control of Autonomous Marine Surface Vessels with Prescribed Performance. IEEE Trans. Syst. Man Cybern. Syst. 2022, 52, 2565–2577. [Google Scholar] [CrossRef]
Yaghmaie, F.A.; Modares, H.; Gustafsson, F. Reinforcement Learning for Partially Observable Linear Gaussian Systems Using Batch Dynamics of Noisy Observations. IEEE Trans. Autom. Control 2024, 69, 6379–6404. [Google Scholar] [CrossRef]
Nguyen, K.; Dang, V.T.; Pham, D.D.; Dao, P.N. Formation control scheme with reinforcement learning strategy for a group of multiple surface vehicles. Int. J. Robust. Nonlinear Control 2024, 34, 2252–2279. [Google Scholar] [CrossRef]
Huang, H.; He, W.; Li, J.; Xu, B.; Yang, C.; Zhang, W. Disturbance Observer-Based Fault-Tolerant Control for Robotic Systems with Guaranteed Prescribed Performance. IEEE Trans. Cybern. 2022, 52, 772–783. [Google Scholar] [CrossRef]
Li, Z.; Ma, Y.; Yue, D.; Zhao, J. Adaptive Tracking for Uncertain Switched Nonlinear Systems with Prescribed Performance Under Slow Switching. IEEE Trans. Syst. Man Cybern. Syst. 2022, 52, 7279–7288. [Google Scholar] [CrossRef]
Liu, L.; Liu, Y.-J.; Tong, S. Fuzzy-Based Multierror Constraint Control for Switched Nonlinear Systems and Its Applications. IEEE Trans. Fuzzy Syst. 2019, 27, 1519–1531. [Google Scholar] [CrossRef]
Xu, Z.; Sun, C.; Liu, Q. Output-Feedback Prescribed Performance Control for the Full-State Constrained Nonlinear Systems and Its Application to DC Motor System. IEEE Trans. Syst. Man Cybern. Syst. 2023, 53, 3898–3907. [Google Scholar] [CrossRef]
Qiu, J.; Wang, T.; Sun, K.; Rudas, I.J.; Gao, H. Disturbance Observer-Based Adaptive Fuzzy Control for Strict-Feedback Nonlinear Systems with Finite-Time Prescribed Performance. IEEE Trans. Fuzzy Syst. 2022, 30, 1175–1184. [Google Scholar] [CrossRef]
Sui, S.; Chen, C.L.P.; Tong, S. A Novel Adaptive NN Prescribed Performance Control for Stochastic Nonlinear Systems. IEEE Trans. Neural Netw. Learn. Syst. 2021, 32, 3196–3205. [Google Scholar] [CrossRef] [PubMed]
Wang, H.; Bai, W.; Zhao, X.; Liu, P.X. Finite-Time-Prescribed Performance-Based Adaptive Fuzzy Control for Strict-Feedback Nonlinear Systems with Dynamic Uncertainty and Actuator Faults. IEEE Trans. Cybern. 2022, 52, 6959–6971. [Google Scholar] [CrossRef] [PubMed]
Dai, S.-L.; Wang, M.; Wang, C. Neural Learning Control of Marine Surface Vessels with Guaranteed Transient Tracking Performance. IEEE Trans. Ind. Electron. 2016, 63, 1717–1727. [Google Scholar] [CrossRef]
Fossen, T.I. Handbook of Marine Craft Hydrodynamics and Motion Control; John Wiley & Sons: Hoboken, NJ, USA, 2011. [Google Scholar]
Yi, S.; Wang, J.; Li, B. Composite backstepping control with finite-time convergence. Optik 2017, 142, 260–272. [Google Scholar] [CrossRef]
Xie, M.; Wu, Z.; Huang, H. Low-complexity formation control of marine vehicle system based on prescribed performance. Nonlinear Dyn. 2024, 112, 18311–18332. [Google Scholar] [CrossRef]
Wang, C.; Hill, D.J. Deterministic Learning Theory for Identification, Recognition, and Control; CRC Press: Boca Raton, FL, USA, 2009. [Google Scholar]

Figure 1. Autonomous underwater helicopter (AUH).

Figure 2. The shape and trajectory of AUH formation.

Figure 3. The tracking error. (a) Surge. (b) Lateral. (c) Heave. (d) Pitch. (e) Roll. (f) Yaw.

Figure 4. The NN approximation error. (a) AUH 1 in translational degrees, (b) AUH 1 in rotation degrees, (c) AUH 2 in translational degrees, and (d) AUH 2 in rotation degrees.

Figure 5. The observer estimation for high-order state using NN-SO and ESO. (a) Surge. (b) Lateral. (c) Heave. (d) Pitch. (e) Roll. (f) Yaw.

Figure 6. The observer estimation for disturbance using DO and ESO. (a) Estimation error in translational degrees. (b) Estimation error in rotation degrees.

Figure 7. The tracking error using PPC method and FTPPC method. (a) Tracking error in translational degrees. (b) Tracking error in rotation degrees.

Figure 8. Comparison with state-of-the-art formation control [37]. (a) Tracking error in surge. (b) Tracking error in lateral. (c) Tracking error in heave. (d) Tracking error in yaw.

Table 1. Comparison of related works on formation control techniques.

Study	Dynamic Model Consideration	Robustness to Disturbances	Learning Mechanism	Performance Metrics
[12,18,22]	Unknown dynamics	Moderate	None	UUB errors
[13]	Feedback linearization	High	None	Finite-time convergence
[14,15]	Unknown dynamics, external disturbances	High	None	Fixed-time convergence
[16]	Thruster faults, unknown disturbances	Moderate	Reinforcement learning	UUB errors
[17,27,29]	Unknown disturbances	Moderate	None	UUB errors
[19]	Thruster faults, unknown disturbances	Low	None	Specified convergence time
[20]	Unknown dynamics, input saturation	Low	None	UUB errors
[23]	Uncertainties	High	Deterministic learning	UUB errors
[24]	Unknown disturbances, uncertainties	High	Cooperative learning	UUB errors
[26]	Dynamic models	Moderate	Reinforcement learning	UUB errors
[30]	Full-state constraints, disturbances	Moderate	None	Specified convergence time
Proposed	Unknown dynamics, uncertainties, external disturbances	High	Experience-based learning	Specified convergence time

Table 2. The initial information on AUH formation.

Terms	Values
The initial states	$\begin{array}{l} η_{i} (0) = {[20, - 30, 0, 0, 0, 0]}^{T} \\ v_{i} (0) = {[0, 0, 0, 0, 0, 0]}^{T} \end{array}$
The reference trajectory of the virtual leader $η_{d}$	$\begin{array}{l} η_{d} = & [30 \sin (0.1 t) + 20 \cos (0.1 t), \\ 20 \sin (0.1 t) - 30 \cos (0.1 t), 0, 0, 0, 0.1 t]^{T} \end{array}$
The external disturbance $τ_{d, i}$	$\begin{matrix} τ_{d, i} = [20 + 5 \cos (0.2 t + \frac{π}{4}), 15 + 10 \sin (0.2 t), 0, \\ 5 \cos (0.2 t + \frac{π}{4}), 0, 10 + 5 \sin (0.5 t)]^{T} \end{matrix}$

Table 3. The comparison experiment results.

Methods	The Adaptive NN-Based Control Method	The Experience-Based Control Method
Simulation time setting (s)	200	200
Actual running time (s)	97.62	83.67
Tracking error in translation degree (m)	<0.03	<0.03
Tracking error in rotation degree (rad)	<0.03	<0.03

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wu, Z.; Song, Z.; Huang, H. Prescribed Performance-Based Formation Control for Multiple Autonomous Underwater Helicopters with Complex Dynamic Characteristics. J. Mar. Sci. Eng. 2024, 12, 2246. https://doi.org/10.3390/jmse12122246

AMA Style

Wu Z, Song Z, Huang H. Prescribed Performance-Based Formation Control for Multiple Autonomous Underwater Helicopters with Complex Dynamic Characteristics. Journal of Marine Science and Engineering. 2024; 12(12):2246. https://doi.org/10.3390/jmse12122246

Chicago/Turabian Style

Wu, Zheyuan, Zilong Song, and Haocai Huang. 2024. "Prescribed Performance-Based Formation Control for Multiple Autonomous Underwater Helicopters with Complex Dynamic Characteristics" Journal of Marine Science and Engineering 12, no. 12: 2246. https://doi.org/10.3390/jmse12122246

APA Style

Wu, Z., Song, Z., & Huang, H. (2024). Prescribed Performance-Based Formation Control for Multiple Autonomous Underwater Helicopters with Complex Dynamic Characteristics. Journal of Marine Science and Engineering, 12(12), 2246. https://doi.org/10.3390/jmse12122246

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Prescribed Performance-Based Formation Control for Multiple Autonomous Underwater Helicopters with Complex Dynamic Characteristics

Abstract

1. Introduction

2. Preliminaries and Problem Formulation

2.1. AUHs Dynamics

2.2. Formation Structure and Graph Theory

2.3. RBFNN

3. NN-Based AUH Formation Control Mechanism

3.1. FTPPC

3.2. State Observer Design

3.3. NN-Based AUH Formation Controller

4. AUH Formation Control Using Experience

4.1. Learn from Formation Track Control

4.2. Experience-Based AUH Formation Controller

5. Simulation Results

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI