Distributed Path Tracking for Autonomous Underwater Vehicles Based on Pseudo Position Feedback

Gao, Huanli; Li, Wei; Cai, He; Gu, Zekai

doi:10.3390/jmse10101477

Open AccessArticle

Distributed Path Tracking for Autonomous Underwater Vehicles Based on Pseudo Position Feedback

by

Huanli Gao

,

Wei Li

,

He Cai

^*

and

Zekai Gu

School of Automation Science and Engineering, South China University of Technology, Guangzhou 510641, China

^*

Author to whom correspondence should be addressed.

J. Mar. Sci. Eng. 2022, 10(10), 1477; https://doi.org/10.3390/jmse10101477

Submission received: 1 September 2022 / Revised: 29 September 2022 / Accepted: 2 October 2022 / Published: 11 October 2022

(This article belongs to the Section Ocean Engineering)

Download

Browse Figures

Versions Notes

Abstract

:

In this paper, we consider the distributed polynomial path tracking problem for a swarm of autonomous underwater vehicles (AUVs) modeled by second-order uncertain multi-agent systems. The application scenario of this paper has three distinguished characteristics. First, the communication network for the multi-agent system is unreliable and switching. Under the jointly connected condition, the communication network can be disconnected the entire time. Second, it is supposed that only the relative position between AUVs can be obtained for trajectory tracking control. Third, the AUV dynamics are subject to uncertain system parameters. By applying the cooperative output regulation control framework, a novel distributed robust control scheme is proposed to solve the distributed path tracking problem, which consists of three parts. First, to cope with communication network uncertainty, the distributed observer was invoked to recover the polynomial path for each AUV. Second, based on the relative position measurement between AUVs, a pseudo position estimator was adopted to generate the pseudo position for each AUV. Finally, based on the estimated polynomial path and the pseudo position, a certainty equivalent robust internal model control law was synthesized to achieve asymptotic reference trajectory tracking, where the internal model compensator aims to tackle uncertain system parameters. Numerical simulations are provided to validate the effectiveness of the proposed control scheme.

Keywords:

autonomous underwater vehicle; cooperative output regulation; distributed path tracking; multi-agent system; switching communication network

1. Introduction

Over the past two decades, tremendous attention has been paid to the cooperative control of swarm systems, which has led to fruitful research results [1,2,3], and has found broad applications in the area of unmanned swarm systems, such as collaborative searching, escorting, localization and mapping by unmanned aerial vehicles [4,5,6] and unmanned mobile robots [7,8,9]. In particular, the motion in formation for unmanned swarm systems is the foundation of many tasks, where all of the individuals in the swarm are supposed to cooperate to maintain some desired relative position and/or orientation with respect to each other while following some common reference path. The existing control approaches dealing with the formation problem can be classified roughly into three categories [10]. The first category is the behavior-based approach, which was first proposed by Balch and Arkin in [11] for four kinds of formation, namely, line, column, wedge and diamond. The guidance and formation are fulfilled by designing and weighting motor schemas. Typical works on the behavior-based approach can be further found in [12,13]. The second category is the leader–follower approach, which defines the global formation by a series of pre-specified leader-following tracking problems. Since the classic leader-following tracking problem has been well treated in the literature, the leader–follower approach has become an effective way to solve the formation problem [14,15]. The third category is the virtual structure approach, where the formation is defined by the static or dynamic relative position/orientation of each agent with respect to a virtual leader. Recently, owing to the application of graph theory in the control of multi-agent systems, the virtual structure approach has become the most popular control strategy for distributed formation control over sparse and vulnerable communication networks [16].

The rapid development of autonomous underwater vehicles (AUVs) has greatly enabled underwater tasks, such as environment inspection, military surveillance, oceanographic observation, site searching and so on [17,18,19]. In contrast to a single AUV, AUV swarms show a better system reliability against failure and system adaptability for complex tasks [20,21,22]. There have thus far been extensive works devoted to the formation control of an AUV swarm. A group of torpedo-type AUVs was investigated in [23] under the centralized management by an unmanned surface vessel, constituting a star-like command and communication system, with the unmanned surface vessel being the center. The formation is achieved by assigning each AUV its corresponding reference trajectory. Both references [24,25] adopted the leader–follower control structure. Environmental disturbances and input saturation were considered in [24], tackled by dynamic surface control and a hyperbolic tangent function, respectively. To simultaneously deal with communication delay, packet discreteness and dropout, a curve fitting method was developed by [25] to make a prediction for the states of the AUV. On the other hand, the virtual structure approach based on graph theory was invoked in [26,27,28,29,30,31]. Li et al. considered prescribed performance control for underactuated AUVs subject to uncertain dynamics and disturbances [26]. By using radial basis functions, the lumped uncertainties of the entire system were transformed into a linearly parameterized form with a single uncertain parameter. Different from [26], model predictive control and an extended state observer were employed in [27] to cope with unknown ocean current disturbances. Time delay issues were discussed in [28,29]. Both bounded and unbounded communication delays were considered in [28], and two separate communication networks for both position and velocity were implemented to realize formation trajectory tracking. A consistent control algorithm was proposed by [29] to deal with communication delays by Gershgorin disk theorem and Nyquist law. To recover the information of the global trajectory, distributed observers were resorted to in [30,31]. A neural-network-based deterministic learning approach was established in [30] to tackle the nonlinear uncertain dynamics of the AUVs. To reduce the risk of actuator saturation, a hyperbolic tangent function was adopted in [31] to design a decentralized formation tracking control law that can also handle system nonlinearities and measurement noises.

From the perspective of the system structure, swarm systems can be categorized as a leaderless swarm and leader–follower swarm, where the formation control of swarms is mainly involved with the latter one by viewing the formation command as the leader. The cooperative output regulation theory proves to be an effective control framework tackling the distributed control of leader–follower swarm systems [32], which can also be viewed as an extension of the virtual structure approach. In the cooperative output regulation framework, the exosystem is viewed as the leader, and each agent is viewed as a follower. The control objectives are twofold: (1) the stability of the closed-loop system should be guaranteed; (2) the output of each agent should track a class of reference signals while rejecting a class of external disturbances. So far, there have been some attempts applying the cooperative output regulation theory to solve the distributed formation problem [33,34,35,36,37]. Wang considered a linear swarm system over a static communication network [33]. By taking the system matrix of the exosystem as prior knowledge, a distributed internal model was constructed for each agent utilizing virtual errors. Hua et al. studied the case where the virtual leader has an external input [34]. A sign-function-based distributed observer was proposed to recover the state of the virtual leader. Like [33], the system matrix of the virtual leader should be known in advance so as to obtain the solution to the regulator equations. The situation of multiple virtual leaders was considered in [35], and the formation problem was integrated with the objective of containment control. Through a thorough analysis on the Laplacian of the associated communication graph, an adaptive distributed control law was proposed to solve the formation problem. Similar to [33], Li et al. also made use of the virtual-error-based distributed internal model [36], while both the swarm system and the virtual leader considered in [36] are nonlinear. Local stability results were obtained by performing Jacobian linearization around the origin of the closed-loop system. In practice, the communication network for the swarm might not always be safe or reliable. Huang and Dong investigated the scenario of false data injection into the communication network [37]. By adopting linear matrix inequality techniques, a robust output regulation control law was proposed so that the tracking errors can be guaranteed to be uniformly ultimately bounded under uniform feedback quantization.

In this paper, we consider the distributed path tracking problem for a swarm of AUVs modeled by second-order multi-agent systems. The AUV swarm should track a class of polynomial path signals while keeping a desired relative position with respect to each other. Different from the existing results, the application scenario of this paper has three distinguished characteristics. First, the unreliable communication network for the multi-agent system was modeled as a switching graph satisfying the jointly connected condition, which allows the communication network topology to be disconnected the entire time. Second, in an underwater environment, absolute position measurement might not be feasible. To address this issue, in this paper, we consider the case where only the relative position between AUVs can be obtained for trajectory tracking control. Third, in the presence of uncertain mass, inertia and velocity damping, the AUV dynamics are assumed to contain uncertain system parameters. By applying the cooperative output regulation control framework, a novel distributed robust control scheme is proposed to solve the distributed path tracking problem, which consists of three parts. First, to cope with communication network uncertainty, the distributed observer was invoked to recover the polynomial path for each AUV, which decouples the dynamics of the AUVs and the virtual leader, thus making it easy to conceive a feasible distributed control scheme under the unreliable communication environment. Second, based on the relative position measurement between AUVs, a pseudo position estimator was adopted to generate the pseudo position for each AUV. The interesting fact regarding the pseudo position lies in that, though it will not converge to the absolute position of the AUV, the differences between the absolute positions and pseudo positions of all of the AUVs will converge to a common constant vector, which paves the way to the success of the proposed distributed control scheme. Finally, based on the estimated polynomial path and the pseudo position, a certainty equivalent robust internal model control law was synthesized to achieve asymptotic reference trajectory tracking, where the internal model compensator aims to tackle uncertain system parameters. It was rigorously proven that the distributed path tracking problem can be solved by the proposed distributed robust control scheme. In contrast to the existing results, the main contributions of this paper are twofold:

In terms of communication topology, reference [23] adopted a centralized structure with the unmanned surface vessel as the communication center, whereas [24,25] adopted the leader–follower approach, whose communication topology is essentially a pure tree-like structure. In [26,27,28,29,30,31], based on graph theory, the communication topology is relaxed to be freely designed as long as the associated communication graph is connected. However, in all of these works [26,27,28,29,30,31], the communication topology should be connected the entire time, which might be impractical for certain application scenarios. In contrast, in this paper, we allow the communication network to be jointly connected, which can be disconnected for the entire time, thus greatly reducing the requirement imposed on the communication network.
In [23,24,25,26,27,28,29,30,31], the absolute position feedback of the AUV is necessary to stabilize the closed-loop system dynamics so that the control objective of tracking or formation can be fulfilled, while, as indicated by [18,20,22], the global localization of AUVs might be costly in an underwater environment, and might sometimes even be impossible, such as in a deep ocean environment. To address this issue, in this paper, the proposed control method only needs a relative position measurement of the neighboring AUVs over the communication network to achieve reference path tracking and maintain a relative formation, which makes it more practical and cost competitive in contrast to the existing results.

2. Graph Notation

A graph

G = (V, E)

is defined by a node set

V = {1, \dots, N}

and an edge set

E \subseteq V \times V

. For

i, j = 1, 2, \dots, N

,

i \neq j

,

(i, j) \in E

means that there exists an edge in

E

from node i to node j. If

(i, j) \in E

, then node i is called a neighbor of node j. Let

N_{i} = {j, (j, i) \in E}

denote the neighbor set of node i. If

(i, j) \in E

if and only if

(j, i) \in E

, then the edge

(i, j)

is called undirected. If all of the edges of a graph are undirected, then the graph is called undirected. If

G

contains a set of edges of the form

(i_{1}, i_{2}), (i_{2}, i_{3}), \dots, (i_{k}, i_{k + 1})

, then the set

{(i_{1}, i_{2}), (i_{2}, i_{3}), \dots, (i_{k}, i_{k + 1})}

is called a path of

G

from node

i_{1}

to node

i_{k + 1}

, and node

i_{k + 1}

is said to be reachable from node

i_{1}

. A graph

G

is said to contain a spanning tree if there exists a node in

G

such that all of the other nodes are reachable from it, and this node is called the root of the spanning tree. Given a set of m graphs

G_{k} = (V, E_{k}), k = 1, \dots, m

, the graph

G = (V, E)

with

E = ⋃_{k = 1}^{m} E_{k}

is called the union of

G_{k}

, denoted by

G = ⋃_{k = 1}^{m} G_{k}

.

A time signal

σ (t) : [0, + \infty) \to M = {1, \dots, m}

for some positive integer m is called a piecewise constant switching signal with dwell time

τ

for some

τ > 0

if there exists a time sequence

{t_{k}, k = 0, 1, 2, \dots,}

satisfying

t_{0} = 0

; for any positive integer k,

t_{k} - t_{k - 1} \geq τ

;

σ (t) = p

,

p \in M

, for all

t \in [t_{k - 1}, t_{k})

. Given a node set

V = {1, \dots, N}

and a piecewise constant switching signal

σ (t)

, define a switching graph

G_{σ (t)} = (V, E_{σ (t)})

where

E_{σ (t)} \subseteq V \times V

for all

t \geq 0

. For a switching graph, let

N_{i} (t)

denote the neighbor set of node i at time instant t. Associated with a switching graph

G_{σ (t)}

, the matrix

A_{σ (t)} = [a_{i j} (t)] \in R^{N \times N}

is called a time-varying weighted adjacency matrix of

G_{σ (t)}

if

a_{i i} (t) = 0

;

a_{i j} (t) > 0 \Leftrightarrow (j, i) \in E_{σ (t)}

; and

a_{i j} (t) = 0

otherwise. Let

L_{σ (t)} = [l_{i j} (t)] \in R^{N \times N}

be such that

l_{i i} (t) = \sum_{j = 1}^{N} a_{i j} (t)

and

l_{i j} (t) = - a_{i j} (t)

if

i \neq j

. Then,

L_{σ (t)}

is called the Laplacian of

G_{σ (t)}

associated with

A_{σ (t)}

.

3. Problem Statement

In this paper, we consider the distributed polynomial path tracking control problem for an AUV swarm system consisting of N AUVs. As in [38], we suppose the AUV is of full actuation. Then, for

i = 1, \dots, N

, the dynamics of the ith AUV take the following form

\begin{matrix} {\dot{p}}_{i} & = v_{i} \end{matrix}

(1a)

\begin{matrix} {\dot{v}}_{i} & = γ_{i 1} p_{i} + γ_{i 2} v_{i} + γ_{i 3} u_{i} \end{matrix}

(1b)

where

p_{i}, v_{i}, u_{i} \in R^{n}

denote the generalized position, velocity and control input of the ith AUV, respectively.

γ_{i 1}, γ_{i 2}, γ_{i 3} \in R

are unknown system parameters satisfying

γ_{i j} = γ_{i j}^{o} + Δ γ_{i j}

with

γ_{i j}^{o}

and

Δ γ_{i j}

denoting the nominal part and uncertain part of

γ_{i j}

, respectively,

j = 1, 2, 3

. Without a loss of generality, suppose

γ_{i 3}^{o} \neq 0

. For

i = 1, \dots, N

, let

w_{i} = col (Δ γ_{i 1}, Δ γ_{i 2}, Δ γ_{i 3}) \in R^{3}

denote the vector of uncertain parameters of the ith AUV. Obviously,

w_{i} = 0

if and only if

γ_{i j} = γ_{i j}^{o}

, i.e., there is no system uncertainty. In what follows, let

W \subseteq R^{3}

be a compact set containing the origin of

R^{3}

.

For

i = 1, \dots, N

, define the state of the ith AUV as

x_{i} (t) = col (p_{i} (t), v_{i} (t))

. Then, system (1) can be rewritten into the following compact form:

\begin{matrix} {\dot{x}}_{i} & = A_{i} x_{i} + B_{i} u_{i} \end{matrix}

(2a)

\begin{matrix} p_{i} & = C_{i} x_{i} \end{matrix}

(2b)

where

\begin{matrix} A_{i} = [\begin{matrix} 0 & I_{n} \\ γ_{i 1} I_{n} & γ_{i 2} I_{n} \end{matrix}], & B_{i} = [\begin{matrix} 0 \\ γ_{i 3} I_{n} \end{matrix}] \\ C_{i} = [\begin{matrix} I_{n} & 0 \end{matrix}] . \end{matrix}

Here,

I_{n}

denotes an n-dimensional identity matrix.

Moreover, define the nominal parts of the system matrices

A_{i}

and

B_{i}

as follows:

A_{i}^{o} = [\begin{matrix} 0 & I_{n} \\ γ_{i 1}^{o} I_{n} & γ_{i 2}^{o} I_{n} \end{matrix}], B_{i}^{o} = [\begin{matrix} 0 \\ γ_{i 3}^{o} I_{n} \end{matrix}] .

Since

γ_{i 3}^{o} \neq 0

, it can be easily verified that

(A_{i}^{o}, B_{i}^{o})

is controllable.

Consider the following polynomial path

r_{0} (t) = a_{m} t^{m} + a_{m - 1} t^{m - 1} + \dots + a_{1} t + a_{0}

(3)

where m is some non-negative integer, and

a_{i} \in R^{n}

,

i = 0, 1, \dots, m

are constant vectors.

The distributed polynomial path tracking of the swarm system (2) is defined in the following way. For each AUV, denote the local formation command as

r_{f i} \in R^{n}

, which defines the relative position of the ith AUV with respect to the polynomial path. Then, the absolute trajectory tracking error for the ith AUV is defined by

e_{i} = p_{i} - r_{0} - r_{f i} .

(4)

Note that an important property of the polynomial path (3) is that it can be generated by a virtual leader system in the following form:

\begin{matrix} {\dot{ℓ}}_{0} & = Ξ ℓ_{0} \end{matrix}

(5a)

\begin{matrix} r_{0} & = Π ℓ_{0} \end{matrix}

(5b)

where

\begin{matrix} Ξ & = I_{n} \otimes ξ \\ Π & = I_{n} \otimes π \end{matrix}

(6)

with

\begin{matrix} ξ & = [\begin{matrix} 0 & 1 & 0 & \dots & 0 \\ 0 & 0 & 1 & \dots & 0 \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & 0 & \dots & 1 \\ 0 & 0 & 0 & \dots & 0 \end{matrix}] \in R^{(m + 1) \times (m + 1)} \\ π & = [\begin{matrix} 1 & 0 & 0 & \dots & 0 \end{matrix}] \in R^{1 \times (m + 1)} . \end{matrix}

(7)

Here, ⊗ denotes the Kronecker product of matrices, and

ℓ_{0} \in R^{n (m + 1)}

denotes the internal state of the virtual leader. Note that

(π, ξ)

is observable, and so is

(Π, Ξ)

.

Remark 1.

In this paper, we consider the case where the polynomial path cannot be known in advance by any AUV. Instead, it will be estimated in a distributed way by each AUV depending solely on neighboring information exchange by the distributed observer.

The communication network for the virtual leader system (5) and the swarm system (2) is described by a switching graph

{\bar{G}}_{σ (t)} = (\bar{V}, {\bar{E}}_{σ (t)})

with

\bar{V} = {0, 1, \dots, N}

and

{\bar{E}}_{σ (t)} = {(i, j), i, j \in V, i \neq j}

. Here, the node 0 is associated with the virtual leader, and the node i,

i = 1, \dots, N

is associated with the ith AUV. For

i, j = 1, \dots, N

,

i \neq j

,

(i, j) \in {\bar{E}}_{σ (t)}

if and only if the jth AUV can receive information from the ith AUV. Moreover, for

i = 1, \dots, N

,

(0, i) \in {\bar{E}}_{σ (t)}

if and only if the ith AUV can receive information from the virtual leader. Let the weighted adjacency matrix of the digraph

{\bar{G}}_{σ (t)}

be

{\bar{A}}_{σ (t)} = [a_{i j} (t)] \in R^{(N + 1) \times (N + 1)}

. Define a subgraph

G_{σ (t)}

of

{\bar{G}}_{σ (t)}

as

G_{σ (t)} = (V, E_{σ (t)})

with

V = {1, \dots, N}

and

E_{σ (t)} = {\bar{E}}_{σ (t)} \cap {V \times V}

. Let

L_{σ (t)}

be the Laplacian of

G_{σ (t)}

and

H_{σ (t)} = L_{σ (t)} + diag {a_{10} (t), \dots, a_{N 0} (t)}

.

The following assumptions are imposed on the communication graphs

{\bar{G}}_{σ (t)}

and

G_{σ (t)}

, respectively.

Assumption 1.

There exists a subsequence

{ϖ_{k} : k = 0, 1, 2, \dots}

of

{k = 0, 1, 2, \dots}

satisfying

t_{ϖ_{k + 1}} - t_{ϖ_{k}} < ν_{ϖ}

for some

ν_{ϖ} > 0

, such that every node i,

i = 1, \dots, N

, is reachable from node 0 in the union graph

⋃_{r = ϖ_{k}}^{ϖ_{k + 1} - 1} {\bar{G}}_{σ (t_{r})}

.

Assumption 2.

The switching graph

G_{σ (t)}

is undirected. Moreover, there exists a subsequence

{ϑ_{k} : k = 0, 1, 2, \dots}

of

{k = 0, 1, 2, \dots}

satisfying

t_{ϑ_{k + 1}} - t_{ϑ_{k}} < ν_{ϑ}

for some

ν_{ϑ} > 0

, such that the union graph

⋃_{r = ϑ_{k}}^{ϑ_{k + 1} - 1} G_{σ (t_{r})}

contains a spanning tree.

Remark 2.

Both Assumptions 1 and 2 are referred to as the jointly connected condition in the literature [32], where Assumption 1 is used for modeling the communication networks for leader–follower-type multi-agent systems, whereas Assumption 2 is used for modeling the communication networks for leaderless-type multi-agent systems. The jointly connected condition is possibly the mildest condition ever imposed on communication networks, and can tolerate the extreme case where the communication network topology is disconnected for all of the time instants.

Now, we are ready to formulate the polynomial path tracking problem as follows.

Problem 1.

Given systems (2), (5) and the communication graph

{\bar{G}}_{σ (t)}

, design a distributed control law

u_{i}

in the following form:

\begin{matrix} u_{i} & = f_{i} (v_{i}, ς_{i}, ς_{j}, p_{j} - p_{i}, j \in N_{i} (t)) \end{matrix}

(8a)

\begin{matrix} {\dot{ς}}_{i} & = g_{i} (v_{i}, ς_{i}, ς_{j}, p_{j} - p_{i}, j \in N_{i} (t)) \end{matrix}

(8b)

such that there exists

W \subseteq R^{3}

for any

w_{i} \in W

, and for any system initial condition, there exists some constant vector

e_{c} \in R^{3}

, known or not, satisfying

lim_{t \to \infty} e_{i} (t) = e_{c}, i = 1, \dots, N .

(9)

Remark 3.

In this paper, it is assumed that the absolute positions

p_{i}

s of the AUVs are not available. Therefore, it is, in general, impossible to drive

e_{i}

to zero. While, in practice, the absolute position tracking is usually immaterial, it is the formation generation and keeping that matters. Thus, instead of regulating

e_{i}

to zero, it suffices for the absolute tracking errors to converge to a common constant vector, which, in turn, implies the achievement of formation generation and keeping.

4. Main Results

The distributed robust control scheme proposed in this paper is composed of three parts. First, an output-based distributed observer was invoked to recover the polynomial path for each AUV. Second, a pseudo position estimator was adopted to generate the pseudo position for each AUV. Finally, based on the estimated polynomial path and the pseudo position, a certainty equivalent robust internal model control law was synthesized to achieve reference trajectory tracking. The details of these three parts are presented in sequence as follows.

First, we introduce the output-based distributed observer for the virtual leader.

Since

(π, ξ)

is observable, let

χ \in R^{(m + 1) \times (m + 1)}

be the positive definite matrix solution to the following algebraic Riccati equation

χ ξ^{T} + ξ χ - χ π^{T} π χ + I_{m + 1} = 0 .

(10)

Let

L = I_{n} \otimes (χ π^{T}) .

(11)

Then, for

i = 1, \dots, N

, design

\begin{matrix} {\dot{ℓ}}_{i} & = Ξ ℓ_{i} + μ_{ℓ} L \sum_{i = 0}^{N} a_{i j} (t) (r_{j} - r_{i}) \end{matrix}

(12a)

\begin{matrix} r_{i} & = Π ℓ_{i} \end{matrix}

(12b)

where

ℓ_{i}

and

r_{i}

are the estimates of

ℓ_{0}

and

r_{0}

, respectively, and

μ_{ℓ} > 0

is the observer gain.

For

i = 1, \dots, N

, define

{\tilde{ℓ}}_{i} = ℓ_{i} - ℓ_{0}

and

{\tilde{r}}_{i} = r_{i} - r_{0}

. Then,

\begin{matrix} {\dot{\tilde{ℓ}}}_{i} & = Ξ {\tilde{ℓ}}_{i} + μ_{ℓ} L \sum_{i = 0}^{N} a_{i j} (t) ({\tilde{r}}_{j} - {\tilde{r}}_{i}) \\ = Ξ {\tilde{ℓ}}_{i} + μ_{ℓ} L Π \sum_{i = 0}^{N} a_{i j} (t) ({\tilde{ℓ}}_{j} - {\tilde{ℓ}}_{i}) . \end{matrix}

(13)

For

x_{i} \in R^{n_{i}}

,

i = 1, \dots, N

, define the notation

col (x_{1}, \dots, x_{N}) = {[x_{1}^{T}, \dots, x_{N}^{T}]}^{T}

. Let

\tilde{ℓ} = col ({\tilde{ℓ}}_{1}, \dots, {\tilde{ℓ}}_{N})

. Then, we have

\dot{\tilde{ℓ}} = (I_{N} \otimes Ξ - H_{σ (t)} \otimes (μ_{ℓ} L Π)) \tilde{ℓ} .

(14)

Using Theorem 4.1 of [32], we have the following result.

Lemma 1.

Given systems (5) and (12), under Assumptions 1 and 2, for any system initial condition, and any

μ_{ℓ} > 0

, both

{\tilde{ℓ}}_{i} (t)

and

{\tilde{r}}_{i} (t)

tend to zero exponentially as

t \to \infty

.

Remark 4.

The dynamic compensator (12) is called an output-based distributed observer of the virtual leader system (5) in the sense that: (1) it relies solely on the output of the virtual leader

r_{0}

; (2) it only requires local information of neighboring AUVs; (3) it can recover the information of the virtual leader for each AUV. In this way, the information of the virtual leader is transmitted to each AUV over the unreliable jointly connected switching network

{\bar{G}}_{σ (t)}

.

Next, we introduce the pseudo position estimator. For

i = 1, \dots, N

, design

\begin{matrix} {\dot{℘}}_{i} & = v_{i} + μ_{℘} \sum_{j = 1}^{N} a_{i j} (t) (℘_{j} - ℘_{i} - p_{j i}) \end{matrix}

(15a)

\begin{matrix} p_{j i} & = p_{j} - p_{i} \end{matrix}

(15b)

where

μ_{℘} > 0

is the consensus gain.

Define

{\tilde{℘}}_{i} = ℘_{i} - p_{i}

. Then, we have

\begin{matrix} {\dot{\tilde{℘}}}_{i} & = v_{i} - v_{i} + μ_{℘} \sum_{j = 1}^{N} a_{i j} (t) (℘_{j} - ℘_{i} - (p_{j} - p_{i})) \\ = μ_{℘} \sum_{j = 1}^{N} a_{i j} (t) ({\tilde{℘}}_{j} - {\tilde{℘}}_{i}) . \end{matrix}

(16)

Then, under Assumption 1, using Theorem 2.8 of [1], it follows that, for any

μ_{℘} > 0

,

lim_{t \to \infty} {\tilde{℘}}_{i} (t) = ℘_{c}

(17)

exponentially for some static vector

℘_{c} \in R^{3}

.

Remark 5.

For

i = 1, \dots, N

, define

{\overset{ˇ}{p}}_{i} = p_{i} + ℘_{c}

and

{\tilde{p}}_{i} = ℘_{i} - {\overset{ˇ}{p}}_{i} .

Then, using (17), it follows that

lim_{t \to \infty} {\tilde{p}}_{i} (t) = 0

(18)

exponentially. Equation (18) means that, though the absolute position

p_{i}

is not available, it is possible to generate a pseudo position

℘_{i}

for each AUV ensuring that the differences between the pseudo positions and the absolute positions of all of the AUVs are a common constant vector. It is this very result that leads to the solution to the polynomial path formation problem by a distributed control law in the form of (8).

In terms of

{\overset{ˇ}{p}}_{i}

, the system dynamics (1) become

{\dot{\overset{ˇ}{p}}}_{i} = v_{i}

(19a)

\begin{matrix} {\dot{v}}_{i} & = γ_{i 1} ({\overset{ˇ}{p}}_{i} - ℘_{c}) + γ_{i 2} v_{i} + γ_{i 3} u_{i} \\ = γ_{i 1} {\overset{ˇ}{p}}_{i} + γ_{i 2} v_{i} + γ_{i 3} u_{i} - γ_{i 1} ℘_{c} . \end{matrix}

(19b)

Moreover, we define the new tracking error as

\begin{matrix} {\overset{ˇ}{e}}_{i} & = {\overset{ˇ}{p}}_{i} - r_{0} - r_{f i} \\ = p_{i} + ℘_{c} - r_{0} - r_{f i} \\ = e_{i} + ℘_{c} . \end{matrix}

(20)

Then,

{\overset{ˇ}{e}}_{i} = 0

if and only if

e_{i} = - ℘_{c}

. As a result, to solve Problem 1, it suffices to solve the following problem.

Problem 2.

Given systems (5), (19) and the communication graph

{\bar{G}}_{σ (t)}

, design a distributed control law

u_{i}

in the form of (8) such that there exists

W \subseteq R^{3}

for any

w_{i} \in W

, and, for any system initial condition,

lim_{t \to \infty} {\overset{ˇ}{e}}_{i} (t) = 0, i = 1, \dots, N .

(21)

To solve Problem 2, we will rewrite the system dynamics in a new compact form. First, define the following augmented exosystem:

\dot{v} = [\begin{matrix} 0 \\ Ξ \end{matrix}] v ≜ \bar{Ξ} v, v (0) = [\begin{matrix} 1 \\ ℓ_{0} (0) \end{matrix}] .

(22)

Therefore,

v (t) = col (1, ℓ_{0} (t))

. Then, letting

{\overset{ˇ}{x}}_{i} = col ({\overset{ˇ}{p}}_{i}, v_{i})

gives

\begin{matrix} {\dot{\overset{ˇ}{x}}}_{i} & = A_{i} {\overset{ˇ}{x}}_{i} + B_{i} u_{i} + E_{i} v \end{matrix}

(23a)

\begin{matrix} {\overset{ˇ}{e}}_{i} & = C_{i} {\overset{ˇ}{x}}_{i} + F_{i} v \end{matrix}

(23b)

where

\begin{matrix} E_{i} & = [\begin{matrix} 0_{n \times 1} & 0_{n \times (n (m + 1))} \\ - γ_{i 1} ℘_{c} & 0_{n \times (n (m + 1))} \end{matrix}] \\ F_{i} & = [\begin{matrix} - r_{f i} & - Π \end{matrix}] . \end{matrix}

Note that the minimal polynomials of

ξ

,

Ξ

and

\bar{Ξ}

are the same. Therefore, let

G_{1} = ξ, G_{2} = [\begin{matrix} 0 \\ ⋮ \\ 0 \\ 1 \end{matrix}] \in R^{m + 1}

(24)

and thus the following matrix pair

G_{1} = I_{n} \otimes G_{1}, G_{2} = I_{n} \otimes G_{2}

(25)

incorporates an n-copy internal model of

\bar{Ξ}

.

Since all of the eigenvalues of

\bar{Ξ}

are zero, for any eigenvalue

λ

of

\bar{Ξ}

, note that

γ_{i 3}^{o} \neq 0

gives

\begin{matrix} rank [\begin{matrix} A_{i}^{o} - λ I_{n} & B_{i}^{o} \\ C_{i} & 0 \end{matrix}] \\ = & rank [\begin{matrix} 0 & I_{n} & 0 \\ γ_{i 1}^{o} I_{n} & γ_{i 2}^{o} I_{n} & γ_{i 3}^{o} I_{n} \\ I_{n} & 0 & 0 \end{matrix}] \\ = & 3 n . \end{matrix}

Thus, by Lemma 1.26 of [39], the matrix pair

({\bar{A}}_{i}^{o}, {\bar{B}}_{i}^{o})

is able to be stabilized, where

{\bar{A}}_{i}^{o} = [\begin{matrix} A_{i}^{o} & 0 \\ G_{2} C_{i} & G_{1} \end{matrix}], {\bar{B}}_{i}^{o} = [\begin{matrix} B_{i}^{o} \\ 0 \end{matrix}]

(26)

Furthermore, let

K_{i} = [\begin{matrix} K_{i 1} & K_{i 2} \end{matrix}]

be such that

{\bar{A}}_{i}^{o} + {\bar{B}}_{i}^{o} K_{i} = [\begin{matrix} A_{i}^{o} + B_{i}^{o} K_{i 1} & B_{i}^{o} K_{i 2} \\ G_{2} C_{i} & G_{1} \end{matrix}]

is Hurwitz.

For

i = 1, \dots, N

, let

{\hat{x}}_{i} = col (℘_{i}, v_{i})

and

{\tilde{x}}_{i} = {\hat{x}}_{i} - {\overset{ˇ}{x}}_{i} = col ({\tilde{p}}_{i}, 0_{n \times 1})

. Therefore,

{lim}_{t \to \infty} {\tilde{x}}_{i} (t) = 0

exponentially using (18). Now, we are ready to present the certainty equivalent robust internal model control law as follows:

\begin{matrix} u_{i} & = K_{i 1} {\hat{x}}_{i} + K_{i 2} z_{i} \end{matrix}

(27a)

\begin{matrix} {\dot{z}}_{i} & = G_{1} z_{i} + G_{2} {\hat{e}}_{i} \end{matrix}

(27b)

\begin{matrix} {\hat{e}}_{i} & = ℘_{i} - r_{i} - r_{f i} . \end{matrix}

(27c)

The overall distributed robust control scheme proposed in this paper is composed of the distributed observer (12), the pseudo position estimator (15) and the certainty equivalent robust internal model control law (27). The information flow among different parts of the distributed robust control scheme is illustrated in Figure 1. A pseudo code used to calculate the control input by the distributed robust control scheme for the ith AUV,

i = 1, \dots, N

, is given by Algorithm 1.

Algorithm 1 Calculating the control input by the distributed robust control scheme for the ith AUV

Input:

r_{j}

,

p_{j} - p_{i}

,

j \in N_{i} (t)

Output:

u_{i}

1: According to the dimension of the position output n and the order of the polynomial path m, determine the matrix pairs (7) and (6).

2: Solve the Riccati Equation (10) and design L using (11).

3: Select any

μ_{ℓ} > 0

and implement the distributed observer (12).

4: Select any

μ_{℘} > 0

and implement the pseudo position estimator (15) based on

v_{i}

and

p_{j} - p_{i}

.

5: Determine

(G_{1}, G_{2})

using (24),

(G_{1}, G_{2})

using (25) and

({\bar{A}}_{i}^{o}, {\bar{B}}_{i}^{o})

using (26).

6: Select

K_{i}

such that

{\bar{A}}_{i}^{o} + {\bar{B}}_{i}^{o} K_{i}

is Hurwitz.

7: Based on

r_{i}

from the distributed observer (12) and

℘_{i}

from the pseudo position estimator (15), implement the certainty equivalent robust internal model control law (27).

The main result of this paper is presented as follows.

Theorem 1.

Given systems (5), (23) and the communication graph

{\bar{G}}_{σ (t)}

, under Assumptions 1 and 2, Problem 2 is solvable by the control law composed of (12), (15) and (27) for any

μ_{ℓ}, μ_{℘} > 0

.

Proof.

Define, for

i = 1, \dots, N

,

{\bar{A}}_{i}^{o} = [\begin{matrix} A_{i} & 0 \\ G_{2} C_{i} & G_{1} \end{matrix}], {\bar{B}}_{i}^{o} = [\begin{matrix} B_{i} \\ 0 \end{matrix}] .

Since

{\bar{A}}_{i}^{o} + {\bar{B}}_{i}^{o} K_{i}

is Hurwits, there exists

W \subseteq R^{3}

, containing the origin of

R^{3}

such that

{\bar{A}}_{i} + {\bar{B}}_{i} K_{i} = [\begin{matrix} A_{i} + B_{i} K_{i 1} & B_{i} K_{i 2} \\ G_{2} C_{i} & G_{1} \end{matrix}]

is Hurwitz for any

w_{i} \in W

. Then, using Lemma 1.27 of [39], the following matrix equation

\begin{matrix} X_{i} \bar{Ξ} & = (A_{i} + B_{i} K_{i 1}) X_{i} + B_{i} K_{i 2} Z_{i} + E_{i} \end{matrix}

(28a)

\begin{matrix} Z_{i} \bar{Ξ} & = G_{1} Z_{i} + G_{2} (C_{i} X_{i} + F_{i}) \end{matrix}

(28b)

have a unique solution pair

(X_{i}, Z_{i})

, which, in addition, satisfies

0 = C_{i} X_{i} + F_{i} .

(29)

Substituting (27) into (23) gives

\begin{matrix} {\dot{\overset{ˇ}{x}}}_{i} & = A_{i} {\overset{ˇ}{x}}_{i} + B_{i} u_{i} + E_{i} v \\ = A_{i} {\overset{ˇ}{x}}_{i} + B_{i} K_{i 1} {\hat{x}}_{i} + B_{i} K_{i 2} z_{i} + E_{i} v \\ = A_{i} {\overset{ˇ}{x}}_{i} + B_{i} K_{i 1} {\overset{ˇ}{x}}_{i} + B_{i} K_{i 2} z_{i} + E_{i} v + B_{i} K_{i 1} {\tilde{x}}_{i} \\ = (A_{i} + B_{i} K_{i 1}) {\overset{ˇ}{x}}_{i} + B_{i} K_{i 2} z_{i} + E_{i} v + B_{i} K_{i 1} {\tilde{x}}_{i} \end{matrix}

(30)

and

\begin{matrix} {\dot{z}}_{i} & = G_{1} z_{i} + G_{2} {\hat{e}}_{i} \\ = G_{1} z_{i} + G_{2} ({\overset{ˇ}{e}}_{i} + r_{0} - r_{i} + ℘_{i} - {\overset{ˇ}{p}}_{i}) \\ = G_{1} z_{i} + G_{2} {\overset{ˇ}{e}}_{i} + G_{2} {\tilde{p}}_{i} - G_{2} {\tilde{r}}_{i} . \end{matrix}

(31)

Next, for

i = 1, \dots, N

, define

\begin{matrix} {\bar{x}}_{i} & = {\overset{ˇ}{x}}_{i} - X_{i} v \\ {\bar{z}}_{i} & = z_{i} - Z_{i} v . \end{matrix}

Then, using (28), we have

\begin{matrix} {\dot{\bar{x}}}_{i} & = (A_{i} + B_{i} K_{i 1}) {\overset{ˇ}{x}}_{i} + B_{i} K_{i 2} z_{i} + E_{i} v + B_{i} K_{i 1} {\tilde{x}}_{i} - X_{i} \bar{Ξ} v \\ = (A_{i} + B_{i} K_{i 1}) ({\bar{x}}_{i} + X_{i} v) + B_{i} K_{i 2} Z_{i} v + B_{i} K_{i 2} {\bar{z}}_{i} + E_{i} v - X_{i} Ξ v + B_{i} K_{i 1} {\tilde{x}}_{i} \\ = (A_{i} + B_{i} K_{i 1}) {\bar{x}}_{i} + B_{i} K_{i 2} {\bar{z}}_{i} + B_{i} K_{i 1} {\tilde{x}}_{i} \end{matrix}

(32)

and

\begin{matrix} {\dot{\bar{z}}}_{i} & = G_{1} z_{i} + G_{2} {\overset{ˇ}{e}}_{i} + G_{2} {\tilde{p}}_{i} - G_{2} {\tilde{r}}_{i} - Z_{i} \bar{Ξ} v \\ = G_{1} ({\bar{z}}_{i} + Z_{i} v) + G_{2} (C_{i} {\overset{ˇ}{x}}_{i} + F_{i} v) + G_{2} {\tilde{p}}_{i} - G_{2} {\tilde{r}}_{i} - Z_{i} \bar{Ξ} v \\ = G_{1} ({\bar{z}}_{i} + Z_{i} v) + G_{2} (C_{i} ({\bar{x}}_{i} + X_{i} v) + F_{i} v) + G_{2} {\tilde{p}}_{i} - G_{2} {\tilde{r}}_{i} - Z_{i} \bar{Ξ} v \\ = G_{2} C_{i} {\bar{x}}_{i} + G_{1} {\bar{z}}_{i} + G_{2} {\tilde{p}}_{i} - G_{2} {\tilde{r}}_{i} . \end{matrix}

(33)

For

i = 1, \dots, N

, define

ρ_{i} = col ({\bar{x}}_{i}, {\bar{z}}_{i})

. Then, it follows that

\begin{matrix} {\dot{ρ}}_{i} = Υ_{i} ρ_{i} + ψ_{i} \end{matrix}

(34)

with

\begin{matrix} Υ_{i} & = [\begin{matrix} A_{i} + B_{i} K_{i 1} & B_{i} K_{i 2} \\ G_{2} C_{i} & G_{1} \end{matrix}] \\ ψ_{i} & = [\begin{matrix} B_{i} K_{i 1} {\tilde{x}}_{i} \\ G_{2} ({\tilde{p}}_{i} - {\tilde{r}}_{i}) \end{matrix}] . \end{matrix}

(35)

Since

Υ_{i}

is Hurwitz for any

w_{i} \in W

, and

{\tilde{x}}_{i} (t), ψ_{i} (t)

decay to zero exponentially as

t \to \infty

, using Lemma 2.5 of [32], it follows that

lim_{t \to \infty} ρ_{i} (t) = 0 .

(36)

Moreover, using (29), it follows that

\begin{matrix} {\overset{ˇ}{e}}_{i} & = C_{i} {\overset{ˇ}{x}}_{i} + F_{i} v \\ = C_{i} ({\bar{x}}_{i} + X_{i} v) + F_{i} v \\ = C_{i} {\bar{x}}_{i} . \end{matrix}

(37)

Therefore, using (36),

{lim}_{t \to \infty} {\overset{ˇ}{e}}_{i} (t) = 0

and thus the proof is complete. □

5. Numerical Simulations

In this section, we will use numerical simulations to illustrate and validate the proposed control scheme. The control task is taken from [38], where a fleet of AUVs executed the formation task in Monterey Bay during August 2003 to observe and predict ocean processes.

5.1. Aim of the Experiment

We begin with the aim of the experiment. In [38], the aim of the experiment is to drive the AUV fleet to cruise in the ocean in a static triangular formation while following the reference trajectory of a straight line, illustrated by Figure 2.

In this paper, we consider a similar task as in [38]. Consider a swarm system of four AUVs, whose dynamics are given by

\begin{matrix} {\dot{p}}_{i} & = v_{i} \end{matrix}

(38a)

\begin{matrix} {\dot{v}}_{i} & = γ_{i 1} p_{i} + γ_{i 2} v_{i} + γ_{i 3} u_{i} \end{matrix}

(38b)

where

p_{i}, v_{i}, u_{i} \in R^{3}

. For

i = 1, \dots, 4

, suppose the system parameters are given by

γ_{i 1}^{o} = 5

,

γ_{i 2}^{o} = - 10

,

γ_{i 3}^{o} = 8

,

Δ γ_{i 1} \in [- 1, 1]

,

Δ γ_{i 2} \in [- 2, 2]

,

Δ γ_{i 3} \in [- 1, 1]

.

The polynomial path is given by

r_{0} (t) = [\begin{matrix} 2 t - 10 \\ - 4 t + 5 \\ - t \end{matrix}] .

The local formation vector is given by

r_{f 1} = [\begin{matrix} 0 \\ 0 \\ 0 \end{matrix}], r_{f 2} = [\begin{matrix} 0 \\ 0 \\ 5 \end{matrix}], r_{f 3} = [\begin{matrix} 5 \\ 0 \\ - 3 \end{matrix}], r_{f 4} = [\begin{matrix} - 5 \\ 0 \\ - 3 \end{matrix}] .

The communication network is shown by Figure 3. In particular, the communication network

{\bar{G}}_{σ (t)}

is assumed to switch among six subgraphs

{\bar{G}}_{1}

, ⋯

{\bar{G}}_{6}

, periodically every

T_{c}

sec. Suppose that

T_{c} = 0.1

. It can be verified that Assumptions 1 and 2 are both satisfied. The distinguished characteristic of the communication network

{\bar{G}}_{σ (t)}

is that it is disconnected the entire time.

5.2. Methodology

Next, we will follow Algorithm 1 to conceive the distributed robust control scheme.

1.: The dimension of the position output is 3, and the order of the polynomial path is 1. Therefore, we have

$ξ = [\begin{matrix} 0 & 1 \\ 0 & 0 \end{matrix}], π = [\begin{matrix} 1 & 0 \end{matrix}]$

(39)

and

$\begin{matrix} Ξ = I_{3} \otimes ξ, Π = I_{3} \otimes π . \end{matrix}$

(40)
2.: The solution to the following Riccati equation

$χ ξ^{T} + ξ χ - χ π^{T} π χ + I_{2} = 0$

(41)

is

$χ = [\begin{matrix} 1.7321 & 1 \\ 1 & 1.7321 \end{matrix}]$

(42)

and thus

$L = I_{3} \otimes (χ π^{T}) = I_{3} \otimes [\begin{matrix} 1.7321 \\ 1 \end{matrix}] .$

(43)
3.: Select $μ_{ℓ} = 10$ and design the distributed observer as follows

$\begin{matrix} {\dot{ℓ}}_{i} & = Ξ ℓ_{i} + μ_{ℓ} L \sum_{i = 0}^{N} a_{i j} (t) (r_{j} - r_{i}) \end{matrix}$

(44a)

$\begin{matrix} r_{i} & = Π ℓ_{i} \end{matrix}$

(44b)

where $ℓ_{i} \in R^{6}$ .
4.: Select $μ_{℘} = 10$ and design the pseudo position estimator as follows

$\begin{matrix} {\dot{℘}}_{i} & = v_{i} + μ_{℘} \sum_{j = 1}^{N} a_{i j} (t) (℘_{j} - ℘_{i} - p_{j i}) \end{matrix}$

(45a)

$\begin{matrix} p_{j i} & = p_{j} - p_{i} \end{matrix}$

(45b)

where $℘_{i} \in R^{3}$ .
5.: Using (24)

$G_{1} = [\begin{matrix} 0 & 1 \\ 0 & 0 \end{matrix}], G_{2} = [\begin{matrix} 0 \\ 1 \end{matrix}]$

(46)

and, using (25),

$G_{1} = I_{3} \otimes G_{1}, G_{2} = I_{3} \otimes G_{2} .$

(47)

Using (38), it follows that

$A_{i}^{o} = [\begin{matrix} 0_{3 \times 3} & I_{3} \\ 5 I_{3} & - 10 I_{3} \end{matrix}], B_{i}^{o} = [\begin{matrix} 0_{3 \times 3} \\ 8 I_{3} \end{matrix}], C_{i} = I_{3} \otimes [\begin{matrix} 1 & 0 \end{matrix}]$

(48)

and, thus, using (26),

${\bar{A}}_{i}^{o} = [\begin{matrix} A_{i}^{o} & 0_{6 \times 6} \\ G_{2} C_{i} & G_{1} \end{matrix}] = [\begin{matrix} A_{i}^{o} & 0_{6 \times 6} \\ I_{3} \otimes [\begin{matrix} 0 & 0 \\ 1 & 0 \end{matrix}] & G_{1} \end{matrix}], {\bar{B}}_{i}^{o} = [\begin{matrix} B_{i}^{o} \\ 0_{6 \times 3} \end{matrix}] .$

(49)
6.: By letting

$K_{i} = [\begin{matrix} K_{i 1} & K_{i 2} \end{matrix}]$

with

$K_{i 1} = [\begin{matrix} - 39.6591 & - 0.9323 & 0.4840 & - 2.3884 & - 0.0400 & 0.0213 \\ - 1.1874 & - 48.0646 & - 3.1420 & - 0.0519 & - 2.7540 & - 0.1312 \\ 0.8779 & - 3.1444 & - 47.5545 & 0.0378 & - 0.1308 & - 2.7325 \end{matrix}]$

and

$K_{i 2} = [\begin{matrix} - 314.8926 & - 182.7741 & - 17.9939 & - 7.1494 & 8.7643 & 3.6004 \\ - 21.7605 & - 8.8925 & - 472.3295 & - 246.2606 & - 63.8461 & - 24.72 \\ 16.8854 & 6.7136 & - 64.4250 & - 24.8377 & - 461.9890 & - 242.2642 \end{matrix}]$

it follows that the eigenvalues of ${\bar{A}}_{i}^{o} + {\bar{B}}_{i}^{o} K_{i}$ are located at

${- 5, - 5.5, - 6, - 6.5, - 7, - 7.5, - 8, - 8.5, - 9, - 9.5, - 10, - 10.5}$

that is, ${\bar{A}}_{i}^{o} + {\bar{B}}_{i}^{o} K_{i}$ is Hurwitz.
7.: Let ${\hat{x}}_{i} = col (℘_{i}, v_{i})$ and the certainty equivalent robust internal model control law be designed as follows:

$\begin{matrix} u_{i} & = K_{i 1} {\hat{x}}_{i} + K_{i 2} z_{i} \end{matrix}$

(50a)

$\begin{matrix} {\dot{z}}_{i} & = G_{1} z_{i} + G_{2} {\hat{e}}_{i} \end{matrix}$

(50b)

$\begin{matrix} {\hat{e}}_{i} & = ℘_{i} - r_{i} - r_{f i} . \end{matrix}$

(50c)

5.3. Results

Now, we examine the system performance using simulation results. Suppose that the initial positions and velocities of the AUVs are given by

\begin{matrix} p_{1} (0) = [\begin{matrix} 5 \\ 5 \\ 0 \end{matrix}], & p_{2} (0) = [\begin{matrix} 5 \\ - 5 \\ 0 \end{matrix}], & p_{3} (0) = [\begin{matrix} - 5 \\ 5 \\ 0 \end{matrix}], & p_{4} (0) = [\begin{matrix} - 5 \\ - 5 \\ 0 \end{matrix}] \\ v_{1} (0) = [\begin{matrix} 0 \\ 0.6 \\ - 0.6 \end{matrix}], & v_{2} (0) = [\begin{matrix} 0.5 \\ 0 \\ - 0.2 \end{matrix}], & v_{3} (0) = [\begin{matrix} 0.2 \\ 0.3 \\ - 0.1 \end{matrix}], & v_{4} (0) = [\begin{matrix} - 0.1 \\ - 0.5 \\ - 0.4 \end{matrix}] . \end{matrix}

The initial values of the control laws, i.e., the components of

℘_{i} (0)

,

ℓ_{i} (0)

and

z_{i} (0)

, take random values from the interval

[0, 0.5]

.

5.3.1. Standard Case

The simulation results are shown in Figure 4, Figure 5, Figure 6 and Figure 7. In particular, the performance of the distributed observer over the unreliable switching communication network

{\bar{G}}_{σ (t)}

is shown in Figure 4. It can be seen that the polynomial trajectory has been successfully recovered by the distributed observer of each AUV. The performance of the pseudo position estimator is shown in Figure 5. As proved, differences between the actual positions and the pseudo positions of all of the AUVs will converge to a common constant vector, which, in our case, is very close to zero. The absolute tracking errors for all of the AUVs are shown in Figure 6. It can be seen that these tracking errors also converge to a common constant vector, as required by the control objective (9), i.e., the distributed tracking problem has been successfully achieved by the proposed distributed robust control scheme. Finally, the 3D trajectories of all of the AUVs are plotted in Figure 7, where the process of formation generation and keeping can be seen straightforwardly.

5.3.2. Comparative Study

In this case, we compare the proposed distributed robust control scheme with a typical existing work [29] from the perspective of a communication network exclusively. For simplicity, for the control scheme proposed in [29], it is assumed that the absolute position of the AUV is available for control feedback, and the system parameters are fully known, while keeping in mind that, for the control scheme proposed in this paper, the absolute position of the AUV is not available, and the system parameters are unknown. Suppose that the polynomial path considered in this case is given by

r_{0} (t) = [\begin{matrix} t^{2} + 2 t - 10 \\ 2 t^{2} - 4 t + 5 \\ - t \end{matrix}] .

(51)

First, we consider the ideal static and connected communication network for the result in [29]. Suppose the communication network is the union of the six subgraphs of Figure 3, which is shown in Figure 8. Under the communication graph

\bar{G}

, the simulation results using the control method in [29] are shown in Figure 9. It can be seen that the tracking errors have been driven to zero asymptotically. Under the communication graph

{\bar{G}}_{σ (t)}

defined by Figure 3 with different switching period

T_{c}

, the simulation results using the control method in [29] are shown in Figure 10 and Figure 11. When the communication network becomes unreliable and switched, the tracking errors will no longer converge to zero, which concludes that the control method in [29] is effective for a static and connected communication network, but cannot effectively deal with an unreliable jointly connected switching communication network. Similar to the design process as in the standard case, we can design the distributed robust control scheme proposed in this paper for the new

r_{0}

given by (51). The simulation results using the proposed control scheme of this paper are given in Figure 12 and Figure 13, which show that successful tracking has been achieved for both cases.

6. Methodology Discussion

In this section, we will further examine the effectiveness of the proposed control scheme from the perspective of measurement noise. There are two sources of measurement noises associated with the distributed robust control scheme proposed in this paper.

The first one is the velocity measurement noise imposed on

v_{i}

. Suppose that

v_{i}^{m} = v_{i} + n_{v i}

(52)

where

v_{i}^{m}, v_{i}, n_{v i}

denote the measured velocity, true velocity and velocity measurement noise, respectively. In what follows, we will show, using simulation results, the effect of

n_{v i}

on the system performance. In the simulations, suppose that the entries of

n_{v i}

take random values uniformly from

[- n_{V}, n_{V}]

, with

n_{V} > 0

being the magnitude of the velocity measurement noise. Note that the velocity measurement noise will affect the pseudo position estimator (15) and the robust internal model control law (27). Simulation results with

n_{V} = 1

are shown in Figure 14 and Figure 15, from which, it can be observed that, due to the velocity measurement noise, the differences between the pseudo positions and the actual positions of the AUVs will approximately converge to some common vector, while this common vector is not constant but time-varying. As a result, the steady state trajectory tracking errors of all of the AUVs also approximately converge to some time-varying common vector.

The second one is the relative position measurement noise imposed on

p_{j} - p_{i}

. In this scenario, suppose the pseudo position estimator takes the following form

\begin{matrix} {\dot{℘}}_{i} & = v_{i} + μ_{℘} \sum_{j = 1}^{N} a_{i j} (t) (℘_{j} - ℘_{i} - p_{j i}) + μ_{℘} n_{℘ i} \end{matrix}

(53a)

\begin{matrix} p_{j i} & = p_{j} - p_{i} \end{matrix}

(53b)

where

n_{℘ i}

denotes the lumped relative position measurement noise for the ith AUV. Again, we will show, using simulation results, the effect of

n_{℘ i}

on the system performance. Similarly, suppose that the entries of

n_{℘ i}

take random values uniformly from

[- n_{P}, n_{P}]

, with

n_{P} > 0

being the magnitude of the relative position measurement noise. Note that the relative position measurement noise will affect the pseudo position estimator (15), and thus the tracking performance of the system. Simulation results with

n_{P} = 0.1

are shown in Figure 16 and Figure 17. Similar to the case of velocity measurement noise, the differences between the pseudo positions and the actual positions of the AUVs will approximately converge to some time-varying common vector, which, in turn, makes the approximate common path tracking error time-varying too. Note that the gain for the distributed observer

μ_{℘}

will amplify the magnitude of the relative position measurement noise. As a result,

μ_{℘}

should not be selected as overly large in the presence of relative position measurement noise.

7. Conclusions

In this paper, a distributed robust control scheme is proposed to solve the polynomial path tracking problem for a swarm of uncertain AUVs facing three application challenges. First, the communication network is unreliable, satisfying merely the jointly connected condition. Second, only the relative position measurement between neighboring AUVs over the communication network is available for control feedback. Third, the second-order model dynamics for the AUV contain uncertain system parameters. To address these issues, three control parts were designed constituting the distributed robust control scheme, namely, the distributed observer, the pseudo position estimator and the certainty equivalent robust internal model control law. Comprehensive simulation results have validated the effectiveness of the proposed control scheme, especially the robustness against the unreliable and switching communication network when comparing with other existing results. Moreover, a further discussion on the effect of measurement noises on the system performance was conducted, where it was shown using simulation results that the proposed control scheme shows a certain resiliency with respect to velocity and relative position measurement noises. In this paper, it is assumed that the AUVs are fully actuated, which results in a linear system dynamic model. In the future, it would be interesting to further consider the case of underactuated AUVs with complex nonlinear system dynamics.

Author Contributions

Conceptualization, H.G. and H.C.; methodology, W.L. and H.C.; software, W.L. and Z.G.; validation, W.L. and Z.G.; investigation, H.G. and H.C.; writing—original draft preparation, H.G. and H.C.; visualization, W.L. and Z.G.; supervision, H.C.; funding acquisition, H.G. and H.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded in part by the National Natural Science Foundation of China under grant number 62173149, 62276104, and in part by the Guangdong Natural Science Foundation under grant number 2021A1515012584, 2022A1515011262.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Ren, W.; Beard, R. Distributed Consensus in Multi-Vehicle Cooperative Control: Theory and Applications; Springer: London, UK, 2008. [Google Scholar] [CrossRef]
Ahn, H.S. Formation Control: Approaches for Distributed Agents; Springer: Cham, Switzerland, 2020. [Google Scholar] [CrossRef]
Ren, W.; Chen, F. Distributed Average Tracking in Multi-Agent Systems; Springer: Cham, Switzerland, 2020. [Google Scholar] [CrossRef]
Dong, X.; Zhou, Y.; Ren, Z.; Zhong, Y. Time-varying formation control for unmanned aerial vehicles with switching interaction topologies. Control Eng. Pract. 2016, 46, 26–36. [Google Scholar] [CrossRef]
Matveev, A.S.; Semakova, A.A. Distributed 3D Navigation of Swarms of Non-Holonomic UAVs for Coverage of Unsteady Environmental Boundaries. Drones 2022, 6, 33. [Google Scholar] [CrossRef]
Zhou, Y.; Rao, B.; Wang, W. UAV Swarm Intelligence: Recent Advances and Future Trends. IEEE Access 2020, 8, 183856–183878. [Google Scholar] [CrossRef]
Liu, T.; Jiang, Z.P. Distributed formation control of nonholonomic mobile robots without global position measurements. Automatica 2013, 49, 592–600. [Google Scholar] [CrossRef]
Zheng, R.; Liu, Y.; Sun, D. Enclosing a target by nonholonomic mobile robots with bearing-only measurements. Automatica 2015, 53, 400–407. [Google Scholar] [CrossRef]
Feng, Z.; Hu, G.; Sun, Y.; Soon, J. An overview of collaborative robotic manipulation in multi-robot systems. Annu. Rev. Control 2020, 49, 113–127. [Google Scholar] [CrossRef]
Kamel, M.A.; Yu, X.; Zhang, Y. Formation control and coordination of multiple unmanned ground vehicles in normal and faulty situations: A review. Annu. Rev. Control 2020, 49, 128–144. [Google Scholar] [CrossRef]
Balch, T.; Arkin, R. Behavior-based formation control for multirobot teams. IEEE Trans. Robot. Autom. 1998, 14, 926–939. [Google Scholar] [CrossRef] [Green Version]
Lawton, J.; Beard, R.; Young, B. A decentralized approach to formation maneuvers. IEEE Trans. Robot. Autom. 2003, 19, 933–941. [Google Scholar] [CrossRef] [Green Version]
Antonelli, G.; Arrichiello, F.; Chiaverini, S. Experiments of Formation Control With Multirobot Systems Using the Null-Space-Based Behavioral Control. IEEE Trans. Control Syst. Technol. 2009, 17, 1173–1182. [Google Scholar] [CrossRef] [Green Version]
Xiao, H.; Li, Z.; Philip Chen, C.L. Formation Control of Leader–Follower Mobile Robots’ Systems Using Model Predictive Control Based on Neural-Dynamic Optimization. IEEE Trans. Ind. Electron. 2016, 63, 5752–5762. [Google Scholar] [CrossRef]
Dai, S.L.; He, S.; Cai, H.; Yang, C. Adaptive Leader–Follower Formation Control of Underactuated Surface Vehicles with Guaranteed Performance. IEEE Trans. Syst. Man Cybern. Syst. 2022, 52, 1997–2008. [Google Scholar] [CrossRef]
Dong, X. Formation and Containment Control for High-Order Linear Swarm Systems; Springer: Berlin/Heidelberg, Germany, 2016. [Google Scholar] [CrossRef]
Petillot, Y.R.; Antonelli, G.; Casalino, G.; Ferreira, F. Underwater Robots: From Remotely Operated Vehicles to Intervention-Autonomous Underwater Vehicles. IEEE Robot. Autom. Mag. 2019, 26, 94–101. [Google Scholar] [CrossRef]
González-García, J.; Gómez-Espinosa, A.; Cuan-Urquizo, E.; García-Valdovinos, L.G.; Salgado-Jiménez, T.; Cabello, J.A.E. Autonomous Underwater Vehicles: Localization, Navigation, and Communication for Collaborative Missions. Appl. Sci. 2020, 10, 1256. [Google Scholar] [CrossRef] [Green Version]
Tholen, C.; El-Mihoub, T.A.; Nolle, L.; Zielinski, O. Artificial Intelligence Search Strategies for Autonomous Underwater Vehicles Applied for Submarine Groundwater Discharge Site Investigation. J. Mar. Sci. Eng. 2022, 10, 7. [Google Scholar] [CrossRef]
Das, B.; Subudhi, B.; Pati, B.B. Cooperative formation control of autonomous underwater vehicles: An overview. Int. J. Autom. Comput. 2016, 13, 199–225. [Google Scholar] [CrossRef]
Hadi, B.; Khosravi, A.; Sarhadi, P. A Review of the Path Planning and Formation Control for Multiple Autonomous Underwater Vehicles. J. Intell. Robot. Syst. 2021, 101, 1–26. [Google Scholar] [CrossRef]
Yang, Y.; Xiao, Y.; Li, T. A Survey of Autonomous Underwater Vehicle Formation: Performance, Formation Control, and Communication Capability. IEEE Commun. Surv. Tutor. 2021, 23, 815–841. [Google Scholar] [CrossRef]
Li, J.H.; Kang, H.; Kim, M.G.; Lee, M.J.; Cho, G.R.; Jin, H.S. Adaptive Formation Control of Multiple Underactuated Autonomous Underwater Vehicles. J. Mar. Sci. Eng. 2022, 10, 1233. [Google Scholar] [CrossRef]
Li, J.; Du, J.; Chang, W.J. Robust time-varying formation control for underactuated autonomous underwater vehicles with disturbances under input saturation. Ocean Eng. 2019, 179, 180–188. [Google Scholar] [CrossRef]
Li, L.; Li, Y.; Zhang, Y.; Xu, G.; Zeng, J.; Feng, X. Formation Control of Multiple Autonomous Underwater Vehicles under Communication Delay, Packet Discreteness and Dropout. J. Mar. Sci. Eng. 2022, 10, 920. [Google Scholar] [CrossRef]
Li, J.; Du, J.; Lewis, F.L. Distributed three-dimension time-varying formation control with prescribed performance for multiple underactuated autonomous underwater vehicles. Int. J. Robust Nonlinear Control 2021, 31, 6272–6287. [Google Scholar] [CrossRef]
Wei, H.; Shen, C.; Shi, Y. Distributed Lyapunov-Based Model Predictive Formation Tracking Control for Autonomous Underwater Vehicles Subject to Disturbances. IEEE Trans. Syst. Man Cybern. Syst. 2021, 51, 5198–5208. [Google Scholar] [CrossRef]
Yan, Z.; Zhang, C.; Tian, W.; Zhang, M. Formation trajectory tracking control of discrete-time multi-AUV in a weak communication environment. Ocean Eng. 2022, 245, 110495. [Google Scholar] [CrossRef]
Chen, Y.; Guo, X.; Luo, G.; Liu, G. A Formation Control Method for AUV Group Under Communication Delay. Front. Bioeng. Biotechnol. 2022, 10. [Google Scholar] [CrossRef] [PubMed]
Yuan, C.; Licht, S.; He, H. Formation Learning Control of Multiple Autonomous Underwater Vehicles With Heterogeneous Nonlinear Uncertain Dynamics. IEEE Trans. Cybern. 2018, 48, 2920–2934. [Google Scholar] [CrossRef]
Yan, Z.; Zhang, M.; Zhang, C.; Zeng, J. Decentralized formation trajectory tracking control of multi-AUV system with actuator saturation. Ocean Eng. 2022, 255, 111423. [Google Scholar] [CrossRef]
Cai, H.; Su, Y.; Huang, J. Cooperative Control of Multi-Agent Systems: Distributed-Observer and Distributed-Internal-Model Approaches; Springer: Cham, Switzerland, 2022. [Google Scholar] [CrossRef]
Wang, X. Distributed formation output regulation of switching heterogeneous multi-agent systems. Int. J. Syst. Sci. 2013, 44, 2004–2014. [Google Scholar] [CrossRef]
Hua, Y.; Dong, X.; Hu, G.; Li, Q.; Ren, Z. Distributed Time-Varying Output Formation Tracking for Heterogeneous Linear Multiagent Systems with a Nonautonomous Leader of Unknown Input. IEEE Trans. Autom. Control 2019, 64, 4292–4299. [Google Scholar] [CrossRef]
Hua, Y.; Dong, X.; Li, Q.; Ren, Z. Distributed adaptive formation tracking for heterogeneous multiagent systems with multiple nonidentical leaders and without well-informed follower. Int. J. Robust Nonlinear Control 2020, 30, 2131–2151. [Google Scholar] [CrossRef]
Li, W.; Chen, Z.; Liu, Z. Formation control for nonlinear multi-agent systems by robust output regulation. Neurocomputing 2014, 140, 114–120. [Google Scholar] [CrossRef]
Huang, X.; Dong, J. Reliable Leader-to-Follower Formation Control of Multiagent Systems Under Communication Quantization and Attacks. IEEE Trans. Syst. Man Cybern. Syst. 2020, 50, 89–99. [Google Scholar] [CrossRef]
Fiorelli, E.; Leonard, N.E.; Bhatta, P.; Paley, D.A.; Bachmayer, R.; Fratantoni, D.M. Multi-AUV Control and Adaptive Sampling in Monterey Bay. IEEE J. Ocean. Eng. 2006, 31, 935–948. [Google Scholar] [CrossRef] [Green Version]
Huang, J. Nonlinear Output Regulation: Theory and Applications; SIAM: Philadelphia, PA, USA, 2004. [Google Scholar] [CrossRef] [Green Version]

Figure 1. Information flow among different parts of the distributed robust control scheme. “VL” denotes virtual leader. “

C M_{j}

”, “

D O_{j}

”, “

P P E_{j}

” and “

I M C_{j}

” denote the communication module, distributed observer, pseudo position estimator and internal model control for the jth AUV, respectively. The communication module is in charge of information exchange over the communication network.

Figure 1. Information flow among different parts of the distributed robust control scheme. “VL” denotes virtual leader. “

C M_{j}

”, “

D O_{j}

”, “

P P E_{j}

” and “

I M C_{j}

” denote the communication module, distributed observer, pseudo position estimator and internal model control for the jth AUV, respectively. The communication module is in charge of information exchange over the communication network.

Figure 2. AUV fleet cruises in the ocean in a static triangular formation while following the reference trajectory of a straight line.

Figure 3. The switching topology of the communication network

{\bar{G}}_{σ (t)}

.

Figure 3. The switching topology of the communication network

{\bar{G}}_{σ (t)}

.

Figure 4. Performance of the distributed observer.

Figure 5. Performance of the pseudo position estimator.

Figure 6. Absolute tracking errors of all of the AUVs.

Figure 7. Three dimensional trajectories of all of the AUVs.

Figure 8. The reliable and connected communication network

\bar{G}

.

Figure 8. The reliable and connected communication network

\bar{G}

.

Figure 9. Performance of the control method in [29] under communication network

\bar{G}

.

Figure 9. Performance of the control method in [29] under communication network

\bar{G}

.

Figure 10. Performance of the control method in [29] under communication network

{\bar{G}}_{σ (t)}

with

T_{c} = 0.1

s.

Figure 10. Performance of the control method in [29] under communication network

{\bar{G}}_{σ (t)}

with

T_{c} = 0.1

s.

Figure 11. Performance of the control method in [29] under communication network

{\bar{G}}_{σ (t)}

with

T_{c} = 0.3

s.

Figure 11. Performance of the control method in [29] under communication network

{\bar{G}}_{σ (t)}

with

T_{c} = 0.3

s.

Figure 12. Performance of the control method proposed in this paper under communication network

{\bar{G}}_{σ (t)}

with

T_{c} = 0.1

s.

Figure 12. Performance of the control method proposed in this paper under communication network

{\bar{G}}_{σ (t)}

with

T_{c} = 0.1

s.

Figure 13. Performance of the control method proposed in this paper under communication network

{\bar{G}}_{σ (t)}

with

T_{c} = 0.3

s.

Figure 13. Performance of the control method proposed in this paper under communication network

{\bar{G}}_{σ (t)}

with

T_{c} = 0.3

s.

Figure 14. Performance of the pseudo position estimator subject to velocity measurement noise with

n_{V} = 1

.

Figure 14. Performance of the pseudo position estimator subject to velocity measurement noise with

n_{V} = 1

.

Figure 15. Performance of the robust internal model control law subject to velocity measurement noise with

n_{V} = 1

.

Figure 15. Performance of the robust internal model control law subject to velocity measurement noise with

n_{V} = 1

.

Figure 16. Performance of the pseudo position estimator subject to relative position measurement noise with

n_{P} = 0.1

.

Figure 16. Performance of the pseudo position estimator subject to relative position measurement noise with

n_{P} = 0.1

.

Figure 17. Performance of the robust internal model control law subject to relative position measurement noise with

n_{P} = 0.1

.

Figure 17. Performance of the robust internal model control law subject to relative position measurement noise with

n_{P} = 0.1

.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Gao, H.; Li, W.; Cai, H.; Gu, Z. Distributed Path Tracking for Autonomous Underwater Vehicles Based on Pseudo Position Feedback. J. Mar. Sci. Eng. 2022, 10, 1477. https://doi.org/10.3390/jmse10101477

AMA Style

Gao H, Li W, Cai H, Gu Z. Distributed Path Tracking for Autonomous Underwater Vehicles Based on Pseudo Position Feedback. Journal of Marine Science and Engineering. 2022; 10(10):1477. https://doi.org/10.3390/jmse10101477

Chicago/Turabian Style

Gao, Huanli, Wei Li, He Cai, and Zekai Gu. 2022. "Distributed Path Tracking for Autonomous Underwater Vehicles Based on Pseudo Position Feedback" Journal of Marine Science and Engineering 10, no. 10: 1477. https://doi.org/10.3390/jmse10101477

APA Style

Gao, H., Li, W., Cai, H., & Gu, Z. (2022). Distributed Path Tracking for Autonomous Underwater Vehicles Based on Pseudo Position Feedback. Journal of Marine Science and Engineering, 10(10), 1477. https://doi.org/10.3390/jmse10101477

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Distributed Path Tracking for Autonomous Underwater Vehicles Based on Pseudo Position Feedback

Abstract

1. Introduction

2. Graph Notation

3. Problem Statement

4. Main Results

5. Numerical Simulations

5.1. Aim of the Experiment

5.2. Methodology

5.3. Results

5.3.1. Standard Case

5.3.2. Comparative Study

6. Methodology Discussion

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI