Trajectory Modeling by Distributed Gaussian Processes in Multiagent Systems

Dongjin Xin; Lingfeng Shi

doi:10.3390/s22207887

and

School of Electronic Engineering, Xidian University, Xi’an 710071, China

^*

Author to whom correspondence should be addressed.

Sensors2022, 22(20), 7887;https://doi.org/10.3390/s22207887

This article belongs to the Collection Signal Processing, Control, and Estimation for Intelligent Sensor Systems

Version Notes

Order Reprints

Review Reports

Abstract

This paper considers trajectory a modeling problem for a multi-agent system by using the Gaussian processes. The Gaussian process, as the typical data-driven method, is well suited to characterize the model uncertainties and perturbations in a complex environment. To address model uncertainties and noises disturbances, a distributed Gaussian process is proposed to characterize the system model by using local information exchange among neighboring agents, in which a number of agents cooperate without central coordination to estimate a common Gaussian process function based on local measurements and datum received from neighbors. In addition, both the continuous-time system model and the discrete-time system model are considered, in which we design a control Lyapunov function to learn the continuous-time model, and a distributed model predictive control-based approach is used to learn the discrete-time model. Furthermore, we apply a Kullback–Leibler average consensus fusion algorithm to fuse the local prediction results (mean and variance) of the desired Gaussian process. The performance of the proposed distributed Gaussian process is analyzed and is verified by two trajectory tracking examples.

Keywords:

trajectory modeling; data-driven approach; distributed Gaussian processes; Lyapunov function; model predictive control (MPC)

1. Introduction

Trajectory tracking is a common problem in control and robotics, and its generation systems represent a large class of dynamical physical models. In the past few decades, various control schemes have been investigated and modeled, and most of them can be considered as a subset of computed torque control laws [1]. Generally speaking, in order to track trajectories, one needs to know the system model, such as the kinematic model, observation model, and motion model [2]. However, in many practical applications, one usually cannot obtain the model information/knowledge, or the system model is dynamical and is difficult to characterize. The system model is often filled with a high degree of uncertainty, nonlinearity, and dependency, which makes it difficult to model accurately. Therefore, traditional modeling methods are no longer suitable for the actual dynamical environment [3]. More recently, data-driven approaches are getting more and more attention in many fields, such as the control and machine learning communities [4,5]. Since data-driven methods can train the system model with high efficiency and precision, they have become the most popular choice for system modeling [6,7]. In particular, the Gaussian process (GP) is the most representative one, and it has been successfully applied to many fields.

A research frontier in the realm of GP is the trajectory modeling issue. Due to its capability to tackle complex perturbations, uncertainties, dependencies, and nonlinearities, GP is becoming a popular choice in various systems, such as in solar power forecasting [8], permanent magnet spherical motors [9], iterative learning control [10], and in swarm kinematic model [11]. In particular, GP has been proven to be effective in improving the learning accuracy and the learning effectiveness of uncertainties and dependencies in low data regimes [12]. More recently, a non-parametric Gaussian process (GP) was proposed for modeling with quantifiable uncertainty and nonlinearity [13,14] based on implicit variance trade-off [15,16]. This bridges the system modeling and data-driven methods. However, computational burden and hardware requirements make GP impractical for big data sets. Furthermore, the high cost of GP also severely hinders the application to an actual physical system. The engineering community has acknowledged these limitations and has attempted to address the problem. Since one can decompose the learning process into a part for a solution, this inspires one to address it in a distributed manner. Accordingly, a distributed GP is an urgent need [17,18].

Generally speaking, the processing is called distributed manner if it is carried out by a cooperative strategy among nodes without central coordination [19]. The distributed method aims at minimizing the amount of computation and communication required by each node as well as making these requirements scalable in the number of nodes [20]. Distributed methods are available for parameter estimation [14], Kalman filtering [21], control [22], optimization [23], learning [13], etc. A major division among distributed methods is based on whether all nodes estimate the full system state [24] or whether each node only estimates a subset of the state variables [25]. The challenge consists in how to execute the update and fusion step in a distributed manner. Existing fusion strategies are usually from the perspective of state estimation and estimation error covariance. Since GP is indeed a Gaussian probability density function (PDF), the trajectory model constructed by GP requires us to consider fusion strategy from the view of PDF [26]. Therefore, this paper is targeted to design a novel GP fusion strategy for multi-agent systems. Generally, the strategy is organized as follows: after obtaining the local predicted results of GP, we perform a fusion of the Kullback–Leibler average consensus on local predictions of GP among neighbors. The distributed GP model can then be developed and successfully applied in large-scale multi-agent systems.

1.1. Related Works

Gaussian process-based modeling and based trajectory tracking have been widely investigated and applied over the past two decades. In the first place, most focus on the centralized GP and the multi-input–output GP. In addition, they are developed based on the need for engineering applications in learning and control fields such as GP-based tracking control, state space model learning, and their applications to trajectory tracking. For example, Beckers et al. studied the stable Gaussian process-based tracking control of Lagrangian systems [1]. Umlauft et al. learned stable Gaussian process state space models [27], while Mohammad et al. learned stable nonlinear dynamical systems with Gaussian mixture models [5]. In addition, Pushpak et al. designed control barrier functions for unknown nonlinear systems using Gaussian processes [28]. Umlauft et al. considered human motion tracking with stable Gaussian process state space models in ref. [29] and proposed an uncertainty-based control Lyapunov approach for control-affine systems modeled by the Gaussian process [30]. They also calculated uniform error bounds of Gaussian process regression with application to safe control [31]. Even Gaussian process-based trajectory tracking and control are becoming research hotspots; they focus mainly on one agent and are seldom involved in multi-agent systems. In the second place, distributed and centralized GPs are flourishing in solving data-driven learning algorithms for multi-agent systems. Generally speaking, the main research results are organized as follows: (1) For contributions of models and theories, the unknown map function was modeled and characterized as a GP but with zero-mean assumption, and a distributed parameter and non-parameter Gaussian regression was proposed by using Karhunen–Loeve expansion in refs. [13,14]. To scale GP to large datum, Deisenroth et al. introduced a robust Bayesian committee machine, a practical and scalable product-of-experts model for large-scale distributed GP regression [32]. To address the hyperparameter optimization problem in big data processing, Xie et al. proposed an alternative distributed GP hyperparameter optimization scheme using the efficient proximal alternating direction method of multipliers [33]. Multiple-task GP was studied in ref. [34], while multi-out regression by GP was studied in ref. [35]. Both of them were centralized approaches and could not be extended to a large-scale problem. GP networks were flexible and effective to be used in multi-output regression by combining with variational inference and distributed variational inference in ref. [36], which involved applications to settle non-linear dimension reduction and regression, and provided a powerful tool to address uncertainty and over-fitting problems. (2) For engineering applications, Nerurkar et al. [37] presented a distributed conjugate gradient algorithm for cooperative localization. Franceschelli and Gasparri [38] presented a distributed gossip-based approach to address the pose estimation problem. Cunnigham et al. [39] developed an approach for robot smoothing and mapping by using Gaussian elimination. Distributed localization from distance measurements is studied in [40]. The distributed position estimation was considered in [41]. Distributed rotation estimation algorithm was developed in various engineering [42,43,44]. Distributed Gauss–Seidel algorithm was studied in [45]. GP for data learning in robotic control was considered in [46]. (3) For trajectory tracking in a multi-agent system, an efficient algorithm was presented in ref. [47] to generate trajectory. Gaussian mixture models were used to learn stable trajectory in ref. [5]. The centralized GP for human motion tracking was studied [48]. (4) For distributed model predictive control (MPC), an overview and future research opportunities were discussed in ref. [49]. A cooperative distributed model predictive control for nonlinear systems was studied in [50], for tracking was studied in ref. [51], for linear systems was studied in ref. [52], and for event-based communication and parallel optimization, it was developed in ref. [53]. Additionally, non-cooperative distributed model predictive control was investigated in ref. [54]. More recently, explicit distributed and localized model predictive control via system-level synthesis was investigated in refs. [55,56] and was applied to the trajectory generation of a multi-agent system in ref. [57]. In short, the study of distributed GP is scarce, especially in the trajectory modeling problem.

1.2. Contributions

More recently, GP was widely used to model tracking systems and applied to track the target in a real-world environment, such as speed racing (quadrotors) [58], trajectory tracking for wheeled mobile robots [59], 3D people tracking [60], and Simultaneous Localization and Mapping (SLAM) [61,62]. However, these applications focus on one agent, which ignores the advantages of multi-agent systems. After surveying these related references, we find that the trajectory tracking problem is mainly solved by control methods, not data-driven methods, and we focus on one agent, not a multi-agent collaboration. In addition, these existing GP-based learning algorithms are limited by training manners. Motivated by the above discussion, we investigate the distributed GP to learn the trajectory system model in this paper. More specifically, the main contributions of the paper are four-fold. (1) Compared with GP in Lagrangian systems [1,58], this paper considers a general state-space model for both discrete-time and continuous-time. (2) Compared with centralized GP in the state space model [27,28,29,30], PD control and model predictive control are combined together with GP to achieve tracking system modeling and estimating, which can make the estimation error globally uniformly bounded. (3) Compared with existing multiple GP-based centralized approaches, such as collaborative GP [32,33,34,35,63,64], this paper achieves a distributed GP manner to estimate the state, and we apply Kullback–Leibler (KL) average consensus to fuse local training results of GPs, which is different with the Wasserstein metric for measuring GP [65]. (4) Compared with the centralized GP without giving the performance bound [32,33,34,35,63,64] or only providing Kullback–Leibler average analysis [26], this paper analyzes the probabilistically globally ultimate bound of distributed GP.

1.3. Paper Structure

The remainder of the paper is organized as follows. Section 2 introduces some preliminaries, including notations, graph theory, Gaussian process, and Kullback–Leibler average consensus. Section 3 states the considered systems. Section 4 designs the local control strategy and proposes a Kullback–Leibler (KL) average consensus to fuse the local predictions of GP. Section 5 provides two tracking experiments. Finally, Section 6 concludes the paper.

2. Preliminaries

2.1. Notation

Throughout the paper, vectors and vector-valued functions are denoted with bold characters. Matrices are described with capital letters.

t r a c e (\cdot)

,

l o g (\cdot)

,

d e t (\cdot)

,

⟨ \cdot, \cdot ⟩, ∥ \cdot ∥

,

\oplus

,

⊙

,

N (\cdot, \cdot)

, and

G P (\cdot, \cdot)

denote, respectively, the trace operation, the logarithm operation, the determinate operation, the inner product, the 2-norm of a matrix or vector, the addition operation of probability density functions (PDFs), the multiplication operation of PDFs, a Gaussian distribution, and a Gaussian process. Moreover,

\dot{x}

,

\ddot{x}

,

\hat{x}

, and

\bar{x}

denote, respectively, the first-order differential operation on

x

, the second-order differential operation, the prediction, and the mean operation of

x

. In addition,

D_{KL} (p ∥ q)

,

E

V

,

I_{n}

denote, respectively, the KL divergence/distance between probabilities

p

and

q

, the expectation operation, the variance operation, and an n-by-n identity matrix. In addition,

\frac{d x}{d u}

,

\frac{\partial x}{\partial u}

, and

O (\cdot)

denote, respectively, the derivative operation, the partial derivative operation, and the complexity.

2.2. Graph Theory

A graph is defined as

G = (N, ε)

; where

N

is a set of nodes and

ε \subseteq N \times N

a set of edges. In particular, graph

G

is undirected iff

(u, v) \in ε \Leftrightarrow (v, u) \in ε

for all

u, v \in N

. The order is

| N |

and the size of

G

is

| ε |

. Further, let

N^{i} = {j \in N : (j, i)} \in ε

denote the set of neighbors for node

i

.

2.3. Gaussian Process

Definition 1.

([66]). A Gaussian process is a collection of random variables, any finite number of which have a joint Gaussian distribution.

A Gaussian process is completely specified by its mean function and covariance function. We define mean function

m (x) : x \in χ \to ℝ

and the covariance function (kernel)

k (x, x^{'}) : χ \times χ \to ℝ

of a real process

f (x)

as

m (x) = E (f (x))

,

k (x, x^{'}) = E [(f (x) - x) (f (x^{'}) - x^{'})]

, and denote the Gaussian process as

f (x) \sim G P (m (x), k (x, x^{'}))

. When

f : χ \to ℝ^{n}

is an n-dimensional map, the GP can be denoted by

f_{j} (x) \sim G P (m (x), k (x, x^{'})), j \in {1, \dots, n}

.

Then, the Gaussian process can be organized as [29]

f (x) = {\begin{matrix} f_{1} (x) \sim G P (m_{1} (x), k_{1} (x, x^{'})) \\ ⋮ \\ f_{n} (x) \sim G P (m_{n} (x), k_{n} (x, x^{'})) \end{matrix} .

(1)

In addition, the covariance function (kernel) measures similarity between any two states/variables

x, x^{'} \in χ

and the common kernel functions include the linear kernel, squared-exponential (SE) kernel, the polynomial kernel, the Gaussian kernel, and the Matèrn kernel.

Assumption 1

. Suppose the measurement equation is

y = f (x) + ϵ

, where

y \in ℝ^{m}

is the observed vector,

x \in X \subset ℝ^{n}

is the state vector defined in a compact set

X

, and

ϵ

is the measurement noise obeying a Gaussian distribution with zero mean and variance

σ^{2} I_{n}

(denoted by

ϵ \sim N (0, σ^{2} I_{n})

). In addition,

f

is the unknown mapping function (

f : χ \to ℝ^{n}

) and is assumed to be a GP (denoted by

f \sim G P (0, k_{θ} (x, x^{'}))

). Here

k_{θ} (x, x^{'})

is a kernel function with respect to hyper-parameters

θ

,

K_{X X} = k_{θ} (X, X)

denotes the covariance matrix of set

X

, and

K_{x X} = k_{θ} (x, X)

denotes the covariance matrix between

x

and

X

.

Generally speaking, given a training set

D = (X, y)

, (input

X

and output

y

) [29], the log-likelihood can be computed by

\begin{array}{l} \log p (y | X, θ) = - \frac{1}{2} y^{T} {(K_{X X} + σ^{2} I_{n})}^{- 1} y \\ - \frac{1}{2} \log | K_{X X} + σ^{2} I_{n} | - \frac{n}{2} \log 2 π . \end{array}

(2)

Then, when a new input

x^{*}

is introduced, the posterior prediction of the Gaussian process [29] is

p (f (x^{*})) = N (μ (x^{*}), \sum (x^{*}))

, where

\begin{matrix} μ (x^{*}) = K_{x^{*} X} {(K_{X X} + σ^{2} I_{n})}^{- 1} y, \\ \sum (x^{*}) = K_{x^{*} X} - K_{x^{*} X} {(K_{X X} + σ^{2} I_{n})}^{- 1} K_{X x^{*}} . \end{matrix}

(3)

To summarize, the likelihood maximization of (2) is performed to compute gradients for training, and the mean and covariance functions (3) are used for fast predictions.

More specifically, given an arbitrary new testing input

x^{*} \in χ

conditioning a dataset

D

described above, the prediction response

y^{*}

is jointly Gaussian distributed with the training set, which is given by

\begin{array}{l} [\begin{matrix} y_{j}^{*} \\ y_{j} \end{matrix}] \sim N ([\begin{matrix} m_{j} (x^{*}) \\ m_{j} \end{matrix}], [\begin{matrix} k_{j}^{*} & k_{j}^{T} \\ k_{j} & K_{j} + σ^{2} I \end{matrix}]), \\ y_{j} = {[\begin{matrix} y_{j}^{(1)} & \dots & y_{j}^{(N)} \end{matrix}]}^{T} \in ℝ^{n}, \\ m_{j} = {[m_{j} (x_{(1)}) \dots m_{j} (x_{(N)})]}^{T} \in ℝ^{n}, \\ k_{j}^{*} = k_{j} (x^{*}, x^{*}) \in ℝ, \\ k_{j} = {[k_{j} (x_{(1)}, x^{*}) \dots k_{j} (x_{(N)}, x^{*})]}^{T}, \\ K_{j} = [\begin{matrix} k_{j} (x_{(1)}, x_{(1)}) & \dots & k_{j} (x_{(1)}, x_{(N)}) \\ ⋮ & ⋱ & ⋮ \\ k_{j} (x_{(N)}, x_{(1)}) & \dots & k_{j} (x_{(N)}, x_{(N)}) \end{matrix}] \in ℝ^{n \times n} . \end{array}

(4)

For j = 1,…, n, the posterior distribution corresponding to fj(·) at x^∗ yields a Gaussian distribution with mean function and covariance function as

\begin{array}{l} E [y_{j}^{*} | D, x^{*}] = m_{j} (x^{*}) + k_{j}^{T} {(K_{j} + σ^{2} I_{n})}^{- 1} (y_{j} - m_{j}), \\ V [y_{j}^{*} | D, x^{*}] = k_{j}^{*} - k_{j}^{T} {(K_{j} + σ^{2} I_{n})}^{- 1} k_{j} . \end{array}

(5)

Furthermore, in order to learn the hyper-parameter

θ_{j}

given a chosen kernel, we can use the maximum likelihood function based on Bayes’ rules as

\max_{θ_{j}} \log p (y_{j} | x_{(1 : n)}, θ_{j}) = \max_{θ_{j}} (- \frac{1}{2} y_{j}^{T} K_{j}^{- 1} y_{j} - \frac{1}{2} \log d e t K_{j} - \frac{n}{2} \log (2 π)) .

(6)

which can be solved by gradient-based approaches [29].

2.4. Kullback–Leibler Average Consensus Algorithm

This section introduces the consensus/fusion algorithm of GPs. A Gaussian process is a Gaussian probability density function over mean function and covariance function. Therefore, the fusion of GPs is indeed the fusion of probabilities. It raises a problem: How to achieve consensus/fusion of probabilities among multiple agents?

Before proceeding on, we first introduce some definitions.

Definition 2.

(Probability space [26]). Let

P ≜ {p (•) : ℝ^{n} \to ℝ such that \int_{ℝ^{n}} p (x) d x = 1 and p (x) \geq 0, \forall x \in ℝ^{n}}

denote the set of probabilities (PDFs) over

ℝ^{n}

and let

p^{i} (\cdot) \in P (i \in N)

denote the local probability/PDF of agent

i

.

Definition 3.

(Kullback–Leibler divergence [26]). In statistics, the Kullback–Leibler divergence,

D_{K L} (p ∥ q)

(also called relative entropy), is a statistical distance: a measure of the probability distribution p(·) is different from the probability distribution q(·), which is defined as (for distributions

p (\cdot)

and

q (\cdot)

of a continuous random variable

x

)

D_{K L} (p ∥ q) = \int p (x) \log \frac{p (x)}{q (x)} d x .

(7)

Definition 4.

(Probabilistic operation [26]). Define the plus

\oplus

and multiplicative

⊙

operators over probabilities (

p (\cdot)

and

q (\cdot)

) for a variable (

x

) and a real constant as a

\begin{matrix} p (x) \oplus q (x) ≜ \frac{p (x) q (x)}{\int p (x) q (x) d x}, \\ a ⊙ p (x) ≜ \frac{{[p (x)]}^{a}}{\int {[p (x)]}^{a} d x} . \end{matrix}

(8)

Then, we attempt to find a Kullback–Leibler average consensus/fusion algorithm over probabilities obtained by multiple agents.

First, according to [26], Kullback–Leibler average (KLA) is to average over probabilities. Motivated by this, we define the weighted KLA (

\bar{p}

) among the probabilities

{p^{i}}_{i = 1}^{N}

as

\bar{p} = \underset{p \in P}{a r g \inf} \sum_{i \in N} a^{i} D_{KL} (p ∥ p^{i}),

(9)

where

a^{i} \geq 0

denotes the weight of agent

i

and satisfies

\sum_{i \in N} a^{i} = 1

.

Then, the average consensus/fusion problem is to achieve

\lim_{l \to \infty} p_{l}^{i} = \bar{p},

(10)

for all agents

i \in N

, where l is the consensus step and

\bar{p}

represents the asymptotic KLA with uniform weights.

Second, we attempt to find the solution of the average consensus

\bar{p}

in (9). Based on [26], the solution is

\bar{p} (x) = \frac{\prod_{i \in N} {[p^{i} (x)]}^{a^{i}}}{\int \prod_{i \in N} {[p^{i} (x)]}^{a^{i}} d x} ≐ \underset{i \in N}{\oplus} (a^{i} ⊙ p^{i} (x)),

(11)

with

a^{i} = 1 / | N |

. In addition, the local consensus of agent

i

at the l-th consensus step can be obtained by

p_{l}^{i} (x) = \underset{i \in N}{\oplus} (a^{i, j} ⊙ p_{l - 1}^{j} (x)), \forall i \in N,

(12)

where

a^{i j}

is the consensus weight satisfying

a^{i j} \geq 0

,

\sum_{i \in N} a^{i, j} = 1

and

a^{i j}

represents the (

i, j

)-th component of the consensus matrix

A

(if

j \notin N^{i} a^{i, j} = 0

). Therefore, when the

l

-th iteration of the consensus algorithm is initialized by

p_{0}^{i} (\cdot) = p^{i} (\cdot)

, we can finally obtain the consensus as

p_{l}^{i} (x) = \underset{i \in N}{\oplus} (a_{l}^{i, j} ⊙ p^{j} (x)), \forall i \in N

(13)

Third, for special Gaussian case, the local probability

p^{i} (\cdot)

takes the form as

p^{i} (x) = N (x; μ^{i}, \sum^{i}) ≜ \frac{1}{\sqrt{d e t (2 π \sum^{i})}} e^{- \frac{1}{2} {(x - μ^{i})}^{T} {(\sum^{i})}^{- 1} (x - μ^{i})},

(14)

where

μ^{i} \in ℝ^{n}

and

\sum^{i} \in ℝ^{n \times n}

denote the mean vector and the covariance matrix, respectively. In view of this case, the KLA can be directly obtained by operating the means and the covariances instead of probabilities. The following lemma states the KLA on Gaussian distributions.

Lemma 1.

([26]). Given

N

Gaussian distributions

(p^{i} (x), i = 1, \dots, N)

defined in (14), with corresponding weigh

a^{i}

, then the weighted KLA

\bar{p} (\cdot) = N (\cdot; \bar{μ,} \sum^{¯})

can be calculated by directly fusing the means

μ^{i}

and the covariance matrices

\sum^{i}

as

\begin{matrix} {\sum^{¯}}^{- 1} = \sum_{i = 1}^{N} a^{i} {(\sum^{i})}^{- 1}, \\ {\sum^{¯}}^{- 1} \bar{μ} = \sum_{i = 1}^{N} a^{i} {(\sum^{i})}^{- 1} μ^{i} . \end{matrix}

(15)

Lemma 1 indicates that the consensus/fusion of Gaussian probabilities can directly operate their means and covariance matrices. Note that a Gaussian process is indeed a Gaussian probability. Therefore, the KLA consensus/fusion on GPs can be directly obtained by fusing the mean functions and the covariance functions.

2.5. Uniform Error Bounds

This section analyzes the probabilistic uniform error bounds.

Definition 5.

(Probabilistic uniform error bound [31]).

\forall x \in X

, if there exists a function

η (x)

such that

∥ μ (x) - f (x) ∥ \leq η (x)

, then, on a compact set

X \subset ℝ^{n}

, GP has a uniformly bounded error. A probabilistic uniform error bound is one that holds with a probability of at least

1 - δ

for any

δ \in (0, 1)

.

Definition 6.

(Lipschitzconstant of the kernel [64]). The Lipschitz constant of a differentiable covariance kernel

k (\cdot, \cdot)

is

L_{k} : = \underset{x, x^{'} \in X}{m a x} ‖ {[\begin{matrix} \frac{\partial k (x, x^{'})}{\partial x_{1}} & \dots & \frac{\partial k (x, x^{'})}{\partial x_{n}} \end{matrix}]}^{T} ‖ .

(16)

Next, we show that the posterior prediction (3) of GP is continuous. Given the continuous unknown

f

with Lipschitz constant

L f

and the Lipschitz continuous kernel

k

with Lipschitz constant

L_{k}

, we then have the following theorem.

Theorem 1.

([31]). Given a GP defined by the continuous covariance kernel function

k

with Lipschitz constant

L_{k}

, a continuous unknown map

f

with Lipschitz constant

L_{f}

and measurements satisfying Assumption 1. Then, the posterior predictions

(μ (\cdot) a n d \sum (\cdot))

of the GP conditioning on the training date set

D = {x_{t}, y_{t}^{i}}_{t = 1, \dots, N}^{i = 1, \dots, N}

are continuous with Lipschitz constant

L_{μ}

and modulus of continuity

ω_{\sum}

such that

\begin{matrix} L_{μ} \leq L_{k} \sqrt{N} | {(K + σ^{2} I)}^{- 1} y |, \\ ω_{\sum (τ)} \leq \sqrt{2 τ L_{k} (1 + N | {(K + σ^{2} I)}^{- 1} | \underset{x, x^{'} \in X}{m a x} k (x, x^{'}))} . \end{matrix}

(17)

for any

τ \in ℝ_{+}

and

δ \in (0, 1)

with

\begin{array}{l} β (τ) = 2 \log (\frac{M (τ, X)}{δ}), \\ γ (τ) = (L_{μ} + L_{f}) τ + \sqrt{β (τ)} ω_{\sum (τ)} . \end{array}

(18)

In addition,

\forall x \in X

, it follows that

p (‖ f (x) ‖ - μ (x) \leq γ (τ) + \sqrt{β (τ) \sum (x)}) \geq 1 - δ .

(19)

Proof of Theorem 1.

The proof is given in Appendix A. □

Asymptotic Analysis

The asymptotic analysis of the error bound (19) in the limit

N \to \infty

is organized as the following theorem.

Theorem 2.

([31]). Given a GP defined by the continuous covariance kernel function

k

with Lipschitz constant

L_{k}

, and an infinite data response of measurements

(X_{t}, y_{t}^{i})

of the continuous unknown map

f

with Lipschitz constant

L_{f}

and the maximum absolute value

∥ f^{m a x} ∥

. The first N measurements inform the posterior predictions of the GP as

(μ (\cdot) a n d \sum (\cdot))

. If there exists a

ϵ > 0

such that

\sum (x) \in O (\log {(N)}^{- \frac{1}{2} - ϵ})

,

\forall x \in X

for any

δ \in (0, 1)

, it follows that

p (\sup_{x \in X} ‖ f (x) - μ_{N} (x) ‖ \in O (\log {(N)}^{- \frac{1}{2} - ϵ})) \geq 1 - δ .

(20)

Proof of Theorem 2.

The proof is given in Appendix B. □

3. Problem Formulation

The trajectories generate from a continuous dynamical system

\dot{x} = f (x, u) + w,

(21)

where

x \in χ \subset ℝ^{n}

in a compact set,

χ

denotes the state (location),

u \in U \subseteq ℝ^{n}

denotes the control input,

ω

denotes the process noise with

w \sim N (0, σ_{ω}^{2} I)

and the initial state is

x (0) = x_{0}

. We have

N

agents/sensors connected with a network to acquire measurements (location or velocity). In particular, sensor

i

measures

y_{j}^{i} = f^{i} (x) + ϵ_{j}^{i},

(22)

where

y_{j}^{i}

is the observed vector of sensor

i (i = 1, \dots, N)

at the

j

-th step

(j = 1, \dots, N)

,

ϵ_{j}^{i}

is the measurement noise with

ϵ_{j}^{i} \sim N (0, σ^{2} I)

.

Suppose that a training data set

D

of trajectories is given.

D

contains the state (current location) and the measurement, which is denoted by

D = {x^{i}, y_{j}^{i}}_{i = 1 \dots N}^{j = 1 \dots N} .

The nonlinear map function

f : χ \to ℝ^{n}

is unknown and is assumed to be a Gaussian process. In addition, the following assumption is satisfied.

Assumption 2.

f (x)

Suppose the measurement is Lipschitz continuous and has a bounded RKHS (reproducing kernel Hilbert space) norm with respect to fixed common kernel

k

,

{∥ f ∥}_{k} = \sqrt{f, f_{k}} < \infty

.

The objective is to find an estimated

\hat{f}

of

f

, for which the output trajectory x tracks the desired trajectory

x_{d} = {[\begin{matrix} x_{d} & {\dot{x}}_{d} \end{matrix}]}^{T}

such that the tracking error

e = x - x_{d} = {[\begin{matrix} e_{1} & e_{2} \end{matrix}]}^{T}

vanishes over time, i.e.,

\underset{t \to \infty}{l i m} ∥ e ∥ = 0

. l. Since the noises

ω

and

ϵ_{j}^{i}

and the uncertain dynamics affect the system and control, we use multiple agents to eliminate the influence of stochastic uncertainty, i.e., given local

{\hat{f}}^{i}

, the goal is also to fuse them and to find a fused/consensus

\bar{f}

such that the uncertainty also vanishes over time.

4. Control Design and Analysis

Classical control uses static feedback gains. Low feedback gains are designed to avoid saturation of the actuators and good noise suppression. However, the considered unknown dynamics require a more minimal feedback gain to keep the tracking error under a defined limit. After performing a training procedure, we use the mean function of GP to adapt the gains. For this purpose, the uncertainty of the GP and multiple agents are employed to scale the feedback gains.

Before proceeding on, the following natural assumptions and lemmas are given.

Assumption 3.

The desired trajectory

x_{d} (t)

is bounded by

∥ x_{d} ∥ = {∥ [\begin{matrix} x_{d} & {\dot{x}}_{d} \end{matrix}]}^{T} ∥ \leq q_{d} = {[\begin{matrix} q_{d} & {\dot{q}}_{d} \end{matrix}]}^{T}

.

Lemma 2.

([67]) If there exist a positive constant

b \in ℝ_{+}

such that

\forall a \in ℝ_{+}

, if there exists a function

T = T (a, b)

satisfying

∥ x (t_{0}) ∥ \leq a

, then we have that

\forall t \geq t_{0} + T, ∥ x (t) ∥ \leq b

, i.e., the trajectory

x (t)

of the dynamics (21) is globally ultimately bounded.

Lemma 3.

([67]) If there exists a Lyapunov function

V

such that

\dot{V} (x) < 0

for all

x \in X \ B

, the dynamical system

\dot{x} = f (x, u)

is globally ultimately bounded to a set

B \subset X

.

Next, we design the controller and the control law such that stability and high-performance tracking are achieved. The controller is designed as

u = - \hat{f} (x) + ρ,

(23)

where

\hat{f}

is the model estimation of nonlinear dynamics

f

and

\hat{f}

is obtained by utilizing the posterior mean function

µ_{N}

of GP trained by the data set

D

, and

ρ

is the control law.

In addition,

ρ

is designed as a Proportional–Derivative (PD) type controller

ρ = {\ddot{x}}_{d} - k_{d} r - k_{p} e_{2},

(24)

where

r = k_{p} e_{1} + e_{2}

is the filtered state with

\dot{r} = f (x) - \hat{f} (x) - k_{c} r

,

k_{p} \in ℝ_{+}

, and the control gain

k_{p} \in ℝ_{+}

.

Given the above controller, one needs to verify the effectiveness of the model estimation

\hat{f}

and the choices of the parameters

k_{d}

and

k_{p}

. The following theorem states the control law with guaranteed boundedness of the tracking error.

Theorem 3.

Consider the system (23), where f satisfies Assumption 2 and admits a Lipschitz constant

L_{f}

. If Assumption 3 is satisfied, then the controller with

\hat{f} = µ_{N}

and the control law guarantee that the tracking error is globally ultimately bounded and converges to a ball

B = {∥ e ∥ \leq \frac{γ (τ) + \sqrt{β (τ) \sum_{N} (x)}}{k_{c} \sqrt{λ^{2} + 1}}}, \forall x \in X

, where

β

and

γ

are given in Theorem 1, with a probability of at least

1 - δ, δ \in (0, 1)

.

Proof of Theorem 3.

The proof is given in Appendix C. □

Remark 1.

From Theorem 3, it can be seen that trajectory tracking with high probability is achieved with the proposed GP-based controller. Compared with most existing results where only uniformly ultimate boundedness of the trajectory tracking errors was achieved [1,68], the proposed control law ensures high control precision in the presence of the estimation errors from GP.

Proof of Remark 1.

The proof is given in Appendix C. □

4.1. Consensus

The aforementioned control law focuses one agent/sensor. Since the noises

ω

and

ϵ_{j}^{i}

affect the measurements and the dynamical system, the proposed controller will fluctuate for different agents. Furthermore, since the uncertainty of dynamics exists, the proposed controller may also change for different agents. Therefore, this section will fuse them and make them reach a consensus, i.e., given local

{\hat{f}}^{i}

(

µ_{N}^{i}

and

\sum_{N}^{i}

), the goal is to fuse them and to find a fused/consensus

\bar{f} ({\bar{µ}}_{N} and {\sum^{¯}}_{N})

such that the uncertainty and the disturbance can vanish over time. Obviously, the controller (23) and the control law (24) for different agents can reach a consensus

\bar{f} ({\bar{µ}}_{N} and {\sum^{¯}}_{N})

.

More specially, after training the local

{\hat{f}}^{i}

by using GP, node

i

sends the result to its neighbors

N^{i}

. After collecting the training results from neighbors, it performs the following dynamic consensus/fusion step. Given weights

a^{i}

satisfying

a^{i} \geq 0

and

\sum_{i \in N^{i}} a^{i} = 1

based on the Kullback–Leibler average consensus given in Section 2.4, the desired weighted KLA takes the Gaussian form as

\bar{f} (\cdot) = N (\cdot; \bar{μ}, \sum^{¯})

, in which the fusion of the mean function

\bar{μ}

and the fusion of the covariance function

\sum^{¯}

can be calculated by

\begin{matrix} {\sum^{¯}}^{- 1} = \sum_{i = 1}^{M} a^{i} {(\sum_{N}^{i})}^{- 1}, \\ {\sum^{¯}}^{- 1} \bar{μ} = \sum_{i = 1}^{M} a^{i} {(\sum_{N}^{i})}^{- 1} μ_{N}^{i}, \end{matrix}

(25)

while the global/centralized fusion using

i = 1, \dots, N

. The flowchart is given in Figure 1.

Figure 1. The flowchart of consensus/fusion.

After obtaining the consensus mean function, the controllers of different agents can be designed to be a unified controller.

Remark 2.

The main advantage of the distributed method lies in that local nodes can only receive part of the training data or even missing data. Neighboring nodes can make the prediction faster and keep high accuracy through information interaction and consensus algorithm, which can also avoid processor failure caused by data loss or node/sensor failure.

4.2. GP-Based Model Predictive Control for Discrete-Time System

The above discussion discusses the continuous-time system. Usually, we need to discretize the system in an actual physical system. This section designs the control strategy for discrete-time by using GP-based model predictive control (MPC).

First, the considered system (21) is assumed to be discrete-time and can be modeled by GP, where the control tuple

x_{k} = {[\begin{matrix} x_{k} & u_{k} \end{matrix}]}^{T}

and the state difference

δ x_{k} = x_{k + 1} - x_{k}

are, respectively, designed as the training input and the desired target. Given the training date set

D = {x_{t}, y = δ x_{k}}_{k = 1 \dots N}

, according to (4) and (5), at a new training input

x^{*}

, we can obtain the mean function and covariance function as follows

\begin{array}{l} E [δ x_{k} | D, x^{*}] = k^{T} {(K + σ^{2} I_{N})}^{- 1} y, \\ V [δ x_{k} | D, x^{*}] = k^{*} - k^{T} {(K + σ^{2} I_{N})}^{- 1} k, \end{array}

(26)

where

k^{*} = k (x^{*}, x^{*})

,

k = {[\begin{matrix} k (x_{(1)}, x^{*}) & \dots & k (x_{(n)}, x^{*}) \end{matrix}]}^{T}

, and

K

is defined in (4). Therefore, (26) is given to predict the next step. By using the moment matching approach [46], the mean function and covariance function of the training target at time k can be calculated by

\begin{array}{l} μ_{k}^{δ} = E [E [δ x_{k}]], \\ \sum_{k}^{δ} = [\begin{matrix} k (δ x_{k_{1}}, δ x_{k_{1}}) & \dots & k (δ x_{k_{n}}, δ x_{k 1}) \\ ⋮ & ⋱ & ⋮ \\ k (δ x_{k_{1}}, δ x_{k_{n}}) & \dots & k (δ x_{k_{n}}, δ x_{k_{n}}) \end{matrix}] . \end{array}

(27)

At time

k + 1

, the mean and covariance functions are updated as

\begin{array}{l} μ_{k + 1} = μ_{k} + μ_{k}^{δ}, \\ \sum_{k + 1} = \sum_{k} + \sum_{k}^{δ} + k (x_{k}, δ x_{k}) + k (δ x_{k}, x_{k}) . \end{array}

(28)

For more details, please refer to [46].

Then, based on (6), we next attempt to learn the hyper-parameters

θ

. A distributed GP-based MPC scheme is presented to address this problem. First, we design the objective function as

J_{k} = \min_{u} E [V (x_{k}, u_{k - 1})],

(29)

where the cost function is

E [V (x, u)] = \sum_{l = 1}^{L} E [{(x_{k + l} - p_{k + l})}^{T} Q (x_{k + l} - p_{k + l}) + u_{k + l - 1}^{T} R u_{k + l - 1}],

(30)

where

p

is the desired trajectory (desired state),

Q

and

R

are positive definite weight matrices, and

L

is the prediction horizon and also the control horizon. According to GP in Section 2, (30) can be rewritten as

E [V (x, u)] = \sum_{l = 1}^{L} E [\begin{array}{l} {(u_{k + l} - p_{k + l})}^{T} Q (u_{k + l} - p_{k + l}) \\ + trace (Q \sum_{k + l}) + u_{k + l - 1}^{T} R u_{k + l - 1} \end{array}],

(31)

Next, to address the optimization problem (29), a gradient-based method is used. Set

F_{l} = {(u_{k + l} - p_{k + l})}^{T} Q (u_{k + l} - p_{k + l}) + trace (Q \sum_{k + l}) + u_{k + l - 1}^{T} R u_{k + l - 1}

and

E [V (x, u)] = \sum_{l = 1}^{L} E F_{l}

. Using the chain rule, the gradient can be calculated by

\begin{array}{l} \frac{d}{d u_{k - 1}} E [V (x_{k}, u_{k - 1})] = \sum_{l = 1}^{L} \frac{d F_{l}}{d u_{k + l - 1}}, \\ \frac{d F_{l}}{d u_{k + l - 1}} = \frac{\partial F_{l}}{\partial u_{k + l}} \frac{\partial u_{k + l}}{\partial u_{k + l - 1}} + \frac{\partial F_{l}}{\partial \sum_{k + l}} \frac{\partial \sum_{k + l}}{\partial u_{k + l - 1}} + \frac{\partial F_{l}}{\partial u_{k + l - 1}} . \end{array}

(32)

where

\frac{\partial F_{l}}{\partial u_{l}}, \frac{\partial F_{l}}{\partial \sum_{l}} and \frac{\partial F_{l}}{\partial u_{l - 1}}

are easy to calculate. In addition,

\begin{array}{l} \frac{\partial u_{k + l}}{\partial u_{k + l - 1}} = \frac{\partial u_{k + l}}{\partial u_{k + l - 1}} \frac{\partial u_{k + l - 1}}{\partial u_{k + l - 1}}, \\ \frac{\partial \sum_{k + l}}{\partial u_{k + l - 1}} = \frac{\partial \sum_{k + l}}{\partial \sum_{k + l - 1}} \frac{\partial \sum_{k + l - 1}}{\partial u_{k + l - 1}}, \end{array}

(33)

where

\frac{\partial u_{k + l - 1}}{\partial u_{k + l - 1}} and \frac{\partial \sum_{k + l - 1}}{\partial u_{k + l - 1}}

are easy to calculate.

Finally, the gradient-based algorithm is formulated as Algorithm 1.

Algorithm 1 Gradient-based optimization method

Input: learning GP,

L

,

p

,

Q

, and

R

Output: Optimal control

u^{*}

1: Initialization: Max iteration number

N = 1000

, threshold

ε = 10^{- 8}

, the initialized input

u_{0}

and optimal control

u^{*} = u_{0}

;

2: for

k = 1

to

N

do

3: if

E [V] < ε

then

4:

u^{*} = u_{k}

;

5: end Loop;

6: else

7: Calculate the gradient

\frac{d E [V (u_{k})]}{d u_{k - 1}}

by (32);

8: Update search step size based on [69];

9: Update control

u_{k + l} = u_{k} + α_{l} \frac{d E [V (u_{l})]}{d u_{l - 1}}

;

10 Go next

l \to l + 1

end
end

11: return Optimal control

u^{*}

.

Remark 3.

Similarly, due to the stochastic uncertainty caused by the noises and model perturbations, we can use multiple agents to address this problem. The consensus/fusion algorithm is given above, which is similar to the continuous system. Therefore, we will not introduce it any more.

Remark 4.

The GP has been widely applied in various real-world applications such as quadrotor tracking, 3D people tracking, localization and mapping, and control-based application models. These applications have attracted much attention from engineers and researchers. As for the limitations, in our opinion, the first is that the model needs real-world data to achieve perfect training and application. The second is that the dynamics are Gaussian distributed or Gaussian-approximate distributed.

5. Simulations

To evaluate the performance and to verify the effectiveness of the proposed algorithms, this section provides two trajectory tracking examples, where one is the trajectory tracking of a robotic manipulator and the other one is the trajectory tracking of an unmanned quadrotor. All simulations are conducted on a computer with 2.6 GHz Intel(R) Core(TM) i7-5600U CPU and MATLAB R2015b.

5.1. Trajectory Tracking of Robotic Manipulator

First, we consider the trajectory of a Puma 560 robot arm manipulator in

x

-

y

-

z

plane with 6 degrees of freedom (DoFs), which is shown in Figure 2. The Puma 560 robot was designed to have approximately the dimensions and reach of a human worker. It also had a spherical joint at the wrist, just as humans have. Roboticists use like waist, shoulder, elbow, and wrist when describing serial link manipulators. For the Puma, these terms correspond respectively to joints 1, 2, 3, and 4–6, which is shown in Figure 2.

Figure 2. The diagram of the Puma 560 robot arm manipulator (6 DoFs).

For the considered robot arm,

τ 1

,

τ 2

, and

τ 3

are the control torques of the motors controlling the joint angles

ϕ

,

θ

,

ψ

. The trajectory of the robotic manipulator can be controlled by these torques. The motion can be described by the following Lagrangian system [1]

H (q) \ddot{q} + C (q, \dot{q} + G (q) + κ (\overset{˘}{q}) = τ,

(34)

where

q

denotes the generalized coordinates with their time derivatives

\dot{q}

,

\ddot{q}

and

τ

denotes the generalized input.

H

is the mass matrix,

C

is the Coriolis matrix and

G

is the potential energy matrix. An additional unknown dynamic

κ (\overset{˘}{q})

(train to obtain its form), which depends on

\overset{˘}{q} = {[{\ddot{q}}^{T}, {\dot{q}}^{T}, q^{T}]}^{T}

, affects the system as a generalized force, in which one can refer to [2] for details. The process methodology is illustrated in Figure 3.

Figure 3. The process methodology and flowchart.

Tracking error is the error between the actual value of joint angle or velocity with the desired values

\begin{array}{l} \tilde{q} = [\begin{matrix} ϕ - ϕ_{d} \\ θ - θ_{d} \\ ψ - ψ_{d} \end{matrix}] = q (t) - q_{d} (t), \\ \dot{\tilde{q}} = \dot{q} (t) - {\dot{q}}_{d} (t) . \end{array}

(35)

The following controllers are tested. (1) Computed torque (CT) controller:

τ_{i n} = H (q) {\ddot{q}}_{d} + C (q, \dot{q}) {\dot{q}}_{d} + G (q) - K_{p} (\tilde{q}) - K_{d} (\dot{\tilde{q}})

. The gains for this controller is

K_{p}

= 50 and

K_{d}

= 40. (2) The proposed PD controller (24): the composite error is

S = \dot{\tilde{q}} +

λ

\tilde{q}

, a reference velocity

{\dot{q}}_{r}

is

{\dot{q}}_{r} = {\dot{q}}_{d} - λ \tilde{q} \tilde{q} = q (t) - q_{d} (t)

, and the control torque is

τ_{i n} = H (q) {\dot{q}}_{r} + C (q, \dot{q}) {\dot{q}}_{r} + G (q) - K_{p} (\tilde{q}) - K_{d} S

. The gains for this controller are λ = 30 and

K_{d}

= 20. (3) The adaptive controller: the control torque is

τ_{i n}

=

Y_{0} + Y_{1} {\hat{m}}_{3} + Y_{2} {\hat{I}}_{x x_{3}} + Y_{3} {\hat{I}}_{y y_{3}} + Y_{4} {\hat{I}}_{z z_{3}} - K_{d} S

, where

{\hat{m}}_{3}

,

{\hat{I}}_{x x_{3}}

,

{\hat{I}}_{y y_{3}}

,

{\hat{I}}_{z z_{3}}

are the unknown parameters.

Y_{0}

Y_{1}

Y_{2}

Y_{3}

Y_{4}

are called the regressor vectors defined as [70].

{\hat{m}}_{3}

=

- (S^{T} Y_{1}) ∕ γ_{1}

,

{\hat{I}}_{x x_{3}}

=

- (S^{T} Y_{2}) ∕ γ_{2}

,

{\hat{I}}_{y y_{3}}

=

- (S^{T} Y_{3}) ∕ γ_{3}

,

{\hat{I}}_{z z_{3}}

=

- (S^{T} Y_{4}) ∕ γ_{4}

, where

γ_{1}

,

γ_{2}

,

γ_{3}

,

γ_{4}

are controller gains, in addition to

γ_{1}

= 50,

γ_{2}

= 30,

γ_{3}

= 20,

γ_{4}

= 50, and

K_{d}

= 20.

Data. Speeds: 5 s, 10 s, 15 s, and 20 s completion times; 4 paths × 4 speeds with 16 different trajectories; 15 loads (0.2 kg…3.0 kg), various shapes and sizes; 10 agents. Training Data. One desired trajectory common to handling of all loads; one trajectory has no data for any context; sixteen unique training trajectories, one for each load. Test Data. Interpolation data sets for testing on reference trajectory and the unique trajectory for each load. Extrapolation data sets for testing on all trajectories.

From Figure 4, it can be seen that the PD controller action is able to hold the position of the robot arm at the desired joint angles. λ = 30 and

K_{d}

= 20 are the gains associated with holding the respective positions necessary. The convergence of the plots is achieved in about 10 s. The first couple runs of the robot can be used for tuning the robot, and the robot should have good repeatability after that. In addition, from the position and velocity plots of a computed torque controller (Figure 5) and adaptive controller (Figure 6), it is observed that both controllers are able to achieve convergence of parameters to the desired values. However, the proposed PD torque controller has a quicker convergence time and also has lesser gains compared to the computed torque controller. The error results from Table 1 further verify the effectiveness. Therefore, the proposed PD torque controller is better for the considered application. In addition, we can also obtain that the distributed GP can effectively eliminate the uncertainty and disturbance caused by the system model and the noises.

Figure 4. The proposed PD controller: (a) Position Plot; (b) Velocity Plot.

Figure 5. The CT controller: (a) Position Plot; (b) Velocity Plot.

Figure 6. The Adaptive controller: (a) Position Plot; (b) Velocity Plot.

Table 1. Mean absolute error.

To further compare with the multiple agent processing methods, these approaches are tested: (1) Independent GP (IGP): model trained independently for each input [6]; (2) Combined GP (CGP): one agent to train GP by combining data across inputs [34]; (3) Proposed distributed GP with BIC (Bayesian Information Criterion) criterion. The training results (interpolation and extrapolation manners) of NMSE (normalized mean square error) with regard to the number of training data points are demonstrated in Figure 7. Note that IGP and CGP, i.e., existing GP, are centralized methods, which are different from the proposed GP, which is a distributed method for training. From Figure 7, the first line displays the training results of the interpolation manner for the three methods. As we can see from it, for joint 1, the proposed distributed GP achieves the best performance for any number of training points. For joint four and joint six, the performance of the proposed GP is close to IGP and also better than CGP when the training points increase. The second line displays the training results of the extrapolation manner for the three methods. As we can see from it, for joint 1, the proposed distributed GP achieves the best performance for small numbers of training points (<500). However, when training points are increased further, the CGP is better than the proposed distributed GP (note that they are very close). For joint four and joint six, the performance of the proposed GP is close to IGP and also better than CGP when the training points increase. To sum up, the proposed distributed GP model can reach the performance of the centralized method and is close to (even outperforms) the existing state-of-the-art centralized-based multiple combined and multi-task methods.

Figure 7. Training results using different methods.

5.2. Trajectory Tracking of an Unmanned Quadrotor

This section tests the proposed distributed GP-based model predictive control (GPMPC) of an unmanned quadrotor. The trajectory of an unmanned quadrotor is generated by a discrete-time Euler–Lagrange dynamical system [2]. The goal is to track its positions (X, Y, Z) and Euler angles (

ϕ

,

θ

,

ψ

). To compare with the state-of-the-art controllers, the efficient MPC (EMPC) [71] and the efficient nonlinear MPC (ENMPC) [72] are also tested in the simulations. The parameters are selected as

L

= 5 and

Q = R = d i a g (1, 1, 1)

.

In the first scenario, the unmanned quadrotor tracks a “Lorenz” trajectory with Gaussian white noise (zero mean and unit variance), which is shown in Figure 8. To train the system model, we use the efficient MPC design proposed in [71]. One hundred seventy measurements, states, and controls are used to train the GP. The datum from the rotational system is with the range [0, 1], the angle

φ

is with a range

[- 1.6, 1.6]

, and the input is with the range

[- 4 \times 10^{- 8}, 7 \times 10^{- 8}]

. The training of GP takes 5 s, and we use 10 agents to train. The values of mean squared error (MSE) trained by GP are very small. The mean squared error (MSE) for the positions is

4.3618 \times 10^{4}

close to the stable GPMPC (GPMPC1) [73] with

4.3498 \times 10^{4}

; MSE for the angels is

1.5743 \times 10^{8}

also close to GPMPC1 with

4.3030 \times 10^{9}

. This indicates that the proposed distributed GP (GPMPC2) is efficient and well-trained, which is illustrated in Figure 9 (with different confidences). Note that the stable GPMPC (GPMPC1) is the most recent best method at present and is also a method for one agent. Therefore, the training results are very close, which indicates that the proposed distributed GP can achieve a good training performance.

Figure 8. Lorenz trajectory tracking.

Figure 9. Training performance with 60%, 80%, and 90% confidence: (a)

Y_{1} (k)

; (b)

Y_{2} (k)

.

The positions and attitudes tracking results are demonstrated in Figure 10, and the tracking errors are displayed in Figure 11 and Table 2. As we can see from Figure 10 and Figure 11, the proposed distributed GP can learn the system model well and track the trajectories with high precision, which is close to the state-of-the-art controllers. This also indicates that as long as the training sets are introduced, we can track the trajectory without model knowledge (model-free), i.e., the proposed GP can learn the system model well. In addition, as long as multiple agents are introduced, the model uncertainties and noise disturbances can be eliminated and suppressed.

Figure 10. Tracking results: (a) Positions tracking; (b) Attitudes tracking.

Figure 11. Tracking errors: (a) Positions errors; (b) Attitudes errors.

Table 2. Training errors and tracking errors.

Furthermore, the covariance on positions and attitudes by different GP models (the stable GPMPC (GPMPC1) [69] and the proposed distributed GPMPC (GPMPC2)) is displayed in Figure 12. This indicates that the proposed distributed GP can also reach the performance of the state-of-the-art GP model.

Figure 12. Covariance results: (a) Positions covariance; (b) Attitudes covariance.

In the second scenario, the unmanned quadrotor tracks an “Elliptical” trajectory with Gaussian white noise (zero mean and unit variance), which is shown in Figure 13. The tracking performance and the tracking errors are shown in Figure 14 and Figure 15 and Table 3. From Figure 13, Figure 14 and Figure 15, we can also see that the proposed distributed GP can learn the trajectory model effectively, which is very close to the desired reference trajectory and is close to the state-of-the-art controllers. The covariance results by GPMPC1 and GPMPC2 are shown in Figure 16, which further verifies the effectiveness of the proposed distributed GP.

Figure 13. Elliptical trajectory tracking.

Figure 14. Tracking results: (a) Positions tracking; (b) Attitudes tracking.

Figure 15. Tracking errors: (a) Positions errors; (b) Attitudes errors.

Table 3. Training errors and tracking errors.

Figure 16. Covariance results: (a) Positions covariance; (b) Attitudes covariance.

6. Conclusions

This paper used the Gaussian process to learn the trajectory model, and a distributed GP-based model learning strategy was proposed. For the continuous- and discrete-time system, we, respectively, designed a GP-based PD controller and a GP-based MPC controller to address the problem. To address the uncertainties of the model and the disturbances of the noises, a distributed multiple-agent system was used to train the model. In addition, since data-driven algorithms needed a large number of training sets, the distributed GP model could also be employed to address this problem by using a Kullback–Leibler average consensus fusion criterion.

The proposed GP can solve the actual model-free problem as long as the training data sets are given. Since the considered multi-agent is interconnected and it is only used to eliminate the uncertainties of the model and disturbances of the noises, future research mainly focuses on the efficiency of distributed Gaussian process and the robustness of the multi-agent network. In the future, we will focus on the application deployment of an unmanned aerial vehicle (UAV) and its usage in UAV detection and location. UAV racing is a challenging problem to overcome.

Author Contributions

Conceptualization, L.S. and D.X.; methodology, D.X.; software, D.X.; validation, D.X.; formal analysis, L.S. and D.X.; investigation, L.S. and D.X.; resources, D.X.; data curation, D.X.; writing—original draft preparation, D.X.; writing—review and editing, L.S. and D.X.; supervision, L.S.; funding acquisition, L.S. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the Shaanxi Provincial Fund under Grant 2020JM-185 and the National Natural Science Foundation of China under grant 62171338.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Proof of Theorem 1

Proof.

First, we prove the Lipschitz constant of

μ (\cdot)

and the modulus of continuity of

\sum (\cdot)

. At two different states

x

and

x^{'}

, the norm of the difference of

μ (\cdot)

is

∥ μ (x) - μ (x^{'}) ∥ = ∥ (K_{x X} - K_{x^{'} X}) α ∥

with

α = {(K_{X X} + σ^{2} I)}^{- 1} y .

Then, based on the Lipschitz continuity of the kernel and the Cauchy–Schwarz inequality, we have

∥ μ (x) - μ (x^{'}) ∥ \leq L_{k} \sqrt{N} ∥ α ∥ ∥ x - x^{'} ∥,

which proves that

µ

is Lipschitz continuous.

Similarly, we have

∥ Σ (x) - Σ (x^{'}) ∥ \leq 2 L_{k} ∥ x - x^{'} ∥ + ∥ K_{x X} - K_{x' X} ∥ ∥ {(K_{X X} + σ^{2} I)}^{- 1} ∥ ∥ K_{X x} + K_{{X x}^{'}} ∥ .

Due to the Lipschitz continuity of kernel

k

(also

K

), we have

∥ K_{x X} - K_{x^{'} X} ∥ \leq L_{k} \sqrt{N} ∥ x - x^{'} ∥, and ∥ K_{X x} + K_{{X x}^{'}} ∥ \leq 2 \sqrt{N} m a x k (x, x^{'})

Therefore, the continuous modulus

ω_{\sum}

can be obtained by combing the above equations and taking the square root.

Finally, we prove the probabilistic uniform error bound. According to [74], for every grid

X_{τ}

with the

| X_{τ} |

-th number of grid points and

\max_{x \in X} \min_{x^{'} \in X_{τ}} ∥ x - x^{'} ∥ \leq τ, \forall . x \in X_{τ},

it follows that

‖ f (x) - μ (x^{'}) ‖ \leq \sqrt{β (τ) \sum (x)},

(36)

with a probability of at least

1 - | X_{τ} | e^{- β (τ) ∕ 2}

. Then, set

β (τ) = 2 \log (\frac{| X_{τ} |}{δ}),

then (36) holds with a probability of at least

1 - δ

. Furthermore, since

f

,

µ

, and

\sum

are continuous,

\forall x \in X

,

\forall x^{'} \in X_{τ}

, it follows

τ L_{f} \geq m i n ∥ f (x) - f (x^{'}) ∥, τ L_{μ} \geq m i n ∥ μ (x) - μ (x^{'}) ∥ and ω_{Σ (x)} \geq m i n ∥ \sqrt{Σ (x)} - \sqrt{Σ (x^{'})} ∥

. In addition, based on [74], the minimum number of grid points is denoted by the covering number

M (τ, X)

. Therefore,

\forall x \in X

it follows that

p (∥ f (x ∥) - μ (x) \leq γ (τ) + \sqrt{β (τ) \sum^{} (x)}) \geq 1 - δ

where

γ (τ) = (L_{μ} + L_{f}) τ + \sqrt{β (τ)} ω_{\sum^{} (τ)} and β (τ) = 2 \log (\frac{M (τ, X)}{δ})

. This concludes the proof. □

Appendix B. Proof of Theorem 2

Proof.

According to Theorem 1, given

β_{N} (τ) = 2 l o g (\frac{M (τ, x) π^{2} N^{2}}{3 δ})

and

N > 0

, it follows that

\sup_{x \in X} | f (x) - μ_{N} (x) | \leq γ_{N} (τ) + \sqrt{β_{N} (τ) \sum_{N} (x)} .

(37)

with a probability of at least

1 - δ / 2

. In addition, given the distance

r = \underset{x, x^{'} \in X}{m a x} ∥ x - x^{'} ∥,

we can obtain a trivial bound

M (τ, X) \leq {(1 + \frac{r}{τ})}^{n} .

Then, we can obtain that

β_{N} (τ) \leq 2 n \log (1 + \frac{r}{τ}) + 4 \log π N - 2 \log 3 δ .

On the one hand, Lipschitz constant

L_{µ}

is bounded by

L_{μ} \leq L_{k} \sqrt{N} {(K_{X_{N} X_{N}} + σ^{2} I)}^{- 1} y_{N} .

On the other hand, since f is bounded by

∥ f^{m a x} ∥

and

K_{X_{N} X_{N}}

is positive semi-definite,

∥ {(K_{X_{N} X_{N}} + σ^{2} I)}^{- 1} y_{N} ∥

is bounded by

∥ {(K_{X_{N} X_{N}} + σ^{2} I)}^{- 1} y_{N} ∥ \leq \frac{∥ y_{N} ∥}{ρ {(K_{X_{N} X_{N}} + σ^{2} I)}_{m i n} \frac{\sqrt{N} f^{m a x} + ∥ φ_{N} ∥}{σ^{2}}}

where vector

φ_{N}

composed of

N

variables is a Gaussian disturbance with zero mean and covariance

σ^{2}

. This indicates that

\frac{∥ φ_{N} ∥^{2}}{σ^{2}}

obeys a chi-square distribution, i.e.,

\frac{∥ φ_{N} ∥^{2}}{σ^{2}} \sim χ_{N}^{2} .

Note that with a probability of at least

1 - η_{N}

we have

∥ φ_{N} ∥^{2} \leq (2 \sqrt{N η_{N}} + 2 η_{N} + N) σ^{2} .

Then, by using the union bounds over all

N > 0

and setting

η_{N} = \log (\frac{π^{2} N^{2}}{3 δ}),

we can obtain that

{∥ (K_{X_{N} X_{N}} + σ^{2} I_{N})}^{- 1} y_{N} ∥ \leq \frac{\sqrt{N} ∥ f^{m a x} ∥ + \sqrt{2 \sqrt{N η_{N}} + 2 η_{N} + N} σ}{σ^{2}}

with a probability of at least

1 - δ ∕ 2

. Therefore, the Lipschitz constant

L_{μ}

of the posterior mean function

u_{N} (\cdot)

has

L_{μ} \leq \frac{\sqrt{N} ∥ f^{m a x} ∥ + \sqrt{N (2 \sqrt{N η_{N}} + 2 η_{N} + N)} σ}{σ^{2}}

In addition, since ηN is logarithmically increased with the number of training samples

N

, it follows that

L_{μ} \in O (N)

with a probability of at least

1 - δ ∕ 2

.

Furthermore, based on (17) and

∥ {(K_{X_{N} X_{N}} + σ^{2} I_{N})}^{- 1} ∥

, we can bound the modulus of continuity

ω_{\sum_{N}} (\cdot)

as

ω_{\sum (τ)} \leq \sqrt{2 τ L_{k} (1 + \frac{N \underset{x, x^{'} \in X}{m a x} k (x, x^{'})}{σ^{2}})}

According to (37), the uniform bound holds with a probability of at least

1 - δ

with

γ_{N} (τ) \leq \sqrt{2 τ L_{k} β_{N} (τ) (1 + \frac{N \underset{x, x^{'} \in X}{m a x} k (x, x^{'})}{σ^{2}})} + L_{f} τ + L_{k} \frac{\sqrt{N} ∥ f^{m a x} ∥ + \sqrt{N (2 \sqrt{N η_{N}} + 2 η_{N} + N)} σ}{σ^{2}}

Note that if the error is designed to vanish, the above equation should be guaranteed convergence to 0 as

N \to \infty

. This indicates that

τ (N)

decreases faster than

O ({(Nlog (N))}^{- 1})

. Therefore, we can set

τ (N) \in O (N^{- 2})

, then we can obtain that

\underset{N \to \infty}{l i m} γ_{N} (τ_{N}) = 0

. Furthermore, this indicates that

β_{N} (τ (N)) \in O (\log (N))

. Therefore, there exists an

ϵ > 0

such that

\sum_{N} (x) \in O (\log {(N)}^{- \frac{1}{2} - ϵ})

, it follows that

\sqrt{β_{N} (τ (N)) \sum_{N} (x)} \in O (\log {(N)}^{- ϵ})

, which concludes the proof. □

Appendix C. Proof of Theorem 3

Proof.

According to Lemma 2 and 3, and recalling that the noise w is the stationary Gaussian process, we use the following Lyapunov candidate

\dot{V} (x) = 1 / 2 r^{2}

.

\forall | r | > \frac{f (x) - μ_{N} (x)}{k_{c}}

, it follows that

\dot{V} (x) = \frac{\partial V}{\partial r} \dot{r} = r (f (x) - \hat{f} (x) - k_{c} r) \leq | r | | f (x) - μ_{N} (x) | - k_{c} {| r |}^{2} \leq 0

. According to Theorem 1, the model error is bounded with probability

p (\dot{V} (x) < 0) \geq 1 - δ

. According to Lemma 3, we can obtain the global ultimate boundedness. □

References

Beckers, T.; Umlauft, J.; Kulic, D.; Hirche, S. Stable Gaussian process based tracking control of Lagrangian systems. In Proceedings of the 2017 IEEE 56th Annual Conference on Decision and Control (CDC), Melbourne, Australia, 12–15 December 2017; pp. 5180–5185. [Google Scholar]
Corke, P.I.; Khatib, O. Robotics, Vision and Control: Fundamental Algorithms in MATLAB; Springer: Berlin/Heidelberg, Germany, 2011; Volume 73. [Google Scholar]
Wang, D.; Mu, C. Adaptive-critic-based robust trajectory tracking of uncertain dynamics and its application to a spring-mass-damper system. IEEE Trans. Ind. Electron. 2018, 65, 654–663. [Google Scholar] [CrossRef]
Choi, J.; Jung, J.; Park, I. Area-efficient approach for generating quantized gaussian noise. IEEE Trans. Circuits Syst. I Regul. Pap. 2016, 63, 1005–1013. [Google Scholar] [CrossRef]
Khansari-Zadeh, S.M.; Billard, A. Learning stable nonlinear dynamical systems with Gaussian mixture models. IEEE Trans. Robot. 2011, 27, 943–957. [Google Scholar] [CrossRef]
Choi, J. Data-aided sensing for Gaussian process regression in iot systems. IEEE Internet Things 2021, 8, 7717–7726. [Google Scholar] [CrossRef]
Diaz-Rozo, J.; Bielza, C.; Larranaga, P. Clustering of data streams with dynamic Gaussian Mixture Models: An IoT application in industrial processes. IEEE Internet Things J. 2018, 5, 3533–3547. [Google Scholar] [CrossRef]
Sheng, H.; Xiao, J.; Cheng, Y.; Ni, Q.; Wang, S. Short-term solar power forecasting based on weighted Gaussian process regression. IEEE Trans. Ind. Electron. 2018, 65, 300–308. [Google Scholar] [CrossRef]
Wen, Y.; Li, G.; Wang, Q.; Guo, X.; Cao, W. Modeling and analysis of permanent magnet spherical motors by a multi-task Gaussian process method and finite element method for output torque. IEEE Trans. Ind. Electron. 2021, 68, 8540–8549. [Google Scholar] [CrossRef]
Jin, X. Fault tolerant nonrepetitive trajectory tracking for mimo output constrained nonlinear systems using iterative learning control. IEEE Trans. Cybern. 2019, 49, 3180–3190. [Google Scholar] [CrossRef] [PubMed]
Fedele, G.; D’Alfonso, L. A kinematic model for swarm finite-time trajectory tracking. IEEE Trans. Cybern. 2019, 49, 3806–3815. [Google Scholar] [CrossRef]
Wilson, A.G.; Knowles, D.A.; Ghahramani, Z. Gaussian process regression networks. In Proceedings of the 29th International Conference on Machine Learning, Edinburgh, UK, 26 June–1 July 2012. [Google Scholar]
Pillonetto, G.; Schenato, L.; Varagnolo, D. Distributed multi-agent gaussian regression via finite-dimensional approximations. IEEE Trans. Pattern Anal. Mach. Intell. 2019, 41, 2098–2111. [Google Scholar] [CrossRef]
Varagnolo, D.; Pillonetto, G.; Schenato, L. Distributed parametric and nonparametric regression with on-line performance bounds computation. Automatica 2012, 48, 2468–2481. [Google Scholar] [CrossRef]
Krivec, T.; Papa, G.; Kocijan, J. Simulation of variational Gaussian process NARX models with GPGPU. ISA Trans. 2021, 109, 141–151. [Google Scholar] [CrossRef]
Aman, K.; Kocijan, J. Application of Gaussian processes for black-box modelling of biosystems. ISA Trans. 2007, 46, 443–457. [Google Scholar] [CrossRef] [PubMed]
Hensman, J.; Durrande, N.; Solin, A. Variational fourier features for Gaussian processes. J. Mach. Learn. Res. 2017, 18, 5537–5588. [Google Scholar]
Damianou, A.C.; Titsias, M.K.; Lawrence, N.D. Variational inference for latent variables and uncertain inputs in Gaussian processes. J. Mach. Learn. Res. 2016, 17, 1425–1486. [Google Scholar]
Meng, Z.; Lin, Z.; Ren, W. Robust cooperative tracking for multiple non-identical second-order nonlinear systems. Automatica 2013, 49, 2363–2372. [Google Scholar] [CrossRef]
Pu, S.; Yu, X.; Li, J. Distributed Kalman filter for linear system with complex multichannel stochastic uncertain parameter and decoupled local filters. Int. J. Adapt. Control. Signal Process. 2021, 35, 1498–1512. [Google Scholar] [CrossRef]
Yu, X.; Li, J. Adaptive Kalman filtering for recursive both additive noise and multiplicative noise. IEEE Trans. Aerosp. Electron. Syst. 2022, 58, 1634–1649. [Google Scholar] [CrossRef]
Huang, Y.; Meng, Z. Bearing-based distributed formation control of multiple vertical take-off and landing UAVs. IEEE Trans. Control. Netw. Syst. 2021, 8, 1281–1292. [Google Scholar] [CrossRef]
Yang, T.; Yi, X.; Wu, J.; Yuan, Y.; Wu, D.; Meng, Z.; Hong, Y.; Wang, H.; Lin, Z.; Johansson, K.H. A survey of distributed optimization. Annu. Rev. Control. 2019, 47, 278–305. [Google Scholar] [CrossRef]
Li, X.; Caimou, H.; Haoji, H. Distributed filter with consensus strategies for sensor networks. J. Appl. Math. 2013, 2013, 683249. [Google Scholar] [CrossRef]
Zhou, T. Coordinated one-step optimal distributed state prediction for a networked dynamical system. IEEE Trans. Autom. Control. 2013, 58, 2756–2771. [Google Scholar] [CrossRef]
Battistelli, G.; Chisci, L. Kullback-Leibler average, consensus on probability densities, and distributed state estimation with guaranteed stability. Automatica 2014, 50, 707–718. [Google Scholar] [CrossRef]
Umlauft, J.; Lederer, A.; Hirche, S. Learning stable Gaussian process state space models. In Proceedings of the 2017 American Control Conference (ACC), Seattle, DC, USA, 24–26 May 2017; pp. 1499–1504. [Google Scholar]
Jagtap, P.; Pappas, G.J.; Zamani, M. Control barrier functions for unknown nonlinear systems using Gaussian processes. In Proceedings of the 2020 59th IEEE Conference on Decision and Control (CDC), Jeju Island, Korea, 14–18 December 2020; pp. 3699–3704. [Google Scholar]
Pöhler, L.D.; Umlauft, J.; Hirche, S. Uncertainty-based Human Motion Tracking with Stable Gaussian Process State Space Models. IFAC-Pap. 2019, 51, 8–14. [Google Scholar] [CrossRef]
Umlauft, J.; Pöhler, L.D.; Hirche, S. An uncertainty-based control Lyapunov approach for control-affine systems modeled by Gaussian process. IEEE Control. Syst. Lett. 2018, 2, 483–488. [Google Scholar] [CrossRef]
Lederer, A.; Umlauft, J.; Hirche, S. Uniform error bounds for Gaussian process regression with application to safe control. Adv. Neural Inf. Process. Syst. 2019, 32, 659–669. [Google Scholar]
Deisenroth, M.; Ng, J.W. Distributed Gaussian processes. In Proceedings of the 32nd International Conference on Machine Learning, Lille, France, 7–9 July 2015; pp. 1481–1490. [Google Scholar]
Xie, A.; Yin, F.; Xu, Y.; Ai, B.; Chen, T.; Cui, S. Distributed Gaussian processes hyperparameter optimization for big data using proximal ADMM. IEEE Signal Processing Lett. 2019, 26, 1197–1201. [Google Scholar] [CrossRef]
Bonilla, E.V.; Chai, K.M.; Williams, C. Multi-task Gaussian process prediction. In Proceedings of the Advances in Neural Information Processing Systems 20 (NIPS 2007), Vancouver, BC, Canada, 3–5 December 2008; pp. 153–160. [Google Scholar]
Alvarez, M.; Lawrence, N.D. Sparse convolved Gaussian processes for multi-output regression. In Proceedings of the Advances in Neural Information Processing Systems 21 (NIPS 2008), Vancouver, BC, Canada, 8 December 2009. [Google Scholar]
Gal, Y.; van der Wilk, M.; Rasmussen, C.E. Distributed variational inference in sparse Gaussian process regression and latent variable models. In Proceedings of the Advances in Neural Information Processing Systems 27 (NIPS 2014), Montreal, Canada, 8–13 December 2014; pp. 3257–3265. [Google Scholar]
Nerurkar, E.D.; Roumeliotis, S.I.; Martinelli, A. Distributed maximum a posteriori estimation for multi-robot cooperative localization. In Proceedings of the 2009 IEEE International Conference on Robotics and Automation, Kobe, Japan, 6 July 2009; pp. 1402–1409. [Google Scholar]
Franceschelli, M.; Gasparri, A. On agreement problems with gossip algorithms in absence of common reference frames. In Proceedings of the 2010 IEEE International Conference on Robotics and Automation, Anchorage, AK, USA, 15 July 2010; pp. 4481–4486. [Google Scholar]
Cunningham, A.; Indelman, V.; Dellaert, F. DDF-SAM 2.0: Consistent distributed smoothing and mapping. In Proceedings of the 2013 IEEE International Conference on Robotics and Automation, Karlsruhe, Germany, 6–10 May 2013; pp. 5220–5227. [Google Scholar]
Anderson, B.D.; Shames, I.; Mao, G.; Fidan, B. Formal theory of noisy sensor network localization. SIAM J. Discret. Math. 2010, 24, 684–698. [Google Scholar] [CrossRef]
Carron, A.; Todescato, M.; Carli, R.; Schenato, L. An asynchronous consensus-based algorithm for estimation from noisy relative measurements. IEEE Trans. Control. Netw. Syst. 2014, 1, 283–295. [Google Scholar] [CrossRef]
Thunberg, J.; Montijano, E.; Hu, X. Distributed attitude synchronization control. In Proceedings of the 2011 50th IEEE Conference on Decision and Control and European Control Conference, Orlando, FL, USA, 12–15 December 2011; pp. 1962–1967. [Google Scholar]
Piovan, G.; Shames, I.; Fidan, B.; Bullo, F.; Anderson, B.D. On frame and orientation localization for relative sensing networks. Automatica 2013, 49, 206–213. [Google Scholar] [CrossRef]
Sarlette, A.; Sepulchre, R. Consensus optimization on manifolds. SIAM J. Control. Optim. 2009, 48, 56–76. [Google Scholar] [CrossRef]
Choudhary, S.; Carlone, L.; Nieto, C.; Rogers, J.; Christensen, H.I.; Dellaert, F. Distributed trajectory estimation with privacy and communication constraints: A two-stage distributed Gauss-seidel approach. In Proceedings of the 2016 IEEE International Conference on Robotics and Automation (ICRA), Stockholm, Sweden, 16–21 May 2016; pp. 5261–5268. [Google Scholar]
Deisenroth, M.P.; Fox, D.; Rasmussen, C.E. Gaussian processes for data-efficient learning in robotics and control. IEEE Trans. Pattern Anal. Mach. Intell. 2015, 37, 408–423. [Google Scholar] [CrossRef] [PubMed]
Robinson, D.R.; Mar, R.T.; Estabridis, K.; Hewer, G. An efficient algorithm for optimal trajectory generation for heterogeneous multi-agent systems in non-convex environments. IEEE Robot. Autom. Lett. 2018, 3, 1215–1222. [Google Scholar] [CrossRef]
Wang, J.M.; Fleet, D.J.; Hertzmann, A. Gaussian process dynamical models for human motion. IEEE Trans. Pattern Anal. Mach. Intell. 2007, 30, 283–298. [Google Scholar] [CrossRef]
Negenborn, R.R.; Maestre, J.M. Distributed model predictive control: An overview and roadmap of future research opportunities. IEEE Control. Syst. Mag. 2014, 34, 87–97. [Google Scholar]
Stewart, B.T.; Wright, S.J.; Rawlings, J.B. Cooperative distributed model predictive control for nonlinear systems. J. Process Control. 2011, 21, 698–704. [Google Scholar] [CrossRef]
Ferramosca, A.; Limón, D.; Alvarado, I.; Camacho, E.F. Cooperative distributed MPC for tracking. Automatica 2013, 49, 906–914. [Google Scholar] [CrossRef]
Conte, C.; Jones, C.N.; Morari, M.; Zeilinger, M.N. Distributed synthesis and stability of cooperative distributed model predictive control for linear systems. Automatica 2016, 69, 117–125. [Google Scholar] [CrossRef]
Groß, D.; Stursberg, O. A cooperative distributed MPC algorithm with event-based communication and parallel optimization. IEEE Trans. Control. Netw. Syst. 2015, 3, 275–285. [Google Scholar] [CrossRef]
Alrifaee, B.; Heßeler, F.J.; Abel, D. Coordinated non-cooperative distributed model predictive control for decoupled systems using graphs. IFAC-Pap. 2016, 49, 216–221. [Google Scholar]
Alonso, C.A.; Matni, N. Distributed and localized closed loop model predictive control via system level synthesis. In Proceedings of the 2020 59th IEEE Conference on Decision and Control (CDC), Jeju, Korea, 14–18 December 2020; pp. 5598–5605. [Google Scholar]
Alonso, C.A.; Matni, N.; Anderson, J. Explicit distributed and localized model predictive control via system level synthesis. In Proceedings of the 2020 59th IEEE Conference on Decision and Control (CDC), Jeju, Korea, 14–18 December 2020; pp. 5606–5613. [Google Scholar]
Luis, C.E.; Schoellig, A.P. Trajectory generation for multiagent point-to-point transitions via distributed model predictive control. IEEE Robot. Autom. Lett. 2019, 4, 375–382. [Google Scholar] [CrossRef]
Torrente, G.; Kaufmann, E.; Föhn, P.; Scaramuzza, D. Data-driven MPC for quadrotors. IEEE Robot. Autom. Lett. 2021, 6, 3769–3776. [Google Scholar] [CrossRef]
Liu, D.; Tang, M.; Fu, J. Robust adaptive trajectory tracking for wheeled mobile robots based on Gaussian process regression. Syst. Control. Lett. 2022, 163, 105210. [Google Scholar] [CrossRef]
Akbari, B.; Zhu, H. Tracking Dependent Extended Targets Using Multi-Output Spatiotemporal Gaussian Processes. IEEE Trans. Intell. Transp. Syst. 2022, 23, 18301–18314. [Google Scholar] [CrossRef]
Hidalgo-Carrió, J.; Hennes, D.; Schwendner, J.; Kirchner, F. Gaussian process estimation of odometry errors for localization and mapping. In Proceedings of the 2017 IEEE International Conference on Robotics and Automation (ICRA), Singapore, 29 May–3 June 2017; pp. 5696–5701. [Google Scholar]
Brossard, M.; Bonnabel, S. Learning wheel odometry and IMU errors for localization. In Proceedings of the 2019 International Conference on Robotics and Automation (ICRA), Montreal, QC, Canada, 20–24 May 2019; pp. 291–297. [Google Scholar]
Nguyen, T.V.; Bonilla, E.V. Collaborative multi-output Gaussian processes. In Proceedings of the UAI’14: Thirtieth Conference on Uncertainty in Artificial Intelligence, Citeseer, Quebec City, QC, Canada, 23–27 July 2014; pp. 643–652. [Google Scholar]
Carron, A.; Todescato, M.; Carli, R.; Schenato, L.; Pillonetto, G. Multi-agents adaptive estimation and coverage control using Gaussian regression. In Proceedings of the 2015 European Control Conference (ECC), Linz, Austria, 15–17 July 2015; pp. 2490–2495. [Google Scholar]
Mallasto, A.; Feragen, A. Learning from uncertain curves: The 2-wasserstein metric for gaussian processes. In Proceedings of the 31st Conference on Neural Information Processing Systems, Long Beach, CA, USA, 4–9 December 2017. [Google Scholar]
Rasmussen, C.E.; Williams, C.K. Gaussian Processes for Machine Learning. In Adaptive Computation and Machine Learning; MIT Press: Cambridge, MA, USA, 2006. [Google Scholar]
Khalil, H.K. Nonlinear Systems: International Edition. Bull. Am. Acad. Arts Sci. 2002, 53, 20–24. [Google Scholar]
Umlauft, J.; Hirche, S. Feedback linearization based on Gaussian processes with event triggered online learning. IEEE Trans. Autom. Control. 2020, 65, 4154–4169. [Google Scholar] [CrossRef]
Zhou, B.; Gao, L.; Dai, Y.H. Gradient methods with adaptive step-sizes. Comput. Optim. Appl. 2006, 35, 69–86. [Google Scholar] [CrossRef]
Ivanov, S.E.; Zudilova, T.; Voitiuk, T.; Ivanova, L.N. Mathematical modeling of the dynamics of 3-DOF robot-manipulator with software control. Procedia Comput. Sci. 2020, 178, 311–319. [Google Scholar] [CrossRef]
Abdolhosseini, M. Model Predictive Control of an Unmanned Quadrotor Helicopter: Theory and Flight Tests. Ph.D. Thesis, Concordia University, Montreal, QC, Canada, 2012. [Google Scholar]
Cannon, M. Efficient nonlinear model predictive control algorithms. Annu. Rev. Control. 2004, 28, 229–237. [Google Scholar] [CrossRef]
Beckers, T.; Kulic, D.; Hirche, S. Stable Gaussian process based tracking control of Euler-Lagrange systems. Automatica 2019, 103, 390–397. [Google Scholar] [CrossRef]
Srinivas, N.; Krause, A.; Kakade, S.M.; Seeger, M.W. Information-theoretic regret bounds for Gaussian process optimization in the bandit setting. IEEE Trans. Inf. Theory 2012, 58, 3250–3265. [Google Scholar] [CrossRef]

Figure 1. The flowchart of consensus/fusion.

Figure 2. The diagram of the Puma 560 robot arm manipulator (6 DoFs).

Figure 3. The process methodology and flowchart.

Figure 4. The proposed PD controller: (a) Position Plot; (b) Velocity Plot.

Figure 5. The CT controller: (a) Position Plot; (b) Velocity Plot.

Figure 6. The Adaptive controller: (a) Position Plot; (b) Velocity Plot.

Figure 7. Training results using different methods.

Figure 8. Lorenz trajectory tracking.

Figure 9. Training performance with 60%, 80%, and 90% confidence: (a)

Y_{1} (k)

; (b)

Y_{2} (k)

.

Figure 10. Tracking results: (a) Positions tracking; (b) Attitudes tracking.

Figure 11. Tracking errors: (a) Positions errors; (b) Attitudes errors.

Figure 12. Covariance results: (a) Positions covariance; (b) Attitudes covariance.

Figure 13. Elliptical trajectory tracking.

Figure 14. Tracking results: (a) Positions tracking; (b) Attitudes tracking.

Figure 15. Tracking errors: (a) Positions errors; (b) Attitudes errors.

Figure 16. Covariance results: (a) Positions covariance; (b) Attitudes covariance.

Table 1. Mean absolute error.

Controller	$ϕ$	$θ$	$ψ$	$\dot{\emptyset}$	$\dot{θ}$	$\dot{ψ}$
CT	0.0077	0.0284	0.0488	0.0135	0.0239	0.0289
Adaptive	0.0216	0.1226	0.1009	0.0378	0.0451	0.1969
The Proposed PD	0.0209	0.0205	0.1392	0.0072	0.0137	0.0349

Table 2. Training errors and tracking errors.

Training Methods\Training Errors	Positions	Attitudes
GPMPC1	4.3496 × 10⁻⁴	4.3030 × 10⁻⁹
GPMPC2	4.3618 × 10⁻⁴	1.5746 × 10⁻⁸
Controllers\Mean Absolute Errors	Positions	Attitudes
EMPC	0.0287	9.6239 × 10⁻¹¹
ENMPC	0.0050	7.6412 × 10⁻¹¹
GPMPC	0.0049	2.2407 × 10⁻¹¹

Table 3. Training errors and tracking errors.

Training Methods\Training Errors	Positions	Attitudes
GPMPC1	6.1494 × 10⁻⁸	3.8282 × 10⁻¹⁰
GPMPC2	5.1161 × 10⁻⁷	1.1889 × 10⁻⁹
Controllers\Mean Absolute Errors	Positions	Attitudes
EMPC	0.0287	0.0263
ENMPC	0.0050	1.8769 × 10⁻⁴
GPMPC	0.0049	8.4033 × 10⁻⁵

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Trajectory Modeling by Distributed Gaussian Processes in Multiagent Systems

Abstract

1. Introduction

1.1. Related Works

1.2. Contributions

1.3. Paper Structure

2. Preliminaries

2.1. Notation

2.2. Graph Theory

2.3. Gaussian Process

2.4. Kullback–Leibler Average Consensus Algorithm

2.5. Uniform Error Bounds

Asymptotic Analysis

3. Problem Formulation

4. Control Design and Analysis

4.1. Consensus

4.2. GP-Based Model Predictive Control for Discrete-Time System

5. Simulations

5.1. Trajectory Tracking of Robotic Manipulator

5.2. Trajectory Tracking of an Unmanned Quadrotor

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A. Proof of Theorem 1

Appendix B. Proof of Theorem 2

Appendix C. Proof of Theorem 3

References

Article Metrics

Citations

Article Access Statistics