Behavior Prediction Based Trust Evaluation for Adaptive Consensus of Quadrotors

Xie, Yichen; Li, Yuzhu; Dong, Wei

doi:10.3390/drones6120371

Open AccessArticle

Behavior Prediction Based Trust Evaluation for Adaptive Consensus of Quadrotors

by

Yichen Xie

,

Yuzhu Li

and

Wei Dong

^*

The Department of Mechanical Engineering, Shanghai Jiao Tong University, Shanghai 200240, China

^*

Author to whom correspondence should be addressed.

Drones 2022, 6(12), 371; https://doi.org/10.3390/drones6120371

Submission received: 19 October 2022 / Revised: 18 November 2022 / Accepted: 20 November 2022 / Published: 22 November 2022

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Without proper treatment, a malfunctional quadrotor may bring severe consequences, e.g., becoming out of control, to the whole swarm. To tackle this problem, we develop a trust evaluations based consensus protocol. Specifically, each quadrotor in the swarm communicates with its connected neighbors, exchanging behavior predictions. By comparing the predicted and the actual behaviors of its neighbor regarding a pre-defined tolerance, each quadrotor assigns trust values to determine potentially legitimate or malfunctional companions. On this basis, an online adaptive controller adjusts each weight in the protocol corresponding to the trust evaluations designed before. We prove that, within proper tolerance, it is almost sure that the legitimate quadrotors can identify the malfunctional quadrotors through trust evaluations and ameliorate their effects on the whole system. Almost surely, the legitimate quadrotors can converge to their center in a finite time. We verify our method through MATLAB and GAZEBO. In particular, with our proposed method, the swarm system discussed in this paper is able to reach position and velocity consensus in the presence of malfunctional quadrotors.

Keywords:

quadrotor; trust; malfunction; adaptative controller; consensus protocol

1. Introduction

Quadrotors have been a topic of particular interest in recent years. They have been widely used in military, industry, and disaster rescuing, owing to their low cost, small size, and autonomous navigation capabilities. Compared with a single quadrotor, a swarm with distributed multi-agent systems can provide better robustness. To establish a well-performed swarm, the distributed consensus is a basic but crucial problem.

Distributed consensus has been widely discussed for a long time. Many works have been conducted on both the consensus itself and the application of distributed consensus. Ref. [1] provides a new algorithm that deals with link failures and channel noise. Ref. [2] discusses the topic of filtering in sensor networks. Ref. [3] provides a new time-discrete algorithm for wireless sensor networks. Refs. [4,5] investigate the consensus problem for second-order multi-agent systems. In addition, Ref. [6] proposes a relative position measurement method to reach consensus. Ref. [7] discusses the convergent speed of consensus.

However, many of these Algorithms mentioned above are based on the assumption that all agents are legitimate ones, but they are vulnerable when there are malfunctional agents occur in the system discussed in Refs. [8,9,10], which leads to the problem of resilient consensus. Resilient consensus refers to consensus in a case, where agents become malfunctional due to either physical faulty or communication failure. As malfunction can be found in many multi-agent systems such as autonomous vehicles, delivery drones, or partners in security and defense as described in Refs. [11,12,13], the resilient consensus has become a very important problem worthy of investigation.

To deal with resilient consensus, many theories and control protocols have been proposed. Ref. [14] provides a consensus protocol that enables a system of n asynchronous processes, some of them malfunctional, to reach an agreement. This article has improved the t-resilient protocol to t < n/3. Refs. [9,15] conclude a basic result that well-behaving agents can always agree upon a parameter if and only if the number of malfunctional agents is less than 1/2 of the network connectivity. Grounded on the previous knowledge, Ref. [16] proposes a distributed function calculation iterative method to overcome the weakness brought by malfunctional agents. Ref. [15] focus on local strategies that provide resilience to faults and compromised nodes. Ref. [17] proposes DP-MSR Algorithm to reach resilient consensus. Refs. [18,19] consider trust evaluation applications in consensus protocols. Ref. [20] has further successfully proved that their protocol is useful even if t = n/2 and it emphasized the interagent trust. However, much of the research is based on the hypothesis that the malfunctional agents will directly broadcast their abnormal behavior to others without detection or observation from others and these consensus protocols are simplified and not suitable for quadrotor systems.

For real quadrotors discussed in this paper, such a hypothesis is not suitable and we need to formulate a method to detect such abnormal behavior of malfunctional quadrotors. Ref. [21] chooses to utilize visual techniques to detect faulty robots but with a strictly demanding requirement that robots should be within close range, typically 50 cm. To overcome this limitation, Ref. [22] focuses on the expected behavior of other agents offline to predict other’s possible failure. However, if the other robot faces a sudden fault after broadcasting a health status, it will not be able to react in time. To reach in-time reactions, Ref. [23] analyzes the coordination data and the signal strength of neighboring agents to detect failure. However, the noise was not taken into consideration. Refs. [24,25] eliminate noise and detect failure using the PCA-EWMA method, but may lose features in data or lead to misleading indications by increasing false alarms and missed detections, and yet they might not perform well in time response situations. Therefore, Therefore, this article mainly deals with the sudden and severe situations that one quadrotor may encounter such as collisions with obstacles in Section 5.2, which could possibly lead to sensor failure or even system failure. In addition, the possible malicious attacks [20] may also be taken into our consideration. Under these situations, we need a novel method that based on trust evaluations between quadrotors in order to improve the robustness of the whole swarm system.

Therefore, motivated by existing research [20], our work extends the trust value evaluation method to an online adaptive controller which is designed for multiple quadrotors systems. Specifically, each quadrotor in the swarm will monitor its nearby quadrotors. Based on the collected information, a trust value is assigned to each connected nearby quadrotor depending on whether the latter is within or out of the established tolerance, and therefore, the trust matrix is formed as trust evaluations for the whole swarm system. On this basis, an adaptive controller will adjust the weight according to the trust matrix. Finally, simulations are performed to verify the feasibility and expandability of our proposed method.

The main contributions of this work are of three aspects: (1) Based on the proposed trust value evaluation method, legitimate quadrotors are able to identify the malfunctional quadrotor. (2) Based on the proposed trust value evaluation method and the adaptive consensus protocol, legitimate quadrotors are able to converge to the correct position and maintain velocity consensus in the presence of malfunctional quadrotors. (3) Based on the proposed trust value evaluation method, the swarm system can outperform the single-quadrotor detecting method in terms of robustness and accuracy.

This article is addressed as the following: in Section 1, we introduce the background of consensus and related works; in Section 1, we introduce the model and consensus protocol we use; in Section 3, we analyze the properties of our adaptive consensus protocols; in Section 4, we briefly illustrate a simple 2-D plane simulation utilizing the method we proposed in Section 2; in Section 5, we discuss multiple quadrotors system in realistic situations. Section 6 concludes the paper.

2. Problem Description

We consider the problem of trust evaluations for the swarm system consensus protocol, where malfunctional quadrotors exist. We firstly establish a general mathmatic frame [18,19,20] to describe the problem. The quadrotors in the system communicate with each other over a network, which in this case, is represented by an undirected graph

G = (V, E)

, where

V = {1, \dots, N}

indicates the set of node indices for agents and

E \in V \times V

indicates the set of undirected edges in this graph. We use

{i, j}

to denote the edge connecting quadrotors i and j. Since it’s an undirected graph, once quadrotor i views quadrotor j as its neighbor, j also recognizes i as its neighbor. Therefore, we donate the neighbors of i as a set

N_{i} = {j | i, j \in E}

. With the graph

G

, we introduce the trust matrix

W (t)

and we associate time-varying trust values

W_{i j} (t), i, j \in V

, which captures the value of trust i to j, in another word, to which extent, quadrotor i views quadrotor j as a legitimate quadrotor. Additionally, the matrix

W (t)

need not be symmetric. We next define

N (t)

as a time-varying weight matrix and use

N_{i j} (t)

to donate the

i j^{t h}

entry of the weight matrix, which can be later derived from the trust matrix

W (t)

. Note that

N (t)

must have nonnegative entries, and will have positive entry

N_{i j} (t)

only if quadrotor i and j are connected and i views j as a trusted neighbor. We consider the case where a subset of nodes with indices denoted by the set

M

,

M

∈V, is either malicious or malfunctional and, thus, does not reliably follow the order in the consensus protocol. The set

M

is assumed to be unknown, with cardinality |

M

|. At the same time, a quadrotor that is not malicious or malfunctional is termed legitimate, and the set of legitimate quadrotors is denoted

L

,

L

, with cardinality |

L

|.

In what follows, we use

0

to denote the zero matrix, where the dimension of

0

is to be understood from the context. A nonnegative matrix is row-stochastic if the sum of its entries in every row is equal to 1, and it is row-substochastic if the sum of its entries in every row is equal to or less than 1. We use

1

to denote the vector with all entries equal to 1. Given a vector

X

, we use

X_{i}

or

{[X]}_{i}

(when

X

has a subscript) to denote its ith entry. A similar notation is used for the entries of a matrix.

2.1. Model

We are concerned with the problem of consensus in a two-dimensional XY-plane, where each legitimate quadrotor updates its value by the following general and widely-used equations for all time t:

X (t + 1) = X (t) + V (t)

(1)

V (t + 1) = V (t) + U (t)

(2)

where

X (t)

represents the position of quadrotors at time t,

V (t)

indicates the velocity of quadrotors at time t and

U (t)

stands for the acceleration of quadrotors at time t, which is the control input in the simulation and swarm system discussed later as well. All three symbols mentioned above are n-dimensional vectors, in which n represents the number of quadrotors discussed in this article. Remark that for the 2D-plane distributed system that will be discussed later in this article, we assume that each quadrotor will maintain its own attitude. Therefore, we only need to separate the vector into two directions X and Y.

Besides, for the

U (t)

mentioned above. We choose the following equations [26] as the consensus protocol:

\begin{matrix} u_{i} (t) & = - p_{0} v_{i} (t) + p_{1} \sum_{j \in N_{i} (t)} a_{i j} [x_{j} (t) - x_{i} (t)] \\ + p_{2} \sum_{j \in N_{i} (t)} a_{i j} [v_{j} (t) - v_{i} (t)] \end{matrix}

(3)

where

a_{i j}

comes from

N (t)

,

i \in V

. We donate the

N_{0} (t)

as the original weight matrix that contains the initial weights of every quadrotor to their neighbors.

p_{0}

,

p_{1}

and

p_{2}

are user-defined parameters, which illustrate the control gain. Unlike Refs. [18,20], the consensus protocol we choose here is a more typical and more general second-order consensus protocol and suitable for swarm systems discussed in this paper.

Definition 1

(Malfunctional quadrotors:). Quadrotor i is said to be malfunctional if it does not follow the consensus protocol rule in

(1) (2) (3)

to update its value

x_{i} (t)

after some time t. To be more specific, we refer Malfunctional quadrotors to those which may deviate from the original calculated route whether resulting from communication failure, collision or hardware fault.

Let

X (t)

∈

R^{n}

denote the vector of values

x_{i} (t)

∈ R for all the quadrotors at time t.

Suppose that

Φ (t) = {(x_{1} (t), V_{1} (t) \dots x_{n} (t), v_{n} (t))}^{T}

(4)

E = [\begin{matrix} 1 & 1 \\ 0 & 1 - p_{0} \end{matrix}] F = [\begin{matrix} 0 & 0 \\ p_{1} & p_{2} \end{matrix}]

(5)

Without loss of generality, we assume that the quadrotor indices are ordered in such a way that the vector

Φ (t)

can be separated into two components: the first part of representing the legitimate quadrotors’ values

Φ_{L} (t)

, the second part of the component representing the malfunctional quadrotors’ values

Φ_{M} (t)

. In this way, the consensus dynamics become the following forms:

[\begin{matrix} Φ_{L} (t + 1) \\ Φ_{M} (t + 1) \end{matrix}] = [\begin{matrix} Λ_{L} (t) & Λ_{M} (t) \\ Θ & Ω \end{matrix}] [\begin{matrix} Φ_{L} (t) \\ Φ_{M} (t) \end{matrix}]

(6)

In addition,

Λ_{L} (t)

and

Λ_{M} (t)

are defined as follows:

Λ_{L} (t) = I_{|} L | \otimes E - (L_{|} L | (t) - N_{L} (t)) \otimes F

(7)

Λ_{M} (t) = 0 - (L_{| L | \times | M |} (t) - N_{M} (t)) \otimes F

(8)

where

N_{L} (t) \in R^{| L | \times | L |}

is the matrix multiplying the state component corresponding to the legitimate quadrotors’ values, and

N_{M} (t) \in R^{| L | \times | M |}

is the matrix multiplying the state component corresponding to the malfunctional quadrotors’ values.

I_{n}

is defined as the identity matrix. The symbol ⊗ represents the Kronecker multiplication.

L_{|} L | (t)

and

L_{| L | \times | M |} (t)

are the specific area of the Laplacian matrix of graph G. The matrices

Θ (t)

and

Ω (t)

dictate the dynamics of the malfunctional quadrotors’ values, which will not be necessary to give specific formulas.

Definition 2

(Trust evaluation

w_{i j} (t)

:). For every

i \in L

and

j \in N_{i}

, the parameter

w_{i j}

represents to which extent the quadrotor i views quadrotor j as a trustworthy neighbor at a certain time t.

We assume the availability of such observations

w_{i j}

(t) throughout this article. In the next section, we will discuss how to determine

w_{i j}

through quadrotors’ communication systems.

In this article, we assume that

w_{i j} (t) > 0

indicates quadrotor j is temporarily regarded as a trusted(legitimate) neighbor by quadrotor i at time t and vice versa. Note that

w_{i j} (t)

= 0 means at time t, quadrotor j’s role is ambiguous to quadrotor i, which will not play a role in the iteration Equations (3) and (6).

Our ultimate goal is to reach convergence under the influence of malfunctions and design available observations of trust. Toward this target, our first priority is to decide which information should be sent through networks. By

(1)

and

(2)

, we need the location of quadrotor i and the location of

j \in N_{i}

at time t to determine the velocity and the expected location at time

t + 1

. In addition, for the determination of

w_{i j}

, we need the predicted speed of quadrotor i and the location of

j \in N_{i}

at the time t.

2.2. Noise

When it comes to the real system, whether we use drones or vehicles, there must be unmodeled factors such as friction, magnetic field, and data lagging, which may lead to the deviation between the actual movement and the computational one. Therefore, we set up noise equations that simulate the real environment as follows:

x_{i t, n o i s e} = K_{0} \times α_{i t, x} + \frac{1}{2} C r h o S V^{2}

(9)

where

x_{i t, n o i s e}

indicates the instability of location caused by noise.

α_{i t, x}

is an arbitrary variable that has uniform distribution with a certain interval that the user can set up. In addition,

K_{0}

is also a user-decided parameter that reduces or enlarges the influence of noise depending on the requirement of the real system. For simplicity, we assume that for both directions of every quadrotor, the parameter is the same

α

. In addition, the second term is the basic wind disturbances, where C is the air drag coefficient,

ρ

is the density of air and S is the windward area. Noise is important in this article as it gives a hint on how to determine the trust matrix. In addition, we include the term in the later simulations to serve as disturbances that the swarm system could possibly encounter in reality.

When taking noise into consideration, the iteration Formulas

(1)

and

(2)

evolve into the following modified equations:

X_{r e f, i} (t + 1) = X_{r e f, i} (t) + V_{i} (t) + X_{i t, n o i s e}

(10)

where

X_{r e f, i} (t)

stands for the actuals position of quadrotors at time t.

2.3. The Trust Matrix and the Weight Matrix

In this subsection, firstly, we will discuss the methods to determine the trust matrix, which is obtained from the communication of quadrotors. After that, we directly derive the weight matrix.

Let

d_{(i, j)} (t)

be the difference between the distance of quadrotor i and j and that after iteration (1) and (6) has been conducted, which can be written as:

d_{(i, j)} (t) = | x_{i} (t) - x_{j} (t) - (x_{r e f, i} (t) - x_{r e f, j} (t)) |

(11)

See that the computed

x_{i} (t)

and the actual movement

x_{r e f, i} (t)

, are directly obtained by computing the following equations:

Φ (t) = (\prod_{t = 0}^{t - 1} [\begin{matrix} Λ_{L} (t) & Λ_{M} (t) \\ Θ & Ω \end{matrix}]) Φ (0)

(12)

So we consider the trust matrix entry

w_{i, j} (t)

of

W (t)

as follows:

w_{i, j} (t) = \{\begin{matrix} w_{i, j} (t - 1) + β; if d_{(i, j)} (t) \leq ζ (α) \\ w_{i, j} (t - 1) - β; if d_{(i, j)} (t) > ζ (α) \end{matrix}

(13)

where

β

is a trust value changing constant and, the equation represents the overall trust of i to j considering the information gathered from

t = 0

to

t = t

. In addition,

ζ (α)

indicates the tolerance of error that will affect trust values. The equation provides some tolerance for a quadrotor to judge its neighbor. It will not simply classify a neighboring quadrotor as a legitimate or a malfunctional one based on one prediction as the trust value must go below zero to change its neighbor’s status from legitimate to malfunctional. It views the overall performance of its neighbors so as to resist certain extent of false prediction that the disturbances may generate in the process.

We now define a time-dependent trusted neighborhood for each quadrotor i as

T_{i} (t + 1) = {j \in N_{i} : W_{i, j} (t + 1) \geq 0}

. This is the set of neighbors that legitimate quadrotor i identifies as its legitimate neighbors at time

t + 1

. Denote

N_{w i} (t + 1) = m a x {κ, | T_{i} (t + 1) | + 1} \geq 1

for all

i \in L

where,

κ

>0 can be thought of as a parameter limiting the maximum influence that the neighbors of quadrotor i are allowed to have on quadrotor i’s update values (i.e., a value of

κ

approaching infinity would preclude neighbors from influencing quadrotor i’s updated values). Using these quantities, we define the weight matrix

N (t)

by choosing its entries

N_{i j} (t)

as follows: for every

i \in L, j \in N_{i}

, we have

N_{i, j} (t + 1) = \{\begin{matrix} \frac{1}{N_{w i} (t + 1)}; if j \in T_{i} \\ 0; if j \notin T_{i} \cup {i} \\ 1 - \sum_{m \in N_{i}} N_{i m} (t), if i = j \end{matrix}

(14)

In this case, we do not consider the difference of weight between cooperative ones for simplicity, which might be improved in the future.

3. Analysis

Proposition 1.

There exists a finite time instant

T_{f} < 0

such that

N_{L} (k) = {\bar{N}}_{L}

for all

k \geq T_{f}

almost surely when

ζ (α)

and β are chosen properly.

Proof.

Let

M = \sum_{1}^{t} w_{i, j} (t)

, then by Chernoff-Hoeffding Inequality [27] TH1.1, we have the following:

\Pr [| M - E (M) > η] \leq 2 e x p (\frac{- 2 η^{2}}{\sum_{k = 1}^{t} Δ_{k}^{2}})

(15)

where

Δ = 2 β

,

η \in (0, 1 / 2)

. By the linearity of the expectation, we have

E (M) = \sum_{k = 1}^{t} E (w_{i, j} (k))

. In addition,

E (W_{i, j} (k))

can be illustrated by the following equations:

\begin{matrix} E [w_{i, j} (k)] & = \int_{0}^{ζ (α)} \frac{ζ (α)}{2 \sqrt{2} k_{0} α} \times β d α \\ + \int_{ζ (α)}^{2 \sqrt{2} k_{0} α} \frac{2 \sqrt{2} k_{0} α - ζ (α)}{2 \sqrt{2} k_{0} α} \times (- β) d α \end{matrix}

(16)

for every

k > 0

and every

i \in L

,

j \in N_{i} \cap L

.

\begin{matrix} E [w_{i, j} (k)] & = \int_{0}^{ζ (α)} \frac{ζ (α)}{2 \sqrt{2} k_{0} α} \times β d α \\ + \int_{ζ (α)}^{+ \infty} \frac{2 \sqrt{2} k_{0} α - ζ (α)}{2 \sqrt{2} k_{0} α} \times (- β) d α \end{matrix}

(17)

for every

k > 0

and every

i \in L

,

j \in N_{i} \cap M

.

The two equations above indicate to which extent the M reaches to

E (M)

as time t increases. In other words, when we choose proper

ζ (α)

and

β

, that M in (14) reaches

E (M) > 0

for every

i \in L

,

j \in N_{i} \cap L

and reaches

E (M) < 0

for every

i \in L

,

j \in N_{i} \cap M

occurs at a finite time almost surely by Borel–Cantelli Lemma. Thus, there exists a (random) finite time

T_{f} > 0

such that

N_{L} (k) = {\bar{N}}_{L}

for all

k \geq T_{f}

almost surely. □

In addition, we will give out an explicit example in the next section to prove there is such proper

ζ (α)

and

β

.

Next, we will state the convergence of legitimate agents’ values. In particular, we use the decomposition

x_{L} (t) = x_{L}^{'} (t) + X_{M}^{'} (t)

of the legitimate quadrotor states [see(4)] to formulate the following propositions.

Lemma 1

([26,28]). Let

H = p_{2} / p_{1}

,

d_{m a x}

be the max eigenvalue of graph

G

, then under the following conditions:

\frac{1 + (1 - H p 0)}{H} > p_{2} d_{m a x}, 1 < H a n d p_{0} H > 1 .

(18)

The matrix in (6) is a stochastic matrix.

Proposition 2

(Convergence of legitimate quadrotors’ values). Let

x_{L} (0)

be the vector of the initial value of legitimate agents. We have the following equations:

x_{L}^{\infty} = (\prod_{t = 0}^{\infty} Λ_{L} (t)) x_{L} (0)

(19)

Proof.

By Proposition 1, there exists a finite time instant

T_{f} < 0

such that

N_{L} (k) = {\bar{N}}_{L}

for all

k \geq T_{f}

almost surely. Thus, by Lemma 1 there exists a finite time instant

T_{f} < 0

such that

Λ_{L} (t) = {\bar{Λ}}_{L}

directly from the definition in (6). In addition, by Perron–Frobenius theorem, we have

lim_{t \to + \infty} Λ_{L} (t) = 1 v

where v is a a stochastic column vector.

Therefore, we have the following equations:

\begin{matrix} x_{L}^{\infty} (T_{0}) & = (\prod_{t = 0}^{\infty} Λ_{L} (t)) x_{L} (0) \\ = (\prod_{t = T_{f}}^{\infty} Λ_{L} (t)) (\prod_{t = 0}^{t = T_{f} - 1} Λ_{L} (t)) x_{L} (0) \\ = 1 v (\prod_{t = 0}^{t = T_{f} - 1} Λ_{L} (t)) x_{L} (0) \end{matrix}

(20)

where

(\prod_{t = 0}^{t = T_{f} - 1} Λ_{L} (t)) x_{L} (0)

is a specific finite vector. Thus, Proposition 2 has been proved. □

Proposition 3

(Convergence of malfunctional quadrotors’ values). Consider

ϕ_{M}^{\infty}

defined by

ϕ_{M}^{\infty} = \sum_{k = 0}^{\infty} (\prod_{t = k + 1}^{\infty} Λ_{L} (t)) Λ_{M} (k) x_{M} (k)

(21)

Then, the entries of this vector are all the same almost surely.

Proof.

By Proposition 1 and Lemma 1, this imply

N_{M} (k) = 0

and

Λ_{M} (k) = 0

for all

k \geq T_{f}

almost surely.

Therefore, we have the following equations:

\begin{matrix} ϕ_{M}^{\infty} & = \sum_{k = 0}^{\infty} (\prod_{t = k + 1}^{\infty} Λ_{L} (t)) Λ_{M} (k) x_{M} (k) \\ = \sum_{k = 0}^{T_{f} - 1} (\prod_{t = k + 1}^{\infty} Λ_{L} (t)) Λ_{M} (k) x_{M} (k) \\ = 1 v [\sum_{k = 0}^{T_{f} - 1} (\prod_{t = k + 1}^{m a x {T_{f}, k + 1} - 1} Λ_{L} (t)) Λ_{M} (k) x_{M} (k)] \end{matrix}

(22)

for some finite

T_{f}

, indicating that all the coordinates of the vector

ϕ_{M}^{\infty}

are identical almost surely. □

By the result of Propositions 2 and 3, we directly get the following result.

Proposition 4.

There exists, almost surely, a random variable

z (T_{0})

that the consensus algorithm converges almost surely,

lim_{t = 0 \to \infty} x_{L} (t) = z (T_{0}) 1

(23)

Proof.

The result is directly obtained from Propositions 2 and 3 and the statement that

x_{L} (t) = x_{L}^{'} (t) + x_{M}^{'} (t)

. □

4. MATLAB Simulation

In this section, we evaluate the performance of convergence of legitimate quadrotors through MATLAB. For simplicity, we set the graph

G

as a fully connected graph, and consider a system with 30 quadrotors among which there are 15 malfunctional ones. The initial location X, Y of each quadrotor is randomly chosen and the malfunctional quadrotors are also randomly chosen in order to guarantee generality. In addition, we consider the two most common cases in that the malfunctional quadrotors cannot move or in a circular movement.

4.1. Static Situation

In the first case, we discuss the situation where malfunctional quadrotors are set as static ones, which means they will remain in their initial locations and hover during the whole simulation as if they fail to take off. We set

K_{0} = 0.05

, the X and Y noise

α_{i t, x}, α_{i t, y} \in [- 0.5, 0.5]

, tolerance

ζ

(

α

) = 0.05, and we set the least trusted limitation

κ = 9

. Besides, we set

[p_{0}, p_{1}, p_{2}] = [0.1, 0.01, 0.2]

to fulfill the requirement of convergence theory.

We demonstrate the following 2D-time results and

X Y

properties:

Figure 1 demonstrates the

X Y

movement with the time of all the quadrotors in the swarm system. As mentioned in the previous conditions, the malfunctional agents won’t move so it remains in their initial positions. The legitimate ones successfully reach their consensus.

Figure 2 demonstrates the velocity and location of agents with time. As can be seen, the X and Y position converges and their velocities reach zero in the end, which fulfills the consensus requirement.

4.2. Movement Situation

In the second case, we discuss the situation where malfunctional quadrotors are set as moving ones, they will start a circular movement with radius

r_{0} = \sqrt{(x_{i}^{2} + y_{i}^{2})}

and the center is located at

(x, y) = (0, 0)

. We set

K_{0} = 0.05

, the noise

α_{i t, x}, α_{i t, y} \in [- 0.5, 0.5]

, tolerance

ζ

(

α

) = 0.05. In addition, we design

κ = 9

.

[p_{0}, p_{1}, p_{2}] = [0.1, 0.01, 0.2]

.

Figure 3 and Figure 4 demonstrate the situation that the malfunctional quadrotors rotate around the origin with the radius of their original distance with it. It’s clearly showed on the graph that the actual trajectory of each quadrotor and their correspondent speeds and positions. Judging from the graphs, we conclude that even if the malfunctional ones are moving in circles, it will not affect the consensus of the legitimate ones, and they will converge to their center, which proves the robustness of our method.

5. Gazebo Simulation

In this section, we apply our theory to the quadrotor swarm in Gazebo. We consider the four-quadrotor-situation where one of them is out of control. In the first case, the malfunctional quadrotor will hover at the original place after taking off, and in the second case will hit the obstacles. We firstly provide a pseudocode of our proposed scheme (Algorithm 1).

Algorithm 1 online adapative controller weight

Input:: Initial location of each quadrotor
Output:: Next-step Trust and Weight
1:: functionTrustevaluation( $x_{j} (t), u_{j} (t)$ )
2:: while not reach consensus do
3:: if Quadrotor j fall within the tolerance of predicted location then $w_{i, j} (t + 1) = w_{i, j} (t) + β$
4:: else $w_{i, j} (t + 1) = w_{i, j} (t) - β$
5:: end if
6:: if
7:: $w_{i, j} (t + 1) \leftarrow 0$ then $n_{i, j} (t + 1) = 0$
8:: end if
9:: end while
10:: end function

Each quadrotor will examine its relationship with its neighbours utilizing the above algorithm at each step, which means that they will online adapt the weight asigned to each other in the proposed protocol. As result, they will be able to distinguish the malfunctional quadrotors and eliminate their impacts on the whole robotic system.

5.1. Position Consensus

In this subsection, we consider the case where one of the four quadrotors hovers at the original place during the experiment. The location of the four quadrotors are

(5, 3), (0, 0), (5, - 3), and (- 5, 0)

separately as shown in Figure 5. The quadrotor at

(- 5, 0)

is designed as the malfunctional one. The four quadrotors are set in offboard mode and act upon our consensus protocol after they reach different heights in order to avoid Z-direction collision.

In the Figure 6, we present the actual movement of the four quadrotors with time under the adaptive consensus protocol. As the malfunctional quadrotor will hover at its original place after it takes off, its trust value will continue going down because its location prediction doesn’t match with its movement. The legitimate quadrotors are able to identify the malfunctional one to eliminate its effect on the whole swarm system by setting its weight to zero in the consensus protocol. Eventually, the legitimate quadrotors reach their center nearly at

X = 2.48, Y = 0.02

.

We next present the actual trajectory of the four quadrotors under the proposed protocol through the rosbag record in Figure 7. The black one which was designed as a malfunctional one will hover at the original place. In addition, the other three will reach their center, not affected by the malfunctional one which in this case is UAV3.

We next present the comparison test. We use the normal consensus protocol that each quadrotor assigns the same weight to its connected neighbors and they will not change weight during the experiment.

We can see from the Figure 8 that if the presence of the malfunctional quadrotor is not taken into consideration or the swarm system fail to identify the malfunctional one. The remaining three legitimate quadrotors will be severely affected as they mistakenly classify the malfunctional quadrotor as a trustworthy one. As the weight of the malfunctional quadrotors is not zero, others will try to reach consensus with it, leading to deviation. However, the swarm system will gradually fly towards

(- 5, 0)

, exactly the place where the malfunctional one is hovering.

We also present the actual trajectory of the four quadrotors under normal consensus protocol through the rosbag record. Figure 9 clearly shows that with the malfunctional quadrotor UAV3 hovering at

(- 5, 0)

, the remaining three legitimate quadrotors drift to UAV3’s location, indicating the severe consequence the misclassification brings about.

Notice that UAV0 whose original location is

(0, 0)

will first move forward due to two quadrotors locating ahead of it, and then drift backward to the malfunctional UAV3.

Compared with the actual movement under adaptive consensus, in which the true consensus point should be near the center of the three legitimate quadrotors’ original spawn location, that is

(2.5, 0)

, the normal consensus leads to severe deviation.

In this case, our protocol gives out only a

0.8 %

relative error at X-location, which is quite acceptable when considering a physical swarm system. While the comparison test gives out a relative error of

250.4 %

, which is unacceptable. This numerically demonstrates the improvement of our method.

5.2. Velocity Consensus

In this section, we extend our theory to velocity consensus and a more typical case will be discussed. We consider a four quadrotors swarm system that tries to fly through a forest while maintaining velocity consensus, and unfortunately, one of the quadrotors hit the tree and is trapped on the branches. Based on the same adaptive consensus protocol, we redesign the trust value and update the trust matrix based on the new function.

We list out the new trust design as the following equation:

\begin{matrix} d_{i, j} (t) = \frac{1}{(X_{i} (t) - X_{i} (t - 1)) / V_{i} (t - 1) + 0.01} \times 0.5 \\ + \frac{1}{(X_{j} (t) - X_{j} (t - 1)) / V_{j} (t - 1) + 0.01} \times 0.5 \end{matrix}

(24)

Notice that the trust function is designed specifically for our experiment. When the malfunctional occurs, i.e., the quadrotor hit the obstacle, the movement of it per second will be very close to zeros, and when we place the measurement in the denominator, the reciprocal will be very large. As result, it can tell the difference between those which are still functioning and the one which hit the trees. Besides, the extra value

0.01

is specialized for the situation when the movement is zero. And with its strong relation with

β

, the greater the

d_{i, j} (t)

will lead to a faster descent in trust value, guaranteeing the quick response to the collision.

We provide the result by MATLAB and GAZEBO separately. In addition, the UAV3 is designed as the one will hit trees after flying for a while:

We set the swarm flying along the X-direction and maintain a lozenge formation. Y-direction velocity and position are not considered in the simulation. We can observe from the Figure 10 that with the unexpected malfunction at

t = 20

, the X-velocity of UAV3 will become zero and resulting in the velocity decrease of the other three legitimate quadrotors. With our proposed trust values based consensus protocol, the remaining three are able to identify the malfunctional quadrotors, restore their velocity and reach velocity consensus.

We next present the GAZEBO simulation to illustrate a more realistic case.

As can be seen from the graphics, at

t = 5

, UAV3 hit the trees and fell to the ground. The three legitimate quadrotors would slow down as they were lowering the trust values, but eventually, they identified the malfunctional quadrotor and disconnected from it. As result, they were able to continue their flying and maintained their consensus, even if the fourth one hit the tree and stopped moving, which indicates the effectiveness of our proposed method (Figure 11).

6. Conclusions

This work considers the problem of quadrotor swarm consensus in the presence of malfunctional quadrotors. Specifically, we propose a consensus protocol based on the trust value evaluation of each quadrotor’s neighboring quadrotors. We show that with the proper trust value design and the proposed consensus protocol:

(1) Correct classification of malfunctional and legitimate quadrotors can be achieved. The legitimate quadrotors can avoid potential risks by eliminating the effect of the malfunctional quadrotor.

(2) Almost sure that the remaining legitimate quadrotors are able to reach convergence even when malfunctional quadrotors occur.

(3) The deviation of the converged limit is much more improved than the one with a simple protocol not taking trust into consideration, which is acceptable and can be applied in physical swarms system.

Additionally, we show that our performance can be applied to a wide range of situations, such as velocity consensus, with different designed trust values and will guarantee more accuracy with more information observed. Taken together, these results point to the inherent value of quantifying and exploiting trust between quadrotors for consensus. Furthermore, with the theoretical derivations and mathematical proofs to guarantee the feasibility of our method and the certain noises to simulate the possible disturbance in reality, we believe that the tests in reality which we are currently working on will still be promising, such as the simulations as they have much in similarity such as gravity, collisions and sensors. All in all, we believe that the proposed method and framework of this article hold promise for a wide range of different purpose swarm systems.

Author Contributions

Conceptualization, Y.X., Y.L. and W.D.; Methodology, Y.X., Y.L. and W.D.; Software, Y.X.; Validation, Y.X.; Formal analysis, Y.X.; Investigation, Y.X.; Resources, W.D.; Data curation, Y.X.; writing—original draft preparation, Y.X.; writing—review and editing, Y.X., Y.L. and W.D.; Supervision, W.D. All authors have read and agreed to the published version of the manuscript.

Funding

This work is partially supported by the Shanghai Rising-Star Program (22QA1404400) and the National Natural Science Foundation of China (Grant No. 51975348).

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Kar, S.; Moura, J.M. Distributed consensus algorithms in sensor networks: Link failures and channel noise. arXiv 2007, arXiv:0711.3915. [Google Scholar]
Yu, W.; Chen, G.; Wang, Z.; Yang, W. Distributed consensus filtering in sensor networks. IEEE Trans. Syst. Man Cybern. Part B 2009, 39, 1568–1577. [Google Scholar]
Xiong, G.; Kishore, S. Discrete-time second-order distributed consensus time synchronization algorithm for wireless sensor networks. EURASIP J. Wirel. Commun. Netw. 2008, 2009, 1–12. [Google Scholar] [CrossRef] [Green Version]
Mei, J.; Ren, W.; Chen, J. Distributed consensus of second-order multi-agent systems with heterogeneous unknown inertias and control gains under a directed graph. IEEE Trans. Autom. Control 2015, 61, 2019–2034. [Google Scholar] [CrossRef]
Sun, F.; Guan, Z.H.; Zhan, X.S.; Yuan, F.S. Consensus of second-order and high-order discrete-time multi-agent systems with random networks. Nonlinear Anal. Real World Appl. 2012, 13, 1979–1990. [Google Scholar] [CrossRef]
Mei, J.; Ren, W.; Ma, G. Distributed coordination for second-order multi-agent systems with nonlinear dynamics using only relative position measurements. Automatica 2013, 49, 1419–1427. [Google Scholar] [CrossRef]
Zhou, J.; Wang, Q. Convergence speed in distributed consensus over dynamically switching random networks. Automatica 2009, 45, 1455–1461. [Google Scholar] [CrossRef]
Fischer, M.J.; Lynch, N.A.; Paterson, M.S. Impossibility of distributed consensus with one faulty process. J. ACM (JACM) 1985, 32, 374–382. [Google Scholar] [CrossRef]
Lamport, L.; Shostak, R.; Pease, M. The Byzantine generals problem. Concurrency: The Works of Leslie Lamport. 2019, pp. 203–226. Available online: https://lamport.azurewebsites.net/pubs/byz.pdf (accessed on 18 October 2022).
Lamport, L. Paxos made simple. ACM SIGACT News (Distributed Computing Column) 32, 4 (Whole Number 121, December 2001). 2001, pp. 51–58. Available online: https://lamport.azurewebsites.net/pubs/paxos-simple.pdf (accessed on 18 October 2022).
Higgins, F.; Tomlinson, A.; Martin, K.M. Threats to the swarm: Security considerations for swarm robotics. Int. J. Adv. Secur. 2009, 2. Available online: https://www.semanticscholar.org/paper/Threats-to-the-Swarm%3A-Security-Considerations-for-Higgins-Tomlinson/81cfc4dcdfe9940920eae0adaa55dee67bf563df (accessed on 18 October 2022).
Sargeant, I.; Tomlinson, A. Modelling malicious entities in a robotic swarm. In Proceedings of the 2013 IEEE/AIAA 32nd Digital Avionics Systems Conference (DASC), East Syracuse, NY, USA, 5–10 October 2013; IEEE: Piscataway, NJ, USA, 2013; pp. 7B1-1–7B1-12. [Google Scholar]
Higgins, F.; Tomlinson, A.; Martin, K.M. Survey on security challenges for swarm robotics. In Proceedings of the 2009 Fifth International Conference on Autonomic and Autonomous Systems, Valencia, Spain, 20–25 April 2009; IEEE: Piscataway, NJ, USA, 2009; pp. 307–312. [Google Scholar]
Bracha, G. An asynchronous [(n-1)/3]-resilient consensus protocol. In Proceedings of the Third Annual ACM Symposium on Principles of Distributed Computing, Vancouver, BC, Canada, 27–29 August 1984; pp. 154–162. [Google Scholar]
LeBlanc, H.J.; Zhang, H.; Koutsoukos, X.; Sundaram, S. Resilient asymptotic consensus in robust networks. IEEE J. Sel. Areas Commun. 2013, 31, 766–781. [Google Scholar] [CrossRef] [Green Version]
Sundaram, S.; Hadjicostis, C.N. Distributed function calculation via linear iterative strategies in the presence of malicious agents. IEEE Trans. Autom. Control 2010, 56, 1495–1508. [Google Scholar] [CrossRef] [Green Version]
Dibaji, S.M.; Ishii, H. Resilient consensus of second-order agent networks: Asynchronous update rules over robust graphs. In Proceedings of the 2015 American Control Conference (ACC), Chicago, IL, USA, 1–3 July 2015; IEEE: Piscataway, NJ, USA, 2015; pp. 1451–1456. [Google Scholar]
Liu, X.; Baras, J.S. Using trust in distributed consensus with adversaries in sensor and other networks. In Proceedings of the 17th International Conference on Information Fusion (FUSION), Salamanca, Spain, 7–10 July 2014; IEEE: Piscataway, NJ, USA, 2014; pp. 1–7. [Google Scholar]
Tong, W.; Dong, X.; Zheng, J. Trust-pbft: A peertrust-based practical byzantine consensus algorithm. In Proceedings of the 2019 International Conference on Networking and Network Applications (NaNA), Daegu, Korea, 10–13 October 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 344–349. [Google Scholar]
Yemini, M.; Nedić, A.; Goldsmith, A.J.; Gil, S. Characterizing Trust and Resilience in Distributed Consensus for Cyberphysical Systems. IEEE Trans. Robot. 2021, 38, 71–91. [Google Scholar] [CrossRef]
Christensen, A.L.; OGrady, R.; Dorigo, M. From fireflies to fault-tolerant swarms of robots. IEEE Trans. Evol. Comput. 2009, 13, 754–766. [Google Scholar] [CrossRef]
Millard, A.G.; Timmis, J.; Winfield, A.F. Towards exogenous fault detection in swarm robotic systems. In Proceedings of the Conference towards Autonomous Robotic Systems; Springer: Berlin/Heidelberg, Germany, 2013; pp. 429–430. [Google Scholar]
Khadidos, A.; Crowder, R.M.; Chappell, P.H. Exogenous fault detection and recovery for swarm robotics. IFAC-PapersOnLine 2015, 48, 2405–2410. [Google Scholar] [CrossRef]
Khaldi, B.; Harrou, F.; Sun, Y.; Cherif, F. A measurement-based fault detection approach applied to monitor robots swarm. In Proceedings of the 2017 6th International Conference on Systems and Control (ICSC), Batna, Algeria, 7–9 May 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 21–26. [Google Scholar]
Khaldi, B.; Harrou, F.; Cherif, F.; Sun, Y. Monitoring a robot swarm using a data-driven fault detection approach. Robot. Auton. Syst. 2017, 97, 193–203. [Google Scholar] [CrossRef] [Green Version]
Lin, P.; Jia, Y. Consensus of second-order discrete-time multi-agent systems with nonuniform time-delays and dynamically changing topologies. Automatica 2009, 45, 2154–2158. [Google Scholar] [CrossRef]
Phillips, J.M. Chernoff-hoeffding inequality and applications. arXiv 2012, arXiv:1209.6396. [Google Scholar]
Wolfowitz, J. Products of indecomposable, aperiodic, stochastic matrices. Proc. Am. Math. Soc. 1963, 14, 733–737. [Google Scholar] [CrossRef]

Figure 1. 15 malfunctional vs. 15 legitimate Time and XY-graph.

Figure 2. 15 malfunctional vs. 15 legitimate X,Y,VX,VY-time.

Figure 3. 15 malfunctional vs. 15 legitimate in moving Time and XY-graph.

Figure 4. 15 malfunctional vs. 15 legitimate X,Y,VX,VY-time in moving.

Figure 5. Original location before takeoff.

Figure 6. Correct Consensus—malfunctional quadrotor doesn’t affect others.

Figure 7. Correct Consensus Trajectory From ROSBAG.

Figure 8. Incorrect Consensus—malfunctional quadrotor severely affects others.

Figure 9. Incorrect Consensus Trajectory From ROSBAG.

Figure 10. Velocity Consensus.

Figure 11. Velocity Consensus—After one quadrotor hit the trees, the rest continue functioning.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Xie, Y.; Li, Y.; Dong, W. Behavior Prediction Based Trust Evaluation for Adaptive Consensus of Quadrotors. Drones 2022, 6, 371. https://doi.org/10.3390/drones6120371

AMA Style

Xie Y, Li Y, Dong W. Behavior Prediction Based Trust Evaluation for Adaptive Consensus of Quadrotors. Drones. 2022; 6(12):371. https://doi.org/10.3390/drones6120371

Chicago/Turabian Style

Xie, Yichen, Yuzhu Li, and Wei Dong. 2022. "Behavior Prediction Based Trust Evaluation for Adaptive Consensus of Quadrotors" Drones 6, no. 12: 371. https://doi.org/10.3390/drones6120371

APA Style

Xie, Y., Li, Y., & Dong, W. (2022). Behavior Prediction Based Trust Evaluation for Adaptive Consensus of Quadrotors. Drones, 6(12), 371. https://doi.org/10.3390/drones6120371

Article Menu

Behavior Prediction Based Trust Evaluation for Adaptive Consensus of Quadrotors

Abstract

1. Introduction

2. Problem Description

2.1. Model

2.2. Noise

2.3. The Trust Matrix and the Weight Matrix

3. Analysis

4. MATLAB Simulation

4.1. Static Situation

4.2. Movement Situation

5. Gazebo Simulation

5.1. Position Consensus

5.2. Velocity Consensus

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI