PD-like Consensus Tracking Algorithm for Discrete Multi-Agent Systems with Time-Varying Reference State Under Binary-Valued Communication

Wu, Yuqi; Sun, Xu; Wang, Ting; Wang, Jie

doi:10.3390/act14060267

Open AccessArticle

PD-like Consensus Tracking Algorithm for Discrete Multi-Agent Systems with Time-Varying Reference State Under Binary-Valued Communication

¹

School of Intelligence Science and Technology, University of Science and Technology Beijing, Beijing 100083, China

²

Institute of Artificial Intelligence, University of Science and Technology Beijing, Beijing 100083, China

^*

Author to whom correspondence should be addressed.

Actuators 2025, 14(6), 267; https://doi.org/10.3390/act14060267

Submission received: 2 April 2025 / Revised: 15 May 2025 / Accepted: 22 May 2025 / Published: 28 May 2025

(This article belongs to the Special Issue Advances in Intelligent Control of Actuator Systems)

Download

Browse Figures

Versions Notes

Abstract

In this paper, a new consensus tracking control algorithm is proposed for discrete multi-agent systems under binary communication with noise and a time-varying reference state. Unlike previous studies, the leader’s reference state is time-varying and convergent. Each agent estimates its neighbors’ states using a recursive projection algorithm based on noisy binary-valued information. The controller design incorporates both the error between the current and estimated states and the rate of change of the estimated state, resulting in a proportional–derivative-like algorithm (PD-like algorithm). The algorithm achieves consensus tracking with a convergence rate of

O (1 / t^{ε})

under certain conditions. Finally, numerical simulations demonstrate the algorithm’s effectiveness and validate the theoretical results.

Keywords:

multi-agent systems; tracking control problem; time-varying reference state; binary-valued communication; convergence; PD-like algorithm

1. Introduction

Multi-agent systems (MASs) consist of multiple interacting agents and are widely applied in areas such as autonomous vehicle platooning, multi-UAV systems, microgrid frequency control, and other domains [1,2,3]. The consensus control problem is one of the core issues in MAS, aiming to design distributed controllers to make the states of all agents converge to a consensus. The theoretical study of consensus problems began in [4], where A. Jadbabaie et al. provided a theoretical explanation for the phenomenon observed in the Vicsek model [5] using graph connectivity, thus improving the coordination theory based on the nearest neighbor rule in MASs. In [6], R. Olfati-Saber et al. used the Laplacian matrix to study the consensus problem in MASs, providing a consensus analysis framework based on graph theory. Ref. [7] pointed out that under dynamic interaction topologies, if the union of interaction graphs has spanning trees frequently enough during certain time periods, MASs can asymptotically reach consensus, and the restrictions on weighting factors are relaxed. These laid the theoretical foundation for subsequent research on consensus in MAS.

In the consensus tracking control problem of MASs, leader-following consensus is an important cooperative control strategy. Ref. [8] studied the problem of achieving finite-time leader-following consensus in second-order multi-agent systems with both fixed and switching topologies. For situations where the leader’s state information cannot be directly obtained, ref. [9] proposed a tracking control protocol based on distributed observers using relative output measurements and information received from neighbors. Ref. [10] combined neighbor-based feedback control laws and dynamic estimation rules with Lyapunov stability analysis to investigate the consensus problem in MASs with an active leader and variable topologies.

With the practical application of MAS, the influence of noise such as sensor noise, channel fading, and electromagnetic interference on measurement and information communication cannot be ignored. Ref. [11] introduced a stochastic approximation algorithm for models with noisy measurements and proposed to combine the decay property of the stochastic Lyapunov function with the so-called invariance direction to achieve convergence analysis. Ref. [12] proposed a new velocity decomposition technique for noisy measurements and time-varying directed topologies and designed a distributed estimation algorithm based on it. Furthermore, with limited communication bandwidth resources, quantized communication has been introduced to improve communication efficiency among MASs. The study of quantized consensus began in [13], where each agent’s state was set as an integer, and a quantized gossip algorithm was proposed for the distributed averaging problem to achieve quantized consensus. Quantized communication has increasingly attracted the interest of researchers. Refs. [14,15,16,17] used different types of quantizers to study consensus problems.

However, there is relatively little research on binary communication, as a special form of quantized communication, in MASs. However, its simplicity, extremely low communication bandwidth requirements, and strong anti-interference ability have attracted many researchers. Refs. [18,19] study system identification problems under binary-valued observations and cyber-attacks. Ref. [20] proposed a two-time-scale consensus protocol for consensus problems with random noise and binary measurements under undirected and fixed topologies. However, the state estimation and control in it are carried out alternately. Inspired by the recursive projection algorithm, refs. [21,22] proposed a new consensus algorithm that can simultaneously perform state estimation and control and update the state in real time, considering noisy measurement and binary quantizers. The convergence rate of this algorithm is

O (1 / t)

, which is faster than that in [20]. Ref. [23] further explored the consensus tracking problem in systems with time-invariant leaders under directed topologies. In this paper, the leader’s state discussed is time-varying, which is more challenging than in [22]. Meanwhile, in [24], the authors separately investigated the cases of a convergent leader and bounded leader. Decaying gains and constant gains in the control algorithms are designed for the two cases, respectively.

In this paper, the state of the leader is time-varying, and the uniform boundedness of the agent’s time-varying state cannot be guaranteed, requiring a reanalysis of its uniform boundedness properties. This increases the difficulty of analysis compared with the constant leader in Reference [23]. Moreover, most of the consensus algorithms with binary-valued communication only use proportional (P-like) control strategies based on the estimated states of neighbors. For consensus tracking problems with accurate information of neighbors, PD-like algorithms were designed in [25,26]. In this paper, a PD-like algorithm is introduced, taking into account the rate of change of the estimated state of the agents’ neighbors. How to analyze the influence of the derivative term on consensus error is more challenging.

The contributions of this paper are as follows:

This paper proposes a novel online algorithm consisting of estimation and control for the consensus tracking problem with a time-varying reference state and binary-valued communication. In the estimation part, the recursive projection algorithm (RPA) is used to deal with binary-valued observations. In the control part, a differential term is added to track the varying of neighbors in addition to a proportional term, which results in a PD-like algorithm.
Due to the introduction of differential terms, the PD-like algorithm increases the complexity of convergence analysis. The differential terms describe an estimate of the rate of change of the neighbors. Using the properties of the estimation algorithm, the differential term can be handled using the rate of change of the leader. If the state of the leader is convergent, it is proved that the followers can asymptotically track the leader using a dual Lyapunov function analysis framework constructed based on the estimation and tracking control.
The PD-like algorithm has significant advantages in terms of convergence speed. It introduces the difference in state estimation in the control input and integrates the recursive projection estimation and differential feedback mechanisms, compensating for the loss of quantized information and enhancing the system’s adaptability to dynamic leaders. This enables the system to adjust its own state quickly and accelerate the convergence speed towards the leader’s state. Moreover, through theoretical analysis, it can be seen that the convergence rate of the algorithm depends on the rate of change of the leader. Compared with algorithms that only use proportional control strategies in [24], the convergence rate of this algorithm is faster, which ensures that the multi-agent system can quickly achieve consensus tracking with binary-valued communication.

The remainder of this paper is structured as follows. Nomenclature summarizes some important symbols. Section 2 describes the consensus tracking control problem with binary-valued communication. Section 3 presents the consensus tracking control algorithm. Section 4 provides the main conclusions of the algorithm, including convergence and the convergence rate. Section 5 verifies the effectiveness of the theory through simulation. Section 6 summarizes this paper and looks ahead to future work.

2. Problem Formulation

Consider a discrete-time multi-agent system consisting of n agents:

\begin{matrix} x_{i} (t + 1) = x_{i} (t) + u_{i} (t), i = 1, \dots, n, \end{matrix}

(1)

where

x_{i} (t) \in R

represents the state of agent i at time t, and

u_{i} (t) \in R

is the corresponding control input. This system also includes a leader agent

n + 1

, whose state is

x_{n + 1} (t)

and is expressed as follows:

\begin{matrix} x_{n + 1} (t + 1) = x_{n + 1} (t) + u_{n + 1} (t), \end{matrix}

(2)

where

u_{n + 1} (t)

is the rate of change of the leader’s state. The remaining n agents are called followers. So, the vector update equation of this system is as follows:

\begin{matrix} X (t + 1) = X (t) + U (t), t = 1, 2, \dots \end{matrix}

(3)

with

X (t) = {[x_{1} (t), \dots, x_{n + 1} (t)]}^{T}

. Let

g (t) = u_{n + 1} (t)

and

U (t) = {[u_{1} (t), \dots, u_{n} (t), g (t)]}^{T}

.

Consider that the multi-agent system is represented by a directed topological network structure

G = (N, E)

, where

N = {1, 2, \dots, n + 1}

is the set of nodes, and each node corresponds to an agent.

E = N \times N

is the set of edges, which represent the channels for information interaction among agents. Matrix

A = {(a_{i j})}_{(n + 1) \times (n + 1)}

is the adjacency matrix of topology G. If there is an edge from agent j to agent i, which means that agent i can directly obtain information from agent j, then

a_{i j} = 1

, and agent j is called a neighbor of agent i, denoted as

j \in N_{i}

; otherwise,

a_{i j} = 0

, and agent j is not a neighbor of agent i.

D = d i a g (d_{1}, d_{2}, \dots, d_{n + 1})

is the degree matrix, where

d_{i}

represents the number of neighbors of agent i. Specifically, since the leader cannot receive feedback from any follower, for adjacency matrix A, we have

a_{(n + 1) j} = 0

,

j = 1, 2, \dots, n

, and

d_{(n + 1)} = 0

. The Laplacian matrix of the directed graph is

L = D - A

.

When agent i receives information from its neighbor agent j, it is affected by random noise, and its observed value is as follows:

\begin{matrix} \{\begin{matrix} y_{i j} (t) = x_{j} (t) + σ_{i j} (t), j = 1, 2, \dots, n + 1, \\ s_{i j} (t) = I_{{y_{i j} (t) \leq C}}, j \in N_{i}, \end{matrix} \end{matrix}

(4)

where

N_{i}

is the set of all neighbors of agent i,

x_{j} (t)

is the state of agent j at time t,

σ_{i j} (t) \in R

is the communication noise,

y_{i j} (t) \in R

is the unmeasurable output, C is the binary sensor threshold, and

s_{i j} (t)

is the binary information obtained by agent i from neighbor agent j.

I_{{.}}

is an indicative function, defined as

\begin{matrix} I_{{v \in V}} = \{\begin{matrix} 1, if v \in V, \\ 0, others . \end{matrix} \end{matrix}

(5)

Assumption 1.

Network structure G is connected and is a directed spanning tree with the leader as the root node.

Remark 1.

Let L be the Laplacian matrix of directed graph G. It has eigenvalues with non-negative real parts and has a unique eigenvalue of 0. The corresponding eigenvector has all elements equal to 1.

Remark 2.

According to the condition of Assumption 1, there exists a matrix

χ_{(n + 1) \times n}

that satisfies the following conditions:

r a n k {χ_{(n + 1) \times n}} = r a n k {L} .

Let an invertible matrix

Ω = [1_{(n + 1) \times 1}, χ_{(n + 1) \times n}]

, with

Ω^{- 1} = (\begin{matrix} τ \\ ξ_{n \times (n + 1)} \end{matrix})

and

Ω^{- 1} L Ω = (\begin{matrix} 0 & 0_{1 \times n} \\ 0_{n \times 1} & \tilde{L} \end{matrix})

, where all eigenvalues of matrix

\tilde{L}

are the non-zero eigenvalues of matrix L with positive real parts. And matrix

\tilde{L}

is a Jordan matrix.

Assumption 2.

In Equation (4), the noise

{σ_{i j} (t), i, j \in N}

follows a normal distribution with a mean of 0 and is independent of i, j, and t. Its distribution function is

F (x)

, and its probability density function is

f (x) \neq 0

.

Remark 3.

We assume that the distribution of this noise is known from prior knowledge.

The reference state considered in this paper is time-varying and convergent, which is described by the following assumption.

Assumption 3.

There exists a constant

x^{*}

such that

lim_{t \to \infty} x_{n + 1} (t) = x^{*} .

From Assumption 3, the following lemma can be derived.

Lemma 1.

The rate of change of the leader satisfies that

\sum_{i = 1}^{\infty} g (i)

is convergent [24].

The main task of the algorithm is to design the control

u_{i} (t)

so that the followers can track the leader’s state through binary communication, that is,

\begin{matrix} lim_{t \to \infty} ∥x_{i} (t) - x_{n + 1} (t)∥ = 0, i = 1, . . ., n . \end{matrix}

(6)

3. Control Algorithm

We use the recursive projection algorithm (RPA) in [21] to estimate the states of neighbors, where the step size decays over time. An attenuation gain and an estimated differential term are introduced into the control law. The consensus algorithm and the control law are as follows:

Initialization: The initial state of each agent and its estimations of the initial states of its neighbors are as follows:

$x_{i} (1) = x_{i}^{0}, {\hat{x}}_{i j} (0) = {\hat{x}}_{i j}^{0},$

for $j \in N_{i}, i = 1, 2, \dots, n$ , $|x_{i}^{0}| \leq W$ , and $|{\hat{x}}_{i j}^{0}| \leq W$ . Here, $W > 0$ , which is a known boundary for the states.
Observation: Each agent observes the binary information of its neighbors, as shown in Equation (4).
Estimation: Each agent uses the observed binary information to calculate the estimation of its neighbors through the RPA algorithm:

$\begin{matrix} {\hat{x}}_{i j} (t) = & Π_{W} {{\hat{x}}_{i j} (t - 1) \\ + \frac{β}{t} (F (C - {\hat{x}}_{i j} (t - 1)) - s_{i j} (t))}, \end{matrix}$

(7)

where $β$ is a coefficient in the step size of estimation, and $Π_{W} \{.\}$ is a projection operator defined as follows:

$Π_{W} (x) = arg min_{|w| \leq W} |x - w| = \{\begin{matrix} - W, & if x < - W; \\ x, & if | x | \leq W; \\ W, & if x > W . \end{matrix}$
Update: Each agent designs a controller to update its own state based on the estimation of its neighbors.

$\begin{matrix} u_{i} (t) = & - \frac{1}{t + 1} \sum_{j = 1}^{n + 1} a_{i j} [γ (x_{i} (t) - {\hat{x}}_{i j} (t)) \\ - ({\hat{x}}_{i j} (t) - {\hat{x}}_{i j} (t - 1))], i = 1, 2, \dots, n, \end{matrix}$

(8)

where $γ > 0$ is a constant. The proportional (P) term and the derivative (D) term are $γ (x_{i} (t) - {\hat{x}}_{i j} (t))$ and $- ({\hat{x}}_{i j} (t) - {\hat{x}}_{i j} (t - 1))$ , respectively. By designing the controller, the state of agent i is updated as follows:

$\begin{matrix} x_{i} (t + 1) = & x_{i} (t) - \frac{γ}{t + 1} \sum_{j = 1}^{n + 1} a_{i j} (x_{i} (t) - {\hat{x}}_{i j} (t)) \\ + \frac{1}{t + 1} \sum_{j = 1}^{n + 1} a_{i j} ({\hat{x}}_{i j} (t) - {\hat{x}}_{i j} (t - 1)), \end{matrix}$

(9)
Repeat: $t = t + 1$ .

Remark 4.

According to the recursive projection operator, it can be known that the states estimated by the agents for their neighbors are bounded.

|{\hat{x}}_{i j} (t)| \leq W, \forall i = 1, 2, \dots, n, j \in N_{i} .

Proposition 1.

The states of all the followers updated using (9) satisfy

|x_{i} (t)| \leq M_{0} + \frac{d_{*} β π^{2}}{6}, \forall i = 1, \dots, n, t \geq ⌊ γ d_{*} ⌋ + 1,

where

M_{0} = max \{{max}_{i = 1, \dots, n} |x_{i} (⌊ γ d_{i} ⌋)|, W\}

,

d_{i}

is the number of agent i’s neighbors, γ is the control parameter,

d_{*} = {max}_{i = 1, \dots, n} \{d_{1}, d_{2}, \dots, d_{n}\}

, and β is the coefficient of estimation.

Proof of Proposition 1.

By updating the state of (9), we can obtain

\begin{matrix} |x_{i} (t)| = & | x_{i} (t - 1) - \frac{1}{t} \sum_{j = 1}^{n + 1} a_{i j} [γ (x_{i} (t - 1) - {\hat{x}}_{i j} (t - 1)) \\ - {\hat{x}}_{i j} (t - 1) + {\hat{x}}_{i j} (t - 2)] | \\ = & | (1 - \frac{γ d_{i}}{t}) x_{i} (t - 1) + \frac{1}{t} \sum_{j = 1}^{n + 1} a_{i j} [γ {\hat{x}}_{i j} (t - 1) \\ + {\hat{x}}_{i j} (t - 1) - {\hat{x}}_{i j} (t - 2)] | \\ \leq & |(1 - \frac{γ d_{i}}{t}) x_{i} (t - 1) + \frac{1}{t} \sum_{j = 1}^{n + 1} a_{i j} γ {\hat{x}}_{i j} (t - 1)| + \\ \frac{1}{t} \sum_{j = 1}^{n + 1} a_{i j} |{\hat{x}}_{i j} (t - 1) - {\hat{x}}_{i j} (t - 2)| \\ \leq & |(1 - \frac{γ d_{i}}{t}) x_{i} (t - 1)| + \frac{γ d_{i}}{t} W + \\ \frac{1}{t} \sum_{j = 1}^{n + 1} a_{i j} |{\hat{x}}_{i j} (t - 1) - {\hat{x}}_{i j} (t - 2)| . \end{matrix}

(10)

The change rate of the neighbor’s estimated state can be obtained using projection algorithm (7):

\begin{matrix} |{\hat{x}}_{i j} (t) - {\hat{x}}_{i j} (t - 1)| \leq & \frac{β}{t} | ((F (C - {\hat{x}}_{i j} (t - 1)) - s_{i j} (t))) | \\ \leq & \frac{β}{t}, \end{matrix}

(11)

Then, we can obtain the following using (10):

\begin{matrix} |x_{i} (t)| \leq |(1 - \frac{γ d_{i}}{t}) x_{i} (t - 1)| + \frac{γ d_{i}}{t} W + \frac{d_{i} β}{t^{2}} . \end{matrix}

Here, in order to further determine the upper bound of

x_{i} (t)

, let

t = t_{i} = ⌊ γ d_{i} ⌋ + 1

. Since

γ

and

d_{i}

are fixed parameters related to the agents, choosing this specific time point is helpful for facilitating subsequent derivations in combination with the previous inequalities and initial conditions.

Then, we have

\begin{matrix} |x_{i} (t_{i})| \leq & |(1 - \frac{γ d_{i}}{⌊ γ d_{i} ⌋ + 1}) x_{i} (⌊ γ d_{i} ⌋)| \\ + \frac{γ d_{i}}{⌊ γ d_{i} ⌋ + 1} |{\hat{x}}_{i j} (⌊ γ d_{i} ⌋)| + \frac{d_{i} β}{t_{i}^{2}} \\ \leq & max \{|x_{i} (⌊ γ d_{i} ⌋)|, W\} + \frac{d_{i} β}{t_{i}^{2}} . \end{matrix}

Since the initial state of agent i and the estimates of the neighbors’ states are bounded, agent i’s state will be bounded through updating with a finite step. So, there exists a constant

M_{0}

such that

{max}_{i = 1, \dots, n} \{|x_{i} (⌊ γ d_{i} ⌋)|, W\} = M_{0} < \infty

. Hence,

|x_{i} (t_{i})| \leq M_{0} + \frac{d_{i} β}{t_{i}^{2}} .

Assuming that

|x_{i} (t_{i} + m)| \leq M_{0} + d_{i} β \sum_{j = t_{i}}^{t_{i} + m} \frac{1}{j^{2}},

we can obtain the following inequality:

\begin{matrix} |x_{i} (t_{i} + m + 1)| \\ \leq & |(1 - \frac{γ d_{i}}{t_{i} + m + 1}) x_{i} (t_{i} + m) \\ + \frac{γ d_{i}}{t_{i} + m + 1} {\hat{x}}_{i j} (t_{i} + m)| + \frac{d_{i} β}{{(t_{i} + m + 1)}^{2}} \\ \leq & (1 - \frac{γ d_{i}}{t_{i} + m + 1}) |x_{i} (t_{i} + m)| \\ + \frac{γ d_{i}}{t_{i} + m + 1} |{\hat{x}}_{i j} (t_{i} + m)| + \frac{d_{i} β}{{(t_{i} + m + 1)}^{2}} \\ \leq & max \{|x_{i} (t_{i} + m)|, W\} + \frac{d_{i} β}{{(t_{i} + m + 1)}^{2}} \\ \leq & M_{0} + d_{i} β \sum_{j = t_{i}}^{t_{i} + m} \frac{1}{j^{2}} + \frac{d_{i} β}{{(t_{i} + m + 1)}^{2}} \\ = & M_{0} + d_{i} β \sum_{j = t_{i}}^{t_{i} + m + 1} \frac{1}{j^{2}} . \end{matrix}

According to mathematical induction, we have

|x_{i} (t)| \leq M_{0} + d_{i} β \sum_{j = t_{i}}^{t} \frac{1}{j^{2}}, t \geq t_{i} .

Let

d_{*} = max_{i = 1, \dots, n} d_{i}, t_{*} = max_{i = 1, . . ., n} t_{i},

It follows that

\begin{matrix} |x_{i} (t)| \leq M_{0} + d_{i} β \sum_{j = t_{i}}^{t} \frac{1}{j^{2}} \leq M_{0} + d_{*} β \sum_{j = 1}^{t} \frac{1}{j^{2}}, \forall t \geq t_{*} . \end{matrix}

Due to

\sum_{j = 1}^{\infty} \frac{1}{j^{2}} = \frac{π^{2}}{6}

, we have

\begin{matrix} |x_{i} (t)| \leq M_{0} + \frac{d_{*} β π^{2}}{6}, \forall i = 1, \dots, n, t \geq t_{*} = ⌊ γ d_{*} ⌋ + 1 . \end{matrix}

□

In order to analyze the impact of the difference between the estimated value and the true value on the system, we define error vectors

ϵ (t)

. They combine the estimation errors of each agent for the states of their neighbors. Hence, let

ϵ_{i j} (t) = {\hat{x}}_{i j} (t) - x_{j} (t)

. Define the error vectors

ϵ (t)

as follows:

\begin{matrix} ϵ (t) = & (ϵ_{1 r_{1}} (t), \dots, ϵ_{1 r_{d_{1}}} (t), ϵ_{2 r_{d_{1} + 1}} (t), \dots, ϵ_{2 r_{d_{1} + d_{2}}} (t) \\ \dots, ϵ_{n r_{d_{1} + \dots + d_{n - 1} + 1}} (t), \dots, ϵ_{n r_{d_{1} + \dots + d_{n}}} {(t))}^{T}, \end{matrix}

(12)

with

r_{1}, \dots, r_{d_{1}} \in N_{1}, r_{d_{1} + 1}, \dots, r_{d_{1} + d_{2}} \in N_{2}, \dots, r_{d_{1} + \dots + d_{n - 1} + 1}, \dots, r_{d_{1} + \dots + d_{n}} \in N_{n}

. For any error vector

ϵ (t)

, we define the following

(n + 1)

-dimensional vectors

u_{i j}

and

v_{i j}

as the starting point and the ending point, respectively:

\begin{matrix} u_{i j} = & {(0, \dots, 0, \underset{i th position}{\underset{︸}{1}}, 0, \dots, 0)}^{T} . \end{matrix}

v_{i j} = {(0, \dots, 0, \underset{j th position}{\underset{︸}{1}}, 0, \dots, 0)}^{T},

for

j \in N_{i}, i = 1, . . ., n

. Place

{u_{i j},, j \in N_{i}, i = 1, . . ., n}

and

{v_{i j},, j \in N_{i}, i = 1, . . ., n}

in the order of the error vector

ϵ (t)

, and the following two matrices are obtained:

\begin{matrix} U = & [u_{1 r_{1}}, \dots, u_{1 r_{d_{1}}}, u_{2 r_{d_{1} + 1}}, \dots, u_{2 r_{d_{1} + d_{2}}}, \dots, \\ u_{n r_{d_{1} + \dots + d_{n - 1} + 1}}, \dots, u_{n r_{d_{1} + \dots + d_{n}}}] \end{matrix}

(13)

\begin{matrix} = & {[\begin{matrix} \underset{d_{1}}{\underset{︸}{1 \dots 1}} \\ \underset{d_{2}}{\underset{︸}{1 \dots 1}} & 0 \\ 0 & \dots \\ \underset{d_{n}}{\underset{︸}{1 \dots 1}} \\ 0 & 0 & \dots & 0 \end{matrix}]}_{(n + 1) \times (d_{1} + \dots + d_{n})}, \end{matrix}

(14)

\begin{matrix} V = & {[\begin{matrix} v_{1 r_{1}}^{T} \\ ⋮ \\ v_{r_{d_{1}} 1}^{T} \\ v_{r_{d_{1} + 1} 2}^{T} \\ ⋮ \\ v_{r_{d_{1} + \dots + d_{n + 1}} (n + 1)}^{T} \end{matrix}]}_{(d_{1} + \dots + d_{n + 1}) \times (n + 1)} . \end{matrix}

(15)

Let

x (t) = {[x_{1} (t), x_{2} (t), \dots, x_{n + 1} (t)]}^{T}

. The vector form of system updating can be given as follows by Equations (2) and (9):

\begin{matrix} x (t + 1) = & (I - \frac{γ L}{t + 1}) x (t) + \frac{γ U}{t + 1} ϵ (t) \\ + \frac{U}{t + 1} R (t) + Υ (t), \end{matrix}

(16)

where L is the

(n + 1) \times (n + 1)

Laplacian matrix of network G. U is defined in (13),

Υ (t) = {[0, 0, \dots, 0, g (t)]}^{T}

,

ϵ (t)

is defined in (12), and

R (t) = [r_{i j} (t)]

with

r_{i j} (t) = {\hat{x}}_{i j} (t) - {\hat{x}}_{i j} (t - 1)

placed in the same order as that in

ϵ (t)

.

4. Main Result

In this section, the tracking error and the estimation error are first defined, and the conditional inequalities they satisfy are given. Finally, the convergence and convergence rates of the tracking error and the estimation error are obtained.

For a discrete multi-agent system, when Equation (6) is satisfied, it indicates that the system has reached a consensus. In Remark 2,

τ = (τ_{1}, τ_{2}, . . ., τ_{n + 1})

is the left eigenvector corresponding to eigenvalue 0 of Laplacian matrix L, and it satisfies

τ 1_{n + 1} = 1

. Define a new convergence index:

\begin{matrix} δ (t) & = (x_{1} - \sum_{j = 1}^{n + 1} τ_{j} x_{j}, x_{2} - \sum_{j = 1}^{n + 1} τ_{j} x_{j}, . . ., x_{n + 1} - \sum_{j = 1}^{n + 1} τ_{j} x_{j}) \\ = (I - J_{n + 1}) x (t), \end{matrix}

where

J_{n + 1} = 1_{n \times 1} τ

. Since

lim_{t \to \infty} ∥ δ (t) ∥ = lim_{t \to \infty} \sqrt{\sum_{i = 1}^{n + 1} {∥\sum_{j = 1}^{n + 1} τ_{j} (x_{i} - x_{j})∥}^{2}}

, we can conclude that

lim_{t \to \infty} ∥x_{i} (t) - x_{n + 1} (t)∥ = 0 \Leftrightarrow lim_{t \to \infty} ∥ δ (t) ∥ = 0,

where

i = 1, . . ., n

. Since

(I - J_{n + 1}) (I - \frac{γ L}{t + 1}) = (I - \frac{γ L}{t + 1}) (I - J_{n + 1})

, we can obtain

\begin{matrix} δ (t + 1) = & (I - \frac{γ L}{t + 1}) δ (t) \\ + \frac{(I - J_{n + 1}) γ U}{t + 1} ϵ (t) + \frac{(I - J_{n + 1}) U}{t + 1} R (t) + (I - J_{n + 1}) Υ (t) . \end{matrix}

Define

\tilde{δ} (t) = {[{\tilde{δ}}_{1} (t), {\tilde{δ}}_{2} (t) . . ., {\tilde{δ}}_{n + 1} (t)]}^{T} = Ω^{- 1} δ (t)

and

ψ (t) = {[{\tilde{δ}}_{2} (t), {\tilde{δ}}_{3} (t), . . ., {\tilde{δ}}_{n + 1} (t)]}^{T}

. We have

{\tilde{δ}}_{1} (t + 1) = τ δ (t + 1) = τ (I - J_{n + 1}) x (t) = 0,

and

\begin{matrix} ψ (t + 1) \\ = & (I - \frac{γ \tilde{L}}{t + 1}) ψ (t) + \frac{ξ_{n \times (n + 1)} (I - J_{n + 1}) γ U}{t + 1} ϵ (t) \\ + \frac{ξ_{n \times (n + 1)} (I - J_{n + 1}) U}{t + 1} R (t) + ξ_{n \times (n + 1)} (I - J_{n + 1}) Υ (t), \end{matrix}

(17)

Thus, we can deduce that

∥δ (t)∥ = ∥\tilde{δ} (t)∥ = ∥ψ (t)∥

. And to enable the system to achieve consensus tracking, we can prove that

lim_{t \to \infty} ∥ψ (t)∥ = 0

, where

∥ x ∥ = \sqrt{E (x^{T} x)}

for any random variable x.

Matrix

- \tilde{L}

is a Hurwitz matrix. There exists a positive definite symmetric matrix

K > 0

that satisfies the following equation:

K \tilde{L} + {\tilde{L}}^{T} K = I .

(18)

Due to

λ_{m i n} (K) E (ψ {(t)}^{T} ψ (t)) \leq E (ψ {(t)}^{T} K ψ (t)) \leq λ_{m a x} (K) E (ψ {(t)}^{T} ψ (t))

, we have

lim_{t \to \infty} E (ψ {(t)}^{T} ψ (t)) = 0 \Leftrightarrow lim_{t \to \infty} E (ψ {(t)}^{T} K ψ (t)) = 0 .

(19)

where

λ_{m i n} (K)

and

λ_{m a x} (K)

are the minimum and maximum eigenvalues of matrix K respectively.

Define

P_{1} (t) = E (ψ {(t)}^{T} K ψ (t))

and

P_{2} (t) = E (ϵ {(t)}^{T} ϵ (t))

, which represent the tracking error and the estimation error in the mean square sense, respectively.

In comparison to state updating [24] using the P-like control algorithm, parameter

γ

and item

\frac{1}{t} R (t)

are added to state updating Equation (16) with the PD-like control algorithm. The parameter is constant, and we can obtain the property of item

\frac{1}{t} R (t)

with (11) as follows:

\begin{matrix} \frac{1}{t} R (t) = O (\frac{1}{t^{2}}) . \end{matrix}

(20)

We have the following lemmas on

P_{1} (t)

and

P_{2} (t)

.

Lemma 2.

Under Assumptions 1 and 2, tracking error

P_{1} (t)

satisfies the error of estimates

\begin{matrix} P_{1} (t) \leq & (1 - \frac{γ}{4 λ_{K} t}) P_{1} (t - 1) + \frac{4 λ_{K} λ_{ξ} d_{*}}{γ t} P_{2} (t - 1) \end{matrix}

\begin{matrix} + O (g (t - 1)) + O (\frac{1}{t^{2}}), t > T_{1}, \end{matrix}

(21)

where

λ_{K} = λ_{m a x} (K)

,

λ_{ξ}

is the maximum eigenvalue of matrix

{(I - J_{n + 1})}^{T} ξ^{T} K ξ (I - J_{n + 1})

, and

T_{1}

is a constant.

Lemma 3.

Under Assumptions 1 and 2, estimate error

P_{2} (t)

satisfies

\begin{matrix} P_{2} (t) \leq & (1 - \frac{2 β f_{C} - \frac{γ λ_{U} λ_{L}}{α} - 2 d_{*}}{t}) P_{2} (t - 1) \\ + \frac{α k γ}{t} P_{1} (t - 1) + O (g (t - 1)) + O (\frac{1}{t^{2}}), \end{matrix}

(22)

as

t > T_{2}

, where

α > 0, f_{C} = f (| C | + W)

, W is the bound of the projection of estimation,

α > 0

,

λ_{U} = λ_{max} (U U^{T})

,

λ_{L} = λ_{max} (L L^{T})

,

k = \frac{1}{λ_{min} (K)}

, and

T_{2}

is a constant.

Due to Equation (20), the proofs of Lemmas 2 and 3 are similar to the proofs of Lemmas 2 and 3 in [24].

To prove the convergence of the algorithm, we give a lemma as follows.

Lemma 4

(Theorem 1.2.23, [27]). If

x_{n}

satisfies the iterative equation

x_{t + 1} = (1 - a_{t}) x_{t} + b_{t}, t \geq 0,

where

a_{t} \in [0, 1)

,

\sum_{t = 1}^{\infty} b_{t}

converges, then

x_{t} \to 0, \forall x_{0} \neq 0 \Leftrightarrow \sum_{t = 1}^{\infty} a_{t} = \infty .

By Lemmas 1–4, we can obtain the convergence and convergence rate of the algorithm as follows.

Theorem 1

(Convergence). For the algorithm with Assumptions 1–3, if the coefficient β in estimate satisfies

β > \frac{l}{f_{C}}

where

l = \frac{γ^{3} λ_{U} λ_{L} k}{8 λ_{K} λ_{ξ} d_{*}} + \frac{32 λ_{K}^{3} λ_{ξ}^{2} d_{*}^{2}}{γ^{3}} + d_{*}

,

λ_{K}, λ_{ξ}, k, λ_{U}, λ_{L}, d_{*}

and

f_{C}

are the same as those in Lemmas 2 and 3, we can obtain

E {({\hat{x}}_{i j} (t) - x_{j} (t))}^{2} \to 0

for

j \in N_{i}, i = 1, 2, \dots n + 1, i \neq j

and

E {(x_{i} (t) - x_{n + 1} (t))}^{2} \to 0

for

\forall i = 1, . . ., n + 1

.

Proof of Theorem 1.

Since error functions (21) and (22) are interrelated, we consider them together:

\{\begin{matrix} P_{1} (t) \leq & (1 - \frac{γ}{4 λ_{K} t}) P_{1} (t - 1) + \frac{4 λ_{K} λ_{ξ} d_{*}}{γ t} P_{2} (t - 1) \\ + O (g (t - 1)) + O (\frac{1}{t^{2}}), \\ P_{2} (t) \leq & (1 - \frac{2 β f_{C} - \frac{γ λ_{U} λ_{L}}{α} - 2 d_{*}}{t}) P_{2} (t - 1) \\ + \frac{α k γ}{t} P_{1} (t - 1) + O (g (t - 1)) + O (\frac{1}{t^{2}}), \end{matrix}

(23)

Let

Z (t) = (\begin{matrix} P_{1} (t) \\ P_{2} (t) \end{matrix}), W = (\begin{matrix} w_{1} & w_{2} \\ w_{4} & w_{3} \end{matrix}),

where

w_{1} = \frac{γ}{4 λ_{K}}, w_{2} = \frac{- 4 λ_{K} λ_{ξ} d_{*}}{γ}, w_{3} = 2 β f_{C} - \frac{λ_{U} λ_{L}}{α} - 2 d_{*},

and

w_{4} = - α k γ

. Then, by (23), we can obtain

\begin{matrix} ∥Z (t)∥ \leq & ∥(I - \frac{W}{t}) Z (t - 1) + O (g (t - 1)) + O (\frac{1}{t^{2}})∥ \\ \leq & ∥I - \frac{W}{t}∥ ∥Z (t - 1)∥ + O (g (t - 1)) + O (\frac{1}{t^{2}}), \end{matrix}

Let

α = \frac{4 λ_{K} λ_{ξ} d_{*}}{k γ^{2}}

. Then,

w_{2} = w_{4}

. We utilize the symmetric property of matrix

W

.

∥I - \frac{W}{t}∥ \leq 1 - \frac{λ_{min} (W)}{t}, if t > λ_{max} (W) .

Then, we can obtain

\begin{matrix} ∥Z (t)∥ \leq & (1 - \frac{λ_{min} (W)}{t}) ∥Z (t - 1)∥ + O (g (t - 1)) \\ + O (\frac{1}{t^{2}}) . \end{matrix}

(24)

Then, we can obtain

w_{1} w_{3} = \frac{γ}{4 λ_{K}} (2 β f_{C} - λ_{U} λ_{L} / α - 2 d_{*}) > \frac{16 λ_{K}^{2} λ_{ξ}^{2} d_{*}^{2}}{γ^{2}} = w_{2}^{2},

Hence,

λ_{min} (W) = \frac{w_{1} + w_{3} - \sqrt{{(w_{1} + w_{3})}^{2} - 4 (w_{1} w_{3} - w_{2}^{2})}}{2} > 0,

where

λ_{min} (W)

is the minimum eigenvalue of matrix

W

. Due to

\sum_{t = 1}^{\infty} \frac{1}{t^{2}} < \infty

, we have, by Lemma 4,

∥Z (t)∥ \to 0 .

Thus,

P_{1} (t) \to 0, P_{2} (t) \to 0,

which implies the theorem. □

Theorem 2

(Convergence rate). Let the changing of the leader be

g (t) = \frac{1}{t^{1 + ε}} (0 < ε < 1)

. For the PD-like algorithm with Assumptions 1–3, we can obtain

E {({\hat{x}}_{i j} (t) - x_{j} (t))}^{2} = \{\begin{matrix} O (\frac{1}{t^{λ_{min} (W)}}), & if ε > λ_{min} (W); \\ O (\frac{log t}{t^{ε}}), & if ε = λ_{min} (W); \\ O (\frac{1}{t^{ε}}), & if ε < λ_{min} (W), \end{matrix}

and

E {(x_{i} (t) - x_{n + 1} (t))}^{2} = \{\begin{matrix} O (\frac{1}{t^{λ_{min} (W)}}), & if ε > λ_{min} (W); \\ O (\frac{log t}{t^{ε}}), & if ε = λ_{min} (W); \\ O (\frac{1}{t^{ε}}), & if ε < λ_{min} (W), \end{matrix}

for

j \in N_{i}, i = 1, \dots, n

, where

W = (\begin{matrix} \frac{γ}{4 λ_{K}} & - \frac{4 λ_{K} λ_{ξ} d_{*}}{γ} \\ - \frac{4 λ_{K} λ_{ξ} d_{*}}{γ} & 2 β f_{C} - \frac{γ^{3} k λ_{U} λ_{L}}{4 λ_{K} λ_{ξ} d_{*}} - 2 d_{*} \end{matrix}),

λ_{K}, λ_{ξ}, k, f_{C}, λ_{U}, λ_{L},

and

d_{*}

are the same as those in Lemmas 2 and 3.

Proof of Theorem 2.

The proof of the theorem is similar to that in Theorem 2 of [24]. □

Remark 5.

Theorem 2 indicates that the convergence rate of the PD-like algorithm depends on

λ_{min} (W)

(the communication topology) and ε (the rate of change of the leader).

5. Simulation

The simulation consisted of a multi-agent system with the network topology illustrated in Figure 1. The system included four follower agents and one leader agent, where agent 5 was designated as the leader.

Each follower agent can obtain information from its respective neighbors, with the neighbor sets defined as follows:

N_{1} = {2, 4}, N_{2} = {1, 3}, N_{3} = {2, 5}, N_{4} = {1, 5}

, where

N_{i}

is the set of agent i’s neighbors. In contrast, agent 5, as the leader, does not receive information from any other agents. Laplacian matrix L corresponding to this topology is defined as

L = [\begin{matrix} 2 & - 1 & 0 & - 1 & 0 \\ - 1 & 2 & - 1 & 0 & 0 \\ 0 & - 1 & 2 & 0 & - 1 \\ - 1 & 0 & 0 & 2 & - 1 \\ 0 & 0 & 0 & 0 & 0 \end{matrix}]

.

The Jordan canonical form of Laplacian matrix L is

\tilde{L} = [\begin{matrix} 0.382 & 0 & 0 & 0 \\ 0 & 3.618 & 0 & 0 \\ 0 & 0 & 1.382 & 0 \\ 0 & 0 & 0 & 2.618 \end{matrix}]

. The corresponding transformation matrices are

Ω = [\begin{matrix} 1 & 0.6015 & 0.6015 & - 0.3717 & - 0.3717 \\ 1 & 0.6015 & - 0.6015 & 0.3717 & - 0.3717 \\ 1 & 0.3717 & 0.3717 & 0.6015 & 0.6015 \\ 1 & 0.3717 & - 0.3717 & - 0.6015 & 0.6015 \\ 1 & 0 & 0 & 0 & 0 \end{matrix}]

, and

Ω^{- 1} = [\begin{matrix} 0 & 0 & 0 & 0 & 1 \\ 0.6015 & 0.6015 & 0.3717 & 0.3717 & - 1.9465 \\ 0.6015 & - 0.6015 & 0.3717 & - 0.3717 & 0 \\ - 0.3717 & 0.3717 & 0.6015 & - 0.6015 & 0 \\ - 0.3717 & - 0.3717 & 0.6015 & 0.6015 & - 0.4596 \end{matrix}]

.

Based on

K \tilde{L} + {\tilde{L}}^{T} K = I

, matrix K is obtained as

[\begin{matrix} 1.3089 & 0 & 0 & 0 \\ 0 & 0.1382 & 0 & 0 \\ 0 & 0 & 0.3618 & 0 \\ 0 & 0 & 0 & 0.1910 \end{matrix}]

. Then, based on related definitions of Lemma 2 and Lemma 3, we can calculate

k = \frac{1}{λ_{m i n} (K)} = 7.2359, λ_{U} = λ_{m a x} (U U^{T}) = 2, λ_{L} = λ_{max} (L L^{T}) = 13.0902, λ_{K} = λ_{m a x} (K) = 1.3089,

and

λ_{ξ} = λ_{max} ({(I - J_{n + 1})}^{T} ξ^{T} K ξ (I - J_{n + 1})) = 6.3005

.

Let the initial values for the states of the agents be

x (0) = {[- 6, - 7, 10, 7, 5]}^{T}

, and the initial estimates of the neighbors’ states for each agent be

{[{\hat{x}}_{1} (0), {\hat{x}}_{2} (0), {\hat{x}}_{3} (0), {\hat{x}}_{4} (0), {\hat{x}}_{5} (0)]}^{T} = {[1, - 4, 7, 2, 4]}^{T}

. The threshold is given as

C = 0

, and the state boundary conditions are defined as

W = 8

. The noise follows a normal distribution

N (0, 8^{2})

, and

f_{C}

can be computed as

f_{C} = f (|C| + W) = 0.0302

.

The state of the leader is convergent, and its state update formula is given by

x_{5} (t + 1) = x_{5} (t) + g (t),

where

g (t) = \frac{1}{t^{2}}

, which satisfies

\sum_{t = 1}^{\infty} g (t) < \infty

. In the simulation, the experimental parameters were set as

β = 7000

and

γ = 4

. The symbol l defined in Theorem 2 can be computed as

l = \frac{γ^{3} λ_{U} λ_{L} k}{8 λ_{K} λ_{ξ} d_{*}} + \frac{32 λ_{K}^{3} λ_{ξ}^{2} d_{*}^{2}}{γ^{3}} + d_{*}

= 203.0038. Choosing

β

satisfies

β > \frac{l}{f_{C}} = 6, 721.9801

, and the trajectories of the agent states and their estimates are shown in Figure 2 and Figure 3, respectively. Figure 4 and Figure 5 present the tracking error and estimation error, respectively. The results demonstrate that the followers converge to the leader’s state, confirming the theoretical findings in Theorem 1. Figure 6 shows the relationship between the tracking error and the rate of change of the leader. The faster the leader changes, the slower the tracking error converges, which is consistent with Theorem 2. Figure 7 shows the state–time curves of agents under the P-like algorithm. It can be seen that the convergence to consensus is slower than that of the PD-like algorithm. At the same time, Figure 8 compares the tracking errors of the two algorithms. The red line represents the PD-like algorithm, and the blue line represents the P-like algorithm. It can also be seen that the PD-like algorithm converges faster.

6. Conclusions

This paper presents a consensus tracking algorithm for discrete-time multi-agent systems under binary-valued communication, considering measurement noise and a time-varying reference state. Each agent estimates the states of its neighbors using RPA and designs a controller to update its state. An estimated differential term is introduced into the controller. Subsequently, the agents can converge to the leader’s reference state, and the convergence speed is closely related to the rate of change of the leader’s state and the system parameters.

There is a lot of work that deserves attention in the future. Presently, the academic community is actively engaged in developing safer and more efficient control algorithms. Notable examples include the design of cooperative control protocols for nonlinear multi-agent systems against different attacks [28,29], and the integration of event-triggered mechanisms into multi-agent consensus tracking problems to significantly reduce system resource consumption while maintaining tracking performance—all of which represent cutting-edge research frontiers [30]. Nevertheless, in practical applications (such as the collective behavior of mobile robots [31]), agents often encounter complex non-matching input constraints, nonlinear dynamic characteristics, and unknown time-varying disturbances. Addressing these challenges by designing robust and adaptive control strategies remains a critical hurdle that demands urgent breakthroughs.

Author Contributions

Conceptualization, T.W.; methodology, T.W.; software, Y.W.; validation, X.S.; formal analysis, Y.W.; writing—original draft preparation, Y.W.; writing—review and editing, J.W.; supervision, T.W. All authors have read and agreed to the published version of the manuscript.

Funding

The research was funded by the National Natural Science Foundation of China (Grants No. 62473040 and 62473042).

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflict of interest.

Nomenclature

Notation	Meaning
$R$	The set of real numbers
$∥ \cdot ∥$	The Euclidean norm
$x_{i} (t)$	The state of the i-th agent at time t
$x_{n + 1} (t)$	The state of leader with time variation
$g (t)$	The leader’s state change rate
$u_{i} (t)$	The control input of the i-th agent at time t
${\hat{x}}_{i j} (t)$	The estimated value of the state of neighbor j at time t by agent i
$ϵ (t)$	The error vectors
$β$	The coefficient in the estimation step size
$γ$	The constant in the control law
$λ_{U}$	The eigenvalues of matrix U

References

Wang, X.; Li, Y.; Huang, L.; Huang, X.; Zhao, H.; Han, X.; Xiang, J.; Wang, H. Cooperative Control for Connected Automated Vehicle Platoon with C-V2X PC5 Interface. In Proceedings of the 2023 IEEE International Conference on Unmanned Systems (ICUS), Hefei, China, 13–15 October 2023; pp. 635–640. [Google Scholar]
Yang, H.; Jiang, B.; Yang, H. Fault Tolerant Cooperative Control for Heterogeneous Multiple UAVs. In Proceedings of the 2018 IEEE CSAA Guidance, Navigation and Control Conference (CGNCC), Xiamen, China, 10–12 August 2018; pp. 1–6. [Google Scholar]
Bidram, A.; Lewis, F.L.; Davoudi, A.; Qu, Z. Frequency control of electric power microgrids using distributed cooperative control of multi-agent systems. In Proceedings of the 2013 IEEE International Conference on Cyber Technology in Automation, Control and Intelligent Systems, Nanjing, China, 26–29 May 2013; pp. 223–228. [Google Scholar]
Jadbabaie, A.; Lin, J.; Morse, A. Coordination of groups of mobile autonomous agents using nearest neighbor rules. IEEE Trans. Autom. Control 2003, 48, 988–1001. [Google Scholar] [CrossRef]
Vicsek, T.; Czirók, A.; Ben-Jacob, E.; Cohen, I.; Shochet, O. Novel Type of Phase Transition in a System of Self-Driven Particles. Phys. Rev. Lett. 1995, 75, 1226–1229. [Google Scholar] [CrossRef] [PubMed]
Olfati-Saber, R.; Murray, R. Consensus problems in networks of agents with switching topology and time-delays. IEEE Trans. Autom. Control 2004, 49, 1520–1533. [Google Scholar] [CrossRef]
Ren, W.; Beard, R.W. Consensus seeking in multiagent systems under dynamically changing interaction topologies. IEEE Trans. Autom. Control 2005, 50, 655–661. [Google Scholar] [CrossRef]
Guan, Z.H.; Sun, F.L.; Wang, Y.W.; Li, T. Finite-Time Consensus for Leader-Following Second-Order Multi-Agent Networks. IEEE Trans. Circuits Syst. I Regul. Pap. 2012, 59, 2646–2654. [Google Scholar] [CrossRef]
Hajshirmohamadi, S.; Sheikholeslam, F.; Meskin, N.; Ghommam, J. Observer-Based Leader-Following Consensus for Linear Multiagent Systems with a Leader of Unknown Input. IEEE Syst. J. 2021, 15, 95–104. [Google Scholar] [CrossRef]
Hong, Y.; Hu, J.; Gao, L. Tracking control for multi-agent consensus with an active leader and variable topology. Automatica 2006, 42, 1177–1182. [Google Scholar] [CrossRef]
Huang, M.; Manton, J.H. Stochastic Lyapunov Analysis for Consensus Algorithms with Noisy Measurements. In Proceedings of the 2007 American Control Conference, New York, NY, USA, 9–13 July 2007; pp. 1419–1424. [Google Scholar]
Hu, J.; Feng, G. Distributed tracking control of leader–follower multi-agent systems under noisy measurement. Automatica 2010, 46, 1382–1387. [Google Scholar] [CrossRef]
Kashyap, A.; Başar, T.; Srikant, R. Quantized consensus. Automatica 2007, 43, 1192–1203. [Google Scholar] [CrossRef]
Ma, J.; Chen, Z.; Ji, H. Distributed output consensus of heterogeneous linear multi-agent systems with dynamic quantization. Int. J. Robust Nonlinear Control 2024, 34, 10251–10275. [Google Scholar] [CrossRef]
Zhu, S.; Chen, B. Quantized Consensus by the ADMM: Probabilistic Versus Deterministic Quantizers. IEEE Trans. Signal Process. 2016, 64, 1700–1713. [Google Scholar] [CrossRef]
Ma, J.; Ji, H.; Sun, D.; Feng, G. An approach to quantized consensus of continuous-time linear multi-agent systems. Automatica 2018, 91, 98–104. [Google Scholar] [CrossRef]
Liu, H.; Cao, M.; De Persis, C. Quantization effects on synchronized motion of teams of mobile agents with second-order dynamics. Syst. Control Lett. 2012, 61, 1157–1167. [Google Scholar] [CrossRef]
Guo, J.; Zhang, Q.; Zhao, Y. Identification of FIR Systems with binary-valued observations under replay attacks. Automatica 2025, 172, 112001. [Google Scholar] [CrossRef]
Guo, J.; Wang, X.; Xue, W.; Zhao, Y. System Identification With Binary-Valued Observations Under Data Tampering Attacks. IEEE Trans. Autom. Control 2021, 66, 3825–3832. [Google Scholar] [CrossRef]
Zhao, Y.; Wang, T.; Bi, W. Consensus Protocol for Multiagent Systems With Undirected Topologies and Binary-Valued Communications. IEEE Transactions on Automatic Control 2019, 64, 206–221. [Google Scholar] [CrossRef]
Guo, J.; Zhao, Y. Recursive projection algorithm on FIR system identification with binary-valued observations. Automatica 2013, 49, 3396–3401. [Google Scholar] [CrossRef]
Wang, T.; Zhang, H.; Zhao, Y. Consensus of Multi-Agent Systems Under Binary-Valued Measurements and Recursive Projection Algorithm. IEEE Trans. Autom. Control 2020, 65, 2678–2685. [Google Scholar] [CrossRef]
Qiu, Z.; Ren, Z.; Wang, T. Consensus Tracking Algorithm for Multi-Agent Systems with Binary-Valued Measurements. In Proceedings of the 2022 China Automation Congress (CAC), Xiamen, China, 25–27 November 2022; pp. 4030–4035. [Google Scholar]
Wang, T.; Qiu, Z.; Lu, X.; Zhao, Y. Consensus Tracking Control of Multi-agent Systems with A Time-varying Reference State under Binary-valued Communication. arXiv 2025, arXiv:2503.15955. [Google Scholar]
Ren, W. Multi-vehicle consensus with a time-varying reference state. Syst. Control Lett. 2007, 56, 474–483. [Google Scholar] [CrossRef]
Cao, Y.; Ren, W.; Li, Y. Distributed discrete-time coordinated tracking with a time-varying reference state and limited communication. Automatica 2009, 45, 1299–1305. [Google Scholar] [CrossRef]
Guo, L. Time-Varying Stochastic Systems: Stability, Estimation and Control; Jilin Science and Technology Press: Changchun, China, 1993. [Google Scholar]
Liu, G.; Sun, Q.; Su, H.; Wang, M. Adaptive Cooperative Fault-Tolerant Control for Output-Constrained Nonlinear Multi-Agent Systems Under Stochastic FDI Attacks. IEEE Trans. Circuits Syst. I Regul. Pap. 2025, 1–12. [Google Scholar] [CrossRef]
Liu, G.; Sun, Q.; Su, H.; Hu, Z. Adaptive Tracking Control for Uncertain Nonlinear Multi-Agent Systems With Partially Sensor Attack. IEEE Trans. Autom. Sci. Eng. 2025, 22, 6270–6279. [Google Scholar] [CrossRef]
Lu, X.; Wang, T.; Zhao, Y.; Zhang, J.F. Consensus of multi-agent systems under binary-valued measurements: An event-triggered coordination approach. Automatica 2025, 176, 112255. [Google Scholar] [CrossRef]
Ning, B.; Han, Q.L.; Zuo, Z.; Jin, J.; Zheng, J. Collective Behaviors of Mobile Robots Beyond the Nearest Neighbor Rules With Switching Topology. IEEE Trans. Cybern. 2018, 48, 1577–1590. [Google Scholar] [CrossRef]

Figure 1. Network topology of multi-agent system.

Figure 2. The state-time curves of agents under the PD-like algorithm.

Figure 3. The estimation-time curves of agents under the PD-like algorithm.

Figure 4. The tracking error curve between the followers and the leader under the PD-like algorithm.

Figure 5. The estimation error curve between the agents’ estimated values and the true states under the PD-like algorithm.

Figure 6. Tracking error curves

g (t) = \frac{1}{t^{6 / 5}}

and

g (t) = \frac{1}{t^{2}}

under the PD-like algorithm.

Figure 6. Tracking error curves

g (t) = \frac{1}{t^{6 / 5}}

and

g (t) = \frac{1}{t^{2}}

under the PD-like algorithm.

Figure 7. The state-time curves of agents under the P-like algorithm.

Figure 8. Tracking error comparison of P-like and PD-like algorithms.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wu, Y.; Sun, X.; Wang, T.; Wang, J. PD-like Consensus Tracking Algorithm for Discrete Multi-Agent Systems with Time-Varying Reference State Under Binary-Valued Communication. Actuators 2025, 14, 267. https://doi.org/10.3390/act14060267

AMA Style

Wu Y, Sun X, Wang T, Wang J. PD-like Consensus Tracking Algorithm for Discrete Multi-Agent Systems with Time-Varying Reference State Under Binary-Valued Communication. Actuators. 2025; 14(6):267. https://doi.org/10.3390/act14060267

Chicago/Turabian Style

Wu, Yuqi, Xu Sun, Ting Wang, and Jie Wang. 2025. "PD-like Consensus Tracking Algorithm for Discrete Multi-Agent Systems with Time-Varying Reference State Under Binary-Valued Communication" Actuators 14, no. 6: 267. https://doi.org/10.3390/act14060267

APA Style

Wu, Y., Sun, X., Wang, T., & Wang, J. (2025). PD-like Consensus Tracking Algorithm for Discrete Multi-Agent Systems with Time-Varying Reference State Under Binary-Valued Communication. Actuators, 14(6), 267. https://doi.org/10.3390/act14060267

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

PD-like Consensus Tracking Algorithm for Discrete Multi-Agent Systems with Time-Varying Reference State Under Binary-Valued Communication

Abstract

1. Introduction

2. Problem Formulation

3. Control Algorithm

4. Main Result

5. Simulation

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Nomenclature

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI