Hidden Markov Model-Based Control for Cooperative Output Regulation of Heterogeneous Multi-Agent Systems under Switching Network Topology

Hong, Gia-Bao; Kim, Sung-Hyun

doi:10.3390/math11163481

Open AccessArticle

Hidden Markov Model-Based Control for Cooperative Output Regulation of Heterogeneous Multi-Agent Systems under Switching Network Topology

by

Gia-Bao Hong

and

Sung-Hyun Kim

^*

Department of Electrical, Electronic and Computer Engineering, University of Ulsan, Daehak-ro 93, Nam-Gu, Ulsan 680-749, Republic of Korea

^*

Author to whom correspondence should be addressed.

Mathematics 2023, 11(16), 3481; https://doi.org/10.3390/math11163481

Submission received: 8 July 2023 / Revised: 7 August 2023 / Accepted: 9 August 2023 / Published: 11 August 2023

(This article belongs to the Special Issue Control Theory and Applications)

Download

Browse Figures

Versions Notes

Abstract

:

This paper investigates the problem of stochastically cooperative output regulation of heterogeneous multi-agent systems (MASs) subject to hidden Markov jumps using observer-based distributed control. In order to address a more realistic situation than prior studies, this paper focuses on the following issues: (1) asynchronous phenomena in the system mode’s transmission to the controller; (2) the impact of system mode switching on network topology; and (3) the emergence of coupled terms between the mode-dependent Lyapunov matrix and the control gain in control design conditions. Specifically, to reduce the complexity arising from the asynchronous controller-side mode, the leader–state observer is developed so that the solution pair of regulator equations can be integrated into the observer. Furthermore, a linear decoupling method is proposed to handle the emergence of the aforementioned coupled terms; this provides sufficient LMI conditions to achieve stochastically cooperative output regulation for heterogeneous MASs. Finally, the validity of the proposed method is shown through two illustrative examples.

Keywords:

heterogeneous multi-agent systems; cooperative output regulation problem; hidden Markov jumps; asynchronous distributed control; time-varying network topology

MSC:

93A16

1. Introduction

Multi-agent systems (MASs) refer to complex systems composed of multiple autonomous agents that interact with each other and their environment, they have been used in various research fields, including robotics [1,2], automated vehicles [3,4], unmanned autonomous vehicles [5,6,7,8], and urban networks [9,10]. Recently, with the advent of these systems, effective techniques for cooperating MASs with different structures and parameters (referred to as heterogeneous MASs) have been swiftly developed for various purposes, including leader following and formation.

Over the past few years, the cooperative output regulation problem has also been regarded as one of the most fundamental consensus problems for MASs. In this problem, one essential requirement is to develop a control strategy that ensures stable and efficient cooperation between agents while achieving the desired overall performance. Additionally, the control strategy should be able to achieve appropriate behaviors with MASs by explicitly considering the interconnections and interactions between multiple agents. Following this, various methods have been proposed to deal with the cooperative output regulation problem of heterogeneous MASs on the premise of a fixed network topology and system parameters (see [11,12,13,14,15] and references therein). However, the network topology and system parameters can randomly change due to obstacles posed by network sizes, functional connectivity disturbances, limited communication ranges, and random packet losses.

As a mathematical model for handling the aforementioned random changes, Markov jump multi-agent systems have been widely utilized in many control problems, such as the leader–following consensus control [16,17], scaled consensus control [18], the formation control [19], and the cooperative output regulation [20,21,22,23]. The above studies mainly focused on control problems for MASs with deterministic dynamics (with no sudden changes), and the Markov process was only used to model sudden changes in the network topology, or vice versa. However, little effort has been devoted to research on realistic cases in which rapid changes in the system modes of MASs affect the network topology. A more serious problem is that due to network issues, such as packet dropout and data transmission delay, the controller modes cannot be designed in accurate synchronization with the system or network topology modes. Thus, it is necessary to carefully consider the impact of this asynchronous problem when designing a controller that operates in such an environment. According to this need, [24] used a hidden Markov model (HMM) to deal with the problem of the leader–following consensus for MASs with asynchronous control modes. However, in [24], random changes in the network topology are modeled as a Markov process, but the system parameters of MASs are assumed to be deterministic. Hence, to overcome these weaknesses, more progress needs to be made toward addressing the impact of changes in both system parameters and network topology while achieving the HMM-based cooperative output regulation for continuous-time heterogeneous MASs.

Based on the above discussion, the main goal of this paper is to address the problem of stochastically cooperative output regulation for continuous-time heterogeneous MASs with hidden Markov jumps in the system mode and network topology. First, a mode-dependent leader–state observer is designed to transmit an estimated leader–state to each follower agent. After that, an asynchronous mode-dependent distributed controller is designed so that it can ensure the stochastically cooperative output regulation of MASs. To be specific, the main contributions of this paper can be summarized as follows.

This paper makes a first attempt to reflect the influence of the asynchronous mode between heterogeneous MASs and observer-based distributed controllers while achieving stochastically cooperative output regulation subject to Markov jumps. Different from [22,23,25,26], the realistic case where rapid changes in the system modes of MASs affect the network topology is considered in the control design processes.
This paper proposes a method to design a continuous-time leader–state observer capable of estimating the leader–state value for each agent under abrupt changes in both systems and network topology. Also, it introduces an alternative mechanism by integrating system-mode-dependent solutions of regulator equations into the output of the leader–state observer to reduce the complexity arising from the asynchronous controller-side mode.
In the control design process, the asynchronous mode-dependent control gain is coupled with the system-mode-dependent Lyapunov matrix, which makes it difficult to directly use the well-known variable replacement technique [27]. For this reason, this paper suggests a suitable linear decoupling method that is capable of handling the aforementioned coupling problem.

The rest of the paper is organized as follows. Section 2 presents a class of heterogeneous MASs with hidden Markov jumps under our consideration. Next, Section 3 presents methods for designing a mode-dependent leader–state observer and asynchronous mode-dependent distributed controllers for MASs. In Section 4, two illustrative examples are provided to demonstrate the validity of the proposed method. Finally, the concluding remarks are given in Section 5.

Notations:

spec (A)

denotes the eigenvalue set of matrix A. In symmetric block matrices,

(*)

is used as an ellipsis for terms induced by symmetry.

diag (\cdot)

stands for a block-diagonal matrix;

col (v_{1}, v_{2}, \dots, v_{n}) = {[v_{1}^{T} v_{2}^{T} \dots v_{n}^{T}]}^{T}

denotes the scalars or vectors,

v_{i}

;

He {Q} = Q + Q^{T}

denotes any square matrix

Q

; ⊗ denotes the Kronecker product;

I_{n}

denotes the n-dimension identity matrix;

{| | x | |}_{2}

denotes the Euclidean norm of vector x;

λ_{m a x} (A)

denotes the maximum eigenvalue of matrix A;

{| | A | |}_{2}^{2} = λ_{max} (A^{T} A)

denotes matrix A;

R e (λ)

denotes the real part of

λ

;

N_{n}

denotes the set

{1, 2, \dots, n}

;

E {\cdot}

denote the mathematical expectation; “T” and “

- 1

” represent matrix transposition and inverse, respectively. The triplet notation

(Ω, F, Pr)

denotes a probability space, where

Ω, F

, and Pr represent the sample space, the algebra of events, and the probability measure defined on

F

, respectively.

2. Preliminaries and Problem Statement

2.1. Heterogeneous Multi-Agent System Description

Let us consider the following continuous-time Markov jump dynamics of the ith follower and the leader (or exogenous system), defined on a complete probability space

(Ω, F, Pr)

:

\begin{matrix} \{\begin{matrix} {\dot{x}}_{i} (t) = A_{i} (ϕ (t)) x_{i} (t) + B_{i} (ϕ (t)) u_{i} (t) + D_{i} (ϕ (t)) x_{0} (t), \\ y_{i} (t) = C_{i} (ϕ (t)) x_{i} (t), f o r \forall i \in N_{N} \\ {\dot{x}}_{0} (t) = A_{0} (ϕ (t)) x_{0} (t), \\ y_{0} (t) = C_{0} (ϕ (t)) x_{0} (t), \end{matrix} \end{matrix}

(1)

where

x_{i} (t) \in R^{n}

,

u_{i} (t) \in R^{m}

, and

y_{i} (t) \in R^{p}

denote the state, the control input, and the output of the ith follower, respectively;

x_{0} (t) \in R^{l}

and

y_{0} (t) \in R^{p}

are the state and the output of the leader, respectively;

N_{N} = 1, 2, \dots, N

is the set of agents;

ϕ (t) \in N_{α}

denotes the time-varying switching system mode with

N_{α}

denoting the set of systems modes; and N is the number of follower agents. In (1),

A_{i} (ϕ (t) = g) = A_{i g}

,

B_{i} (ϕ (t) = g) = B_{i g}

,

C_{i} (ϕ (t) = g) = C_{i g}

,

D_{i} (ϕ (t) = g) = D_{i g}

,

A_{0} (ϕ (t) = g) = A_{0 g}

, and

C_{0} (ϕ (t) = g) = C_{0 g}

are known system matrices with appropriate dimensions; and

(A_{i g}, B_{i g})

and

(C_{i g}, A_{i g})

are controllable and detectable, respectively. In addition, the process

{ϕ (t) \in N_{α}, t \geq 0}

is characterized by a continuous-time homogeneous Markov process with the following transition probabilities (TPs):

\begin{matrix} Pr (ϕ (t + δ) = h | ϕ (t) = g) = \{\begin{matrix} π_{g h} δ + o (δ) \\ 1 + π_{g g} δ + o (δ) \end{matrix}, \end{matrix}

(2)

where

o (δ)

is the little-o notation defined as

{lim}_{δ \to 0} (o (δ) / δ) = 0

; and

π_{g h}

indicates the transition rate (TR) from mode g to mode h at time

t + δ

and satisfies that

\sum_{h \in N_{α}} π_{g h} = 0

, and

π_{g h} \geq 0

for

h \in N_{α} ∖ {g}

.

Remark 1.

Different from other studies [22,23,25,26,28], this paper deals with the case where changes in system parameters (affected by the life span of system components, the increase in heat of motors, wear and tear, etc.) directly lead to signal loss or degradation in the transmission quality between agents, inducing changes in the network topology. In addition, external influences could also impact the system parameters and communication network of agents at the same time. Therefore, in this paper, the network topology mode is set to be the same as the system mode

ϕ (t)

.

To be specific, Figure 1 shows a block diagram of the cooperative output regulation of multi-agent systems under our consideration, which contains four main parts: the multi-agent system (

v_{i}

), network analyzer, leader–state observer, and distributed controller (

c_{i}

). Functionally, the network analyzer finds the current network topology mode using data transmitted over a wireless local area network (black dash line), and the mode information is transmitted to the observer directly and to the distributed controller over a wireless wide area network (blue dash line). As in [23,29], the leader–state observer provides the estimated leader–state

{\hat{x}}_{0 i}

for the ith agent to overcome the difficulty that some followers cannot receive information directly from the leader. Next, based on

x_{i}

,

Π_{i g} {\hat{x}}_{0 i}

, and

Λ_{i g} {\hat{x}}_{0 i}

, the controller

c_{i}

generates the control input

u_{i}

and transmits it to the ith agent so that the output

y_{i}

approaches

y_{0}

as time increases. In a real environment, since the transmission of the network topology mode can be affected by various network issues from a wireless wide area network, such as network-induced delay, packet dropout, network congestion, and interference, this paper employs another asynchronous observation mode,

ρ (t) \in N_{β}

. In other words, this paper considers a hidden Markov model (HMM) concept with

ϕ (t)

and

ρ (t)

when designing the distributed controller, where the asynchronism between the two is described as the following conditional probability:

Pr (ρ (t) = s | ϕ (t) = g) = ϖ_{g s}

, which satisfies

\sum_{s \in N_{β}} ϖ_{g s} = 1

and

ϖ_{g s} \geq 0

for

g \in N_{α}

and

s \in N_{β}

.

Remark 2.

If the network topology mode observed in the network analyzer is not affected by network issues in the transmission process, there will be no asynchronous phenomenon between

ϕ (t)

and

ρ (t)

, i.e.,

ρ (t) = ϕ (t)

and

N_{β} = N_{α}

. To cover this special case, the conditional probability matrix

{[ϖ_{g s}]}_{g \in N_{α}, s \in N_{α}} \in R^{α \times α}

can be set to the identity matrix

I \in R^{α \times α}

.

Assumption A1.

For

ϕ (t) = g

,

R e (λ_{0 g}) \geq 0, \forall λ_{0 g} \in s p e c (A_{0 g})

.

Assumption A2.

For

ϕ (t) = g

, there are pairs of solutions

(Λ_{i g}, Π_{i g})

for the following regulator equations:

\begin{matrix} 0 = A_{i g} Λ_{i g} + B_{i g} Π_{i g} + D_{i g} - Λ_{i g} A_{0 g}, \end{matrix}

(3)

\begin{matrix} 0 = C_{i g} Λ_{i g} - C_{0 g}, \forall i \in N_{N} . \end{matrix}

(4)

where

Λ_{i g} \in R^{n \times l}

and

Π_{i g} \in R^{m \times l}

.

Remark 3.

Assumption 1 is made only for convenience and loses no generality. In fact, if the linear output regulation problem is solvable by any controller under Assumption 1, it is also solvable by the same controller even if Assumption 1 is violated. More explanations can be found in [30]. Assumption 2 provides the well-known regulator equations [23,31] whose solutions impose a necessary condition for the control design process. Furthermore, the feasibility of Assumption 2 is guaranteed according to Remark 4.

Remark 4

([30]). For

ϕ (t) = g

, the solution pair

(Λ_{i g}, Π_{i g})

of Assumption 2 exists if and only if all eigenvalues

λ \in s p e c (A_{0 g})

satisfy

r a n k [\begin{matrix} A_{i g} - λ I & B_{i g} \\ C_{i g} & 0 \end{matrix}] = n + p, \forall i \in N_{N} .

(5)

2.2. Communication Topology

As mentioned in Remark 1 and Figure 1, the network topology of (1) with

ϕ (t) = g \in N_{α}

is represented as a mode-dependent weighted and directed graph (digraph)

G_{g} = (V, ε_{g}, A_{g})

formed by N follower agents. Here,

V = {v_{1}, v_{2}, \dots, v_{N}}

denotes the node set;

ε_{g} ⫅ \{(v_{j}, v_{i}) | v_{i}, v_{j} \in V, j \neq i\}

denotes the edge set with the ordered pair

(v_{j}, v_{i})

that has information flow leaving from agent

v_{j}

to agent

v_{i}

at time t; and

A_{g} = {[a_{i j, g}]}_{i, j \in {1, 2, \dots, N}}

denotes the adjacency matrix with

a_{i j, g} > 0

if and only if

(v_{j}, v_{i}) \in ε_{g}

, and

a_{i j, g} = 0

otherwise. In addition, the neighbor set and degree of

v_{i} \in V

are defined as

N_{i, g} = {v_{j} | (v_{j}, v_{i}) \in_{g}}

and

d_{i, g} = \sum_{j = 1}^{N} a_{i j, g}

, respectively, and the Laplacian matrix of

G_{g}

is given by

L_{g} = D_{g} - A_{g} \in R^{N \times N}

, where

D_{g} = diag (d_{1, g}, \dots, d_{N, g})

. Furthermore, a directed path leaving from node

v_{j}

to node

v_{i}

is a sequence of ordered edges [24]. In particular, the multi-agent system under our consideration consists of N follower agents and one leader

v_{0}

with a spanning tree from the leader to each agent in the network topology. Thus, to represent such a system, this paper considers an extended graph

G_{0, g} = (V_{0}, ε_{0, g}, M_{g})

with

V_{0} = V ⋃ {v_{0}}

,

ε_{0, g} \subseteq {(v_{0}, v_{i}) | i \in V}

, and

M_{g} = diag (m_{1, g}, \dots, m_{N, g}) \in R^{N \times N}

, where

M_{g}

denotes the leader adjacency matrix with

m_{i, g} > 0

if and only if the leader

v_{0}

transmits information to

v_{i}

, and

m_{i, g} = 0

otherwise. That is, the union of graphs is given by

G_{0} : = ⋃_{g = 1}^{α} G_{0, g}

, and its node set is equal to

V_{0}

.

Remark 5.

By considering the bidirectional information link between agents, this paper ensures that all agents have the opportunity to communicate with each other, depending on the network topology modes.

The following definitions and lemma will be adopted in this paper.

Definition 1

([32,33]). Let us consider a Markov jump linear system with state

χ (t)

. For any initial conditions,

χ (0)

,

ϕ (0)

, and

ρ (0)

, if the state

χ (t)

satisfies

\begin{matrix} lim_{t \to \infty} E \{\int_{0}^{t} {| | χ (τ) | |}_{2}^{2} d τ | χ (0), ϕ (0), ρ (0)\} < \infty, \end{matrix}

(6)

then the Markov jump linear system is stochastically stable.

Definition 2

([20,23,34]). The heterogeneous MAS system (1) is said to achieve the stochastically cooperative output regulation if it holds that

System (1) is stochastically stable when $x_{0} (t) = 0$ ,
For any initial conditions, $x_{i} (0)$ and $x_{0} (0)$ ,

$\begin{matrix} lim_{t \to \infty} E \{\int_{0}^{t} | | e_{i} (τ) {| |}_{2}^{2} d τ | e_{i} (0), ϕ (0), ρ (0)\} < \infty \end{matrix}$

(7)

where $e_{i} (t) = y_{i} (t) - y_{0} (t)$ represents the error between the output of the ith agent and the output of the leader.

Lemma 1

([35]). For any matrix

U > 0

, and matrices X and Y of compatible dimensions, the following inequality holds:

\begin{matrix} He {X^{T} Y} \leq X^{T} U X + Y^{T} U^{- 1} Y . \end{matrix}

3. Main Results

As in [29], this paper first designs a mode-dependent leader–state observer that provides an estimated leader–state for each agent. Following that, this paper designs a distributed controller that achieves cooperative output regulation for heterogeneous multi-agent systems (1).

3.1. Leader–State Observer Design

As depicted in Figure 1, the leader–state observer directly receives g and

x_{0}

from the network analyzer and the leader. Thus, the mode-dependent observer for the ith agent can be established as follows: for

ϕ (t) = g \in N_{α}

,

\begin{matrix} {\dot{\hat{x}}}_{0 i} (t) = A_{0 g} {\hat{x}}_{0 i} (t) + F_{g} (\sum_{j = 1}^{N} a_{i j, g} ({\hat{x}}_{0 i} (t) - {\hat{x}}_{0 j} (t)) + m_{i, g} ({\hat{x}}_{0 i} (t) - x_{0} (t))), \end{matrix}

(8)

where

{\hat{x}}_{0 i} (t) \in R^{l}

is the estimated leader–state for the ith agent; and

F_{g} \in R^{l \times l}

is the observer gain. Thus, based on the ith observation error state

{\hat{e}}_{0 i} (t) = {\hat{x}}_{0 i} (t) - x_{0} (t)

, it is obtained that

\begin{matrix} {\dot{\hat{e}}}_{0 i} (t) = A_{0 g} {\hat{e}}_{0 i} (t) + F_{g} (\sum_{j = 1}^{N} a_{i j, g} ({\hat{e}}_{0 i} (t) - {\hat{e}}_{0 j} (t)) + m_{i, g} {\hat{e}}_{0 i} (t)) . \end{matrix}

(9)

That is, the resultant error system is represented as follows:

\begin{array}{l} {\dot{\hat{e}}}_{0} (t) & = (I_{N} \otimes A_{0 g}) {\hat{e}}_{0} (t) + (L_{g} + M_{g}) \otimes F_{g} {\hat{e}}_{0} (t) \\ = ((I_{N} \otimes A_{0 g}) + (L_{g} + M_{g}) \otimes F_{g}) {\hat{e}}_{0} (t), \end{array}

(10)

where

{\hat{e}}_{0} (t) = col ({\hat{e}}_{01} (t), {\hat{e}}_{02} (t), \dots, {\hat{e}}_{0 N} (t))

.

The following theorem provides the stabilization condition for system (10), formulated in terms of LMIs.

Theorem 1.

Suppose there exists

0 < Q_{g} = Q_{g}^{T} \in R^{l \times l}

and

{\bar{F}}_{g} \in R^{l \times l}

such that the following conditions hold: for

g \in N_{α}

,

\begin{matrix} 0 > I_{N} \otimes (He {Q_{g} A_{0 g}} + \sum_{h \in N_{α}} π_{g h} Q_{h}) + He \{(L_{g} + M_{g}) \otimes {\bar{F}}_{g}\} . \end{matrix}

(11)

Then, system (10) is stochastically stable under the Markov network topology, and the observer gain is obtained by

F_{g} = Q_{g}^{- 1} {\bar{F}}_{g}

.

Proof.

Let us choose a mode-dependent Lyapunov function of the following form:

\begin{matrix} V (t, ϕ (t) = g) = {\hat{e}}_{0}^{T} (t) (I_{N} \otimes Q_{g}) {\hat{e}}_{0} (t), \end{matrix}

(12)

where

I_{N} \otimes Q_{g} = diag (Q_{g}, \dots, Q_{g}) > 0

. Then, the weak infinitesimal operator acting on

V (t, ϕ (t))

provides

\begin{array}{l} \nabla V (t) & = lim_{δ \to 0} \frac{1}{δ} (E \{V (t + δ, ϕ (t + δ) = h | ϕ (t) = g)\} - V (t, ϕ (t) = g)) \\ = \sum_{h \in N_{α}} π_{g h} V (t, h) + \dot{V} (t, g) \\ = \sum_{h \in N_{α}} π_{g h} {\hat{e}}_{0}^{T} (t) (I_{N} \otimes Q_{h}) {\hat{e}}_{0} (t) + He {{\hat{e}}_{0}^{T} (t) (I_{N} \otimes Q_{g}) {\dot{\hat{e}}}_{0} (t)} \\ = {\hat{e}}_{0}^{T} (t) (He \{(I_{N} \otimes Q_{g}) ((I_{N} \otimes A_{0 g}) + (L_{g} + M_{g}) \otimes F_{g})\} \\ + I_{N} \otimes \sum_{h \in N_{α}} π_{g h} Q_{h}) {\hat{e}}_{0} (t) . \end{array}

(13)

Thus, by (13), it can be seen that

\nabla V (t) < 0

holds if

\begin{matrix} 0 > He \{(I_{N} \otimes Q_{g}) ((I_{N} \otimes A_{0 g}) + (L_{g} + M_{g}) \otimes F_{g})\} + I_{N} \otimes {\tilde{Q}}_{g}, \end{matrix}

(14)

where

{\tilde{Q}}_{g} = \sum_{h \in N_{α}} π_{g h} Q_{h}

. That is, (14) implies that there exists a small scalar

κ > 0

, such that

\nabla V (t) \leq - κ | | {\hat{e}}_{0} (t) {| |}_{2}^{2}

, and by a generalization of Dynkin’s formula, it is obtained that

\begin{array}{l} E {V (t)} - V (0) & = E \{\int_{0}^{t} \nabla V (τ) d τ | {\hat{e}}_{0} (0), ϕ (0)\} \\ \leq - κ E \{- \int_{0}^{t} | | {\hat{e}}_{0} {(τ) | |}_{2}^{2} d τ | {\hat{e}}_{0} (0), ϕ (0)\}, \end{array}

(15)

which results in

\begin{matrix} lim_{t \to \infty} E \{\int_{0}^{t} | | {\hat{e}}_{0} {(τ) | |}_{2}^{2} d τ | {\hat{e}}_{0} (0), ϕ (0)\} < κ^{- 1} V (0) . \end{matrix}

(16)

Hence, based on Definition 1, system (10) can be said to be stochastically stable. Moreover, defining

{\bar{F}}_{g} = Q_{g} F_{g}

, condition (14) can be converted into

\begin{matrix} 0 > I_{N} \otimes He \{Q_{g} A_{0 g}\} + He \{(L_{g} + M_{g}) \otimes {\bar{F}}_{g}\} + I_{N} \otimes {\tilde{Q}}_{g}, \end{matrix}

which becomes (11). □

Remark 6.

It is worth noting that not all follower agents can obtain the leader–state values directly under the switching network topology, but only the one who is connected to the leader. For distributed control purposes, the mode-dependent leader–state observer is designed for each agent, such that the estimated states can track the leader–state values by intermittent communication. Indeed, following Theorem 1, system (10) is stochastically stable, which leads to

{lim}_{t \to \infty} {\hat{e}}_{0 i} (t) = {\hat{x}}_{0 i} (t) - x_{0} (t) = 0

.

Remark 7.

As shown in Figure 1,

{\hat{x}}_{0 i} (t)

is multiplied by each component in the solution pair

(Λ_{i g}, Π_{i g})

of regulator equations as the output of the leader–state observer and then sent to the controller to reduce the complexity arising from the asynchronous controller-side mode.

3.2. Distributed Controller Design

Let us define

{\tilde{x}}_{i} (t) = x_{i} (t) - Λ_{i g} x_{0} (t)

. Then, the error system of the ith agent is given as follows:

\begin{matrix} {\dot{\tilde{x}}}_{i} (t) = A_{i g} x_{i} (t) + B_{i g} u_{i} (t) + D_{i g} x_{0} (t) - Λ_{i g} A_{0 g} x_{0} (t) . \end{matrix}

(17)

Furthermore, for (17), the following distributed control law is considered:

\begin{matrix} u_{i} (t) = K_{i s} (x_{i} (t) - Λ_{i g} {\hat{x}}_{0 i} (t)) + Π_{i g} {\hat{x}}_{0 i} (t), \end{matrix}

(18)

where

K_{i s} = K_{i} (ρ (t) = s)

denotes the asynchronous mode-dependent controller gain with

s \in N_{β}

.

Thus, based on Assumption 2, the closed-loop system with (17) and (18) is described as follows:

\begin{array}{l} {\dot{\tilde{x}}}_{i} (t) & = A_{i g} x_{i} (t) + B_{i g} (K_{i s} (x_{i} (t) - Λ_{i g} {\hat{x}}_{0 i} (t)) + Π_{i g} {\hat{x}}_{0 i}) \\ + D_{i g} x_{0} (t) - Λ_{i g} A_{0 g} x_{0} (t) \\ = A_{i g} ({\tilde{x}}_{i} (t) + Λ_{i g} x_{0} (t)) + B_{i g} K_{i s} ({\tilde{x}}_{i} (t) + Λ_{i g} x_{0} (t)) \\ - B_{i g} K_{i s} Λ_{i g} {\hat{x}}_{0 i} (t) + B_{i g} Π_{i g} {\hat{x}}_{0 i} (t) + D_{i g} x_{0} (t) - Λ_{i g} A_{0 g} x_{0} (t) \\ = (A_{i g} + B_{i g} K_{i s}) {\tilde{x}}_{i} (t) - (B_{i g} K_{i s} Λ_{i g} - B_{i g} Π_{i g}) {\hat{e}}_{0 i} (t) . \end{array}

(19)

The following theorem presents the LMI-based cooperative output regulation conditions of (19).

Theorem 2.

For any scalars

μ > 0

,

ϵ > 0

, and

δ > 0

, suppose that there exist

0 < {\bar{P}}_{i g} = {\bar{P}}_{i g}^{T} \in R^{n \times n}

,

0 < Z_{i g h} = Z_{i g h}^{T} \in R^{n \times n}

,

W_{i} = W_{i}^{T} \in R^{n \times n}

, and

{\bar{K}}_{i s} \in R^{m \times n}

, such that the following conditions hold for

i \in N_{N}

,

g \in N_{α}

:

0 > [\begin{matrix} - 2 W_{i} & (*) & (*) & (*) & (*) & (*) \\ (2, 1) & (2, 2) & (*) & (*) & (*) & (*) \\ 0 & 0 & - I & (*) & (*) & (*) \\ 0 & \sum_{s \in N_{β}} ϖ_{g s} {\bar{K}}_{i s}^{T} B_{i g}^{T} & 0 & - δ^{- 1} W_{i} & (*) & (*) \\ 0 & 0 & I & 0 & - δ W_{i} & (*) \\ W_{i} & 0 & 0 & 0 & 0 & - μ {\bar{P}}_{i g}, \end{matrix}],

(20)

0 \leq [\begin{matrix} Z_{i g h} & {\bar{P}}_{i g} \\ (*) & {\bar{P}}_{i h} \end{matrix}], \forall h \in N_{α} ∖ {g},

(21)

where

\begin{matrix} (2, 1) = A_{i g} W_{i} + \sum_{s \in N_{β}} ϖ_{g s} B_{i g} {\bar{K}}_{i s} + {\bar{P}}_{i g}, \\ (2, 2) = (ϵ + π_{g g}) {\bar{P}}_{i g} + \sum_{h \in N_{α} ∖ {g}} π_{g h} Z_{i g h} + B_{i g} B_{i g}^{T} - μ^{- 1} {\bar{P}}_{i g} . \end{matrix}

Then, system (1) achieves the cooperative output regulation, where the controller gain is constructed by

K_{i s} = {\bar{K}}_{i s} W_{i}^{- 1}

.

Proof.

Let us consider the following mode-dependent Lyapunov function:

V_{i} (t, ϕ (t) = g) = {\tilde{x}}_{i}^{T} (t) P_{i g} {\tilde{x}}_{i} (t),

(22)

where

P_{i g} = P_{i g}^{T} > 0

. Then, applying the weak infinitesimal operator to

V_{i} (t, ϕ (t))

leads to

\begin{array}{l} \nabla V_{i} (t) + ϵ V_{i} (t) & = lim_{δ \to 0} \frac{1}{δ} (E \{V_{i} (t + δ, ϕ (t + δ) = h | ϕ (t) = g)\} - V_{i} (t, ϕ (t) = g)) + ϵ V_{i} (t) \\ = \sum_{h \in V_{α}} π_{g h} V i (t, h) + E {{\dot{V}}_{i} (t, g)} + ϵ V_{i} (t) \\ = 2 {\tilde{x}}_{i}^{T} (t) P_{i g} \sum_{s \in N_{β}} ϖ_{g s} ((A_{i g} + B_{i g} K_{i s}) {\tilde{x}}_{i} (t) - (B_{i g} K_{i s} Λ_{i g} - B_{i g} Π_{i g}) {\hat{e}}_{i 0} (t)) \\ + {\tilde{x}}_{i}^{T} (t) (\sum_{h \in N_{α}} π_{g h} P_{i h}) {\tilde{x}}_{i} (t) + ϵ V_{i} (t) . \end{array}

Also, by Lemma 1, it is obtained that

\begin{array}{l} \nabla V_{i} (t) + ϵ V_{i} (t) & \leq {\tilde{x}}_{i}^{T} (t) He \{P_{i g} (A_{i g} + B_{i g} K_{i g})\} {\tilde{x}}_{i} (t) + {\tilde{x}}_{i}^{T} (t) P_{i g} B_{i g} K_{i g} K_{i g}^{T} B_{i g}^{T} P_{i g} {\tilde{x}}_{i} (t) \\ + {\hat{e}}_{0 i}^{T} (t) Λ_{i g}^{T} Λ_{i g} {\hat{e}}_{0 i} (t) + {\tilde{x}}_{i}^{T} (t) P_{i g} B_{i g} B_{i g}^{T} P_{i g} {\tilde{x}}_{i} (t) + {\hat{e}}_{0 i}^{T} (t) Π_{i g}^{T} Π_{i g} {\hat{e}}_{0 i} (t) \\ + {\tilde{x}}_{i}^{T} (t) (\sum_{h \in N_{α}} π_{g h} P_{i h}) {\tilde{x}}_{i} (t) + ϵ {\tilde{x}}_{i}^{T} (t) P_{i g} {\tilde{x}}_{i} (t) \\ \leq {\tilde{x}}_{i}^{T} (t) Y_{i g} {\tilde{x}}_{i} (t) + (| | Λ_{i g} | |_{2}^{2} + | | Π_{i g} | |_{2}^{2}) | | {\hat{e}}_{0 i} (t) {| |}_{2}^{2}, \end{array}

(23)

where

K_{i g} = \sum_{s \in N_{β}} ϖ_{g s} K_{i s}

, and

\begin{matrix} Y_{i g} & = ϵ P_{i g} + \sum_{h \in N_{α}} π_{g h} P_{i h} + He \{P_{i g} (A_{i g} + B_{i g} K_{i g})\} + P_{i g} B_{i g} B_{i g}^{T} P_{i g} + P_{i g} B_{i g} K_{i g} K_{i g}^{T} B_{i g}^{T} P_{i g} . \end{matrix}

Thus, the condition

Y_{i g} < 0

implies

\begin{matrix} \nabla V_{i} (t) < - ϵ V_{i} (t) + γ | | {\hat{e}}_{0 i} (t) {| |}_{2}^{2}, \end{matrix}

(24)

where

γ = | | Λ_{i g} | |_{2}^{2} + | | Π_{i g} | |_{2}^{2}

. Furthermore, using a generalization of Dynkin’s formula, it is obtained that

\begin{matrix} E \{V_{i} (t)\} - V_{i} (0) = E \{\int_{0}^{t} \nabla_{i} (τ) d τ | {\tilde{x}}_{i} (0), ϕ (0), ρ (0)\} . \end{matrix}

Thus, it is valid that

\begin{matrix} E \{V_{i} (t)\} - V_{i} (0) < & - E \{\int_{0}^{t} ϵ V_{i} (τ) d τ | {\tilde{x}}_{i} (0), ϕ (0), ρ (0)\} \\ + γ E \{\int_{0}^{t} | | {\hat{e}}_{0 i} {(τ) | |}_{2}^{2} d τ | {\hat{e}}_{0 i} (0), ϕ (0)\} . \end{matrix}

(25)

Also, Theorem 1 ensures

E \{\int_{0}^{t} | | {\hat{e}}_{0 i} {(τ) | |}_{2}^{2} d τ | {\hat{e}}_{0 i} (0), ϕ (0)\} < η_{i} < \infty

, where scalar

η_{i}

stands for a finite constant. Accordingly, by considering a sufficiently small scalar

ϑ > 0

, such that

ϵ_{i} (τ) \geq ϑ | | {\tilde{x}}_{i} (τ) {| |}_{2}^{2}

, it follows from (25) that

\begin{matrix} lim_{t \to \infty} E \{\int_{0}^{t} | | {\tilde{x}}_{i} (τ) {| |}_{2}^{2} d τ | {\tilde{x}}_{i} (0), ϕ (0), ρ (0)\} < ϑ^{- 1} (V_{i} (0) + γ η_{i}) < \infty, \end{matrix}

(26)

which guarantees that the system (19) is stochastically stable according to Definition 1. As a result, since

e_{i} (t) = y_{i} (t) - y_{0} (t) = C_{i g} {\tilde{x}}_{i} (t) + C_{i g} Λ_{i g} x_{0} (t) - C_{0 g} x_{0} (t) = C_{i g} {\tilde{x}}_{i} (t),

condition (26) leads to

\begin{array}{l} lim_{t \to \infty} E \{\int_{0}^{t} | | e_{i} (τ) {| |}_{2}^{2} d τ | e_{i} (0), ϕ (0), ρ (0)\} \\ = lim_{t \to \infty} E \{\int_{0}^{t} | | C_{i g} {\tilde{x}}_{i} (τ) {| |}_{2}^{2} d τ | {\tilde{x}}_{i} (0), ϕ (0), ρ (0)\} < \infty . \end{array}

(27)

Hence, based on Definition 2, it can be seen that condition

Y_{i g} < 0

guarantees the stochastically cooperative output regulation of system (1). Meanwhile, condition

Y_{i g} < 0

can be converted into

\begin{matrix} 0 > (ϵ + π_{g g}) {\bar{P}}_{i g} + {\bar{P}}_{i g} & + He \{(A_{i g} + B_{i g} K_{i g}) {\bar{P}}_{i g}\} + B_{i g} B_{i g}^{T} + B_{i g} K_{i g} K_{i g}^{T} B_{i g}^{T}, \end{matrix}

(28)

where

{\bar{P}}_{i g} = P_{i g}^{- 1}

, and

{\bar{P}}_{i g} = \sum_{h \in N_{α} ∖ {g}} π_{g h} {\bar{P}}_{i g} P_{i h} {\bar{P}}_{i g}

. Furthermore, by the Schur complement, condition (21) ensures

{\bar{P}}_{i g} P_{i h} {\bar{P}}_{i g} \leq Z_{i g h}

for

h \in N_{α} ∖ {g}

. Thus, based on (21), condition (28) holds if it is satisfied that

\begin{matrix} 0 > (ϵ + π_{g g}) {\bar{P}}_{i g} + \sum_{h \in N_{α} ∖ {g}} π_{g h} Z_{i g h} + He \{(A_{i g} + B_{i g} K_{i g}) {\bar{P}}_{i g}\} \\ + B_{i g} B_{i g}^{T} + B_{i g} K_{i g} K_{i g}^{T} B_{i g}^{T} . \end{matrix}

(29)

Also, applying the Schur complement to (29) yields

\begin{matrix} 0 > [\begin{matrix} He {(A_{i g} + B_{i g} K_{i g}) {\bar{P}}_{i g}} +_{i g} & (*) \\ 0 & - I \end{matrix}] + [\begin{matrix} B_{i g} K_{i g} W_{i} \\ 0 \end{matrix}] [0 W_{i}^{- 1}] \\ [\begin{matrix} 0 \\ W_{i}^{- 1} \end{matrix}] [W_{i} K_{i g}^{T} B_{i g}^{T} 0], \end{matrix}

(30)

where

\begin{matrix} X_{i g} = (ϵ + π_{g g}) {\bar{P}}_{i g} + \sum_{h \in N_{α} ∖ {g}} π_{g h} Z_{i g h} + B_{i g} B_{i g}^{T} . \end{matrix}

Accordingly, by Lemma 1, condition (30) holds if

0 > [\begin{matrix} He \{(A_{i g} + B_{i g} K_{i g}) {\bar{P}}_{i g}\} + X_{i g} & (*) \\ 0 & - I \end{matrix}] + δ^{- 1} [\begin{matrix} 0 \\ W_{i}^{- 1} \end{matrix}] W_{i} [0 W_{i}^{- 1}] + δ [\begin{matrix} B_{i g} K_{i g} W_{i} \\ 0 \end{matrix}] W_{i}^{- 1} [W_{i} K_{i g}^{T} B_{i g}^{T} 0],

(31)

which is converted by the Schur complement as follows:

0 > [\begin{matrix} He \{(A_{i g} + B_{i g} K_{i g}) {\bar{P}}_{i g}\} + X_{i g} & (*) & (*) & (*) \\ 0 & - I & (*) & (*) \\ \sum_{s \in N_{β}} ϖ_{g s} {\bar{K}}_{i s}^{T} B_{i g}^{T} & 0 & - δ^{- 1} W_{i} & (*) \\ 0 & I & 0 & - δ W_{i} . \end{matrix}] .

(32)

Indeed, performing the congruent transformation to (20) by

S = [\begin{matrix} A_{i g} + B_{i g} K_{i g} & I & 0 & 0 & 0 & 0 \\ 0 & 0 & I & 0 & 0 & 0 \\ 0 & 0 & 0 & I & 0 & 0 \\ I & 0 & 0 & 0 & 0 & I \end{matrix}],

we can obtain

0 > [\begin{matrix} Q_{i g} & (*) & (*) & (*) & (*) \\ 0 & - I & (*) & (*) & (*) \\ \sum_{s \in N_{β}} ϖ_{g s} {\bar{K}}_{i s}^{T} B_{i g}^{T} & 0 & - δ^{- 1} W_{i} & (*) & (*) \\ 0 & I & 0 & - δ W_{i} & (*) \\ {\bar{P}}_{i g} & 0 & 0 & 0 & - μ {\bar{P}}_{i g}, \end{matrix}],

(33)

where

{\bar{K}}_{i s} = K_{i s} W_{i}

, and

\begin{matrix} Q_{i g} & = He \{(A_{i g} + B_{i g} K_{i g}) {\bar{P}}_{i g}\} + (ϵ + π_{g g}) {\bar{P}}_{i g} + \sum_{h \in N_{α} ∖ {g}} π_{g h} Z_{i g h} + B_{i g} B_{i g}^{T} - μ^{- 1} {\bar{P}}_{i g} . \end{matrix}

That is, since (33) is equivalent to (32) by the Schur complement, it can be seen that (20) and (21) imply (28), i.e,

Y_{i g} < 0

. □

Remark 8.

In the synchronous case (

ϕ (t) \equiv ρ (t) = g

), the bilinear problems (the third and fifth terms on the right-hand side) of condition (29) could be addressed using the conventional variable replacement technique by denoting

{\tilde{K}}_{i g} = K_{i g} {\bar{P}}_{i g}

, and then applying the Schur complement method to derive the final LMI conditions. However, because of the difference between the system and controller modes in the asynchronous case (

ϕ (t) = g \neq ρ (t) = s

), it is impossible to use the aforementioned approach. Hence, additional numerical processes and equivalent LMI conditions are introduced here to handle the coupling problems in condition (29).

Remark 9.

The most recent studies concentrated on addressing the cooperative output regulation problem of heterogeneous Markov jump MASs, where both system and controller modes operate synchronously [23,34], or on deterministic system parameters with a switching network topology [20,21]. Meanwhile, the proposed control strategy in this work can, overall, cover these two problems. Furthermore, we make a first attempt to reflect the influence of the asynchronous mode between the continuous-time heterogeneous MASs and observer-based distributed controllers on achieving stochastically cooperative output regulation subject to Markov jumps. Thus, it is hard to draw a direct comparison since the lack of comparable results developed in a similar framework to this study.

4. Illustrative Examples

To show the validity of the proposed method, this paper provides two examples.

Example 1.

Let us consider the following heterogeneous multi-agent system with

g \in N_{2}

and

i \in N_{4}

, adopted in [23]:

\begin{array}{l} A_{11} = [\begin{matrix} - 0.3 & 1.4 \\ - 1.1 & - 0.3 \end{matrix}], A_{21} = [\begin{matrix} 2 & 1 \\ 1 & 0 \end{matrix}], A_{31} = [\begin{matrix} 2 & 1 \\ 1 & 0 \end{matrix}], \\ A_{41} = [\begin{matrix} - 0.2 & 1.5 \\ - 1.2 & - 0.3 \end{matrix}], A_{12} = [\begin{matrix} 0 & 0.9 \\ 0.5 & 0.3 \end{matrix}], A_{22} = [\begin{matrix} - 2 & 1 \\ 1 & - 1 \end{matrix}], \\ A_{32} = [\begin{matrix} - 2 & 1 \\ 1 & - 1 \end{matrix}], A_{42} = [\begin{matrix} 0.1 & 1 \\ 0.6 & 0.4 \end{matrix}], B_{i g} = [\begin{matrix} 1 \\ 1 \end{matrix}], \\ D_{11} = [\begin{matrix} 0.4 & 0.1 \\ 1.1 & 0.5 \end{matrix}], D_{21} = [\begin{matrix} - 1.9 & 0.5 \\ - 1 & 0.2 \end{matrix}], D_{31} = [\begin{matrix} - 1.9 & 0.5 \\ - 1 & 0.2 \end{matrix}], \\ D_{41} = [\begin{matrix} 0.3 & 0 \\ 1.2 & 0.5 \end{matrix}], D_{12} = [\begin{matrix} 0.1 & 0.6 \\ - 0.5 & - 0.1 \end{matrix}], D_{22} = [\begin{matrix} 2.1 & 0.5 \\ - 1 & 1.2 \end{matrix}], \\ D_{32} = [\begin{matrix} 2.1 & 0.5 \\ - 1 & 1.2 \end{matrix}], D_{42} = [\begin{matrix} 0 & 0.5 \\ - 0.6 & - 0.2 \end{matrix}], C_{i g} = [\begin{matrix} 1 & 0.1 \end{matrix}], \end{array}

(34)

A_{01} = [\begin{matrix} 0.1 & 1.5 \\ 0 & 0.2 \end{matrix}], A_{02} = [\begin{matrix} 0.1 & 1.4 \\ 0 & 0.1 \end{matrix}], C_{0 g} = [\begin{matrix} 1 & 0.1 \end{matrix}] .

(35)

In addition, the used network topology is depicted in Figure 2 (each agent is a numbered circle), which can be characterized as follows:

L_{1} = [\begin{matrix} 3 & - 3 & 0 & 0 \\ 0 & 2 & 0 & - 2 \\ - 1 & 0 & 1 & 0 \\ 0 & - 3 & 0 & 3 \end{matrix}], L_{2} = [\begin{matrix} 1 & 0 & 0 & - 1 \\ - 3 & 3 & 0 & 0 \\ 0 & - 3 & 3 & 0 \\ 0 & - 2 & 0 & 2 \end{matrix}],

M_{1} = diag (0, 2, 0, 0), M_{2} = diag (3, 0, 0, 0) .

Also, based on Assumption 2,

Λ_{i g} = I_{2}, Π_{i 1} = 0, Π_{i 2} = [0 - 0.1] .

Furthermore, to handle two synchronous and asynchronous cases between the system and controller modes, we consider the following transition rate

π_{g h}

and conditional probability

ϖ_{g s}

:

{[π_{g h}]}_{g, h \in N_{2}} = [\begin{matrix} - 0.5 & 0.5 \\ 0.7 & 0.7 \end{matrix}],

(36)

Case 1 (Synchronous case) : {[ϖ_{g s}]}_{g \in N_{2}, s \in N_{2}} = [\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix}],

(37)

Case 2 (Aynchronous case) : {[ϖ_{g s}]}_{g \in N_{2}, s \in N_{2}} = [\begin{matrix} 0.7 & 0.3 \\ 0.5 & 0.5 \end{matrix}] .

(38)

Thereupon, for

μ = 0.05

,

δ = 0.05

, and

ϵ = 0.1

, Theorems 1 and 2 provide the following leader–state observer gain

F_{g}

and controller gain

K_{i s}

:

F_{1} = [\begin{matrix} - 1.8266 & - 0.5765 \\ - 0.5059 & - 0.1793 \end{matrix}], F_{2} = [\begin{matrix} - 1.0958 & - 0.4640 \\ - 0.3766 & - 0.2335 \end{matrix}],

(39)

Case 1 (Synchronous case):

\begin{array}{l} K_{11} = [- 7.1527 - 6.1036], K_{21} = [- 10.5399 - 5.1965], \\ K_{31} = [- 10.5399 - 5.1965], K_{41} = [- 7.3693 - 6.0404], \\ K_{12} = [- 8.0258 - 6.1664], K_{22} = [- 8.3219 - 4.9543], \\ K_{32} = [- 8.3219 - 4.9543], K_{42} = [- 8.3020 - 6.1352], \end{array}

(40)

Case 2 (Asynchronous case):

\begin{array}{l} K_{11} = [- 5.8430 - 6.0093], K_{21} = [- 13.8669 - 5.5598], K_{31} = [- 13.8669 - 5.5598], K_{41} = [- 5.9701 - 5.8983], K_{12} = [- 10.2085 - 6.3236], K_{22} = [- 2.7768 - 4.3487], K_{32} = [- 2.7768 - 4.3487], K_{42} = [- 10.6340 - 6.3720] . \end{array}

(41)

The details of the hardware used in this experiment are provided in Table 1 and the experiment is conducted using MATLAB for both cases. Based on the central limit theorem, the expected computation times are nearly 0.0363 s for the synchronous case and 0.0394 s for the asynchronous case (after 10,000 trials). Although there is a slight difference, the computation times to derive the final results are approximately similar for both cases overall.

Let us consider the following initial conditions:

x_{0} (0) = {[0.1, 0.1]}^{T} = x_{i} (0)

,

{\hat{x}}_{01} (0) = {[0.2, 0.2]}^{T}

,

{\hat{x}}_{02} (0) = {[0.3, 0.2]}^{T}

,

{\hat{x}}_{03} (0) = {[0.2, - 0.3]}^{T}

, and

{\hat{x}}_{04} (0) = {[- 0.1, 0.2]}^{T}

. As shown in Figure 3, suppose that the hidden mode (also called the system mode) and the observed mode (also called the control mode) are generated according to (36)–(38). Then, based on (39), Figure 4 shows the leader–state estimation error

{\hat{e}}_{0 i} = {\hat{x}}_{0 i} - x_{0}

, where

{\hat{e}}_{0 i} = {[{\hat{e}}_{0 i 1} {\hat{e}}_{0 i 2}]}^{T}

. That is, from Figure 4, it can be seen that

{\hat{x}}_{0 i}

steadily approaches

x_{0}

as time increases, which reveals that the observer (8) with (39) can accurately estimate the leader–state despite abrupt changes in the system mode (related to the network topology mode). Next, based on (40), Figure 5a shows the leader and agent outputs for the synchronous case, which demonstrates the cooperative output regulations to ensure that all agent outputs (see the solid lines) follow the leader’s output (see the green-dotted line). Furthermore, Figure 6a shows the output error

e_{i} = y_{i} - y_{0}

, clearly illustrating how the agent output approaches the leader’s output from the given initial conditions. Meanwhile, based on (41), Figure 5b shows the leader and agent outputs for the asynchronous case, which illustrates that all agent outputs follow the leader’s output as time increases, despite the emergence of Markov switching and the asynchronous phenomenon. Figure 6b shows that the output error

e_{i}

steadily converges to zero as time increases; hence, it validates the results presented in Figure 5b. Eventually, from Figure 5a,b, it can be seen that the proposed method can be successfully used to achieve the cooperative output regulation for heterogeneous multi-agent systems, with hidden Markov jumps containing both synchronous and asynchronous cases.

Example 2

(Practical application). Consider the following double integrator dynamics driven by different types of actuators, adopted in [36], for

g \in N_{2}

, and

i \in N_{4}

:

A_{i g} = [\begin{matrix} 0 & 1 & 0 \\ 0 & 0 & c_{i} \\ 0 & - d_{i g} & - a_{i} \end{matrix}], B_{i g} = [\begin{matrix} 0 \\ 0 \\ b_{i} \end{matrix}], C_{i g} = [\begin{matrix} 1 & 0 & 0 \end{matrix}], D_{i} = 0,

(42)

where the system state

x_{i} = {[x_{1 i}, x_{2 i}, x_{3 i}]}^{T}

consists of the position

x_{1 i}

, the velocity

x_{2 i}

, and the actuator state

x_{3 i}

;

a_{i} > 0

denotes the speed of the actuator;

b_{i}

and

c_{i} > 0

are gains; and

d_{i g} \geq 0

represents the influence of velocity on the actuator. Specifically, the value of

d_{i g}

is changed according to the Markov process of system mode

g \in N_{2}

, which is affected by the environment on the actuator, or internal factors, such as temperature, humidity, and lifespan.

If

d_{i g} > 0

, the actuator is influenced by the velocity, otherwise,

d_{i g} = 0

(refer to [36] for more details). In this paper, the system parameters are set to

a_{1} = 1

,

a_{2} = 10

,

a_{3} = a_{4} = 2

,

b_{1} = b_{3} = b_{4} = 1

,

b_{2} = 2

,

c_{i} = 1

,

d_{11} = d_{21} = d_{42} = 0

,

d_{41} = d_{12} = d_{22} = 1

,

d_{31} = 10

, and

d_{32} = 5

. Furthermore, to synchronize the position and velocity of agents, the leader is described according to [36] as follows:

A_{0 g} = [\begin{matrix} 0 & 1 \\ 0 & 0 \end{matrix}], C_{0 g} = [\begin{matrix} 1 & 0 \end{matrix}] .

(43)

In addition, the used network topology is depicted in Figure 7 (each agent is a numbered circle), which can be characterized as follows:

L_{1} = [\begin{matrix} 1 & 0 & 0 & - 1 \\ - 1 & 1 & 0 & 0 \\ 0 & - 1 & 2 & - 1 \\ - 1 & 0 & - 1 & 2 \end{matrix}], L_{2} = [\begin{matrix} 2 & - 1 & 0 & - 1 \\ 0 & 1 & - 1 & 0 \\ 0 & - 1 & 1 & 0 \\ - 1 & 0 & 0 & 1 \end{matrix}],

M_{1} = diag (1, 0, 0, 0), M_{2} = diag (0, 1, 0, 0) .

Also, based on Assumption 2, we can see that

Λ_{i g} = [\begin{matrix} 1 & 0 \\ 0 & 1 \\ 0 & 0 \end{matrix}], Π_{i g} = [0 \frac{d_{i g}}{b_{i}}] .

Furthermore, to handle the two synchronous and asynchronous cases between the system and controller modes, we consider the following transition rate

π_{g h}

and conditional probability

ϖ_{g s}

:

{[π_{g h}]}_{g, h \in N_{2}} = [\begin{matrix} - 0.6 & 0.6 \\ 0.4 & - 0.4 \end{matrix}],

(44)

Case 1 (Synchronous case) : {[ϖ_{g s}]}_{g \in N_{2}, s \in N_{2}} = [\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix}],

(45)

Case 2 (Asynchronous case) : {[ϖ_{g s}]}_{g \in N_{2}, s \in N_{2}} = [\begin{matrix} 0.2 & 0.8 \\ 0.6 & 0.4 \end{matrix}] .

(46)

Thereupon, for

μ = 0.05

,

δ = 0.05

, and

ϵ = 0.1

, Theorems 1 and 2 provide the following observer gain

F_{g}

and controller gain

K_{i s}

:

F_{1} = [\begin{matrix} - 0.8526 & - 0.4293 \\ - 0.3945 & - 0.4292 \end{matrix}], F_{2} = [\begin{matrix} - 0.9814 & - 0.5368 \\ - 0.5160 & - 0.4497 \end{matrix}],

(47)

Case 1 (Synchronous case):

\begin{array}{l} K_{11} = [- 6.8769 - 13.0047 - 14.1589], K_{21} = [- 3.4643 - 6.5518 - 2.6852], \\ K_{31} = [- 6.8790 - 3.0301 - 13.1795], K_{41} = [- 6.8890 - 12.0268 - 13.1877], \\ K_{12} = [- 6.7689 - 11.8254 - 13.9587], K_{22} = [- 3.3989 - 5.9427 - 2.5656], \\ K_{32} = [- 6.7734 - 7.8439 - 12.9812], K_{42} = [- 6.7824 - 12.8468 - 12.9897], \end{array}

(48)

Case 2 (Asynchronous case):

\begin{array}{l} K_{11} = [- 6.6609 - 10.6460 - 13.7584], K_{21} = [- 3.3334 - 5.3336 - 2.4461], \\ K_{31} = [- 6.6678 - 12.6577 - 12.7828], K_{41} = [- 6.6759 - 13.6669 - 12.7918], \\ K_{12} = [- 6.9308 - 13.5943 - 14.2590], K_{22} = [- 3.4971 - 6.8563 - 2.7450], \\ K_{32} = [- 6.9318 - 0.6321 - 13.2787], K_{42} = [- 6.9422 - 11.6168 - 13.2866] . \end{array}

(49)

Let us consider the following initial conditions:

x_{0} (0) = {[0.2, 0.1]}^{T}

,

x_{i} (0) = {[0.1, 0.1, 0.2]}^{T}

(

\forall i \in N_{4}

),

{\hat{x}}_{01} (0) = {[0.2, 0.3]}^{T}

,

{\hat{x}}_{02} (0) = {[0.3, 0.2]}^{T}

,

{\hat{x}}_{03} (0) = {[0.1, - 0.1]}^{T}

, and

{\hat{x}}_{04} (0) = {[- 0.3, 0.2]}^{T}

. And, as shown in Figure 8, suppose that the hidden mode (also called the system mode) and the observed mode (also called the control mode) are generated according to (44)–(46). Then, based on (47), Figure 9 shows the leader–state estimation error

{\hat{e}}_{0 i} = {\hat{x}}_{0 i} - x_{0}

, where

{\hat{e}}_{0 i} = {[{\hat{e}}_{0 i 1} {\hat{e}}_{0 i 2}]}^{T}

. That is, as shown in Figure 9,

{\hat{x}}_{0 i}

steadily approach

x_{0}

as time increases, which reveals that the observer (8) with (47) can accurately estimate the leader–state regardless of the abrupt changes in the system (42). Subsequently, Figure 10a shows the leader and agent outputs for the synchronous case, which verifies that (48) achieves the cooperative output regulation of (42) since all the agent outputs (see the solid lines) follow the leader’s output (see the green-dotted line). Meanwhile, based on (49), Figure 10b shows the leader and agent outputs for the asynchronous case; this also illustrates that all agent outputs follow the leader’s output as time increases despite the emergence of the Markov switching and the asynchronous phenomenon. Eventually, from Figure 10a,b, it can be seen that the proposed method can be effectively used to realize the cooperative output regulation for (42) with hidden Markov jumps containing both synchronous and asynchronous cases.

5. Concluding Remarks

In this paper, we investigated the stochastic cooperative output regulation problem of heterogeneous multi-agent systems subject to hidden Markov jumps. In particular, when dealing with this problem, we also considered a time-varying network topology that changes according to the system operation mode. First, a leader–state observer was designed using a mode-dependent Lyapunov function to ensure that all agents can accurately estimate the leader–state. Then, an asynchronous mode-dependent distributed controller was designed to ensure the stochastic cooperative output regulation for heterogeneous multi-agent systems with hidden Markov jumps. In addition, recent studies [37,38,39] motivated us to extend the proposed strategy to cover more practical control problems, such as stochastic time delay, input saturation, and unknown system dynamics in the continuous-time (discrete-time) domain for a wider range of applications.

Author Contributions

Conceptualization, G.-B.H.; methodology, G.-B.H.; software, G.-B.H.; validation, S.-H.K.; formal analysis, G.-B.H. and S.-H.K.; writing—review & editing, G.-B.H. and S.-H.K.; supervision, S.-H.K.; funding acquisition, S.-H.K. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the 2023 Research Fund of the University of Ulsan under Grant 2023-0331.

Data Availability Statement

The authors confirm that the data supporting the findings of this study are available within the article.

Conflicts of Interest

The authors declare no conflict of interest.

References

Jiménez, A.C.; García-Díaz, V.; Bolaños, S. A decentralized framework for multi-agent robotic systems. Sensors 2018, 18, 417. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Dai, S.; Wu, Z.; Zhang, P.; Tan, M.; Yu, J. Distributed Formation Control for a Multi-Robotic Fish System with Model-Based Event-Triggered Communication Mechanism. IEEE Trans. Ind. Electron. 2023, 70, 11433–11442. [Google Scholar] [CrossRef]
Huang, Z.; Chu, D.; Wu, C.; He, Y. Path planning and cooperative control for automated vehicle platoon using hybrid automata. IEEE Trans. Intell. Transp. Syst. 2018, 20, 959–974. [Google Scholar] [CrossRef]
Xiao, S.; Ge, X.; Han, Q.L.; Zhang, Y. Dynamic event-triggered platooning control of automated vehicles under random communication topologies and various spacing policies. IEEE Trans. Cybern. 2021, 52, 11477–11490. [Google Scholar] [CrossRef]
Cui, J.; Liu, Y.; Nallanathan, A. Multi-agent reinforcement learning-based resource allocation for UAV networks. IEEE Trans. Wirel. Commun. 2019, 19, 729–743. [Google Scholar] [CrossRef] [Green Version]
Wang, Y.; Cheng, Z.; Xiao, M. UAVs’ formation keeping control based on Multi–Agent system consensus. IEEE Access 2020, 8, 49000–49012. [Google Scholar] [CrossRef]
Yan, Z.; Han, L.; Li, X.; Dong, X.; Li, Q.; Ren, Z. Event-Triggered formation control for time-delayed discrete-Time multi-Agent system applied to multi-UAV formation flying. J. Frankl. Inst.-Eng. Appl. Math. 2023, 360, 3677–3699. [Google Scholar] [CrossRef]
Chen, Y.J.; Chang, D.K.; Zhang, C. Autonomous tracking using a swarm of UAVs: A constrained multi-agent reinforcement learning approach. IEEE Trans. Veh. Technol. 2020, 69, 13702–13717. [Google Scholar] [CrossRef]
Pham, V.H.; Sakurama, K.; Mou, S.; Ahn, H.S. Distributed Control for an Urban Traffic Network. IEEE Trans. Intell. Transp. Syst. 2022, 23, 22937–22953. [Google Scholar] [CrossRef]
Qu, Z.; Pan, Z.; Chen, Y.; Wang, X.; Li, H. A distributed control method for urban networks using multi-agent reinforcement learning based on regional mixed strategy Nash-equilibrium. IEEE Access 2020, 8, 19750–19766. [Google Scholar] [CrossRef]
Ma, Q.; Xu, S.; Lewis, F.L.; Zhang, B.; Zou, Y. Cooperative output regulation of singular heterogeneous multiagent systems. IEEE Trans. Cybern. 2015, 46, 1471–1475. [Google Scholar] [CrossRef]
Li, Z.; Chen, M.Z.; Ding, Z. Distributed adaptive controllers for cooperative output regulation of heterogeneous agents over directed graphs. Automatica 2016, 68, 179–183. [Google Scholar] [CrossRef]
Hu, W.; Liu, L. Cooperative output regulation of heterogeneous linear multi-agent systems by event-triggered control. IEEE Trans. Cybern. 2016, 47, 105–116. [Google Scholar] [CrossRef]
Zhang, J.; Zhang, H.; Cai, Y.; Lu, Y. Distributed cooperative output regulation of heterogeneous linear multi-agent systems based on event-and self-triggered control with undirected topology. ISA Trans. 2020, 99, 191–198. [Google Scholar] [CrossRef] [PubMed]
Yuan, C. Cooperative H_∞ output regulation of heterogeneous parameter-dependent multi-agent systems. J. Frankl. Inst.-Eng. Appl. Math. 2017, 354, 7846–7870. [Google Scholar] [CrossRef]
Wang, Y.; Xia, J.; Wang, Z.; Zhou, J.; Shen, H. Reliable consensus control for semi-Markov jump multi-agent systems: A leader-following strategy. J. Frankl. Inst.-Eng. Appl. Math. 2019, 356, 3612–3627. [Google Scholar] [CrossRef]
Zhang, G.; Li, F.; Wang, J.; Shen, H. Mixed H_∞ and passive consensus of Markov jump multi-agent systems under DoS attacks with general transition probabilities. J. Frankl. Inst.-Eng. Appl. Math. 2023, 360, 5375–5391. [Google Scholar] [CrossRef]
Li, M.; Deng, F.; Ren, H. Scaled consensus of multi-agent systems with switching topologies and communication noises. Nonlinear Anal.-Hybrid Syst. 2020, 36, 100839. [Google Scholar] [CrossRef]
Li, B.; Wen, G.; Peng, Z.; Wen, S.; Huang, T. Time-varying formation control of general linear multi-agent systems under Markovian switching topologies and communication noises. IEEE Trans. Circuits Syst. II-Express Briefs 2020, 68, 1303–1307. [Google Scholar] [CrossRef]
Liu, Z.; Yan, W.; Li, H.; Zhang, S. Cooperative output regulation problem of discrete-time linear multi-agent systems with Markov switching topologies. J. Frankl. Inst.-Eng. Appl. Math. 2020, 357, 4795–4816. [Google Scholar] [CrossRef]
Meng, M.; Liu, L.; Feng, G. Adaptive output regulation of heterogeneous multiagent systems under Markovian switching topologies. IEEE Trans. Cybern. 2017, 48, 2962–2971. [Google Scholar] [CrossRef] [PubMed]
Li, D.; Li, T. Cooperative output feedback tracking control of stochastic linear heterogeneous multi-agent systems. IEEE Trans. Autom. Control 2021, 33, 7154–7180. [Google Scholar]
Dong, S.; Chen, G.; Liu, M.; Wu, Z.G. Cooperative adaptive H_∞ output regulation of continuous-time heterogeneous multi-agent Markov jump systems. IEEE Trans. Circuits Syst. II-Express Briefs 2021, 68, 3261–3265. [Google Scholar] [CrossRef]
Nguyen, N.H.A.; Kim, S.H. Leader-following consensus for multi-agent systems with asynchronous control modes under nonhomogeneous Markovian jump network topology. IEEE Access 2020, 8, 203017–203027. [Google Scholar] [CrossRef]
Ding, L.; Guo, G. Sampled-data leader-following consensus for nonlinear multi-agent systems with Markovian switching topologies and communication delay. J. Frankl. Inst.-Eng. Appl. Math. 2015, 352, 369–383. [Google Scholar] [CrossRef]
Nguyen, N.H.A.; Kim, S.H. Asynchronous H_∞ observer-based control synthesis of nonhomogeneous Markovian jump systems with generalized incomplete transition rates. Appl. Math. Comput. 2021, 411, 126532. [Google Scholar] [CrossRef]
Dong, J.; Yang, G.H. Robust H2 control of continuous-time Markov jump linear systems. Automatica 2008, 44, 1431–1436. [Google Scholar] [CrossRef]
Sakthivel, R.; Sakthivel, R.; Kaviarasan, B.; Alzahrani, F. Leader-following exponential consensus of input saturated stochastic multi-agent systems with Markov jump parameters. Neurocomputing 2018, 287, 84–92. [Google Scholar] [CrossRef]
He, G.; Zhao, J. Cooperative output regulation of T-S fuzzy multi-agent systems under switching directed topologies and event-triggered communication. IEEE Trans. Fuzzy Syst. 2022, 30, 5249–5260. [Google Scholar] [CrossRef]
Huang, J. Nonlinear Output Regulation: Theory and Applications; SIAM: Bangkok, Thailand, 2004. [Google Scholar]
Yaghmaie, F.A.; Lewis, F.L.; Su, R. Output regulation of linear heterogeneous multi-agent systems via output and state feedback. Automatica 2016, 67, 157–164. [Google Scholar] [CrossRef]
Arrifano, N.S.; Oliveira, V.A. Robust H_∞ fuzzy control approach for a class of markovian jump nonlinear systems. IEEE Trans. Fuzzy Syst. 2006, 14, 738–754. [Google Scholar] [CrossRef]
Nguyen, T.B.; Kim, S.H. Nonquadratic local stabilization of nonhomogeneous Markovian jump fuzzy systems with incomplete transition descriptions. Nonlinear Anal.-Hybrid Syst. 2021, 42, 101080. [Google Scholar] [CrossRef]
He, S.; Ding, Z.; Liu, F. Output regulation of a class of continuous-time Markovian jumping systems. Signal Process. 2013, 93, 411–419. [Google Scholar] [CrossRef]
Wang, Y.; Xie, L.; De Souza, C.E. Robust control of a class of uncertain nonlinear systems. Syst. Control Lett. 1992, 19, 139–149. [Google Scholar] [CrossRef]
Wieland, P.; Sepulchre, R.; Allgöwer, F. An internal model principle is necessary and sufficient for linear output synchronization. Automatica 2011, 47, 1068–1074. [Google Scholar] [CrossRef]
Yan, S.; Gu, Z.; Park, J.H.; Xie, X. Distributed-delay-dependent stabilization for networked interval type-2 fuzzy systems with stochastic delay and actuator saturation. IEEE Trans. Syst. Man Cybern.-Syst. 2022, 53, 3165–3175. [Google Scholar] [CrossRef]
Yan, S.; Gu, Z.; Park, J.H.; Xie, X. A delay-kernel-dependent approach to saturated control of linear systems with mixed delays. Automatica 2023, 152, 110984. [Google Scholar] [CrossRef]
Zhang, T.; Li, Y. Global exponential stability of discrete-time almost automorphic Caputo–Fabrizio BAM fuzzy neural networks via exponential Euler technique. Knowl.-Based Syst. 2022, 246, 108675. [Google Scholar] [CrossRef]

Figure 1. Block diagram of the cooperative output regulation of multi-agent systems.

Figure 2. Network topology: (a) for

g = 1

, and (b) for

g = 2

.

Figure 2. Network topology: (a) for

g = 1

, and (b) for

g = 2

.

Figure 3. Mode evolution: (a) synchronous case, and (b) asynchronous case.

Figure 4. Estimation error

{\hat{e}}_{0 i} = {\hat{x}}_{0 i} - x_{0} = {[{\hat{e}}_{0 i 1} {\hat{e}}_{0 i 2}]}^{T}

.

Figure 4. Estimation error

{\hat{e}}_{0 i} = {\hat{x}}_{0 i} - x_{0} = {[{\hat{e}}_{0 i 1} {\hat{e}}_{0 i 2}]}^{T}

.

Figure 5. Output of leader and agents: (a) synchronous case, and (b) asynchronous case.

Figure 6. Output error between leader and agents: (a) synchronous case, and (b) asynchronous case.

Figure 7. Network topology: (a) for

g = 1

, and (b) for

g = 2

.

Figure 7. Network topology: (a) for

g = 1

, and (b) for

g = 2

.

Figure 8. Mode evolution: (a) synchronous case, and (b) asynchronous case.

Figure 9. Estimation error

{\hat{e}}_{0 i} = {\hat{x}}_{0 i} - x_{0} = {[{\hat{e}}_{0 i 1} {\hat{e}}_{0 i 2}]}^{T}

.

Figure 9. Estimation error

{\hat{e}}_{0 i} = {\hat{x}}_{0 i} - x_{0} = {[{\hat{e}}_{0 i 1} {\hat{e}}_{0 i 2}]}^{T}

.

Figure 10. Output of leader and agents: (a) synchronous case, and (b) asynchronous case.

Table 1. System specifications.

Hardware Resources	Information
Operating System	Microsoft Windows 10 Pro
RAM	8 GB
Processor	3.20 GHz
Hard Drive	120 GB SSD

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Hong, G.-B.; Kim, S.-H. Hidden Markov Model-Based Control for Cooperative Output Regulation of Heterogeneous Multi-Agent Systems under Switching Network Topology. Mathematics 2023, 11, 3481. https://doi.org/10.3390/math11163481

AMA Style

Hong G-B, Kim S-H. Hidden Markov Model-Based Control for Cooperative Output Regulation of Heterogeneous Multi-Agent Systems under Switching Network Topology. Mathematics. 2023; 11(16):3481. https://doi.org/10.3390/math11163481

Chicago/Turabian Style

Hong, Gia-Bao, and Sung-Hyun Kim. 2023. "Hidden Markov Model-Based Control for Cooperative Output Regulation of Heterogeneous Multi-Agent Systems under Switching Network Topology" Mathematics 11, no. 16: 3481. https://doi.org/10.3390/math11163481

APA Style

Hong, G.-B., & Kim, S.-H. (2023). Hidden Markov Model-Based Control for Cooperative Output Regulation of Heterogeneous Multi-Agent Systems under Switching Network Topology. Mathematics, 11(16), 3481. https://doi.org/10.3390/math11163481

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Hidden Markov Model-Based Control for Cooperative Output Regulation of Heterogeneous Multi-Agent Systems under Switching Network Topology

Abstract

1. Introduction

2. Preliminaries and Problem Statement

2.1. Heterogeneous Multi-Agent System Description

2.2. Communication Topology

3. Main Results

3.1. Leader–State Observer Design

3.2. Distributed Controller Design

4. Illustrative Examples

5. Concluding Remarks

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI