Non-Zero Sum Nash Game for Discrete-Time Infinite Markov Jump Stochastic Systems with Applications

Liu, Yueying; Wang, Zhen; Lin, Xiangyun

doi:10.3390/axioms12090882

Open AccessArticle

Non-Zero Sum Nash Game for Discrete-Time Infinite Markov Jump Stochastic Systems with Applications

by

Yueying Liu

^*

,

Zhen Wang

and

Xiangyun Lin

College of Mathematics and Systems Science, Shandong University of Science and Technology, Qingdao 266590, China

^*

Author to whom correspondence should be addressed.

Axioms 2023, 12(9), 882; https://doi.org/10.3390/axioms12090882

Submission received: 24 July 2023 / Revised: 8 September 2023 / Accepted: 11 September 2023 / Published: 15 September 2023

(This article belongs to the Special Issue Advances in Analysis and Control of Systems with Uncertainties II)

Download Versions Notes

Abstract

:

This paper is to study finite horizon linear quadratic (LQ) non-zero sum Nash game for discrete-time infinite Markov jump stochastic systems (IMJSSs). Based on the theory of stochastic analysis, a countably infinite set of coupled generalized algebraic Riccati equations are solved and a necessary and sufficient condition for the existence of Nash equilibrium points is obtained. From a new perspective, the finite horizon mixed robust

H_{2} / H_{\infty}

control is investigated, and summarize the relationship between Nash game and

H_{2} / H_{\infty}

control problem. Moreover, the feasibility and validity of the proposed method has been proved by applying it to a numerical example.

Keywords:

infinite Markov jump stochastic systems (IMJSSs); Nash equilibrium points; coupled generalized algebraic Riccati equations; Nash game; H₂/H_∞ control

MSC:

93E03

1. Introduction

Markov jump stochastic systems (MJSSs), as a typical class of stochastic hybrid dynamical systems, are widely used in practical engineering control systems. At the same time, a great deal of related research on the stability, ergodicity, robust control and filtering, and so on [1,2,3,4,5,6,7,8,9] has been undertaken. However, for the Markov chain, when the state space takes values in infinite countably sets, from the application point of view, it may be more extensive. Consequently, plentiful theory literature and research achievements have emerged for infinite Markov jump stochastic systems (IMJSSs). In recent years, various efforts have been made to cope with IMJSSs in a wide variety of systems. To be specific, via the operator spectrum method, exponential stabilities for discrete-time [10] and continuous-time [11] IMJSSs have been investigated respectively. Aiming at the practical time-delay factors, with the aid of Lyapunov stability theory, [12] discussed the stability analysis for IMJSSs with time-delay; further, stabilities for uncertain discrete-time IMJSSs with time-delay have been developed in [13], and on this basis, [14] addressed the finite horizon

H_{2} / H_{\infty}

control problems.

H_{2} / H_{\infty}

fuzzy filtering has been solved for nonlinear IMJSSs by the T-S fuzzy model approach in [15].

Dynamic game theory, which has come into wider use in many fields such as engineering, economics, and management science, has attracted great attention, and a large number of results have been obtained in the literature [16,17,18,19,20,21]. Furthermore, the Nash game problem has been studied for MJSSs, and further, a unified treatment approach for

H_{2}

,

H_{\infty}

and

H_{2} / H_{\infty}

control design problems was presented in [22]. In [23,24], the authors revealed the relationship between the Nash equilibrium points and

H_{2} / H_{\infty}

control for continuous-time MJSSs. While many theoretical results have been established to study the stochastic system governed by a finite-state Markov chains, more research efforts are required to discuss IMJSSs. However, the causal and anticausal Lyapunov operators of IMJSSs are no more adjoint, which is the essential difference between finite and infinite MJSSs [10,25]. In most practical applications, IMJSSs have broader application prospects than MJSSs with finite jumps. As a consequence, the related research about IMJSSs is of critical importance. In the game field, [26] discussed an infinite horizon linear quadratic (LQ) Nash game for continuous-time IMJSSs. Unfortunately, to the best of our knowledge, there is almost no research about the Nash game problem for discrete-time IMJSSs.

In this article, we study a finite horizon LQ non-zero sum Nash game for discrete-time IMJSSs, which considers a more general system. From a new perspective, the finite horizon mixed

H_{2} / H_{\infty}

control is further investigated. On the one hand, this is an extension of the previous study from MJSSs [22], to IMJSSs; on the other, it is a discrete-time case of [26]. Concretely, the main work and contribution of this thesis is as follows. First, the existence of Nash equilibrium points, which can be boiled down to the solvability of a countably infinite set of coupled generalized algebraic Riccati equations, is shown. For infinite Markov jump systems, the causal and anticausal Lyapunov operators are no more adjoint, which leads to the inequivalence between stochastic stability, asymptotic mean square stability, and exponential mean square stability. Further, this indicates the essential difference between the finite and infinite Markov jump systems. To this end, we introduce the infinite dimension Banach spaces, where the elements are countably infinite sequences of linear and bounded operators. Thus, the source of the problem resides in the way to solve this kind of equations, which are harder to solve than those in [22]. Second, the finite horizon mixed

H_{2} / H_{\infty}

control is solved from the new view of a Nash game, and further, the relationship between the Nash game and

H_{2} / H_{\infty}

control problem is summarized with some remarks. Finally, the typical example cited in the paper demonstrates the validity of the proposed method well.

The rest of this article is arranged as below. Some useful preliminary results are introduced in Section 2. The existence of Nash equilibrium points is solved by a necessary and sufficient condition in Section 3. In Section 4, some special cases with some remarks are given. In Section 5, a numerical example is illustrated, and a summary is provided in Section 6.

For convenience, the following notations are adopted.

R^{n}

: n-dimensional real Euclidean space;

R^{m \times n}

: the linear space of all m by n real matrices;

∥ \cdot ∥ :

the Euclidean norm of

R^{n}

or the operator norm of

R^{m \times n}

;

I_{n}

: the

n \times n

identity matrix;

N^{'}

: the transpose of a matrix (or vector) N;

N^{†}

: the pseudo-inverse of a matrix N;

N > 0

(\geq 0)

: N is positive (semi-positive) definite;

N_{T} : = {0, 1, \dots, T}

;

D : = {1, 2, \dots}

.

2. Preliminaries

On a complete probability space (

Ω

,

F

,

P

), we consider the following discrete-time IMJSS:

\{\begin{matrix} y (t + 1) & = Q_{0} (t, ξ_{t}) y (t) + R_{0} (t, ξ_{t}) u (t) + U_{0} (t, ξ_{t}) v (t) \\ + \sum_{k = 1}^{r} [Q_{k} (t, ξ_{t}) y (t) + R_{k} (t, ξ_{t}) u (t) + U_{k} (t, ξ_{t}) v (t)] w_{k} (t), \\ z (t) = & [\begin{matrix} L (t, ξ_{t}) y (t) \\ M (t, ξ_{t}) u (t) \end{matrix}], M {(t, ξ_{t})}^{'} M (t, ξ_{t}) = I_{n_{u}}, \\ y (0) = & y_{0} \in R^{n}, ξ (0) = ξ_{0} \in D, t \in N_{T}, \end{matrix}

(1)

where

y (t) \in R^{n}

represents the system state, and

u (t) \in R^{n_{u}}

and

v (t) \in R^{n_{v}}

are the control processes of two different players, respectively.

z (t) \in R^{n_{z}}

stands for the controlled output.

w (t) = {w (t) | w (t) = (w_{1} (t), w_{2} (t), \dots, w_{r} (t))}

,

t \in N_{T} = {0, 1, \dots, T}

is a standard r-dimensional Brownian motion.

{ξ_{t}}_{t \in N_{T}}

denotes a infinite Markov jump process taking values in

D = {1, 2, \dots}

, and the transition probability matrix is

P = [p (ς, ι)]

with

p (ς, ι) = P (ξ_{t + 1} = ι | ξ_{t} = ς)

. In this paper, we assume

P

is nondegenerate,

π_{0} (ς) = P (ξ_{0} = ς) > 0

for all

ς \in D

, and the stochastic processes

{w_{t}}_{t \in N_{T}}

and

{ξ_{t}}_{t \in N_{T}}

are mutually independent. Let

F_{t} = {ξ_{k}, w_{s} | 0 \leq k \leq t, 0 \leq s \leq t - 1}, F_{0} = σ (ξ_{0})

.

l^{2} (N_{T}; R^{m}) : = {x \in R^{m} | x

is

F_{t}

-measurable, and

(\sum_{t = 0}^{T} {E ∥ x (t) ∥}^{2})^{\frac{1}{2}} < \infty

}.

H_{\infty}^{m \times n}

is defined as the Banach space of the set

{H | H = (H (1), H (2), \dots), H (ς) \in R^{m \times n}}

with the norm

{∥ H ∥}_{\infty} = {sup}_{ς \in D} ∥ H (ς) ∥ < \infty .

In the sequel, we assume that all coefficients of considered systems have a finite norm

{∥ \cdot ∥}_{\infty}

. We make use of the notation of

H_{\infty}^{n}

for

m = n

. Further,

H_{\infty}^{n +}

denotes the subspace of

H_{\infty}^{n}

formed by all

H (ς) \in S_{n}

and

H (ς) \geq 0, ς \in D

. For

L, M \in H_{\infty}^{n +}

,

L \leq M

is equivalent to

L (ς) \leq M (ς), ς \in D

, which means

{∥ L ∥}_{\infty} \leq {∥ M ∥}_{\infty}

. We set

Q_{k} (t) = {Q_{k} (t, ς)}_{ς \in D}

,

0 \leq k \leq r

,

R_{k} (t) = {R_{k} (t, ς)}_{ς \in D}, 0 \leq k \leq r

,

U_{k} (t) = {U_{k} (t, ς)}_{ς \in D}, 0 \leq k \leq r

,

L (t) = {L (t, ς)}_{ς \in D}

,

M (t) = {M (t, ς)}_{ς \in D}

, and assume

Q_{k} (t) \in H_{\infty}^{n},

R_{k} (t) \in H_{\infty}^{n \times n_{u}},

U_{k} (t) \in H_{\infty}^{n \times n_{v}},

L (t) \in H_{\infty}^{n_{z} \times n},

M (t) \in H_{\infty}^{n_{z} \times n_{u}}

for

t \in N_{T}

.

The two cost functions for the Nash game problem are given by

\begin{matrix} J_{1} (x_{0}, ξ_{0}, u^{*} (\cdot), v (\cdot)) = \sum_{t = 0}^{T} E [γ^{2} {∥ v (t) ∥}^{2} - {∥ z (t) ∥}^{2}], \end{matrix}

(2)

\begin{matrix} J_{2} (x_{0}, ξ_{0}, u (\cdot), v^{*} (\cdot)) = \sum_{t = 0}^{T} {E [∥ z (t) ∥}^{2}], \end{matrix}

(3)

where

γ > 0

is a given prescribed disturbance attenuation level. For simplicity, let the Nash game problem for cost Functions (2) and (3) with Equation (1) by the Nash game problem

P

. Hence, the Nash game problem

P

is addressed, which is to find admissible control

(u^{*} (\cdot), v^{*} (\cdot))

to minimize cost Functions (2) and (3) subject to Equation (1).

Next, we list some definitions and lemmas that are needed for the follow-up procedures.

Definition 1.

A strategy pair

(u^{*} (\cdot), v^{*} (\cdot)) \in l^{2} (N_{T}; R^{n_{u}}) \times l^{2} (N_{T}; R^{n_{v}})

is a Nash equilibrium point if

\begin{matrix} J_{1} (x_{0}, ξ_{0}, u^{*} (\cdot), v^{*} (\cdot)) \leq J_{1} (x_{0}, ξ_{0}, u^{*} (\cdot), v (\cdot)), \end{matrix}

(4)

\begin{matrix} J_{2} (x_{0}, ξ_{0}, u^{*} (\cdot), v^{*} (\cdot)) \leq J_{2} (x_{0}, ξ_{0}, u (\cdot), v^{*} (\cdot)) \end{matrix}

(5)

for all

(u (\cdot), v (\cdot)) \in l^{2} (N_{T}; R^{n_{u}}) \times l^{2} (N_{T}; R^{n_{v}})

.

Lemma 1

([27]). For a symmetric matrix S, we have

(i): $S^{†} = {(S^{†})}^{'}$ ;
(ii): $S S^{†} = S^{†} S$ ;
(iii): $S \geq 0$ if and only if $S^{†} \geq 0$ .

Lemma 2

([27]). Let matrices

F = F^{'}

, H, and

G = G^{'}

with appropriate dimensions consider the following quadratic form:

\begin{matrix} q (y, u) = E [y^{'} F y + 2 y^{'} H u + u^{'} G u], \end{matrix}

where y and u are random variables defined on a probability space (Ω,

F

,

P

). Then, the following conditions are equivalent:

(i): $inf_{u} q (y, u) > - \infty$ for any random variable y.
(ii): There exists a symmetric matrix $S = S^{'}$ such that $inf_{u} q (y, u) = E [y^{'} S y]$ for any random variable y.
(iii): $G \geq 0$ and $H (I - G G^{†}) = 0$ .
(iv): $G \geq 0$ and $K e r (G) \subseteq K e r (H)$ .
(v): There exists a symmetric matrix $T = T^{'}$ such that $[\begin{matrix} F - T & H \\ H^{'} & G \end{matrix}] \geq 0$ .
Moreover, if any of the above conditions hold, then (ii) is satisfied by $S = F - H G^{†} H^{'}$ . In addition, $S \geq T$ for any T satisfying (v). Finally, for any random variable y, the random variable $u^{*} = - G^{†} H^{'} y$ is optimal and the optimal value is $q (y, u^{*}) = E [y^{'} (F - H G^{†} H^{'}) y]$ .

The following LQ result for discrete-time IMJSSs can be directly yielded from Theorem 3.1 [14], which is the continuous-time case.

Lemma 3.

For the following standard LQ optimal control problem with discrete-time IMJSSs,

\begin{matrix} min_{u (\cdot) \in l^{2} (N_{T}; R^{n_{u}})} {J_{2} (y_{0}, ξ_{0}, u (\cdot)) = \sum_{t = 0}^{T} {E [∥ z (t) ∥}^{2}]}, \end{matrix}

(6)

subject to

\{\begin{matrix} y (t + 1) = Q_{0} (t, ξ_{t}) y (t) + R_{0} (t, ξ_{t}) u (t) + \sum_{k = 1}^{r} [Q_{k} (t, ξ_{t}) y (t) + R_{k} (t, ξ_{t}) u (t)] w_{k} (t), \\ z (t) = [\begin{matrix} L (t, ξ_{t}) y (t) \\ M (t, ξ_{t}) u (t) \end{matrix}], M {(t, ξ_{t})}^{'} M (t, ξ_{t}) = I_{n_{u}}, \\ y (0) = y_{0} \in R^{n}, ξ (0) = ξ_{0} \in D, t \in N_{T}, \end{matrix}

(7)

then, we find that

P (t, ς), (t, ς) \in N_{T} \times D

solves the following coupled generalized algebraic Riccati equations:

\begin{matrix} \{\begin{matrix} P (t, ς) = \sum_{k = 0}^{r} Q_{k} {(t, ς)}^{'} ε_{ς} (t, P) Q_{k} (t, ς) + L {(t, ς)}^{'} L (t, ς) \\ - {[\sum_{k = 0}^{r} Q_{k} {(t, ς)}^{'} ε_{ς} (t, P) R_{k} (t, ς)]}^{'} \\ \cdot H {(t, ς, P)}^{- 1} [\sum_{k = 0}^{r} Q_{k} {(t, ς)}^{'} ε_{ς} (t, P) R_{k} (t, ς)], \\ P (T + 1, ς) = 0, H (t, ς, P) > 0, (t, ς) \in N_{T} \times D, \end{matrix} \end{matrix}

(8)

where

H (t, ς, P) = I_{n_{u}} + \sum_{k = 0}^{r} R_{k} {(t, ς)}^{'} ε_{ς} (t, P) R_{k} (t, ς),

ε_{ς} (t, P) = \sum_{ι = 1}^{\infty} p (ς, ι) P (t + 1, ι),

and

min_{u (\cdot) \in l^{2} (N_{T}; R^{n_{u}})} J_{2} (y_{0}, ξ_{0}, u (\cdot)) = J_{2} (y_{0}, ξ_{0}, u^{*} (\cdot)) = \sum_{ς = 1}^{\infty} π_{0} (ς) y_{0}^{'} P (0, ς) y_{0},

u^{*} (t) = - H {(t, ς, P)}^{- 1} [\sum_{k = 0}^{r} Q_{k} {(t, ς)}^{'} ε_{ς} (t, P) R_{k} (t, ς)] y (t) .

3. Nash Equilibrium Points

This section focuses on solving the Nash gameproblem

P

, and we assume that the linear, memoryless feedback strategy has the following form [28]:

\begin{matrix} u (t) = G_{2} (t, ξ_{t}) y (t), v (t) = G_{1} (t, ξ_{t}) y (t), \end{matrix}

(9)

under this assumption, we obtain the feedback Nash equilibrium points by a countably infinite set of coupled generalized algebraic Riccati equations.

Theorem 1.

The Nash game problem

P

has unique Nash equilibrium points

(u^{*} (t) = G_{2} (t, ξ_{t}, X_{2}) y (t), v^{*} (t) = G_{1} (t, ξ_{t}, X_{1}) y (t))

iff the following coupled generalized algebraic Riccati equations,

\begin{matrix} \{\begin{matrix} Λ_{1} (t, ς) = \sum_{k = 0}^{r} {[Q_{k} (t, ς) + R_{k} (t, ς) G_{2} (t, ς, Λ_{2})]}^{'} ε_{ς} (t, Λ_{1}) \\ \cdot [Q_{k} (t, ς) + R_{k} (t, ς) G_{2} (t, ς, Λ_{2})] - L {(t, ς)}^{'} L (t, ς) \\ - G_{2} {(t, ς, Λ_{2})}^{'} G_{2} (t, ς, Λ_{2}) - G_{3} {(t, ς, Λ_{1})}^{'} H_{1} {(t, ς, Λ_{1})}^{†} G_{3} (t, ς, Λ_{1}), \\ (I - H_{1} (t, ς, Λ_{1}) H_{1} {(t, ς, Λ_{1})}^{†}) G_{3} (t, ς, Λ_{1}) = 0, \\ Λ_{1} (T + 1, ς) = 0, H_{1} (t, ς, Λ_{1}) \geq 0, (t, ς) \in N_{T} \times D, \end{matrix} \end{matrix}

(10)

\begin{matrix} G_{1} (t, ς, Λ_{1}) = - H_{1} {(t, ς, Λ_{1})}^{†} G_{3} (t, ς, Λ_{1}), \end{matrix}

(11)

\begin{matrix} \{\begin{matrix} Λ_{2} (t, ς) = \sum_{k = 0}^{r} {[Q_{k} (t, ς) + U_{k} (t, ς) G_{1} (t, ς, Λ_{1})]}^{'} ε_{ς} (t, Λ_{2}) \\ \cdot [Q_{k} (t, ς) + U_{k} (t, ς) G_{1} (t, ς, Λ_{1})] + L {(t, ς)}^{'} L (t, ς) \\ - G_{4} {(t, ς, Λ_{2})}^{'} H_{2} {(t, ς, Λ_{2})}^{- 1} G_{4} (t, ς, Λ_{2}), \\ Λ_{2} (T + 1, ς) = 0, H_{2} (t, ς, Λ_{2}) > 0, (t, ς) \in N_{T} \times D, \end{matrix} \end{matrix}

(12)

\begin{matrix} G_{2} (t, ς, Λ_{2}) = - H_{2} {(t, ς, Λ_{2})}^{- 1} G_{4} (t, ς, Λ_{2}) \end{matrix}

(13)

admit a group of solutions

(Λ_{1} (t, ς), G_{1} (t, ς, Λ_{1}); Λ_{2} (t, ς), G_{2} (t, ς, Λ_{2}))

with

Λ_{1} (t, ς) \leq 0

,

Λ_{2} (t, ς) \geq 0

for

(t, ς) \in N_{T} \times D

, where

\begin{matrix} H_{1} (t, ς, Λ_{1}) = γ^{2} I_{n_{v}} + \sum_{k = 0}^{r} U_{k} {(t, ς)}^{'} ε_{ς} (t, Λ_{1}) U_{k} (t, ς), \\ H_{2} (t, ς, Λ_{2}) = I_{n_{u}} + \sum_{k = 0}^{r} R_{k} {(t, ς)}^{'} ε_{ς} (t, Λ_{2}) R_{k} (t, ς), \\ G_{3} (t, ς, Λ_{1}) = \sum_{k = 0}^{r} {[Q_{k} (t, ς) + R_{k} (t, ς) G_{2} (t, ς, Λ_{2})]}^{'} ε_{ς} (t, Λ_{1}) U_{k} (t, ς), \\ G_{4} (t, ς, Λ_{2}) = \sum_{k = 0}^{r} {[Q_{k} (t, ς) + U_{k} (t, ς) G_{1} (t, ς, Λ_{1})]}^{'} ε_{ς} (t, Λ_{2}) R_{k} (t, ς) . \end{matrix}

Proof.

Sufficiency: Since Equations (10)–(13) have a group of solutions

(Λ_{1} (t, ς), G_{1} (t, ς, Λ_{1}); Λ_{2} (t, ς), G_{2} (t, ς, Λ_{2})),

by constructing

u^{*} (t) = G_{2} (t, ξ_{t}, Λ_{2}) y (t)

, then, we substitute

u^{*} (t)

into Equation (1) to get the equation as follows:

\{\begin{matrix} y (t + 1) = [Q_{0} (t, ξ_{t}) + R_{0} (t, ξ_{t}) G_{2} (t, ξ_{t}, Λ_{2})] y (t) + U_{0} (t, ξ_{t}) v (t) \\ + \sum_{k = 1}^{r} {[Q_{k} (t, ξ_{t}) + R_{k} (t, ξ_{t}) G_{2} (t, ξ_{t}, Λ_{2})] y (t) + U_{k} (t, ξ_{t}) v (t)} w_{k} (t), \\ z (t) = [\begin{matrix} L (t, ξ_{t}) y (t) \\ M (t, ξ_{t}) G_{2} (t, ξ_{t}, Λ_{2}) y (t) \end{matrix}], M {(t, ξ_{t})}^{'} M (t, ξ_{t}) = I_{n_{u}}, \\ y (0) = y_{0} \in R^{n}, ξ (0) = ξ_{0} \in D, t \in N_{T}, \end{matrix}

(14)

using similar processing, Lemma 1 in [29] can be generalized to the infinite Markov jump systems. In fact, via the assumption that

{w_{t}}_{t \in N_{T}}

and

{ξ_{t}}_{t \in N_{T}}

are mutually independent, and

{w_{t}}_{t \in N_{T}}

is also irrelevant to

v (t)

, we have

\begin{matrix} E [y {(t + 1)}^{'} Λ_{1} (t + 1, ξ_{t + 1}) y (t + 1) - y {(t)}^{'} Λ_{1} (t, ξ_{t}) y (t) | F_{t}, ξ_{t} = ς] \\ = {[\begin{matrix} y (t) \\ v (t) \end{matrix}]}^{'} [\begin{matrix} C (t, ς, Λ_{1}) - Λ_{1} (t, ς) & G_{3} (t, ς, Λ_{1}) \\ G_{3} {(t, ς, Λ_{1})}^{'} & H_{1} (t, ς, Λ_{1}) - γ^{2} I_{n_{v}} \end{matrix}] [\begin{matrix} y (t) \\ v (t) \end{matrix}], \end{matrix}

(15)

it follows from taking summation from 0 to T on both sides of (15) that

\begin{matrix} E [y_{T + 1}^{'} Λ_{1} (T + 1, ξ_{T + 1}) y_{T + 1} - y_{0}^{'} Λ_{1} (0, ξ_{0}) y_{0}] \\ = \sum_{t = 0}^{T} E {[\begin{matrix} y (t) \\ v (t) \end{matrix}]}^{'} [\begin{matrix} C (t, ς, Λ_{1}) - Λ_{1} (t, ς) & G_{3} (t, ς, Λ_{1}) \\ G_{3} {(t, ς, Λ_{1})}^{'} & H_{1} (t, ς, Λ_{1}) - γ^{2} I_{n_{v}} \end{matrix}] [\begin{matrix} y (t) \\ v (t) \end{matrix}] . \end{matrix}

Further, we can obtain the following result:

\begin{matrix} J_{1} (y_{0}, ξ_{0}, u^{*} (\cdot), v (\cdot)) = \sum_{t = 0}^{T} E [γ^{2} {∥ v (t) ∥}^{2} - {∥ z (t) ∥}^{2}] \\ = E [y_{0}^{'} Λ_{1} (0, ξ_{0}) y_{0}] - E [y_{T + 1}^{'} Λ_{1} (T + 1, ξ_{T + 1}) y_{T + 1}] \\ + \sum_{t = 0}^{T} E {[\begin{matrix} y (t) \\ v (t) \end{matrix}]}^{'} D (t, ξ_{t}, Λ_{1}) [\begin{matrix} y (t) \\ v (t) \end{matrix}], \end{matrix}

(16)

where

\begin{matrix} D (t, ξ_{t}, Λ_{1}) = D (t, ς, Λ_{1}) = [\begin{matrix} C (t, ς, Λ_{1}) - L {(t, ς)}^{'} L (t, ς) - Λ_{1} (t, ς) & G_{3} (t, ς, Λ_{1}) \\ G_{3} {(t, ς, Λ_{1})}^{'} & H_{1} (t, ς, Λ_{1}) \end{matrix}], \\ C (t, ς, Λ_{1}) = \sum_{k = 0}^{r} {[Q_{k} (t, ς) + R_{k} (t, ς) G_{2} (t, ς, Λ_{2})]}^{'} ε_{ς} (t, Λ_{1}) \\ \cdot [Q_{k} (t, ς) + R_{k} (t, ς) G_{2} (t, ς, Λ_{2})], \end{matrix}

for

ξ_{t} = ς .

A combination of the method of completing square and (10) turns Equation (16) into

\begin{matrix} J_{1} (y_{0}, ξ_{0}, u^{*} (\cdot), v (\cdot)) = E [y_{0}^{'} Λ_{1} (0, ξ_{0}) y_{0}] \\ + \sum_{t = 0}^{T} E \{E \{{[\begin{matrix} y (t) \\ v (t) \end{matrix}]}^{'} D (t, ξ_{t}, Λ_{1}) [\begin{matrix} y (t) \\ v (t) \end{matrix}] | F_{t - 1}, ξ_{t} = ς\}\} \\ = E [y_{0}^{'} Λ_{1} (0, ξ_{0}) y_{0}] \\ + \sum_{t = 0}^{T} E \{E \{{[v (t) - v^{*} (t)]}^{'} H_{1} (t, ς, Λ_{1}) [v (t) - v^{*} (t)] | F_{t - 1}, ξ_{t} = ς\}\}, \end{matrix}

(17)

where

v^{*} (t) = G_{1} (t, ξ_{t}, Λ_{1}) y (t)

; therefore, the Nash equilibrium inequality (4) occurs naturally, that is to say

\begin{matrix} J_{1} (y_{0}, ξ_{0}, u^{*} (\cdot), v (\cdot)) \geq J_{1} (Λ_{0}, ξ_{0}, u^{*} (\cdot), v^{*} (\cdot)) = E [y_{0}^{'} Λ_{1} (0, ξ_{0}) y_{0}], \end{matrix}

additionally, by plugging

v^{*} (t) = G_{1} (t, ξ_{t}, Λ_{1}) y (t)

into Equation (1), Equation (1) can be converted into

\{\begin{matrix} y (t + 1) = [Q_{0} (t, ξ_{t}) + U_{0} (t, ξ_{t}) G_{1} (t, ξ_{t}, Λ_{1})] y (t) + R_{0} (t, ξ_{t}) u (t) \\ + \sum_{k = 1}^{r} {[Q_{k} (t, ξ_{t}) + U_{k} (t, ξ_{t}) G_{1} (t, ξ_{t}, Λ_{1})] y (t) + R_{k} (t, ξ_{t}) u (t)} w_{k} (t), \\ z (t) = [\begin{matrix} L (t, ξ_{t}) y (t) \\ M (t, ξ_{t}) u (t) \end{matrix}], M {(t, ξ_{t})}^{'} M (t, ξ_{t}) = I_{n_{u}}, \\ y (0) = y_{0} \in R^{n}, ξ (0) = ξ_{0} \in D, t \in N_{T}, \end{matrix}

(18)

in the meantime, we also have

\begin{matrix} J_{2} (y_{0}, ξ_{0}, u (\cdot), v^{*} (\cdot)) = \sum_{t = 0}^{T} E [y {(t)}^{'} L {(t, ξ_{t})}^{'} L (t, ξ_{t}) y (t) + u {(t)}^{'} u (t)] . \end{matrix}

(19)

Obviously, to illustrate that

u^{*} (t)

minimizes (3), we only need to prove a kind of LQ optimal control problem of IMJSSs that causes (19) minimization under the condition of (18). Via Lemma 3, we can easily prove the Nash equilibrium inequality (5).

Necessity: Assume that

(u^{*} (t), v^{*} (t)) = (G_{2} (t, ξ_{t}) y (t), G_{1} (t, ξ_{t}) y (t))

is a linear feedback Nash equilibrium point for the Nash game (4) and (5). Putting

u^{*} (t)

into Equation (1), we gain

\{\begin{matrix} y (t + 1) = [Q_{0} (t, ξ_{t}) + R_{0} (t, ξ_{t}) G_{2} (t, ξ_{t})] y (t) + U_{0} (t, ξ_{t}) v (t) \\ + \sum_{k = 1}^{r} {[Q_{k} (t, ξ_{t}) + R_{k} (t, ξ_{t}) G_{2} (t, ξ_{t})] y (t) + U_{k} (t, ξ_{t}) v (t)} w_{k} (t), \\ z (t) = [\begin{matrix} L (t, ξ_{t}) y (t) \\ M (t, ξ_{t}) G_{2} (t, ξ_{t}) y (t) \end{matrix}], M {(t, ξ_{t})}^{'} M (t, ξ_{t}) = I_{n_{u}}, \\ y (0) = y_{0} \in R^{n}, ξ (0) = ξ_{0} \in D, t \in N_{T}, \end{matrix}

(20)

now discover that the above problem is transformed into the following indefinite LQ optimal control with optimal solution

v^{*} (t) = K_{1} (t, ξ_{t}) y (t)

:

\{\begin{matrix} min_{v (\cdot) \in l^{2} (N_{T}; R^{n_{v}})} {J_{1} (y_{0}, ξ_{0}, u^{*} (\cdot), v (\cdot)) = \sum_{t = 0}^{T} E [γ^{2} {∥ v (t) ∥}^{2} - {∥ z (t) ∥}^{2}]}, \\ s u b j e c t t o (20), \end{matrix}

(21)

it is obvious that the above indefinite LQ problem (21) is well-posed. Next, we prove that the coupled generalized algebraic Riccati Equation (22) is solvable by mathematical induction:

\begin{matrix} \{\begin{matrix} {\tilde{Λ}}_{1} (t, ς) = \sum_{k = 0}^{r} {[Q_{k} (t, ς) + R_{k} (t, ς) G_{2} (t, ς)]}^{'} ε_{ς} (t, {\tilde{Λ}}_{1}) [Q_{k} (t, ς) + R_{k} (t, ς) G_{2} (t, ς)] \\ - L {(t, ς)}^{'} L (t, ς) - G_{2} {(t, ς)}^{'} G_{2} (t, ς) \\ - G_{3} {(t, ς, {\tilde{Λ}}_{1})}^{'} H_{1} {(t, ς, {\tilde{Λ}}_{1})}^{†} G_{3} (t, ς, {\tilde{Λ}}_{1}), \\ (I - H_{1} (t, ς, {\tilde{Λ}}_{1}) H_{1} {(t, ς, {\tilde{Λ}}_{1})}^{†}) G_{3} (t, ς, {\tilde{Λ}}_{1}) = 0, \\ {\tilde{Λ}}_{1} (T + 1, ς) = 0, H_{1} (t, ς, {\tilde{Λ}}_{1}) \geq 0, (t, ς) \in N_{T} \times D, \end{matrix} \end{matrix}

(22)

to this end, the value function is introduced as follows:

\begin{matrix} V (τ, y (τ), ξ_{τ}) = inf_{v (\cdot) \in l^{2} ([τ, T]; R^{n_{v}})} J_{1} (y (τ), ξ_{τ}, u^{*} (\cdot), v (\cdot)) \\ = inf_{v (\cdot) \in l^{2} ([τ, T]; R^{n_{v}})} \sum_{t = τ}^{T} {E {y (t)}^{'} [- L {(t, ξ_{t})}^{'} L (t, ξ_{t}) - G_{2} {(t, ξ_{t})}^{'} G_{2} (t, ξ_{t})] y (t) \\ + γ^{2} v {(t)}^{'} v (t)}, \end{matrix}

(23)

for

τ = T

, we can obtain from

{\tilde{Λ}}_{1} (T + 1, i) = 0

that the existence of

{\tilde{Λ}}_{1} (T, i)

motivates

\begin{matrix} V (T, y (T), ξ_{T}) = inf_{v (T)} {E {y (T)}^{'} [- L {(T, ξ_{T})}^{'} L (T, ξ_{T}) - G_{2} {(T, ξ_{T})}^{'} G_{2} (T, ξ_{T})] y (T) \\ + γ^{2} v {(T)}^{'} v (T)} \\ = inf_{v (T)} E [y {(T)}^{'} {\tilde{Λ}}_{1} (T, ς) y (T) + γ^{2} v {(T)}^{'} v (T)], \end{matrix}

(24)

it is noteworthy that Equation (22) is available for

t = T

with

{\tilde{Λ}}_{1} (T, ς)

, and the optimal value function is

\begin{matrix} V (T, y (T), ξ_{T}) = E [y {(T)}^{'} {\tilde{Λ}}_{1} (T, ξ_{T}) y (T)], \end{matrix}

(25)

in the process,

\begin{matrix} v^{*} (T) = 0 = - H_{1} {(T, ξ_{T}, {\tilde{Λ}}_{1})}^{†} G_{3} (T, ξ_{T}, {\tilde{Λ}}_{1}) y (T) . \end{matrix}

(26)

then, for

t = τ

, assume that the coupled generalized algebraic Riccati Equation (22) has a solution

{\tilde{Λ}}_{1} (τ, ς), ς \in D

. Meanwhile, the optimal value function is

\begin{matrix} V (τ, y (τ), ξ_{τ}) = \sum_{i = 1}^{\infty} π_{τ} (i) y {(τ)}^{'} {\tilde{Λ}}_{1} (τ, ξ_{τ}) y (τ), \end{matrix}

and the optimal control is

\begin{matrix} v^{*} (τ) = - H_{1} {(τ, ξ_{τ}, {\tilde{Λ}}_{1})}^{†} G_{3} (τ, ξ_{τ}, {\tilde{Λ}}_{1}) y (τ), \end{matrix}

now, our goal is to illustrate that for

t = τ - 1

, the existence of solution

{\tilde{Λ}}_{1} (τ - 1, ς), ς \in D

to Equation (22). By the aid of (20), the dynamic programming optimality principle applies to

\begin{matrix} V (τ - 1, y (τ - 1), ξ_{τ - 1}) = inf_{v (τ - 1)} E {y {(τ - 1)}^{'} [- L {(τ - 1, ξ_{τ - 1})}^{'} L (τ - 1, ξ_{τ - 1}) \\ - G_{2} {(τ - 1, ξ_{τ - 1})}^{'} G_{2} (τ - 1, ξ_{τ - 1})] y (τ - 1) \\ + γ^{2} v {(τ - 1)}^{'} v (τ - 1) + V (τ, y (τ), ξ_{τ})}, \\ = inf_{v (τ - 1)} {E [y (τ - 1)}^{'} Γ (τ - 1, ξ_{τ - 1}, {\tilde{Λ}}_{1}) y (τ - 1) \\ + 2 v {(τ - 1)}^{'} G_{3} {(τ - 1, ξ_{τ - 1}, {\tilde{Λ}}_{1})}^{'} y (τ - 1) \\ + v {(τ - 1)}^{'} H_{1} (τ - 1, ξ_{τ - 1}, {\tilde{Λ}}_{1}) v (τ - 1)], \end{matrix}

(27)

where

\begin{matrix} Γ (τ - 1, ξ_{τ - 1}, {\tilde{Λ}}_{1}) = \sum_{k = 0}^{r} {[Q_{k} (τ - 1, ξ_{τ - 1}) + R_{k} (τ - 1, ξ_{τ - 1}) G_{2} (τ - 1, ξ_{τ - 1})]}^{'} \\ \cdot ε_{ξ_{τ - 1}} (τ - 1, {\tilde{Λ}}_{1}) [Q_{k} (τ - 1, ξ_{τ - 1}) + R_{k} (τ - 1, ξ_{τ - 1}) G_{2} (τ - 1, ξ_{τ - 1})] \\ - L {(τ - 1, ξ_{τ - 1})}^{'} L (τ - 1, ξ_{τ - 1}) - G_{2} {(τ - 1, ξ_{τ - 1})}^{'} G_{2} (τ - 1, ξ_{τ - 1}), \end{matrix}

similar results to Lemma 2 apply to the infinite Markov jump case, it is demonstrated that

{\tilde{Λ}}_{1} (τ - 1, ς), ς \in D

satisfies the following equation:

\{\begin{matrix} {\tilde{Λ}}_{1} (τ - 1, ς) = Γ (τ - 1, ς, {\tilde{Λ}}_{1}) - G_{3} {(τ - 1, ς, {\tilde{Λ}}_{1})}^{'} H_{1} {(τ - 1, ς, {\tilde{Λ}}_{1})}^{†} G_{3} (τ - 1, ς, {\tilde{Λ}}_{1}), \\ (I - H_{1} (τ - 1, ς, {\tilde{Λ}}_{1}) H_{1} {(τ - 1, ς, {\tilde{Λ}}_{1})}^{†}) G_{3} (τ - 1, ς, {\tilde{Λ}}_{1}) = 0, \\ {\tilde{Λ}}_{1} (T + 1, ς) = 0, H_{1} (τ - 1, ς, {\tilde{Λ}}_{1}) \geq 0, ς \in D, \end{matrix}

(28)

the following conclusions can be handled in the same manner as (17), that is

\begin{matrix} V (τ - 1, y (τ - 1), ξ_{τ - 1}) = \sum_{ς = 1}^{\infty} π_{τ - 1} (i) y {(τ - 1)}^{'} P (τ - 1, ς) y (τ - 1), \end{matrix}

(29)

and

\begin{matrix} v^{*} (τ - 1) = - H_{1} {(τ - 1, ξ_{τ - 1}, {\tilde{Λ}}_{1})}^{†} G_{3} (τ - 1, ξ_{τ - 1}, {\tilde{Λ}}_{1}) y (τ - 1), \end{matrix}

(30)

up to the present, one can infer that there exists

{\tilde{Λ}}_{1} (t, ς), (t, ς) \in N_{T} \times D

satisfying Equation (22). Furthermore, an optimal solution of the indefinite LQ problem (21) is

v^{*} (t) = - H_{1} {(t, ξ_{t}, {\tilde{Λ}}_{1})}^{†} G_{3} (t, ξ_{t}, {\tilde{Λ}}_{1}) y (t)

with

G_{1} (t, ξ_{t}) = - H_{1} {(t, ξ_{t}, {\tilde{Λ}}_{1})}^{†} G_{3} (t, ξ_{t}, {\tilde{Λ}}_{1}) = G_{1} (t, ξ_{t}, {\tilde{Λ}}_{1})

. Amalgamating the above results into Equation (22), it is stated that

{\tilde{Λ}}_{1} (t, ς) = Λ_{1} (t, ς), (t, ς) \in N_{T} \times D

. It only remains to indicate

Λ_{1} (t, ς) \leq 0, (t, ς) \in N_{T} \times D

. In practice,

\begin{matrix} J_{1} (y_{0}, ς, u^{*} (\cdot), v^{*} (\cdot)) = \sum_{ς = 1}^{\infty} π_{0} (ς) y_{0}^{'} Λ_{1} (0, ς) y_{0} \leq J_{1} (y_{0}, ξ_{0}, u^{*} (\cdot), 0) = \sum_{t = 0}^{T} {E [- ∥ z (t) ∥}^{2}]} \leq 0, \end{matrix}

in addition, if we plug

v^{*} (t) = G_{1} (t, ξ_{t}) y (t)

into Equation (1), then (18) is gained. We can deduce that (5) is a standard LQ control subject to (18). Using Lemma 3,

Λ_{2} (t, ς) \geq 0, (t, ς) \in N_{T} \times D

can be easily accessible. Moreover,

J_{2} (y_{0}, ς, u^{*} (\cdot), v^{*} (\cdot)) = \sum_{ς = 1}^{\infty} π_{0} (ς) y_{0}^{'} Λ_{2} (0, ς) y_{0}

with

u^{*} (t) = G_{2} (t, ξ_{t}) y (t) = G_{2} (t, ξ_{t}, Λ_{2}) y (t) = - H_{2} {(t, ξ_{t}, Λ_{2})}^{- 1} G_{4} (t, ξ_{t}, Λ_{2}) y (t) .

The proof has been completed. □

Remark 1.

Note that Theorem 1 can be considered as an extension of [22] to infinite jumps and multiplicative noise and a discrete-time version of [23].

Remark 2.

If the infinite horizon cost function is concerned, it is much more challenging owing to the requirement of stabilization limitation for the closed-loop system. As discussed in [26], the infinite horizon LQ Nash game has been considered.

4. Application to Special Case

In the previous section, the Nash game problem for discrete-time IMJSSs is solved. It should be noted that when

γ

takes different values or

v (t)

is regarded as exogenous disturbance. As a special case, we discuss the finite horizon robust

H_{2} / H_{\infty}

control problem from a new perspective and further explore the relationship between Nash equilibrium points and finite horizon

H_{2} / H_{\infty}

control with some remarks.

4.1. Finite Horizon $H_{2} / H_{\infty}$ Control

v (t)

in system (1) is seen as one of the players in the Nash game problem; as a matter of fact, it is more a case of considering

v (t)

as an exogenous disturbance in most control systems. Let

γ > 0

be a prescribed disturbance attenuation. In consequence, the original Nash game problem is turned into finding a controller

u^{*} (\cdot) \in l^{2} (N_{T}; R^{n_{u}})

such that

(i): $∥ L_{T} ∥ < γ$ for the closed system of Equation (1) with $x_{0} = 0$ ;
(ii): if $v^{*} (\cdot)$ exists, $u^{*} (\cdot)$ minimizes the output energy:

$\begin{matrix} J_{2} (y_{0}, ξ_{0}, u (\cdot), v^{*} (\cdot)) = \sum_{t = 0}^{T} {E [∥ z (t) ∥}^{2}], \end{matrix}$

(31)

where $v^{*} (\cdot)$ is the worst case disturbance and

$\begin{matrix} v^{*} (\cdot) & = arg max_{\begin{matrix} v \end{matrix}} J_{1} (y_{0}, ξ_{0}, u^{*} (\cdot), v (\cdot)) \\ = arg max_{\begin{matrix} v \end{matrix}} \sum_{t = 0}^{T} E [γ^{2} {∥ v (t) ∥}^{2} - {∥ z (t) ∥}^{2}], \end{matrix}$

(32)

in other words, the above problem is called the finite horizon $H_{2} / H_{\infty}$ control problem. We need to pay attention to the definition of perturbed operator $∥ L_{T} ∥$ given in [29,30].

As an applications of Nash game problem, it can be obtained from Theorem 1 that

\begin{matrix} ∥ L_{T} ∥ \leq γ, \end{matrix}

(33)

and

\begin{matrix} J_{2} (y_{0}, ξ_{0}, u^{*} (\cdot), v^{*} (\cdot)) \leq J_{2} (y_{0}, ξ_{0}, u (\cdot), v^{*} (\cdot)), \end{matrix}

(34)

where

u^{*} (t)

and

v^{*} (t)

are defined in Theorem 1.

Remark 3.

In accordance with the definition of

H_{2} / H_{\infty}

control problem, it is crucial to note that

∥ L_{T} ∥ \leq γ

does not come down to

∥ L_{T} ∥ < γ

. If we can confirm that

∥ L_{T} ∥ \leq γ

can be replaced by

∥ L_{T} ∥ < γ

, then the following result would be given naturally.

Theorem 2.

For system (1), assume the following coupled generalized algebraic Riccati equations:

\begin{matrix} \{\begin{matrix} Λ_{1} (t, ς) = \sum_{k = 0}^{r} {[Q_{k} (t, ς) + R_{k} (t, ς) G_{2} (t, ς, Λ_{2})]}^{'} ε_{ς} (t, Λ_{1}) \\ \cdot [Q_{k} (t, ς) + R_{k} (t, ς) G_{2} (t, ς, Λ_{2})] - L {(t, ς)}^{'} L (t, ς) \\ - G_{2} {(t, ς, Λ_{2})}^{'} G_{2} (t, ς, Λ_{2}) - G_{3} {(t, ς, Λ_{1})}^{'} H_{1} {(t, ς, Λ_{1})}^{- 1} G_{3} (t, ς, Λ_{1}), \\ Λ_{1} (T + 1, ς) = 0, H_{1} (t, ς, Λ_{1}) > 0, (t, ς) \in N_{T} \times D, \end{matrix} \end{matrix}

(35)

\begin{matrix} G_{1} (t, ς, Λ_{1}) = - H_{1} {(t, ς, Λ_{1})}^{- 1} G_{3} (t, ς, Λ_{1}), \end{matrix}

(36)

\begin{matrix} \{\begin{matrix} Λ_{2} (t, ς) = \sum_{k = 0}^{r} {[Q_{k} (t, ς) + U_{k} (t, ς) G_{1} (t, ς, Λ_{1})]}^{'} ε_{ς} (t, Λ_{2}) \\ \cdot [Q_{k} (t, ς) + U_{k} (t, ς) G_{1} (t, ς, Λ_{1})] + L {(t, ς)}^{'} L (t, ς) \\ - G_{4} {(t, ς, Λ_{2})}^{'} H_{2} {(t, ς, Λ_{2})}^{- 1} G_{4} (t, ς, Λ_{2}), \\ Λ_{2} (T + 1, ς) = 0, H_{2} (t, ς, Λ_{2}) > 0, (t, ς) \in N_{T} \times D, \end{matrix} \end{matrix}

(37)

\begin{matrix} G_{2} (t, ς, Λ_{2}) = - H_{2} {(t, ς, Λ_{2})}^{- 1} G_{4} (t, ς, Λ_{2}) \end{matrix}

(38)

admit solutions

(Λ_{1} (t, ς), G_{1} (t, ς, Λ_{1}); Λ_{2} (t, ς), G_{2} (t, ς, Λ_{2}))

with

Λ_{1} (t, ς) \leq 0

,

Λ_{2} (t, ς) \geq 0

for

(t, ς) \in N_{T} \times D

, then the finite horizon

H_{2} / H_{\infty}

control optimal controller is

u^{*} (t) = G_{2} (t, ς, Λ_{2}) y (t)

,

v^{*} (t) = G_{1} (t, ς, Λ_{1}) y (t)

. Conversely, if the finite horizon

H_{2} / H_{\infty}

control problem has the solution

u^{*} (t) = G_{2} (t, ς, Λ_{2}) y (t)

,

v^{*} (t) = G_{1} (t, ς, Λ_{1}) y (t)

, and

H_{1} (t, ς, Λ_{1}) > 0

, then the coupled generalized algebraic Riccati Equations (35)–(38) admit a group of solutions

(Λ_{1} (t, ς), G_{1} (t, ς, Λ_{1}); Λ_{2} (t, ς), G_{2} (t, ς, Λ_{2}))

with

Λ_{1} (t, ς) \leq 0

,

Λ_{2} (t, ς) \geq 0

for

(t, ς) \in N_{T} \times D

.

Proof.

Sufficiency: It is obvious from the sufficiency part in Theorem 1 that we need only to explain

∥ L_{T} ∥ < γ

. In fact, in the light of the definition of perturbed operator

∥ L_{T} ∥

, when

x_{0} = 0

, note that the condition

H_{1} (t, ς, Λ_{1}) > 0

of (35), Equation (17) means

J_{1} (y_{0}, ξ_{0}, u^{*} (\cdot), v (\cdot)) = 0

iff

v (t) = v^{*} (t) = G_{1} (t, ς, Λ_{1}) y (t)

. Putting

v^{*} (t)

into Equation (14), the obtained closed-loop system with initial state

y_{0} = 0

leads to the state response

y (t) \equiv 0, t \in N_{T}

. One step further,

v (t) = v^{*} (t) = 0

naturally occurs. Then, the inescapable conclusion is that

J_{1} (0, ξ_{0}, u^{*} (\cdot), v (\cdot)) > 0

iff

v (t) = v^{*} (t) \neq 0

, which stands for

∥ L_{T} ∥ < γ

.

Necessity: Assume that the finite horizon

H_{2} / H_{\infty}

control problem has the solution

u^{*} (t) = G_{2} (t, ς, Λ_{2}) y (t)

,

v^{*} (t) = G_{1} (t, ς, Λ_{1}) y (t)

, a combination of (31) and (32) causes

J_{1} (y_{0}, ξ_{0}, u^{*} (\cdot), v^{*} (\cdot)) \leq J_{1} (y_{0}, ξ_{0}, u^{*} (\cdot), v (\cdot)), J_{2} (y_{0}, ξ_{0}, u^{*} (\cdot), v^{*} (\cdot)) \leq J_{2} (y_{0}, ξ_{0}, u (\cdot), v^{*} (\cdot))

for all

(u (\cdot), v (\cdot)) \in l^{2} (N_{T}; R^{n_{u}}) \times l^{2} (N_{T}; R^{n_{v}})

. By now, it can be deduced from the necessity of Theorem 1 that Equations (10)–(13) have solutions. Keep in mind that

H_{1} (t, ς, Λ_{1}) > 0

, hence, Equations (35)–(38) admit a group of solutions

(Λ_{1} (t, ς), G_{1} (t, ς, Λ_{1}); Λ_{2} (t, ς), G_{2} (t, ς, Λ_{2}))

with

Λ_{1} (t, ς) \leq 0

,

Λ_{2} (t, ς) \geq 0

for

(t, ς) \in N_{T} \times D

. The proof has been completed. □

Remark 4.

Although the finite horizon

H_{2} / H_{\infty}

control problem has been solved in [30], we yield a similar result from the new perspective of a Nash game.

Remark 5.

By comparing Theorem 1 with Theorem 2, surely the existence of Nash equilibrium points and the solvability of finite horizon

H_{2} / H_{\infty}

control are not equivalent for system (1), which differs from the continuous-time case as described in [26]. The main cause of the inequivalence lies in that the condition

H_{1} (t, ς, Λ_{1}) > 0

of Equation (35) cannot meet the Nash equilibrium problem, namely,

∥ L_{T} ∥ < γ

is not equivalent to

∥ L_{T} ∥ \leq γ

.

4.2. Some Remarks on Nash Equilibrium Points

As a point of fact, only when

H_{1} (t, ς, Λ_{1}) > 0

is the equivalence between Nash equilibrium points and finite horizon

H_{2} / H_{\infty}

control valid. Accordingly, the relationship between the Nash game and

H_{2} / H_{\infty}

control problem can be discussed in the following theorem.

Theorem 3.

For system (1), under the condition of

H_{1} (t, ς, Λ_{1}) = γ^{2} I_{n_{v}} + \sum_{k = 0}^{r} U_{k} {(t, ς)}^{'} ε_{ς} (t, Λ_{1}) U_{k} (t, ς) > 0,

the following statements are equivalent:

(i): There exists a linear memoryless Nash equilibrium points $(u^{*} (\cdot), v^{*} (\cdot))$ with $(u^{*} (t) = G_{2} (t, ς, Λ_{2}) y (t), v^{*} (t) = G_{1} (t, ς, Λ_{1}) y (t));$
(ii): The finite horizon $H_{2} / H_{\infty}$ control is solvable with $u^{*} (t) = G_{2} (t, ς, Λ_{2}) y (t), v^{*} (t) = G_{1} (t, ς, Λ_{1}) y (t);$
(iii): The coupled generalized algebraic Riccati Equations (35)–(38) have a group of solutions $(Λ_{1} (t, ς), G_{1} (t, ς, Λ_{1}); Λ_{2} (t, ς), G_{2} (t, ς, Λ_{2}))$ with $Λ_{1} (t, ς) \leq 0, Λ_{2} (t, ς) \geq 0$ for $(t, ς) \in N_{T} \times D$ .

Proof.

Theorem 3 can be demonstrated through Theorem 1 and Theorem 2. □

Remark 6.

Keep in mind that when

H_{1} (t, ς, Λ_{1}) > 0

,

H_{1} {(t, ς, Λ_{1})}^{†} = H_{1} {(t, ς, Λ_{1})}^{- 1}

, at this point, Equations (10)–(13) are consistent with Equations (35)–(38). In other words, for system (1), the existence of Nash equilibrium problem, the solvable of finite horizon

H_{2} / H_{\infty}

control and the solvability of Equations (35)–(38) are equivalent. In addition, under the limitation of

H_{1} (t, ς, Λ_{1}) > 0

, a unified treatment for

H_{2}

,

H_{\infty}

and

H_{2} / H_{\infty}

control can be investigated such as [22].

5. Numerical Example

In this section, to solve the coupled generalized algebraic Riccati Equations (35)–(38), we provide an iterative algorithm, which can be summarized as follows:

(i): When $t = T$ , the terminal condition $Λ_{1} (T + 1, ς) = 0$ and $Λ_{2} (T + 1, ς) = 0$ can obtain $H_{1} (T, ς, Λ_{1})$ and $H_{2} (T, ς, Λ_{2})$ ;
(ii): Working out (36) and (38), $G_{1} (T, ς, Λ_{1})$ and $G_{2} (T, ς, Λ_{2})$ can be computed;
(iii): To compute (35) and (37), it is found that $Λ_{1} (T, ς) \leq 0$ and $Λ_{2} (T, ς) \geq 0$ ;
(iv): Repeating the above procedures, for $t = T - 1, T - 2, \dots, 0,$ we can compute that $Λ_{1} (t, ς) \leq 0$ , $Λ_{2} (t, ς) \geq 0$ , $G_{1} (t, ς, Λ_{1})$ and $G_{2} (t, ς, Λ_{2})$ .

Remark 7.

It should be noted that for the coupled generalized algebraic Riccati Equations (35)–(38),

H_{1} (t, ς, Λ_{1}) > 0

and

H_{2} (t, ς, Λ_{2}) > 0

are prerequisites for the effectiveness of the above iterative algorithm. Similarly, we can easily derive the algorithm used to solve the coupled generalized algebraic Riccati Equations (10)–(13).

Next, a numerical example will be presented to show the validity of the proposed method.

Example 1.

Consider a three-stage one-dimensional discrete-time IMJSS with coefficients in Table 1.

For

{ξ_{t}}_{t \in N_{T}}

, the element of

P = [p (ς, ι)]

is given by

p (ς, ς) = \frac{1}{2}, p (ς, ς + 1) = \frac{1}{2}

and

p_{ς ι} = 0

,

ς \in D

,

ι \in D / {ς, ς + 1}

. Set

γ = \frac{\sqrt{2}}{2}

, the coupled generalized algebraic Riccati Equations (10)–(13) are solved by the above iterative algorithm with

\begin{matrix} G_{1} (0, ς, Λ_{1}) = - \frac{3 (ς + 1)}{3 {(ς + 1)}^{2} - 4}, G_{2} (0, ς, Λ_{2}) = - 0.53, \\ Λ_{1} (0, ς) = - 0.36 - \frac{3}{6 {(ς + 1)}^{2} - 8} - \frac{ς}{ς + 1} \leq 0, \\ Λ_{2} (0, ς) = 0.55 + \frac{110 - 33 {(ς + 1)}^{2}}{27 {(ς + 1)}^{4} - 72 {(ς + 1)}^{2} + 48} + \frac{ς}{ς + 1} \geq 0 . \end{matrix}

Therefore, the Nash equilibrium points and optimal

H_{2} / H_{\infty}

controller of the considered system will be obtained naturally.

6. Conclusions

This paper mainly explores a finite horizon LQ non-zero sum Nash game for discrete-time IMJSSs, whose system is governed by a countable Markov chain. The Nash equilibrium points for the considered system are solved by a countably infinite set of coupled generalized algebraic Riccati equations. Then, some special cases are given via a new perspective of the Nash game, which is the finite horizon mixed

H_{2} / H_{\infty}

control with some remarks, and we summarized the relationship between Nash game and

H_{2} / H_{\infty}

control problem. The contents in this paper are an extension and improvement of the previous works [22,23] in the MJSSs case. In fact, to solve the difficulties caused by the countable Markov chain, we introduce the infinite dimension Banach spaces, where the elements are countably infinite sequences of linear and bounded operators. Besides, to overcome the difficulty of solving a countably infinite set of coupled generalized algebraic Riccati equations, we present an iterative algorithm. In the future, the infinite horizon Nash game for IMJSSs can be considered.

Author Contributions

Conceptualization, Y.L.; methodology, Y.L.; formal analysis, Y.L.; investigation, Z.W. and X.L.; writing—original draft preparation, Y.L.; writing—review and editing, Y.L.; supervision, X.L. and Z.W.; project administration, Y.L.; funding acquisition, Y.L., Z.W. and X.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Natural Science Foundation of Qingdao under Grant 23-2-1-7-zyyd-jch, Social Science Planning and Research Special Project of Shandong Province under Grant 22CSDJ43, Natural Science Foundation of China under grant 62273212, the Natural Science Foundation of Shandong Province under grant ZR2020MF062 and People Benefit Project of Qingdao under Grant 22-3-7-smjk-16-nsh.

Data Availability Statement

Not applicable.

Acknowledgments

We would like to thank the anonymous reviewers for their constructive suggestions to improve the quality of this paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

Shafieepoorfard, E.; Raginsky, M.; Meyn, S.P. Rationally inattentive control of Markov processes. SIAM J. Control Optim. 2016, 54, 987–1016. [Google Scholar] [CrossRef]
Veretennikov, A.Y.; Veretennikova, M.A. On improved bounds and conditions for the convergence of Markov chains. Izv. Math. 2022, 86, 92–125. [Google Scholar] [CrossRef]
Khasminskii, R.Z. Stability of regime-switching stochastic differential equations. Probl. Inform. Transm. 2012, 48, 259–270. [Google Scholar] [CrossRef]
Li, F.; Xu, S.; Shen, H.; Zhang, Z. Extended dissipativity-based control for hidden Markov jump singularly perturbed systems subject to general probabilities. IEEE Trans. Syst. Man Cybern. Syst. 2021, 51, 5752–5761. [Google Scholar] [CrossRef]
Wang, L.; Wu, Z.G.; Shen, Y. Asynchronous mean stabilization of positive jump systems with piecewise-homogeneous Markov chain. IEEE Trans. Circuits Syst. II Exp. Briefs 2021, 68, 3266–3270. [Google Scholar] [CrossRef]
Wang, B.; Zhu, Q. Stability analysis of discrete-time semi-Markov jump linear systems with partly unknown semi-Markov kernel. Syst. Control Lett. 2020, 140, 104688. [Google Scholar] [CrossRef]
Zhao, X.; Deng, F.; Gao, W. Exponential stability of stochastic Markovian jump systems with time-varying and distributed delays. Sci. China Inf. Sci. 2021, 64, 209202:1–209202:3. [Google Scholar] [CrossRef]
Han, X.; Wu, K.N.; Niu, Y. Asynchronous boundary control of Markov jump Neural networks with diffusion terms. IEEE Trans. Cybern. 2023, 53, 4962–4971. [Google Scholar] [CrossRef]
Xue, M.; Yan, H.; Zhang, H.; Shen, H.; Peng, S. Dissipativity-based filter design for Markov jump systems with packet loss compensation. Automatica 2021, 133, 109843. [Google Scholar] [CrossRef]
Hou, T.; Ma, H. Exponential stability for discrete-time infinite Markov jump systems. IEEE Trans. Autom. Control. 2016, 61, 4241–4246. [Google Scholar] [CrossRef]
Ma, H.; Jia, Y. Stability analysis for stochastic differential equations with infinite Markovian switchings. J. Math. Anal. Appl. 2016, 435, 593–605. [Google Scholar] [CrossRef]
Song, R.; Zhu, Q. Stability of linear stochastic delay differential equations with infinite Markovian switchings. Int. J. Robust Nonlinear Control 2018, 28, 825–837. [Google Scholar] [CrossRef]
Hou, T.; Liu, Y.; Deng, F. Stability for discrete-time uncertain systems with infinite Markov jump and time-delay. Sci. China Inf. Sci. 2021, 64, 152202:1–152202:11. [Google Scholar] [CrossRef]
Hou, T.; Liu, Y.; Deng, F. Finite horizon H₂/H_∞ control for SDEs with infinite Markovian jumps. Nonlinear Anal. Hybrid Syst. 2019, 34, 108–120. [Google Scholar] [CrossRef]
Liu, Y.; Hou, T. Robust H₂/H_∞ fuzzy filtering for nonlinear stochastic systems with infinite Markov jump. J. Syst. Sci. Complex. 2020, 33, 1023–1039. [Google Scholar] [CrossRef]
Dockner, E.J.; Jorgensen, S.; Long, N.V. Differential Games in Economics and Management Science; Cambridge University Press: Cambridge, UK, 2000. [Google Scholar]
Chen, B.S.; Tseng, C.S.; Uang, H.J. Fuzzy differential games for nonlinear stochastic systems: Suboptimal approach. IEEE Trans. Fuzzy Syst. 2002, 10, 222–233. [Google Scholar] [CrossRef]
Lin, Y.; Zhang, T.; Zhang, W. Infinite horizon linear quadratic Pareto game of the stochastic singular systems. J. Frankl. Inst. 2018, 355, 4436–4452. [Google Scholar] [CrossRef]
Moon, J. Linear-quadratic stochastic leader-follower differential games for Markov jump-diffusion models. Automatica 2023, 147, 110713. [Google Scholar] [CrossRef]
Gao, X.; Deng, F.; Zeng, P. Zero-sum game-based security control of unknown nonlinear Markov jump systems under false data injection attacks. Int. J. Robust Nonlinear Control 2022. Early Access. [Google Scholar] [CrossRef]
Dufour, F.; Prieto-Rumeau, T. Stationary Markov Nash equilibria for nonzero-sum constrained ARAT Markov games. SIAM J. Control Optim. 2022, 60, 945–967. [Google Scholar] [CrossRef]
Hou, T.; Zhang, W. A game-based control design for discrete-time Markov jump systems with multiplicative noise. IET Control Theory Appl. 2013, 7, 773–783. [Google Scholar] [CrossRef]
Sheng, L.; Zhang, W.; Gao, M. Relationship between Nash equilibrium strategies and H₂/H_∞ control of stochastic Markov jump systems with multiplicative noise. IEEE Trans. Autom. Control. 2014, 59, 2592–2597. [Google Scholar] [CrossRef]
Sheng, L.; Zhang, W.; Gao, M. Some remarks on infinite horizon stochastic H₂/H_∞ control with (x, u, v) dependent noise and Markov jumps. J. Frankl. Inst. 2015, 352, 3929–3946. [Google Scholar] [CrossRef]
Dragan, V.; Morozan, T.; Stoica, A.M. Mathematical Methods in Robust Control of Linear Stochastic Systems, 2nd ed.; Springer: New York, NY, USA, 2013. [Google Scholar]
Liu, Y.; Hou, T. Infinite horizon LQ Nash Games for SDEs with infinite jumps. Asian J. Control 2021, 23, 2431–2443. [Google Scholar] [CrossRef]
Rami, M.A.; Chen, X.; Zhou, X. Discrete-time indefinite LQ control with state and control dependent noises. J. Glob. Optim. 2002, 23, 245–265. [Google Scholar] [CrossRef]
Basar, T.; Olsder, G.J. Dynamic Noncooperative Game Theory; SIAM: Philadelphia, PA, USA, 1999. [Google Scholar]
Hou, T.; Zhang, W.; Ma, H. Finite horizon H₂/H_∞ control for discrete-time stochastic systems with Markovian jumps and multiplicative noise. IEEE Trans. Autom. Control. 2010, 55, 1185–1191. [Google Scholar]
Wang, J.; Hou, T. Finite horizon H₂/H_∞ control for discrete-time time-varying stochastic systems with infinite Markov jumps. In Proceedings of the 36th Chinese Control Conference, Dalian, China, 26–28 July 2017. [Google Scholar]

Table 1. Coefficients of considered system.

Coefficients	$t = 0$	$t = 1$	$t = 2$
$Q_{0} (t, ς)$	$\frac{1}{2}$	1	$\frac{1}{3 (ς + 1)}$
$Q_{1} (t, ς)$	1	1	$\frac{1}{ς + 1}$
$R_{0} (t, ς)$	1	$- \frac{1}{ς + 1}$	$\frac{ς}{ς + 1}$
$R_{1} (t, ς)$	1	$\frac{1}{ς + 1}$	$\frac{1}{2 {(ς + 1)}^{2}}$
$U_{0} (t, ς)$	$- \frac{1}{ς + 1}$	1	$\frac{1}{ς + 1}$
$U_{1} (t, ς)$	$\frac{1}{ς + 1}$	1	1
$L (t, ς)$	$\sqrt{\frac{ς}{ς + 1}}$	1	1
$M (t, ς)$	1	1	1

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Liu, Y.; Wang, Z.; Lin, X. Non-Zero Sum Nash Game for Discrete-Time Infinite Markov Jump Stochastic Systems with Applications. Axioms 2023, 12, 882. https://doi.org/10.3390/axioms12090882

AMA Style

Liu Y, Wang Z, Lin X. Non-Zero Sum Nash Game for Discrete-Time Infinite Markov Jump Stochastic Systems with Applications. Axioms. 2023; 12(9):882. https://doi.org/10.3390/axioms12090882

Chicago/Turabian Style

Liu, Yueying, Zhen Wang, and Xiangyun Lin. 2023. "Non-Zero Sum Nash Game for Discrete-Time Infinite Markov Jump Stochastic Systems with Applications" Axioms 12, no. 9: 882. https://doi.org/10.3390/axioms12090882

APA Style

Liu, Y., Wang, Z., & Lin, X. (2023). Non-Zero Sum Nash Game for Discrete-Time Infinite Markov Jump Stochastic Systems with Applications. Axioms, 12(9), 882. https://doi.org/10.3390/axioms12090882

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Non-Zero Sum Nash Game for Discrete-Time Infinite Markov Jump Stochastic Systems with Applications

Abstract

1. Introduction

2. Preliminaries

3. Nash Equilibrium Points

4. Application to Special Case

4.1. Finite Horizon $H_{2} / H_{\infty}$ Control

4.2. Some Remarks on Nash Equilibrium Points

5. Numerical Example

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Non-Zero Sum Nash Game for Discrete-Time Infinite Markov Jump Stochastic Systems with Applications

Abstract

1. Introduction

2. Preliminaries

3. Nash Equilibrium Points

4. Application to Special Case

4.1. Finite Horizon H 2 / H ∞ Control

4.2. Some Remarks on Nash Equilibrium Points

5. Numerical Example

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

4.1. Finite Horizon $H_{2} / H_{\infty}$ Control