Hybrid Partial-Data-Driven H∞ Robust Tracking Control for Linear Stochastic Systems with Discrete-Time Observation of Reference Trajectory

Yiteng Zhang; Xiangyun Lin; Rui Zhang

doi:10.3390/math13233854

,

and

¹

College of Mathematics and Systems Science, Shandong University of Science and Technology, Qingdao 266590, China

²

College of Electronic and Information Engineering, Shandong University of Science and Technology, Qingdao 266590, China

^*

Author to whom correspondence should be addressed.

Mathematics2025, 13(23), 3854;https://doi.org/10.3390/math13233854

This article belongs to the Special Issue Stochastic System Analysis and Control

Version Notes

Order Reprints

Abstract

A hybrid robust

H_{\infty}

tracking-control design method is studied for linear stochastic systems in which the parameters of the reference system are unknown but inferred from discrete-time observations. First, the reference system parameters are estimated by the least-squares method, and a corresponding data-dependent augmented system is constructed. Second, a Riccati matrix inequality is established for these systems, and a state-feedback

H_{\infty}

controller is designed to improve tracking performance. Third, to mitigate large tracking errors, an error-feedback control scheme is introduced to compensate for dynamic tracking deviations. These results yield a hybrid control framework that integrates data observation, state-feedback

H_{\infty}

control, and error-feedback

H_{\infty}

control to address the tracking problem more effectively. Two numerical examples and one practical example demonstrate the effectiveness of the proposed method.

Keywords:

H_∞ control; data-driven control; adaptive tracking control; stochastic system

MSC:

93E03

1. Introduction

With advancements in artificial intelligence and data-acquisition technology, data-driven control has become a prominent paradigm in control engineering [1]. Numerous important results have emerged. For instance, Shen et al. studied iterative learning control (ILC) for discrete-time linear systems without prior probability information on randomly varying iteration length [2]. The authors of Ref. [3] proposed a proportional–integral–derivative (PID) control scheme based on adaptive updating rules and data-driven techniques. In [4], bounded-input bounded-output stability, monotonic convergence of tracking-error dynamics, and internal stability of the full-form dynamic-linearization-based model-free adaptive control (MFAC) scheme were analyzed using the contraction mapping principle. The authors of [5] proposed a data-driven adaptive control method based on the incremental triangular dynamic-linearization data model. Recently, data-driven modeling methods have advanced rapidly, yielding notable results in several areas, including predictive control for switched linear systems [6], predictive control for a modular multilevel converter [7], optimal output tracking control [8], self-triggered control [9], and distributed predictive control [10].

The

H_{\infty}

control technique is an effective method for enhancing robustness against exogenous disturbances [11] and has been widely applied in aerospace, robotics, and wireless communications [12,13,14,15]. Robust

H_{\infty}

control theory has evolved over more than 40 years. The foundational works include [16,17], which introduce the frequency-domain and algebraic Riccati equation (ARE) methods, respectively, for linear deterministic systems. The linear matrix inequality (LMI) approach was later developed in [18]. For systems subject to random noise [19], stochastic differential equations have been adopted [20], and corresponding

H_{\infty}

results have been extended to linear Itô systems [21].

H_{\infty}

control theories for nonlinear systems have also been established, such as those in [22] for deterministic settings and those in [23]. State-feedback

H_{\infty}

control for affine stochastic Itô systems was studied in [24] using the completing-squares method. More recently, we proposed an

H_{\infty}

control design method for general nonlinear discrete-time stochastic systems using the disintegration property of conditional expectation and convex functions [25]. Additional results on

H_{\infty}

control can be found in [26,27,28] and the references therein.

Tracking-control design techniques are widely used in autonomous underwater vehicles (AUVs), unmanned surface vehicles, and unmanned aerial vehicles (UAVs). For example, Ref. [29] investigated trajectory-tracking control for underactuated autonomous underwater vehicles subject to input constraints and arbitrary attitude maneuvers. A delay-compensated control framework with prescribed performance guarantees was proposed in [30] for trajectory-tracking control of unmanned underwater vehicles (UUVs) subject to uncertain time-varying input delays. The distributed model-predictive control (MPC) framework in [31] was designed for tracking-control problems involving multiple unmanned aerial vehicles (UAVs) interconnected through a directed communication graph. To address size-related limitations in computing invariant sets and to simplify the offline model-predictive control (MPC) design, the authors of Ref. [32] developed an MPC technique based on implicit terminal components. Optimal

H_{\infty}

adaptive fuzzy observer-based indirect reference tracking-control designs were investigated in [33] for uncertain SISO and MIMO nonlinear stochastic systems with nonlinear uncertain measurement functions and measurement noise.

This paper investigates a hybrid partial-data-driven

H_{\infty}

robust tracking-control method for the following linear stochastic system to be controlled:

d x (t) = [A x (t) + B u (t) + B_{1} v_{1} (t)] d t + [A_{1} x (t) + B_{2} v_{2} (t)] d W (t), t \in [t_{0}, T_{f}],

together with the unknown reference model:

{\dot{x}}_{r} (t) = A_{r} x_{r} (t) + v_{3} (t),

where

x (t)

is the system state,

x_{r} (t)

is the reference trajectory,

u (t) \in R^{n_{u}}

is the control input, and

v_{1} (t)

,

v_{2} (t)

and

v_{3} (t)

are the exogenous disturbances. The matrices

A, B, B_{1}, A_{1}

, and

B_{2}

of the system to be controlled are known, whereas the matrix

A_{r}

of the reference system is unknown. Only some discrete-time observations of

x_{r} (t)

are available in

t = t_{0}, t_{1}, \dots, t_{N}

. To reduce the tracking error, i.e., the distance between

x (t)

and

x_{r} (t)

, the control input

u (t)

must be properly designed.

A data-dependent augmented system is constructed, and state-feedback and error-feedback

H_{\infty}

controllers are developed accordingly. The resulting hybrid partial-data-driven

H_{\infty}

control design scheme comprises three stages: data observation, state-feedback

H_{\infty}

control, and error-feedback

H_{\infty}

control. In the first stage, information on

x_{r} (t)

is obtained, the reference system parameters are estimated, and the corresponding data-dependent augmented system is constructed, from which a state-feedback

H_{\infty}

controller is derived. In the second stage, under the action of the state-feedback controller, the tracking error is reduced, although its magnitude remains relatively large because the control depends only on the system state and does not incorporate tracking-error values. In the third stage, an error-feedback

H_{\infty}

controller is designed to further reduce the tracking error.

Compared with the method of [33], the proposed approach has two main innovations:

Tracking errors are incorporated not only in the performance index but also directly in the control input through an error-feedback term.
The control input $u (t)$ is a hybrid partial-data-driven controller with a piecewise structure.

This paper is organized as follows: In Section 2, some results on

H_{\infty}

control for linear stochastic systems are reviewed. In Section 3, the data-dependent augmented system is constructed, and the corresponding state-feedback

H_{\infty}

control and error-feedback

H_{\infty}

control design methods are studied by solving algebraic Riccati inequalities. In Section 4, the hybrid control scheme is proposed, which includes three stages, and the corresponding programming evolution is also provided. In Section 5, two numerical examples and one practical example are discussed to illustrate the effectiveness of the proposed method.

Notations:

A^{T}

: the transpose of the matrix A;

| x |

: the Euclidean norm of the vector

x \in R^{n}

;

tr (A)

: the trace of square matrix

A = (a_{i j}) \in R^{n \times n}

with

tr (A) = \sum_{i = 1}^{n} a_{i i}

;

∥ M ∥

: the norm of matrix

M \in R^{m \times n}

defined by

∥ M ∥ = \sqrt{tr (M^{T} M)}

;

n_{y}

: the dimension of vector

y \in R^{n_{y}}

;

S^{n} (R)

: the set of the n-order real symmetric matrix;

S_{+}^{n} (R)

: the set of the n-order positive definite matrix;

M > 0 (M \geq 0)

: the matrix M is a positive definite (semi-definite) matrix;

M^{\frac{1}{2}}

: the square root of a positive definite (semi-definite) matrix M; i.e.,

M^{\frac{1}{2}}

is a positive (semi-definite) matrix satisfying

M = M^{\frac{1}{2}} M^{\frac{1}{2}}

.

{∥ v ∥}_{M}

: the norm of vector

v \in R^{n_{v}}

with weighted matrix M defined by

{∥ v ∥}_{M} = v^{T} M v

, where

M > 0

.

2. Preliminaries

Let

{W (t), t \geq 0}

be a 1-dimensional standard Brownian motion in a completed probability space

(Ω, F, F, P)

with

E [W (t)] = 0

and

E [W {(t)}^{2}] = t

. The filtration

F = {F_{t}, t \geq 0}

satisfies the usual conditions, and

F_{t}

is generated by Brownian motion, i.e.,

F_{t} = σ {W_{s} : 0 \leq s \leq t} ⋁ N

where

N

is the totality of

P

-null sets. Denote

L^{2} (Ω, F_{t}, P; R^{n})

as the set of random variables

ξ

, where

ξ

is

F_{t}

-measurable and

E [| ξ |^{2}] < \infty .

Denote

M_{F}^{2} ([t_{0}, T_{f}]; R^{n_{y}})

as the set of

F

-adapted stochastic processes

y = {y_{t}, t_{0} \leq t \leq T_{f}}

with norm

{∥ y ∥}_{M_{F}^{2} ([t_{0}, T_{f}]; R^{n_{y}})} = \sqrt{E \int_{t_{0}}^{T_{f}} {| y_{t} |}^{2} d t} < \infty .

The stochastic linear system with exogenous disturbance

v (t)

is given as

\{\begin{matrix} d x (t) = (A x (t) + B_{1} v (t)) d t + (A_{1} x (t) + B_{2} v (t)) d W (t) \\ x (t_{0}) = x_{0} \in R^{n}, t \in [t_{0}, T_{f}] \end{matrix} .

(1)

where

A, A_{1} \in R^{n \times n}

,

B_{1}, B_{2} \in R^{n \times n_{v}}

are coefficient matrices,

v (t) \in R^{n_{v}}

is the exogenous disturbance, and

T_{f}

is the terminal time with

0 < T_{f} < \infty

. We first review some results of

H_{\infty}

problem for system (1) that will be used later.

Denote

x (t; v, t_{0}, x_{0})

the solutions of (1) beginning at

t_{0}

with initial state

x_{0}

under exogenous disturbance

v = {v (t), t_{0} \leq t \leq T_{f}}

. For every

v (t) \in M_{F}^{2} ([t_{N}, T_{f}]; R^{n_{v}})

, let

z (t; v, t_{0}, x_{0}) = C x (t; v, t_{0}, x_{0}) + D v (t)

be the output of system (1). Define operator

L : M_{F}^{2} ([t_{0}, T_{f}]; R^{n_{v}}) \mapsto M_{F}^{2} ([t_{0}, T_{f}]; R^{n_{z}})

with

L (v) = z,

where

z (t) = z (t; v, t_{0}, 0)

. The

H_{\infty}

norm of

L

is defined as

∥ L ∥ = sup_{v \neq 0} \frac{{∥ z ∥}_{M_{F}^{2} ([t_{0}, T_{f}]; R^{n_{z}})}}{{∥ v ∥}_{M_{F}^{2} ([t_{0}, T_{f}]; R^{n_{v}})}} .

If there exists a constant

ρ > 0

such that, for every

v (t) \in M_{F}^{2} ([t_{0}, T_{f}]; R^{n_{v}})

,

{∥ z ∥}_{M_{F}^{2} ([t_{0}, T_{f}]; R^{n_{z}})} \leq ρ {∥ v ∥}_{M_{F}^{2} ([t_{0}, T_{f}]; R^{n_{v}})},

(2)

i.e.,

E \int_{t_{0}}^{T_{f}} {| z (t) |}^{2} d t \leq ρ^{2} E \int_{t_{0}}^{T_{f}} {| v (t) |}^{2} d t,

(3)

then (2) and (3) are equivalent to

∥ L ∥ \leq ρ .

The following lemma is the bounded real lemma from [34], which is also suitable for the case of

H_{\infty}

problem for system (1).

Lemma 1.

For some given positive scalar

ρ > 0

, suppose there exists a positive definite matrix

P \in S_{+}^{n} (R)

such that

[\begin{matrix} P A + A^{T} P + A_{1}^{T} P A_{1} & P B_{1} + A_{1}^{T} P B_{2} & C^{T} \\ ★ & - (ρ^{2} I - B_{2} P B_{2}) & D^{T} \\ ★ & ★ & - I \end{matrix}] \leq 0,

(4)

where ★ denotes the symmetrical part, and then

∥ L ∥ \leq ρ

. Moreover, there also exists

E \int_{t_{0}}^{T_{f}} | z (t; t_{0}, v, ξ) |^{2} d t \leq E [x_{0}^{T} P x_{0}] + ρ^{2} E \int_{t_{0}}^{T_{f}} {| v (t) |}^{2} d t,

for all

x_{0} \in R^{n}

and

v (t) \in M_{F}^{2} ([t_{0}, T_{f}]; R^{n_{v}})

.

3. State-Feedback and Error-Feedback $H_{\infty}$ Tracking Control for Linear Systems Based on Partially Observable Data

The following linear control systems are considered:

\{\begin{matrix} d x (t) = [A x (t) + B u (t) + B_{1} v_{1} (t)] d t + [A_{1} x (t) + B_{2} v_{2} (t)] d W (t), \\ x (0) = x_{0} \in R^{n}, t \in [0, T_{f}], \end{matrix}

(5)

where

A, A_{1} \in R^{n \times n}

,

B \in R^{n \times n_{u}}

,

B_{1} \in R^{n \times n_{v_{1}}}

,

B_{2} \in R^{n \times n_{v_{2}}}

,

C \in R^{n_{z} \times n}

, and

D \in R^{n_{z} \times n}

are coefficient matrices,

x (t) \in R^{n}

is the system state,

u (t) \in R^{n_{u}}

is the controller, and

v_{1} (t) \in R^{n_{v_{1}}}

and

v_{2} (t) \in R^{n_{v_{2}}}

are the exogenous disturbances.

Suppose the tracking target of system (5) can be described by the following reference model:

{\dot{x}}_{r} (t) = A_{r} x_{r} (t) + v_{3} (t),

(6)

where

x_{r} (t) \in R^{n}

is the desired reference state tracked by

x (t)

in (5),

A_{r} \in R^{n \times n}

is the coefficient matrix, and

v_{2} (t) \in R^{n_{v_{2}}}

is the exogenous disturbance of reference system (6). In system (6), the system coefficient

A_{r}

is unknown, but

x_{r} (t)

can be observed at discrete-time

t_{0}, t_{1}, \dots, t_{N}

with

t_{0} < t_{1} < \dots < t_{N} \leq T_{f}

; i.e., the observation values of

x_{r} (t_{0}), x_{r} (t_{1}), \dots x_{r} (t_{N})

can be obtained.

Denote

e_{r} (t)

as the tracking errors between

x (t)

and

x_{r} (t)

; i.e.,

e_{r} (t) = x (t) - x_{r} (t) .

Our target is, with the background of reference system coefficients unknown and only based on the obtained discrete-time observation

x_{r} (t_{0}), x_{r} (t_{1}), \dots x_{r} (t_{N})

, to find an

H_{\infty}

controller

u (t)

such that, for the given

ρ > 0

, the tracking performance always satisfies the following inequality in the next coming time

[t_{N}, T_{f}]

given as

\int_{t_{N}}^{T_{f}} E [e_{r}^{T} (t) Q e_{r} (t) + u^{T} (t) R u (t)] d t \leq ρ^{2} \int_{t_{N}}^{T_{f}} E [v^{T} (t) v (t)] d t,

(7)

for all exogenous disturbances

v (t) = {[v_{1} {(t)}^{T}, v_{2} {(t)}^{T}, v_{3} {(t)}^{T}]}^{T} \in M_{F}^{2} ([t_{N}, T_{f}]; R^{n_{v}})

, where

Q \geq 0

is the weighted matrix of tracking errors

e (t)

,

R > 0

is the weighted matrix of controller

u (t)

,

T_{f} > 0

is the terminal time of control, and

ρ

is the attenuation level of external interference on the system.

Denote

ξ_{r} (t_{k}) = \frac{x_{r} (t_{k + 1}) - x_{r} (t_{k})}{t_{k + 1} - t_{k}}, k = 0, 1, \dots, N - 1 .

In order to obtain the

H_{\infty}

control

u (t)

, the unknown coefficient matrix

A_{r}

in system (6) should be estimated first. Because the trajectory

x_{r} (t)

of reference system (6) can be observed at discrete-time

t_{0}, t_{1}, \dots, t_{N}

, in order to overcome the system’s uncertainty regarding (6), the observation values of

x_{r} (t_{0}), x_{r} (t_{1}), \dots x_{r} (t_{N})

are used to estimate the unknown matrix

A_{r}

. This requires the discrete form of (6) as follows:

\frac{x_{r} (t_{k + 1}) - x_{r} (t_{k})}{t_{k + 1} - t_{k}} = A_{r} x_{r} (t_{k}) + ϵ_{k}, k = 0, 1, 2, \dots, N - 1,

i.e.,

ξ_{r} (t_{k}) = A_{r} x_{r} (t_{k}) + ϵ_{k}, k = 0, 1, 2, \dots, N - 1,

(8)

where

{ϵ_{k}, k = 0, 1, \dots, N - 1}

are the fitting errors or residuals of this model. The least-squares method is used to estimate matrix

A_{r}

, which satisfies

{\hat{A}}_{r} = arg min_{A_{r}} TSSE (A_{r}),

(9)

where TSSE is the total sum of squared errors with

TSSE (A_{r}) = \sum_{k = 0}^{N - 1} {|ϵ_{k}|}^{2} .

(10)

Theorem 1.

Let

X_{r} = [x_{r} (t_{0}), x_{r} (t_{1}), \dots x_{r} (t_{N - 1})]

and

Ξ_{r} = [ξ_{r} (t_{0}), ξ_{r} (t_{1}), \dots ξ_{r} (t_{N - 1})]

; then, the least-squares estimator of matrix

A_{r}

can be presented by

{\hat{A}}_{r} = Ξ_{r} X_{r}^{T} {(X_{r} X_{r}^{T})}^{†} .

(11)

which satisfies (9), where

{(X_{r} X_{r}^{T})}^{†}

is the Moore–Penrose inverse matrix of

X_{r} X_{r}^{T}

.

Proof.

Applying the discrete-time model (8), we can obtain

ϵ_{k} = ξ_{r} (t_{k}) - A_{r} x (t_{k}), k = 0, 1, 2, \dots, N - 1 .

By the definition of

TSSE (A_{r})

, there exists

\begin{matrix} TSSE (A_{r}) & = \sum_{k = 0}^{N - 1} [ξ_{r} (t_{k}) - A_{r} x_{(} t_{k})]^{T} [ξ_{r} (t_{k}) - A_{r} x_{(} t_{k})] \\ = t r ((Ξ_{r}^{T} - X_{r}^{T} A_{r}^{T}) (Ξ_{r} - A_{r} X_{r})) \\ = ∥ Ξ_{r} - A_{r} X_{r} ∥^{2} . \end{matrix}

So, the least-squares estimator of

A_{r}

is given by (11). □

Remark 1.

In Theorem 1, the Moore–Penrose inverse matrix is used. As proposed by the anonymous reviewer, the Moore–Penrose procedure is insufficient. But, if the sample size

N + 1

is large enough, the matrix

X_{r} X_{r}^{T}

can be an invertible matrix, which is shown in Examples 1 and 2. If

X_{r} X_{r}^{T}

is invertible, then it is equivalent to

R (X_{r} X_{r}^{T}) = n

, where

R (X_{r} X_{r}^{T})

denotes the rank of matrix

X_{r} X_{r}^{T}

. Because

R (X_{r} X_{r}^{T}) \leq R (X_{r})

and

X_{r}

is

n \times N

matrix, the necessary condition for

X_{r} X_{r}^{T}

to be invertible is

N \geq n

. So, if

X_{r} X_{r}^{T}

is invertible, the result of (11) can be replaced by

\hat{A_{r}} = Ξ_{r} X_{r}^{T} {(X_{r} X_{r}^{T})}^{- 1} .

So, based on the results of Theorem 1, the parameter-unknown reference system (6) can be estimated with the observed data form

{\dot{x}}_{r} (t) = Ξ_{r} X_{r}^{T} {(X_{r} X_{r}^{T})}^{†} x_{r} (t) + v_{3} (t) .

(12)

The augment system of (5) and (12) is written by

\{\begin{matrix} d \tilde{x} (t) = [\tilde{A} \tilde{x} (t) + \tilde{B} u (t) + {\tilde{B}}_{1} v (t)] d t + [\tilde{A_{1}} \tilde{x} (t) + {\tilde{B}}_{2} v (t)] d W (t) \\ \tilde{x} (t_{N}) = [\begin{matrix} x (t_{N}) \\ x_{r} (t_{N}) \end{matrix}], t \in [t_{N}, T_{f}] \end{matrix}

(13)

where

\tilde{x} (t) = [\begin{matrix} x (t) \\ x_{r} (t) \end{matrix}], v (t) = [\begin{matrix} v_{1} (t) \\ v_{2} (t) \\ v_{3} (t) \end{matrix}], \tilde{A} = [\begin{matrix} A & 0 \\ 0 & Ξ_{r} X_{r}^{T} {(X_{r} X_{r}^{T})}^{†} \end{matrix}], \tilde{B} = [\begin{matrix} B \\ 0 \end{matrix}],

\tilde{A_{1}} = [\begin{matrix} A_{1} & 0 \\ 0 & 0 \end{matrix}], {\tilde{B}}_{1} = [\begin{matrix} B_{1} & 0 & 0 \\ 0 & 0 & I_{n_{v_{3}}} \end{matrix}], {\tilde{B}}_{2} = [\begin{matrix} 0 & B_{2} & 0 \\ 0 & 0 & 0 \end{matrix}] .

So, the corresponding performance of (13) is rewritten as

\int_{t_{N}}^{T_{f}} E [{\tilde{x}}^{T} (t) \tilde{Q} \tilde{x} (t) + u^{T} (t) R u (t)] d t \leq ρ^{2} \int_{t_{N}}^{T_{f}} E [v^{T} (t) v (t)] d t

(14)

where

\tilde{Q} = [\begin{matrix} Q & - Q \\ - Q & Q \end{matrix}] .

Remark 2.

Denote the observations of the reference system (6) at discrete-time

t_{0}, t_{1}, \dots, t_{N}

as

X = [x_{r} (t_{0}), x_{r} (t_{1}), \dots x_{r} (t_{N})],

(15)

and then the coefficients of

Ξ_{r}

and

X_{r}

in (12) depend on such observation data

X_{r}

. So, some coefficients of the augment system (13) such as

\tilde{A}

are also data-dependent where the data

X_{r}

is only the observations of

x_{r} (t)

at discrete-time

t_{0}, t_{1}, \dots, t_{N}

. Because such data

X_{r}

is only the observation of reference system (6) but not including the to-be-controlled system (5), and

X_{r}

is only the data at discrete time at

t_{0}, t_{1}, \dots, t_{N}

but not the continuous interval in

[t_{0}, t_{N}]

, so the to-be-designed control is called partial-data-driven, which depends on such data

X_{r}

. Now, we outline the definition of partial-data-driven

H_{\infty}

tracking control of systems (5) and (6) in which the performance is more general than (14).

Definition 1.

For the given scalar

ρ > 0

and the observations

X_{r}

of reference system (6), if there exists a positive definite matrix P and a control

u (t), t \in [t_{N}, T_{f}]

such that the solutions of

\tilde{x} (t)

for augmented system (13) satisfy the following inequality

\begin{matrix} \int_{t_{N}}^{T_{f}} E [{\tilde{x}}^{T} (t) \tilde{Q} \tilde{x} (t) + u^{T} (t) R u (t)] d t \leq E [{\tilde{x}}^{T} (t_{N}) P \tilde{x} (t_{N})] + ρ^{2} \int_{t_{N}}^{T_{f}} E [v^{T} (t) v (t)] d t, \end{matrix}

(16)

for all

\tilde{x} (t_{N}) \in L^{2} (Ω, F_{t_{N}}, P; R^{2 n}), v (t) \in M_{F}^{2} ([t_{N}, T_{f}]; R^{n_{v}})

, then

u (t)

is called the partial-data-driven

H_{\infty}

tracking control of systems (5) and (6).

Theorem 2.

For the given positive number

ρ > 0

, suppose the positive definite matrix

P > 0

satisfies the following Riccati inequality:

\{\begin{matrix} H_{1} (P) : = {\tilde{A}}^{T} P + P \tilde{A} + {\tilde{A_{1}}}^{T} P \tilde{A_{1}} + \tilde{Q} - P \tilde{B} R^{- 1} {\tilde{B}}^{T} P \\ + ({\tilde{A}}_{1}^{T} P {\tilde{B}}_{2} + P {\tilde{B}}_{1}) {(ρ^{2} I - {\tilde{B}}_{2}^{T} P {\tilde{B}}_{2})}^{- 1} ({\tilde{B}}_{2}^{T} P {\tilde{A}}_{1} + {\tilde{B}}_{1}^{T} P) \leq 0, \\ ρ^{2} I - {\tilde{B}}_{2}^{T} P {\tilde{B}}_{2} > 0 . \end{matrix}

(17)

Then, the state-feedback control

u (t) = - R^{- 1} {\tilde{B}}^{T} P \tilde{x} (t)

is the partial-data-driven

H_{\infty}

tracking control of systems (5) and (6).

Proof.

Let

V (\tilde{x}) = {\tilde{x}}^{T} P \tilde{x}

. Applying Itô’s formula to

V (\tilde{x} (t))

, we have

\begin{matrix} V (\tilde{x} (T_{f})) - V (\tilde{x} (t_{N})) & = \int_{t_{N}}^{T_{f}} [{\tilde{x}}^{T} (t) ({\tilde{A}}^{T} P + P \tilde{A} + {\tilde{A_{1}}}^{T} P \tilde{A_{1}}) \tilde{x} (t) + 2 {\tilde{x}}^{T} (t) P \tilde{B} u (t) \\ + 2 x^{T} (t) (P {\tilde{B}}_{1} v (t) + {\tilde{A}}_{1}^{T} P {\tilde{B}}_{2}) v (t) + v^{T} (t) {\tilde{B}}_{2}^{T} P {\tilde{B}}_{2} v (t)] d t \\ + \int_{t_{N}}^{T_{f}} 2 {\tilde{x}}^{T} (t) P [\tilde{A_{1}} \tilde{x} (t) + {\tilde{B}}_{2} v (t)] d W (t) . \end{matrix}

Taking expectation on both sides, we get

\begin{matrix} E [V (\tilde{x} (T_{f}))] - E [V (\tilde{x} (t_{N}))] = E \int_{t_{N}}^{T_{f}} [{\tilde{x}}^{T} (t) ({\tilde{A}}^{T} P + P \tilde{A} + {\tilde{A_{1}}}^{T} P \tilde{A_{1}}) \tilde{x} (t) \\ + 2 {\tilde{x}}^{T} (t) P \tilde{B} u (t) + 2 x^{T} (t) (P {\tilde{B}}_{1} v (t) + {\tilde{A}}_{1}^{T} P {\tilde{B}}_{2}) v (t) + v^{T} (t) {\tilde{B}}_{2}^{T} P {\tilde{B}}_{2} v (t)] d t . \end{matrix}

Since

P \geq 0

, there exists

V (\tilde{x} (T_{f})) \geq 0

. So, we have

\begin{matrix} \int_{t_{N}}^{T_{f}} E & [{\tilde{x}}^{T} (t) \tilde{Q} \tilde{x} (t) + u {(t)}^{T} R u (t)] d t \leq E [V (\tilde{x} (t_{N}))] + ρ^{2} E \int_{t_{N}}^{T_{f}} {| v (t) |}^{2} d t \\ + E \int_{t_{N}}^{T_{f}} [{\tilde{x}}^{T} (t) ({\tilde{A}}^{T} P + P \tilde{A} + {\tilde{A_{1}}}^{T} P \tilde{A_{1}} + Q) \tilde{x} (t) + 2 {\tilde{x}}^{T} (t) P \tilde{B} u (t) \\ + u {(t)}^{T} R u (t) + 2 x^{T} (t) (P {\tilde{B}}_{1} + {\tilde{A}}_{1}^{T} P {\tilde{B}}_{2}) v (t) - v^{T} (t) (ρ^{2} I - {\tilde{B}}_{2}^{T} P {\tilde{B}}_{2}) v (t)] d t . \end{matrix}

By completing-squares method, we have

\begin{matrix} \int_{t_{N}}^{T_{f}} E [{\tilde{x}}^{T} (t) \tilde{Q} \tilde{x} (t) + u {(t)}^{T} & R u (t)] d t \leq E [V (\tilde{x} (t_{N}))] + ρ^{2} E \int_{t_{N}}^{T_{f}} {| v (t) |}^{2} d t \\ + E \int_{t_{N}}^{T_{f}} [{\tilde{x}}^{T} (t) H_{1} (P) {\tilde{x}}^{T} (t) + {∥ u (t) + R^{- 1} B^{T} P x ∥}_{R}^{2} \\ - ∥ v (t) - {(ρ^{2} I - {\tilde{B}}_{2}^{T} P {\tilde{B}}_{2})}^{- 1} ({\tilde{B}}_{1}^{T} P + {\tilde{B}}_{2}^{T} P {\tilde{A}}_{1}) x (t) ∥_{M}^{2}] d t . \end{matrix}

where

M = ρ^{2} I - {\tilde{B}}_{2}^{T} P {\tilde{B}}_{2}

. Taking

u = - R^{- 1} B^{T} P x

and combining with inequality (17), the following inequality is obtained:

\begin{matrix} \int_{t_{N}}^{T_{f}} E [{\tilde{x}}^{T} (t) \tilde{Q} \tilde{x} (t) + u {(t)}^{T} R u (t)] d t \leq E [V (\tilde{x} (t_{N}))] + ρ^{2} E \int_{t_{N}}^{T_{f}} {| v (t) |}^{2} d t, \end{matrix}

i.e.,

\begin{matrix} \int_{t_{N}}^{T_{f}} E [{\tilde{x}}^{T} (t) \tilde{Q} \tilde{x} (t) + u {(t)}^{T} R u (t)] d t \leq E [{\tilde{x}}^{T} (t_{N}) P \tilde{x} (t_{N})] + ρ^{2} E \int_{t_{N}}^{T_{f}} {| v (t) |}^{2} d t, \end{matrix}

for all

\tilde{x} (t_{N}) \in L^{2} (Ω, F_{t_{N}}, P; R^{2 n})

,

v (t) \in M_{F}^{2} ([t_{N}, T_{f}]; R^{n_{v}})

. This ends the proof. □

Remark 3.

Let

\tilde{z} (t) = {\tilde{Q}}^{\frac{1}{2}} \tilde{x} (t) + R^{\frac{1}{2}} u (t),

and then the inequality (16) can be rewritten as

E \int_{t_{N}}^{T_{f}} {| \tilde{z} (t) |}^{2} d t \leq E [{\tilde{x}}^{T} (t_{N}) P \tilde{x} (t_{N})] + ρ^{2} \int_{t_{N}}^{T_{f}} E [v^{T} (t) v (t)] d t .

Particularly, if

x (t_{N}) = 0

, we have

E \int_{t_{N}}^{T_{f}} {| \tilde{z} (t) |}^{2} d t \leq ρ^{2} \int_{t_{N}}^{T_{f}} E [v^{T} (t) v (t)] d t .

This is just the performance of (14). Under such situation, we can define the operator

L

as follows:

L : M_{F}^{2} ([t_{N}, T_{f}]; R^{n_{v}}) \mapsto M_{F}^{2} ([t_{N}, T_{f}]; R^{n_{\tilde{z}}})

with

L (v) (t) = \tilde{z} (t)

, and the

H_{\infty}

norm of

L

satisfies

∥ L ∥ \leq ρ

. So, the inequality (16) in Definition 1 is the generalization of the performance given by (16).

Now, we consider the error-feedback control case and suppose the corresponding control

u (t)

will be designed with the form of

u (t) = K e (t),

where K is a to-be-designed matrix taking values in

R^{n_{u} \times n}

. Denote

{\tilde{A}}_{K} = [\begin{matrix} A + B K & - B K \\ 0 & Ξ_{r} X_{r}^{T} {(X_{r} X_{r}^{T})}^{†} \end{matrix}] .

Then, the augmented system of (5) and (12) under control

u (t)

can be described as

\{\begin{matrix} d \tilde{x} (t) = [{\tilde{A}}_{K} \tilde{x} (t) + \tilde{B} u (t) + {\tilde{B}}_{1} v (t)] d t + [\tilde{A_{1}} \tilde{x} (t) + {\tilde{B}}_{2} v (t)] d W (t) \\ \tilde{x} (t_{N}) = [\begin{matrix} x (t_{N}) \\ x_{r} (t_{N}) \end{matrix}], t \in [t_{N}, T_{f}] \end{matrix}

(18)

So, our target is to find proper matrix K and P to satisfy the following inequality for some given

ρ > 0

\begin{matrix} E \int_{t_{N}}^{T_{f}} e_{r} {(t)}^{T} Q e_{r} (t) d t \leq E [{\tilde{x}}^{T} (t_{N}) P \tilde{x} (t_{N})] + ρ^{2} E \int_{t_{N}}^{T_{f}} {| v_{t} |}^{2} d t, \\ \forall \tilde{x} (t_{N}) \in L^{2} (Ω, F_{t_{N}}, P; R^{2 n}), v (t) \in M_{F}^{2} ([t_{N}, T_{f}]; R^{n_{v}}) . \end{matrix}

(19)

Theorem 3.

Suppose there exists matrix

K \in R^{n_{v_{1}} \times n}

, and positive matrix

P > 0

satisfies the following Riccati inequality

\{\begin{matrix} H_{2} (P) : = {\tilde{A}}_{K}^{T} P + P {\tilde{A}}_{K} + {\tilde{A}}_{1}^{T} P {\tilde{A}}_{1} + \tilde{Q} \\ + ({\tilde{A}}_{1}^{T} P {\tilde{B}}_{2} + P {\tilde{B}}_{1}) {(ρ^{2} I - {\tilde{B}}_{2}^{T} P {\tilde{B}}_{2})}^{- 1} ({\tilde{B}}_{1}^{T} P + {\tilde{B}}_{2}^{T} P {\tilde{A}}_{1}) \leq 0 \\ ρ^{2} I - {\tilde{B}}_{2}^{T} P {\tilde{B}}_{2} > 0 \end{matrix}

(20)

Then, the error-feedback control

u (t) = K e_{r} (t)

satisfies (19).

Proof.

Similar to the proof of Theorem 2, applying Itô’s formula to

V (\tilde{x} (t)) = {\tilde{x}}^{T} (t) P \tilde{x} (t)

, we can obtain the following inequality:

\begin{matrix} E \int_{t_{N}}^{T_{f}} e_{r} {(t)}^{T} Q e_{r} (t) d t \leq & E [{\tilde{x}}^{T} (t_{N}) P \tilde{x} (t_{N})] + ρ^{2} E \int_{t_{N}}^{T_{f}} {| v_{t} |}^{2} d t + E \int_{t_{N}}^{T_{f}} {\tilde{x}}^{T} (t) H_{1} (P) \tilde{x} (t) d t \\ - E \int_{t_{N}}^{T_{f}} {∥ v (t) - {(ρ^{2} I - {\tilde{B}}_{2}^{T} P {\tilde{B}}_{2})}^{- 1} ({\tilde{B}}_{1}^{T} P + {\tilde{B}}_{2}^{T} P {\tilde{A}}_{1}) x (t) ∥}_{M}^{2} d t, \end{matrix}

where

M = ρ^{2} I - {\tilde{B}}_{2}^{T} P {\tilde{B}}_{2}

. So, for every

v (t) \in M_{F}^{2} ([t_{N}, T_{f}]; R^{n_{v}})

, there exists

\begin{matrix} E \int_{t_{N}}^{T_{f}} e_{r} {(t)}^{T} Q e_{r} (t) d t \leq E [{\tilde{x}}^{T} (t_{N}) P \tilde{x} (t_{N})] + ρ^{2} E \int_{t_{N}}^{T_{f}} {| v_{t} |}^{2} d t, \end{matrix}

for all

v (t) \in M_{F}^{2} ([t_{N}, T_{f}]; R^{n_{v}})

. This ends the proof. □

4. Hybrid Partial-Data-Driven $H_{\infty}$ Robust Tracking Control Scheme for Linear Stochastic Systems

Based on the results of Theorems 1–3 in Section 3, the hybrid partial-data-driven

H_{\infty}

robust tracking control scheme for (5) and (6) is proposed in this section. The hybrid control scheme includes three stages, and the interval

[t_{0}, T_{f}]

is divided into three segments with the subintervals of

[t_{0}, t_{N}]

,

[t_{N}, T^{(1)}]

, and

[T^{(1)}, T_{f}]

, where

t_{0} < t_{N} < T^{(1)} < T_{f}

. The control input of system (5) is designed with piecewise form for each stage, which is organized in detail as follows:

Stage 1: Observing state of $x_{r} (t)$ in $[t_{0}, t_{N}]$ at $t_{0}, t_{1}, t_{2}, \dots, t_{N}$ .

In this stage, the coefficient

A_{r}

of reference system (6) is unknown, but the observation values of

x_{r} (t)

can be obtained at

t_{0}, t_{1}, t_{2}, \dots, t_{N}

. The observations of

x_{r} (t_{0}), x_{r} (t_{1}), x_{r} (t_{2}), \dots, x_{r} (t_{N})

can rewritten as a matrix

X_{r} = [x_{r} (t_{0}), x_{r} (t_{1}), x_{r} (t_{2}), \dots, x_{r} (t_{N})] .

Denote

Ξ_{r} = [\frac{x_{r} (t_{1}) - x_{r} (t_{0})}{t_{1} - t_{0}}, \frac{x_{r} (t_{2}) - x_{r} (t_{1})}{t_{2} - t_{1}}, \dots, \frac{x_{r} (t_{N}) - x_{r} (t_{N - 1})}{t_{N} - t_{N - 1}}]

By results of Theorem 1, the estimator of

A_{r}

can be obtained:

{\hat{A}}_{r} = Ξ_{r} X_{r}^{T} {(X_{r} X_{r}^{T})}^{†} .

Because the main objective is to observe the state of system (6), there are no control inputs in this stage; i.e.,

u^{*} (t) \equiv 0,

when

t \in [t_{0}, t_{N}]

.

Stage 2: Designing state-feedback $H_{\infty}$ control in $[t_{N}, T^{(1)}]$ .

Based on the estimator of reference system (12), the augmented control system of (5) and (12) can be obtained. For the given positive scalar

ρ > 0

, by solving the Riccati inequality (17), the positive definite matrix P is obtained corresponding to the augmented system (13). By the results of Theorem 2, the state-feedback

H_{\infty}

control is designed:

u^{*} (t) = K_{2} \tilde{x} (t),

when

t \in [t_{N}, T^{(1)}]

, where

K_{2} = - R^{- 1} {\tilde{B}}^{T} P

and

\tilde{x} (t)

is the state of the augmented system (13), and such control satisfies the following performance:

\begin{matrix} E & \int_{t_{N}}^{T^{(1)}} [e_{r}^{T} (t) Q e_{r} (t) + u^{T} (t) R u (t)] d t \leq E [{\tilde{x}}^{T} (t_{N}) P \tilde{x} (t_{N})] + E \int_{t_{N}}^{T^{(1)}} {| v (t) |}^{2} d t, \\ \forall \tilde{x} (t_{N}) \in L^{2} (Ω, F_{t_{N}}, P; R^{2 n}), v (t) \in M_{F}^{2} ([t_{N}, T^{(1)}]; R^{n_{v}}) . \end{matrix}

Stage 3: Designing error-feedback $H_{\infty}$ control in $[T^{(1)}, T_{f}]$ .

In order to further decrease the errors between

x (t)

and

x_{r} (t)

, the error-feedback control is designed. By solving the Riccati inequality (20), the positive matrix P is obtained, and the corresponding error-feedback is designed:

u^{*} (t) = K_{3} (x (t) - x_{r} (t)),

when

t \in [T^{(1)}, T_{f}]

, where

K_{3}

is a part of the solutions of Riccati inequality (20).

Finally, the three-stage

H_{\infty}

control

u^{*} (t)

of (5) and (6) is obtained with piecewise form regarding

t \in [t_{0}, T_{f}]

, which can be written as follows:

u^{*} (t) = \{\begin{matrix} 0, & t \in [t_{0}, t_{N}], \\ K_{2} \tilde{x} (t), & t \in [t_{N}, T^{(1)}], \\ K_{3} (x (t) - x_{r} (t)), & t \in [T^{(1)}, T_{f}] . \end{matrix}

This piecewise control

u^{*} (t)

satisfies the following performance:

\begin{matrix} E \int_{T^{(1)}}^{T_{f}} e_{r} {(t)}^{T} Q e_{r} (t) d t \leq E [{\tilde{x}}^{T} (T^{(1)}) P \tilde{x} (T^{(1)}] + ρ^{2} E \int_{T^{(1)}}^{T_{f}} {| v_{t} |}^{2} d t, \\ \forall \tilde{x} (T^{(1)}) \in L^{2} (Ω, F_{T^{(1)}}, P; R^{2 n}), v (t) \in M_{F}^{2} ([T^{(1)}, T_{f}]; R^{n_{v}}) . \end{matrix}

For the profiles of the hybrid control

u^{*} (t)

and the corresponding performances are shown in the following Example 1–3.

In summary, the programming evolution of such suggested hybrid 3-stage

H_{\infty}

control is organized in the following Table 1.

Table 1. The programming evolution of hybrid partial-data-driven

H_{\infty}

control.

5. Examples and Simulation

Example 1.

Consider the following 1-dimensional controlled system:

\begin{matrix} \{\begin{matrix} d x (t) = (- 2 x (t) + u (t) + v_{1} (t)) d t + (0.16 x (t) + v_{2} (t)) d W (t), \\ x (0) = - 3, t \in [0, T_{f}], \end{matrix} \end{matrix}

(21)

and the reference system is also a 1-dimensional system:

{\dot{x}}_{r} (t) = a_{r} x_{r} (t) + v_{3},

(22)

where

a_{r}

is an unknown number, but the values of

x_{r} (t)

are observed at

t = t_{0}, t_{1}, \dots, t_{N}

with observation vector

X_{r}

as follows:

\begin{matrix} X_{r} = [1.0000, 0.9306, 0.8793, 0.8420, 0.7828, 0.7024, 0.6394, 0.5906, 0.4946, 0.4727, \\ 0.3722, 0.2397, 0.2241, 0.2115, 0.2014, 0.2699, 0.3046, 0.3673, 0.3416, 0.3529, 0.2880], \end{matrix}

where

t_{0} = 0

,

t_{k} = k Δ t

,

k = 1, 2, \dots, 20

, and

Δ t = 0.05

. By Theorem 1, we get the estimator

{\hat{a}}_{r} = - 1.5182

. So, the augmented system is obtained:

\begin{matrix} \{\begin{matrix} d x (t) = [- 2 x (t) + u (t) + v_{1} (t)] d t + [0.16 x (t) + v_{2} (t)] d W (t), \\ d x_{r} (t) = [- 1, 5182 x_{r} (t) + v_{3} (t)] d t, t \in [t_{N}, T_{f}] . \end{matrix} \end{matrix}

(23)

The inverse matrix of

X_{r} X_{r}^{T}

is

{(X_{r} X_{r}^{T})}^{- 1} = 0.1507

, and the corresponding augmented matrices are

\tilde{A} = [\begin{matrix} - 2 & 0 \\ 0 & - 1.5182 \end{matrix}], \tilde{B} = [\begin{matrix} 1 \\ 0 \end{matrix}], {\tilde{B}}_{1} = [\begin{matrix} 1 \\ 0 \end{matrix}], {\tilde{B}}_{2} = [\begin{matrix} 1 \\ 0 \end{matrix}], {\tilde{B}}_{3} = [\begin{matrix} 0 \\ 1 \end{matrix}], Q = R = 1 .

For

ρ = 0.9

, it is easy to check that the following matrix

P_{2} = [\begin{matrix} 0.5066 & - 0.2990 \\ - 0.2990 & 0.7361 \end{matrix}]

satisfies the Riccati inequality (17). Let

T^{(1)} \in (t_{N}, T_{f})

. By Theorem 2, the corresponding state-feedback

H_{\infty}

control

u^{*} (t) = - 0.5066 x (t) + 0.2990 x_{r} (t), t \in [t_{N}, T^{(1)}]

is designed. Similarly, it is easy to check that the matrix

P_{3} = P_{2}

and

K_{3} = - 0.4267

also satisfy the Riccati inequality (20). By Theorem 3, we can obtain the corresponding error-feedback

H_{\infty}

control

u^{*} (t) = - 0.4267 [x (t) - x_{r} (t)], t \in [T^{(1)}, T_{f}] .

In summary, the control

u (t)

of system (21) is designed in three stages with t in different intervals:

[t_{0}, t_{N}]

,

[t_{N}, T^{(1)}]

,

[T^{(1)}, T_{f}]

. So, the control

u^{*} (t)

has a piecewise form:

u^{*} (t) = \{\begin{matrix} 0, & t \in [t_{0}, t_{N}], \\ - 0.5066 x (t) + 0.2990 x_{r} (t), & t \in [t_{N}, T^{(1)}], \\ - 0.4267 [x (t) - x_{r} (t)], & t \in [T^{(1)}, T_{f}] . \end{matrix}

(24)

This control is the suggested hybrid

H_{\infty}

control of (21) and (22). The trajectory of

u^{*} (t)

with piecewise form is shown in Figure 1.

Figure 1. Trajectories of

u^{*} (t)

in Example 1.

Figure 2 shows the profiles of exogenous disturbances and Brownian motion. The trajectories of

x (t)

and

x_{r} (t)

are illustrated in Figure 3, which are the solutions of (21) and (22) under the control

u (t)

given by (24).

Figure 2. Profiles of Brownian motion and exogenous disturbance in systems (25) and (26) in Example 1.

Figure 3. Trajectories of

x (t)

and

x_{r} (t)

under the control

u^{*} (t)

in Example 1.

By Figure 3 and Figure 4, we see that, under the control of

u^{*} (t)

, the distance between

x (t)

and

x_{r} (t)

becomes less and the errors

e_{r} (t) = x (t) - x_{r} (t)

also become smaller when t changes in

[t_{0}, T_{f}]

. Comparing the errors in different stages, Figure 4 illustrates that the values of errors

e_{r} (t)

at the third stage (

t \in [T^{(1)}, T_{f}]

) are smaller than in the other two stages, where

t \in [t_{0}, t_{N}]

and

[t_{N}, T^{(1)}]

.

Figure 4. The changes in errors

| e_{r} (t) |

under the effect of control

u^{*} (t)

in Example 1.

Example 2.

Consider the following 3-dimensional controlled system:

\begin{matrix} \{\begin{matrix} d x (t) = (A x (t) + B x (t) + B_{1} v_{1} (t)) d t + (A_{1} x (t) + B_{2} v_{2} (t)) d W (t) \\ x (0) = [\begin{matrix} - 3 \\ 1 \\ 1 \end{matrix}], t \in [0, T_{f}] \end{matrix} \end{matrix}

(25)

with the matrix coefficients as follows:

\begin{matrix} A = [\begin{matrix} - 1.6000 & 0.8000 & 0 \\ - 0.8000 & - 2.0000 & 1.6000 \\ 0.8000 & 0 & - 1.6000 \end{matrix}], B = [\begin{matrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{matrix}], B_{1} = [\begin{matrix} 0.1300 \\ 0.1000 \\ 0.1400 \end{matrix}], \\ A_{1} = [\begin{matrix} - 0.1280 & 0.0640 & 0 \\ - 0.0640 & - 0.1600 & 0.1280 \\ 0.0640 & 0 & - 0.1280 \end{matrix}], B_{2} = [\begin{matrix} 0.0980 \\ 0.1000 \\ 0.1200 \end{matrix}], B_{3} = [\begin{matrix} 0.1500 \\ 0.1700 \\ 0.1200 \end{matrix}], \\ Q = R = [\begin{matrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{matrix}], \end{matrix}

and the reference system is also a 3-dimensional system:

\dot{x} (t) = A_{r} x_{r} (t) + B_{3} v_{3},

(26)

where

A_{r}

is an unknown matrix. Suppose the interval

[t_{0}, T_{f}]

is divided into three segments, i.e.,

[t_{0}, t_{N}]

,

[t_{N}, T^{(1)}]

, and

[T^{(1)}, T_{f}]

. Now, we apply the suggested hybrid 3-stage-control method to design the

H_{\infty}

control of (25) and (26), which includes 3 steps:

Stage 1: Observing state of $x_{r} (t)$ in $[t_{0}, t_{N}]$ at $t_{0}, t_{1}, t_{2}, \dots, t_{N}$ .

In this stage, the states of reference system (26) can be observed at

t_{0}, t_{1}, t_{2}, \dots, t_{N}

. Suppose the observations of reference system (26) at

t_{0} = 0, t_{1} = Δ t, t_{2} = 2 Δ t, \dots, t_{N} = N Δ t

with

N = 20

are arranged as a

3 \times (N + 1)

matrix denoted by

X_{r} = [x_{t_{0}}, x_{t_{1}}, \dots, x_{t_{N}}]

, whose values are observed as follows:

\begin{matrix} X_{r} = [\begin{matrix} 2.0099 & 1.9772 & 1.9242 & 1.8567 & 1.7744 & 1.6807 & 1.6077 \\ 3.3307 & 2.8971 & 2.5053 & 2.1834 & 1.8992 & 1.6630 & 1.4555 \\ 1.5543 & 1.5005 & 1.4424 & 1.4031 & 1.3478 & 1.2990 & 1.2610 \end{matrix} \\ \begin{matrix} 1.5152 & 1.4263 & 1.3366 & 1.2687 & 1.1921 & 1.1177 & 1.0487 \\ 1.2870 & 1.1427 & 1.0174 & 0.9122 & 0.8072 & 0.7265 & 0.6451 \\ 1.2263 & 1.1559 & 1.1126 & 1.0681 & 1.0206 & 0.9567 & 0.9056 \end{matrix} \\ \begin{matrix} 0.9799 & 0.8970 & 0.8369 & 0.7756 & 0.7252 & 0.6752 & 0.6380 \\ 0.5843 & 0.5281 & 0.4838 & 0.4713 & 0.4487 & 0.4153 & 0.3883 \\ 0.8605 & 0.8039 & 0.7567 & 0.7249 & 0.6946 & 0.6643 & 0.6421 \end{matrix}], \end{matrix}

where

Δ t = 0.05

is the sampling period. The inverse matrix of

X_{r} X_{r}^{T}

is

{(X_{r} X_{r}^{T})}^{- 1} = [\begin{matrix} 15.6800 & - 2.3095 & - 16.5304 \\ - 2.3095 & 0.5003 & 2.2326 \\ - 16.5304 & 2.2326 & 17.7210 \end{matrix}] .

Applying the results of Theorem 1 in this stage, the estimator of

A_{r}

can be obtained as follows:

{\hat{A}}_{r} = [\begin{matrix} - 2.6897 & 1.0497 & 0.8069 \\ - 2.0047 & - 2.8509 & 3.0743 \\ 0.9389 & 0.1091 & - 2.1192 \end{matrix}] .

In this stage, there is no control input

u (t)

; i.e.,

u^{1, *} (t) \equiv 0

.

Stage 2: Designing state-feedback $H_{\infty}$ control in $[t_{N}, T^{(1)}]$ .

Taking

ρ = 0.9

, by solving the Riccati inequality (17) corresponding to systems (25) and (26) with observation

X_{r}

, we can obtain the positive definite matrix

P_{2} = [\begin{matrix} 0.7505 & 0.0796 & 0.1836 & - 0.2306 & - 0.0259 & - 0.0978 \\ 0.0796 & 0.5933 & 0.2790 & 0.0234 & - 0.1879 & - 0.1338 \\ 0.1836 & 0.2790 & 0.9542 & - 0.0218 & - 0.0622 & - 0.3467 \\ - 0.2306 & 0.0234 & - 0.0218 & 0.4721 & - 0.0116 & 0.1373 \\ - 0.0259 & - 0.1879 & - 0.0622 & - 0.0116 & 0.3991 & 0.2849 \\ - 0.0978 & - 0.1338 & - 0.3467 & 0.1373 & 0.2849 & 0.9738 \end{matrix}] .

Furthermore, by Theorem 2, we can get the state-feedback

H_{\infty}

control

u^{2, *} (t) = K_{2} {[x^{T} (t), x_{r}^{T} (t)]}^{T}

with

K_{2} = [\begin{matrix} - 0.7505 & - 0.0796 & - 0.1836 & 0.2306 & 0.0259 & 0.0978 \\ - 0.0796 & - 0.5933 & - 0.2790 & - 0.0234 & 0.1879 & 0.1338 \\ - 0.1836 & - 0.2790 & - 0.9542 & 0.0218 & 0.0622 & 0.3467 \end{matrix}] .

Stage 3: Designing error-feedback $H_{\infty}$ control in $[T^{(1)}, T_{f}]$ .

By solving the Riccati inequality (20), the positive definite matrix

P_{3}

is obtained as

P_{3} = [\begin{matrix} 1.4993 & - 0.0444 & - 0.0603 & 0.0000 & 0.0000 & 0.0000 \\ - 0.0444 & 1.5181 & - 0.0488 & 0.0000 & - 0.0000 & 0.0000 \\ - 0.0603 & - 0.0488 & 1.4884 & 0.0000 & - 0.0000 & 0.0000 \\ 0.0000 & 0.0000 & 0.0000 & 1.4926 & - 0.0701 & - 0.0495 \\ 0.0000 & - 0.0000 & - 0.0000 & - 0.0701 & 1.4750 & - 0.0560 \\ 0.0000 & 0.0000 & 0.0000 & - 0.0495 & - 0.0560 & 1.5149 \end{matrix}],

and, by results of Theorem 3, the corresponding error-feedback

H_{\infty}

control

u^{3, *} (t) = K_{3} (x (t) - x_{r} (t))

can be obtained, where

K_{3}

is solved as

K_{3} = [\begin{matrix} 0.0368 & - 0.2511 & - 0.0297 \\ 0.2503 & 0.0603 & - 0.2802 \\ - 0.0346 & 0.1502 & 0.0227 \end{matrix}] .

Combining the results of Stage 1, Stage 2, and Stage 3, the input control of system (25) can be rewritten as segmentation form:

u^{*} (t) = \{\begin{matrix} 0, & t \in [t_{0}, t_{N}], \\ K_{2} {[x^{T} (t), x_{r}^{T} (t)]}^{T}, & t \in [t_{N}, T^{(1)}], \\ K_{3} (x (t) - x_{r} (t)), & t \in [T^{(1)}, T_{f}] . \end{matrix}

(27)

Figure 5 shows the profiles of exogenous disturbances

v_{1} (t)

,

v_{2} (t)

,

v_{3} (t)

, and Brownian motion

W (t)

to which systems (25) and (26) are subjected. The trajectories of each component of the suggested

H_{\infty}

control

u^{*} (t)

of (25) are illustrated in Figure 6, which is divided into 3 stages. The contrast between

x (t)

and

x_{r} (t)

shown in Figure 7 and Figure 8 illustrate the change in errors between

x (t)

and

x_{r} (t)

with

| e_{r} (t) | = | x (t) - x_{r} (t) | |

. In the firs stage, i.e.,

t \in [t_{0}, t_{N}]

, system (25) is no control input, where

u_{1} (t) = u_{2} (t) = u_{3} (t) = 0

. Figure 7 and Figure 8 illustrate that the distance between

x (t)

and

x_{r} (t)

is the farthest; i.e., the values of errors

| e_{r} (t) |

are worst in the first stage. In the second stage, i.e.,

t \in [t_{N}, T^{(1)}]

, the control input of system (25) is state-feedback

H_{\infty}

control. By Figure 7 and Figure 8, we see that, compared with the first stage, the values of errors

| e_{r} (t) |

with

t \in [t_{N}, T^{(1)}]

become smaller but still not very good. In the third stage, it is easy to see that the errors between

x (t)

and

x_{r} (t)

become the smallest when

t \in [T^{(1)}, T_{f}]

, which is shown in Figure 7 and Figure 8.

Figure 5. Profiles of Brownian motion

W (t)

and exogenous disturbance

v_{1} (t)

,

v_{2} (t)

, and

v_{3} (t)

in Example 2.

Figure 6. Trajectories of

u^{*} (t)

in Example 2.

Figure 7. Trajectories of

x (t)

and

x_{r} (t)

under the control

u^{*} (t)

in Example 2.

Figure 8. The changes in errors

| e_{r} (t) |

under the effect of control

u^{*} (t)

in Example 2.

Example 3.

Consider the system of robot manipulator discussed in [35], whose dynamic equation is given by

M (q) \ddot{q} + V (q, \dot{q}) + G (q) = τ + τ_{d}

(28)

where

q (t) \in R^{n}

is the vector of generalized coordinates,

M (q) \in R^{n \times n}

is the inertia matrix,

V (q, \dot{q}) \in R^{n}

is the Coriolis and centrifugal torque vector,

G (q) \in R^{n}

is the gravitational torque vector,

τ \in R^{n}

is the generalized control input vector, and

τ_{d}

is the disturbance.

See Figure 9 in order to complete the task of the robot arm moving along with a given trajectory from A to B in the time interval

[t_{0}, T_{f}]

. The control τ in (28) should be designed rationally such that

q (t)

can take proper values at every moment

t \in [t_{0}, T_{f}]

. So, the values of

q (t)

should track a given reference

q_{d} (t)

. The error between them is defined as

e (t) = q (t) - q_{d} (t) .

Figure 9. Sketch of robot manipulator moving along the given reference trajectory.

Now, suppose

M (q)

is an invertible positive definite constant matrix and

V (q, \dot{q})

and

G (q)

are linear in q and

\dot{q}

; i.e., there exist matrices

V_{1}, V_{2}, G \in R^{n \times n}

such that

V (q, \dot{q}) = V_{1} q + V_{2} \dot{q}

and

G (q) = G q

.

Let

x = [\begin{matrix} q \\ \dot{q} \end{matrix}], x_{r} = [\begin{matrix} q_{d} \\ {\dot{q}}_{d} \end{matrix}],

Then, system (28) is equivalent to

d x (t) = [A x (t) + B τ (t) + B_{1} τ_{d} (t)] d t .

Now, we extend it to the stochastic case. Denote the control

u (t) = τ (t)

and the exogenous disturbance

v_{1} (t) = τ_{d} (t)

. Suppose the stochastic system with control input has the following form:

d x (t) = [A x (t) + B u (t) + B_{1} v_{1} (t)] d t + [A_{1} x (t) + B_{2} v_{2} (t)] d W (t) .

(29)

where

\begin{matrix} A = [\begin{matrix} 0 & I \\ - M^{- 1} (V_{1} + G) & - M^{- 1} V_{2} \end{matrix}], B = [\begin{matrix} 0 \\ M^{- 1} \end{matrix}], \\ A_{1} = [\begin{matrix} 0 & 0 \\ - M^{- 1} (V_{1} + G) λ & - M^{- 1} V_{2} λ \end{matrix}], B_{1} = [\begin{matrix} 0 \\ M^{- 1} \end{matrix}] . \end{matrix}

In practice, system (29) is seen as a version of (28) subjected to Brownian motion

W (t)

and exogenous disturbances

v_{1} (t)

and

v_{2} (t)

.

We also suppose the reference trajectory

x_{r} (t)

given by

d x_{r} (t) = [A_{r} x_{r} (t) + B_{3} v_{3} (t)] d t,

(30)

where

A_{r} \in R^{2 n \times 2 n}

is unknown, but the values of

x_{r} (t)

can be observed at

t_{0}, t_{1}, \dots, t_{N}

.

Now, let

n = 3

; i.e., q is a 3-dimensional vector with

q = [\begin{matrix} q_{1} \\ q_{2} \\ q_{3} \end{matrix}], q_{d} = [\begin{matrix} q_{d 1} \\ q_{d 2} \\ q_{d 3} \end{matrix}] .

Suppose the corresponding matrices of

M, V_{1}, V_{2}

, and G are provided as follows:

\begin{matrix} M = [\begin{matrix} 10.2470 & - 1.2741 & - 4.7213 \\ - 1.2741 & 14.3984 & - 0.3849 \\ - 4.7213 & - 0.3849 & 10.5878 \end{matrix}], & V_{1} = [\begin{matrix} 23.4859 & - 0.5090 & - 6.4912 \\ 6.3166 & 29.0514 & 0.3357 \\ - 9.8896 & - 6.6483 & 20.1972 \end{matrix}], \\ V_{2} = [\begin{matrix} 36.0462 & 0.1154 & - 11.2630 \\ 15.8835 & 52.9585 & - 17.5902 \\ - 13.5789 & - 8.6582 & 34.6113 \end{matrix}], & G = [\begin{matrix} 20.6781 & 0.1502 & - 5.5988 \\ 11.2692 & 29.6475 & - 2.1779 \\ - 9.9908 & - 6.5910 & 19.7089 \end{matrix}], \end{matrix}

and

λ = 0.05

,

B_{2} = B_{3} = B

. Suppose the observations of

x_{r}

at

t = t_{0}, t_{1}, \dots, t_{N}

are obtained:

\begin{matrix} X_{r} = [\begin{matrix} 0.2000 & 0.2250 & 0.2417 & 0.2514 & 0.2556 & 0.2552 & 0.2512 & 0.2444 & 0.2355 & 0.2250 \\ - 0.2200 & - 0.2450 & - 0.2656 & - 0.2825 & - 0.2961 & - 0.3068 & - 0.3151 & - 0.3212 & - 0.3253 & - 0.3276 \\ 0.7000 & 0.7050 & 0.6999 & 0.6865 & 0.6666 & 0.6415 & 0.6126 & 0.5808 & 0.5471 & 0.5122 \\ 0.5000 & 0.3330 & 0.1953 & 0.0829 & - 0.0078 & - 0.0797 & - 0.1358 & - 0.1784 & - 0.2097 & - 0.2315 \\ - 0.5000 & - 0.4126 & - 0.3373 & - 0.2719 & - 0.2150 & - 0.1651 & - 0.1211 & - 0.0822 & - 0.0476 & - 0.0167 \\ 0.1000 & - 0.1029 & - 0.2674 & - 0.3985 & - 0.5010 & - 0.5788 & - 0.6356 & - 0.6745 & - 0.6984 & - 0.7095 \end{matrix} \\ \begin{matrix} 0.2134 & 0.2011 & 0.1885 & 0.1757 & 0.1630 & 0.1506 & 0.1386 & 0.1270 & 0.1160 & 0.1056 \\ - 0.3285 & - 0.3279 & - 0.3262 & - 0.3233 & - 0.3194 & - 0.3147 & - 0.3091 & - 0.3029 & - 0.2960 & - 0.2886 \\ 0.4767 & 0.4412 & 0.4061 & 0.3717 & 0.3384 & 0.3063 & 0.2756 & 0.2465 & 0.2190 & 0.1932 \\ - 0.2456 & - 0.2531 & - 0.2554 & - 0.2535 & - 0.2484 & - 0.2407 & - 0.2311 & - 0.2201 & - 0.2082 & - 0.1958 \\ 0.0108 & 0.0355 & 0.0576 & 0.0775 & 0.0951 & 0.1109 & 0.1249 & 0.1373 & 0.1481 & 0.1575 \\ - 0.7102 & - 0.7023 & - 0.6872 & - 0.6666 & - 0.6415 & - 0.6131 & - 0.5823 & - 0.5497 & - 0.5161 & - 0.4821 \end{matrix} \\ \begin{matrix} 0.0958 & 0.0867 & 0.0781 & 0.0702 & 0.0630 & 0.0563 & 0.0501 & 0.0445 & 0.0394 & 0.0348 & 0.0306 \\ - 0.2807 & - 0.2724 & - 0.2638 & - 0.2549 & - 0.2458 & - 0.2365 & - 0.2271 & - 0.2176 & - 0.2081 & - 0.1986 & - 0.1892 \\ 0.1691 & 0.1467 & 0.1260 & 0.1069 & 0.0895 & 0.0736 & 0.0592 & 0.0462 & 0.0345 & 0.0242 & 0.0150 \\ - 0.1831 & - 0.1704 & - 0.1579 & - 0.1457 & - 0.1339 & - 0.1226 & - 0.1119 & - 0.1019 & - 0.0925 & - 0.0837 & - 0.0756 \\ 0.1655 & 0.1723 & 0.1779 & 0.1823 & 0.1857 & 0.1881 & 0.1896 & 0.1903 & 0.1901 & 0.1892 & 0.1876 \\ - 0.4480 & - 0.4143 & - 0.3812 & - 0.3491 & - 0.3180 & - 0.2882 & - 0.2599 & - 0.2330 & - 0.2076 & - 0.1838 & - 0.1615 \end{matrix}] . \end{matrix}

where

t_{0} = 0, t_{k} = k Δ t, k = 1, 2, \dots, 30

with

Δ t = 0.05

. By Theorem 1, we can obtain the estimator of

A_{r}

:

{\hat{A}}_{r} = [\begin{matrix} - 1.8585 & 0.2796 & 0.9931 & 0.9700 & 0.4932 & - 0.0032 \\ 1.6188 & - 0.1667 & - 0.8638 & 0.1287 & 0.6264 & - 0.0664 \\ 0.8675 & - 0.1626 & - 0.4663 & - 0.0249 & - 0.2543 & 1.0276 \\ - 2.8913 & - 0.1483 & - 1.6393 & - 3.8331 & - 0.6145 & - 0.3722 \\ - 0.4257 & - 4.2336 & - 0.7023 & - 1.4485 & - 4.0145 & 1.1032 \\ - 0.5798 & 1.0479 & - 3.9362 & - 0.6826 & 0.5743 & - 3.2811 \end{matrix}] .

Construct the augmented system of (29) and (30) with

{\hat{A}}_{r}

given above. Then, for

ρ = 0.8

, applying the results of Theorems 2 and 3, the state-feedback

H_{\infty}

control and error-feedback

H_{\infty}

control are designed, which are given as follows:

τ^{*} (t) = u^{*} (t) = \{\begin{matrix} 0, & t \in [t_{0}, t_{N}], \\ K_{2} {[x^{T} (t), x_{r}^{T} (t)]}^{T}, & t \in [t_{N}, T^{(1)}], \\ K_{3} (x (t) - x_{r} (t)), & t \in [T^{(1)}, T_{f}] . \end{matrix}

(31)

where

K_{2} = [\begin{matrix} - 0.0266 & - 0.0039 & - 0.0122 & - 0.0397 & 0.0029 & - 0.0141 & 0.0056 & 0.0000 & 0.0096 & 0.0141 & - 0.0010 & 0.0081 \\ 0.0021 & - 0.0192 & 0.0007 & 0.0038 & - 0.0262 & - 0.0099 & 0.0076 & 0.0101 & - 0.0050 & 0.0015 & 0.0103 & 0.0021 \\ - 0.0092 & - 0.0121 & - 0.0277 & - 0.0097 & - 0.0098 & - 0.0456 & 0.0097 & 0.0056 & 0.0103 & 0.0053 & 0.0031 & 0.0188 \end{matrix}]

and

K_{3} = [\begin{matrix} 1.0647 & 0.0374 & - 0.3934 & - 1.3155 & 0.2722 & 0.6980 \\ - 0.1236 & 1.2073 & 0.3950 & 0.3407 & - 1.6582 & - 0.2415 \\ - 0.5678 & - 0.3900 & 0.9147 & 0.6441 & - 0.1463 & - 1.4583 \end{matrix}]

Figure 10 illustrates the trajectories of control

τ^{*} (t)

with three stages. Figure 11 shows the trajectories of

q (t)

and reference

q_{d} (t)

under the control

τ^{*} (t)

designed above. It is easy to see that each component of

q (t)

performs well regarding the reference

q_{d} (t)

. Figure 12 illustrates the error’s changes between

q (t)

and

q_{d} (t)

, showing that there exists the smallest error in the third stage, which verifies the effect of the proposed method.

Figure 10. Trajectories of

u^{*} (t)

in Example 3.

Figure 11. Trajectories of

x (t)

and

x_{r} (t)

under the control

u^{*} (t)

in Example 3.

Figure 12. The changes in errors

| e_{r} (t) |

under the effect of control

u^{*} (t)

in Example 3.

6. Conclusions

The robust

H_{\infty}

tracking problem is investigated for linear stochastic systems where the parameters of a reference system are unknown but some discrete-time observations are available. A hybrid data-driven

H_{\infty}

tracking-control design scheme is proposed for such problems. By using the least-squares method, the parameters of the reference system are estimated and the corresponding data-dependent augmented systems are established. Based on the solutions of algebraic Riccati inequalities, the state-feedback and error-feedback

H_{\infty}

controls are designed for such tracking problems to enhance system performance and reduce tracking error. Moreover, the programming evolution of the hybrid control method is summarized, which includes data observation, state-feedback

H_{\infty}

control, and error-feedback

H_{\infty}

control.

Author Contributions

Conceptualization, X.L. and R.Z.; methodology, X.L. and Y.Z.; software, X.L. and Y.Z.; validation, X.L., Y.Z., and R.Z.; formal analysis, X.L.; investigation, R.Z.; resources, X.L.; writing—original draft preparation, Y.Z. and X.L.; writing—review and editing, X.L. and R.Z.; visualization, R.Z.; supervision, X.L.; project administration, X.L.; funding acquisition, X.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by NSF of China, grant number 62273212.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Persis, C.D.; Tesi, P. Formulas for data-driven control: Stabilization, optimality, and robustness. IEEE Trans. Autom. Control 2020, 65, 909–924. [Google Scholar] [CrossRef]
Shen, D.; Zhang, W.; Wang, Y.; Chien, C. On almost sure and mean square convergence of P-type ILC under randomly varying iteration lengths. Automatica 2016, 63, 359–365. [Google Scholar] [CrossRef]
Yu, H.; Guan, Z.; Chen, T.; Yamamoto, T. Design of data-driven PID controllers with adaptive updating rules. Automatica 2020, 121, 109185. [Google Scholar] [CrossRef]
Hou, Z.; Xiong, S. On model-free adaptive control and its stability analysis. IEEE Trans. Autom. Control 2019, 64, 4555–4569. [Google Scholar] [CrossRef]
Pang, Z.; Ma, B.; Liu, G.; Han, Q. Data-driven adaptive control: An incremental triangular dynamic linearization approach. IEEE Trans. Circuits Syst. II Express Briefs 2022, 69, 4949–4953. [Google Scholar] [CrossRef]
Wang, Z.; Liu, K.; Cheng, X.; Sun, X. Online data-driven model predictive control for switched linear systems. IEEE Trans. Autom. Control 2025, 70, 6222–6229. [Google Scholar] [CrossRef]
Wu, W.; Qiu, L.; Rodriguez, J.; Liu, X.; Ma, J.; Fang, Y. Data-driven finite control-set model predictive control for modular multilevel converter. IEEE J. Emerg. Sel. Top. Power Electron. 2023, 11, 523–531. [Google Scholar] [CrossRef]
Sun, T.; Sun, X.; Sun, A. Optimal output tracking of aircraft engine systems: A data-driven adaptive performance seeking control. IEEE Trans. Circuits Syst. II Express Briefs 2022, 69, 1467–1471. [Google Scholar] [CrossRef]
Liu, W.; Sun, J.; Wang, G.; Bullo, F.; Chen, J. Data-driven self-triggered control via trajectory prediction. IEEE Trans. Autom. Control 2023, 68, 6951–6958. [Google Scholar] [CrossRef]
Zhan, J.; Ma, Z.; Zhang, L. Data-driven modeling and distributed predictive control of mixed vehicle platoons. IEEE Trans. Intell. Veh. 2023, 8, 572–582. [Google Scholar] [CrossRef]
Zhang, W.; Xie, L.; Chen, B.S. Stochastic H₂/H_∞ Control: A Nash Game Approach; Taylor & Francis: Boca Raton, FL, USA, 2017. [Google Scholar]
Chen, B.S.; Wang, C.P.; Lee, M.Y. Stochastic robust team tracking control of multi-UAV networked system under Wiener and Poisson random fluctuations. IEEE Trans. Cybern. 2021, 51, 5786–5799. [Google Scholar] [CrossRef] [PubMed]
Liu, H.; Ma, T.; Lewis, F.L.; Wan, Y. Robust formation control for multiple quadrotors with nonlinearities and disturbances. IEEE Trans. Cybern. 2020, 50, 1362–1371. [Google Scholar] [CrossRef]
Zhao, J.; Zhao, H.; Song, Y.; Sun, Z.Y.; Yu, D. Fast finite-time consensus protocol for high-order nonlinear multi-agent systems based on event-triggered communication scheme. Appl. Math. Comput. 2026, 508, 129631. [Google Scholar] [CrossRef]
Boshkovska, E.; Ng, D.W.K.; Zlatanov, N.; Koelpin, A.; Schober, R. Robust resource allocation for MIMO wireless powered communication networks based on a non-linear EH model. IEEE Trans. Commun. 2017, 65, 1984–1999. [Google Scholar] [CrossRef]
Zames, G. Feedback and optimal sensitivity: Model reference transformations, multiplicative seminorms, and approximate inverses. IEEE Trans. Autom. Control 1981, 26, 301–320. [Google Scholar] [CrossRef]
Doyle, J.C.; Glover, K.; Khargonekar, P.; Francis, B. State-space solutions to standard H2 and H_∞ control problems. IEEE Trans. Autom. Control 1989, 34, 831–847. [Google Scholar] [CrossRef]
Gahinet, P.; Apkarian, P. A linear matrix inequality approach to H^∞ control. Int. J. Robust Nonlinear Control 1994, 4, 21–448. [Google Scholar] [CrossRef]
Dai, X.; Zuo, H.; Deng, F. Mean square finite-time stability and stabilization of impulsive stochastic distributed parameter systems. IEEE Trans. Syst. Man Cybern. Syst. 2025, 55, 4064–4075. [Google Scholar] [CrossRef]
Zhao, J.; Yuan, Y.; Sun, Z.Y.; Xie, X. Applications to the dynamics of the suspension system of fast finite time stability in probability of p-norm stochastic nonlinear systems. Appl. Math. Comput. 2023, 457, 128221. [Google Scholar] [CrossRef]
Hinrichsen, D.; Pritchard, A.J. Stochastic H_∞. SIAM J. Control Optim. 1998, 36, 1504–1538. [Google Scholar] [CrossRef]
der Schaft, A.J.V. On a state-space approach to nonlinear H_∞ control. Syst. Control Lett. 1991, 16, 1–8. [Google Scholar]
Ball, J.A.; Helton, J.W.; Walker, M.L. H_∞ control for nonlinear systems with output feedback. IEEE Trans. Autom. Control 1993, 38, 546–559. [Google Scholar] [CrossRef]
Zhang, W.; Chen, B.S. State feedback H_∞ control for a class of nonlinear stochastic systems. SIAM J. Control Optim. 2006, 44, 1973–1991. [Google Scholar] [CrossRef]
Lin, X.; Zhang, T.; Zhang, W.; Chen, B.S. New approach to general nonlinear discrete-Time stochastic H_∞ control. IEEE Trans. Autom. Control 2019, 64, 1472–1486. [Google Scholar] [CrossRef]
Chen, B.S.; Yang, C.T.; Lee, M.Y. Multiplayer noncooperative and cooperative minimax H_∞ tracking game strategies for linear mean-field stochastic systems with applications to cyber-social systems. IEEE Trans. Cybern. 2022, 52, 2968–2980. [Google Scholar]
Xin, Y.; Zhang, W.; Wang, D.; Zhang, T.; Jiang, X. Stochastic H₂/H_∞ control for discrete-time multi-agent systems with state and disturbance-dependent noises. Int. J. Robust Nonlinear Control 2025, preprinted. [Google Scholar] [CrossRef]
Zhao, J.; Wang, Z.; Lv, Y.; Na, J.; Liu, C.; Zhao, Z. Data-driven learning for H_∞ control of adaptive cruise control systems. IEEE Trans. Veh. Technol. 2024, 73, 18348–18362. [Google Scholar] [CrossRef]
Liao, Y.; Zhang, T.; Yan, X.; Jiang, D. Integrated guidance and tracking control for underactuated AUVs on SE(3) with singularity-free global prescribed performance attitude control. Ocean Eng. 2025, 342, 122770. [Google Scholar] [CrossRef]
Zhang, L.; Sun, Y.; Chai, P.; Tan, J.; Zheng, H. Prescribed-performance time-delay compensation control for UUV trajectory tracking in main-branch water conveyance tunnel transitions under unknown input delays. Ocean Eng. 2025, 342, 122941. [Google Scholar]
Xu, B.; Dai, Y.; Suleman, A.; Shi, Y. Distributed fault-tolerant control of multi-UAV formation for dynamic leader tracking: A Lyapunov-based MPC framework. Automatica 2025, 175, 112179. [Google Scholar] [CrossRef]
Luque, I.; Chanfreut, P.; Limón, D.; Maestre, J.M. Model predictive control for tracking with implicit invariant sets. Automatica 2025, 179, 112436. [Google Scholar] [CrossRef]
Chen, B.S.; Hsueh, C.H.; Wu, R.S. Optimal H_∞ adaptive fuzzy observer-based reference tracking control of nonlinear stochastic systems under uncertain measurement function and noise. Fuzzy Sets Syst. 2025, 521, 109597. [Google Scholar] [CrossRef]
Lin, X.; Zhang, R. H_∞ Control for stochastic systems with Poisson jumps. J. Syst. Sci. Complex 2011, 24, 683–700. [Google Scholar] [CrossRef]
Song, G.; Park, H.; Kim, J. The H_∞ Robust stability and performance conditions for uncertain robot manipulators. IEEE/CAA Hournal Autom. Sin. 2025, 12, 270–272. [Google Scholar] [CrossRef]

Figure 1. Trajectories of

u^{*} (t)

in Example 1.

Figure 2. Profiles of Brownian motion and exogenous disturbance in systems (25) and (26) in Example 1.

Figure 3. Trajectories of

x (t)

and

x_{r} (t)

under the control

u^{*} (t)

in Example 1.

Figure 4. The changes in errors

| e_{r} (t) |

under the effect of control

u^{*} (t)

in Example 1.

Figure 5. Profiles of Brownian motion

W (t)

and exogenous disturbance

v_{1} (t)

,

v_{2} (t)

, and

v_{3} (t)

in Example 2.

Figure 6. Trajectories of

u^{*} (t)

in Example 2.

Figure 7. Trajectories of

x (t)

and

x_{r} (t)

under the control

u^{*} (t)

in Example 2.

Figure 8. The changes in errors

| e_{r} (t) |

under the effect of control

u^{*} (t)

in Example 2.

Figure 9. Sketch of robot manipulator moving along the given reference trajectory.

Figure 10. Trajectories of

u^{*} (t)

in Example 3.

Figure 11. Trajectories of

x (t)

and

x_{r} (t)

under the control

u^{*} (t)

in Example 3.

Figure 12. The changes in errors

| e_{r} (t) |

under the effect of control

u^{*} (t)

in Example 3.

Table 1. The programming evolution of hybrid partial-data-driven

H_{\infty}

control.

Table 1. The programming evolution of hybrid partial-data-driven

H_{\infty}

control.

Step 1	Stage 1: Observing the state of $x_{r} (t)$ in $[t_{0}, t_{N}]$ at $t_{0}, t_{1}, t_{2}, \dots, t_{N}$ . The estimator of $A_{r}$ is obtained: ${\hat{A}}_{r} = Ξ_{r} X_{r}^{T} {(X_{r} X_{r}^{T})}^{†} .$ which is based on the observations of $x_{r} (t_{0}), x_{r} (t_{1}), x_{r} (t_{2}), \dots, x_{r} (t_{N})$ , and the corresponding control input is $u^{*} (t) \equiv 0, t \in [t_{0}, t_{N}] .$
Step 2	Stage 2: Designing the state-feedback $H_{\infty}$ control in $[t_{N}, T^{(1)}]$ . By solving the Riccati inequality (17) with solution P, the state-feedback $H_{\infty}$ control is designed: $u^{*} (t) = K_{2} \tilde{x} (t),$ where $K_{2} = - R^{- 1} {\tilde{B}}^{T} P$ and $\tilde{x} (t)$ is the state of the augmented system (13).
Step 3	Stage 3: Designing the error-feedback $H_{\infty}$ control in $[T^{(1)}, T_{f}]$ . By solving the Riccati inequality (20) with solution of P and $K_{3}$ , the error-feedback is designed: $u^{*} (t) = K_{3} (x (t) - x_{r} (t)) .$
Step 4	The hybrid $H_{\infty}$ control is obtained with piecewise form: $u^{*} (t) = \{\begin{matrix} 0, & t \in [t_{0}, t_{N}], \\ K_{2} \tilde{x} (t), & t \in [t_{N}, T^{(1)}], \\ K_{3} (x (t) - x_{r} (t)), & t \in [T^{(1)}, T_{f}] . \end{matrix}$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Hybrid Partial-Data-Driven H_∞ Robust Tracking Control for Linear Stochastic Systems with Discrete-Time Observation of Reference Trajectory

Abstract

1. Introduction

2. Preliminaries

3. State-Feedback and Error-Feedback $H_{\infty}$ Tracking Control for Linear Systems Based on Partially Observable Data

4. Hybrid Partial-Data-Driven $H_{\infty}$ Robust Tracking Control Scheme for Linear Stochastic Systems

5. Examples and Simulation

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics

Hybrid Partial-Data-Driven H∞ Robust Tracking Control for Linear Stochastic Systems with Discrete-Time Observation of Reference Trajectory

Abstract

1. Introduction

2. Preliminaries

3. State-Feedback and Error-Feedback H ∞ Tracking Control for Linear Systems Based on Partially Observable Data

4. Hybrid Partial-Data-Driven H ∞ Robust Tracking Control Scheme for Linear Stochastic Systems

5. Examples and Simulation

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics

Hybrid Partial-Data-Driven H_∞ Robust Tracking Control for Linear Stochastic Systems with Discrete-Time Observation of Reference Trajectory

3. State-Feedback and Error-Feedback $H_{\infty}$ Tracking Control for Linear Systems Based on Partially Observable Data

4. Hybrid Partial-Data-Driven $H_{\infty}$ Robust Tracking Control Scheme for Linear Stochastic Systems