Multi-Model Reliable Control for Variable Fault Systems under LQG Framework

Liu, Lei; Qian, Fucai; Xie, Guo; Wang, Min

doi:10.3390/electronics8060632

Open AccessArticle

Multi-Model Reliable Control for Variable Fault Systems under LQG Framework

by

Lei Liu

¹,

Fucai Qian

^1,2,*,

Guo Xie

¹ and

Min Wang

³

¹

School of Automation and Information Engineering, Xi’an University of Technology, Xi’an 710048, China

²

Autonomous Systems and Intelligent Control International Joint Research Center, Xi’an Technological University, Xi’an 710021, China

³

State Key Laboratory of Astronautic Dynamics, Xi’an 710043, China

^*

Author to whom correspondence should be addressed.

Electronics 2019, 8(6), 632; https://doi.org/10.3390/electronics8060632

Submission received: 14 April 2019 / Revised: 23 May 2019 / Accepted: 31 May 2019 / Published: 5 June 2019

(This article belongs to the Section Systems & Control Engineering)

Download

Browse Figures

Versions Notes

Abstract

:

The problem of reliable control for variable fault systems under linear quadratic Gaussian (LQG) framework is studied in this paper. Firstly, a cluster of models is used to cover the dynamic behaviors of different fault modes of a system and, for each model, LQG control is implemented. By using the a posteriori probability of model innovation as the weight information, a multi-model reliable control (MMRC) is proposed. Secondly, it is proved that MMRC can enable the controller to learn the real operating mode of the system. When the controller is in a deadlock state, a deadlock avoidance strategy is given and its convergence of the a posteriori probability is proved. Finally, the validity of MMRC is verified by an example simulation of the lateral-directional control system of an aircraft. Simulation results show that MMRC guarantees an acceptable performance of the closed-loop system. In addition, since the controller fuses the control law of each model according to the weight information, when the system model is switched, the controller implements a soft switching, which avoids the jitter caused by frequent hard switching to the system.

Keywords:

LQG; variable fault system; a posteriori probability; reliable controller

1. Introduction

In the design of LQG controllers, due to the separation property between state estimation and control gain calculation, the design method of control law has received extensive attention from theorists and engineers. To date, a large number of problems in aerospace, aviation, industrial and socio-economic system have achieved satisfactory control effects under LQG framework, and many new methods have been deduced [1,2,3]. However, all of these methods hold that the system’s actuators, sensors, etc. can work properly. When faults occur in the system, these methods not only fail to optimize system performance, but also destabilize the system and even put the system at risk. Therefore, it is thought to be of great theoretical and practical engineering significance to study the design method of the controller under LQG framework in the case of faults appearing in the system.

The LQG framework contains two aspects. On the one hand, the system dynamics are linear, while the disturbance acting on the process and the error affecting the measurements are white Gaussian noise. On the other hand, the performance index is quadratic in state and control with the form of convex function. The existing research results show that a linear state feedback control which makes the performance index of closed-loop system optimal and has the separation property can be determined under the condition that the system is normal. In the general case, the aging of the components, the varying of the environment and other unpredictable factors in the process of control will cause the system to deviate from normal operating conditions, and even cause system faults. The existing control algorithms are powerless. Faced with this challenge, this paper hopes to design a reliable controller. The reliable controller is thus a controller that can make a closed-loop system with an optimal performance index when the system is in normal conditions and can also make the closed-loop system with an acceptable performance under certain conditions when faults appear in the system. Notice that, when the system is under normal conditions, the controller is the conventional LQG optimal controller. A controller having this property is referred to as a reliable controller in this paper.

The primary role of a reliable controller is to handle system faults. Generally speaking, faults will inevitably occur in a system during its life cycle, such as the measurement deviation of sensor, the actuator stuck, etc. When these faults are reflected in the system model, they cannot be mathematically described by model parameters change, but by the model parameters jump in the high-dimensional discrete space. Essentially, each point in the discrete space corresponds to a model, and different control strategies are designed based on different models [4,5,6]. If the fault model corresponding to any point in the discrete space is known, the number of models will have a dimension disaster, and it is impossible to refine the faults to different degrees to obtain all the models. A viable strategy is to build a cluster of models that traverse all possible faults as much as possible. The research results in this paper show that the reliable controller can be designed for a limited system model set. The controller tends to LQG optimal feedback control over time when the system is normal, and it can make the system have acceptable performance when the faults occur in the system.

A prerequisite for the reliable controller design is to approximate the various conditions of the system with a cluster of models which includes a normal model with no faults and several known fault models. The a priori information of the fault model is based on an understanding and mastery of history of the system, especially for systems that operate repeatedly in different cycles, such as aeronautical vehicles. There have been some methods for the problem of controller design with a model set that has multi-model control strategies [7,8]. These methods require that the controller is able to detect the fault model immediately. As system faults differ, the control laws matching different models are constantly switched according to the switching indicators, which often leads to strong jitter in the system at the switching point [9,10]. This is a hard switching method. Although it can make the system response faster, the jitter is unavoidable. In fact, a variety of methods that are accompanied by the development of sliding mode variable structure technology appear to solve the jitter.

In order to avoid the jitter, soft switching methods came into being [11,12]. In this paper, a soft switching among multiple models is realized by means of the dual adaptive control method. Since Feldbaum first proposed the dual control method for the autoregressive moving average (ARMA) model with unknown parameters, after more than half a century of research, it has been proved that dual control can optimize the system toward the desired target on one hand, and on the other hand, it can actively collect information and guarantee the unknown parameter estimation process [13,14,15].

Based on the above analysis, the contribution in this paper is as follows: (1) It is assumed in this paper that the normal model and the possible fault models are known. LQG optimal control is applied for each model. By using the a posteriori probability of each model as the weight information, an MMRC is proposed; (2) It is the optimal LQG controller when the system is normal, and the controller performs reliably when faults occur in the system; (3) It neither needs to detect the fault model, nor needs to detect the fault time; (4) It implements a soft switching among the multiple models, which avoids the jitter caused by hard switching.

The remainder of this paper is organized as follows: the problem to be solved in this paper will be presented in Section 2. The theoretical basis of MMRC will be established in Section 3. Section 4 illustrates the validity of the control algorithm through an example of the lateral-directional control system of an aircraft, and, finally, the conclusions will be offered in Section 5.

2. Problem Statement

Consider the following discrete-time stochastic linear systems:

\begin{matrix} x (k + 1) & = & [G (k) + Δ G_{j}] x (k) + [H (k) + Δ H_{j}] u (k) + ω (k), \end{matrix}

(1)

\begin{matrix} z (k) & = & [C (k) + Δ C_{j}] x (k) + υ (k), \end{matrix}

(2)

where

x (k)

is an n-dimensional vector of state,

z (k)

is a p-dimensional vector of output and

u (k)

is an m-dimensional vector of control input. The process noise

ω (k)

, observation noise

υ (k)

and initial condition

x (0)

are mutually independent white Gaussian, whose statistics are

ω (k) \sim N (0, W)

,

υ (k) \sim N (0, V)

, and

x (0) \sim N (0, P_{0})

.

G (k)

,

H (k)

and

C (k)

are matrices of appropriate dimensions, and their variable quantities

\{Δ G_{j}, Δ H_{j}, Δ C_{j}\}

represent the deviation of system components, actuators and sensors when the system is in operating mode j,

j \in Ω = \{0, 1, \dots, s\}

. When

j = 0

, it represents that the system is in normal mode and there is no fault, and the corresponding

Δ G_{0} = Δ H_{0} = Δ C_{0} = 0

. When

j = 1, 2, \dots, s

, it represents that the system is in different fault modes respectively, and, for any fault mode

\{Δ G_{j}, Δ H_{j}, Δ C_{j}\}

corresponds to a set of determined values. In this paper, it is assumed that

\{Δ G_{j}, Δ H_{j}, Δ C_{j}\}

are known a priori; that is, the model parameters corresponding to the mode j are known, but which mode the system is in is unknown. In addition, the system can only be in one mode during the same stage, and, in different stages, the system mode is variable.

The performance index for the system is quadratic in the state and control

\begin{matrix} J & = & x^{T} (N) A x (N) + \sum_{k = 0}^{N - 1} [x^{T} (k) A x (k) + u^{T} (k) B u (k)], \end{matrix}

(3)

where A and B are positive semi-definite and positive definite symmetric matrices of appropriate dimensions, respectively.

Define the information set at time k to be

I^{k}

\begin{matrix} I^{k} = \{u (0), \dots, u (k - 1); z (1), \dots, z (k)\} . \end{matrix}

The problem to be solved in this paper is to find a feedback control law

u (k) = f_{k} (I^{k})

such that the expected performance index

E \{J\}

of system (1) and (2) is minimized, namely,

\begin{matrix} (P) & min E \{J\} \\ s . t . & x (k + 1) = [G (k) + Δ G_{j}] x (k) + [H (k) + Δ H_{j}] u (k) + ω (k), \\ z (k) = [C (k) + Δ C_{j}] x (k) + υ (k), j \in Ω . \end{matrix}

Note that, if the process model parameters are completely known, the above control problem becomes the classical LQG control problem, for which there is already a mature solution. However, the problem

(P)

solved in this paper is different from LQG, namely, the fault at current time is unknown and which fault is also unknown.

3. Reliable Controller Design

For the convenience of problem statement and notation, systems (1) and (2) can be rewritten as follows:

\begin{matrix} x_{j} (k + 1) & = & G_{j} x_{j} (k) + H_{j} u (k) + ω (k), \end{matrix}

(4)

\begin{matrix} z (k) & = & C_{j} x_{j} (k) + υ (k), \end{matrix}

(5)

where

G_{j} = G (k) + Δ G_{j}

,

H_{j} = H (k) + Δ H_{j}

,

C_{j} =

C (k) + Δ C_{j}

,

x_{j} (k)

is the state of the system under mode j, and

z (k)

is the output.

Definition 1.

When the system is in mode j, the state estimate based on real-time information set

I^{k}

is

\begin{matrix} {\hat{x}}_{j} (k) = E \{x_{j} (k) | I^{k}\}, \end{matrix}

{\hat{x}}_{j} (k)

can be obtained by the Kalman filter, [16]

\begin{matrix} {\hat{x}}_{j} (k) = G_{j} {\hat{x}}_{j} (k - 1) + H_{j} u (k - 1) + K_{j} (k) {\tilde{z}}_{j} (k), \end{matrix}

(6)

where

\begin{matrix} K_{j} (k) = P_{j} (k | k - 1) {C_{j}}^{T} {[C_{j} P_{j} (k | k - 1) {C_{j}}^{T} + V]}^{- 1}, \end{matrix}

(7)

\begin{matrix} P_{j} (k | k - 1) = G_{j} P_{j} (k - 1) {G_{j}}^{T} + W, \end{matrix}

(8)

\begin{matrix} P_{j} (k) = [I - K_{j} (k) C_{j}] P_{j} (k | k - 1), \end{matrix}

(9)

\begin{matrix} {\tilde{z}}_{j} (k) = z (k) - C_{j} [G_{j} {\hat{x}}_{j} (k - 1) + H_{j} u (k - 1)], \end{matrix}

(10)

with initial condition

{\hat{x}}_{j} (0) = \hat{x} (0)

and

P_{j} (0) = P_{0}

.

K_{j} (k)

being the filter gain.

Theorem 1.

When the system is in mode j, the control law that minimizes performance index

E \{J\}

is given by

\begin{matrix} u_{j}^{*} (k) = - L_{j} (k) {\hat{x}}_{j} (k), \end{matrix}

(11)

where, for

k = N - 1, N - 2, \dots, 0

,

\begin{matrix} L_{j} (k) & = & {D_{j}}^{- 1} (k) {H_{j}}^{T} S_{j} (k + 1) G_{j}, \end{matrix}

(12)

\begin{matrix} D_{j} (k) & = & {H_{j}}^{T} S_{j} (k + 1) H_{j} + B, \end{matrix}

(13)

\begin{matrix} S_{j} (k) & = & {G_{j}}^{T} S_{j} (k + 1) G_{j} + A - {L_{j}}^{T} (k) D_{j} (k) L_{j} (k), \end{matrix}

(14)

with the boundary condition

S_{j} (N) = A

.

{\hat{x}}_{j} (k)

can be obtained from Definition 1.

Proof of Theorem 1.

Let the optimal cost-to-go of the operating mode j at time k be

J_{j}^{*} (k)

\begin{matrix} J_{j}^{*} (k) = min_{u_{j} (k), u_{j} (k + 1), \dots, u_{j} (N - 1)} E \{x_{j}^{T} (N) A x_{j} (N) + \sum_{i = k}^{N - 1} [x_{j}^{T} (i) A x_{j} (i) + u_{j}^{T} (i) B u_{j} (i)] |I^{k}\} . \end{matrix}

According to the smoothing property of expectations and optimal theory of stochastic dynamic programming, we have

\begin{matrix} J_{j}^{*} (k) = min_{u_{j} (k)} E \{x_{j}^{T} (k) A x_{j} (k) + u_{j}^{T} (k) B u_{j} (k) + J_{^{j}}^{*} (k + 1) | I^{k}\}, \end{matrix}

(15)

with the boundary condition

\begin{matrix} J_{j}^{*} (N) = E \{x_{j}^{T} (N) A x_{j} (N) | I^{N}\} . \end{matrix}

(16)

The solution to

(P)

can be obtained by using dynamic programming. Let l denote the time. If (11) is true at the initial time

l = N - 1

, let it be true when

l = k + 1

. If it can be proved that it is true when

l = k

, then the theorem is proved.

When

l = N - 1

, we have

\begin{matrix} J_{j}^{} (N - 1) = E \{x_{j}^{T} (N - 1) A x_{j} (N - 1) + u_{j}^{T} (N - 1) B u_{j} (N - 1) + J_{j}^{*} (N) | I^{N - 1}\} . \end{matrix}

(17)

Substituting (16) into (17), according to the nature of trace and Definition 1, yields

\begin{matrix} J_{j} (N - 1) & = & t r (A W) + {\hat{x}}_{j}^{T} (N - 1) (G_{j}^{T} A G_{j}^{} + A) {\hat{x}}_{j}^{} (N - 1) + 2 u_{j}^{T} (N - 1) H_{j}^{T} A G_{j}^{} {\hat{x}}_{j}^{} (N - 1) + \\ u_{j}^{T} (N - 1) (H_{j}^{T} A H_{j}^{} + B) u_{j}^{} (N - 1) + t r ((G_{j}^{T} A G_{j}^{} + A) P_{j}^{} (N - 1)) . \end{matrix}

(18)

Equation (18) is quadratic in control

u_{j} (N - 1)

. Letting

\frac{d J_{j} (N - 1)}{d u_{j} (N - 1)} = 0

, we can get the LQG optimal control

u_{j}^{*} (N - 1)

at time

N - 1

,

\begin{matrix} u_{j}^{*} (N - 1) = - L_{j} (N - 1) {\hat{x}}_{j} (N - 1), \end{matrix}

(19)

where

L_{j} (N - 1)

is identical to (12). Substituting (19) into (18), then the optimal cost-to-go at the time

N - 1

is given as

\begin{matrix} J_{j}^{*} (N - 1) & = & {\hat{x}}_{j}^{T} (N - 1) S_{j} (N - 1) {\hat{x}}_{j} (N - 1) + s_{j} (N - 1), \end{matrix}

where

S (N - 1)

is identical to (11), and

s (N - 1)

is uncorrelated to control and state variables

\begin{matrix} s_{j} (N - 1) = t r [A (G_{j} P_{j} (N - 1) G_{j}^{T} + W)] . \end{matrix}

Assume that at time

l = k + 1

we have

\begin{matrix} J_{j}^{*} (k + 1) & = & {\hat{x}}_{j}^{T} (k + 1) S_{j} (k + 1) {\hat{x}}_{j} (k + 1) + s_{j} (k + 1) . \end{matrix}

(20)

According to (15), we can get

\begin{matrix} J_{j}^{*} (k) & = & min_{u_{j} (k)} E \{x_{j}^{T} (k) A x_{j} (k) + u_{j}^{T} (k) B u_{j} (k) + J_{j}^{*} (k + 1) |I^{k}\} . \end{matrix}

(21)

Substituting (20) into (21) by mathematical induction yields the following:

\begin{matrix} J_{j} (k) & = & {\hat{x}}_{j}^{T} (k) (G_{j}^{T} S_{j} (k + 1) G_{j} + A) {\hat{x}}_{j} (k) + 2 u_{j}^{T} (k) H_{j}^{T} S_{j} (k + 1) G_{j} {\hat{x}}_{j} (k) + \\ u_{j}^{T} (k) (H_{j}^{T} S_{j} (k + 1) H_{j} + B) u_{j} (k) + t r ((G_{j}^{T} S_{j} (k + 1) G_{j} + A) P_{j} (k) + A W) . \end{matrix}

Similarly, letting

\frac{d J_{j} (k)}{d u_{j} (k)} = 0

, we get the LQG optimal control

\begin{matrix} u_{j}^{*} (k) = - L_{j} (k) {\hat{x}}_{j} (k), \end{matrix}

where

L_{j} (k)

and (12) are identical. Substituting

u_{j}^{*} (k)

into performance index

J_{j} (k)

, we get

\begin{matrix} J_{j}^{*} (k) = {\hat{x}}_{j}^{T} (k) S_{j} (k) {\hat{x}}_{j} (k) + s_{j} (k), \end{matrix}

where

S_{j} (k)

is identical to (14), and

s_{j} (k)

is uncorrelated to control and state variables

\begin{matrix} s_{j} (k) = s_{j} (k + 1) + t r [A (G_{j} P_{j} (k) G_{j}^{T} + W)] . \end{matrix}

This completes the proof. □

During the operation, the system may be in the normal mode, or switch back and forth among s fault modes. According to Theorem 1, each mode corresponds to an LQG optimal control. How to apply reliable control to the system based on

s + 1

LQG optimal control, the following theorem answers this question.

Theorem 2.

The control law that minimizes the performance index

E \{J\}

of the system (1) and (2) at time k is

\begin{matrix} u^{*} (k) = \sum_{j = 0}^{s} q_{j} (k) u_{j}^{*} (k), \end{matrix}

(22)

where

u_{j}^{*} (k)

is the LQG optimal control law when the system is in mode j,

q_{j} (k)

is the a posteriori probability of mode j and satisfies the following recursive equation:

\begin{matrix} q_{j} (k) = \frac{M_{j} (k)}{\sum_{i = 0}^{s} M_{i} (k) q_{i} (k - 1)} q_{j} (k - 1), \end{matrix}

(23)

where

\begin{matrix} M_{j} (k) & = & {|Δ_{j} (k)|}^{- \frac{1}{2}} exp [- \frac{1}{2} {\tilde{z}}_{j}^{T} (k) Δ_{j}^{- 1} (k) {\tilde{z}}_{j} (k)], \end{matrix}

(24)

\begin{matrix} Δ_{j} (k) & = & C_{j} P_{j} (k | k - 1) {C_{j}}^{T} + V, \end{matrix}

(25)

with the boundary condition

q_{j} (0) = 1 / (s + 1)

.

P_{j} (k | k - 1)

and

{\tilde{z}}_{j}

are identical to (8) and (10), respectively.

Proof of Theorem 2.

According to the stability theorem of filter, it can be obtained that, when

k \to \infty

, the limit of

Δ_{j} (k)

is a constant matrix that is denoted as

Δ_{j}

. Let i be the real system, for the non-real system j,

j \neq i

, define

π_{j} (k) = \frac{q_{j} (k)}{q_{i} (k)}

, according to (23), we have

\begin{matrix} π_{j} (k) & = & \frac{q_{j} (k)}{q_{i} (k)} = \frac{M_{j} (k)}{M_{i} (k)} π_{j} (k - 1) = π_{j} (0) \prod_{n = 1}^{k} \frac{M_{j} (n)}{M_{i} (n)} . \end{matrix}

(26)

Substituting (24), (25) and the boundary condition

q_{j} (0) = 1 / (s + 1)

into (26)

\begin{matrix} π_{j} (k) & = & (\prod_{n = 1}^{k} {|\frac{Δ_{i}}{Δ_{j}}|}^{\frac{1}{2}}) exp \{- \frac{1}{2} \sum_{n = 1}^{k} [{\tilde{z}}_{j}^{T} (n) Δ_{j}^{- 1} {\tilde{z}}_{j} (n) - {\tilde{z}}_{i}^{T} (n) Δ_{i}^{- 1} {\tilde{z}}_{i} (n)]\}, \end{matrix}

(27)

where

Δ_{j} (k)

is replaced by its limit

Δ_{j}

. Taking the logarithm on both sides of (27), and, according to the nature of trace, we get

\begin{matrix} ln π_{j} (k) & = & \frac{n}{2} ln |\frac{Δ_{i}}{Δ_{j}}| - \frac{1}{2} t r \{\sum_{n = 1}^{k} {\tilde{z}}_{j} (n) {\tilde{z}}_{j}^{T} (n) Δ_{j}^{- 1}\} + \frac{1}{2} t r \{\sum_{n = 1}^{k} {\tilde{z}}_{i} (n) {\tilde{z}}_{i}^{T} (n) Δ_{i}^{- 1}\} . \end{matrix}

(28)

Since i is a real system, we have

\begin{matrix} E \{{\tilde{z}}_{i} (k) {\tilde{z}}_{i}^{T} (k)\} = Δ_{i} (k) . \end{matrix}

According to the ergodicity of the innovation sequence

{\tilde{z}}_{i} (k),

it can be obtained that, when

k \to \infty

,

\begin{matrix} \frac{1}{k} \sum_{n = 1}^{k} {\tilde{z}}_{i} (n) {\tilde{z}}_{i}^{T} (n) Δ_{i}^{- 1} \to I . \end{matrix}

(29)

For the non-real system j, innovation

\begin{matrix} {\tilde{z}}_{j} (k) = z (k) - C_{j} {\hat{x}}_{j} (k | k - 1) = C_{i} x_{i} (k) + υ_{i} (k) - C_{i} {\hat{x}}_{i} (k | k - 1) + C_{i} {\hat{x}}_{i} (k | k - 1) - C_{j} {\hat{x}}_{j} (k | k - 1), \end{matrix}

(30)

let

\begin{matrix} e_{i} (k | k - 1) = {\hat{x}}_{i} (k | k - 1) - x_{i} (k), \end{matrix}

(31)

\begin{matrix} F_{j} (k) = C_{i} {\hat{x}}_{i} (k | k - 1) - C_{j} {\hat{x}}_{j} (k | k - 1) . \end{matrix}

(32)

Substituting (31) and (32) into (30) yields

\begin{matrix} {\tilde{z}}_{j} (k) = υ_{i} (k) - C_{i} e_{i} (k | k - 1) + F_{j} (k) . \end{matrix}

Since

υ_{i} (k)

and

e_{i} (k | k - 1)

are uncorrelated, according to (25), we get

\begin{matrix} E \{{\tilde{z}}_{j} (k) {\tilde{z}}_{j}^{T} (k)\} & = & E \{C_{i} e_{i} (k | k - 1) e_{i}^{T} (k | k - 1) {C_{i}}^{T} + υ_{i} (k) {υ_{i}}^{T} (k) + F_{j} (k) {F_{j}}^{T} (k)\} \\ = & C_{i} P_{i} (k - 1 | k) {C_{i}}^{T} + V + E \{F_{j} (k) F_{j}^{T} (k)\} \\ = & Δ_{i} (k) + E \{F_{j} (k) F_{j}^{T} (k)\} . \end{matrix}

Notice that

F_{j} (k) F_{j}^{T} (k)

is a semi-positive matrix such that

E \{{\tilde{z}}_{j} (k) {\tilde{z}}_{j}^{T} (k)\} \geq Δ_{i} (k)

holds if and only if

j = i

; then, we have

\begin{matrix} lim_{k \to \infty} \frac{1}{k} \sum_{n = 1}^{k} {\tilde{z}}_{j} (n) {\tilde{z}}_{j}^{T} (n) \geq Δ_{i} . \end{matrix}

(33)

Substituting (29), (33) into (28), when

k \to \infty

, we get

\begin{matrix} \frac{2}{k} ln π_{j} (k) \to ln |\frac{Δ_{i}}{Δ_{j}}| - t r (Δ_{j}^{- 1} Δ_{i}) + t r (I) - t r (Δ_{j}^{- 1} Σ_{j}), \end{matrix}

(34)

where

\begin{matrix} Σ_{j} = lim_{k \to \infty} \frac{1}{k} \sum_{n = 1}^{k} {\tilde{z}}_{j} (n) {\tilde{z}}_{j}^{T} (n) - Δ_{i} . \end{matrix}

Since the right side of (34) is negative, when

k \to \infty

\begin{matrix} \frac{2}{k} ln π_{j} (k) \to - c, \end{matrix}

where

c > 0

. Then, we have

\begin{matrix} π_{j} (k) = K exp (- \frac{k c}{2}), \end{matrix}

(35)

where K is a constant. Therefore, when

k \to \infty

, we have

\frac{q_{j} (k)}{q_{i} (k)} \to 0

, so we get

q_{j} (k)

\to 0

, since

\sum_{j = 0}^{s} q_{j} (k)

=1, we get

q_{i} (k) \to 1

.

At this time, the reliable control law

u^{*} (k)

=

\sum_{j = 0}^{s} q_{j} (k) u_{j}^{*} (k)

applied to the real system will tend to

u_{i}^{*} (k)

. This completes the proof. □

The above assumption of the boundary condition

q_{j} (0) = 1 / (s + 1)

indicates that the reliable controller does not have any preference for the system modes at the initial time, which is the worst case from the viewpoint of probability, even though the reliable controller will still tend to the control law of the real system. Theorem 2 demonstrates the reliability of the controller, and also shows that MMRC implements a soft switching strategy.

Note that, when

k \to \infty

,

q_{i} (k)

, which is the a posteriori probability of true model i, tends to 1, while

q_{j} (k) (j \in Ω, j \neq i)

tends to 0. Suppose at some time unit k,

q_{i} (k) = 1, q_{j} (k) = 0

; it can be seen from (23) that

q_{i} (k)

and

q_{j} (k)

will no longer change. According to (22), it yields

u^{*} (k) \equiv u_{i}^{*} (k)

, namely,

u^{*} (k)

always equals the control law of model i. If the true model changes from i to j in the next stage,

u^{*} (k)

will not change to

u_{j} (k)

, which is referred to as “lock out”. The following theorem answers the question of how to unlock a posteriori probability.

Theorem 3.

Assume that if the a posteriori probability of the system is locked out at time k, that is, the a posteriori probability of real system i,

q_{i} (k) = 1

, and the a posteriori probabilities of non-real system j,

q_{j} (k) = 0

,

j \neq i

, then, through the following transformation,

\begin{matrix} q_{i} (k) & = & 1 - ε, \\ q_{j} (k) & = & ε / s, j \neq i, \end{matrix}

(36)

where

0 < ε ≪ 1

, the a posteriori probability of the system after time k not only can be unlocked, but also does not affect the convergence.

Proof of Theorem 3.

According to (23), we have

\begin{matrix} q_{i} (k + m) = \frac{M_{i} (k + m) q_{i} (k + m - 1)}{\sum_{j = 0}^{s} M_{j} (k + m) q_{j} (k + m - 1)} = \frac{1}{1 + \sum_{j = 0, j \neq i}^{s} \frac{M_{j} (k + m) q_{j} (k + m - 1)}{M_{i} (k + m) q_{i} (k + m - 1)}} = \frac{1}{1 + \frac{q_{j} (k)}{q_{i} (k)} \sum_{j = 0, j \neq i}^{s} \prod_{n = 1}^{m} \frac{M_{j} (k + n)}{M_{i} (k + n)}} . \end{matrix}

(37)

Substituting (36) into (37), we get

\begin{matrix} q_{i} (k + m) = \frac{1}{1 + \frac{ε}{s (1 - ε)} \sum_{j = 0, j \neq i}^{s} \prod_{n = 1}^{m} \frac{M_{j} (k + n)}{M_{i} (k + n)}}, \end{matrix}

where

ε / s (1 - ε)

is a constant. Let

ξ_{j} (k + m)

=

\prod_{n = 1}^{m} \frac{M_{j} (k + n)}{M_{i} (k + n)}

, according to (24) and (25), we have

\begin{matrix} ξ_{j} (k + m) & = & exp \{- \frac{1}{2} \sum_{n = 1}^{m} [{\tilde{z}}_{j}^{T} (k + n) Δ_{j}^{- 1} (k + n) {\tilde{z}}_{j} (k + n) - {\tilde{z}}_{i}^{T} (k + n) Δ_{i}^{- 1} (k + n) {\tilde{z}}_{i} (k + n)]\} \times \\ (\prod_{n = 1}^{m} {|\frac{Δ_{i} (k + n)}{Δ_{j} (k + n)}|}^{\frac{1}{2}}) . \end{matrix}

(38)

Comparing (38) with (27), similarly, we can prove that, when

m \to \infty

, if i is still real system, we have

ξ_{j} (k + m) \to 0

and then we can get

q_{i} (k + m) \to 1

; if i becomes non-real system, then

ξ_{j} (k + m) \to \infty

and

q_{i} (k + m) \to 0

. This completes the proof. □

One can see from Theorem 3 that

ε

is a constant, whose value only affects the convergence speed of the a posteriori probability and does not affect the convergence. That is, after the time k, no matter whether the system mode changes, the a posteriori probability of the real system will always tend to 1, and a posteriori probability of the non-real system will tend to 0.

4. Simulation Analysis

In summary, the design of the reliable controller can be implemented by the following algorithm:

Step 1: Calculate the Kalman gain $K_{j} (k)$ offline according to (7)–(9).
Step 2: Calculate control gain $L_{j} (k)$ offline according to (12)–(14).
Step 3: Set $k = 0$ .
Step 4: Calculate ${\tilde{z}}_{j} (k)$ and ${\hat{x}}_{j} (k | k)$ according to (10) and (6), respectively.
Step 5: Calculate $u_{j}^{*} (k)$ according to (11).
Step 6: Calculate $q_{j} (k)$ according to (23).
Step 7: Calculate control law $u^{*} (k)$ according to (22), and apply this control to the system.
Step 8: Update $q_{j} (k)$ according to (36).
Step 9: If $k = N - 1$ , stop. Otherwise, set k := $k + 1$ , go back to Step 4.

In order to illustrate the characteristics of the controller designed in this paper, an example of an aircraft lateral-directional control system [17] is given as follows:

\begin{matrix} x (k + 1) & = & G_{j} x (k) + H_{j} u (k) + ω (k), \\ z (k) & = & x (k) + υ (k), \end{matrix}

where system state

x = {[φ β p_{s} r_{s}]}^{T}

and control input

u = {[δ_{a} δ_{r}]}^{T}

.

φ, β, p_{s}

and

r_{s}

represent the bank angle, the sideslip angle, the body roll rate, and the body yaw rate, respectively.

δ_{a}

and

δ_{r}

denote the differential aileron and the rudder, which control the roll and yaw motion, respectively. The statistics for the process noise, observation noise and initial condition are

ω (k) \sim N (0, 0.01 I_{4})

,

υ (k) \sim N (0, 0.01 I_{4})

and

x (0) \sim N (0, I_{4})

respectively where

I_{4}

is an identity matrix of

4 \times 4

dimension.

The performance index is

\begin{matrix} J & = & E \{x^{T} (50) x (50) + \sum_{k = 0}^{49} [x^{T} (k) x (k) + u^{T} (k) u (k)]\} . \end{matrix}

Assume that the airspeed is excessive at fault time

k_{f} = 25

due to the abnormality of the pitot tube system, and the process model parameters related to pitot tube are attenuated to 90% or 80%. That is, there are three potential operating modes in the system, i.e.,

j \in Ω = \{0, 1, 2\}

, and the operation process of system is divided into two stages by

k_{f}

. Before

k_{f}

, we call it the first stage, and, after

k_{f}

we call it the second stage. It is further assumed that the real system is

j = 0

in the first stage and in the second stage the real system is

j = 2

. When the system is normal, i.e.,

j = 0

, its model parameters are given as follows:

\begin{matrix} G_{0} = [\begin{matrix} 1 & 0 & 0.1 & 0 \\ 0.0049 & 0.9917 & 0 & - 0.1 \\ 0 & - 0.4545 & 0.8331 & 0.0172 \\ 0 & 0.3882 & - 0.0065 & 0.9911 \end{matrix}], H_{0} = [\begin{matrix} 0 & 0 \\ 0 & 0.0001 \\ 0.2728 & 0.0058 \\ 0.0040 & - 0.0136 \end{matrix}] . \end{matrix}

When the faults appear in the system, the relevant model parameters are 90% of the normal system, i.e.,

j = 1

,

\begin{matrix} G_{1} (2, 1) = G_{0} (2, 1) \times 90 %, \\ G_{1} (2, 2) = G_{0} (2, 2) \times 90 %, \\ H_{1} (2, 2) = H_{0} (2, 2) \times 90 %, \end{matrix}

and 80% of the normal system, i.e.,

j = 2

,

\begin{matrix} G_{2} (2, 1) = G_{0} (2, 1) \times 80 %, \\ G_{2} (2, 2) = G_{0} (2, 2) \times 80 %, \\ H_{2} (2, 2) = H_{0} (2, 2) \times 80 % \end{matrix}

where

G_{j} (2, 1)

,

G_{j} (2, 2)

and

H_{j} (2, 1)

represent the elements of the second row and the first column, the second row and the second column in the matrix G, and the second row and second column in the matrix H respectively when the system is in mode j. Note that we do not know which mode the system is in, and we do not know if the system mode is switched. We only know no matter which mode the system is in, its corresponding model is covered by the known model set

Ω

.

In this paper, two control strategies are used to implement example simulation. One is MMRC under the above conditions, the other is optimal control (OC) under the condition that the process model parameters are completely known. The performance index corresponding to OC should be the lower bound of the other controls. Simulation results are shown in Figure 1, Figure 2, Figure 3, Figure 4 and Figure 5. Figure 1 illustrates the variation curve of the a posteriori probabilities of different modes in

Ω

under MMRC. Figure 2 and Figure 3 show the system control components under OC and MMRC, respectively. Figure 4 indicates the response curve of a representative system state. Performance index of the two control strategies are demonstrated in Figure 5.

It is observed in Figure 1 that in the above two stages the a posteriori probabilities of different system modes both have experienced change and stability. In the first stage, it is because the system is assumed with equal probability of

1 / 3

at the initial time, and, in the second stage, it is because a fault occurs in the system. As the system continuously obtains measurement results, the a posteriori probability of

j = 0

tends to 1 and the a posteriori probabilities of

j = 1

,

j = 2

tend to 0 in the first stage, and in the second stage the a posteriori probability of

j = 2

tends to 1 and the a posteriori probabilities of

j = 0

,

j = 1

tend to 0. It is noted that the change of the a posteriori probability of system is not 0 or 1, but varies between 0 and 1, which indicates that MMRC implements soft switching.

Figure 2 and Figure 3 demonstrate that, in the first stage, MMRC follows closely OC except for the initial transient period. When fault occurs in the system in the second stage, a certain error lasts for a while, but then MMRC fluctuates around OC and eventually MMRC follows closely OC. It is observed in Figure 4 that, in the first stage, there is only a slight difference in the state response curves under OC and MMRC, and, in the second stage, there is a deviation. However, the state response curve of MMRC finally follows closely that of OC in the control horizon.

Figure 5 demonstrates that only in the second stage is there a certain deviation between the performance index of MMRC and OC. These phenomena are due to two aspects. One is the switching of control law between multiple models, the other is the pursuit of control objective. Notice that it is an inevitable cost for MMRC.

5. Conclusions

In this paper, an MMRC, which is under the condition that the models are known, is given for controlling the variable fault system under LQG framework. The controller fuses the control law of each model by using their a posteriori probabilities as weight information. When the controller is in a deadlock state, a deadlock avoidance strategy is given and the convergence of the corresponding a posteriori probability is proved. The simulation results show that, when the system is normal, the controller is the optimal LQG control, and, when faults occur in the system, the controller is able to track optimal control quickly, which enables the performance of the fault system to closely follow the performance of that under OC. In addition, the controller neither needs to detect the system fault model, nor needs to detect the fault time. Soft switching strategy is implemented among the multiple models, which avoids the jitter caused by frequent hard switching to the system. If the system noise is non-Gaussian, extending the above algorithm to a non-Gaussian stochastic system will be future work.

Author Contributions

L.L. and F.Q. conceived and designed the experiments; L.L. performed the experiments and wrote the original draft preparation. F.Q contributed to the writing and review of this paper; G.X. participated in the editing of the paper. M.W. contributed to system model correction.

Funding

This research has received no external funding.

Acknowledgments

This work is supported by the National Natural Science Foundation of China (Grant No. 61773016, 61873201).

Conflicts of Interest

The authors declare no conflict of interest.

References

Tanaka, T.; Esfahani, P.M.; Mitter, S.K. LQG Control with Minimum Directed Information: Semidefinite Programming Approach. IEEE Trans. Autom. Control 2017, 63, 37–52. [Google Scholar] [CrossRef]
Gibson, T.E.; Qu, Z.; Annaswamy, A.M. Adaptive output feedback based on closed-loop reference models. IEEE Trans. Autom. Control 2015, 60, 2728–2733. [Google Scholar] [CrossRef]
Yuksel, S. Jointly Optimal LQG Quantization and Control Policies for Multi-Dimensional Systems. IEEE Trans. Autom. Control 2014, 59, 1612–1617. [Google Scholar] [CrossRef]
Zhang, Y.M.; Li, X.R. Detection and diagnosis of sensor and actuator failures using IMM estimator. IEEE Trans. Aerosp. Electron. Syst. 1998, 34, 1293–1313. [Google Scholar] [CrossRef]
Meskin, N.; Naderi, E.; Khorasani, K. A Multiple Model-based Approach for Fault Diagnosis of Jet Engines. IEEE Trans. Control Syst. Technol. 2013, 21, 254–262. [Google Scholar] [CrossRef]
Pourbabaee, B.; Meskin, N.; Khorasani, K. Sensor Fault Detection, Isolation, and Identification Using Multiple-Model-Based Hybrid Kalman Filter for Gas Turbine Engines. IEEE Trans. Control Syst. Technol. 2016, 24, 1184–1200. [Google Scholar] [CrossRef]
Zhang, L.; Zhuang, S.; Braatz, R.D. Switched model predictive control of switched linear systems: Feasibility, stability and robustness. Automatica 2016, 67, 8–21. [Google Scholar] [CrossRef]
Buchstaller, D.; French, M. Robust Stability for Multiple Model Adaptive Control: Part I—The Framework. IEEE Trans. Autom. Control 2016, 61, 677–692. [Google Scholar] [CrossRef]
Tan, C.; Tao, G.; Qi, R.Y. An adaptive control scheme using multiple reference models. Int. J. Adapt. Control Signal Process. 2015, 28, 1290–1298. [Google Scholar] [CrossRef]
Tao, X.Y.; Li, N.; Li, S.Y. Multiple model predictive control for large envelope flight of hypersonic vehicle systems. Inf. Sci. 2016, 328, 115–126. [Google Scholar] [CrossRef]
Wu, H.F.; Sun, K.; Chen, L.Q.; Zhu, L.; Xing, Y. High Step-Up/Step-Down Soft-Switching Bidirectional DC-DC Converter With Coupled-Inductor and Voltage Matching Control for Energy Storage Systems. IEEE Trans. Ind. Electron. 2016, 63, 2892–2903. [Google Scholar] [CrossRef]
Qian, F.C.; Diao, J.; Yang, H.Z.; Huang, J.R. Multiple model learn and control optimization design method for complex systems. Syst. Eng. Theory Pract. 2016, 36, 200–208. [Google Scholar]
Li, D.; Fu, P.L.; Qian, F.C. Optimal nominal dual control for discrete-time LQG problem with unknown parameters. Automatica 2003, 44, 119–127. [Google Scholar] [CrossRef]
Li, D.; Qian, F.C.; Fu, P.L. Variance minimization approach for a class of dual control problems. IEEE Trans. Autom. Control 2003, 47, 2010–2020. [Google Scholar]
Qian, F.C.; Huang, J.R.; Liu, D. Adaptive Dual Control of Discrete-Time Lqg Problems with Unknown-But- Bounded Parameter. Asian J. Control 2015, 17, 942–951. [Google Scholar] [CrossRef]
Shang, T.; Qian, F.C.; Zhang, X.Y.; Xie, G. Research on Dual Control Algorithm for LQG with Unknown Parameters. Acta Autom. Sin. 2017, 43, 1478–1484. [Google Scholar]
Lavretsky, E.; Wise, K.A. Output Feedback Control. In Robust and Adaptive Control with Aerospace Applications; National Defense Industry Press: Beijing, China, 2015; pp. 118–155. [Google Scholar]

Figure 1. The a-posteriori probabilities of different modes of the system under MMRC.

Figure 2. Control component

u_{1}

.

Figure 2. Control component

u_{1}

.

Figure 3. Control component

u_{2}

.

Figure 3. Control component

u_{2}

.

Figure 4.

p_{s}

under MMRC and OC.

Figure 4.

p_{s}

under MMRC and OC.

Figure 5. The performance index under MMRC and OC.

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Liu, L.; Qian, F.; Xie, G.; Wang, M. Multi-Model Reliable Control for Variable Fault Systems under LQG Framework. Electronics 2019, 8, 632. https://doi.org/10.3390/electronics8060632

AMA Style

Liu L, Qian F, Xie G, Wang M. Multi-Model Reliable Control for Variable Fault Systems under LQG Framework. Electronics. 2019; 8(6):632. https://doi.org/10.3390/electronics8060632

Chicago/Turabian Style

Liu, Lei, Fucai Qian, Guo Xie, and Min Wang. 2019. "Multi-Model Reliable Control for Variable Fault Systems under LQG Framework" Electronics 8, no. 6: 632. https://doi.org/10.3390/electronics8060632

APA Style

Liu, L., Qian, F., Xie, G., & Wang, M. (2019). Multi-Model Reliable Control for Variable Fault Systems under LQG Framework. Electronics, 8(6), 632. https://doi.org/10.3390/electronics8060632

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Multi-Model Reliable Control for Variable Fault Systems under LQG Framework

Abstract

1. Introduction

2. Problem Statement

3. Reliable Controller Design

4. Simulation Analysis

5. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI