Next Article in Journal
Anomaly Detection with Feature Extraction Based on Machine Learning Using Hydraulic System IoT Sensor Data
Next Article in Special Issue
MEDEP: Maintenance Event Detection for Multivariate Time Series Based on the PELT Approach
Previous Article in Journal
Using Artificial Intelligence for Assistance Systems to Bring Motor Learning Principles into Real World Motor Tasks
Previous Article in Special Issue
Batch Process Monitoring Based on Quality-Related Time-Batch 2D Evolution Information
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Distributed Fault Diagnosis and Cooperative Fault-Tolerant Control Design Framework for Distributed Interconnected Systems

School of Electrical Engineering, Nantong University, Nantong 226019, China
*
Author to whom correspondence should be addressed.
Sensors 2022, 22(7), 2480; https://doi.org/10.3390/s22072480
Submission received: 25 January 2022 / Revised: 14 March 2022 / Accepted: 21 March 2022 / Published: 23 March 2022

Abstract

:
This paper investigates a design framework for a class of distributed interconnected systems, where a fault diagnosis scheme and a cooperative fault-tolerant control scheme are included. First of all, fault detection observers are designed for the interconnected subsystems, and the detection results will be spread to all subsystems in the form of a broadcast. Then, to locate the faulty subsystem accurately, fault isolation observers are further designed for the alarming subsystems in turn with the aid of an adaptive fault estimation technique. Based on this, the fault estimation information is used to compensate for the residuals, and then isolation decision logic is conducted. Moreover, the cooperative fault-tolerant control unit, where state feedback and cooperative compensation are both utilized, is introduced to ensure the stability of the whole system. Finally, the simulation of intelligent unmanned vehicle platooning is adopted to demonstrate the applicability and effectiveness of the proposed design framework.

1. Introduction

With the rapid development of sensing and communication technologies, modern engineering systems are increasingly networked and distributed [1]. Further, the large-scale distributed systems such as power grid and vehicle platooning are generally interconnected, physically or informationally [2,3,4,5]. These kinds of systems are thus referred to as distributed interconnected systems, which are composed of several subsystems in different locations through coupling mechanisms. On the other hand, the increasing size and complexity of distributed interconnected systems makes the occurrence of faults easier. Besides, due to the characteristics of interconnection, the fault diagnosis for distributed interconnected systems is challenging as an incipient fault occurring in any subsystem can potentially propagate from one subsystem to another and even result in the collapse of the whole system. The research on fault diagnosis and fault-tolerant control for distributed interconnected systems is receiving remarkable attention [6,7,8,9].
For the most part, the fault diagnosis approaches for distributed interconnected systems can be divided, according to the information used by the diagnostic units, into three categories: centralized, decentralized, and distributed fault diagnosis [10]. The centralized fault diagnosis approach employs a centralized diagnostic unit to collect the information of the whole system and then conducts fault diagnosis for all subsystems. In [11], an interconnected system with disconnected interconnections and packet dropouts was augmented into a switched system, and then a centralized robust fault detection filter was further designed. Obviously, the centralized approach requires high computation as well as communication and is not easy to expand, so it is not suitable for large-scale distributed interconnected systems. In the decentralized fault diagnosis approach, each subsystem is equipped with a local diagnostic unit which diagnoses its own faults with its own information; thus, the approach to interconnections among subsystems is of great importance. The conventional idea is to regard interconnections as external disturbances and design robust local observers to make local residuals insensitive to interconnections. In [12], a decentralized sensor fault isolation approach was investigated for a class of large-scale interconnected nonlinear systems. Some prior known reference signals were utilized to estimate interconnections and then the maximum impact of the estimation error on the local residual was assessed for an adaptive threshold setting. Although the decentralized approach does not need to consider information transmission or security issues, because of no interactions among subsystems, it is conservative to think of interconnections as disturbances or estimate interconnections based on prior information, which can result in a low fault detection rate.
In the distributed fault diagnosis approach, the local diagnostic unit for each subsystem can not only use information about the subsystem itself, but also interact with other subsystems. In other words, the distributed fault diagnosis approach can improve the fault diagnosis performance by partial information interaction which can obtain interconnection characteristics, and thus has received much more attention [13,14,15,16,17]. Fault isolation is the key to fault diagnosis for the distributed interconnected systems as interconnection characteristics can lead to fault propagation. In [15], fault isolation for a class of fuzzy interconnected systems was considered for the first time under the framework of interval observers, and piecewise interval observers were constructed to characterize the unknown interconnections among subsystems and the residual intervals were further used to realize fault isolation. In [16], fault isolation was achieved through the decoupling method. The unknown input consisting of faults in the other subsystems and disturbance in the whole system could be partitioned into the decoupled and the non-decoupled part, and a bank of finite-frequency H / H unknown input observers were further constructed. Moreover, a set of linear matrix inequalities were also used to ensure that the generated residual was sensitive to the fault, while remaining robust against the unknown input. Although the method provides design with a degree of freedom, the appropriate computation capacity and resources are demanded. In addition, information interaction among subsystems can exacerbate fault propagation, especially for sensor faults. In [17], the problem of fault isolation for sensor faults was studied. The influence of local and propagated sensor faults on the residuals was analyzed to realize distributed fault isolation for multiple sensor faults in interconnected systems. However, one of the main focuses in some distributed systems is to minimize the number of measurements shared among subsystems to reduce the communication cost. Meanwhile, different strategies including the fault-driven minimal structurally overdetermined set strategy, the minimal hitting set strategy, the equation-based strategy, and a set of fault-driven minimal structurally overdetermined sets strategies have been explored [8,18,19,20]. Distinguished from what has been mentioned above, the broadcasting communication is used for information exchange in our work, and the means of communication is not the focus.
Less research effort has been made, in comparison with fault diagnosis, to study fault-tolerant control for distributed interconnected systems. The traditional fault-tolerant control approach is to estimate the faults, or the changes of the subsystems caused by the faults, approximately via the adaptive or neural networked method, and then design the distributed or decentralized local fault-tolerant control law to compensate for the faults, so that the subsystems or the whole system can recover to an acceptable performance [21,22,23]. In these schemes, fault-tolerant control only regulates the controllers of the faulty subsystems, so it is called independent fault-tolerant control. From the perspective of globally distributed interconnected systems, another fault-tolerant control approach, called cooperative fault-tolerant control, is to make full use of subsystems and the cooperative effect of their coupling mechanisms to ensure the performance of the faulty system. In [24], a novel fault-tolerant control scheme for switched and interconnected nonlinear systems was designed to guarantee the stability of the state based on “fault-tolerant control Lyapunov–Barrier functions”. In [25], the cycle-small-gain theorem was utilized to ensure the closed-loop stability of interconnected systems, and a fault-tolerant control scheme that considered both rigid and flexible component faults was proposed. However, the use of the small gain theorem generally leads to a conservative result, and the fault-tolerant objective is only to guarantee the stability of faulty systems. To the best of our knowledge, most investigations on fault-tolerant control for the distributed interconnected systems are limited to basic stability analysis, whereas other dynamic and static properties have not been covered in great detail.
Inspired by the above considerations, a distributed fault diagnosis and cooperative fault-tolerant control design framework for distributed interconnected systems is proposed in this paper. Specifically, the contributions of this paper are as follows:
  • A novel fault diagnosis framework, which is mainly composed of fault detection observers and fault isolation observers, is developed for a general class of distributed interconnected systems with actuator faults. By transmitting the state estimation information in the form of a broadcast communication and carrying out several decision logic schemes in the cloud processing unit based on the residuals to achieve fault detection, isolation, and estimation, the problem of fault propagation can be solved as well;
  • A cooperative fault-tolerant control scheme, where LQR controllers for the healthy subsystems and a cooperative fault-tolerant controller for the faulty subsystem are utilized respectively, is also proposed to guarantee the stability and performance of the whole system;
  • Different from the conventional isolation decision logic, the adaptive method is employed to estimate the fault and the fault estimation information is used to modify the residuals. In this way, the subsystem with an actuator fault can be located where the residual value is less than the threshold rather than exceeding the threshold as usual.
This paper is organized as follows. In Section 2, the framework of distributed fault diagnosis and cooperative fault-tolerant control is introduced briefly, followed by the corresponding design objective. Section 3 presents the main results, including the design of fault detection observer, fault isolation observer, and cooperative fault-tolerant controller. Section 4 is dedicated to the simulation of intelligent unmanned vehicle platooning to demonstrate the applicability and effectiveness of the proposed design scheme. Ultimately, some conclusions and possible future research directions are presented in Section 5.

2. Problem Description

The design framework of distributed fault diagnosis and cooperative fault-tolerant control for distributed interconnected systems is depicted in Figure 1 and mainly includes the monitoring and control units (MCUs) and cloud processing unit. The whole distributed interconnected system consists of p subsystems and is modeled as
{ x ( k + 1 ) = A x ( k ) + B u ( k ) + B f f ( k ) + B v v ( k ) y ( k ) = C x ( k ) + E w ( k )
where x ( k ) = [ x 1 T ( k ) x i T ( k ) x p T ( k ) ] T n denotes the state vector, with x i ( k ) n / p the i th subsystem state. u ( k ) = [ u 1 T ( k ) u i T ( k ) u p T ( k ) ] T m denotes the input vector, with u i ( k ) m / p the ith subsystem input. y ( k ) = [ y 1 T ( k ) y i T ( k ) y p T ( k ) ] T r denotes the output vector, with y i ( k ) the i t h subsystem output. f ( k ) = [ f 1 T ( k ) f i T ( k ) . f p T ( k ) ] T m represents the actuator failure to be isolated, with f i ( k ) m / p the ith actuator failure. v ( k ) = [ v 1 T ( k ) v i T ( k ) v p T ( k ) ] T m stands for the process noise, with v i ( k ) m / p the ith subsystem process noise. w ( k ) = [ w 1 T ( k ) w i T ( k ) w p T ( k ) ] T s denotes the measurement noise, with w i ( k ) s / p the ith subsystem measurement noise. A , B , B f , B v , C and E in Equation (1) can be decomposed into
A = [ A 11 A 12 A 1 p A 21 A 22 A 2 p A p 1 A p 2 A p p ] , B = [ B 11 B 12 B 1 p B 21 B 22 B 2 p B p 1 B p 2 B p p ] , B f = [ B f 1 0 0 0 B f 2 0 0 0 B f p ] , B v = [ B v 1 0 0 0 B v 2 0 0 0 B v p ] , C = [ C 1 0 0 0 C 2 0 0 0 C p ] , E = [ E 1 0 0 0 E 2 0 0 0 E p ]
The ith subsystem can be further given as
{ x i ( k + 1 ) = A i i x i ( k ) + j N i ¯ A i j x j ( k ) + l N B i l u l ( k ) + B f i f i ( k ) + B v i v i ( k ) y i ( k ) = C i x i ( k ) + E i w i ( k )
where N is the set of all subsystems and N i ¯ is the set of subsystems other than the ith subsystem. x j ( k ) and u l ( k ) represent the jth ( j i ) subsystem state and lth subsystem input. Note that f i ( k ) denotes the actuator failure and, in general, B f i = B i i .
It can be found that each subsystem is equipped with an MCU which consists of the following components:
(1)
A fault detection observer (FDO), which is governed by
{ x ^ i ( k + 1 ) = A i i x ^ i ( k ) + j N i ¯ A i j x ^ j ( k ) + l = 1 N B i l u l ( k ) + L i ( y i ( k ) y ^ i ( k ) ) y ^ i ( k ) = C i x ^ i ( k ) r i ( k ) = y i ( k ) y ^ i ( k )
where x ^ i ( k ) and x ^ j ( k ) are the state estimations of the ith and jth subsystem respectively. x ^ i ( k ) represents the output estimation of the ith subsystem. L i is the detection observer gain and r i ( k ) stands for the residual of the ith subsystem generated by the FDO.
(2)
A fault isolation observer (FIO), which is activated when there is an alarm provided by the corresponding FDO and can be described by
{ x ^ i q ( k + 1 ) = A i i x ^ i q ( k ) + j A l a r m A i j x ^ j ( k ) + j ¯ A l a r m ¯ A i j ¯ x ^ j ¯ ( k ) + l = 1 N B i l u l ( k )         + B fi f ^ i ( k ) + G i ( y i ( k ) y ^ i q ( k ) ) y ^ i q ( k ) = C i x ^ i q ( k ) r i q ( k ) = y i ( k ) y ^ i q ( k ) f ^ i ( k + 1 ) = f ^ i ( k ) + Γ i ( y i ( k ) y ^ i q ( k ) )
where A l a r m + A l a r m ¯ = N i ¯ and x ^ i q ( k ) is the state estimation of subsystem given by the FIO. G i is the isolation observer gain of the ith subsystem. y ^ i q ( k ) represents the corresponding output estimation and r i q ( k ) is the residual of the ith subsystem generated by the FIO. f ^ i ( k ) stands for the fault estimation and Γ i is the weighting matrix.
(3)
A controller, which can keep the faulty system stable and is constructed as
u i ( k ) = { R i 1 B i i T K i x ^ z i ( k ) , f a u l t f r e e R i 1 B i i T K i x ^ z i ( k ) R i 1 B i i T S i ( k ) , f a u l t y
where R i is a positive definite matrix, and K i is the local optimal gain determined from a standard LQR Riccati equation. x ^ z i ( k ) = { x ^ i , f a u l t f r e e x ^ i q , f a u l t y is the real state estimation of x i ( k ) and is provided by the cloud processing unit. S i ( k ) represents the cooperative compensation vector from other subsystems in the faulty case.
The processing flow of the cloud processing unit is shown in detail in Figure 2. It can be seen that the clouding processing unit shoulders the responsibility of receiving, processing, and broadcasting information. Further, it mainly perform three functions: (i) obtaining the state estimations x ^ i and x ^ i q from the monitoring unit; (ii) accomplishing fault detection and isolation based on the residual signals and spreading results; and (iii) providing the corresponding state estimation to the control unit.
Based on this, the fault detection and isolation schemes in particular are given in Figure 3. The conventional fault detection observer is used to detect whether a fault occurs, and a residual value exceeding the threshold indicates that there is a fault in the process. Meanwhile, the fault isolation observer based on the adaptive fault estimation method is adopted to achieve fault isolation by the combination of an unconventional isolation decision logic. Specifically, since the adaptive method is employed to estimate the fault and the fault estimation information is used to modify the residuals. In this way, the subsystem with actuator faults can be located where the residual value is less than the threshold rather than exceed the threshold as usual. It is noteworthy that isolation decision logic in this paper is contrary to the detection decision logic and different from the conventional method [26].
In this paper, the design objective is to locate the fault accurately and achieve a cooperative fault-tolerant control. Hence, this paper studies the design of a novel fault diagnosis framework, and the detection and isolation observer gain L i and G i , the controller gain K i , and the cooperative compensation vector S i ( k ) .
Remark 1.
It is worthwhile to note that only the single fault case is considered in this paper. Meanwhile, a cooperative controller with fault-tolerant ability is introduced to keep the faulty subsystem stable, and LQR controllers are employed so that the healthy subsystems, which may be affected by the faulty subsystem, can remain stable.
Remark 2.
The subsystems in Figure 1 are physically interconnected. From the mathematical viewpoint, the physical interconnection can be seen from the state matrix A .If the matrix A i j ( i j ) ,the non-diagonal block of the matrix A ,is not equal to zero, it means that the ithsubsystem and the jthsubsystem are physically interconnected. In addition, the monitoring units in Figure 1 are informationally interconnected. To be specific, the monitoring units acquire the state estimation information from other interconnected subsystems through broadcast communication, and all subsystems can use them once the state estimation information has been broadcast.
Remark 3.
Fault isolation is achieved by making use of the adaptive fault estimation observer which is not applicable for a sensor fault. This is the reason we do not consider a sensor fault. If the fault isolation observer based on the adaptive fault estimation method is replaced by some other sensor fault estimation observer, the problem of sensor fault diagnosis can be considered.
Remark 4.
Similar to what has been given in [27], the necessary conditions for the existence of the observer are ( A i i , C i ) ( i = 1 , p ) being observable and ( A , C ) being observable, which can be guaranteed by the PBH rank criteria r a n k [ C i T s I i A i i T ] = n / p and r a n k [ C T s I A T ] = n respectively.

3. Main Results

The presentation of the main results is divided into three sections: (i) the design of fault detection observer; (ii) the design of fault isolation observer; and (iii) the design of cooperative fault-tolerant controller.

3.1. The Design of Fault Detection Observer

In the design of the fault detection observer, the only design parameter is the observer gain L i in Equation (3). To this end, we define the state error as e i ( k ) = x i ( k ) x ^ i ( k ) and the dynamics of the error system are obtained from Equations (2) and (3) as
{ e i ( k + 1 ) = ( A i i L i C i ) e i ( k ) + j N i ¯ A i j e j ( k ) + B f i f i ( k ) + B d i d i ( k ) r i ( k ) = C i e i ( k ) + D i d i ( k )
where B di = [ B v i L i E i ] , D i = [ 0 E i ] and d i ( k ) = [ v i T ( k ) w i T ( k ) ] T , and the residual r i ( k ) is related to v i ( k ) and w i ( k ) . Thus, the asymptotic stability and H performance of the error system given in Equation (6) are ensured in the following theorem.
Theorem 1.
For a given scalar γ i > 0 , if there exists a symmetric matrix P i = P i T > 0 and Y i such that
[ P i P i A i i Y i C i P i B v i P i L i E i P i A i m P i A i p 0 0 0 P i 0 0 0 0 C i T 0 0 γ i 2 0 0 0 0 0 0 γ i 2 0 0 E i T 0 0 ε m 0 0 2 ε m 0 0 ε p 0 0 0 2 ε p I 0 0 0 I 0 0 I 0 I ] < 0
where 0 < ε m , , ε p 1 ( m , p N i ¯ ). Then the error system (6) is asymptotically stable and satisfies H performance r i ( k ) 2 < γ i d i ( k ) 2 , and the detection observer gain can be obtained as L i = P i 1 Y i .
Proof of Theorem 1.
Consider the following Lyapunov function candidate:
V i ( k ) = e i T ( k ) P i e i ( k )
and the stability of the error system (6) is satisfied if and only if
J i = V i ( k + 1 ) V i ( k ) + r i T ( k ) r i ( k ) γ i 2 d i T ( k ) d i ( k ) < 0
It yields
J i < e i T ( k ) ( A ¯ i T P i A ¯ i P i + C i T C i ) e i ( k ) + 2 e i T ( k ) ( A ¯ i T P i B d i + C i T D i ) d i ( k ) + 2 j N i ¯ e i T ( k ) A ¯ i T P i A i j e j ( k ) + d i T ( k ) ( B d i T P i B d i + D i T D i γ i 2 ) d i ( k ) + j N i ¯ e j T ( k ) ( A i j T P i A i j + ε j ) e j ( k ) + 2 j N i ¯ e j T ( k ) A i j T P i B d i d i ( k )
or equivalently
ξ i T ( k ) Ξ i ξ i ( k ) < 0
where A ¯ i = A i i L i C i and ξ ( k ) = [ e i T ( k ) d i T ( k ) e m T ( k ) e p T ( k ) ] T , and
Ξ i = [ A ¯ i T P i A ¯ i P i + C i T C i A i i T P i B d i + C i T D i A ¯ i i T P i A i m A ¯ i i T P i A i p * B d i T P i B d i + D i T D i γ i 2 B d i T P i A i m B d i T P i A i p * * A i m T P i A i m + ε m A i m T P i A i P * * * * * * A i p T P i A i p + ε p ]
With applying the Schur complement twice, Ξ i < 0 can be further expressed as:
[ P i P i A ¯ i P i B d i P i A i m P i A i p 0 * P i 0 0 0 C i T * * γ i 2 0 0 D i T * * * ε m 0 0 * * * * * * * * * ε p 0 * * * * * * I ] < 0
based on this, ε m , , ε p are further managed with the use of Schur complement p 1 times. Hence, Theorem 1 can be proven. □
The root mean square (RMS) norm is selected as the residual evaluation function with
J i , R M S = r i ( k ) R M S
The threshold, the maximum influence of disturbance on the residual evaluation function without faults, can be computed by
J i , t h = sup fault - free r i ( k ) R M S
Then, the fault detection decision logic can be described by
{ J i , R M S > J i , t h ε i = 1 J i , R M S J i , t h ε i = 0
where ε i = 1 denotes that the ith subsystem will generate an alarm signal which will be broadcast by the cloud processing unit, and ε i = 0 denotes that the ith subsystem is healthy.

3.2. The Design of Fault Isolation Observer

According to the above fault detection results, the alarming subsystems are firstly put into the fault set and then fault isolation is only conducted in the fault set. Meanwhile, the FIO given in Equation (4) takes advantage of the output signals from sensors as well as the input signals to generate state estimation x ^ i q ( k ) and fault estimation f ^ i ( k ) . Based on this, a novel framework of fault isolation is proposed to locate the fault accurately. For this purpose, some error expressions are defined as follows:
e i q ( k ) = x i ( k ) x ^ i q ( k ) ,   e f i ( k ) = f i ( k ) f ^ i ( k ) .
Then, from Equations (2) and (4), the error dynamical system is described by
{ e i q ( k + 1 ) = ( A i i G i C i ) e i q ( k ) + j A l a r m A i j e j q ( k ) + j ¯ A l a r m ¯ A i j ¯ e j ¯ ( k ) + B fi e f i ( k )         + B v i v i ( k ) G i E i w i ( k ) e f i ( k + 1 ) = Γ i C i e i q ( k ) + e f i ( k ) + Δ f i ( k ) Γ i E i w i ( k )
where Δ f i ( k ) = f i ( k + 1 ) f i ( k ) denotes the variation in the fault.
According to the error system (8), the augmented system is as follows:
{ e ˜ i ( k + 1 ) = A ˜ i e ˜ i ( k ) + B ˜ d i d ˜ i ( k ) e f i ( k ) = E ˜ i e ˜ i ( k )
where e ˜ i ( k ) = [ e i q T ( k ) e f i T ( k ) e i T ( k ) e m T ( k ) e p T ( k ) ] T , d ˜ i ( k ) = [ w ˜ i T ( k ) v ˜ i T ( k ) Δ f ˜ i T ( k ) ] T , and w ˜ i ( k ) = [ w i ( k ) w m ( k )   w p ( k ) ] T , and v ˜ i ( k ) = [ v i ( k ) v m ( k )   v p ( k ) ] T , and Δ f ˜ i ( k ) = [ Δ f i ( k ) Δ f m ( k )   Δ f p ( k ) ] T , and E ˜ i = [ 0 1 0 0 0 ] , and A ˜ i = [ A i i G i C i B f i 0 A i m A i p Γ i C i 1 0 0 0 0 0 A i i L i C i A i m A i p 0 0 A m i A m m L m C m A m p 0 0 A p i A p m A p p L p C p ] , and B ˜ d i = [ G i E i 0 0 B v i 0 0 0 0 0 Γ i E i 0 0 0 0 0 1 0 0 L i E i 0 0 B v i 0 0 0 0 0 0 L m E m 0 0 B v m 0 0 0 0 0 0 L p E p 0 0 0 B v p 0 0 0 ] . Thus, the asymptotic stability and H performance of the error system given in Equation (9) are ensured in the following theorem.
Theorem 2.
For a given scalar δ i > 0 , if there exist symmetric matrices P ˜ i i = P ˜ i i T > 0 , Q ˜ i i = Q ˜ i i T > 0 such that the following condition holds
[ P ˜ i ψ i Θ i 0 P ˜ i 0 E ˜ i δ i 0 δ i ] < 0
where P ˜ i = d i a g { P ˜ i i Q ˜ i i I I I } , ψ i = [ P ˜ i i A i i X ˜ i C i P ˜ i i B f i 0 P ˜ i i A i m P ˜ i i A i p Y ˜ i C i Q ˜ i i 0 0 0 0 0 A i i L i C i A i m A i p 0 0 A m i A m m L m C m A m p 0 0 A p i A p m A p p L p C P ] , Θ i = [ X ˜ i E i 0 0 P ˜ i i B v i 0 0 0 0 0 Y ˜ i E i 0 0 0 0 0 Q ˜ i i 0 0 L i E i 0 0 B v i 0 0 0 0 0 0 L m E m 0 0 B v m 0 0 0 0 0 0 L p E p 0 0 B v p 0 0 0 ] , then the error dynamics (9) satisfy the H performance index e f i ( k ) 2 < δ i d ˜ i ( k ) 2 . Further, the weighting matrix and isolation observer gain are given as Γ i = Q ˜ i i 1 Y ˜ i and G i = P ˜ i i 1 X ˜ i respectively.
Proof of Theorem 2.
Consider the following Lyapunov function candidate:
V ˜ i ( k ) = e ˜ i T ( k ) P ˜ i e ˜ i ( k )  
and the stability of the error system Equation (9) is satisfied if and only if
J ˜ i = V ˜ i ( k + 1 ) V ˜ i ( k ) + e f i T ( k ) e f i ( k ) δ i 2 d ˜ i T ( k ) d ˜ i ( k ) < 0  
It yields
e ˜ i T ( k ) ( A ˜ i T P ˜ i A ˜ i + E ˜ i T E ˜ i P ˜ i ) e ˜ i ( k ) + e ˜ i T ( k ) A ˜ i T P ˜ i B ˜ d i d ˜ i ( k ) + d ˜ i T ( k ) B ˜ d i T P ˜ i A ˜ i e ˜ i ( k ) + d ˜ i T ( k ) ( B ˜ d i T P ˜ i B ˜ d i δ i 2 ) d ˜ i ( k ) < 0  
or equivalently
[ P ˜ i P ˜ i A ˜ i P ˜ i B ˜ d i 0 P ˜ i 0 E ˜ i T δ i 0 δ i ] < 0
Theorem 2 can be proven by substituting P ˜ i , A ˜ i and B ˜ d i into Equation (11). □
Furthermore, the residual evaluation in the fault set is carried out again. The RMS norm is chosen as the residual evaluation function with
J i , R M S q = r i q ( k ) R M S
and the threshold, similar to previous subsections, is calculated as
J i , t h q = sup fault - free r i q ( k ) R M S
Different from the conventional scheme of decision logic, fault isolation decision logic in this subsection is described by
{ J i , R M S q J i , t h q f a u l t y J i , R M S q > J i , t h q f a u l t f r e e
and fault compensation is responsible for this.
Remark 5.
Alarm signals from FDOs are broadcasted by the cloud processing unit. In order to realize fault isolation, the alarming subsystems then further employ FIOs in turn after the use of FDOs. Meanwhile, the healthy subsystems, not generating alarm signals, use FDOs at all times.

3.3. The Design of Cooperative Fault-Tolerant Controller

In the two previous subsections, fault detection and isolation have been realized by the design of the fault detection and isolation observer. However, the stability of the whole system cannot be guaranteed because the fault is unsolved during fault diagnosis. For this reason, a cooperative fault-tolerant control scheme for the distributed interconnected system is presented in this subsection.
To ensure that a distributed interconnected system with an actuator failure can maintain stability, the optimal cooperative fault-tolerant control law is designed as u i ( k ) = R i 1 B i i T K i x ^ z i ( k ) R i 1 B i i T S i ( k ) , where the local control gain and cooperative compensation vector are determined from
K i = Q i + A i i T K i A i i A i i T K i B i i ( R i + B i i T K i B i i ) 1 B i i T K i A i i
S i ( k + 1 ) = [ A i i T A i i T K i B i i ( R i + B i i T K i B i i ) 1 B i i T ] 1 S i ( k ) K i j N i ¯ ( A i j x j ( k ) + B i j u j ( k ) )
Proof of Theorem 3.
The global optimization problem for the control of the distributed interconnected system with a failure is a quadratic function related to the state and input vectors, which can be given as
min M u i = i = 1 N 1 2 ( x i T ( k ) Q i x i ( k ) + u i T ( k ) R i u i ( k ) )
where Q i is a positive semidefinite matrix.
When one subsystem fails, the effect of cooperative compensation can be achieved by the Hamiltonian for each subsystem
H i = 1 2 ( x i T ( k ) Q i x i ( k ) + u i T ( k ) R i u i ( k ) ) + λ i T ( k + 1 ) ( A i i x i ( k ) + B i i u i ( k )     + j N i ¯ A i j x j ( k ) + j N i ¯ B i j u j ( k ) )
where λ i ( k + 1 ) is the adjoint vector.
According to Equation (15), the necessary optimality conditions, obtained from the optimal control theory [28], are
H i x i = Q i x i ( k ) + A i i T λ i ( k + 1 ) = λ i ( k )
H i u i = R i u i ( k ) + B i i T λ i ( k + 1 ) = 0
Further, the cooperative compensation is calculated through a feedback control, which can be described by
λ i ( k ) = K i x ^ i ( k ) + S i ( k )
Substituting Equations (18) and (2) without the fault f i ( k ) and the noise v i ( k ) into Equation (16) yields
λ i ( k ) = ( Q i + A i i T K i A i i ) x i ( k ) + A i i T K i B i i u i ( k ) + j N i ¯ A i i T K i A i j x j ( k )       + j N i ¯ A i i T K i B i j u j ( k ) + A i i T S i ( k + 1 )
By combining Equations (18) and (2) without the fault f i ( k ) and the noise v i ( k ) , Equation (17) can be re-written as
u i ( k ) = ( R i + B i i T K i B i i ) 1 B i i T K i A i i x i ( k ) j N i ¯ ( R i + B i i T K i B i i ) 1 B i i T K i A i j x j ( k )     j N i ¯ ( R i + B i i T K i B i i ) 1 B i i T K i B i j u j ( k ) ( R i + B i i T K i B i i ) 1 B i i T S i ( k + 1 )
Moreover, substituting Equation (20) into Equation (19) leads to
λ i ( k ) = [ Q i + A i i T K i A i i A i i T K i B i i ( R i + B i i T K i B i i ) 1 B i i T K i A i i ] x i ( k ) + [ j N i ¯ A i i T K i A i j A i i T K i B i i ( R i + B i i T K i B i i ) 1 B i i T K i A i j ] x j ( k ) + [ j N i ¯ A i i T K i B i j A i i T K i B i i ( R i + B i i T K i B i i ) 1 B i i T K i B i j ] u j ( k ) + [ A i i T A i i T K i B i i ( R i + B i i T K i B i i ) 1 B i i T ] S i ( k + 1 )
Theorem 3 can be proven by comparing Equations (18) and (21). □

4. Simulation Example

In this section, the proposed fault diagnosis and cooperative fault-tolerant control scheme is applied to the simplified model of intelligent unmanned vehicle platooning [29], which is shown in Figure 4. A desired separation distance Δ o between adjacent vehicles, and a desired average velocity V o should be assigned under normal operating conditions. Furthermore, the variable Δ D i ( k ) ( i = 2 , 3 , 4 ) represents the deviation from the desired separation distance while the variable Δ V i ( k ) ( i = 1 , 2 , 3 , 4 ) represents the deviation from the desired velocity. Δ d i ( k ) is the real separation distance between the ith and i 1th vehicle at time k , and V i ( k ) is the real velocity of the ith vehicle at time k . Therefore, the state vector and output vector in Equation (1) are x ( k ) = [ Δ V 4 ( k ) Δ D 4 ( k ) Δ V 3 ( k ) Δ D 3 ( k ) Δ V 2 ( k ) Δ D 2 ( k ) Δ V 1 ( k ) ] T and y ( k ) = [ Δ D 4 ( k ) Δ D 3 ( k )   Δ D 2 ( k ) Δ V 1 ( k ) ] T respectively, where Δ D i ( k ) = Δ d i ( k ) Δ o and Δ V i ( k ) = V i ( k ) V o .
The motion of each vehicle is characterized firstly by differential equations with the help of Newton’s second law, and then the state-space representation of the four vehicles platooning can be acquired through expanding the nonlinear term in a Taylor series expansion. The system matrices of the four vehicles platooning are given as follows:
A = [ 0.9048 0 0 0 0 0 0 0.0952 1 0.0952 0 0 0 0 0 0 0.9048 0 0 0 0 0 0 0.0952 1 0.0952 0 0 0 0 0 0 0.9048 0 0 0 0 0 0 0.0952 1 0.0952 0 0 0 0 0 0 0.9048 ] B = [ 0.0952 0 0 0 0.0048 0.0048 0 0 0 0.0952 0 0 0 0.0048 0.0048 0 0 0 0.0952 0 0 0 0.0048 0.0048 0 0 0 0.0952 ] B v = 0.1 B f ,   C = [ 0 1 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 1 ] ,   E = [ 1 0 0 0 0 1 0 0 0 0 1 0 0 0 0 1 ] .  
According to Theorem 1, the four parameters γ 1 , γ 2 , γ 3 , and γ 4 are computed as 1.1339, 1.1339, 1.1339, and 1.0003 respectively, and the detection observer gains are as follows:
L 1 = [ 1.1439 × 10 4 0.2222 ] ,   L 2 = [ 1.1439 × 10 4 0.2222 ] ,   L 3 = [ 1.1439 × 10 4 0.2222 ] ,   L 4 = 4.5233 × 10 4
Then, by solving the condition in Theorem 2, we can obtain the H performance levels δ 4 = 1.8338 , δ 3 = 1.8633 , and the isolation observer parameters
G 4 = 0.9048 ,   G 3 = [ 0.0044 0.6059 ] T ,   Γ 4 = 1 ,   Γ 3 = 8.652 × 10 6 .  
Further, by solving Riccati Equation (9), the local optimal gains can be obtained as:
K 1 = [ 0 . 3112 0 . 3113 ] ,   K 2 = [ 0 . 3112 0 . 3113 ] ,   K 3 = [ 0 . 3112 0 . 3113 ] ,   K 4 = 0 . 0463 .  
In the simulation, the process and measurement noise are assumed as v ( k ) = w ( k ) = [ 0.1 0.1 0.1 0.1 ] T U [ 1 1 ] . Meanwhile, a fault has occurred in the 1st vehicle and is chosen as
f 1 ( k ) = { 0 , k < 30 s 0.8 , k 30 s
The simulation results of fault detection are depicted in Figure 5a–d. It can be observed that the 1st and 2nd vehicles generate alarm signals and the 3rd and 4th vehicles do not generate alarm signals. Thus, the fault detection logic table can be listed as follows:
The simulation results of fault detection are depicted in Figure 5a–d. It can be observed that the 1st and 2nd vehicles generate alarm signals and the 3rd and 4th vehicles don’t generate alarm signals. So the fault detection logic table can be listed as Table 1.
It indicates that the faulty vehicle is located in alarming vehicles and the 3rd and 4th vehicles are healthy because of no alarm signals. Hence, the fault set is defined as {the 1st vehicle, the 2nd vehicle}. The next step is to determine whether the faulty vehicle is the 1st vehicle or the 2nd vehicle. For this purpose, fault isolation for the 1st vehicle and the 2nd vehicle are carried out in turn.
The simulation results of fault isolation for the 1st vehicle and 2nd vehicle are shown in Figure 6 and Figure 7 respectively. It can be seen from Figure 6a,b that the residual assessment values of the 1st and 2nd vehicle are both less than the threshold. However, the residual assessment values of the 1st and 2nd vehicle are both over the threshold in Figure 7a,b after the occurrence of the fault. Based on this, the fault isolation logic table can be listed as Table 2.
Combing the simulation results and the fault isolation logic, it can be found that the faulty vehicle is the 1st vehicle.
Meanwhile, it also can be found from Figure 8 that the fault estimation value of the 1st vehicle can follow the fault value rapidly and accurately in a short time.
In order to guarantee the stability of the whole intelligent unmanned vehicle platooning, a cooperative controller with fault-tolerant ability is applied to the 1st vehicle and LQR controllers are used for the other three vehicles, and fault-tolerant results are further shown in Figure 9. It is obvious that the malfunction of the 1st vehicle brings about fault propagation among intelligent unmanned vehicle platooning, so that the displacement curves of the other three vehicles are no longer parallel with each other for a period of time. However, the displacement curves of the four vehicles are parallel again under the action of cooperative fault-tolerant control, which demonstrates the effectiveness of the fault-tolerant control scheme proposed in this paper.

5. Conclusions

In this paper, a distributed fault diagnosis and cooperative fault-tolerant control framework was developed. To be specific, a fault detection observer was first designed for each subsystem, and the generated alarm signals were broadcast by the cloud processing unit. After that, fault isolation observers and isolation decision logic were used for alarming subsystems in turn to locate the fault accurately. Furthermore, the control unit with the effect of cooperative compensation was constructed to avoid the system instability caused by the faulty subsystem.
It is notable that the scheme in this paper can be employed if and only if a single fault occurs in the system, so it is challenging to study actuator faults as well as sensor faults simultaneously for a distributed interconnected system. Meanwhile, it is very meaningful to combine our proposed method with the distinguished technologies in [8,18,19,20], focusing on optimizing shared information among subsystems, so as to reduce communication costs. These may represent the directions of our future work.

Author Contributions

Conceptualization, A.Q. and S.W.; methodology, X.L. and Z.F.; validation, X.L. and Z.F.; formal analysis, A.Q. and X.L.; writing—original draft preparation, X.L.; writing—review and editing, A.Q., S.W., and J.M.; supervision, A.Q. and S.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (Grant Nos. 61473159, U2066203), the Six Talent Peaks Project in Jiangsu Province, China (Grant No. XYDXX-091), the Key Program of Science and Technology of Nantong (Grant Nos. MS22020030, MS22020022), and the Natural Science Research Program of Jiangsu Colleges and Universities(Grant No. 20KJA470002).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Ding, S.X. Advanced Methods for Fault Diagnosis and Fault-Tolerant Control; Springer: New York, NY, USA, 2021. [Google Scholar] [CrossRef]
  2. Qin, L.; He, X.; Yan, R.; Deng, R.; Zhou, D. Distributed sensor fault diagnosis for a formation of multi-vehicle systems. J. Frankl. Inst. 2017, 356, 791–818. [Google Scholar] [CrossRef]
  3. Ren, W.; Beard, R.W. Distributed Consensus in Multi-Vehicle Cooperative Control: Theory and Applications; Springer: London, UK, 2007; pp. 3–274. [Google Scholar]
  4. Hui, Q.; Yang, J.; Yang, X.; Chen, Z.; Li, Y.; Teng, Y.; Shenyang University of Technology. A robust control strategy to improve transient stability for AC-DC interconnected power system with wind farms. CSEE J. Power Energy Syst. 2019, 5, 259–265. [Google Scholar] [CrossRef]
  5. Zhou, X.; Wang, Y.; Zhang, W.; Yang, W.; Zhang, Y.; Xu, D. Actuator fault-tolerant load frequency control for interconnected power systems with hybrid energy storage system. Energy Rep. 2020, 6, 1312–1317. [Google Scholar] [CrossRef]
  6. Qiu, A.; Yan, Z.; Deng, Q.; Liu, J.; Shang, L.; Wu, J. Modeling of HVAC Systems for Fault Diagnosis. IEEE Access 2020, 8, 146248–146262. [Google Scholar] [CrossRef]
  7. Xia, J.P.; Jiang, B.; Zhang, K. Fault Diagnosis Based on Adaptive Observer for Interconnected Systems. Control. Eng. China 2020, 27, 1452–1457. [Google Scholar]
  8. Blank, M.; Kinnaert, M.; Lunze, J.; Staroswiecki, M. Diagnosis and Fault-Tolerant Control, 3rd ed.; Springer: New York, NY, USA, 2015; pp. 467–518. [Google Scholar]
  9. Antonelli, G. Interconnected dynamic systems: An overview on distributed control. IEEE Control Syst. 2013, 33, 76–88. [Google Scholar] [CrossRef]
  10. Yang, H.; Jiang, B.; Zhou, D.H. Review and perspectives on fault tolerant control for interconnected systems. Acta Autom. Sinica. 2017, 43, 9–19. [Google Scholar]
  11. Li, J.; Wu, C.; Su, Q. Robust fault detection filter design for interconnected systems subject to packet dropouts and structure changes. IET Control Theory Appl. 2018, 12, 368–376. [Google Scholar] [CrossRef]
  12. Reppa, V.; Polycarpou, M.; Panayiotou, C. Decentralized isolation of multiple sensor faults in large-scale interconnected non-linear systems. IEEE Trans. Automat. Contr. 2015, 60, 1582–1596. [Google Scholar] [CrossRef] [Green Version]
  13. Xu, D.Z.; Zhu, F.L.; Zhou, Z.P.; Yan, X.G. Distributed fault detection and estimation in cyber-physical systems subject to actuators. ISA Trans. 2020, 104, 162–174. [Google Scholar] [CrossRef]
  14. Khan, A.S.; Khan, A.Q.; Iqbal, N.; Sarwar, M. Distributed fault detection and isolation in second order networked systems in a cyber-physical environment. ISA Trans. 2020, 103, 131–142. [Google Scholar] [CrossRef] [PubMed]
  15. Zhang, Z.H.; Yang, G.H. Interval Observer-based fault isolation filter for discrete-time fuzzy interconnected systems with unknown interconnections. IEEE Trans Cybern. 2017, 47, 2413–2424. [Google Scholar] [CrossRef] [PubMed]
  16. Liang, D.; Yang, Y.; Li, R.; Liu, R. Finite-frequency H−/H∞ unknown input observer-based distributed fault detection for multi-agent systems. J. Frankl. Inst. 2021, 358, 3258–3275. [Google Scholar] [CrossRef]
  17. Reppa, V.; Polycarpou, M.M.; Panayiotou, C.G. Distributed sensor fault diagnosis for a network interconnected Cyber-Physical Systems. IEEE Trans. Control. Netw. Syst. 2015, 2, 11–23. [Google Scholar] [CrossRef]
  18. Pérez-Zuñiga, C.G.; Chanthery, E.; Travé-Massuyѐs, L.; Sotomayor, J. Fault-driven structural diagnosis approach in a dis-tributed context. IFAC-PapersOnLine 2017, 50, 14254–14259. [Google Scholar] [CrossRef]
  19. Khorasgani, H.; Biswas, G.; Jung, D. Structural Methodologies for Distributed Fault Detection and Isolation. Appl. Sci. 2019, 9, 1286. [Google Scholar] [CrossRef]
  20. Pérez-Zuñiga, G.; Sotomayor-Moriano, J.; Rivas-Perez, R.; Sanchez-Zurita, V. Distributed Fault Detection and Isolation Approach for Oil Pipelines. Appl. Sci. 2021, 11, 11993. [Google Scholar] [CrossRef]
  21. Tong, S.C.; Huo, B.Y.; Li, Y.M. Observer-Based Adaptive Decentralized Fuzzy Fault-Tolerant Control of Nonlinear Large-Scale Systems with Actuator Failures. IEEE Trans Fuzzy Syst. 2014, 22, 1–15. [Google Scholar] [CrossRef]
  22. Yadergar, M.; Meskin, N. Fault-tolerant control of nonlinear heterogeneous multi-agent systems. Automatica 2021, 127, 109514. [Google Scholar] [CrossRef]
  23. Qiu, A.B.; Al-Dabbagh, A.W.; Yu, H.; Chen, T.W. A design framework for event-triggered active fault-tolerant control systems. Int. J. Control. 2020, 94, 2508–2519. [Google Scholar] [CrossRef]
  24. Ren, W.J.; Yang, H.; Jiang, B. Fault-tolerant safe control design of switched and interconnected nonlinear systems. J. Franklin Inst. 2019, 35, 8929–8951. [Google Scholar] [CrossRef]
  25. Li, Q.; Yang, H.; Zhao, D.; Jiang, B. Fault-tolerant control and vibration suppression of flexible spacecraft: An interconnected system approach. Chin. J. Aeronaut. 2020, 33, 2014–2023. [Google Scholar] [CrossRef]
  26. Shi, J.T.; He, X.; Wang, Z.D.; Zhou, D.H. Distributed fault detection for a class of second-order multi-agent systems: An optimal robust observer approach. IET Control Theory Appl. 2014, 8, 1032–1044. [Google Scholar] [CrossRef]
  27. Habibi, H.; Howard, I.; Simani, S.; Fekih, A. Decoupling Adaptive Sliding Mode Observer Design for Wind Turbines Subject to Simultaneous Faults in Sensors and Actuators. IEEE-CAA J. Autom. Sin. 2021, 8, 837–847. [Google Scholar] [CrossRef]
  28. Larbah, E.; Patton, R.J. Fault tolerant plug and play vibration control in building structures. In Proceedings of the 49th IEEE Conference on Decision and Control (CDC), Atlanta, GA, USA, 15–17 December 2010; pp. 2462–2467. [Google Scholar] [CrossRef]
  29. Levine, W.S.; Athans, M. On the optimal error regulation of a string of moving vehicles. IEEE Trans. Automat. Contr. 1966, 11, 355–361. [Google Scholar] [CrossRef]
Figure 1. The distributed fault diagnosis and cooperative fault-tolerant control design framework.
Figure 1. The distributed fault diagnosis and cooperative fault-tolerant control design framework.
Sensors 22 02480 g001
Figure 2. The function of the cloud processing unit.
Figure 2. The function of the cloud processing unit.
Sensors 22 02480 g002
Figure 3. Fault detection and isolation schemes, where (a) represents fault detection scheme; (b) represents fault isolation scheme.
Figure 3. Fault detection and isolation schemes, where (a) represents fault detection scheme; (b) represents fault isolation scheme.
Sensors 22 02480 g003
Figure 4. Intelligent unmanned vehicle platooning made up of four vehicles.
Figure 4. Intelligent unmanned vehicle platooning made up of four vehicles.
Sensors 22 02480 g004
Figure 5. The results of fault detection, where (a) represents the 1st vehicle; (b) represents the 2nd vehicle; (c) represents the 3rd vehicle; (d) represents the 4th vehicle.
Figure 5. The results of fault detection, where (a) represents the 1st vehicle; (b) represents the 2nd vehicle; (c) represents the 3rd vehicle; (d) represents the 4th vehicle.
Sensors 22 02480 g005
Figure 6. The results of fault isolation, where (a) represents the 1st vehicle with the use of FIO; (b) represents the 2nd vehicle with the use of FDO.
Figure 6. The results of fault isolation, where (a) represents the 1st vehicle with the use of FIO; (b) represents the 2nd vehicle with the use of FDO.
Sensors 22 02480 g006
Figure 7. The results of fault isolation, where (a) represents the 1st vehicle with the use of FDO; (b) represents the 2nd vehicle with the use of FIO.
Figure 7. The results of fault isolation, where (a) represents the 1st vehicle with the use of FDO; (b) represents the 2nd vehicle with the use of FIO.
Sensors 22 02480 g007
Figure 8. The actual value and estimation value of f 1 ( k ) .
Figure 8. The actual value and estimation value of f 1 ( k ) .
Sensors 22 02480 g008
Figure 9. Displacements of four vehicles with the use of fault-tolerant control scheme.
Figure 9. Displacements of four vehicles with the use of fault-tolerant control scheme.
Sensors 22 02480 g009
Table 1. Fault detection logic.
Table 1. Fault detection logic.
NumbersDecision Results
The 1st vehicle1
The 2nd vehicle1
The 3rd vehicle0
The 4th vehicle0
Table 2. Fault isolation logic.
Table 2. Fault isolation logic.
TypesNumbersDecision Results
Fault isolation for the 1st vehicleThe 1st vehicle (FIO)0
The 2nd vehicle (FDO)0
Fault isolation for the 2nd vehicleThe 1st vehicle (FDO)1
The 2nd vehicle (FIO)1
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Li, X.; Fan, Z.; Wang, S.; Qiu, A.; Mao, J. A Distributed Fault Diagnosis and Cooperative Fault-Tolerant Control Design Framework for Distributed Interconnected Systems. Sensors 2022, 22, 2480. https://doi.org/10.3390/s22072480

AMA Style

Li X, Fan Z, Wang S, Qiu A, Mao J. A Distributed Fault Diagnosis and Cooperative Fault-Tolerant Control Design Framework for Distributed Interconnected Systems. Sensors. 2022; 22(7):2480. https://doi.org/10.3390/s22072480

Chicago/Turabian Style

Li, Xue, Zhikang Fan, Shengfeng Wang, Aibing Qiu, and Jingfeng Mao. 2022. "A Distributed Fault Diagnosis and Cooperative Fault-Tolerant Control Design Framework for Distributed Interconnected Systems" Sensors 22, no. 7: 2480. https://doi.org/10.3390/s22072480

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop