Adaptive Fault-Tolerant Tracking Control of Quadrotor UAVs against Uncertainties of Inertial Matrices and State Constraints

Yang, Shuai; Zou, Zhihui; Li, Yingchao; Shi, Haodong; Fu, Qiang

doi:10.3390/drones7020107

Open AccessArticle

Adaptive Fault-Tolerant Tracking Control of Quadrotor UAVs against Uncertainties of Inertial Matrices and State Constraints

by

Shuai Yang

^1,2,

Zhihui Zou

^1,2,

Yingchao Li

^1,2,*,

Haodong Shi

^1,2 and

Qiang Fu

^1,2

¹

School of Optoelectronic Engineering, Changchun University of Science and Technology, Changchun 130022, China

²

Jilin Provincial Key Laboratory of Space Optoelectronics Technology, Changchun 130022, China

^*

Author to whom correspondence should be addressed.

Drones 2023, 7(2), 107; https://doi.org/10.3390/drones7020107

Submission received: 30 December 2022 / Revised: 29 January 2023 / Accepted: 2 February 2023 / Published: 4 February 2023

(This article belongs to the Special Issue Advanced Intelligent Decision-Making and Flight Control of Unmanned Aerial Vehicles)

Download

Browse Figures

Versions Notes

Abstract

:

This paper presents a study on a quadrotor unmanned aerial vehicle (UAV) fault-tolerant control scheme. According to the attitude model and safety control of the aircraft under the uncertainty of inertial matrix, the attitude state constraint by reinforcement learning is designed to ensure safety. Even if the boundary is crossed, it can be pulled back to the boundary by means of a designed penalty function with reinforcement learning. Meanwhile, in order to inhibit the oscillation caused by immediate reward as usual, an adaptive update law is proposed. Furthermore, considering the coupled actuator fault and system input saturation due to uncertainty of inertial matrix, the Nussbaum-type function is utilized in this work to handle this challenge, which likely causes the singularity of inertia matrix. As a consequence, combined with the Lyapunov stability theory, it is confirmed that the proposed FTC scheme ensures that all the closed-loop signals are bounded. Simulation results are carried out to illustrate the effectiveness and advantage of the proposed control scheme.

Keywords:

quadrotor UAV; fault-tolerant control; uncertainty inertial matrix; RBF neural network (RBFNN); backstepping control; state constraint

1. Introduction

As a classic unmanned aerial vehicle (UAV), quadrotor has attracted the attention of many researchers [1,2]. Because of its excellent performance in fast mobility, high convenience, and low structural complexity, it is widely used in military and civilian fields, such as rescue [3], aerial photography, map [4], detection [5], etc. The motion control scenarios for traditional quadrotors include trajectory tracking and attitude tracking. Attitude tracking is an important component to achieve the above complex tasks. Due to the uncertainty of inherent parameters and internal and external interference, designing a high-precision attitude tracking controller is a challenging problem in engineering practice.

In the recent decade, for the problem of quadrotor attitude tracking control, the control algorithms commonly used in China include: PID, sliding mode control, fuzzy control, backstepping method, optimization algorithm, data-driving control [6], etc. The purpose is to improve the anti-interference ability of the quadrotor UAV during flight, making the flight more stable. Zhang et al. [7] designed a PID controller based on small disturbances for three channels, and achieved better anti-interference performance and control stability within a certain error range. Hu et al. [8] designed a particle swarm optimization algorithm with variable weight and hybridization. The inertia weight is controlled by iteration and setting coefficients, and hybrid evolution is introduced to optimize the parameter settings and improve the flight control performance of the quadrotor. Based on the proximal strategy optimization algorithm, Jia et al. [9] improved the reinforcement learning algorithm combined with the model, and the improved algorithm achieved rapid convergence in quadrotor attitude control. Chen [10] proposes an attitude tracking control scheme that combines the integral backstepping sliding mode algorithm with an extended observer. The attitude feedback control is performed in the inner and outer loops, and the integration link is introduced for the uncertainty error, so that the steady-state error of the system is reduced and the robustness is improved. Labbadi et al. [11] proposed an adaptive inversion sliding mode control algorithm. According to the estimated compensation value of the uncertainty, the inversion sliding mode control outputs the state of the flight attitude, which enhances the robustness and realizes fast response and small tracking error. Wu et al. [12] proposed an inner and outer loop control algorithm, in which sliding mode control and active disturbance rejection control were fused in the outer loop for compound control. The simulation results verified the superiority of the inner and outer loop control algorithms and improved the anti-interference ability. Existing research, such as that of Zhang et al. [7], is based on the PID algorithm of error elimination error, a model which is simple and easy to understand, and is widely used. However, when faced with a quadrotor model with a high degree of nonlinearity and large external uncertainty disturbances, the stability of the quadrotor’s attitude is inefficient, the robustness is greatly reduced, and it is difficult to meet high-precision and high-level tracking requirements. The optimization algorithms that were improved and applied in [8,9] involve a large amount of iterative training. Offline learning is difficult to cope with highly dynamic environments, cannot respond in real time, and lacks heuristic tuning rules to guide engineers. The sliding mode controllers used in [10,11] have a complex structure, and the symbolic function introduced leads to a large overshoot and serious chattering of the control value, which is not conducive to the driving and long-term service of the actual actuator.

With the wide application of quadrotors, their corresponding fault-tolerant control attracts a large number of scholars for research, forming many research achievements. Wen et al. [13] designed an adaptive fuzzy neural approximator to estimate faults, and designed a sliding mode fault-tolerant controller. Ductian et al. [14] designed a comprehensive fault diagnosis and fault-tolerant control method based on two-stage Kalman filter and gain-scheduled control synthesis to solve actuator faults in quadrotor aircraft. Nian et al. [15] designed a robust adaptive fault estimation observer, and designed a dynamic output feedback fault-tolerant controller for UAV systems with faults and uncertainties to achieve a stable state. Zhu et al. [16] proposed a state fault estimation with switching PI observer controller, and proposed a fault-tolerant controller for nonlinear switching systems to achieve the goal of asymptotic convergence to both system states and faults. The above literature all propose solutions for the single fault of UAV actuators. However, in practical applications, quadrotor UAVs may also have partial actuator failures and bias faults at the same time. This kind of failure may reduce the tracking performance of the UAV system, make the controller invalid, and even deteriorate the stability of the system. To solve this problem, Rudin et al. [17] proposed a robust fault diagnosis method of

H \infty

filtering, which is used to estimate the size of the fault. Liu et al. [18], utilizing the strong approximation characteristics of radial basis neural network, proposed a fault-tolerant control scheme based on radial basis neural network to compensate for parameter uncertainty, external disturbance, and actuator failure. Wen et al. [13] proposed an adaptive predetermined performance control scheme for quadrotor UAVs with actuator failures. However, the control scheme in this literature only solves the problem of constant faults, and does not solve the problem of trajectory tracking under time-varying faults.

According to the analysis of the above literature, the better the robustness of the quadrotor UAV, the better the tracking performance in practical applications. When a quadrotor UAV actuator has complex faults—uncertainties of inertial matrix, inducing system uncertainties and eccentric moment—it will seriously affect the performance of the flight control system of the quadrotor UAV, which greatly increases the difficulty of fault-tolerant controller design; for instance, the singularity of inertia matrix of a quadrotor UAV. In this paper, motivated by the complex faults combined with system input saturation, an adaptive fault-tolerant tracking controller consisting of states constraint, reinforcement learning [19], and backstepping control framework is proposed to achieve the desired control objective. The main innovation points are summed up as follows:

(1) Contrary to the usual log-type barrier Lyapunov function [20], a novel state-constraint mechanism is proposed, which can ensure that the system states maintain in the designed constraints. Even if the boundary is crossed, it can be pulled back to the boundary by means of a designed penalty function with reinforcement learning. Meanwhile, in order to inhibit the oscillation caused by immediate reward as usual, an adaptive update law is designed in this work.

(2) Based on the backstepping fault-tolerant control framework and the state constraint obtained in (1), the eccentric torque and actuator partial failure faults suffered by a quadrotor UAV are input into the backstepping fault-tolerant control framework through Nussbaum-type function combined with adaptive control method using the norm bound method to achieve the bounded stability.

The article is organized as follows: Section 2 is devoted to establish the model of a quadrotor UAV with uncertainties of inertial matrix and its inducing system uncertainties and eccentric moment. Section 3 delivers the definition of Nussbaum-type function, which is crucial to handle the uncertainty of inertial matrix. Subsequently, the adaptive fault-tolerant control strategy is proposed in Section 4. In Section 5, numerical simulations are made to illustrate the effectiveness of the designed fault-tolerant tracking control (FTC) algorithm. In Section 6, the conclusion is summarized.

2. Problem Formulation

2.1. Attitude Dynamics of Quadrotor UAV

2.1.1. Attitude Angle Dynamic Equation

In order to describe the attitude states of the quadrotor UAV, the aircraft-body coordinate frame

O_{B} X_{B} Y_{B} Z_{B}

and inertial coordinate frame

O_{E} X_{E} Y_{E} Z_{E}

are brought into this work, as shown in Figure 1.

\begin{matrix} \dot{γ} = ℜ (γ) ω \end{matrix}

(1)

where

γ = {[α, β, μ]}^{T}

represents the attitude angle vector of a quadrotor UAV.

ω = {[p, q, r]}^{T}

stands for the attitude angular velocity vector of a quadrotor UAV in the aircraft-body coordinate system.

ℜ (γ)

represents the transformation matrix from the aircraft-body coordinate system to the inertial coordinate system, and yields

\begin{matrix} ℜ (γ) = [\begin{matrix} 1 & \sin α \tan β & \cos α \tan β \\ 0 & \cos α & - \sin α \\ 0 & \sin α \sec β & \cos α \sec β \end{matrix}] \end{matrix}

(2)

2.1.2. Attitude Angular Rate Dynamics

As depicted in Figure 1, the attitude angular rate dynamic of a quadrotor UAV with uncertainty of inertial matrix, system uncertainty, and external disturbance, and one has

\begin{matrix} (J^{*} + Δ J) \dot{ω} = - ω^{\times} (J^{*} + Δ J) ω + Λ + υ + d \end{matrix}

(3)

where

J^{*} \in R^{3 \times 3}

represents the nominal inertial matrix, and the external disturbance moment is denoted by d. Furthermore, the operator

ω^{\times}

working on the vector

ω = {[p, q, r]}^{T}

results in that

\begin{matrix} ω^{\times} = [\begin{matrix} 0 & - r & q \\ r & 0 & - p \\ - q & p & 0 \end{matrix}] \end{matrix}

(4)

Through a close inspection of (3), the challenges of a quadrotor UAV considered in this work distinguish the usual fault-tolerance of a quadrotor UAV, which is the uncertainty of inertial matrix (

Δ J

) combined with its inducing system uncertainty as well as the eccentric moment. The details are analyzed as follows:

(a) Uncertainty of inertial matrix. The uncertainty

Δ J

, as shown in (3), represents an uncertain part of

J^{*}

, which is derived from a movement of the mass center of a quadrotor UAV, denoted as

\bar{ρ} = {[Δ x, Δ y, Δ z]}^{T}

. By the aid of Varignon’s theorem and Parallel-Axis Theorem, it yields

\begin{matrix} Δ J = [\begin{matrix} Δ J_{x x} & - J_{x y} & - J_{x z} \\ - J_{x y} & Δ J_{y y} & - J_{y z} \\ - J_{x z} & - J_{y z} & Δ J_{z z} \end{matrix}] \end{matrix}

(5)

where

J_{x y}, J_{x z}

, and

J_{y z}

are the products of inertia, and

Δ x, Δ y

, and

Δ z

stand for the three components of the offset vector

\bar{ρ}

along the air-craft body coordinate frame

O_{B} X_{B} Y_{B} Z_{B}

, as shown in Figure 1. The details are shown as follows [21]:

\begin{matrix} Δ J_{x x} & = m (Δ y^{2} + Δ z^{2}), Δ J_{y y} = m (Δ x^{2} + Δ z^{2}), \\ Δ J_{z z} & = m (Δ x^{2} + Δ y^{2}), J_{x y} = m Δ x Δ y, J_{x z} = m Δ x Δ z, \\ J_{y z} & = m Δ y Δ z \end{matrix}

(6)

where m denotes the mass of a quadrotor UAV. It should be pointed that the inverse matrix of the inertial matrix with unknown

Δ J

suffers from the risk of being a singular matrix, which causes the failure of usual control algorithms that depend on the inverse matrix of J, for instance, adaptive control, sliding mode control, etc.

(b) System uncertainty. Based on (3), it yields that the system uncertainty caused by

Δ J

is hard to separate from

- {(J^{*} + Δ J)}^{- 1} ω^{\times} (J^{*} + Δ J) ω

. In addition, because of products of inertia caused by

Δ J

, the system uncertainties analyzed above further aggravate the coupling between the longitudinal and lateral dynamics of a quadrotor UAV, making a huge challenge for the FTC controller design.

(c) Eccentric moment induced by

Δ J

. The eccentric moment

Λ

, derived from

Δ_{x}, Δ_{y}, Δ_{z}

, can be modeled as

Λ = [\begin{matrix} 0 & - υ_{z} & υ_{y} \\ υ_{z} & 0 & - υ_{x} \\ - υ_{y} & υ_{x} & 0 \end{matrix}] [\begin{matrix} Δ x \\ Δ y \\ Δ z \end{matrix}] = Θ^{\times} ς

(7)

where

υ = {[υ_{x}, υ_{y}, υ_{z}]}^{T}

are the control moment, which is produced by

F_{1}, F_{2}, \dots, F_{4}

, as shown in Figure 1. In terms of the control moment,

υ = {[υ_{x}, υ_{y}, υ_{z}]}^{T}

,

υ_{x}, υ_{y}

, and

υ_{z}

denote the rolling, pitching, yawing moments, respectively. The corresponding details are delivered in the following subsection.

2.2. Actuator Fault and System Input Saturation

In this work, the partial loss of efficiency fault of actuator combined with system input saturation constraints is taken into consideration. Thus, the system control input of (3) is delivered as

\begin{matrix} υ = F u (τ) + \bar{u} \end{matrix}

(8)

where

u (τ)

stands for the control input with saturation constraints.

F = diag {l_{1}, \dots, l_{8}}

with

0 \leq l_{i} \leq 1 (l_{i} = 1, \dots, 8)

represents the fault matrix that reflects the health condition of the corresponding actuator, and

\bar{u}

represents the stuck fault of a actuator. Subsequently,

u (τ)

can be formulated as

u (τ) = {[sat (τ_{1}), sat (τ_{2}), sat (τ_{3})]}^{T}

, which has

u (τ_{i}) = sat (τ_{i}) = \{\begin{matrix} u_{τ i \max}, \\ τ_{i}, \\ u_{τ i \min}, \end{matrix} \begin{matrix} τ_{i} > u_{τ i \max} \\ |τ_{i}| \leq u_{τ i \max} \\ τ_{i} < - u_{τ i \max} \end{matrix}

(9)

where

u_{τ i \max}

is the maximum moment produced by

F_{1}, F_{2}, \dots, F_{4}

, as shown in Figure 1. For streamlining the analysis, a smooth function is adopted to approximate (9), and we have

\begin{matrix} u (τ) = κ (τ) + ε (τ) \end{matrix}

(10)

where

κ (τ) = {[κ_{1} (τ_{1}), κ_{2} (τ_{2}), κ_{3} (τ_{3})]}^{T}

. With the aid of hyperbolic tangent function,

κ_{i} (τ_{i})

is obtained as

\begin{matrix} κ_{i} (τ_{i}) = u_{τ i \max} tanh (τ_{i} / u_{τ i \max}) \end{matrix}

(11)

where

ε (τ i)

is the approximation error vector satisfying

|ε_{τ i} (ν_{i})| = |s a t (ν_{i}) - κ_{τ i} (ν_{i})| \leq u_{τ i max} (1 - tanh (1))

. Furthermore, drawing support from mean-value theorem combined with

κ (0) = 0

, according to [20],

κ_{i} (τ_{i})

is further modified as

\begin{matrix} κ_{i} (τ_{i}) = \frac{\partial κ_{i} (\cdot)}{\partial τ_{i}} τ_{i} = h_{i} ν_{i} \end{matrix}

(12)

As a result, the system control input

υ

can be further modified as

\begin{matrix} υ = Υ τ + F ε (τ) + \bar{u} \end{matrix}

(13)

where

Υ = d i a g (l_{i} h_{i}), i = 1, 2, \dots, 3

. Based on the definition

Υ

shown in (13),

Υ

is a time-varying coefficient matrix reflecting the information of actuator fault and input saturation.

2.3. Problem Statement

This work is devoted to proposing an adaptive fault-tolerant control for attitude tracking of a quadrotor UAV to achieve the the following two targets, despite the presence of uncertainty of

Δ J

combined with its seducing system uncertainties and eccentric moment, and actuator fault and input saturation:

Q_{1}

: The system output

γ

tracks the desired trajectory

γ_{c}

, while the steady-state behavioral boundedness of the attitude angles (

ϕ, θ

, and

ψ

) is preserved.

Q_{2}

: All signals in the closed-loop systems are bounded.

Before proceeding further, the following assumptions should be made.

Assumption A1.

The desired tracking command signals

γ_{c}

is continuous and bounded.

Assumption A2.

The inverse of the inertial matrix

J = (J^{*} + Δ J)

exists.

Assumption A3.

The effects of eccentric moment Λ is bounded. In addtiton, the disturbance satisfies

∥d∥ \leq ℓ_{d}

, where

ℓ_{d}

is the unknown constant satisfying

ℓ_{d} > 0

.

Remark 1.

Under Assumption 2, the inertial matrix

(J + Δ J)

is an invertible matrix, but the inverse of

(J + Δ \hat{J})

may not exist because of the estimation of

Δ \hat{J}

by estimator.

3. Preliminary Knowledge

In this section, some definitions and preliminary results, applied for the control design and the closed-loop stability analysis, are delivered as follows.

A Nussbaum gain works as a control-direction selector that can swing from positive to negative based on the control performance [22]. On account of

Υ

(13) being a time-varying coefficient matrix, the Nussbaum gain technique is adopted in this work to handle this challenge.

Definition 1.

A function

N (\cdot)

, named as Nussbaum-type function, possesses the following characters [20,22]:

\begin{matrix} \lim_{θ \to \infty} \inf \frac{1}{θ} \int_{0}^{θ} N (Φ) Φ = - \infty, \lim_{θ \to \infty} \sup \frac{1}{θ} \int_{0}^{θ} N (Φ) Φ = + \infty \end{matrix}

(14)

According to [20], the Nussbaum-type function of this work is selected as

\begin{matrix} N (Φ) = e^{Φ^{2} / 2} (Φ^{2} + 2) sin (Φ) \end{matrix}

(15)

where Φ is the state of a Nussbaum-type function.

Lemma 1

([22]).

V (t)

and

Φ_{i} (t) (i = 1, 2, \dots, N)

are smooth functions in

[0, t_{f})

, satisfying

V (t) \geq 0

,

Φ_{i} (0) = 0

. If

N (\cdot)

is chose as (15) and the following inequality maintains

\begin{matrix} V (t) \leq ℏ_{0} + e^{- ℏ_{1} t} \sum_{i = 1}^{N} \int_{0}^{t} (- σ_{i} (λ) N (Φ_{i} (λ)) + 1) {\dot{Φ}}_{i} (λ) e^{ℏ_{1} λ} d λ \end{matrix}

(16)

where

ℏ_{0}

is a bounded constant, and parameter

ℏ_{1}

satisfies

ℏ_{1} > 0

.

σ_{i} (t) \neq 0

is a time-varying parameter which is selected from the unknown set

Π_{σ} : = [ψ^{-}, ψ^{+}]

(all

σ_{i} (t)

have the same sign). And then, it indicates that

V (t)

,

Φ_{i} (t)

,

\sum_{i = 1}^{N} \int_{0}^{t} σ_{i} (λ) N (Φ_{i} (λ)) {\dot{Φ}}_{i} (λ) d λ

are bounded on

[0, t_{f})

.

4. Integral Reinforcement Learning-Based Adaptive Neural Network Fault-Tolerant Control

As shown in Figure 2, an adaptive FTC scheme based on integral reinforcement learning-based (IRL-based) adaptive neural network (NN) fault-tolerant control under the backstepping frame is proposed, ensuring that the system states can maintain in the designed constraints. Even if the boundary is crossed, it can be pulled back to the boundary by means of a designed penalty function with reinforcement learning.

For this purpose, in the light of backstepping derivation, the following coordinate transformation is taken into consideration:

z_{1 i} = \frac{γ_{i}^{}}{k_{b i}^{}}, i = 1, 2, 3

,

z_{2} = γ - γ_{c}

and

z_{3} = ω - ω_{c}

, where

ω_{c}

is a virtual control law to be designed at a later stage. The details are shown as follows:

4.1. State Constraints Penalty Function by Critic NN

For the control target that the attitude states remain in a constraint region without violation due to the demand of FTC control, it can be achieved by means of constraining

z_{1}

satisfies

z_{1}^{T} z_{1} = \sum_{i = 1}^{3} \frac{γ_{i}^{2}}{k_{b i}^{2}} < c_{κ}

all the time. Even if the boundary is crossed, it can still be pulled back. To this aim, inspired by the idea of reinforcement learning algorithm, the following discount returns are designed:

\begin{matrix} Γ (t) = \int_{t}^{\infty} {\bar{λ}}^{\frac{- ζ + t}{T}} κ (z_{1} (ζ)) d ζ \end{matrix}

(17)

where

T > 0

denotes a small integral reinforcement interval. A discount factor in this work is denoted as

\bar{λ} \in (0, 1)

, which can decrease the effects of a current reward for the future. When

z_{1}^{T} z_{1} = \sum_{i = 1}^{3} \frac{γ_{i}^{2}}{k_{b i}^{2}} < c_{κ}

is satisfied, it means that the state constraint objective is achieved. The

Γ (t)

will not increase, and it can be inferred that the smaller

Γ (t)

, the better. Conversely, the controller should be adjusted to make

z_{1}, z_{2}

, and

Γ (t)

smaller even if there exists uncertainties caused by

Δ J

, eccentric moment, actuator fault, etc. Therefore, it infers that the desired value

Γ_{d}

is

Γ_{d} = {[0, 0, 0]}^{T}

, and the immediate reward

κ (z_{1}) = {[κ (z_{11}), κ (z_{12}), κ (z_{13})]}^{T}

can be designed as follows:

κ (z_{1 i} (ζ)) = \{\begin{matrix} 0 \\ 1 \end{matrix} \begin{matrix}  \end{matrix} \begin{matrix}  \end{matrix} \begin{matrix} i f \\ i f \end{matrix} \begin{matrix} {z_{1 i}}^{2} \leq c_{κ i} \\ {z_{1 i}}^{2} > c_{κ i} \end{matrix}, ζ \in [t - T, t)

(18)

where a small threshold is represented by

c_{κ i} > 0

. In this work, the control strategy is made that

κ (z_{1 i}) = 0

reflects a good control performance, while

κ (z_{1 i}) = 1

results in a bad control performance, which means that

Γ (t)

will be increased. The current control should be adjusted to decrease the increase of

Γ (t)

so that the out of bounds state

z_{1}

can return to the constraint area again.

Afterwards, resorting to Bellman error iteration,

Γ (t - T)

and

Γ (t)

yield

\begin{matrix} Γ (t - T) & = \int_{t - T}^{\infty} ϱ^{\frac{- ζ + t - T}{T}} κ (z_{1} (ζ)) d ζ = {\bar{λ}}^{- 1} Γ (t) + \int_{t - T}^{t} {\bar{λ}}^{\frac{- ζ + t - T}{T}} κ (z_{1} (ζ)) d ζ \\ = {\bar{λ}}^{- 1} (Γ (t) + κ_{c}) \end{matrix}

(19)

where

κ_{c}

is defined as

κ_{c} = max {0, \int_{t - T}^{t} {\bar{λ}}^{\frac{- ζ + t}{T}} κ (z_{1} (ζ)) d ζ}

that is the value cost in the interval

[t - T, t)

, where

∥κ_{c}∥ \leq b_{κ_{c}}

with a positive constant

b_{κ_{c}}

. In addition, to overcome the vibration caused by operator max, in this work, the approximation of

{\hat{κ}}_{c} = {[{\hat{κ}}_{c 1}, {\hat{κ}}_{c 2}, {\hat{κ}}_{c 3}]}^{T}

is introduced by means of adaptive control.

On the basis of (18),

\int_{t - T}^{t} {\bar{λ}}^{\frac{- ζ + t}{T}} κ (z_{1 i} (ζ)) d ζ

can be further deduced:

\int_{t - T}^{t} {\bar{λ}}^{\frac{- ζ + t}{T}} κ (z_{1 i} (ζ)) d ζ = \{\begin{matrix} 0 \\ \frac{T}{\ln \bar{λ}} (\bar{λ} - 1) \end{matrix} \begin{matrix} i f \\ i f \end{matrix} \begin{matrix} z_{1 i} {(t)}^{2} \leq c_{κ i} \\ z_{1 i} {(t)}^{2} > c_{κ i} \end{matrix}

(20)

Since the future system information is involved in

Γ (t)

, as shown in (17), it is difficult to solve. According to [23,24,25], based on the value function approximation technique and Bellman Optimality Equation, a critic RBFNN is utilized in this work to handle this solving problem, and we have

\begin{matrix} Γ (t) = W_{c}^{*} H_{c} (x_{c} (t)) + O_{c} (x_{c} (t)) \end{matrix}

(21)

where

W_{c}^{* T}

stands for the ideal weight, satisfying that

{∥W_{c}^{*}∥}_{F} \leq b_{W c}

and

b_{W c}

is a positive constant.

l_{c}

is the number of hidden layers, and

x_{c}

stands for the input of the RBFNN applied in this work.

H_{c} (x_{c})

is the Gaussian basis function of RBFNN. It is also assumed that

∥H_{c} (x_{c})∥ \leq b_{H_{c}}

, where

b_{H_{c}}

is a positive constant, so is

O_{c} (x_{c})

, satisfying

∥O_{c} (x_{c})∥ \leq b_{O_{c}}

with constrained boundary

b_{O_{c}} > 0

. Owing to the ideal weight

W_{c}^{*}

being unknown,

Γ (t)

is estimated in real time with following form:

\begin{matrix} \hat{Γ} (t) = {\hat{W}}_{c}^{T} H_{c} (x_{c} (t)) \end{matrix}

(22)

In addition, the estimation of

Γ (t - T)

follows:

\begin{matrix} \hat{Γ} (t - T) = {\hat{W}}_{c}^{T} H_{c} (x_{c} (t - T)) \end{matrix}

(23)

Then, the temporal difference error is denoted as

\begin{matrix} e_{Γ_{c}} & = \hat{Γ} (t) - \bar{λ} \hat{Γ} (t - T) + {\hat{κ}}_{c} \\ = {\tilde{W}}_{c}^{T} Δ H_{c} (t) + {\hat{κ}}_{c} + W_{c}^{* T} Δ H_{c} (t) \end{matrix}

(24)

where

Δ H_{c} (t) = [H_{c} (x_{c} (t)) - \bar{λ} H_{c} (x_{c} (t - T))]

, resulting in

∥Δ H_{c} (t)∥ \leq (1 + \bar{λ}) b_{H_{c}}

. The adaptive laws of

{\hat{W}}_{c}

and

{\hat{κ}}_{c}

are designed as follows:

{\hat{W}}_{c}

is updated by

\begin{matrix} {\dot{\hat{W}}}_{c} & = - Λ_{c} Δ H_{c} (t) {[{\hat{W}}_{c}^{T} Δ H_{c} (t) + {\hat{κ}}_{c}]}^{T} - ℓ_{c} Λ_{c} {\hat{W}}_{c} \\ {\dot{\hat{κ}}}_{c} & = - η_{κ} l_{W_{c}} {\hat{W}}_{c}^{T} Δ H_{c} (t) - η_{κ} l_{κ_{c}} {\hat{κ}}_{c} - η_{κ} p {(z_{2}^{T} ℓ_{Γ} {\hat{W}}_{c}^{T} H_{c} (x_{c}))}^{T} \end{matrix}

(25)

where

Λ_{c} = d i a g {Λ_{c 1}, Λ_{c 2}, Λ_{c 3}} > 0

denotes the learning rate matrix, combined with

ℓ_{c} > 0

. Besides,

{\tilde{W}}_{c} = {\hat{W}}_{c} - W_{c}^{*}

stands for the weight error for critic NN, so does

{\tilde{κ}}_{c}

. The estimation error is defined as

{\tilde{κ}}_{c} = 0 - {\hat{κ}}_{c}

.

4.2. Attitude Angle Controller Design with State Constraints by Critic NN

As for the attitude tracking error

z_{2}

under the constraints of system states (namely,

z_{1}^{T} z_{1} < c_{κ}

), a control law based on the backstepping control is designed to ensure that the state constraints

z_{1}

are not violated and the tracking error

z_{2}

is small enough. To this aim, the candidate Lyapunov function is delivered as

\begin{matrix} V_{1} & = \frac{1}{2} z_{2}^{T} z_{2} + \frac{ℓ_{W_{c}}}{2} tr ({\tilde{W}}_{c}^{T} Λ_{c}^{- 1} {\tilde{W}}_{c}) + \frac{1}{2 η_{κ_{c}}} {\tilde{κ}}_{c}^{T} {\tilde{κ}}_{c} \\ = V_{11} + V_{12} \end{matrix}

(26)

where

ℓ_{W_{c}}

is a positive coefficient used for theoretical analysis. Besides,

V_{11} = \frac{1}{2} z_{2}^{T} z_{2}

, and the remaining items make up

V_{12}

. And then, taking the time derivative of

V_{11}

gives

\begin{matrix} {\dot{V}}_{11} = z_{2}^{T} {\dot{z}}_{2} = z_{2}^{T} (ℜ (z_{3} + ω_{c}) - {\dot{γ}}_{c}) \end{matrix}

(27)

where

i = 1, 2, 3

. Ideally, the intermediate controller

ω_{c}

is designed as

\begin{matrix} ω_{c} & = ℜ {(γ)}^{- 1} (- k_{1} z_{2} + {\dot{γ}}_{c} - ℓ_{Γ} {\hat{W}}_{c}^{T} H_{c} (x_{c} (t)) p^{T} {\hat{κ}}_{c}) \end{matrix}

(28)

where

k_{1}

is symmetric positive definite.

p = {[\begin{matrix} 1 & 1 & 1 \end{matrix}]}^{T}

.

As for

V_{11}

, by substituting (28) into (27), one has

\begin{matrix} {\dot{V}}_{11} = - z_{2}^{T} k_{1} z_{2} + z_{2}^{T} ℜ z_{3} - z_{2}^{T} ℓ_{Γ} {\hat{W}}_{c}^{T} H_{c} (x_{c}) p^{T} {\hat{κ}}_{c} \end{matrix}

(29)

Furthermore, the derivative of

V_{12}

is obtained as

\begin{matrix} {\dot{V}}_{12} & = l_{W_{c}} tr ({\tilde{W}}_{c}^{T} Λ_{c}^{- 1} {\dot{\tilde{W}}}_{c}) + \frac{1}{η_{κ}} {\tilde{κ}}_{c}^{T} {\dot{\tilde{κ}}}_{c} = - l_{W_{c}} tr ({\tilde{W}}_{c}^{T} Δ Φ_{c} (t) {[{\tilde{W}}_{c}^{T} Δ Φ_{c} (t) + W_{c}^{* T} Δ Φ_{c} (t)]}^{T}) \\ - l_{W_{c}} ℓ_{c} tr ({\tilde{W}}_{c}^{T} {\tilde{W}}_{c}) - l_{W_{c}} ℓ_{c} tr ({\tilde{W}}_{c}^{T} W_{c}^{*}) - l_{W_{c}} tr ({\tilde{W}}_{c}^{T} Δ Φ_{c} (t) {\hat{κ}}_{c}^{T}) + \frac{1}{η_{κ}} {\tilde{κ}}_{c}^{T} {\dot{\hat{κ}}}_{c} \\ \leq - l_{W_{c}} tr ({\tilde{W}}_{c}^{T} (Δ Φ_{c} (t) Δ Φ_{c} {(t)}^{T} + ℓ_{c} I) {\tilde{W}}_{c}) + l_{W_{c}} {∥{\tilde{W}}_{c}∥}_{F} (∥Δ Φ_{c} (t)∥ ∥W_{c}^{* T} Δ Φ_{c} (t)∥ + ℓ_{c} {∥{\tilde{W}}_{c}^{*}∥}_{F}) \\ - l_{W_{c}} tr ({\tilde{W}}_{c}^{T} Δ Φ_{c} (t) {\hat{κ}}_{c}^{T}) + \frac{1}{η_{κ}} {\tilde{κ}}_{c}^{T} {\dot{\hat{κ}}}_{c} \\ \leq - l_{V} k_{c} tr ({\tilde{W}}_{c}^{T} {\tilde{W}}_{c}) + b_{V_{c}} {∥{\tilde{W}}_{c}∥}_{F} - l_{W_{c}} tr ({\tilde{W}}_{c}^{T} Δ Φ_{c} (t) {\hat{κ}}_{c}^{T}) - \frac{1}{η_{κ}} {\tilde{κ}}_{c}^{T} {\dot{\hat{κ}}}_{c} \end{matrix}

(30)

where

b_{V_{c}} = l_{W_{c}} {(1 + λ)}^{2} b_{H_{c}}^{2} b_{W_{c}} + l_{W_{c}} ℓ_{c} b_{W_{c}}

.

Further,

{\dot{V}}_{1}

is obtained as

\begin{matrix} {\dot{V}}_{1} & = {\dot{V}}_{11} + {\dot{V}}_{12} \\ \leq - z_{2}^{T} k_{1} z_{2} + z_{2}^{T} ℜ z_{3} - l_{W_{c}} k_{c} tr ({\tilde{W}}_{c}^{T} {\tilde{W}}_{c}) + b_{V_{c}} {∥{\tilde{W}}_{c}∥}_{F} + l_{W_{c}} tr (W_{c}^{* T} Δ Φ_{c} (t) {\tilde{κ}}_{c}^{T}) + l_{κ_{c}} {\tilde{κ}}_{c}^{T} {\hat{κ}}_{c} \\ \leq - z_{2}^{T} k_{1} z_{2} + z_{2}^{T} ℜ z_{3} - l_{W_{c}} k_{c} tr ({\tilde{W}}_{c}^{T} {\tilde{W}}_{c}) + b_{V_{c}} {∥{\tilde{W}}_{c}∥}_{F} - \frac{l_{κ_{c}}}{2} {∥{\tilde{κ}}_{c}∥}_{F}^{2} + \frac{l_{κ_{c}}}{2} {∥κ_{c}∥}_{F}^{2} \\ + l_{W_{c}} b_{W_{c}} b_{H_{c}} (1 + \bar{λ}) {∥{\tilde{κ}}_{c}^{}∥}_{F} \end{matrix}

(31)

where

\begin{matrix} l_{κ_{c}} {\tilde{κ}}_{c}^{T} {\hat{κ}}_{c} \leq - \frac{l_{κ_{c}}}{2} {∥{\tilde{κ}}_{c}∥}_{F}^{2} + \frac{l_{κ_{c}}}{2} {∥κ_{c}∥}_{F}^{2} \\ l_{W_{c}} tr (W_{c}^{* T} Δ Φ_{c} (t) {\tilde{κ}}_{c}^{T}) \leq l_{W_{c}} b_{W_{c}} b_{H_{c}} (1 + \bar{λ}) {∥{\tilde{κ}}_{c}^{}∥}_{F} \\ - \frac{l_{κ_{c}}}{4} {∥{\tilde{κ}}_{c}∥}_{F}^{2} + l_{W_{c}} b_{W_{c}} b_{H_{c}} (1 + \bar{λ}) {∥{\tilde{κ}}_{c}^{}∥}_{F} \leq l_{W_{c}}^{2} b_{W_{c}}^{2} b_{H_{c}}^{2} {(1 + \bar{λ})}^{2} / l_{κ_{c}} \end{matrix}

(32)

In what follows,

{\dot{V}}_{1}

is further deduced as

\begin{matrix} {\dot{V}}_{1} & \leq - z_{2}^{T} k_{1} z_{2} + z_{2}^{T} ℜ z_{3} - l_{W_{c}} k_{c} tr ({\tilde{W}}_{c}^{T} {\tilde{W}}_{c}) + b_{V_{c}} {∥{\tilde{W}}_{c}∥}_{F} \\ - \frac{l_{κ_{c}}}{4} {∥{\tilde{κ}}_{c}∥}_{F}^{2} + l_{W_{c}}^{2} b_{W_{c}}^{2} b_{H_{c}}^{2} {(1 + \bar{λ})}^{2} / l_{κ_{c}} + \frac{l_{κ_{c}}}{2} {∥κ_{c}∥}_{F}^{2} \\ \leq - λ_{min} (k_{1}) {∥z_{2}∥}^{2} + z_{2}^{T} ℜ z_{3} - (l_{W_{c}} k_{c} - l_{c}) tr ({\tilde{W}}_{c}^{T} {\tilde{W}}_{c}) \\ - \frac{l_{κ_{c}}}{4} {∥{\tilde{κ}}_{c}∥}_{F}^{2} + b_{V_{1}} \end{matrix}

(33)

where

\begin{matrix} - l_{c} {∥{\tilde{W}}_{c}∥}_{F}^{2} + b_{V_{c}} {∥{\tilde{W}}_{c}∥}_{F} \leq b_{V_{c}}^{2} / 2 l_{c} \\ b_{V_{1}} = b_{V_{c}}^{2} / 2 l_{c} + l_{W_{c}}^{2} b_{W_{c}}^{2} b_{H_{c}}^{2} {(1 + \bar{λ})}^{2} / l_{κ_{c}} + \frac{l_{κ_{c}}}{2} {∥κ_{c}∥}_{F}^{2} \end{matrix}

4.3. Attitude Angular Rate Controller Design Resorting to Action NN

In this part, the final control law is designed for

τ

to drive

z_{3} \to 0

, where the tracking error

z_{3}

is defined by

\begin{matrix} z_{3} = ω - ω_{c} \end{matrix}

(34)

where

ω_{c}

and

{\dot{ω}}_{c}

are available for controller.

By taking (34) and (13) into consideration, we have

\begin{matrix} J {\dot{z}}_{3} = - ω^{\times} J ω + Λ + Υ τ + D - J {\dot{ω}}_{c} \end{matrix}

(35)

where the complex disturbance is defined as

D = F ε (τ) + \bar{u} + d

.

Uncertainties of of system: Then, in order to facilitate the subsequent derivation, we define that $R = - ω^{\times} J ω - J {\dot{ω}}_{c}$ , drawing support from the operation rule $Δ J x = L (x) θ$ [20], where $θ = {[J_{11}, J_{12}, J_{13}, J_{22}, J_{23}, J_{33}]}^{T}$ . In this work, R can be made by can $R = ℑ (\cdot) θ$ , where $ℑ (\cdot) \in R^{3 \times 6}$ is delivered as follows:

$\begin{matrix} ℑ (\cdot) = - ω^{\times} L (ω) - L ({\dot{ω}}_{c}) \end{matrix}$

(36)

Furthermore, by the aid of synthesized adaptive control technology, the following expression is made to reduce the calculated load problem caused by too many estimated variables ( $θ$ ). The details are shown as follows:

$\begin{matrix} ∥ℑ (\cdot) θ∥ \leq Ψ ℏ \end{matrix}$

(37)

where $Ψ = {∥ℑ (\cdot)∥}_{F}, ℏ = ∥θ∥$ . The estimation of $\tilde{ℏ}$ is defined as $\tilde{ℏ} = ℏ - \hat{ℏ}$ .
Eccentric moment and Disturbance: For unknown disturbance and eccentric moment, an action RBFNN is established to approximate it and we have

$\begin{matrix} f = Λ + D = W_{a}^{*} Φ_{a} (x_{a} (t)) + O_{a} (x_{a} (t)) \end{matrix}$

(38)

where $W_{a}^{*}$ stands for the ideal weight. $x_{a}$ represents the input vector for actor NN. $ϕ_{a} (x_{a})$ denotes the Gaussian basis function. As for $ε_{a} (x_{a})$ , it is assumed that $∥ε_{a} (x_{a})∥ < b_{ε_{a}}$ , where $b_{ε_{a}}$ is a positive constant. And then, we have the real time estimation as

$\begin{matrix} \hat{f} = {\hat{W}}_{a}^{T} ϕ_{a} (x_{a}) \end{matrix}$

(39)

Define the weight error for action NN:

$\begin{matrix} {\tilde{W}}_{a} = {\hat{W}}_{a} - W_{a}^{*} \end{matrix}$

(40)
Uncertainties of inertial matrix: In order to conquer the challenge caused by time-varying coefficient matrix $Υ$ , as shown in (13), which is caused by actuator fault, input saturation combined with $Δ J$ , recalling Nussbaum-type function, the final control law and adaptive law are proposed as follows:

$\begin{matrix} τ & = N_{a} (χ) ϖ_{a} \\ ϖ_{a} & = - k_{a} z_{3} - Ψ \hat{ℏ} T a n h (\frac{z_{3}}{ϑ}) - ℜ z_{2} - {\hat{W}}_{a}^{T} ϕ_{a} (x_{a}) \\ \dot{χ} & = - k_{N} diag (z_{3}) ϖ_{a} \\ \dot{\hat{ℏ}} & = η Ψ z_{3}^{T} Tanh (z_{3} / ϑ) - η l_{ℏ} \hat{ℏ} \end{matrix}$

(41)

where $ϑ = k_{ϑ} / (1 + {∥ℑ (\cdot)∥}_{\infty})$ with $ϑ$ and $k_{ϑ}$ being design parameters.
Action NN design: There are two objectives for action NN design under the uncertainties caused by $Δ J$ and actuator fault. One is to make $z_{3}$ follow $ω_{c}$ well. The other one is to make $Γ (t)$ minimized to its desired value $Γ_{d} = 0$ . As a consequence, the following action error is defined as

$\begin{matrix} e_{a} = z_{3} + \hat{Γ} (t) - Γ_{d} \end{matrix}$

(42)

And then, the corresponding update law of ${\hat{W}}_{a}$ is designed as

$\begin{matrix} \dot{\hat{W_{a}}} & = Λ_{a} Φ_{a} (x_{a}) {[z_{3} + {\hat{W}}_{c}^{T} Φ_{c} (x_{c})]}^{T} - k_{a} Λ_{a} {\hat{W}}_{a} \end{matrix}$

(43)

where $Λ_{a} = diag (Λ_{a 1}, Λ_{a 2}, Λ_{a 3})$ is a learning rate matrix to be designed, which is a positive definite matrix.

5. Stability Analysis

In this section, the main result of this work is summarized as the following theorem.

Theorem 1.

Take a quadrotor UAV attitude tracking system depicted by (1)–(4), suffers from the uncertainty caused by

Δ J

and its corresponding system uncertainties and eccentric moment, with actuator faults and input saturation in consideration. When the Assumptions 1–3 hold, an adaptive fault-tolerant control strategy for attitude tracking to a quadrotor UAV is proposed in this work, consisted of (25) and (41). The following two targets are achieved, despite the presence of uncertainty of

Δ J

combined with its inducing system uncertainties and eccentric moment, and actuator fault and input saturation:

Q_{1}

: The system output γ tracks the desired trajectory

γ_{c}

, while the steady-state behavioral boundedness of the attitude angles (

ϕ, θ

, and ψ) is preserved.

Q_{2}

: All signals in the closed-loop systems are bounded.

Proof.

The following Lyapunov function candidate is selected as:

\begin{matrix} V_{2} & = V_{1} + \frac{1}{2} z_{3}^{T} J z_{3} + \frac{1}{2 η} {\tilde{ℏ}}^{T} \tilde{ℏ} + \frac{1}{2} tr ({\tilde{W}}_{a}^{T} Λ_{a}^{- 1} {\tilde{W}}_{a}) \end{matrix}

(44)

□

Furthermore, for the first item of (44) with the properties

tr (A^{T} A) = {∥A∥}_{F}^{2}

,

{∥a b^{T}∥}_{F} \leq ∥a∥ ∥b∥

and

∥A a∥ \leq {∥A∥}_{F} ∥a∥

, one has:

\begin{matrix} {\dot{V}}_{2} & = {\dot{V}}_{1} + z_{3}^{T} J {\dot{z}}_{3} - \frac{1}{η} {\tilde{ℏ}}^{T} \dot{\hat{ℏ}} + tr ({\tilde{W}}_{a}^{T} Λ_{a}^{- 1} {\dot{\hat{W}}}_{a}) \\ \leq (z_{3}^{T} P Ψ ℏ - \frac{1}{η} {\tilde{ℏ}}^{T} \dot{\hat{ℏ}}) + z_{3}^{T} (W_{a}^{*} ϕ_{a} + O_{a}) + tr ({\tilde{W}}_{a}^{T} Λ_{a}^{- 1} {\dot{\hat{W}}}_{a}) \\ + z_{3}^{T} Υ τ - λ_{min} (k_{1}) {∥z_{2}∥}^{2} + z_{2}^{T} ℜ z_{3} - (l_{W_{c}} k_{c}) \\ - l_{c}) tr ({\tilde{W}}_{c}^{T} {\tilde{W}}_{c} - \frac{l_{κ_{c}}}{4} {∥{\tilde{κ}}_{c}∥}_{F}^{2} + b_{V_{1}} \end{matrix}

(45)

After substituting the control law and its adaptive laws (41) into (45), it follows:

\begin{matrix} {\dot{V}}_{2} & = {\dot{V}}_{1} + z_{3}^{T} J {\dot{z}}_{3} - \frac{1}{η} {\tilde{ℏ}}^{T} \dot{\hat{ℏ}} + tr ({\tilde{W}}_{a}^{T} Λ_{a}^{- 1} {\dot{\hat{W}}}_{a}) \\ \leq - λ_{min} (k_{1}) {∥z_{2}∥}^{2} - λ_{min} (k_{a}) {∥z_{3}∥}^{2} - (l_{W_{c}} k_{c} - l_{c}) tr ({\tilde{W}}_{c}^{T} {\tilde{W}}_{c}) \\ - \frac{l_{κ_{c}}}{4} {∥{\tilde{κ}}_{c}∥}_{F}^{2} + \sum_{i = 1}^{3} \frac{1}{k_{N i}} (- Υ_{i} N (χ_{i}) + 1) {\dot{χ}}_{i} \\ + (z_{3}^{T} P Ψ ℏ - \frac{1}{η} {\tilde{ℏ}}^{T} \dot{\hat{ℏ}}) - z_{3}^{T} Ψ \hat{ℏ} tanh (\frac{z_{3}}{ϑ}) \\ + z_{3}^{T} (W_{a}^{*} ϕ_{a} + O_{a}) + tr ({\tilde{W}}_{a}^{T} Λ_{a}^{- 1} {\dot{\hat{W}}}_{a}) \\ - z_{3}^{T} {\hat{W}}_{a}^{T} ϕ_{a} (x_{a}) + b_{V_{1}} \end{matrix}

(46)

And then, it is further deduced that

\begin{matrix} {\dot{V}}_{2} & \leq - λ_{min} (k_{1}) {∥z_{2}∥}^{2} - λ_{min} (k_{a}) {∥z_{3}∥}^{2} - (l_{W_{c}} k_{c} - l_{c}) tr ({\tilde{W}}_{c}^{T} {\tilde{W}}_{c}) \\ - \frac{l_{κ_{c}}}{4} {∥{\tilde{κ}}_{c}∥}_{F}^{2} + \sum_{i = 1}^{3} \frac{1}{k_{N i}} (- Υ_{i} N (χ_{i}) + 1) {\dot{χ}}_{i} + b_{V_{1}} \\ + ℏ^{T} Ψ \sum_{i = 1}^{3} (|z_{3 i}| (1 - tanh (\frac{|z_{3 i}|}{ϑ}))) - \frac{l_{ℏ}}{2} {∥\tilde{ℏ}∥}_{F}^{2} + \frac{l_{ℏ}}{2} {∥ℏ∥}_{F}^{2} \\ + b_{O_{a}} ∥z_{3}∥ + \frac{k_{a} b_{W_{a}}^{2}}{2} - \frac{k_{a}}{2} tr ({\tilde{W}}_{a}^{T} {\tilde{W}}_{a}) \\ + b_{Φ_{a}} b_{Φ_{c}} {∥{\tilde{W}}_{a}^{}∥}_{F} ({∥{\tilde{W}}_{c}^{}∥}_{F} + b_{W_{c}}) \end{matrix}

(47)

Furthermore, it is obtained that

\begin{matrix} {\dot{V}}_{2} & \leq - λ_{min} (k_{1}) {∥z_{2}∥}^{2} - λ_{\min} (k_{a 2}) {∥z_{3}∥}^{2} - \frac{l_{κ_{c}}}{4} {∥{\tilde{κ}}_{c}∥}_{F}^{2} \\ - (l_{W_{c}} k_{c} - \frac{b_{Φ_{a}} b_{Φ_{c}}}{2} - l_{c}) tr ({\tilde{W}}_{c}^{T} {\tilde{W}}_{c}) - \frac{l_{ℏ}}{2} {∥\tilde{ℏ}∥}_{F}^{2} \\ - (\frac{k_{a}}{2} - \frac{b_{Φ_{a}} b_{Φ_{c}}}{2} - l_{a}) tr ({\tilde{W}}_{a}^{T} {\tilde{W}}_{a}) \\ + \sum_{i = 1}^{3} \frac{1}{k_{N i}} (- Υ_{i} N (χ_{i}) + 1) {\dot{χ}}_{i} + b_{V_{1}} + b_{V_{2}} \end{matrix}

(48)

where

b_{V_{2}} = \frac{b_{O_{a}}^{2}}{4 λ_{min} (k_{a 1})} + \frac{k_{a} b_{W_{a}}^{2}}{2} + \frac{b_{Φ_{a}}^{2} b_{Φ_{c}}^{2} b_{W_{c}}^{2}}{4 l_{a}} + \frac{l_{ℏ}}{2} {∥ℏ∥}_{F}^{2} + 3 α k_{ϑ} ∥θ∥

. The following inequalities are used:

\begin{matrix} - l_{a} {∥{\tilde{W}}_{a}∥}_{F}^{2} & + b_{Φ_{a}} b_{Φ_{c}} b_{W_{c}} {∥{\tilde{W}}_{a}∥}_{F} \leq \frac{{(b_{Φ_{a}} b_{Φ_{c}} b_{W_{c}})}^{2}}{4 l_{a}} \\ - λ_{min} (k_{a 1}) {∥z_{3}∥}^{2} & + b_{O_{a}} ∥z_{3}∥ \leq \frac{b_{O_{a}}^{2}}{4 λ_{min} (k_{a 1})} \\ l_{ℏ} {\tilde{ℏ}}^{T} \hat{ℏ} & \leq - \frac{l_{ℏ}}{2} {∥\tilde{ℏ}∥}_{F}^{2} + \frac{l_{ℏ}}{2} {∥ℏ∥}_{F}^{2} \end{matrix}

(49)

Finally, we have

\begin{matrix} {\dot{V}}_{2} \leq - c V_{2} + b_{V} + \sum_{i = 1}^{3} \frac{1}{k_{N i}} (- Υ_{i} N (χ_{i}) + 1) {\dot{χ}}_{i} \end{matrix}

(50)

where

\begin{matrix} \begin{matrix} c = min {2 λ_{min} (k_{1}), λ_{min} (k_{a 2}), \frac{2 l_{W_{c}} k_{c} - b_{Φ_{a}} b_{Φ_{c}} - 2 l_{c}}{λ_{min} (Λ_{c}^{- 1})}, \\ \frac{k_{a} - b_{Φ_{a}} b_{Φ_{c}} - 2 l_{a})}{λ_{min} (Λ_{a}^{- 1})}} \end{matrix} \\ b_{V} = b_{V_{1}} + b_{V_{2}} \end{matrix}

Let us define

ν = b_{V} / c

. Then, multiplying both sides of (50) by

e^{c t}

, and integrating the resulting inequality over

[0, t]

, we have (in the set Z1)

\begin{matrix} V_{2} (t) & \leq ν + V_{2} (t) e^{- c t} + e^{- c t} \\ \times \sum_{i = 1}^{3} \frac{1}{k_{N i}} \int_{0}^{t} (- Υ_{i} N (χ_{i} (λ)) + 1) {\dot{χ}}_{i} (λ) e^{c t} d λ \end{matrix}

(51)

Since

Υ_{i}, i = 1, 2, 3

are constrained to the closed interval

[ψ^{-}, ψ^{+}]

, then from (51) and Lemma 1, it is obtained that

χ_{i} (t), i = 1, 2, 3, \sum_{i = 1}^{3} \int_{0}^{t} (- Υ_{i} N (χ_{i}) + 1) {\dot{χ}}_{i} d λ, V_{2} (t)

are bounded on

[0, t_{f})

. Then, from the positive definition of

V_{2} (t)

, it can be shown that

Γ, z_{2}, z_{3}, {\tilde{κ}}_{c}, \tilde{ℏ}, {\tilde{W}}_{c}, {\tilde{W}}_{a}

are also bounded on

[0, t_{f})

. Based on the above arguments, all closed-loop signals are bounded.

For convenience of analysis, we denote

c_{B}

as the upper bound of

\frac{1}{k_{N i}} \sum_{i = 1}^{3} \int_{0}^{t} |(- Υ_{i} N (χ_{i}) + 1) {\dot{χ}}_{i}| d λ,

then, after straightforward algebraic manipulations, (51) reduces to

\begin{matrix} V_{2} (t) \leq ν + c_{B} + V_{2} (0) e^{- c t} \end{matrix}

(52)

where when

t = 0

, we have

\begin{matrix} V_{2} (0) & = \frac{1}{2} z_{2}^{T} (0) z_{2} (0) + \frac{ℓ_{W_{c}}}{2} tr ({\tilde{W}}_{c}^{T} (0) Λ_{c}^{- 1} {\tilde{W}}_{c} (0)) + \frac{1}{2 η_{κ_{c}}} {\tilde{κ}}_{c}^{T} (0) {\tilde{κ}}_{c} (0) + \frac{1}{2} z_{3}^{T} (0) J z_{3} (0) \\ + \frac{1}{2 η} {\tilde{ℏ}}^{T} (0) \tilde{ℏ} (0) + \frac{1}{2} tr ({\tilde{W}}_{a}^{T} (0) Λ_{a}^{- 1} {\tilde{W}}_{a} (0)) \end{matrix}

By invoking the boundedness of

\frac{ℓ_{W_{c}}}{2} tr ({\tilde{W}}_{c}^{T} Λ_{c}^{- 1} {\tilde{W}}_{c})

and the relationship of

Γ (t)

(17) and

\hat{Γ} (t)

(22), one may notice that

z_{1} (t)

remains in the designed constrained area

z_{1}^{T} z_{1} = \sum_{i = 1}^{3} \frac{γ_{i}^{2}}{k_{b i}^{2}} < c_{κ}

for all time. Then, with respect to the tracking error, it can be easily verified that

\frac{1}{2} z_{2}^{T} z_{2} \leq V_{2} (t) \leq \bar{V}

, where

\bar{V} = ν + c_{B} + V_{2} (0)

, which would lead to [20,23]:

\begin{matrix} ∥z_{2}∥ \leq \sqrt{2 \bar{V}} \end{matrix}

(53)

under the

min Γ (t)

. Furthermore, based on

λ_{min} (J) {∥z_{3}∥}^{2} \leq z_{3}^{T} J z_{3} \leq 2 \bar{V}

, we have that

∥z_{3}∥ \leq \sqrt{\frac{2 \bar{V}}{λ_{min} (J)}}

.

In what follows, with the properties

tr (A^{T} A) = {∥A∥}_{F}^{2}

,

{∥a b^{T}∥}_{F} \leq ∥a∥ ∥b∥

and

∥A a∥ \leq {∥A∥}_{F} ∥a∥

, taking the

∥{\tilde{W}}_{c}∥ \leq \sqrt{\frac{2 \bar{V}}{ℓ_{W_{c}} λ_{min} (Λ_{c}^{- 1})}}

, for example, it is deduced by the following process:

\begin{matrix} \frac{ℓ_{W_{c}}}{2} tr ({\tilde{W}}_{c}^{T} Λ_{c}^{- 1} {\tilde{W}}_{c}) \leq \frac{ℓ_{W_{c}}}{2} ∥{\tilde{W}}_{c}^{T} Λ_{c}^{- 1} {\tilde{W}}_{c}∥ \leq \frac{ℓ_{W_{c}}}{2} λ_{max} (Λ_{c}^{- 1}) {∥{\tilde{W}}_{c}∥}^{2} \leq \bar{V} \end{matrix}

(54)

Similarly, we have that

\begin{matrix} ∥{\tilde{W}}_{a}∥ \leq \sqrt{\frac{2 \bar{V}}{λ_{min} (Λ_{a}^{- 1})}}, ∥\tilde{ℏ}∥ \leq \sqrt{2 η \bar{V}}, ∥{\tilde{κ}}_{c}∥ \leq \sqrt{2 η_{κ_{c}} \bar{V}} \end{matrix}

(55)

This completes the proof of Theorem 1, and the achievement of R1 and R2 are realized.

6. Simulation Studies

Considering that a quadrotor UAV with the speed of

3 m / s

and an altitude of

10 m

, the initial attitude vector is

γ = [0.017, 0.026, 0.017] rad

, and the angular rate is

ω = {[0, 0, 0]}^{T} rad / s

. The reference signals are set that

α_{c} = 0.087 rad

during 1–8 s,

α_{c} = 0.02 rad

during 8–20 s and

α_{c} = 0.05 rad

during

t > 20

s, besides,

β

is always

0 rad

and

μ

is keeping

0.035 rad

. In addition, the loss of efficiency fault of actuator is set to

λ = d i a g {0.2, 0.17, 0.2}

as the 5th second. The external disturbances of a quadrotor UAV is described as follows:

d_{f 1} = 2.5 sin (3 t + 0.2) N \cdot m, d_{f 2} = 5.5 sin (4 t - 0.2) N \cdot m

,

d_{f 3} = 5.5 sin (4 t + 0.2) N \cdot m

. The design parameters of controller of a quadrotor UAV are that

k_{1} = 1.23 * I_{3}

,

ℓ_{Γ} = 0.15

.

c_{κ} = 0.23

.

λ = 0.97

.

k_{a} = 1.23

,

k_{N} = 0.8

,

η = 0.5

.

Λ_{c} = η_{κ} = 0.75

,

Λ_{a} = 0.82

. In addition, the initial center of the RBFNN is

\begin{matrix} c = [\begin{matrix} - 1.5 & - 1 & - 0.5 & 0 & 0.5 & 1 & 1.5 \\ - 1.5 & - 1 & - 0.5 & 0 & 0.5 & 1 & 1.5 \\ - 1.5 & - 1 & - 0.5 & 0 & 0.5 & 1 & 1.5 \end{matrix}] \end{matrix}

and the initial width is selected as

b = 10 {[1, 1, 1, 1, 1, 1, 1]}^{T}

.

In order to study the influence of uncertainty of inertial matrix caused by

Δ J

on the motion of a quadrotor UAV and validate the effectiveness of designed fault-tolerant controller, the simulation is carried out in two cases. In case 1, for analyzing the effects of uncertainty of inertial matrix on the attitude tracking performacne of quadrotor UAV, simulation conditions are set that the offsets of mass of center have different values along the x-axis. In case 2, under the similar simulation conditions, compared with the tracking effects of FTC strategy consisted of sliding mode control combined with nonlinear disturbance observer, the simulation results demonstrate the effectiveness of the designed state constraints strategy.

Case 1: Under the same fault-tolerant control conditions, the offset of mass of center just alone the x-axis is set that

\bar{ρ} = {[Δ x, Δ y, Δ z]}^{T} = {[- 4, 0, 0]}^{T} c m

in the 10th second, and the corresponding simulation curves are represented by

{(\cdot)}_{3}

. Similarly,

{(\cdot)}_{4}

denotes the conditions that

{[Δ x, Δ y, Δ z]}^{T} = {[- 1, 0, 0]}^{T} c m

.

{(\cdot)}_{5}

and

{(\cdot)}_{6}

stand for the conditions that

{[Δ x, Δ y, Δ z]}^{T} = {[3, 0, 0]}^{T} c m

and

{[Δ x, Δ y, Δ z]}^{T} = {[4.5, 0, 0]}^{T} c m

, respectively. The simulation results are shown as follows.

Based on the analysis with respect to the influence of uncertainty of inertial matrix caused by

Δ J

on the motion of quadrotor UAV, it covers three aspects: system uncertainties, eccentric moment, and variation of inertial matrix

J^{*} + Δ J

. For the variation of

J^{*}

, by taking the tracking curves of attitude angles depicted in Figure 3, Figure 4 and Figure 5 for example, tracking curves of

α_{c o m}

,

β_{c o m}

, and

μ_{c o m}

show a trend of divergence after the 10th second, which displays that the controller based on sliding mode control technology that rests on the inverse matrix of inertia matrix

{(J^{*})}^{- 1}

can not handle the variation of inertial matrix J due to

Δ J

. The FTC strategy of this work without the exact knowledge of

{(J^{*} + Δ J)}^{- 1}

displays good tracking effects, as shown in

{(\cdot)}_{3}

of Figure 3, Figure 4, Figure 5, Figure 6, Figure 7 and Figure 8. As a result, the simulation results reveal that the FTC strategy of this work is effective. Furthermore, in order to investigate the

Δ J

on the attitude tracking of quadrotor UAV, some other simulations are made, as shown in the attitude tracking curves of

{(\cdot)}_{4}

,

{(\cdot)}_{5}

, and

{(\cdot)}_{6}

in Figure 3, Figure 4, Figure 5, Figure 6, Figure 7 and Figure 8.

With the center of mass moving away from the original centroid along the positive direction of x-axis, the tracking effects of

α_{i}, β_{i}, μ_{i}, i = 3, \dots, 6

of Figure 3, Figure 4 and Figure 5 are gradually declining. When the center of mass is further from the aerodynamic center (from

{(\cdot)}_{3}

to

{(\cdot)}_{6}

), the larger oscillation of the tracking curves occur, as shown in tracking curves of

{(\cdot)}_{α}

and other response curves of Figure 3, Figure 4, Figure 5, Figure 6, Figure 7 and Figure 8. Furthermore, it would be inferred that the further center of mass leaving from the aerodynamic center leads to the larger extra control moment due to the longer arm of force. The greater offset of center of mass can result in the greater

Δ J

inducing system uncertainties and eccentric moment, etc., leading to huge challenges for the designed FTC controller.

Case 2: The state constrained fault-tolerant control method of quadrotor UAV based on reinforcement learning is simulated, and the rigid body attitude tracking control effect of quadrotor UAV affected by the unknown

Δ J

and the safety state constraint can be obtained, as shown in Figure 9 and Figure 10. Figure 9 and Figure 10 show the desired attitude and attitude angle tracking curve of a quadrotor UAV from top to bottom, as well as the comparison group (without state safety constraints). The three figures in Figure 9 and Figure 10 are the attack angle tracking curve, pitch angle tracking curve, and roll angle tracking curve of the aircraft from top to bottom. From the corresponding reference attitude command and the actual tracking effect, it can be seen that even when there is an unknown

Δ J

caused by

\bar{ρ}

, the control algorithm designed in this paper can still maintain a good attitude tracking and maintenance effect. However, from the perspective of the control group, when an unknown

Δ J

caused by

\bar{ρ}

occurs, an abnormal eccentric moment is generated. When the system state is unconstrained, the influence of the eccentric moment will be superimposed, until 22 s, the fault-tolerant control of the system will fail, as shown in Figure 9 and Figure 10. Under the state constrained fault-tolerant control method of quadrotor UAV based on reinforcement learning, even if the boundary is crossed, it can be pulled back to the boundary by means of a designed penalty function.

7. Conclusions

In this paper, a fault-tolerant controller considering unknown inertial matrix, actuator fault, and input saturation is proposed. Contrary to the usual log-type BLF literature, a novel state-constraint mechanism is proposed, which can ensure that the system states maintain in the designed constraints. Even if the boundary is crossed, it can be pulled back to the boundary by means of a designed penalty function with reinforcement learning. Meanwhile, in order to inhibit the algorithm oscillation caused by maximum operation of

{\hat{κ}}_{c}

, the immediate reward

{\hat{κ}}_{c}

is obtained with adaptive control law. Furthermore, based on the backstepping fault-tolerant control framework, the eccentric torque and actuator partial failure faults suffered by a quadrotor UAV are input into the backstepping fault-tolerant control framework through Nussbaum-type function combined with adaptive control method using the norm bound method to achieve the bounded stability. In future work, we will focus on the study of interpretable intelligent fault-tolerant controller [24] of quadrotor UAV with the structural faults caused by changeable center of mass, which means that the structure and the magnitude of the elements of inertial matrix are both uncertain.

Author Contributions

Methodology and writing—original draft preparation, S.Y.; writing—review and editing, Z.Z. and Y.L.; validation, H.S. and Q.F. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (NSFC) (No. 61890964, No. 62127813), Changchun Science and Technology Development Plan (No. 21ZY36).

Data Availability Statement

The data presented in this study are available upon request from the corresponding author.

Acknowledgments

Thank each author for his efforts and innovative suggestions in the process of writing the article.

Conflicts of Interest

The authors declare no conflict of interest.

References

Avram, R.C.; Zhang, X.; Muse, J. Nonlinear adaptive fault-tolerant quadrotor altitude and attitude tracking with multiple actuator faults. IEEE Trans. Control. Syst. Technol. 2017, 26, 701–707. [Google Scholar] [CrossRef]
Han, S.K.; Hwang, S.; Joo, Y.H. Interval type-2 fuzzy-model-based fault-tolerant sliding mode tracking control of a quadrotor UAV under actuator saturation. IET Control. Theory Appl. 2021, 20, 3663–3675. [Google Scholar]
Alarcon, C.; Jamett, M. Autonomous multirrotor design and simulation for search and rescue missions. In Proceedings of the 2018 IEEE International Conference on Automation/XXIII Congress of the Chilean Association of Automatic Control (ICA-ACCA), Concepcion, Chile, 17–19 October 2018; pp. 1–6. [Google Scholar]
Yang, L.; Li, B.; Li, W.; Brand, H.; Jiang, B.; Xiao, J. Concrete defects inspection and 3D mapping using CityFlyer quadrotor robot. IEEE/CAA J. Autom. Sin. 2020, 7, 991–1002. [Google Scholar] [CrossRef]
Zhong, S.; Chirarattananon, P. Direct visual-inertial ego-motion estimation via iterated extended kalman filter. IEEE Robot. Autom. Lett. 2020, 5, 1476–1483. [Google Scholar] [CrossRef]
Chen, H.; Jiang, B.; Ding, S.X.; Huang, B. Data-driven fault diagnosis for traction systems in high-speed trains: A survey, challenges, and perspectives. IEEE Trans. Intell. Transp. Syst. 2022, 23, 1700–1716. [Google Scholar] [CrossRef]
Zhang, P. Modeling and simulation of attitude control of quadrotor aircraft. Electr. Mach. Control. Appl. 2019, 46, 70–74. [Google Scholar]
Hu, W.; Cao, R.R. An improved PSO algorithm of quadrotor ADRC attitude control. Electron. Control. 2019, 26, 12–16. [Google Scholar]
Jia, Z.Y.; Liu, Z.L. Quadrotor attitude control algorithm based on reinforcement learning. J. Chin. Comput. Syst. 2021, 10, 2074–2078. [Google Scholar]
Chen, Z.; Peng, Z.; Zhang, F. Attitude control of coaxial tri-rotor UAV based on linear extended state observer. In Proceedings of the 26th Chinese Control and Decision Conference (2014 CCDC), Changsha, China, 31 May–2 June 2014; pp. 4204–4209. [Google Scholar]
Labbadi, M.; Cherkaoui, M. Robust adaptive nonsingular fast terminal sliding-mode tracking control for an uncertain quadrotor UAV subjected to disturbances. ISA Trans. 2020, 99, 290–304. [Google Scholar] [CrossRef]
Wu, W.Y.; Zheng, B.C.; Li, H. Attitude controller for quadrotor via active disturbance rejection control and sliding mode control. Electron. Opt. Control. 2022, 29, 93–98. [Google Scholar]
Wen, S.; Chen, M.Z.; Zeng, Z.; Huang, T.; Li, C. Adaptive neural-fuzzy sliding-mode fault-tolerant control for uncertain nonlinear systems. IEEE Trans. Syst. Man. Cybern. Syst. 2017, 47, 2268–2278. [Google Scholar] [CrossRef]
Ductian, N.; David, S.; Lahcen, S. Robust self-scheduled fault tolerant control of a quadrotor UAV. IFAC-PapersOaLiae 2017, 50, 5761–5767. [Google Scholar]
Nian, X.; Chen, W.; Chu, X.; Xu, Z. Robust adaptive fault estimation and fault tolerant control for quadrotor attitude systems. Int. J. Control. 2020, 93, 725–737. [Google Scholar] [CrossRef]
Zhu, F.L.; Hou, Y.J.; Zhao, X.D. Observer-based fault-tolerant controller design for nonlinear switched systems. Control. Decis. 2017, 32, 1855–1863. [Google Scholar]
Rudin, K.; Ducard, G.J.J.; Siegwart, R.Y. Active fault toler- ant control with imperfect fault detection information: Applications to UAVs. IEEE Trans. Aerosp. Electron. Syst. 2019, 56, 2792–2805. [Google Scholar] [CrossRef]
Liu, K.; Wang, R.; Wang, X.; Wang, X. Anti-saturation adaptive finite- time neural network based fault-tolerant tracking control for a quadrotor UAV with external disturbances. Aerosp. Sci. Technol. 2021, 115, 106790. [Google Scholar] [CrossRef]
Chen, H.; Luo, H.; Huang, B.; Jiang, B.; Kaynak, O. Transfer learning-motivated intelligent fault diagnosis designs: A survey, insights, and perspectives. TechrXiv 2022. [Google Scholar] [CrossRef]
Hu, Q.; Shao, X.; Guo, L. Adaptive fault-Tolerant attitude tracking control of spacecraft with prescribed performance. IEEE/ASME Trans. Mechatron. 2018, 23, 331–341. [Google Scholar] [CrossRef]
Nguyen, N.; Krishnakumar, K.; Kaneshige, J.; Nespeca, P. Flight dynamics and hybrid adaptive control of damaged aircraft. J. Guid. Control. Dyn. 2008, 31, 751–764. [Google Scholar] [CrossRef]
Ge, S.S.; Wang, J. Robust adaptive tracking for time-varying uncertain nonlinear systemswith unknown control coefficients. IEEE Trans. Autom. Control. 2003, 48, 1463–1469. [Google Scholar]
Guo, X.; Yan, W.; Cui, R. Integral reinforcement learning-based adaptive NN control for continuous-time nonlinear MIMO systems with unknown control directions. IEEE Trans. Syst. Man, Cybern. Syst. 2020, 50, 4068–4077. [Google Scholar] [CrossRef]
Chen, H.; Liu, Z.; Alippi, C.; Huang, B.; Liu, D. Explainable intelligent fault diagnosis for nonlinear dynamic systems: From unsupervised to supervised learning. IEEE Trans. Neural Netw. Learn. Syst. 2022; early access. [Google Scholar] [CrossRef] [PubMed]
Zhao, G.L.; Gao, R.S.; Chen, J.N. Adaptive prescribed performance control of quadrotor with unknown actuator fault. Control. Decis. 2021, 36, 2103–2112. [Google Scholar]

Figure 1. Three-dimensional view of quadrotor attitude definition.

Figure 2. Adaptive fault-tolerant control diagram of quadrotor UAVs against uncertainties of inertial matrices and state constraints.

Figure 3. Tracking effects caused by the attack angle

α

.

Figure 3. Tracking effects caused by the attack angle

α

.

Figure 4. Tracking effects caused by the bank angle

μ

.

Figure 4. Tracking effects caused by the bank angle

μ

.

Figure 5. Tracking effects caused by the sideslip angle

β

.

Figure 5. Tracking effects caused by the sideslip angle

β

.

Figure 6. Control effects of pitch moment on the pitch motion of quadrotor UAVs

δ_{e}

.

Figure 6. Control effects of pitch moment on the pitch motion of quadrotor UAVs

δ_{e}

.

Figure 7. Control effects of rolling moment on the rolling motion of quadrotor UAVs

δ_{a}

.

Figure 7. Control effects of rolling moment on the rolling motion of quadrotor UAVs

δ_{a}

.

Figure 8. Control effects of yaw moment on the yaw motion of quadrotor UAVs

δ_{r}

.

Figure 8. Control effects of yaw moment on the yaw motion of quadrotor UAVs

δ_{r}

.

Figure 9. Tracking effects caused by the attack angle

α

based on reinforcement learning state constraint

α, β

.

Figure 9. Tracking effects caused by the attack angle

α

based on reinforcement learning state constraint

α, β

.

Figure 10. Tracking effects caused by the bank angle

β

based on reinforcement learning state constraint

μ

.

Figure 10. Tracking effects caused by the bank angle

β

based on reinforcement learning state constraint

μ

.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yang, S.; Zou, Z.; Li, Y.; Shi, H.; Fu, Q. Adaptive Fault-Tolerant Tracking Control of Quadrotor UAVs against Uncertainties of Inertial Matrices and State Constraints. Drones 2023, 7, 107. https://doi.org/10.3390/drones7020107

AMA Style

Yang S, Zou Z, Li Y, Shi H, Fu Q. Adaptive Fault-Tolerant Tracking Control of Quadrotor UAVs against Uncertainties of Inertial Matrices and State Constraints. Drones. 2023; 7(2):107. https://doi.org/10.3390/drones7020107

Chicago/Turabian Style

Yang, Shuai, Zhihui Zou, Yingchao Li, Haodong Shi, and Qiang Fu. 2023. "Adaptive Fault-Tolerant Tracking Control of Quadrotor UAVs against Uncertainties of Inertial Matrices and State Constraints" Drones 7, no. 2: 107. https://doi.org/10.3390/drones7020107

APA Style

Yang, S., Zou, Z., Li, Y., Shi, H., & Fu, Q. (2023). Adaptive Fault-Tolerant Tracking Control of Quadrotor UAVs against Uncertainties of Inertial Matrices and State Constraints. Drones, 7(2), 107. https://doi.org/10.3390/drones7020107

Article Menu

Adaptive Fault-Tolerant Tracking Control of Quadrotor UAVs against Uncertainties of Inertial Matrices and State Constraints

Abstract

1. Introduction

2. Problem Formulation

2.1. Attitude Dynamics of Quadrotor UAV

2.1.1. Attitude Angle Dynamic Equation

2.1.2. Attitude Angular Rate Dynamics

2.2. Actuator Fault and System Input Saturation

2.3. Problem Statement

3. Preliminary Knowledge

4. Integral Reinforcement Learning-Based Adaptive Neural Network Fault-Tolerant Control

4.1. State Constraints Penalty Function by Critic NN

4.2. Attitude Angle Controller Design with State Constraints by Critic NN

4.3. Attitude Angular Rate Controller Design Resorting to Action NN

5. Stability Analysis

6. Simulation Studies

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI