Composite Learning-Based Inverse Optimal Fault-Tolerant Control for Hierarchy-Structured Unmanned Helicopters

Liu, Qingyi; Zhang, Ke; Jiang, Bin; Tan, Yushun

doi:10.3390/drones9060391

Open AccessFeature PaperArticle

Composite Learning-Based Inverse Optimal Fault-Tolerant Control for Hierarchy-Structured Unmanned Helicopters

¹

College of Automation Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China

²

National Key Laboratory of Helicopter Dynamics, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China

³

School of Applied Mathematics, Nanjing University of Finance and Economics, Nanjing 210023, China

^*

Author to whom correspondence should be addressed.

Drones 2025, 9(6), 391; https://doi.org/10.3390/drones9060391

Submission received: 25 March 2025 / Revised: 13 May 2025 / Accepted: 20 May 2025 / Published: 23 May 2025

(This article belongs to the Special Issue Advanced Intelligent Decision-Making and Flight Control of Unmanned Aerial Vehicles 2nd Edition)

Download

Browse Figures

Versions Notes

Abstract

This article investigates the inverse optimal fault-tolerant formation-containment control problem for a group of unmanned helicopters, where the leaders form a desired formation pattern under the guidance of a virtual leader while the followers move toward the convex hull established by leaders. To facilitate control design and stability analysis, each helicopter’s dynamics are separated into an outer-loop (position) and an inner-loop (attitude) subsystem by exploiting their multi-time-scale characteristics. Next, the serial-parallel estimation model, designed to account for prediction error, is developed. On this foundation, the composite updating law for network weights is derived. Using these intelligent approximations, a fault estimation observer is constructed. The estimated fault information is further incorporated into the inverse optimal fault-tolerant control framework that avoids tackling either the Hamilton–Jacobi–Bellman or Hamilton–Jacobi–Issacs equation. Finally, simulation results are presented to demonstrate the superior control performance and accuracy of the proposed method.

Keywords:

unmanned helicopters; composite learning; fault estimation; inverse optimal fault-tolerant control; formation-containment

1. Introduction

Unmanned helicopters (UHs) have been extensively applied in various surveillance and inspection tasks due to their exceptional abilities in hovering, vertical take-off and landing, and in accessing constrained environments where other aerial vehicles may be unable to operate. Nevertheless, UHs require complex, multivariable, nonlinear, and under-actuated systems, often characterized by strong coupling effects and uncertain or unidentifiable aerodynamic parameters [1,2]. To address these inherent limitations, control designs for UHs typically adopt a cascaded structure, consisting of an outer-loop subsystem and an inner-loop subsystem (i.e., translational and rotational). The authors of [3,4,5] discussed the trajectory tracking control methods for a single UH with unknown uncertainties and external disturbances. As mission complexity and environmental challenges escalate, employing helicopter formations rather than individual units has become crucial for meeting operational demands and enhancing overall system performance. The authors of [6] studied the intelligent formation control problem by utilizing recurrent neural networks, and the authors of [7,8,9] provided fast formation tracking control protocols for multiple UHs to improve convergence rate/time.

Compared to fixed-wing aircraft, helicopters are more prone to mechanical faults and operational challenges due to their unique structural characteristics, dynamic flight mechanics, and complex operational conditions. If a fault (e.g., performance degradation) occurring in an actuator, sensor, or other component is not detected and addressed, the helicopter may fail to perform tasks. To ensure that system performance remains within acceptable bounds when a fault occurs, the fault-tolerant control (FTC) scheme must be implemented to settle the presumed faults [10,11]. Depending on whether to address the faults proactively, the existing FTC methods can be generally categorized into passive FTC (PFTC) and active FTC (AFTC) [12]. PFTC does not modify the system structure and control unit. It only employs a resilient controller against the expected and considered fault categories without requiring accurate fault information [13,14,15]. However, this approach may have inherent limitations in fault accommodation, potentially compromising system performance. An AFTC is developed by reconstructing the related controller, or selecting a pre-devised controller through a fault diagnosis technique that can detect and identify faults [16,17]. In this manner, the authors of [18,19] developed an output feedback-based AFTC scheme for a single three-degree-of-freedom (3-DOF) helicopter suffering from angular position sensor faults. Next, the authors of [20] developed a decentralized fault estimation and a distributed fault-tolerant control hierarchy for a number of 3-DOF helicopters with nonlinearities, uncertainties, and both actuator and sensor faults. As the autonomy of the helicopter continues to improve, designing intelligent AFTC schemes to compensate for potential faults remains a crucial area for further investigation, serving as the primary motivation for this article.

It should be pointed out that the results above only require the helicopters to follow a desired trajectory under fault conditions, without imposing additional performance constraints such as minimizing tracking time, tracking distance, or energy consumption. Therefore, optimal control problems have received considerable attention in practice, which often need either the Hamilton–Jacobi–Bellman (HJB) or Hamilton–Jacobi–Issacs (HJI) equation to obtain an analytical solution [21,22,23]. Unlike the conventional optimal control method that initially defines the cost performance function and then minimizes it to design an optimized controller, the inverse optimal control method takes an alternative way. It starts by designing a stabilizing controller and then identifies a corresponding cost function for which the controller is optimal. Also, it no longer focuses on the solution of the HJB or HJI equation but searches inversely for a certain undetermined objective function. This capability makes the inverse optimal control an essential tool in improving system performance and achieving optimization without solving partial differential equations. The authors of [24,25,26,27] investigated the inverse optimal consensus, formation, and containment control problem for multi-agent systems. The authors of [28] devised a fuzzy-based inverse optimal output feedback strategy to resist actuator and sensor network attacks. The authors of [29,30,31] later incorporated disturbance estimation into an inverse optimal control. From the perspective of the remarkable ability, it is also necessary for UHs to equip the inverse optimal control with an FTC, which is the main motivation for investigating this problem in the subsequent sections.

Enlightened by the discussions above, the inverse optimal fault-tolerant formation-containment control strategy is proposed for multi-UHs with actuator faults. The primary innovations and contributions are outlined below:

(1): Different from the previous control schemes in [6,7,8,9], multiple leaders and multiple followers exist in the formation system, and the leaders collaborate with each other to form a desired formation. Each UH modeling is equipped with nonlinearities and faults (simultaneous partial loss of effectiveness and bias), which are separated into outer-loop (position) and inner-loop (attitude) subsystems based on multiple-time-scale features.
(2): The serial-parallel estimation model is constructed with intelligent approximation techniques. Accordingly, the composite learning (CL)-based updating law is designed for both the network weight and the fault estimation observer.
(3): The estimated fault information is subsequently incorporated into the inverse optimal FTC to develop the controller that ensures both optimality and effective fault compensation. Owing to this, the formation of the leaders and reference tracking are obtained. Further, the followers are not only inside the convex hull established by leaders but maintain a specific formation based on the combination of the leaders.

The remaining sections of this article are structured as follows. Section 2 gives the problem formulation and related lemmas, definitions, and assumptions. Section 3 and Section 4 separately design the CL-based inverse optimal fault-tolerant formation and containment controller for leaders and followers. Finally, Section 5 presents the simulation flight results of the hierarchy-structured UHs, and Section 6 concludes this article.

2. The UH Flight Dynamics Model and Problem Formulation

Each UH can be recognized as a rigid body in executing formation flight tasks. Let

O_{I} = {x_{I}, y_{I}, z_{I}}

be the inertial reference frame positioned at the take-off location, and

O_{ι} = {x_{ι}, y_{ι}, z_{ι}}

be the body-fixed coordinate frame located at the center of gravity of the helicopter. Figure 1a presents the spatial configuration of the inertial and body-fixed frames. As depicted in Figure 1b, a flying system with

N + M

helicopters (including N leaders and M followers) is considered, where

H = {1, 2 \dots, N + M}

is the set of serial numbers for the helicopters, and

L = {1, 2 \dots, N}

and

F = {N + 1, N + 2, \dots, N + M}

are the sets of serial numbers for the leaders and followers, respectively. Also, the virtual leader is labeled as 0. The 6-DOF rigid body model of the

ι

th UH is modeled as a position subsystem and an attitude subsystem as follows:

\{\begin{matrix} {\dot{P}}_{ι} = V_{ι}, ι = 1, 2 \dots, N + M \\ {\dot{V}}_{ι} = g e_{3} + \frac{1}{m_{ι}} R_{ι} f_{ι} \end{matrix}

(1)

\{\begin{matrix} {\dot{Φ}}_{ι} = Π_{ι} Λ_{ι} \\ {\dot{Λ}}_{ι} = - J_{ι}^{- 1} Λ_{ι}^{\times} J_{ι} Λ_{ι} + J_{ι}^{- 1} τ_{ι} \end{matrix}

(2)

where

P_{ι} = {[x_{ι}, y_{ι}, z_{ι}]}^{T}

and

V_{ι} = {[u_{ι}, v_{ι}, ω_{ι}]}^{T}

are the position and the velocity vectors of the inertial reference frame

O_{I}

, respectively.

Φ_{ι} = {[ϕ_{ι}, θ_{ι}, ψ_{ι}]}^{T}

and

Λ_{ι} = {[p_{ι}, q_{ι}, r_{ι}]}^{T}

denote the Euler angle and angular rate vectors of the body-fixed coordinate frame

O_{ι}

, respectively.

e_{3} = {[0, 0, 1]}^{T}

is the unitary vector. g indicates the gravity acceleration,

m_{ι}

denotes the mass of UH, and

J_{ι} = diag {J_{x ι}, J_{y ι}, J_{z ι}}

represents the diagonal inertia matrix. Further, the rotation matrix

R_{ι}

, the anti-symmetric matrix

Λ_{ι}^{\times}

, and the rotational kinematic matrix

Π_{ι}

are defined below:

\begin{matrix} R_{ι} = [\begin{matrix} cos θ_{ι} cos ψ_{ι} & sin ϕ_{ι} sin θ_{ι} cos ψ_{ι} - cos ϕ_{ι} sin ψ_{ι} & cos ϕ_{ι} sin θ_{ι} cos ψ_{ι} + sin ϕ_{ι} sin ψ_{ι} \\ cos θ_{ι} sin ψ_{ι} & sin ϕ_{ι} sin θ_{ι} sin ψ_{ι} + cos ϕ_{ι} cos ψ_{ι} & cos ϕ_{ι} sin θ_{ι} sin ψ_{ι} - sin ϕ_{ι} cos ψ_{ι} \\ - sin θ_{ι} & sin ϕ_{ι} cos θ_{ι} & cos ϕ_{ι} cos θ_{ι} \end{matrix}] \\ Λ_{ι}^{\times} = [\begin{matrix} 0 & - r_{ι} & q_{ι} \\ r_{ι} & 0 & - p_{ι} \\ - q_{ι} & p_{ι} & 0 \end{matrix}], Π_{ι} = [\begin{matrix} 1 & sin ϕ_{ι} tan θ_{ι} & cos ϕ_{ι} tan θ_{ι} \\ 0 & cos ϕ_{ι} & - sin ϕ_{ι} \\ 0 & sin ϕ_{ι} / cos θ_{ι} & cos ϕ_{ι} / cos θ_{ι} \end{matrix}] \end{matrix}

In translational dynamics,

f_{ι} = {[0, 0, - T_{m ι}]}^{T}

implies the force vector, with

T_{m ι} ≜ - m_{ι} (- g + Z_{ω ι} ω_{ι} + Z_{c o ι} δ_{c o ι})

as the main rotor thrust controlled by the collective pitch

δ_{c o ι}

.

Z_{ω ι}

and

Z_{c o ι}

denote the constants associated with the main rotor rotation speed and the collective pitch

δ_{c o ι}

, respectively.

For the rotational dynamics,

τ_{ι}

is the moment vector applied to a helicopter, which is formed by

τ_{ι} = J_{ι} [\begin{matrix} L_{a ι} a_{ι} + L_{b ι} b_{ι} \\ M_{a ι} a_{ι} + M_{b ι} b_{ι} \\ N_{r ι} r_{ι} + N_{t a ι} δ_{t a ι} + N_{c o ι} δ_{c o ι} \end{matrix}]

(3)

where

L_{a ι}

,

L_{b ι}

and

M_{a ι}

,

M_{b ι}

are the lateral and longitudinal wave motion coefficients of the main rotor.

N_{r ι}

denotes the damping coefficient of the yaw angle.

N_{t a ι}

and

N_{c o ι}

are damping coefficients related to the tail rotor and main rotor (e.g., rotation speed and blade radius).

δ_{t a ι}

is the collective pitch of the tail rotor.

a_{ι}

and

b_{ι}

denote the flapping angles of the main rotor along with the longitudinal and lateral axes, which are mainly controlled by longitudinal and lateral cyclic

δ_{l o ι}

and

δ_{l a ι}

. Their relationships can be approximated as

\{\begin{matrix} a_{ι} = - τ_{m ι} q_{ι} + A_{l a ι} δ_{l a ι} + A_{l o ι} δ_{l o ι} \\ b_{ι} = - τ_{m ι} p_{ι} + B_{l a ι} δ_{l a ι} + B_{l o ι} δ_{l o ι} \end{matrix}

(4)

Therefore, combining Equations (3) and (4), a modified moment vector can be rewritten as

\begin{matrix} τ_{ι} & = J_{ι} [\begin{matrix} - τ_{m ι} L_{a ι} q_{ι} - τ_{m ι} L_{b ι} p_{ι} \\ - τ_{m ι} M_{a ι} q_{ι} - τ_{m ι} M_{b ι} p_{ι} \\ N_{r ι} r_{ι} \end{matrix}] + J_{ι} [\begin{matrix} L_{l a ι} δ_{l a ι} + L_{l o ι} δ_{l o ι} \\ M_{l a ι} δ_{l a ι} + M_{l o ι} δ_{l o ι} \\ N_{t a ι} δ_{t a ι} \end{matrix}] + J_{ι} [\begin{matrix} 0 \\ 0 \\ N_{c o ι} δ_{c o ι} \end{matrix}] \\ = J_{ι} A_{ι} Ω_{ι} + J_{ι} B_{ι} U_{ι} + J_{ι} N_{c o ι} δ_{c o ι} e_{3} \end{matrix}

(5)

where

U_{ι} = {[δ_{l o ι}, δ_{l a ι}, δ_{t a ι}]}^{T}

.

τ_{m ι}

denotes the time constant of the main rotor. The parameters in the above matrices are

L_{l a ι} = L_{a ι} A_{l a ι} + L_{b ι} B_{l a ι}

,

M_{l a ι} = M_{a ι} A_{l a ι} + M_{b ι} B_{l a ι}

,

L_{l o ι} = L_{a ι} A_{l o ι} + L_{b ι} B_{l o ι}

, and

M_{l o ι} = M_{a ι} A_{l o ι} + M_{b ι} B_{l o ι}

, where the coefficients

A_{l a ι}

,

B_{l a ι}

and

A_{l o ι}

,

B_{l o ι}

can be obtained by the system model identification. The matrices

A_{ι}

and

B_{ι}

are given as

\begin{matrix} A_{ι} = [\begin{matrix} - τ_{m ι} L_{b ι} & - τ_{m ι} L_{a ι} & 0 \\ - τ_{m ι} M_{b ι} & - τ_{m ι} M_{a ι} & 0 \\ 0 & 0 & N_{r ι} \end{matrix}], B_{ι} = [\begin{matrix} L_{l o ι} & L_{l a ι} & 0 \\ M_{l o ι} & M_{l a ι} & 0 \\ 0 & 0 & N_{t a ι} \end{matrix}] \end{matrix}

Remark 1.

Note that the representations of force vector

f_{ι}

, moment vector

τ_{ι}

, and blade flapping angles (both longitudinal

a_{ι}

and lateral

b_{ι}

) are simplified in the above-mentioned model. The force and moment vectors generated by the main rotor, tail rotor, and fuselage can be highly complex due to unsteady aerodynamics and dynamic interactions. However, under steady or quasi-steady flight conditions (such as hover or low-speed cruise), these effects can be approximated using linearized or lumped-parameter models, significantly reducing computational complexity. Similarly, the blade flapping dynamics can be simplified under the small-angle assumption. In many control-oriented models, the flapping angles are approximated using first-order dynamics or even static gain relationships, assuming a linear response to cyclic pitch. These simplifications are reasonable in flight regimes, where the flapping angles remain small and high-frequency dynamics are negligible. Such simplifications keep a balance between model fidelity and computational efficiency, making them suitable for control design.

Remark 2.

In this article, the virtual leader serves as a reference point that defines the desired trajectory and formation pattern for the actual leaders to follow and maintain. Although it does not physically exist, the virtual leader provides position or yaw angle information that coordinates the motions in the flying system. By tracking the virtual leader, each helicopter can maintain proper spacing and movement direction, thereby ensuring the overall geometric structure.

Let

δ_{c o ι}

,

δ_{l o ι}

,

δ_{l a ι}

and

δ_{t a ι}

be the actual input signal, which is combined with the collective pitch of the main and tail rotor, and the lateral and longitudinal cyclic. Each component plays a critical role in ensuring the helicopter’s stability and maneuverability. Due to the hydraulic or electrical malfunctions, and some environmental factors like extreme temperatures or corrosion, these control inputs may suffer from various types of faults. Among the most common are loss of effectiveness and bias faults, which allow the system to continue operating but with reduced performance. Such faults can result in sluggish or unresponsive control behavior, ultimately leading to degraded or even lost of control authority. In this article, the fault models are formulated as

\begin{matrix} δ_{υ}^{f} & = ρ_{ι m} δ_{υ} + ζ_{ι m} \\ = δ_{υ} + (ρ_{ι m} - 1) δ_{υ} + ζ_{ι m}, m = 1, 2, 3, 4 \end{matrix}

(6)

where

υ = {c o ι, l o ι, l a ι, t a ι}

,

ρ_{ι m} \in (0, 1]

, and

ξ_{ι m}

denote the partial loss of actuator effectiveness and the additional bias faults, respectively.

Subsequently, the flight dynamics (1) and (2) of the

ι

th UH are separated into outer-loop (position) and inner-loop (attitude) subsystems for the control design. Since the design processes of three directions are similar, define

{[P_{ι 1}, P_{ι 2}, P_{ι 3}]}^{T} = {[x_{ι}, y_{ι}, z_{ι}]}^{T}

and

{[V_{ι 1}, V_{ι 2}, V_{ι 3}]}^{T} = {[u_{ι}, v_{ι}, ω_{ι}]}^{T}

, and the outer-loop (position) subsystem can be rewritten as

\{\begin{matrix} {\dot{P}}_{ι n} = V_{ι n} \\ {\dot{V}}_{ι n} = f_{ι 1 n} + u_{ι 1 n} + Θ_{ι 1 n}, n = 1, 2, 3 \end{matrix}

(7)

where

f_{ι 11} = (cos ϕ_{ι} sin θ_{ι} cos ψ_{ι} + sin ϕ_{ι} sin ψ_{ι}) (- g + Z_{ω_{i}} ω_{ι})

,

f_{ι 12} = (cos ϕ_{ι} sin θ_{ι} sin ψ_{ι} - sin ϕ_{ι} cos ψ_{ι}) (- g + Z_{ω_{ι}} ω_{ι})

and

f_{ι 13} = cos ϕ_{ι} cos θ_{ι} (- g + Z_{ω_{ι}} ω_{ι})

are nonlinear functions.

u_{ι 1 n}

and

Θ_{ι 1 n}

imply the input signal to be designed and the lumped fault to be estimated, which are constructed as

\{\begin{matrix} u_{ι 11} = (cos ϕ_{ι} sin θ_{ι} cos ψ_{ι} + sin ϕ_{ι} sin ψ_{ι}) Z_{c o ι} δ_{c o ι} \\ u_{ι 12} = (cos ϕ_{ι} sin θ_{ι} sin ψ_{ι} - sin ϕ_{ι} sin ψ_{ι}) Z_{c o ι} δ_{c o ι} \\ u_{ι 13} = g + cos ϕ_{ι} cos θ_{ι} Z_{c o ι} δ_{c o ι} \end{matrix}

(8)

and

\{\begin{matrix} Θ_{ι 11} = (cos ϕ_{ι} sin θ_{ι} cos ψ_{ι} + sin ϕ_{ι} sin ψ_{ι}) Z_{c o ι} ((ρ_{ι 1} - 1) δ_{c o ι} + ξ_{ι 1}) \\ Θ_{ι 12} = (cos ϕ_{ι} sin θ_{ι} sin ψ_{ι} - sin ϕ_{ι} sin ψ_{ι}) Z_{c o ι} ((ρ_{ι 1} - 1) δ_{c o ι} + ξ_{ι 1}) \\ Θ_{ι 13} = cos ϕ_{ι} cos θ_{ι} Z_{c o ι} ((ρ_{ι 1} - 1) δ_{c o ι} + ξ_{ι 1}) \end{matrix}

(9)

Similarly, we denote

{[Φ_{ι 1}, Φ_{ι 2}, Φ_{ι 3}]}^{T} = {[ϕ_{ι}, θ_{ι}, ψ_{ι}]}^{T}

and

{[Λ_{ι 1}, Λ_{ι 2}, Λ_{ι 3}]}^{T} = {[p_{ι}, q_{ι}, r_{ι}]}^{T}

to reformulate the inner-loop (attitude) subsystem as

\{\begin{matrix} {\dot{Φ}}_{ι n} = f_{ι 2 n} + g_{ι n} Λ_{ι n} \\ {\dot{Λ}}_{ι n} = f_{ι 3 n} + u_{ι 2 n} + Θ_{ι 2 n}, n = 1, 2, 3 \end{matrix}

(10)

where

f_{ι 21} = sin ϕ_{ι} tan θ_{ι} q_{ι} + cos ϕ_{ι} tan θ_{ι} r_{ι}

,

f_{ι 22} = - sin ϕ_{ι} r_{ι}

,

f_{ι 23} = q_{ι} sin ϕ_{ι} / cos θ_{ι}

,

g_{i 1} = 1

,

g_{ι 2} = cos ϕ_{ι}

,

g_{ι 3} = cos ϕ_{ι} / cos θ_{ι}

,

f_{ι 31} = q_{ι} r_{ι} (J_{z ι} - J_{y ι}) / J_{x ι} - τ_{m ι} L_{b ι} p_{ι} - τ_{m ι} L_{a ι} q_{ι}

,

f_{ι 32} = p_{ι} r_{ι} (J_{x ι} - J_{z ι}) / J_{y ι} - τ_{m ι} M_{b ι} p_{ι} - τ_{m ι} M_{a ι} q_{ι}

, and

f_{ι 33} = p_{ι} q_{ι} (J_{y ι} - J_{x ι}) / J_{z ι} + N_{r ι} r_{ι}

are nonlinear functions.

u_{ι 2 n}

and

Θ_{ι 2 n}

denote the input signal and the lumped fault as follows:

\{\begin{matrix} u_{ι 21} = L_{l o ι} δ_{l o ι} + L_{l a ι} δ_{l a ι} \\ u_{ι 22} = M_{l o ι} δ_{l o ι} + M_{l a ι} δ_{l a ι} \\ u_{ι 23} = N_{t a ι} δ_{t a ι} + N_{c o ι} δ_{c o ι} \end{matrix}

(11)

and

\{\begin{matrix} Θ_{ι 21} = L_{l o ι} ((ρ_{ι 2} - 1) δ_{l o ι} + ξ_{ι 2}) + L_{l a ι} ((ρ_{ι 3} - 1) δ_{l a ι} + ξ_{ι 3}) \\ Θ_{ι 22} = M_{l o ι} ((ρ_{ι 2} - 1) δ_{l o ι} + ξ_{ι 2}) + M_{l a ι} ((ρ_{ι 3} - 1) δ_{l a ι} + ξ_{ι 3}) \\ Θ_{ι 23} = N_{t a ι} ((ρ_{ι 4} - 1) δ_{t a ι} + ξ_{ι 4}) + N_{c o ι} ((ρ_{ι 1} - 1) δ_{c o ι} + ξ_{ι 1}) \end{matrix}

(12)

Remark 3.

It should be noted that the direct control inputs of the helicopter are

δ_{c o ι}

,

δ_{l o ι}

,

δ_{l a ι}

, and

δ_{t a ι}

. These inputs determine the control effectiveness in all flight directions and can be computed through the transformation relationships defined in (8) and (11). The above-mentioned (6) satisfies the following four cases: (1) When

ρ_{ι m} = 1

and

ζ_{ι m} = 0

, this indicates the fault-free (normal) case. (2) When

ρ_{ι m} \in (0, 1)

and

ζ_{ι m} = 0

, it implies the partial loss of actuator effectiveness only. (3) When

ρ_{ι m} = 1

and

ζ_{ι m} \neq 0

, it indicates the bias fault only. (4) When

ρ_{ι m} \in (0, 1)

and

ζ_{ι m} \neq 0

, it reveals both the partial loss of actuator effectiveness and bias faults. Note that when

ζ_{ι m} = 0

and

ζ_{ι m} \neq 0

, this refers to the stuck fault, i.e.,

δ_{υ}

sticks at a bounded function

ζ_{ι m}

. However, this case cannot be compensated for by designing the AFTC scheme; thus, we only consider the other four cases in the subsequent sections.

Before presenting the formation-containment control scheme, we make the following fundamental graph theory and some related lemmas, definitions, and assumptions.

The communication network among the

N + M

helicopters is represented by a directed graph

G = (V, E, A)

, where

V

is the node set and

E \subseteq V \times V

is the edge set. The vertex sets of the leaders and the followers are defined as

V_{L}

and

V_{F}

, respectively. Consequently, we have

V = V_{L} \cup V_{F}

and

V_{L} \cap V_{F} = \emptyset

. The edge pair

(i, j) \in E

means that the ith UH can receive data from the jth UH. The adjacent matrix

A = [a_{i j}] \in R^{(N + M) \times (N + M)}

is a position matrix that can be defined as

a_{i j} = 1

when

(i, j) \in E

, otherwise

a_{i j} = 0

. Denote the Laplacian matrix as

L = D - A = [l_{i j}] \in R^{(N + M) \times (N + M)}

, where

l_{i j} = - a_{i j}

,

i \neq j

, and

l_{i i} = \sum_{j = 1, j \neq i}^{N + M} a_{i j}

.

D = diag {d_{1}, d_{2}, \dots, d_{N + M}}

represents the degree matrix,

d_{i j} = \sum_{j = 1}^{N + M} a_{i j}

. Here, the Laplacian matrix is described as

L = [\begin{matrix} L_{l l} & 0_{N \times M} \\ L_{l f} & L_{f f} \end{matrix}]

(13)

where

L_{l l} \in R^{N \times N}

signifies the communication topology among the leader UHs,

L_{l f} \in R^{M \times N}

denotes the communication interaction from the leaders to the followers, and

L_{f f} \in R^{M \times M}

implies the communication topology among the follower UHs.

In addition,

B = [b_{1}, b_{2}, \dots, b_{N}]

, with

b_{i}

being the connections between the ith leader UH and the virtual leader, where

b_{i} = 1

means that it can receive data from the virtual leader, or otherwise,

b_{i} = 0

.

Assumption 1

([32]). As for the leader UHs, the virtual leader has at least one path to each leader UH. Also, there exists at least one path from the leader UHs to each follower UH.

Lemma 1

([32]). All the eigenvalues of matrix

L_{l l} + B

have positive real parts. Each entry of

- L_{f f}^{- 1} L_{l f}

is positive and each row sum of

- L_{f f}^{- 1} L_{l f}

is equal to 1.

Definition 1

([32]). (Convex Hull) Suppose that

C

is a subset in the real vector space

Z \in R^{ρ}

. The set

C

is said to be convex if the point

(1 - ρ) α + ρ β \in C

holds for any

ρ \in [0, 1]

and arbitrary two elements α, β. The convex hull regarding a group of points

X = {x_{1}, x_{2}, \dots, x_{n}}

in Z is the minimal convex set consisting of all points in X, and then, it is represented as

Co (X) = {\sum_{k = 1}^{n} ρ_{k} | x_{k} \in X, ρ_{k} > 0, \sum_{k = 1}^{n} ρ_{k} = 1}

.

In this article, the control objective is to ensure the desired flight performance of

N + M

UHs. As shown in Figure 2, a two-layer structure framework is employed to settle the inverse optimal fault-tolerant formation-containment control problem: one is the formation layer of N leaders, and another is the containment layer of M followers. Specifically, the leaders in the first layer can simultaneously take action and form a desired reference formation pattern, while the followers in the second layer are contained in a convex hull constructed by the leaders. Thus, the inverse optimal fault-tolerant control of the hierarchy-structured UHs is completed if and only if both two layers have achieved their own control objectives. That is, for

i, j \in L

, there exists

\{\begin{matrix} {lim}_{t \to \infty} P_{i} - P_{0} - c_{i P}^{d} = 0 \\ {lim}_{t \to \infty} P_{i} - P_{j} - (c_{i P}^{d} - c_{j P}^{d}) = 0 \end{matrix}

(14)

and for

k \in F

,

j \in L

, there exists

{lim}_{t \to \infty} P_{k} - \sum_{j \in L} α_{k j} P_{j} = 0

(15)

where

P_{0} = {[x_{0}, y_{0}, z_{0}]}^{T}

is the position of the virtual leader.

c_{i P}^{d} = {[c_{i P 1}^{d}, c_{i P 2}^{d}, c_{i P 3}^{d}]}^{T}

is the relative position of the desired configuration between the ith leader UH and the virtual leader.

Remark 4.

Owing to the under-actuated mechanics of the ιth UH, the proposed control algorithm is separated into two cascaded control modules. The outer-loop (position) controller calculates the input signal

u_{ι 1 n}

and appropriate attitude commands

ϕ_{ι d}

and

θ_{ι d}

to navigate the helicopter along the target trajectories. It can be calculated that

ϕ_{ι d}

and

θ_{ι d}

are

\{\begin{matrix} ϕ_{ι d} = arcsin (\frac{sin ψ_{ι d} u_{ι 11} - cos ψ_{ι d} u_{ι 12}}{\sqrt{u_{ι 11}^{2} + u_{ι 12}^{2} + {(u_{ι 13} - g)}^{2}}}) \\ θ_{ι d} = arctan (\frac{cos ψ_{ι d} u_{ι 11} + sin ψ_{ι d} u_{ι 12}}{u_{ι 13} - g}), ι = 1, 2, \dots, N + M \end{matrix}

(16)

and

ψ_{ι d} = ψ_{0}

with

ψ_{0}

denotes the yaw angle of the virtual leader. Then, the command

Φ_{ι d} = {[ϕ_{ι d}, θ_{ι d}, ψ_{ι d}]}^{T}

is delivered to the inner-loop (attitude) controller that calculates

u_{ι 2 n}

.

Assumption 2

([33]). The position

P_{0} = {[x_{0}, y_{0}, z_{0}]}^{T}

and the yaw angle reference

ψ_{0}

from the virtual leader and their derivatives are bounded and continuous functions.

Assumption 3

([7]). The Euler angle vectors (i.e., roll angle

ϕ_{ι}

, pitch angle

θ_{ι}

, and yaw angle

ψ_{ι}

) are bounded and satisfy

ϕ_{ι} \in (- \frac{π}{2}, \frac{π}{2})

,

θ_{ι} \in (- \frac{π}{2}, \frac{π}{2})

and

ψ_{ι} \in (- π, π)

for all t.

Lemma 2

([30]). (Inverse Optimal Control) Consider a nonlinear system as follows:

\dot{x} = f (x) + g (x) u

(17)

where

x \in R^{n}

and

u \in R^{m}

denote the state and control input vectors, while

f (x)

and

g (x)

are the smooth functions. If there exist a positive-definite control Lyapunov function

V (x)

, and a control strategy

u = - R {(x)}^{- 1} {(L_{g} V)}^{T}

stabilizing the whole system (17), then we have

u^{*} = β u

, where

β \geq 2

is the inverse optimal control input for the system (17), and the minimized performance index function is

J = lim_{t \to \infty} {\int_{0}^{\infty} (l (x) + u^{T} R (x) u) d t}

(18)

where

l (x) = - 2 β (L_{f} V - L_{g} V R {(x)}^{- 1} {(L_{g} V)}^{T} + β (β - 2) L_{g} V R {(x)}^{- 1} {(L_{g} V)}^{T})

with

L_{f} V = \partial V f (x) / \partial x

and

L_{g} V = \partial V g (x) / \partial x

.

Lemma 3

([7]). (Neural Network Approximation) For an unknown nonlinear function

f (x)

, there is a neural network

w^{* T} s (x)

satisfying the following condition:

f (x) = w^{* T} s (x) + ϵ_{x}, | ϵ_{x} | \leq {\bar{ϵ}}_{x}

(19)

where

w^{*}

represents the ideal neural network weight matrix,

s (x)

denotes the activation function, and

{\bar{ϵ}}_{x}

depicts the approximation error

ϵ_{x}

bounded by

| ϵ_{x} |

. In this article,

s (x)

is assumed as the Gaussian function, i.e.,

s (x) = e x p (- ∥ x - c ∥^{2} / 2 ς^{2})

, where

ς > 0

represents the width, and

c = {[c_{1}, c_{2}, \dots, c_{μ}]}^{T}

, with μ as the node number.

3. CL-Based Inverse Optimal Fault-Tolerant Formation Controller

In this subsection, a CL-based inverse optimal fault-tolerant formation control scheme is put forward for N leader UHs to track the virtual leader and form a desired geometric pattern even though the actuator faults exist. Based on this,

P_{i} - P_{j} \to c_{i P}^{d} - c_{j P}^{d}

and

P_{i} - P_{0} \to c_{i P}^{d}

can be achieved in the outer-loop (position) subsystem, and

Φ_{i} \to Φ_{i d}

can be obtained in the inner-loop (attitude) subsystem.

3.1. Outer-Loop (Position) Controller Design for the Leaders

For the formation layer, we define the tracking errors of each direction in the outer-loop (position) subsystem (7) as follows:

\{\begin{matrix} e_{i P n} = \sum_{j \in L} a_{i j} (P_{i n} - P_{j n} - (c_{i P n}^{d} - c_{j P n}^{d})) \\ + b_{i} (P_{i n} - P_{0 n} - c_{i P n}^{d}) \\ e_{i V n} = V_{i n} - {\bar{a}}_{i 1 n}, n = 1, 2, 3 \end{matrix}

(20)

where

i, j \in L

.

a_{i j}

is the connective relation between the ith helicopter and jth helicopter, and

{\bar{a}}_{i 1 n}

denotes the designed virtual controller in the following.

Subsequently, we elaborate an inverse optimal fault-tolerant control scheme for the ith leader UH via the CL method under the backstepping framework.

In light of Equation (20), the derivative of

e_{i P n}

with respect of time can be attained as

\begin{matrix} {\dot{e}}_{i P n} & = π_{i} e_{i V n} + π_{i} {\bar{a}}_{i 1 n} - \sum_{j \in L} a_{i j} {\dot{P}}_{j n} - b_{i} {\dot{P}}_{0 n} \end{matrix}

(21)

where

π_{i} = \sum_{j \in L} a_{i j} + b_{i}

.

Then, the virtual controller

{\bar{a}}_{i 1 n}

is defined as

\begin{matrix} {\bar{a}}_{i 1 n} = π_{i}^{- 1} (- k_{i 1 n} e_{i P n} + \sum_{j \in L} a_{i j} {\dot{P}}_{j n} + b_{i} {\dot{P}}_{0 n}) \end{matrix}

(22)

where

k_{i 1 n}

denotes a positive constant.

By substituting Equation (22) into Equation (21), we can obtain as

{\dot{e}}_{i P n} = - k_{i 1 n} e_{i P n} + π_{i} e_{i V n}

.

For the unknown nonlinear function

f_{i 1 n}

, we employ the neural network

f_{i 1 n} = w_{i 1 n}^{* T} s_{i 1 n} + ϵ_{i 1 n}

to approximate it, where

ϵ_{i 1 n}

denotes the neural network approximation error with

| ϵ_{i 1 n} | \leq {\bar{ϵ}}_{i 1 n}

. Then, the derivative of

e_{i V n}

is achieved as

\begin{matrix} {\dot{e}}_{i V n} = w_{i 1 n}^{* T} s_{i 1 n} + ϵ_{i 1 n} + u_{i 1 n} + Θ_{i 1 n} - {\dot{\bar{a}}}_{i 1 n} \end{matrix}

(23)

To compensate for the influence of lumped fault

Θ_{i 1 n}

, the following CL-based fault estimation observer is utilized to estimate the exact information of

Θ_{i 1 n}

:

\{\begin{matrix} {\hat{Θ}}_{i 1 n} = L_{i 1 n} (V_{i n} - η_{i 1 n}) \\ {\dot{η}}_{i 1 n} = {\hat{w}}_{i 1 n}^{T} s_{i 1 n} + u_{i 1 n} + {\hat{Θ}}_{i 1 n} - L_{i 1 n}^{- 1} (r_{f_{i 1 n}} ϖ_{f_{i 1 n}} + e_{i V n}) \end{matrix}

(24)

where

L_{i 1 n}

denotes a positive constant.

ϖ_{f_{i 1 n}} = V_{i n} - {\hat{V}}_{i n}

indicates the prediction error, where

{\hat{V}}_{i n}

is denoted with the serial-parallel estimation model as follows:

{\dot{\hat{V}}}_{i n} = {\hat{w}}_{i 1 n}^{T} s_{i 1 n} + u_{i 1 n} + {\hat{Θ}}_{i 1 n} + γ_{i 1 n} ϖ_{f_{i 1 n}}

(25)

where

γ_{i 1 n}

is a user-defined positive constant.

For the neural network updating law, the prediction error

ϖ_{f_{i 1 n}}

is also applied to construct the learning law design:

{\dot{\hat{w}}}_{i 1 n} = e_{i V n} s_{i 1 n} + r_{f_{i 1 n}} ϖ_{f_{i 1 n}} s_{i 1 n} - κ_{i 1 n} {\hat{w}}_{i 1 n}

(26)

where

r_{f_{i 1 n}}

and

κ_{i 1 n}

represent the positive constants to be designed.

With Equations (23)–(26), the time-derivative of

{\hat{Θ}}_{i 1 n}

and

e_{i V n}

are rewritten as follows:

\begin{matrix} {\dot{\hat{Θ}}}_{i 1 n} = L_{i 1 n} ({\tilde{w}}_{i 1 n}^{T} s_{i 1 n} + ϵ_{i 1 n} + {\tilde{Θ}}_{i 1 n}) + r_{f_{i 1 n}} ϖ_{f_{i 1 n}} + e_{i V n} \end{matrix}

(27)

\begin{matrix} {\dot{e}}_{i V n} = {\hat{w}}_{i 1 n}^{T} s_{i 1 n} - {\dot{\bar{a}}}_{i 1 n} + {\tilde{w}}_{i 1 n}^{T} s_{i 1 n} + ϵ_{i 1 n} + {\bar{u}}_{i 1 n} + {\tilde{Θ}}_{i 1 n} \end{matrix}

(28)

where

{\bar{u}}_{i 1 n} = u_{i 1 n} + {\hat{Θ}}_{i 1 n}

.

{\tilde{w}}_{i 1 n} = w_{i 1 n}^{*} - {\hat{w}}_{i 1 n}

and

{\tilde{Θ}}_{i 1 n} = Θ_{i 1 n} - {\hat{Θ}}_{i 1 n}

are the estimation errors of network weight and lumped faults.

Remark 5.

In [6,7,34], only the tracking error is used to construct the neural network updating law and the fault estimation observer. Here, the prediction error

ϖ_{f_{i 1}}

is employed to improve the approximation accuracy of network weight and fault estimation.

The design of the inverse optimal fault-tolerant controller

u_{i 1 n}^{*}

is summarized in the next theorem.

Theorem 1.

Under the designed virtual controller (22), the CL-based fault estimation observer (24), and the updating law of network weight (26), the inverse optimal fault-tolerant controller

u_{i 1 n}^{*}

is designed with a positive constant

{\bar{k}}_{i 1 n}

:

u_{i 1 n}^{*} = - 2 ({\bar{k}}_{i 1 n} + \frac{1}{2} + \frac{π_{i}^{2}}{2 k_{i 1 n}} + \frac{Ψ_{i 1 n}^{2}}{2 {\bar{k}}_{i 1 n}}) e_{i V n} - {\hat{Θ}}_{i 1 n}

(29)

which can make the tracking errors

e_{i P n}

and

e_{i V n}

as well as the errors

{\tilde{Θ}}_{i 1 n}

,

{\tilde{w}}_{i 1 n}

, and

ϖ_{f_{i 1 n}}

of the outer-loop (position) subsystem approach a small region of the origin, and achieve the minimum cost function as follows:

\begin{matrix} J_{1} ≜ \sum_{i = 1}^{N} \sum_{n = 1}^{3} J_{i 1 n} = \sum_{i = 1}^{N} \sum_{n = 1}^{3} {lim}_{t \to \infty} \{\int_{0}^{\infty} (l_{i 1 n} + {\bar{u}}_{i 1 n}^{T} R_{i 1 n} {\bar{u}}_{i 1 n}) d t\} \end{matrix}

(30)

where

{\bar{u}}_{i 1 n} = - ({\bar{k}}_{i 1 n} + \frac{1}{2} + \frac{π_{i}^{2}}{2 k_{i 1 n}} + \frac{Ψ_{i 1 n}^{2}}{2 {\bar{k}}_{i 1 n}}) e_{i V n}

,

l_{i 1 n} = 2 k_{i 1 n} e_{i P n}^{2} + 2 {\bar{k}}_{i 1 n} e_{i V n}^{2} + (4 L_{i 1 n} - 2 - 2 L_{i 1 n} ρ_{i 1 n} {\bar{s}}_{i 1 n}^{2}) {\tilde{Θ}}_{i 1 n}^{2} + (2 κ_{i 1 n} - \frac{2}{ρ_{i 1 n}} L_{i 1 n}) {\tilde{w}}_{i 1 n}^{T} {\tilde{w}}_{i 1 n} + (4 r_{f_{i 1 n}} γ_{i 1 n} - 2) ϖ_{f_{i 1 n}}^{2} + 2 k_{i 1 n} {(e_{i P n} - \frac{π_{i}}{k_{i 1 n}} e_{i V n})}^{2} + 2 {\bar{k}}_{i 1 n} {(e_{i V n} - \frac{Ψ_{i 1 n}}{{\bar{k}}_{i 1 n}} e_{i V n})}^{2} - 4 ε_{i 1 n}

.

Proof of Theorem 1.

The Lyapunov functional is chosen as follows:

\begin{matrix} \begin{matrix} V_{1} ≜ \sum_{i = 1}^{N} \sum_{n = 1}^{3} V_{i 1 n} = \sum_{i = 1}^{N} \sum_{n = 1}^{3} \{\frac{1}{2} e_{i P n}^{2} + \frac{1}{2} e_{i V n}^{2} + \frac{1}{2} {\tilde{Θ}}_{i 1 n}^{2} + \frac{1}{2} {\tilde{w}}_{i 1 n}^{T} {\tilde{w}}_{i 1 n} + \frac{1}{2} r_{f_{i 1}} ϖ_{f_{i 1 n}}^{2}\} \end{matrix} \end{matrix}

(31)

Note that

L_{g} V_{i 1 n} = \partial V_{i 1 n} / \partial e_{i V n} = e_{i V n}

. From Lemma 2, there exists a controller

{\bar{u}}_{i 1 n} = - R_{i 1 n}^{- 1} e_{i V n}

that can stabilize the whole subsystem. By calculating the time-derivative of

V_{1}

and combining Equations (24)–(28), we have

\begin{matrix} {\dot{V}}_{1} & = \sum_{i = 1}^{N} \sum_{n = 1}^{3} {- k_{i 1 n} e_{i P n}^{2} + π_{i} e_{i P n} e_{i V n} + (Ψ_{i 1 n} - R_{i 1 n}^{- 1}) e_{i V n}^{2} \\ + e_{i V n} ϵ_{i 1 n} + {\tilde{Θ}}_{i 1 n} {\dot{Θ}}_{i 1 n} - {\tilde{Θ}}_{i 1 n} L_{i 1 n} {\tilde{w}}_{i 1 n}^{T} s_{i 1 n} - {\tilde{Θ}}_{i 1 n} L_{i 1 n} ϵ_{i 1 n} \\ - L_{i 1 n} {\tilde{Θ}}_{i 1 n}^{2} + κ_{i 1} {\tilde{w}}_{i 1 n}^{T} {\hat{w}}_{i 1 n} + r_{f_{i 1 n}} ϖ_{f_{i 1 n}} ϵ_{i 1 n} - r_{f_{i 1 n}} γ_{i 1 n} ϖ_{f_{i 1 n}}^{2}} \end{matrix}

(32)

where

Ψ_{i 1 n}

is a smooth function satisfying

{\hat{w}}_{i 1 n}^{T} s_{i 1 n} - {\dot{\bar{a}}}_{i 1 n} = Ψ_{i 1 n} e_{i V n}

.

The following facts exist by using Young’s inequality:

\begin{matrix} e_{i V n} ϵ_{i 1 n} \leq \frac{1}{2} e_{i V n}^{2} + \frac{1}{2} {\bar{ϵ}}_{i 1 n}^{2} \end{matrix}

(33)

\begin{matrix} {\tilde{Θ}}_{i 1 n} {\dot{Θ}}_{i 1 n} \leq \frac{1}{2} {\tilde{Θ}}_{i 1 n}^{2} + \frac{1}{2} χ_{i 1 n}^{2} \end{matrix}

(34)

\begin{matrix} r_{f_{i 1 n}} ϖ_{f_{i 1 n}} ϵ_{i 1 n} \leq \frac{1}{2} ϖ_{f_{i 1 n}}^{2} + \frac{1}{2} r_{f_{i 1 n}}^{2} {\bar{ϵ}}_{i 1 n}^{2} \end{matrix}

(35)

\begin{matrix} - {\tilde{Θ}}_{i 1} L_{i 1 n} ϵ_{i 1} \leq \frac{1}{2} {\tilde{Θ}}_{i 1}^{2} + \frac{1}{2} L_{i 1 n}^{2} {\bar{ϵ}}_{i n}^{2} \end{matrix}

(36)

\begin{matrix} κ_{i 1 n} {\tilde{w}}_{i 1 n}^{T} {\hat{w}}_{i 1 n} \leq - \frac{1}{2} κ_{i 1 n} {\tilde{w}}_{i 1 n}^{T} {\tilde{w}}_{i 1 n} + \frac{1}{2} κ_{i 1 n} w_{i 1 n}^{* 2} \end{matrix}

(37)

\begin{matrix} - {\tilde{Θ}}_{i 1 n} L_{i 1 n} {\tilde{w}}_{i 1 n}^{T} s_{i 1 n} \leq \frac{1}{2} ρ_{i 1 n} L_{i 1 n} {\bar{s}}_{i 1 n}^{2} {\tilde{Θ}}_{i 1 n}^{2} + \frac{1}{2 ρ_{i 1 n}} L_{i 1 n} {\tilde{w}}_{i 1 n}^{T} {\tilde{w}}_{i 1 n} \end{matrix}

(38)

where

| {\dot{Θ}}_{i 1 n} | \leq χ_{i 1 n}

,

∥ s_{i 1 n} ∥ \leq {\bar{s}}_{i 1 n}

, with

χ_{i 1 n}

,

{\bar{s}}_{i 1 n}

, and

ρ_{i 1 n}

as the positive constants.

Integrating the above-mentioned inequality (33)–(38) into Equation (32), we obtain

\begin{matrix} \begin{matrix} {\dot{V}}_{1} & \leq \sum_{i = 1}^{N} \sum_{n = 1}^{3} {- k_{i 1 n} e_{i P n}^{2} - (R_{i 1 n}^{- 1} - Ψ_{i 1 n} - \frac{1}{2}) e_{i V n}^{2} \\ - (L_{i 1 n} - 1 - \frac{1}{2} ρ_{i 1 n} L_{i 1 n} {\bar{s}}_{i 1 n}^{2}) {\tilde{Θ}}_{i 1 n}^{2} + π_{i} e_{i P n} e_{i V n} + ε_{i 1 n} \\ - (\frac{1}{2} κ_{i 1 n} - \frac{1}{2 ρ_{i 1 n}} L_{i 1 n}) {\tilde{w}}_{i 1 n}^{T} {\tilde{w}}_{i 1 n} - (r_{f_{i 1 n}} γ_{i 1 n} - \frac{1}{2}) ϖ_{f_{i 1 n}}^{2}} \end{matrix} \end{matrix}

(39)

where

ε_{i 1 n} = \frac{1}{2} χ_{i 1 n}^{2} + \frac{1}{2} κ_{i 1 n} w_{i 1 n}^{* T} w_{i 1 n}^{*} + \frac{1}{2} r_{f_{i 1 n}}^{2} {\bar{ϵ}}_{i 1 n}^{2} + \frac{1}{2} {\bar{ϵ}}_{i 1 n}^{2} + \frac{1}{2} L_{i 1 n}^{2} {\bar{ϵ}}_{i 1 n}^{2}

.

For a given

R_{i 1 n}^{- 1} = {\bar{k}}_{i 1 n} + \frac{1}{2} + \frac{π_{i}^{2}}{2 k_{i 1 n}} + \frac{Ψ_{i 1 n}^{2}}{2 {\bar{k}}_{i 1 n}}

,

{\dot{V}}_{1}

can be further obtained as

\begin{matrix} {\dot{V}}_{1} & \leq \sum_{i = 1}^{N} \sum_{n = 1}^{3} {- \frac{1}{2} k_{i 1 n} e_{i P n}^{2} - \frac{1}{2} {\bar{k}}_{i 1 n} e_{i V n}^{2} - (L_{i 1 n} - 1 - \frac{1}{2} ρ_{i 1 n} L_{i 1 n} {\bar{s}}_{i 1 n}^{2}) {\tilde{Θ}}_{i 1 n}^{2} \\ - (\frac{1}{2} κ_{i 1 n} - \frac{1}{2 ρ_{i 1 n}} L_{i 1 n}) {\tilde{w}}_{i 1 n}^{T} {\tilde{w}}_{i 1 n} - (r_{f_{i 1 n}} γ_{i 1 n} - \frac{1}{2}) ϖ_{f_{i 1 n}}^{2} \\ - \frac{1}{2} k_{i 1 n} {(e_{i P n} - \frac{π_{i}}{k_{i 1 n}} e_{i V n})}^{2} - \frac{1}{2} {\bar{k}}_{i 1 n} {(e_{i V n} - \frac{Ψ_{i 1 n}}{{\bar{k}}_{i 1 n}} e_{i V n})}^{2} + ε_{i 1 n}} \\ \leq \sum_{i = 1}^{N} \sum_{n = 1}^{3} {- \frac{1}{2} k_{i 1 n} e_{i P n}^{2} - \frac{1}{2} {\bar{k}}_{i 1 n} e_{i V n}^{2} - (L_{i 1 n} - 1 - \frac{1}{2} ρ_{i 1 n} L_{i 1 n} {\bar{s}}_{i 1 n}^{2}) {\tilde{Θ}}_{i 1 n}^{2} \\ - (\frac{1}{2} κ_{i 1 n} - \frac{1}{2 ρ_{i 1 n}} L_{i 1 n}) {\tilde{w}}_{i 1 n}^{T} {\tilde{w}}_{i 1 n} - (r_{f_{i 1 n}} γ_{i 1 n} - \frac{1}{2}) ϖ_{f_{i 1 n}}^{2} + ε_{i 1 n}} \end{matrix}

(40)

this means that if the conditions

L_{i 1 n} - 1 - \frac{1}{2} ρ_{i 1 n} L_{i 1 n} {\bar{s}}_{i 1 n}^{2} > 0

,

\frac{1}{2} κ_{i 1 n} - \frac{1}{2 ρ_{i 1 n}} L_{i 1 n} > 0

and

r_{f_{i 1 n}} γ_{i 1 n} - \frac{1}{2} > 0

hold, it follows that the controller

{\bar{u}}_{i 1 n} = - R_{i 1 n}^{- 1} e_{i V n}

can ensure that the following condition holds:

{\dot{V}}_{1} \leq - c_{1} V_{1} + ε_{1}

(41)

where

c_{1} = {min}_{1 \leq i \leq N, 1 \leq n \leq 3} {k_{i 1 n}, {\bar{k}}_{i 1 n}, 2 L_{i 1 n} - 2 - L_{i 1 n} ρ_{i 1 n} {\bar{s}}_{i 1 n}^{2}, κ_{i 1 n} - \frac{1}{ρ_{i 1 n}} L_{i 1 n}, 2 r_{f_{i 1 n}} γ_{i 1 n} - 1}

, and

ε_{1} = \sum_{i = 1}^{N} \sum_{n = 1}^{3} ε_{i 1 n}

.

Then, from Lemma 2, when

β_{i} = 2

, we can derive that the CL-based inverse optimal controller

{\bar{u}}_{i 1 n}^{*} = 2 {\bar{u}}_{i 1 n} = - 2 ({\bar{k}}_{i 1 n} + \frac{1}{2} + \frac{π_{i}^{2}}{2 k_{i 1 n}} + \frac{Ψ_{i 1 n}^{2}}{2 {\bar{k}}_{i 1 n}}) e_{i V n}

can minimize the cost function (30). Moreover, the tracking errors

e_{i P n}

and

e_{i V n}

, along with the errors

{\tilde{Θ}}_{i 1 n}

,

{\tilde{w}}_{i 1 n}

, and

ϖ_{f_{i 1 n}}

of the ith leader UH, converge to a small region around the origin. Accordingly, the optimal fault-tolerant controller is derived as (29).

Next, we will verify whether the control objective (14) is achieved for the N leaders’ formation system. Define

{\tilde{P}}_{i n} = P_{i n} - P_{0 n} + c_{i P n}^{d}

, and from Equation (20), we have

\begin{matrix} e_{i P n} = \sum_{j \in L} a_{i j} ({\tilde{P}}_{i n} - {\tilde{P}}_{j n}) + b_{i} {\tilde{P}}_{i n} \end{matrix}

(42)

which can be rewritten as

e_{i P} = \sum_{j \in L} a_{i j} ({\tilde{P}}_{i} - {\tilde{P}}_{j}) + b_{i} {\tilde{P}}_{i}

with

{\tilde{P}}_{i} = P_{i} - P_{0} + c_{i P}^{d}

. As for the N leader UHs, the following is derived:

e_{l P} = ((L_{l l} + B) \otimes I_{3}) {\tilde{P}}_{l}

(43)

where

e_{l P} = {[e_{1 P}, e_{2 P}, \dots, e_{N P}]}^{T}

and

{\tilde{P}}_{l} = {[{\tilde{P}}_{1}, {\tilde{P}}_{2}, \dots, {\tilde{P}}_{N}]}^{T}

. From Lemma 1, the matrix

L_{l l} + B

is a non-singular matrix, and

{\tilde{P}}_{l} = {((L_{l l} + B) \otimes I_{3})}^{- 1} e_{l P}

is derived, i.e., the formation control object is achieved. □

3.2. Inner-Loop (Attitude) Controller Design for the Leaders

Define the tracking errors of the inner-loop subsystem as follows:

\{\begin{matrix} e_{i Φ n} = Φ_{i n} - Φ_{i d n} \\ e_{i Λ n} = Λ_{i n} - {\bar{a}}_{i 2 n} \end{matrix}

(44)

where

{[Φ_{i d 1}, Φ_{i d 2}, Φ_{i d 3}]}^{T} = {[ϕ_{i d}, θ_{i d}, ψ_{i d}]}^{T}

, and

{\bar{a}}_{i 2 n}

is the virtual controller to be designed.

According to Lemma 3, the neural network is utilized to estimate the unknown function

f_{i 2 n} = w_{i 2 n}^{* T} s_{i 2 n} + ϵ_{i 2 n}

, where

ϵ_{i 2 n}

denotes the bounded approximation error satisfying

| ϵ_{i 2 n} | \leq {\bar{ϵ}}_{i 2 n}

. Similarly, the time-derivative of

e_{i Φ n}

is derived as follows:

\begin{matrix} \begin{matrix} {\dot{e}}_{i Φ n} = w_{i 2 n}^{* T} s_{i 2 n} + ϵ_{i 2 n} - {\dot{Φ}}_{i d n} + g_{i n} (e_{i Λ n} + {\bar{a}}_{i 2 n}) \end{matrix} \end{matrix}

(45)

where the network weight updating law is designed as

{\dot{\hat{w}}}_{i 2 \bar{n}} = e_{i Φ n} s_{i 2 n} + r_{f_{i 2 n}} ϖ_{f_{i 2 n}} s_{i 2 n} - κ_{i 2 n} {\hat{w}}_{i 2 n}

(46)

where

r_{f_{i 2 n}}

and

κ_{i 2 n}

are positive constants. The prediction error is denoted as

ϖ_{f_{i 2 n}} = Φ_{i n} - {\hat{Φ}}_{i n}

, where

{\hat{Φ}}_{i n}

is estimated by the serial-parallel estimation model as follows:

{\dot{\hat{Φ}}}_{i n} = {\hat{w}}_{i 2 n} s_{i 2 n} + g_{i n} (e_{i Λ n} + {\bar{a}}_{i 2 n}) + γ_{i 2 n} ϖ_{f_{i 2 n}}

(47)

where

γ_{i 2 n}

represents a positive constant to be designed.

Furthermore, the virtual controller

{\bar{a}}_{i 2 n}

is designed as

\begin{matrix} {\bar{a}}_{i 2 n} = g_{i n}^{- 1} (- k_{i 2 n} e_{i Φ n} + {\dot{Φ}}_{i d n} - {\hat{w}}_{i 2 n}^{T} s_{i 2 n}) \end{matrix}

(48)

where

k_{i 2 n}

is a given positive constant.

Integrating Equation (48) into Equation (45), the time-derivative of

e_{i Φ}

has the form

{\dot{e}}_{i Φ n} = - k_{i 2 n} e_{i Φ n} + g_{i n} e_{i Λ n} + {\tilde{w}}_{i 2 n} s_{i 2 n} + ϵ_{i 2 n}

(49)

where

{\tilde{w}}_{i 2 n} = w_{i 2 n}^{*} - {\hat{w}}_{i 2 n}

is the approximation error of the network weight.

Similarly, with the help of the neural network approximation,

{\dot{e}}_{i Λ n}

is achieved as

{\dot{e}}_{i Λ n} = w_{i 3 n}^{* T} s_{i 3 n} + ϵ_{i 3 n} + u_{i 2 n} + Θ_{i 2 n} - {\dot{\bar{a}}}_{i 2 n}

(50)

where

ϵ_{i 3 n}

is the bounded network approximation error with

| ϵ_{i 3 n} | \leq {\bar{ϵ}}_{i 3 n}

.

Correspondingly, the CL-based fault estimation observer is given below:

\{\begin{matrix} {\hat{Θ}}_{i 2 n} = L_{i 2 n} (Λ_{i n} - η_{i 2 n}) \\ {\dot{η}}_{i 2 n} = {\hat{w}}_{i 3 n}^{T} s_{i 3 n} + u_{i 2 n} + {\hat{Θ}}_{i 2 n} - L_{i 2 n}^{- 1} (r_{f_{i 3 n}} ϖ_{f_{i 3 n}} + e_{i Λ n}) \end{matrix}

(51)

where

L_{i 2 n}

denotes a positive constant, and the network weight training law of

{\hat{w}}_{i 3 n}

with the prediction error

ϖ_{f_{i 3 n}} = Λ_{i n} - {\hat{Λ}}_{i n}

is designed as follows:

\begin{matrix} {\dot{\hat{w}}}_{i 3 n} = e_{i Λ n} s_{i 3 n} + r_{f_{i 3 n}} ϖ_{f_{i 3 n}} s_{i 3 n} - κ_{i 3 n} {\hat{w}}_{i 3 n} \end{matrix}

(52)

\begin{matrix} {\dot{\hat{Λ}}}_{i n} = {\hat{w}}_{i 3 n}^{T} s_{i 3 n} + u_{i 2 n} + {\hat{Θ}}_{i 2 n} + γ_{i 3 n} ϖ_{f_{i 3 n}} \end{matrix}

(53)

where

γ_{i 3 n}

,

r_{f_{i 3 n}}

, and

κ_{i 3 n}

denote designed positive constants.

On this basis, we have

\begin{matrix} {\dot{\hat{Θ}}}_{i 2 n} = L_{i 3 n} ({\tilde{w}}_{i 3 n}^{T} s_{i 3 n} + ϵ_{i 3 n} + {\tilde{Θ}}_{i 2 n}) + r_{f_{i 3 n}} ϖ_{f_{i 3 n}} + e_{i Λ n} \end{matrix}

(54)

\begin{matrix} {\dot{e}}_{i Λ n} = {\hat{w}}_{i 3 n}^{T} s_{i 3 n} - {\dot{\bar{a}}}_{i 2 n} + {\tilde{w}}_{i 3 n}^{T} s_{i 3 n} + ϵ_{i 3 n} + {\bar{u}}_{i 2 n} + {\tilde{Θ}}_{i 2 n} \end{matrix}

(55)

where

{\bar{u}}_{i 2 n} = u_{i 2 n} + {\hat{Θ}}_{i 2 n}

. Also,

{\tilde{w}}_{i 3 n} = w_{i 3 n}^{*} - {\hat{w}}_{i 3 n}

and

{\tilde{Θ}}_{i 2 n} = Θ_{i 2 n} - {\hat{Θ}}_{i 2 n}

are approximation errors.

Theorem 2.

Under the designed virtual controller (48), the fault estimation observer (51), and the composite updating law of network weight (52), the inverse optimal fault-tolerant controller

u_{i 2 n}^{*}

is designed with a positive constant

{\bar{k}}_{i 2 n}

:

u_{i 2 n}^{*} = - 2 ({\bar{k}}_{i 2 n} + \frac{g_{i n}^{2}}{2 k_{i 2 n} - 1} + \frac{Ψ_{i 2 n}^{2}}{2 {\bar{k}}_{i 2 n}}) e_{i Λ n} - {\hat{Θ}}_{i 2 n}

(56)

which can make the tracking errors

e_{i Φ n}

and

e_{i Λ n}

as well as the estimation errors

{\tilde{Θ}}_{i 2 n}

,

{\tilde{w}}_{i 2 n}

,

ϖ_{f_{i 2 n}}

,

{\tilde{w}}_{i 3 n}

, and

ϖ_{f_{i 3 n}}

of the inner-loop (attitude) subsystem approach a small region of the origin, and achieve the minimum cost function below:

\begin{matrix} \begin{matrix} J_{2} = \sum_{i = 1}^{N} \sum_{n = 1}^{3} J_{i 2 n} = \sum_{i = 1}^{N} \sum_{n = 1}^{3} {lim}_{t \to \infty} \{\int_{0}^{\infty} (l_{i 2 n} + {\bar{u}}_{i 2 n}^{T} R_{i 2 n} {\bar{u}}_{i 2 n}) d t\} \end{matrix} \end{matrix}

(57)

where

{\bar{u}}_{i 2 n} = ({\bar{k}}_{i 2 n} + \frac{g_{i n}^{2}}{2 k_{i 2 n} - 1} + \frac{Ψ_{i 2 n}^{2}}{2 {\bar{k}}_{i 2 n}}) e_{i Λ n}

,

l_{i 2 n} = (2 k_{i 2 n} - 1) e_{i Φ n}^{2}

+ (4 r_{f_{i 2 n}} γ_{i 2 n} - 2) ϖ_{f_{i 2 n}}^{2} + 2 κ_{i 2 n} {\tilde{w}}_{i 2 n}^{T} {\tilde{w}}_{i 2 n}

+ 2 {\bar{k}}_{i 2 n} e_{i Λ n}^{2} + (4 L_{i 3 n} - 4 - 2 L_{i 3 n} ρ_{i 3 n} {\bar{s}}_{i 3 n}^{2}) {\tilde{Θ}}_{i}^{2}

+ (2 κ_{i 3 n} - \frac{2}{ρ_{i 3 n}} L_{i 3 n}) {\tilde{w}}_{i 3 n}^{T} {\tilde{w}}_{i 3 n} +

(4 r_{f_{i 3 n}} γ_{i 3 n} - 2) ϖ_{f_{i 3 n}}^{2} + (2 k_{i 2 n} - 1) {(e_{i Φ n} - \frac{g_{i n}}{k_{i 2 n} - \frac{1}{2}} e_{i Λ n})}^{2}

+ 2 {\bar{k}}_{i 2 n} {(e_{i Λ n} - \frac{Ψ_{i 2 n}}{{\bar{k}}_{i 2 n}} e_{i Λ n})}^{2} - 4 ε_{i 2 n}

.

Proof of Theorem 2.

For Equations (45) and (50), we select the candidate Lyapunov functional as follows:

\begin{matrix} \begin{matrix} V_{2} ≜ \sum_{i = 1}^{N} \sum_{n = 1}^{3} V_{i 2 n} & = \sum_{i = 1}^{N} \sum_{n = 1}^{3} {\frac{1}{2} e_{i Φ n}^{2} + \frac{1}{2} {\tilde{w}}_{i 2 n}^{T} {\tilde{w}}_{i 2 n} + \frac{1}{2} r_{f_{i 2 n}} ϖ_{f_{i 2 n}}^{2} \\ + \frac{1}{2} e_{i Λ n}^{2} + \frac{1}{2} {\tilde{Θ}}_{i 2 n}^{2} + \frac{1}{2} {\tilde{w}}_{i 3 n}^{T} {\tilde{w}}_{i 3 n} + \frac{1}{2} r_{f_{i 3 n}} ϖ_{f_{i 3 n}}^{2}} \end{matrix} \end{matrix}

(58)

Note that

L_{g} V_{i 2 n} = \partial V_{i 2 n} / \partial e_{i Λ n} = e_{i Λ n}

. From Lemma 2, there exists a controller

{\bar{u}}_{i 2 n} = - R_{i 2 n}^{- 1} e_{i Λ n}

that can stabilize the whole subsystem:

\begin{matrix} \begin{matrix} V_{2} & = \sum_{i = 1}^{N} \sum_{n = 1}^{3} {- k_{i 2 n} e_{i Φ n}^{2} + g_{i n} e_{i Φ n} e_{i Λ n} + e_{i Φ n} ϵ_{i 2 n} + κ_{i 2 n} {\tilde{w}}_{i 2 n}^{T} {\hat{w}}_{i 2 n} \\ + r_{f_{i 2 n}} ϖ_{f_{i 2 n}} ϵ_{i 2 n} - r_{f_{i 2 n}} γ_{i 2 n} ϖ_{f_{i 2 n}}^{2} + (Ψ_{i 2 n} - R_{i 2 n}^{- 1}) e_{i Λ n}^{2} \\ + e_{i Λ n} ϵ_{i 3 n} + {\tilde{Θ}}_{i 2 n} {\dot{Θ}}_{i 2 n} - L_{i 3 n} {\tilde{Θ}}_{i 2 n} {\tilde{w}}_{i 3 n}^{T} s_{i 3 n} - L_{i 3 n} {\tilde{Θ}}_{i 2 n} ϵ_{i 3 n} \\ - L_{i 3 n} {\tilde{Θ}}_{i 2 n}^{2} + κ_{i 3 n} {\tilde{w}}_{i 3 n}^{T} {\hat{w}}_{i 3 n} + r_{f_{i 3 n}} ϖ_{f_{i 3 n}} ϵ_{i 3 n} - r_{f_{i 3 n}} γ_{i 3 n} ϖ_{f_{i 3 n}}^{2}} \end{matrix} \end{matrix}

(59)

where

{\hat{w}}_{i 3 n}^{T} s_{i 3 n} - {\dot{\bar{a}}}_{i 2 n} = Ψ_{i 2 n} e_{i Λ n}

, with

Ψ_{i 2 n}

as a smooth function.

Using Young’s inequality, one determines that

\begin{matrix} \begin{matrix} V_{2} & \leq \sum_{i = 1}^{N} \sum_{n = 1}^{3} {- (k_{i 2 n} - \frac{1}{2}) e_{i Φ n}^{2} + g_{i n} e_{i Φ n} e_{i Λ n} - (r_{f_{i 2 n}} γ_{i 2 n} - \frac{1}{2}) ϖ_{f_{i 2 n}}^{2} \\ - \frac{1}{2} κ_{i 2 n} {\tilde{w}}_{i 2 n}^{T} {\tilde{w}}_{i 2 n} - (R_{i 2 n}^{- 1} - \frac{1}{2} - Ψ_{i 2 n}) e_{i Λ n}^{2} - (L_{i 3 n} - 1 - \frac{1}{2} L_{i 3 n} ρ_{i 3 n} {\bar{s}}_{i 3 n}^{2}) {\tilde{Θ}}_{i 2 n}^{2} \\ - (\frac{1}{2} κ_{i 3 n} - \frac{1}{2 ρ_{i 3 n}} L_{i 3 n}) {\tilde{w}}_{i 3 n}^{T} {\tilde{w}}_{i 3 n} - (r_{f_{i 3 n}} γ_{i 3 n} - \frac{1}{2}) ϖ_{f_{i 3 n}}^{2} + ε_{i 2 n}} \end{matrix} \end{matrix}

(60)

where

ε_{i 2 n} = \frac{1}{2} {\bar{ϵ}}_{i 2 n}^{2} + \frac{1}{2} κ_{i 2 n} w_{i 2 n}^{* T} w_{i 2 n}^{*} + \frac{1}{2} r_{f_{i 2 n}}^{2} {\bar{ϵ}}_{i 2 n}^{2} + \frac{1}{2} {\bar{ϵ}}_{i 3 n}^{2} + \frac{1}{2} χ_{i 2 n}^{2} + \frac{1}{2} L_{i 3 n}^{2} {\bar{ϵ}}_{i 3 n}^{2} + \frac{1}{2} κ_{i 3 n} w_{i 3 n}^{* T} w_{i 3 n}^{*} + \frac{1}{2} r_{f_{i 3 n}}^{2} {\bar{ϵ}}_{i 3 n}^{2}

.

ρ_{i 3 n}

,

χ_{i 2 n}

, and

{\bar{s}}_{i 3 n}

are positive constants satisfying

| {\dot{Θ}}_{i 2 n} | \leq χ_{i 2 n}

and

∥ s_{i 3 n} ∥ \leq {\bar{s}}_{i 3 n}

.

Defining

R_{i 2 n}^{- 1} = {\bar{k}}_{i 2 n} + \frac{g_{i n}^{2}}{2 k_{i 2 n} - 1} + \frac{Ψ_{i 2 n}^{2}}{2 {\bar{k}}_{i 2 n}}

derives

\begin{matrix} \begin{matrix} {\dot{V}}_{2} & \leq \sum_{i = 1}^{N} \sum_{n = 1}^{3} {- \frac{1}{2} (k_{i 2 n} - \frac{1}{2}) e_{i Φ n}^{2} - (r_{f_{i 2 n}} γ_{i 2 n} - \frac{1}{2}) ϖ_{f_{i 2 n}}^{2} \\ - \frac{1}{2} κ_{i 2 n} {\tilde{w}}_{i 2 n}^{T} {\tilde{w}}_{i 2 n} - \frac{1}{2} {\bar{k}}_{i 2 n} e_{i Λ n}^{2} - (L_{i 3 n} - 1 - \frac{1}{2} L_{i 3 n} ρ_{i 3 n} {\bar{s}}_{i 3 n}^{2}) {\tilde{Θ}}_{i 2 n}^{2} \\ - (\frac{1}{2} κ_{i 3 n} - \frac{1}{2 ρ_{i 3 n}} L_{i 3 n}) {\tilde{w}}_{i 3 n}^{T} {\tilde{w}}_{i 3 n} - (r_{f_{i 3 n}} γ_{i 3 n} - \frac{1}{2}) ϖ_{f_{i 3 n}}^{2} + ε_{i 2 n} \\ - \frac{1}{2} (k_{i 2 n} - \frac{1}{2}) {(e_{i Φ n} - \frac{g_{i n}}{k_{i 2 n} - \frac{1}{2}} e_{i Λ n})}^{2} - \frac{1}{2} {\bar{k}}_{i 2 n} {(e_{i Λ n} - \frac{Ψ_{i 2 n}}{{\bar{k}}_{i 2 n}} e_{i Λ n})}^{2}} \\ \leq \sum_{i = 1}^{N} \sum_{n = 1}^{3} {- \frac{1}{2} (k_{i 2 n} - \frac{1}{2}) e_{i Φ n}^{2} - (r_{f_{i 2 n}} γ_{i 2 n} - \frac{1}{2}) ϖ_{f_{i 2 n}}^{2} \\ - \frac{1}{2} κ_{i 2 n} {\tilde{w}}_{i 2 n}^{T} {\tilde{w}}_{i 2 n} - \frac{1}{2} {\bar{k}}_{i 2 n} e_{i Λ n}^{2} - (L_{i 3 n} - 1 - \frac{1}{2} L_{i 3 n} ρ_{i 3 n} {\bar{s}}_{i 3 n}^{2}) {\tilde{Θ}}_{i 2 n}^{2} \\ - (\frac{1}{2} κ_{i 3 n} - \frac{1}{2 ρ_{i 3 n}} L_{i 3 n}) {\tilde{w}}_{i 3 n}^{T} {\tilde{w}}_{i 3 n} - (r_{f_{i 3 n}} γ_{i 3 n} - \frac{1}{2}) ϖ_{f_{i 3 n}}^{2} + ε_{i 2 n}} \end{matrix} \end{matrix}

(61)

which means that if the inequalities

k_{i 2 n} - \frac{1}{2} > 0

,

r_{f_{i 2 n}} γ_{i 2 n} - \frac{1}{2} > 0

,

L_{i 3 n} - 1 - \frac{1}{2} L_{i 3 n} ρ_{i 3 n} {\bar{s}}_{i 3 n}^{2} > 0

,

\frac{1}{2} κ_{i 3 n} - \frac{1}{2 ρ_{i 3 n}} L_{i 3 n} > 0

, and

r_{f_{i 3 n}} γ_{i 3 n} - \frac{1}{2} > 0

hold, the controller

u_{i 2 n} = - R_{i 2 n}^{- 1} e_{i Λ n}

can stabilize the whole subsystem, that is,

{\dot{V}}_{2} \leq - c_{2} V_{2} + ε_{2}

(62)

where

c_{2} = {min}_{1 \leq i \leq N, 1 \leq n \leq 3} {k_{i 2 n} - \frac{1}{2}, 2 r_{f_{i 2 n}} γ_{i 2 n} - 1, κ_{i 2 n}, {\bar{k}}_{i 2 n}, 2 L_{i 3 n} - 2 - L_{i 3 n} ρ_{i 3 n} {\bar{s}}_{i 3 n}^{2}, κ_{i 3 n} - \frac{1}{ρ_{i 3 n}} L_{i 3 n}, 2 r_{f_{i 3 n}} γ_{i 3 n} - 1}

and

ε_{2} = \sum_{i = 1}^{N} \sum_{n = 1}^{3} ε_{i 2 n}

.

From Lemma 2, the optimized controller

{\bar{u}}_{i 2 n}^{*} = 2 {\bar{u}}_{i 2 n} = - 2 ({\bar{k}}_{i 2 n} + \frac{g_{i n}^{2}}{2 k_{i 2 n} - 1} + \frac{Ψ_{i 2 n}^{2}}{2 {\bar{k}}_{i 2 n}}) e_{i Λ n}

can minimize the performance index as (57). Further, the optimized fault-tolerant controller is calculated as (56), and the tracking errors

e_{i Φ n}

and

e_{i Λ n}

as well as the errors

{\tilde{Θ}}_{i 2 n}

,

{\tilde{w}}_{i 2 n}

,

ϖ_{f_{i 2 n}}

,

{\tilde{w}}_{i 3 n}

, and

ϖ_{f_{i 3 n}}

of the ith leader UH approach a small region of the origin. □

4. CL-Based Inverse Optimal Fault-Tolerant Containment Controller

As for the second layer, the optimal position containment controller and optimal attitude tracking controller are separately designed for each follower UH in this subsection. Based on this, the followers will approach the convex region constructed by the leaders.

Firstly, the containment error

e_{k n}

and the tracking error

e_{k \bar{n}}

of the kth helicopter are defined as

\{\begin{matrix} e_{k P n} = \sum_{l \in F} a_{k l} (P_{k n} - P_{l n}) + \sum_{j \in L} a_{k j} (P_{k n} - P_{j n}) \\ e_{k V n} = V_{k n} - {\bar{a}}_{k 1 n} \end{matrix}

(63)

and

\{\begin{matrix} e_{k Φ n} = Φ_{k n} - Φ_{k d n} \\ e_{k Λ n} = Λ_{k n} - {\bar{a}}_{k 2 n} \end{matrix}

(64)

where k,

l \in F

,

j \in L

,

a_{k l}

, and

a_{k j}

are the connective relation between the kth follower and the lth follower or the jth leader.

{[Φ_{k d 1}, Φ_{k d 2}, Φ_{k d 3}]}^{T} = {[ϕ_{k d}, θ_{k d}, ψ_{k d}]}^{T}

.

{\bar{a}}_{k 1 n}

and

{\bar{a}}_{k 2 n}

denote the virtual controller to be designed.

Next, the entirety of the CL-based inverse optimal control schemes with fault compensation design for the follower UHs are presented in the subsequent steps.

For the outer-loop (position) subsystem, differentiating

e_{k P n}

yields

{\dot{e}}_{k P n} = π_{k} e_{k V n} + π_{k} {\bar{a}}_{k 1 n} - \sum_{l \in F} a_{k l} {\dot{P}}_{l n} - \sum_{j \in L} a_{k j} {\dot{P}}_{j n}

(65)

where

π_{k} = \sum_{l \in F} a_{k l} + \sum_{j \in L} a_{k j}

. Furthermore, we design the following feedforward virtual controller

{\bar{a}}_{k 1 n}

for target enclosing:

\begin{matrix} \begin{matrix} {\bar{a}}_{k 1 n} & = π_{k}^{- 1} (- k_{k 1 n} e_{k P n} + \sum_{l \in F} a_{k l} {\dot{P}}_{l n} + \sum_{j \in L} a_{k j} {\dot{P}}_{j n}) \end{matrix} \end{matrix}

(66)

where

k_{k 1 n}

is a positive constant and has

{\dot{e}}_{k P n} = - k_{k 1 n} e_{k P n} + π_{k} e_{k V n}

.

Similarly, the CL-based fault estimation observer

{\hat{Θ}}_{k 1 n}

with the adaptive learning law of network weight

{\hat{w}}_{k 1 n}

is constructed as

\begin{matrix} {\dot{\hat{Θ}}}_{k 1 n} = L_{k 1 n} ({\tilde{w}}_{k 1 n}^{T} s_{k 1 n} + ϵ_{k 1 n} + {\tilde{Θ}}_{k 1 n}) + r_{f_{k 1 n}} ϖ_{f_{k 1 n}} + e_{k V n} \end{matrix}

(67)

\begin{matrix} {\dot{\hat{w}}}_{k 1 n} = e_{k V n} s_{k 1 n} + r_{f_{k 1 n}} ϖ_{f_{k 1 n}} s_{k 1 n} - κ_{k 1 n} {\hat{w}}_{k 1 n} \end{matrix}

(68)

where

L_{k 1 n}

,

r_{f_{k 1 n}}

, and

κ_{k 1 n}

are positive constants.

ϖ_{f_{k 1 n}} = V_{k n} - {\hat{V}}_{k n}

is the prediction error with

{\dot{\hat{V}}}_{k n} = {\hat{w}}_{k 1 n}^{T} s_{k 1 n} + u_{k 1 n} + {\hat{Θ}}_{k 1 n} + γ_{k 1 n} ϖ_{f_{k 1 n}}

, with

γ_{k 1 n}

as a positive constant.

{\tilde{w}}_{k 1 n} = w_{k 1 n}^{*} - {\hat{w}}_{k 1 n}

and

{\tilde{Θ}}_{k 1 n} = Θ_{k 1 n} - {\hat{Θ}}_{k 1 n}

are the estimation errors of network weight and lumped faults.

Then, the derivative of

e_{k V n}

is rewritten as

{\dot{e}}_{k V n} = {\hat{w}}_{k 1 n}^{T} s_{k 1 n} - {\dot{\bar{a}}}_{k 1 n} + {\tilde{w}}_{k 1 n}^{T} s_{k 1 n} + ϵ_{k 1 n} + {\bar{u}}_{k 1 n} + {\tilde{Θ}}_{k 1 n}

(69)

where

{\bar{u}}_{k 1 n} = u_{k 1 n} + {\hat{Θ}}_{k 1 n}

.

Similarly, for the inner-loop (attitude) subsystem, the time-derivation of

{\dot{e}}_{k Φ n}

is calculated as

{\dot{e}}_{k Φ n} = w_{k 2 n}^{* T} s_{k 2 n} + ϵ_{k 2 n} - {\dot{Φ}}_{k d n} + g_{k n} (e_{k Λ n} + {\bar{a}}_{k 2 n})

, and then, the virtual controller is

{\bar{a}}_{k 2 n} = g_{i n}^{- 1} (- k_{k 2 n} e_{k Φ n} + {\dot{Φ}}_{k d n} - {\hat{w}}_{k 2 n} s_{k 2 n})

(70)

where

k_{k 2 n}

is a given positive constant, and the neural network weight

{\hat{w}}_{k n}

is formed by

{\dot{\hat{w}}}_{k 2 \bar{n}} = e_{k Φ n} s_{k 2 n} + r_{f_{k 2 n}} ϖ_{f_{k 2 n}} s_{k 2 n} - κ_{k 2 n} {\hat{w}}_{k 2 n}

(71)

where

r_{f_{k 2 n}}

and

κ_{k 2 n}

are positive constants.

ϖ_{f_{k 2 n}} = Φ_{k n} - {\hat{Φ}}_{k n}

denotes the prediction error with

{\dot{\hat{Φ}}}_{i n} = {\hat{w}}_{k 2 n} s_{k 2 n} + g_{i n} (e_{k Λ n} + {\bar{a}}_{k 2 n}) + γ_{k 2 n} ϖ_{f_{k 2 n}}

, with

γ_{k 2 n}

as a positive constant.

Combining (71) and (70), the time-derivative of

e_{k Φ n}

has the form

{\dot{e}}_{k Φ n} = - k_{k 2 n} e_{k Φ n} + g_{k n} e_{k Λ n} + {\tilde{w}}_{k 2 n} s_{k 2 n} + ϵ_{k 2 n}

(72)

where

{\tilde{w}}_{k 2 n} = w_{k 2 n}^{*} - {\hat{w}}_{k 2 n}

is the approximation error of the network weight.

By differentiating

e_{k Λ n}

with respect to the time, we obtain

{\dot{e}}_{k Λ n} = w_{k 3 n}^{* T} s_{k 3 n} + ϵ_{k 3 n} + u_{k 2 n} + Θ_{k 2 n} - {\dot{\bar{a}}}_{k 2 n}

; then, the CL-based fault estimation observer with the network weight training law is formulated as

\begin{matrix} {\dot{\hat{Θ}}}_{k 2 n} = L_{k 3 n} ({\tilde{w}}_{k 3 n}^{T} s_{k 3 n} + ϵ_{k 3 n} + {\tilde{Θ}}_{k 2 n}) + r_{f_{k 3 n}} ϖ_{f_{k 3 n}} + e_{k Λ n} \end{matrix}

(73)

\begin{matrix} {\dot{\hat{w}}}_{k 3 n} = e_{k Λ n} s_{k 3 n} + r_{f_{k 3 n}} ϖ_{f_{k 3 n}} s_{k 3 n} - κ_{k 3 n} {\hat{w}}_{k 3 n} \end{matrix}

(74)

where

L_{k 3 n}

,

r_{f_{k 3 n}}

and

κ_{k 3 n}

denote the positive constants.

ϖ_{f_{k 3 n}} = Λ_{k n} - {\hat{Λ}}_{k n}

is the prediction error with

{\dot{\hat{Λ}}}_{k n} = {\hat{w}}_{k 3 n}^{T} s_{k 3 n} + u_{k 2 n} + {\hat{Θ}}_{k 2 n} + γ_{k 3 n} ϖ_{f_{k 3 n}}

. Also,

{\tilde{w}}_{k 3 n} = w_{k 3 n}^{*} - {\hat{w}}_{k 3 n}

and

{\tilde{Θ}}_{k 2 n} = Θ_{k 2 n} - {\hat{Θ}}_{k 2 n}

are the approximation errors.

Then, we have

{\dot{e}}_{k Λ n} = {\hat{w}}_{k 3 n}^{T} s_{k 3 n} - {\dot{\bar{a}}}_{k 2 n} + {\tilde{w}}_{k 3 n}^{T} s_{k 3 n} + ϵ_{k 3 n} + {\bar{u}}_{k 2 n} + {\tilde{Θ}}_{k 2 n}

(75)

where

{\bar{u}}_{k 2 n} = u_{k 2 n} + {\hat{Θ}}_{k 2 n}

.

Theorem 3.

Under the designed virtual controllers (66) and (70), the fault estimation observers (67) and (73) with the updating laws of network weights (68), (71), and (74), the inverse optimal fault-tolerant controllers

u_{k 1 n}^{*}

and

u_{k 2 n}^{*}

are designed with the positive constants

{\bar{k}}_{k 1 n}

and

{\bar{k}}_{k 2 n}

:

\begin{matrix} u_{k 1 n}^{*} = - 2 ({\bar{k}}_{k 1 n} + \frac{1}{2} + \frac{π_{k}^{2}}{2 k_{k 1 n}} + \frac{Ψ_{k 1 n}^{2}}{2 {\bar{k}}_{k 1 n}}) e_{k V n} - {\hat{Θ}}_{k 1 n} \end{matrix}

(76)

\begin{matrix} u_{k 2 n}^{*} = - 2 ({\bar{k}}_{k 2 n} + \frac{g_{k n}^{2}}{2 k_{k 2 n} - 1} + \frac{Ψ_{k 2 n}^{2}}{2 {\bar{k}}_{k 2 n}}) e_{k Λ n} - {\hat{Θ}}_{k 2 n} \end{matrix}

(77)

which can make the tracking errors

e_{k P n}

,

e_{k V n}

,

e_{k Φ n}

, and

e_{k Λ n}

, as well as the errors

{\tilde{Θ}}_{k 1 n}

,

{\tilde{w}}_{k 1 n}

,

ϖ_{f_{k 1 n}}

,

{\tilde{Θ}}_{k 2 n}

,

{\tilde{w}}_{k 2 n}

,

ϖ_{f_{k 2 n}}

,

{\tilde{w}}_{k 3 n}

, and

ϖ_{f_{k 3 n}}

approach a small region of the origin and achieve the minimum cost functions as follows:

\begin{matrix} J_{3} ≜ \sum_{k = N + 1}^{M} \sum_{n = 1}^{3} J_{k 1 n} = \sum_{k = N + 1}^{N} \sum_{n = 1}^{3} {lim}_{t \to \infty} {\int_{0}^{\infty} (l_{k 1 n} + {\bar{u}}_{k 1 n}^{T} R_{k 1 n} {\bar{u}}_{k 1 n}) d t} \end{matrix}

(78)

\begin{matrix} J_{4} ≜ \sum_{k = N + 1}^{N} \sum_{n = 1}^{3} J_{k 2 n} = \sum_{k = N + 1}^{M} \sum_{n = 1}^{3} {lim}_{t \to \infty} {\int_{0}^{\infty} (l_{k 2 n} + {\bar{u}}_{k 2 n}^{T} R_{k 2 n} {\bar{u}}_{i 2 n}) d t} \end{matrix}

(79)

where

{\bar{u}}_{k 1 n} = - ({\bar{k}}_{k 1 n} + \frac{1}{2} + \frac{π_{i}^{2}}{2 k_{k 1 n}} + \frac{Ψ_{k 1 n}^{2}}{2 {\bar{k}}_{k 1 n}}) e_{k V n}

,

{\bar{u}}_{k 2 n} = ({\bar{k}}_{k 2 n} + \frac{g_{k n}^{2}}{2 k_{k 2 n} - 1} + \frac{Ψ_{k 2 n}^{2}}{2 {\bar{k}}_{k 2 n}}) e_{k Λ n}

.

l_{k 1 n} = 2 k_{k 1 n} e_{k P n}^{2} + 2 {\bar{k}}_{k 1 n} e_{k V n}^{2} + (4 L_{k 1 n} - 2 - 2 L_{k 1 n} ρ_{k 1 n} {\bar{s}}_{k 1 n}^{2}) {\tilde{Θ}}_{k 1 n}^{2} + (2 κ_{k 1 n} - \frac{2}{ρ_{k 1 n}} L_{k 1 n}) {\tilde{w}}_{k 1 n}^{T} {\tilde{w}}_{k 1 n} + (4 r_{f_{k 1 n}} γ_{k 1 n} - 2) ϖ_{f_{k 1 n}}^{2} + 2 k_{k 1 n} {(e_{k P n} - \frac{π_{k}}{k_{k 1 n}} e_{i V n})}^{2} + 2 {\bar{k}}_{k 1 n} {(e_{k V n} - \frac{Ψ_{k 1 n}}{{\bar{k}}_{k 1 n}} e_{k V n})}^{2} - 4 ε_{k 1 n}

, and

l_{k 2 n} = (2 k_{k 2 n} - 1) e_{k Φ n}^{2} + (4 r_{f_{k 2 n}} γ_{k 2 n} - 2) ϖ_{f_{k 2 n}}^{2} + 2 κ_{k 2 n} {\tilde{w}}_{k 2 n}^{T} {\tilde{w}}_{k 2 n} + (4 L_{k 3 n} - 4 - 2 L_{k 3 n} ρ_{k 3 n} {\bar{s}}_{k 3 n}^{2}) {\tilde{Θ}}_{i}^{2} + (2 κ_{k 3 n} - \frac{2}{ρ_{k 3 n}} L_{k 3 n}) {\tilde{w}}_{k 3 n}^{T} {\tilde{w}}_{k 3 n} + 2 {\bar{k}}_{k 2 n} e_{k Λ n}^{2} + (2 k_{k 2 n} - 1) {(e_{k Φ n} - \frac{g_{i n}}{k_{k 2 n} - \frac{1}{2}} e_{k Λ n})}^{2} + 2 {\bar{k}}_{k 2 n} {(e_{k Λ n} - \frac{Ψ_{k 2 n}}{{\bar{k}}_{k 2 n}} e_{k Λ n})}^{2} + (4 r_{f_{k 3 n}} γ_{k 3 n} - 2) ϖ_{f_{k 3 n}}^{2} - 4 ε_{k 2 n}

.

Proof of Theorem 2.

For the outer-loop (position) subsystem, select the Lyapunov function as follows:

\begin{matrix} V_{3} ≜ \sum_{k = N + 1}^{M} \sum_{n = 1}^{3} V_{k 3 n} & = \sum_{k = N + 1}^{N} \sum_{n = 1}^{3} {\frac{1}{2} e_{k P n}^{2} + \frac{1}{2} e_{k V n}^{2} + \frac{1}{2} {\tilde{Θ}}_{k 1 n}^{2} \\ + \frac{1}{2} {\tilde{w}}_{k 1 n}^{T} {\tilde{w}}_{k 1 n} + \frac{1}{2} r_{f_{k 1}} ϖ_{f_{k 1 n}}^{2}} \end{matrix}

(80)

By employing the aforementioned procedures in (32)–(39), we can know that for a given

R_{k 1 n}^{- 1} = {\bar{k}}_{k 1 n} + \frac{1}{2} + \frac{π_{k}^{2}}{2 k_{k 1 n}} + \frac{Ψ_{k 1 n}^{2}}{2 {\bar{k}}_{k 1 n}}

,

{\dot{V}}_{3}

becomes

\begin{matrix} {\dot{V}}_{3} & \leq \sum_{k = N + 1}^{M} \sum_{n = 1}^{3} {- \frac{1}{2} k_{k 1 n} e_{k P n}^{2} - \frac{1}{2} {\bar{k}}_{k 1 n} e_{k V n}^{2} - (L_{k 1 n} - 1 - \frac{1}{2} ρ_{k 1 n} L_{k 1 n} {\bar{s}}_{k 1 n}^{2}) {\tilde{Θ}}_{k 1 n}^{2} \\ - (\frac{1}{2} κ_{k 1 n} - \frac{1}{2 ρ_{k 1 n}} L_{k 1 n}) {\tilde{w}}_{k 1 n}^{T} {\tilde{w}}_{k 1 n} - (r_{f_{k 1 n}} γ_{k 1 n} - \frac{1}{2}) ϖ_{f_{k 1 n}}^{2} + ε_{k 1 n}} \end{matrix}

(81)

where

ε_{k 1 n} = \frac{1}{2} χ_{k 1 n}^{2} + \frac{1}{2} κ_{k 1 n} w_{k 1 n}^{* T} w_{k 1 n}^{*} + \frac{1}{2} r_{f_{k 1 n}}^{2} {\bar{ϵ}}_{k 1 n}^{2} + \frac{1}{2} {\bar{ϵ}}_{k 1 n}^{2} + \frac{1}{2} L_{k 1 n}^{2} {\bar{ϵ}}_{k 1 n}^{2}

.

ρ_{k 1 n}

,

χ_{k 1 n}

, and

{\bar{s}}_{k 1 n}

are positive constants satisfying

| {\dot{Θ}}_{k 1 n} | \leq χ_{k 1 n}

and

∥ s_{k 1 n} ∥ \leq {\bar{s}}_{k 1 n}

. Note that if the conditions

L_{k 1 n} - 1 - \frac{1}{2} ρ_{k 1 n} L_{k 1 n} {\bar{s}}_{k 1 n}^{2} > 0

,

\frac{1}{2} κ_{k 1 n 1} - \frac{1}{2 ρ_{k 1 n}} L_{k 1 n} > 0

and

r_{f_{k 1 n}} γ_{k 1 n} - \frac{1}{2} > 0

hold, it follows that the controller

{\bar{u}}_{k 1 n} = - R_{k 1 n}^{- 1} e_{k V n}

can ensure that the following condition holds:

{\dot{V}}_{3} \leq - c_{3} V_{3} + ε_{3}

(82)

where

c_{3} = {min}_{N + 1 \leq k \leq M, 1 \leq n \leq 3} {k_{k 1 n}, 2 L_{k 1 n} - 2 - L_{k 1 n} ρ_{k 1 n} {\bar{s}}_{k 1 n}^{2}, κ_{k 1 n} - \frac{1}{ρ_{k 1 n}} L_{k 1 n}, 2 r_{f_{k 1 n}} γ_{k 1 n} - 1, {\bar{k}}_{k 1 n}}

and

ε_{3} = \sum_{k = N + 1}^{M} \sum_{n = 1}^{3} ε_{k 1 n}

. From Lemma 2, when

β_{k} = 2

, the CL-based inverse optimal controller

{\bar{u}}_{k 1 n}^{*} = 2 {\bar{u}}_{k 1 n} = - 2 ({\bar{k}}_{k 1 n} + \frac{1}{2} + \frac{π_{i}^{2}}{2 k_{k 1 n}} + \frac{Ψ_{k 1 n}^{2}}{2 {\bar{k}}_{k 1 n}}) e_{k V n}

can minimize the cost function (78). Moreover, the tracking errors

e_{k P n}

and

e_{k V n}

along with the estimation errors

{\tilde{Θ}}_{k 1 n}

,

{\tilde{w}}_{k 1 n}

, and

ϖ_{f_{k 1 n}}

will converge to a small region around the origin. Then, the inverse optimal fault-tolerant controller is designed as (76).

For the inner-loop (attitude) subsystem, the Lyapunov functional is chosen as

\begin{matrix} \begin{matrix} V_{4} ≜ \sum_{k = N + 1}^{M} \sum_{n = 1}^{3} V_{k 2 n} & = \sum_{i = 1}^{N} \sum_{n = 1}^{3} {\frac{1}{2} e_{k Φ n}^{2} + \frac{1}{2} {\tilde{w}}_{k 2 n}^{T} {\tilde{w}}_{k 2 n} + \frac{1}{2} r_{f_{k 2 n}} ϖ_{f_{k 2 n}}^{2} \\ + \frac{1}{2} e_{k Λ n}^{2} + \frac{1}{2} {\tilde{Θ}}_{k 2 n}^{2} + \frac{1}{2} {\tilde{w}}_{k 3 n}^{T} {\tilde{w}}_{k 3 n} + \frac{1}{2} r_{f_{k 3 n}} ϖ_{f_{k 3 n}}^{2}} \end{matrix} \end{matrix}

(83)

and we can know that, if

R_{k 2 n}^{- 1} = {\bar{k}}_{k 2 n} + \frac{g_{k n}^{2}}{2 k_{k 2 n} - 1} + \frac{Ψ_{k 2 n}^{2}}{2 {\bar{k}}_{k 2 n}}

, we have

\begin{matrix} {\dot{V}}_{4} & \leq \sum_{k = N + 1}^{M} \sum_{n = 1}^{3} {- \frac{1}{2} (k_{k 2 n} - \frac{1}{2}) e_{k Φ n}^{2} - (r_{f_{k 2 n}} γ_{k 2 n} - \frac{1}{2}) ϖ_{f_{k 2 n}}^{2} \\ - \frac{1}{2} κ_{k 2 n} {\tilde{w}}_{k 2 n}^{T} {\tilde{w}}_{k 2 n} - \frac{1}{2} {\bar{k}}_{k 2 n} e_{k Λ n}^{2} - (L_{k 3 n} - 1 - \frac{1}{2} L_{k 3 n} ρ_{k 3 n} {\bar{s}}_{k 3 n}^{2}) {\tilde{Θ}}_{k 2 n}^{2} \\ - (\frac{1}{2} κ_{k 3 n} - \frac{1}{2 ρ_{k 3 n}} L_{k 3 n}) {\tilde{w}}_{k 3 n}^{T} {\tilde{w}}_{k 3 n} - (r_{f_{k 3 n}} γ_{k 3 n} - \frac{1}{2}) ϖ_{f_{k 3 n}}^{2} + ε_{k 2 n}} \end{matrix}

(84)

where

ε_{k 2 n} = \frac{1}{2} {\bar{ϵ}}_{k 2 n}^{2} + \frac{1}{2} κ_{k 2 n} w_{k 2 n}^{* T} w_{k 2 n}^{*} + \frac{1}{2} r_{f_{k 2 n}}^{2} {\bar{ϵ}}_{k 2 n}^{2} + \frac{1}{2} {\bar{ϵ}}_{k 3 n}^{2} + \frac{1}{2} L_{k 3 n}^{2} {\bar{ϵ}}_{k 3 n}^{2} + \frac{1}{2} κ_{k 3 n} w_{k 3 n}^{* T} w_{k 3 n}^{*} + \frac{1}{2} r_{f_{k 3 n}}^{2} {\bar{ϵ}}_{k 3 n}^{2} + \frac{1}{2} χ_{k 2 n}^{2}

.

ρ_{k 3 n}

,

χ_{k 2 n}

, and

{\bar{s}}_{k 3 n}

are positive constants satisfying

| {\dot{Θ}}_{k 2 n} | \leq χ_{k 2 n}

and

∥ s_{k 3 n} ∥ \leq {\bar{s}}_{k 3 n}

. This means that if the inequalities

k_{k 2 n} - \frac{1}{2} > 0

,

r_{f_{k 2 n}} γ_{k 2 n} - \frac{1}{2} > 0

,

L_{k 3 n} - 1 - \frac{1}{2} L_{k 3 n} ρ_{k 3 n} {\bar{s}}_{k 3 n}^{2} > 0

,

\frac{1}{2} κ_{k 3 n} - \frac{1}{2 ρ_{k 3 n}} L_{k 3 n} > 0

, and

r_{f_{k 3 n}} γ_{k 3 n} - \frac{1}{2} > 0

hold, the controller

u_{k 2 n} = - R_{k 2 n}^{- 1} e_{k Λ n}

can stabilize the whole subsystem such that

{\dot{V}}_{4} \leq - c_{4} V_{4} + ε_{4}

(85)

where

c_{4} = {min}_{N + 1 \leq k \leq M, 1 \leq n \leq 3} {k_{k 2 n} - \frac{1}{2}, 2 r_{f_{k 2 n}} γ_{k 2 n} - 1, κ_{k 2 n}, {\bar{k}}_{k 2 n}, 2 L_{k 3 n} - 2 - L_{k 3 n} ρ_{k 3 n} {\bar{s}}_{k 3 n}^{2}

,

κ_{k 3 n} - \frac{1}{ρ_{k 3 n}} L_{k 3 n}, 2 r_{f_{k 3 n}} γ_{k 3 n} - 1}

and

ε_{4} = \sum_{k = N + 1}^{M} \sum_{n = 1}^{3} ε_{k 2 n}

. Moreover, the tracking errors

e_{k Φ n}

and

e_{k Λ n}

alongside the errors

{\tilde{Θ}}_{k 2 n}

,

{\tilde{w}}_{k 2 n}

,

ϖ_{f_{k 2 n}}

,

{\tilde{w}}_{k 3 n}

, and

ϖ_{f_{k 3 n}}

approach a small neighborhood of the origin.

The containment tracking objective is illustrated as follows. From (15), the containment error is described as

e_{c P} = P_{f} - (- L_{f f}^{- 1} L_{l f} \otimes I_{3}) P_{l}

, where

P_{l} = {[P_{1}, P_{2}, \dots, P_{N}]}^{T}

,

P_{f} = {[P_{N + 1}, P_{N + 2}, \dots, P_{N + M}]}^{T}

. In terms of the condition (63), it has

e_{k P} = \sum_{l \in F} a_{k l} (P_{k} - P_{l}) + \sum_{j \in L} a_{k j} (P_{k} - P_{j})

, and the following equation can be derived:

\begin{matrix} e_{f P} & = (L_{f f} \otimes I_{3}) P_{f} + (L_{l f} \otimes I_{3}) P_{l} \\ = (L_{f f} \otimes I_{3}) (P_{f} - (- L_{f f}^{- 1} L_{l f} \otimes I_{3}) P_{l}) \\ = (L_{f f} \otimes I_{3}) e_{c P} \end{matrix}

(86)

where

e_{f P} = {[e_{(N + 1) P}, e_{(N + 2) P}, \dots, e_{(N + M) P}]}^{T}

, which can be written as

e_{c P} = {(L_{f f} \otimes I_{3})}^{- 1} e_{f P}

, i.e., the containment objective is achieved accordingly. □

Remark 6.

For the flying system containing N leaders and M followers, if we choose the Lyapunov function as

V = V_{1} + V_{2} + V_{3} + V_{4}

, it derives that

\dot{V} < - c V + ε

with

c = min {c_{1}, c_{2}, c_{3}, c_{4}}

and

ε = ε_{1} + ε_{2} + ε_{3} + ε_{4}

, and the inverse optimal fault-tolerant controller

u_{i 1 n}^{*}

,

u_{i 2 n}^{*}

, and

u_{k 1 n}^{*}

,

u_{k 2 n}^{*}

can minimize the cost function

J = J_{1} + J_{2} + J_{3} + J_{4}

.

Remark 7.

Among the above-mentioned parameters, larger values of

k_{ι 1 n}

,

{\bar{k}}_{ι 1 n}

,

k_{ι 2 n}

,

{\bar{k}}_{ι 2 n}

,

γ_{i 1 n}

,

γ_{i 2 n}

, and

γ_{i 3 n}

along with smaller values of

χ_{i 1 n}

,

{\bar{ϵ}}_{i 1 n}

,

{\bar{s}}_{i 1 n}

,

χ_{i 2 n}

,

{\bar{ϵ}}_{i 2 n}

,

{\bar{ϵ}}_{i 3 n}

, and

{\bar{s}}_{i 3 n}

can be chosen such that

L_{ι 1 n} - 1 - \frac{1}{2} ρ_{ι 1 n} L_{ι 1 n} {\bar{s}}_{ι 1 n}^{2} > 0

,

\frac{1}{2} κ_{ι 1 n 1} - \frac{1}{2 ρ_{ι 1 n}} L_{ι 1 n} > 0

,

r_{f_{ι 1 n}} γ_{ι 1 n} - \frac{1}{2} > 0

,

k_{ι 2 n} - \frac{1}{2} > 0

,

r_{f_{ι 2 n}} γ_{ι 2 n} - \frac{1}{2} > 0

,

L_{ι 3 n} - 1 - \frac{1}{2} L_{ι 3 n} ρ_{ι 3 n} {\bar{s}}_{ι 3 n}^{2} > 0

,

\frac{1}{2} κ_{ι 3 n} - \frac{1}{2 ρ_{ι 3 n}} L_{ι 3 n} > 0

and

r_{f_{ι 3 n}} γ_{ι 3 n} - \frac{1}{2} > 0

hold, keeping a balance between the faster control convergence time/rate and the input characteristic.

5. Simulation Results

The flight performance of the inverse optimal fault-tolerant formation-containment control protocol was examined using a numerical example. A UH system composed of three leaders and two followers were considered. This leader-following platoon was supposed to accomplish a fault reconfiguration mission, changing from arbitrary positions in the air to an assigned pattern. The related physical parameters of each UH are given in Table 1.

The control objective of this theoretical case analysis is to enable three leaders (labeled as 1, 2, and 3) to maintain a triangle formation and encircle around a particular target or reference provided by a virtual leader. Afterward, the position trajectories of two followers (labeled as 4 and 5) need to be entered into the convex hull spanned by the leaders. The information transmission topology among members in the helicopter system is exhibited in Figure 3. The reference trajectory of the virtual leader can be expressed as

x_{0} = 15 sin (0.2 t)

,

y_{0} = 15 cos (0.2 t)

,

z_{0} = 0.1 t

, and

ψ_{0} = 1

, and a desired formation pattern of leaders is set with the relative distances as

c_{x 1}^{d} = 0

,

c_{x 2}^{d} = a

,

c_{x 3}^{d} = a

,

c_{y 1}^{d} = 0

,

c_{y 2}^{d} = - a

,

c_{y 3}^{d} = a

and

c_{z 1}^{d} = c_{z 2}^{d} = c_{z 3}^{d} = 0

with

a = 5

.

The other related simulation parameters are given as

k_{i 1 n} = {\bar{k}}_{i 1 n} = 15

,

k_{i 2 n} = {\bar{k}}_{i 2 n} = 10

,

k_{k 1 n} = {\bar{k}}_{k 1 n} = 15

,

k_{k 2 n} = {\bar{k}}_{k 2 n} = 8

,

L_{i 1 n} = L_{i 1 n} = 50

,

L_{k 1 n} = L_{k 1 n} = 50

,

γ_{i 1 n} = γ_{i 2 n} = γ_{i 3 n} = 20

,

γ_{k 1 n} = γ_{k 2 n} = γ_{k 3 n} = 20

,

r_{f_{i 1 n}} = r_{f_{i 2 n}} = r_{f_{i 3 n}} = 1

,

r_{f_{k 1 n}} = r_{f_{k 2 n}} = r_{f_{k 3 n}} = 1

,

κ_{i 1 n} = κ_{i 2 n} = κ_{i 3 n} = 1

, and

κ_{k 1 n} = κ_{k 2 n} = κ_{k 3 n} = 1

. The faults are assumed to occur in leaders 2 and 3 and follower 5, while the others operate under normal conditions. The detailed fault parameters are set as follows:

\begin{matrix} ρ_{21} = \{\begin{matrix} 1, & 0 < t < 5 s \\ 0.9 & t \geq 5 s \end{matrix}, ζ_{21} = \{\begin{matrix} 0, & 0 < t < 5 s \\ 0.1 cos (t) & t \geq 5 s \end{matrix}, ρ_{22} = \{\begin{matrix} 1, & 0 < t < 5 s \\ 0.8 & t \geq 5 s \end{matrix} \\ ζ_{22} = \{\begin{matrix} 0, & 0 < t < 5 s \\ 0.1 cos (t) & t \geq 5 s \end{matrix}, ρ_{23} = \{\begin{matrix} 1, & 0 < t < 5 s \\ 1 & t \geq 5 s \end{matrix}, ζ_{23} = \{\begin{matrix} 0, & 0 < t < 5 s \\ 0.2 cos (1.5 t) & t \geq 5 s \end{matrix} \\ ρ_{24} = \{\begin{matrix} 1, & 0 < t < 5 s \\ 0.6 & t \geq 5 s \end{matrix}, ζ_{24} = \{\begin{matrix} 0, & 0 < t < 5 s \\ 0.3 cos (2.5 t) & t \geq 5 s \end{matrix}, ρ_{31} = \{\begin{matrix} 1, & 0 < t < 5 s \\ 0.9 & t \geq 5 s \end{matrix} \\ ζ_{31} = \{\begin{matrix} 0, & 0 < t < 5 s \\ 0.1 sin (2 t) & t \geq 5 s \end{matrix}, ρ_{32} = \{\begin{matrix} 1, & 0 < t < 5 s \\ 1 & t \geq 5 s \end{matrix}, ζ_{32} = \{\begin{matrix} 0, & 0 < t < 5 s \\ 0.2 cos (0.5 t) & t \geq 5 s \end{matrix} \\ ρ_{33} = \{\begin{matrix} 1, & 0 < t < 5 s \\ 0.6 & t \geq 5 s \end{matrix}, ζ_{33} = \{\begin{matrix} 0, & 0 < t < 5 s \\ 0.3 cos (t) & t \geq 5 s \end{matrix}, ζ_{34} = \{\begin{matrix} 0, & 0 < t < 5 s \\ 0.4 cos (1.5 t) & t \geq 5 s \end{matrix} \end{matrix}

\begin{matrix} ρ_{34} = \{\begin{matrix} 1, & 0 < t < 5 s \\ 0.7 & t \geq 5 s \end{matrix}, ρ_{51} = \{\begin{matrix} 1, & 0 < t < 10 s \\ 0.9 & t \geq 10 s \end{matrix}, ζ_{51} = \{\begin{matrix} 0, & 0 < t < 10 s \\ 0.1 sin (1.5 t) & t \geq 10 s \end{matrix} \\ ρ_{52} = \{\begin{matrix} 1, & 0 < t < 10 s \\ 1, & t \geq 10 s \end{matrix}, ζ_{52} = \{\begin{matrix} 0, & 0 < t < 10 s \\ 0.2 cos (t), & t \geq 10 s \end{matrix}, ζ_{53} = \{\begin{matrix} 0, & 0 < t < 10 s \\ 0.2 cos (0.5 t), & t \geq 10 s \end{matrix} \\ ρ_{53} = \{\begin{matrix} 1, & 0 < t < 10 s \\ 0.7, & t \geq 10 s \end{matrix}, ρ_{54} = \{\begin{matrix} 1, & 0 < t < 10 s \\ 0.4, & t \geq 10 s \end{matrix}, ζ_{54} = \{\begin{matrix} 0, & 0 < t < 10 s \\ 0.4 cos (t), & t \geq 10 s \end{matrix} \end{matrix}

Under the initial conditions of

P_{1} = {[0, 0, 0]}^{T}

,

P_{2} = {[0, 1, 0]}^{T}

,

P_{3} = {[1, 0, 0]}^{T}

,

P_{4} = {[1, 1, 0]}^{T}

, and

P_{5} = {[0, 2, 0]}^{T}

, the simulation results are provided in Figure 4, Figure 5, Figure 6, Figure 7, Figure 8 and Figure 9. Figure 4a–d show the moving trajectories (i.e.,

x_{ι}

,

y_{ι}

,

z_{ι}

, and

ψ_{ι}

) of the five UHs. Figure 5 depicts the formation-containment performance in the 3-orthogonal plane. Figure 6 depicts the formation-containment performance in the XY plane. From this, it can be concluded that the designed protocol can simultaneously make the multi-UHs reach and preserve the target geometric pattern from any position and track the global reference signal by a virtual leader with a specific position. In Figure 7, Figure 8 and Figure 9, it can be observed that the proposed fault estimation observer will approximate the lumped actuator faults

Θ_{21 n}

,

Θ_{22 n}

,

Θ_{31 n}

,

Θ_{32 n}

,

Θ_{51 n}

, and

Θ_{52 n}

well even though the faults occur. Through the above analysis of these helicopter formations and containment statuses, we can conclude that the system realizes the expected formation-containment flight performance under the designed inverse optimal fault-tolerant control protocol, and the fault effects can be compensated well based on the relevant estimations.

Further, to validate the advantages of the proposed inverse optimal fault-tolerant formation-containment control strategy, comparative results were conducted under the same initial conditions and parameters. Specifically, two schemes were considered: one using a non-optimal control law (Case I, i.e.,

u_{ι 1 n} = - ({\bar{k}}_{ι 1 n} + \frac{1}{2}) e_{ι P n} - π_{ι} e_{ι P n} - Ψ_{i 1 n} e_{ι P n} - {\hat{Θ}}_{ι 1 n}

and

u_{ι 2 n} = - ({\bar{k}}_{ι 2 n} + \frac{1}{2}) e_{ι Φ n} - g_{ι n} e_{ι Φ \bar{n}} - Ψ_{i 2 n} e_{ι Φ n} - {\hat{Θ}}_{ι 2 n}

), and the other omitting the serial-parallel estimation model (Case II, i.e.,

r_{f_{i 1 n}} = r_{f_{i 2 n}} = r_{f_{i 3 n}} = 0

,

r_{f_{k 1 n}} = r_{f_{k 2 n}} = r_{f_{k 3 n}} = 0

). The corresponding formation error

e_{ι P n}

and containment error

e_{ι Φ n}

are presented in Figure 10, Figure 11 and Figure 12, where Figure 10 presents the performance of the proposed inverse optimal fault-tolerant formation-containment control strategy, Figure 11 displays the error trajectories under Case I while Figure 12 shows its performance. As illustrated, the proposed control protocol improves the tracking accuracy in both formation and containment tasks, compared with Case I and Case II. These improvements highlight the optimality and accurate estimation in enhancing system performance.

To quantitatively evaluate control performance, two reliability indices are introduced: the comprehensive performance index (CPI), defined as

{(\sum_{n = 1}^{3} ∥ e_{ι P n} ∥^{2} + \sum_{n = 1}^{3} {∥ e_{ι Λ n} ∥}^{2})}^{1 / 2}

, and the mean-squared error (MSE), given by

\frac{1}{T} \sum_{t = 0}^{T} {(\sum_{n = 1}^{3} ∥ e_{ι P n} ∥)}^{2} + {(\sum_{n = 1}^{3} ∥ e_{ι Λ n} ∥)}^{2}

. The calculation results of these reliability indices also illustrate the effectiveness and the superiority of our proposed control algorithms on formation-containment flight.

6. Conclusions

This article investigated the composite learning-based inverse optimal fault-tolerant control algorithm for multiple UHs with unknown nonlinearities and actuator faults. To solve this problem, a two-layer control framework was constructed by the leaders’ formation control and the followers’ containment control. To improve the approximation accuracy of the compound nonlinearity, the network weight updating law was constructed based on the tracking error and the prediction error derived from the serial-parallel estimation model. Furthermore, the fault estimation observer was designed using the updating law and the prediction error. After that, the active fault-tolerant formation-containment controller was devised to realize the formation reference tracking of the leaders while the followers converged to the convex hull encompassed by the leaders. Finally, a numerical formation-containment experiment was carried out using five UHs to demonstrate the effectiveness and the advantage of the obtained theoretical results.

Future works will mainly concentrate on solving the observer-based control problem for more general helicopter models with disturbances and dynamic flapping, like [15,31,35].

Author Contributions

Conceptualization, Q.L. and K.Z.; methodology, Q.L.; software, Q.L.; validation, Q.L. and K.Z.; data curation, Q.L.; writing—original draft preparation, Q.L.; writing—review and editing, Q.L., K.Z. and B.J.; supervision, K.Z., B.J. and Y.T.; funding acquisition, K.Z. and Y.T. All authors have read and agreed to the published version of the manuscript.

Funding

This work was funded by the National Natural Science Foundation of China under Grants 62173180, 62020106003, and 62441310; the Science Center Program of National Natural Science Foundation of China under Grant 62188101; Natural Science Foundation of Jiangsu Province of China under Grants BZ2024037 and BK20222012; the National Key Laboratory Foundation of Helicopter Aeromechanics under Grant 2023-HA-LB-067-04; and the Postgraduate Research and Practice Innovation Program of Jiangsu Province (KYCX24_0589).

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Fang, X.; Wu, A.; Shang, Y.; Dong, N. Robust control of small-scale unmanned helicopter with matched and mismatched disturbances. J. Frankl. Inst. 2016, 353, 4803–4820. [Google Scholar] [CrossRef]
Hernández-González, O.; Ramírez-Rasgado, F.; Farza, M.; Guerrero-Sánchez, M.-E.; Astorga-Zaragoza, C.-M.; M’Saad, M.; Valencia-Palomo, G. Observer for Nonlinear Systems with Time-Varying Delays: Application to a Two-Degrees-of-Freedom Helicopter. Aerospace 2024, 11, 206. [Google Scholar] [CrossRef]
Marantos, P.; Bechlioulis, C.P.; Kyriakopoulos, K.J. Robust trajectory tracking control for small-scale unmanned helicopters with model uncertainties. IEEE Trans. Control Syst. Technol. 2017, 25, 2010–2021. [Google Scholar] [CrossRef]
Jiang, T.; Lin, D.; Song, T. Finite-time control for small-scale unmanned helicopter with disturbances. Nonlinear Dyn. 2019, 96, 1747–1763. [Google Scholar] [CrossRef]
Xian, B.; Zhang, X.; Zhang, H.; Gu, X. Robust adaptive control for a small unmanned helicopter using reinforcement learning. IEEE Trans. Neural Netw. Learn. Syst. 2021, 33, 7589–7597. [Google Scholar] [CrossRef]
Kuo, C.W.; Tsai, C.C.; Lee, C.T. Intelligent leader-following consensus formation control using recurrent neural networks for small-size unmanned helicopters. IEEE Trans. Syst. Man Cybern. 2019, 51, 1288–1301. [Google Scholar] [CrossRef]
Wang, D.; Zong, Q.; Tian, B.; Shao, S.; Zhang, X.; Zhao, X. Neural network disturbance observer-based distributed finite-time formation tracking control for multiple unmanned helicopters. ISA Trans. 2018, 73, 208–226. [Google Scholar] [CrossRef]
Wang, J.; Wang, D.; Wang, C.; Deng, F. Robust formation control for unmanned helicopters with collision avoidance. J. Frankl. Inst. 2020, 357, 11997–12018. [Google Scholar] [CrossRef]
Zhao, Z.; Zhang, J.; Chen, S.; He, W.; Hong, K.S. Neural-network-based adaptive finite-time control for a two-degree-of-freedom helicopter system with an event-triggering mechanism. IEEE/CAA J. Autom. Sin. 2023, 10, 1754–1765. [Google Scholar] [CrossRef]
Chen, M.; Shi, P.; Lim, C.-C. Adaptive neural fault-tolerant control of a 3-DOF model helicopter system. IEEE Trans. Syst. Man Cybern. Syst. 2015, 46, 260–270. [Google Scholar] [CrossRef]
Liu, Q.; Zhang, K.; Jiang, B.; Xu, J. Prescribed-time fault-tolerant formation control for collision-free unmanned helicopters: A high-order fully actuated system approach. IEEE Trans. Aerosp. Electron. Syst. 2024, 60, 4715–4727. [Google Scholar] [CrossRef]
Wang, Y.; Rotondo, D.; Puig, V.; Cembrano, G. Fault-tolerant control based on virtual actuator and sensor for discrete-time descriptor systems. IEEE Trans. Circuits Syst. I Regul. Pap. 2020, 67, 5316–5325. [Google Scholar] [CrossRef]
Wang, B.; Zhang, Y. An adaptive fault-tolerant sliding mode control allocation scheme for multirotor helicopter subject to simultaneous actuator faults. IEEE Trans. Ind. Electron. 2017, 65, 4227–4236. [Google Scholar] [CrossRef]
Yang, H.; Jiang, B.; Liu, H.H.; Yang, H.; Zhang, Q. Attitude synchronization for multiple 3-DOF helicopters with actuator faults. IEEE/Asme Trans. Mechatron. 2019, 24, 597–608. [Google Scholar] [CrossRef]
Chen, M.; Yan, K.; Wu, Q. Multiapproximator-based fault-tolerant tracking control for unmanned autonomous helicopter with input saturation. IEEE Trans. Syst. Man Cybern. Syst. 2021, 52, 5710–5722. [Google Scholar] [CrossRef]
Ai, S.; Song, J.; Cai, G.; Zhao, K. Active fault-tolerant control for quadrotor uav against sensor fault diagnosed by the auto sequential random forest. Aerospace 2022, 9, 518. [Google Scholar] [CrossRef]
Wang, B.; Shen, Y.; Zhang, Y. Active fault-tolerant control for a quadrotor helicopter against actuator faults and model uncertainties. Aerosp. Sci. Technol. 2020, 99, 105745. [Google Scholar] [CrossRef]
Wang, X.; Tan, C.P. Output feedback active fault tolerant control for a 3-DOF laboratory helicopter with sensor fault. IEEE Trans. Autom. Sci. Eng. 2023, 21, 2689–2700. [Google Scholar] [CrossRef]
Wang, X.; Wang, Y.; Zhang, Z.; Wang, X.; Patton, R. Sensor fault tolerant control for a 3-dof helicopter considering detectability loss. IEEE Trans. Circuits Syst. I Regul. Pap. 2023, 70, 4112–4125. [Google Scholar] [CrossRef]
Liu, C.; Jiang, B.; Zhang, K.; Ding, S.X. Hierarchical structure-based fault-tolerant tracking control of multiple 3-DOF laboratory helicopters. IEEE Trans. Syst. Man Cybern. Syst. 2021, 52, 4247–4258. [Google Scholar] [CrossRef]
Li, K.; Fuzzy, Y.L. adaptive optimal consensus fault-tolerant control for stochastic nonlinear multiagent systems. IEEE Trans. Fuzzy Syst. 2021, 30, 2870–2885. [Google Scholar] [CrossRef]
Liu, Q.; Zhang, K.; Jiang, B. Zero-sum differential game-based optimal fault-tolerant control for interconnected systems with actuator faults. IEEE Trans. Control Netw. Syst. 2023, 11, 1287–1299. [Google Scholar] [CrossRef]
Li, R.; Yang, Z.; Yan, G.; Jian, L.; Li, G.; Li, Z. Robust approximate optimal trajectory tracking control for quadrotors. Aerospace 2024, 11, 149. [Google Scholar] [CrossRef]
Lin, Z.; Liu, Z.; Zhang, Y.; Chen, C.P. Adaptive neural inverse optimal tracking control for uncertain multi-agent systems. Inf. Sci. 2022, 584, 31–49. [Google Scholar] [CrossRef]
Liu, Y.; Li, Y. Application of inverse optimal formation control for euler-lagrange systems. IEEE Trans. Intell. Transp. Syst. 2023, 24, 5655–5662. [Google Scholar] [CrossRef]
An, C.; Su, H.; Chen, S. Inverse-optimal consensus control of fractional-order multiagent systems. IEEE Trans. Syst. Man Cybern. Syst. 2021, 52, 5320–5331. [Google Scholar] [CrossRef]
Yan, F.; Liu, X.; Feng, T. Distributed minimum-energy containment control of continuous-time multi-agent systems by inverse optimal control. IEEE/CAA J. Autom. Sin. 2024, 11, 1533–1535. [Google Scholar] [CrossRef]
Chen, Z.; Yu, Z.; Li, S. Output feedback adaptive fuzzy inverse optimal security control against sensor and actuator attacks for nonlinear cyber-physical systems. IEEE Trans. Fuzzy Syst. 2024, 32, 2554–2566. [Google Scholar] [CrossRef]
Lungu, M.; Dinu, D.-A.; Chen, M.; Flores, G. Inverse optimal control for autonomous carrier landing with disturbances. Aerosp. Sci. Technol. 2023, 139, 108382. [Google Scholar] [CrossRef]
Fan, Z.; Adhikary, A.C.; Li, S.; Liu, R. Disturbance observer based inverse optimal control for a class of nonlinear systems. Neurocomputing 2022, 500, 821–831. [Google Scholar] [CrossRef]
Ma, H.; Chen, M.; Feng, G.; Wu, Q. Disturbance-observer-based adaptive fuzzy tracking control for unmanned autonomous helicopter with flight boundary constraints. IEEE Trans. Fuzzy Syst. 2022, 31, 184–198. [Google Scholar] [CrossRef]
Xu, J.; Cui, Y.; Xing, W.; Huang, F.; Du, X.; Yan, Z.; Wu, D. Distributed active disturbance rejection formation containment control for multiple autonomous underwater vehicles with prescribed performance. Ocean Eng. 2022, 259, 112057. [Google Scholar] [CrossRef]
Yu, Z.; Qu, Y.; Zhang, Y. Distributed fault-tolerant cooperative control for multi-UAVs under actuator fault and input saturation. IEEE Trans. Control Syst. Technol. 2018, 27, 2417–2429. [Google Scholar] [CrossRef]
Chen, L.; Liu, M.; Shi, Y.; Zhang, H.; Zhao, E. Adaptive fault estimation for unmanned surface vessels with a neural network observer approach. IEEE Trans. Circuits Syst. I Regul. Pap. 2020, 68, 416–425. [Google Scholar] [CrossRef]
Campos-Mart, S.-N.; Hern, O.; Guerrero-Sánchez, M.-E.; Valencia-Palomo, G.; Targui, B.; López-Estrada, F.-R. Consensus tracking control of multiple unmanned aerial vehicles subject to distinct unknown delays. Machines 2024, 12, 337. [Google Scholar] [CrossRef]

Figure 1. The diagrammatic sketch of a single UH and the hierarchy structure of multiple UHs. (a) is the inertial reference frame and the body-fixed coordinate frame for a single UH. (b) is the formation-containment structure for multiple UHs.

Figure 2. The inverse optimal fault-tolerant formation-containment control framework for UHs.

Figure 3. The communication topology among the three leader UHs and two follower UHs.

Figure 4. Tracking trajectories of the position variables

x_{ι}

,

y_{ι}

,

z_{ι}

, and the yaw angle

ψ_{ι}

. (a–c) are the lateral axis

x_{ι}

, longitudinal axis

y_{ι}

, and vertical axis

z_{ι}

; (d) is the yaw angle

ψ_{ι}

.

Figure 4. Tracking trajectories of the position variables

x_{ι}

,

y_{ι}

,

z_{ι}

, and the yaw angle

ψ_{ι}

. (a–c) are the lateral axis

x_{ι}

, longitudinal axis

y_{ι}

, and vertical axis

z_{ι}

; (d) is the yaw angle

ψ_{ι}

.

Figure 5. Formation-containment trajectories in the 3-orthogonal plane.

Figure 6. Formation-containment trajectories in XY plane.

Figure 7. Estimation trajectories of the lumped faults

Θ_{21 n}

and

Θ_{22 n}

in UH 2.

Figure 7. Estimation trajectories of the lumped faults

Θ_{21 n}

and

Θ_{22 n}

in UH 2.

Figure 8. Estimation trajectories of the lumped faults

Θ_{31 n}

and

Θ_{32 n}

in UH 3.

Figure 8. Estimation trajectories of the lumped faults

Θ_{31 n}

and

Θ_{32 n}

in UH 3.

Figure 9. Estimation trajectories of the lumped faults

Θ_{51 n}

and

Θ_{52 n}

in UH 5.

Figure 9. Estimation trajectories of the lumped faults

Θ_{51 n}

and

Θ_{52 n}

in UH 5.

Figure 10. Formation errors

e_{i P n}

and

e_{i Φ n}

, and containment errors

e_{k P n}

and

e_{k Φ n}

, under the proposed method.

Figure 10. Formation errors

e_{i P n}

and

e_{i Φ n}

, and containment errors

e_{k P n}

and

e_{k Φ n}

, under the proposed method.

Figure 11. Comparative results of formation errors

e_{i P n}

and

e_{i Φ n}

, and containment errors

e_{k P n}

and

e_{k Φ n}

, under Case I.

Figure 11. Comparative results of formation errors

e_{i P n}

and

e_{i Φ n}

, and containment errors

e_{k P n}

and

e_{k Φ n}

, under Case I.

Figure 12. Comparative results of formation errors

e_{i P n}

and

e_{i Φ n}

, and containment errors

e_{k P n}

and

e_{k Φ n}

, under Case II.

Figure 12. Comparative results of formation errors

e_{i P n}

and

e_{i Φ n}

, and containment errors

e_{k P n}

and

e_{k Φ n}

, under Case II.

Table 1. The physical parameters of each helicopter modeling.

Symbol	Value and Unit
$m_{ι}$	8.2 $kg$
g	9.8 $m / s^{2}$
$Z_{ω ι}$	−0.7615 $s^{- 1}$
$Z_{c o ι}$	−131.4125 $M / rad \cdot s^{2}$
$N_{c o ι}$	−0.3705 $s^{- 2}$
$J_{ι}$	$diag {0.18, 0.34, 0.28} kg \cdot m^{2}$
$A_{ι}$	$diag {- 48.1757, - 25.5048, - 0.9808} s^{- 1}$
$B_{ι}$	$[\begin{matrix} 0 & 1689.5 & 0 \\ 894.5 & 0 & 0 \\ 0 & 0 & 135.8 \end{matrix}] s^{- 2}$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Liu, Q.; Zhang, K.; Jiang, B.; Tan, Y. Composite Learning-Based Inverse Optimal Fault-Tolerant Control for Hierarchy-Structured Unmanned Helicopters. Drones 2025, 9, 391. https://doi.org/10.3390/drones9060391

AMA Style

Liu Q, Zhang K, Jiang B, Tan Y. Composite Learning-Based Inverse Optimal Fault-Tolerant Control for Hierarchy-Structured Unmanned Helicopters. Drones. 2025; 9(6):391. https://doi.org/10.3390/drones9060391

Chicago/Turabian Style

Liu, Qingyi, Ke Zhang, Bin Jiang, and Yushun Tan. 2025. "Composite Learning-Based Inverse Optimal Fault-Tolerant Control for Hierarchy-Structured Unmanned Helicopters" Drones 9, no. 6: 391. https://doi.org/10.3390/drones9060391

APA Style

Liu, Q., Zhang, K., Jiang, B., & Tan, Y. (2025). Composite Learning-Based Inverse Optimal Fault-Tolerant Control for Hierarchy-Structured Unmanned Helicopters. Drones, 9(6), 391. https://doi.org/10.3390/drones9060391

Article Menu

Composite Learning-Based Inverse Optimal Fault-Tolerant Control for Hierarchy-Structured Unmanned Helicopters

Abstract

1. Introduction

2. The UH Flight Dynamics Model and Problem Formulation

3. CL-Based Inverse Optimal Fault-Tolerant Formation Controller

3.1. Outer-Loop (Position) Controller Design for the Leaders

3.2. Inner-Loop (Attitude) Controller Design for the Leaders

4. CL-Based Inverse Optimal Fault-Tolerant Containment Controller

5. Simulation Results

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI