1. Introduction
Unmanned helicopters (UHs) have been extensively applied in various surveillance and inspection tasks due to their exceptional abilities in hovering, vertical take-off and landing, and in accessing constrained environments where other aerial vehicles may be unable to operate. Nevertheless, UHs require complex, multivariable, nonlinear, and under-actuated systems, often characterized by strong coupling effects and uncertain or unidentifiable aerodynamic parameters [
1,
2]. To address these inherent limitations, control designs for UHs typically adopt a cascaded structure, consisting of an outer-loop subsystem and an inner-loop subsystem (i.e., translational and rotational). The authors of [
3,
4,
5] discussed the trajectory tracking control methods for a single UH with unknown uncertainties and external disturbances. As mission complexity and environmental challenges escalate, employing helicopter formations rather than individual units has become crucial for meeting operational demands and enhancing overall system performance. The authors of [
6] studied the intelligent formation control problem by utilizing recurrent neural networks, and the authors of [
7,
8,
9] provided fast formation tracking control protocols for multiple UHs to improve convergence rate/time.
Compared to fixed-wing aircraft, helicopters are more prone to mechanical faults and operational challenges due to their unique structural characteristics, dynamic flight mechanics, and complex operational conditions. If a fault (e.g., performance degradation) occurring in an actuator, sensor, or other component is not detected and addressed, the helicopter may fail to perform tasks. To ensure that system performance remains within acceptable bounds when a fault occurs, the fault-tolerant control (FTC) scheme must be implemented to settle the presumed faults [
10,
11]. Depending on whether to address the faults proactively, the existing FTC methods can be generally categorized into passive FTC (PFTC) and active FTC (AFTC) [
12]. PFTC does not modify the system structure and control unit. It only employs a resilient controller against the expected and considered fault categories without requiring accurate fault information [
13,
14,
15]. However, this approach may have inherent limitations in fault accommodation, potentially compromising system performance. An AFTC is developed by reconstructing the related controller, or selecting a pre-devised controller through a fault diagnosis technique that can detect and identify faults [
16,
17]. In this manner, the authors of [
18,
19] developed an output feedback-based AFTC scheme for a single three-degree-of-freedom (3-DOF) helicopter suffering from angular position sensor faults. Next, the authors of [
20] developed a decentralized fault estimation and a distributed fault-tolerant control hierarchy for a number of 3-DOF helicopters with nonlinearities, uncertainties, and both actuator and sensor faults. As the autonomy of the helicopter continues to improve, designing intelligent AFTC schemes to compensate for potential faults remains a crucial area for further investigation, serving as the primary motivation for this article.
It should be pointed out that the results above only require the helicopters to follow a desired trajectory under fault conditions, without imposing additional performance constraints such as minimizing tracking time, tracking distance, or energy consumption. Therefore, optimal control problems have received considerable attention in practice, which often need either the Hamilton–Jacobi–Bellman (HJB) or Hamilton–Jacobi–Issacs (HJI) equation to obtain an analytical solution [
21,
22,
23]. Unlike the conventional optimal control method that initially defines the cost performance function and then minimizes it to design an optimized controller, the inverse optimal control method takes an alternative way. It starts by designing a stabilizing controller and then identifies a corresponding cost function for which the controller is optimal. Also, it no longer focuses on the solution of the HJB or HJI equation but searches inversely for a certain undetermined objective function. This capability makes the inverse optimal control an essential tool in improving system performance and achieving optimization without solving partial differential equations. The authors of [
24,
25,
26,
27] investigated the inverse optimal consensus, formation, and containment control problem for multi-agent systems. The authors of [
28] devised a fuzzy-based inverse optimal output feedback strategy to resist actuator and sensor network attacks. The authors of [
29,
30,
31] later incorporated disturbance estimation into an inverse optimal control. From the perspective of the remarkable ability, it is also necessary for UHs to equip the inverse optimal control with an FTC, which is the main motivation for investigating this problem in the subsequent sections.
Enlightened by the discussions above, the inverse optimal fault-tolerant formation-containment control strategy is proposed for multi-UHs with actuator faults. The primary innovations and contributions are outlined below:
- (1)
- Different from the previous control schemes in [ 6- , 7- , 8- , 9- ], multiple leaders and multiple followers exist in the formation system, and the leaders collaborate with each other to form a desired formation. Each UH modeling is equipped with nonlinearities and faults (simultaneous partial loss of effectiveness and bias), which are separated into outer-loop (position) and inner-loop (attitude) subsystems based on multiple-time-scale features. 
- (2)
- The serial-parallel estimation model is constructed with intelligent approximation techniques. Accordingly, the composite learning (CL)-based updating law is designed for both the network weight and the fault estimation observer. 
- (3)
- The estimated fault information is subsequently incorporated into the inverse optimal FTC to develop the controller that ensures both optimality and effective fault compensation. Owing to this, the formation of the leaders and reference tracking are obtained. Further, the followers are not only inside the convex hull established by leaders but maintain a specific formation based on the combination of the leaders. 
The remaining sections of this article are structured as follows. 
Section 2 gives the problem formulation and related lemmas, definitions, and assumptions. 
Section 3 and 
Section 4 separately design the CL-based inverse optimal fault-tolerant formation and containment controller for leaders and followers. Finally, 
Section 5 presents the simulation flight results of the hierarchy-structured UHs, and 
Section 6 concludes this article.
  2. The UH Flight Dynamics Model and Problem Formulation
Each UH can be recognized as a rigid body in executing formation flight tasks. Let 
 be the inertial reference frame positioned at the take-off location, and 
 be the body-fixed coordinate frame located at the center of gravity of the helicopter. 
Figure 1a presents the spatial configuration of the inertial and body-fixed frames. As depicted in 
Figure 1b, a flying system with 
 helicopters (including 
N leaders and 
M followers) is considered, where 
 is the set of serial numbers for the helicopters, and 
 and 
 are the sets of serial numbers for the leaders and followers, respectively. Also, the virtual leader is labeled as 0. The 6-DOF rigid body model of the 
th UH is modeled as a position subsystem and an attitude subsystem as follows:
      where 
 and 
 are the position and the velocity vectors of the inertial reference frame 
, respectively. 
 and 
 denote the Euler angle and angular rate vectors of the body-fixed coordinate frame 
, respectively. 
 is the unitary vector. 
g indicates the gravity acceleration, 
 denotes the mass of UH, and 
 represents the diagonal inertia matrix. Further, the rotation matrix 
, the anti-symmetric matrix 
, and the rotational kinematic matrix 
 are defined below:
In translational dynamics,  implies the force vector, with  as the main rotor thrust controlled by the collective pitch .  and  denote the constants associated with the main rotor rotation speed and the collective pitch , respectively.
For the rotational dynamics, 
 is the moment vector applied to a helicopter, which is formed by
      where 
, 
 and 
, 
 are the lateral and longitudinal wave motion coefficients of the main rotor. 
 denotes the damping coefficient of the yaw angle. 
 and 
 are damping coefficients related to the tail rotor and main rotor (e.g., rotation speed and blade radius). 
 is the collective pitch of the tail rotor. 
 and 
 denote the flapping angles of the main rotor along with the longitudinal and lateral axes, which are mainly controlled by longitudinal and lateral cyclic 
 and 
. Their relationships can be approximated as
Therefore, combining Equations (3) and (4), a modified moment vector can be rewritten as
      where 
. 
 denotes the time constant of the main rotor. The parameters in the above matrices are 
, 
, 
, and 
, where the coefficients 
, 
 and 
, 
 can be obtained by the system model identification. The matrices 
 and 
 are given as
Remark 1.  Note that the representations of force vector , moment vector , and blade flapping angles (both longitudinal  and lateral ) are simplified in the above-mentioned model. The force and moment vectors generated by the main rotor, tail rotor, and fuselage can be highly complex due to unsteady aerodynamics and dynamic interactions. However, under steady or quasi-steady flight conditions (such as hover or low-speed cruise), these effects can be approximated using linearized or lumped-parameter models, significantly reducing computational complexity. Similarly, the blade flapping dynamics can be simplified under the small-angle assumption. In many control-oriented models, the flapping angles are approximated using first-order dynamics or even static gain relationships, assuming a linear response to cyclic pitch. These simplifications are reasonable in flight regimes, where the flapping angles remain small and high-frequency dynamics are negligible. Such simplifications keep a balance between model fidelity and computational efficiency, making them suitable for control design.
 Remark 2.  In this article, the virtual leader serves as a reference point that defines the desired trajectory and formation pattern for the actual leaders to follow and maintain. Although it does not physically exist, the virtual leader provides position or yaw angle information that coordinates the motions in the flying system. By tracking the virtual leader, each helicopter can maintain proper spacing and movement direction, thereby ensuring the overall geometric structure.
 Let 
, 
, 
 and 
 be the actual input signal, which is combined with the collective pitch of the main and tail rotor, and the lateral and longitudinal cyclic. Each component plays a critical role in ensuring the helicopter’s stability and maneuverability. Due to the hydraulic or electrical malfunctions, and some environmental factors like extreme temperatures or corrosion, these control inputs may suffer from various types of faults. Among the most common are loss of effectiveness and bias faults, which allow the system to continue operating but with reduced performance. Such faults can result in sluggish or unresponsive control behavior, ultimately leading to degraded or even lost of control authority. In this article, the fault models are formulated as
      where 
, 
, and 
 denote the partial loss of actuator effectiveness and the additional bias faults, respectively.
Subsequently, the flight dynamics (
1) and (
2) of the 
th UH are separated into outer-loop (position) and inner-loop (attitude) subsystems for the control design. Since the design processes of three directions are similar, define 
 and 
, and the outer-loop (position) subsystem can be rewritten as
      where 
, 
 and 
 are nonlinear functions. 
 and 
 imply the input signal to be designed and the lumped fault to be estimated, which are constructed as
      and
Similarly, we denote 
 and 
 to reformulate the inner-loop (attitude) subsystem as
      where 
, 
, 
, 
, 
, 
, 
, 
, and 
 are nonlinear functions. 
 and 
 denote the input signal and the lumped fault as follows:
      and
Remark 3.  It should be noted that the direct control inputs of the helicopter are , , , and . These inputs determine the control effectiveness in all flight directions and can be computed through the transformation relationships defined in (8) and (11). The above-mentioned (6) satisfies the following four cases: (1) When  and , this indicates the fault-free (normal) case. (2) When  and , it implies the partial loss of actuator effectiveness only. (3) When  and , it indicates the bias fault only. (4) When  and , it reveals both the partial loss of actuator effectiveness and bias faults. Note that when  and , this refers to the stuck fault, i.e.,  sticks at a bounded function . However, this case cannot be compensated for by designing the AFTC scheme; thus, we only consider the other four cases in the subsequent sections.  Before presenting the formation-containment control scheme, we make the following fundamental graph theory and some related lemmas, definitions, and assumptions.
The communication network among the 
 helicopters is represented by a directed graph 
, where 
 is the node set and 
 is the edge set. The vertex sets of the leaders and the followers are defined as 
 and 
, respectively. Consequently, we have 
 and 
. The edge pair 
 means that the 
ith UH can receive data from the 
jth UH. The adjacent matrix 
 is a position matrix that can be defined as 
 when 
, otherwise 
. Denote the Laplacian matrix as 
, where 
, 
, and 
. 
 represents the degree matrix, 
. Here, the Laplacian matrix is described as
      where 
 signifies the communication topology among the leader UHs, 
 denotes the communication interaction from the leaders to the followers, and 
 implies the communication topology among the follower UHs.
In addition, , with  being the connections between the ith leader UH and the virtual leader, where  means that it can receive data from the virtual leader, or otherwise, .
Assumption 1  ([
32]). 
As for the leader UHs, the virtual leader has at least one path to each leader UH. Also, there exists at least one path from the leader UHs to each follower UH. Lemma 1  ([
32]). 
All the eigenvalues of matrix  have positive real parts. Each entry of  is positive and each row sum of  is equal to 1. Definition 1  ([
32]). 
(Convex Hull) Suppose that  is a subset in the real vector space . The set  is said to be convex if the point  holds for any  and arbitrary two elements α, β. The convex hull regarding a group of points  in Z is the minimal convex set consisting of all points in X, and then, it is represented as . In this article, the control objective is to ensure the desired flight performance of 
 UHs. As shown in 
Figure 2, a two-layer structure framework is employed to settle the inverse optimal fault-tolerant formation-containment control problem: one is the formation layer of 
N leaders, and another is the containment layer of 
M followers. Specifically, the leaders in the first layer can simultaneously take action and form a desired reference formation pattern, while the followers in the second layer are contained in a convex hull constructed by the leaders. Thus, the inverse optimal fault-tolerant control of the hierarchy-structured UHs is completed if and only if both two layers have achieved their own control objectives. That is, for 
, there exists
      and for 
, 
, there exists
      where 
 is the position of the virtual leader. 
 is the relative position of the desired configuration between the 
ith leader UH and the virtual leader.  
Remark 4.  Owing to the under-actuated mechanics of the ιth UH, the proposed control algorithm is separated into two cascaded control modules. The outer-loop (position) controller calculates the input signal  and appropriate attitude commands  and  to navigate the helicopter along the target trajectories. It can be calculated that  and  areand  with  denotes the yaw angle of the virtual leader. Then, the command  is delivered to the inner-loop (attitude) controller that calculates .  Assumption 2  ([
33]). 
The position  and the yaw angle reference  from the virtual leader and their derivatives are bounded and continuous functions. Assumption 3  ([
7]). 
The Euler angle vectors (i.e., roll angle , pitch angle , and yaw angle ) are bounded and satisfy ,  and  for all t. Lemma 2  ([
30]). 
(Inverse Optimal Control) Consider a nonlinear system as follows:where  and  denote the state and control input vectors, while  and  are the smooth functions. If there exist a positive-definite control Lyapunov function , and a control strategy  stabilizing the whole system (17), then we have , where  is the inverse optimal control input for the system (17), and the minimized performance index function iswhere  with  and . Lemma 3  ([
7]). 
(Neural Network Approximation) For an unknown nonlinear function , there is a neural network  satisfying the following condition:where  represents the ideal neural network weight matrix,  denotes the activation function, and  depicts the approximation error  bounded by . In this article,  is assumed as the Gaussian function, i.e., , where  represents the width, and , with μ as the node number.   4. CL-Based Inverse Optimal Fault-Tolerant Containment Controller
As for the second layer, the optimal position containment controller and optimal attitude tracking controller are separately designed for each follower UH in this subsection. Based on this, the followers will approach the convex region constructed by the leaders.
Firstly, the containment error 
 and the tracking error 
 of the 
kth helicopter are defined as
      and
      where 
k, 
, 
, 
, and 
 are the connective relation between the 
kth follower and the 
lth follower or the 
jth leader. 
. 
 and 
 denote the virtual controller to be designed.
Next, the entirety of the CL-based inverse optimal control schemes with fault compensation design for the follower UHs are presented in the subsequent steps.
For the outer-loop (position) subsystem, differentiating 
 yields
      where 
. Furthermore, we design the following feedforward virtual controller 
 for target enclosing:
      where 
 is a positive constant and has 
.
Similarly, the CL-based fault estimation observer 
 with the adaptive learning law of network weight 
 is constructed as
      where 
, 
, and 
 are positive constants. 
 is the prediction error with 
, with 
 as a positive constant. 
 and 
 are the estimation errors of network weight and lumped faults.
Then, the derivative of 
 is rewritten as
      where 
.
Similarly, for the inner-loop (attitude) subsystem, the time-derivation of 
 is calculated as 
, and then, the virtual controller is
      where 
 is a given positive constant, and the neural network weight 
 is formed by
      where 
 and 
 are positive constants. 
 denotes the prediction error with 
, with 
 as a positive constant.
Combining (71) and (70), the time-derivative of 
 has the form
      where 
 is the approximation error of the network weight.
By differentiating 
 with respect to the time, we obtain 
; then, the CL-based fault estimation observer with the network weight training law is formulated as
      where 
, 
 and 
 denote the positive constants. 
 is the prediction error with 
. Also, 
 and 
 are the approximation errors.
Then, we have
      where 
.
Theorem 3.  Under the designed virtual controllers (66) and (70), the fault estimation observers (67) and (73) with the updating laws of network weights (68), (71), and (74), the inverse optimal fault-tolerant controllers  and  are designed with the positive constants  and :which can make the tracking errors , , , and , as well as the errors , , , , , , , and  approach a small region of the origin and achieve the minimum cost functions as follows:where , . , and .  Proof of Theorem 2.  For the outer-loop (position) subsystem, select the Lyapunov function as follows:
By employing the aforementioned procedures in (
32)–(
39), we can know that for a given 
, 
 becomes
        where 
. 
, 
, and 
 are positive constants satisfying 
 and 
. Note that if the conditions 
, 
 and 
 hold, it follows that the controller 
 can ensure that the following condition holds:
        where 
 and 
. From Lemma 2, when 
, the CL-based inverse optimal controller 
 can minimize the cost function (
78). Moreover, the tracking errors 
 and 
 along with the estimation errors 
, 
, and 
 will converge to a small region around the origin. Then, the inverse optimal fault-tolerant controller is designed as (
76).
For the inner-loop (attitude) subsystem, the Lyapunov functional is chosen as
        and we can know that, if 
, we have
        where 
. 
, 
, and 
 are positive constants satisfying 
 and 
. This means that if the inequalities 
, 
, 
, 
, and 
 hold, the controller 
 can stabilize the whole subsystem such that
        where 
, 
 and 
. Moreover, the tracking errors 
 and 
 alongside the errors 
, 
, 
, 
, and 
 approach a small neighborhood of the origin.
The containment tracking objective is illustrated as follows. From (
15), the containment error is described as 
, where 
, 
. In terms of the condition (
63), it has 
, and the following equation can be derived:
        where 
, which can be written as 
, i.e., the containment objective is achieved accordingly.    □
 Remark 6.  For the flying system containing N leaders and M followers, if we choose the Lyapunov function as , it derives that  with  and , and the inverse optimal fault-tolerant controller , , and ,  can minimize the cost function .
 Remark 7.  Among the above-mentioned parameters, larger values of , , , , , , and  along with smaller values of , , , , , , and  can be chosen such that , , , , , ,  and  hold, keeping a balance between the faster control convergence time/rate and the input characteristic.
   5. Simulation Results
The flight performance of the inverse optimal fault-tolerant formation-containment control protocol was examined using a numerical example. A UH system composed of three leaders and two followers were considered. This leader-following platoon was supposed to accomplish a fault reconfiguration mission, changing from arbitrary positions in the air to an assigned pattern. The related physical parameters of each UH are given in 
Table 1.
The control objective of this theoretical case analysis is to enable three leaders (labeled as 1, 2, and 3) to maintain a triangle formation and encircle around a particular target or reference provided by a virtual leader. Afterward, the position trajectories of two followers (labeled as 4 and 5) need to be entered into the convex hull spanned by the leaders. The information transmission topology among members in the helicopter system is exhibited in 
Figure 3. The reference trajectory of the virtual leader can be expressed as 
, 
, 
, and 
, and a desired formation pattern of leaders is set with the relative distances as 
, 
, 
, 
, 
, 
 and 
 with 
.
The other related simulation parameters are given as 
, 
, 
, 
, 
, 
, 
, 
, 
, 
, 
, and 
. The faults are assumed to occur in leaders 2 and 3 and follower 5, while the others operate under normal conditions. The detailed fault parameters are set as follows: 
Under the initial conditions of 
, 
, 
, 
, and 
, the simulation results are provided in 
Figure 4, 
Figure 5, 
Figure 6, 
Figure 7, 
Figure 8 and 
Figure 9. 
Figure 4a–d show the moving trajectories (i.e., 
, 
, 
, and 
) of the five UHs. 
Figure 5 depicts the formation-containment performance in the 3-orthogonal plane. 
Figure 6 depicts the formation-containment performance in the XY plane. From this, it can be concluded that the designed protocol can simultaneously make the multi-UHs reach and preserve the target geometric pattern from any position and track the global reference signal by a virtual leader with a specific position. In 
Figure 7, 
Figure 8 and 
Figure 9, it can be observed that the proposed fault estimation observer will approximate the lumped actuator faults 
, 
, 
, 
, 
, and 
 well even though the faults occur. Through the above analysis of these helicopter formations and containment statuses, we can conclude that the system realizes the expected formation-containment flight performance under the designed inverse optimal fault-tolerant control protocol, and the fault effects can be compensated well based on the relevant estimations.
Further, to validate the advantages of the proposed inverse optimal fault-tolerant formation-containment control strategy, comparative results were conducted under the same initial conditions and parameters. Specifically, two schemes were considered: one using a non-optimal control law (Case I, i.e., 
 and 
), and the other omitting the serial-parallel estimation model (Case II, i.e., 
, 
). The corresponding formation error 
 and containment error 
 are presented in 
Figure 10, 
Figure 11 and 
Figure 12, where 
Figure 10 presents the performance of the proposed inverse optimal fault-tolerant formation-containment control strategy, 
Figure 11 displays the error trajectories under Case I while 
Figure 12 shows its performance. As illustrated, the proposed control protocol improves the tracking accuracy in both formation and containment tasks, compared with Case I and Case II. These improvements highlight the optimality and accurate estimation in enhancing system performance.
To quantitatively evaluate control performance, two reliability indices are introduced: the comprehensive performance index (CPI), defined as , and the mean-squared error (MSE), given by . The calculation results of these reliability indices also illustrate the effectiveness and the superiority of our proposed control algorithms on formation-containment flight.