Abstract
In this paper, the problem of pitching attitude finite-horizon optimization for aircraft is posed with system uncertainties, external disturbances, and input constraints. First, a neural network (NN) and a nonlinear disturbance observer (NDO) are employed to estimate the value of system uncertainties and external disturbances. Taking input constraints into account, an auxiliary system is designed to compensate for the constrained input. Subsequently, the backstepping control containing NN and NDO is used to ensure the stability of systems and suppress the adverse effects caused by the system uncertainties and external disturbances. In order to avoid the derivation operation in the process of backstepping, a dynamic surface control (DSC) technique is utilized. Simultaneously, the estimations of the NN and NDO are applied to derive the backstepping control law. For the purpose of achieving finite-horizon optimization for pitching attitude control, an adaptive method termed adaptive dynamic programming (ADP) with a single NN-termed critic is applied to obtain the optimal control. Time-varying feature functions are applied to construct the critic NN in order to approximate the value function in the Hamilton–Jacobi–Bellman (HJB) equation. Furthermore, a supplementary term is added to the weight update law to minimize the terminal constraint. Lyapunov stability theory is used to prove that the signals in the control system are uniformly ultimately bounded (UUB). Finally, simulation results illustrate the effectiveness of the proposed finite-horizon optimal attitude control method.
1. Introduction
In flight control, the problem of finite-horizon optimization for pitching attitude tracking control of aircraft can be treated as the pitching attitude tracking the command signal with the desired finite-horizon optimal index. To resolve the above problem, both the tracking control and finite-horizon optimization should be taken into consideration, which makes the overall problem significantly difficult.
In order to achieve attitude tracking for aircraft, much research has been conducted. In [1], an adaptive second order sliding model control method was presented in order to improve the performance in the presence of external disturbances. In [2], a gain-scheduling control method was proposed for linear parameter-varying systems with multi-input multi-output in order to obtain a satisfactory performance in the bank-to-turn control of aircraft. In [3], nonlinear dynamic inversion technology was proposed for the design of a supermaneuverable aircraft control. In [4], backstepping control was employed to execute the attitude tracking control of a mini unmanned aerial vehicle.
Among the control methods above, the backstepping control scheme is widely used owing to some advantages. First, virtual control is designed separately for each subsystem in the process of backstepping control to reduce the complexity of high-order system control. Subsequently, the backstepping control can be combined with other control methods such as sliding mode control, NN, adaptive control method, and disturbance observer to improve the control performance. In [5], a robust backstepping control scheme combining sliding mode control and neural network (NN) was proposed to achieve the reentry attitude tracking control of a near-space hypersonic vehicle in the presence of parameter variations and external disturbances. In [6], a deep convolutional NN-based backstepping method was used to identify system uncertainties and hidden states in attitude control in order to enhance the robustness. In [7], an auxiliary system-based backstepping control was constructed for the aircraft subject to the input saturation problem caused by wing rock. In [8], a finite-time convergence backstepping control scheme was designed. In the scheme, a finite-time observer and finite-time auxiliary system were used to suppress the effects of unsteady aerodynamic disturbances and compensate for the effect of input saturation, respectively. However, the aforementioned attitude control methods do not take into account the optimal control that meets some desired index. Actually, the optimal control problem is intractable, especially for a nonlinear system. Hence, there is relatively little research on the attitude control of aircraft in an optimal way, which is a nonlinear optimization problem in nature, not to mention the finite-horizon optimization for attitude tracking control. Thus, how to control the aircraft attitude in an optimal way should be further studied.
Quadratic optimal control is an optimal control method applied earlier in flight control. A given quadratic index is used to control the system with a desired optimal performance. In [9], a nonlinear system was divided into two parts of a linear nominal system and compound disturbances. Then, a linear quadratic regulator was designed to control the linear nominal system, while a robust control was derived to compensate for the effects caused by compound disturbances. In order to cope with the problem of recovering open-loop singular values in the quadratic optimal control, the LQG/LTR technique was applied for a multivariable vertical short take-off and landing aircraft linear system in [10]. However, the above control methods can only be applied to linear systems. For a nonlinear system, Hamilton–Jacobi–Bellman (HJB) equations without analytical solutions need to be solved, which makes it intractable to execute optimal control.
To cope with the problem of solving HJB equations for a nonlinear system, some numerical methods were applied to approximate the solution. In [11], a dynamic programming algorithm was presented, which is supposed to be solved in an off-line manner. In [12], a recursive optimization approach was proposed for a nonlinear system. In [13], a state-dependent Riccati equation (SDRE) method, which used a parameterization technique to convert the nonlinear system into a linear structure with state-dependent coefficients, was proposed to deal with the problem of nonlinear optimization. Nevertheless, a heavy computational burden is the main obstacle to applying the above three methods to nonlinear optimization. Inspired by the dynamic programming algorithm, the ADP algorithm was proposed in [14]. Compared with the dynamic programming algorithm, a critic NN was constructed to approximate the value function in order to solve the HJB equation forward-in-time in the ADP algorithm. Thus, the heavy computational burden was avoided, and on-line optimization was achieved.
The ADP algorithm, characterized by strong abilities of self-learning and adaptivity, has received significantly increased attention and has become an important intelligent optimal control method for nonlinear systems [15]. Due to its advantage of a low calculation cost, ADP has been applied in flight control. In [16], an adaptive critic design (ACD)-based optimal control algorithm was proposed. Under the premise of ensuring system stability, the ACD algorithm was utilized to improve the control performance of the system. In [17], a constrained ADP approach and linear parameter-varying technique were employed to guarantee the closed-loop stability and excellent control performance of the flight with aerodynamic parameter uncertainties and actuator failures. In [18], an incremental ADP algorithm was proposed to control the attitude tracking of spacecraft. In [19], an integral sliding-mode control based adaptive actor–critic algorithm was developed to guarantee the optimal control for sliding-mode dynamics online. As discussed above, the backstepping control method is favored by researchers due to its advantages. Because of its feature of easy combination with other control methods, the backstepping-based ADP scheme has been applied in many works. In [20], a backstepping-based ADP algorithm was developed to solve the problem of missile-target interception with state and input constraints. In the scheme of backstepping, a barrier Lyapunov function was used in the virtual controller design process for each subsystem to guarantee the state constraints, and an auxiliary system was designed to compensate for the constrained input. In [21], a backstepping-based ADP algorithm with zero-sum differential game method was applied to the zero-sum game problem for a missile and target. The zero-sum differential game technique was applied in the scheme of ADP algorithm in order to control the missile and target in an optimal way, and a critic network was constructed to approximate the value function in Hamilton–Jacobi–Isaacs (HJI) in order to achieve optimization online. In [22], an NN-based optimal control scheme was proposed for the near-space vehicle attitude tracking control. In the scheme, the NN and NDO were designed in a backstepping scheme to approximate the system uncertainties and external disturbances, while the critic network was constructed to approximate the value function in the HJB equation. However, it should be noted that the developments of the ADP algorithm above mainly address only the problem of infinite-horizon optimization. In fact, it is required to control in a finite-horizon optimal way for many systems, especially for flight systems.
Compared with infinite-horizon optimal control, finite-horizon optimal control is considered to be more challenging. First, the value function of the finite-horizon optimal control system is time-to-go-dependent, which leads to a time-varying associated HJB equation. Hence, it is more difficult to solve the HJB equation. Second, the terminal constraint should be taken into account for infinite-horizon optimization [23]. For the purpose of addressing the issues above, some research has been conducted. In [24], time-dependent weights and state-dependent feature functions were incorporated to construct an NN in order to approximate the time-to-go-dependent value function, and the least-square-based gradient descent method was utilized to update the weights off-line. In contrast, an NN consisting of constant weights and time-state-dependent feature functions was designed to achieve the approximation of the value function in the HJB function online in [25]. Nevertheless, the constrained input and system uncertainties were not taken into account. Considering input constraints, a non-quadratic function was utilized to eliminate the input constraints in [26]. Regarding system uncertainties, an online NN identifier was designed to approximate system uncertainties, and an actor–critic algorithm was introduced to solve the HJB equation to guarantee that the system was controlled with the optimal index in finite-horizon in [27]. Unfortunately, most of the research considered discrete systems. Furthermore, the constrained input, system uncertainties, and external disturbances are not considered in the finite-horizon optimal control together, which limits its application in flight control.
In order to address the problem of finite-horizon optimization pitching attitude tracking with system uncertainties, external disturbances, and input constraints, a novel backstepping-based finite-horizon optimization is developed in this work. The backstepping scheme, in which NN and NDO are employed to estimate the value of system uncertainties and external disturbances and an auxiliary system is designed to compensate for the constrainted input, is introduced to ensure the stability of systems. The ADP algorithm with a critic NN that consists of constant weights and time-state-dependent feature functions is employed to obtain finite-horizon optimal control. A novel updating law of the critic NN weights is derived to solve the HJB equation and minimize the terminal constraints. Furthermore, the Lyapunov stability method is applied to prove that the signals in the control system are UUB. Finally, simulation results illustrate the effectiveness of the proposed control scheme.
The main contributions of this paper include the following:
- (1)
- A backstepping-based ADP scheme is used to achieve finite-horizon optimal control. In the backstepping control scheme, NN is applied to approximate system uncertainties, while NDO is employed to estimate external disturbances. The ADP is used to control the nominal system in a finite-horizon optimal manner. Due to the integration of the backstepping method and the advantages of ADP, the backstepping-based finite-horizon optimization ADP scheme is promising for pitching attitude tracking control.
- (2)
- A novel updating law of the critic NN weights is derived in order to satisfy the terminal constraints, relaxing the requirement of an initial admissible control and guarantee the stability of system.
The rest of the paper is organized as follows. Section 2 formally states the preliminaries of the research object of this paper. The desigsn for backstepping control and finite-horizon optimal control are given in Section 3 and Section 4, respectively. Then, the stability analysis is developed in Section 5. The simulation results are presented in Section 6. Finally, the conclusions of the paper are given in the last section.
Notations. Throughout the paper, stands for the set of all real matrices. stands for the gradient of f with respect to x such as . stands for the sign function.
2. Problem Descriptions and Preliminaries
Before giving the attitude dynamics of an aircraft, a diagram illustrating the model parameters of the aircraft is given in Table 1. Regardless of the unsteady aerodynamics, the longitudinal attitude dynamics of an aircraft with system uncertainties and external disturbances can be described as follows [7].
where and are the known internal system dynamics. is the known control coefficient. is the unconstrainted control to be designed. and are the unknown system uncertainties. and are the unknown external disturbances. , , , are given by
where is a known boundary of u.
Table 1.
The illustration of aircraft model parameters.
In this paper, the control objective is to design a controller u, so that the angle of attack is driven to track a desired signal in a finite-horizon optimal manner, and all the signals in the control system (1) and (2) are uniformly ultimately bounded (UUB). To illustrate the design of the proposed method, the control block diagram is shown in Figure 1. To deal with the external disturbances and the system uncertainties, the NDO and the NN are designed together with the backstepping control. To solve the problem of the input constraints, the auxiliary is used in the forward control, which transforms the longitudinal attitude dynamics of the aircraft into the nominal form. To carry out optimal control, the ADP-based finite-horizon optimal control method is designed with the critic NN.
Figure 1.
The control block diagram of the proposed method.
For the design of the controller hereinafter, the following assumption is required.
Assumption 1.
Remark 1.
Investigating the expression of shown as Equation (5), the boundeness of , , and T yield the boundness of , and the non-zero values of and T make invertible. Thus, Assumption 1 is reasonable.
3. Design for Backstepping Control
In this section, a backstepping method with NN and NDO is derived to design a forward controller. The NN is constructed to approximate the system uncertainties while the NDO is designed to approximate the external disturbances. Then, the system comprising (1) and (2) is transformed into a nominal system to be controlled in a finite-horizon optimal manner by ADP.
In order to obtain satisfying control performance, the negative effects caused by unknown system uncertainties must be eliminated. According to the NN theory, the system uncertainties and can be approximated as [22]
where , are the parameters to be designed. and are basis functions. and are the numbers of the basis functions of and , respectively. and are the desired weight vectors. and are the approximation errors of the NN.
Assumption 2
([20]). and are both bounded as , .
Invoking Equations (7) and (8) into (1) and (2) yields
where and are treated as compound disturbances [22].
For the purpose of compensating for input constraints, an auxiliary system is designed as
where . is an auxiliary control signal. is the parameter to be designed.
Assumption 3
([8]). Δ is bounded such as .
The error system is defined as [20,22]
where is the virtual control law to be designed.
For the purpose of achieving the backstepping control and finite-horizon optimal control, both of the virtual control and the unconstrained control u are divided into two parts as
where and are the virtual control input and unconstrained control input in the backstepping scheme, respectively, and and are the virtual control input and unconstrained control input in the finite-horizon optimal control scheme, respectively.
To estimate the compound disturbances, NDOs are designed as [22]
where and are the estimations of and , respectively. and are functions to be designed that satisfy and . and are the estimations of and , respectively.
We define the estimation errors as
Then, the backstepping control law can be designed as follows.
Step 1: Taking Equation (14) into account, the virtual control input is designed as
where is the parameter to be designed.
The weights vector is updated as
where is the positive definite symmetric matrice to be designed [22]. is the parameter to be designed.
In the normal backstepping scheme, it is inevitable to differentiate . Nevertheless, due to the unknown information in the partial derivative of , it is intractable to obtain the derivation. In order to avoid the derivation operation, a dynamic surface control (DSC) technique is applied as [28]
where τ is the parameter to be designed. λ is a first-order filter in nature to approximate such that can substitute for .
We define the error as
Then, we have
where . is the weight vector of critic NN designed hereinafter.
Assumption 4
([28]). is a continuous function. For any and , the sets and are compact in and , respectively. Hence, is also compact. Considering the continuous property, the function is bounded for the given initial conditions in the compact set such as .
We consider the Lyapunov function candidate as
In addition, we have
Taking Assumption 3.3 and Young’s inequality into account and invoking inequality (34) yields
where . is the parameter to be designed.
Step 2: Taking Equation (15) into account, the control input is designed as
where is the parameter to be designed.
The weights vector is updated as
where represents the positive definite symmetric matrices to be designed [22]. is the parameter to be designed.
We consider the Lyapunov function candidate as
In addition, we have
Taking Assumptions 3 and 4 and Young’s inequality into account and invoking inequality (41) yields
where . , are the parameters to be designed.
The nominal affine nonlinear system is defined as
where
We consider the Lyapunov function candidate in the backstepping scheme as
Remark 2.
Based on Assumption 1, the first derivatives of and are bounded such as , . In addition, considering the optimal approximation property of the NN, the desired weight vectors are bounded. Thus, Assumption 2 is reasonable. Furthermore, if the difference Δ between the desired control input and saturation input is unbounded, the desirable attitude motion will be uncontrollable. Thus, Assumption 3 is reasonable. In addition, in terms of the compact property and the continuous property, which have been detailed in [28], Assumption 4 is reasonable. Detailed derivations of some equations above is provided in Appendix A.
4. Design for Finite-Horizon Optimal Control
In this section, an ADP based finite-horizon optimal control method is designed to make the nominal system (43) controlled in a finite-horizon optimal manner. In order to approximate the value function in the HJB equation, an NN consisting of constant weights and a time-state-dependent feature function is constructed. A novel weight updating law is proposed in order to minimize the objective function, remove the requirement for the initial admissible control, and guarantee the Lyapunov stability.
The objective of the finite-horizon optimal control is to maximize the finite-horizon cost function defined as
where is the terminal constraint of the terminal state . is the cost-to-go function defined as
where , are symmetric positive matrices.
Similarly, the terminal cost function is defined as
Then, the optimal cost function satisfies the equation as [20]
According to Equation (54), the optimal control input meets the conditions as
Hence, the optimal control input can be obtained as
We rewrite the optimal cost function by NN as
where is the basis functions vector, L is the number of the basis functions. is the weights vector. is the approximate error.
Similarly, the terminal optimal cost function can be written as
The gradients of with respect to t and Z are
Assumption 5
([20,22,25,26,29,30]). , , , , , , and are all bounded such as , , , , , , .
Lemma 1
Assumption 6.
and are both bounded as and .
Since is unknown, a critic NN is constructed to approximate the optimal cost function as
where is the estimation of .
We define the estimation error
Invoking Equation (67), the gradients of the optimal cost function with respect to t and Z can be approximated as
Invoking Equations (56) and (70), the estimation of the optimal control input can be written as
where .
Similar to Equation (57), the estimation of the Hamiltonian function can be expressed as
Then, the optimal terminal cost function can be estimated as
where is the estimation of [25,26,32].
We define the terminal constraints estimation error as
Prior to designing the weight updating law for the critic NN, an assumption is given.
Assumption 7.
Considering system (43) with the optimal control input (56), we can always find a Lyapunov function that satisfies . Furthermore, there is always a positive function that satisfies the following inequality.
In order to minimize the total squared error (75), a novel weight updating law based on the gradient descent theory is developed as
where is the parameter to be designed. is designed in Remark 4. , are the vector and matrix to be designed, respectively. , , , are expressed as , , , , where and are written as
Φ is given as
Remark 3.
Considering the optimal approximation property of the NN, Assumption 5 is reasonable. In addition, the optimal control can be obtained when in Equation (66). Taking Lemma 1 into account, can stabilize the nominal system (43); that is, is bounded. Simultaneously, considering Assumptions 1 and 5, Assumption 6 is reasonable. Taking Remark 3 into consideration, the optimal control input can stabilize the nominal system (43). Hence, we can always find a Lyapunov function where the derivative of with respect to t is negative and bounded. In general, can be designed as . Thus, Assumption 7 is reasonable.
Remark 4.
According to the expression of , and , we have that , and are bounded as , , , respectively.
Remark 5.
The first and second terms in Equation (77) are employed to minimize the total squared error based on gradient descent theory. Moreover, the third term is used to enhance the stabilizing ability of the controller. In more detail, according to Equation (80), the third term disappears when , which can be treated as a stability characteristic of the system, while the third term is activated to reinforce the stability of the system when the stability characteristic is gone. Thus, the requirement of an initial admissible control is avoided. In addition, the fourth term is designed for the UUB stability of the system in the subsequent process of the proof.
5. Stability Analysis
In this section, Theorem 1 is proposed to analyze the stability of the closed system controlled by the backstepping and finite-horizon optimal control methods. Theorem 1 is given as follows.
Theorem 1.
For the system comprising (1) and (2) with associated finite-horizon cost function (50), the backstepping control inputs and finite-horizon optimal control inputs are designed as Equations (26), (36), and (71), respectively. The virtual control input and unconstrained input control input u are designed as Equations (16) and (17), respectively. The NN weights vectors tuning laws are given by Equations (27), (37), and (77). Then, the closed-loop system states errors , , the weights vectors estimation errors , , , the disturbance estimation errors , , the DSC system state error e, and the auxiliary system state error are UUB with appropriate designed parameters.
Proof.
We consider the following Lyapunov function as
where is defined as Equation (48).
Next, is derived as follows.
Invoking Equation (63) yields
Invoking Equation (87) yields
Invoking Equation (89) yields
In addition, we have
Let parameters , , , , , , , , , be chosen such that , and invoking Equation (98) yields
where stands for the minimum eigenvalue of .
Case 1: If , then ; that is, satisfies that . Thus, we have
Taking Assumptions 1 and 5 into account, is bounded such as
Combining Case 1 and Case 2, we can obtain the conclusion that the augmented state vector is UUB. This completes the proof. □
Remark 6.
Recalling Assumptions 5, 6, and Remark 5 yields that is bounded.
Remark 7.
d is expressed in quadratic form, as shown in Equation (111). Hence, we have that . Simultaneously, taking into account, . In addition, recalling Assumptions 1, 2–4 and Remark 1 yields that d is bounded.
Remark 8.
Recalling Assumption 7, we have that . Simultaneously, taking Remark 6 into account yields that , . Thus, it can be guaranteed that , , and .
6. Simulation Results
In this section, the simulation of the pitching attitude tracking of the aircraft controlled by the backstepping finite-horizon optimal control is described to illustrate the effectiveness of the proposed scheme. The parameters of the aircraft model and the designed control system are given in Table 2 and Table 3, respectively. The finite-horizon is selected as . The terminal constraint is chosen as . The basis functions are designed as and . The objective of the designed control law is to obtain an optimized performance in the finite-horizon under the guarantee of basic tracking ability.
Table 2.
The parameters of the aircraft model.
Table 3.
The parameters of the control system.
In order to simulate system uncertainties , , aerodynamic parameter variations are set to 1.1 times the nominal value. The disturbances are given as . NN and NDO are designed as the estimators of the system uncertainties and external disturbances, respectively. In order to illustrate the effectiveness of the NN and NDO, the estimations of the sums of , and , are shown as Figure 2 and Figure 3, respectively. In addition, the comparison of the response of the angle of attack controlled with and without the designed estimators are shown in Figure 4. As shown in Figure 2 and Figure 3, the system uncertainties and external disturbances can be estimated accurately and quickly under the utility of NN and NDO. From Figure 4, it can be depicted that owing to the designed estimators, the adverse effects caused by the system uncertainties and external disturbances are greatly reduced so that the angle of attack can track the command signal more precisely.
Figure 2.
The estimations of the sum of and .
Figure 3.
The estimations of the sum of and .
Figure 4.
The response of the angle of attack controlled with and without estimators.
In order to achieve the finite-horizon optimal control, the ADP algorithm is applied. The objective function is given as in Equation (75). The objective of the ADP algorithm is to minimize the objective function. For the purpose of illustrating the effectiveness of the designed ADP algorithm, the response of is shown in Figure 5. It can be observed that gradually decreases and eventually converges to zero. Hence, the designed ADP algorithm is efficient.
Figure 5.
The response of the objective function.
The virtual control inputs defined in Equation (26), the control inputs defined in Equation (36), and and defined in Equation (71) are shown in Figure 6 and Figure 7, respectively. Under the action of the backstepping based finite-horizon optimal control inputs, the response of the angle of attack is shown in Figure 8. Furthermore, the response of the angle of attack controlled by the backstepping method with the same parameters is given in Figure 8 as a contrast. It can be illustrated that the system controlled by the designed backstepping based finite-horizon optimal control method can evolve in a finite-horizon optimal way. Thus, the conclusion can be drawn that a better performance can be obtained under the control of the backstepping-based finite-horizon optimal control method.
Figure 6.
The response of and .
Figure 7.
The response of and .
Figure 8.
The response of the angle of attack with and without finite-horizon optimization.
7. Conclusions
In this paper, a backstepping-based finite-horizon optimal control scheme is proposed to complete the task of angle of attack tracking in a finite-horizon optimal manner. An auxiliary system is designed to compensate for the input constraints. NN and NDO are applied to estimate the system uncertainties and external disturbances. Furthermore, the backstepping method containing NN and NDO is employed to ensure the stability of the system and suppress the adverse effects caused by the system uncertainties and external disturbances. In addition, the DSC technique is utilized to avoid the derivation operation in the process of the backstepping control. Moreover, the ADP algorithm is used to control the system in a finite-horizon optimal manner. In the design of the ADP, a critic NN is constructed by time-state-dependent feature functions to approximate the value function in the HJB equation. Finally, simulation results illustrate the effectiveness of the proposed backstepping-based finite-horizon optimal control scheme.
In future work, a practical experiment will be constructed and carried out to verify the proposed control method.
Author Contributions
Conceptualization, A.L., Y.S. and B.D.; methodology, A.L. and Y.S.; writing—original draft preparation, A.L. and Y.S.; writing—review and editing, A.L., Y.S. and B.D.; visualization, A.L. and Y.S.; supervision, B.D. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Data Availability Statement
Dataset available on request from the authors.
Conflicts of Interest
Author Ang li was employed by the company Shenyang Aircraft Design and Research Institute Yangzhou Collaborative Innovation Research Institute Co., Ltd., Yangzhou, China. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Appendix A. Derivation of Equation
Appendix A.1. The Derivation of Equation (33) Is as Follows
Appendix A.2. The Derivation of Equation (34) Is as Follows
Appendix A.3. The Derivation of Equation (35) Is as Follows
Appendix A.4. The Derivation of Equation (38) Is as Follows
References
- Castañeda, H.; Salas-Peña, O.S.; de León-Morales, J. Extended observer based on adaptive second order sliding mode control for a fixed wing UAV. ISA Trans. 2017, 66, 226–232. [Google Scholar] [CrossRef] [PubMed]
- Lee, C.H.; Chung, M.J. Gain-scheduled state feedback control design technique for flight vehicles. IEEE Trans. Aerosp. Electron. Syst. 2001, 37, 173–182. [Google Scholar]
- Snell, S.A.; Enns, D.F.; Garrard, W.L. Nonlinear inversion flight control for a supermaneuverable aircraft. J. Guid. Control. Dyn. 1992, 15, 976–984. [Google Scholar] [CrossRef]
- Lungu, M. Stabilization and control of a UAV flight attitude angles using the backstepping method. World Acad. Sci. Eng. Technol. 2012, 6, 241–248. [Google Scholar]
- Zhang, J.; Sun, C.; Zhang, R.; Qian, C. Adaptive sliding mode control for re-entry attitude of near space hypersonic vehicle based on backstepping design. IEEE/CAA J. Autom. Sin. 2015, 2, 94–101. [Google Scholar] [CrossRef]
- Kang, Y.; Chen, S.; Wang, X.; Cao, Y. Deep convolutional identifier for dynamic modeling and adaptive control of unmanned helicopter. IEEE Trans. Neural Netw. Learn. Syst. 2018, 30, 524–538. [Google Scholar] [CrossRef] [PubMed]
- Wu, D.; Chen, M.; Gong, H.; Wu, Q. Robust backstepping control of wing rock using disturbance observer. Appl. Sci. 2017, 7, 219. [Google Scholar] [CrossRef]
- Wu, D.; Chen, M.; Gong, H. Robust control of post-stall pitching maneuver based on finite-time observer. ISA Trans. 2017, 70, 53–63. [Google Scholar] [CrossRef] [PubMed]
- Liu, H.; Lu, G.; Zhong, Y. Robust LQR Attitude Control of a 3-DOF Laboratory Helicopter for Aggressive Maneuvers. IEEE Trans. Ind. Electron. 2013, 60, 4627–4636. [Google Scholar] [CrossRef]
- Zarei, J.; Montazeri, A.; Motlagh, M.R.J.; Poshtan, J. Design and comparison of LQG/LTR and H controllers for a VSTOL flight control system. J. Frankl. Inst. 2007, 344, 577–594. [Google Scholar] [CrossRef]
- Bellman, R. Dynamic programming. Science 1966, 153, 34–37. [Google Scholar] [CrossRef] [PubMed]
- Chanane, B. Optimal control of nonlinear systems: A recursive approach. Comput. Math. Appl. 1998, 35, 29–33. [Google Scholar] [CrossRef]
- Mracek, C.P.; Cloutier, J.R. Control designs for the nonlinear benchmark problem via the state-dependent Riccati equation method. Int. J. Robust Nonlinear Control 1998, 8, 401–433. [Google Scholar] [CrossRef]
- Werbos, P. Approximate Dynamic Programming for Real-Time Control and Neural Modeling; Academic Press: New York, NY, USA, 1977. [Google Scholar]
- Wei, Q.; Song, R.; Yan, P. Data-driven zero-sum neuro-optimal control for a class of continuous-time unknown nonlinear systems with disturbance using ADP. IEEE Trans. Neural Netw. Learn. Syst. 2017, 27, 444–458. [Google Scholar] [CrossRef] [PubMed]
- Ferrari, S.; Stengel, R.F. Online adaptive critic flight control. J. Guid. Control. Dyn. 2004, 27, 777–786. [Google Scholar] [CrossRef]
- Ferrari, S.; Steck, J.E.; Chandramohan, R. Adaptive feedback control by constrained approximate dynamic programming. IEEE Trans. Syst. Man Cybern. Part 2008, 38, 982–987. [Google Scholar] [CrossRef] [PubMed]
- Zhou, Y.; Kampen, E.J.V.; Chu, Q.P. Incremental approximate dynamic programming for nonlinear adaptive tracking control with partial observability. J. Guid. Control. Dyn. 2018, 41, 1–14. [Google Scholar] [CrossRef]
- Fan, Q.Y.; Yang, G.H. Adaptive actor–critic design-based integral sliding-mode control for partially unknown nonlinear systems with input disturbances. IEEE Trans. Neural Netw. Learn. Syst. 2017, 27, 165–177. [Google Scholar] [CrossRef] [PubMed]
- Sun, J.; Liu, C. Backstepping-based adaptive dynamic programming for missile-target guidance systems with state and input constraints. J. Frankl. Inst. 2018, 355, 8412–8440. [Google Scholar] [CrossRef]
- Sun, J.; Liu, C.; Zhao, X. Backstepping-based zero-sum differential games for missile-target interception systems with input and output constraints. IET Control Theory Appl. 2018, 12, 243–253. [Google Scholar] [CrossRef]
- Xia, R.; Chen, M.; Wu, Q. Neural network based optimal adaptive attitude control of near-space vehicle with system uncertainties and disturbances. Proc. Inst. Mech. Eng. Part J. Aerosp. Eng. 2019, 233, 641–656. [Google Scholar] [CrossRef]
- Cui, X.; Zhang, H.; Luo, Y.; Zu, P. Online finite-horizon optimal learning algorithm for nonzero-sum games with partially unknown dynamics and constrained inputs. Neurocomputing 2016, 185, 37–44. [Google Scholar] [CrossRef]
- Cheng, T.; Lewis, F.L.; Abu-Khalaf, M. A neural network solution for fixed-final time optimal control of nonlinear systems. Automatica 2007, 43, 482–490. [Google Scholar] [CrossRef]
- Zhao, Q.; Xu, H.; Jagannathan, S. Neural network-based finite-horizon optimal control of uncertain affine nonlinear discrete-time systems. IEEE Trans. Neural Netw. Learn. Syst. 2014, 26, 486–499. [Google Scholar] [CrossRef] [PubMed]
- Sun, J.; Liu, C. Finite-horizon differential games for missile–target interception system using adaptive dynamic programming with input constraints. Int. J. Syst. Sci. 2018, 49, 264–283. [Google Scholar] [CrossRef]
- Xu, H.; Jagannathan, S. Neural network-based finite horizon stochastic optimal control design for nonlinear networked control systems. IEEE Trans. Neural Netw. Learn. Syst. 2014, 26, 472–485. [Google Scholar] [CrossRef] [PubMed]
- Chen, M.; Tao, G.; Jiang, B. Dynamic surface control using neural Networks for a class of uncertain nonlinear systems with input saturation. IEEE Trans. Neural Netw. Learn. Syst. 2015, 26, 2086–2097. [Google Scholar] [CrossRef] [PubMed]
- Abu-Khalaf, M.; Lewis, F.L. Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach. Automatica 2005, 41, 779–791. [Google Scholar] [CrossRef]
- Dierks, T.; Jagannathan, S. Optimal control of affine nonlinear continuous-time systems. In Proceedings of the 2010 American Control Conference, Baltimore, MD, USA, 30 June–2 July 2010; pp. 1568–1573. [Google Scholar]
- Wang, D.; Liu, D.; Li, H.; Ma, H. Neural-network-based robust optimal control design for a class of uncertain nonlinear systems via adaptive dynamic programming. Inf. Sci. 2014, 282, 167–179. [Google Scholar] [CrossRef]
- Xu, H.; Zhao, Q.; Dierks, T.; Jagannathan, S. Neural network-based finite-horizon approximately optimal control of uncertain affine nonlinear continuous-time systems. In Proceedings of the 2014 American Control Conference, Portland, OR, USA, 4–6 June 2014; pp. 1243–1248. [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).







