Abstract
This paper uses the adaptive dynamic programming (ADP) method to achieve optimal trajectory tracking control for quadrotors. Relying on an established mathematical model of a quadrotor, the approximate optimal trajectory tracking control, which consists of the steady-state control input and the approximate optimal feedback control input, is designed for a nominal system. Considering the compound disturbances in position and attitude dynamic models, disturbance observers are introduced. The estimated values are used to design robust compensation inputs to suppress the effect of the compound disturbances for good trajectory tracking performance. Theoretically, the Lyapunov theorem demonstrates the stability of a closed-loop system. The robustness and effectiveness of the proposed controller are confirmed by the simulation results.
1. Introduction
The miniaturization and reduction in cost of the relevant control components in aircraft, as well as the development and progress of computer and sensing and measurement technologies, have improved the stability of flight control systems and greatly facilitated the development of quadrotors [1]. The high operability, strong mobility and flexibility of quadrotors allow them to meet the specific needs of many projects, generally used in military, industrial and other fields [2,3]. A quadrotor system is multivariable, nonlinear and strongly coupled, and quadrotors will also be disturbed by the surrounding environment during flight [4]. These factors can affect the accuracy of quadrotor control systems. The requirements for high-accuracy and robust flight control in the design of controllers for quadrotors are stringent, and the design of a core control algorithm is a prerequisite for quadrotors to achieve a stable and high-precision flight performance. Therefore, the research and development of controllers for quadrotor systems is of great significance.
At present, it is no longer a problem to ensure the uniformity of quadrotors through control algorithms. Many controllers for quadrotors have been designed and are already in application [5]. Since the dynamics of quadrotors can be linearized around the equilibrium point, traditional linear control methods are used for a designed controller [6]. On this basis, linear techniques are employed in the flight control of quadrotors, such as linear quadratic regulator (LQR) control [7]. However, quadrotors need to be controlled away from the equilibrium point to accomplish complex control tasks and withstand external disturbances. As a result, a technique has been devised that is regarded as a robust feedback linearization method that uses extended state observers to estimate the nonlinear state feedback term online, containing aerodynamic forces, moments and unknown disturbances, and obtains the desired closed-loop dynamics via pole assignment [8]. Moreover, several robust controllers relying on nonlinear techniques have been proposed, such as sliding mode control [9], adaptive control [10], backstepping-based control [11] and robust control [12]. These control methods ensure the stability and robustness of nonlinear systems and have generally been used for the tracking control of these systems, but their optimal properties have not been considered. Therefore, the concept of optimization has been introduced into control design.
To derive the optimal control policy for the infinite horizon optimal control problem, solving the Hamilton–Jacobi–Bellman (HJB) equation or the Hamilton–Jacobi–Isaacs (HJI) equation for the optimal control problem considering uncertainties is essential. Nevertheless, it is difficult to mathematically derive the corresponding analytical solutions in most cases. Neural networks are an optional method to overcome this problem [13,14,15]. The approximation property of neural networks makes it possible to find approximate solutions to partial differential equations. The convergence of neural networks can be ensured by penalizing them to ensure they satisfy the given partial differential equations. The ADP method is the combination of reinforcement learning, dynamic programming and neural network adaptive methods to derive approximate solutions of the HJB/HJI equations using function approximate structures to address nonlinear optimal control problems [16,17]. The ADP method is used for control design with suitable performance index functions to derive the desired dynamic performance and stabilize a nominal system with uncertainties. However, most nonlinear optimal control methods using the ADP method are aimed at nominal systems or uncertain systems satisfying specific conditions [18,19,20], while the immunity to disturbances is still weak for such systems with external time-varying disturbances independent of the state, and the control effect under stronger disturbances is not ideal. The ADP method has been used in the design of controllers for quadrotors and efforts have been made to improve the robustness, but the designed controllers are more geared towards linear systems and design uncertainty is a unique problem [21,22]. Quadrotors will often experience various external effects in flight, requiring strong adaptive and anti-disturbance capabilities in flight control. The disturbance observer technique achieves disturbance suppression of the target utilizing feedback regulation [23], which can attenuate compound disturbances containing external disturbances and model uncertainties, thus improving the system robustness. A disturbance observer can accurately estimate compound disturbances in a system, which greatly reduces the conservatism of the control. In addition, since a disturbance observer can usually be designed independently of the controller, this ensures that the method can be easily combined with other advanced control methods and more flexible in its application. There are experiments suggesting that the introduction of a disturbance observer significantly improves performance, which is a good reference for methods for quadrotors to overcome disturbances [24].
Considering the above analysis, a robust approximate optimal trajectory tracking control method is proposed for quadrotors to solve the optimal control problem under the conditions of compound disturbances. The main contributions are summarized as follows:
- (1)
- The combination of modeling uncertainties and external time-varying disturbances is considered as compound disturbances. Disturbance observers are introduced to estimate the compound disturbances in the position and attitude subsystems, and the estimated values are used to design robust compensation inputs to suppress the effects of the compound disturbances and ensure the stability of a quadrotor system under the ADP method.
- (2)
- To obtain optimal trajectory tracking control for a quadrotor without composite disturbances, the ADP method is used to design approximate optimal control inputs for the nominal system of a quadrotor.
The rest of the paper is organized as follows. In Section 2, the quadrotor mathematical model is developed, and the quadrotor system is divided into two subsystems. Section 3 describes the design of the robust approximate optimal trajectory tracking control and the stability analysis of the closed-loop system. Section 4 describes the robust approximate optimal trajectory tracking control for the quadrotor. The results of the corresponding simulation and the results of the comparative simulation without disturbance observer are presented in Section 5. Section 6 gives the conclusion of the paper.
2. Mathematical Modeling of a Quadrotor
The quadrotor has four evenly spaced, cross-symmetrical brushless motors in the plane, The rotors of motor 1 and motor 3 rotate clockwise, while the rotors of motor 2 and motor 4 rotate counterclockwise. By changing the rotational speed of the four rotors, the quadrotor generates different magnitudes of lift forces and torques, which can control the takeoff, landing and attitude motions of the quadrotor. As a result, the location of the quadrotor can be altered in the three-dimensional space. Figure 1 depicts the basic structure of the quadrotor.
Figure 1.
Basic structure of the quadrotor.
To clarify the mathematical model of the quadrotor system and satisfy the implementation of the control method, the earth-fixed inertial frame and the body-fixed body frame are established. To ensure that the constructed mathematical model does not lose the generality, it is assumed that the deformation and elastic vibration properties of the rotors and body are neglected, and the quadrotor is considered as an ideal rigid body; the quadrotor’s structure is symmetrical, its mass is uniformly distributed, and its center of mass is located at the geometric center. The translational and rotational motions of the quadrotor are satisfied by [25]
where represents the position of the quadrotor in the inertial frame and represents the corresponding velocity. denotes the vector of Euler angles. denotes the angular velocity of the quadrotor in the body frame. is the rotation matrix for the angular velocity in the form of
in which , and .
Relying on the Newton–Euler method, the dynamical equation of quadrotor with compound disturbances is represented by [26]
where represents the mass of the quadrotor. represents the inertia matrix of the quadrotor. As the assumptions of the quadrotor structure, its inertia matrix can be defined as the diagonal array . is the drag coefficient matrix. represents the resultant force consisting of the gravity and the total lift in the inertial frame. represents the torque in the body frame. and are the compound disturbances in position and attitude dynamic models, which contain modeling uncertainties and external time-varying disturbances.
According to the mechanical analysis, the quadrotor is affected by gravity and lift forces. Since the special structure of the quadrotor, the lift forces are along the z-axis direction of the body frame. Then, the resultant force expressed in the inertial frame is [27]
where represents the total lift force and represents the gravity acceleration. . is the rotation matrix of the body frame transformed into the inertial frame in the form of
Assumption 1
([28]). The pitch and roll angles hold the conditions and to avoid the singularities of the matrices and .
Assumption 2
([29]). In the control process, the total compound disturbance has finite energy. In addition, is a continuous function and its norm is bounded such that , where is an unknown positive constant. Simultaneously, the compound disturbances are usually considered to be superimposed by the low-frequency period signals. Hence, it is assumed that the total compound disturbance has a low change rate and its rate of change is slow compared to the dynamic properties of the disturbance observer, which can be considered that .
Assumption 3
([30]). The desired trajectory of position and the desired trajectory of yaw angle and their higher order derivatives are known, continuous and bounded.
Remark 1.
Assumption 2 is common in control studies using disturbance observers [31,32,33], while there are different considerations for compound disturbances in [34]. In the case of this paper, the considerations in Assumption 2 are used. Assumption 3 ensures that the ADP method can be utilized for the control design and the stability analysis.
The total lift and torque of the quadrotor are related to the force and torque of the four rotors as follows [35]:
where and are the lift and torque generated by the four rotors of the quadrotor, respectively. l represents the length from each rotor to the center of the body.
The rotor speeds are related to pulse-width modulated (PWM) signals through the motors. The lift forces and torques generated by the four motors are related to the pulse width of the input signals as follows [36]:
where and are the positive gains of the lift coefficient and the inverse torque coefficient, respectively. is the motor bandwidth and represents the PWM signals of each corresponding motor, which should be limited between 0 and 1.
Assuming that the motors have a sufficiently fast response speed, then the motor model can be simplified as [37]
Considering the trajectory tracking control for the quadrotor, the control objective is to design a controller that allows the position and attitude to track the desired trajectory asymptotically within a small error.
Combining (1), (2), (4) and (5), the overall model of the quadrotor can be decomposed into a position subsystem and an attitude subsystem. The position subsystem can be represented as
with
While the attitude subsystem is expressed in the form of
with
3. Robust Approximate Optimal Trajectory Tracking Control Design
Considering the convenience of describing the control design process, (12) and (13) is represented in the uniform form
in which and represent the drift dynamics and the input dynamics of the system, respectively. denotes the observable state vector, denotes the control input, and denotes the compound disturbance.
Definition 1
([39]). A state vector is said to be uniformly ultimately bounded (UUB) if there exists a compact set , a positive number and a time such that for all state variable initial value and all .
Lemma 1
([40]). is UUB if the time derivative of a positive definite function is negative when for a positive constant .
To realize the trajectory tracking control with robustness for the system, the designed controller consists of two parts, the form of which is as follows:
where is the robust compensation input designed through the disturbance observer for suppressing the effect of compound disturbances in the system. is the control input designed based on the ADP method for the nominal system, which takes the form of
where represents the steady-state control input and represents the feedback control input.
3.1. Disturbance Observer Design
The disturbance observer is applied to derive the estimate of the compound disturbance. The estimated value is then used for the design of the robust compensation input to improve robustness. The disturbance observer is designed as
in which represents the estimate of the unknown compound disturbance, represents the designed vector-valued function, is the observer gain and represents the auxiliary variable vector of the disturbance observer.
Remark 2.
In the disturbance observer (17), the derivative of the state is required, which is unknown because the compound disturbance is unknown. Then, the auxiliary variable vector is given to avoid calculating the derivative of the state.
Define the estimation error of compound disturbance as . With regard to Assumption 2 and the disturbance observer (17), the time derivative of is developed as
Combined with (14), we have
Then, is convergent by appropriately designing the vector-valued function .
Theorem 1.
Proof.
Select the candidate Lyapunov function as follows:
Combined with (18), the time derivative of is
In the case where is positive definite, then we derive
where and denotes the minimum eigenvalue. Obviously, when . Hence, the disturbance observer (17) can estimate D and will converge to zero. This completes the proof. □
Then, the robust compensation input is designed as
3.2. Optimal Trajectory Tracking Control Design and Analysis
The compound disturbance is estimated by the disturbance observer. The robust compensation input is designed by the estimated value to suppress the effect of the compound disturbances. As a result, converting the trajectory tracking control problem of the nonlinear system with the compound disturbance into the trajectory tracking control problem of the nominal system is possible. In order to derive the optimal control for the nominal system, deriving the solution of the associated HJB equation is essential. Unfortunately, deriving the analytical solution is difficult for the nonlinear system by the direct solution method. Then, the ADP method is utilized for achieving the approximate optimal control by constructing the critic network. The weight update law designed for the critic network ensures the convergence of the weight and the stability of the closed-loop system.
For System (14), the nominal system is represented by
Given the desired trajectory , the steady-state control input is obtained from (24) as
in which denotes the pseudo-inverse of .
Let and , then we have
Noting that , the norm of is bounded such that for the positive constants and .
As a result of Theorem 1, the disturbance observer (20) can successfully estimate the compound disturbance D and the estimation error of compound disturbance can converge to zero. Therefore, it is possible to neglect in the error system (27) for the optimal control design [41,42]. However, would still be considered in the stability analysis. Then, the nominal error system is represented by
Define the cost function as
where and are the designed symmetric positive definite matrices.
The nonlinear Lyapunov equation for (29) is achieved as
where and .
Definition 2
The Hamiltonian function takes the following form
The optimal cost function is represented by
and the following relation is satisfied
where .
Under the existence condition of the optimal solution , the optimal feedback control input is derived by
3.3. Approximate Optimal Control Design
Clearly, it is necessary to derive by solving the HJB Equation (35) for deriving the optimal feedback control input (34). However, (35) is a typical nonlinear partial differential equation and its solution is difficult to derive in the analytic form [44,45]. To overcome the difficulty, the ADP method relying on the policy iteration technique is utilized to derive the approximate solution.
Assumption 4
([46]). The continuously differentiable Lyapunov function candidate for the nominal error system (28) satisfies , where . Meanwhile, there exists a symmetric positive definite matrix such that . Moreover, the relation holds for positive constants , .
Remark 3.
Assumption 4 is a common assumption that has been used for the ADP method. Generally, it is assumed that the closed-loop dynamics with the optimal feedback control is bounded by a function of the system state on the compact set. In such a situation, there exists a positive constant such that . Hence, we can further derive . Furthermore, the function can be correctly selected as a quadratic polynomial [47], such as .
Considering the uniform estimation property of neural networks, the optimal cost function is approximated by
where represents the unknown ideal constant weight, represents the activation function, represents the approximate error, and N represents the number of neurons. This neural network is called the critic network in the ADP method.
Lemma 2
([48]). The estimation error is expected to be bounded when the approximated function is bounded.
Then, by the definition of , it is developed as follows
where and .
Substituting (37) into (35), the HJB equation is developed as
where . represents the residual error, which takes the form of
Since is bounded, there exists the positive constants and such that .
Define the estimate of as , then the estimate of is derived as follows:
Moreover, the approximate optimal feedback control input is derived as
Remark 4.
The classical ADP method utilizes the critic network and the actor network to approximate the optimal cost function and the optimal feedback control, respectively [43,49,50]. Considering the association between the optimal cost function and the optimal feedback control for the continuous affine nonlinear system, it is possible to omit the actor network and only use the critic network [51,52]. This framework provides smaller computational effort, faster convergence and compared to the actor–critic network framework, which has a better practical value.
Define the objective function as
Moreover, the weight update law is designed as
where , are the learning rates to be designed. and . is given in Assumption 4. in the last term is defined as
where is a designed positive constant.
Remark 5.
The first term in (45) is employed for minimizing the objective function (44). To ensure that will converge to , the existence of the persistence of excitation (PE) condition is essential during the learning process is necessary [49]. In addition, the probing noise is typically introduced to the control input for satisfying this condition, which may enable the closed-loop system to become unstable during the learning process [53,54]. The second term in (45) is employed for the stability of the closed-loop system.
3.4. Stability Analysis
Assumption 5
([50]). The ideal weight have bound over the compact set Ø such that for a positive constant . Meanwhile, the activation function and the approximate error are bounded such that , for positive constants and , and their derivatives are also bounded such that and for positive constants and . Moreover, the residual error will converge to zero when the number of neurons N is sufficiently large, as suggested by Remark 3 and the bound of . That is, the relation exists for the positive constant .
Theorem 2.
Considering System (14), the robust approximate optimal controller for the trajectory tracking control is designed as (15), which consists of the robust compensation input (23) and the nominal system control input (16), and the weight update law is designed as (45) for the critic network, then it is ensured that the tracking error E of the closed-loop system and the weight estimation error are UUB.
Proof.
Select the candidate Lyapunov function as follows
where is designed as (20), and .
Since the first two terms in the final form of (50) are negative semi-definite, we then derive
According to Remark 3 and Assumption 5, and considering the bound of , we assume that , , , , and . Noticing that the PE condition guarantees to be bounded, there exists a positive constant such that . In addition, based on Young’s inequality, there exists the relation , where c is a nonzero constant. Then, we have
Then, (51) is developed as
where
and are all non-zero constants whose selection guarantees . Combining the results of (22), (49) and (51), we have
By using Young’s inequality, the relation exists. Then, (61) is developed as
The following discussion is divided into two cases.
Case 1.
In this case, . Since , we can derive that . According to the dense property of , there exists a positive constant such that for all . Then, (62) becomes
By selecting and , such that , then is satisfied provided that one of the following conditions holds:
or
Case 2.
Considering the case , (62) is developed as
Based on Assumption 4, and considering , we have
Similarly, by selecting and such that and , then it means that holds as long as
or
In conclusion, when or . Relying on Lemma 1 and the standard Lyapunov extension theorem [55], it is further concluded that the tracking error E of the closed-loop system and the weight estimation error are UUB. This completes the proof. □
Remark 6.
As a result of Theorem 2, the approximate optimal cost function in (41) and the approximate optimal feedback control input in (42) can, respectively, converge to the neighborhoods of the optimal cost function and the optimal feedback control input within finite bounds when the PE condition holds [41].
4. Robust Approximate Optimal Trajectory Tracking Control for a Quadrotor
Position and yaw angle are the system outputs for the quadrotor that tracks the desired trajectory of position and the desired trajectory of yaw angle. The desired trajectories of roll and pitch angles required by the attitude subsystem are generated according to the position subsystem control inputs. The tracking errors in lateral and longitudinal positions are eliminated by the attitude subsystem tracking the desired trajectories of roll and pitch angles. According to the description of the control design in the previous section, the control design for the quadrotor is shown in Figure 2, which can guarantee that the tracking error of the quadrotor remains within a small range.
Figure 2.
Control design of the quadrotor.
4.1. Position Control Design
The estimated value of unknown compound disturbance in the position subsystem is derived by the following disturbance observer
where denotes the observer gain of the disturbance observer in the position subsystem and F is derived by (6). Then, the position subsystem robust compensation input is designed as
The steady-state control input for the position nominal system is designed as
where and . denotes the pseudo-inverse of . Then, define the position subsystem tracking error as
The cost function of the position subsystem is represented as
where and are the designed symmetric definite matrices. The approximate optimal feedback control input in the position subsystem is
where and . is the activation function and represents the estimate of the ideal weight for the critic network of the position subsystem. The corresponding weight update law is designed as
where , are the designed learning rates. , and . , where is the Lyapunov function candidate that satisfies Assumption 4.
Then, the robust approximate optimal trajectory tracking control in the position subsystem is designed as
4.2. Attitude Resolution
Since the system of the quadrotor is underactuated and strongly coupled, the information of the position subsystem is used to calculate the total lift force. The desired trajectories of roll and pitch angles are determined by the position subsystem through the relation between the kinematic equation and the Euler equation and passed to the attitude subsystem. For the position subsystem, the generated tracking error and the received compound disturbance can be eliminated by the attitude subsystem. By a matrix operation on (6), the following equations are derived:
The actual total lift force for the quadrotor system is designed as
The desired trajectories of the pitch and roll angles are derived by the following equations:
Then, we have
4.3. Attitude Control Design
Similarly, the estimated value of unknown compound disturbance in the attitude subsystem is derived by the following disturbance observer
where denotes the observer gain of the disturbance observer in the attitude subsystem. Then, the attitude subsystem robust compensation input is
The desired trajectory for the angular velocity is given by [56]
in which is the desired trajectory of Euler angles and is the desired trajectory of the angular velocity. The steady-state control input for the attitude nominal system is designed as
where and denotes the pseudo-inverse of . Then, define the attitude subsystem tracking error as
While the cost function of the attitude subsystem is represented as
where and are the designed symmetric definite matrices. The approximate optimal feedback control input in the attitude subsystem is
where and . is the activation function and represents the estimate of the ideal weight for the critic network of the attitude subsystem. The corresponding weight update law is designed as
where , are the learning rates, , and . , where is the Lyapunov function candidate that satisfies Assumption 4.
Then, the robust approximate optimal trajectory tracking control in the attitude subsystem is designed as
5. Simulation Results
In this section, the robustness and effectiveness of the designed controller are evaluated through numerical simulations. The quadrotor is considered to be in a flight environment with slow-changing disturbances. The parameters of the quadrotor model are presented in Table 1 [24].
Table 1.
Parameters of quadrotor model.
A representative desired trajectory is selected to emulate the trajectory tracking performance of the quadrotor. The desired trajectory is designed as and . In addition, referring to [57,58], the unknown compound disturbances considered are described as and . In this way, the performance of the disturbance observers is reflected by comparing them with the estimates. The initial states of the quadrotor are all set to zero.
The vector-valued functions of the disturbance observers are designed as , , while the observer gains are selected as
Clearly, and are positive definite and satisfy the design requirements of Theorem 1. To derive the appropriate dynamic performance, the parameters of the performance index functions are designed as , , . The activation functions are designed as , . The relevant constants of the weight update laws are selected as , , , , , . The Lyapunov function candidates are selected as and . The initial weights are assigned values within the interval .
The PE condition is ensured by the method mentioned in Remark 5 to excite the system states. The weights gradually vary to become slower and stabilize during the learning process. The converged weights are already very close to the ideal weights after sufficient learning. The convergence of the whole critic network weights , in the learning processes are depicted in Figure 3. The final converged values of , are as follows
Figure 3.
Convergence of critic network weights.
The converged weights are used to design the approximate feedback optimal control inputs. Figure 4 and Figure 5 present the variation of states in trajectory tracking control, revealing the corresponding tracking errors in Figure 6 and Figure 7. In addition, Figure 8 visualizes the path in three-dimensional space, whereas Figure 9 illustrates the PWM signals for the motors. The figures clearly demonstrate that the quadrotor system effectively tracks the desired trajectory and achieves a small convergence bound for the tracking error. These results highlight the rapidity and accuracy of the designed controller in the control process.
Figure 4.
Variation of states in the position subsystem.
Figure 5.
Variation of states in the attitude subsystem.
Figure 6.
Tracking errors in the position subsystem.
Figure 7.
Tracking errors in the attitude subsystem.
Figure 8.
Results of three-dimensional path.
Figure 9.
Pulse-width of input signals.
The estimates for the compound disturbances are depicted in Figure 10. It shows that the estimated values from the disturbance observers can quickly follow the actual compound disturbances. Moreover, the trajectory tracking control performs well in the presence of compound disturbances, which implies the robustness of the designed controller.
Figure 10.
Estimates of compound disturbances.
In order to verify that the designed controller rejects the compound disturbances, a comparative simulation is performed without the disturbance observers in the position subsystem and the attitude subsystem. The control inputs use only the control inputs designed for the nominal system. Under such control, the variation of states is presented in Figure 11 and Figure 12, while Figure 13 and Figure 14 show the corresponding tracking errors.
Figure 11.
Variation of states in the position subsystem without disturbance observers.
Figure 12.
Variation of states in the attitude subsystem without disturbance observers.
Figure 13.
Tracking errors in the position subsystem without disturbance observers.
Figure 14.
Tracking errors in the attitude subsystem without disturbance observers.
By comparing the simulation results, it is clear that the trajectory tracking control of the quadrotor cannot be realized without the robust compensation inputs. Thus, further demonstrating the robustness of the designed controller. Moreover, the corresponding path in three-dimensional space and the PWM signals of the motors are shown in Figure 15 and Figure 16, respectively.
Figure 15.
Results of three-dimensional path without disturbance observers.
Figure 16.
Pulse-width of input signals without disturbance observers.
In summary, the controller designed for quadrotor trajectory tracking control has good dynamic performance, high tracking accuracy and strong robustness when the quadrotor is subjected to compound disturbances.
6. Conclusions
This paper proposes a robust approximate optimal controller for the trajectory tracking control of the quadrotor with unknown compound disturbances. By incorporating the estimated values of compound disturbances that are estimated by the disturbance observers into the control design, the effect of compound disturbances can be suppressed, resulting in ensured tracking accuracy and improved robustness. Moreover, the ADP method can then be utilized in the nominal system for ensuring the performance index of the control. The stability of the closed-loop system is analyzed by the Lyapunov theorem, which demonstrates that the tracking errors are UUB. Simulation results further confirm the robustness and effectiveness of the designed controller. In future work, experiments will be considered to validate the performance of the proposed controller.
Author Contributions
Conceptualization, R.L. and Z.Y.; methodology, R.L.; software, Z.Y.; validation, R.L., Z.Y. and G.Y.; formal analysis, R.L.; investigation, Z.Y.; resources, R.L.; data curation, L.J.; writing—original draft preparation, Z.Y.; writing—review and editing, R.L., Z.Y. and G.Y.; visualization, G.L.; supervision, Z.L.; project administration, R.L.; funding acquisition, R.L. All authors have read and agreed to the published version of the manuscript.
Funding
This work was supported by the National Natural Science Foundation of China (Grant No. 62003233), the Fundamental Research Program of Shanxi Province (Grant Nos. 201901D211083 and 20210302124552), and the Science and Technology Innovation Project of Higher Education Institutions in Shanxi Province (Grant No. 2019L0236).
Data Availability Statement
Data are contained within the article.
Conflicts of Interest
The authors declare no conflicts of interest.
References
- Hassanalian, M.; Abdelkefi, A. Classifications, applications, and design challenges of drones: A review. Prog. Aerosp. Sci. 2017, 91, 99–131. [Google Scholar] [CrossRef]
- Salem, K.A.; Palaia, G.; Chiarelli, M.R.; Bianchi, M. A simulation framework for aircraft take-off considering ground effect aerodynamics in conceptual design. Aerospace 2023, 10, 459. [Google Scholar] [CrossRef]
- Salem, K.A.; Palaia, G.; Quarta, A.A. Review of hybrid-electric aircraft technologies and designs: Critical analysis and novel solutions. Prog. Aerosp. Sci. 2023, 141, 100924. [Google Scholar] [CrossRef]
- Shao, S.; Chen, M.; Hou, J.; Zhao, Q. Event-triggered-based discrete-time neural control for a quadrotor UAV using disturbance observer. IEEE/ASME Trans. Mechatronics 2021, 26, 689–699. [Google Scholar] [CrossRef]
- Idrissi, M.; Salami, M.; Annaz, F. A review of quadrotor unmanned aerial vehicles: Applications, architectural design and control algorithms. J. Intell. Robot. Syst. 2022, 104, 22. [Google Scholar] [CrossRef]
- Rinaldi, F.; Chiesa, S.; Quagliotti, F. Linear quadratic control for quadrotors UVAs dynamics and formation flight. J. Intell. Robot. Syst. 2013, 70, 203–220. [Google Scholar] [CrossRef]
- Dharmawan, A.; Priyambodo, T.K. Model of linear quadratic regulator (lqr) control method in hovering state of quadrotor. J. Telecommun. Electron. Comput. Eng. (JTEC) 2017, 9, 135–143. [Google Scholar]
- Alonge, F.; D’Ippolito, F.; Fagiolini, A.; Garraffa, G.; Sferlazza, A. Trajectory robust control of autonomous quadcopters based on model decoupling and disturbance estimation. Int. J. Adv. Robot. Syst. 2021, 18, 1729881421996974. [Google Scholar] [CrossRef]
- Yang, Y.; Yan, Y. Attitude regulation for unmanned quadrotors using adaptive fuzzy gain-scheduling sliding mode control. Aerosp. Sci. Technol. 2016, 54, 208–217. [Google Scholar] [CrossRef]
- Avram, R.C.; Zhang, X.; Muse, J. Nonlinear adaptive fault-tolerant quadrotor altitude and attitude tracking with multiple actuator faults. IEEE Trans. Control. Syst. Technol. 2017, 26, 701–707. [Google Scholar] [CrossRef]
- Chen, F.; Lei, W.; Zhang, K.; Tao, G.; Jiang, B. A novel nonlinear resilient control for a quadrotor UVA via backstepping control and nonlinear disturbance observer. Nonlinear Dyn. 2016, 85, 1281–1295. [Google Scholar] [CrossRef]
- Liu, H.; Xi, J.; Zhong, Y. Robust attitude stabilization for nonlinear quadrotor systems with uncertainties and delays. IEEE Trans. Ind. Electron. 2017, 64, 5585–5594. [Google Scholar] [CrossRef]
- Liu, E.; Yan, Y.; Yang, Y. Neural network approximation-based backstepping sliding mode control for spacecraft with input saturation and dynamics uncertainty. Acta Astronaut. 2022, 191, 1–10. [Google Scholar] [CrossRef]
- Li, R.; Chen, M.; Wu, Q. Robust control for an unmanned helicopter with constrained flapping dynamics. Chin. J. Aeronaut. 2018, 31, 2136–2148. [Google Scholar] [CrossRef]
- Li, R.; Chen, M.; Wu, Q. Adaptive neural tracking control for uncertain nonlinear systems with input and output constraints using disturbance observer. Neurocomputing 2017, 235, 27–37. [Google Scholar] [CrossRef]
- Yang, Y.; Modares, H.; Vamvoudakis, K.G.; He, W.; Xu, C.Z.; Wunsch, D.C. Hamiltonian-driven adaptive dynamic programming with approximation errors. IEEE Trans. Cybern. 2021, 52, 13762–13773. [Google Scholar] [CrossRef]
- Xue, S.; Luo, B.; Liu, D. Event-triggered adaptive dynamic programming for zero-sum game of partially unknown continuous-time nonlinear systems. IEEE Trans. Syst. Man Cybern. Syst. 2018, 50, 3189–3199. [Google Scholar] [CrossRef]
- Du, Y.; Jiang, B.; Ma, Y.; Cheng, Y. Robust ADP-based sliding-mode fault-tolerant control for nonlinear systems with application to spacecraft. Appl. Sci. 2022, 12, 1673. [Google Scholar] [CrossRef]
- Huang, Y.; Wang, D.; Liu, D. Bounded robust control design for uncertain nonlinear systems using single-network adaptive dynamic programming. Neurocomputing 2017, 266, 128–140. [Google Scholar] [CrossRef]
- Wang, D.; Liu, D.; Li, H. Policy iteration algorithm for online design of robust control for a class of continuous-time nonlinear systems. IEEE Trans. Autom. Sci. Eng. 2014, 11, 627–632. [Google Scholar] [CrossRef]
- Dou, L.; Su, X.; Zhao, X.; Zong, Q.; He, L. Robust tracking control of quadrotor via on-policy adaptive dynamic programming. Int. J. Robust Nonlinear Control 2021, 31, 2509–2525. [Google Scholar] [CrossRef]
- Mu, C.; Zhang, Y. Learning-based robust tracking control of quadrotor with time-varying and coupling uncertainties. IEEE Trans. Neural Netw. Learn. Syst. 2019, 31, 259–273. [Google Scholar] [CrossRef]
- Chen, W.H.; Yang, J.; Guo, L.; Li, S. Disturbance-observer-based control and related methods—An overview. IEEE Trans. Ind. Electron. 2015, 63, 1083–1095. [Google Scholar] [CrossRef]
- Chen, M.; Xiong, S.; Wu, Q. Tracking flight control of quadrotor based on disturbance observer. IEEE Trans. Syst. Man Cybern. Syst. 2019, 51, 1414–1423. [Google Scholar] [CrossRef]
- Chen, F.; Jiang, R.; Zhang, K.; Jiang, B.; Tao, G. Robust backstepping sliding-mode control and observer-based fault estimation for a quadrotor UVA. IEEE Trans. Ind. Electron. 2016, 63, 5044–5056. [Google Scholar]
- Shao, X.; Liu, J.; Cao, H.; Shen, C.; Wang, H. Robust dynamic surface trajectory tracking control for a quadrotor UVA via extended state observer. Int. J. Robust Nonlinear Control 2018, 28, 2700–2719. [Google Scholar] [CrossRef]
- Mofid, O.; Mobayen, S. Adaptive sliding mode control for finite-time stability of quad-rotor UAVs with parametric uncertainties. ISA Trans. 2018, 72, 1–14. [Google Scholar] [CrossRef] [PubMed]
- Lei, W.; Li, C.; Chen, M.Z. Robust adaptive tracking control for quadrotors by combining PI and self-tuning regulator. IEEE Trans. Control Syst. Technol. 2018, 27, 2663–2671. [Google Scholar] [CrossRef]
- Maqsood, H.; Qu, Y. Nonlinear disturbance observer based sliding mode control of quadrotor helicopter. J. Electr. Eng. Technol. 2020, 15, 1453–1461. [Google Scholar] [CrossRef]
- Hua, H.; Fang, Y.; Zhang, X.; Lu, B. A novel robust observer-based nonlinear trajectory tracking control strategy for quadrotors. IEEE Trans. Control Syst. Technol. 2020, 29, 1952–1963. [Google Scholar] [CrossRef]
- Song, R.; Lewis, F.L. Robust optimal control for a class of nonlinear systems with unknown disturbances based on disturbance observer and policy iteration. Neurocomputing 2020, 390, 185–195. [Google Scholar] [CrossRef]
- Lee, D. Nonlinear disturbance observer-based robust control for spacecraft formation flying. Aerosp. Sci. Technol. 2018, 76, 82–90. [Google Scholar] [CrossRef]
- Yuan, W.; Gao, G. Sliding mode control of the automobile electro-coating conveying mechanism with a nonlinear disturbance observer. Adv. Mech. Eng. 2018, 10, 1687814018795748. [Google Scholar] [CrossRef]
- Orozco Soto, S.M.; Cacace, J.; Ruggiero, F.; Lippiello, V. Active Disturbance Rejection Control for the Robust Flight of a Passively Tilted Hexarotor. Drones 2022, 6, 258. [Google Scholar] [CrossRef]
- Wang, Y.; Sun, J.; He, H.; Sun, C. Deterministic policy gradient with integral compensator for robust quadrotor control. IEEE Trans. Syst. Man Cybern. Syst. 2019, 50, 3713–3725. [Google Scholar] [CrossRef]
- Li, C.; Wang, Y.; Yang, X. Adaptive fuzzy control of a quadrotor using disturbance observer. Aerosp. Sci. Technol. 2022, 128, 107784. [Google Scholar] [CrossRef]
- Fan, Y.; Guo, H.; Han, X.; Chen, X. Research and verification of trajectory tracking control of a quadrotor carrying a load. Appl. Sci. 2022, 12, 1036. [Google Scholar] [CrossRef]
- Wang, B.; Yu, X.; Mu, L.; Zhang, Y. Disturbance observer-based adaptive fault-tolerant control for a quadrotor helicopter subject to parametric uncertainties and external disturbances. Mech. Syst. Signal Process. 2019, 120, 727–743. [Google Scholar] [CrossRef]
- Fei, Y.; Shi, P.; Lim, C.C. Robust and collision-free formation control of multiagent systems with limited information. IEEE Trans. Neural Netw. Learn. Syst. 2021, 34, 4286–4295. [Google Scholar] [CrossRef]
- Fei, Y.; Shi, P.; Lim, C.C. Robust formation control for multi-agent systems: A reference correction based approach. IEEE Trans. Circuits Syst. Regul. Pap. 2021, 68, 2616–2625. [Google Scholar] [CrossRef]
- Xia, R.; Wu, Q.; Shao, S. Disturbance observer-based optimal flight control of near space vehicle with external disturbance. Trans. Inst. Meas. Control 2020, 42, 272–284. [Google Scholar] [CrossRef]
- Sun, J.; Liu, C. Disturbance observer-based robust missile autopilot design with full-state constraints via adaptive dynamic programming. J. Frankl. Inst. 2018, 355, 2344–2368. [Google Scholar] [CrossRef]
- Zhang, H.; Cui, L.; Zhang, X.; Luo, Y. Data-driven robust approximate optimal tracking control for unknown general nonlinear systems using adaptive dynamic programming method. IEEE Trans. Neural Netw. 2011, 22, 2226–2236. [Google Scholar] [CrossRef] [PubMed]
- Xu, N.; Niu, B.; Wang, H.; Huo, X.; Zhao, X. Single-network ADP for solving optimal event-triggered tracking control problem of completely unknown nonlinear systems. Int. J. Intell. Syst. 2021, 36, 4795–4815. [Google Scholar] [CrossRef]
- Xia, R.; Wu, Q.; Chen, M. Disturbance observer-based optimal longitudinal trajectory control of near space vehicle. Sci. China Inf. Sci. 2019, 62, 1–3. [Google Scholar] [CrossRef]
- Sun, J.; Liu, C. Backstepping-based adaptive dynamic programming for missile-target guidance systems with state and input constraints. J. Frankl. Inst. 2018, 355, 8412–8440. [Google Scholar] [CrossRef]
- Wang, D.; Liu, D.; Li, H.; Ma, H. Neural-network-based robust optimal control design for a class of uncertain nonlinear systems via adaptive dynamic programming. Inf. Sci. 2014, 282, 167–179. [Google Scholar] [CrossRef]
- Zheng, S.; Shi, P.; Wang, S.; Shi, Y. Adaptive neural control for a class of nonlinear multiagent systems. IEEE Trans. Neural Netw. Learn. Syst. 2020, 32, 763–776. [Google Scholar] [CrossRef] [PubMed]
- Fan, Q.Y.; Yang, G.H. Adaptive actor–critic design-based integral sliding-mode control for partially unknown nonlinear systems with input disturbances. IEEE Trans. Neural Netw. Learn. Syst. 2015, 27, 165–177. [Google Scholar] [CrossRef] [PubMed]
- Vamvoudakis, K.G.; Lewis, F.L. Online actor–critic algorithm to solve the continuous-time infinite horizon optimal control problem. Automatica 2010, 46, 878–888. [Google Scholar] [CrossRef]
- Liu, D.; Xue, S.; Zhao, B.; Luo, B.; Wei, Q. Adaptive dynamic programming for control: A survey and recent advances. IEEE Trans. Syst. Man Cybern. Syst. 2020, 51, 142–160. [Google Scholar] [CrossRef]
- Zhao, B.; Liu, D.; Luo, C. Reinforcement learning-based optimal stabilization for unknown nonlinear systems subject to inputs with uncertain constraints. IEEE Trans. Neural Netw. Learn. Syst. 2019, 31, 4330–4340. [Google Scholar] [CrossRef] [PubMed]
- Wang, D.; Liu, D.; Zhang, Y.; Li, H. Neural network robust tracking control with adaptive critic framework for uncertain nonlinear systems. Neural Netw. 2018, 97, 11–18. [Google Scholar] [CrossRef] [PubMed]
- Liu, D.; Wei, Q.; Wang, D.; Yang, X.; Li, H. Adaptive Dynamic Programming with Applications in Optimal Control; Springer International Publishing: Berlin/Heidelberg, Germany, 2017. [Google Scholar]
- Lewis, F.L.; Jagannathan, S.; Yesildirek, A. Neural Network Control of Robot Manipulators and Nonlinear Systems; Taylor & Francis: London, UK, 1999. [Google Scholar]
- Castillo, A.; Sanz, R.; Garcia, P.; Qiu, W.; Wang, H.; Xu, C. Disturbance observer-based quadrotor attitude tracking control for aggressive maneuvers. Control Eng. Pract. 2019, 82, 14–23. [Google Scholar] [CrossRef]
- Mobayen, S.; El-Sousy, F.F.; Alattas, K.A.; Mofid, O.; Fekih, A.; Rojsiraphisal, T. Adaptive fast-reaching nonsingular terminal sliding mode tracking control for quadrotor UAVs subject to model uncertainties and external disturbances. Ain Shams Eng. J. 2023, 14, 102059. [Google Scholar] [CrossRef]
- Shao, X.; Yue, X.; Li, J. Event-triggered robust control for quadrotors with preassigned time performance constraints. Appl. Math. Comput. 2021, 14, 102059. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).