1. Introduction
Unmanned aerial vehicles (UAVs) have attracted significant attention from research institutes, companies and industries over the last decade, owing to their wide applications in environment mapping, safety monitoring, disaster preventive management, service delivery, etc. [
1]. The quadrotor UAV is the most widely used vertical take-off and landing aircraft with flexible maneuverability.
For the quadrotor UAV, the highly nonlinear underactuated system has led to essential challenges in designing effective flight controller with robust stability and high performance. Many linear techniques have been designed to reach a full control of the quadrotor UAV, such as the cascade PID control [
2,
3] and LQR control [
4]. However, for such a highly nonlinear system, the linearized control methods have great limitations for the analysis of system robustness and anti-interference ability. Therefore, to overcome the limitations of the linear control methods, nonlinear control methods, such as feedback linearization [
5,
6,
7], backstepping approach [
8,
9], sliding mode control (SMC) [
10,
11,
12], observer-based robust control [
13,
14,
15] or some adaptive control approaches [
16,
17,
18,
19] have been investigated to achieve good stability and tracking performance. Additionally, research works, such as refs. [
20,
21,
22], also introduced the prescribed performance method to manipulate the convergence rate and the steady-state error inside the prescribed bounds. Among the aforementioned methods, observer-based backstepping approaches have been widely adopted by researchers and demonstrate great advantages in the control of the quadrotor UAV system [
9,
23]. However, there is a significant drawback in that the backstepping control is sensitive to the initial error due to the repeated differentiation of the virtual controls [
24]. To solve this limitation, the dynamic surface control (DSC) [
25,
26] is represented in this work and integrated with the prescribed performance technique to guarantee the bounded convergence and steady-state errors.
Integrated with the microelectronic mechanical system (MEMS), positioning technologies and fusion algorithms, the real-time translational and rotational states of the quadrotor UAV can be obtained for state and attitude estimation [
27,
28,
29]. Various state or disturbance observers were integrated with controllers to enhance stability and performance robustness [
30]. For instance, in ref. [
31], Liu et al. designed two novel finite-time disturbance observers for position dynamics and attitude dynamics separately. The selection of control gains is improved to be mildly greater than the observation error instead of disturbances, which is more practical for implementation, but the system states are directly obtained from measurements. To observe both the states and disturbances, in [
32], Xi et al. proposed an adaptive sliding mode disturbances observer for a robot manipulator system. The backstepping-based auxiliary system with error feedback is used as a system state observer. Compared to the disturbance observer, the extended state observer (ESO) [
33], with the attractive advantages of having a concise structure and the ability to estimate both the states of the dynamics and the lumped disturbances via the system outputs, is widely studied [
22,
34,
35]. In ref. [
15], Zhang et al. applied six conventional ESOs to estimate the disturbances exist on quadrotor UAV system. Liu et al. [
36] designed a finite-time ESO for attitude tracking of a quadrotor UAV using angular rates as the observer feedback. In the work of [
37], Niu et al. applied the finite-time ESO to estimate the disturbance for the terminal sliding mode surface. Although the above-mentioned schemes have achieved the observation purpose, all these ESOs are only effective on integral-chain systems with matched disturbance using linear observer gains and a single measurement. The quadrotor UAV dynamics is usually decomposed into an underactuated subsystem and a fully actuated (UF) subsystem [
38], where the former is a fourth-order pure-feedback system with both matched and unmatched disturbances. To compensate for the matched or unmatched disturbance from output, Chen et al. proposed two generalized approaches on unmatched disturbances situation based on ESO [
39] and nonlinear disturbance observer [
40]; the unmatched disturbance is effectively attenuated from output via a disturbance compensation gain. However, the estimation of both matched and unmatched disturbances with only one observer is still challenging, especially for the quadrotor UAV system. Consequently, in this work, we proposed a nonlinear ESO to approximate both the matched and unmatched disturbance estimation problems of the quadrotor UAV system.
The input saturation caused by limited propeller thrust forces often constrains the performance severely, and even dominates the stability of the quadrotor UAV [
41].Thus, the input constraints are practically important issues for control problems, and extensive research works have considered stabilizing a nonlinear system with input saturation [
42,
43,
44]. In ref. [
41], Wang et al. introduced the hyperbolic function and Nussbaum function to designed priori-bounded control inputs for the trajectory tracking control of quadrotor UAV. Liu et al. in [
23] applied two input saturation functions for position dynamics and attitude dynamics separately with two auxiliary control systems for compensation of the saturation effects. However, all these researchers only focused on the synthetic torque and force control input, but for the quadrotor UAV system, the limited force generated from each single rotor presents a more practical constraint rather than the synthetic torque and force control input. Thus, an input saturation algorithm on each single rotor of quadrotor UAV is proposed and an auxiliary system is designed in this work to compensate for the saturation effects.
A quadrotor UAV mounted with precise inspection instrument could fit the requirements for conducting healthy monitoring in limited spaces, such as tunnel safety patrolling, as shown in
Figure 1. Highly aggressive maneuvers are not allowed, and the body inclination angle
has to be within safety ranges. The large inclination angle of the quadrotor UAV will also cause insufficient lift force and further affect the system stability and performance. Thus, the attitude constraints turned out to be an important issue. Nevertheless, the input saturation for the position dynamics were considered in refs. [
23,
41], which can be seen as a bound for calculating the desired attitude angles, but did not directly consider the attitude constraints in designing the trajectory tracking controller for the quadrotor UAV. To realize the attitude constraint for quadrotor UAV, a saturation function is added to the inclination angle of the body with an auxiliary system.
Motivated by the above observations, the input saturation and prescribe performance control were taken into account in most existing research works. It is of practical significance to consider the attitude constraint for the control of the quadrotor UAV. Compared to other control schemes, a backstepping-typed DSC approach is more suitable for implementing the attitude constraint with a compensation auxiliary system. In this work, we focus on the prescribed performance control issue for the quadrotor UAV in the presence on both input saturation and attitude constraint, where the uncertainties and disturbances are approximated and attenuated through a novel nonlinear ESO base DSC. The main contributions of this paper are as follows:
- 1.
The control scheme is developed by the DSC technique with two auxiliary systems designed for attitude and input saturation constraints. Additionally, the prescribed performance method provides a more intuitive way to adjust the tracking speed and steady-state error.
- 2.
Considering the limitations of existing ESOs, two nonlinear ESOs are developed for approximating the pure-feedback subsystems of quadrotor UAV. Under such scheme, only one ESO is utilized for each quadrotor UF subsystem to estimate both the matched and unmatched disturbances with multiple state observations. Thus, a uniform convergence speed can be obtained, and the complexity of the observer’s parameter adjustment are reduced compared to conventional ESO designs.
- 3.
To improve the control stability of quadrotor UAV, the input saturation constraint is modified to exert on the thrust force generated by each rotor rather than the synthetic torque and force control inputs. Furthermore, the attitude constraint is firstly taken into account for stabilizing a quadrotor UAV. The constraint is realized by a saturation function with an auxiliary system as compensation to keep the inclination of quadrotor UAV within safety region.
The rest of this paper is organized as follows. In
Section 2, problem formulation and preliminaries are illustrated for a typical configuration of quadrotor UAV.
Section 3 introduces and proves a nonlinear ESO with multi-measurement feedback and varying-observer gain. The main results, which are the design of proposed controller, with stability analysis of the closed-loop system including the ESO system, are provided in
Section 4. Then, several cases are simulated to validate the effectiveness of the proposed method in
Section 5. The conclusion is given in
Section 6.
3. ESO Design
Most existing studies on ESOs are only effective on integral-chain systems with matched disturbance using linear observer gains and single measurement. However, considering the UF subsystems of quadrotor UAV, the difficulties are the approximations of both the matched and unmatched disturbances. By taking the advantages of coordinate transform from the feedback linearization technique, the quadrotor UF subsystems can be reformulate into Brunovsky systems. Then, the matched and unmatched disturbances from the quadrotor system are added as the augmented states, and the whole system is estimated using the ESO technique.
Considering UF subsystems (
2) and (
3), augment the dynamics with lumped disturbances and rewritten in ESO from as
with
where
are the observation matrices,
are the distribution matrix of disturbances,
are the observations,
and
are the observer feedback matrices.
Remark 2. From the state space dynamics, one can find that (2) is an affine-in-control pure-feedback system, and (3) is a strict-feedback system. Additionally, (2) and (3) contain unmatched disturbances. In order to compute the nonlinear feedback matrices, coordinate transformation is proposed to transform the pure-feedback system (
2) and strict-feedback system (
3) to canonical systems. According to the feedback linearization method [
49], define alternative state variables
and
as
where
,
,
and
. The lumped disturbances
are considered independent to state variables
. Therefore, according to (
17) and (
18), the state transform matrices for which transfer
to
can be obtained as
where
,
,
,
and
.
is given as
and the details of variables
are shown in
Appendix A.
Introducing the observer bandwidth
in the high-gain observer method [
50], the bandwidth matrices are defined as
Using the coordinate transformation matrices (
19) and bandwidth matrices (
21), the nonlinear ESO observer feedback matrices in (
15) and (
16) are obtained as
where
are gain matrices with positive entries.
To prove the convergence of the proposed ESO, the canonical form of (
15) and (
16) can be computed using the following relationship as
Define the observer estimation error as
using the mean value theorem [
51], the error dynamics of augmented subsystem (
2) and (
3) under ESO (
15), (
16) can be obtained from (
15)–(
26) as
with
where
are the columns of
and
are the columns of
.
are the transfer matrices in canonical form as
Remark 3. The result of adding the observer bandwidth is a scalar amplification of the location of poles or a scalar amplification for all eigenvalues, which can be seen from (27) and (28). Since multiple disturbances appeared in the subsystem (
2) and (
3), observability is essential and can be checked from
where
and
are the transfer matrix and observation matrix of the augmented systems, separately. From subsystems (
2), (
3) and Assumption 1, the observability matrices can be calculated and have a rank of 7 for (
2) and 4 for (
3). Therefore, the corresponding states and lumped disturbances are observable. Note that the observations can be reduced as long as the observability is satisfied.
The gain matrices
are selected as follows such that
and
are Hurwitz matrices.
Hence, the observer bandwidth becomes the only tuning parameter of the observer and is determined by the system dynamics. Now, the following is the proof of stability of proposed ESO method.
Theorem 1. Considering the ESO error dynamics in (27) and (28), there exist two positive definite matrices and satisfying and . Under Assumptions 1, 2 and 3, the uniformly bounded stability of proposed nonlinear ESOs for quadrotor UAV system can be guaranteed if the observer bandwidth is selected such that where , and . Proof of Theorem 1. Choose the following Lyapunov candidate function as
Substituting (
27) and (
28) into the derivative of
V yields
where
with
,
and
. From (
31),
if
, and therefore, it can be concluded that the error dynamics (
27) and (
28) are bounded stably as
where
and
denote the maximum and minimum eigenvalues of the matrix
P. The proof is finished. □
4. Controller Design
In this section, the prescribed performance tracking control strategy for the quadrotor UAV is proposed based on the DSC technique with attitude and input constraints. The estimated state vector , and lumped disturbances, , are obtained from the proposed ESO in previous section. For clarity, the following notations are defined in this section:
- (1)
are the estimated variables,
- (2)
are the estimated errors,
- (3)
and are the maximum and minimum eigenvalues of matrix A.
4.1. Controller Design for Underactuated System
Define
as the first surface error. To achieve guaranteed tracking performance of
, the transformed error in (
10) and its derivative are
where
and
.
The virtual control variables
is designed as
where
is a positive definite matrix.
Following the principle DSC technique, introduce a variable
and pass the virtual control
though a first-order low-pass filter with the positive definite time constant matrix
as
Let
denote the filtering error, then the filtering error dynamics can be derived as
where
is a continuous function.
Consider the Lyapunov function candidate
, and its derivative with respect to time is
Invoking Young’s inequality, error transform (
26) and bounds (
14), the inequality above can be further expressed as
Define
as the second surface error, and its derivative is
The virtual control variables
are designed as
where
are positive definite matrices, and
is a variable from the following auxiliary system for compensating the effect of state constraints.
where
is the error of the state constraints,
is a positive definite matrix, and
is a small positive constant.
Remark 4. As for the variable in (40), which is defined in (41), its value is dependent on . If the derivative of is not equal to zero in the auxiliary system, the result of might render the virtual control smaller and closed to . Therefore, the saturation error can be compensated by . When the derivative of is equal to zero, the result of is a small constant value and it may affect the virtual control slightly since is a small constant. Remark 5. The combination of Euler angle ϕ, θ and ψ have orders to form the rotation matrix in (1). Thus, the actual inclination angles along the x-axis and y-axis with respect to the earth-frame are ϕ and , separately. Here, the trigonometric variables are used as and , which is in (2). The state constraint exerts to to ensure the total inclination angle within . Introducing a variable
, we pass the constrained virtual control
though a first-order low-pass filter with the positive definite time constant matrix
as
Let
denote the filtering error, then the filtering error dynamics can be derived as
where
is a continuous function.
Consider the Lyapunov function candidate
, when
. Its derivative with respect to time is
According to the definition of matrix
, one can conclude that
. Invoking Young’s inequality, the
can be further expressed as
When
, the last term is
, thus, its derivative with respect to time is
Synthesizing (
45) and (
46), the inequality becomes
Define
as the third surface error, and its derivative is
where
is the Jacobian matrix of
in (
20) with determinant equals to
. Therefore,
is non-singular under Assumption 3.
The virtual control variable
is designed as
where
is a positive definite matrix.
Introducing a variable
, we pass the constrained virtual control
though a first-order low-pass filter with the positive definite time constant matrix
as
Let
denote the filtering error, then the filtering error dynamics can be derived as
where
is a continuous function.
Consider the Lyapunov function candidate
. Its derivative is
From (
2) and (
20), after the matrix operation, one can conclude that
and
are satisfied. Invoking Young’s inequality,
can be further expressed as
Define
as the fourth surface error, and its derivative is
The virtual control variables
is designed as
where
are positive definite matrices. The control input
represents the input torque along x-axis and y-axis with respect to body frame of quadrotor UAV which are compound control variables of four thrust force generated by rotors shown in (
5). Thus, the saturation constraints cannot be implemented directly. The design of the input saturation is introduced in the following Section C.
4.2. Controller Design for Fully Actuated System
Define
as the fifth surface error. To achieve guaranteed tracking performance of
, the transformed error in (
10) and its derivative are
where
and
.
The virtual control variables
is designed as
where
is a positive definite matrix.
Introduce a variable
, and pass the virtual control
though a first-order low-pass filter with the positive definite time constant matrix
as
Let
denote the filtering error, and following the same procedure in (
36), the filtering error dynamics can be derived with (
12) and (
57) as
where
is a continuous function.
Consider the Lyapunov function candidate
. Its derivative with respect to time is
Define
and
since the entry
in matrix
is bounded under the state constraints. Invoking Young’s inequality, the
can be further expressed as
Define
as the sixth surface error, and its derivative is
The virtual control variables
is designed as
where
are positive definite matrices. The control input
represents the input torque and input force along the z-axis with respect to the body frame of the quadrotor UAV in which the saturation constraints cannot be implemented directly. The design of the input saturation is introduced in the next step.
4.3. Constraint Design for Actuator System
The input saturation is related to the thrust force generated by each rotor of the quadrotor UAV. The virtual control variables and represent the input torque along three axes and the input force along the z-axis, which cannot implement saturation constraints directly. Thus, a switch matrix and an auxiliary system are proposed to fulfill the actuator saturation constraints.
Define the final real control variable
as
where
is the matrix in (
5) and
is a variable from following auxiliary system for compensating the effect of input saturation.
where
,
is the error of the input saturation,
is a positive definite matrix and
is a small positive constant.
Remark 6. As for the variable in (64), which is defined in (65), its value is dependent on . If the derivative of is not equal to zero in the auxiliary system, the result of might render the real control smaller and closed to . Therefore, the saturation error can be compensated by . When the derivative of is equal to zero, the result of is a small constant value, and it may affect the virtual control slightly since is a small constant. Consider the Lyapunov function candidate
, when
. Its derivative with respect to time is
Define
,
. Invoking Young’s inequality, the
can be further expressed as
When
, the last term is
, thus, its derivative with respect to time is
Synthesizing (
67) and (
68), the inequality becomes
According the above procedures, the structure of the proposed controller is demonstrated in
Figure 3.
4.4. Stability Analysis
For clarity, define vectors , , and .
The whole Lyapunov function candidate is considered as
where
is a positive definite matrix satisfying
and
.
For the low-pass filter error dynamics
, according to the bound properties in Assumptions 1, 2, 3 and 4, there exist positive constants
such that
[
14,
24].
According to (
17), (
18) and (
30) that
, the maximum value of
is 1, and the minimum values of
,
and
are related to the inclination angles of the quadrotor UAV, which are predefined within valid ranges as Assumption 3. Therefore, define
as the minimum value of
where
.
Define a set of known constants as
Theorem 2. Considering the quadrotor UAV system in the UF frame as (2), (3), under the Assumptions 1, 2 and 3, the ESO designed with proposed feedback matrices in (22), (23), the controller designed in (34), (40), (49),(55), (57), (63), (64), with low-pass filter designed in (35), (42), (50), (58), the auxiliary systems designed in (41), (65), and properly selecting gains , , and bandwidth , such that the matrix
in the following is positive definite, error dynamics are uniformly ultimately bounded, and the prescribed performance of state and are satisfied.where Proof of Theorem 2. Substituting corresponding
and observer error dynamics (
27), (
28) into the derivative of (
70) results in
where
. Since the matrix
defined in (
72) is positive definite, thus
where
and
. Thus, the right-hand side of (
70) is upper bounded as
Therefore, (
70) and (
76) indicate that
s,
z,
and
are uniformly ultimately bounded. According to Lemma 1, the tracking error
and
remains within the prescribed bounds defined in (
8). By appropriately choosing the constants
,
and performance function
, the states
, which is
and
, can track the desired trajectory
with guaranteed errors. The proof is finished. □
5. Simulation Results
In this section, simulations are carried out to demonstrate the effectiveness of the prescribed performance with the attitude and input saturation controller (PPAISC) for the quadrotor UAV. Additionally, the performance of the proposed nonlinear ESO based DSC is also illustrated.
The dynamics of a typical quadrotor UAV is governed by Equation (
1) with the physical parameters shown in
Table 1 from a small-size quadrotor UAV platform [
52]. All the physical parameters were obtained through experiment parameter identification, specifications and theoretical analysis.
The control parameters are selected, satisfying Theorem 2, as the following values:
In order to show the effectiveness of the proposed control scheme under attitude and input saturation, the prescribed performance control using the traditional DSC approach (PPDSC) without taking account of saturation constraints, the active disturbance rejection control (ADRC) strategies in the work of Zhang et al. [
15], and, the most widely used in open source and real application, cascade-PID (CPID) control strategies [
2] are also simulated as a comparison. To this end, the CPID controller is designed as follows:
where
are the states and control inputs of quadrotor UAV. Additionally, the CPID controller needs a conversion algorithm between position control output
and attitude control input
as
For fair comparison, the control parameters of the CPID and ADRC are arrived at by trial and error such that these four controllers have nearly identical convergence rates, and the parameters of the PPDSC are chosen to be the same as that of the PPAISC. The original is at zero, and the target position is given at with the heading angle remained at zero.
The disturbances can be caused by the external wind, air drag, blade flapping and installation error of four rotors, etc., and will cause both extra force and torque on the quadrotor UAV. Therefore, two kinds of disturbances are considered here: (1) external forces
on the translational dynamics, which are unmatched disturbances for UF subsystem (
2), and (2) external torques
on rotational dynamics, which are matched disturbances for the UF subsystem (
2). These two variables are defined as follows:
The comparative results are shown from
Figure 4,
Figure 5,
Figure 6 and
Figure 7. The time response of positions is depicted in
Figure 4a. With the prescribed performance approach, PPDSC and PPAISC are within the required bounds during the transit convergence and the steady state error. The response errors for all controllers are converging to a small neighborhood of the target position; here, we take the x-axis as representative as shown in
Figure 4b. The steady-state errors for PPDSC and PPAISC are largely reduced at about 4.5 s compared to CPID and ADRC as shown in the zoomed section in
Figure 4b. Considering PPDSC and PPAISC in the zoomed section of
Figure 4b, one can conclude that the steady-state errors are slightly affected by the augmented two auxiliary systems for attitude and the input saturation constraints cause bounded variables
and
.
Although the backstepping-typed control scheme shows a strong capability in stabilization nonlinear systems, the obvious drawback is the steep variation of control variables at start. For controlling a quadrotor UAV, this phenomenon reflects as a large inclination angle of body as shown in
Figure 5. Under the same convergence rates, our proposed attitude saturation method effectively constrains the inclination angle,
in (4), within the predefined requirement (15 degrees).
Figure 6 shows the simulation results of disturbance estimation for ADRC, PPDSC and PPAISC. All three controllers integrated with ESO are successfully estimates
and
at a steady state. The main difference is that, in ADRC, the six ESOs are independent to each other inside each control channel, and only single measurement are considered for the observation feedback. For the proposed nonlinear ESO in PPAISC, the translational and rotational dynamics are considered together in one high order ESO. Thus, using multiple observation feedback, a uniform convergence speed of estimation can be reached. The simulation results exhibited a small variation during response and fast tracking of disturbance estimation compared to the ADRC method. In
Figure 6b, the simulation of disturbance estimation for PPDSC during 0 to 0.3 second appears as large deviations, which is caused by the unmodeled limitation of input variables as shown in
Figure 7. Compared to the proposed PPAISC with input saturation, the out-ranged control variables are dominated by the output of the auxiliary systems result as constrained control variables for the estimation of disturbances in ESO. Furthermore, the system stability is also enhanced.
To quantitatively compare the response performance of various controllers, six performance indices are used as following:
- (1)
Integral squared errors (ISE) of position are defined as [
31]
where
,
and
are position errors, thus the controller with lower ISE index reflects a fast convergence speed.
- (2)
Integral time-multiplied absolute errors (ITAE) of position are defined as [
31]
Different from ISE, ITAE considers the steady-state error rather than the initial response, thus the controller with lower ITAE index reflects a smaller steady-state errors.
- (3)
Maximum inclination angle (MIA) is .
- (4)
Variance of thrust force (VTF). The controller with a lower VTF index reflects a smooth output and less aggressive maneuvers.
- (5)
Root mean square error of estimated force disturbances (RMSEEFD). The observer with lower RMSEEFD index means a faster convergence speed and fewer oscillations during estimation of the force disturbances on the translational dynamics.
- (6)
Root mean square error of estimated torque disturbances (RMSEETD). The observer with lower RMSEETD index means a faster convergence speed and less oscillations during estimation of the force disturbances on the rotational dynamics.
The quantitative results of performance indices are collected in
Table 2. Since there is no observer in the CPID method, RMSEEFD and RMSEETD items are neglected. The control method of PPDSC is the same as PPAISC except attitude and input saturation constraints. Therefore, the lowest values in the term ISE and ITAE reflect faster convergence character and less steady-state errors of PPAISC and PPDSC compared with ADRC and CPID. The effectiveness of our proposed attitude and input saturation constraints are shown in the values of MIA and VTF. The values in RMSEEFD and RMSEETD show that the proposed ESO design for quadrotor UAV system has faster convergence and fewer oscillations, compared to conventional ESOs.