1. Introduction
Micro air vehicles (MAVs) were proposed by the US Defense Advanced Research Projects Agency (DARPA) in 1992 [
1,
2] and were expected to become a new generation of military weapons in the future. Compared to the conventional propulsion systems of MAVs, the interest in flapping flight has increased in recent years due to the rapid development of mechanics and measurement technology in biology and aerospace science [
3,
4]. Observations of the natural flapping flight of birds and insects have provided tremendous inspiration for the development of MAVs in aeronautics. Flapping-wing micro air vehicles (FWMAVs), especially clapping-wing micro air vehicles with a double clap-fling mechanism, combine the advantages of fixed wing and rotor wing MAVs with better maneuverability, higher energy conversion rates, lower noise level, better concealment performance, and higher safety [
5,
6,
7]. For the CWMAV, the realization of high-precision and high-stability flight control is the most important guarantee for its practical application. Although several CWMAV or FWMAV types exist, rare prototypes can realize an autonomous flight. Owing to the requirement for lightweight design, the size, weight, and power are limited, which leads to the degradation of the performance of sensors and actuators. These issues make CWMAV more susceptible to internal uncertainties and external disturbances, which place higher demands on control algorithms.
Numerous studies on the control algorithm of CWMAV have been conducted to meet these demands, mainly including PID, adaptive sliding mode control, linear quadratic regulator, model predictive control, etc.
Owing to difficulties in modeling MAVs with flapping wings, the control system of the mainstream prototype still uses the PID algorithm, a model-free algorithm [
8] that works well for simple control tasks with low precision requirements. Verboom et al. [
9] employed three independent PID controllers to control the flapping frequency, yaw tail angle, and pitch tail angle and realized the hovering flight of the FWMAV.
Modern state-space control methods can be more flexible and accurate when compared to PID control methods, especially in controlling complex nonlinear and time-dependent systems [
10]. Among state-space control methods, the adaptive sliding mode control has good adaptability in the face of modeling uncertainties and external disturbances and is capable of adjusting the controller parameters [
11,
12]. Banazadeh and Taymourtash [
13] designed an adaptive sliding mode control algorithm for the attitude and position control of the FWMAV in uncertain environments. Based on the quasi-steady assumption, they developed the average theory to convert the time-varying system to a time-invariant system. Bluman et al. [
14] used a second-order sliding-mode control method to realize the hovering control of CWMAVs in the longitudinal plane. In the non-optimal initial state, they introduced external gust disturbance into the control algorithm, whose validation was proved by the experiment of hovering state within seconds.
The linear quadratic regulator (LQR) is another state-space-based method for designing an optimal controller considering disturbances [
15,
16]. Deng et al. [
17] established a three-degree-of-freedom linearization model under the hovering condition of the CWMAV and proposed a top-down three-layer control framework to simplify the flight control design. Meanwhile, they designed a robust output feedback controller based on linear quadratic regulation, which only needs to pass sensor data to achieve a stable hovering flight by a simple affine transformation. Kim et al. [
18] developed an aerodynamic model that considered the interaction of the leading vortices on the wing and tail. The LQR method was adopted to calculate the optimal control gain matrix at each flight speed. Simulation experiments show that their proposed control method can not only achieve the hovering flight of the CWMAV but also control the flight speed well.
Previous studies have primarily focused on the CWMAV control problem. However, they are not robust or efficient enough in the face of real-world uncertainties such as wind. In contrast to the aforementioned advanced control algorithms, model predictive control (MPC) emphasizes the function of the model and does not consider the internal structure of the model. MPC is an advanced control algorithm that has been successfully applied in the field of engineering control [
19,
20]. It calculates the control input by line optimization. The basic principle is to first measure the output and system state at a given time and use the system model to predict the system output in the future, and then calculate the optimal control input at the current time by minimizing the difference between the predicted output and the desired output. Its rolling optimization strategy can adapt well to the trajectory tracking system with internal uncertainties and external disturbances. Dutta and Das [
21] proposed an explicit nonlinear predictive control technique model using multiple estimation models with a convex combination framework for a class of nonlinear MIMO systems. Yao et al. [
22] presented the MPC strategy for building dynamic systems, which are generally nonlinear. They presented different challenges for its optimized control of energy-saving and thermal comfort. Zhu et al. [
23] proposed the MPC scheme for a three-degrees-of-freedom flapping-wing unmanned air vehicle (UAV) to track the reference set point. In addition, the adaptive MPC extends the standard MPC by adapting the prediction model and controller parameters online in real-time to account for system changes; however, it is often cited only for MAVs with rotor wings.
A review of the literature indicates that the research considering internal uncertainties and external disturbances for MAVs with flapping wings, and especially for MAVs with clapping wings, is rarely included. Conversely, the mapping relationship between the dynamic model and actuator parameters in the control assignment module is often ignored, causing an incomplete closed-loop control. Therefore, a trajectory control model combining control assignment and adaptive model predictive control for CWMAVs is a research gap that motivated this study.
The novelty of the study is threefold: (1) a model predictive control framework for trajectory tracking of CWMAVs, based on the system state transition equation and three desired Euler angles; (2) an adaptively predictive control model with two observers to estimate internal uncertainties and external disturbances; and (3) a radial basis function model-based control assignment strategy to obtain the actuator control outputs of CWMAVs.
The remaining sections of this paper are organized as follows:
Section 2 describes a radial basis function model-based adaptive model predictive control model of the CWMAV, considering internal uncertainties and external disturbances.
Section 3 presents the tracking results of validation studies and discussions about accuracy.
Section 4 presents the conclusion and recommendations for future studies.
2. Methodology
Uncertainties are considered in the proposed RAMPC framework. The procedure can be summarized as follows.
Step 1: Construct the system state transition equation at the time step to evaluate the current system state of the CWMAV.
Step 2: Evaluate the desired attitude angle using the equation for the desired trajectory and the current flight state of the CWMAV. This ensures the real-time adjustment of the attitude angle during the trajectory tracking process.
Step 3: Calculate the future system vector that is based on the current system state and system measurement.
Step 4: Minimize the difference between the future system vector and the desired vector using system incremental gain.
Step 5: Transfer the nonlinear trajectory tracking to linear quadratic programming.
Step 6: Design two observers to compensate for the system’s model uncertainties (internal uncertainties) and external disturbances.
Step 7: Obtain the AMPC output at the time step and proceed to the time step .
Step 8: Develop an aerodynamic radial basis function model based on the experimental data.
Step 9: Transfer the nonlinear control assignment to a constraint optimization problem to obtain the output evolution of actuators .
The schematic diagram is shown in
Figure 1, and the details will be introduced in the following sections.
2.1. Dynamic Modeling of CWMAV
2.1.1. Coordinate Definition
The inertial coordinate system can describe the absolute velocity and position of the CWMAV. The flight attitude and direction need to be defined by the body coordinate system and the track coordinate system, respectively. In
Figure 2a,
,
,
are the Euler angles based on the angle between the body coordinate system and the inertial coordinate system. In
Figure 2b,
represents the forward direction, where
and
are the angle of attack and sideslip angle, respectively.
2.1.2. Nonlinear Dynamics System of CWMAV
According to the law of conservation of momentum, the force
and torque
in the body coordinate on the CWMAV can be expressed as:
where
is the total mass of CWMAV.
where
are the velocity components for the three axes in the body coordinate system.
are the components of angular velocity.
are the moments of inertia.
are the products of inertia
The dynamic system of the CWMAV in the body coordinate system can be expressed as:
Considering the axisymmetric traits of CWMAV’s moment of inertia,
, the nonlinear dynamic equation of the system is introduced into the solver based on Equations (1)–(3), which can be written as follows:
The position and orientation of the CWMAV in actual flight cannot be described by the body coordinate system. In order to determine the transformation matrix from the inertial coordinate system to the body coordinate system, the Euler angular rate
needs to be calculated by the angular velocity component
.
Similarly, the absolute velocity component can be determined by the Euler angle
and the velocity component
in the body coordinates. The absolute velocity can be expressed as:
where
There are several sources of uncertainties to be considered in flapping propulsion, including manufacturing errors, environmental disturbances, and morphological changes in the structure and wings during flapping flight. In addition, the CWMAV usually encounters unstable factors, such as low-altitude gusts. The design of the CWMAV usually focuses on lightweight construction and is made of non-metallic material. These design methods reduce the robustness of the CWMAV and increase the risk of failure. Therefore, considering the uncertainties and the disturbance, the internal uncertainties and external disturbance terms are parameterized and added to Equations (4) and (5). Then, the state equation of the system can be rewritten as follows:
where
is the linear acceleration generated by the resultant external force.
is the obliquely symmetric matrix of angular velocity;
and
are the uncertain parts of line acceleration and angular acceleration;
and
are the external disturbances in the translational and rotational flights.
The system state transition equation is performed to realize the trajectory tracking control of the CWMAV, and the inputs
and outputs
of the system are defined as follows:
where
are the force and moment of the CWMAV in the body coordinate system.
are the position coordinate components in the inertial coordinate system.
In addition, the desired velocity and angular velocity components are tracked to achieve a better control effect [
24]. Therefore, the entire system output is rewritten as follows:
2.1.3. Desired System State Solving
For the trajectory tracking problem, the input of the system is generally the equation of motion of the desired trajectory, and the attitude motion equation of the CWMAV is not specified. The attitude angle of the CWMAV tremendously influences its aerodynamic performance, so the desired attitude angle (including roll, yaw, and pitch angles) is well calculated in this study, based on the desired trajectory equation and actual flight state of the CWMAV.
In order to estimate the desired roll angle, the force analysis in the body coordinate system
is shown in
Figure 3, where
is the centripetal force and
is the desired roll angle. Due to the limited lateral aerodynamic capabilities of the CWMAV, the centripetal force
during the steering motion is mainly generated by the centripetal component of aerodynamic lift. During the roll motion, the yaw moment is created by the vertical tail, which generates a sideslip angle so that the wings have a lift difference for this roll motion. Since the lateral force generated by the tail is small compared to the wings, it is assumed that the centripetal force and gravity are provided by the normal aerodynamic force. Then, the desired roll angle can be expressed as follows:
where
is the curvature radius at the actual flight position of the aircraft at time
k, and
are the velocities in the inertial coordinate system at time
k.
In order to calculate the desired yaw angle, the force analysis is shown in
Figure 4a in the inertial coordinate system. In
Figure 4a,
are the positions at time k. According to the measurement results of the wind tunnel experiment, the sideslip angle during actual flight is particularly small due to the weak lateral aerodynamic performance of the CWMAV. Based on the difference between the actual position and the desired position in the inertial coordinate system, the desired yaw angle in the next time step can be expressed as follows:
The force analysis in the inertial coordinate system to estimate the desired pitch angle is shown in
Figure 4b. The angle of attack during flight cannot be ignored for the CWMAV. Therefore, the desired pitch angle
should be expressed by the sum of the desired track angle and the angle of attack:
where
is the angle of attack at time
k.
2.2. Adaptive Model Predictive Control for Trajectory Tracking of CWMAV
2.2.1. MPC Trajectory Tracking Controller for CWMAV
In order to obtain the control variable of the system in the MPC framework, an optimization problem must be solved in each prediction time domain. By adjusting the future control input to minimize the difference between the future output and the desired output, the optimization problem in the prediction time domain can be expressed as follows:
where
represents the predicted output at time
based on time
;
is the desired out time at
;
and
are the system input and the increment of the system input at time
, respectively.
is the weighted diagonal matrix with the tracking error and the smooth control input, respectively, and
are the time domains of the predicted output and the initial input. To increase the accuracy of the system tracking,
is generally used to provide a larger time domain for the control system.
and
are the upper and lower boundaries of the control input and control input increment, respectively.
is the terminal constraint used to ensure the progressive stability of the closed-loop system.
In order to obtain the desired attitude angle at the current time, the relative position difference between the CWMAV location and the desired trajectory is calculated by Equations (12)–(14). The control quantity, system output, control increment, and control expectation in the same time window are expressed as follows:
Then, the cost function of Equation (15) can be re-expressed as:
where,
are the extended diagonal positive definite matrices for
and
, respectively.
Considering the nonlinear dynamic system of the CWMAV from Equations (4) and (5), the state equation of the input and output of the system is expressed as:
where
is a nonlinear function by the dynamic equations of CWMAV. Then, the discretization of Equation (22) is expressed as follows:
The prediction output vector of the prediction time domain is obtained by Equations (18) and (23) as:
where the incremental control vector is expressed as:
From Equation (25), it can be seen that the proposed trajectory tracking controller becomes a nonlinear structure due to this nonlinear prediction model. In order to increase the efficiency of the traditional low-order Taylor expansion linearization method, a nonlinear predictive linearization (NPL) is proposed in [
25] to approximate the nonlinear output, where
can be represented as a linear combination of the control increments:
where
is the steady-state output trajectory when the system input is unchanged and
is the step response coefficient matrix of each input corresponding to the output:
where
is the step response coefficient.
Finally, the cost function of Equation (20) can be rewritten as:
The solution to the trajectory tracking control problem is transformed into a quadratic programming problem, which can be used to solve the system input quickly by a linear controller.
2.2.2. Adaptive Model Predictive Control Model of CWMAV
As mentioned earlier about the MPC framework of the CWMAV, this section discusses the model parameter’s uncertainty and external disturbances to perform adaptive model prediction control (AMPC).
AMPC Model for CWMAV
The prediction vector in AMPC can be defined as follows:
where
and
are the uncertain inputs of the system model,
is the high-order nonlinear input of the external disturbance, and the total disturbance is
. It is worth noting that all model uncertainties and external disturbances are bounded in this study. The optimization model of Equation (20) can be transferred to:
where
are the maximum values of disturbances and uncertainties, respectively.
Since the disturbances and uncertainty boundaries cannot be easily estimated, two observers were developed in this study to compensate for the system’s model uncertainties and external disturbances.
Designing of the Compensation Observer
In order to transform the optimization problem into a quadratic programming problem, a compensation observer is introduced to compensate for the uncertainty of the system, and the error
between the actual output and the predicted output of the system is defined as:
where
,
.
and
are the estimated values of internal uncertainty and external disturbance, respectively. The error vector can be expressed by collecting
input and output data in the prediction time domain as follows:
and
can be inversely solved using
and
at each time step by Equation (31) to ensure that the error of the model is zero at time step
. In order to reduce the computational cost of the proposed model, the least mean square (LMS) algorithm is used instead of the inverse matrix solver in Equation (34).
where
is the searching step length. The convergence study of this search algorithm can be found in [
26]. Then, using the uncertainty compensation observer, Equation (24) can be modified by this uncertainty compensation observer as follows:
where
where
and
are the modified steady-state output vector and step response output matrix, respectively.
Designing of the Disturbance Observer
External disturbances are another factor for the input error in this study. The entire prediction model can be expressed as follows:
where
is the higher-order disturbance term. A disturbance observer is proposed to estimate the difference between the predicted output and the actual output of the system.
In the classical MPC model, the disturbance term
is generally assumed to be a constant over the entire prediction time window, i.e.,
, where
and
are the system output at time k and the predicted system output at time k, respectively. These models ignore the time-dependent disturbance, which easily leads to control errors. In this study, another observer—disturbance observer—is used, where the predicted disturbance is obtained from the previous output history. Therefore, the total disturbance at time
can be expressed as follows:
where
is the
ith row vector at time
and
is the
ith row and
jth column value of the step response matrix
at time
. In contrast to the simple assumption of a constant disturbance, Equation (36) shows a more accurate assumption, namely that the changing trend of the disturbance in the future
steps is the same as the last step. The disturbance term in the predicted time domain can be represented as:
Based on the compensation of the two proposed observers, the optimization model shown in Equation (30) is transformed into:
where
,
, and
are determined by the two proposed observers. For this convex quadratic programming model, the Lagrange multiplier method is applied to obtain the optimal control increment vector
at time
k:
By considering that the trajectory tracking system is time-dependent, to increase the accuracy of control, only the first control component
acting on the controlled object in the optimal control vector
is selected, and the control amount at the current time is expressed as:
The asymptotic stability of the AMPC trajectory tracking system is verified [
27].
2.3. Radial Basis Function Model-Based Adaptive Model Predictive Control Model of CWMAV
Most of the current control methods only consider the force and torque of the airframe, but the relationship between the control algorithm and actuators is ignored. Here, a control assignment method is proposed, where an experimental measurement and RBF model place the system state of the CWMAV into the inner loop of the proposed control model.
2.3.1. Control Assignment Method
In the cascaded trajectory tracking control system structure, shown in
Figure 5, the outer loop control law realizes the calculation of the trajectory command to the desired control aerodynamic force and torque (
), while the inner loop control assignment realizes the aerodynamic force and torque to the control command (flapping frequency, pitch angle, and yaw angle). This strategy is explained in detail in the following subsections.
By defining the output of the trajectory tracking controller as a virtual control quantity
, the entire control system of the CWMAV can be expressed as:
where
is the full state variable of the system,
represents the state variable that affects the aerodynamic force and torque,
is the control parameter of the actual actuator, and
are the system function and the control assignment mapping function, respectively.
After obtaining the mapping relationship between the control command and aerodynamic data, the nonlinear control assignment problem of the CWMAV can be transformed into a constrained optimization problem:
where
is the objective function of the optimization problem, selecting
as the performance index to reduce overfitting.
is the weight matrix of the virtual control variable
.
and
are the strict lower and upper limits of the system actuator, respectively.
Considering the output interval and the maximum rate of change with the system actuator,
and
are expressed as:
where
and
are the upper and lower limit vectors of the actuator output,
represents the maximum change rate of the actuators, and
is the controller update rate.
To solve no solutions or multiple solution problems of Equation (44), a regularization term is introduced to obtain a hybrid optimization to control the smoothness of the objective function:
where
is the penalty factor for error minimization and control minimization. The value of
has a great influence on the results of the model.
represents the weight matrix output by the actuators and
is the preference vector of the control output. The purpose of adding a regularization item is to punish overfitting and avoid overfitting, thereby improving the generalization ability of the model. In this way, the complexity and accuracy of the optimization model can be balanced, and better results can be obtained.
2.3.2. Experimental Measurement
The numerical aerodynamic model is generally not accurate enough, with a much larger error than experimental data. To obtain the mapping function
in Equation (43), a force measurement experiment is conducted in a low-speed tunnel (
Figure 6).
The experiments were conducted in a low-speed wind tunnel. This is a closed-circuit facility with a rectangular cross-section of 1.2 m wide, 1.2 m high, and 3 m long. The entrance section is equipped with a regular hexagonal honeycomb and a five-layer 24 mesh/inch damping net. The four corners of the wind tunnel are installed with eight baffles. The fan in the power section of the wind tunnel has a diameter of 0.8 m and is composed of eight blades. It is driven by a variable frequency speed motor, and the rated speed of the motor is 986 r/min. Before the test section, the tunnel has an entry section with screens and a contraction ratio of 5:1 to ensure that the angle of wind deflection is lower than 0.5 degrees and the turbulence level is lower than 0.5 %. The tunnel flow speed was monitored by a pitot tube installed in the tunnel, and the free-stream speed was controlled by a manual speed controller that drove the tunnel fan, providing an experimental flow speed from 0.1 m/s to 5 m/s, which was also measured by a Pitot tube. The mechanical data acquisition system included a six-dimensional mechanical sensor (Nano 17), a data acquisition card (National Instruments, NI), and PC terminal mechanical data acquisition software. The number of bits, number of channels, protocols, sample rate, and connection type of the data acquisition card are 16, 16 single-ended, multifunction I/O, 250 KS/s, and USB, respectively.
The test prototype is a developed clapping-wing micro air vehicle. The initial state parameters include:
Initial position: ;
Initial velocity: ;
Initial Euler angle: .
As mentioned before, the rolling moment could be technologically ignored. and are the force and moment constraints of the optimization.
2.3.3. Radial Basis Function Modeling
Due to the nonlinear characteristics and unsteady effects of the CWMAV, the radial basis function is adopted in this study to conduct the mapping function
in Equation (43). The proposed aerodynamic modeling scheme is illustrated in
Figure 7.
For the aerodynamic model of the CWMAV in this paper, the angle of attack and the incoming flow velocity are the state variables; the flapping frequency , pitch angle , and yaw angle are the design variables of the actuators. Therefore, the entire design variables are five-dimensional, and the outputs are six-dimensional.