Robust Collision-Free Guidance and Control for Underactuated Multirotor Aerial Vehicles

Jorge A. Ricardo Jr; Davi A. Santos

doi:10.3390/drones7100611

and

Mechatronics Department, Aeronautics Institute of Technology (ITA), Praça Marechal Eduardo Gomes, 50-Vila das Acacias, São José dos Campos 12228-900, SP, Brazil

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Drones2023, 7(10), 611;https://doi.org/10.3390/drones7100611

This article belongs to the Topic Target Tracking, Guidance, and Navigation for Autonomous Systems

Version Notes

Order Reprints

Review Reports

Abstract

This paper is concerned with the robust collision-free guidance and control of underactuated multirotor aerial vehicles in the presence of moving obstacles capable of accelerating, linear velocity and rotor thrust constraints, and matched model uncertainties and disturbances. We address this problem by using a hierarchical flight control architecture composed of a supervisory outer-loop guidance module and an inner-loop stabilizing control one. The inner loop is designed using a typical hierarchical control scheme that nests the attitude control loop inside the position one. The effectiveness of this scheme relies on proper time-scale separation (TSS) between the closed-loop (faster) rotational and (slower) translational dynamics, which is not straightforward to enforce in practice. However, by combining an integral sliding mode attitude control law, which guarantees instantaneous tracking of the attitude commands, with a smooth and robust position control one, we enforce, by construction, the satisfaction of the TSS, thus avoiding the loss of robustness and use of a dull trial-and-error tweak of gains. On the other hand, the outer-loop guidance is built upon the continuous-control-obstacles method, which is incremented to respect the velocity and actuator constraints and avoid multiple moving obstacles that can accelerate. The overall method is evaluated using a numerical Monte Carlo simulation and is shown to be effective in providing satisfactory tracking performance, collision-free guidance, and the satisfaction of linear velocity and actuator constraints.

Keywords:

multirotor aerial vehicles; integral sliding mode control; velocity obstacles; collision avoidance

1. Introduction

Multirotor aerial vehicles (MAVs) have been extensively researched in recent decades and are expected to be widely used for a variety of new, challenging applications, such as delivery [1] and air taxis [2]. These applications demand a high level of safety and are usually carried out in disturbed environments where the MAV may fly among obstacles such as buildings, other manned or unmanned aircraft, and birds. To enable dependable operation, the MAV flight control system must provide outstanding performance, robustness, and collision avoidance. Aiming at contributing to this topic, this paper focuses on underactuated MAVs equipped with fixed rotors (which includes the conventional and widespread quadrotor). We start by adopting a typical hierarchical guidance and control architecture [3], such as the one shown in Figure 1. This architecture is composed of an outer-loop guidance system whose goal is to conduct the vehicle to the desired state while avoiding collisions and respecting operational and physical constraints. It also contains an inner control loop providing stability and robustness.

Figure 1. A typical hierarchical guidance and control architecture for MAVs.

To address the robustness requirement of MAV flight control systems against model uncertainties and disturbances, the sliding mode control [4,5] (SMC) strategy has been widely adopted to design the position and attitude control laws [6,7,8,9,10,11]. This technique provides a high-frequency switching control signal to drive and keep a system output on a sliding manifold, where the system ideally becomes insensitive to bounded model uncertainties and disturbances of the matched type. In particular, there are also SMC design techniques based on sliding manifolds suitably constructed to eliminate the reaching phase, i.e., to enable a sliding motion (with the insensitivity property) from the very beginning [12,13,14,15,16]. These design techniques can be found in the literature under two main alternative designations: the integral sliding mode control (ISMC) [14] and sometimes the global sliding mode control (GSMC) [13,16].

To deal with the underactuated dynamics of the considered class of MAVs, most of the literature relies on the assumption of time-scale separation (TSS) between the (slower) position and (faster) attitude closed-loop dynamics [6,7,8,9,17,18,19,20,21], which allows us to nest the attitude control loop inside the position one. As a consequence, the position and attitude control laws can be designed separately, but their gains have to be suitably tuned to achieve a sufficient TSS. In particular, the use of SMC in underactuated control systems is still a challenge [22]. In fact, under this nested control architecture, by using a high-frequency switching signal in the outer-loop position control law, it is not possible to properly guarantee the required TSS since such a signal is too fast to be considered slow in the design of the inner-loop attitude control. Nevertheless, there is no restriction on the use of such a policy in the attitude control. In this context, Besnard et al. [6] designed flight controllers using a continuous SMC driven by a sliding mode disturbance observer (SMDO). A procedure to tune the controllers’ gains in order to respect the TSS between the control loops was presented. Muñoz Palacios et al. [7] adopted a modified super-twisting SMC driven by a high-order sliding mode observer. Silva and Santos [8] and Labbadi and Cherkaoui [9] designed nonsingular fast terminal sliding mode flight control laws, but, to avoid the TSS violation, the position control loop was smoothed using sigmoid functions at the cost of robustness. It is worth noting that all the above-cited methods demand a trial-and-error procedure for the tuning of the flight control laws to respect the TSS assumption and avoid eventual instability.

The guidance of MAVs among obstacles has been addressed in the literature using different methods, such as nonlinear model predictive control (NMPC) [23,24], sampling-based search [25], and velocity obstacles (VO) [26,27,28]. In particular, Kamel et al. [23] and Pereira et al. [24] employed an NMPC to guide an MAV through static obstacles, addressing the collision avoidance problem by using a penalty in the cost function. Furthermore, Bouzid et al. [25] guided an MAV among static obstacles by combining an optimal rapidly exploring random tree (RRT) method with a genetic algorithm to compute the shortest path to a given target. On the other hand, Bareiss and Van Den Berg [27] proposed the linear–quadratic regulator obstacles (LQR obstacles) to address collision avoidance for mobile robots described by linear dynamics, rather than by single integrators as in the original VO [26]. The method was evaluated using quadcopters whose nonlinear dynamics were supposed to be known and linearized around the hover state. Other methods, such as the acceleration velocity obstacles (AVO) [29] and continuous control obstacles (CCO) [30], were also developed as generalizations of the VO to deal with more complicated known linear dynamics. Among the references [23,24,25,27], only [27] addresses the guidance problem for MAVs in the presence of moving obstacles. However, they use a simplified dynamic model and, like the VO-based methods [26,29,30], predict the obstacles’ future positions assuming that they keep a constant velocity, which is generally not satisfied in practice, but is addressed using a quick replanning loop with no guarantee that collision can be avoided when the obstacles accelerate.

This paper tries to fill the aforementioned gaps in the current state of the art by addressing the guidance and control for underactuated MAVs subject to matched model uncertainties and disturbances, as well as velocity and rotor thrust constraints, in the presence of obstacles that can accelerate. To tackle this problem, we start by adopting the hierarchical architecture shown in Figure 1 [3]. To design the position and attitude control laws, we propose a new hierarchical SMC scheme that, differently from [6,7,8,9,10], enforces the TSS without losing robustness and using a dull trial-and-error tweak of gains. Under this scheme, the attitude control law is designed using an ISMC strategy, in such a way as to guarantee exact tracking capability in the attitude control loop during the entire time, under the condition that the attitude command is sufficiently smooth. Then, this smoothness requirement is fulfilled by designing the position control law using a proportional–derivative (PD) approach combined with an SMDO to endow the position control loop with robustness. By doing so, the TSS is instantaneously enforced since the inner loop is made infinitely fast. On the other hand, we propose a nonlinear and robust guidance strategy based on the CCO method [30]. The guidance considers the use of smooth reference filters and, rather than relying on overly simplified nominal dynamics, it is able to deal with the uncertain nonlinear system dynamics. These system dynamics, as seen by the guidance algorithm, encompass the reference filters and the translational and rotational control loops, including the uncertainties and disturbances. Then, using the fact that the TSS is enforced by the control strategy, we tighten the admissible sets according to the bounds of the MAV tracking errors as well as the bounds of the disturbance force and torque. By doing so, different from that of Bareiss and Van Den Berg [27], the proposed strategy is able to guarantee collision avoidance and the satisfaction of velocity and rotor thrust constraints for underactuated MAVs with uncertain dynamics. Furthermore, to reliably address the problem of collision avoidance in the presence of obstacles that can accelerate, we adopt our previous method [28] that uses a high-order sliding mode differentiator (SMD) [31,32] to robustly estimate the obstacles’ maximum accelerations. Then, using these estimates, the obstacles’ future positions are not predicted using first-order models, as generally done in the VO-based methods [26,27,29,30], but considering the obstacles’ maximum accelerations. To summarize, the main contributions of this paper are the proposal of

a hierarchical SMC scheme to design the position and attitude control laws for underactuated MAVs that, by construction, enforces the TSS assumption and
robust guidance, based on the CCO and designed using a constraint-tightening approach, for underactuated MAVs with uncertain dynamics in the presence of velocity and rotor thrust constraints and multiple moving obstacles capable of accelerating.

The remaining text is structured as follows. Section 1.1 presents the notation. Section 2 defines the problem. Section 3 and Section 4 present, respectively, the control and guidance methods. Section 5 evaluates the proposed method using numerical simulations. Lastly, Section 6 concludes the paper.

1.1. Notation

The sets of real numbers, positive real numbers, and non-negative real numbers are denoted by

R

,

R_{> 0}

, and

R_{\geq 0}

, respectively. In the same manner, the sets of integer numbers, positive integer numbers, and non-negative integer numbers are denoted by

Z

,

Z_{> 0}

, and

Z_{\geq 0}

, respectively. Uppercase and lowercase boldface letters are used, respectively, to denote matrices and algebraic vectors, while geometric (Euclidean) vectors are denoted as in

\vec{a}

. The symbols

I_{n}

and

0_{n \times m}

denote, respectively, the

n \times n

identity matrix and the

n \times m

zero matrix. Moreover, the vectors of ones and zeros of dimension n are denoted, respectively, by

1_{n}

and

0_{n}

. A Cartesian coordinate system (CCS) is represented as

S_{b} ≜ {B; {\vec{x}}_{b}, {\vec{y}}_{b}, {\vec{z}}_{b}}

, with B denoting its origin and

{\vec{x}}_{b}

,

{\vec{y}}_{b}

, and

{\vec{z}}_{b}

representing the unit geometric vectors along its orthogonal axes. The algebraic vectors corresponding to the projection of an arbitrary physical vector

\vec{a} \in R^{3}

onto

S_{b}

and

S_{r}

are denoted, respectively, by

a_{b} \in R^{3}

and

a_{r} \in R^{3}

. The relation between

a_{r}

and

a_{b}

is

a_{b} = D^{b / r} a_{r}

, where

D^{b / r} \in SO (3)

is the attitude matrix of

S_{b}

relative to

S_{r}

and

SO (3) ≜ {D \in R^{3 \times 3} | D^{T} D = I_{3}}

denotes the special orthogonal group. The inverse of

D^{b / r}

is equal to its transpose, being denoted by

D^{r / b}

. Let

{\vec{a}}^{b / r}

represent an arbitrary quantity of

S_{b}

relative to

S_{r}

, e.g., throughout this paper,

{\vec{r}}^{b / r}

will denote the position of

S_{b}

with respect to

S_{r}

. Consider two arbitrary algebraic vectors

x = (x_{1}, \dots, x_{n})

and

y = (y_{1}, \dots, y_{n})

. The vector inequality

x < y

means that

x_{i} < y_{i}

,

\forall i \in {1, \dots, n}

. The kth-order time derivative of

x

is denoted by

x^{(k)}

. Furthermore, we define the vector signum function as

sign (x) ≜ (sign (x_{1}), \dots, sign (x_{n}))

, where

sign (x_{i}) ≜ \{\begin{matrix} 1 & x_{i} > 0, \\ 0 & x_{i} = 0, \\ - 1 & x_{i} < 0, \end{matrix}

with

i \in {1, \dots, n}

. The Euclidean norm and component-wise absolute value of

x

are denoted, respectively, by

∥ x ∥ ≜ \sqrt{x_{1}^{2} + \dots + x_{n}^{2}}

and

| x | ≜ (| x_{1} |, \dots, | x_{n} |)

. On the other hand, the component-wise absolute value of a generic matrix

A \in R^{m \times n}

is

|A| ≜ (| A_{i, j} |) \in R^{m \times n}

, where

A_{i, j}

is the element of the ith row and jth column of

A

. The standard basis vectors of

R^{3}

are denoted by

e_{1} ≜ (1, 0, 0)

,

e_{2} ≜ (0, 1, 0)

, and

e_{3} ≜ (0, 0, 1)

. A closed sphere of radius

ρ \in R_{> 0}

centered at

p \in R^{3}

, the Minkowski sum of two sets, and the set subtraction are denoted, respectively, by

\begin{matrix} B (p, ρ) & = \{x \in R^{3} | ∥x - p∥ \leq ρ\}, \\ X \oplus Y & = \{x + y | x \in X, y \in Y\}, \\ X ∖ Y & = \{x | x \in X, x \notin Y\} . \end{matrix}

Finally, consider the

S_{r}

representations

a_{r}

and

b_{r} ≜ (b_{1}, b_{2}, b_{3})

of

\vec{a}

and

\vec{b}

, respectively. The vector product

\vec{c} = \vec{b} \times \vec{a}

is represented in

S_{r}

by

c_{r} = [b_{r} \times] a_{r}

, where

[b_{r} \times]

is the following skew-symmetric matrix:

[b_{r} \times] ≜ [\begin{matrix} 0 & - b_{3} & b_{2} \\ b_{3} & 0 & - b_{1} \\ - b_{2} & b_{1} & 0 \end{matrix}] .

2. Problem Definition

This section defines the main problem of the paper. First, the MAV translational and rotational dynamics are derived in Section 2.1. Second, the control and guidance problems are stated in Section 2.2.

2.1. MAV Dynamic Modeling

Consider an inertial reference CCS

S_{r} ≜ \{R; {\vec{x}}_{r}, {\vec{y}}_{r}, {\vec{z}}_{r}\}

fixed on the ground at a known point R, with

{\vec{z}}_{r}

oriented upwards, parallel to the local vertical. Moreover, consider a body-fixed CCS

S_{b} ≜ \{B; {\vec{x}}_{b}, {\vec{y}}_{b}, {\vec{z}}_{b}\}

located at a point B fixed to the MAV center of mass, as shown in Figure 2, along with an illustration of a general underactuated MAV equipped with

n_{r} \geq 4

fixed rotors parallel to

{\vec{z}}_{b}

.

Figure 2. The adopted CCSs and a general underactuated MAV equipped with

n_{r}

fixed rotors parallel to

{\vec{z}}_{b}

.

The attitude kinematics of

S_{b}

with respect to

S_{r}

can be described in SO(3) by [33]

{\dot{D}}^{b / r} = - [ω_{b}^{b / r} \times] D^{b / r},

(1)

where

D^{b / r} \in SO (3)

and

ω_{b}^{b / r} \in R^{3}

are, respectively, the attitude matrix and the angular velocity of the MAV.

Using the Euler equation [34], the MAV rotational dynamics can be described by

{\dot{h}}_{b} + ω_{b}^{b / r} \times h_{b} = τ_{b}^{c} + τ_{b}^{d},

(2)

where

h_{b} \in R^{3}

is the angular momentum of the MAV relative to point B,

τ_{b}^{c} \in R^{3}

is the control torque, and

τ_{b}^{d} \in R^{3}

is the disturbance torque, which can include state-dependent terms related to parametric and model uncertainties [11]; both torques are with respect to B. Assuming a rigid airframe and negligible rotor inertias, the angular momentum

h_{b}

can be written as

h_{b} = J_{b} ω_{b}^{b / r},

(3)

where

J_{b} \in R^{3 \times 3}

is the MAV inertia tensor.

By substituting (3) into (2), the rotational dynamics can be finally described by

{\dot{ω}}_{b}^{b / r} = J_{b}^{- 1} [J_{b} ω_{b}^{b / r} \times] ω_{b}^{b / r} + J_{b}^{- 1} (τ_{b}^{c} + τ_{b}^{d}) .

(4)

The translational kinematics of

S_{b}

relative to

S_{r}

, on the other hand, are represented by

{\dot{r}}_{r}^{b / r} = v_{r}^{b / r},

(5)

where

r_{r}^{b / r} \in R^{3}

and

v_{r}^{b / r} \in R^{3}

are, respectively, the position and linear velocity of the MAV.

Using Newton’s second law, the translational dynamics of the MAV can be described in

S_{r}

by

{\dot{v}}_{r}^{b / r} = - g e_{3} + m^{- 1} (f_{r}^{c} + f_{r}^{d}),

(6)

where

g \in R_{> 0}

is the gravity acceleration magnitude,

m \in R_{> 0}

is the MAV mass,

f_{r}^{c} \in R^{3}

is the control force, and

f_{r}^{d} \in R^{3}

is the disturbance force, which can include state-dependent terms related to parametric and model uncertainties.

The set of Equations (1) and (4)–(6) can be used to represent the six-degrees-of-freedom (DOF) dynamics of any fixed-rotor rigid MAV with negligible rotor inertia, regardless of its rotor configuration. For the considered class of underactuated MAVs, the control force is given by

f_{r}^{c} = f n_{r},

(7)

where

n_{r} ≜ D^{r / b} e_{3}

is the

S_{r}

representation of

{\vec{z}}_{b}

and

f ≜ ∥ f_{r}^{c} ∥

is the resultant thrust magnitude.

Denote by

f_{i} \in R

the thrust of the ith rotor and define the vector

f^{r} ≜ (f_{1}, f_{2}, \dots, f_{n_{r}})

. The control allocation equation relating f and

τ_{b}^{c}

with

f^{r}

is [35]

[\begin{matrix} f \\ τ_{b}^{c} \end{matrix}] = Γ f^{r},

(8)

where

Γ \in R^{4 \times n_{r}}

is the control allocation matrix, which depends on the rotors’ coefficients and arrangement. For the considered class of underactuated MAVs,

Γ

is always of full-row rank, i.e.,

rank (Γ) = 4

, thus allowing the rotors to produce, within certain limits, a three-dimensional torque and a resultant thrust in the

{\vec{z}}_{b}

direction.

The resulting system (1), (4)–(8) is said to be underactuated since it has only four control inputs,

τ_{b}^{c}

and f, to control a six-DOF motion. Furthermore, from (6) and (7), it can be seen that the translational dynamics depend on the MAV attitude. For this reason, we can say that the rotational and translational dynamics are cascaded.

2.2. Problem Statement

Consider that the MAV is subject to velocity and rotor thrust constraints and flies among moving obstacles. These circumstances give rise to the following constraints:

\begin{matrix} r_{r}^{b / r} & \in P (t), \end{matrix}

(9)

\begin{matrix} v_{r}^{b / r} & \in V, \end{matrix}

(10)

\begin{matrix} f^{r} & \in F, \end{matrix}

(11)

where

P (t) \subseteq R^{3}

denotes the collision-free space, which changes as the obstacles move and is generally non-convex;

V \subseteq R^{3}

is an admissible convex set; and

F

is assumed to be

F ≜ \{f^{r} \in R^{n_{r}} | f_{i} \in [f_{i}^{-}, f_{i}^{+}], \forall i \in {1, \dots, n_{r}}\},

being

f_{i}^{-} \in R_{> 0}

and

f_{i}^{+} \in R_{> 0}

, respectively, the given lower and upper bounds for the ith rotor thrust.

Let us define the heading angle

ψ \in [- π, π)

as the third rotation in the 1-2-3 Euler angle representation of the MAV attitude.

Denoting the MAV’s desired (constant) position and desired (constant) heading, respectively, by

{\overset{ˇ}{r}}_{r}^{b / r} \in R^{3}

and

\overset{ˇ}{ψ} \in [- π, π)

, we now define the main problem of the paper.

Problem 1.

Guide the underactuated MAV described by (1), (4)–(8) to precisely reach the desired position

{\overset{ˇ}{r}}_{r}^{b / r}

and the desired heading

\overset{ˇ}{ψ}

, while satisfying constraints (9)–(11).

Problem 1 is a nonlinear guidance and control problem of an underactuated MAV subject to disturbances and constraints, operating in a dynamic environment with multiple moving obstacles that can accelerate. To tackle this problem, we adopt the guidance and control architecture shown in Figure 1. The adopted strategy is to design the control module to provide robustness with respect to disturbances and uncertainties, and the guidance module to satisfy the position, velocity, and rotors’ thrust constraints while conducting the vehicle to reach the desired position and heading. The control and guidance methods are, respectively, detailed in Section 3 and Section 4. It is worth mentioning that the objective of Problem 1 can be immediately generalized for a sequence of desired positions and headings [3].

3. Control Design

This section presents the design of the position and attitude control laws to address Problem 1. Specifically, Section 3.1 introduces the adopted hierarchical flight control architecture. Section 3.2 formulates the attitude control law. Section 3.3 formulates the position control law. Lastly, Section 3.4 presents the control allocation.

Before proceeding to the next subsection, let us denoted the command of a given variable by using an overbar symbol, e.g.,

{\bar{r}}_{r}^{b / r}

and

\bar{ψ}

denote the position and heading commands, respectively.

3.1. Hierarchical Flight Control Architecture

Most of the literature on underactuated MAVs [6,7,8,9,17,18,19,20,21] has designed the position and attitude controllers relying on a TSS between the (slower) position and (faster) attitude closed-loop dynamics, which allows us to nest the attitude control loop inside the position one, as shown in the block diagram of Figure 3. In this architecture, the position controller receives

{\bar{r}}_{r}^{b / r}

as the command input,

r_{r}^{b / r}

and

v_{r}^{b / r}

as feedback, and produces

{\bar{f}}_{r}^{c}

as the output. The attitude command generator (ACG) converts

{\bar{n}}_{r}

and

\bar{ψ}

into the three-dimensional attitude command

{\bar{D}}^{b / r}

. The attitude controller receives

{\bar{D}}^{b / r}

as the command input,

D^{b / r}

and

ω_{b}^{b / r}

as feedback, and produces

{\bar{τ}}_{b}^{c}

as output. Lastly, the control allocation converts

\bar{f}

and

{\bar{τ}}_{b}^{c}

into individual thrust commands

{\bar{f}}^{r} ≜ ({\bar{f}}_{1}, \dots, {\bar{f}}_{n_{r}})

.

Figure 3. Hierarchical control architecture for underactuated MAVs equipped with fixed rotors. ACG stands for attitude command generator.

Consider the following assumption regarding the rotors.

Assumption 1.

The rotor dynamics are instantaneous and their static parameters are exactly known.

Assumption 1 is common in practice since the rotor dynamics are much faster than the attitude and position ones, thus allowing us to suppose, in the design of the controllers, that the thrust commands are instantaneously achieved, i.e.,

f^{r} \equiv {\bar{f}}^{r}, \forall t \geq 0

. Consequently, from (8), one can say that

f \equiv \bar{f}

and

τ_{b}^{c} \equiv {\bar{τ}}_{b}^{c}

.

By assuming that the rotational dynamics are faster than the translational ones, the TSS allows consideration in the design of the position controller that

n_{r} = {\bar{n}}_{r}

. Then, by also considering Assumption 1, Equation (7) becomes

f_{r}^{c} = \bar{f} {\bar{n}}_{r} ≜ {\bar{f}}_{r}^{c},

(12)

where

{\bar{n}}_{r} \equiv {\bar{D}}^{r / b} e_{3}

, which in theory removes the underactuation of the system since the dependency of

f_{r}^{c}

on the actual attitude is removed. Then, the attitude and position control laws can be separately designed by considering the resulting fully actuated system described by Equations (1), (4)–(6), and (12). The critical point is that, when using this strategy, the controllers’ gains have to be carefully tuned in order to respect the TSS assumption, whereas, in practice, this tuning is a cumbersome trial-and-error process, thus requiring improved safety procedures to deal with the eventual instability that may occur in the case of insufficient TSS [36].

To avoid this drawback, we propose a hierarchical sliding mode control scheme for underactuated MAVs that enforces the attitude-position TSS. To this end, we design a multi-input attitude control law using an ISMC strategy, in such a way as to guarantee exact tracking capability in the attitude control loop during all the time. Such exact tracking makes

n_{r} = {\bar{n}}_{r}

without the need for tuning to reach the TSS. Therefore, the position control law can be designed using the fully actuated system described by (5), (6), and (12).

3.2. Integral Sliding Mode Attitude Control Law

Consider the objective of tracking a time-varying attitude command

{\bar{D}}^{b / r}

that satisfies the following assumption.

Assumption 2.

The time-varying attitude command

{\bar{D}}^{b / r}

is such that, at the initial time,

{\bar{D}}^{b / r} (0) = D^{b / r} (0)

and

{\bar{ω}}_{b}^{b / r} (0) = ω_{b}^{b / r} (0)

. Moreover, it is

(h - 1)

th-order differentiable with respect to time, where

h \geq 2

, such that its first-time derivative

{\dot{\bar{D}}}^{b / r}

is Lipschitz-continuous.

Assumption 2 is not restrictive; it only requires the knowledge of the MAV’s initial attitude and angular velocity and the use of a suitably smooth attitude command. These conditions can be fulfilled by properly designing the heading command

\bar{ψ}

and the position control law (see Figure 3).

The attitude and angular velocity control errors can be defined, respectively, as [33]

\begin{matrix} \tilde{D} & ≜ D^{b / \bar{b}} \equiv D^{b / r} {({\bar{D}}^{b / r})}^{T} \in SO (3), \end{matrix}

(13)

\begin{matrix} \tilde{ω} & ≜ ω_{b}^{b / \bar{b}} \equiv ω_{b}^{b / r} - \tilde{D} {\bar{ω}}_{b}^{b / r} \in R^{3}, \end{matrix}

(14)

where

{\bar{D}}^{b / r} ≜ D^{\bar{b} / r}

,

{\bar{ω}}_{b}^{b / r} ≜ ω_{\bar{b}}^{\bar{b} / r}

, and

\bar{b}

refers to a CCS

S_{\bar{b}}

representing the commanded attitude for

S_{b}

.

The attitude and angular velocity errors (13) and (14) allow the description of the attitude error kinematics by a conventional attitude kinematics differential equation, such as (1), i.e.,

\dot{\tilde{D}} = - [\tilde{ω} \times] \tilde{D} .

(15)

Besides the attitude matrix

\tilde{D}

, a three-dimensional attitude representation is required for the proposed control design. Here, we adopt the Gibbs vector [33]

\tilde{g} ≜ \tilde{ε} tan (\tilde{ϑ} / 2),

where

\tilde{ε} \in S^{2} ≜ {x \in R^{3} | ∥ x ∥ = 1}

and

\tilde{ϑ} \in R

are, respectively, the principal Euler axis and angle corresponding to

\tilde{D}

. Note that the Gibbs vector is singular at the angles

\tilde{ϑ} = (2 i + 1) π

rad,

\forall i \in Z

. However, since

\tilde{g}

represents the attitude error (not the full attitude), an effective control design will keep

\tilde{ϑ} < < π

and singularities will not be reached in practice [8]. The direct and inverse relations between

\tilde{D}

and

\tilde{g}

are, respectively, given by

\begin{matrix} \tilde{D} & = \frac{(1 - {\tilde{g}}^{T} \tilde{g}) I_{3} + 2 \tilde{g} {\tilde{g}}^{T} - 2 [\tilde{g} \times]}{1 + {\tilde{g}}^{T} \tilde{g}}, \\ \tilde{g} & = \frac{1}{1 + tr (\tilde{D})} [\begin{matrix} {\tilde{D}}_{23} - {\tilde{D}}_{32} \\ {\tilde{D}}_{31} - {\tilde{D}}_{13} \\ {\tilde{D}}_{12} - {\tilde{D}}_{21} \end{matrix}], \end{matrix}

where

{\tilde{D}}_{i j}

is the element of the ith row and jth column of

\tilde{D}

, and

tr (\cdot)

is the trace operator.

The attitude error kinematics (15) can alternatively be described using the Gibbs vector as [33]

\dot{\tilde{g}} = \frac{1}{2} (\tilde{g} {\tilde{g}}^{T} + [\tilde{g} \times] + I_{3}) \tilde{ω} .

(16)

On the other hand, by replacing (13) and (14) into (4) and considering Assumption 1, the attitude error dynamics can be described by

\begin{matrix} \dot{\tilde{ω}} & = J_{b}^{- 1} [J_{b} ω_{b}^{b / r} \times] ω_{b}^{b / r} + J_{b}^{- 1} ({\bar{τ}}_{b}^{c} + τ_{b}^{d}) - \tilde{D} {\dot{\bar{ω}}}_{b}^{b / r} + [\tilde{ω} \times] \tilde{D} {\bar{ω}}_{b}^{b / r} . \end{matrix}

(17)

By defining

x_{1} ≜ \tilde{g}

,

x_{2} ≜ \tilde{ω}

,

u ≜ {\bar{τ}}_{b}^{c}

,

d ≜ τ_{b}^{d}

, the error kinematics (16) and dynamics (17) can be inserted into the state-space model

\begin{matrix} {\dot{x}}_{1} & = f_{1} (x_{1}, x_{2}), \end{matrix}

(18)

\begin{matrix} {\dot{x}}_{2} & = f_{2} (x_{1}, x_{2}) + B u + B d, \end{matrix}

(19)

where

B ≜ J_{b}^{- 1}

,

\begin{matrix} f_{1} (x_{1}, x_{2}) & ≜ \frac{1}{2} (x_{1} x_{1}^{T} + [x_{1} \times] + I_{3}) x_{2}, \\ f_{2} (x_{1}, x_{2}) & ≜ B [B^{- 1} ω_{b}^{b / r} \times] ω_{b}^{b / r} - \tilde{D} {\dot{\bar{ω}}}_{b}^{b / r} + [x_{2} \times] \tilde{D} {\bar{ω}}_{b}^{b / r} . \end{matrix}

(20)

Now, let us define the attitude sliding variable

s ≜ {\dot{x}}_{1} + C x_{1},

(21)

where

C \in R^{3 \times 3}

. The corresponding sliding set is

S ≜ \{(x_{1}, {\dot{x}}_{1}) \in R^{6} | s = 0_{3}\} .

From Assumption 2, one can see that the system is in the sliding set

S

at the initial time instant. Therefore, by designing a control

u

such that the inequality

\dot{V} \leq - β V^{1 / 2}

[37], with

β \in R_{> 0}

, is satisfied from the very beginning, it holds that

(x_{1}, {\dot{x}}_{1}) \in S

during the entire time, and, consequently,

(x_{1} (t), {\dot{x}}_{1} (t)) = (0_{3}, 0_{3}), \forall t \geq 0

. Therefore, from (20), it holds that

x_{2} (t) = 0_{3}, \forall t \geq 0

. In other words, the system is capable of exactly tracking the attitude and angular velocity commands during the entire time, i.e.,

D^{b / r} (t) = {\bar{D}}^{b / r} (t)

and

ω_{b}^{b / r} (t) = {\bar{ω}}_{b}^{b / r} (t), \forall t \geq 0

.

By differentiating (21) and using (18) and (19), we can obtain the dynamic equation for

s

(for conciseness, we omit the function-independent variables):

\dot{s} = C f_{1} + \frac{\partial f_{1}}{\partial x_{1}} f_{1} + \frac{\partial f_{1}}{\partial x_{2}} f_{2} + \frac{\partial f_{1}}{\partial x_{2}} B u + \frac{\partial f_{1}}{\partial x_{2}} B d .

(22)

Regarding the disturbance

d

, consider the following assumption.

Assumption 3.

The disturbance

d

is bounded according to

| d | \leq τ^{\max}

, where

τ^{\max} \in R^{3}

is a known vector with positive components.

The boundedness in Assumption 3 is reasonable in practice, but one can rarely obtain a non-conservative estimate of the bound

τ^{\max}

without a switching-gain adaptation scheme [38].

Lemma 1 gives a control law that ensures that the system (18) and (19) has an integral sliding mode in

S

.

Lemma 1.

Under Assumptions 2 and 3, the following control law guarantees the integral sliding mode of the system (18) and (19) in

S

:

u = - Ξ^{- 1} (u_{0} + K_{1} s i g n (s)),

(23)

where

u_{0} ≜ C f_{1} + \frac{\partial f_{1}}{\partial x_{1}} f_{1} + \frac{\partial f_{1}}{\partial x_{2}} f_{2},

Ξ ≜ \frac{\partial f_{1}}{\partial x_{2}} B

, and

K_{1} \in R^{3 \times 3}

is a positive-definite diagonal matrix satisfying

K_{1} 1_{3} \geq |Ξ| τ^{\max} + 1_{3} \frac{β}{\sqrt{2}} .

(24)

Proof.

Consider the Lyapunov candidate function

V (s) = s^{T} s / 2

. From Assumption 2, the system (18) and (19) starts the motion in the sliding set

S

. Therefore, to prove the integral sliding mode, it is sufficient to show the satisfaction of the inequality

\dot{V} \leq - β V^{1 / 2}

. To this end, differentiating

V (s)

with respect to time, using (22) and (23), and choosing

K_{1}

according to (24), one can show that

\begin{matrix} \dot{V} (s) & = - s^{T} (K_{1} sign (s) - Ξ d), \\ = - {| s |}^{T} diag (sign (s)) (K_{1} sign (s) - Ξ d), \\ = - {| s |}^{T} (K_{1} 1_{3} - diag (sign (s)) Ξ d), \\ \leq - {| s |}^{T} (K_{1} 1_{3} - |Ξ| τ^{\max}), \\ \leq - {| s |}^{T} 1_{3} \frac{β}{\sqrt{2}} \leq - \frac{β}{\sqrt{2}} ∥ s ∥ = - β V^{1 / 2} . \end{matrix}

Thus, we complete the proof. □

Attitude Command Generator

The attitude command generator (ACG) converts

{\bar{n}}_{r}

and

\bar{ψ}

into

{\bar{D}}^{b / r}

. Since the heading angle

ψ

is defined as the third rotation in the 1-2-3 Euler angle representation of

D^{b / r}

, it is appropriate to parameterize

{\bar{D}}^{b / r}

also using the Euler angles

(\bar{ϕ}, \bar{θ}, \bar{ψ})

in the 1-2-3 sequence, i.e.,

{\bar{D}}^{b / r} ≜ [\begin{matrix} c_{\bar{ψ}} c_{\bar{θ}} & c_{\bar{ψ}} s_{\bar{θ}} s_{\bar{ϕ}} + s_{\bar{ψ}} c_{\bar{ϕ}} & - c_{\bar{ψ}} s_{\bar{θ}} c_{\bar{ϕ}} + s_{\bar{ψ}} s_{\bar{ϕ}} \\ - s_{\bar{ψ}} c_{\bar{θ}} & - s_{\bar{ψ}} s_{\bar{θ}} s_{\bar{ϕ}} + c_{\bar{ψ}} c_{\bar{ϕ}} & s_{\bar{ψ}} s_{\bar{θ}} c_{\bar{ϕ}} + c_{\bar{ψ}} s_{\bar{ϕ}} \\ s_{\bar{θ}} & - c_{\bar{θ}} s_{\bar{ϕ}} & c_{\bar{θ}} c_{\bar{ϕ}} \end{matrix}],

(25)

where

c_{*}

and

s_{*}

are short notations for

\cos (*)

and

\sin (*)

, respectively.

From the definition of

{\bar{n}}_{r}

given after Equation (12), it can be seen that its transpose is equal to the third line of the attitude command

{\bar{D}}^{b / r}

. Then, the commands

\bar{ϕ}

and

\bar{θ}

in (25) can be calculated from

{\bar{n}}_{r}

, respectively, as

\bar{ϕ} = - \tan^{- 1} (\frac{e_{2}^{T} {\bar{n}}_{r}}{e_{3}^{T} {\bar{n}}_{r}}),

(26)

\bar{θ} = \sin^{- 1} (e_{1}^{T} {\bar{n}}_{r}) .

(27)

Regarding the smoothness of

{\bar{D}}^{b / r}

in Assumption 2, consider the following remark.

Remark 1.

For the attitude command

{\bar{D}}^{b / r}

to be in fact

(h - 1)

th-order differentiable with respect to time, such that its first-time derivative is Lipschitz-continuous, as supposed by Assumption 2, it can be seen from (25)–(27) that

{\bar{n}}_{r}

and

\bar{ψ}

must have the same smoothness degree. In turn, the required smoothness of

{\bar{n}}_{r}

and

\bar{ψ}

can be achieved by properly designing the guidance method and the position control law.

3.3. Position Control Law

Using Assumption 1 and the fact that the attitude loop has an integral sliding mode, which ideally imply that

D^{b / r} (t) \equiv {\bar{D}}^{b / r} (t)

and

n_{r} (t) \equiv {\bar{n}}_{r} (t), \forall t \geq 0

, the position control law can be designed using the fully actuated model (5) and (6) with

f_{r}^{c}

given by (12) instead of (7).

Therefore, consider the objective of tracking a time-varying position command

{\bar{r}}_{r}^{b / r} (t)

. To this end, define the position and linear velocity errors, respectively, by

\begin{matrix} \tilde{r} & ≜ r_{r}^{b / r} - {\bar{r}}_{r}^{b / r}, \end{matrix}

(28)

\begin{matrix} \tilde{v} & ≜ v_{r}^{b / r} - {\bar{v}}_{r}^{b / r} . \end{matrix}

(29)

Then, the translational error kinematics and dynamics are obtained, respectively, by substituting (28) into (5), and (12) and (29) into (6), yielding

\begin{matrix} \dot{\tilde{r}} & = \tilde{v}, \end{matrix}

(30)

\begin{matrix} \dot{\tilde{v}} & = - g e_{3} + m^{- 1} ({\bar{f}}_{r}^{c} + f_{r}^{d}) - {\dot{\bar{v}}}_{r}^{b / r} . \end{matrix}

(31)

Consider the following assumption regarding the disturbance force

f_{r}^{d}

.

Assumption 4.

The disturbance force is bounded according to

| f_{r}^{d} | \leq f^{\max}

, with

f^{\max} \in R^{3}

being a known vector with positive components. Moreover,

f_{r}^{d}

is

(h - 1)

th-order differentiable with respect to time, being h the smoothness parameter defined in Assumption 2, such that

f_{r}^{d (1)}

is Lipschitz and

f_{r}^{d (h - 1)}

has a known Lipschitz constant vector

γ_{p} > 0_{3}

.

Assumption 4 is not very restrictive. It essentially states that the disturbance is limited and has a Lipschitz-continuous first and

(h - 1)

th derivative with respect to time, which is reasonable in practice. In fact, model uncertainties and external aerodynamic disturbances usually fulfill such an assumption [39].

In order to satisfy the smoothness requirements for

{\bar{D}}^{b / r}

as discussed in Remark 1, while ensuring robustness with respect to the disturbance force, we design the position control law by combining a proportional–derivative (PD) policy with an SMDO. This control law is given by

\begin{matrix} {\bar{f}}_{r}^{c} & = m (g e_{3} + {\dot{\bar{v}}}_{r}^{b / r} - K_{2} \tilde{r} - K_{3} \tilde{v}) - {\hat{f}}_{r}^{d}, \end{matrix}

(32)

where

K_{2} \in R^{3 \times 3}

and

K_{3} \in R^{3 \times 3}

are positive-definite diagonal matrices, and

{\hat{f}}_{r}^{d} \in R^{3}

is the disturbance force estimate to be provided by the SMDO.

To estimate the disturbance force, let us define an auxiliary sliding variable

\begin{matrix} σ & ≜ m \tilde{v} + β, β (0) = - m \tilde{v} (0), \end{matrix}

(33)

\begin{matrix} \dot{β} & = - {\bar{f}}_{r}^{c} + m g e_{3} + m {\dot{\bar{v}}}_{r}^{b / r} . \end{matrix}

(34)

By substituting (31) and (34) into the time derivative of (33), we obtain

σ^{(i)} = f_{r}^{d (i - 1)},

\forall i \in {1, \dots, h}

. Then, the following multi-input high-order sliding mode differentiator (SMD) [31] is used to estimate

σ^{(i)}

and, consequently,

f_{r}^{d (i - 1)}

:

\{\begin{matrix} {\dot{z}}_{0}^{d} = w_{0}^{d}, \\ w_{0}^{d} = - Λ_{0}^{d} Ψ {(γ_{p})}^{1 / (h + 1)} Ψ (| z_{0}^{d} - σ {|)}^{h / (h + 1)} sign (z_{0}^{d} - σ) + z_{1}^{d}, \\ {\dot{z}}_{1}^{d} = w_{1}^{d}, \\ w_{1}^{d} = - Λ_{1}^{d} Ψ {(γ_{p})}^{1 / h} Ψ (| η_{1}^{d} {|)}^{(h - 1) / h} sign (η_{1}^{d}) + z_{2}^{d}, \\ ⋮ \\ {\dot{z}}_{h - 1}^{d} = w_{h - 1}^{d}, \\ w_{h - 1}^{d} = - Λ_{h - 1}^{d} Ψ {(γ_{p})}^{1 / 2} Ψ (| η_{h - 1}^{d} {|)}^{1 / 2} sign (η_{h - 1}^{d}) + z_{h}^{d}, \\ {\dot{z}}_{h}^{d} = - Λ_{h}^{d} Ψ (γ_{p}) sign (η_{h}^{d}), \end{matrix}

(35)

where

η_{k}^{d} ≜ z_{k}^{d} - w_{k - 1}^{d}

, with

k \in {1, \dots, h}

,

Λ_{j}^{d} \in R^{3 \times 3}

is a positive-definite diagonal matrix, with

j \in \{0, \dots, h\}

, and

Ψ (| \cdot |) ≜ diag (| \cdot |)

maps a vector into a diagonal matrix. The SMD (35) initial conditions are set as

z_{0}^{d} (0) = σ (0)

and

z_{i}^{d} (0) = 0_{3}

,

\forall i \in {1, \dots, h}

.

Lemma 2 establishes the convergence properties of the SMD (35).

Lemma 2

([31]). Consider the SMD of Equation (35). For any given

Λ_{h}^{d}

satisfying

\begin{matrix} Λ_{h}^{d} 1_{3} > 1_{3}, \end{matrix}

there exists a set of positive-definite diagonal matrices

{Λ_{0}^{d}, \dots, Λ_{h - 1}^{d}}

that provides the finite-time convergence of

z_{0}^{d}

and

z_{i}^{d}

, respectively, to σ and

σ^{(i)}

,

\forall i \in \{1, \dots, h\}

.

Then, since

σ^{(i)} = f_{r}^{d (i - 1)}

and

z_{i}^{d} \to σ^{(i)}

,

\forall i \in {1, \dots, h}

, the estimate of the disturbance force and its time derivatives are simply given by

{\hat{f}}_{r}^{d (i - 1)} = z_{i}^{d}, \forall i \in {1, \dots, h} .

(36)

The proposed SMDO is represented by Equations (33)–(36) and exhibits low sensitivity to parameter variations, making it easy to tune. The tuning trade-off is that the larger the gains

Λ_{j}^{d}

, the faster the convergence and the higher the sensitivity to measurement noise, time discretization, and unmodeled dynamics [31].

Substituting the control law (32) into (31), the closed-loop position dynamics can be described by

\begin{matrix} \dot{\tilde{v}} & = - K_{2} \tilde{r} - K_{3} \tilde{v} + m^{- 1} (f_{r}^{d} - {\hat{f}}_{r}^{d}) . \end{matrix}

(37)

Given the disturbance force bound of Assumption 4, it holds that the disturbance estimation error is bounded during the SMDO (33)–(36) convergence phase and vanishes after a finite time. Then, the origin

(\tilde{r}, \tilde{v}) = (0_{3}, 0_{3})

of the system described by (30) and (37) can be made asymptotically stable by suitably choosing the gains

K_{2}

and

K_{3}

.

Now, to satisfy the smoothness requirement for

{\bar{n}}_{r}

as discussed in Remark 1,

{\bar{f}}_{r}^{c}

must have the same smoothness degree. Regarding the smoothness of

{\bar{f}}_{r}^{c}

, consider the following remark.

Remark 2.

For the control force command

{\bar{f}}_{r}^{c}

to be

(h - 1)

th-order differentiable with respect to time and have a Lipschitz-continuous first-time derivative, it can be seen from (32) that, since

{\hat{f}}_{r}^{d}

already satisfies this requirement (see (33)–(36)), the acceleration command

{\dot{\bar{v}}}_{r}^{b / r}

must have the same smoothness degree. Such a command will be properly generated by the guidance strategy presented in Section 4.

Once we have fulfilled the smoothness requirement of

{\bar{D}}^{b / r}

by properly designing

\bar{ψ}

and

{\bar{f}}_{r}^{c}

, the initial conditions

{\bar{D}}^{b / r} (0) = D^{b / r} (0)

and

{\bar{ω}}_{b}^{b / r} (0) = ω_{b}^{b / r} (0)

of Assumption 2 must also be assured. Consider the initial conditions

D^{b / r} (0)

,

ω_{b}^{b / r} (0)

, and the initial total thrust magnitude

f (0)

. The latter is generally not available for measurement, but, from Assumption 1, we can replace

f (0)

with

\bar{f} (0)

. From (25)–(27), it can be seen that, by choosing

\bar{ψ} (0) = ψ (0)

and

{\bar{n}}_{r} (0) = n_{r} (0)

, it holds that

{\bar{D}}^{b / r} (0) = D^{b / r} (0)

. The condition

{\bar{n}}_{r} (0) = n_{r} (0)

is satisfied if

{\bar{f}}_{r}^{c} (0) = \bar{f} (0) n_{r} (0)

. Therefore, by choosing

{\bar{r}}_{r}^{b / r} (0) = r_{r}^{b / r} (0)

and

{\bar{v}}_{r}^{b / r} (0) = v_{r}^{b / r} (0)

, given that

{\hat{f}}_{r}^{d} (0) = 0_{3}

, the initial acceleration command

{\dot{\bar{v}}}_{r}^{b / r} (0)

that makes

{\bar{f}}_{r}^{c} (0) = \bar{f} (0) n_{r} (0)

and consequently

{\bar{n}}_{r} (0) = n_{r} (0)

can be calculated from (32) as

\begin{matrix} {\dot{\bar{v}}}_{r}^{b / r} (0) & = \frac{\bar{f} (0) n_{r} (0)}{m} - g e_{3} . \end{matrix}

(38)

In summary, by calculating

{\dot{\bar{v}}}_{r}^{b / r} (0)

according to (38) and choosing

\bar{ψ} (0) = ψ (0)

, we indirectly ensure that

{\bar{D}}^{b / r} (0) = D^{b / r} (0)

.

The relation between the 1-2-3 Euler angles

α^{b / r} ≜ (ϕ, θ, ψ)

that parameterize

D^{b / r}

and the angular velocity

ω_{b}^{b / r}

is

{\dot{α}}^{b / r} = A_{α} (α^{b / r}) ω_{b}^{b / r},

(39)

where

A_{α} ≜ [\begin{matrix} \frac{\cos (ψ)}{\cos (θ)} & - \frac{\sin (ψ)}{\cos (θ)} & 0 \\ \sin (ψ) & \cos (ψ) & 0 \\ - \cos (ψ) \tan (θ) & \sin (ψ) \tan (θ) & 1 \end{matrix}] .

From (39), it can be seen that, since

{\bar{D}}^{b / r} (0) = D^{b / r} (0)

, by making

{\dot{\bar{α}}}^{b / r} (0) = {\dot{α}}^{b / r} (0)

, we have that

{\bar{ω}}_{b}^{b / r} (0) = ω_{b}^{b / r} (0)

. The equality

{\dot{\bar{α}}}^{b / r} (0) = {\dot{α}}^{b / r} (0)

holds true if

\dot{\bar{ψ}} (0) = e_{3}^{T} A_{α} (α^{b / r} (0)) ω_{b}^{b / r} (0)

,

\dot{\bar{θ}} (0) = \dot{θ} (0)

, and

\dot{\bar{ϕ}} (0) = \dot{ϕ} (0)

. However, it can be seen from (26) and (27) that

\dot{\bar{θ}} (0) = \dot{θ} (0)

and

\dot{\bar{ϕ}} (0) = \dot{ϕ} (0)

hold true if

{\dot{\bar{n}}}_{r} (0) = {\dot{n}}_{r} (0)

. The initial condition

{\dot{n}}_{r} (0)

is known and can be calculated from the definition of

n_{r}

, given immediately after Equation (7), using

D^{b / r} (0)

and

ω_{b}^{b / r} (0)

. Therefore, the equality

{\dot{\bar{n}}}_{r} (0) = {\dot{n}}_{r} (0)

must be ensured by the position control law. Knowing that

{\bar{n}}_{r} = {\bar{f}}_{r}^{c} / ∥ {\bar{f}}_{r}^{c} ∥

,

{\dot{\bar{n}}}_{r} (0)

can be calculated in terms of the initial control force command as

{\dot{\bar{n}}}_{r} (0) = Φ (0) {\dot{\bar{f}}}_{r}^{c} (0),

(40)

where

Φ (0) ≜ \frac{I_{3}}{∥ {\bar{f}}_{r}^{c} (0) ∥} - \frac{{\bar{f}}_{r}^{c} (0) {({\bar{f}}_{r}^{c} (0))}^{T}}{∥ {\bar{f}}_{r}^{c} {(0) ∥}^{3}} .

The time derivative of the control force command

{\dot{\bar{f}}}_{r}^{c} (0)

can be calculated by differentiating (32) and using (37). The resulting expression is

\begin{matrix} {\dot{\bar{f}}}_{r}^{c} (0) & = m {\ddot{\bar{v}}}_{r}^{b / r} (0) - K_{3} f_{r}^{d} (0) . \end{matrix}

(41)

Substituting (41) into (40), one can see that the initial jerk command

{\ddot{\bar{v}}}_{r}^{b / r} (0)

that makes

{\dot{\bar{n}}}_{r} (0) = {\dot{n}}_{r} (0)

is

{\ddot{\bar{v}}}_{r}^{b / r} (0) = \frac{{(Φ (0))}^{- 1} {\dot{n}}_{r} (0)}{m} + K_{3} f_{r}^{d} (0) .

(42)

By calculating

{\ddot{\bar{v}}}_{r}^{b / r} (0)

according to (42) and choosing

\dot{\bar{ψ}} (0) = e_{3}^{T} A_{α} (α^{b / r} (0)) ω_{b}^{b / r} (0)

, we ensure that

{\bar{ω}}_{b}^{b / r} (0) = ω_{b}^{b / r} (0)

. However, the initial disturbance force

f_{r}^{d} (0)

appearing on the right-hand side of (42) is unknown. To deal with this lack of information, we gradually increase the derivative gain

K_{3}

from

K_{3} (0) = 0_{3}

.

The adaptation of

K_{3}

must be quick enough to provide good performance to the position control loop. To guarantee the position control smoothness (see Remark 1), the adaptive gain

K_{3}

must be

(h - 1)

th-order differentiable with respect to time, such that its first-time derivative is Lipschitz. To achieve this, we increase

K_{3} (t)

during a prescribed time

t_{s} \in R_{> 0}

according to

K_{3} (t) = {\bar{K}}_{3} - \frac{{\bar{K}}_{3}}{t_{s}^{h}} {(t_{s} - t)}^{h} I_{[0, t_{s})} (t),

(43)

where

{\bar{K}}_{3} \in R^{3 \times 3}

is a positive-definite diagonal matrix corresponding to the final desired value of

K_{3}

, and

I_{[0, t_{s})} (t)

is an indicator function that is equal to one if

t \in [0, t_{s})

and equal to zero otherwise.

3.4. Control Allocation

The control allocation calculates

{\bar{f}}^{r}

from

\bar{f}

and

{\bar{τ}}_{b}^{c}

as a solution to the control allocation Equation [35]:

[\begin{matrix} \bar{f} \\ {\bar{τ}}_{b}^{c} \end{matrix}] = Γ {\bar{f}}^{r} .

(44)

Assuming that the rotor set is configured in such a way that

Γ

is always of full-row rank, i.e.,

rank (Γ) = 4

, the solution of (44) is unique when

n_{r} = 4

and is simply calculated by inverting

Γ

. For

n_{r} > 4

, a simple solution can also be obtained, but now using the pseudo-inverse method [40].

4. Guidance Design

Consider that the MAV flies among

n_{o} \in Z_{> 0}

obstacles and define a set

I

containing the identification of the obstacles. Moreover, consider a point

C_{i}

fixed to the center of mass of obstacle

i \in I

and denote its position and velocity by

r_{r}^{i / r} \in R^{3}

and

v_{r}^{i / r} \in R^{3}

, respectively.

To guarantee the smoothness requirement of

\bar{ψ}

and

{\bar{r}}_{r}^{b / r}

as discussed in Remarks 1 and 2, the proposed guidance algorithm generates these commands using reference filters, as depicted in Figure 4. The heading reference filter is designed using an overdamped LPF to make

\bar{ψ}

converge smoothly to a target heading

ψ^{*} \in (- π, π]

provided by the CCO method. Similarly, the position reference filter consists of an overdamped LPF to make

{\bar{v}}_{r}^{b / r}

smoothly converge to a target velocity

v_{r}^{*} \in R^{3}

, also given by the CCO, and an integrator for calculating

{\bar{r}}_{r}^{b / r}

. We assume that the MAV is aware of the obstacles’ position and velocity

O ≜ \{(r_{r}^{i / r}, v_{r}^{i / r}), \forall i \in I\}

. Then, to avoid collision with accelerated obstacles, we use a high-order SMD [31] to provide an estimate of the acceleration of obstacle i, denoted by

z_{1}^{i} \in R^{3}

, from observations of

v_{r}^{i / r}

. In summary, the CCO receives the desired position

{\overset{ˇ}{r}}_{r}^{b / r}

and the desired heading

\overset{ˇ}{ψ}

as input and the MAV states

r_{r}^{b / r}

,

v_{r}^{b / r}

,

D^{b / r}

, and

ω_{b}^{b / r}

, as well as

z_{1}^{i}, \forall i \in I

, and

O

, as feedback. Using this information, it chooses

v_{r}^{*}

and

ψ^{*}

aiming at avoiding collisions with the obstacles and respecting the linear velocity constraint (10) and the rotor thrust constraint (11).

Figure 4. Block diagram of the proposed guidance strategy.

The heading reference filter consists of a hth-order overdamped LPF to guarantee the necessary smoothness of

\bar{ψ}

and ensure that it has no overshoot with respect to the input

ψ^{*}

. This reference filter can be represented by the state-space model

{\dot{y}}_{ψ} = E y_{ψ} + F ψ^{*},

(45)

where

y_{ψ} ≜ (\bar{ψ}, \dot{\bar{ψ}}, \dots, {\bar{ψ}}^{(h - 1)}) \in R^{h}

,

F ≜ (0_{h - 1}, τ_{ψ}^{- h}) \in R^{h}

,

τ_{ψ} \in R_{> 0}

is a time constant, and

\begin{matrix} E & ≜ [\begin{matrix} \begin{matrix} 0_{h - 1 \times 1} & I_{h - 1} \end{matrix} \\ E_{1} E_{2} \dots E_{h} \end{matrix}] \in R^{h \times h}, \\ E_{k + 1} & ≜ - \frac{h!}{k! (h - k)!} τ_{ψ}^{- (h - k)}, \forall k \in {0, 1, \dots, h - 1} . \end{matrix}

The initial heading and heading rate commands are set equal to the respective MAV initial conditions, while the remaining filter states are set equal to zero, i.e.,

y_{ψ} (0) ≜ (ψ (0), \dot{ψ} (0), 0, \dots, 0) .

(46)

It is worth noting that the target heading

ψ^{*} (t)

given by the CCO may be discontinuous [16]. Then, designing the heading reference filter using an LPF of order h guarantees that

\bar{ψ}

is minimally

(h - 1)

-times differentiable with respect to time and has a Lipschitz-continuous first-time derivative. As a result, the smoothness requirement for

\bar{ψ}

can be satisfied.

Similarly, to guarantee the necessary smoothness of

{\dot{\bar{v}}}_{r}^{b / r}

, the position reference filter is designed using an overdamped LPF of order

h + 1

and an integrator, being represented by the state-space model

{\dot{y}}_{p} = A y_{p} + B v_{r}^{*},

(47)

where

y_{p} ≜ ({\bar{r}}_{r}^{b / r}, {\dot{\bar{r}}}_{r}^{b / r}, \dots, {\bar{r}}_{r}^{b / r (h + 1)}) \in R^{3 (h + 2)}

,

B ≜ {[0_{3 \times 3 (h + 1)} I_{3} τ_{p}^{- (h + 1)}]}^{T} \in R^{3 (h + 2) \times 3},

τ_{p} \in R_{> 0}

is a time constant, and

\begin{matrix} A & ≜ [\begin{matrix} 0_{3 (h + 1) \times 3} & I_{3 (h + 1)} \\ 0_{3 \times 3} & A_{1} A_{2} \dots A_{h + 1} \end{matrix}] \in R^{3 (h + 2) \times 3 (h + 2)}, \\ A_{k + 1} & ≜ - \frac{(h + 1)!}{k! (h + 1 - k)!} I_{3} τ_{p}^{- (h + 1 - k)}, k \in {0, 1, \dots, h} . \end{matrix}

The initial position and velocity commands are set equal to the corresponding MAV initial conditions, while the remaining states of the filter are set equal to zero, i.e.,

y_{p} (0) ≜ (r_{r}^{b / r} (0), v_{r}^{b / r} (0), 0_{3}, \dots, 0_{3}) .

(48)

Analogous to

ψ^{*}

, the target velocity

v_{r}^{*} (t)

calculated by the CCO may be discontinuous [16]. Then, by using an LPF of order

h + 1

, we guarantee that the acceleration command

{\dot{\bar{v}}}_{r}^{b / r}

is minimally (

h - 1

)-times differentiable with respect to time and has a Lipschitz-continuous first-time derivative. As a result, the smoothness requirement of

{\dot{\bar{v}}}_{r}^{b / r}

is satisfied.

From the choice of

y_{p} (0)

, we have that

\tilde{r} (0) = 0_{3}

and

\tilde{v} (0) = 0_{3}

. Therefore, by properly designing the position control law (32),

\tilde{r}

and

\tilde{v}

can be limited, respectively, as

\begin{matrix} ∥ \tilde{r} (t) ∥ & \leq ϵ^{r}, \end{matrix}

(49)

\begin{matrix} ∥ \tilde{v} (t) ∥ & \leq ϵ^{v}, \end{matrix}

(50)

where

ϵ^{r} \in R_{\geq 0}

and

ϵ^{v} \in R_{\geq 0}

can be chosen as time-dependent functions. It is worth noting that

ϵ^{r}

and

ϵ^{v}

cannot be analytically calculated, but they can be approximated from numerical simulations of the position closed-loop system (30) and (37) using a great amount of values of the disturbance force inside the bounds provided in Assumption 4.

Assume that obstacle i and the MAV can be contained, respectively, in the closed spheres

B (C_{i}, ρ_{i})

and

B (B, ρ_{b})

, where

ρ_{i} \in R_{> 0}

and

ρ_{b} \in R_{> 0}

are of a given radius. Therefore, a collision between the MAV and obstacle i is assumed to take place if and only if

∥r_{r}^{b / r} (t) - r_{r}^{i / r} (t)∥ \leq ρ_{b i},

(51)

where

ρ_{b i} ≜ ρ_{b} + ρ_{i}

.

Using the position error bound (49) and the position error definition (28), condition (51) can be tightened to obtain

∥{\bar{r}}_{r}^{b / r} (t) - r_{r}^{i / r} (t)∥ \leq ρ_{b i} + ϵ^{r},

(52)

whose satisfaction implies that condition (51) is true.

To prevent collisions, the adopted strategy is to avoid satisfying (52) in a future time horizon of finite length

τ \in R_{> 0}

. To this end, we must predict

{\bar{r}}_{r}^{b / r} (t)

and

r_{r}^{i / r} (t)

inside the time interval

[t_{0}, t_{0} + τ]

, where

t_{0} \in R_{\geq 0}

is the current time. By using the length

τ

, we only consider the obstacles that can cause imminent collisions, i.e., collisions that can occur inside

[t_{0}, t_{0} + τ]

. This can be effective to reduce the computational burden when there are many obstacles. However, the parameter

τ

must be chosen based on the physical bounds of the obstacles’ trajectories and the MAV dynamics in such a way that the MAV can avoid a collision when it is detected as imminent.

From (47), one can see that the future values of

{\bar{r}}_{r}^{b / r}

are unknown once they depend on the future values of

v_{r}^{*}

, which are calculated in real time by the CCO and dependent on the unknown future behavior of the obstacles. In this context, we predict

{\bar{r}}_{r}^{b / r}

on the time interval

[t_{0}, t_{0} + τ]

by considering

v_{r}^{*}

as a constant. Therefore, to support the forthcoming derivations, Lemma 3 provides a unique solution to (47) with a given initial condition

y_{p} (t_{0})

, while considering

v_{r}^{*}

as a constant.

Lemma 3.

Considering

v_{r}^{*}

as a constant input, the solution of (47) in

t \in [t_{0}, \infty)

with initial condition

y_{p} (t_{0})

is given by

y_{p} (t) = e^{A δ t} y_{p} (t_{0}) + G (δ t) v_{r}^{*},

(53)

where

δ t ≜ t - t_{0}

,

G (δ t) ≜ [I_{3 (h + 2)} 0_{3 (h + 2) \times 3}] e^{\bar{A} δ t} {[0_{3 \times 3 (h + 2)} I_{3}]}^{T},

\bar{A} ≜ [\begin{matrix} A & B \\ 0_{3 \times 3 (h + 2)} & 0_{3 \times 3} \end{matrix}] \in R^{3 (h + 3) \times 3 (h + 3)} .

Proof.

See Appendix A. □

Considering

v_{r}^{*}

as a constant input, the predicted values of the MAV position command

{\bar{r}}_{r}^{b / r} (t)

, which we denote by

{\hat{\bar{r}}}_{r}^{b / r} (t) \in R^{3}

, can be obtained from (53) as

\begin{matrix} {\hat{\bar{r}}}_{r}^{b / r} (t) & = C_{1} e^{A δ t} y_{p} (t_{0}) + G_{1} (δ t) v_{r}^{*}, \end{matrix}

(54)

where

G_{1} (δ t) ≜ C_{1} G (δ t)

and

C_{1} ≜ [I_{3} 0_{3 \times 3 (h + 1)}] \in R^{3 \times 3 (h + 2)}

.

The earlier VO-based approaches generally assume that the obstacles keep a constant velocity. In practice, however, this assumption only ensures collision avoidance against obstacles with low acceleration capacity or under a large

τ

. Here, instead of predicting the obstacles’ trajectories using a first-order approximation, we consider a set of possible future trajectories for the obstacles based on estimates of their acceleration bounds calculated from

z_{1}^{i}

provided by the SMD. In this context, the predicted position

{\hat{r}}_{r}^{i / r} \in R^{3}

of obstacle i inside the time interval

[t_{0}, t_{0} + τ]

belongs to

\begin{matrix} R_{i} ≜ & \{r (t) = r_{r}^{i / r} (t_{0}) + δ t v_{r}^{i / r} (t_{0}) + {\hat{a}}_{i}^{\max} ϑ \frac{δ t^{2}}{2}, δ t \in [0, τ], ϑ \in B (0_{3}, 1)\}, \end{matrix}

(55)

where

{\hat{a}}_{i}^{\max} \in R_{\geq 0}

is a bound estimate of obstacle i acceleration.

Let

v_{r}^{i / r}

be n-times differentiable with respect to time, where

n \geq 1

, such that

v_{r}^{i / r (n)}

has a known Lipschitz constant vector

γ_{i} > 0_{3}

. Define a vector

γ \in R^{3}

such that

γ > γ_{i},

\forall i \in I

. To estimate the obstacle i acceleration, we use the high-order SMD [11,31]

\{\begin{matrix} {\dot{z}}_{0}^{i} = w_{0}^{i}, \\ w_{0}^{i} = - Λ_{0} Ψ {(γ)}^{1 / (n + 1)} Ψ (| z_{0}^{i} - v_{r}^{i / r} {|)}^{n / (n + 1)} sign (z_{0}^{i} - v_{r}^{i / r}) + z_{1}^{i}, \\ {\dot{z}}_{1}^{i} = w_{1}^{i}, \\ w_{1}^{i} = - Λ_{1} Ψ {(γ)}^{1 / n} Ψ (| η_{1}^{i} {|)}^{(n - 1) / n} sign (η_{1}^{i}) + z_{2}^{i}, \\ ⋮ \\ {\dot{z}}_{n - 1}^{i} = w_{n - 1}^{i}, \\ w_{n - 1}^{i} = - Λ_{n - 1} Ψ {(γ)}^{1 / 2} Ψ (| η_{n - 1}^{i} {|)}^{1 / 2} sign (η_{n - 1}^{i}) + z_{n}^{i}, \\ {\dot{z}}_{n}^{i} = - Λ_{n} Ψ (γ) sign (η_{n}^{i}), \end{matrix}

(56)

where

η_{k}^{i} ≜ z_{k}^{i} - w_{k - 1}^{i}

, with

k \in {1, \dots, n}

;

Λ_{j} \in R^{3 \times 3}

is a positive-definite diagonal matrix, with

j \in \{0, \dots, n\}

; and

Ψ (| \cdot |) ≜ diag (| \cdot |)

maps a vector into a diagonal matrix. The SMD (56) initial conditions are set to

z_{0}^{i} (0) = v_{r}^{i / r} (0)

and

z_{j}^{i} (0) = 0_{3}

,

\forall j \in {1, \dots, n}

.

Lemma 4

([31]). Consider the SMD (56). For any

Λ_{n}

satisfying

\begin{matrix} Λ_{n} 1_{3} > 1_{3}, \end{matrix}

there exists a set of positive-definite diagonal matrices

{Λ_{0}, \dots, Λ_{n - 1}}

that provides the finite-time convergence of

z_{k}^{i}

to

v_{i}^{(k)}

,

\forall k \in \{0, \dots, n\}

and

\forall i \in I

.

Note that, before the differentiator (56)’s convergence time, denoted by

t_{c}^{i} \in R_{> 0}

, there is no accurate information about the acceleration bound of obstacle i. In this context, we choose to calculate

{\hat{a}}_{i}^{\max}

as

{\hat{a}}_{i}^{\max} (t) ≜ \{\begin{matrix} \max_{t_{c}^{i} \leq δ \leq t} ∥ z_{1}^{i} (δ) ∥ & \forall t \geq t_{c}^{i}, \\ ∥ z_{1}^{i} (t) ∥ & \forall t < t_{c}^{i} . \end{matrix}

(57)

According to (57), before

t_{c}^{i}

,

{\hat{a}}_{i}^{\max}

is converging according to the SMD (56), and after

t_{c}^{i}

, by maximizing

∥ z_{1}^{i} ∥

over the time interval

[t_{c}^{i}, t]

,

{\hat{a}}_{i}^{\max}

becomes an accurate estimate of obstacle i’s acceleration bound inside the considered time interval. However, since

t_{c}^{i}

is difficult to calculate without conservativeness, a very intuitive choice to approximate it is by verifying when the SMD states enter a small neighborhood of the sliding manifold, i.e., when

∥ z_{0}^{i} - v_{r}^{i / r} ∥ + ∥ η_{1}^{i} ∥ + \dots + ∥ η_{n}^{i} ∥ < α,

(58)

where

α \in R_{> 0}

is satisfied.

Using

{\hat{\bar{r}}}_{r}^{b / r} (t)

and

{\hat{r}}_{r}^{i / r} (t)

given, respectively, by (54) and (55), the collision condition (52) can be further tightened to obtain

∥G_{1} (δ t) v_{r}^{*} - c_{i} (δ t) - {\hat{a}}_{i}^{\max} ϑ \frac{δ t^{2}}{2}∥ \leq ρ_{b i} + ϵ^{r},

(59)

where

c_{i} (δ t) ≜ - C_{1} e^{A δ t} y_{p} (t_{0}) + r_{r}^{i / r} (t_{0}) + δ t v_{r}^{i / r} (t_{0})

. In turn, we can also notice that the collision condition (59) is always satisfied if

∥G_{1} (δ t) v_{r}^{*} - c_{i} (δ t)∥ \leq ρ_{b i}^{+} (δ t),

(60)

where

ρ_{b i}^{+} (δ t) ≜ ρ_{b i} + ϵ^{r} + {\hat{a}}_{i}^{\max} \frac{δ t^{2}}{2}

.

From the definition of

G_{1} (δ t)

, one can see that it is nonsingular

\forall δ t > 0

. Then, by multiplying both sides of (60) by

∥ G_{1}^{- 1} (δ t) ∥

, it can be shown that (60) is satisfied if

∥v_{r}^{*} - G_{1}^{- 1} (δ t) c_{i} (δ t)∥ \leq ∥ G_{1}^{- 1} (δ t) ∥ ρ_{b i}^{+} (δ t) .

(61)

Now, using (61), define a set of target velocities

v_{r}^{*}

that may result in a collision with obstacle i within

(t_{0}, t_{0} + τ]

as

{CCO}_{b i}^{τ} ≜ ⋃_{0 < δ t \leq τ} B (G_{1}^{- 1} (δ t) c_{i} (δ t), ∥ G_{1}^{- 1} (δ t) ∥ ρ_{b i}^{+} (δ t)) .

Therefore, the set of target velocities

v_{r}^{*}

that may result in a collision with any obstacle within

(t_{0}, t_{0} + τ]

is given by

{CCO}_{b}^{τ} ≜ ⋃_{\forall i \in I} {CCO}_{b i}^{τ} .

In other words, we can guarantee collision avoidance against accelerated obstacles by merely continuously choosing

v_{r}^{*} \notin {CCO}_{b}^{τ}

, thus satisfying the position constraint (9).

To consider the linear velocity constraint (10), we can rewrite it in terms of the target velocity

v_{r}^{*}

. To this end, note that, given the choice of

y_{p} (0)

in (48) and the design of the position reference filter (47) using the adopted overdamped LPF,

{\bar{v}}_{r}^{b / r}

presents no overshoot relative to the input

v_{r}^{*}

. Consequently, by choosing

v_{r}^{*} \in V

, we guarantee that

{\bar{v}}_{r}^{b / r} \in V

since

V

is a convex set. Then, from the velocity tracking error definition (29) and bound (50), one can see that by choosing

v_{r}^{*} \in V ⊖ B (0_{3}, ϵ^{v}),

(62)

it holds that

v_{r}^{b / r} \in V

, thus respecting the velocity constraint (10). Then, the position and velocity constraints (9) and (10) can be respected by continuously choosing

v_{r}^{*} \in V_{R} ≜ (V ⊖ B (0_{3}, ϵ^{v})) ∖ {CCO}_{b}^{τ} .

(63)

It should be noted that the set

V_{R}

is generally non-convex.

On the other hand, to account for the rotor thrust constraint (11), it can be seen from the control allocation Equation (8) and Assumption 1 that (11) results in the following control command constraints:

[\begin{matrix} \bar{f} \\ {\bar{τ}}_{b}^{c} \end{matrix}] \in U,

(64)

where

U ≜ \{ν \in R^{n_{r}} | ν = Γ f^{r}, f^{r} \in F\}

.

Knowing that

\bar{f} = ∥ {\bar{f}}_{r}^{c} ∥

and the attitude control loop has an integral sliding mode, i.e.,

D^{b / r} \equiv {\bar{D}}^{b / r}

and

ω_{b}^{b / r} \equiv {\bar{ω}}_{b}^{b / r}

, by substituting the control torque command (23) and the control force command (32) into (64), we obtain

[\begin{matrix} ∥g e_{3} + {\dot{\bar{v}}}_{r}^{b / r} - K_{2} \tilde{r} - K_{3} \tilde{v} - m^{- 1} {\hat{f}}_{r}^{d}∥ \\ {\dot{\bar{ω}}}_{b}^{b / r} - J_{b}^{- 1} [J_{b} {\bar{ω}}_{b}^{b / r} \times] {\bar{ω}}_{b}^{b / r} - 2 K_{1} sign (s) \end{matrix}] \in H U,

(65)

where

H ≜ [\begin{matrix} m^{- 1} & 0_{1 \times 3} \\ 0_{3 \times 3} & J_{b}^{- 1} \end{matrix}] .

To respect the rotor thrust constraint (11), our strategy is to guarantee the satisfaction of (65) in the time horizon

τ

. To this end, we have to predict the left side of (65). However, note that we cannot predict the terms

K_{2} \tilde{r}

,

K_{3} \tilde{v}

, and

{\hat{f}}_{r}^{d}

. In this sense, using the tracking error bounds (49) and (50), Assumption 4, and the definition of

sign (s)

, condition (65) can be tightened to obtain

[\begin{matrix} ∥{\dot{\bar{v}}}_{r}^{b / r} + g e_{3}∥ \\ {\dot{\bar{ω}}}_{b}^{b / r} - J_{b}^{- 1} [J_{b} {\bar{ω}}_{b}^{b / r} \times] {\bar{ω}}_{b}^{b / r} \end{matrix}] \in H U ⊖ [\begin{matrix} Δ F \\ K_{1} \end{matrix}],

(66)

where

K_{1} ≜ \{γ \in R^{3} | | γ | = 2 K_{1} 1_{3}\}

and

\begin{matrix} Δ F & ≜ \{x \in R | | x | \leq ∥ K_{2} ∥ ϵ^{r} + ∥ K_{3} ∥ ϵ^{v} + m^{- 1} ∥ f^{\max} ∥\} . \end{matrix}

A prediction for

{\dot{\bar{v}}}_{r}^{b / r}

inside the time interval

[t_{0}, t_{0} + τ]

can be directly obtained from (53). On the other hand, a prediction for

{\bar{ω}}_{b}^{b / r}

cannot be exactly obtained. To support this statement, note that for the considered class of underactuated MAVs, the angular velocity command is given by

{\bar{ω}}_{b}^{b / r} = e_{3} \times {\bar{D}}^{b / r} {\dot{\bar{n}}}_{r} + \dot{\bar{ψ}} e_{3} .

(67)

Then, note that

{\bar{D}}^{b / r}

depends on

{\bar{n}}_{r}

(see (25)–(27)), which is a unit vector that has the same direction and orientation of the control force command

{\bar{f}}_{r}^{c}

(see (12)). However, it can be seen from (32) that

{\bar{f}}_{r}^{c}

cannot be exactly predicted since the future values of the terms

K_{2} \tilde{r}

,

K_{3} \tilde{v}

, and

{\hat{f}}_{r}^{d}

are unknown. As a result, we cannot obtain an exact prediction for

{\bar{ω}}_{b}^{b / r}

inside

[t_{0}, t_{0} + τ]

.

On the basis of the above discussion,

{\bar{n}}_{r}

can be rewritten in terms of a nominal part

{\bar{n}}_{r}^{n} \in R^{3}

and an unknown part

Δ {\bar{n}}_{r} \in R^{3}

, i.e.,

{\bar{n}}_{r} = {\bar{n}}_{r}^{n} + Δ {\bar{n}}_{r},

(68)

where

{\bar{n}}_{r}^{n} ≜ \frac{{\dot{\bar{v}}}_{r}^{b / r} + g e_{3}}{∥ {\dot{\bar{v}}}_{r}^{b / r} + g e_{3} ∥} .

Using (68), Equation (67) can be rewritten as

{\bar{ω}}_{b}^{b / r} = {\bar{ω}}_{b}^{n} + Δ {\bar{ω}}_{b},

(69)

where

{\bar{ω}}_{b}^{n} ≜ e_{3} \times {\bar{D}}^{b / r, n} {\dot{\bar{n}}}_{r}^{n} + \dot{\bar{ψ}} e_{3}

, being

{\bar{D}}^{b / r, n}

calculated in the same way as

{\bar{D}}^{b / r}

in (25) but considering

{\bar{n}}_{r}^{n}

instead of

{\bar{n}}_{r}

, and

Δ {\bar{ω}}_{b} \in R^{3}

is an unknown part.

Substituting (69), condition (66) can be rewritten as

[\begin{matrix} ∥{\dot{\bar{v}}}_{r}^{b / r} + g e_{3}∥ \\ {\dot{\bar{ω}}}_{b}^{n} - J_{b}^{- 1} [J_{b} {\bar{ω}}_{b}^{n} \times] {\bar{ω}}_{b}^{n} + δ \bar{ω} \end{matrix}] \in H U ⊖ [\begin{matrix} Δ F \\ K_{1} \end{matrix}],

(70)

where

δ \bar{ω} ≜ Δ {\dot{\bar{ω}}}_{b} - J_{b}^{- 1} [J_{b} Δ {\bar{ω}}_{b} \times] {\bar{ω}}_{b}^{b / r} - J_{b}^{- 1} [J_{b} {\bar{ω}}_{b}^{n} \times] Δ {\bar{ω}}_{b} .

Considering that

δ \bar{ω} \in Δ \bar{W}

, condition (70) can be tightened one last time to obtain

[\begin{matrix} ∥{\dot{\bar{v}}}_{r}^{b / r} + g e_{3}∥ \\ {\dot{\bar{ω}}}_{b}^{n} - J_{b}^{- 1} [J_{b} {\bar{ω}}_{b}^{n} \times] {\bar{ω}}_{b}^{n} \end{matrix}] \in H U ⊖ Δ U,

(71)

where

Δ U ≜ [\begin{matrix} Δ F \\ K_{1} \oplus Δ \bar{W} \end{matrix}] .

Regarding the set

Δ \bar{W}

, consider the following remark.

Remark 3.

An analytic expression for

Δ {\bar{ω}}_{b}

is difficult to obtain since the relation between

{\bar{D}}^{b / r}

and

{\bar{n}}_{r}

is strongly nonlinear (see (25)–(27)). Consequently, it is complex to obtain an analytical expression for the set

Δ \bar{W}

. Therefore, we approximate it based on computer simulations.

Now, we are able to predict the left side of (71) inside the time interval

[t_{0}, t_{0} + τ]

. In this sense, let us rewrite (71) in terms of the predicted values for each variable, i.e.,

[\begin{matrix} ∥{\hat{\bar{a}}}_{r}^{b / r} + g e_{3}∥ \\ {\dot{\hat{\bar{ω}}}}_{b}^{n} - J_{b}^{- 1} [J_{b} {\hat{\bar{ω}}}_{b}^{n} \times] {\hat{\bar{ω}}}_{b}^{n} \end{matrix}] \in H U ⊖ Δ U,

(72)

where

{\hat{\bar{a}}}_{r}^{b / r} \in R^{3}

and

{\hat{\bar{ω}}}_{b}^{n} \in R^{3}

are, respectively, the predictions of

{\dot{\bar{v}}}_{r}^{b / r}

and

{\bar{ω}}_{b}^{n}

inside

[t_{0}, t_{0} + τ]

.

The acceleration command prediction

{\hat{\bar{a}}}_{r}^{b / r}

can be obtained from (53) using the same strategy adopted to predict the position command

{\bar{r}}_{r}^{b / r} (t)

, i.e., considering

v_{r}^{*}

as a constant input inside the time interval

[t_{0}, t_{0} + τ]

, yielding

\begin{matrix} {\hat{\bar{a}}}_{r}^{b / r} (t) & = C_{3} e^{A δ t} y_{p} (t_{0}) + G_{3} (δ t) v_{r}^{*}, \end{matrix}

(73)

where

G_{3} (δ t) ≜ C_{3} G (δ t)

and

C_{3} ≜ [0_{3 \times 6} I_{3} 0_{3 \times 3 (h - 1)}] \in R^{3 \times 3 (h + 2)}

.

On the other hand, note that the angular velocity prediction

{\hat{\bar{ω}}}_{b}^{n}

can be calculated by differentiating

{\bar{D}}^{b / r, n}

with respect to time. Here, we choose to predict

{\bar{D}}^{b / r, n} (t)

inside the interval

[t_{0}, t_{0} + τ]

using the Euler angles 1-2-3 parameterization, denoted by

{\hat{\bar{α}}}^{b / r} (t) ≜ (\hat{\bar{ϕ}} (t), \hat{\bar{θ}} (t), \hat{\bar{ψ}} (t))

. The prediction of the heading command

\hat{\bar{ψ}}

is obtained by adopting the same strategy used to predict

{\bar{r}}_{r}^{b / r} (t)

and

{\dot{\bar{v}}}_{r}^{b / r} (t)

, i.e., considering

ψ^{*}

as a constant input. Under this assumption,

\hat{\bar{ψ}} (t)

can be calculated from the actual time instant

t_{0}

by solving Equation (45), yielding

\begin{matrix} \hat{\bar{ψ}} (t) & = C_{1}^{ψ} e^{E δ t} y_{ψ} (t_{0}) + C_{1}^{ψ} E^{- 1} (e^{E δ t} - I_{h}) F ψ^{*}, \end{matrix}

(74)

where

C_{1}^{ψ} ≜ (1, 0_{1 \times (h - 1)})

.

The predictions

\hat{\bar{ϕ}}

and

\hat{\bar{θ}}

inside

[t_{0}, t_{0} + τ]

can be obtained from Equations (26) and (27) and the definition of

{\bar{n}}_{r}^{n}

given after (68), being given by

\begin{matrix} \hat{\bar{ϕ}} & = - {tan}^{- 1} \frac{e_{2}^{T} {\hat{\bar{n}}}_{r}^{n}}{e_{3}^{T} {\hat{\bar{n}}}_{r}^{n}}, \\ \hat{\bar{θ}} & = \sin^{- 1} (e_{1}^{T} {\hat{\bar{n}}}_{r}^{n}), \end{matrix}

where

{\hat{\bar{n}}}_{r}^{n} = \frac{{\hat{\bar{a}}}_{r}^{b / r} + g e_{3}}{∥ {\hat{\bar{a}}}_{r}^{b / r} + g e_{3} ∥},

being

{\hat{\bar{a}}}_{r}^{b / r}

the acceleration command prediction defined in Equation (73). Then, the angular velocity prediction

{\hat{\bar{ω}}}_{b}^{n}

can be immediately calculated by

{\hat{\bar{ω}}}_{b}^{n} = A_{α} {\dot{\hat{\bar{α}}}}^{b / r},

(75)

where

A_{α} ≜ [\begin{matrix} \cos (\hat{\bar{θ}}) \cos (\hat{\bar{ψ}}) & \sin (\hat{\bar{ψ}}) & 0 \\ - \sin (\hat{\bar{ψ}}) \cos (\hat{\bar{θ}}) & \cos (\hat{\bar{ψ}}) & 0 \\ \sin (\hat{\bar{θ}}) & 0 & 1 \end{matrix}] .

The angular acceleration prediction

{\dot{\hat{\bar{ω}}}}_{b}^{n}

is simply obtained by differentiating (75) with respect to time.

The continuous selection of

v_{r}^{*}

and

ψ^{*}

such that condition (72) is satisfied in

[t_{0}, t_{0} + τ]

guarantees the satisfaction of the control command constraints (64) and, consequently, the rotor thrust constraint (11).

Lastly, to completely solve Problem 1, we formulate a bilevel optimization problem [41] that calculates

v_{r}^{*}

and

ψ^{*}

, giving priority to collision avoidance. In this sense, we compute

v_{r}^{*}

and

ψ^{*}

as the solution of the following minimization that aims to satisfy the linear velocity and rotor thrust constraints, respectively, given by (10) and (11), and make the MAV reach its desired position

{\overset{ˇ}{r}}_{r}^{b / r}

and desired heading

\overset{ˇ}{ψ}

without collision:

\begin{matrix} (v_{r}^{*}, ψ^{*}) = \underset{(ν, β)}{\arg \min} & ∥ v^{pref} - ν ∥ \\ s . t . & ν \in V_{R}, \\ β \in Z, \end{matrix}

(76)

where

v^{pref} \in R^{3}

is a preferred velocity and

Z

is the set of solutions to the

ν

-parameterized problem

\begin{matrix} min_{z \in (π, π]} & | \overset{ˇ}{ψ} - z | \\ s . t . & \hat{U} (t; ν, z) \in H U ⊖ Δ U, \forall t \in (t_{0}, t_{0} + τ], \end{matrix}

with

\hat{U} (t; ν, β) ≜ [\begin{matrix} ∥{\hat{\bar{a}}}_{r}^{b / r} (t; ν) + g e_{3}∥ \\ {\dot{\hat{\bar{ω}}}}_{b}^{n} (t; ν, z) - J_{b}^{- 1} [J_{b} {\hat{\bar{ω}}}_{b}^{n} (t; ν, z) \times] {\hat{\bar{ω}}}_{b}^{n} (t; ν, z) \end{matrix}] .

Here,

v^{pref}

is a vector that points to

{\overset{ˇ}{r}}_{r}^{b / r}

and has a magnitude equal to the maximum admissible velocity in this direction. To provide a smooth deceleration phase, the magnitude of

v^{pref}

is gradually decreased according to the remaining distance when the vehicle is near

{\overset{ˇ}{r}}_{r}^{b / r}

. However, we highlight that the design of

v^{pref}

can be done using other strategies [26,42].

When obstacles are present, the set

V_{R}

is generally non-convex and can even become empty in very dense scenarios. In this case, it is appropriate to choose a target velocity and heading that result in the largest time to collide, while respecting constraints (10) and (11). The shortest time for a collision to occur, denoted by

t_{col} \in R_{> 0}

, as a result of choosing a certain target velocity

ν

is given by

\begin{matrix} t_{col} (ν) = min_{0 < δ \leq τ} & δ \\ s . t . & ν \in {CCO}_{b}^{δ} . \end{matrix}

Therefore, in case there is no solution to (76), i.e.,

V_{R} = \emptyset

, we propose to choose the target velocity and heading that imply the largest

t_{col}

, while respecting constraints (10) and (11), i.e.,

\begin{matrix} (v_{r}^{*}, ψ^{*}) = \underset{(ν, β)}{\arg \max} & t_{col} (ν) \\ s . t . & ν \in V ⊖ B (0_{3}, ϵ^{v}), \\ β \in Z . \end{matrix}

(77)

In general, finding the global minima of a non-convex optimization problem, such as the one in (76), is computationally intensive. Attempting to reduce the computational burden, we approximate the solution of (76) using a fixed number of velocity samples calculated from a uniform distribution over

V ⊖ B (0_{3}, ϵ^{v})

and creating an equally spaced fixed number of heading angle samples inside the interval

(- π, π]

. When (76) has no solution, we use the same sampling strategy to solve the optimization problem (77). Algorithm 1 is proposed to solve (76) or (77).

Algorithm 1: Proposed sampling algorithm to solve the optimization problem (76) (or (77) if (76) has no solution)

Data:

n_{v} \leftarrow

number of velocity samples

n_{ψ} \leftarrow

number of heading angle samples

v^{pref} \leftarrow

preferred velocity

\overset{ˇ}{ψ} \leftarrow

desired heading

{CCO}_{b}^{τ} \leftarrow ⋃ {CCO}_{b i}^{τ}, \forall i \in I

H U ⊖ Δ U \leftarrow

admissible set

Result:

V^{*} \leftarrow

a set of

n_{v}

random samples inside

V ⊖ B (0_{3}, ϵ^{v})

V_{o}^{*} \leftarrow

sort

V^{*}

in ascending cost order

A^{*} \leftarrow

a set of

n_{ψ}

equally spaced samples inside

(- π, π]

A_{o}^{*} \leftarrow

sort

A^{*}

in ascending cost order

k \leftarrow 0

return the pair

(v, ψ)

that implies the largest

t_{col}

Algorithm 1 generates

n_{v}

random velocity samples inside

V ⊖ B (0_{3}, ϵ^{v})

and

n_{ψ}

equally spaced samples inside

(- π, π]

, sorting them in ascending cost order. Then, it sequentially checks that each velocity sample does not result in a collision, i.e., if it does not belong to

{CCO}_{b}^{τ}

. When a velocity sample is collision-free, the algorithm tries to choose a heading sample that, given this collision-free velocity, respects the rotors’ thrust constraints. If such heading exists, the algorithm returns the corresponding pair of velocity and heading samples. On the other hand, when a velocity sample is not collision-free, the algorithm tries to choose a heading sample that respects the rotors’ thrust constraints and, if such heading exists, it calculates the time to collision for this particular pair of samples. Then, in the event that any velocity sample is free of collision, the algorithm returns the pair of samples that results in the minimum time to collision and respects the velocity and rotors’ thrust constraints. For this case, the main for loop in Algorithm 1 iterates

n_{v}

times and, within each iteration, the nested else condition is executed. On the other hand, when the first velocity sample in

V_{o}^{*}

is collision-free and there is a heading sample that, given this first velocity sample, respects the rotors’ thrust constraints, the main for loop is executed only one time.

5. Numerical Simulation

The effectiveness of the proposed method is evaluated in a numerical simulation implemented on the basis of the nonlinear equations of motion (1) and (4)–(8) and coded in MATLAB utilizing the first-order explicit Euler method with a sampling time of 0.01 s. The vehicle adopted for the simulation is an x-shaped quadcopter with a mass of

1 kg

, arm length of 0.5 m, and inertia matrix

J_{b} = diag (0.015, 0.015, 0.03)

kg m

^{2}

.

The proposed method is compared with another one that uses the same control strategy and the original CCO [30] as the guidance strategy. To compare the methods, we conduct a Monte Carlo simulation where the quadcopter has an initial position

(0, 0, 5)

m, zero initial velocity, and has to go to the desired position

(38, 0, 5)

m, while avoiding collision with 270 quadcopters that are used to represent the obstacles, as depicted in Figure 5. The obstacles are arranged between the MAV’s initial and desired positions in thirty evenly spaced groups of nine. The kth group, where

k \in {1, \dots, 30}

, contains a static obstacle that has the randomly uniformly selected position offsets

Δ y_{b, k} \in [- 6, 6]

m and

Δ z_{b, k} \in [- 2, 7]

m relative to the line connecting the MAV’s initial and desired positions in the directions of

{\vec{y}}_{r}

and

{\vec{z}}_{r}

, respectively. The remaining obstacles are circularly and uniformly distributed in the plane

{\vec{y}}_{r}

–

{\vec{z}}_{r}

and rotate around the static obstacle with a randomly uniformly selected radius

ρ_{b, k} \in [2, 5]

and angular velocity

{\vec{ω}}_{k} \in ([- 0.4, - 0.1] \cup [0.1, 0.4]) {\vec{x}}_{r}

.

Figure 5. Schematic illustration of the proposed Monte Carlo simulation scenario. (a) Overview of the scenario showing the MAV, its target position, and the groups of obstacles. (b) The kth group of obstacles in detail, where

k \in {1, \dots, 30}

.

The vehicle can be circumscribed by a sphere of radius

0.5

m, while all obstacles can be circumscribed by spheres of radius

0.7

m. The controlled quadcopter has the following linear velocity and rotor thrust admissible sets:

\begin{matrix} V & ≜ \{v \in R^{3} | ∥ v ∥ \leq 5\}, \\ F & ≜ \{f^{r} \in R^{4} | f_{i} \in [1.25, 4], \forall i \in {1, \dots, 4}\} . \end{matrix}

Moreover, the vehicle is subject to the disturbances

\begin{matrix} f_{r}^{d} (t) & = 0.25 {[1, - 1, - 1]}^{T} \sin (0.5 t) N, \\ τ_{b}^{d} (t) & = 0.005 {[1, - 1, - 1]}^{T} \sin (0.5 t) Nm . \end{matrix}

Table 1 shows the adopted parameters for the proposed guidance and control strategies.

Table 1. Parameters of the proposed method.

Moreover, we have used in Algorithm 1 a total of 800 target velocity samples calculated using a uniform distribution over

V ⊖ B (0_{3}, ϵ^{v})

and 50 equally spaced target heading samples inside the interval

(- π, π]

.

The MAV receives the desired heading signal (in degrees)

\overset{ˇ}{ψ} (t) = \{\begin{matrix} 0, & if 0 \leq t < 2, \\ 50, & if 2 \leq t < 5, \\ 10, & if 5 \leq t < 8, \\ - 30, & if t \geq 8 . \end{matrix}

In order to conduct a more comprehensive analysis of the proposed method, we present in Figure 6 and Figure 7 the relevant results from a single iteration of the Monte Carlo simulation. A three-dimensional animation of this simulation is available at the following link: https://youtu.be/d7K4e6Ytn8M (accessed on 23 September 2023).

Figure 6. Constraint plots for the proposed method. (a) Distance from the quadcopter to the obstacles. (b) Euclidean norm of the quadcopter velocity and its bound. (c) Rotors’ thrusts and their bounds.

Figure 7. Position tracking performance, attitude tracking performance, and control commands for the proposed method. (a) Position tracking performance. (b) Attitude tracking performance. (c) Control force command. (d) Control torque command. Legend: ― vector first component, ― vector second component, and ― vector third component.

Figure 6a–c, respectively, show the distance between the quadcopter and the obstacles, the Euclidean norm of the quadcopter’s velocity and its bound, and the rotors’ thrust and their bounds. It can be seen that the proposed method guarantees the satisfaction of the position constraint (9), the linear velocity constraint (10), and the rotors’ thrust constraint (11), which demonstrates the effectiveness and robustness of the proposed method.

Figure 7 presents the position and attitude tracking performance as well as the force and torque control commands. In Figure 7a, it can be seen that the position commands are satisfactorily tracked despite the presence of the disturbance force. Figure 7b shows that the attitude commands are exactly tracked during the entire time despite the presence of the disturbance torque, thus confirming that by combining the proposed sliding mode attitude and smooth position control laws, an integral sliding mode exists in the attitude loop. This fact is also confirmed by Figure 8, which shows that the Euclidean norm of the attitude sliding variable

s

is restricted to a relatively small neighborhood of the origin during the entire time. Figure 7c and Figure 7d, respectively, show the force and torque control commands. These plots confirm the smoothness and the switching behavior of the force and torque control commands, respectively.

Figure 8. Euclidean norm of the attitude sliding variable

s

.

5.1. Monte Carlo Simulation

We have performed, for each method, 100 iterations of the above-described Monte Carlo simulation. For both methods, we have analyzed at each iteration whether collisions have occurred and their number. For the proposed method, we have also analyzed whether the linear velocity and rotors’ thrust constraints are satisfied. Table 2 shows the Monte Carlo simulation results. It presents, for both methods, the collision rate (

C R

)

C R ≜ \frac{n_{c}}{100},

where

n_{c} \in Z_{\geq 0}

is the number of iterations where collisions have occurred, and it also shows the average number of collisions per iteration (

A C I

)

A C I ≜ \frac{n_{col}}{100} C R,

where

n_{col} \in Z_{\geq 0}

is the total number of collisions that have occurred in the 100 iterations. For the proposed method, Table 2 also shows the percentage of iterations in which the linear velocity and rotor thrust constraints are respected.

Table 2. Monte Carlo simulation results for the proposed and original CCO methods. N/A: not applicable.

The results in Table 2 confirm the effectiveness of the proposed method in respecting the linear velocity and rotors’ thrust constraints. Moreover, one can see that, by observing the obstacles’ acceleration and considering the vehicle position tracking error bound

ϵ^{r}

, the proposed method is more effective than the original CCO in preventing collisions. We emphasize that this is remarkable since the original CCO, by not considering the rotors’ thrust constraints, has more control force and torque available to avoid collisions. To support this statement, we also ran 100 iterations for the proposed method, disregarding the rotors’ thrust constraints. In this simulation, the proposed method had a

C R = 0 %

, demonstrating that the previous

C R

of 25% was only related to the inability to avoid collisions due to the control force and torque limitations imposed by the rotors’ thrust constraints.

5.2. Discussion

The results of the numerical simulations demonstrate the efficiency of the proposed method in guiding underactuated MAVs subject to model uncertainties/disturbances as well as velocity and rotors’ thrust constraints in the presence of obstacles that can accelerate. These results have practical importance given the recent growing interest in autonomous flight and the number of aircraft sharing the same air space. On the other hand, the performed simulation study, although comprehensive, did not consider sensor noise and inaccuracies. These uncertainties present in real-world scenarios could be immediately considered in the proposed strategy by suitably increasing the MAV tracking error bounds, the obstacles’ radii, and the guidance time horizon

τ

. Moreover, it is worth mentioning that the proposed formulation considers that the MAV and the obstacles can be enclosed by spheres. While this is not a significant limitation, as more complex shapes can be approximated using a collection of spheres [43], there are situations where employing this approach might lead to an increased level of complexity to represent the obstacles.

6. Conclusions

This paper proposes robust guidance and control methods for underactuated MAVs equipped with fixed rotors subject to model uncertainties/disturbances and linear velocity and rotor thrust constraints, in the presence of accelerated obstacles. The attitude and position control laws are designed using a hierarchical SMC scheme that enforces the attitude-position TSS. The former is designed using a multi-input ISMC strategy and the latter is designed using a proportional–derivative policy combined with an SMDO. On the other hand, the proposed guidance strategy is based on the CCO and designed using a constraint-tightening approach for robustness against uncertainties/disturbances and a high-order SMD to robustly estimate the maximum accelerations of the obstacles. A Monte Carlo numerical simulation was performed using a quadcopter to compare the proposed method with another one that uses the same attitude and position control design and the original CCO in the guidance level. The proposed guidance method outperformed the original CCO in avoiding collisions with moving obstacles that can accelerate and, additionally, has been shown to be very effective in respecting linear velocity and rotor thrust constraints. In future works, the proposed method can be evaluated experimentally and applied to other mobile robotics systems. Moreover, Algorithm 1 can be further improved in performance, aiming at its implementation in embedded systems.

Author Contributions

Methodology, J.A.R.J.; conceptualization, J.A.R.J. and D.A.S.; writing—original draft preparation, J.A.R.J.; writing—review and editing, J.A.R.J. and D.A.S.; supervision, D.A.S.; funding acquisition, D.A.S. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the São Paulo Research Foundation (FAPESP) under grant 2019/053340; the Coordination of Superior Level Staff Improvement (CAPES), EMBRAER S.A., and the Aeronautics Institute of Technology (ITA) under the doctorate scholarship of the Academic-Industrial Graduate Program (DAI); the National Council for Scientific and Technological Development (CNPq), under grant 304300/2021-7; and the Funding Authority for Studies and Projects (FINEP) under grant 01.22.0069.00.

Data Availability Statement

The data presented in this study are available on request from the corresponding author. The data are not publicly available due to privacy.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:

AVO	Acceleration Velocity Obstacles
CCO	Continuous Control Obstacles
GSMC	Global Sliding Mode Control
ISMC	Integral Sliding Mode Control
LPF	Low-Pass Filter
LQR	Linear Quadratic Regulator
MAV	Multirotor Aerial Vehicle
NMPC	Nonlinear Model Predictive Control
PD	Proportional–Derivative
RRT	Rapidly Exploring Random Tree
SMC	Sliding Mode Control
SMDO	Sliding Mode Disturbance Observer
SMD	Sliding Mode Differentiator
TSS	Time-Scale Separation
VO	Velocity Obstacles

Appendix A

Proof of Lemma 3.

The solution of the position reference filter differential Equation (47) is given by

y_{p} (t) = e^{A δ t} y_{p} (t_{0}) + \int_{t_{0}}^{t} e^{A (t - τ)} B v_{r}^{*} d τ,

(A1)

where

δ t ≜ t - t_{0}

.

The integral present in (A1) cannot be directly calculated since matrix

A

is singular due to the position reference filter integrator. To analytically calculate (A1), we consider

v_{r}^{*}

as constant and define the vector

y_{p}^{a} ≜ (y_{p}, v_{r}^{*}) \in R^{3 (h + 3)}

. Therefore, using (47), we can write the dynamic model

{\dot{y}}_{p}^{a} = \bar{A} y_{p}^{a},

(A2)

where

\bar{A} ≜ [\begin{matrix} A & B \\ 0_{3 \times 3 (h + 2)} & 0_{3 \times 3} \end{matrix}] \in R^{3 (h + 3) \times 3 (h + 3)} .

Then, the solution of (A2) is

y_{p}^{a} (t) = e^{\bar{A} δ t} y_{p}^{a} (0)

, where

e^{\bar{A} δ t} = [\begin{matrix} e^{A δ t} & \int_{t_{0}}^{t} e^{A (t - τ)} B d τ \\ 0_{3 \times 3 (h + 2)} & I_{3} \end{matrix}] .

(A3)

Using (A3), Equation (A1) can be rewritten as

y_{p} (t) = e^{A δ t} y_{p} (t_{0}) + G (δ t) v_{r}^{*},

where

G (δ t) ≜ [I_{3 (h + 2)}, 0_{3 (h + 2) \times 3}] e^{\bar{A} δ t} [\begin{matrix} 0_{3 (h + 2) \times 3} \\ I_{3} \end{matrix}] .

Thus, we complete the proof. □

References

Singireddy, S.R.R.; Daim, T.U. Technology Roadmap: Drone Delivery—Amazon Prime Air. In Infrastructure and Technology Management: Contributions from the Energy, Healthcare and Transportation Sectors; Springer International Publishing: Cham, Switzerland, 2018; pp. 387–412. [Google Scholar]
Rajendran, S.; Srinivas, S. Air taxi service for urban mobility: A critical review of recent developments, future challenges, and opportunities. Transp. Res. Part Logist. Transp. Rev. 2020, 143, 102090. [Google Scholar] [CrossRef]
Santos, D.A.; Lagoa, C.M. Wayset-based guidance of multirotor aerial vehicles using robust tube-based model predictive control. ISA Trans. 2022, 128, 123–135. [Google Scholar] [CrossRef] [PubMed]
Utkin, V.I. Sliding Modes in Control and Optimization; Springer: Berlin/Heidelberg, Germany, 1992. [Google Scholar]
Drajunov, S.V.; Utkin, V.I. Sliding mode control in dynamic systems. Int. J. Control 1991, 55, 1029–1037. [Google Scholar] [CrossRef]
Besnard, L.; Shtessel, Y.B.; Landrum, B. Quadrotor vehicle control via sliding mode controller driven by sliding mode disturbance observer. J. Frankl. Inst. 2012, 349, 658–684. [Google Scholar] [CrossRef]
Muñoz Palacios, F.; Espinoza Quesada, E.S.; González, I.; Salazar, S.; Lozano, R. Robust Trajectory Tracking for Unmanned Aircraft Systems using a Nonsingular Terminal Modified Super-Twisting Sliding Mode Controller. J. Intell. Robot. Syst. 2019, 93, 55–72. [Google Scholar] [CrossRef]
Silva, A.L.; Santos, D.A. Fast Nonsingular Terminal Sliding Mode Flight Control for Multirotor Aerial Vehicles. IEEE Trans. Aerosp. Electron. Syst. 2020, 56, 4288–4299. [Google Scholar] [CrossRef]
Labbadi, M.; Cherkaoui, M. Robust adaptive nonsingular fast terminal sliding-mode tracking control for an uncertain quadrotor UAV subjected to disturbances. ISA Trans. 2020, 99, 290–304. [Google Scholar] [CrossRef]
Wang, X.; Sun, S.; van Kampen, E.J.; Chu, Q. Quadrotor Fault Tolerant Incremental Sliding Mode Control driven by Sliding Mode Disturbance Observers. Aerosp. Sci. Technol. 2019, 87, 417–430. [Google Scholar] [CrossRef]
Ricardo, J.A., Jr.; Santos, D.A. Smooth second-order sliding mode control for fully actuated multirotor aerial vehicles. ISA Trans. 2022, 129, 169–178. [Google Scholar] [CrossRef]
Slotine, J.J.E. The Robust Control of Robot Manipulators. Int. J. Robot. Res. 1985, 4, 49–64. [Google Scholar] [CrossRef]
Lu, Y.S.; Chen, J.S. Design of a global sliding-mode controller for a motor drive with bounded control. Int. J. Control 1995, 62, 1001–1019. [Google Scholar] [CrossRef]
Utkin, V.; Shi, J. Integral sliding mode in systems operating under uncertainty conditions. In Proceedings of the 35th IEEE Conference on Decision and Control, Kobe, Japan, 11–13 December 1996; Volume 4, pp. 4591–4596. [Google Scholar]
Bartoszewicz, A. Time-varying sliding modes for second-order systems. IEE Proc.—Control Theory Appl. 1996, 143, 455–462. [Google Scholar] [CrossRef]
Ricardo, J.A., Jr.; Santos, D.A. Robot Guidance and Control Using Global Sliding Modes and Acceleration Velocity Obstacles. In Proceedings of the International Workshop on Variable Structure Systems and Sliding Mode Control, Rio de Janeiro, Brazil, 11–14 September 2022. [Google Scholar]
Altug, E.; Ostrowski, J.; Mahony, R. Control of a quadrotor helicopter using visual feedback. In Proceedings of the 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292), Washington, DC, USA, 11–15 May 2002; Volume 1, pp. 72–77. [Google Scholar]
Bertrand, S.; Guénard, N.; Hamel, T.; Piet-Lahanier, H.; Eck, L. A hierarchical controller for miniature VTOL UAVs: Design and stability analysis using singular perturbation theory. Control Eng. Pract. 2011, 19, 1099–1108. [Google Scholar] [CrossRef]
Liu, H.; Bai, Y.; Lu, G.; Shi, Z.; Zhong, Y. Robust tracking control of a quadrotor helicopter. J. Intell. Robot. Syst. 2014, 75, 595–608. [Google Scholar] [CrossRef]
Antonelli, G.; Cataldi, E.; Arrichiello, F.; Robuffo Giordano, P.; Chiaverini, S.; Franchi, A. Adaptive Trajectory Tracking for Quadrotor MAVs in Presence of Parameter Uncertainties and External Disturbances. IEEE Trans. Control. Syst. Technol. 2018, 26, 248–254. [Google Scholar] [CrossRef]
Hou, Z.; Yu, X.; Lu, P. Terminal Sliding Mode Control for Quadrotors with Chattering Reduction and Disturbances Estimator: Theory and Application. J. Intell. Robot. Syst. 2022, 105, 1–21. [Google Scholar] [CrossRef]
Fridman, L. Recent Achievement and Perspective Directions in Sliding Mode Control. In Proceedings of the 22st IFAC World Congress, Yokohama, Japan, 9–14 July 2023; pp. 1–5. [Google Scholar]
Kamel, M.; Alonso-Mora, J.; Siegwart, R.; Nieto, J. Robust collision avoidance for multiple micro aerial vehicles using nonlinear model predictive control. In Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Vancouver, BC, Canada, 24–28 September 2017; pp. 236–243. [Google Scholar]
Pereira, J.C.; Leite, V.J.S.; Raffo, G.V. An ellipsoidal-polytopic based approach for aggressive navigation using nonlinear model predictive control. In Proceedings of the 2021 International Conference on Unmanned Aircraft Systems (ICUAS), Athens, Greece, 15–18 June 2021; pp. 827–835. [Google Scholar]
Bouzid, Y.; Bestaoui, Y.; Siguerdidjane, H. Quadrotor-UAV optimal coverage path planning in cluttered environment with a limited onboard energy. In Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Vancouver, BC, Canada, 24–28 September 2017; pp. 979–984. [Google Scholar]
Fiorini, P.; Shiller, Z. Motion planning in dynamic environments using velocity obstacles. Int. J. Robot. Res. 1998, 17, 760–772. [Google Scholar] [CrossRef]
Bareiss, D.; Van Den Berg, J. Reciprocal collision avoidance for robots with linear dynamics using LQR-Obstacles. In Proceedings of the 2013 IEEE International Conference on Robotics and Automation, Karlsruhe, German, 6–10 May 2013; pp. 3847–3853. [Google Scholar]
Ricardo, J.A., Jr.; Santos, D.A. Robust Collision Avoidance for Mobile Robots in the Presence of Moving Obstacles. IEEE Control Syst. Lett. 2023, 7, 1584–1589. [Google Scholar] [CrossRef]
Van Den Berg, J.; Snape, J.; Guy, S.J.; Manocha, D. Reciprocal collision avoidance with acceleration-velocity obstacles. In Proceedings of the IEEE International Conference on Robotics and Automation, Shanghai, China, 9–13 May 2011; pp. 3475–3482. [Google Scholar]
Rufli, M.; Alonso-Mora, J.; Siegwart, R. Reciprocal Collision Avoidance With Motion Continuity Constraints. IEEE Trans. Robot. 2013, 29, 899–912. [Google Scholar] [CrossRef]
Levant, A. Higher-order sliding modes, differentiation and output-feedback control. Int. J. Control 2003, 76, 924–941. [Google Scholar] [CrossRef]
Levant, A. Sliding order and sliding accuracy in sliding mode control. Int. J. Control 1993, 58, 1247–1263. [Google Scholar] [CrossRef]
Markley, F.L.; Crassidis, J.L. Fundamentals of Spacecraft Attitude Determination and Control; Space Technology Library; Springer: New York, NY, USA, 2014. [Google Scholar]
Goldstein, H. Classical Mechanics; Addison-Wesley: Boston, MA, USA, 1980. [Google Scholar]
Bezerra, J.A.; Santos, D.A. Optimal Exact Control Allocation for Under-Actuated Multirotor Aerial Vehicles. IEEE Control Syst. Lett. 2022, 6, 1448–1453. [Google Scholar] [CrossRef]
Sakurama, K.; Verriest, E.I.; Egerstedt, M. Effects of insufficient time-scale separation in cascaded, networked systems. In Proceedings of the 2015 American Control Conference (ACC), Chicago, IL, USA, 1–3 July 2015; pp. 4683–4688. [Google Scholar]
Bhat, S.P.; Bernstein, D.S. Finite-Time Stability of Continuous Autonomous Systems. SIAM J. Control. Optim. 2000, 38, 751–766. [Google Scholar] [CrossRef]
Ricardo, J.A., Jr.; Santos, D.A. Attitude Tracking Control for a Quadrotor Aerial Robot Using Adaptive Sliding Modes. In Proceedings of the XLI Ibero-Latin-American Congress on Computational Methods in Engineering, Foz do Iguaçu, Brazil, 16–19 November 2020. [Google Scholar]
Escareño, J.; Salazar, S.; Romero, H.; Lozano, R. Trajectory control of a quadrotor subject to 2D wind disturbances. J. Intell. Robot. Syst. 2013, 70, 51–63. [Google Scholar] [CrossRef]
Ducard, G.; Hua, M.D. Discussion and practical aspects on control allocation for a multi-rotor helicopter. Int. Arch. Photogramm. Remote. Sens. Spat. Inf. Sci. 2011, 38, 95–100. [Google Scholar] [CrossRef]
Sinha, A.; Malo, P.; Deb, K. A Review on Bilevel Optimization: From Classical to Evolutionary Approaches and Applications. IEEE Trans. Evol. Comput. 2018, 22, 276–295. [Google Scholar] [CrossRef]
Van Den Berg, J.; Lin, M.; Manocha, D. Reciprocal Velocity Obstacles for Real-Time Multi-Agent Navigation. IEEE Trans. Robot. 2007, 23, 834. [Google Scholar] [CrossRef]
O’Rourke, J.; Badler, N. Decomposition of three-dimensional objects into spheres. IEEE Trans. Pattern Anal. Mach. Intell. 1979, 3, 295–305. [Google Scholar] [CrossRef]

Figure 1. A typical hierarchical guidance and control architecture for MAVs.

Figure 2. The adopted CCSs and a general underactuated MAV equipped with

n_{r}

fixed rotors parallel to

{\vec{z}}_{b}

.

Figure 3. Hierarchical control architecture for underactuated MAVs equipped with fixed rotors. ACG stands for attitude command generator.

Figure 4. Block diagram of the proposed guidance strategy.

Figure 5. Schematic illustration of the proposed Monte Carlo simulation scenario. (a) Overview of the scenario showing the MAV, its target position, and the groups of obstacles. (b) The kth group of obstacles in detail, where

k \in {1, \dots, 30}

.

Figure 6. Constraint plots for the proposed method. (a) Distance from the quadcopter to the obstacles. (b) Euclidean norm of the quadcopter velocity and its bound. (c) Rotors’ thrusts and their bounds.

Figure 7. Position tracking performance, attitude tracking performance, and control commands for the proposed method. (a) Position tracking performance. (b) Attitude tracking performance. (c) Control force command. (d) Control torque command. Legend: ― vector first component, ― vector second component, and ― vector third component.

Figure 8. Euclidean norm of the attitude sliding variable

s

.

Table 1. Parameters of the proposed method.

Control Law	Variable	Value
Attitude control law	$K_{1}$	diag(0.5, 0.5, 0.25)
	$C$	$5 I_{3}$
	h	2
Position control law	$K_{2}$	$2.5 I_{3}$
	${\bar{K}}_{3}$	$10 I_{3}$
	$t_{s}$	0.2 s
	$Λ_{0}^{p}$	$4 I_{3}$
	$Λ_{1}^{p}$	$2 I_{3}$
	$Λ_{2}^{p}$	$I_{3}$
	$γ_{p}$	$I_{3}$
Guidance method	$τ_{p}$	$0.15$ s
	$ϵ^{r}$	$0.06$ m
	$ϵ^{v}$	$0.1$ m/s
	$τ$	5 s
	n	2
	$Λ_{0}$	$4 I_{3}$
	$Λ_{1}$	$3 I_{3}$
	$Λ_{2}$	2 $I_{3}$
	$γ$	$1.5 I_{3}$
	$τ_{ψ}$	$0.2$ s

Table 2. Monte Carlo simulation results for the proposed and original CCO methods. N/A: not applicable.

	Proposed	Original CCO
$C R$	25%	50%
$A C I$	2.16	1.74
Respected velocity constraint	100%	N/A
Respected rotor thrust constraint	100%	N/A

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Robust Collision-Free Guidance and Control for Underactuated Multirotor Aerial Vehicles

Abstract

1. Introduction

1.1. Notation

2. Problem Definition

2.1. MAV Dynamic Modeling

2.2. Problem Statement

3. Control Design

3.1. Hierarchical Flight Control Architecture

3.2. Integral Sliding Mode Attitude Control Law

Attitude Command Generator

3.3. Position Control Law

3.4. Control Allocation

4. Guidance Design

5. Numerical Simulation

5.1. Monte Carlo Simulation

5.2. Discussion

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

Appendix A

References

Article Metrics

Citations

Article Access Statistics