Development of a Controlled Dynamics Simulator for Reusable Launcher Descent and Precise Landing

Alice De Oliveira; Michèle Lavagna

doi:10.3390/aerospace10120993

and

Department of Aerospace Science & Technology, Politecnico di Milano, Via La Masa 34, 20156 Milan, Italy

^*

Author to whom correspondence should be addressed.

Aerospace2023, 10(12), 993;https://doi.org/10.3390/aerospace10120993

This article belongs to the Section Astronautics & Space Science

Version Notes

Order Reprints

Abstract

This paper introduces a Reusable Launch Vehicle (RLV) descent dynamics simulator coupled with closed-loop guidance and control (G&C) integration. The studied vehicle’s first-stage booster, evolving in the terrestrial atmosphere, is steered by a Thrust Vector Control (TVC) system and planar fins through gain-scheduled Proportional–Integral–Derivative controllers, correcting the trajectory deviations until precise landing from the reference profile computed in real time by a successive convex optimisation algorithm. Environmental and aerodynamic models that reproduce realistic atmospheric conditions are integrated into the simulator for enhanced assessment. Comparative performance results were achieved in terms of control configuration (TVC-only, fins-only, and both) for nominal conditions as well as with external disturbances such as wind gusts or multiple uncertainties through a Monte Carlo analysis to assess the G&C system. These studies demonstrated that the configuration combining TVC and steerable planar fins has sufficient control authority to provide stable flight and adequate uncertainties and disturbance rejection. The developed simulator provides a preliminary assessment of G&C techniques for the RLV descent and landing phase, along with examining the interactions that occur. In particular, it paves the way towards the development and assessment of more advanced and robust algorithms.

Keywords:

RLV; G&C; aerodynamic and powered descent; precise landing; re-entry dynamics; successive convex optimisation; gain-scheduled PID controllers; TVC; aerodynamic steering

1. Introduction

Over the last decade, launcher reusability has become the new paradigm for reducing the cost of access to space and enabling future manned missions, such as a return to the Moon or, even more ambitiously, the first steps on Mars. This technology was already developed in the Space Shuttle era; however, unanticipated costs and risks led to the cancellation of the programme in 2011. Nevertheless, some years ago, private companies, such as SpaceX and Blue Origin, completely disrupted the space sector and demonstrated the cost effectiveness and technical feasibility of reusable rockets. More specifically, SpaceX’s Falcon 9 became in 2017 the first Vertical Take-Off Vertical Landing (VTVL) vehicle, having its first stage recovered after launch and reused for another mission, and then became in 2020 the first private rocket to take astronauts to the International Space Station thanks to its spacecraft Dragon [1]. Today, SpaceX has flown reusable boosters more than 100 times, with some single boosters reused more than 10 times, proving the feasibility and economic sustainability of such a technology. This leading company is now successfully testing its Super Heavy rocket equipped with the Starship spacecraft with the objective of carrying both crew and cargo on long-duration interplanetary flights, achieving humanity’s return to the Moon, and travelling to Mars and beyond. Meanwhile, Blue Origin is also developing advanced reusable launchers such as New Shepard, a suborbital launch vehicle designed for space tourism, and New Glenn, a heavy-lift reusable rocket that should be able to carry heavy payloads to Earth’s orbit and beyond [2]. Consequently, national agencies and intergovernmental institutions are following the same path, increasing research and development related to launcher reusability.

The descent and precision soft-landing of Reusable Launch Vehicles (RLVs) on Earth are very challenging, mainly due to the presence of the atmosphere. Indeed, during this phase, the vehicle is subjected to fast system dynamics changes induced by external loads such as lift and drag, unpredictable wind gusts, and control-induced actuation commands to comply with the landing requirements, allowing so-called pinpoint landing while preserving the vehicle’s integrity. All of these factors involve uncertainties and nonlinearities, which lead to vehicle instability and therefore justify the implementation of a high-performance guidance, navigation, and control (GNC) system. A solution to this demanding problem became feasible in the past decade with the development of convex optimisation: a particular class of methods that allow one to compute, in real time and based on the current flight conditions, optimal trajectories to be followed satisfying the desired constraints (which must be convex). This technology was demonstrated by the Masten Space Systems’ VTVL demonstrator Xombie, which used a vision-based system and a fuel-optimal convex guidance algorithm for precision landing [3].

Research on convex optimisation for the entry, descent, and soft pinpoint landing of VTVL reusable launchers has actively been carried out in recent years with the development of advanced techniques such as successive convex optimisation [4] and pseudospectral convex optimisation [5,6]. In Ref. [7], Liu extended this first method by combining aerodynamic forces and propulsion as control inputs to gain optimality with the consideration of vehicle aerodynamics, which had previously been ignored. Then, in Ref. [8], Sagliano et al. combined both methods and proposed separating the aerodynamic descent and powered landing into two different optimal control problems, using aerodynamic forces as the control input for the first phase and a combination of aerodynamic and propulsive control for the second phase. Finally, in Ref. [9], Simplício et al. solved a simplified optimal control problem in a first step and passed the solution to a second step involving successive convex optimisation to include aerodynamic effects.

The coupled flight mechanics involved in the reusable launcher descent and landing (D&L) phase are in fact usually not considered in the design of optimal guidance algorithms. The disturbances and uncertainties acting on the vehicle and arising from the nonlinear dynamics; external events (e.g., wind and aerodynamics); the actuation system; and the environment are counteracted by a properly designed robust control system. Classic techniques involve the use of linear control theory based on linearising the equations of motion and feedback of defined control parameters with gain scheduling [10]. However, these techniques require an extensive verification and validation campaign with Monte Carlo analyses, which render the process very time-consuming and costly. Lately, advanced robust control methods have been studied in both academia and industry, such as the Linear Parameter-Varying (LPV) approach [11] and the

H_{\infty}

family of methods, specifically the structured

H_{\infty}

technique [12].

The steering of a VTVL reusable rocket during the D&L phase is generally achieved by a Thrust Vector Control (TVC) system, which actuates by deflecting the engine nozzle along the two body axes perpendicular to the vehicle’s longitudinal axis through specific gimbal angles computed using the guidance and control (G&C) algorithms. To increase the control authority of the RLV, especially at low thrust during aerodynamic descent, steerable fins are crucial. They are typically placed above the vehicle’s centre of pressure, with one pair usually applied for controlling the pitch motion and another pair for controlling the yaw motion. Finally, a Reaction Control System (RCS) based on cold gas thrusters is often added for use at a high altitude in low-dynamic-pressure conditions or to provide roll control capabilities.

To understand the interactions between G&C and D&L flight mechanics, an RLV controlled dynamics simulator is proposed herein. This could serve as a baseline for the design and analysis of more advanced G&C methods for the D&L phase of reusable launchers. It covers the descent and soft pinpoint landing of a VTVL vehicle first-stage booster with closed-loop guidance and control integration. It includes the six-degrees-of-freedom (6-DoF) descent dynamics of a rigid-body model with a varying mass, evolving in the terrestrial atmosphere with varying environmental parameters, uncertainties, and disturbances (atmospheric density, ambient pressure, and wind) and subjected to external forces (gravity and aerodynamics). The steering of the spacecraft is carried out by a TVC system and planar fins, correcting the trajectory deviations with respect to the reference profile. The G&C system consists of a successive convex optimisation guidance algorithm updated several times during the flight and a control system composed of gain-scheduled Proportional–Integral–Derivative (PID) controllers. The main contributions of the proposed work can be summarised as follows:

The development of a 6-DoF RLV controlled dynamics simulator with closed-loop guidance and control integration for the descent and precise landing phase. This tool allows one to assess G&C methods for realistic scenarios, more specifically with respect to environmental models (aerodynamics, wind, and atmospheric parameters) and the actuation system (TVC and steerable planar fins). Moreover, it has a modular architecture and therefore can be easily modified to integrate more complex models (e.g., propulsion and aerodynamics). To the best of the authors’ knowledge, such a simulator is not publicly available and therefore provides the opportunity to understand the challenges involved in designing G&C algorithms for reusable launcher descent and precise landing and perform preliminary assessments of multiple recovery strategies.
The implementation and assessment of a successive convex optimisation guidance algorithm that solves the 6-DoF equations of motion for the powered descent and pinpoint landing problem.
The generation of corrections using classical linear feedback control through gain-scheduled PID controllers. Then, commands are allocated between the TVC system and the steerable planar fins according to the level of thrust. This feature also allows a certain modularity for studying different actuation configurations according to the mission requirements (e.g., propellant consumption) and the flight phase: TVC-only, planar fins-only, or both.

The paper is organised as follows. Section 2 introduces the reusable launcher controlled dynamics simulator with a description of all the building blocks: from the reference frames, environmental and aerodynamic models, and vehicle dynamics to the definition of the different actuation systems. Then, the successive convex optimisation guidance algorithm is introduced in Section 3. In addition, Section 4 presents the preliminary control method using classic linear control theory with gain-scheduled PID controllers and explains how the command is then allocated to the TVC system and/or the steerable planar fins. Subsequently, several simulations are performed in Section 5 with different actuation configurations. A sensitivity analysis is also carried out, adding wind and dispersion to several parameters in order to study their impact on the D&L performance and better address them for future developments in advanced G&C methods. Finally, conclusions are provided in Section 6.

2. Reusable Launcher Controlled Dynamics Modelling

The RLV controlled dynamics simulator developed in this paper relies on the nonlinear 6-DoF dynamics of a VTVL vehicle first-stage booster modelled as a rigid body with a varying mass subjected to external forces induced by the terrestrial atmosphere and controlled through embedded closed-loop guidance and control strategies. Therefore, it is made up of several building blocks with interconnections. A description of the developed architecture is provided in Figure 1. The elements were implemented through MATLAB/Simulink R2021b and will be briefly presented in the following subsections. A performance analysis of the simulator described below with a simplified aerodynamic model and TVC actuation only was carried out in Ref. [13].

Figure 1. 6-DoF RLV re-entry controlled dynamics simulator description.

The reference frames and environmental models adopted for gravity, atmospheric parameters, and wind are explained in Section 2.1. Then, the equations of motion and the centre of gravity (CG) and inertia estimations are described in Section 2.2. The developed aerodynamic model is presented in Section 2.3. The vehicle is steered via TVC and planar fins depending on their level of control authority. These actuators are introduced in Section 2.4 and Section 2.5, respectively.

Finally, the G&C algorithms are organised into two subsystems. First, “D&L Guidance” is responsible for the real-time generation of the reference control values, here in terms of thrust magnitude and attitude angles. Note that this feature is executed at frequency

f_{g u i}

, which differs from the simulator time step. A dedicated passage on the development of the guidance algorithm is provided in Section 3. Then, the “Control” subsystem, responsible for the computation of the commands allocated among the aforementioned actuators, is defined in Section 4.

2.1. Reference Frames and Environmental Models

This subsection describes the reference frames and environmental models that are adopted in the RLV controlled dynamics simulator. They are essential to simulating the re-entry of a reusable rocket into the terrestrial atmosphere.

Two reference frames are considered and are shown in Figure 2. The first is the landing-site-centred reference frame. Its origin is at the landing site and it is an up–east–north reference frame, such that the

x_{I}

-axis points up, the

y_{I}

-axis east, and the

z_{I}

-axis north. This reference frame is considered inertial, and the equations of motion refer to it. Simulations start from an initial position in this reference frame

r_{I} (0)

, with an initial velocity

v_{I} (0)

. The second reference frame is the vehicle’s body-fixed reference frame. This is fixed to the vehicle’s CG, and the basis vectors can be defined as follows: the

x_{B}

-axis lies along the vehicle’s longitudinal axis, the

y_{B}

-axis is defined so as to remain perpendicular to the pitch plane, and the

z_{B}

-axis completes the right-handed system (and thus remains perpendicular to the yaw plane). Following these definitions, the roll, pitch, and yaw angles (

ϕ (t)

,

θ (t)

, and

ψ (t)

, respectively) represent the orientation of the body-fixed reference frame with respect to the landing-site-centred inertial reference frame. These angles are useful for controlling the vehicle trajectory. However, in the formulation of the equations of motion, the rotation quaternion

q_{B}^{I} (t)

is used to translate the attitude of the vehicle. Therefore,

R_{B}^{I} (t)

represents the rotation matrix from the inertial reference frame to the vehicle’s body-fixed reference frame. The angular velocity is defined in the body-fixed reference frame with an initial value

ω_{B} (0)

.

Figure 2. Reference frames.

The atmosphere model adopted in this study, available in the MATLAB Aerospace Toolbox [14], implements the mathematical representation of the 1976 Committee on Extension to the Standard Atmosphere (COESA) [15], which provides, as a function of altitude

h (t)

, the atmospheric density

ρ (h (t))

, the speed of sound

a (h (t))

, and the ambient atmospheric pressure

P_{a m b} (h (t))

. Then, the gravitational field is defined in the inertial frame by

g_{I} (h (t)) = {[\begin{matrix} g (h (t)) & 0 & 0 \end{matrix}]}^{T}

, where

g (h (t))

is obtained as a function of the altitude and expressed by

g (h (t)) = g_{0} {(\frac{R_{E}}{R_{E} + h (t)})}^{2}

(1)

Here,

g_{0} \approx 9.81 m / s^{2}

is the standard gravity of Earth, and

R_{E} = 6378 km

is the radius of the Earth. For conciseness, these values will now be written as a function of time t.

Finally, the constant wind is computed with the US Naval Research Laboratory model Horizontal Wind Model 14, also available in Ref. [14], which generates the meridional

w_{m e r} (t)

and zonal

w_{z o n} (t)

components of the wind for a set of geophysical data. Wind gusts are modelled as a cosine-shaped function, so the user can define the amplitude of the gust and the altitude at which it occurs. The function is expressed as follows:

V_{g u s t} (h (t)) = (\frac{A_{g u s t}}{2}) (1 - cos (\frac{π (h (t) - h_{1})}{0.5 Δ h}))

(2)

where

A_{g u s t} \in R^{3}

specifies the amplitude of the gust in three directions,

h (t)

is the current altitude of the spacecraft,

h_{1}

specifies the altitude at which the gust starts, and

Δ h

is the altitude range in which the gust is applied. Therefore, the maximum intensity of the gust is reached in the middle of the specified altitude region. Consequently, the wind vector is written in the inertial reference frame as follows:

w_{I} (t) = {[\begin{matrix} 0 & w_{m e r} (t) & w_{z o n} (t) \end{matrix}]}^{T} + V_{g u s t} (h (t)) .

(3)

Note that the wind model is not considered in the descent dynamics of the guidance algorithm described in Section 3.

2.2. Equations of Motion and CG/Inertia Estimations

The equations of motion are written using the reference frames previously defined in Section 2.1. They are based on

x_{I} (0) = [\begin{matrix} m (0) & r_{I}^{T} (0) & v_{I}^{T} (0) & q_{B}^{I} {(0)}^{T} & ω_{B}^{T} (0) \end{matrix}]

, the initial state vector, and the assumption that the vehicle is a rigid body with no effects induced by the varying mass (e.g., propellant sloshing) and structural flexibility.

The mass depletion dynamics are modelled by an affine function of the thrust magnitude as follows:

\dot{m} (t) = - \frac{| | F_{T V C, I} (t) {| |}_{2}}{I_{s p} g_{0}} - \frac{A_{n o z z l e} P_{a m b} (t)}{I_{s p} g_{0}}

(4)

where

I_{s p} = 282 s

is the vacuum specific impulse of the engine, which is assumed to be constant for simplicity, and

A_{n o z z l e} = 3.1416 m^{2}

is the nozzle exit area of the engine.

F_{T V C, I} (t) \in R^{3}

is the thrust vector coming from the TVC system, introduced in Section 2.4. The second term is related to the reduction in the specific impulse due to the atmospheric back pressure [4].

The translational states, position, and velocity of the vehicle in the inertial reference frame,

r_{I} (t) \in R^{3}

and

v_{I} (t) \in R^{3}

, are governed by the following dynamics:

\begin{matrix} {\dot{r}}_{I} (t) & = v_{I} (t) \\ {\dot{v}}_{I} (t) & = \frac{1}{m (t)} [F_{T V C, I} (t) + F_{a e r o, I} (t) + F_{f i n s, I} (t)] + g_{I} (t) \end{matrix}

(5)

where

F_{a e r o, I} (t) \in R^{3}

describes the aerodynamic force acting on the vehicle in the inertial reference frame (Section 2.3), and

F_{f i n s, I} (t) \in R^{3}

represents the control force generated by the planar fins (Section 2.5).

Then, the attitude states are governed by the following rotational dynamics, using the quaternion-based kinematics equation:

\begin{matrix} {\dot{q}}_{B}^{I} (t) = \frac{1}{2} [\begin{matrix} q_{4} (t) & - q_{3} (t) & q_{2} (t) \\ q_{3} (t) & q_{4} (t) & - q_{1} (t) \\ - q_{2} (t) & q_{1} (t) & q_{4} (t) \\ - q_{1} (t) & - q_{2} (t) & - q_{3} (t) \end{matrix}] ω_{B} (t) \\ {\dot{ω}}_{B} (t) = J^{- 1} (t) [M_{T V C, B} (t) + M_{a e r o, B} (t) + M_{f i n s, B} (t) - ω_{B} (t) \times J ω_{B}] \end{matrix}

(6)

where

J (t)

is the inertia matrix of the vehicle, introduced below.

M_{a e r o, B} (t) \in R^{3}

,

M_{T V C, B} (t) \in R^{3}

, and

M_{f i n s, B} (t) \in R^{3}

(Section 2.3, Section 2.4 and Section 2.5) represent the aerodynamic and control torques acting on the vehicle. In Equation (6), the coupling between angular velocity and inertia along the three axes and the effect of centroid movement on the inertia caused by mass consumption are ignored.

Finally, because of the propellant mass and the level variations throughout the flight, the total vehicle CG and the moments of inertia also vary. The CG is considered to lie along the vehicle body’s longitudinal axis, i.e.,

x_{C G} (t) = {[\begin{matrix} x_{C G} (t) & 0 & 0 \end{matrix}]}^{T}

, while the inertia tensor is assumed to be diagonal, i.e.,

J (t) = diag ([\begin{matrix} J_{A} (t) & J_{N} (t) & J_{N} (t) \end{matrix}])

. Following the model and data available in Ref. [16], the vehicle’s mass is broken down into structural mass and time-dependent propellant mass, which is updated via Equation (4) during engine burn. Therefore, the reader is referred to Ref. [16] for details of the parameters defining the inertial and CG properties and their numerical values.

2.3. Aerodynamic Model

The aerodynamic forces and moments generated by the vehicle depend on its structure, as well as the instantaneous dynamic pressure. This atmospheric parameter is usually given by

Q (t) = \frac{1}{2} ρ (t) V^{2} (t)

(7)

where

V (t) = | | v_{a i r, I} (t) {| |}_{2}

and

v_{a i r, I} (t) = v_{I} (t) - w_{I} (t)

are the air-relative velocity vectors written in the inertial reference frame that account for the wind

w_{I} (t)

.

For the computation of aerodynamic loads, it is common to define a velocity reference frame that is fixed to the vehicle’s CG but directed along the air-relative velocity written in the body-fixed reference frame

v_{a i r, B} (t)

. This reference frame enables the definition of the two aerodynamic angles, the angle of attack

α (t)

and the sideslip angle

β (t)

, in order to illustrate the rotation from the body-fixed to the velocity reference frame

R_{V}^{B} (t)

, as follows:

R_{V}^{B} (t) = [\begin{matrix} cos α (t) cos β (t) & sin β (t) & sin α (t) cos β (t) \\ - cos α (t) sin β (t) & cos β (t) & - sin α (t) sin β (t) \\ sin α (t) & 0 & cos α (t) \end{matrix}]

(8)

where the aerodynamic angles are given by

\begin{matrix} α (t) & = atan2 (v_{a i r, B, z} (t), v_{a i r, B, x} (t)) \\ β (t) & = arcsin (\frac{v_{a i r, B, y} (t)}{V (t)}) . \end{matrix}

(9)

With these definitions and assuming that the vehicle has an axisymmetric shape, the aerodynamic forces and moments generated by the vehicle are expressed in the body-fixed reference frame as

\begin{matrix} F_{a e r o, B} (t) & = - Q (t) S_{r e f} R_{B}^{V} (t) [\begin{matrix} C_{D} (α_{e f f} (t), M (t)) \\ 0 \\ C_{L} (α_{e f f} (t), M (t)) \end{matrix}] \\ M_{a e r o, B} (t) & = [x_{C P} (t) - x_{C G} (t)] \times F_{a e r o, B} (t) \end{matrix}

(10)

where

S_{r e f} = 7.14 m^{2}

is the vehicle reference area;

x_{C P} (t) = {[\begin{matrix} x_{C P} (t) & 0 & 0 \end{matrix}]}^{T}

is the vehicle’s center of pressure (CP); and

{C_{D}, C_{L}}

are the drag and lift coefficients, respectively. These parameters are estimated as functions of the effective angle of attack

α_{e f f} (t) = \sqrt{α^{2} (t) + β^{2} (t)}

and the Mach number

M (t) = V (t) / a (t)

, where

a (t)

is the speed of sound, also obtained from COESA as a function of altitude.

Aerodynamic parameters are obtained using the Supersonic/Hypersonic Arbitrary-Body Program (S/HABP) for a cylindrical-shape first-stage rocket, with an angle of attack from 0 to 180 deg and a Mach number from 0.8 to 5. This programme, which was developed in 1973 by the United States Air Force Flight Dynamics Laboratory [17] and used by the National Aeronautics and Space Administration, has been adapted to obtain an aerodynamic database composed of the aerodynamic coefficients and the CP as function of the Mach number and the aerodynamic angles. More details on the development of the aerodynamic database and its validation are given in Ref. [18]. These coefficients are then linearly interpolated in the simulator according to the current flight conditions. The variation of

C_{D}

,

C_{L}

and

x_{C P}

with respect to

α_{e f f} (t)

and

M (t)

is illustrated in Figure 3.

Figure 3. Aerodynamic coefficient database. Note that the values of

x_{C P}

are found to be independent of the Mach number M.

Note that this aerodynamic database has some limitations. In fact, S/HABP was designed to operate from about Mach 2 to the hypersonic range [19]. However, for the RLV descent phase, and particularly for this study, the Mach number range starts around Mach 5 and then drops below Mach 1 until reaching zero velocity at landing. In addition, the aerodynamic coefficients are assumed to be independent of the thrust level. This approximation is very rough for retro-propulsive flight, where there are significant interactions between the exhaust plume of the engine and the oncoming flow that substantially impact the drag coefficient and the heat loads [20]. Therefore, the approximations obtained for the aerodynamic coefficients might diverge from the true values [18]. However, the goal of this simulator is not to gather high-fidelity models but to study the interactions and challenges that exist in the design of an RLV controlled dynamics simulator and assess the advanced and robust G&C methods that must be developed accordingly.

2.4. TVC System

The trajectory of the vehicle during descent is controlled by adjusting the magnitude and direction of the thrust vector generated by the main engine. This is achieved by the TVC actuator deflecting the engine nozzle by

β_{T V C, y} (t)

and

β_{T V C, z} (t)

, respectively, along the

y_{B}

-axis and

z_{B}

-axis. The required thrust magnitude

T_{r e f} (t)

and deflection angles

{β_{T V C, y} (t), β_{T V C, z} (t)}

are obtained from the guidance algorithm (Section 3) and the control method (Section 4), respectively. Decoupling between translational and rotational dynamics is common for TVC control due to the fact that the attitude of the vehicle can change faster than its trajectory [16]. Thus, the TVC-generated force and moment can be expressed in the body-fixed frame by

\begin{matrix} F_{T V C, B} (t) & = T_{r e f} (t) [\begin{matrix} cos (β_{T V C, y} (t)) cos (β_{T V C, z} (t)) \\ cos (β_{T V C, y} (t)) sin (β_{T V C, z} (t)) \\ - sin (β_{T V C, y} (t)) \end{matrix}] \\ M_{T V C, B} (t) & = [x_{P V P} - x_{C G} (t)] \times F_{T V C, B} (t) \end{matrix}

(11)

where

x_{P V P} = {[\begin{matrix} x_{P V P} & 0 & 0 \end{matrix}]}^{T}

is the TVC pivot position (

x_{P V P} = 0.96 m

).

2.5. Steerable Planar Fins Model

The implementation of planar fins for a G&C strategy has already been studied in the literature. Usually, two pairs of fins are placed above the vehicle’s CG: one pair, with deflections

{β_{f i n, 1} (t), β_{f i n, 2} (t)}

, controlls the motion in the pitch plane, while the other, with

{β_{f i n, 3} (t), β_{f i n, 4} (t)}

, controlls the motion in the yaw plane. Therefore, it is considered that there is no roll perturbation, meaning that the two pairs always remain in the trajectory yaw and pitch planes, respectively. In Ref. [8], Sagliano et al. used aerodynamic coefficient lookup tables that directly considered the state of the vehicle (angle of attack

α (t)

, sideslip angle

β (t)

, and Mach number

M (t)

) and fin deflections

{β_{f i n, 1} (t), β_{f i n, 2} (t), β_{f i n, 3} (t), β_{f i n, 4} (t)}

. In Ref. [21], the authors developed a fin model with a corresponding lookup table for the axial coefficient and the derivative of the normal coefficient, depending on only the Mach number. Therefore, the lookup tables were the same for the four fins, and the generated force was determined by the fin’s local angle of attack, defined as a function of the fin deflection and the vehicle’s angle of attack or sideslip angle. Finally, in Ref. [16], Simplício et al. also developed a fin model, but it only considered the normal force, which was calculated as a function of the fin’s local angle of attack. The same approach is used in this paper, and the obtained planar fins model was validated in Ref. [22].

Table 1 defines the fin positions with the corresponding deflections.

Table 1. Position of the fins’ CP with respect to the base of the RLV and corresponding deflections.

Furthermore, due to the reduced fin area compared to the RLV body, only the normal force contribution is considered [16]. Then, the value of the normal coefficient of the fin is estimated using lifting-line theory [23]. In fact, for a symmetric airfoil, the lift coefficient can be approximated by

c_{l} (α (t)) = 2 π α (t) .

(12)

To obtain the lift coefficient

C_{L}

of the corresponding wing, it is necessary to define the aspect ratio, denoted by

A R

and defined as

A R = \frac{b^{2}}{S} = \frac{b}{c}

(13)

where b is the wing span, S is the wing reference area, and c is the wing chord. Therefore, the following approximation is obtained [24]:

C_{L} (α (t)) = (\frac{A R}{A R + 2}) c_{l} (α (t)) .

(14)

This theory is then adapted for the fins of the RLV. Because flow separation is neglected and the angle of attack of the rocket is around

π

during descent, the normal fin coefficient has a sinusoidal dependence on the fin angle of attack

γ_{f i n, i} (t)

and can be approximated by

C_{N, f i n, i} (γ_{f i n, i} (t)) = 2 π (\frac{A R_{f i n}}{A R_{f i n} + 2}) sin (γ_{f i n, i} (t)), i = {1, 2, 3, 4} .

(15)

It remains to define the ith fin’s angle of attack and its associated force

F_{f i n, i} (t)

and moment

M_{f i n, i} (t)

in the vehicle’s body-fixed reference frame. Figure 4 shows the motion of the vehicle in the pitch plane; from this figure and Ref. [16], it is possible to state the following:

\{\begin{matrix} γ_{f i n, i} (t) = β_{f i n, i} (t) - α (t) \\ F_{f i n, i} (t) = \frac{1}{2} ρ (t) | | v_{a i r, I} (t) {| |}_{2}^{2} S_{f i n} C_{N, f i n, i} (γ_{f i n, i} (t)) {[\begin{matrix} - sin (β_{f i n, i} (t)) & 0 & cos (β_{f i n, i} (t)) \end{matrix}]}^{T} \\ M_{f i n, i} (t) = [x_{f i n, i} - x_{C G}] \times F_{f i n, i} (t) \end{matrix}, i = {1, 2}

(16)

where

α (t)

is the vehicle’s angle of attack, and

S_{f i n}

is the fin reference area. Similarly, the following formula is obtained in the yaw plane:

\{\begin{matrix} γ_{f i n, i} (t) = - β_{f i n, i} (t) - β (t) \\ F_{f i n, i} (t) = \frac{1}{2} ρ (t) | | v_{a i r, I} (t) {| |}_{2}^{2} S_{f i n} C_{N, f i n, i} (γ_{f i n, i} (t)) {[\begin{matrix} sin (β_{f i n, i} (t)) & cos (β_{f i n, i} (t)) & 0 \end{matrix}]}^{T} \\ M_{f i n, i} (t) = [x_{f i n, i} - x_{C G}] \times F_{f i n, i} (t) \end{matrix}, i = {3, 4}

(17)

where

β (t)

is the vehicle’s sideslip angle.

Figure 4. Fin model.

Finally, the total force generated by the fixed planar fins in the inertial reference frame and the total moment generated in the vehicle’s body-fixed reference frame are given by

F_{f i n s, I} (t) = R_{I}^{B} (t) \sum_{i = 1}^{4} F_{f i n, i} (t)

(18)

M_{f i n s, B} (t) = \sum_{i = 1}^{4} M_{f i n, i} (t)

(19)

Table 2 specifies the parameters of the planar fins that are implemented in the simulator.

Table 2. Planar fins’ model parameters.

3. Guidance Strategy

For the RLV D&L simulator introduced in the previous section, the guidance algorithm is responsible for the real-time generation of a reference trajectory to be followed by the vehicle with thrust and attitude commands. Here, a direct method is used within the convex optimisation framework. This consists in transforming the fuel-optimal trajectory problem into a convex one—more precisely, into a Second-Order Cone Programming (SOCP) problem, which can be solved with efficient solvers in polynomial time. These challenging tasks rely on converting nonconvex state and control constraints into the convex form, requiring high computational power. Recently, the so-called lossless convexification method [25] and advances in computational development have enabled these issues to be overcome and therefore allow real-time trajectory generation in a closed-loop fashion.

Moreover, a particular class of convex optimisation, successive convex optimisation, can be applied to approximate the remaining nonlinearities in the optimal landing problem, such as the aerodynamic effects, which have previously been ignored. This consists in iteratively solving convex optimisation SOCP subproblems in which the nonconvex dynamics and constraints are repeatedly linearised using information originating from the previous iteration’s solution. This algorithm was first developed by Szmuk et al. in Ref. [4] and then adapted in different ways in Refs. [7,9]. In this paper, the successive convex optimisation algorithm relies on the work achieved by Guadagnini et al. in Ref. [26], where the strategy defined in Ref. [4] was improved to be applicable in a closed-loop fashion for a 6-DoF controlled dynamics simulator.

In this study, the successive convex optimisation guidance algorithm is implemented in MATLAB using the CVX library [27] to formulate the convex problem and the ECOS routine [28] to solve it. At each simulation instance defined by the simulation rate

f_{s i m}

, the reference thrust profile

T_{B, r e f} (t)

and the reference attitude angles

{θ_{r e f} (t), ψ_{r e f} (t)}

are calculated from the most recent guidance solution by linear interpolation. In fact, this solution is stored as an online lookup table, which is updated at each guidance step, with the guidance update frequency

f_{g u i} = 0.1

Hz, that is, every 10 s. The guidance algorithm inside the “D&L Guidance” building block of the simulator (recall Figure 1) is schematised in Figure 5.

Figure 5. “D&L Guidance” block description.

Before describing the algorithm, a description of the adopted notation is provided. In the following paragraphs and subsections, the discrete time instant is specified with the parameter k. Consequently, a variable a at the time instant k is represented as

a [k]

. Then, since we are handling an iterative process, the considered iterative solution is specified with the superscript i. Therefore, the solution a obtained at iteration i is specified as

a^{i}

. Thus, a variable a at a time instant k, relative to iteration i, is denoted

a^{i} [k]

.

First, it is necessary to initialise the process with a dynamically inconsistent guess solution. The simplest approach for the state vector is to create a linear interpolation of the discrete state variables under the initial and final conditions. Regarding the control vector, a good guess for the 6-DoF D&L problem is to match the gravitational force at each time step. In this study, the time of flight, which is the final time

t_{f}

, is also an optimisation variable and therefore must be initialised. The initial guess for the state and control vector solutions at each time instant, starting at time

t_{c}

and for the time of flight

t_{f}

, are defined by

\begin{matrix} x^{0} [k] & = \frac{K - k}{K - 1} x (t_{c}) + \frac{k - 1}{K - 1} x (t_{f}), k \in [1, K] \\ u^{0} [k] & = m^{0} [k] \cdot {[\begin{matrix} g_{0} & 0 & 0 \end{matrix}]}^{T}, k \in [1, K - 1] \\ t_{f}^{0} & = 120 s . \end{matrix}

(20)

The algorithm is not specifically sensitive to initial guesses, but poor guesses can lead to an increased convergence time [4].

Once the initial guess is defined, we enter the successive convex optimisation loop, which consists of solving the SOCP problem several times until reaching the user-defined maximum iteration number

i_{m a x}

or the tolerance relative to the trust region radius

Δ_{t o l}

, defined in the next subsection. Note that several exit conditions can be defined, such as a tolerance with respect to the norm of the virtual controls or the norm of the difference in the cost function between two iterations. Those defined here lead to satisfactory results and enable the coupling of the guidance algorithm with the other building blocks of the 6-DoF RLV controlled dynamics simulator, which is the main focus of this paper.

Then, to enable the formulation of the SOCP subproblems, the optimal control problem must be converted into a finite-dimensional parameter optimisation problem. Therefore, the trajectory and optimisation variables are discretised into K uniformly spaced points, ranging from the current instant of time

t_{c}

to the final time

t_{f}

. At each guidance step, the time vector is divided in the following way:

t [k] = \frac{k - 1}{K - 1} t_{f}, k \in [1, K]

(21)

Additionally, because the estimated time of flight

t_{f} \to 0

as

t \to T o F

, where

T o F

is the actual time of flight achieved by the simulation, the accuracy of the discretisation becomes more precise towards the end. More specifically, the sampling time is given by

T_{s} = t_{f} / (K - 1)

. The linearisation and discretisation methods are explained in the next subsection, together with the definition of the SOCP problem.

When the optimisation algorithm converges to an optimal solution, this reference trajectory is saved to be used for the next iteration, or, if the exit criterion of the successive convex optimisation routine is met, it is transferred to the online look-up table from which the actual reference parameters corresponding to the simulation instance can be generated. In this study, this involves the reference thrust magnitude profile

T_{r e f} (t)

and the reference pitch and yaw angle profiles

θ_{r e f} (t)

and

ψ_{r e f} (t)

, respectively.

3.1. Nonconvex Optimal Control Problem

The guidance law relies on solving an optimal control problem with dynamic constraints. These involve the descent dynamics, but it is also possible to add several state and control constraints. The following paragraphs describe the optimisation problem implemented in the successive convex optimisation loop. Note that the superscript i that defines the current iteration loop is omitted from the following description for the sake of clarity. Figure 6 shows the nonconvex optimisation problem defined for this study.

Figure 6. Nonconvex optimisation problem.

It can be observed that the 6-DoF nonlinear descent dynamics displayed in Equations (4)–(6) are re-adapted to the 6-DoF descent of a powered-only first-stage booster, meaning that only the thrust vector of the main engine, denoted hereafter as

T_{r e f, B} (t)

, is considered as the control input

u (t)

. In fact, the steerable planar fins are not included in the optimisation problem for the rocket D&L in order to avoid adding complexity due to the nonlinearities generated by the addition of these aerodynamic loads. This is common practice for launcher re-entry, since the thrust vector (magnitude and direction) is a good indicator for reference trajectory generation. The allocation between the actuators, TVC, and steerable planar fins is achieved afterwards by the control subsystem using the reference values obtained in terms of thrust magnitude and attitude angles.

In addition, the aerodynamics are modelled through a so-called spherical aerodynamic model. This model, introduced by Szmuk et al. in Ref. [4], approximates the relationship between the aerodynamic force and the velocity vector and has the advantage of being easily implementable with the successive convex optimisation guidance method. More specifically, the aerodynamic force

A_{B} (t)

is considered to be always anti-parallel with respect to the velocity

v_{B} (t)

as if the vehicle were subjected to a pure drag force. Assuming that the rocket is axisymmetric, the aerodynamic forces and moments in the vehicle’s body-fixed reference frame are expressed by

\begin{matrix} A_{B} (t) & = - \frac{1}{2} ρ (t) | | v_{I} (t) {| |}_{2} S_{r e f} C_{a e r o} (t) R_{B}^{I} (t) v_{I} (t) \\ M_{A, B} (t) & = [x_{C P} - x_{C G} (t)] \times A_{B} (t) \end{matrix}

(22)

Here,

C_{a e r o} (t) = diag ([\begin{matrix} c_{a, x} (t) & c_{a, x} (t) & c_{a, x} (t) \end{matrix}])

is the aerodynamic coefficient matrix, where

c_{a, x} (t)

is a positive scalar defined as follows

c_{a, x} (t) = C_{D} (α = π, M (t))

(23)

Here,

C_{D} (α (t), M (t))

is the drag coefficient, which is estimated from the available lookup tables defined in Section 2.3.

Regarding the state constraints, the first is a lower bound of the mass: for any time

t \in [t_{c}, t_{f}]

, the mass cannot be lower than the dry mass of the vehicle. This constraint is expressed as follows:

m (t) \geq m_{d r y} .

(24)

The second constraint is the so-called glide-slop constraint: it restricts the inertial position to lie within a glide-slope cone with half-angle

γ_{g s} \in [0, 90 \deg)

and a vertex at the landing site. This constraint is enforced by

e_{1} \cdot r_{I} (t) \geq tan (γ_{g s}) {||{[\begin{matrix} e_{2} & e_{3} \end{matrix}]}^{T} r_{I} (t)||}_{2}

(25)

where

e_{i}, i \in [1, 3]

are the versors. The third constraint then concerns the tilt angle, that is, the angle between the

x

-axes of the two reference frames, which is limited to a maximum of

θ_{m a x} \in (0, 90 \deg]

. It is defined by

cos (θ_{m a x}) \leq e_{I, 1}^{T} R_{I}^{B} (t) e_{B, 1} .

(26)

Then, the fourth constraint limits the angular rate of the vehicle and is enforced by

| | ω_{B} (t) {| |}_{2} \leq ω_{m a x} .

(27)

Finally, an additional constraint preserves the unit norm of the quaternion as follows:

| | q_{B}^{I} (t) {| |}_{2} = 1 .

(28)

Moreover, a so-called State-Triggered Constraint (STC) [4] is added. In the present case, it consists in imposing an angle of attack

α

constraint,

α_{m a x}

, when the dynamic pressure

Q (t)

is larger than a prescribed value

Q_{m a x}

. This constraint is written in a continuous formulation with a trigger function

g_{α}

and a constraint function

c_{α}

as follows:

\begin{matrix} h_{α} (r_{I} (t), v_{I} (t), q_{B}^{I} (t)) & = - min (g_{α} (v_{I} (t), r_{I} (t)), 0) \cdot c_{α} \leq 0 \\ c_{α} (v_{I} (t), q_{B}^{I} (t)) & = e_{1} \cdot R_{B}^{I} (t) v_{I} (t) + cos (α_{m a x}) | | v_{I} (t) {| |}_{2} \\ g_{α} (r_{I} (t), v_{I} (t)) & = Q_{m a x} - \frac{1}{2} ρ (t) | | v_{I} (t) {| |}_{2}^{2} . \end{matrix}

(29)

Two control constraints are considered to bound the direction and magnitude of the thrust force. The direction is bounded by limiting the TVC up to a maximum gimbal angle

δ_{m a x}

. It is enforced by

cos (δ_{m a x}) | | T_{B, r e f} (t) {| |}_{2} \leq e_{1} \cdot T_{B, r e f} (t) .

(30)

Then, the thrust magnitude is bounded between minimum and maximum values, i.e.,

0 < T_{m i n} \leq | | T_{B, r e f} (t) {| |}_{2} \leq T_{m a x}

(31)

where

T_{m i n}

and

T_{m a x}

are the lower and upper bounds, respectively.

The objective of the optimal control problem defined herein is to find the optimal trajectory subject to the defined re-entry dynamics and state and control constraints while minimising the vehicle’s fuel consumption, which corresponds to maximising the vehicle’s final mass. Therefore, the cost function can be written as follows at each ith SOCP iteration:

J = - m (t_{f}) .

(32)

3.2. SOCP Problem

However, the optimisation problem subject to the described dynamics and state and control constraints is not convex and must therefore be convexified. In order to achieve this, the first step is to convert the free-final-time nonlinear continuous-time optimal control problem into an equivalent fixed-final-time nonlinear continuous-time problem. This is achieved by normalising the time of flight from

t \in [t_{c}, t_{f}]

to

τ \in [0, 1]

, where

τ

is the normalised time of flight. The nonlinear dynamics are summarised as

\dot{x} (t) = f (x (t), u (t))

with

x (t) = {[\begin{matrix} m (t) & r_{I}^{T} (t) & v_{I}^{T} (t) & q_{B}^{I} {(t)}^{T} & ω_{B}^{T} (t) \end{matrix}]}^{T}

as the state vector and

u (t) = T_{B, r e f} (t)

as the control vector, which can be rewritten as follows:

\dot{x} (t) = \frac{d τ}{d t} \frac{d}{d τ} x (t) .

(33)

Therefore, with

σ = {(d τ / d t)}^{- 1}

, the normalised nonlinear dynamics are expressed by

\frac{d}{d τ} x (τ) = σ \cdot f (x (τ), u (τ))

(34)

where

σ = t_{f}

, since

τ \in [0, 1]

.

Then, the nonlinear descent dynamics equations, defined above, are linearised and discretised about the solution of the previous iteration through a first-order Taylor approximation and using a zero-order-hold interpolation scheme. First, the original continuous-time problem is transformed into a Linear Time-Varying (LTV) problem defined by

\frac{d}{d τ} x (τ) = A (τ) x (τ) + B (τ) u (τ) + Σ (τ) σ + z (τ)

(35)

where the parameters are evaluated about a reference trajectory corresponding to the previous (

i - 1

)th SOCP solution:

\begin{matrix} A (τ) & : = σ^{i - 1} \cdot \frac{\partial f}{\partial x} |_{x^{i - 1} (τ), u^{i - 1} (τ)} \\ B (τ) & : = σ^{i - 1} \cdot \frac{\partial f}{\partial u} |_{x^{i - 1} (τ), u^{i - 1} (τ)} \\ Σ (τ) & : = f (x^{i - 1} (τ), u^{i - 1} (τ)) \\ z (τ) & : = - A (τ) x^{i - 1} (τ) - B (τ) u^{i - 1} (τ) . \end{matrix}

(36)

Second, the discretised LTV system is given for each

k \in [1, K - 1]

by

\begin{matrix} x [k + 1] & = \bar{A} [k] x [k] + \bar{B} [k] u [k] + \bar{Σ} [k] σ + \bar{z} [k], \\ \bar{A} [k] & : = I_{n_{x} \times n_{x}} + T_{s} A [k], \\ \bar{B} [k] & : = T_{s} B [k], \\ \bar{Σ} [k] & : = T_{s} Σ [k], \\ \bar{z} [k] & : = T_{s} z [k] . \end{matrix}

(37)

Once the descent dynamics are linearised and discretised, the next step is the convexification of the nonconvex constraints. This concerns two state constraints, the norm of the quaternion (Equation (28)) and the STC (Equation (29)), and one control constraint, the lower bound of the thrust magnitude (Equation (31)). The convexification of Equation (28) is obtained through a first-order Taylor expansion approximation evaluated about the previous

(i - 1)

th SOCP iteration:

| | q_{B}^{I, i - 1} [k] {| |}_{2} + \frac{q_{B}^{I, i - 1} {[k]}^{T}}{| | q_{B}^{I, i - 1} [k] {| |}_{2}} (q_{B}^{I, i} [k] - q_{B}^{I, i - 1} [k]) = 1 .

(38)

The same method is used for the STC (Equation (29)). However, due to the

min (\cdot)

function, the constraint is approximated as follows:

\{\begin{matrix} h_{α} (ξ^{i - 1} [k]) + {\frac{\partial h_{α}}{\partial ξ}|}_{ξ^{i - 1} [k]} (ξ^{i} [k] - ξ^{i - 1} [k]) \leq 0, & if g_{α} (ξ^{i - 1} [k]) < 0 \\ 0, & otherwise \end{matrix}

(39)

where

ξ^{i} [k] = {[\begin{matrix} v_{I}^{i} {[k]}^{T} & q_{B}^{I, i} {[k]}^{T} \end{matrix}]}^{T}, \forall k \in [1, K]

are the reference trajectory parameters obtained from the ith SOCP iteration. Lastly, it is applied to the lower bound of the thrust magnitude, obtaining the following expression for

k \in [1, K - 1]

:

\begin{matrix} h_{T} (u [k]) = T_{m i n} - | | T_{B, r e f} [k] {| |}_{2} \\ h_{T} (u^{i - 1} [k]) + {\frac{\partial h_{T}}{\partial u}|}_{u^{i - 1} [k]} (u^{i} [k] - u^{i - 1} [k]) \leq 0 . \end{matrix}

(40)

The successive convex optimisation strategy involves the use of trust regions and virtual controls to prevent unboundedness and artificial infeasibility, respectively. In fact, these issues are due to the linearisation process. They could be avoided using the nonlinearity preservation and linearisation approach instead of the direct linearisation approach adopted in this guidance law to reduce complexity [29,30]. The implementation of trust regions allows one to limit the deviation between two consecutive iterations responsible for artificial unboundedness. They consist of quadratic inequality constraints. The aim is to define a region near the previous iteration so that the deviation is mitigated. As a consequence, this involves the radius being penalised in the cost function. In this optimisation problem, the trust regions are defined first for the state and control vectors and then for the time of flight as follows:

\begin{matrix} | | x^{i} [k] - x^{i - 1} {[k] | |}_{2} + | | u^{i} [k] - u^{i - 1} [k] {| |}_{2} & \leq Δ_{x, u}^{i} [k] \\ | | σ^{i} - σ^{i - 1} {| |}_{2} & \leq Δ_{σ}^{i} . \end{matrix}

(41)

Δ_{x, u}^{i} = {[\begin{matrix} Δ_{x, u}^{i} [1], & \dots, & Δ_{x, u}^{i} [K] \end{matrix}]}^{T} \in R^{K}

is then defined as the state and control trust region vector. To convert this trust region vector into the SOCP formulation, it is necessary to define a joint state and control vector at each time instant,

ξ^{i} [k] = {[\begin{matrix} {(x^{i} [k])}^{T} & {(u^{i} [k])}^{T} \end{matrix}]}^{T}

,

k \in [1, K - 1]

so that Equation (41) can be rewritten as

| | \begin{matrix} (1 - 2 {(ξ^{i - 1} [k])}^{T} ξ^{i} [k] + & ({(ξ^{i - 1} [k])}^{T} ξ^{i - 1} [k] - Δ_{x, u}^{i} [k])) / 2 \\ I_{n_{ξ} \times n_{ξ}} ξ^{i} [k] \end{matrix} | |_{2} \leq (1 + 2 {(ξ^{i - 1} [k])}^{T} ξ^{i} [k] - ({(ξ^{i - 1} [k])}^{T} ξ^{i - 1} [k] - Δ_{x, u}^{i} [k])) / 2 .

(42)

Finally, the size of the trust regions must be bounded; therefore, the norms

Δ_{x, u}^{i}

and

Δ_{σ}^{i}

must be inserted into the cost function. Regarding the state and control trust region vector, a slack variable

S_{Δ_{x, u}}^{i}

must be introduced in order to avoid a quadratic term in the cost function. This implies the addition of the following inequality constraint [26]:

| | Δ_{x, u}^{i} {| |}_{2} \leq S_{Δ_{x, u}}^{i} .

(43)

Virtual controls are additional control inputs

ν^{i} \in R^{n_{x}}

that allow one to reach each point of the solution domain through dynamics relaxation and therefore avoid artificial infeasibility. They are commonly met during the first iterations of the algorithm due to the dynamically inconsistent initial guess, but they also compensate for the high-order terms neglected by the discretisation process. Therefore, the linear discrete dynamics of Equation (37) become

x^{i} [k + 1] = \bar{A} [k] x^{i} [k] + \bar{B} [k] u^{i} [k] + \bar{Σ} [k] σ + {\bar{z}}^{i} [k] + ν^{i} [k] .

(44)

We can then define a concatenated vector

{\bar{ν}}^{i} : = {[\begin{matrix} {(ν^{i} [1])}^{T}, & \dots, & {(ν^{i} [K - 1])}^{T} \end{matrix}]}^{T} \in R^{n_{x} \times (K - 1)}

. Similarly to the trust regions, all these terms must be penalised in the cost function, and to avoid a quadratic term, a slack variable

S_{ν}^{i}

must be again be defined in conjunction with the following inequality constraint:

\begin{matrix} | | {\bar{ν}}^{i} {| |}_{2} \leq S_{ν}^{i} . \end{matrix}

(45)

Finally, the cost function of Equation (32) is augmented with the previously defined features and becomes:

J = - m^{i} [K] + w_{ν} S_{ν}^{i} + w_{Δ_{x, u}} S_{Δ_{x, u}}^{i} + w_{Δ_{σ}} Δ_{σ}^{i}

(46)

where

w_{ν}

,

w_{Δ_{x, u}}

, and

w_{Δ_{σ}}

are penalisation weights.

The obtained SOCP optimisation problem, which is solved iteratively in the successive convex optimisation algorithm, is summarised in Figure 7. Table 3 provides the SOCP problem parameters.

Figure 7. SOCP problem.

Table 3. SOCP optimisation problem parameters.

4. Control Approach

From the reference trajectory computed by the previously defined guidance algorithm and the current states of the vehicle, the control algorithm must be able to generate the necessary commands in terms of the thrust magnitude

T_{r e f} (t)

; TVC deflection angles

{β_{T V C, y} (t), β_{T V C, z} (t)}

; and fin deflections

{β_{f i n, 1} (t), β_{f i n, 2} (t), β_{f i n, 3} (t), β_{f i n, 4} (t)}

to be applied by the actuators in order to correct the trajectory of the vehicle. For this study, we assume

β_{f i n, 1} (t) = β_{f i n, 2} (t) = β_{f i n, y} (t)

and

β_{f i n, 3} (t) = β_{f i n, 4} (t) = β_{f i n, z} (t)

. The method adopted here considers the use of two gain-scheduled PID controllers to compute the respective deflection angles. In fact, the thrust magnitude command is taken directly from the guidance algorithm

T_{r e f} (t) = | | T_{B, r e f} (t) {| |}_{2}

. This approximation is penalised by a low-pass filter, which simulates the intrinsic physics of the device, and the delay induced is compensated for by a PI controller. In fact, the descent control system is more complex than the ascent phase due to the throttleability of the thrust force generated by the rocket’s main engine. If this were considered as a control input, the pitch and yaw motion could not be decoupled, as is usually carried out for rocket preliminary attitude control design. We followed this approach herein since the objective was primarily to study the interactions between all the subsystems, rather than the development of a highly accurate, high-performance control system.

Usually, the 6-DoF problem is separated into two 3-DoF problems. One is characterised by the motion in the

x_{B} z_{B}

plane with the controller on the pitch angle

θ (t)

through the deflection angles

β_{T V C, y} (t)

and

β_{f i n, y} (t)

. The second problem is characterised by the motion in the

x_{B} y_{B}

plane with the controller on the yaw angle

ψ (t)

through the deflection angles

β_{T V C, z} (t)

and

β_{f i n, z} (t)

. An assumption is made that the roll angle

ϕ (t)

is small so that no coupling effects can arise in the dynamics. Therefore, two linear systems are built using a reference trajectory precomputed offline. This reference trajectory corresponds to the solution of the successive convex optimisation algorithm in its first run, meaning that the initial conditions of the studied problem are used. These can be rewritten in terms of the perturbed variables

\tilde{x} (t) = x (t) - \bar{x} (t)

and

\tilde{u} (t) = u (t) - \bar{u} (t)

, where

\bar{x} (t)

and

\bar{u} (t)

are the reference state and control vectors, respectively, to finally obtain

\begin{matrix} \dot{\tilde{x}} (t) = A (t) \tilde{x} (t) + B (t) \tilde{u} (t) \\ y (t) = C (t) \tilde{x} (t) \end{matrix}

(47)

where

A (t) \in R^{10 \times 10}

and

B (t) \in R^{10 \times 4}

are the Jacobian matrices of the nonlinear equations with respect to the state and control variables respectively, computed with the function

jacobian

in MATLAB, and

C (t) \in R^{2 \times 10}

enables the extraction of the pitch angle error

\tilde{θ} (t)

and the yaw angle error

\tilde{ψ} (t)

. Therefore, the decoupling into two 3-DoF is achieved, and the following linear systems are obtained:

\begin{matrix} x_{p i t c h} (t) & = {[\begin{matrix} m (t) & v_{x} (t) & v_{z} (t) & ω_{y} (t) & θ (t) \end{matrix}]}^{T} \in R^{5}, \\ u_{p i t c h} (t) & = {[\begin{matrix} β_{T V C, y} (t) & β_{f i n, y} (t) \end{matrix}]}^{T} \in R^{2}, y_{p i t c h} (t) = θ (t) \in R \\ x_{y a w} (t) & = {[\begin{matrix} m (t) & v_{x} (t) & v_{y} (t) & ω_{z} (t) & ψ (t) \end{matrix}]}^{T} \in R^{5}, \\ u_{y a w} (t) & = {[\begin{matrix} β_{T V C, y} (t) & β_{f i n, y} (t) \end{matrix}]}^{T} \in R^{2}, y_{y a w} (t) = ψ (t) \in R \end{matrix}

(48)

where

v_{x} (t)

,

v_{y} (t)

, and

v_{z} (t)

are the x, y, and z components of

v_{B} (t)

, respectively, and

ω_{y} (t)

and

ω_{z} (t)

are the y and z components of

ω_{B} (t)

. The corresponding Jacobian matrices are computed similarly to the linear system defined in Equation (47). This decoupling of the dynamics was validated in [26].

With these definitions and because two control inputs are considered in each linear system (TVC and fin deflections), the latter is considered as a Multiple-Input Multiple-Output (MIMO) control system for which it is complex to apply classical linear control theory since every channel must be iteratively addressed in a single-loop fashion. The solution to overcome this drawback would be the use of advanced robust control methods such as the

H_{\infty}

family of methods or the LPV approach. A preliminary study of structured

H_{\infty}

control synthesis within this simulator is available in Ref. [31]. In this study, to develop a baseline simulator and stay in line with the current state of the art in control design for launchers [10,32], the linear systems are adapted to Single-Input Single-Output (SISO) control systems, for which it is possible to use gain-scheduled PID controllers. Two configurations are chosen and are explained in the next subsections. The first is the TVC-only configuration, for which the fins are considered fixed and the only input is therefore the TVC deflection. The second configuration lies in the definition of a control moment, introduced in Ref. [16], which gathers TVC and fin control authorities and then allocates the necessary command to each actuator according to the level of thrust.

4.1. TVC-Only SISO Configuration

In this case, the only control inputs are

β_{T V C, y} (t)

for the pitch plane and

β_{T V C, z} (t)

for the yaw plane. Therefore, the two linear systems consider the following parameters:

\begin{matrix} x_{p i t c h} (t) & = {[\begin{matrix} m (t) & v_{x} (t) & v_{z} (t) & ω_{y} (t) & θ (t) \end{matrix}]}^{T} \in R^{5}, \\ u_{p i t c h} (t) & = β_{T V C, y} (t), y_{p i t c h} (t) = θ (t) \\ x_{y a w} (t) & = {[\begin{matrix} m (t) & v_{x} (t) & v_{y} (t) & ω_{z} (t) & ψ (t) \end{matrix}]}^{T} \in R^{5}, \\ u_{y a w} (t) & = β_{T V C, z} (t), y_{y a w} (t) = ψ (t) \end{matrix}

(49)

where

v_{x} (t)

,

v_{y} (t)

, and

v_{z} (t)

are the x, y, and z components of

v_{B} (t)

, respectively, and

ω_{y} (t)

and

ω_{z} (t)

are the y and z components of

ω_{B} (t)

. The corresponding Jacobian matrices are computed similarly to the linear system defined in Equation (47).

Due to the time-varying nature of the problem, a single PID controller might be unable to stabilise the system for the whole trajectory. Therefore, the reference altitude profile is discretised into 25 slots where linearisation is performed. This was chosen as the scheduling parameter since it evolves monotonically with respect to time and has been well validated in the literature [33,34]. Moreover, it allows one to capture the variations in terms of thrust magnitude. In this way, the problem is divided into regions wherein it is possible to analyse if the controller is able to stabilise the system. Thanks to this, the controllers can be considered gain-scheduled PID controllers, as the gains can be changed to achieve the desired levels of performance in all the regions. For each system, the gains are tuned with the following performance requirements: an overshoot inferior to 10%, a settling time strictly inferior to 1 s, a gain margin superior to 6 dB, and a phase margin superior to 60 deg. The tuning is performed with the MATLAB application

PID tuner

.

4.2. TVC and Fin SISO Configuration

Here, the MIMO formulation is translated into an SISO formulation by defining a surrogate variable that gathers gimbal and fin angle deflections and achieving control synthesis on it. More specifically, following Ref. [16], the control moment

m_{c t r} (t)

is defined as a parameter that specifies the necessary pitch or yaw moment to correct the trajectory of the vehicle. Knowing the control effectiveness level of each actuator, a control allocation algorithm is then used to determine the actual control inputs

{β_{T V C, y} (t)

,

β_{f i n, y} (t)}

and

{β_{T V C, z} (t)

,

β_{f i n, z} (t)}

.

The control effectiveness levels are expressed as follows. The effectiveness of TVC in generating control moments is quantified by

μ_{T V C} (t) = [x_{C G} (t) - x_{P V P}] \frac{T_{r e f} (t)}{J_{N} (t)} .

(50)

Regarding the fins, the control effectiveness is given by

μ_{f i n} (t) = 2 [x_{f i n} - x_{C G} (t)] \frac{Q (t) S_{f i n} C_{N, f i n ∖ α} (t)}{J_{N} (t)}

(51)

where

C_{N, f i n ∖ α} (t) = 2 π (\frac{A R_{f i n}}{A R_{f i n} + 2}) cos (γ_{f i n, i} (t))

is the normal fin force gradient with

γ_{f i n, i} (t)

computed from Equation (37) for the pitch plane and Equation (38) for the yaw plane. The relationship between the control moment and the control inputs is then expressed as

m_{c t r, #} (t) = - μ_{T V C} (t) β_{T V C, #} (t) - μ_{f i n} (t) β_{f i n, #} (t)

(52)

where

# = {y, z}

for the pitch plane and the yaw plane, respectively.

Therefore, these parameters are obtained from the reference trajectory, and similarly to Equation (47), the following linear systems are built for the pitch and the yaw planes:

\begin{matrix} x_{p i t c h} (t) & = {[\begin{matrix} m (t) & v_{x} (t) & v_{z} (t) & ω_{y} (t) & θ (t) \end{matrix}]}^{T} \in R^{5}, \\ u_{p i t c h} (t) & = m_{c t r, y} (t), y_{p i t c h} (t) = θ (t) \\ x_{y a w} (t) & = {[\begin{matrix} m (t) & v_{x} (t) & v_{y} (t) & ω_{z} (t) & ψ (t) \end{matrix}]}^{T} \in R^{5}, \\ u_{y a w} (t) & = m_{c t r, z} (t), y_{y a w} (t) = ψ (t) . \end{matrix}

(53)

The Jacobian matrices and the corresponding PIDs for the given altitude slots are computed in the same manner as for the previous configuration. Note that the obtained controllers must be robust enough to cope with a range of trajectories since the guidance is recomputed several times during the descent, but not the tuning of the gains. However, it is observed that the updated guidance trajectories follow the same scheme, which is enforced by the boundary constraint on the quaternion (recall Figure 7), and since the controllers are interpolated with respect to the altitude (and not the time of flight, which is unknown), the obtained gains provide satisfactory results all along the descent flight.

Finally, the commanded control moment

m_{c t r} (t)

is allocated between the TVC system and the planar fins following the algorithm in Ref. [16], repeated in Algorithm 1. More specifically, if the commanded thrust magnitude

T_{r e f} (t)

is above the user-defined high thrust limit

T_{H T L}

, then the TVC system is used as the primary actuator, and the planar fins are used only if the maximum authority

β_{T V C, m a x}

of the TVC system is reached. In contrast, if the thrust magnitude command

T_{r e f} (t)

is below the user-defined high thrust limit

T_{H T L}

, then the planar fins are used as the primary actuator, and the TVC system is used as the secondary actuator if the maximum authority

β_{f i n, m a x}

of the planar fins is reached. Here,

β_{T V C, m a x} = 10 \deg

and

β_{f i n, m a x} = 20 \deg

.

Algorithm 1 Control allocation [16]

1:: if $T_{r e f} \geq T_{H T L}$ then
2:: $β_{T V C} \leftarrow - m_{c t r} / μ_{T V C}$
3:: $β_{f i n} \leftarrow 0$
4:: if $| β_{T V C} | > β_{T V C, m a x}$ then
5:: $β_{T V C} \leftarrow β_{T V C, m a x} \times sign (β_{T V C})$
6:: $β_{f i n} \leftarrow - (m_{c t r} + μ_{T V C} \times β_{T V C}) / μ_{f i n}$
7:: end if
8:: else
9:: $β_{f i n} \leftarrow - m_{c t r} / μ_{f i n}$
10:: $β_{T V C} \leftarrow 0$
11:: if $| β_{f i n} | > β_{f i n, m a x}$ then
12:: $β_{f i n} \leftarrow β_{f i n, m a x} \times sign (β_{f i n})$
13:: $β_{T V C} \leftarrow - (m_{c t r} + μ_{f i n} \times β_{f i n}) / μ_{T V C}$
14:: end if
15:: end if
16:: OUTPUTS: $β_{T V C}$ , $β_{f i n}$

Note that this control configuration also enables a fin-only actuation configuration by setting a high thrust limit

T_{H T L}

superior to the maximum thrust magnitude allowed by the guidance algorithm. Note also that this choice of criteria for changing the actuator allocation configuration was made after further analyses. Other criteria were tested, such as dynamic pressure or control effectiveness levels, that is, allocation primarily to the TVC system if

μ_{T V C} (t) > μ_{f i n} (t)

and to the planar fins otherwise. However, the dynamic pressure profile was not accurate enough, since at the beginning of the trajectory the dynamic pressure is high, as well as the thrust magnitude; thus, the planar fins are efficient but in reality not as efficient as the TVC system. Furthermore, the control effectiveness level was not optimal, since some overlaps when both actuators had a similar control authority were observed that could lead to convergence issues, since it would involve rapid switches in the commands given to the actuators. Moreover, since the reference thrust magnitude is among the control inputs and completely decoupled from the TVC system by design, this parameter is less complex to implement, preventing coupling effects and therefore leading to the best results.

Once verified through linear analysis, the controllers were implemented in the nonlinear simulator according to the actual altitude following the scheme described in Figure 8. Basically, no interpolation was achieved, and a controller was selected as soon as we entered the altitude region in which this controller had been defined. Note that the controllers’ gains could have been interpolated linearly with respect to the altitude using a finite-difference method as in Ref. [33]. However, this solution was not adopted, since the values of two adjacent gain-scheduled controllers were considerably different, leading to inaccuracies when achieving the interpolation. Another strategy would be to use a so-called signal blending scheme to mitigate the previous issue [34]. However, this could cause large transients in the switching regions and would be quite complex to implement. Therefore, this technique was not studied, since the objective was primarily the design of a closed-loop baseline simulator. The gain-scheduling method should be more thoroughly investigated in future work, since an improved scheduling strategy would be a substantial extension for enhanced robustness.

Figure 8. Gain-scheduling method description.

5. Simulation Results

This section illustrates the results obtained with the proposed G&C architecture coupled with the RLV controlled dynamics simulator under different control configurations: TVC-only, fins-only, and both (Section 5.1). Then, a sensitivity analysis is carried out to assess the impact on the obtained trajectory from disturbances such as wind gusts as well as multiple uncertainties through a Monte Carlo approach (Section 5.2).

5.1. Nominal Trajectory Simulations for Different Actuation Configurations

For this study, no wind was considered, and neither propellant sloshing effects nor flexible bending modes were included, since the described simulator is still at an early design stage and more complex studies are necessary for future developments. Three different actuation configurations were tested. The first one with TVC actuation only used the control architecture defined in Section 4.1 and considered fixed planar fins with 0 deg deflection. The second used only planar fins actuation with the control architecture defined in Section 4.2 (with

T_{H T L} = T_{m a x} = 600 kN

). Finally, the third configuration used TVC and planar fins actuation with a thrust magnitude limit of

T_{H T L} = 70 kN

. The initial and final conditions are described in Table 4. The initial conditions allowed us to study a trajectory evolving mainly in the pitch plane. Other simulations were also carried out for a trajectory mainly in the yaw plane and for a trajectory in both planes, showing similar results; therefore, they are not displayed in this paper.

Table 4. Initial and final conditions.

Figure 9 shows the converged trajectories for the different cases, as well as the control contributions of the vehicle through the TVC and fin deflection angles and the thrust magnitude level. The forces acting on the vehicle as well as the vertical axes of the vehicle and fins are represented at different times during the descent. Table 5 summarises the performance results obtained for each configuration through the final vehicle mass, the final downrange error, and the final velocity error. Performance criteria were defined to evaluate the different simulation cases. In this study, a precise soft landing was considered satisfactory when the final mass of the vehicle was greater than the dry mass, when the downrange error was lower than 300 m, and when the final velocity was lower than 10 m/s.

Figure 9. Nominal trajectory simulations for different actuation configurations: TVC-only, Fins-only, and TVC & Fins. Wind is not considered. No propellant sloshing effects neither flexible modes are included.

Table 5. Performance results for the different actuation configurations.

From these simulations, some observations could be made. For the case with the TVC-only configuration in Figure 9a, we noticed that the commanded thrust vector in red was not anti-parallel to the velocity vector in magenta, since the TVC system was activated to counteract the deviations caused by the aerodynamic force in orange. No saturation was observed since the TVC deflections remained between

- 10

and 10 deg, and the rocket managed to reach the landing site vertically, satisfying the landing requirements quite accurately. However, for the case with the fins-only control configuration represented in Figure 9b, the trajectory obtained was considerably different. The fins’ deflection can be observed with the emergence of the pitch fins’ vertical body axis in dark blue, which is not merged with the rocket vertical body axis in green. This created the normal force of the corresponding fins, which corrected the trajectory of the vehicle. However, even if saturation was not reached, the performance results obtained were not as good as those of the TVC-only configuration, since the final downrange was higher and exceeded the aforementioned criterion for precision landing. This lack of precision was compensated for by a slight reduction in propellant use. This suggests that TVC is essential for precise landing. This observation was justified by the last configuration using TVC as the primary effector when the thrust magnitude level was higher than 70 kN and fin control otherwise; the results are shown in Figure 9c. Note that the obtained trajectory was similar to a combination of both previous trajectories: the TVC-only trajectory until 80 s of flight and around 4 km of altitude and then the fins-only profile. However, we observed a saturation of the fins between 80 and 95 s of flight. This was likely a consequence of the control allocation switch. In terms of performance results, this enabled us to obtain more accurate results regarding the final downrange position than the fins-only configuration, again with the advantage of a reduction in propellant mass use. Saturation due to the control allocation switching was more likely to lead to a higher final velocity error, although this remained within the desired bounds. Therefore, we observed the limitations of the adopted control law, since a rapid change in control allocation could generate undesired transients that could damage the final performance. However, this method enabled us to easily notice the advantages of combining TVC and steerable planar fins for the aerodynamic and powered descent phase of reusable launchers. Note that in the problem studied, the steerable planar fins were used at a relatively low altitude compared to standard scenarios. In fact, under 5 km of altitude, the TVC system is typically preferred. This is due to the thrust magnitude profile given by the guidance algorithm, which does not follow so-called bang-bang behaviour and therefore causes the control authority of the TVC system to be higher than the steerable planar fins during most of the descent flight. In Ref. [35], the authors analysed the guidance strategy to obtain this bang-bang profile and compared the global performance using the same simulator. In fact, this enabled us to a obtain a significant increase in performance with a trajectory for which the fins were primarily used in the middle of the flight, between the two thrust burns from the main engine.

5.2. Sensitivity Analyses

In this section, the simulator was complexified by adding external forces such as wind and dispersion to specific parameters. This study enabled us to demonstrate how the combination of TVC and steerable planar fins managed to counteract these forces well and assess the robustness of the actual G&C architecture against disturbances and uncertainties.

5.2.1. Wind

In this study, we considered three different wind cases that modified the gust amplitude and the altitude range at which the gust occurred (recall Equation (2)). Case 1 corresponded to

A_{g u s t} = 15 m / s, h_{1} = 7 km, h_{2} = 4 km

, Case 2 corresponded to

A_{g u s t} = 25 m / s, h_{1} = 17 km, h_{2} = 10 km

, and Case 3 corresponded to

A_{g u s t} = 30 m / s

,

h_{1} = 17 km, h_{2} = 14 km

. Figure 10 displays these cases in the up-north plane, as well as the horizontal wind. Note that the same wind conditions were also considered in the up-east plane to study the impact on the yaw motion. This led to the creation of an out-of-plane component along the east direction and a 3D trajectory. Note also that the wind gust model used here was not realistic and that using noise-coloring Dryden filters as in Ref. [36] would be more accurate. However, for this baseline analysis, the goal was only to analyse the behaviour of the G&C system in counteracting external events such as wind, and more accurate models remain to be developed in future work.

Figure 10. Description of the wind cases studied.

The three wind cases were tested under nominal initial conditions with the enhanced aerodynamic model and the TVC and fins control configuration corresponding to Figure 9c of the previous section. Figure 11 presents the simulation results showing the altitude versus downrange and velocity profiles and the control contributions in terms of deflection angles for each control configuration. The deflection angles in the yaw plane,

β_{T V C, z}

and

β_{f i n, z}

, are also represented to show that the consideration of the wind also led to the emergence of trajectory corrections in the yaw plane. Table 6 summarises the performance results.

Figure 11. Study of the impact of the wind in three different cases using the nominal conditions.

Table 6. Performance results for three wind cases.

From these simulations, it is possible to observe how the trajectory was modified due to the corresponding wind gust by examining the altitude versus downrange profile in Figure 11a. In fact, Case 3 with a strong gust at an upper altitude did not impact the trajectory profile considerably, since it followed the nominal profile relatively well. However, we could observe with Cases 1 and 2 that at a lower altitude such gusts could modify the trajectory quite significantly, even if they were not particularly strong (

A_{g u s t} = 15

for Case 1). This statement was confirmed by considering the performance results in Table 6, where Case 1, in which the wind gust occurred at the lowest altitude, shows the highest final errors. Furthermore, looking at the control commands generated in Figure 11b, slight changes in the deflection angles compared to the nominal profile can be observed at the times of the gusts. Since the gusts also occurred in the yaw plane, we also noticed the emergence of deflections arising from the actuators controlling the yaw motion. These also impacted the rest of the trajectory, since yaw fin deflections were generated when the wind gusts had stopped. Overall, even though the controller was not designed to specifically counteract the wind (which was made possible by including the wind as an exogenous input in the control synthesis [36]), it still provided satisfactory performance results within the desired bounds for precise landing defined previously, and therefore enabled us to study G&C interactions in the presence of wind.

5.2.2. Monte Carlo Analyses

Finally, the G&C system was tested within the 6-DoF controlled dynamics simulator in the presence of multiple uncertainties and disturbances through a 100-run Monte Carlo analysis. Note that 100 cases might not have been sufficient to properly assess the robustness of the present control system. However, the objective of the study was not to provide a high-performance control system, but rather a relevant tool to perform controllability analyses of reusable rockets during the D&L phase. Therefore, the robustness analysis carried out here had to first ensure that the present tool could adapt to a range of different trajectories and be evaluated with this number of runs. The corresponding dispersions are indicated in Table 7. Note again that neither sloshing effects nor flexible modes were included in this analysis. The results of the analysis are depicted in Figure 12, showing the errors in terms of position, velocity, and pitch angle, as well as the corresponding control commands in terms of thrust magnitude, TVC gimbal angle, and fin deflection angle profiles. At the bottom of the figure, a table gives the number of cases belonging to each of three different categories: (i) those for which a convergence issue occurred or the final mass obtained was greater than the dry mass of the vehicle and that were therefore considered as failure; (ii) those for which the final velocity or downrange did not verify the criteria defined in Section 5.1; and finally (iii), those whose results satisfied these criteria.

Table 7. Perturbations considered for the Monte Carlo analysis.

Figure 12. Results of the 100-run Monte Carlo analysis for the nominal case. Wind was not considered. No propellant sloshing effects or flexible modes were included.

These results confirmed that the G&C system was not highly robust to uncertainties. In fact, of the 100 cases, 41 were failures. This was due to a convergence issue for 31 cases (not shown in the figure). This meant that among all the tested cases, 31% were not usable, showing that the current G&C solution could not be applied for real scenarios. However, all other cases could be used to study the controllability of reusable rockets, which was the main objective of the simulator. Among them, 34 cases satisfied the criteria for a precise soft landing, showing the system’s relative flexibility to undertake the necessary corrections and counteract the existing uncertainties. In terms of pitch angle error, we noticed some cases where the error was greater than for the nominal case, but the controllers and actuators managed to correct it well and land with a pitch angle within

[- 1, 2]

deg. In fact, looking at the control contributions, we observed that as soon as the pitch angle error grew, the controller quickly compensated for this by generating the corresponding actuator deflection angles. We actually observed significant differences in the control command profiles because the thrust reference profile generated by the guidance algorithm was sensitive to the disturbances and uncertainties considered. This profile showed in some cases a higher commanded thrust at the beginning and a lower one in the second part of the flight, causing the actuator switch from the TVC to the steerable planar fins to occur earlier. Consequently, the deflection profiles obtained from the actuators are significantly different. However, it also enables to observe that for some cases, this behaviour does not reduce overall performance, confirming that even if this control strategy is not optimal, it manages to overcome the challenging task of combining TVC and steerable planar fins for the descent phase and precise landing of reusable launchers.

6. Conclusions

This paper described the development of a controlled dynamics simulator with closed-loop guidance and control integration for the D&L phase of reusable launchers. We considered a VTVL first-stage booster descent and soft pinpoint landing. The simulator included the 6-DoF descent dynamics of a rigid-body model with a varying mass, evolving in the terrestrial atmosphere with varying environmental parameters, uncertainties, and disturbances and subjected to external forces. To steer the spacecraft towards a controlled descent and a soft pinpoint landing, the vehicle is equipped with a TVC system and steerable planar fins controlled by gain-scheduled PID controllers, which correct the trajectory deviations with respect to the reference profile generated by a successive convex optimisation guidance algorithm. More specifically, the simulator involved a modular control architecture, allowing us to study different actuation configurations according to the mission requirements and the flight phase: TVC-only, planar fins-only, or both.

Several simulations were carried out that allowed us to provide preliminary assessments of the controllability challenges encountered by a rocket during the D&L phase while highlighting the necessary improvements for enhanced robustness to uncertainties. The combination of the TVC system and steerable planar fins was critical to provide a fuel-optimal trajectory and a precise landing for the reusable rocket while counteracting the possible disturbances and uncertainties existing in the terrestrial atmosphere. Despite the simplifying assumptions used in the simulator design and the low complexity of the control and allocation laws adopted, the tool obtained represents a powerful and versatile baseline for the development of more sophisticated G&C techniques. For example, as mentioned in the previous section, the guidance could be leveraged to generate the so-called bang-bang thrust magnitude profile, likely leading to less propellant consumption. Advanced approaches such as pseudospectral convex optimisation could be assessed and compared with the actual successive convex optimisation strategy. Concerning the control system synthesis, methods based on robust algorithms such as structured

H_{\infty}

could also be assessed in the simulator and are expected to provide improved performance.

Author Contributions

Conceptualisation, A.D.O. and M.L.; methodology, A.D.O.; software, A.D.O.; validation, A.D.O.; formal analysis, A.D.O.; investigation, A.D.O.; resources, A.D.O.; data curation, A.D.O.; writing—original draft preparation, A.D.O.; writing—review and editing, A.D.O.; visualisation, A.D.O.; supervision, M.L.; project administration, M.L.; funding acquisition, M.L. All authors have read and agreed to the published version of the manuscript.

Funding

The project leading to this research received funding from the European Union H2020 research and innovation programme under the Marie Slodowska-Curie grant agreement No. 860956.

Data Availability Statement

No new data were created or analysed in this study. Data sharing is not applicable to this article.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

6-DoF	Six-degrees-of-freedom
CG	Centre of gravity
CP	Centre of pressure
D&L	Descent and landing
G&C	Guidance and control
GNC	Guidance, navigation, and control
LPV	Linear Parameter-Varying
LTV	Linear Time-Varying
MIMO	Multiple-Input Multiple-Output
PID	Proportional–Integral–Derivative
RCS	Reaction Control System
RLV	Reusable Launch Vehicle
SISO	Single-Input Single-Output
SOCP	Second-Order Cone Programming
STC	State-Triggered Constraint
TVC	Thrust Vector Control
VTVL	Vertical Take-Off Vertical Landing

References

Howell, E. SpaceX: Facts about Elon Musk’s Private Spaceflight Company. 2022. Available online: https://www.space.com/18853-spacex.html (accessed on 23 May 2022).
Blue Origin. New Glenn: Our Next (Really) Big Step—An Orbital Reusable Launch Vehicle That Will Build the Road to Space. 2019. Available online: https://www.blueorigin.com/new-glenn (accessed on 23 May 2022).
Scharf, D.P.; Açıkmeşe, B.; Dueri, D.; Benito, J.; Casoliva, J. Implementation and Experimental Demonstration of Onboard Powered-Descent Guidance. J. Guid. Control Dyn. 2017, 40, 213–229. [Google Scholar] [CrossRef]
Szmuk, M.; Reynolds, T.P.; Açıkmeşe, B. Successive Convexification for Real-Time Six-Degree-of-Freedom Powered Descent Guidance with State-Triggered Constraints. J. Guid. Control Dyn. 2020, 43, 1399–1413. [Google Scholar] [CrossRef]
Sagliano, M. Pseudospectral Convex Optimization for Powered Descent and Landing. J. Guid. Control Dyn. 2018, 41, 320–334. [Google Scholar] [CrossRef]
Huang, J.; Zeng, Y. An hp-Legendre Pseudospectral Convex Method for 6-Degree-of-Freedom Powered Landing Problem. Aerospace 2023, 10, 849. [Google Scholar] [CrossRef]
Liu, X. Fuel-Optimal Rocket Landing with Aerodynamic Controls. J. Guid. Control Dyn. 2019, 42, 65–77. [Google Scholar] [CrossRef]
Sagliano, M.; Heidecker, A.; Hernández, J.M.; Farì, S.; Schlotterer, M.; Woicke, S.; Seelbinder, D.; Dumont, E. Onboard Guidance for Reusable Rockets: Aerodynamic Descent and Powered Landing. In Proceedings of the AIAA Scitech 2021 Forum, Virtual Event, 11–15 and 19–21 January 2021; American Institute of Aeronautics and Astronautics: Reston, VA, USA, 2021. [Google Scholar] [CrossRef]
Simplício, P.; Marcos, A.; Bennani, S. Guidance of Reusable Launchers: Improving Descent and Landing Performance. J. Guid. Control Dyn. 2019, 42, 2206–2219. [Google Scholar] [CrossRef]
Mooij, E. Linear Quadratic Regulator Design for an Unpowered, Winged Re-Entry Vehicle; Number 03 in 08 Astrodynamics and Satellite Systems; Delft University Press: Delft, The Netherlands, 1998. [Google Scholar]
Navarro-Tapia, D.; Marcos, A.; Bennani, S.; Roux, C. Structured H-infinity and Linear Parameter Varying Control Design for the VEGA Launch Vehicle. In Proceedings of the 7th European Conference for Aeronautics and Space Sciences, Milan, Italy, 3–6 July 2017. [Google Scholar] [CrossRef]
Sagliano, M.; Tsukamoto, T.; Heidecker, A.; Maces Hernandez, J.A.; Farì, S.; Schlotterer, M.; Woicke, S.; Seelbinder, D.; Ishimoto, S.; Dumont, E. Robust Control for Reusable Rockets via Structured H-infinity Synthesis. In Proceedings of the 11th International ESA Conference on Guidance, Navigation & Control Systems, Virtual Event, 22–25 June 2021. [Google Scholar]
De Oliveira, A.; Lavagna, M. Reusable Launch Vehicles Re-entry: Preliminary Architecture towards Optimal Guidance and Robust Control. In Proceedings of the XXVI International Congress of the Italian Association of Aeronautics and Astronautics (AIDAA), Virtual Event. Pisa, Italy, 31 August–3 September 2021. [Google Scholar]
MATLAB Aerospace Toolbox User’s Guide; MathWorks: Natick, MA, USA, 2017.
Committee on Extension to the Standard Atmosphere. U.S. Standard Atmosphere 1976; Technical Memorandum NASA-TM-X-74335; NASA: Washington, DC, USA, 1976.
Simplício, P.; Marcos, A.; Bennani, S. Reusable Launchers: Development of a Coupled Flight Mechanics, Guidance, and Control Benchmark. J. Spacecr. Rockets 2020, 57, 74–89. [Google Scholar] [CrossRef]
Gentry, A.E.; Smyth, D.N.; Oliver, W.R. The Mark IV Supersonic-Hypersonic Arbitrary-Body Program, Volume I, User’s Manual; Technical Report AFFDL-TR-73-159; USAF Flight Dynamics Laboratory: Dayton, OH, USA, 1973. [Google Scholar]
De Oliveira, A.; Lavagna, M. Assessment of Reusable Launch Vehicles Re-entry Dynamics Control Effectiveness with Enhanced Aerodynamics Modelling. In Proceedings of the 73rd International Astronautical Congress (IAC), Paris, France, 18–22 September 2022. [Google Scholar]
Gentry, A.E.; Smyth, D.N.; Oliver, W.R. The Mark IV Supersonic-Hypersonic Arbitrary-Body Program, Volume II, Program Formulation; Technical Report AFFDL-TR-73-159; USAF Flight Dynamics Laboratory: Dayton, OH, USA, 1973. [Google Scholar]
Ecker, T.; Karl, S.; Dumont, E.; Stappert, S.; Krause, D. A Numerical Study on the Thermal Loads during a Supersonic Rocket Retro-propulsion Maneuver. In Proceedings of the 53rd AIAA/SAE/ASEE Joint Propulsion Conference, Atlanta, GA, USA, 10–12 July 2017; American Institute of Aeronautics and Astronautics: Reston, VA, USA, 2017. [Google Scholar] [CrossRef]
Sagliano, M.; Seelbinder, D.; Theil, S.; Im, S.; Lee, J.; Lee, K. Booster Dispersion Area Management through Aerodynamic Guidance and Control. In Proceedings of the AIAA SCITECH 2022 Forum, San Diego, CA, USA, 3–7 January 2022; American Institute of Aeronautics and Astronautics: Reston, VA, USA, 2022. [Google Scholar] [CrossRef]
De Oliveira, A.; Lavagna, M. Reusable Launchers Re-entry Controlled Dynamics Simulator. In Proceedings of the 9th European Conference for Aeronautics and Aerospace Sciences, Lille, France, 27 June–1 July 2022. [Google Scholar] [CrossRef]
Anderson, J. Fundamentals of Aerodynamics, 6th ed.; McGraw-Hill Education: New York, NY, USA, 2017. [Google Scholar]
Nelson, R.C. Flight Stability and Automatic Control; McGraw-Hill Education: New York, NY, USA, 1989. [Google Scholar]
Açıkmeşe, B.; Ploen, S.R. Convex Programming Approach to Powered Descent Guidance for Mars Landing. J. Guid. Control Dyn. 2007, 30, 1353–1366. [Google Scholar] [CrossRef]
Guadagnini, J.; Lavagna, M.; Rosa, P. Model predictive control for reusable space launcher guidance improvement. Acta Astronaut. 2022, 193, 767–778. [Google Scholar] [CrossRef]
Grant, M.; Boyd, S. CVX: MATLAB Software for Disciplined Convex Programming, Version 2.1. 2014. Available online: http://cvxr.com/cvx (accessed on 16 October 2023).
Domahidi, A.; Chu, E.; Boyd, S. ECOS: An SOCP solver for embedded systems. In Proceedings of the 2013 European Control Conference (ECC), Zurich, Switzerland, 17–19 July 2013. [Google Scholar] [CrossRef]
Yang, R.; Liu, X. Comparison of Convex Optimization-Based Approaches to Solve Nonconvex Optimal Control Problems. In Proceedings of the AIAA Scitech 2019 Forum, San Diego, CA, USA, 7–11 January 2019; American Institute of Aeronautics and Astronautics: Reston, VA, USA, 2019. [Google Scholar] [CrossRef]
Yang, R.; Liu, X. Fuel-optimal powered descent guidance with free final-time and path constraints. Acta Astronaut. 2020, 172, 70–81. [Google Scholar] [CrossRef]
De Oliveira, A.; Lavagna, M. Robust Control Design via Structured H-infinity for the Atmospheric Re-entry of Reusable Launchers. In Proceedings of the 12th International ESA Conference on Guidance, Navigation and Control Systems, Sopot, Poland, 12–16 June 2023. [Google Scholar]
Roux, C.; Cruciani, I. Scheduling Schemes and Control Law Robustness in Atmospheric Flight of VEGA. In Proceedings of the 7th International ESA Conference on Guidance, Navigation and Control Systems, Tralee, County Kerry, Ireland, 2–5 June 2008. [Google Scholar]
Sagliano, M.; Hernández, J.A.M.; Fari, S.; Heidecker, A.; Schlotterer, M.; Woicke, S.; Seelbinder, D.; Krummen, S.; Dumont, E. Unified-Loop Structured H-Infinity Control for Aerodynamic Steering of Reusable Rockets. J. Guid. Control Dyn. 2023, 46, 815–837. [Google Scholar] [CrossRef]
Iannelli, A.; Gkouletsos, D.; Smith, R.S. Robust Control Design for Flexible Guidance of the Aerodynamic Descent of Reusable Launchers. In Proceedings of the AIAA SCITECH 2023 Forum, National Harbor, MD, USA, 23–27 January 2023; American Institute of Aeronautics and Astronautics: Reston, VA, USA, 2023. [Google Scholar] [CrossRef]
De Oliveira, A.; Lavagna, M. Advanced Guidance Design via Successive Convex Optimization for the 6-DoF Atmospheric Re-entry of Reusable Launchers. In Proceedings of the 2023 AAS/AIAA Astrodynamics Specialist Conference, Big Sky, MT, USA, 13–17 August 2023. [Google Scholar]
Simplício, P.; Bennani, S.; Marcos, A.; Roux, C.; Lefort, X. Structured Singular-Value Analysis of the VEGA Launcher in Atmospheric Flight. J. Guid. Control Dyn. 2016, 39, 1342–1355. [Google Scholar] [CrossRef]

Figure 1. 6-DoF RLV re-entry controlled dynamics simulator description.

Figure 2. Reference frames.

Figure 3. Aerodynamic coefficient database. Note that the values of

x_{C P}

are found to be independent of the Mach number M.

Figure 4. Fin model.

Figure 5. “D&L Guidance” block description.

Figure 6. Nonconvex optimisation problem.

Figure 7. SOCP problem.

Figure 8. Gain-scheduling method description.

Figure 9. Nominal trajectory simulations for different actuation configurations: TVC-only, Fins-only, and TVC & Fins. Wind is not considered. No propellant sloshing effects neither flexible modes are included.

Figure 10. Description of the wind cases studied.

Figure 11. Study of the impact of the wind in three different cases using the nominal conditions.

Figure 12. Results of the 100-run Monte Carlo analysis for the nominal case. Wind was not considered. No propellant sloshing effects or flexible modes were included.

Table 1. Position of the fins’ CP with respect to the base of the RLV and corresponding deflections.

	Fin CP Position $x_{fin, i}$	Fin Deflection $β_{fin, i} (t)$
Fin1	${[\begin{matrix} x_{f i n} & y z_{f i n} & 0 \end{matrix}]}^{T}$	$β_{f i n, 1} (t)$
Fin2	${[\begin{matrix} x_{f i n} & - y z_{f i n} & 0 \end{matrix}]}^{T}$	$β_{f i n, 2} (t)$
Fin3	${[\begin{matrix} x_{f i n} & 0 & y z_{f i n} \end{matrix}]}^{T}$	$β_{f i n, 3} (t)$
Fin4	${[\begin{matrix} x_{f i n} & 0 & - y z_{f i n} \end{matrix}]}^{T}$	$β_{f i n, 4} (t)$

Table 2. Planar fins’ model parameters.

Parameter	Value	Unit
$x_{f i n}$	$11.1$	$m$
$y z_{f i n}$	$2.5$	$m$
$b_{f i n}$	$1.2$	$m$
$c_{f i n}$	$0.8$	$m$
$S_{f i n}$	$0.96$	$m^{2}$
$A R_{f i n}$	$1.5$	-

Table 3. SOCP optimisation problem parameters.

Parameter	Value	Units	Parameter	Value	Units
$ω_{Δ}$	1	-	$T_{m a x}$	600	kN
$ω_{ν}$	1000	-	$T_{m i n}$	0	kN
$ω_{σ}$	0.75	-	$ω_{m a x}$	28.6	deg/s
$i_{m a x}$	10	-	$θ_{m a x}$	75	deg
$Δ_{t o l}$	0.001	-	$γ_{g s}$	10	deg
K	100	-	$δ_{m a x}$	10	deg
$t_{f}^{0}$	120	s	$α_{m a x}$	5	deg
$m_{d r y}$	2750	kg	$Q_{m a x}$	$4 \times 10^{4}$	Pa

Table 4. Initial and final conditions.

Parameter	Value	Parameter	Value
$r_{I} [0]$	${[25 0 - 15]}^{T}$ $km$	$r_{I} [K]$	${[0 0 0]}^{T}$ $m$
$v_{I} [0]$	${[- 850 0 950]}^{T}$ $m / s$	$v_{I} [K]$	${[- 5 0 0]}^{T}$ $m / s$
$ω_{B} [0]$	${[0 0 0]}^{T}$ $rad / s$	$ω_{B} [K]$	${[0 0 0]}^{T}$ $rad / s$
$m [0]$	$14,000$ $kg$	$q_{B}^{I} [K]$	${[0 0 0 1]}^{T}$

Table 5. Performance results for the different actuation configurations.

	TVC-Only	Fins-Only	TVC and Fins
Final mass	$2775 kg$	$2761 kg$	$2767 kg$
Final downrange	$77 m$	$354 m$	$84 m$
Final velocity	$4.96 m / s$	$4.86 m / s$	$6.53 m / s$

Table 6. Performance results for three wind cases.

	Case 1	Case 2	Case 3
Final mass	$2751 kg$	$2764 kg$	$2758 kg$
Final downrange	$260 m$	$133 m$	$201 m$
Final velocity	$7.90 m / s$	$8.46 m / s$	$8.07 m / s$

Table 7. Perturbations considered for the Monte Carlo analysis.

Perturbation	Variable	Distribution	Value
Initial lateral velocity	$v_{z} [0]$	Normal	$σ = 20 m / s$
Initial mass	$m [0]$	Uniform	2%
Moments of inertia	$J_{A} (t), J_{N} (t)$	Uniform	2%
Reference thrust	$T_{r e f} (t)$	Uniform	10%
Atmospheric density	$ρ (t)$	Uniform	20%
Ambient pressure	$P_{a m b} (t)$	Uniform	10%
Drag coefficient	$C_{D} (t)$	Uniform	20%
Lift coefficient	$C_{L} (t)$	Uniform	20%

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Development of a Controlled Dynamics Simulator for Reusable Launcher Descent and Precise Landing

Abstract

1. Introduction

2. Reusable Launcher Controlled Dynamics Modelling

2.1. Reference Frames and Environmental Models

2.2. Equations of Motion and CG/Inertia Estimations

2.3. Aerodynamic Model

2.4. TVC System

2.5. Steerable Planar Fins Model

3. Guidance Strategy

3.1. Nonconvex Optimal Control Problem

3.2. SOCP Problem

4. Control Approach

4.1. TVC-Only SISO Configuration

4.2. TVC and Fin SISO Configuration

5. Simulation Results

5.1. Nominal Trajectory Simulations for Different Actuation Configurations

5.2. Sensitivity Analyses

5.2.1. Wind

5.2.2. Monte Carlo Analyses

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Article Metrics

Citations

Article Access Statistics