1. Introduction
Developing reusable launchers has become one of the key aspects of the space race for any country seeking independent and sustainable access to space. In December 2015, the US private company SpaceX showed the technical feasibility of launcher reusability by landing its
Falcon 9 first stage after having delivered a payload into orbit [
1]. Two years later, the same company demonstrated the cost effectiveness of such a technology by reusing a recovered first stage for another mission [
2]. SpaceX is today one of the major space companies and is currently developing its
Super Heavy rocket equipped with the
Starship spacecraft with the objective of carrying both crew and cargo on long-duration interplanetary flights and to allow humanity to return to the Moon and travel to Mars and beyond. In June 2024,
Starship 29 (S29) and its
Super Heavy Booster (B11) marked the first integrated test flight, where both stages successfully re-entered and performed a powered vertical landing over the ocean surface. Meanwhile, Rocket Lab’s
Electron microlauncher is well integrated in the market, and the company is currently developing its medium-lift rocket
Neutron [
3]. Blue Origin is also focusing on advanced reusable launchers such as
New Shepard, a suborbital launch vehicle designed for space tourism, and
New Glenn, a heavy-lift reusable rocket that should be able to carry heavy payloads to Earth orbit and beyond [
4]. Outside of the United States and in what can be described as the new space race, China [
5] and India [
6] are actively working on their own reusable launchers. In Europe, various programmes are underway, such as
Themis from the European Space Agency (ESA), National Center for Space Studies (CNES), and ArianeGroup, which is a first-stage rocket demonstrator aimed at paving the way for the future European reusable launcher,
Ariane Next [
7]. Additionally, the collaborative project CALLISTO, involving the German Aerospace Center (DLR), Japan Aerospace Exploration Agency (JAXA), and CNES, is focused on the development of a European/Japanese reusable rocket demonstrator [
8].
Following similar guidelines, the European project Advancing Space Access Capabilities—Reusability and Multiple Satellite Injection (ASCenSIon) has been initiated as an innovative training network involving multiple public institutions and companies across Europe in order to study the critical technologies for the development of a Reusable Launch Vehicle (RLV) capable of injecting multiple payloads into multiple orbits. More particularly, one of the research fields involved in this project aims to study the design of the Guidance and Control (G&C) system for the vehicle descent and precise landing on Earth, essential for reusability. In fact, this flight phase is very challenging as it depends on multiple parameters, which are further complicated by the dense terrestrial atmosphere [
9]. During descent, the vehicle is subjected to fast, dynamics changes, partly induced by external loads such as lift, drag, and wind but also by the actuation commands to fulfil the landing constraints satisfaction and vehicle integrity preservation. Recovering a first-stage launcher was made possible in the last decade by the development of advanced and robust computational methods able to generate in real time the reference trajectory to be followed according to the actual flight conditions, and then to command the optimal vehicle’s actuator deflections to steer the vehicle to the landing site. Despite the success of the aforementioned commercial space companies, some standing problems, such as the aerodynamic and powered descent of the launcher, require further understanding.
One of the critical aspects of G&C design for the descent and precise landing of reusable rockets is the development of a robust control strategy capable of counteracting disturbances and uncertainties while satisfying the strict accuracy requirements associated with pinpoint landing. This synthesis is further complicated by the need for a real-time guidance algorithm to update onboard the optimal trajectory to be followed by the vehicle, which therefore requires that the controller be capable of tracking multiple types of possible references. In fact, thanks to the increase in computational power available onboard, recent progress has shown that convex optimization is among the key technologies to enable autonomous and onboard real-time trajectory planning, and therefore pinpoint landing. More particularly, advanced methods such as successive convex optimization [
10] and pseudospectral convex optimization [
11] enable the definition of a fuel-optimal trajectory problem in which nonlinearities (e.g., aerodynamics) or non-convex constraints can be integrated [
8,
12,
13] and that can be solved in polynomial times with efficient solvers. When implemented in a closed-loop fashion, this advanced guidance design enables the correction of potential trajectory tracking discrepancies caused by uncertainties within the algorithm’s embedded models or external disturbances.
As demonstrated by the current state-of-the-art in launcher control design [
14,
15], classical linear control theory represents a rich heritage with many applications. This choice was motivated by its relative ease of implementation and the possibility of using gain-scheduling techniques to adapt to nonlinear systems. Nevertheless, these methods are well-adapted to the control system design of Single-Input Single-Output (SISO) systems, such as, for example, a reusable rocket using a Thrust Vector Control (TVC) system as the unique actuator. The implementation of Multiple-Input Multiple-Output (MIMO) control systems then becomes complex since every channel is addressed in a single-loop fashion. However, this capability is required for the future generation of reusable rockets, which also commonly use fin-based aerodynamic steering in addition to the TVC system to enhance control authority. Furthermore, model uncertainties are not accurately considered in the design process, developed only with nominal conditions and stability margin requirements. All these issues result in an extensive (in terms of both time and cost) verification and validation campaign with many iterations and Monte-Carlo analyses to assess the performance and robustness of the control system.
To overcome these drawbacks, the
family of methods, introduced a few years ago [
16], provides a powerful solution for robust control design. It relies on defining the control requirements in the frequency domain in terms of weighting functions and minimizing the maximum gain of the resulting weighted system from the exogenous inputs to the outputs to be controlled. The control–plant interaction is modelled through a Linear Fractional Transformation (LFT) that represents the feedback action. Furthermore, the structured
method [
17] allows one to directly impose a specific control structure—like a Proportional-Integral-Derivative (PID), enabling the reuse of gain-scheduling techniques—and to consider parametric uncertainties for enhanced robustness. This technology was studied in the United States for the
Ares-I programme, later for the
Space Launch System programme [
18], and in Europe for
Ariane 5 [
19] and the future generation of European launchers [
20]. In recent years, several studies have emerged regarding this method for the descent and landing phases of vehicles and have shown promising results. Although structured
was first studied mainly for the ascent phase of the
VEGA launcher recovering the baseline control structure [
21], some analyses were further performed in the descent phase [
22]. Furthermore, interesting cases for the aerodynamic descent of reusable rockets have been exploited within the framework of CALLISTO where, first, decoupled attitude and translational channels were considered for the design of robust controllers on each control axis [
23], before a unified control was considered for both position and attitude with a robustness analysis to account for uncertainties [
24]. Finally, a multi-plant control design approach with fully-coupled translational and attitude dynamics was studied as a solution to better consider the range of trajectories coming from the online guidance algorithm during flight [
25].
It is clear from the available literature that, while this technology has been largely explored for the ascent phase of conventional (i.e., non-reusable) launch vehicles, there is still room for exploiting its capabilities during the descent and precise landing phase of reusable rockets. Furthermore, among most of the aforementioned literature, only the aerodynamic descent phase using steerable planar fins was considered, and the powered-descent phase combining fins and TVC was treated only in Ref. [
22]. This paper investigates the problem of coupling advanced guidance and robust control systems for the descent and precise landing of RLVs, involving fin-based aerodynamic steering and the TVC system simultaneously in the control action. It exploits the room for improvement that comes from the reusable launcher controlled dynamics simulator, developed in previous work under the ASCenSIon project [
26]. This simulator precisely models the existing interactions between the environment with its potential disturbances and uncertainties, the corresponding nonlinear Six-Degree-of-Freedom (6-DoF) equations of motion, the actuators, and the G&C system. In that paper, the latter was proposed as the baseline for preliminary assessments, which involved a successive convex optimization algorithm maximizing the vehicle final mass and a control system for which the MIMO formulation induced by the coupling of TVC and steerable planar fins is simplified to a series of SISO systems to apply classical linear control theory. In this paper, from the G&C system of Ref. [
26], first the guidance algorithm is improved by using a cost function strategy that involves both minimization of time of flight and maximization of final mass. The latter still copes with mission requirements and can also be efficiently coupled with the other building blocks. Second, a new architecture is introduced for control synthesis through structured
. The latter is simultaneously tuning gain-scheduled PID controllers for both planar fins and the TVC system through feedback on attitude and velocity (with the main focus on attitude). The robustness of the control system obtained is assessed through linear analysis and nonlinear simulations with the aforementioned tool, under nominal and dispersed conditions. Therefore, the main contribution of this work is twofold: the design of a robust control strategy through
synthesis combining two actuators, TVC, and steerable planar fins, followed by the performance assessment of the integrated G&C design through a 6-DoF nonlinear simulator. To the best of the authors’ knowledge, this is the first time that such a control strategy is coupled with an advanced guidance algorithm for the entire descent and landing problem of a reusable rocket and assessed in a realistic scenario. Furthermore, this paper takes the perspective of integrated analysis rather than a compartmentalized design of guidance and control strategies. In fact, such strategies are emerging in the literature: Ref. [
27] combines an optimal guidance strategy with a linear quadratic regulator control approach for the rocket-powered descent phase, while Ref. [
28] proposes an integrated and adaptive G&C design via reinforcement meta-learning for air-to-air missiles.
The paper is organised as follows.
Section 2 introduces the modelling of the nonlinear controlled dynamics involved in reusable launcher descent and precise landing. More particularly,
Section 3 describes the guidance method embedded in a closed-loop fashion in the simulator. Then,
Section 4 formulates the robust control design via structured
, with weighting functions adequately chosen according to the available requirements. The synthesized controllers are assessed through classical linear analysis, and robustness stability and performance are assessed via structured singular value
-analysis. Subsequently, in
Section 5, the synthesized controllers are embedded in the controlled dynamics simulator, therefore coupled with the guidance system, and assessed through nonlinear analysis under both nominal and dispersed conditions. Finally, conclusions regarding the work performed are provided in
Section 6.
2. Reusable Launcher Controlled Dynamics Simulator
This section describes the nonlinear 6-DoF descent dynamics of a Vertical Take-off Vertical Landing (VTVL) vehicle first-stage booster modelled through rigid-body motion with varying mass, subjected to external forces induced by the terrestrial atmosphere and controlled through embedded closed-loop guidance and control strategies. This paper relies on the controlled dynamics simulator developed by the authors and adapted from Ref. [
29] to study the efficiency of aerodynamic steering and conventional G&C techniques [
26]. The latter is illustrated in
Figure 1, showcasing the interactions between all building blocks, from the G&C systems to the actuators, vehicle dynamics, and environment.
The latter includes standard G&C algorithms, where a thrust vector is commanded by the guidance subsystem and then converted to the reference pitch angle,
, and yaw,
, rates
and
, lateral body velocities
and
, and thrust magnitude
. The control subsystem then generates the necessary commands to correct the deviations between the reference and actual attitude angles in terms of TVC gimbal deflections,
, and fin deflections,
. The guidance subsystem is based on a successive convex optimization algorithm. The reference trajectory generated is updated with a user-specified frequency,
, at which the guidance subsystem is re-executed. Concerning the control subsystem, it relies on the use of feedback control through gain-scheduled PID controllers synthesized via structured
synthesis, decoupling pitch and yaw axes based on the assumption of low roll rate. These two subsystems will be better defined in
Section 3 and
Section 4, respectively.
The equations of motion are written in the landing site-centred inertial and the vehicle’s body-fixed reference frames using the following initial state vector,
, and, for the sake of simplicity, based on the assumptions that the vehicle is a rigid body with no effect induced by the varying mass (e.g., propellant sloshing) and structural flexibilities. The mass depletion dynamics are modelled by an affine function of the thrust magnitude as follows:
where
is the vacuum specific impulse of the engine, assumed to be constant for simplicity, and
is the nozzle exit area of the engine.
is the thrust vector coming from the TVC system, represented in the inertial reference frame. The second term is related to the reduction of the specific impulse due to the atmospheric back-pressure [
10].
The translational states, position, and velocity of the vehicle in the inertial reference frame,
and
, are governed by the following dynamics:
where
described the aerodynamic force acting on the vehicle in the inertial reference frame,
represents the control force generated by the planar fins, and
is the gravitational field defined in the inertial frame.
Then, the attitude states are governed by the following rotational dynamics, using the following quaternion-based kinematics equation:
where
is the inertia matrix of the vehicle and
,
, and
represent the aerodynamic and control torques acting on the vehicle. In Equation (
3), the coupling between angular velocity and inertia along the three axes, and the effect of centroid movement on the inertia caused by mass consumption, are ignored.
For the computation of the aerodynamic forces and moments generated by the vehicle, it is necessary to define an additional reference frame, the so-called velocity reference frame. The latter is fixed to the vehicle’s Center of Gravity (CG), with its
x-axis directed along the wind-relative vector,
, so that the transformation from the body-fixed to the velocity reference frame can be represented by two aerodynamic angles: the angle of attack,
, and the sideslip angle,
[
30]. Then, the aerodynamic characteristics depend on the vehicle’s external shape with its reference area and on the instantaneous dynamic pressure, which is defined as follows:
where
accounts for the vehicle’s inertial velocity and wind gusts, and the atmospheric density
is generated using the Committee on Extension to the Standard Atmosphere model [
31] (like the ambient pressure,
, and the speed of sound,
). The aerodynamic drag and lift coefficients, as well as the Center of Pressure (CP) position, are estimated from available look-up tables as a function of the effective angle of attack,
, and the Mach number,
[
32].
3. Guidance Approach
This section describes the Descent and Landing (D&L) guidance strategy adopted in the previously defined controlled dynamics simulator, which is responsible for the real-time generation of a fuel-optimal reference trajectory with thrust, attitude, and velocity commands to be sent to the robust control subsystem. As mentioned in
Section 1, the literature of guidance methods for the reusable launcher descent and landing phase is now focusing on advanced computational techniques, among them successive convex optimization or pseudospectral convex optimization, therefore giving up to the traditional guidance schemes commonly implemented for the ascent phase and for which path constraints could not be enforced. In this paper, the direct method, successive convex optimization, is employed and analysed before being implemented in the controlled dynamics simulator for closed-loop integration assessment.
Convex optimization guidance consists of transforming the fuel-optimal trajectory problem into a convex one, more particularly into a Second-Order Cone Programming (SOCP) problem, which can be solved with efficient solvers in polynomial time. The recent increase in the computational power available onboard made the real-time implementation of these algorithms possible. The challenging task relies on converting the non-convex state and control constraints into convex forms [
33]. Then, successive convex optimization can be applied to approximate the remaining nonlinearities in the optimal landing problem, such as the aerodynamic effects previously ignored. It consists of iteratively solving SOCP convex optimization subproblems in which the non-convex dynamics and constraints are repeatedly linearised using information coming from the previous iteration solution. This algorithm was first developed in Ref. [
34] and then adapted in different ways [
12,
13]. In this study, the strategy defined in Ref. [
10] is leveraged to be applicable in a closed-loop fashion. The algorithm is thoroughly detailed in Ref. [
26] from the same authors and summarized hereafter. First, the guidance strategy is detailed in
Section 3.1, and the corresponding SOCP problem is described in
Section 3.2.
3.1. Successive Convex Optimization Strategy
The successive convex optimization guidance algorithm was implemented in MATLAB (R2021b) using the CVX library [
35] to formulate the convex problem and the ECOS routine [
36] to solve them. In each simulation instance defined by the simulation rate,
Hz, the reference thrust profile,
, the reference attitude angles,
, rates,
, and the reference body velocities,
, are calculated from the most recent guidance solution by linear interpolation. In fact, as mentioned above, this solution is stored as an online lookup table, which is updated at each guidance step, with the guidance update frequency
Hz, that is, every 10 s. Note that, to avoid adding complexity, the steerable planar fins are not considered as control inputs in the guidance problem. The guidance algorithm inside the “D&L Guidance” building block of the simulator is schematized in
Figure 2.
First, the state and control vectors, as well as the final time,
, which is also an optimization variable in this problem, are initialized. A linear interpolation of the discrete state variables under the initial and final conditions is used for the initial state vector, while the initial control vector is taken as matching the gravitational force [
26]. Once the initial guess is defined, we enter the successive convex optimization loop, which consists of solving the SOCP problem several times until reaching the user-defined maximum iterations number,
, or the tolerance relative to the trust regions radius,
. Then, to enable the formulation of the SOCP subproblems, the optimal control problem must be converted into a finite-dimensional parameter optimization problem. Therefore, the trajectory and optimization variables are discretized into
K uniformly spaced points, ranging from the current instant of time,
, to the final time,
. At each guidance step, the time vector is divided in that way:
and because the estimated time of flight
as
, where
is the actual time of flight achieved by the simulation, the accuracy of the discretization becomes more precise towards the end. More particularly, the sampling time is given by
. The linearization and discretization methods are explained in the next subsection, together with the definition of the SOCP problem.
When the optimization algorithm converges to an optimal solution, this reference trajectory is saved to be used for the next iteration, or, if the exit criterion of the successive convex optimization routine is met, is transferred to the online look-up table from where the actual reference parameters corresponding to the simulation instance can be generated. In this study, this involves the reference thrust magnitude profile, , the reference pitch and yaw angle profiles, respectively and , rates, respectively and , and the reference body velocities, and .
3.2. SOCP Problem
The SOCP optimization problem obtained in Ref. [
26], which is solved iteratively in the successive convex optimization algorithm, is summarized in
Figure 3.
The 6-DoF descent dynamics of a powered-only, first-stage booster are linearized and discretized about the solution of the previous iteration through a first-order Taylor expansion approximation and using a zero-order-hold interpolation scheme. Note that only the thrust force is considered as the control input, so that the contribution of the fins is ignored through the guidance logic. Before this process, the time of flight is normalized from
to
, where
is the normalized time of flight, to obtain a fixed-final-time optimization problem. Summarizing the nonlinear dynamics as
, with
as the state vector and
as the control vector, they can be rewritten as follows:
Therefore, having
, the normalized nonlinear dynamics are expressed by:
where
since
. Furthermore, for the sake of simplicity, aerodynamics are modeled as if the vehicle were subjected to a pure drag force. Assuming that the rocket is axisymmetric, the aerodynamic forces and moments in the vehicle’s body-fixed reference frame are expressed by:
where
is the actual drag coefficient, which is estimated from the available lookup tables. Note that, within the guidance, the velocity vector,
, does not account for wind.
Several state and control constraints are enforced in the optimization problem. Among the state constraints, we consider the lower bound of the mass, the so-called glide-slope constraint, the tilt angle constraint, the higher bound of the angular rate, and the preservation of the unit norm of the quaternions. Note that, in the latter case, the linearization was used for simplicity, but more advanced strategies can be implemented. For example, Ref. [
37] describes the augmented convex–concave decomposition for this particular case. Finally, control constraints involve bounding the thrust direction and magnitude (higher and lower bounds). All the constraints in
Figure 3 are thoroughly described in Ref. [
26].
Due to the linearization process involved in the successive convex optimization strategy, trust regions and virtual controls are implemented to prevent unboundedness and artificial infeasibility, respectively. More particularly, trust regions limit the deviation between the two consecutive iterations responsible for artificial unboundedness. They are defined for the state and control vectors, as well as for the time of flight, and are penalized in the cost function. Virtual controls are additional control inputs that allow for reaching each point of the solution domain through dynamics relaxation and therefore avoid artificial infeasibility. Therefore, they are added in the linear discrete dynamics and then penalized in the cost function. The reader is referred to Ref. [
26] for a better understanding of trust regions and virtual control implementation and their convexification through slack variables definition.
Finally, the cost function is defined as follows:
where
,
, and
are penalization weights, respectively penalizing the virtual controls, the trust region of the state and control vectors, and the trust region of the time of flight. Furthermore,
and
are slack variables defined to avoid a quadratic term in the cost function. Whereas in Ref. [
26] only the final mass (
) was considered to be maximized, this study also includes the minimum-time strategy through the parameter
. In fact, due to the monotonic behavior of the propellant consumption with respect to time in this study and since the final time is also an optimization variable, the latter can also be selected as the value to be minimized. This choice leads to better performance when integrated in the closed-loop G&C simulator, as demonstrated in
Section 5.
4. Structured Control Synthesis
In this section, gain-scheduled structured
controllers are designed for the aerodynamic and powered descent phase of an RLV along a reference trajectory computed offline (corresponding to the first run of the guidance algorithm studied above) using the state-space representation described below. In fact, this control technique has been demonstrated as a successful candidate in space applications to cope with the closed-loop requirements needed to enable robustness and performance [
21,
24,
25]. In Ref. [
38] from the same authors, gain-scheduled structured
controllers were already designed with the objective to recover a more basic control architecture. Even if the controllers obtained validated the control approach, more stability and robustness against uncertainties were needed. In this direction, a re-tuning of the controller gains is achieved in this study, also involving attitude rate and trajectory tracking (only Euler angle tracking previously). Tighter stability margins are considered, and a robustness analysis via structured singular value
-analysis, which directly considers uncertainties in the linear RLV model, is included. More particularly, the requirements include closed-loop stability, attitude and trajectory tracking, and actuation limitation (
,
) taken from Ref. [
29], as well as stability margins (the gain margin must be superior to 6 dB and the phase margin must be superior to 30 deg). Note that the maximum actuator deflection considered in this paper is high (especially for the fins), and was selected for this analysis for simplicity, but at later design stages more realistic values should be integrated.
4.1. State-Space Representation of the RLV Descent Dynamics
Before proceeding with the linearization of the equations of motion, several assumptions must be made. First, the launch vehicle is considered axisymmetric with a negligible roll rate, therefore allowing for the decoupling of the motion in the pitch and yaw planes. Wind disturbances are not considered at this stage. Additionally, for the sake of simplicity, the effects of actuators, sensors, bending, and sloshing dynamics are ignored. Note that wind and all these features could be added in future work following the scheme available in the literature [
24,
39]. Concerning the neglect of bending modes, the reader is referred to Ref. [
40], where relevant analyses have been carried out to ensure that their frequency is sufficiently larger than the closed-loop bandwidth of the controller. In this section, only the pitch dynamics will be defined for conciseness, but the expressions obtained for the yaw dynamics are available in
Appendix A.3.
The pitch dynamics are described with the following state-space realization obtained by the linearization of the perturbed equations of motion translated in the vehicle’s body fixed reference frame. Details of the linearization process are provided in
Appendix A. Therefore, linearized perturbations are represented by the linear time invariant model
, which is defined as:
where
,
, and
represent pitch angle perturbations and first/second-order derivatives and
,
, and
are the lateral drift and derivatives. The matrices
and
are defined by:
Before synthesizing the control system, the assumption stating that the 6-DoF dynamics can be decoupled between the pitch and yaw planes is verified by eigenvalue analysis. More particularly, the poles of the 6-DoF decomposition, whose state-space representation is available in
Appendix A.1 through Equation (A14), are computed and compared with the poles of
, as well as its equivalent in the yaw plane (see Equations (
A20)–(
A22)).
Figure 4 displays the poles of the decoupled systems (superposition of the poles of both systems, translating pitch and yaw dynamics) and the 6-DoF system for which the couplings have not been neglected.
From this figure, we can observe that the eigenvalue distribution with respect to time is highly similar. More particularly, in
Table 1 are reported the values of the poles for
. As expected, the structure of the 6-DoF system is similar to the ones of the reduced model, and also the discrepancy between the yaw and pitch models is small due to the axial symmetry of the rocket.
4.2. Structured Control Problem Formulation
The system constituted by the RLV linear dynamics model developed previously is subjected to significant changes during the descent flight, mainly due to the variations associated with thrust and aerodynamics. In fact,
Figure 5 shows as an example the Bode plot of the system
, where the linear dynamics are discretized according to the altitude with
points. Therefore, it justifies the use of gain-scheduling to increase the performance and robustness of the control system.
The altitude has been chosen as the scheduling parameter since it is monotonically evolving with respect to time and has been well validated in the literature [
24,
25]. The 15 points were equally distributed with respect to the altitude vector, which allows us to capture the variations well in terms of thrust magnitude and dynamic pressure.
In the framework of structured
, the augmented plant must be defined. It is usually made up of the linear dynamics model of the system, the controller to be designed, other linear systems describing the effects of the actuators (TVC and fins) or disturbances such as wind, and a set of weights that include design specifications. Note that, in this study, for simplicity, the effects of sensors, actuators, and wind disturbances are not considered. A PID structure was used for the controllers designed in Ref. [
38], taking the pitch angle error,
, as input and giving the actuator deflections,
, as outputs. In this work, attitude rate and trajectory tracking are also included for enhanced stability, so controllers take pitch angle, rate, and lateral body velocity errors
as inputs and actuator deflections as outputs. Note that here the synthesis is depicted for the pitch dynamics, but the same methodology is followed for the yaw dynamics.
Figure 6 shows the augmented plant named
.
The exogenous inputs are the reference pitch angle,
, and lateral body velocity,
, scaled by the input weighting function
, which translates them into the signals
and
, respectively. The scaled pitch rate command,
, is obtained using a first-order derivative filter with time constant
s. The comparison of the scaled references {
and the outputs
generates the errors
entering the controller
. Then, the deflection angles
enter the RLV linear dynamics model. The mixed
sensitivity approach is employed for design tuning, where the output weighting functions
,
, and
shape the tracking performance, the disturbance rejection capability, and the control efforts, respectively. More particularly, the controller is defined by:
Therefore, the objective of the
optimal control problem is to find the gains
,
,
,
and
,
,
,
, which constitute a stabilising controller,
, such that the
-norm of the augmented system is minimized. The optimal problem is defined by:
where
,
, with
and
. The non-smooth, non-convex optimization problem of Equation (
13) is solved with
systune since it enables one to directly include stability margin requirements through the command
TuningGoal.Margins. Note that the chosen structural template and the actuation limitations result in
-norm values that slightly exceed 1, depending on the altitude range. This behaviour is typical in rocket applications [
24,
41]. However, the overall tuning strategy effectively prevents the actuation system from reaching saturation, while also satisfying the mission requirements related to tracking and disturbance rejection, as described in the next subsections.
The command
TuningGoal.WeightedGain is used to define the weighting functions. The input weighting function
, used for reference scaling, is determined based on the allowed maximum value for the input reference, here, the pitch angle and the lateral body velocity. For this study, we set:
The output weighting functions
,
, and
must complete the
mixed sensitivity synthesis problem. In fact, their inverse bound, respectively, the complementary sensitivity function,
T, the sensitivity function,
S, and the control sensitivity function,
. Usually,
is chosen as a high-pass filter, whereas
and
are chosen as low-pass filters [
21]. Therefore, the pitch angle error weighting function is defined as follows:
According to Refs. [
25,
41], the low-frequency attitude sensitivity is limited when controlling translational degrees of freedom; therefore, the parameter
, supposed to reduce the steady-state tracking error, is chosen to be larger. Then, the high-frequency
to keep small the maximum peak of the sensitivity function, which is critical to ensure good stability margins.
is the desired bandwidth, varying according to the altitude range studied from
to
. Then,
is selected as:
with
to keep the fundamental relationship between complementary sensitivity and sensitivity functions and reduce the parameters to be tuned, while
. The tracking bandwidth is set to
. Finally, the control input weighting function is here used to impose signal limitations in order to prevent actuator saturation. They are set using constant weighting functions as follows:
Note that the values of the control weighting functions have been adequately selected with the maximum actuator deflections to ensure that the actuators are not saturated during descent flight.
4.3. Linear Analysis of the Controllers
This analysis is necessary to validate the controllers obtained according to the requirements set previously. The latter are meant to be applicable to the nominal case for the classical linear analysis, while we will deal with a perturbed formulation through a robustness stability and performance analysis via
-analysis in
Section 4.4.
To study the robustness and performance of a system (plant + controller), it is usual to analyse the closed-loop transfer functions. Therefore, the frequency domain is analysed by looking at the sensitivity, complementary sensitivity, and control sensitivity functions. The corresponding bode plots are shown in
Figure 7 with respect to the
channel, since the focus is on attitude tracking.
The performance is quite good overall; the sensitivity function (
Figure 7a) does not show peaks at low frequency, which is a good indicator for stability margins. Note that the sensitivity functions for all controllers are well below the corresponding weighting functions,
, which validates the disturbance rejection requirement. The complementary sensitivity functions (
Figure 7b) are well below the weighting function,
, for all controllers, but there are some peaks that could represent a lack of stability. This will be checked in the next sections through robustness analysis and nonlinear simulations. Concerning the control sensitivity functions, shown in
Figure 7c for the TVC system and in
Figure 7d for the steerable planar fins, the requirements are also well met since the functions are below the weighting functions
and
, so saturation should be avoided. It is already possible to see that the last three altitude slots from orange to yellow show reduced performance.
Then, the time domain is also analysed by plotting the step response in
Figure 8, again from the
channel.
The dotted lines represent the steady-state values of the corresponding input–output pairs, which are desired to be 1 if associated with a command tracking or 0 in the case of disturbance. From this graph, we can observe good tracking capabilities without significant overshoot and with a relatively fast response in the to channel. It is possible to see that the response is slower at the beginning of the flight and faster towards the end. This is related to the selected value of for the weighting function that bounds the sensitivity function, . At the beginning of the flight, both the thrust magnitude and the dynamic pressure are high; therefore, high control capabilities are available. The stability margins are high, and an increased was seen to reduce them. Consequently, because in this analysis we want to focus on increased stability and robustness to uncertainties, a settling time of 40–60 s was considered acceptable. However, towards the end of the flight, the dynamic pressure is decreased, leading to lower control capabilities, but the vehicle mass is also reduced, which increases the risk of instabilities. In this case, to obtain satisfactory results with respect to the requirements set, it was necessary to reduce the stability margins and improve the settling time of the step response (making it shorter). This is particularly the case for the last three sets of controller gains, which were already noticed as being less stable previously. Disturbances are well rejected with respect to the attitude rate, . However, the input lateral body velocity, , was more difficult to control. This behaviour is observed because it was purposely decided not to tightly constrain the translational motion, since the focus was set on the attitude tracking. In the next sections, we will see the consequences of this choice on the vehicle’s stability.
Figure 9 explicitly shows the stability margins obtained, with the gain margins on top and the phase margins below. Note that the Nichols charts have also been generated, but it was decided to directly display the stability margins for better readability. When a star is shown instead of a dot, it means that the gain margin is infinite. For the steerable planar fins, the gain margins are always above the requirement of 6 dB, with a minimum gain margin of 17 dB. However, for some altitude ranges, the phase margin is below the requirement of 30 deg, with a minimum phase margin of 23 deg. Nevertheless, the stability margin requirements are more difficult to satisfy for the TVC system, where a minimum gain margin of 3 dB is reached at 1.5 km altitude and with several phase margins below the requirement. We obtained a minimum phase margin of 6 deg at 2.1 km altitude. In fact, the requirement for the maximum deflection angle of the planar fins,
, is not strict; therefore, no particular difficulty was observed to obtain the stability margins required. However, it was harder for the TVC system, first because the maximum allowed gimbal angle,
, is lower but also because of the definition of the actuation system itself, which is less stable due to its similarity to an inverted pendulum. Consequently, small stability margins are noticed at the beginning of the flight, which is more likely related to the high thrust magnitude at this stage. Moreover, this behaviour can also be seen at the end of the flight since the thrust magnitude becomes high again and the vehicle mass is significantly reduced. It is particularly important to mention how critical this phase is since the fins’ effectiveness is significantly reduced due to low dynamic pressure. In the controller gain tuning, it was not possible to improve these margins without significantly increasing the settling time of the step response of the closed-loop system. Therefore, we decided to conserve these gains and assess them through robustness analysis and nonlinear simulations in the following sections. Note that, in future works, the aerodynamic and powered descent phases should be separated with control systems that are properly designed for each phase in order to increase robustness.
4.4. Robust Stability and Performance Analysis
The classical stability margins that come from the control design were quantified in the previous section. However, nothing was said about the robust stability and performance of such a design in the presence of uncertainties. In fact, the LFTs can be exploited to inject several uncertainties into the design. Specifically, the uncertainties and perturbations considered and gathered in the uncertainty block
are summarized in
Table 2.
These perturbations are included by the command
ureal in MATLAB; they can be interpreted as multiplicative perturbations with a uniformly distributed multiplicative factor. Perturbations in forces and torques are considered through variations in mass, dynamic pressure, and moments of inertia, as well as the corresponding coefficients. Furthermore, uncertainties have also been induced through the positions CG and CP, again affecting the torques generated. Finally, errors in estimated speed and acceleration are also considered. Note that the controller gains could also be tuned, accounting for these possible uncertainties, as was performed in Ref. [
21]. However, in this study, the uncertainties are included through block
only to assess the robustness stability and performance of the previously synthesized gains [
24,
39].
With these uncertainties inserted in the nominal plant, the robustness of the designed system can now be assessed using the tools provided by the analysis of the structural singular value (
-analysis) by inserting these uncertainties into the nominal plant. More particularly, this is done by reshaping the system in the classical
format, with
embedding all the uncertainties defined in
Table 2 in a diagonal matrix and
representing the deterministic part of the augmented closed-loop system defined in
Figure 6.
Figure 10 describes the robust standard
interconnection augmented with the uncertainty blocks. Note that, in this configuration, the tunable controller
is removed from the generalised plant
, with
representing the controller output and
the controller input.
The structural singular value,
, is usually defined as follows [
42]:
where
represents the transfer function from the uncertainty channel
to
. Then, Robust Stability (RS) is ensured by the following criterion:
Furthermore, the structured singular value can also be used for Robust Performance (RP) analysis. In that context, the robust interconnection of
Figure 10, including the weighting functions and denoted as
, is closed using a fictitious full-complex perturbation matrix,
, which does not represent any actual perturbation of the system. The RP criterion is then defined as follows [
43]:
For both, instead of directly, we estimate their upper and lower bounds for the system under analysis. Therefore, when these bounds are close to each other, they uniquely identify the real value of the system. These analyses are achieved using the MATLAB tools robstab, for robust stability, and robgain, for robust performance.
First,
Figure 11 shows the results of the robust stability analysis. On the top in
Figure 11a, the upper bounds (in solid lines) and the lower bounds (in dashed lines) are represented for all linear design points. On the bottom in
Figure 11b, the sensitivities associated with each parameter of
Table 2 are highlighted for the linear design point at
, which corresponds to the worst-case point in terms of controllability. The
bounds for this specific case are repeated at the top to better facilitate the analysis.
From
Figure 11a, we can see that the bounds are well below 1, therefore ensuring a good robust stability against uncertainties. It can be observed that, at high altitudes, the controller gains generated larger bounds at a relatively high frequency (around 13 rad/s), whereas it is the opposite at lower altitudes. In the linear analysis of the previous section, it was noticed that the last three altitude slots from orange to yellow were more difficult to manage. The observed reduced performance is confirmed by looking at the
-bound peaks obtained in
Figure 11a at a frequency of 2.4 rad/s. However, these bounds remain largely below 1, so the robustness stability of the controllers is satisfied for the uncertainties considered. From the sensitivities in
Figure 11b, it can be noticed that the normal force gradient of the fin,
, and the dynamic pressure,
Q, are the main causes of degradation at
due to the large control authority that fins have compared to the TVC system. Moreover, the
-bound peak corresponds to a peak of the
sensitivities of the normal force gradient of the fin,
, and the mass,
m. In fact, at low altitude, the vehicle is light since most of the propellant has been consumed, and consequently any movement of the fins, even slight, significantly affects its behaviour.
Figure 12 on the bottom left shows the results of the robust performance analysis. Note that the lower bounds are omitted for ease of readability. From this figure, it is possible to notice that the robust performance criterion is not met since, for some frequencies, the
upper bound is bigger than 1. However, this result is still interesting to understand the key factors that lead to poorer performance. In these cases, we observe a peak above 1 at 30 rad/s for the linear design point at
as well as three peaks around 2–4 rad/s, corresponding to the linear design points at low altitude (from 2110 to 0 m). For the latter, they are more likely due to a lack of controllability, since the dynamic pressure and thrust power available are low (due to the small propellant mass remaining). Concerning the peak at 25 km altitude, it is more likely due to a lack of controllability of the TVC system at the starting point of the simulation.
To conclude this section, the robustness analysis and the performance of the gains from the synthesized rigid body controllers through
-analysis were satisfactory, as only lack of performance was observed, which is common for launcher control system design [
21,
24]. The latter is particularly noticed at low altitude and correlated with low stability margins. For the study carried out in this paper, the control system obtained is sufficient, but for future developments this lack of robustness should be corrected since the integration of delays and actuator models on the nonlinear simulator would significantly increase the risk of instability.
6. Conclusions
In Ref. [
26], a 6-DoF nonlinear controlled dynamics simulator for the aerodynamic powered descent and precise landing of reusable launchers was developed to assess G&C systems. As the baseline system for preliminary performance assessment, it involves a successive convex optimization algorithm, maximizing the vehicle final mass and a control system for which the MIMO formulation induced by the coupling of TVC and steerable planar fins was simplified to a series of SISO systems to apply classical linear control theory. The present paper is a follow up of that work, where the improvements are twofold: First, more robust gain-scheduled PID controllers are synthesized via the structured
method. The latter enables the proper combination of both actuators. Then, the assessment of the performance of the integrated G&C design enables the selection of a better cost function strategy.
In fact, the successive convex optimization algorithm implemented into the guidance system offers a modular architecture, enabling several cost function strategies to be evaluated. Among these, optimizing both the time of flight and the vehicle final mass increases the performance of the system. Furthermore, the structured method enables one to directly consider the MIMO formulation, which means that the actuation corrections through TVC and planar fin deflections are optimized simultaneously throughout the descent flight. The robust framework associated with the structured method allows one to directly consider control requirements when tuning the scheduled controllers. In addition, the control architecture for synthesis can be augmented with parametric uncertainty via LFT modelling. This asset enables a direct assessment of the robustness to uncertainties of the controllers obtained via -analysis without the need to use an extensive Monte-Carlo analysis campaign.
It is shown that such a G&C system, when assessed through closed-loop nonlinear simulations with the 6-DoF controlled dynamics simulator, provides significantly improved launcher performance and robustness to uncertainties and disturbances with respect to the baseline system developed in Ref. [
26]. More particularly, a propellant mass saving of 1816 kg is performed while still ensuring a precise and soft landing of the reusable launcher.
The overall results show that the proposed G&C system represents a relevant candidate realistic descent flight and precise landing phase for reusable launch vehicles.