Fault-Tolerant Controller Design for Reusable Launch Vehicle

Xu, Jian; Guo, Chenguang; Wang, Yuewen; Xiao, Yong; Hu, Xiaoxiang

doi:10.3390/act14110565

Open AccessArticle

Fault-Tolerant Controller Design for Reusable Launch Vehicle

by

Jian Xu

¹,

Chenguang Guo

^2,*,

Yuewen Wang

³,

Yong Xiao

^3,* and

Xiaoxiang Hu

³

¹

Xi’an Institute of Microelectronics Technology, Xi’an 710065, China

²

Beijing Microelectronics Technology Institute, Beijing 100076, China

³

School of Automation, Northwestern Polytechnical University, Xi’an 710072, China

^*

Authors to whom correspondence should be addressed.

Actuators 2025, 14(11), 565; https://doi.org/10.3390/act14110565

Submission received: 8 October 2025 / Revised: 15 November 2025 / Accepted: 17 November 2025 / Published: 19 November 2025

(This article belongs to the Section Aerospace Actuators)

Download

Browse Figures

Versions Notes

Abstract

A fault-tolerant controller design for reusable launch vehicles (RLVs) is discussed in this paper. The control precision of RLVs is very important, since it must be ensured that an RLV’s speed reaches zero while flying to the target point. More seriously, the rocket’s thrust system may suffer from faults, so the fault-tolerant control of RLVs is very important. The landing dynamic model of RLVs is very complex, and the thrust is coupled with time-varying states, which make the controller design of RLVs very difficult. Based on the specific control requirements of rocket landing, the control design problem is first transformed into a normal model in this paper. Then, considering potential thrust faults, an optimal fault-tolerant controller is designed using reinforcement learning. Considering sensor faults and actuator faults, this paper presents the corresponding fault-tolerant controller design method. Considering that the analytical problem of the proposed fault-tolerant controller is difficult to solve, this paper presents an approximation method for the analytical solution based on a neural network. The simulation results demonstrate that the proposed controller ensures the safe and stable landing of the rocket in both nominal and fault scenarios.

Keywords:

reusable launch vehicle (RLV); underactuated system; fault-tolerant control (FTC); reinforcement learning

1. Introduction

With the global development of outer space, RLVs have become a prominent focus of research [1]. RLVs have significant advantages, including rapid deployment, large load carrying capacity, and reusability, and all of these advantages substantially reduce the cost of space transportation and exploration. However, to enable reuse ability, RLVs must perform vertical landings with high precision, placing stringent demands on their guidance and control systems [2,3].

In the vertical recovery of RLVs, the control force is provided by the rocket’s thrust. During the landing process, the control objective of RLVs is to ensure that they reach the target landing point smoothly while minimizing fuel consumption. In this phase, the only adjustable control input is the magnitude and direction of the thrust. However, the system must simultaneously control the rocket’s position to the target and reduce its velocity to zero, which means that there are six outputs to be controlled and three control inputs. This mismatch between the number of control inputs (thrust magnitude and direction) and controlled outputs (position and velocity states) makes the rocket landing problem a typical underactuated control problem. However, to ensure a smooth landing, both the position and velocity of the rocket must be simultaneously regulated. The recovery of launch vehicles represents a typical underactuated control problem.

The design of the optimal controller for an underactuated system is challenging [4,5]. To achieve this, a trajectory tracking control law based on the dynamic inversion method combined with an online trajectory update strategy is proposed in [6]; by using this method, effective reentry guidance tracking is achieved. Sliding mode dynamic surface control is employed to design a precise vertical recovery control strategy, ensuring high control accuracy during vertical recovery [7]. A two-point boundary value approach, formulating the rocket’s initial and desired states as boundary conditions to optimize the trajectory, is introduced in [8,9]; subsequently, an indirect method was used to design the rocket’s optimal flight path. However, the indirect method faces challenges in real-time implementation and convergence guarantees. A direct method for designing controllers for rocket recovery by solving convex optimization problems to achieve landing control is proposed in [10,11]. Nevertheless, both direct and indirect methods require accurate knowledge of the rocket’s nonlinear model and disturbances. In practice, due to the complex atmospheric environment and machining accuracy limitations, the RLV model inevitably contains uncertainties. Moreover, the rocket operates over a large envelope, and states of RLVs are coupled with each other, which leads to the complexity of the controller design of RLVs [12,13].

For the rocket landing control problem under uncertainties and model deviations, the indirect method is improved by neural networks [14,15]. A large set of rocket landing sample data is obtained by extensive training, and then neural networks are used to fit the data; thus a new empirical control law that achieves rocket landing control is derived. By applying reinforcement learning, a staged reward function is built [16]. The staged reward function is solved by Q-learning, and then a rocket landing controller is optimized under energy constraints. Ref. [17] analyzes the uncertain factors of RLVs and presents a design method for parameterized controllers.

The rocket landing control problem has attracted widespread attention. Researchers have designed landing controllers using methods such as direct approaches, indirect approaches, neural networks, and reinforcement learning. However, in practical applications, rocket thrust systems may experience faults, which significantly complicate landing control under fault conditions. The attitude FTC problem for faulty rocket systems is proposed, and a robust control method based on fixed-time observers is proposed [18]. An adaptive FTC technique for rocket attitude control systems is presented in [19]. The FTC problem of the attitude control system during rocket reentry is discussed in [20]. Nevertheless, fault-tolerant control in the rocket landing and recovery process remains relatively underexplored and lacks a comprehensive solution. In summary, although the rocket landing problem has received considerable attention, existing studies have yet to address fault-tolerant control in rocket landing guidance.

Based on the above discussion, this paper discusses the fault-tolerant landing controller designing of RLVs. Uncertainties such as inaccurate nonlinear rocket models and unknown external disturbances are taken into account. While considering these uncertainties and disturbances, neural networks are employed to construct the performance function and control law within the rocket controller, and a model-free fault-tolerant control design method is proposed. The contributions of this paper can be summarized as follows:

A data-based FTC method for RLVs is proposed in this paper. The proposed method can realize the accurate control of RLVs without complete information on the RLV’s nonlinear model.
Both the uncertainties and external interference of RLVs are considered in this paper. The proposed method can deal with not only the faults of RLVs but also their uncertainties and external interference.
Both sensor faults and partial failure faults are considered and addressed simultaneously within the proposed FTC method.

2. Landing Model of Reusable Launch Vehicle

2.1. Nonlinear Landing Model of Reusable Launch Vehicle

The rocket landing problem can be described by the following nonlinear dynamic equations:

\{\begin{array}{l} \dot{r} = V \\ \dot{V} = g + \frac{T + D}{m} \\ \dot{m} = \frac{‖T‖}{V_{e x}} \end{array}

(1)

where

r

represents the position vector of the rocket;

V

represents the velocity vector;

m

denotes the mass of the rocket;

\dot{m}

represents the consumption rate of mass, that is, the consumption rate of fuel; and

g

is the gravitational acceleration at the rocket’s location.

T

denotes the thrust vector of the rocket, and

V_{e x}

represents the rocket’s exhaust velocity, which is a constant.

D

stands for the aerodynamic drag experienced by the rocket, which depends on the vehicle’s velocity and is calculated simultaneously using the following equation:

D = - \frac{1}{2} ρ V^{2} S_{r e f} C_{D}

(2)

where

ρ

is the atmospheric density, which depends on the rocket’s position;

S_{r e f}

is the aerodynamic reference area of the rocket;

C_{D}

is the drag coefficient, which is related to the rocket’s Mach number.

To ensure control accuracy,

g

is closely related to the rocket’s altitude. In practical applications,

g

is calculated using the following formula:

g = - \frac{μ}{{‖R_{E} + r‖}^{3}} (R_{E} + r)

(3)

where

μ

is the Earth’s gravitational constant, and

R_{E}

is the vector from the Earth’s center to the surface.

2.2. Control Requirements

The control objective can be formulated as the following optimal control problem:

\begin{array}{l} \max m \\ s u b j e c t t o \\ \{\begin{cases} \dot{r} = V \\ \dot{V} = g + \frac{T + D}{m} \\ \dot{m} = - \frac{‖T‖}{V_{e x}} \\ r (t_{0}) = r_{0}, V (t_{0}) = V_{0}, m (t_{0}) = m_{0} \\ r (t_{f}) = 0, V (t_{f}) = 0 \end{cases} \end{array}

(4)

where

t_{0}

denotes the initial time, and

t_{f}

represents the final landing time.

r_{0}, V_{0}, m_{0}

correspond to the position, velocity, and mass at the initial time, respectively.

V_{e x}

denotes the exhaust velocity. The rocket’s intrinsic constraints are taken into account; specifically, the thrust is bounded as

T_{\min} \leq ‖T‖ \leq T_{\max}

, where

T_{\min}, T_{\max}

are positive real numbers.

3. Model Transformation

By examining Equations (1)–(3) and considering the control requirements, the variables to be controlled include the position and velocity in three directions, as well as the rocket’s mass. However, the available control input is limited to the rocket’s thrust, so this problem is a typical underactuated control problem. To facilitate controller design, the model is transformed according to the control requirements.

3.1. Selection of Control Variables

To facilitate the controller design, the RLV’s position and velocity are decomposed in the North–East–Up coordinate frame. In this coordinate system, both the position and velocity of the rocket are represented as three-dimensional state variables. Meanwhile, the thrust of the rocket is also decomposed accordingly. Assuming that the rocket does not rotate in space, the thrust can be expressed as a three-dimensional control input vector.

\begin{array}{l} u = {[\begin{matrix} u_{1}, & u_{2}, & u_{3} \end{matrix}]}^{T} \\ = {[\begin{matrix} ‖T‖ \cos α \sin β, & ‖T‖ \sin α \sin β, & ‖T‖ \cos β \end{matrix}]}^{T} \end{array}

where

α

denotes the angle between the thrust vector and the east direction in the “north-east” plane, while

β

represents the angle between the thrust vector and the “upward” (zenith) axis. Under this configuration, the constraints on the three input variables of the rocket are given as follows:

\{\begin{cases} T_{\min} \leq ‖T‖ \leq T_{\max} \\ 0 \leq α \leq \frac{π}{2} \\ 0 \leq β \leq \frac{π}{2} \end{cases}

3.2. Fully Actuated System

To facilitate controller design, the rocket landing model is transformed based on control requirements. Let

r = \sqrt{x^{2} + y^{2} + z^{2}}

and

V = \sqrt{V_{x}^{2} + V_{y}^{2} + V_{z}^{2}}

, where

x, y, z

denotes the position vector of the rocket expressed in the local East–North–Up coordinate system, and

V_{x}, V_{y}, V_{z}

represents the velocity vector in the same coordinate frame. The nonlinear dynamics of the rocket can then be rewritten as follows:

\{\begin{cases} \dot{r} = \frac{x \dot{x} + y \dot{y} + z \dot{z}}{\sqrt{x^{2} + y^{2} + z^{2}}} = \frac{x V_{x} + y V_{y} + z V_{z}}{\sqrt{x^{2} + y^{2} + z^{2}}} \\ \dot{V} = \frac{V_{x} {\dot{V}}_{x} + V_{y} {\dot{V}}_{y} + V_{z} {\dot{V}}_{z}}{\sqrt{V_{x}^{2} + V_{y}^{2} + V_{z}^{2}}} \\ \dot{m} = - \frac{‖T‖}{V_{e x}} \end{cases}

(5)

where

\begin{array}{l} {\dot{V}}_{x} = \frac{\frac{V_{x} D}{\sqrt{V_{x}^{2} + V_{y}^{2} + V_{z}^{2}}}}{m} + \frac{u_{1}}{m} \\ {\dot{V}}_{y} = \frac{\frac{V_{y} D}{\sqrt{V_{x}^{2} + V_{y}^{2} + V_{z}^{2}}}}{m} + \frac{u_{2}}{m} \\ {\dot{V}}_{z} = g + \frac{\frac{V_{z} D}{\sqrt{V_{x}^{2} + V_{y}^{2} + V_{z}^{2}}}}{m} + \frac{u_{3}}{m} \end{array}

At this stage, the rocket landing problem is transformed into a control problem with three inputs and three outputs, where the control objectives achieve

r = 0

and

V = 0

while maximizing the remaining fuel.

To address the above control problem, new output variables are selected as follows:

\{\begin{cases} χ_{1} = r + λ \dot{r} \\ χ_{2} = V \\ χ_{3} = m \end{cases}

(6)

where

λ

represents the coupling between position and velocity. The value of r is a positive number, and the specific value is determined by the importance of position control accuracy and speed control accuracy. Then the new equation can be expressed as follows:

\dot{χ} = f (χ) + g (χ) u

(7)

The specific form of

f (χ), g (χ)

is determined by Equations (1), (2) and (5).

4. Fault Description of RLVs

The RLV’s thrust is controlled by an onboard computer. The thrust system itself may exhibit deviations. In particular, the erosion of the thrust system’s nozzle may occur, causing the rocket’s thrust vector to deviate from the desired direction; then the control accuracy of the rocket is affected. In this paper, two common fault modes are considered for the design of a fault-tolerant controller for the rocket.

4.1. Partial Failure Fault

The rocket propulsion system may produce a thrust lower than the expected value. This type of fault can be described as follows:

{‖T‖}_{r} = η (t) {‖T‖}_{d}

(8)

where

{‖T‖}_{d}

denotes the desired thrust of the rocket, i.e., the thrust required under fault-free conditions;

{‖T‖}_{x}

represents the actual thrust that the rocket can provide; and

η (t)

indicates the thrust efficiency of the rocket, with

0 < η (t) \leq 1

. In this scenario, the rocket experiences a partial failure fault. Considering the actual occurrence of faults in the rocket, the efficiency

η (t)

is unknown. Under this fault mode, the rocket control problem can be transformed into the following:

\begin{array}{l} \max m \\ S u b j e c t t o \\ \{\begin{matrix} \dot{χ} = f (χ) + g (χ) u \\ {‖T‖}_{r} = η (t) {‖T‖}_{d} \\ 0 < η (t) \leq 1 \\ 0 \leq α \leq \frac{π}{2} \\ 0 \leq β \leq \frac{π}{2} \end{matrix} \end{array}

(9)

4.2. Sensor Fault

The rocket’s thrust command is generated by converting the electrical signals received from the onboard computer. However, during signal transmission, signal attenuation may occur, causing the electrical signal received by the thrust generation system to be lower than the expected value. As a result, the rocket’s thrust will deviate. The thrust can be expressed as follows:

{‖T‖}_{r} = {‖T‖}_{d} - {‖T‖}_{f}

(10)

where

{‖T‖}_{d}

denotes the desired thrust of the rocket, and

{‖T‖}_{f}

represents the thrust error caused by the sensor fault.

{‖T‖}_{f}

is a time-varying variable. This type of fault is defined as a sensor fault, with a critical threshold for thrust loss caused by the fault given by

{‖T‖}_{f} \leq \bar{T}

.

{‖T‖}_{r}

represents the actual thrust of the rocket. Under this fault mode, the rocket’s control problem can be formulated as follows:

\begin{array}{l} \max m \\ S u b j e c t t o \\ \{\begin{cases} \dot{χ} = f (χ) + g (χ) u \\ {‖T‖}_{r} = {‖T‖}_{d} - {‖T‖}_{f} \\ T_{\min} \leq {‖T‖}_{d} \leq T_{\max} \\ {‖T‖}_{f} \leq \bar{T} \\ 0 \leq α \leq \frac{π}{2} \\ 0 \leq β \leq \frac{π}{2} \end{cases} \end{array}

(11)

5. Design of Optimal Fault-Tolerant Controller

By analysis, the aforementioned FTC problem is a nonlinear time-varying control problem. Moreover, the inputs are not given in an explicit form, and there is coupling among the input variables; all of this makes the direct FTC design of RLVs challenging. To achieve optimal control for (9) and (11), a model-free optimal fault-tolerant controller is considered, and a data-driven online solution method based on reinforcement learning is employed to implement the optimal controller.

5.1. Optimal Fault-Tolerant Controller Design for Partial Actuator Failures

5.1.1. Optimal Fault-Tolerant Controller Design

Considering the partial failure model of the rocket described in Equation (8), the controllable inputs available from the rocket can be represented as follows:

\begin{array}{l} u_{r} = {[\begin{matrix} u_{1 r}, & u_{2 r}, & u_{3 r} \end{matrix}]}^{T} \\ = {[\begin{matrix} {‖T‖}_{r} \cos α \sin β, & {‖T‖}_{r} \sin α \sin β, & {‖T‖}_{r} \cos β \end{matrix}]}^{T} \end{array}

Since

η (t)

and

{‖T‖}_{r}

are unknown, the exact form of

u_{r}

cannot be determined. Considering that the optimization objective of rocket landing is to minimize fuel consumption while ensuring a stable landing, the following performance index is selected as the optimization criterion:

\begin{array}{l} J (χ (t), u (t)) = \int_{0}^{\infty} (χ^{T} (τ) Q χ (τ) + {u_{r}}^{T} (τ) R u_{r} (τ)) d τ \\ = \int_{0}^{\infty} Θ (χ (τ), u_{r} (τ)) d τ \end{array}

(12)

where

Q

and

R

are positive definite weighting matrices

Θ (χ (τ), u (τ)) = χ^{T} (τ) Q χ (τ) + {u_{r}}^{T} (τ) R u_{r} (τ) .

Remark 1.

Regarding the optimization criterion (9), the control objective is to ensure that the rocket reaches the designated landing point while minimizing fuel consumption. In performance index (12), by appropriately selecting the weighting matrices

Q

and

R

, the optimization criterion (9) can be effectively optimized. Therefore, the performance index (12) is reasonable.

For the control system (9), combined with the performance index function established in (12), the following Hamilton–Jacobi–Bellman (HJB) function is defined:

\begin{array}{l} H (χ (t), u_{r} (t), J (t)) \\ = Θ (χ (t), u_{r} (t)) + \nabla J^{T} (χ (t), u_{r} (t)) \dot{χ} (t) \\ = χ^{T} (t) Q χ (t) + {u_{r}}^{T} (t) R u_{r} (t) + \\ \nabla J^{T} (χ (t), u_{r} (t)) (f (χ) + g (χ) u_{r}) \end{array}

(13)

where

\nabla J^{T} (χ (t), u_{r} (t))

is the gradient of the performance index

J (χ (t), u_{r} (t))

. According to optimal control theory, the required optimal controller can guarantee the minimization of the following performance index:

\begin{array}{l} J^{*} (χ (τ), u_{r} (t)) \\ = \min_{u \in Ω} \int_{0}^{\infty} Θ (χ (τ), u_{r} (τ)) d τ \\ = \int_{0}^{\infty} Θ (χ (τ), u_{r}^{*} (τ)) d τ \end{array}

Under the action of this controller,

\begin{array}{l} H (χ (τ), {u_{r}}^{*} (t), J^{*} (t)) \\ = χ^{T} (t) Q χ (t) + u_{r}^{* T} (t) R {u_{r}}^{*} (t) + \\ \nabla J^{* T} (χ (τ), u_{r} (t)) (f (χ) + g (χ) {u_{r}}^{*} (t)) \\ = 0 \end{array}

To solve the above HJB equation, the partial derivatives should equal zero:

\frac{\partial H (χ (τ), {u_{r}}^{*} (t), J (t))}{\partial {u_{r}}^{*} (t)} = 0

The optimal control input which can ensure that the performance index (12) reaches the minimum under given constraints

{u_{r}}^{*} (t)

can be obtained

{u_{r}}^{*} (t) = - \frac{1}{2} R^{- 1} g^{T} (χ) \nabla J (χ (τ), u_{r} (t))

(14)

Substituting the optimal control input into Equation (13) yields

\begin{array}{l} H (χ (τ), {u_{r}}^{*} (t), J^{*} (t)) \\ = χ^{T} (t) Q χ (t) + \nabla J^{* T} (χ (t), u_{r} (t)) f (χ) \\ - \frac{1}{4} \nabla J^{* T} (χ (t), u_{r} (t)) g (χ) R^{- 1} g^{T} (χ) \nabla J^{*} (e (t), u_{r} (t)) \\ = 0 \end{array}

By substituting the optimal control input into Equation (13), the optimal control law (14) is obtained, which serves as the optimal controller for the system under partial actuator failures. However, this optimal controller cannot be directly implemented since the gradient of the performance index function (12), denoted as

\nabla J (χ (τ), u_{r} (t))

, cannot be solved analytically. To better solve the optimal control input, a policy iteration method from reinforcement learning is employed to compute the system’s optimal control.

The online solution of the optimal controller can be achieved through Algorithm 1.

Algorithm 1: Policy iterative algorithm of FTC

Initialization: Select

u (0), J (0),

Step1: pass

\begin{array}{l} \nabla J^{(k + 1) T} (χ (t), u (t)) (f (x) + g (x) u) \\ + χ^{T} (t) Q χ (t) + u (t) R u (t)) \\ = 0 \end{array}

Calculate

J^{(k + 1)} (χ (t), u (t))

Step2: Updated Control Input

u^{k + 1} (t) = - \frac{1}{2} R^{- 1} g^{T} (χ) \nabla J^{(k)} (χ (t), u (t))

If

‖J^{(k + 1)} (χ (t), u (t)) - J^{(k)} (χ (t), u (t))‖ \leq ε

, then stop the computation; otherwise, proceed to Step 1. Here,

ε

is a sufficiently small positive constant.

5.1.2. Online Solution Method for Fault-Tolerant Controller

For the nonlinear model (9) proposed in this paper, directly applying Algorithm 1 to iteratively compute the controller solution is highly challenging, since the model parameters are time-varying and the fault variable

η (t)

is unknown. Under such circumstances, this paper proposes an online iterative solution method. Considering the performance index (12) and the controller (14), the following neural network is selected for online approximation:

\begin{array}{l} J (t) = W_{J}^{T} φ (t) + e_{J} (t) \\ u (t) = W_{u}^{T} ψ (t) + e_{v} (t) \end{array}

(15)

where

W_{J}^{T}

denotes the weight matrix of the performance evaluation neural network, and

φ (t)

represents its basis function;

W_{u}^{T}

is the weight matrix of the controller neural network, and

ψ (t)

denotes its basis function. The terms

e_{J} (t)

and

e_{v} (t)

represent the approximation errors.

Assuming that both the weights and basis functions in (15) are bounded, the rocket dynamics described in (12) can be rewritten under the conditions of the controller (14). The following can be obtained:

\dot{χ} (t) = f (χ) + g (χ) W_{u}^{T} ψ (t)

(16)

The update law for the optimal control is given by the following:

\begin{array}{l} J^{k + 1} (t + Δ t) = - \int_{t}^{t + Δ t} [ψ^{T} (t) W_{u} R W_{u}^{T} ψ (t) + χ^{T} (τ) Q χ (τ)] d τ \\ - \int_{t}^{t + Δ t} 2 {(u^{k + 1} (t))}^{T} R_{χ} d τ + J^{k + 1} (t) \\ u^{k + 1} (t) = - \frac{1}{2} R^{- 1} g^{T} (x) \nabla J^{k + 1} (t) \end{array}

The HJB equation is as follows:

\begin{array}{l} H (χ, u, J) = χ^{T} (t) Q χ (t) + ψ {(t)}^{T} W_{u}^{*} R W_{u}^{* T} ψ (t) \\ + W_{J}^{* T} \nabla φ (t) (f (χ) + g (χ) W_{v}^{* T} ψ (t)) \\ = 0 \end{array}

where

W_{u}^{*}

and

W_{v}^{*}

represent the optimal weights of the chosen performance approximation neural network and the controller neural network, respectively. Since the optimal weights are unknown, let

{\hat{W}}_{v}

and

{\hat{W}}_{J}

denote the estimates of these optimal weights. Therefore,

\begin{array}{l} \hat{J} (t) = {\hat{W}}_{J}^{T} φ (t) \\ \hat{u} (t) = {\hat{W}}_{v}^{T} ψ (t) \end{array}

(17)

Then the HJB equation is

\begin{array}{l} J (χ, u, J) = χ^{T} (t) Q χ (t) + ψ {(t)}_{u}^{T} {\hat{W}}_{v} R {\hat{W}}_{v}^{T} ψ (t) \\ + {\hat{W}}_{J}^{T} \nabla φ (t) (f (χ) + g (χ) {\hat{W}}_{v}^{T} ψ (t)) + ε_{H J I} \\ = 0 \end{array}

where

ε_{H J I}

denotes the estimated total error.

\begin{array}{l} ε_{H J I} = ψ {(t)}^{T} W_{u} R χ_{u} (t) + χ^{T} (t) R_{χ} (t) + \\ χ_{u}^{T} (t) R W_{u}^{T} ψ (t) + \nabla J_{u} (t) (f (χ) + g (χ) W_{u}^{T} ψ (t)) \end{array}

The residual error of the system at this moment is

\begin{array}{l} ϒ = \int_{t}^{t + Δ t} [χ^{T} (τ) Q χ (τ) + ψ {(t)}^{T} {\hat{W}}_{u}^{(k)} R {\hat{W}}_{u}^{(k) T} ψ (ς)] d τ \\ - \int_{t}^{t + Δ t} 2 {({\hat{W}}_{u}^{(k + 1) T} ψ (t))}^{T} R χ_{1} d τ \\ + {\hat{W}}_{J}^{(k + 1) T} (φ (t + Δ t) - φ (t)) \\ = \int_{t}^{t + Δ t} [χ^{T} (τ) Q χ (τ) + ψ {(t)}^{T} {\hat{W}}_{u}^{(k)} R {\hat{W}}_{u}^{(k) T} ψ (ς)] d τ \\ - \int_{t}^{t + Δ t} 2 (χ^{T} R) \otimes ψ^{T} (t) d τ \cdot v e c ({\hat{W}}_{u}^{(k + 1)}) \\ + {(φ (t + Δ t) - φ (t))}^{T} {\hat{W}}_{J}^{(k + 1)} \end{array}

To better represent the residual error and facilitate subsequent derivations, a new weight vector is defined as follows:

{\hat{W}}^{(k + 1)} = [\begin{matrix} {\hat{W}}_{J}^{(k + 1)} \\ v e c ({\hat{W}}_{u}^{(k + 1)}) \end{matrix}]

(18)

The residual error can be expressed as

ϒ = {\bar{H}}_{1}^{(k)} + \bar{H} {\hat{W}}^{(k + 1)}

(19)

where

\begin{array}{l} {\bar{H}}_{1}^{(k)} = - \int_{t}^{t + Δ t} [χ^{T} (τ) Q χ (τ) + ψ {(t)}^{T} {\hat{W}}_{u}^{(k)} R {\hat{W}}_{u}^{(k) T} ψ (ς)] d τ \\ \bar{H} = {[\begin{matrix} (φ (t + Δ t) - φ (t)) \\ \int_{t}^{t + Δ t} 2 (χ^{T} R) \otimes ψ^{T} (t) d τ \end{matrix}]}^{T} \end{array}

To address the above issues, the following weight update law is designed:

\dot{\hat{W}} = - τ \frac{\bar{H}}{{(\bar{H} {\bar{H}}^{T} + 1)}^{2}} ϒ

(20)

where

τ

denotes the adjustable learning rate.

Theorem 1.

For the RLV landing system (9) subject to partial actuator failures, to achieve the optimal performance index (12), the controller defined by Equation (15) is employed, and the update law for the controller and performance approximation network follows Equation (20). Under these conditions, the landing system remains stable and reaches the desired landing point.

Proof.

Choose the Lyapunov function as follows:

V (t) = J (t) + \frac{1}{δ} t r ({\tilde{W}}^{T} \tilde{W})

where

\tilde{W} = W^{*} - \hat{W}

; taking the derivative of the above equation yields the following:

\dot{V} (t) = \nabla^{T} J (t) (f (χ) + g (χ) {\hat{W}}_{u}^{T} ψ (t)) + \frac{1}{2 δ} t r ({\tilde{W}}^{T} \tilde{W})

Based on Equation (12),

\begin{array}{l} \dot{V} (t) = {\hat{W}}_{J}^{T} \nabla φ (t) (f (χ) + g (χ) {\hat{W}}_{v}^{T} ψ (t)) \\ + e V_{J} (ς) + \frac{1}{2 δ} {\bar{W}}^{T} {\tilde{W}}^{T} \end{array}

where

e V_{J} (ς) = ε_{J} (ς)

. Substituting this into the Hamiltonian function yields

\begin{array}{l} H (χ, u^{*}, J^{*}) = χ^{T} (t) Q χ (t) + ψ {(ς)}^{T} W_{v}^{*} R W_{v}^{* T} ψ (ς) \\ + W_{J}^{* T} \nabla φ (ς) (f (χ) + g (χ) W_{v}^{* T} ψ (ς)) + ε_{H J I} \\ = 0 \end{array}

Based on the neural network weight formulation,

\begin{array}{l} {\hat{W}}_{J}^{T} \nabla φ (t) (f (χ) + g (χ) {\hat{W}}_{v}^{T} ψ (t)) \leq \\ - χ^{T} (t) Q χ (t) - ψ {(ς)}^{T} W_{v}^{*} R W_{v}^{* T} ψ (ς) - ε_{H J I} \end{array}

For Equation (18),

E = H_{1}^{*} - H_{1} + \bar{H} (Ψ^{*} - \hat{Ψ})

Let

e_{H_{1}} = H_{1}^{*} - H_{1}

and

e_{H_{1}} = \int_{t}^{t + Δ t} 2 ψ {(t)}^{T} Ξ ψ (ς) d τ = (2 ψ {(t)}^{T} Ξ ψ (ς)) Δ t

, where

Ξ = [W_{u}^{*} R W_{u}^{* T} - {\hat{W}}_{u}^{(k)} R {\hat{W}}_{u}^{(k) T}]

; therefore

e_{H_{1}} \leq d_{1} ‖{\tilde{W}}_{u}^{(k)}‖ + d_{2}

, and

d_{1}

and

d_{2}

are bounded normal variables.

For

t r ({\tilde{W}}^{T} \hat{W}),

\begin{array}{l} \frac{1}{2 δ} t r ({\tilde{W}}^{T} \hat{W}) = t r ({\tilde{W}}^{T} \frac{\bar{H}}{{(\bar{H} {\bar{H}}^{T} + 1)}^{2}} Ϝ) \\ \leq t r ({\tilde{W}}^{T} \frac{\bar{H}}{{(\bar{H} {\bar{H}}^{T} + 1)}^{2}} \bar{H} (Ψ^{*} - \hat{Ψ}) + e_{H_{1}}) \\ = t r ({\tilde{W}}^{T} \frac{\bar{H}}{{(\bar{H} {\bar{H}}^{T} + 1)}^{2}} \bar{H} W^{*} - {\tilde{W}}^{T} \frac{\bar{H}}{{(\bar{H} {\bar{H}}^{T} + 1)}^{2}} \bar{H} \hat{Ψ} \\ + {\tilde{W}}^{T} \frac{\bar{H}}{{(\bar{H} {\bar{H}}^{T} + 1)}^{2}} e_{H_{1}}) \\ \leq \frac{d_{η}}{Δ t} ‖\tilde{W}‖ \end{array}

Therefore,

\dot{V} (t) \leq d_{η} ‖\tilde{W}‖ + d_{1} ‖\tilde{W}‖ + d_{2}

. According to the theory of bounded stability of the system [20], the system is boundedly stable. The proof is completed. □

Remark 2.

In the design of partially faulty controllers, thrust efficiency

η (t)

is unknown. However, during system derivation, neural network approximation treats the input matrix and efficiency as a combined entity, thereby simultaneously addressing the uncertainties in both the input matrix and efficiency.

5.2. Design of Fault-Tolerant Controller for Sensor Failures

In Section 5.1, the optimal controller for the landing rocket under partial failure conditions was obtained. This section analyzes the fault-tolerant control problem of the rocket under sensor failure conditions.

The model of the rocket considering sensor failures, as established in Section 3.2, can represent the overall landing rocket system under this fault mode as follows:

\dot{χ} = f (χ) + g (χ) (u - f_{a})

(21)

where

f_{a}

represents the thrust loss of the rocket caused by the fault. At this point, according to the performance function in Section 5.1, its derivative is taken.

\dot{V} (t) = \nabla^{T} J (t) (f (χ) + g (χ) {\hat{W}}_{u}^{T} ψ (t) - g (χ) f_{a})

Based on the HJB equation, a further simplification of the above expression yields the following:

\begin{array}{l} {\hat{W}}_{J}^{T} \nabla φ (t) (f (χ) + g (χ) {\hat{W}}_{v}^{T} ψ (t) - f_{a}) \\ \leq - e^{T} (t) Q e (t) - ψ {(ς)}^{T} W_{v}^{*} R W_{v}^{* T} ψ (ς) - ε_{H J I} + {f_{a}}^{T} R f_{a} \end{array}

It can be seen that the designed fault-tolerant controller needs to compensate for the efficiency loss of the actuators caused by faults. Under the premise that the optimal controller for the nominal condition is known, the fault-tolerant controller for rocket landing is implemented as follows:

Theorem 2.

For the RLV landing system described by (9), to satisfy the optimal performance criterion in (12), a controller as given in (22) is employed. The update laws for both the controller and the performance approximation network follow (20), while the adaptive estimation parameter update law for faults is given by (23). Under these conditions, the landing system remains stable and successfully reaches the desired landing point.

u (t) = u^{*} (t) + {\hat{f}}_{a} u^{*} (t) = W_{u}^{T} ψ (t)

(22)

{\dot{\hat{f}}}_{a} = σ (2 u *^{T} R + {‖x^{T} g (χ)‖}_{\max})^{T}

(23)

where

{‖\cdot‖}_{\max}

denotes the maximum value of the function.

Proof.

Choose the Lyapunov function as follows:

V (t) = J (t) + \frac{1}{δ} t r ({\tilde{W}}^{T} \tilde{W}) + \frac{1}{σ} {\tilde{f}}_{a}^{T} {\tilde{f}}_{a}

where

{\tilde{f}}_{a} = f_{a} - {\hat{f}}_{a}

, and

σ

is a constant. Taking the derivative of the above equation yields the following:

\begin{array}{l} \dot{V} (t) = \nabla^{T} J (t) (f (χ) + g (χ) {\hat{W}}_{u}^{T} ψ (t) + {\hat{f}}_{a}) \\ + \frac{1}{2 δ} t r ({\tilde{W}}^{T} \tilde{W}) - \frac{1}{σ} {\dot{\hat{f}}}_{a}^{T} {\tilde{f}}_{a} \end{array}

Based on Equation (12), considering

\nabla^{T} J (t) = {\hat{W}}_{J}^{T} \nabla φ (t)

,

\begin{array}{l} {\hat{W}}_{J}^{T} \nabla φ (t) (f (x) + g (x) {\hat{W}}_{v}^{T} ψ (t) - f_{a}) \\ \leq - e^{T} (t) Q e (t) - ψ {(ς)}^{T} W_{v}^{*} R W_{v}^{* T} ψ (ς) + {f_{a}}^{T} R f_{a} - ε_{H J I} \end{array}

Then, combined with the proof of Theorem 1,

\dot{V} (t) = d_{η} ‖\tilde{W}‖ + d_{1} ‖\tilde{W}‖ + d_{2} - \frac{1}{σ} (2 u *^{T} R {\dot{\hat{f}}}_{a}^{T} - x^{T} g (χ) {\dot{\hat{f}}}_{a}^{T}) {\tilde{f}}_{a}

By combining this with the update law of

{\hat{f}}_{a}

, the system is shown to be bounded and stable, thus completing the proof. □

6. Simulation Results

To verify the performance of the proposed method, it is applied to the landing control of a recoverable rocket. In the simulation, the rocket’s initial position is set to

[- 3000 m, 300 m, 4000 m]

, and the initial velocity is

[300 m / s, 30 m / s, - 300 m / s]

. The mass of the rocket is 48,200 kg, and the rocket’s fault conditions are configured as follows:

(1) For partial failure faults,

η (t) = 0.8

;

(2) For the sensor fault,

{‖T‖}_{f} = 20,000 (1 - e^{- t})

N.

The basis function selected in the controller design is

φ (t) = ψ (t) = [\begin{matrix} x & y & z & V_{x} & V_{y} & V_{z} r V \end{matrix}]

. According to the controller designed in this paper, simulations were first conducted under fault-free conditions. The proposed FTC is marked as

u_{f}

, while a normal controller without considering the fault is marked as

u_{n}

. The following three cases are considered in the simulation:

Case I: Only the partial failure fault is considered;

Case II: Only the sensor fault is considered;

Case III: Both the partial failure fault and the sensor fault are considered.

The simulation results for the above three cases are shown in the following figures.

Case I: Figure 1 represents the position variation curve, Figure 2 represents the velocity variation curve, Figure 3 shows the rocket’s thrust variation curve, and Figure 4 depicts the rocket’s mass variation curve. As shown in the figures, the designed FTC

u_{f}

achieves the stable control of the rocket under fault-free conditions and ensures a stable landing.

Case II: Figure 5 represents the position variation curve, Figure 6 represents the velocity variation curve, Figure 7 shows the rocket’s thrust variation curve, and Figure 8 depicts the rocket’s mass variation curve. As shown in the figures, under sensor fault, the designed FTC

u_{f}

achieves the stable control of the rocket under fault-free conditions and ensures a stable landing.

Case III: Figure 9 represents the position variation curve, Figure 10 represents the velocity variation curve, Figure 11 shows the rocket’s thrust variation curve, and Figure 12 depicts the rocket’s mass variation curve. As shown in the figures, the designed FTC

u_{f}

achieves the stable control of the rocket under fault-free conditions and ensures a stable landing under both the partial failure fault and the sensor fault.

Based on the simulation results, it can be observed that when the system operates without faults, the designed controller is capable of achieving the safe landing control of the reusable launch vehicle. Furthermore, when faults occur, the designed controller can still ensure the safe landing control of the launch vehicle under fault conditions.

7. Conclusions

This paper addresses the landing problem of RLVs. By analyzing the landing rocket model, the typical underactuated control challenge of the rocket’s control system is identified. Subsequently, through model transformation, a controllable rocket landing model is derived. Considering the presence of uncertainties and external disturbances in the rocket model, a reinforcement learning-based controller is designed. Additionally, a fault-tolerant controller is developed to handle sensor faults in the rocket. The simulation results demonstrate that the proposed controllers ensure the safe and precise landing of the reusable launch vehicle under both fault-free and faulty conditions.

Author Contributions

Conceptualization, J.X. and X.H.; methodology, X.H.; software, Y.X.; validation, Y.W. and J.X.; investigation, X.H.; resources, Y.W.; data curation, C.G. and Y.X.; writing—original draft preparation, J.X.; writing—review and editing, C.G.; visualization, X.H.; supervision, X.H.; project administration, J.X.; funding acquisition, X.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China under Grant 61833016, Grant 62073265.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

No new data were created or analyzed in this study.

Acknowledgments

We would like to acknowledge the reviewers for their careful reading, helpful comments, and constructive suggestions, which have significantly improved the presentation of our manuscript.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Hu, X.; Xiao, B.; Hu, C.; Si, X. Unmeasurable flexible dynamics monitoring and tracking controller design for guidance and control system of hypersonic flight vehicle. J. Frankl. Inst. 2024, 361, 958–977. [Google Scholar] [CrossRef]
Wu, X.; Xiao, B.; Wu, C.; Guo, Y. Centroidal voronoi tessellation and model predictive control–based macro-micro trajectory optimization of microsatellite swarm. Space Sci. Technol. 2022, 2022, 9802195. [Google Scholar] [CrossRef]
Gulczynski, M.T.; Vennitti, A.; Scarlatella, G.; Calabuig, G.J.D.; Blondel-Canepari, L.; Weber, F.; Sarritzu, A.; Bach, C.; Deeken, J.C.; Pasini, A.; et al. RLV applications: Challenges and benefits of novel technologies for sustainable main stages. In Proceedings of the International Astronautical Congress, IAC (No. 64293), Dubai, United Arab Emirates, 25–29 October 2021; International Astronautical Federation, IAF: Paris, France, 2021. [Google Scholar]
Ye, L.; Tian, B.; Liu, H.; Zong, Q.; Liang, B.; Yuan, B. Anti-windup robust backstepping control for an underactuated reusable launch vehicle. IEEE Trans. Syst. Man Cybern. Syst. 2020, 52, 1492–1502. [Google Scholar] [CrossRef]
Singh, S.; Stappert, S.; Buckingham, S.; Lopes, S.; Kucukosman, Y.C.; Simioana, M.; Pripasu, M.; Wiegand, A.; Sippel, M.; Planquart, P. Dynamic Modelling and control of an aerodynamically controlled capturing device for ‘in-air-capturing’ of a reusable launch vehicle. In Proceedings of the 11th International ESA Conference on Guidance, Navigation & Control Systems, Sopot, Poland, 21–25 June 2021; pp. 22–25. [Google Scholar]
Cheng, G.; Jing, W.; Gao, C. Recovery trajectory planning for the reusable launch vehicle. Aerosp. Sci. Technol. 2021, 117, 106965. [Google Scholar] [CrossRef]
Javaid, U.; Dong, H.; Ijaz, S.; Alkarkhi, T.; Haque, M. High-performance adaptive attitude control of spacecraft with sliding mode disturbance observer. IEEE Access 2022, 10, 42004–42013. [Google Scholar] [CrossRef]
Lu, P. Propellant-optimal powered descent guidance. J. Guid. Control Dyn. 2018, 41, 813–826. [Google Scholar] [CrossRef]
Liu, X.F.; Lu, P.; Pan, B.F. Survey of convex optimization for aerospace applications. Astrodynamics 2017, 1, 23–40. [Google Scholar] [CrossRef]
Xue, X.P.; Wen, C.Y. Review of unsteady aerodynamics of supersonic parachutes. Prog. Aerosp. Sci. 2021, 125, 100728. [Google Scholar] [CrossRef]
Zhang, X.; Mu, R.; Chen, J.; Wu, P. Hybrid multi-objective control allocation strategy for reusable launch vehicle in re-entry phase. Aerosp. Sci. Technol. 2021, 116, 106825. [Google Scholar] [CrossRef]
An, S.; Liu, K.; Fan, Y.; Guo, J.; She, Z. Control design for the autonomous horizontal takeoff phase of the reusable launch vehicles. IEEE Access 2020, 8, 109015–109027. [Google Scholar] [CrossRef]
Cheng, L.; Wang, Z.B.; Jiang, F.H.; Li, J. Fast generation of optimal asteroid landing trajectories using deep neural networks. IEEE Trans. Aerosp. Electron. Syst. 2020, 56, 2642–2655. [Google Scholar] [CrossRef]
Xue, S.; Wang, Z.; Bai, H.; Yu, C.; Li, Z. Research on Self-Learning Control Method of Reusable Launch Vehicle Based on Neural Network Architecture Search. Aerospace 2024, 11, 774. [Google Scholar] [CrossRef]
Kim, G.S.; Chung, J.; Park, S. Realizing stabilized landing for computation-limited reusable rockets: A quantum reinforcement learning approach. IEEE Trans. Veh. Technol. 2024, 73, 12252–12257. [Google Scholar] [CrossRef]
Liang, X.; Wang, Q.; Hu, C.; Dong, C. Fixed-time observer based fault tolerant attitude control for reusable launch vehicle with actuator faults. Aerosp. Sci. Technol. 2020, 107, 106314. [Google Scholar] [CrossRef]
Wang, C.; Chen, J.; Jia, S.; Chen, H. Parameterized design and dynamic analysis of a reusable launch vehicle landing system with semi-active control. Symmetry 2020, 12, 1572. [Google Scholar] [CrossRef]
Zhang, L.; Wei, C.; Wu, R.; Cui, N. Adaptive fault-tolerant control for a VTVL reusable launch vehicle. Acta Astronaut. 2019, 159, 362–370. [Google Scholar] [CrossRef]
Liang, X.; Xu, B.; Hong, R.; Sang, M. Quaternion observer-based sliding mode attitude fault-tolerant control for the Reusable Launch Vehicle during reentry stage. Aerosp. Sci. Technol. 2022, 129, 107855. [Google Scholar] [CrossRef]
Liang, Y.; Zhang, H.; Duan, J.; Sun, S. Event-triggered reinforcement learning H∞ control design for constrained-input nonlinear systems subject to actuator failures. Inf. Sci. 2021, 543, 273–295. [Google Scholar] [CrossRef]

Figure 1. Variation in rocket position under Case I.

Figure 2. Variation in rocket velocity under Case I.

Figure 3. Variation in rocket thrust under Case I.

Figure 4. Variation in rocket mass under Case I.

Figure 5. Variation in rocket position under Case II.

Figure 6. Variation in rocket velocity under Case II.

Figure 7. Variation in rocket thrust under Case II.

Figure 8. Variation in rocket mass under Case II.

Figure 9. Variation in rocket position under Case III.

Figure 10. Variation in rocket velocity under Case III.

Figure 11. Variation in rocket thrust under Case III.

Figure 12. Variation in rocket mass under Case III.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Xu, J.; Guo, C.; Wang, Y.; Xiao, Y.; Hu, X. Fault-Tolerant Controller Design for Reusable Launch Vehicle. Actuators 2025, 14, 565. https://doi.org/10.3390/act14110565

AMA Style

Xu J, Guo C, Wang Y, Xiao Y, Hu X. Fault-Tolerant Controller Design for Reusable Launch Vehicle. Actuators. 2025; 14(11):565. https://doi.org/10.3390/act14110565

Chicago/Turabian Style

Xu, Jian, Chenguang Guo, Yuewen Wang, Yong Xiao, and Xiaoxiang Hu. 2025. "Fault-Tolerant Controller Design for Reusable Launch Vehicle" Actuators 14, no. 11: 565. https://doi.org/10.3390/act14110565

APA Style

Xu, J., Guo, C., Wang, Y., Xiao, Y., & Hu, X. (2025). Fault-Tolerant Controller Design for Reusable Launch Vehicle. Actuators, 14(11), 565. https://doi.org/10.3390/act14110565

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Fault-Tolerant Controller Design for Reusable Launch Vehicle

Abstract

1. Introduction

2. Landing Model of Reusable Launch Vehicle

2.1. Nonlinear Landing Model of Reusable Launch Vehicle

2.2. Control Requirements

3. Model Transformation

3.1. Selection of Control Variables

3.2. Fully Actuated System

4. Fault Description of RLVs

4.1. Partial Failure Fault

4.2. Sensor Fault

5. Design of Optimal Fault-Tolerant Controller

5.1. Optimal Fault-Tolerant Controller Design for Partial Actuator Failures

5.1.1. Optimal Fault-Tolerant Controller Design

5.1.2. Online Solution Method for Fault-Tolerant Controller

5.2. Design of Fault-Tolerant Controller for Sensor Failures

6. Simulation Results

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI