Adaptive Dynamic Programming-Based Spacecraft Attitude Control Under a Tube-Based Framework

Shiyi Li; Kerun Liu; Ming Liu

doi:10.3390/electronics13224575

,

and

School of Aeronautics, Harbin Institute of Technology, Harbin 150001, China

^*

Author to whom correspondence should be addressed.

Electronics2024, 13(22), 4575;https://doi.org/10.3390/electronics13224575

This article belongs to the Section Systems & Control Engineering

Version Notes

Order Reprints

Abstract

This paper investigates the control problem of a spacecraft attitude manoeuvrer with external disturbances. Firstly, the spacecraft attitude dynamical model is introduced; then, the tube-based framework is constructed, which includes a nominal system and an error system. Based on that, the control law design would be a two-step process. To start with, the nominal control law is developed via an adaptive dynamic programming technique and a neural network approximation in order to provide a nominal trajectory to the desired attitude. Moreover, based on the nonsingular terminal sliding mode control scheme, the error controller is derived to lead the actual system to track the nominal trajectory and suppress disturbances. The stability of the closed-loop system is analyzed via the Lyapunov approach and the simulation results could verify the effectiveness of the proposed control scheme.

Keywords:

spacecraft attitude control; tube-based framework; adaptive dynamic programming; nonsingular terminal sliding mode

1. Introduction

Recent years have witnessed prosperous developments in the field of aerospace engineering, especially in terms of the attitude control of spacecrafts, contributing to the success of a wide range of space missions, such as on-orbit monitoring [1], on-orbit inspection [2], and formation flights [3]. Among previous works in this area, many control schemes have been proven to be effective in reaching the goal of precise spacecraft attitude control, such as the back stepping method [4,5], sliding mode control [6,7], adaptive control [8,9], and observer-based control [10,11]. However, in most practical scenarios, electrical power is considered as the major energy for small spacecrafts, which can only carry very limited energy storage systems [12]. Considering the energy consumption utilized during attitude manoeuvrers of the spacecraft, only applying the above methods can guarantee optimal control performance and the minimizing of energy consumption; therefore, the optimal control theory plays an important role in many practical cases.

Various kinds of methods are included in the optimal control theory, such as inverse optimal control [13,14], H∞ optimal tracking control [15,16], and the online-learning technique [17,18]. Among these methodologies, adaptive dynamic programming (ADP) [19] has been proven to be a powerful data-driven method that is capable of ensuring optimal control performance through iteratively solving the Hamilton–Jacobi–Bellman (HJB) equation. This optimal control scheme has been widely adopted by many scholars to solve optimal control problems regarding spacecraft [20,21] and other objectives [22,23]. For the attitude dynamics of spacecraft with high nonlinearity and complexity, the corresponding HJB equation, subject to the pre-defined cost function, would be a complicated differential equation; thus, it is difficult to obtain its analytical solution. To address this obstacle efficiently, an adaptive neural network (ANN) can be adopted to actively approximate the HJB function. Through the ANN learning technique, the optimal control policy could be easily obtained.

Additionally, the on-orbit spacecraft would also suffer external disturbances caused by atmospheric drag, the Earth’s geomagnetic and solar radiation pressure, etc. A wide range of methods have been studied to deal with such a problem, among which the sliding mode control scheme is a major choice for spacecraft attitude control and suppressing disturbances. In [24], an adaptive nonsingular terminal sliding mode (NTSM) control scheme is proposed for spacecraft attitude tracking with actuator faults. Qiao et al. proposed a novel spacecraft composite attitude stabilization scheme in [25] using a nonsingular sliding mode technique, which could compensate for the estimated disturbances and attenuate the influence of estimated errors, showing the effectiveness of the NTSM. Furthermore, a tube-based control framework is also an effective method for improving the control performance for spacecraft attitude manoeuvring, and it includes a nominal system and an error system. In the nominal system, where the external disturbance is not considered, a nominal controller would be designed to draw the nominal states to the desired point, which provides a nominal trajectory. Additionally, the error controller for the error system would lead the actual system to the nominal trajectory and suppress unknown disturbances. In [26], a new tube-based framework is developed to design a guaranteed cost control law for spacecraft attitude reorientation, indicating the effectiveness of the tube-based framework.

Inspired by all the above methodologies, this article would consider the attitude reorientation control problem of a rigid spacecraft under external disturbances, with three reaction wheels being the actuators, and focus on the design of a tube-based control scheme via ADP and the NTSM technique. To be specific, the nominal system and the error system would be firstly constructed based on the attitude dynamical model of a spacecraft, which would be in the next section. Then, the tube-based control laws would be designed in Section 3, which would include an ADP-based nominal control law that ensures optimal control performance and convergence of the nominal system and a NTSM-based error control law that serves to stabilize the error system and deal with unknown disturbances. The stability of the closed-loop control system would be analyzed via the Lyapunov approach. The effectiveness of the proposed method would be verified through a numerical simulation conducted in Section 4. Section 5 would present the conclusion of this paper. The main contributions of this paper are concluded as follows:

(1): A tube-based framework that includes a nominal system and an error system is constructed for spacecraft attitude control, allowing for “two degrees of freedom” for controller design. Moreover, with the generated nominal trajectory and a small error set, the knowledge of the actual states can be determined prior to control being applied.
(2): The adaptive dynamic programming technique is adopted for the design of nominal control law, aiming to optimize the control performance and minimize energy costs while ensuring the convergence of the nominal system.
(3): The nonsingular terminal sliding mode control scheme is used to derive the error control law, which serves to suppress external disturbances and lead the actual system to track the nominal system.

Notations: We denote by

I_{n}

the identity matrix of

n \times n

.

| \cdot |

stands for the absolute value of a scalar, and

∥ \cdot ∥

is the standard Euclidean norm of a vector;

s i g n (\cdot)

represents the standard sign function; for any

x = {[x_{1}, \dots, x_{n}]}^{T} \in R^{n}

, we define

diag (x) = diag (x_{1}, \dots, x_{n})

as a diagonal matrix, and

s i g^{m} (x) = [| x_{1} |^{m} s i g n (x_{1}), \dots, | x_{n} {|^{m} s i g n (x_{n})]}^{T}, 0 < m < 1

. Additionally,

\forall a \in R^{3}

,

a^{\times}

is the cross-product operating element that transforms vector

a = {[a_{1}, a_{2}, a_{3}]}^{T}

into a skew-symmetric matrix:

\begin{matrix} a^{\times} = [\begin{matrix} 0 & - a_{3} & a_{2} \\ a_{3} & 0 & - a_{1} \\ - a_{2} & a_{1} & 0 \end{matrix}] . \end{matrix}

2. Problem Formulation and Preliminaries

In this section, we start with analyzing the attitude kinematics and dynamics of rigid spacecraft and then construct an error attitude dynamical model. Additionally, the tube-based control framework is also introduced to construct a nominal system and an error system. The control objective is to design a nominal controller and an error controller, respectively, for each system and ensure that the nominal system would be stabilized while the actual system could track the optimized nominal trajectory while all system state errors are guaranteed to be bounded.

2.1. Error Attitude Dynamical Model of Rigid Spacecraft

To begin, we introduce the Modified Rodriguez Parameters (MRPs) to describe the attitude kinematics and dynamics as follows [27]

\begin{matrix} \dot{σ} & = \frac{1}{4} M (σ) Ω \end{matrix}

(1a)

\begin{matrix} J \dot{Ω} & = - Ω^{\times} J Ω + u + d \end{matrix}

(1b)

with

\begin{matrix} M (σ) = (1 - σ^{T} σ) I_{3} + 2 σ^{\times} + 2 σ σ^{T}, \end{matrix}

(1c)

where

σ \in R^{3}

denotes the MRPs describing the attitude orientation with respect to the inertia frame

I

;

Ω \in R^{3}

represents the angular velocity and

J \in R^{3 \times 3}

is the inertia matrix of the spacecraft; and

d \in R^{3}

is the external disturbance torque and the control input is denoted by

u \in R^{3}

. Before proceeding further, we shall make the following assumption:

Assumption 1.

The disturbance d is unknown but bounded by a unknown constant

d_{m} > 0

, i.e.,

∥ d ∥ \leq \bar{d}

.

Remark 1.

According to [28], the matrix

M (σ)

is invertible as it satisfies

M {(σ)}^{- 1} = \frac{16}{{(1 + σ^{T} σ)}^{2}} M {(σ)}^{T}

.

By defining

σ_{d} \in R^{3}

as the desired attitude trajectory, the relative attitude described by the error MRPs could be written as

\begin{matrix} σ_{e} & = \frac{(1 - {∥ σ_{d} ∥}^{2}) σ - (1 - {∥ σ ∥}^{2}) σ_{d} + 2 σ \times σ_{d}}{1 + {∥ σ ∥}^{2} {∥ σ_{d} ∥}^{2} + 2 σ_{d}^{T} σ}, \end{matrix}

(2)

then, the error kinematics and dynamics of the spacecraft could be represented in the following form

\begin{matrix} {\dot{σ}}_{e} & = \frac{1}{4} M (σ_{e}) ω \end{matrix}

(3a)

\begin{matrix} J \dot{ω} & = - {(ω + C Ω_{d})}^{\times} J (ω + C Ω_{d}) + J (ω^{\times} C Ω_{d} - C {\dot{Ω}}_{d}) + u + d \end{matrix}

(3b)

with

\begin{matrix} C = I_{3} - \frac{4 (1 - σ_{e}^{T} σ_{e})}{{(1 + σ_{e}^{T} σ_{e})}^{2}} σ_{e}^{\times} + \frac{8 {(σ_{e}^{\times})}^{2}}{{(1 + σ_{e}^{T} σ_{e})}^{2}}, \end{matrix}

(3c)

where

ω \in R^{3}

is the relative angular velocity satisfying

ω = Ω - C Ω_{d}

;

Ω_{d}

and

{\dot{Ω}}_{d}

are the desired angular velocity and its derivative, respectively.

Remark 2.

Based on the error attitude dynamical model above, the primary objective of control in this paper is to find the input signal for the spacecraft model (1) in order to transition the state

σ (0), Ω (0)

to

σ (t_{f}), Ω (t_{f})

, where

σ (t_{f}) a n d Ω (t_{f})

are equal to the desired values and

t_{f} > 0

is the task completion time. Additionally, the error MRPs and the relative angular velocity could converge to 0.

2.2. Tube-Based Control Framework

In what follows, by introducing a tube-based control framework, the original attitude model (3) would be split into a nominal system and an error system, where the external disturbance is only considered in the error system.

To start with, the spacecraft error attitude dynamical model (3) can be rewritten as

\begin{matrix} [\begin{matrix} {\dot{σ}}_{e} \\ \dot{ω} \end{matrix}] = g (σ_{e}, ω) + [\begin{matrix} 0 \\ J^{- 1} u \end{matrix}] + [\begin{matrix} 0 \\ J^{- 1} d \end{matrix}] \end{matrix}

(4)

with

\begin{matrix} g (σ_{e}, ω) = [\begin{matrix} \frac{1}{4} M (σ_{e}) ω \\ - J^{- 1} {(ω + C Ω_{d})}^{\times} J (ω + C Ω_{d}) + ω^{\times} C Ω_{d} - C {\dot{Ω}}_{d} \end{matrix}] . \end{matrix}

(5)

Then, we could define a nominal system in the following form where the external disturbance is not considered

\begin{matrix} [\begin{matrix} \dot{\bar{σ_{e}}} \\ \dot{\bar{ω}} \end{matrix}] = g ({\bar{σ}}_{e}, \bar{ω}) + [\begin{matrix} 0 \\ J^{- 1} \bar{u} \end{matrix}], \end{matrix}

(6)

where

\bar{u}

is the nominal control law to be designed. Additionally, the error system that includes the external disturbance is defined as follows

\begin{matrix} \dot{e} = [\begin{matrix} {\dot{e}}_{1} \\ {\dot{e}}_{2} \end{matrix}] = g (σ_{e}, ω) - g ({\bar{σ}}_{e}, \bar{ω}) + [\begin{matrix} 0 \\ J^{- 1} v \end{matrix}] + [\begin{matrix} 0 \\ J^{- 1} d \end{matrix}], \end{matrix}

(7)

where

e_{1} = σ_{e} - {\bar{σ}}_{e}

,

e_{2} = ω - \bar{ω}

, and v is the error control law to be designed.

Combining the above two systems, the actual control input could be written as

\begin{matrix} u = \bar{u} + v . \end{matrix}

(8)

Remark 3.

It should be noticed that the tube-based control framework mainly includes the nominal system (6) to be optimized and the error system (7) that serves to suppress the external disturbances. In this control scheme, the nominal control law is designed to optimize the nominal system without considering the external disturbances and the error controller is designed to lead the actual system, where the disturbances exist, to track the nominal system. In order to ensure that the actual system would track the optimized trajectory given by the nominal system with relatively small errors, the initial states of the nominal system should be set as the same as the actual states, which means that

e = 0

.

3. Main Results

In this section, the nominal control law would be developed based on the adaptive dynamic programming technique, where the HJB equation would be solved via an ANN approximation in order to further derive the optimal control policy that is capable of optimizing the control performance for the convergence and stabilization of the nominal system. Moreover, the terminal sliding mode control technique would be used to derive the error control law, which guarantees that the actual system can accurately track the optimized nominal trajectory.

3.1. ADP-Based Control Law for Nominal System

Consider the nominal system (6); to ensure that the original point is the only equilibrium, we define

\begin{matrix} w = \frac{4 p_{1}}{∥ {\bar{σ}}_{e} ∥^{2} + 1} {\bar{σ}}_{e}, \end{matrix}

(9)

where

p_{1}

is a positive constant to be designed and the derivative of w can be easily calculated as

\begin{matrix} \dot{w} = p_{1} \frac{M ({\bar{σ}}_{e}) - 2 {\bar{σ}}_{e} {\bar{σ}}_{e}^{T}}{1 + ∥ {\bar{σ}}_{e} ∥^{2}} (- w + \bar{ω}), \end{matrix}

(10)

then the coordinate transformation system could be written as

\begin{matrix} \dot{y} = [\begin{matrix} {\dot{y}}_{1} \\ {\dot{y}}_{2} \end{matrix}] = [\begin{matrix} {\dot{\bar{σ}}}_{e} \\ {\dot{y}}_{2} \end{matrix}] = [\begin{matrix} \frac{1}{4} M ({\bar{σ}}_{e}) (y_{2} - w) \\ - J^{- 1} {(y_{2} - w)}^{\times} J (y_{2} - w) + \dot{w} \end{matrix}] + [\begin{matrix} 0 \\ J^{- 1} \bar{u} \end{matrix}], \end{matrix}

(11)

where

y_{2} = \bar{ω} + w

and

y \in R^{6}

are the nominal states. Additionally, (11) could be further written as

\begin{matrix} \dot{y} = G + K \bar{u} \end{matrix}

(12)

with

\begin{matrix} G = [\begin{matrix} \frac{1}{4} M ({\bar{σ}}_{e}) (y_{2} - w) \\ - J^{- 1} {(y_{2} - w)}^{\times} J (y_{2} - w) + \dot{w} \end{matrix}], K = [\begin{matrix} 0 \\ J^{- 1} \end{matrix}] \in R^{6 \times 3} . \end{matrix}

(13)

Consider the following performance function:

\begin{matrix} T = \int_{0}^{t_{f}} (y^{T} y + {\bar{u}}^{T} A \bar{u}) d t, \end{matrix}

(14)

where

t_{f}

is the convergence time of the system and

A = 4 J^{- T} J^{- 1} \in R^{3 \times 3}

.

To derive the optimal nominal control law that can stabilize the nominal system and minimize the performance function, we define the optimal function as follows

\begin{matrix} T^{*} = \min (\int_{0}^{t_{f}} (y^{T} y + {\bar{u}}^{T} A \bar{u}) d t) . \end{matrix}

(15)

Based on the optimal control theory, the Hamilton–Jacobi–Bellman (HJB) equation and the optimal control policy can be given as follows

\begin{matrix} B_{H} = y^{T} y - \frac{1}{4} {(\frac{\partial T^{*}}{\partial y})}^{T} K A^{- 1} K^{T} (\frac{\partial T^{*}}{\partial y}) + {(\frac{\partial T^{*}}{\partial y})}^{T} G = 0 \end{matrix}

(16)

\begin{matrix} {\bar{u}}^{*} = - \frac{1}{2} A^{- 1} K^{T} \frac{\partial T^{*}}{\partial y} . \end{matrix}

(17)

To further derive the optimal control law, solving of the HJB equation is required to obtain the analytical form of

T_{y}^{*} = \frac{\partial T^{*}}{\partial y}

. However, due to the fact that the HJB equation is a complex nonlinear differential equation, it is difficult to directly obtain its solution. Thus, an adaptive neural network is introduced to approximate the solution of the HJB equation. According to the universal approximation property of the neural network, we have

\begin{matrix} T^{*} = D^{* T} h (y) + δ, \end{matrix}

(18)

where

D^{*} \in R^{18 \times 1}

is the optimal weight and

h (y) \in R^{18 \times 1}

is the activation function. Taking the derivative with respect to the nominal state y yields

\begin{matrix} T_{y}^{*} = \nabla h^{T} D^{*} + \nabla δ, \end{matrix}

(19)

where

\nabla h = \frac{\partial h}{\partial y} \in R^{18 \times 6}

and

\nabla δ = \frac{\partial δ}{\partial y}

. Substituting (19) into (17) and (16) gives

\begin{matrix} {\bar{u}}^{*} = - \frac{1}{2} A^{- 1} K^{T} (\nabla h^{T} D^{*} + \nabla δ) \end{matrix}

(20)

\begin{matrix} B_{H} = y^{T} y - \frac{1}{4} D^{* T} \nabla h K A^{- 1} K^{T} \nabla h D^{*} + D^{* T} \nabla h G + δ_{1} = 0, \end{matrix}

(21)

where

δ_{1} = \nabla δ^{T} G - \frac{1}{2} \nabla δ^{T} K A^{- 1} K^{T} \nabla h^{T} D^{*} - \frac{1}{4} \nabla δ^{T} K A^{- 1} K^{T} \nabla δ

.

Then, the ANN can be implemented to approximate the performance function

T^{*}

:

\begin{matrix} \hat{T} = {\hat{D}}^{T} h, \end{matrix}

(22)

where

\hat{D}

is the estimation of

D^{*}

. Additionally, by taking the derivative with respect to the nominal state, it can obtain

\begin{matrix} {\hat{T}}_{y} = \nabla h^{T} \hat{D}, \end{matrix}

(23)

the approximated optimal control law can be given as

\begin{matrix} \hat{\bar{u}} = - \frac{1}{2} A^{- 1} K^{T} \nabla h^{T} \hat{D} \end{matrix}

(24)

with the approximated Bellman function being

\begin{matrix} {\hat{B}}_{H} = y^{T} y - \frac{1}{4} {\hat{D}}^{T} \nabla h K A^{- 1} K^{T} \nabla h \hat{D} + {\hat{D}}^{T} \nabla h G . \end{matrix}

(25)

and the Hamiltonian error could be derived as

\begin{matrix} e = {\hat{B}}_{H} - B_{H} = {\hat{B}}_{H} . \end{matrix}

(26)

To minimize the above error, the update law for the weight of the ANN is designed as follows:

\begin{matrix} \dot{\hat{D}} = - α \frac{E}{{(E^{T} E)}^{2}} e + β γ \nabla h K A^{- 1} K^{T} (\frac{\partial V_{1}}{\partial y}) \end{matrix}

(27)

\begin{matrix} E = \nabla h (G - K A^{- 1} K^{T} \nabla h^{T} \hat{D} / 2), \end{matrix}

(28)

where

α > 0

and

β > 0

are constant parameters to be designed and

V_{1}

and

γ

are defined as

\begin{matrix} V_{1} = \frac{1}{2} y^{T} y \end{matrix}

(29)

\begin{matrix} γ = \{\begin{matrix} 1, {\dot{V}}_{1} > 0 \\ 0, {\dot{V}}_{1} \leq 0 \end{matrix} . \end{matrix}

(30)

If we define the estimation error of the ANN weight as

\begin{matrix} \tilde{D} = D^{*} - \hat{D}, \end{matrix}

(31)

e could be rewritten as

\begin{matrix} e = - \frac{1}{4} {\tilde{D}}^{T} K_{1} \tilde{D} - {\tilde{D}}^{T} \nabla h R - δ_{1} \end{matrix}

(32)

\begin{matrix} K_{1} = \nabla h K A^{- 1} K^{T} \nabla h^{T} \end{matrix}

(33)

\begin{matrix} R = G + K {\bar{u}}^{*} + \frac{1}{2} K A^{- 1} K^{T} \nabla δ, \end{matrix}

(34)

and we could also obtain the derivative of

\tilde{D}

as follows

\begin{matrix} \dot{\tilde{D}} = - \frac{α}{E_{1}^{2}} (\nabla h R + \frac{1}{2} K_{1} \tilde{D}) ({\tilde{D}}^{T} \nabla h R + \frac{1}{4} {\tilde{D}}^{T} K_{1} \tilde{D} + δ_{1}) - β γ \nabla h K A^{- 1} K^{T} \frac{\partial V_{1}}{\partial y} \end{matrix}

(35)

\begin{matrix} E_{1} = E^{T} E + 1 . \end{matrix}

(36)

Assumption 2.

It is assumed that

∥ \nabla h ∥ \leq M_{1}

,

∥ K A^{- 1} K^{T} ∥ \leq j_{\max}

,

∥ \nabla δ ∥ \leq k_{\max}

.

Theorem 1.

Consider the nominal system of the spacecraft (6) and the performance function selected as (14); if the approximative optimal control law is designed as (24) and the update law of the ANN weight is designed as (27), then the nominal state y and the estimation error

\tilde{D}

are guaranteed to be uniformly ultimately bounded.

Proof.

Select a Lyapunov function as follows

\begin{matrix} V_{2} = 2 β V_{1} + \frac{1}{2} {\tilde{D}}^{T} \tilde{D}, \end{matrix}

(37)

Taking the time derivative of (37), it gives

\begin{matrix} {\dot{V}}_{2} = 2 β {(\frac{\partial V_{1}}{\partial y})}^{T} \dot{y} + {\tilde{D}}^{T} \dot{\tilde{D}} + X \end{matrix}

(38)

with the last term X satisfying

\begin{matrix} X = & - \frac{α}{E_{1}^{2}} ({({\tilde{D}}^{T} \nabla h R)}^{2} + \frac{3}{4} {\tilde{D}}^{T} \nabla h R {\tilde{D}}^{T} K_{1} \tilde{D} + \frac{1}{8} {({\tilde{D}}^{T} K_{1} \tilde{D})}^{2} + {\tilde{D}}^{T} \nabla h R δ_{1} + \frac{1}{2} {\tilde{D}}^{T} K_{1} \tilde{D} δ_{1}) \\ \leq & - \frac{α}{16 E_{1}^{2}} ∥ {\tilde{D}}^{T} K_{1} \tilde{D} ∥^{2} + \frac{4 α}{E_{1}^{2}} {∥ {\tilde{D}}^{T} \nabla h R ∥}^{2} + \frac{5 α}{2 E_{1}^{2}} δ_{1}^{2} \\ \leq & (\frac{2 α}{E_{1}^{2} a_{1}^{2}} - \frac{α λ_{2}^{2}}{16 E_{1}^{2} λ_{1}^{2}}) ∥ {\tilde{D}}^{T} {\nabla h ∥}^{4} + \frac{5 α}{2 E_{1}^{2}} δ_{1}^{2} + \frac{2 α a_{1}^{2}}{E_{1}^{2}} {∥ R ∥}^{4} . \end{matrix}

(39)

where

λ_{1}

is the maximum eigenvalue of A and

λ_{2} = r_{1}^{2}

,

r_{1} \leq ∥ K ∥ \leq r_{2}

.

a_{1}

is a parameter to be designed, satisfying that

\begin{matrix} \frac{2 α}{E_{1}^{2} a_{1}^{2}} - \frac{α λ_{2}^{2}}{16 E_{1}^{2} λ_{1}^{2}} < 0 . \end{matrix}

(40)

Thus,

{\dot{V}}_{2}

satisfies that

\begin{matrix} {\dot{V}}_{2} = & (\frac{2 α}{E_{1}^{2} a_{1}^{2}} - \frac{α λ_{2}^{2}}{16 E_{1}^{2} λ_{1}^{2}}) ∥ {\tilde{D}}^{T} {\nabla h ∥}^{4} + \frac{5 α}{2 E_{1}^{2}} δ_{1}^{2} + \frac{2 α a_{1}^{2}}{E_{1}^{2}} {∥ R ∥}^{4} + 2 β {(\frac{\partial V_{1}}{\partial y})}^{T} \dot{y} \\ - β γ {\tilde{D}}^{T} \nabla h K A^{- 1} K^{T} \frac{\partial V_{1}}{\partial y} . \end{matrix}

(41)

Let

K_{2} = β γ {\tilde{D}}^{T} \nabla h K A^{- 1} K^{T} \frac{\partial V_{1}}{\partial y}

, while

{\dot{V}}_{1} > 0

,

γ = 1

; then,

K_{2}

would function to stabilize the nominal state y. Then, it can obtain

\begin{matrix} ∥ R ∥ \leq φ + γ_{1}, \end{matrix}

(42)

where

\begin{matrix} φ = \sqrt[4]{γ_{2} ||\frac{\partial V_{1}}{\partial y}||} . \end{matrix}

(43)

And there exists a positive-definite matrix

K_{2}

, such that

\begin{matrix} {\dot{V}}_{1} = - {(\frac{\partial V_{1}}{\partial y})}^{T} K_{2} (\frac{\partial V_{1}}{\partial y}), \end{matrix}

(44)

Additionally, it can be further derived that

\begin{matrix} {\dot{V}}_{2} \leq l_{1} M_{1}^{4} {∥ \tilde{D} ∥}^{4} + \frac{5 α}{2 E_{1}^{2}} δ_{1}^{2} + \frac{16 α a_{1}^{2}}{E_{1}^{2}} (γ_{2} ||\frac{\partial V_{1}}{\partial y}|| + γ_{1}^{4}) + 2 β {(\frac{\partial V_{1}}{\partial y})}^{T} \dot{y} - β γ {\tilde{D}}^{T} \nabla h K A^{- 1} K^{T} \frac{\partial V_{1}}{\partial y}, \end{matrix}

(45)

where

\begin{matrix} l_{1} = \frac{2 α}{E_{1}^{2} a_{1}^{2}} - \frac{α λ_{2}^{2}}{16 E_{1}^{2} λ_{1}^{2}} . \end{matrix}

(46)

By adding and subtracting the term

β {(\frac{\partial V_{1}}{\partial y})}^{T} K A^{- 1} K^{T} (\nabla δ + \nabla h^{T} D^{*})

on the right hand side of (45), it gives

\begin{matrix} {\dot{V}}_{2} = l_{1} M_{1}^{4} {∥ \tilde{D} ∥}^{4} + l_{2} ||\frac{\partial V_{1}}{\partial y}|| - 2 {(\frac{\partial V_{1}}{\partial y})}^{T} K_{2} (\frac{\partial V_{1}}{\partial y}) + β {(\frac{\partial V_{1}}{\partial y})}^{T} K A^{- 1} K^{T} \nabla δ + K_{3}, \end{matrix}

(47)

where

\begin{matrix} K_{3} = \frac{5 α}{2 E_{1}^{2}} δ_{1}^{2} + \frac{16 α a_{1}^{2} γ_{1}^{4}}{E_{1}^{2}} \end{matrix}

(48)

\begin{matrix} l_{2} = \frac{16 α a_{1}^{2} γ_{2}}{E_{1}^{2}} . \end{matrix}

(49)

Moreover, it can be derived that

\begin{matrix} {\dot{V}}_{2} \leq & l_{1} M_{1}^{4} {∥ \tilde{D} ∥}^{4} + \frac{2 β}{λ_{\min} (K_{2})} (\frac{l_{2}^{2}}{β^{2}} + \frac{λ_{\min}^{2} (K_{2})}{4} {||\frac{\partial V_{1}}{\partial y}||}^{2}) - 2 β λ_{\min} (K_{2}) {||\frac{\partial V_{1}}{\partial y}||}^{2} \\ + β λ_{\min} (K_{2}) (\frac{1}{2} {||\frac{\partial V_{1}}{\partial y}||}^{2} + \frac{{(j_{\max} k_{\max})}^{2}}{2 λ_{\min}^{2} (K_{2})}) + K_{3} \\ \leq & l_{1} M_{1}^{4} {∥ \tilde{D} ∥}^{4} - β λ_{\min} (K_{2}) {||\frac{\partial V_{1}}{\partial y}||}^{2} + K_{3} + K_{4}, \end{matrix}

(50)

where

\begin{matrix} K_{4} = \frac{2 l_{2}^{2}}{β λ_{\min} (K_{2})} + \frac{β {(j_{\max} k_{\max})}^{2}}{2 λ_{\min} (K_{2})} . \end{matrix}

(51)

Then, it could be concluded that, when the following inequality is satisfied

\begin{matrix} ||\frac{\partial V_{1}}{\partial y}|| \geq \sqrt{\frac{K_{3} + K_{4}}{β λ_{\min} (K_{2})}} = Φ_{1} . \end{matrix}

(52)

it can be ensured that

{\dot{V}}_{2} \leq 0

, y and

\tilde{D}

are ultimately uniformly bounded.

Additionally, while

{\dot{V}}_{1} \leq 0

,

γ = 0

, then

{\dot{V}}_{2}

satisfies that

\begin{matrix} {\dot{V}}_{2} \leq l_{1} M_{1}^{4} {∥ \tilde{D} ∥}^{4} + (\frac{16 α a_{1}^{2} γ_{2}}{E_{1}^{2}} - 2 β {∥ \dot{y} ∥}_{\min}) ||\frac{\partial V_{1}}{\partial y}|| + K_{3} . \end{matrix}

(53)

When the following inequality is satistied

\begin{matrix} ||\frac{\partial V_{1}}{\partial y}|| \geq \frac{K_{3}}{2 β ∥ \dot{y} ∥_{\min} - \frac{16 α a_{1}^{2} γ_{2}}{E_{1}^{2}}} = Φ_{2} . \end{matrix}

(54)

it can be ensured that

{\dot{V}}_{2} \leq 0

, y and

\tilde{D}

are ultimately uniformly bounded. This completes the proof. □

3.2. Sliding Mode Control Law for Error System

Consider the error system as follows

\begin{matrix} \dot{e} = [\begin{matrix} {\dot{e}}_{1} \\ {\dot{e}}_{2} \end{matrix}] = g (σ_{e}, ω) - g ({\bar{σ}}_{e}, \bar{ω}) + [\begin{matrix} 0 \\ J^{- 1} v \end{matrix}] + [\begin{matrix} 0 \\ J^{- 1} d \end{matrix}], \end{matrix}

(55)

where we let

\begin{matrix} {\dot{e}}_{2} = g_{1} + J^{- 1} v + d \end{matrix}

(56)

\begin{matrix} g_{1} = - J^{- 1} ω^{\times} J ω + J^{- 1} {\bar{ω}}^{\times} J \bar{ω} . \end{matrix}

(57)

Select the nonsingular terminal sliding mode surface as follows

\begin{matrix} s = e_{2} + k n (e_{1}), \end{matrix}

(58)

where k is a positive parameter to be designed and

n (e_{1})

is defined as

\begin{matrix} n (e_{1 i}) = \{\begin{matrix} s i g^{q} (e_{1 i}), i f {\hat{S}}_{i} = 0 o r {\hat{S}}_{i} \neq 0, |e_{1 i} |> Θ \\ q_{1} e_{1 i} + q_{2} s i g^{2} (e_{1 i}), i f {\hat{S}}_{i} \neq 0, |e_{1 i} |\leq Θ \end{matrix}, \end{matrix}

(59)

where

\begin{matrix} {\hat{S}}_{i} = e_{2 i} + k {| e_{1 i} |}^{q} s i g n (e_{1 i}) \end{matrix}

(60)

\begin{matrix} q_{1} = (2 - q) Θ^{q - 1} \end{matrix}

(61)

\begin{matrix} q_{2} = (q - 1) Θ^{q - 2} . \end{matrix}

(62)

and

0 < q < 1

is a constant parameter to be designed.

Θ

is a small positive constant.

Taking the derivative of s yields

\begin{matrix} \dot{s} = k \dot{n} (e_{1}) + g_{1} + J^{- 1} v + J^{- 1} d, \end{matrix}

(63)

where

\begin{matrix} \dot{n} (e_{1 i}) = \{\begin{matrix} q |e_{1 i} |{}^{q - 1}{\dot{e}}_{1 i}, i f {\hat{S}}_{i} = 0 o r {\hat{S}}_{i} \neq 0, |e_{1 i} |> Θ \\ q_{1} {\dot{e}}_{1 i} + 2 q_{2} |e_{1 i} |{\dot{e}}_{1 i}, i f {\hat{S}}_{i} \neq 0, |e_{1 i} |\leq Θ \end{matrix} . \end{matrix}

(64)

Additionally, it could further obtain

\begin{matrix} J \dot{s} = k J \dot{n} (e_{1}) + J g_{1} + v + d . \end{matrix}

(65)

Then, the error control law could be designed as follows

\begin{matrix} v = - k J \dot{n} (e_{1}) - J g_{1} - m_{1} s - m_{2} s i g^{q} (s) - \frac{\hat{d} s}{κ} \end{matrix}

(66)

\begin{matrix} \dot{\hat{d}} = k_{1} (\frac{{∥ s ∥}^{2}}{κ} - k_{2} \hat{d}), \end{matrix}

(67)

where

m_{1}

,

m_{2}

,

k_{1}

and

k_{2}

are positive constant parameters to be designed.

\hat{d}

is the estimation of the upper bound of the external disturbance

\bar{d}

and

\tilde{d} = \hat{d} - \bar{d}

is defined as the estimation error.

Theorem 2.

Consider the error system (7) with external disturbances; the designed error control law (66) with the adaptive law is capable of guaranteeing the finite-time convergence of

e_{1}

and

e_{2}

to a small region around the equilibrium.

Proof.

Select a Lyapunov function as follows

\begin{matrix} V_{3} = \frac{1}{2 k_{1}} {\tilde{d}}^{2} + \frac{1}{2} s^{T} J s, \end{matrix}

(68)

Regarding the time derivative, it presents

\begin{matrix} {\dot{V}}_{3} = & - \frac{1}{k_{1}} \tilde{d} \dot{\hat{d}} + s^{T} J \dot{s} \\ \leq & - \tilde{d} (\frac{{∥ s ∥}^{2}}{κ} - k_{2} \hat{d}) + s^{T} (- m_{1} s - m_{2} s i g^{q} (s) - \frac{\hat{d} s}{κ} + d) \\ \leq & - \frac{\bar{d} {∥ s ∥}^{2}}{κ} + k_{2} \tilde{d} \hat{d} - m_{1} {∥ s ∥}^{2} - m_{2} \sum_{i = 1}^{3} {| s_{i} |}^{q + 1} + ∥ s ∥ ∥ d ∥ \\ \leq & - m_{1} {∥ s ∥}^{2} - m_{2} \sum_{i = 1}^{3} {| s_{i} |}^{q + 1} + \frac{κ}{4} + k_{2} \tilde{d} \hat{d} \\ \leq & - m_{1} {∥ s ∥}^{2} + \frac{κ}{4} + \frac{k_{2}}{2} {\bar{d}}^{2} - \frac{k_{2}}{2} {\tilde{d}}^{2} \\ \leq & - χ V_{3} + ξ, \end{matrix}

(69)

where

\begin{matrix} χ = \min \{\frac{2 m_{1}}{λ_{\max} (J)}, k_{1} k_{2}\} \end{matrix}

(70)

\begin{matrix} ξ = \frac{κ}{4} + \frac{k_{2}}{2} \bar{d}, \end{matrix}

(71)

Then, it can be concluded that s and

\tilde{d}

are ensured to be ultimately uniformly bounded, and there exists a positive constant

d_{0}

such that

\tilde{d} \leq d_{0}

.

Select another Lyapunov function as follows

\begin{matrix} V_{4} = \frac{1}{2} s^{T} J s, \end{matrix}

(72)

Taking the derivative of it presents

\begin{matrix} {\dot{V}}_{4} = & - m_{1} {∥ s ∥}^{2} - m_{2} \sum_{i = 1}^{3} {| s_{i} |}^{q + 1} + ∥ s ∥ ∥ d ∥ - \frac{\hat{d} {∥ s ∥}^{2}}{κ} \\ \leq & - m_{1} {∥ s ∥}^{2} - m_{2} \sum_{i = 1}^{3} {| s_{i} |}^{q + 1} - \frac{\hat{d} {∥ s ∥}^{2}}{κ} + \frac{\bar{d} {∥ s ∥}^{2}}{κ} + \frac{κ}{4} \\ \leq & - m_{1} {∥ s ∥}^{2} + \frac{d_{0} {∥ s ∥}^{2}}{κ} + \frac{κ}{4} - \frac{m_{2} 2^{\frac{q + 1}{2}}}{λ_{\max}^{\frac{q + 1}{2}} (J)} V_{4}^{\frac{q + 1}{2}} \\ = & (\frac{d_{0}}{κ} - m_{1}) {∥ s ∥}^{2} - (\frac{m_{2} 2^{\frac{q + 1}{2}}}{λ_{\max}^{\frac{q + 1}{2}} (J)} - \frac{\frac{κ}{2}}{2 V_{4}^{\frac{q + 1}{2}}}) V_{4}^{\frac{q + 1}{2}} . \end{matrix}

(73)

If it is satisfied that

(\frac{d_{0}}{κ} - m_{1}) \leq 0

, the system is ensured to reach the sliding mode surface within finite time and

e_{1}

and

e_{2}

are guaranteed to converge to a small region around the equilibrium. □

4. Simulation Results

In this section, a numerical simulation regarding the problem of spacecraft reorientation control is carried out to verify the effectiveness of the proposed tube-based control scheme. The simulation parameters are selected as follows. To start with, the inertia matrix of the spacecraft is chosen as

\begin{matrix} J = [\begin{matrix} 350 & 3 & 4 \\ 3 & 280 & 10 \\ 4 & 10 & 190 \end{matrix}] (kg \cdot m^{2}) . \end{matrix}

Additionally, we set the initial value of the error MRPs and the angular velocity as

{[0.2, - 0.1, 0.1]}^{T}

and

{[- 1, 2, - 3]}^{T} (^{\circ} / s)

, respectively. The desired angular velocity is 0. The control torque is bounded by

| u_{i} | \leq 0.5 (N \cdot m), i = 1, 2, 3

and the external disturbance is selected as

\begin{matrix} d = 10^{- 3} \times [\begin{matrix} 5 + 2.5 sin (0.1 t) \\ - 4 + 2 cos (0.05 t) \\ 3 - 8 sin (0.3 t) \end{matrix}] (N \cdot m) . \end{matrix}

Moreover, the parameters for the tube-based controller are selected as follows.

α = 800

,

β = 600

,

p_{1} = 0.1

;

k = 0.1

,

m_{1} = 500

,

m_{2} = 0.1

,

k_{1} = 10

,

k_{2} = 0.01

and

κ = 0.5

. The initial value of the adaptive parameter

\hat{d}

is set as 0 and the activation function for the adaptive neural network is selected as

\begin{matrix} h = & [y_{11}^{2}, y_{11} y_{12}, y_{11} y_{13}, y_{11} y_{21}, y_{11} y_{22}, y_{11} y_{23}, y_{12}^{2}, \\ y_{12} y_{13}, y_{12} y_{21}, y_{12} y_{22}, y_{12} y_{23}, y_{13}^{2}, y_{13} y_{21}, y_{13} y_{22}, y_{13} y_{23}, \\ 10 y_{21} arctan (10 y_{21}) - 0.5 ln (1 + 100 y_{21}^{2}), \\ 10 y_{22} arctan (10 y_{22}) - 0.5 ln (1 + 100 y_{22}^{2}), \\ 10 y_{23} arctan (10 y_{23}) - 0.5 ln (1 + 100 y_{23}^{2})] . \end{matrix}

The simulation results have been shown in Figure 1, Figure 2, Figure 3, Figure 4, Figure 5 and Figure 6. Figure 1 and Figure 2 indicate the convergence of the error MRPs

σ_{e}

and the relative angular velocity of the spacecraft with the subplot showing that the steady-state errors are at the level of

10^{- 5}

. Additionally, the nominal error MRP

{\bar{σ}}_{e}

is plotted in Figure 3, where its convergence is clearly indicated. The control torque that is bounded by

0.5 N \cdot m

is shown in Figure 4 and the adaptive parameter is plotted in Figure 5. Figure 6 indicates the estimation of the ANN weight.

Figure 1. Time responses of the error MRPs

σ_{e} (t)

.

Figure 2. Time responses of the relative angular velocity

ω (t)

(rad/s).

Figure 3. Time responses of the nominal error MRPs

{\bar{σ}}_{e} (t)

.

Figure 4. The control torque

u (t)

(N·m).

Figure 5. Time responses of the adaptive parameter

\hat{d} (t)

.

Figure 6. The estimation of the ANN weight

\hat{D} (t)

.

5. Conclusions

This article has proposed a novel control method for spacecraft attitude reorientation with external unknown disturbances. Based on the tube-based framework formed by a nominal system and an error system, the design of the final control law has been divided into two parts: the nominal control law and the error control law. The adaptive dynamic programming technique is applied to the design of the nominal controller, which serves to provide a nominal trajectory to the desired attitude, and the nonsingular terminal sliding mode scheme is adopted when developing the error controller, which could lead the actual states to track the nominal trajectory. Through the Lyapunov approach, we have analyzed the control system stability and then verified its effectiveness via a numerical simulation. Compared to other methodologies, such as adaptive control, back stepping control, etc., which might cause overshooting of the system states during manoeuvrer control, the proposed ADP-based approach for spacecraft attitude reorientation is conducive to improving the optimal control performance and thus minimizing the energy consumption, while the tube-based framework and the NTSM scheme contribute to enhancing system stability and suppressing disturbances at the same time.

Author Contributions

S.L.: conceptualization; investigation; methodology. K.L.: validation; visualization; writing—original draft, review and editing. M.L.: conceptualization; funding acquisition; resources; supervision. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by the Science Center Program of National Natural Science Foundation of China (Grant No. 62188101), National Natural Science Foundation of China (Grant No. 62273116), the Guangdong Major Project of Basic and Applied Basic Research (Grant No. 2019B030302001), the SiYuan Collaborative Innovation Alliance of Artificial Intelligence Science and Technology (Grant No. HTKJ2023SY502003), and the Heilongjiang Touyan Team.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Li, L.; Zhou, X.; Hu, Z.; Gao, L.; Li, X.; Ni, X.; Chen, F. On-orbit monitoring flying aircraft day and night based on SDGSAT-1 thermal infrared dataset. Remote Sens. Environ. 2023, 298, 113840. [Google Scholar] [CrossRef]
Jiao, B.; Sun, Q.; Han, H.; Dang, Z. A parametric design method of nanosatellite close-range formation for on-orbit target inspection. Chin. J. Aeronaut. 2023, 36, 194–209. [Google Scholar] [CrossRef]
Xiao, Y.; de Ruiter, A.; Ye, D.; Sun, Z. Attitude Coordination Control for Flexible Spacecraft Formation Flying with Guaranteed Performance Bounds. IEEE Trans. Aerosp. Electron. Syst. 2023, 59, 1534–1550. [Google Scholar] [CrossRef]
Chen, Z.; Chen, Q.; He, X.; Sun, M. Adaptive Backstepping Control Design for Uncertain Rigid Spacecraft with Both Input and Output Constraints. IEEE Access 2018, 6, 60776–60789. [Google Scholar] [CrossRef]
Wang, Y.; Tang, S.; Guo, J.; Wang, X.; Liu, C. Fuzzy-Logic-Based Fixed-Time Geometric Backstepping Control on SO(3) For Spacecraft Attitude Tracking. IEEE Trans. Aerosp. Electron. Syst. 2019, 55, 2938–2950. [Google Scholar] [CrossRef]
Wang, Y.; Ji, H. Integrated relative position and attitude control for spacecraft rendezvous with ISS and finite-time convergence. Aerosp. Sci. Technol. 2019, 85, 234–245. [Google Scholar] [CrossRef]
Hou, Z.; Lan, X. Adaptive sliding mode and RBF neural network based fault tolerant attitude control for spacecraft with unknown uncertainties and disturbances. Adv. Space Res. 2024, 74, 1680–1692. [Google Scholar] [CrossRef]
Gao, J.; Fu, Z.; Zhang, S. Adaptive Fixed-Time Attitude Tracking Control for Rigid Spacecraft with Actuator Faults. IEEE Trans. Ind. Electron. 2019, 66, 7141–7149. [Google Scholar] [CrossRef]
Kang, Z.; Shen, Q.; Wu, S.; Damaren, C.J. Saturated adaptive pose tracking control of spacecraft on SE(3) under attitude constraints and obstacle-avoidance constraints. Automatica 2024, 159, 111367. [Google Scholar] [CrossRef]
Liu, Q.Z.; Zhang, L.; Sun, B.; Xiao, Y.; Fan, G.W. Fixed-Time Disturbance Observer-Based Attitude Prescribed Performance Predictive Control for Flexible Spacecraft. IEEE Trans. Aerosp. Electron. Syst. 2024, 60, 3209–3220. [Google Scholar] [CrossRef]
Xuan-Mung, N.; Golestani, M. Energy-Efficient Disturbance Observer-Based Attitude Tracking Control with Fixed-Time Convergence for Spacecraft. IEEE Trans. Aerosp. Electron. Syst. 2023, 59, 3659–3668. [Google Scholar] [CrossRef]
Marshall, M.A.; Goel, A.; Pellegrino, S.R.M. Power-Optimal Guidance for Planar Space Solar Power Satellites. J. Guid. Control Dyn. 2020, 43, 518–535. [Google Scholar] [CrossRef]
Li, Q.; Gao, D.; Sun, C.; Song, S.; Niu, Z.; Yang, Y. Prescribed performance-based robust inverse optimal control for spacecraft proximity operations with safety concern. Aerosp. Sci. Technol. 2023, 136, 108229. [Google Scholar] [CrossRef]
Wang, P.; Zhang, X. Optimized Bézier-curve-based command generation and robust inverse optimal control for attitude tracking of spacecraft. Aerosp. Sci. Technol. 2022, 121, 107183. [Google Scholar] [CrossRef]
Luo, W.; Chu, Y.C.; Ling, K.V. H-infinity Inverse Optimal Attitude-Tracking Control of Rigid Spacecraft. J. Guid. Control Dyn. 2005, 28, 481–494. [Google Scholar] [CrossRef]
Huang, Y.; Zhang, Z.; Yang, X. Backstepping based neural H-infinite optimal tracking control for nonlinear state constrained systems with input delay and disturbances. Neurocomputing 2024, 595, 127869. [Google Scholar] [CrossRef]
Liu, Y.; Ma, G.; Lyu, Y.; Wang, P. Neural network-based reinforcement learning control for combined spacecraft attitude tracking maneuvers. Neurocomputing 2022, 484, 67–78. [Google Scholar] [CrossRef]
Wang, R.; Zhuang, Z.; Tao, H.; Paszke, W.; Stojanovic, V. Q-learning based fault estimation and fault tolerant iterative learning control for MIMO systems. ISA Trans. 2023, 142, 123–135. [Google Scholar] [CrossRef]
Dierks, T.; Jagannathan, S. Optimal control of affine nonlinear continuous-time systems. In Proceedings of the 2010 American Control Conference, Baltimore, MD, USA, 30 June–2 July 2010; pp. 1568–1573. [Google Scholar] [CrossRef]
Yang, H.; Hu, Q.; Dong, H.; Zhao, X. ADP-Based Spacecraft Attitude Control Under Actuator Misalignment and Pointing Constraints. IEEE Trans. Ind. Electron. 2022, 69, 9342–9352. [Google Scholar] [CrossRef]
Xiao, B.; Zhang, H.; Chen, Z.; Cao, L. Fixed-Time Fault-Tolerant Optimal Attitude Control of Spacecraft with Performance Constraint via Reinforcement Learning. IEEE Trans. Aerosp. Electron. Syst. 2023, 59, 7715–7724. [Google Scholar] [CrossRef]
Yuan, L.; Wang, L.; Zhang, J. Adaptive dynamic programming base on MMC device of a flexible high-altitude long endurance aircraft. Aerosp. Sci. Technol. 2024, 151, 109305. [Google Scholar] [CrossRef]
Wei, Q.; Yang, Z.; Su, H.; Wang, L. Online Adaptive Dynamic Programming for Optimal Self-Learning Control of VTOL Aircraft Systems with Disturbances. IEEE Trans. Autom. Sci. Eng. 2024, 21, 343–352. [Google Scholar] [CrossRef]
Jing, C.; Xu, H.; Niu, X.; Song, X. Adaptive Nonsingular Terminal Sliding Mode Control for Attitude Tracking of Spacecraft with Actuator Faults. IEEE Access 2019, 7, 31485–31493. [Google Scholar] [CrossRef]
Qiao, J.; Li, Z.; Xu, J.; Yu, X. Composite Nonsingular Terminal Sliding Mode Attitude Controller for Spacecraft with Actuator Dynamics Under Matched and Mismatched Disturbances. IEEE Trans. Ind. Inform. 2020, 16, 1153–1162. [Google Scholar] [CrossRef]
Zhang, L.; Wang, H.; Zhu, Y.; Yang, J. Tube-based attitude control of rigid-bodies with magnitude-bounded disturbances. Automatica 2021, 133, 109845. [Google Scholar] [CrossRef]
Arjun Ram, S.P.; Akella, M.R. Uniform Exponential Stability Result for the Rigid-Body Attitude Tracking Control Problem. J. Guid. Control Dyn. 2020, 43, 39–45. [Google Scholar] [CrossRef]
Li, Q.; Yuan, J.; Zhang, B. Extended state observer based output control for spacecraft rendezvous and docking with actuator saturation. ISA Trans. 2019, 88, 37–49. [Google Scholar] [CrossRef]

Figure 1. Time responses of the error MRPs

σ_{e} (t)

.

Figure 2. Time responses of the relative angular velocity

ω (t)

(rad/s).

Figure 3. Time responses of the nominal error MRPs

{\bar{σ}}_{e} (t)

.

Figure 4. The control torque

u (t)

(N·m).

Figure 5. Time responses of the adaptive parameter

\hat{d} (t)

.

Figure 6. The estimation of the ANN weight

\hat{D} (t)

.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Adaptive Dynamic Programming-Based Spacecraft Attitude Control Under a Tube-Based Framework

Abstract

1. Introduction

2. Problem Formulation and Preliminaries

2.1. Error Attitude Dynamical Model of Rigid Spacecraft

2.2. Tube-Based Control Framework

3. Main Results

3.1. ADP-Based Control Law for Nominal System

3.2. Sliding Mode Control Law for Error System

4. Simulation Results

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics