Next Article in Journal
Modal Vibration Suppression for Magnetically Levitated Rotor Considering Significant Gyroscopic Effects and Interface Contact
Next Article in Special Issue
Dynamic Parameter Identification Method for Space Manipulators Based on Hybrid Optimization Strategy
Previous Article in Journal
Efficient and High-Precision Method of Calculating Maximum Singularity-Free Space in Stewart Platform Based on K-Means Clustering and CNN-LSTM-Attention Model
Previous Article in Special Issue
Distributed Passivity-Based Control for Multiple Space Manipulators Holding Flexible Beams
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Nonlinear Optimal Control for Spacecraft Rendezvous and Docking Using Symplectic Numerical Method

by
Zhengtao Wei
,
Jie Yang
,
Hao Wen
,
Dongping Jin
and
Ti Chen
*
State Key Laboratory of Mechanics and Control for Aerospace Structures, Nanjing University of Aernautics and Astronautics, Nanjing 210016, China
*
Author to whom correspondence should be addressed.
Actuators 2025, 14(2), 75; https://doi.org/10.3390/act14020075
Submission received: 31 December 2024 / Revised: 25 January 2025 / Accepted: 5 February 2025 / Published: 6 February 2025
(This article belongs to the Special Issue Dynamics and Control of Aerospace Systems—2nd Edition)

Abstract

This paper addresses the autonomous rendezvous and docking between a chaser spacecraft and a target spacecraft. An optimal control method is employed to plan the rendezvous and docking maneuver, considering various constraints, including force, velocity, field of view, and collision avoidance with a diamond-shaped obstacle. The optimal trajectories are derived using a symplectic algorithm, which ensures high accuracy and enhances computational efficiency. These trajectories serve as the reference for the maneuver. A PD-based tracking control method is proposed to enable real-time feedback control. An air-bearing experimental system, encompassing state measurement, data transmission, and processing, is established to conduct ground-based tracking experiments. Furthermore, specialized simulators for the chaser and target spacecraft, equipped with a docking mechanism, are designed. Experimental results validate both the feasibility of the reference trajectories and the effectiveness of the PD tracking control approach.

1. Introduction

Autonomous Rendezvous and Docking (ARVD) plays a crucial role in emerging space missions such as on-orbit assembly, on-orbit servicing, and deep space exploration [1,2]. As a fundamental operation, the ARVD of two spacecraft has garnered significant attention from both academia and industry. For instance, in the context of ARVD for micro-satellites in on-orbit assembly, Fear et al. proposed a guidance algorithm based on model predictive control [3]. Gao et al. applied time-synchronized control theory to the stable maneuvering of a six-degree-of-freedom service spacecraft, enhancing control performance while reducing energy consumption [4]. Colagrossi et al. developed a comprehensive guidance, navigation, and control framework tailored to ARVD requirements in the Cislunar environment [5]. In terms of engineering applications, NASA’s Demonstration for Autonomous Rendezvous Technology (DART) project tested both the software and hardware involved in the mission [6]. Additionally, the International Space Station (ISS), as a typical example of a large spacecraft assembled in orbit, has completed over forty construction flights, providing extensive validation of ARVD technology [7].
To achieve stable and safe relative maneuvers between the Chaser Spacecraft (CS) and the Target Spacecraft (TS), it is crucial to incorporate various mission or path constraints into the guidance control algorithm. Sun explicitly integrated control input saturation into the design of a fault-tolerant controller [8]. Zhao et al. further addressed field-of-view constraints for optical sensors during the final phase of docking [9]. Li et al. introduced boundary constraints on the maneuvering velocity of the CS to ensure its capability to avoid emergencies during the process [10]. Collision avoidance is another critical consideration. Boyarko et al. imposed a constraint on the minimum distance between the two spacecraft, ensuring it remains above a predefined threshold to prevent collisions [11]. Xu et al. proposed a method to avoid unexpected collisions between the CS and TS by utilizing a circular envelope representation [12]. Beyond direct collision with the TS, Weiss et al. tackled the issue of obstacle avoidance by approximating the obstacle’s envelope using an ellipsoid and representing the collision avoidance constraint geometrically with rotating hyperplanes [13]. Similarly, Wei et al. addressed collision avoidance between the TS and space debris by calculating the distance between two circular envelopes [14]. However, in the studies mentioned above, collision avoidance objects were often described using simplified geometric shapes. While this improves computational efficiency, it may limit the planning space for feasible paths. Therefore, methods for contour characterization that strike a balance between complex shape descriptions and computational simplicity merit further attention.
The implementation of ARVD is fundamentally a problem of trajectory planning and control under multiple constraints. Optimal control theory provides an effective approach to solving such problems. For nonlinear control optimization, solution methods can be broadly categorized into direct and indirect methods. The direct method first converts the optimal control problem into a finite-dimensional nonlinear programming problem, which is then solved using established numerical tools. The primary advantage of the direct method lies in its ability to handle any optimization problem within a unified algorithmic framework, making it widely applicable in engineering. For instance, Sanjeev et al. studied the trajectory optimization of low-thrust spacecraft using the Hermite–Legendre–Gauss–Lobatto method. They introduced the use of the Edelbaum trajectory to set bounds, scale the problem, and generate an appropriate initial guess [15]. In a similar vein, Zhong et al. developed a piecewise two-phase optimal control strategy for rapid nanosatellite deorbiting, leveraging a short electrodynamic tether. They utilized a direct collocation method grounded in the Hermite–Simpson approach with discretization schemes that feature coincident nodes. This method effectively reduces the problem size and improves computational efficiency [16].
Unlike the direct method, the indirect method begins by applying variational techniques or Pontryagin’s Maximum Principle to derive the first-order necessary conditions for the optimal control problem [17]. These conditions are subsequently discretized and converted into a two-point boundary value problem for solving. The main advantage of the indirect method is that it directly satisfies the first-order necessary conditions of the optimal control problem. However, its drawback is the complex and tedious derivation required for specific optimization problems [18]. Fabien introduced a multiple shooting method to solve boundary-value problems involving both differential and algebraic equations, presenting two examples to demonstrate the effectiveness of the approach [19]. Despite this, most numerical algorithm research tends to focus primarily on numerical approximation accuracy and computational speed, often neglecting the inherent properties of optimal control systems. Symplectic algorithms, derived from appropriate difference schemes or variational principles, maintain an indirect symplectic structure. This structure is highly efficient due to its sparsity, which enhances both solution speed and convergence rate. Consequently, symplectic algorithms not only preserve the intrinsic properties of the optimal control system but also ensure numerical accuracy and computational efficiency through the sparse structure of the associated matrices [20,21,22]. Peng et al. proposed three different symplectic numerical approaches based on the variational principle to solve the optimal rendezvous control problem. The numerical results demonstrated that, although the three approaches exhibited different numerical performances, they all converged to the same results as the number of time intervals increased [23,24].
To verify the applicability of the guidance algorithm, a ground-based experiment is essential. For spacecraft proximity rendezvous and docking tasks, Wei et al. developed an air-bearing experimental system, which consists of air-bearing spacecraft simulators, a granite platform, and a binocular vision measurement system. This setup was used to validate the proposed analytical rendezvous and docking control algorithm [25]. Xu et al. extended this work by incorporating a collision avoidance scenario with an obstacle simulator, successfully achieving collision-free autonomous rendezvous and docking between a chaser spacecraft simulator and a stationary target spacecraft simulator [26]. Additionally, in ground-based experiments, it is crucial to account for the hardware performance of the simulators and the potential impact of the measurement systems on the results.
This paper focuses on the numerical simulation and ground-based experiment of ARVD between a chaser spacecraft and a target spacecraft, where the entire process is limited to the orbital plane [1,14,25,27,28,29,30]. In this study, the symplectic algorithm is employed to solve the optimal control problems of ARVD, ensuring that the characteristics of the original system are preserved, while maintaining high accuracy and improving computational efficiency. Additionally, a superquadratic curve is used to describe the shape of the obstacle, leveraging its ability to represent complex shapes while remaining simple. An analytical expression is applied to avoid numerical differentiation and reduce calculation errors. To address experimental challenges, this paper introduces a ground-based experimental system and proposes an indirect method to meet the requirements of real-time feedback control. The optimal trajectories obtained from the symplectic algorithm serve as reference trajectories, and a PD-based tracking control method is introduced to ensure that the simulator follows these reference trajectories.
This paper is organized as follows. Section 2 presents the dynamic model for the ARVD system, along with the superquadratic curve, the relevant constraints, and the formulation of the optimal control problem. Section 3 introduces the symplectic algorithm for generating optimal trajectories and the PD tracking control method for the ground-based experiment. The numerical simulation is provided in Section 4, followed by the experimental studies in Section 5. Finally, Section 6 concludes the paper.

2. Problem Formulation

2.1. System Dynamics

The ARVD mission between a chaser spacecraft and a target spacecraft is considered. When the target spacecraft is in a circular orbit and the distance between the chaser spacecraft and the target spacecraft is much smaller than the orbital radius, the relative translational motion of the chaser with respect to the target is described by the Clohessy–Wiltshire (CW) equations, as follows [1]:
x ¨ H 2 n y ˙ H 3 n 2 x H = F x H m y ¨ H + 2 n x ˙ H = F y H m z ¨ H + n 2 z H = F z H m
where x H , y H , and z H are the projection coordinates of the relative position vector in the Hill coordinate system; F (   ) H is the control force; m is the mass of the chaser spacecraft; and n is the orbital rate of the target spacecraft.
The rotational dynamics of a single spacecraft lead to the following Euler’s equations [14]:
ω ˙ i x = ( J i y J i z ) ω i y ω i z + τ i x J i x ω ˙ i y = ( J i z J i x ) ω i z ω i x + τ i y J i y ω ˙ i z = ( J i x J i y ) ω i x ω i y + τ i z J i z
where J C x ,   J C y ,   J C z and J T x ,   J T y ,   J T z denote the principal moments of inertia of the chaser and target spacecraft, respectively.
Additionally, if the mission duration is much shorter than the orbital period, orbital effects are neglected. When the entire process is confined to the orbital plane, the resulting dynamic equations simplify to a double integral form [1,14,28]. As illustrated in Figure 1, the target spacecraft is treated as a stationary object, while the dynamic equations of the chaser spacecraft, with one rotational and two translational degrees of freedom, are given by:
x ¨ = F x m ,   y ¨ = F y m ,   θ ¨ = T I z
where F x , F y , and T are the control forces and torque; m is the mass of the chaser spacecraft; and I z is the moment of inertia about the vertical axis. The state-space form of the chaser spacecraft model can be written as:
x ˙ = A x + B u
where x = [ x , y , θ , x ˙ , y ˙ , θ ˙ ] T denotes the state vector, u = [ F x , F y , T ] T denotes the control force, and A and B are the corresponding state and control constant matrices, respectively.
The mission consists of two phases: the obstacle avoidance phase and the docking phase. The shape of the diamond obstacle is represented by a superquadratic curve, while the chaser spacecraft is assumed to have a circular appearance with a specified radius R C . The final system configuration of the docking phase is shown in Figure 2. The corresponding states of the chaser spacecraft are given by:
x f = x T + L total cos γ ,   y f = y T L total sin γ ,   θ f = π + θ T x ˙ f = x ˙ T ,   y ˙ f = y ˙ T ,   θ ˙ f = θ ˙ T
with:
L total = L EDM + L CS 2 + L TS 2
where x f = [ x f , y f , θ f , x ˙ f , y ˙ f , θ ˙ f ] T denotes the final state of the chaser spacecraft, x T = [ x T , y T , θ T , x ˙ T , y ˙ T , θ ˙ T ] T denotes the position of the target spacecraft, γ denotes the angle between the line connecting the centroids of the two spacecraft and the X-axis, L EDM = 0.068   m is the length of the Electromagnetic Docking Mechanism (EDM), and L CS and L TS are the side lengths of the chaser spacecraft and target spacecraft, respectively. Specifically, L CS and L TS hold the same value, equal to L spacecraft .

2.2. Constraints Modeling

Due to the complex shapes of obstacles and the target spacecraft, it is challenging to represent these shapes using a general curve. The superquadratic curve offers a flexible way to describe such complex shapes using a simple formula while also improving calculation accuracy and avoiding the need for numerical differentiation [31,32,33]. In this subsection, the superquadratic curve is used to model the obstacle, as shown in Figure 1.
The standard expression of the superquadratic curve can be expressed as:
x sq 2 a 2 s + y sq 2 b 2 s = 1
where x sq and y sq are the independent variables; a , b , and s are positive real numbers.
Figure 3 shows a group of superquadratic curves. The value of s is from 0 to . As shown in the figure, the shape changes among the following seven conditions: (1) “+” line ( s = 0 ); (2) concave asteroid ( 0 < s < 0.5 ); (3) diamond ( s = 0.5 ); (4) thin ellipse ( 0.5 < s < 1 ); (5) ellipse ( s = 1 ); (6) fat ellipse ( s > 1 ); (7) rectangle ( s ). Table 1 further introduces some special superquadratic curves.
To ensure that the chaser spacecraft follows a safe and feasible trajectory until it makes contact with the target spacecraft through the docking mechanism, the following constraints are imposed.
Sensor field-of-view constraint: Sensors are installed on the target spacecraft to detect the relative distance and attitude of the chaser spacecraft. Therefore, the centroid of the chaser spacecraft must remain within the sensor’s field of view throughout the entire process. This constraint can be expressed mathematically as:
tan θ T α s 2 y y s x x s tan θ T + α s 2
where ( x s , y s ) denotes the origin of the sensor.
Collision avoidance: Any collision with obstacles or target spacecraft must be avoided. Based on Equation (7) and Table 1, by setting a = b = l o , the concerned diamond-shaped obstacle constraint can be expressed as:
x x o + y y o l o + R C
where ( x o , y o ) denotes the position of the center of the obstacle.
Thruster force Constraints: Considering the spacecraft design, the control force is subject to finite upper and lower bounds:
u u max
where u max = [ F x max , F y max , T max ] T denotes the bound of the control force.
Velocity and angular velocity constraints: To ensure the spacecraft’s controllability in the event of an emergency, both the velocity and angular velocity are constrained by finite upper and lower bounds:
v v max
where v = [ x ˙ , y ˙ , θ ˙ ] T and v max = [ x ˙ max , y ˙ max , θ ˙ max ] T denote the bound of the velocity and angular velocity.

2.3. Optimal Control Problem Formulation

The performance index function is defined as:
J = 1 2 t 0 t f [ ( x x f ) T P ( x x f ) + ( u u f ) T R ( u u f ) ] d t
where t 0 and t f denote the initial and final times, respectively, and x f can be obtained from Equations (5) and (6). u f = [ F x f ,   F yf ,   T f ] T represents the final control variable, which is given as u f = [ 0 ,   0 ,   0 ] T . The matrices P and R represent the performance index function weights associated with the final states and control variables, respectively.
The optimal control problem for the ARVD maneuver can be formulated as follows: find the control histories u and state histories x that minimize the performance index J , while simultaneously transferring the system described by Equation (4) from an initial state x 0 = [ x 0 , y 0 , θ 0 , x ˙ 0 , y ˙ 0 , θ ˙ 0 ] T to a final state x f = [ x f ,   y f ,   θ f ,   x ˙ f ,   y f ,   θ ˙ f ] T that satisfies Equations (5) and (6), and respecting the path constraints outlined in Equations (8)–(11).

3. Trajectory Generation and Tracking

In this section, the symplectic algorithm is used to obtain the solutions to the Nonlinear Optimal Control (NOC) problem [34,35], and based on the derived optimal trajectories, a PD-based tracking control method is then proposed.

3.1. Theory of Symplectic Algorithm

The performance index of the NOC problem is defined as:
J = t 0 t f L ( x , u , t ) d t
where the detailed formulation of L ( x , u , t ) is given by Equation (12). The dynamics equations, boundary conditions, and state-control constraints are as follows:
x ˙ = f ( x , u , t ) ,     x ( t 0 ) = x 0 ,     x ( t f ) = x f h ( x , u , t ) 0
where f ( x , u , t ) is system dynamics shown in Equation (4); h ( x , u , t ) denotes the inequality constraints presented by Equations (8)–(11).
To reduce computation and complexity, a quasi-linearization method is adopted in this work to linearize the nonlinear optimal control problem defined in Equations (13) and (14). By extending the performance index, dynamic equations, and state-control constraints, the iterative process of the linearized optimal control problem is obtained as follows.
The performance index function can be written as:
J k + 1 = t 0 t f L k + 1 d t
where:
L ( k + 1 ) = L ¯ ( k ) + ( x ( k + 1 ) x ( k ) ) T L x ( k ) + ( u ( k + 1 ) u ( k ) ) T L u ( k ) + 1 2 ( x ( k + 1 ) x ( k ) ) T L x x ( k ) ( x ( k + 1 ) x ( k ) ) + ( x ( k + 1 ) x ( k ) ) T L x u ( k ) ( u ( k + 1 ) u ( k ) ) + 1 2 ( u ( k + 1 ) u ( k ) ) T L u u ( k ) ( u ( k + 1 ) u ( k ) )
with
L ¯ ( k ) = L ( x ( k ) , u ( k ) , t )
L x ( k ) = L ¯ ( x , u , t ) x | x k , u k ,   L u ( k ) = L ¯ ( x , u , t ) x | x k , u k
L x x ( k ) = L ¯ 2 ( x , u , t ) x 2 | x k , u k ,   L x u ( k ) = L ¯ ( x , u , t ) x u | x k , u k ,   L u u ( k ) = L ¯ ( x , u , t ) u 2 | x k , u k
The state-control constraints are:
C ( k ) x ( k + 1 ) + D ( k ) u ( k + 1 ) + v ( k ) 0
where:
C ( k ) = h ( x , u , t ) x | x ( k ) , u ( k ) ,   D ( k ) = h ( x , u , t ) u | x ( k ) , u ( k ) v ( k ) = h ( x ( k ) , u ( k ) , t ) C ( k ) x ( k ) D ( k ) u ( k )
The variables x k + 1 and u k + 1 in the above equations represent the state and control vectors for the current ( k + 1 )th iteration, while x k and u k are the state and control vectors from the previous k th iteration, which serve as reference values for the current computation. As a result, the NOC problem is transformed into a series of Linear Quadratic Optimal Control (LQOC) problems, and the symplectic approach is applied to solve each LQOC problem.
To transform the inequality constraints defined in Equation (20) into equality constraints, the parameter variables α ( k + 1 ) are introduced as:
C ( k ) x ( k + 1 ) + D ( k ) u ( k + 1 ) + v ( k ) + α ( k + 1 ) = 0 ,       α ( k + 1 ) 0
By introducing the cosatate variables λ ( k + 1 ) and parameter variables β ( k + 1 ) , the Hamilton function can be defined as follows:
H ( k + 1 ) = L ( x ( k + 1 ) , u ( k + 1 ) , t )   +   ( λ ( k + 1 ) ) T f ( x ( k + 1 ) , u ( k + 1 ) , t ) + ( β ( k + 1 ) ) T ( h ( x ( k + 1 ) , u ( k + 1 ) , t ) + α ( k + 1 ) )
The variations of the augmented performance index function are computed, and the first-order necessary conditions are subsequently derived.
H ( k + 1 ) u ( k + 1 ) = 0
x ˙ ( k + 1 ) = H ( k + 1 ) λ ( k + 1 )
λ ˙ ( k + 1 ) = H ( k + 1 ) x ( k + 1 )
H ( k + 1 ) β ( k + 1 ) = 0
It is important to note that the following complementary conditions must be satisfied, according to Pontryagin’s minimum principle [34,35].
α ( k + 1 ) 0 ,   β ( k + 1 ) 0 ,   ( α ( k + 1 ) ) T β ( k + 1 ) = 0
The optimal solutions can be obtained by solving Equations (24)–(28) with the initial conditions and the final conditions specified in Equations (5) and (6).

3.2. PD Tracking Control

To track the position and velocity, the formulation of the PD control can be described as follows
u ( t ) = K p e ( t ) K d d e ( t ) d t
where e ( t ) and d e ( t ) d t are the error of position and velocity; K p and K d are the proportional and derivative coefficients, respectively.
According to Equation (29), the control law for translational motion can be expressed as
u i ( t ) = K p ( x i x d i ) K d ( x ˙ i x ˙ d i )     i = 1 , 2 , 3 N
where N is the number of points, K p and K d are the control coefficients, and x d i = [ x d i ,   y d i ] T and x ˙ d i = [ x ˙ d i ,   y ˙ d i ] T denote the desired position and velocity at the i th point. x i = [ x i ,   y i ] T and x ˙ i = [ x ˙ i ,   y ˙ i ] T represent the tracking trajectory and velocity of the translational motion from the ( i 1 ) th point to the i th point, while u i ( t ) = [ F x i , F y i ] T is the required thrust from the ( i 1 ) point to the i th point.
Similarly, the control law for rotational motion can be given as:
T i ( t ) = K p ( θ i θ d i ) K d ( θ ˙ i θ ˙ d i )       i = 1 , 2 , 3 N
where K p and K d are the control coefficients; θ d i and θ ˙ d i denote the desired attitude and angular velocity at the i th point. θ i and θ ˙ i represent the tracking trajectory and angular velocity of the rotational motion from the ( i 1 ) th point to i th point, while T i ( t ) is the required torque from the ( i 1 ) th point to i th point.

4. Numerical Results

The results of the numerical simulation are presented in this section. Table 2 lists the physical parameters of the chaser spacecraft and the target spacecraft. The initial states of the chaser spacecraft and the position of the target spacecraft are summarized in Table 3. Table 4 provides the parameters of the constraints.
The matrices for the performance index are chosen as:
P = diag ( 10 , 10 , 0 . 01 , 600 , 600 , 0 . 02 ) ,   R = diag ( 400 , 400 , 800 )
The simulation is conducted in the Matlab Simulink (R2022b) environment, with a simulation duration of 45 s. Figure 4, Figure 5 and Figure 6 present the results obtained using the symplectic algorithm. Figure 4 shows the translational and rotational trajectories of the chaser spacecraft. From these figures, the following conclusions can be drawn: (1) at all times, the centroid of the chaser spacecraft remains within the sensor’s field of view; (2) the chaser successfully avoids collision while passing by the obstacle. Moreover, the constraints defined in Equations (10) and (11) are satisfied, as shown in Figure 5 and Figure 6.

5. Experimental Studies

5.1. Experimental System

As shown in Figure 7, the experimental system primarily consists of a granite platform, a binocular camera, an Image Processing Workstation (IPW), and two spacecraft simulators that float above the granite platform using air bearings. The binocular camera, mounted on the ceiling, captures the positions of fluorescent markers in real time. The acquired image data is then transmitted to the IPW, where the digital image is processed to obtain the three-dimensional coordinates. This information is forwarded to the Industrial PC (IPC) via a ZigBee module. The IPC calculates the control law in real time, converting the continuous control law into a series of fully equivalent switching signals using the pulse width pulse frequency modulation method [36,37]. These switching signals control the motion of the chaser spacecraft simulator through the solenoid valves. Notably, the electromagnet on the Electromagnetic Docking Mechanism (EDM) is also controlled by the IPC. When the distance between the two spacecraft simulators drops below a predefined threshold, a turn-on signal is sent to the relay, activating the electromagnet and generating suction. Figure 8 presents the workflow of the experimental system. For more detailed information about the customized spacecraft simulator, one can refer to [14,25].

5.2. Experimental Results

Table 5 presents the values of the control gains designed for the PD tracking controller. Figure 9 shows the translational and rotational tracking trajectories of the chaser spacecraft simulator in the ground experiment. The release location of the tracking simulator is ( 0.82   m ,   0.19   m ,   88 ° ) . As can be seen from the figure, the tracking trajectories almost completely coincide with the reference trajectories, except for the initial phase of the experiment. Figure 10 further illustrates the velocities time history of the chaser spacecraft simulator during the experiment. Based on these experimental results, it can be concluded that the PD tracking controller designed in Section 3.2 is effective.

6. Conclusions

In this paper, the symplectic algorithm was employed to execute the ARVD maneuver between a chaser spacecraft and a target spacecraft. The NOC problem was transformed into a series of LQOC problems through quasi-linearization iterations, which could then be solved using symplectic discretization. Additionally, a diamond-shaped obstacle constraint, represented by a superquadratic curve, and a sensor field-of-view constraint were integrated into the optimal control problem formulation. The simulation results demonstrated that the optimal control law derived from the symplectic algorithm effectively guides the chaser spacecraft. Since the guidance algorithm cannot provide real-time control inputs due to hardware limitations, this paper also present a ground-based experimental system and introduces an indirect method using PD control to track the reference trajectory. The experimental results validate the applicability of the reference trajectories and the effectiveness of the PD tracking control.

Author Contributions

Conceptualization, Z.W., J.Y. and T.C.; methodology, Z.W. and J.Y.; software, Z.W. and J.Y.; validation, Z.W.; formal analysis, Z.W. and J.Y.; investigation, Z.W. and J.Y.; resources, Z.W. and J.Y.; data curation, Z.W. and J.Y.; writing—original draft preparation, Z.W. and J.Y.; writing—review and editing, T.C., H.W. and D.J.; visualization, Z.W.; supervision, T.C.; project administration, T.C.; funding acquisition, T.C. and Z.W. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the China Postdoctoral Science Foundation under Grant No. 2024M764231, in part by the Basic Research Program of Jiangsu under Grant No. BK20241390, in part by the Fundamental Research Funds for the Central Universities under Grant No. NS2024004, and in part by the Jiangsu Funding Program for Excellent Postdoctoral Talent under Grant No. 2024ZB114.

Data Availability Statement

The datasets generated during and analyzed during the current study are available from the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Bashnick, C.; Ulrich, S. Fast model predictive control for spacecraft rendezvous and docking with obstacle avoidance. J. Guid. Control Dyn. 2023, 46, 998–1007. [Google Scholar] [CrossRef]
  2. Pirat, C.; Ribes-Pleguezuelo, P.; Keller, F.; Marchi, A.Z.; Walker, R. Toward the autonomous assembly of large telescopes using CubeSat rendezvous and docking. J. Spacecr. Rockets 2022, 59, 375–388. [Google Scholar] [CrossRef]
  3. Fear, A.; Lightsey, E.G. Autonomous rendezvous and docking implementation for small satellites using model predictive control. J. Guid. Control Dyn. 2024, 47, 539–547. [Google Scholar] [CrossRef]
  4. Gao, Y.F.; Li, D.Y.; Ge, S.S. Time-synchronized tracking control for 6-DOF spacecraft in rendezvous and docking. IEEE Trans. Aerosp. Electron. Syst. 2021, 58, 1676–1691. [Google Scholar] [CrossRef]
  5. Colagrossi, A.; Pesce, V.; Bucci, L.; Colombi, F.; Lavagna, M. Guidance, navigation and control for 6DOF rendezvous in Cislunar multi-body environment. Aerosp. Sci. Technol. 2021, 114, 106751. [Google Scholar] [CrossRef]
  6. Rumford, T.E. Demonstration of autonomous rendezvous technology (dart) project summary. Space Syst. Technol. Oper. SPIE 2003, 5088, 10–19. [Google Scholar]
  7. Bedrossian, N. International space station assembly and operation control challenges. In Proceedings of the AAS Guidance and Control Conference, Breckenridge, CO, USA, 2–6 February 2000; pp. 22–36. [Google Scholar]
  8. Sun, L. Adaptive fault-tolerant constrained control of cooperative spacecraft rendezvous and docking. IEEE Trans. Ind. Electron. 2019, 67, 3107–3115. [Google Scholar] [CrossRef]
  9. Zhao, X.T.; Zhang, S.J. Adaptive saturated control for spacecraft rendezvous and docking under motion constraints. Aerosp. Sci. Technol. 2021, 114, 106739. [Google Scholar] [CrossRef]
  10. Li, Q.; Yuan, J.P.; Zhang, B.; Gao, C. Model predictive control for autonomous rendezvous and docking with a tumbling target. Aerosp. Sci. Technol. 2017, 69, 700–711. [Google Scholar] [CrossRef]
  11. Boyarko, G.; Yakimenko, O.; Romano, M. Optimal rendezvous trajectories of a controlled spacecraft and a tumbling object. J. Guid. Control Dyn. 2011, 34, 1239–1252. [Google Scholar] [CrossRef]
  12. Xu, Z.Y.; Chen, Y.K.; Xu, Z.X. Optimal guidance and collision avoidance for docking with the rotating target spacecraft. Adv. Space Res. 2019, 63, 3223–3234. [Google Scholar] [CrossRef]
  13. Weiss, A.; Baldwin, M.; Erwin, R.S.; Kolmanovsky, I. Model predictive control for spacecraft rendezvous and docking: Strategies for handling constraints and case studies. IEEE Trans. Control Syst. Technol. 2015, 23, 1638–1647. [Google Scholar] [CrossRef]
  14. Wei, Z.T.; Wen, H.; Hu, H.Y.; Jin, D.P. Ground experiment on rendezvous and docking with a spinning target using multistage control strategy. Aerosp. Sci. Technol. 2020, 104, 105967. [Google Scholar] [CrossRef]
  15. Sanjeev, N.; Christopher, J.D. Comparison of the Legendre–Gauss pseudospectral and Hermite–Legendre–Gauss–Lobatto methods for low-thrust spacecraft trajectory optimization. Aerosp. Syst. 2020, 3, 53–70. [Google Scholar]
  16. Zhong, R.; Zhu, Z.H. Optimal Control of Nanosatellite Fast Deorbit Using Electrodynamic Tether. J. Guid. Control Dyn. 2014, 37, 1182–1194. [Google Scholar] [CrossRef]
  17. Bokov, G.V. Pontryagin’s maximum principle of optimal control problems with time-delay. J. Math. Sci. 2011, 172, 623–634. [Google Scholar] [CrossRef]
  18. Nikoobin, A.; Moradi, M. Indirect solution of optimal control problems with state variable inequality constraints: Finite difference approximation. Robotica 2017, 35, 50–72. [Google Scholar] [CrossRef]
  19. Fabien, B. Numerical solution of constrained optimal control problems with parameters. Appl. Math. Comput. 1996, 80, 43–62. [Google Scholar] [CrossRef]
  20. Feng, K.; Qin, M. Symplectic Geometric Algorithms for Hamiltonian Systems; Springer: Berlin/Heidelberg, Germany, 2010; pp. 1–690. [Google Scholar]
  21. Park, C.; Scheeres, D.J. Determination of optimal feedback terminal controllers for general boundary conditions using generating functions. Automatica 2006, 42, 869–875. [Google Scholar] [CrossRef]
  22. Guibout, V.M.; Scheeres, D.J. Solving Relative Two Point Boundary Value Problems: Spacecraft Formulation Flight Transfers Application. J. Guid. Control Dyn. 2004, 27, 693–704. [Google Scholar] [CrossRef]
  23. Peng, H.; Gao, Q.; Wu, Z. Efficient sparse approach for solving receding-horizon control problems. J. Guid. Control Dyn. 2013, 36, 1864–1872. [Google Scholar] [CrossRef]
  24. Peng, H.; Gao, Q.; Wu, Z. Symplectic approaches for solving two-point boundary-value problems. J. Guid. Control Dyn. 2012, 35, 653–658. [Google Scholar] [CrossRef]
  25. Wei, Z.T.; Chen, T.; Wen, H.; Jin, D.P.; Hu, H.Y. Experimental study on autonomous assembly of multiple spacecraft simulators in a spinning scenario. Acta Astronaut. 2023, 207, 106–117. [Google Scholar] [CrossRef]
  26. Xu, X.N.; Wen, H.; Wei, Z.T. Ground-based experiment on spacecraft rendezvous and docking using superquadric curve for obstacle representation. J. Dyn. Control 2020, 18, 42–49. [Google Scholar]
  27. Hovell, K.; Ulrich, S. Deep reinforcement learning for spacecraft proximity operations guidance. J. Spacecr. Rockets 2021, 58, 254–264. [Google Scholar] [CrossRef]
  28. Zappulla, R., II; Park, H.; Virgili-Llop, J.; Romano, M. Real-time autonomous spacecraft proximity maneuvers and docking using an adaptive artificial potential field approach. IEEE Trans. Control Syst. Technol. 2018, 27, 2598–2605. [Google Scholar] [CrossRef]
  29. Mammarella, M.; Lorenzen, M.; Capello, E.; Park, H.; Dabbene, F.; Guglieri1, G.; Romano, M.; Allgower, F. An offline-sampling SMPC framework with application to autonomous space maneuvers. IEEE Trans. Control Syst. Technol. 2018, 28, 388–402. [Google Scholar] [CrossRef]
  30. Yang, Y. Coupled orbital and attitude control in spacecraft rendezvous and soft docking. Proc. Inst. Mech. Eng. Part G J. Aerosp. Eng. 2019, 233, 3109–3119. [Google Scholar] [CrossRef]
  31. Badawy, A.; McInnes, C.R. On-orbit assembly using superquadric potential fields. J. Guid. Control Dyn. 2008, 31, 30–43. [Google Scholar] [CrossRef]
  32. Badawy, A.; McInnes, C.R. Small spacecraft formation using potential functions. Acta Astronaut. 2009, 65, 1783–1788. [Google Scholar] [CrossRef][Green Version]
  33. Wen, H.; Chen, T.; Jin, D.P.; Hu, H.Y. Passivity-based control with collision avoidance for a hub-beam spacecraft. Adv. Space Res. 2017, 59, 425–433. [Google Scholar] [CrossRef]
  34. Peng, H.; Jiang, X. Nonlinear receding horizon guidance for spacecraft formation reconfiguration on libration point orbits using a symplectic numerical method. ISA Trans. 2016, 60, 38–52. [Google Scholar] [CrossRef]
  35. Cheng, L.; Wen, H.; Jin, D.P. Reconfiguration control of satellite formation using online quasi-linearization iteration and symplectic discretization. Aerosp. Sci. Technol. 2020, 107, 106348. [Google Scholar] [CrossRef]
  36. Guglieri, G.; Maroglio, F.; Pellegrino, P. Design and development of guidance navigation and control algorithms for spacecraft rendezvous and docking experimentation. Acta Astronaut. 2014, 94, 395–408. [Google Scholar] [CrossRef]
  37. Huang, Z.; Lu, Y.; Wen, H.; Jin, D.P. Ground-based experiment of capturing space debris based on artificial potential field. Acta Astronaut. 2018, 152, 235–241. [Google Scholar] [CrossRef]
Figure 1. Schematic of the ARVD mission.
Figure 1. Schematic of the ARVD mission.
Actuators 14 00075 g001
Figure 2. Schematic of the final system configuration in the docking phase.
Figure 2. Schematic of the final system configuration in the docking phase.
Actuators 14 00075 g002
Figure 3. A group of superquadratic curves of a = 5, b = 3.
Figure 3. A group of superquadratic curves of a = 5, b = 3.
Actuators 14 00075 g003
Figure 4. The pose trajectory of the chaser spacecraft obtained by the symplectic algorithm.
Figure 4. The pose trajectory of the chaser spacecraft obtained by the symplectic algorithm.
Actuators 14 00075 g004
Figure 5. Time history of the translational and rotational velocities obtained by the symplectic algorithm.
Figure 5. Time history of the translational and rotational velocities obtained by the symplectic algorithm.
Actuators 14 00075 g005
Figure 6. Time histories of the thrust force and torque generated by the symplectic algorithm.
Figure 6. Time histories of the thrust force and torque generated by the symplectic algorithm.
Actuators 14 00075 g006
Figure 7. Schematic of the ground-based experimental system.
Figure 7. Schematic of the ground-based experimental system.
Actuators 14 00075 g007
Figure 8. Schematic of the workflow of the experimental system.
Figure 8. Schematic of the workflow of the experimental system.
Actuators 14 00075 g008
Figure 9. The pose tracking trajectory of the chaser spacecraft in the experiment.
Figure 9. The pose tracking trajectory of the chaser spacecraft in the experiment.
Actuators 14 00075 g009
Figure 10. Time history of the translational and rotational tracking velocities in the experiment.
Figure 10. Time history of the translational and rotational tracking velocities in the experiment.
Actuators 14 00075 g010
Table 1. Some special superquadratic curves.
Table 1. Some special superquadratic curves.
CurveEquationShape
“+” (s = 0) x -axis and y -axisActuators 14 00075 i001
Asteroid (s = 1/3 and a = b) x 2 3 + y 2 3 = c 2 3 Actuators 14 00075 i002
Diamond (s = 0.5) ± x a ± y b = 1 Actuators 14 00075 i003
Ellipse (s = 1) x 2 a 2 + y 2 b 2 = 1 Actuators 14 00075 i004
Rectangle (s )s = 6Actuators 14 00075 i005
Table 2. The physical parameters of the chaser and target spacecraft.
Table 2. The physical parameters of the chaser and target spacecraft.
ParameterValue
L spacecraft 200 mm
R C 0.15 m
m C 9 kg
I C 0.05 kg m 2
Table 3. The initial states of the chaser spacecraft and the position of target spacecraft.
Table 3. The initial states of the chaser spacecraft and the position of target spacecraft.
ParameterValue
x 0 , y 0 , θ 0 0.8   m , 0.2   m , π 2
x ˙ 0 , y ˙ 0 , θ ˙ 0 0, 0, 0
x T , y T , θ T , x ˙ T , y ˙ T , θ ˙ T 0.3   m , 1.5   m , π 4
Table 4. The parameters of constraints.
Table 4. The parameters of constraints.
ParameterValue
Origin of FOV 0.3 + 2 20 , 1.5 2 20 m
α s 60 °
Roadblock location 0 . 6 ,   0 . 7 m
a , b , s a = b = 0.15 ,   s = 0.5
F x max , F y max 0.03 N
T max 0.001 N m
x ˙ max , y ˙ max 0.035 m/s
θ ˙ max 0.15 rad/s
Table 5. The values of the control gains in the PD tracking controller.
Table 5. The values of the control gains in the PD tracking controller.
ParameterValue
K p diag ( 20 , 20 )
K d diag ( 50 , 50 )
K p 0.1
K d 0.3
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wei, Z.; Yang, J.; Wen, H.; Jin, D.; Chen, T. Nonlinear Optimal Control for Spacecraft Rendezvous and Docking Using Symplectic Numerical Method. Actuators 2025, 14, 75. https://doi.org/10.3390/act14020075

AMA Style

Wei Z, Yang J, Wen H, Jin D, Chen T. Nonlinear Optimal Control for Spacecraft Rendezvous and Docking Using Symplectic Numerical Method. Actuators. 2025; 14(2):75. https://doi.org/10.3390/act14020075

Chicago/Turabian Style

Wei, Zhengtao, Jie Yang, Hao Wen, Dongping Jin, and Ti Chen. 2025. "Nonlinear Optimal Control for Spacecraft Rendezvous and Docking Using Symplectic Numerical Method" Actuators 14, no. 2: 75. https://doi.org/10.3390/act14020075

APA Style

Wei, Z., Yang, J., Wen, H., Jin, D., & Chen, T. (2025). Nonlinear Optimal Control for Spacecraft Rendezvous and Docking Using Symplectic Numerical Method. Actuators, 14(2), 75. https://doi.org/10.3390/act14020075

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop