1. Introduction
Recently, there have been many space systems that contain the master spacecraft and the slave spacecraft, such as CubeSats deployed from the space station [
1], the lander or rover released from a deep-space probe on the orbit of a planet [
2] and the smart impactor launched from a mother flyby spacecraft [
3]. Prior to the release of the slave spacecraft, its attitude has to be initialized and the systematic errors of its gyroscope unit (GU) should be calibrated, such that the slave spacecraft can perform its space mission independently. The attitude determination accuracy of the slave spacecraft depends on the calibration accuracy of the systematic errors, such as gyroscope bias, scale factor error and misalignment [
4,
5,
6]. Although the ground calibration method is convenient to achieve high accuracy [
7,
8], the values of the calibration parameters may change due to the difference between the ground and space environments [
9]. The primary cause for the change in the misalignment, which is the most dominant error that affects the attitude determination accuracy, is the thermal distortion of the GU bracket and the spacecraft body, which yields a pointing change in the GU axis in the space environment. To cope with this problem, the attitude determination system (ADS) of the master spacecraft can be utilized as the reference for the on-orbit calibration [
10]. Typically, the high-precision gyroscopes and the star sensors are mounted on the master spacecraft for attitude determination [
11,
12,
13]. An accurate and rapid transfer alignment is crucial to guarantee the attitude determination performance of the slave spacecraft [
14,
15].
The main problem of the traditional inter-spacecraft transfer alignment method is that the convergence rate of the algorithm is rather slow. Typically, tens of minutes are required for the traditional transfer alignment process [
16], which is not satisfactory for quick response situations. To achieve faster convergence and increased accuracy, the rapid transfer alignment techniques are investigated in this paper.
The measurement information matching scheme is an important research object in the field of transfer alignment. Many scholars focus on the on-orbit calibration method for the GU based on the attitude matching scheme [
17,
18]. The traditional method is to estimate the attitude and the calibration parameters of the GU on the slave spacecraft using the attitude measurement data provided by the star sensors on the master spacecraft with a Kalman filter (KF) [
19,
20]. The high-accuracy measurement information obtained from the satellite payload and GNSS (global navigation satellite system) are taken into account for the calibration [
21,
22]. In addition, for tactical weapon applications, the velocity plus attitude matching scheme [
23,
24] is developed as an improvement of the conventional velocity matching scheme [
25,
26,
27]. The calibration methods based on position matching and acceleration matching schemes are studied in [
28,
29].
On the basis of measurement information matching, the performance of the inter-spacecraft transfer alignment method depends on the filtering algorithm. The KF is the most widely used state estimation approach for the space missions [
30,
31,
32]. The estimation accuracy of the KF depends on the process and measurement noise covariance matrices
and
in the system model. If the prior statistical characteristics of the process and measurement noises are inaccurate, the performance of the KF may be degraded evidently. For the inter-spacecraft transfer alignment system, the statistical characteristics of the gyroscopes and the star sensors can be achieved from the manufacturer, while the process noise covariance related to the calibration parameters are often not known exactly. To cope with the problem, the common approach is to estimate the unknown noise covariance matrix with an adaptive Kalman filter (AKF) [
33,
34]. In the AKF, the estimate of
is updated recursively based on the measurement innovation, which is calculated with the previous state estimate in each iteration. However, when the previous state estimate is inaccurate, and the prior statistical information is far from the actual situation, it is difficult to obtain the proper
with the AKF [
35]. To the best of the authors’ knowledge, the optimal approach to tune the process noise covariance matrix is still an open question.
Motivated by the key idea of the transfer alignment method for tactical weapon applications and the AKF algorithm, to improve the accuracy and rapidity of the inter-spacecraft transfer alignment, a practical method is presented in this paper. The main contributions of the paper are given as follows:
- (1)
An attitude plus angular rate matching scheme is presented, where the fused information from the star sensors and the gyroscopes on the master spacecraft is adopted for the calibration of the GU on the slave spacecraft. Accordingly, the state equation and the measurement equation are derived as the transfer alignment system model. Compared with the traditional attitude matching scheme, the main advantage of the presented scheme is that more measurement information is utilized, such that the alignment performance is improved in limited time.
- (2)
A framework of the Q-learning Kalman filter (QKF) that combines the celebrated Q-learning approach [
36,
37,
38] with the KF is developed to fine tune the process noise covariance matrix related to the calibration parameters. Instead of the recursive estimation in the AKF, the Q-learning approach is designed to explore for the appropriate
. Once the appropriate
is learned, it is plugged into the model-based KF to enhance its performance. Compared with our previous works [
39,
40], the main advantage of the presented QKF is that only one explorative filter (instead of a group of parallel filters) is required in the Q-learning process so as to simplify the implementation of the algorithm. Correspondingly, the computational load of the algorithm is decreased, which facilitates the application of the algorithm on the spacecraft with limited computing resource.
This study is structured as follows. In
Section 2, the attitude plus angular rate matching scheme and the inter-spacecraft rapid transfer alignment system model are presented. In
Section 3, the QKF algorithm for rapid transfer alignment is provided. In
Section 4, the potential performance of the transfer alignment system is analyzed via the calculation of the Cramer–Rao lower bounds (CRLB) [
41]. In
Section 5, the estimation performance of the KF, the AKF and the presented QKF designed based on the transfer alignment system model are compared via simulations. Finally, the conclusions are drawn in the last section.
2. Attitude Plus Angular Rate Matching Scheme
2.1. Main Idea
The inter-spacecraft rapid transfer alignment model is derived to calibrate the GU on the slave spacecraft based on the ADS on the master spacecraft. The basic principle of the considered transfer alignment system is illustrated in
Figure 1.
As shown in
Figure 1, on the master spacecraft, the ADS consisting of gyroscopes and star sensors are utilized to obtain the attitude reference information. On the slave spacecraft, the transfer alignment filter incorporates the attitude reference information and the GU measurement to estimate the attitude, the GU bias and the calibration parameters. The transfer alignment filter is designed based on the transfer alignment model composed of the state equation and the measurement equation. The calibration parameters include the scale factor error and the misalignment of the GU on the slave spacecraft. The construction of the transfer alignment model and the parameterization of the systematic errors fit within the spacecraft attitude determination framework [
10].
In this paper, the spacecraft attitude describes the rotation of the spacecraft body frame relative to the geocentric equatorial inertial frame. For the geocentric equatorial inertial frame, the origin is the Earth’s center, the X axis points to the Equinox direction, the Y axis points toward the North pole and the Z axis forms a right-handed coordinate with the X axis and Y axis. For the spacecraft body frame, the origin is the center mass of the spacecraft, and the X axis, Y axis and Z axis are parallel to the principal axis of inertia and form a right-handed coordinate.
2.2. State Equation
For the design of the transfer alignment filter, the state equation of the transfer alignment model is established based on the attitude kinematics and the error model of the GU on the slave spacecraft. In this paper, the GU is supposed to be three gyroscopes which measure the angular rate of the slave spacecraft relative to the inertial frame. The GU error model includes gyroscope bias, scale factor error, misalignment and random noise composed of angle random walk (ARW) and rate random walk (RRW), as shown in
Figure 2.
In the GU error model, the measured angular rate
is related to the true angular rate
of the slave spacecraft relative to inertial frame in the body frame by [
21]
where
denotes the unit matrix with compatible dimension,
is the scale factor error matrix,
is the misalignment matrix,
is the attitude transformation matrix from the body frame to the sensor frame,
is the GU bias,
is the random noise called angular random walk, the subscript
denotes discrete time. The true angular rate is written as
, where
,
and
are the elements of 3-axis angular rate. The scale factor error matrix is given by
where
,
and
are the scale factor error parameters. The misalignment matrix is described as
where
,
,
,
,
and
are elements in the misalignment matrix. From Equations (2) and (3), we have
with
where
,
,
,
,
and
are the misalignment parameters. Note that the products of scale factors and misalignments are combined in Equation (5) to simplify the formulation. Substituting Equation (4) into Equation (1), the GU error model is reformulated as
To derive the attitude error equation, the estimate of the spacecraft angular rate
is written as
where
is the estimate of the GU bias,
is the attitude transformation matrix from the sensor frame to the body frame. Substituting Equation (12) into Equation (13), we have
where
is the GU bias error. The angular rate estimation error is defined as
Substituting Equation (14) into Equation (16) yields
Equation (17) is reformulated as
with
and
is the calibration parameter vector.
For the spacecraft attitude determination, the error quaternion
is defined as
where
is the attitude quaternion that describes the spacecraft attitude relative to the inertial frame,
is the estimate of
. According to the spacecraft attitude kinematics, the propagation of the error quaternion is described by the following perturbation equation [
11]:
where
is the vector part of the error quaternion,
denotes the time interval of discretization,
is the skew symmetric matrix defined as
Substituting Equation (18) into Equation (22), the perturbation equation is modified as
It is evident that the effects of the GU bias, scale factor error, misalignment and random noise to the propagation of the vector part of the error quaternion are described in Equation (24).
For transfer alignment, the spacecraft attitude, the GU bias, scale factor error and misalignment should be estimated. Accordingly, the state vector is constructed as the combination of the vector part of the error quaternion
, the GU bias error
and the calibration parameter vector
, which is given by
From Equations (24) and (25), we obtain the state equation:
with the state transition matrix
The process noise
is given by
where
is the random noise that drive the rate random walk,
is introduced to describe the drift of the calibration parameters. It is often assumed that
is the Gaussian white noise with zero mean. The process noise covariance matrix
is a positive definite symmetric matrix with the following structure
where
and
are the sub-matrices related to the vector part of the error quaternion
and the GU bias error
respectively,
is the sub-matrix related to the calibration parameter vector
.
2.3. Measurement Model
For the attitude plus angular rate matching scheme, both the attitude and the angular rate reference information achieved from the ADS on the master spacecraft are utilized for the on-orbit calibration. It is expected that the calibration parameters can be estimated effectively with the attitude reference information from the master spacecraft. When the attitude and the angular reference information is available, the measurement equation is written as
with the measurement
the measurement matrix
and the measurement noise
where
is the vector part of the error quaternion between the quaternions obtained from the ADS on the master spacecraft and the GU on the slave spacecraft,
is the difference between the angular rates obtained from the ADS on the master spacecraft and the GU on the slave spacecraft,
is the attitude measurement noise with the covariance matrix
,
is the angular rate measurement noise with the covariance matrix
. The measurement noise covariance matrix
is the mixture of
and
:
Note that both the measurement noises of the ADS on the master spacecraft and the GU on the slave spacecraft are contained in Equation (33). Generally, to implement the inter-spacecraft rapid transfer alignment effectively, the ADS on the master spacecraft should be more accurate than the GU on the slave spacecraft. It is expected that the systematic errors in the ADS on the master spacecraft have been compensated before the implementation of the inter-spacecraft rapid transfer alignment.
The state Equation (26) and the measurement Equation (30) compose the system model for the attitude plus angular rate matching scheme. With the system model, the transfer alignment KF is designed to implement the state estimation. It should be mentioned that, for the considered attitude determination system composed of the star sensors and the gyroscopes, as it is widely used in current satellites, the feasibility of the system model has been verified through multiple space missions. In general, for a novel navigation system, the hardware-in-loop experiment with the simulation of the operational environment is an effective approach for model verification.
2.4. Transfer Alignment KF
On the basis of the system model, the transfer alignment KF is designed to estimate the state vector
based on the measurement
. The procedure of the inter-spacecraft rapid transfer alignment method based on the KF with the prediction and update procedures is collected in Algorithm 1.
Algorithm 1: Transfer alignment Kalman filter |
1: Initialize attitude quaternion estimate , bias estimate , state estimate and its estimation error covariance matrix |
2: for k = 1, 2, …, K, do |
3: |
4: |
5: |
6: |
7: |
8: |
9: if the measurement is available, then |
10: |
11: |
12: |
13: |
14: end if |
15: |
16: |
17: end for |
18: return , , , |
In the algorithm,
and
are the prediction and the estimate of the state vector,
and
are their corresponding estimation error covariance matrices, K is the length of the measurement data,
is the Kalman gain,
is the measurement innovation. The matrix
is constructed with the estimate of the calibration parameter vector as the estimate of
. The expression of
is
To facilitate the implementation of the algorithm, similar to the method presented in [
11], as the attitude quaternion
is propagated based on the spacecraft attitude kinematics equation, the state transition matrix
in the state Equation (26) is only used for the estimation error covariance propagation in the KF. Note that the propagation of the error quaternion
is not beneficial to improve the filtering performance. To deal with the problem caused by the discontinuity of the measurement data, for the filtering algorithm shown in Algorithm 1, the propagation is performed in each time step, while the update is performed in the case that the measurement data are available.
It is seen from Algorithm 1 that the efficiency of the update to the prediction
with the measurement innovation
depends on the Kalman gain
, which is adjusted through the noise covariance matrices
and
. In the measurement noise covariance matrix,
and
are symmetric positive definite matrices determined according to the measurement error behavior of the star sensors and the GU specified by the manufacturer. In the process noise covariance matrix, the elements related to
and
are determined with the ARW and RRW coefficients, which are achievable through the Allan variance analysis [
42]. However, it is difficult to determine the accurate noise covariance for the calibration parameters
in the absence of prior knowledge. As mentioned in the introduction, the proper choice for the process noise covariance matrix is critical for accurate transfer alignment. The state estimate of the KF may deviate from its actual value if
is not set appropriately. In order to identify the unknown elements in
, a Q-learning-based filtering algorithm is presented in the next section as a modification of the standard KF.
4. CRLB of Transfer Alignment System
The feasibility of the inter-spacecraft rapid transfer alignment method based on the attitude plus angular rate matching scheme is analyzed through the CRLB. The CRLB is a theoretical bound on the achievable state estimation accuracy for certain system models. It facilitates the potential performance analysis of the transfer alignment method before the numerical simulation of the filtering algorithm. For the linear discrete-time stochastic system formulated in Equations (26) and (30), the calculation process of the CRLB is described in Algorithm 3.
Algorithm 3: Calculation of CRLB |
1: |
2: for k = 1, 2, …, K, do |
3: |
4: if the measurement is available, then |
5: |
6: end if |
7: |
8: end for |
9: return |
In the algorithm, is the fisher information matrix calculated with the considered system model. The square roots of the diagonal elements of the calculated matrix provide the theoretical bound of the state estimation root mean square (RMS) error.
The CRLB analysis is implemented under the following conditions. The master spacecraft is an Earth satellite with an orbit altitude of 700 km and its attitude keeps in orientation to the Earth. The slave spacecraft is mounted on the master spacecraft. For the ADS on the master spacecraft, the attitude determination accuracy is 3″ and the angular rate determination accuracy is 0.02°/h. For the GU on the slave spacecraft, the ARW and RRW coefficients are 4 × 10−4°/h0.5 and 1 × 10−3°/h1.5 respectively. The scale factor error parameter vector is set as (parts per million) and the misalignment parameter vector is set as . The update rate of the filter is 1 Hz. The total simulation time of the transfer alignment is 3600 s. The calibration maneuver is the sequential rotation around the three orthogonal axes of the master spacecraft body frame. Generally, the transfer alignment performance can be improved when the calibration maneuver angular rate is increased. Considering that the dynamic measurement performance of the typical star sensor may be degraded if its angular rate is larger than 2°/s, the rotation angular rate of the master spacecraft is set as 1°/s.
The CRLB for the inter-spacecraft rapid transfer alignment system described in
Section 2 is calculated using Algorithm 3.
Figure 4,
Figure 5 and
Figure 6 give the theoretical error bounds of the calibration parameters when the time length of the rotation around each axis is 100 s, 200 s, 400 s and 600 s, respectively.
From the CRLB curves, it is evident that the rapidity of the transfer alignment is guaranteed when a shorter rotation time is adopted. According to the analysis results, in the following numerical simulation, the time length of the rotation around each axis is set as 100 s for the calibration maneuver.
5. Simulation Results
To illustrate the high performance of the inter-spacecraft rapid transfer alignment method based on the QKF, the simulation results are shown in this section. For the data generation of the sensors, the simulation conditions are same as those in
Section 4. In the transfer alignment KF, the initial state estimation error covariance matrix is set as follows.
where
,
. The elements in
are larger than triple the magnitude of the calibration parameters given in
Section 4. Similarly to the attitude determination KF [
21], in the process noise covariance matrix
, the sub-matrices
and
are designed according to the ARW and RRW coefficients of the GU. The sub-matrix
is set as
, where the magnitude of the parameter
is
. In the measurement noise covariance matrix
, the sub-matrices are set as
and
, where
and
. For the inter-spacecraft rapid transfer alignment, the initial attitude estimate
is obtained from the ADS on the master spacecraft, and the state vector related to the calibration parameters is initialized as zero.
The first simulation is performed to illustrate the high performance of the attitude plus angular rate matching scheme presented in
Section 2. The presented attitude plus angular rate matching scheme is compared with the traditional attitude matching scheme via the simulation. The average RMS errors of the attitude plus angular rate matching scheme and the attitude matching scheme for the estimation of the attitude and the calibration parameters obtained from 10 individual trials are plotted together in
Figure 7,
Figure 8 and
Figure 9.
It is seen from
Figure 7,
Figure 8 and
Figure 9 that the presented scheme performs better than the traditional scheme. The reason is that more measurement information is available for the transfer alignment in the attitude plus angular rate matching scheme. With the simulation results shown in the figures above, we conclude that the presented attitude plus angular rate matching scheme is efficient for the inter-spacecraft transfer alignment.
The aim of the second simulation is to illustrate the performance of the QKF presented in
Section 3. In the QKF algorithm, the Q-learning parameters are set as
,
and
. Generally, the parameters can be designed with a trial-and-error method via the numerical simulation. For the considered noise covariance adaptation problem in the filtering algorithm, it was found in previous works [
39,
44] that the influence of the design parameters is not significant when they are chosen in certain scopes. In the pre-determined set related to the state space
for the tuning of the process noise covariance matrix, the sub-matrix
is set as
where
is a scalar factor in the range of
, with 11 different values inside this interval. The cardinality of the pre-determined set is rather small, which is beneficial for the Q-learning approach in order to achieve a reasonable result. In fact, for the considered scenario, a small parameter set is effective to improve the filtering performance. Nevertheless, a sophisticated Q-learning approach may be required for more complicated problems.
The estimation error curves of the attitude and the calibration parameters as well as their corresponding
error bounds computed from the filter’s error covariance matrix are shown in
Figure 10,
Figure 11 and
Figure 12.
It is seen from the figures that the estimation errors of the calibration parameters diminish rapidly after the calibration maneuver is performed. Both the scale factor and the misalignment estimation errors converge in about 600 s. All the estimation error curves of the QKF are contained in the corresponding error bounds, which indicate the consistency of the filtering algorithm. The simulation result illustrates that all the systematic errors of the GU on the slave spacecraft in the system model can be calibrated rapidly and accurately with the ADS on the master spacecraft. Simultaneously, it shows that the QKF is feasible for the inter-spacecraft rapid transfer alignment.
To facilitate the comparison, the average RMS errors of different transfer alignment methods are listed in
Table 1.
Obviously, the presented attitude plus angular rate matching scheme outperforms the traditional attitude matching scheme. The state estimation accuracy can be improved when the presented QKF is used instead of the standard KF.
Furthermore, to demonstrate the advantage of the QKF algorithm, the simulation of the inter-spacecraft rapid transfer alignment methods based on different filtering algorithms are performed, including the standard KF, the Sage-Husa AKF and the QKF. For a fair comparison, three filtering algorithms share the basic filtering parameters, including the initialization parameters, the state transition matrix, the measurement matrix and the initial noise covariance matrices. The average RMS errors of the KF, the AKF and the QKF for the estimation of the attitude and the calibration parameters obtained from 10 individual trials are plotted in
Figure 13,
Figure 14 and
Figure 15.
The above figures illustrate that the QKF obtains the highest state estimation accuracy in comparison with the KF and the AKF. This result is distinguishable in terms of both the attitude and the calibration parameters estimation. It indicates that the QKF is more effective than the AKF to identify the appropriate process noise covariance matrix and consequently improves the filtering performance.
To illustrate the performance of the presented QKF algorithm in different scenarios, the numerical simulation is implemented with different measurement noise levels. In the inter-spacecraft rapid transfer alignment system, when the attitude measurement noise standard deviation increases from 1″ to 10″, the average attitude estimation RMS errors of the KF and the QKF obtained from 10 individual trials are plotted in
Figure 16. It indicates that the QKF algorithm is less sensitive to the variation in the measurement noise levels.
To sum up, it is apparent from the simulation results that the presented inter-spacecraft rapid transfer alignment method is valuable to satisfy the rapidity and accuracy requirements for the on-orbit calibration. The presented attitude plus angular rate matching scheme performs better than the traditional one. The QKF has considerable potential for practical applications in space missions as both the KF algorithm and the Q-learning approach are familiar for aerospace engineers. The basic principle of the presented method can be further applied to calibrate other attitude sensors, although the system model should be modified separately depending on the specific application.