Maximum Correntropy Unscented Kalman Filter for Spacecraft Relative State Estimation

Liu, Xi; Qu, Hua; Zhao, Jihong; Yue, Pengcheng; Wang, Meng

doi:10.3390/s16091530

Open AccessArticle

Maximum Correntropy Unscented Kalman Filter for Spacecraft Relative State Estimation

by

Xi Liu

^1,*

,

Hua Qu

^1,2,

Jihong Zhao

¹,

Pengcheng Yue

¹ and

Meng Wang

²

¹

School of Electronic and Information Engineering, Xi’an Jiaotong University, Xi’an 710049, China

²

School of Software Engineering, Xi’an Jiaotong University, Xi’an 710049, China

^*

Author to whom correspondence should be addressed.

Sensors 2016, 16(9), 1530; https://doi.org/10.3390/s16091530

Submission received: 6 August 2016 / Revised: 6 September 2016 / Accepted: 8 September 2016 / Published: 20 September 2016

(This article belongs to the Section Physical Sensors)

Download

Browse Figures

Versions Notes

Abstract

:

A new algorithm called maximum correntropy unscented Kalman filter (MCUKF) is proposed and applied to relative state estimation in space communication networks. As is well known, the unscented Kalman filter (UKF) provides an efficient tool to solve the non-linear state estimate problem. However, the UKF usually plays well in Gaussian noises. Its performance may deteriorate substantially in the presence of non-Gaussian noises, especially when the measurements are disturbed by some heavy-tailed impulsive noises. By making use of the maximum correntropy criterion (MCC), the proposed algorithm can enhance the robustness of UKF against impulsive noises. In the MCUKF, the unscented transformation (UT) is applied to obtain a predicted state estimation and covariance matrix, and a nonlinear regression method with the MCC cost is then used to reformulate the measurement information. Finally, the UT is adopted to the measurement equation to obtain the filter state and covariance matrix. Illustrative examples demonstrate the superior performance of the new algorithm.

Keywords:

unscented Kalman filter (UKF); unscented transformation (UT); maximum correntropy criterion (MCC)

1. Introduction

The relative state estimation problem is rather important in space communication networks, including inter-satellite link (ISL) establishment, communication support, multiple spacecraft formation flying, and spacecraft rendezvous and docking [1,2,3,4]. Many relative motion scenes make use of the radar or/and ladar to obtain the relative measurement information among spacecrafts, which is necessary to determine the relative motions.

As for the state estimation problem, the Kalman filter (KF) is often used for a linear dynamic system, which is, in essence, a recursive minimum

l_{2}

-norm linear filter [5,6,7]. However, from [8], we know that the relative state estimation in a circular reference orbit involves a linear state Clohessy-Wiltshire (CW) equation and a nonlinear measurement equation. Especially, when the reference orbit is elliptic, the dynamic state equation is also nonlinear [9]. To handle the nonlinear problems, the widely used methods are the extended Kalman filter (EKF) [10,11,12] and unscented Kalman filter (UKF) [11,12,13]. The EKF is a popular nonlinear extension of KF, which makes use of the first order Taylor series expansions to approximate the nonlinear system and applies the KF to this approximation. Nevertheless, the approximation is crude and it may lead to filter divergence if the function exhibits highly non-linear characteristics. Furthermore, the derivation of the Jacobian matrices is cumbersome and often results in the implementation difficulties. In UKF, the probability distribution of the state is approximated by a set of deterministically selected sigma points and propagated though the non-linear process and measurement equations. UKF does not need to calculate the cumbersome Jacobian matrices and provides derivative-free higher-order approximations to achieve more accurate performances than the EKF. Unfortunately, the EKF and UKF may fail to perform well when the measurements are disturbed by some heavy-tailed non-Gaussian noises, which arises naturally in many real applications of engineering. The main reason for this is that both methods are based on the minimum

l_{2}

-norm technique and thus exhibit sensitivity to heavy-tailed noises [14]. To address this problem, some robust methods have been proposed. The Huber’s generalized maximum likelihood methodology is very common [15,16], which is a combined minimum

l_{1}

and

l_{2}

-norm technique.

Besides Huber’s robust statistics, the optimization criteria in information theoretic learning (ITL) [17,18] provide an alternative effective way, using information theoretic quantities estimated directly from the data as the optimization costs instead of using the usual second-order statistical measures (such as variance and covariance). The ITL costs can capture higher-order statistics of data, and can be used as a robust adaptation cost to achieve excellent performance in a number of applications [19,20,21,22,23,24,25,26,27,28,29,30,31]. Particularly, an optimization criterion based on correntropy, called the maximum correntropy criterion (MCC) [17,18], has recently been successfully applied in robust adaptive filtering in the presence of heavy-tailed non-Gaussian noises [17,28,29,30,31,32].

In this paper, a non-linear Kalman type filter based on MCC has been derived, namely the maximum correntropy unscented Kalman filter (MCUKF). In MCUKF, the UT is applied to obtain a predicted state estimation and covariance matrix, and a non-linear regression model under MCC is used to reformulate the measurement information. Then, the UT is adopted to the measurement equation to obtain the filter state and covariance matrix. Similar to the original UKF, the new filter also has a recursive structure and is suitable for online implementation. It is worth noting that the proposed MCUKF is different from the algorithm in [33].

The organization of the paper is as follows. In Section 2, we provide a short review of the MCC and the traditional UKF. In Section 3, the MCUKF algorithm is derived. Two illustrative examples are then provided in Section 4 to show the excellent performance of the MCUKF. Finally, Section 5 gives the conclusion.

2. Preliminaries

2.1. Maximum Correntropy Criterion

Given two random variables

X, Y \in R

with joint distribution function

F_{X Y} (x, y)

, the correntropy is a generalized similarity measure between X and Y defined by

V (X, Y) = E [κ (X, Y)] = \int κ (x, y) d F_{X Y} (x, y),

(1)

where E is the expectation operator, and

κ (\cdot, \cdot)

denotes a shift-invariant kernel following the Mercer condition. In this paper, the kernel function is chosen as the Gaussian kernel given by

κ (x, y) = G_{σ} (e) = \exp (- \frac{e^{2}}{2 σ^{2}}),

(2)

where

e = x - y

, and

σ > 0

is the kernel bandwidth.

In many practical applications, we have only a limited amount of data and the joint distribution

F_{X Y}

is unknown. In these cases, the correntropy can be estimated by the sample mean estimator:

\hat{V} (X, Y) = \frac{1}{N} \sum_{i = 1}^{N} G_{σ} (e (i)),

(3)

where

e (i) = x (i) - y (i)

, with

{\{x (i), y (i)\}}_{i = 1}^{N}

being N samples drawn from

F_{X Y}

.

Taking the Taylor series expansion of the Gaussian kernel yields

V (X, Y) = \sum_{n = 0}^{\infty} \frac{{(- 1)}^{n}}{2^{n} σ^{2 n} n!} E [{(X - Y)}^{2 n}] .

(4)

It can be seen that the correntropy is a weighted sum of all even order moments of the error variable

X - Y

. In addition, the kernel bandwidth appears as a parameter weighting the second order and higher order moments. It is noted that the correntropy will be dominated by the second order moment when the kernel bandwidth is very large compared to the dynamic range of the data.

Given a sequence of error data

{\{e (i)\}}_{i = 1}^{N}

, the cost function of MCC is given by

J_{M C C} = \frac{1}{N} \sum_{i = 1}^{N} G_{σ} (e (i)) .

(5)

Assume

W

is a parameter vector of an adaptive system to learn, and let

x (i)

and

y (i)

denote the model output and the desired response, respectively. The MCC based learning can be formulated as the following optimization problem:

\hat{W} = \underset{W \in Ω}{\arg max} \frac{1}{N} \sum_{i = 1}^{N} G_{σ} (e (i)),

(6)

where

\hat{W}

denotes the optimal solution, and Ω denotes a feasible set of the parameter.

2.2. Unscented Kalman Filter

The UKF provides a suitable alternative for EKF to deal with nonlinear systems. In this paper, we discuss a discrete-time nonlinear system described as

x (k) = f (k - 1, x (k - 1)) + q (k - 1),

(7)

y (k) = h (k, x (k)) + r (k),

(8)

where

x (k) \in R^{n}

denotes the n-dimensional state vector of the system at time step k,

y (k) \in R^{m}

denotes an m-dimensional measurement vector. f represents a nonlinear system function, and h is a nonlinear measurement function and they are assumed to be continuously differentiable. The process noise

q (k - 1)

and measurement noise

r (k)

are mutually uncorrelated, with zero mean values and covariance matrices

E [q (k - 1) q^{T} (k - 1)] = Q (k - 1), E [r (k) r^{T} (k)] = R (k) .

(9)

In general, similar to KF, the UKF includes two steps, namely predict and update:

2.2.1. Predict

First, a set of

2 n + 1

sigma points are computed by the estimated state and covariance matrix of last time step:

\begin{matrix} χ^{0} (k - 1 | k - 1) = & \hat{x} (k - 1 | k - 1), \\ χ^{i} (k - 1 | k - 1) = & \hat{x} (k - 1 | k - 1) \\ + {(\sqrt{(n + λ) P (k - 1 | k - 1)})}_{i}, i = 1, \dots, n, \\ χ^{i} (k - 1 | k - 1) = & \hat{x} (k - 1 | k - 1) \\ - {(\sqrt{(n + λ) P (k - 1 | k - 1)})}_{i - n}, i = n + 1, \dots, 2 n, \end{matrix}

(10)

where

{(\sqrt{(n + λ) P (k - 1 | k - 1)})}_{i}

is the i-th column of the matrix square root of

(n + λ) P (k - 1 | k - 1)

, n denotes the dimension of the state vector, and λ represents a compound scaling factor defined by

λ = α^{2} (n + ε) - n,

(11)

where α determines the extent of the spread of the sigma points, which is usually chosen as

0 < α \leq 1

, ε is a scaling factor and is usually set to

3 - n

.

The transformed set is given through the process model

{χ^{i}}^{*} (k | k - 1) = f (k - 1, χ^{i} (k - 1 | k - 1)), i = 0, \dots, 2 n .

(12)

Then, the predicted state mean and associated covariance matrix are calculated by

\hat{x} (k | k - 1) = \sum_{i = 0}^{2 n} w_{m}^{i} {χ^{i}}^{*} (k | k - 1),

(13)

\begin{matrix} P (k | k - 1) = & \sum_{i = 0}^{2 n} w_{c}^{i} [{χ^{i}}^{*} (k | k - 1) - \hat{x} (k | k - 1)] \\ \times {[{χ^{i}}^{*} (k | k - 1) - \hat{x} (k | k - 1)]}^{T} + Q (k - 1), \end{matrix}

(14)

where the corresponding weights of the state and covariance matrix are defined as

\begin{matrix} w_{m}^{0} = \frac{λ}{(n + λ)}, \\ w_{c}^{0} = λ / (n + λ) + (1 - α^{2} + β), \\ w_{m}^{i} = w_{c}^{i} = \frac{1}{2 (n + λ)}, i = 1, \dots, 2 n . \end{matrix}

(15)

with β being related to prior knowledge of the distribution of

x (k)

(for Gaussian noises,

β = 2

is optimal).

2.2.2. Update

Similarly, a set of

2 n + 1

sigma points are computed by the predicted state mean and covariance matrix

\begin{matrix} χ^{0} (k | k - 1) = & \hat{x} (k | k - 1), \\ χ^{i} (k | k - 1) = & \hat{x} (k | k - 1) \\ + & {(\sqrt{(n + λ) P (k | k - 1)})}_{i}, i = 1, \dots, n, \\ χ^{i} (k | k - 1) = & \hat{x} (k | k - 1) \\ - & {(\sqrt{(n + λ) P (k | k - 1)})}_{i - n}, i = n + 1, \dots, 2 n . \end{matrix}

(16)

The transformed set is given through the measurement model

γ^{i} (k) = h (k, χ^{i} (k | k - 1)), i = 0, \dots, 2 n .

(17)

Then, the predicted measurement mean and covariance matrix are calculated by

\hat{y} (k) = \sum_{i = 0}^{2 n} w_{m}^{i} γ^{i} (k),

(18)

P_{yy} = \sum_{i = 0}^{2 n} w_{c}^{i} [γ^{i} (k) - \hat{y} (k)] {[γ^{i} (k) - \hat{y} (k)]}^{T} + R (k) .

(19)

In addition, the state-measurement cross-covariance matrix is computed by

P_{xy} (k) = \sum_{i = 0}^{2 n} w_{c}^{i} [χ^{i} (k | k - 1) - \hat{x} (k | k - 1)] {[γ^{i} (k) - \hat{y} (k)]}^{T} .

(20)

Next, the Kalman gain is given as

K (k) = P_{xy} P_{yy}^{- 1} .

(21)

Finally, the filter state in this case is obtained by

\hat{x} (k | k) = \hat{x} (k | k - 1) + K (k) (y (k) - \hat{y} (k)) .

(22)

Additionally, the filter covariance is recursively updated as

P (k | k) = P (k | k - 1) - K (k) P_{yy} K^{T} (k) .

(23)

3. Unscented Kalman Filter under MCC

To improve the robustness of the UKF, we present the idea to combine the MCC cost function with the UKF framework to derive a novel UKF, which may perform much better in non-Gaussian noise environments, since correntropy contains second and higher order moments of the error.

First, we consider a nonlinear model given by Equation (7) and Equation (8), combine expression Equations (13) and (14) and recast the nonlinear regression model as

[\begin{matrix} \hat{x} (k | k - 1) \\ y (k) \end{matrix}] = [\begin{matrix} x (k) \\ h (k, x (k)) \end{matrix}] + ψ (k),

(24)

where

ψ (k)

denotes by

ψ (k) = [\begin{matrix} η (x (k)) \\ r (k) \end{matrix}],

(25)

with

\begin{matrix} Ψ (k) \\ = & E [ψ (k) ψ^{T} (k)] \\ = & [\begin{matrix} P (k | k - 1) & 0 \\ 0 & R (k) \end{matrix}] \\ = & [\begin{matrix} T_{p} (k | k - 1) T_{p}^{T} (k | k - 1) & 0 \\ 0 & T_{r} (k) T_{r}^{T} (k) \end{matrix}] \\ = & T (k) T^{T} (k), \end{matrix}

(26)

where

T (k)

can be obtained by Cholesky decomposition of

Ψ (k)

. Left multiplying both sides of Equation (24) by

T^{- 1} (k)

, we have the following equation

D (k) = g (k, x (k)) + e (k),

(27)

where

D (k) = T^{- 1} (k) [\begin{matrix} \hat{x} (k | k - 1) \\ y (k) \end{matrix}]

,

g (k, x (k)) = T^{- 1} (k) [\begin{matrix} x (k) \\ h (k, x (k)) \end{matrix}]

,

e (k) = T^{- 1} (k) ψ (k)

and the covariance of

e (k)

is the identity matrix.

Then, we define the following cost function

J_{L} (x (k))

based MCC:

J_{L} (x (k)) = \sum_{i = 1}^{L} G_{σ} (d_{i} (k) - g_{i} (k, x (k))),

(28)

where

d_{i} (k)

is the i-th element of

D (k)

,

g_{i} (k, x (k))

is the i-th row of

g (k, x (k))

, and

L = n + m

is the dimension of

D (k)

.

Under the MCC, the optimal estimate of

x (k)

can be found by maximizing the cost function

\hat{x} (k) = \arg max_{x (k)} J_{L} (x (k)) = \arg max_{x (k)} \sum_{i = 1}^{L} G_{σ} (e_{i} (k)),

(29)

where

e_{i} (k)

is the i-th element of

e (k)

and

e_{i} (k) = d_{i} (k) - g_{i} (k, x (k)) .

(30)

Hence, the optimal solution for

x (k)

can be found from the following equation

\frac{\partial J_{L} (x (k))}{\partial x (k)} = 0,

(31)

which can also be expressed as

\sum_{i = 1}^{L} φ (e_{i} (k)) \frac{\partial e_{i} (k)}{\partial x_{i} (k)} = 0,

(32)

when

x_{i} (k)

is the i-th element of

x (k)

, and

φ (e_{i} (k)) = G_{σ} (e_{i} (k)) \cdot e_{i} (k)

. Then, we define

C (k, i) = φ (e_{i} (k)) / e_{i} (k)

and have

C (k, i) = G_{σ} (e_{i} (k)) .

(33)

Equation (32) further writes in matrix form as

{(\frac{\partial g (k, x (k))}{\partial x (k)})}^{T} C (k) [D (k) - g (k, x (k))] = 0,

(34)

where

C (k) = [\begin{matrix} C_{x} (k) & 0 \\ 0 & C_{y} (k) \end{matrix}],

with

C_{x} (k) = d i a g (G_{σ} (e_{1} (k)), . . ., G_{σ} (e_{n} (k))), C_{y} (k) = d i a g (G_{σ} (e_{n + 1} (k)), . . ., G_{n + m} (e_{n + m} (k))) .

Based on Equation (27) with only one iteration in [32], we could obtain similar results in the UKF framework by using

C (k)

to reformulate the measurement information. Similar to [16], there are two ways to do it. One is to re-weight the residual error covariance depending on the value of

e_{i} (k) = d_{i} (k) - g_{i} (k, x (k))

, the other is to reconstruct the ’pseudo-observation’. In general, the derived results based on two ways are the same. In this paper, we just describe the algorithm based on the first way for simplicity.

Defining

\tilde{Ψ} (k)

is the modified covariance, and could be given as

\tilde{Ψ} (k) = T (k) C^{- 1} (k) T^{T} (k) .

(35)

For our next discussion, we write

\tilde{Ψ} (k)

into two portions so that

\tilde{Ψ} (k) = [\begin{matrix} {\tilde{Ψ}}_{x} (k) & 0 \\ 0 & {\tilde{Ψ}}_{y} (k) \end{matrix}] .

(36)

In fact, since the true state

x (k)

is unknown, we set

η (x (k)) = \hat{x} (k | k - 1) - x (k) = 0

. In this case, we can see easily that

\begin{matrix} {\tilde{Ψ}}_{x} (k) = & T_{p} (k | k - 1) * I * T_{p}^{T} (k | k - 1) \\ = & P (k | k - 1) . \end{matrix}

(37)

The modified measurement covariance can be obtained

\tilde{R} (k) = {\tilde{Ψ}}_{y} (k) .

(38)

In the above derivation, the MCUKF algorithm can be summarized as follows:

Choose a proper kernel bandwidth σ; set an initial estimate $\hat{x} (0 | 0)$ and corresponding covariance matrix $P (0 | 0)$ ; and let $k = 1$ ;
Use Equations (10) and (12) to calculate the sigma points and the propagated sigma points with respect to function f;
Compute predicted estimate $\hat{x} (k | k - 1)$ and covariance matrix $P (k | k - 1)$ by Equations (13) and (14), and adopt Cholesky decomposition to get $T_{p} (k | k - 1)$ ;
Use Equations (16) and (17) to calculate the sigma points and the propagated sigma points with respect to function h;
Use Equations (24)–(38) to derive the modified $\tilde{R} (k)$ ; compute the predicted measurement mean by Equation (18); and replace $R (k)$ with $\tilde{R} (k)$ in Equation (19) and have the following covariance matrix:

$P_{yy} = \sum_{i = 0}^{2 n} w_{c}^{i} [γ^{i} (k) - \hat{y} (k)] {[γ^{i} (k) - \hat{y} (k)]}^{T} + \tilde{R} (k) .$

(39)
Use Equations (20)–(23) to compute the filter state mean and covariance matrix; go back to 2.

Remark 1.

As one can see, different from the UKF algorithm, the MCUKF uses a non-linear regression model combined MCC to update the measurement information. Note that the kernel bandwidth σ is a key parameter in MCUKF. In general, a smaller kernel bandwidth makes the algorithm more robust (with respect to outliers). However, when the kernel bandwidth is too small, it may result in the filter divergence or accuracy deterioration. The reason for this can be seen in [34]. On the other hand, when σ becomes larger and larger, the MCUKF will behave more and more like the UKF algorithm. In particular, the following theorem holds.

Theorem 1.

If the kernel bandwidth σ approaches infinity, the MCUKF will reduce to the traditional UKF.

Proof of Theorem 1.

The proof is simple, we give a short explanation. Note that as

σ \to \infty

, we have

C (k) \to I

. In this case, the MCUKF is equal to the traditional UKF. ☐

Thus, it is seen that the kernel bandwidth has significant influences on the behavior of MCUKF. In practical applications, the kernel bandwidth can be set manually or optimized by trial and error methods.

4. Illustrative Examples

This section demonstrates the performances of the proposed algorithm by two illustrative examples. In the paper, we mainly compare the estimation performance based on the following benchmarks:

{MSE}_{1} (m) = \frac{1}{K} \sum_{k = 1}^{K} {(x (k) - \hat{x} (k | k))}^{2}, m = 1, \dots, M,

(40)

MSE = \frac{1}{M} \sum_{m = 1}^{M} {MSE}_{1} (m),

(41)

{MSD}_{p} (k) = \frac{1}{M} \sum_{m = 1}^{M} ({(x (k) - \hat{x} (k | k))}^{2} + {(y (k) - \hat{y} (k | k))}^{2} + {(z (k) - \hat{z} (k | k))}^{2}), k = 1, \dots, K,

(42)

{MSD}_{v} (k) = \frac{1}{M} \sum_{m = 1}^{M} ({(\dot{x} (k) - \hat{\dot{x}} (k | k))}^{2} + {(\dot{y} (k) - \hat{\dot{y}} (k | k))}^{2} + {(\dot{z} (k) - \hat{\dot{z}} (k | k))}^{2}), k = 1, \dots, K,

(43)

{TAMSD}_{p} = \frac{1}{k_{2} - k_{1}} \sum_{k = k_{1} + 1}^{k_{2}} {MSD}_{p} (k),

(44)

{TAMSD}_{v} = \frac{1}{k_{2} - k_{1}} \sum_{k = k_{1} + 1}^{k_{2}} {MSD}_{v} (k),

(45)

where K is the entire time steps in every Monte Carlo run and M represents the total Monte Carlo runs.

4.1. Example 1

Now, a univariate nonstationary growth model (UNGM) that is often used as a benchmark example for nonlinear filtering is considered, whose state and measurement equations are given by

x (k) = α_{1} x (k - 1) + α_{2} \frac{x (k - 1)}{1 + x {(k - 1)}^{2}} + α_{3} \cos (1.2 (k - 1)) + q (k - 1),

(46)

y (k) = \frac{x {(k)}^{2}}{20} + r (k) .

(47)

The parameters are set to

α_{1} = 0.5, α_{2} = 25, α_{3} = 8

.

First, we consider the case in which all the noises are Gaussian

\begin{matrix} q (k - 1) \sim N (0, 1), \\ r (k) \sim N (0, 1) . \end{matrix}

In this example, the parameters are chosen as

K = 500, M = 100

. Table 1 illustrates the

MSE

s of x, defined in Equation (41), in Gaussian noise. Since all the noises are Gaussian, the UKF gives the smallest

MSE

in all filters. Moreover, it is noted that when the kernel bandwidth is small, the MCUKF may result in a worse estimation; in contrast, when the kernel bandwidth becomes larger, its performance will approach that of the UKF. Actually, it has been proved easily that when

σ \to \infty

, the MCUKF will reduce to the traditional UKF (see Theorem 1). Therefore, one should choose a larger kernel bandwidth in Gaussian noises.

Second, we change the observation noise into a heavy-tailed non-Gaussian noise, with a mixed-Gaussian distribution

\begin{matrix} q (k - 1) \sim N (0, 1), \\ r (k) \sim 0.8 N (0, 1) + 0.2 N (0, 500) . \end{matrix}

Table 2 shows the

MSE

s of x in non-Gaussian measurement noise. As one can see, in this case, when kernel bandwidth is too small or too large, the performance of MCUKF will be not good. However, with a proper kernel bandwidth (say

σ = 2.0

), the MCUKF can outperform the UKF, achieving the smallest

MSE

. Again, when σ is very large, MCUKF achieves almost the same performance as the UKF.

4.2. Example 2

Finally, we consider a practical example with respect to the relative motion of two spacecrafts, which is illustrated in Figure 1. One of the spacecrafts is called the chief spacecraft, which is moving on the reference orbit, and the other is the deputy spacecraft. They all revolve around the earth and thus the inertial orbital equations of two spacecraft are given as

{\ddot{r}}_{c} = - \frac{μ}{r_{c}^{3}} r_{c},

(48)

{\ddot{r}}_{d} = - \frac{μ}{r_{d}^{3}} r_{d},

(49)

where

r_{c}

and

r_{d}

are the position vectors of the chief spacecraft and the deputy spacecraft in ECI coordinate frame,

r_{c}

and

r_{d}

are the norms of

r_{c}

and

r_{d}

, respectively, and μ is the gravitational parameter of the earth.

The position vector of the deputy spacecraft relative to the chief spacecraft is

æ_{d c} = r_{d} - r_{c} .

(50)

Here, we use the Hill coordinate frame, which is a rectangular, Cartesian, dextral rotating frame centered on the chief spacecraft and refer to it as the local-vertical-local-horizontal (LVLH) frame. Then, the motion of the deputy spacecraft relative to the chief spacecraft in the Hill coordinate frame can be described in [35] as

\ddot{x} = 2 ω \dot{y} + \dot{ω} y + ω^{2} x + \frac{μ}{r_{c}^{2}} - \frac{μ (r_{c} + x)}{{[{(r_{c} + x)}^{2} + y^{2} + z^{2}]}^{3 / 2}} + q_{1},

(51)

\ddot{y} = - 2 ω \dot{x} - \dot{ω} x + ω^{2} y - \frac{μ y}{{[{(r_{c} + x)}^{2} + y^{2} + z^{2}]}^{3 / 2}} + q_{2},

(52)

\ddot{z} = - \frac{μ z}{{[{(r_{c} + x)}^{2} + y^{2} + z^{2}]}^{3 / 2}} + q_{3},

(53)

with

{\ddot{r}}_{c} = r_{c} ω^{2} - \frac{μ}{r_{c}^{2}},

(54)

\dot{ω} = - \frac{2 {\dot{r}}_{c} ω}{r_{c}},

(55)

where x, y and z are the radial, in-track and cross-track coordinates of the Hill frame, and ω denotes the orbital angular velocity of the chief spacecraft.

The radar is set on the chief spacecraft to obtain the measurements, and the measurement coordinate system is showed in Figure 2. The measurement equations are given as

ρ = \sqrt{(x^{2} + y^{2} + z^{2})} + r_{1},

(56)

θ = \tan^{- 1} \frac{y}{x} + r_{2},

(57)

ϕ = \tan^{- 1} \frac{z}{\sqrt{x^{2} + y^{2}}} + r_{3},

(58)

where ρ is the relative range between the chief spacecraft and deputy spacecraft, θ is the azimuth angle, and

ϕ

is the elevation angle.

The two spacecrafts are on the elliptic low Earth orbits. The initial six orbital elements of the chief spacecraft are showed in Table 3, and the trajectory of which in ECI coordinate frame is propagated by the fixed-step fourth-order Runge–Kutta algorithm. The prediction of filters is performed every 0.1 s and the measurements record from the radar every 1 s. The state vector

x (k) = {[\begin{matrix} x (k) & y (k) & z (k) & \dot{x} (k) & \dot{y} (k) & \dot{z} (k) \end{matrix}]}^{T}

contains the relative position and velocity components in Hill frame, respectively. The initial true state is

\begin{matrix} x (0) = 31.9262 km, \dot{x} (0) = - 0.005583 km / s, \\ y (0) = - 7.1384 km, \dot{y} (0) = - 0.071774 km / s, \\ z (0) = 33.4729 km, \dot{z} (0) = 0.026249 km / s, \end{matrix}

and the initial estimates and covariance matrix of the states are chosen as

\begin{matrix} \hat{x} (0 | 0) = 31.9262 km, \hat{\dot{x}} (0 | 0) = - 0.004416 km / s, \\ \hat{y} (0 | 0) = - 8.1384 km, \hat{\dot{y}} (0 | 0) = - 0.061774 km / s, \\ \hat{z} (0 | 0) = 32.4729 km, \hat{\dot{z}} (0 | 0) = 0.036249 km / s, \\ P (0 | 0) = d i a g ([1, 1, 1, {0.01}^{2}, {0.01}^{2}, {0.01}^{2}]) . \end{matrix}

In this example, 100 independent Monte Carlo runs have been conducted, and, in each case, an elapsed time of 7200 s is considered, that is

K = 7200, M = 100

. Since correntropy is a local similarity measure, the MCUKF may converge to the optimal solution slowly, especially when the initial values deviate greatly from the true values. Thus, we use the UKF during the first 100 s to make the process settle down and then switch to the MCUKF to continue. The performance of EKF, Huber-EKF (HEKF) [36], UKF and novel robust UKF (NRUKF) [16] are shown for comparison with that of the proposed algorithm.

First, we consider the case in which all the noises are Gaussian, that is,

\begin{matrix} q_{1} \sim N (0, {(10^{- 7} km / s^{2})}^{2}), r_{1} \sim N (0, {(10^{- 3} km)}^{2}), \\ q_{2} \sim N (0, {(10^{- 7} km / s^{2})}^{2}), r_{2} \sim N (0, {(0.05 π / 180)}^{2}), \\ q_{3} \sim N (0, {(10^{- 7} km / s^{2})}^{2}), r_{3} \sim N (0, {(0.05 π / 180)}^{2}) . \end{matrix}

Figure 3 describes the relative motion of the deputy spacecraft in the Hill coordinate frame, Figure 4 and Figure 5 demonstrate the

{MSD}_{p}

and

{MSD}_{v}

, defined in Equations (42) and (43), with different filters in Gaussian noises, and Table 4 summarizes the corresponding

{TAMSD}_{p}

and

{TAMSD}_{v}

, defined in Equations (44) and (45) (the parameter is set to

k_{1} = 1000, k_{2} = 7200

). Those results illustrate that the UKF play the best performance in all filters in this case and the UKF type filters have the better performance than the EKF type counterparts. One can also observe that the robust filters do not perform as well as their non-robust counterparts in the Gaussian noises. Moreover, it is worth noting that when the kernel bandwidth is small, the MCUKF may achieve a worse performance; whereas, when the kernel bandwidth becomes larger, it has similar results to the UKF.

Second, we consider the case in which the measurement noises are heavy-tailed, with mixed-Gaussian distributions

\begin{matrix} q_{1} \sim N (0, {(10^{- 7} km / s^{2})}^{2}), \\ q_{2} \sim N (0, {(10^{- 7} km / s^{2})}^{2}), \\ q_{3} \sim N (0, {(10^{- 7} km / s^{2})}^{2}), \\ r_{1} \sim 0.9 N (0, {(10^{- 3} km)}^{2}) + 0.1 N (0, {(10^{- 2} km)}^{2}), \\ r_{2} \sim 0.9 N (0, {(0.05 π / 180)}^{2}) + 0.1 N (0, {(0.5 π / 180)}^{2}), \\ r_{3} \sim 0.9 N (0, {(0.05 π / 180)}^{2}) + 0.1 N (0, {(0.5 π / 180)}^{2}) . \end{matrix}

Figure 6 and Figure 7 reveal the

{MSD}_{p}

and

{MSD}_{v}

with different filters in non-Gaussian noises, and Table 5 lists the corresponding

{TAMSD}_{p}

and

{TAMSD}_{v}

. As one can observe again, the UKF type filters give a smaller

{TAMSD}_{p}

and

{TAMSD}_{v}

than the EKF type counterparts. All the robust filters are superior to their non-robust counterparts in impulsive noises. It is noted that when the kernel bandwidth is very large, MCUKF achieves almost the same performance as the UKF. However, with a proper kernel bandwidth, the MCUKF can outperform the UKF significantly. Particularly, when

σ = 2.0

, the MCUKF exhibits the smallest

{TAMSD}_{p}

and

{TAMSD}_{v}

.

Moreover, to compare the computational cost, the computation times of different filters in this example are shown in Table 6. We can see that the computation time of the MCUKF is moderate compared with the UKF, and is superior to that of the HEKF and NRUKF.

5. Conclusions

In this paper, we propose a new unscented Kalman filter (UKF), namely the maximum correntropy unscented Kalman filter (MCUKF), which shows strong robustness against heavy-tailed non-Gaussian noises. The proposed algorithm is recast in the form of a non-linear regression model and makes use of the MCC to obtain the filter estimates. The MCUKF is then applied to the spacecraft relative state estimation, compared with EKF, HEKF, UKF and NRUKF. Simulation results confirm that, with a large kernel bandwidth, the MCUKF will behave like the original UKF. With a proper kernel bandwidth, the new filter can achieve better performance than other filters, especially when the measurement system is disturbed by some impulsive non-Gaussian noises.

Acknowledgments

This work was supported by the 863 Program (No. 2015AA015702) and the National Natural Science Foundation of China (No. 61371087).

Author Contributions

Xi Liu proposed the main idea, wrote the draft manuscript and presented the experiments; Hua Qu and Jihong Zhao polished the language and were in charge of technical checking; Pengcheng Yue and Meng Wang analyzed the data and drew the figures and tables.

Conflicts of Interest

The authors declare no conflict of interest.

References

Marszalek, M.; Kurz, O.; Drentschew, M.; Schmidt, M.; Schilling, K. Intersatellite Links and Relative Navigation: Pre-conditions for Formation Flights with Pico- and Nanosatellites. In Proceedings of the 18th IFAC World Congress, Milan, Italy, 29 August–3 September 2011; pp. 3027–3032.
Liu, X.; Sun, Z.; Zhang, J.; Wu, X. Relative Navigation Method of Communication Supporting Spacecraft base-on Particle Filter. J. Comput. Inf. Syst. 2013, 9, 1–8. [Google Scholar]
Wu, F.; Sui, X.; Zhao, Y.; Zhang, Y. Relative navigation for formation flying spacecrafts using X-ray pulsars. In Proceedings of the IEEE/ION Position Location and Navigation Symposium (PLANS), Myrtle Beach, SC, USA, 23–26 April 2012; pp. 1289–1292.
Kelsey, J.; Byrne, J.; Cosgrove, M.; Seereeram, S.; Mehra, R. Vision-based relative pose estimation for autonomous rendezvous and docking. In Proceedings of the IEEE Aerospace Conference, Big Sky, MT, USA, 4–11 March 2006; pp. 1–20.
Kalman, R.E. A new approach to linear filtering and prediction problems. Trans. ASME-J. Basic Eng. 1960, 82, 35–45. [Google Scholar] [CrossRef]
Bryson, A.; Ho, Y. Applied Optimal Control: Optimization, Estimation and Control; CRC Press: Boca Raton, FL, USA, 1975. [Google Scholar]
Nahi, N.E. Estimation Theory and Applications; Wiley: New York, NY, USA, 1969. [Google Scholar]
Hablani, H.B.; Tapper, M.L.; Dana-Bashian, D.J. Guidance and Relative Navigation for Autonomous Rendezvous in a Circular Orbit. Trans. ASME-J. Basic Eng. 2002, 25, 553–562. [Google Scholar] [CrossRef]
Baek, K.; Bang, H. Adaptive sparse grid quadrature filter for spacecraft relative navigation. Acta Astronaut. 2013, 87, 96–106. [Google Scholar] [CrossRef]
Anderson, B.; Moore, J. Optimal Filtering; Prentice-Hall: Englewood Cliffs, NJ, USA, 1979. [Google Scholar]
Haykin, S. Kalman Filtering and Neural Networks; John Wiley & Sons: New York, NY, USA, 2001. [Google Scholar]
Simon, D. Optimal State Estimation: Kalman, H_∞ and Nonlinear Approaches; John Wiley & Sons: Hoboken, NJ, USA, 2006. [Google Scholar]
Julier, S.; Uhlmann, J.; Durrant-Whyte, H.F. A new method for the nonlinear transformation of means and covariances in filters and estimators. IEEE Trans. Autom. Control 2000, 45, 477–482. [Google Scholar] [CrossRef]
Schick, I.; Mitter, S. Robust Recursive Estimation in the Presence of Heavy-Tailed Observation Noise. Ann. Stat. 1994, 22, 1045–1080. [Google Scholar] [CrossRef]
Huber, P. Robust Statistics; Springer: Berlin, Germany, 2011. [Google Scholar]
Chang, L.; Hu, B.; Chang, G.; Li, A. Huber-based novel robust unscented Kalman filter. IET Sci. Meas. Technol. 2012, 6, 502–509. [Google Scholar] [CrossRef]
Principe, J.C. Information Theoretic Learning: Renyi’s Entropy and Kernel Perspectives; Springer: New York, NY, USA, 2010. [Google Scholar]
Chen, B.; Zhu, Y.; Hu, J.; Principe, J.C. System Parameter Identification: Information Criteria and Algorithms; Newnes: Oxford, UK, 2013. [Google Scholar]
Liu, W.; Pokharel, P.P.; Principe, J.C. Correntropy: Properties, and applications in non-Gaussian signal processing. IEEE Trans. Signal Process. 2007, 55, 5286–5298. [Google Scholar] [CrossRef]
Chen, B.; Principe, J.C. Maximum correntropy estimation is a smoothed MAP estimation. IEEE Signal Process. Lett. 2012, 19, 491–494. [Google Scholar] [CrossRef]
Ma, W.; Qu, H.; Zhao, J. Estimator with Forgetting Factor of Correntropy and Recursive Algorithm for Traffic Network Prediction. In Proceedings of the Chinese Control and Decision Conference (CCDC), Guiyang, China, 25–27 May 2013; pp. 490–494.
Chen, X.; Yang, J.; Liang, J.; Ye, Q. Recursive robust least squares support vector regression based on maximum correntropy criterion. Neurocomputing 2012, 97, 63–73. [Google Scholar] [CrossRef]
He, R.; Hu, B.; Yuan, X.; Wang, L. Robust Recognition via Information Theoretic Learning; Springer: Amsterdam, The Netherlands, 2014. [Google Scholar]
He, R.; Zheng, W.; Hu, B. Maximum correntropy criterion for robust face recognition. IEEE Trans. Pattern Anal. 2011, 33, 1561–1576. [Google Scholar]
He, R.; Hu, B.; Zheng, W.; Kong, X. Robust principal component analysis based on maximum correntropy criterion. IEEE Trans. Image Process. 2011, 20, 1485–1494. [Google Scholar] [PubMed]
Chen, L.; Qu, H.; Zhao, J.; Chen, B.; Principe, J.C. Efficient and robust deep learning with Correntropy-induced loss function. Neural Comput. Appl. 2016, 27, 1019–1031. [Google Scholar] [CrossRef]
Bessa, R.J.; Miranda, V.; Gama, J. Entropy, and correntropy against minimum square error in offline, and online three-day ahead wind power forecasting. IEEE Trans. Power Syst. 2009, 24, 1657–1666. [Google Scholar] [CrossRef]
Ma, W.; Qu, H.; Gui, G.; Xu, L.; Zhao, J.; Chen, B. Maximum correntropy criterion based sparse adaptive filtering algorithms for robust channel estimation under non-Gaussian environments. J. Frankl. Inst. 2015, 352, 2708–2727. [Google Scholar] [CrossRef]
Chen, B.; Xing, L.; Liang, J.; Zheng, N.; Principe, J.C. Steady-State Mean-Square Error Analysis for Adaptive Filtering under the Maximum Correntropy Criterion. IEEE Signal Process. Lett. 2014, 21, 880–884. [Google Scholar]
Shi, L.; Lin, Y. Convex Combination of Adaptive Filters under the Maximum Correntropy Criterion in Impulsive Interference. IEEE Signal Process. Lett. 2014, 21, 1385–1388. [Google Scholar] [CrossRef]
Chen, B.; Xing, L.; Zhao, H.; Zheng, N.; Principe, J.C. Generalized correntropy for robust adaptive filtering. IEEE Trans. Signal Process. 2016, 64, 3376–3387. [Google Scholar] [CrossRef]
Chen, B.; Liu, X.; Zhao, H.; Principe, J.C. Maximum Correntropy Kalman Filter. Available online: http://arxiv.org/abs/1509.04580 (accessed on 18 September 2016).
Liu, X.; Chen, B.; Xu, B.; Wu, Z.; Honeine, P. Maximum Correntropy Unscented Filter. Available online: http://arxiv.org/abs/1608.07526 (accessed on 18 September 2016).
Chen, B.; Wang, J.; Zhao, H.; Zheng, N.; Principe, J.C. Convergence of a fixed-point algorithm under maximum correntropy criterion. IEEE Signal Process. Lett. 2015, 22, 1723–1727. [Google Scholar] [CrossRef]
Schaub, H.; Junkins, J.L. Analytical Mechanics of Space Systems; American Institute of Aeronautics and Astronautics Education Series: Reston, VA, USA, 2003. [Google Scholar]
El-Hawary, F.; Jing, Y. Robust Regression-Based EKF for Tracking Underwater Targets. IEEE J. Ocean. Eng. 1995, 20, 31–41. [Google Scholar] [CrossRef]

Figure 1. Illustration of example 2.

Figure 2. Measurement coordinate system.

Figure 3. Relative motion of the deputy spacecraft in the Hill frame.

Figure 4.

{MSD}_{p}

with different filters in Gaussian noises.

Figure 4.

{MSD}_{p}

with different filters in Gaussian noises.

Figure 5.

{MSD}_{v}

with different filters in Gaussian noises.

Figure 5.

{MSD}_{v}

with different filters in Gaussian noises.

Figure 6.

{MSD}_{p}

with different filters in non-Gaussian noises.

Figure 6.

{MSD}_{p}

with different filters in non-Gaussian noises.

Figure 7.

{MSD}_{v}

with different filters in non-Gaussian noises.

Figure 7.

{MSD}_{v}

with different filters in non-Gaussian noises.

Table 1.

MSE

s of x in Gaussian noises.

**Table 1.** $MSE$ s of x in Gaussian noises.
Filter	MSE of x
UKF	67.6974
MCUKF $(σ = 2.0)$	87.6836
MCUKF $(σ = 3.0)$	80.8406
MCUKF $(σ = 5.0)$	74.0286
MCUKF $(σ = 10)$	72.3362
MCUKF $(σ = 20)$	68.6795

Table 2.

MSE

s of x in non-Gaussian measurement noise.

**Table 2.** $MSE$ s of x in non-Gaussian measurement noise.
Filter	MSE of x
UKF	85.8439
MCUKF $(σ = 1.0)$	84.1944
MCUKF $(σ = 2.0)$	82.6933
MCUKF $(σ = 3.0)$	83.1098
MCUKF $(σ = 5.0)$	84.7173
MCUKF $(σ = 10)$	85.4411

Table 3. Initial orbital elements of chief spacecraft.

**Table 3.** Initial orbital elements of chief spacecraft.
Orbital Elements	Chief Spacecraft
Semi-major axis	8000 km
Eccentricity	0.150
Orbit inclination	$π / 6$ rad
Argument of perigee	$π / 6$ rad
Right ascension of the ascending node	$π / 18$ rad
True anomaly	0 rad

Table 4.

{TAMSD}_{p}

and

{TAMSD}_{v}

in Gaussian noises.

**Table 4.** ${TAMSD}_{p}$ and ${TAMSD}_{v}$ in Gaussian noises.
Filter	${TAMSD}_{p}$	${TAMSD}_{v}$
EKF	$9.6397 \times 10^{- 6}$	$3.7704 \times 10^{- 11}$
HEKF	$1.0742 \times 10^{- 5}$	$3.9391 \times 10^{- 11}$
UKF	$8.4386 \times 10^{- 6}$	$2.6552 \times 10^{- 11}$
NRUKF	$9.0277 \times 10^{- 6}$	$2.7508 \times 10^{- 11}$
MCUKF $(σ = 2.0)$	$9.3516 \times 10^{- 6}$	$2.8038 \times 10^{- 11}$
MCUKF $(σ = 5.0)$	$8.4909 \times 10^{- 6}$	$2.6625 \times 10^{- 11}$
MCUKF $(σ = 20)$	$8.4403 \times 10^{- 6}$	$2.6554 \times 10^{- 11}$

Table 5.

{TAMSD}_{p}

and

{TAMSD}_{v}

in non-Gaussian noises.

**Table 5.** ${TAMSD}_{p}$ and ${TAMSD}_{v}$ in non-Gaussian noises.
Filter	${TAMSD}_{p}$	${TAMSD}_{v}$
EKF	$1.0680 \times 10^{- 4}$	$2.2751 \times 10^{- 10}$
HEKF	$3.2302 \times 10^{- 5}$	$7.5898 \times 10^{- 11}$
UKF	$4.2884 \times 10^{- 5}$	$8.4705 \times 10^{- 11}$
NRUKF	$2.5510 \times 10^{- 5}$	$5.7120 \times 10^{- 11}$
MCUKF $(σ = 2.0)$	$2.4842 \times 10^{- 5}$	$5.6557 \times 10^{- 11}$
MCUKF $(σ = 5.0)$	$3.1545 \times 10^{- 5}$	$6.6535 \times 10^{- 11}$
MCUKF $(σ = 20)$	$4.1800 \times 10^{- 5}$	$8.3054 \times 10^{- 11}$

Table 6. Computation time comparison.

**Table 6.** Computation time comparison.
Filter	Computation Ratio
UKF	1
HEKF	$6.81$
NRUKF	$4.62$
MCUKF	$3.91$

© 2016 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC-BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Liu, X.; Qu, H.; Zhao, J.; Yue, P.; Wang, M. Maximum Correntropy Unscented Kalman Filter for Spacecraft Relative State Estimation. Sensors 2016, 16, 1530. https://doi.org/10.3390/s16091530

AMA Style

Liu X, Qu H, Zhao J, Yue P, Wang M. Maximum Correntropy Unscented Kalman Filter for Spacecraft Relative State Estimation. Sensors. 2016; 16(9):1530. https://doi.org/10.3390/s16091530

Chicago/Turabian Style

Liu, Xi, Hua Qu, Jihong Zhao, Pengcheng Yue, and Meng Wang. 2016. "Maximum Correntropy Unscented Kalman Filter for Spacecraft Relative State Estimation" Sensors 16, no. 9: 1530. https://doi.org/10.3390/s16091530

APA Style

Liu, X., Qu, H., Zhao, J., Yue, P., & Wang, M. (2016). Maximum Correntropy Unscented Kalman Filter for Spacecraft Relative State Estimation. Sensors, 16(9), 1530. https://doi.org/10.3390/s16091530

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Maximum Correntropy Unscented Kalman Filter for Spacecraft Relative State Estimation

Abstract

1. Introduction

2. Preliminaries

2.1. Maximum Correntropy Criterion

2.2. Unscented Kalman Filter

2.2.1. Predict

2.2.2. Update

3. Unscented Kalman Filter under MCC

4. Illustrative Examples

4.1. Example 1

4.2. Example 2

5. Conclusions

Acknowledgments

Author Contributions

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI