Abstract
A new algorithm called maximum correntropy unscented Kalman filter (MCUKF) is proposed and applied to relative state estimation in space communication networks. As is well known, the unscented Kalman filter (UKF) provides an efficient tool to solve the non-linear state estimate problem. However, the UKF usually plays well in Gaussian noises. Its performance may deteriorate substantially in the presence of non-Gaussian noises, especially when the measurements are disturbed by some heavy-tailed impulsive noises. By making use of the maximum correntropy criterion (MCC), the proposed algorithm can enhance the robustness of UKF against impulsive noises. In the MCUKF, the unscented transformation (UT) is applied to obtain a predicted state estimation and covariance matrix, and a nonlinear regression method with the MCC cost is then used to reformulate the measurement information. Finally, the UT is adopted to the measurement equation to obtain the filter state and covariance matrix. Illustrative examples demonstrate the superior performance of the new algorithm.
1. Introduction
The relative state estimation problem is rather important in space communication networks, including inter-satellite link (ISL) establishment, communication support, multiple spacecraft formation flying, and spacecraft rendezvous and docking [1,2,3,4]. Many relative motion scenes make use of the radar or/and ladar to obtain the relative measurement information among spacecrafts, which is necessary to determine the relative motions.
As for the state estimation problem, the Kalman filter (KF) is often used for a linear dynamic system, which is, in essence, a recursive minimum -norm linear filter [5,6,7]. However, from [8], we know that the relative state estimation in a circular reference orbit involves a linear state Clohessy-Wiltshire (CW) equation and a nonlinear measurement equation. Especially, when the reference orbit is elliptic, the dynamic state equation is also nonlinear [9]. To handle the nonlinear problems, the widely used methods are the extended Kalman filter (EKF) [10,11,12] and unscented Kalman filter (UKF) [11,12,13]. The EKF is a popular nonlinear extension of KF, which makes use of the first order Taylor series expansions to approximate the nonlinear system and applies the KF to this approximation. Nevertheless, the approximation is crude and it may lead to filter divergence if the function exhibits highly non-linear characteristics. Furthermore, the derivation of the Jacobian matrices is cumbersome and often results in the implementation difficulties. In UKF, the probability distribution of the state is approximated by a set of deterministically selected sigma points and propagated though the non-linear process and measurement equations. UKF does not need to calculate the cumbersome Jacobian matrices and provides derivative-free higher-order approximations to achieve more accurate performances than the EKF. Unfortunately, the EKF and UKF may fail to perform well when the measurements are disturbed by some heavy-tailed non-Gaussian noises, which arises naturally in many real applications of engineering. The main reason for this is that both methods are based on the minimum -norm technique and thus exhibit sensitivity to heavy-tailed noises [14]. To address this problem, some robust methods have been proposed. The Huber’s generalized maximum likelihood methodology is very common [15,16], which is a combined minimum and -norm technique.
Besides Huber’s robust statistics, the optimization criteria in information theoretic learning (ITL) [17,18] provide an alternative effective way, using information theoretic quantities estimated directly from the data as the optimization costs instead of using the usual second-order statistical measures (such as variance and covariance). The ITL costs can capture higher-order statistics of data, and can be used as a robust adaptation cost to achieve excellent performance in a number of applications [19,20,21,22,23,24,25,26,27,28,29,30,31]. Particularly, an optimization criterion based on correntropy, called the maximum correntropy criterion (MCC) [17,18], has recently been successfully applied in robust adaptive filtering in the presence of heavy-tailed non-Gaussian noises [17,28,29,30,31,32].
In this paper, a non-linear Kalman type filter based on MCC has been derived, namely the maximum correntropy unscented Kalman filter (MCUKF). In MCUKF, the UT is applied to obtain a predicted state estimation and covariance matrix, and a non-linear regression model under MCC is used to reformulate the measurement information. Then, the UT is adopted to the measurement equation to obtain the filter state and covariance matrix. Similar to the original UKF, the new filter also has a recursive structure and is suitable for online implementation. It is worth noting that the proposed MCUKF is different from the algorithm in [33].
The organization of the paper is as follows. In Section 2, we provide a short review of the MCC and the traditional UKF. In Section 3, the MCUKF algorithm is derived. Two illustrative examples are then provided in Section 4 to show the excellent performance of the MCUKF. Finally, Section 5 gives the conclusion.
2. Preliminaries
2.1. Maximum Correntropy Criterion
Given two random variables with joint distribution function , the correntropy is a generalized similarity measure between X and Y defined by
where E is the expectation operator, and denotes a shift-invariant kernel following the Mercer condition. In this paper, the kernel function is chosen as the Gaussian kernel given by
where , and is the kernel bandwidth.
In many practical applications, we have only a limited amount of data and the joint distribution is unknown. In these cases, the correntropy can be estimated by the sample mean estimator:
where , with being N samples drawn from .
Taking the Taylor series expansion of the Gaussian kernel yields
It can be seen that the correntropy is a weighted sum of all even order moments of the error variable . In addition, the kernel bandwidth appears as a parameter weighting the second order and higher order moments. It is noted that the correntropy will be dominated by the second order moment when the kernel bandwidth is very large compared to the dynamic range of the data.
Given a sequence of error data , the cost function of MCC is given by
Assume is a parameter vector of an adaptive system to learn, and let and denote the model output and the desired response, respectively. The MCC based learning can be formulated as the following optimization problem:
where denotes the optimal solution, and Ω denotes a feasible set of the parameter.
2.2. Unscented Kalman Filter
The UKF provides a suitable alternative for EKF to deal with nonlinear systems. In this paper, we discuss a discrete-time nonlinear system described as
where denotes the n-dimensional state vector of the system at time step k, denotes an m-dimensional measurement vector. f represents a nonlinear system function, and h is a nonlinear measurement function and they are assumed to be continuously differentiable. The process noise and measurement noise are mutually uncorrelated, with zero mean values and covariance matrices
In general, similar to KF, the UKF includes two steps, namely predict and update:
2.2.1. Predict
First, a set of sigma points are computed by the estimated state and covariance matrix of last time step:
where is the i-th column of the matrix square root of , n denotes the dimension of the state vector, and λ represents a compound scaling factor defined by
where α determines the extent of the spread of the sigma points, which is usually chosen as , ε is a scaling factor and is usually set to .
The transformed set is given through the process model
Then, the predicted state mean and associated covariance matrix are calculated by
where the corresponding weights of the state and covariance matrix are defined as
with β being related to prior knowledge of the distribution of (for Gaussian noises, is optimal).
2.2.2. Update
Similarly, a set of sigma points are computed by the predicted state mean and covariance matrix
The transformed set is given through the measurement model
Then, the predicted measurement mean and covariance matrix are calculated by
In addition, the state-measurement cross-covariance matrix is computed by
Next, the Kalman gain is given as
Finally, the filter state in this case is obtained by
Additionally, the filter covariance is recursively updated as
3. Unscented Kalman Filter under MCC
To improve the robustness of the UKF, we present the idea to combine the MCC cost function with the UKF framework to derive a novel UKF, which may perform much better in non-Gaussian noise environments, since correntropy contains second and higher order moments of the error.
First, we consider a nonlinear model given by Equation (7) and Equation (8), combine expression Equations (13) and (14) and recast the nonlinear regression model as
where denotes by
with
where can be obtained by Cholesky decomposition of . Left multiplying both sides of Equation (24) by , we have the following equation
where , , and the covariance of is the identity matrix.
Then, we define the following cost function based MCC:
where is the i-th element of , is the i-th row of , and is the dimension of .
Under the MCC, the optimal estimate of can be found by maximizing the cost function
where is the i-th element of and
Hence, the optimal solution for can be found from the following equation
which can also be expressed as
when is the i-th element of , and . Then, we define and have
Based on Equation (27) with only one iteration in [32], we could obtain similar results in the UKF framework by using to reformulate the measurement information. Similar to [16], there are two ways to do it. One is to re-weight the residual error covariance depending on the value of , the other is to reconstruct the ’pseudo-observation’. In general, the derived results based on two ways are the same. In this paper, we just describe the algorithm based on the first way for simplicity.
Defining is the modified covariance, and could be given as
For our next discussion, we write into two portions so that
In fact, since the true state is unknown, we set . In this case, we can see easily that
The modified measurement covariance can be obtained
In the above derivation, the MCUKF algorithm can be summarized as follows:
- Choose a proper kernel bandwidth σ; set an initial estimate and corresponding covariance matrix ; and let ;
Remark 1.
As one can see, different from the UKF algorithm, the MCUKF uses a non-linear regression model combined MCC to update the measurement information. Note that the kernel bandwidth σ is a key parameter in MCUKF. In general, a smaller kernel bandwidth makes the algorithm more robust (with respect to outliers). However, when the kernel bandwidth is too small, it may result in the filter divergence or accuracy deterioration. The reason for this can be seen in [34]. On the other hand, when σ becomes larger and larger, the MCUKF will behave more and more like the UKF algorithm. In particular, the following theorem holds.
Theorem 1.
If the kernel bandwidth σ approaches infinity, the MCUKF will reduce to the traditional UKF.
Proof of Theorem 1.
The proof is simple, we give a short explanation. Note that as , we have . In this case, the MCUKF is equal to the traditional UKF. ☐
Thus, it is seen that the kernel bandwidth has significant influences on the behavior of MCUKF. In practical applications, the kernel bandwidth can be set manually or optimized by trial and error methods.
4. Illustrative Examples
This section demonstrates the performances of the proposed algorithm by two illustrative examples. In the paper, we mainly compare the estimation performance based on the following benchmarks:
where K is the entire time steps in every Monte Carlo run and M represents the total Monte Carlo runs.
4.1. Example 1
Now, a univariate nonstationary growth model (UNGM) that is often used as a benchmark example for nonlinear filtering is considered, whose state and measurement equations are given by
The parameters are set to .
First, we consider the case in which all the noises are Gaussian
In this example, the parameters are chosen as . Table 1 illustrates the s of x, defined in Equation (41), in Gaussian noise. Since all the noises are Gaussian, the UKF gives the smallest in all filters. Moreover, it is noted that when the kernel bandwidth is small, the MCUKF may result in a worse estimation; in contrast, when the kernel bandwidth becomes larger, its performance will approach that of the UKF. Actually, it has been proved easily that when , the MCUKF will reduce to the traditional UKF (see Theorem 1). Therefore, one should choose a larger kernel bandwidth in Gaussian noises.
Table 1.
s of x in Gaussian noises.
Second, we change the observation noise into a heavy-tailed non-Gaussian noise, with a mixed-Gaussian distribution
Table 2 shows the s of x in non-Gaussian measurement noise. As one can see, in this case, when kernel bandwidth is too small or too large, the performance of MCUKF will be not good. However, with a proper kernel bandwidth (say ), the MCUKF can outperform the UKF, achieving the smallest . Again, when σ is very large, MCUKF achieves almost the same performance as the UKF.
Table 2.
s of x in non-Gaussian measurement noise.
4.2. Example 2
Finally, we consider a practical example with respect to the relative motion of two spacecrafts, which is illustrated in Figure 1. One of the spacecrafts is called the chief spacecraft, which is moving on the reference orbit, and the other is the deputy spacecraft. They all revolve around the earth and thus the inertial orbital equations of two spacecraft are given as
where and are the position vectors of the chief spacecraft and the deputy spacecraft in ECI coordinate frame, and are the norms of and , respectively, and μ is the gravitational parameter of the earth.
Figure 1.
Illustration of example 2.
The position vector of the deputy spacecraft relative to the chief spacecraft is
Here, we use the Hill coordinate frame, which is a rectangular, Cartesian, dextral rotating frame centered on the chief spacecraft and refer to it as the local-vertical-local-horizontal (LVLH) frame. Then, the motion of the deputy spacecraft relative to the chief spacecraft in the Hill coordinate frame can be described in [35] as
with
where x, y and z are the radial, in-track and cross-track coordinates of the Hill frame, and ω denotes the orbital angular velocity of the chief spacecraft.
The radar is set on the chief spacecraft to obtain the measurements, and the measurement coordinate system is showed in Figure 2. The measurement equations are given as
where ρ is the relative range between the chief spacecraft and deputy spacecraft, θ is the azimuth angle, and is the elevation angle.
Figure 2.
Measurement coordinate system.
The two spacecrafts are on the elliptic low Earth orbits. The initial six orbital elements of the chief spacecraft are showed in Table 3, and the trajectory of which in ECI coordinate frame is propagated by the fixed-step fourth-order Runge–Kutta algorithm. The prediction of filters is performed every 0.1 s and the measurements record from the radar every 1 s. The state vector contains the relative position and velocity components in Hill frame, respectively. The initial true state is
and the initial estimates and covariance matrix of the states are chosen as
Table 3.
Initial orbital elements of chief spacecraft.
In this example, 100 independent Monte Carlo runs have been conducted, and, in each case, an elapsed time of 7200 s is considered, that is . Since correntropy is a local similarity measure, the MCUKF may converge to the optimal solution slowly, especially when the initial values deviate greatly from the true values. Thus, we use the UKF during the first 100 s to make the process settle down and then switch to the MCUKF to continue. The performance of EKF, Huber-EKF (HEKF) [36], UKF and novel robust UKF (NRUKF) [16] are shown for comparison with that of the proposed algorithm.
First, we consider the case in which all the noises are Gaussian, that is,
Figure 3 describes the relative motion of the deputy spacecraft in the Hill coordinate frame, Figure 4 and Figure 5 demonstrate the and , defined in Equations (42) and (43), with different filters in Gaussian noises, and Table 4 summarizes the corresponding and , defined in Equations (44) and (45) (the parameter is set to ). Those results illustrate that the UKF play the best performance in all filters in this case and the UKF type filters have the better performance than the EKF type counterparts. One can also observe that the robust filters do not perform as well as their non-robust counterparts in the Gaussian noises. Moreover, it is worth noting that when the kernel bandwidth is small, the MCUKF may achieve a worse performance; whereas, when the kernel bandwidth becomes larger, it has similar results to the UKF.
Figure 3.
Relative motion of the deputy spacecraft in the Hill frame.
Figure 4.
with different filters in Gaussian noises.
Figure 5.
with different filters in Gaussian noises.
Table 4.
and in Gaussian noises.
Second, we consider the case in which the measurement noises are heavy-tailed, with mixed-Gaussian distributions
Figure 6 and Figure 7 reveal the and with different filters in non-Gaussian noises, and Table 5 lists the corresponding and . As one can observe again, the UKF type filters give a smaller and than the EKF type counterparts. All the robust filters are superior to their non-robust counterparts in impulsive noises. It is noted that when the kernel bandwidth is very large, MCUKF achieves almost the same performance as the UKF. However, with a proper kernel bandwidth, the MCUKF can outperform the UKF significantly. Particularly, when , the MCUKF exhibits the smallest and .
Figure 6.
with different filters in non-Gaussian noises.
Figure 7.
with different filters in non-Gaussian noises.
Table 5.
and in non-Gaussian noises.
Moreover, to compare the computational cost, the computation times of different filters in this example are shown in Table 6. We can see that the computation time of the MCUKF is moderate compared with the UKF, and is superior to that of the HEKF and NRUKF.
Table 6.
Computation time comparison.
5. Conclusions
In this paper, we propose a new unscented Kalman filter (UKF), namely the maximum correntropy unscented Kalman filter (MCUKF), which shows strong robustness against heavy-tailed non-Gaussian noises. The proposed algorithm is recast in the form of a non-linear regression model and makes use of the MCC to obtain the filter estimates. The MCUKF is then applied to the spacecraft relative state estimation, compared with EKF, HEKF, UKF and NRUKF. Simulation results confirm that, with a large kernel bandwidth, the MCUKF will behave like the original UKF. With a proper kernel bandwidth, the new filter can achieve better performance than other filters, especially when the measurement system is disturbed by some impulsive non-Gaussian noises.
Acknowledgments
This work was supported by the 863 Program (No. 2015AA015702) and the National Natural Science Foundation of China (No. 61371087).
Author Contributions
Xi Liu proposed the main idea, wrote the draft manuscript and presented the experiments; Hua Qu and Jihong Zhao polished the language and were in charge of technical checking; Pengcheng Yue and Meng Wang analyzed the data and drew the figures and tables.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Marszalek, M.; Kurz, O.; Drentschew, M.; Schmidt, M.; Schilling, K. Intersatellite Links and Relative Navigation: Pre-conditions for Formation Flights with Pico- and Nanosatellites. In Proceedings of the 18th IFAC World Congress, Milan, Italy, 29 August–3 September 2011; pp. 3027–3032.
- Liu, X.; Sun, Z.; Zhang, J.; Wu, X. Relative Navigation Method of Communication Supporting Spacecraft base-on Particle Filter. J. Comput. Inf. Syst. 2013, 9, 1–8. [Google Scholar]
- Wu, F.; Sui, X.; Zhao, Y.; Zhang, Y. Relative navigation for formation flying spacecrafts using X-ray pulsars. In Proceedings of the IEEE/ION Position Location and Navigation Symposium (PLANS), Myrtle Beach, SC, USA, 23–26 April 2012; pp. 1289–1292.
- Kelsey, J.; Byrne, J.; Cosgrove, M.; Seereeram, S.; Mehra, R. Vision-based relative pose estimation for autonomous rendezvous and docking. In Proceedings of the IEEE Aerospace Conference, Big Sky, MT, USA, 4–11 March 2006; pp. 1–20.
- Kalman, R.E. A new approach to linear filtering and prediction problems. Trans. ASME-J. Basic Eng. 1960, 82, 35–45. [Google Scholar] [CrossRef]
- Bryson, A.; Ho, Y. Applied Optimal Control: Optimization, Estimation and Control; CRC Press: Boca Raton, FL, USA, 1975. [Google Scholar]
- Nahi, N.E. Estimation Theory and Applications; Wiley: New York, NY, USA, 1969. [Google Scholar]
- Hablani, H.B.; Tapper, M.L.; Dana-Bashian, D.J. Guidance and Relative Navigation for Autonomous Rendezvous in a Circular Orbit. Trans. ASME-J. Basic Eng. 2002, 25, 553–562. [Google Scholar] [CrossRef]
- Baek, K.; Bang, H. Adaptive sparse grid quadrature filter for spacecraft relative navigation. Acta Astronaut. 2013, 87, 96–106. [Google Scholar] [CrossRef]
- Anderson, B.; Moore, J. Optimal Filtering; Prentice-Hall: Englewood Cliffs, NJ, USA, 1979. [Google Scholar]
- Haykin, S. Kalman Filtering and Neural Networks; John Wiley & Sons: New York, NY, USA, 2001. [Google Scholar]
- Simon, D. Optimal State Estimation: Kalman, H∞ and Nonlinear Approaches; John Wiley & Sons: Hoboken, NJ, USA, 2006. [Google Scholar]
- Julier, S.; Uhlmann, J.; Durrant-Whyte, H.F. A new method for the nonlinear transformation of means and covariances in filters and estimators. IEEE Trans. Autom. Control 2000, 45, 477–482. [Google Scholar] [CrossRef]
- Schick, I.; Mitter, S. Robust Recursive Estimation in the Presence of Heavy-Tailed Observation Noise. Ann. Stat. 1994, 22, 1045–1080. [Google Scholar] [CrossRef]
- Huber, P. Robust Statistics; Springer: Berlin, Germany, 2011. [Google Scholar]
- Chang, L.; Hu, B.; Chang, G.; Li, A. Huber-based novel robust unscented Kalman filter. IET Sci. Meas. Technol. 2012, 6, 502–509. [Google Scholar] [CrossRef]
- Principe, J.C. Information Theoretic Learning: Renyi’s Entropy and Kernel Perspectives; Springer: New York, NY, USA, 2010. [Google Scholar]
- Chen, B.; Zhu, Y.; Hu, J.; Principe, J.C. System Parameter Identification: Information Criteria and Algorithms; Newnes: Oxford, UK, 2013. [Google Scholar]
- Liu, W.; Pokharel, P.P.; Principe, J.C. Correntropy: Properties, and applications in non-Gaussian signal processing. IEEE Trans. Signal Process. 2007, 55, 5286–5298. [Google Scholar] [CrossRef]
- Chen, B.; Principe, J.C. Maximum correntropy estimation is a smoothed MAP estimation. IEEE Signal Process. Lett. 2012, 19, 491–494. [Google Scholar] [CrossRef]
- Ma, W.; Qu, H.; Zhao, J. Estimator with Forgetting Factor of Correntropy and Recursive Algorithm for Traffic Network Prediction. In Proceedings of the Chinese Control and Decision Conference (CCDC), Guiyang, China, 25–27 May 2013; pp. 490–494.
- Chen, X.; Yang, J.; Liang, J.; Ye, Q. Recursive robust least squares support vector regression based on maximum correntropy criterion. Neurocomputing 2012, 97, 63–73. [Google Scholar] [CrossRef]
- He, R.; Hu, B.; Yuan, X.; Wang, L. Robust Recognition via Information Theoretic Learning; Springer: Amsterdam, The Netherlands, 2014. [Google Scholar]
- He, R.; Zheng, W.; Hu, B. Maximum correntropy criterion for robust face recognition. IEEE Trans. Pattern Anal. 2011, 33, 1561–1576. [Google Scholar]
- He, R.; Hu, B.; Zheng, W.; Kong, X. Robust principal component analysis based on maximum correntropy criterion. IEEE Trans. Image Process. 2011, 20, 1485–1494. [Google Scholar] [PubMed]
- Chen, L.; Qu, H.; Zhao, J.; Chen, B.; Principe, J.C. Efficient and robust deep learning with Correntropy-induced loss function. Neural Comput. Appl. 2016, 27, 1019–1031. [Google Scholar] [CrossRef]
- Bessa, R.J.; Miranda, V.; Gama, J. Entropy, and correntropy against minimum square error in offline, and online three-day ahead wind power forecasting. IEEE Trans. Power Syst. 2009, 24, 1657–1666. [Google Scholar] [CrossRef]
- Ma, W.; Qu, H.; Gui, G.; Xu, L.; Zhao, J.; Chen, B. Maximum correntropy criterion based sparse adaptive filtering algorithms for robust channel estimation under non-Gaussian environments. J. Frankl. Inst. 2015, 352, 2708–2727. [Google Scholar] [CrossRef]
- Chen, B.; Xing, L.; Liang, J.; Zheng, N.; Principe, J.C. Steady-State Mean-Square Error Analysis for Adaptive Filtering under the Maximum Correntropy Criterion. IEEE Signal Process. Lett. 2014, 21, 880–884. [Google Scholar]
- Shi, L.; Lin, Y. Convex Combination of Adaptive Filters under the Maximum Correntropy Criterion in Impulsive Interference. IEEE Signal Process. Lett. 2014, 21, 1385–1388. [Google Scholar] [CrossRef]
- Chen, B.; Xing, L.; Zhao, H.; Zheng, N.; Principe, J.C. Generalized correntropy for robust adaptive filtering. IEEE Trans. Signal Process. 2016, 64, 3376–3387. [Google Scholar] [CrossRef]
- Chen, B.; Liu, X.; Zhao, H.; Principe, J.C. Maximum Correntropy Kalman Filter. Available online: http://arxiv.org/abs/1509.04580 (accessed on 18 September 2016).
- Liu, X.; Chen, B.; Xu, B.; Wu, Z.; Honeine, P. Maximum Correntropy Unscented Filter. Available online: http://arxiv.org/abs/1608.07526 (accessed on 18 September 2016).
- Chen, B.; Wang, J.; Zhao, H.; Zheng, N.; Principe, J.C. Convergence of a fixed-point algorithm under maximum correntropy criterion. IEEE Signal Process. Lett. 2015, 22, 1723–1727. [Google Scholar] [CrossRef]
- Schaub, H.; Junkins, J.L. Analytical Mechanics of Space Systems; American Institute of Aeronautics and Astronautics Education Series: Reston, VA, USA, 2003. [Google Scholar]
- El-Hawary, F.; Jing, Y. Robust Regression-Based EKF for Tracking Underwater Targets. IEEE J. Ocean. Eng. 1995, 20, 31–41. [Google Scholar] [CrossRef]
© 2016 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC-BY) license (http://creativecommons.org/licenses/by/4.0/).