A Novel Perspective of the Kalman Filter from the Rényi Entropy

Luo, Yarong; Guo, Chi; You, Shengyong; Liu, Jingnan

doi:10.3390/e22090982

Open AccessArticle

A Novel Perspective of the Kalman Filter from the Rényi Entropy

¹

Global Navigation Satellite System Research Center, Wuhan University, Wuhan 430079, China

²

Artificial Intelligence Institute, Wuhan University, Wuhan 430079, China

^*

Author to whom correspondence should be addressed.

Entropy 2020, 22(9), 982; https://doi.org/10.3390/e22090982

Submission received: 21 July 2020 / Revised: 31 August 2020 / Accepted: 31 August 2020 / Published: 3 September 2020

(This article belongs to the Special Issue Data Science: Measuring Uncertainties)

Download

Browse Figures

Versions Notes

Abstract

:

Rényi entropy as a generalization of the Shannon entropy allows for different averaging of probabilities of a control parameter

α

. This paper gives a new perspective of the Kalman filter from the Rényi entropy. Firstly, the Rényi entropy is employed to measure the uncertainty of the multivariate Gaussian probability density function. Then, we calculate the temporal derivative of the Rényi entropy of the Kalman filter’s mean square error matrix, which will be minimized to obtain the Kalman filter’s gain. Moreover, the continuous Kalman filter approaches a steady state when the temporal derivative of the Rényi entropy is equal to zero, which means that the Rényi entropy will keep stable. As the temporal derivative of the Rényi entropy is independent of parameter

α

and is the same as the temporal derivative of the Shannon entropy, the result is the same as for Shannon entropy. Finally, an example of an experiment of falling body tracking by radar using an unscented Kalman filter (UKF) in noisy conditions and a loosely coupled navigation experiment are performed to demonstrate the effectiveness of the conclusion.

Keywords:

Rényi entropy; discrete Kalman filter; continuous Kalman filter; algebraic Riccati equation; nonlinear differential Riccati equation

1. Introduction

In the late 1940s, Shannon introduced a logarithmic measure of information [1] and a theory that included information entropy (the literature shows that it is related to Boltzmann entropy in statistical mechanics). The more stochastic and unpredictable a variable is, the larger its entropy is. As a measure of information, entropy has been used in various fields, such as information theory, signal processing, information-theoretic learning [2,3], etc. As a generalization of the Shannon entropy, Rényi entropy, named after Alfréd Rényi [4], allows for different averaging of probabilities through a control parameter

α

, and is usually used to quantify the diversity, uncertainty, or randomness of random variables. Liang [5] presented the evolutionary entropy equations and the uncertainty estimation for Shannon entropy and relative entropy, which is also called Kullback–Leibler divergence [6], within the framework of dynamical systems. However, higher-order Rényi entropy has some better properties than Shannon entropy by setting the control parameter

α

in most cases.

The Kalman filter [7] and its variants have been widely used in navigation, control, tracking, etc. Many works focus on combining different entropy and entropy-like quantities with the original Kalman filter to improve the performance. When the state space equation is nonlinear, Rényi entropy can be used to measure the nonlinearity [8,9]. Shannon entropy was used to estimate the weight of each particle from the weights of different measurement models for the fusion algorithm in [10]. Quadratic Rényi entropy [11] of innovation has been used as a minimum entropy criterion under a nonlinear and non-Gaussian circumstance [12] in unscented Kalman filter (UKF) [13] and finite mixtures [14]. A generalized density evolution equation [15] and polynomial-based non-linear compensation [16] were used to improve the minimum entropy filtering [17]. Relative entropy has been used to measure the similarity between the probabilistic density functions during the recursive processes of the nonlinear filter [18,19]. As for the nonlinear measurement equation with additive Gaussian noise, relative entropy can be deduced to measure the nonlinearity of the measurement [20], and can also be used to measure the approximation error of the i-th measurement element in the partitioned update Kalman filter [21]. When the state variables and the measurement variables do not belong to strict Gaussian distribution, such as in the seamless indoor/outdoor multi-source fusion positioning problem [22], the estimation error can be measured by the relative entropy. Relative entropy can also be used to calculate the number of particles in the unscented particle filter for mobile robot self-localization [23] and to calculate the sample window size in the cubature Kalman filter (CKF) [24] for attitude estimation [25]. Moreover, it has been verified that the original Kalman filter can be derived by maximizing the relative entropy [26]. Meanwhile, the robust maximum correntropy criterion has been adopted as the optimal criterion to derive the maximum correntropy Kalman filter [27,28]. However, there has been no work on the direct connections between the Rényi entropy and the Kalman filter theory until now.

In this paper, we propose a new perspective of the Kalman filter from the Rényi entropy for the first time, which bridges the gap between the Kalman filter and the Rényi entropy. We calculate the temporal derivative of the Rényi entropy for the Kalman filter mean square error matrix, which provides the optimal recursive solution mathematically and will be minimized to obtain the Kalman filter gain. Moreover, from the physical point of view, the continuous Kalman filter approaches a steady state when the temporal derivative of the Rényi entropy is equal to zero, which also means that the Rényi entropy will keep stable. A numerical experiment of falling body tracking in noisy conditions with radar using the UKF and a practical experiment of loosely-coupled integration are provided to demonstrate the effectiveness of the above conclusion.

The structure of this paper is as follows. In Section II, the definitions and properties of Shannon entropy and Rényi entropy are presented. In Section III, the Kalman filter is derived from the perspective of minimizing the temporal derivative of Rényi entropy, and the connection between the Rényi entropy and the algebraic Riccati equation is explained. In Section IV, experimental results and analysis are given by the simulation of the UKF and the real integrated navigation data. We finally conclude this paper and provide an outlook for future work in Section V.

2. The Connection between the Kalman Filter and the Temporal Derivative of the Rényi Entropy

2.1. Rényi Entropy

To calculate the Rényi entropy of the continuous probability density function (PDF), it is necessary to extend the definition of the Rényi entropy to the continuous form. The Rényi entropy of order

α

for a continuous random variable with a multivariate Gaussian PDF

p (x)

is defined [4] and calculated [9] as:

\begin{matrix} H_{R}^{α} (x) & = \frac{1}{1 - α} ln \int_{S} p^{α} (x) d x = \frac{N}{2} ln (2 π α^{\frac{1}{α - 1}}) + \frac{1}{2} ln (det Σ), \end{matrix}

(1)

where

α > 0, α \neq 1

, and

α

is a parameter providing a family of entropy functions. N is the dimension of the random variable x.

S

is the support.

Σ

is the covariance matrix of

p (x)

.

It is straightforward to show that the temporal derivative of the Rényi entropy is given by [9]:

{\dot{H}}_{R}^{(α)} (x) = \frac{1}{2} T r {Σ^{- 1} \dot{Σ}},

(2)

where

\dot{Σ}

is the temporal derivative of the covariance matrix and

T r (\cdot)

is the trace operator.

It is easy to get the Shannon entropy for the multivariate Gaussian PDF by taking the limitation of Equation (1) as

α

approaches 1. This entropy is given as

H (x) = \frac{N}{2} ln (2 π e) + \frac{1}{2} ln (det Σ)

, and the temporal derivative of the Shannon entropy is given as

\dot{H} (x) = \frac{1}{2} T r {Σ^{- 1} \dot{Σ}}

. It is obvious the temporal of the Shannon entropy is the same as the temporal of the Rényi entropy. Therefore, we will see later that the conclusion can also be derived from the temporal derivative of the Shannon entropy. However, the Rényi entropy for the multivariate Gaussian PDF instead of the temporal derivative of the Rényi entropy will be used by adjusting the free parameter

α

for different uncertainty measurements in most cases, as the filtering problem has to account for the nonlinearity and the non-Gaussian noise; we adopt the Rényi entropy as the measurement for uncertainty.

2.2. Kalman Filter

Given the continuous-time linear system [29]:

\dot{X} (t) = F (t) X (t) + G (t) w (t)

(3)

Z (t) = H (t) X (t) + v (t),

(4)

where

X (t)

is the state vector;

F (t)

is the state transition matrix;

G (t)

is the system noise driving matrix;

Z (t)

is the measurement vector;

H (t)

is the measurement matrix; and

w (t)

and

v (t)

are independent white Gaussian noise with zero mean value; their covariance matrices are

Q (t)

and

R (t)

, respectively:

E [w (t)] = 0, E [w (t) w^{T} (τ)] = Q (t) δ (t - τ)

(5)

E [v (t)] = 0, E [v (t) v^{T} (τ)] = R (t) δ (t - τ)

(6)

E [w (t) v^{T} (τ)] = 0,

(7)

where

δ (t)

is the Dirac impulse function,

Q (t)

is a symmetric non-negative definite matrix, and

R (t)

is a symmetric positive matrix.

The continuous Kalman filter can be deduced by taking the limit of the discrete Kalman filter. The discrete-time state-space model is arranged as follows [29]:

X_{k} = Φ_{k | k - 1} X_{k - 1} + Γ_{k | k - 1} W_{k - 1}

(8)

Z_{k} = H_{k} X_{k} + V_{k}

(9)

where

X_{k}

is an n-dimensional state vector;

Z_{k}

is an m-dimensional measurement vector;

Φ_{k | k - 1}

,

Γ_{k | k - 1}

, and

H_{k}

are the known system structure parameters, which are called the

n \times n

dimensional one-step state update matrix, the

n \times l

dimensional system noise distribution matrix, and the

m \times n

dimensional measurement matrix, respectively;

W_{k - 1}

is the l-dimensional system noise vector, and

V_{k}

is the m-dimensional measurement noise vector. Both of them are Gaussian noise vector sequences with zero mean value, and are independent of each other:

E [W_{k}] = 0, E [W_{k} W_{j}^{T}] = Q_{k} δ_{k j}

(10)

E [V_{k}] = 0, E [V_{k} V_{j}^{T}] = R_{k} δ_{k j}

(11)

E [W_{k} V_{j}^{T}] = 0 .

(12)

The above equation is the basic assumption for the noise requirement in the Kalman filtering state space model, where

Q_{k}

is a symmetric non-negative definite matrix, and

R_{k}

is a symmetric positive definite matrix.

δ_{k j}

is the Kronecker

δ

function.

The covariance parameters

Q_{k}

and

R_{k}

play roles similar to those of Q and R in the continuous filter, but they do not have the same numerical values. Next, the relationship between the corresponding continuous and discrete filter parameters will be derived.

To achieve the transformation from the continuous form to the discrete form, the relations between Q and R and the corresponding

Q_{k}

and

R_{k}

for a small step size

T_{s}

are needed. According to the linear system theory, the relation between Q and

Q_{k}

from Equation (3) to Equation (8) is as follows:

Φ_{k | k - 1} = Φ (t_{k}, t_{k - 1}) \approx e^{\int_{t_{k - 1}}^{t_{k}} F (τ) d τ}

(13)

Γ_{k | k - 1} W_{k - 1} = \int_{t_{k - 1}}^{t_{k}} Φ (t_{k}, τ) G (τ) w (t) d τ .

(14)

Denote the discrete-time interval as

T_{s} = t_{k} - t_{k - 1}

, when

F (t)

does not change too dramatically within the shorter integral interval

[t_{k - 1}, t_{k}]

. Take the Taylor expansion of

e^{F (t_{k - 1}) T_{s}}

with respect to

F (t_{k - 1}) T_{s}

and set

F (t_{k - 1}) T_{s} < < I

, so the higher-order terms are negligible and the one-step transition matrix, Equation (13), can be approximated as:

\begin{matrix} Φ_{k | k - 1} \approx e^{F (t_{k - 1}) T_{s}} = I + F (t_{k - 1}) T_{s} + F^{2} (t_{k - 1}) \frac{T_{s}^{2}}{2!} + F^{3} (t_{k - 1}) \frac{T_{s}^{3}}{3!} + \dots \approx I + F (t_{k - 1}) T_{s} . \end{matrix}

(15)

Equation (14) shows that

Γ_{k | k - 1} W_{k - 1}

is the linear transform of the Gaussian white noise

w (τ)

; the result remains the normal distribution random vector. Therefore, the first- and second-order statistical characteristics can be used to describe and be equivalent to

Γ_{k | k - 1} W_{k - 1}

. Referring to Equation (5), the mean of

Γ_{k | k - 1} W_{k - 1}

is given as follows:

\begin{matrix} E [Γ_{k | k - 1} W_{k - 1}] = E [\int_{t_{k - 1}}^{t_{k}} Φ (t_{k}, τ) G (τ) w (τ) d τ] = \int_{t_{k - 1}}^{t_{k}} Φ (t_{k}, τ) G (τ) E [w (τ)] d τ = 0 . \end{matrix}

(16)

For the second-order statistical characteristics, when

k \neq j

, the time parameter between the noise

w (τ_{k})

and

w (τ_{j})

is independent, so

Γ_{k | k - 1} W_{k - 1}

and

Γ_{j | j - 1} W_{j - 1}

are uncorrelated:

E [(Γ_{k | k - 1} W_{k - 1}) {(Γ_{j | j - 1} W_{j - 1})}^{T}] = 0 (k \neq j) .

(17)

When

k = j

, thus

\begin{matrix} E [(Γ_{k | k - 1} W_{k - 1}) {(Γ_{k | k - 1} W_{k - 1})}^{T}] & = E \{[\int_{t_{k - 1}}^{t_{k}} Φ (t_{k}, τ) G (τ) w (τ) d τ] {[\int_{t_{k - 1}}^{t_{k}} Φ (t_{k}, s) G (s) w (s) d s]}^{T}\} \\ = E \{\int_{t_{k - 1}}^{t_{k}} Φ (t_{k}, τ) G (τ) w (τ) \int_{t_{k - 1}}^{t_{k}} w^{T} (s) G^{T} (s) Φ^{T} (t_{k}, s) d s d τ\} \\ = \int_{t_{k - 1}}^{t_{k}} Φ (t_{k}, τ) G (τ) \int_{t_{k - 1}}^{t_{k}} E [w (τ) w^{T} (s)] G^{T} (s) Φ^{T} (t_{k}, s) d s d τ . \end{matrix}

(18)

Substituting Equation (5) into the above equation:

\begin{matrix} E [(Γ_{k | k - 1} W_{k - 1}) {(Γ_{k | k - 1} W_{k - 1})}^{T}] & = \int_{t_{k - 1}}^{t_{k}} Φ (t_{k}, τ) G (τ) \int_{t_{k - 1}}^{t_{k}} Q (t) δ (τ - s) G^{T} (s) Φ^{T} (t_{k}, s) d s d τ \\ = \int_{t_{k - 1}}^{t_{k}} Φ (t_{k}, τ) G (τ) Q (τ) G^{T} (τ) Φ^{T} (t_{k}, τ) d τ . \end{matrix}

(19)

When the noise control matrix

G (τ)

changes slowly during the time interval

[t_{k - 1}, t_{k}]

, Equation (19) becomes:

\begin{matrix} E [(Γ_{k | k - 1} W_{k - 1}) {(Γ_{k | k - 1} W_{k - 1})}^{T}] \\ \approx \int_{t_{k - 1}}^{t_{k}} [I + F (t_{k - 1}) (t_{k} - τ)] G (t_{k - 1}) Q (τ) G^{T} (t_{k - 1}) {[I + F (t_{k - 1}) (t_{k} - τ)]}^{T} d τ \\ = [I + F (t_{k - 1}) T_{s}] \cdot [G (t_{k - 1}) Q (t_{k - 1}) G^{T} (t_{k - 1}) T_{s}] \cdot {[I + F (t_{k - 1}) T_{s}]}^{T} \\ + \frac{1}{12} F (t_{k - 1}) G (t_{k - 1}) Q (t_{k - 1}) G^{T} (t_{k - 1}) F {(t_{k - 1})}^{T} T_{s}^{3} \\ \approx \{[I + F (t_{k - 1}) T_{s}] G (t_{k - 1})\} \cdot [Q (t_{k - 1}) T_{s}] \cdot {\{[I + F (t_{k - 1}) T_{s}] G (t_{k - 1})\}}^{T} . \end{matrix}

(20)

When

F (t_{k - 1}) T_{s} < < I

is satisfied, the above equation can be further approximated:

\begin{matrix} E [(Γ_{k | k - 1} W_{k - 1}) {(Γ_{k | k - 1} W_{k - 1})}^{T}] \approx G (t_{k - 1}) \cdot [Q (t_{k - 1}) T_{s}] \cdot G^{T} (t_{k - 1}) . \end{matrix}

(21)

Comparing the result with Equation (10):

Γ_{k | k - 1} \approx [I + F (t_{k - 1}) T_{s}] G (t_{k - 1}) \approx G (t_{k - 1})

(22)

E [W_{k} W_{j}^{T}] = Q_{k} δ_{k j} = [Q (t_{k}) T_{s}] δ_{k j} .

(23)

Notice that [29]:

Q_{k} = Q (t_{k}) T_{s} .

(24)

The derivation of the equation relating to

R_{k}

and R is more subtle. In the continuous model,

v (t)

is white, so simple sampling of

Z (t)

leads to measurement noise with infinite variance. Hence, in the sampling process, we have to imagine averaging the continuous measurement over the

T_{s}

interval to get an equivalent discrete sample. This is justified because x is not the Gaussian white noise and can be approximately constant within the interval.

\begin{matrix} Z_{k} & = \frac{1}{T_{s}} \int_{t_{k - 1}}^{t_{k}} Z (t) d t = \frac{1}{T_{s}} \int_{t_{k - 1}}^{t_{k}} [H (t) x (t) + v (t)] d t = H (t_{k}) x_{k} + \frac{1}{T_{s}} \int_{t_{k - 1}}^{t_{k}} v (t) d t . \end{matrix}

(25)

Then, the discrete noise matrix and the continuous noise matrix are equivalent:

V_{k} = \frac{1}{T_{s}} \int_{t_{k - 1}}^{t_{k}} v (t) d t .

(26)

From Equation (12), we have:

\begin{matrix} E [V_{k} V_{j}^{T}] & = R_{k} δ_{k j} = \frac{1}{T_{s}^{2}} \int_{t_{k - 1}}^{t_{k}} \int_{t_{j - 1}}^{t_{j}} E [v (τ) v (s)] d τ d s \\ = \frac{1}{T_{s}^{2}} \int_{t_{k - 1}}^{t_{k}} \int_{t_{j - 1}}^{t_{j}} R (τ) δ (s - τ) d τ d s = \frac{1}{T_{s}^{2}} \int_{t_{k - 1}}^{t_{k}} R (τ) δ_{k j} d τ \approx \frac{R (t_{k})}{T_{s}} δ_{k j} . \end{matrix}

(27)

Comparing it with Equation (6), we have [29]:

R_{k} = \frac{R (t_{k})}{T_{s}} .

(28)

2.3. Derivation of the Kalman Filter

Assuming that the optimal state estimation at

t_{k - 1}

is

{\hat{X}}_{k - 1}

, the state estimation error is

{\tilde{X}}_{k - 1}

, and the state estimation covariance matrix is

Σ_{k - 1}

:

{\tilde{X}}_{k - 1} = X_{k - 1} - {\hat{X}}_{k - 1}

(29)

and

\begin{matrix} Σ_{k - 1} & = E [{\tilde{X}}_{k - 1} {\tilde{X}}_{k - 1}^{T}] = E [(X_{k - 1} - {\hat{X}}_{k - 1}) {(X_{k - 1} - {\hat{X}}_{k - 1})}^{T}] . \end{matrix}

(30)

If we take the expectation operator of both sides of Equation (8), we obtain the state one-step prediction and the state one-step estimation error:

\begin{matrix} X_{k | k - 1}^{-} = E [X_{k}] & = E [Φ_{k | k - 1} X_{k - 1} + Γ_{k | k - 1} W_{k - 1}] = Φ_{k | k - 1} E [X_{k - 1}] = Φ_{k | k - 1} {\hat{X}}_{k - 1}, \end{matrix}

(31)

{\tilde{X}}_{k | k - 1} = X_{k} - X_{k | k - 1}^{-} .

(32)

Substituting Equations (8) and (31) into Equation (32) leads to:

\begin{matrix} {\tilde{X}}_{k | k - 1} & = (Φ_{k | k - 1} X_{k - 1} + Γ_{k | k - 1} W_{k - 1}) - Φ_{k | k - 1} {\hat{X}}_{k - 1} \\ = Φ_{k | k - 1} (X_{k - 1} - {\hat{X}}_{k - 1}) + Γ_{k | k - 1} W_{k - 1} = Φ_{k | k - 1} {\tilde{X}}_{k - 1} + Γ_{k | k - 1} W_{k - 1} . \end{matrix}

(33)

Since

{\tilde{X}}_{k - 1}

is uncorrelated with

W_{k - 1}

, we therefore obtain the covariance of the state one-step estimation error

{\tilde{X}}_{k | k - 1}

as follows:

\begin{matrix} Σ_{k | k - 1} & = E [{\tilde{X}}_{k | k - 1} {\tilde{X}}_{k | k - 1}^{T}] = E [(Φ_{k | k - 1} {\tilde{X}}_{k - 1} + Γ_{k | k - 1} W_{k - 1}) {(Φ_{k | k - 1} {\tilde{X}}_{k - 1} + Γ_{k | k - 1} W_{k - 1})}^{T}] \\ = Φ_{k | k - 1} E [{\tilde{X}}_{k - 1} {\tilde{X}}_{k - 1}^{T}] Φ_{k | k - 1}^{T} + Γ_{k | k - 1} E [W_{k - 1} W_{k - 1}^{T}] Γ_{k | k - 1}^{T} \\ = Φ_{k | k - 1} Σ_{k - 1} Φ_{k | k - 1}^{T} + Γ_{k | k - 1} Q_{k - 1} Γ_{k | k - 1}^{T} . \end{matrix}

(34)

In a similar way, the measurement at

t_{k}

can be predicted by the state one-step estimation prediction

X_{k | k - 1}^{-}

and system measurement Equation (9) as follows:

Z_{k | k - 1}^{-} = E [H_{k} X_{k | k - 1}^{-} + V_{k}] = H_{k} X_{k | k - 1}^{-} .

(35)

In fact, there is difference between the measurement one-step prediction

Z_{k | k - 1}^{-}

and the actual measurement

Z_{k}

. The difference is denoted as measurement one-step prediction error:

{\tilde{Z}}_{k | k - 1} = Z_{k} - Z_{k | k - 1}^{-} .

(36)

Substituting the measurement Equations (9) and (35) into Equation (36) yields:

\begin{matrix} {\tilde{Z}}_{k | k - 1} & = Z_{k} - H_{k} X_{k | k - 1}^{-} = H_{k} X_{k} + V_{k} - H_{k} X_{k | k - 1}^{-} = H_{k} {\tilde{X}}_{k | k - 1} + V_{k} . \end{matrix}

(37)

In general, the measurement one-step prediction error

{\tilde{Z}}_{k | k - 1}

is called innovation in the classical Kalman filter theory, and it indicates the new information about the state estimate carried by the measurement one-step prediction error.

On the one hand, if the estimation of

X_{k}

only includes the state one-step prediction

X_{k | k - 1}^{-}

of the system state equation, the estimation accuracy will be low, as no information of the measurement equation has been used. On the other hand, according to Equation (37), the measurement one-step prediction error calculated using the system measurement equation contains the information of the state one-step prediction of

X_{k | k - 1}^{-}

. Consequently, it is natural to consider all the state information that comes from the system state equation and the measurement equation, respectively, and correct the state one-step prediction mean

X_{k | k - 1}^{-}

with the measurement one-step prediction error

{\tilde{Z}}_{k | k - 1}

. Thereby, the optimal estimation of

X_{k}

can be calculated by the combination of

X_{k | k - 1}^{-}

and

{\tilde{Z}}_{k | k - 1}

as follows:

{\hat{X}}_{k} = X_{k | k - 1}^{-} + K_{k} {\tilde{Z}}_{k | k - 1},

(38)

where

K_{k}

is the undetermined correction factor matrix.

Substituting Equations (31) and (37) into Equation (38) obtains:

\begin{matrix} {\hat{X}}_{k} & = X_{k | k - 1}^{-} + K_{k} (Z_{k} - H_{k} X_{k | k - 1}^{-}) = (I - K_{k} H_{k}) X_{k | k - 1}^{-} + K_{k} Z_{k} \\ = (I - K_{k} H_{k}) Φ_{k | k - 1} {\hat{X}}_{k - 1} + K_{k} Z_{k} . \end{matrix}

(39)

From Equation (39), the current state estimation

{\hat{X}}_{k}

is a linear combination of the last state estimation

{\hat{X}}_{k - 1}

and the current measurement

Z_{k}

, which considers the influence of the structural parameters

Φ_{k | k - 1}

in the state equation and the structure parameters

H_{k}

in the measurement equation with different types of construction.

The state estimation error at the current time

t_{k}

is denoted as:

{\tilde{X}}_{k} = X_{k} - {\hat{X}}_{k},

(40)

where

X_{k}

is the true values and

{\hat{X}}_{k}

is the posterior estimation of

X_{k}

.

Substituting Equation (39) into Equation (40) obtains:

\begin{matrix} {\tilde{X}}_{k} & = X_{k} - [X_{k | k - 1}^{-} + K_{k} (Z_{k} - H_{k} X_{k | k - 1}^{-})] = (X_{k} - X_{k | k - 1}^{-}) - K_{k} (H_{k} X_{k} + V_{k} - H_{k} X_{k | k - 1}^{-}) \\ = {\tilde{X}}_{k | k - 1} - K_{k} (H_{k} {\tilde{X}}_{k | k - 1} + V_{k}) = (I - K_{k} H_{k}) {\tilde{X}}_{k | k - 1} - K_{k} V_{k} . \end{matrix}

(41)

Then, the mean square error matrix of state estimation

{\hat{X}}_{k}

is given by:

\begin{matrix} Σ_{k} & = E [{\tilde{X}}_{k} {\tilde{X}}_{k}^{T}] = E {[(I - K_{k} H_{k}) {\tilde{X}}_{k | k - 1} - K_{k} V_{k}] {[(I - K_{k} H_{k}) {\tilde{X}}_{k | k - 1} - K_{k} V_{k}]}^{T}} \\ = (I - K_{k} H_{k}) E [{\tilde{X}}_{k | k - 1} {\tilde{X}}_{k | k - 1}^{T}] {(I - K_{k} H_{k})}^{T} + K_{k} E [V_{k} V_{k}^{T}] K_{k}^{T} \\ = (I - K_{k} H_{k}) Σ_{k | k - 1} {(I - K_{k} H_{k})}^{T} + K_{k} R_{k} K_{k}^{T} . \end{matrix}

(42)

Substituting Equation (34) into Equation (42) obtains:

\begin{matrix} Σ_{k} & = (I - K_{k} H_{k}) [Φ_{k | k - 1} Σ_{k - 1} Φ_{k | k - 1}^{T} + Γ_{k | k - 1} Q_{k - 1} Γ_{k | k - 1}^{T}] {(I - K_{k} H_{k})}^{T} + K_{k} R_{k} K_{k}^{T} \\ = Φ_{k | k - 1} Σ_{k - 1} Φ_{k | k - 1}^{T} + K_{k} H_{k} Φ_{k | k - 1} Σ_{k - 1} Φ_{k | k - 1}^{T} H_{k}^{T} K_{k}^{T} - Φ_{k | k - 1} Σ_{k - 1} Φ_{k | k - 1}^{T} H_{k}^{T} K_{k}^{T} \\ - K_{k} H_{k} Φ_{k | k - 1} Σ_{k - 1} Φ_{k | k - 1}^{T} + Γ_{k | k - 1} Q_{k - 1} Γ_{k | k - 1}^{T} - K_{k} H_{k} Γ_{k | k - 1} Q_{k - 1} Γ_{k | k - 1}^{T} \\ - Γ_{k | k - 1} Q_{k - 1} Γ_{k | k - 1}^{T} H_{k}^{T} K_{k}^{T} + K_{k} H_{k} Γ_{k | k - 1} Q_{k - 1} Γ_{k | k - 1}^{T} H_{k}^{T} K_{k}^{T} + K_{k} R_{k} K_{k}^{T} . \end{matrix}

(43)

We now use the approximation

Φ_{k | k - 1} \approx I + F (t_{k - 1}) T_{s}

as Equation (15). From Equation (22) with

Γ_{k | k - 1} \approx G (t_{k - 1})

, we have:

\begin{matrix} Σ_{k} & = [I + F (t_{k - 1}) T_{s}] Σ_{k - 1} {[I + F (t_{k - 1}) T_{s}]}^{T}] + K_{k} H_{k} [I + F (t_{k - 1}) T_{s}] Σ_{k - 1} {[I + F (t_{k - 1}) T_{s}]}^{T} H_{k}^{T} K_{k}^{T} \\ - [I + F (t_{k - 1}) T_{s}] Σ_{k - 1} {[I + F (t_{k - 1}) T_{s}]}^{T} H_{k}^{T} K_{k}^{T} - K_{k} H_{k} [I + F (t_{k - 1}) T_{s}] Σ_{k - 1} {[I + F (t_{k - 1}) T_{s}]}^{T} \\ + G (t_{k - 1}) Q_{k - 1} G^{T} (t_{k - 1}) - K_{k} H_{k} G (t_{k - 1}) Q_{k - 1} G^{T} (t_{k - 1}) - G (t_{k - 1}) Q_{k - 1} G^{T} (t_{k - 1}) H_{k}^{T} K_{k}^{T} \\ + K_{k} H_{k} G (t_{k - 1}) Q_{k - 1} G^{T} (t_{k - 1}) H_{k}^{T} K_{k}^{T} + K_{k} R_{k} K_{k}^{T} . \end{matrix}

(44)

Note from Equation (24) that

Q_{k}

is of the order of

T_{s}

and from Equation (28) that

R_{k} = \frac{R (t_{k})}{T_{s}}

; then, Equation (44) becomes:

\begin{matrix} Σ_{k} & = [I + F (t_{k - 1}) T_{s}] Σ_{k - 1} {[I + F (t_{k - 1}) T_{s}]}^{T}] + K_{k} H_{k} [I + F (t_{k - 1}) T_{s}] Σ_{k - 1} {[I + F (t_{k - 1}) T_{s}]}^{T} H_{k}^{T} K_{k}^{T} \\ - [I + F (t_{k - 1}) T_{s}] Σ_{k - 1} {[I + F (t_{k - 1}) T_{s}]}^{T} H_{k}^{T} K_{k}^{T} - K_{k} H_{k} [I + F (t_{k - 1}) T_{s}] Σ_{k - 1} {[I + F (t_{k - 1}) T_{s}]}^{T} \\ + G (t_{k - 1}) Q (t_{k}) T_{s} G^{T} (t_{k - 1}) - K_{k} H_{k} G (t_{k - 1}) Q (t_{k}) T_{s} G^{T} (t_{k - 1}) \\ - G (t_{k - 1}) Q (t_{k}) T_{s} G^{T} (t_{k - 1}) H_{k}^{T} K_{k}^{T} + K_{k} H_{k} G (t_{k - 1}) Q (t_{k}) T_{s} G^{T} (t_{k - 1}) H_{k}^{T} K_{k}^{T} + K_{k} \frac{R (t_{k})}{T_{s}} K_{k}^{T} . \end{matrix}

(45)

2.4. The Temporal Derivative of the Rényi Entropy and the Kalman Filter Gain

To obtain the continuous form of covariance matrix

Σ

, the limit will be taken. However, the relation between the undetermined correction factor matrix

K_{k}

and its continuous form still remains unknown. Therefore, we make the following assumption.

Assumption 1.

K_{k}

is of the order of

T_{s}

, that is:

K (t_{k}) = \frac{K_{k}}{T_{s}} .

(46)

From the conclusion, we can also derive this assumption conversely. We next draw the conclusion as one theorem under the assumption, as follows:

Theorem 1.

The discrete form of the undetermined correction factor matrix is the same as the continuous form when the temporal derivative of Rényi entropy is minimized. This can be presented in a mathematical form as follows:

{K_{k} = Σ_{k} H_{k}^{T} R_{k}, K = Σ H^{T} R^{- 1} | K^{*} = \underset{K}{arg min} {\dot{H}}_{R}^{(α)} (K)} .

(47)

Proof of Theorem 1.

We substitute the expression for

K_{k}

into Equation (45) and neglect higher-order terms in

T_{s}

; Equation (45) becomes:

\begin{matrix} Σ_{k} & = [I + F (t_{k - 1}) T_{s}] Σ_{k - 1} {[I + F (t_{k - 1}) T_{s}]}^{T}] + T_{s} K (t_{k}) H_{k} Σ_{k - 1} [I + F (t_{k - 1}) T_{s}] Σ_{k - 1} \\ {[I + F (t_{k - 1}) T_{s}]}^{T} H_{k}^{T} T_{s} K^{T} (t_{k}) - [I + F (t_{k - 1}) T_{s}] Σ_{k - 1} {[I + F (t_{k - 1}) T_{s}]}^{T} H_{k}^{T} T_{s} K^{T} (t_{k}) \\ - T_{s} K (t_{k}) H_{k} [I + F (t_{k - 1}) T_{s}] Σ_{k - 1} {[I + F (t_{k - 1}) T_{s}]}^{T} + G (t_{k - 1}) Q (t_{k}) T_{s} G^{T} (t_{k - 1}) \\ - T_{s} K (t_{k}) H_{k} G (t_{k - 1}) Q (t_{k}) T_{s} G^{T} (t_{k - 1}) - G (t_{k - 1}) Q (t_{k}) T_{s} G^{T} (t_{k - 1}) H_{k}^{T} T_{s} K^{T} (t_{k}) \\ + T_{s} K (t_{k}) H_{k} G (t_{k - 1}) Q (t_{k}) T_{s} G^{T} (t_{k - 1}) H_{k}^{T} T_{s} K^{T} (t_{k}) + T_{s} K (t_{k}) \frac{R_{k}}{T_{s}} T_{s} K^{T} (t_{k}) \\ = Σ_{k - 1} + T_{s} F (t_{k - 1}) Σ_{k - 1} + T_{s} Σ_{k - 1} F^{T} (t_{k - 1}) - Σ_{k - 1} H_{k}^{T} T_{s} K {(t_{k})}^{T} - T_{s} K (t_{k}) H_{k} Σ_{k - 1} \\ + G (t_{k - 1}) Q (t_{k}) T_{s} G^{T} (t_{k - 1}) + T_{s} K (t_{k}) \frac{R (t_{k})}{T_{s}} T_{s} K^{T} (t_{k}) . \end{matrix}

(48)

Moving the first term of Equation (48) from right to left and dividing both sides by

T_{s}

to form the finite difference expression:

\begin{matrix} \frac{Σ_{k} - Σ_{k - 1}}{T_{s}} & = F (t_{k - 1}) Σ_{k - 1} + Σ_{k - 1} F^{T} (t_{k - 1}) - Σ_{k - 1} H_{k}^{T} K {(t_{k})}^{T} - K (t_{k}) H_{k} Σ_{k - 1} \\ + G (t_{k - 1}) Q (t_{k}) G^{T} (t_{k - 1}) + K (t_{k}) R (t_{k}) K^{T} (t_{k}) . \end{matrix}

(49)

Finally, passing to the limit as

T_{s} \to 0

and dropping of the subscripts lead to the matrix differential equation:

\begin{matrix} \dot{Σ} & = F Σ + Σ F^{T} - Σ H^{T} K^{T} - K H Σ + G Q G^{T} + K R K^{T} . \end{matrix}

(50)

Σ

is invertible, as it is a positive matrix. Multiplying

Σ^{- 1}

with Equation (50), we can consider the temporal derivative of the Rényi entropy of the mean square error matrix

Σ

using Equation (2):

\begin{matrix} {\dot{H}}_{R}^{(α)} & = \frac{1}{2} T r {Σ^{- 1} \dot{Σ}} \\ = \frac{1}{2} T r {Σ^{- 1} F Σ + F^{T} - H^{T} K^{T} - Σ^{- 1} K H Σ + Σ^{- 1} G Q G^{T} + Σ^{- 1} K R K^{T}} \\ = \frac{1}{2} T r {F + F^{T} - H^{T} K^{T} - K H + Σ^{- 1} G Q G^{T} + Σ^{- 1} K R K^{T}} \\ = \frac{1}{2} T r {2 F - 2 K H + Σ^{- 1} G Q G^{T} + Σ^{- 1} K R K^{T}}, \end{matrix}

(51)

where the invariance under the cyclic permutation property of the trace operator has been used to eliminate

Σ^{- 1}

and

Σ

, as well as the truth that

T r (F) = T r (F^{T})

has been used to simplify the formula.

It is obvious that Equation (51) is a quadratic function of the undetermined correction factor matrix K. Thereby, there must be a minimum of

{\dot{H}}_{R}^{(α)} (x)

in a probabilistic sense. Taking the derivative of both sides of Equation (51) with respect to matrix K obtains:

\begin{matrix} \frac{\partial}{\partial K} {\dot{H}}_{R}^{(α)} & = - 2 \frac{\partial T r (K H)}{\partial K} + \frac{\partial T r (Σ^{- 1} K R K^{T})}{\partial K} \\ = - 2 H^{T} + \frac{T r (Σ^{- 1} K R {(\partial K)}^{T})}{\partial K} + \frac{T r (Σ^{- 1} (\partial K) R K^{T})}{\partial K} \\ = - 2 H^{T} + Σ^{- 1} K R + {(R K^{T} Σ^{- 1})}^{T} . \end{matrix}

(52)

In addition, since

Σ^{- 1}

and

R_{k}

are symmetric matrices, the result is:

\frac{\partial}{\partial K} {\dot{H}}_{R}^{(α)} = - 2 H^{T} + 2 Σ^{- 1} K R .

(53)

R_{k}

is invertible, as it is a positive matrix. According to the extreme value principle of the function, when the above are equal to zero, then we have:

K = Σ H^{T} R^{- 1} .

(54)

So far, we have found the analytic solution to the undetermined correction factor matrix K, which is called the continuous-time Kalman filter gain in the classical Kalman filter. Then, the recursive formulations of the Kalman filter can be established through the Kalman filter gain K. Most importantly, this implies the connection between the temporal derivative of Rényi entropy and the classical Kalman filter: The temporal derivative of the Rényi entropy is minimized when the Kalman filter gain satisfies Equation (54).

Looking back to Assumption 1 and substituting Equation (28) into Equation (54), we obtain:

\begin{matrix} K (t_{k}) & = \frac{K_{k}}{T_{s}} = K = Σ H^{T} R^{- 1} = Σ_{k} H_{k}^{T} R_{k} (T_{s}) = \frac{Σ_{k} H_{k}^{T} R_{k}}{T_{s}} . \end{matrix}

(55)

Therefore, the discrete-time Kalman filter gain can be expressed as follows:

K_{k} = Σ_{k} H_{k}^{T} R_{k} .

(56)

□

Remark 1.

The discrete-time Kalman filter gain has the same form as the continuous-time filter gain, as shown in the Equation (54). In principle, this is consistent with our intuition and proves the correctness and rationality of Assumption A1, in turn.

Remark 2.

The Kalman filter gain is equivalent to the minimization of the temporal derivative of the Rényi entropy, although it has the same result as the original Kalman filter, which is deduced under the minimum mean square error criterion.

Substituting Equation (54) into Equation (50), we have:

\begin{matrix} \dot{Σ} & = F Σ + Σ F^{T} - Σ H^{T} K^{T} - Σ H^{T} R^{- 1} H Σ + G Q G^{T} + Σ H^{T} R^{- 1} R K^{T} \\ = F Σ + Σ F^{T} - Σ H^{T} R^{- 1} H Σ + G Q G^{T} . \end{matrix}

(57)

This is a second-order nonlinear differential equation with respect to the mean square error matrix

Σ

, and it is commonly called the Riccati equation. This is the same result as that of the Bucy–Kalman filter [7].

If the system equation, Equation (3), and the measurement equation, Equation (4), form a linear time-invariant system with constant noise covariance, the mean square error matrix

Σ

may reach a steady-state value, and

\dot{Σ}

may eventually reach zero. So, we have the continuous algebraic Riccati equation as follows:

\dot{Σ} = F Σ + Σ F^{T} - Σ H^{T} R^{- 1} H Σ + G Q G^{T} = 0 .

(58)

As we can see, the time derivative of covariance at the steady state is zero; then, the temporal derivative of the Rényi entropy should also be zero:

{\dot{H}}_{R}^{(α)} = 0 .

(59)

This implies that when the system approaches a stable state, the Rényi entropy approaches a steady value so that the temporal derivative of the Rényi entropy is zero. This is reasonable when the steady system owns a constant Rényi entropy, as uncertainty is stable, which follows our intuitive understanding. Consequently, it is worth noting that whether the value of the Rényi entropy is stable or not can be a validated indicator of whether the system is approaching the steady state.

3. Simulations and Analysis

In this section, we give two experiments to show that when the nonlinear filter system approaches the steady state, the Rényi entropy of the system approaches stability. The first experiment is a numerical example of a falling body in noisy conditions, tracked by radar [30] using the UKF. The second experiment is a practical experiment of loosely coupled integration [29]. The simulations were carried out on MATLAB 2018a running on a computer with i5-5200U, 2.20 GHz CPU, and the graphs were plotted by MATLAB.

3.1. Falling Body Tracking

In the example of a falling body being tracked by radar, the body falls vertically. The radar is placed at a vertical distance L from the body, and the radar measures the distance y from the radar to the body. The state-space equation of the body is given by:

\begin{matrix} {\dot{x}}_{1} & = x_{2} \\ {\dot{x}}_{2} & = d + g \\ {\dot{x}}_{3} & = 0, \end{matrix}

(60)

where

x_{1}

is the height,

x_{2}

is the velocity,

x_{3}

is the ballistic coefficient,

g = - 9.81

m/s

^{2}

is the gravity acceleration, and d is the air drag, which could be approximated as:

\begin{matrix} d = \frac{ρ x_{2}^{2}}{2 x_{3}} = ρ_{0} exp (- \frac{x_{1}}{k}) \frac{x_{2}^{2}}{2 x_{3}}, \end{matrix}

(61)

where

ρ

is the air density with an initial value of

ρ_{0} = 1.225

;

ρ_{0} = 1.225

and

k = 6705.6

are constants.

The measurement equation is:

y = \sqrt{L^{2} + x_{1}^{2}} .

(62)

It is worth noting that the drag and the square root cause severely nonlinearity in the state-space function and measurement function, respectively.

The discrete-time nonlinear system can be given by the Euler discretization method. Combining the additive process with Gaussian white noises for measurement, we can obtain:

\begin{matrix} x_{1} (n + 1) & = x_{1} (n) + x_{2} (n) \cdot T + w_{1} (n) \\ x_{2} (n + 1) & = x_{2} (n) + (d + g) \cdot T + w_{2} (n) \\ x_{3} (n + 1) & = x_{3} (n) + w_{3} (n) \end{matrix}

(63)

y (n) = \sqrt{L^{2} + x_{1}^{2} (n)} + v (n) .

(64)

In the UKF numerical experiment, we set the sampling period to

T = 0.4

s, the horizontal distance to

L = 100

m, the maximum number of samples to

N = 100

, the process noise to

S_{w} = d i a g (10^{5}, 10^{3}, 10^{2})

, the measurement noise to

S_{v} = 10^{6}

, and the initial state to

x = [10^{5}; - 5000; 400]

. The results are shown as follows:

Figure 1 shows the evolution of covariance matrix

Σ

. Figure 2 and Figure 3 show the Rényi entropy of covariance matrix

Σ

and its change in adjacent time, respectively. Notice that the uncertainty increases near the middle of the plots, which is coincident with the drag peak. However, the Rényi entropy fluctuates around 15; even the fourth element of

Σ

changes dramatically. Of course, the entropy changes are closely accompanied by the drag peak, which means the change of the entropy of covariance reflects the evolution of matrix

Σ

. Consequently, the Rényi entropy can be viewed as the indicator of whether the system is approaching the steady state or not.

3.2. Practical Integrated Navigation

In the loosely integrated navigation system, the system state parameter x is composed of inertial navigation system (INS) error states in the North–East–Down (NED) local-level navigation frame, and can be expressed as follows:

x = {[{(δ r^{n})}^{T} {(δ v^{n})}^{T} {(ψ)}^{T} {(b_{g})}^{T} {(b_{a})}^{T}]}^{T},

(65)

where

δ r^{n}

,

δ v^{n}

, and

ψ

represent the position error, the velocity error, and the attitude error, respectively;

b_{g}

and

b_{a}

are modeled as first-order Gauss–Markov processes, representing the gyroscope bias and the accelerometer bias, respectively.

The discrete-time state update equation is used to update state parameters as follows:

x_{k} = Φ_{k | k - 1} x_{k - 1} + G_{k | k - 1} w_{k - 1},

(66)

where

G_{k | k - 1}

is the system noise matrix,

w_{k - 1}

is the system noise, and

Φ_{k | k - 1}

is the state transition matrix from

t_{k - 1}

to

t_{k}

; this is determined by the dynamic model of the state parameter.

In the loosely coupled integration, the measurement equation can be simply expressed as:

δ z = H_{k} x_{k} + v_{k},

(67)

where

v_{k}

is the measurement noise,

H_{k}

is the measurement matrix, and

z_{k}

is the measurement vector calculated by subtracting the global navigation satellite system (GNSS) observation with the inertial navigation system (INS) mechanism.

The experiments reported in this section were carried out by processing the data from an unmanned ground vehicle test. The gyroscope random walk was set to 0.03 deg/

\sqrt{h}

and the velocity random walk was set to 0.05 m/s/

\sqrt{h}

. The sampling rates of the inertial measurement unit (IMU) and the GNSS are 200 Hz and 1 Hz, respectively. The test lasts 48 min.

The position error curve, velocity error curve, and attitude error curve of the loosely coupled integration are shown in Figure 4, Figure 5 and Figure 6. The root mean squares (RMSs) of the position errors in the north, east, and earth directions are

0.0057

m,

0.0024

m, and

0.0134

m, respectively. The RMS of the velocity errors in the north, east, and earth directions are

0.0023

m/s,

0.0021

m/s, and

0.0038

m/s, respectively. The RMSs of the attitude errors in the roll, pitch, and yaw directions are

0.0034

deg,

0.0030

deg, and

0.0178

deg, respectively.

The Rényi entropy of the covariance P is shown in Figure 7. As we can see, the Rényi entropy fluctuates around

- 100

once the filter converges, which is consistent with the conclusion from the entropy perspective.

4. Conclusions and Final Remarks

We have considered the original Kalman filter by taking the minimization of the temporal derivative of the Rényi entropy. In particular, we show that the temporal derivative of Rényi entropy is equal to zero when the Kalman filter system approaches the steady state, which means that the Rényi entropy approaches a stable value. Finally, simulation experiments and practical experiments show the Rényi entropy truly stays stable when the system becomes steady.

Future work includes calculating the Rényi entropy of the innovation term when the measurements and the noise are non-Gaussian [14] in order to evaluate the effectiveness of measurements and adjust the noise covariance matrix. Meanwhile, we can also calculate the Rényi entropy of the nonlinear dynamical equation to measure the nonlinearity in the propagation step.

Author Contributions

Conceptualization, Y.L. and C.G.; Funding acquisition, C.G. and J.L.; Investigation, Y.L.; Methodology, Y.L., C.G., and S.Y.; Project administration, J.L.; Resources, C.G.; Software, Y.L. and S.Y.; Supervision, J.L.; Validation, S.Y.; Visualization, S.Y.; Writing—original draft, Y.L.; Writing—review and editing, C.G., S.Y., and J.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by a grant from the National Key Research and Development Program of China (2018YFB1305001).

Acknowledgments

In this section you can acknowledge any support given which is not covered by the author contribution or funding sections. This may include administrative and technical support, or donations in kind (e.g., materials used for experiments).

Conflicts of Interest

The authors declare no conflict of interest.

References

Shannon, C.E. A mathematical theory of communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef] [Green Version]
Principe, J.C. Information Theoretic Learning: Renyi’s Entropy and Kernel Perspectives; Springer Science & Business Media: Berlin, Germany, 2010. [Google Scholar]
He, R.; Hu, B.; Yuan, X.; Wang, L. Robust Recognition via Information Theoretic Learning; Springer International Publishing: Berlin, Germany, 2014. [Google Scholar]
Rényi, A. On measures of entropy and information. In Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability, Berkeley, CA, USA, 20 June–30 July 1961. [Google Scholar]
Liang, X.S. Entropy evolution and uncertainty estimation with dynamical systems. Entropy 2014, 16, 3605–3634. [Google Scholar] [CrossRef]
Kullback, S.; Leibler, R.A. On information and sufficiency. Ann. Math. Stat. 1951, 22, 79–86. [Google Scholar] [CrossRef]
Kalman, R.E.; Bucy, R.S. New results in linear filtering and prediction theory. J. Basic Eng. 1961, 83, 95–108. [Google Scholar] [CrossRef]
DeMars, K.J. Nonlinear Orbit Uncertainty Prediction and Rectification for Space Situational Awareness. Ph.D. Thesis, The University of Texas at Austin, Austin, TX, USA, 2010. [Google Scholar]
DeMars, K.J.; Bishop, R.H.; Jah, M.K. Entropy-based approach for uncertainty propagation of nonlinear dynamical systems. J. Guid. Control. Dyn. 2013, 36, 1047–1057. [Google Scholar] [CrossRef]
Kim, H.; Liu, B.; Goh, C.Y.; Lee, S.; Myung, H. Robust vehicle localization using entropy-weighted particle filter-based data fusion of vertical and road intensity information for a large scale urban area. IEEE Robot. Autom. Lett. 2017, 2, 1518–1524. [Google Scholar] [CrossRef]
Zhang, J.; Du, L.; Ren, M.; Hou, G. Minimum error entropy filter for fault detection of networked control systems. Entropy 2012, 14, 505–516. [Google Scholar] [CrossRef]
Liu, Y.; Wang, H.; Hou, C. UKF based nonlinear filtering using minimum entropy criterion. IEEE Trans. Signal Process. 2013, 61, 4988–4999. [Google Scholar] [CrossRef]
Julier, S.; Uhlmann, J.; Durrant-Whyte, H.F. A new method for the nonlinear transformation of means and covariances in filters and estimators. IEEE Trans. Autom. Control 2000, 45, 477–482. [Google Scholar] [CrossRef] [Green Version]
Contreras-Reyes, J.E.; Cortés, D.D. Bounds on rényi and shannon entropies for finite mixtures of multivariate skew-normal distributions: Application to swordfish (xiphias gladius linnaeus). Entropy 2016, 18, 382. [Google Scholar] [CrossRef]
Ren, M.; Zhang, J.; Fang, F.; Hou, G.; Xu, J. Improved minimum entropy filtering for continuous nonlinear non-Gaussian systems using a generalized density evolution equation. Entropy 2013, 15, 2510–2523. [Google Scholar] [CrossRef] [Green Version]
Zhang, Q. Performance enhanced Kalman filter design for non-Gaussian stochastic systems with data-based minimum entropy optimisation. AIMS Electron. Electr. Eng. 2019, 3, 382. [Google Scholar] [CrossRef]
Chen, B.; Dang, L.; Gu, Y.; Zheng, N.; Príncipe, J.C. Minimum error entropy Kalman filter. IEEE Trans. Syst. Man Cybern. Syst. 2019. [Google Scholar] [CrossRef] [Green Version]
Gultekin, S.; Paisley, J. Nonlinear Kalman filtering with divergence minimization. IEEE Trans. Signal Process. 2017, 65, 6319–6331. [Google Scholar] [CrossRef] [Green Version]
Darling, J.E.; DeMars, K.J. Minimization of the kullback–leibler divergence for nonlinear estimation. J. Guid. Control Dyn. 2017, 40, 1739–1748. [Google Scholar] [CrossRef]
Morelande, M.R.; Garcia-Fernandez, A.F. Analysis of Kalman filter approximations for nonlinear measurements. IEEE Trans. Signal Process. 2013, 61, 5477–5484. [Google Scholar] [CrossRef]
Raitoharju, M.; García-Fernández, Á.F.; Piché, R. Kullback–Leibler divergence approach to partitioned update Kalman filter. Signal Process. 2017, 130, 289–298. [Google Scholar] [CrossRef] [Green Version]
Hu, E.; Deng, Z.; Xu, Q.; Yin, L.; Liu, W. Relative entropy-based Kalman filter for seamless indoor/outdoor multi-source fusion positioning with INS/TC-OFDM/GNSS. Clust. Comput. 2019, 22, 8351–8361. [Google Scholar] [CrossRef]
Yu, W.; Peng, J.; Zhang, X.; Li, S.; Liu, W. An adaptive unscented particle filter algorithm through relative entropy for mobile robot self-localization. Math. Probl. Eng. 2013. [Google Scholar] [CrossRef]
Arasaratnam, I.; Haykin, S. Cubature kalman filters. IEEE Trans. Autom. Control 2009, 54, 1254–1269. [Google Scholar] [CrossRef] [Green Version]
Kiani, M.; Barzegar, A.; Pourtakdoust, S.H. Entropy-based adaptive attitude estimation. Acta Astronaut. 2018, 144, 271–282. [Google Scholar] [CrossRef]
Giffin, A.; Urniezius, R. The Kalman filter revisited using maximum relative entropy. Entropy 2014, 16, 1047–1069. [Google Scholar] [CrossRef]
Chen, B.; Liu, X.; Zhao, H.; Principe, J.C. Maximum correntropy Kalman filter. Automatica 2017, 76, 70–77. [Google Scholar] [CrossRef] [Green Version]
Chen, B.; Xing, L.; Liang, J.; Zheng, N.; Principe, J.C. Steady-state mean-square error analysis for adaptive filtering under the maximum correntropy criterion. IEEE Signal Process. Lett. 2014, 21, 880–884. [Google Scholar]
Gongmin, Y.; Jun, W. Lectures on Strapdown Inertial Navigation Algorithm and Integrated Navigation Principles; Northwestern Polytechnical University Press: Xi’an, China, 2019. [Google Scholar]
Kumari, L.; Padma Raju, K. Application of Extended Kalman filter for a Free Falling body towards Earth. IJACSA Ed. 2011, 2, 4. [Google Scholar] [CrossRef]

Figure 1. Evolution of matrix

Σ

.

Figure 1. Evolution of matrix

Σ

.

Figure 2. Simulation results for the entropy.

Figure 3. Simulation results for the change of entropy.

Figure 4. Position error of the loosely coupled integration.

Figure 5. Velocity error of the loosely coupled integration.

Figure 6. Attitude error of the loosely coupled integration.

Figure 7. Rényi entropy of the covariance

Σ

.

Figure 7. Rényi entropy of the covariance

Σ

.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Luo, Y.; Guo, C.; You, S.; Liu, J. A Novel Perspective of the Kalman Filter from the Rényi Entropy. Entropy 2020, 22, 982. https://doi.org/10.3390/e22090982

AMA Style

Luo Y, Guo C, You S, Liu J. A Novel Perspective of the Kalman Filter from the Rényi Entropy. Entropy. 2020; 22(9):982. https://doi.org/10.3390/e22090982

Chicago/Turabian Style

Luo, Yarong, Chi Guo, Shengyong You, and Jingnan Liu. 2020. "A Novel Perspective of the Kalman Filter from the Rényi Entropy" Entropy 22, no. 9: 982. https://doi.org/10.3390/e22090982

APA Style

Luo, Y., Guo, C., You, S., & Liu, J. (2020). A Novel Perspective of the Kalman Filter from the Rényi Entropy. Entropy, 22(9), 982. https://doi.org/10.3390/e22090982

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Novel Perspective of the Kalman Filter from the Rényi Entropy

Abstract

1. Introduction

2. The Connection between the Kalman Filter and the Temporal Derivative of the Rényi Entropy

2.1. Rényi Entropy

2.2. Kalman Filter

2.3. Derivation of the Kalman Filter

2.4. The Temporal Derivative of the Rényi Entropy and the Kalman Filter Gain

3. Simulations and Analysis

3.1. Falling Body Tracking

3.2. Practical Integrated Navigation

4. Conclusions and Final Remarks

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI