Design Method for a Higher Order Extended Kalman Filter Based on Maximum Correlation Entropy and a Taylor Network System

Wang, Qiupeng; Sun, Xiaohui; Wen, Chenglin

doi:10.3390/s21175864

Open AccessArticle

Design Method for a Higher Order Extended Kalman Filter Based on Maximum Correlation Entropy and a Taylor Network System

by

Qiupeng Wang

¹,

Xiaohui Sun

² and

Chenglin Wen

^3,*

¹

School of HDU-ITMO, Joint Institute, Hangzhou Dianzi University, Hangzhou 310018, China

²

School of Automation, Hangzhou Dianzi University, Hangzhou 310018, China

³

School of Automation, Guangdong University of Petrochemical Technology, Maoming 525000, China

^*

Author to whom correspondence should be addressed.

Sensors 2021, 21(17), 5864; https://doi.org/10.3390/s21175864

Submission received: 11 June 2021 / Revised: 28 August 2021 / Accepted: 28 August 2021 / Published: 31 August 2021

(This article belongs to the Section Intelligent Sensors)

Download

Browse Figures

Versions Notes

Abstract

:

This paper proposes one new design method for a higher order extended Kalman filter based on combining maximum correlation entropy with a Taylor network system to create a nonlinear random dynamic system with modeling errors and unknown statistical properties. Firstly, the transfer function and measurement function are transformed into a nonlinear random dynamic model with a polynomial form via system identification through the multidimensional Taylor network. Secondly, the higher order polynomials in the transformed state model and measurement model are defined as implicit variables of the system. At the same time, the state model and the measurement model are equivalent to the pseudolinear model based on the combination of the original variable and the hidden variable. Thirdly, higher order hidden variables are treated as additive parameters of the system; then, we establish an extended dimensional linear state model and a measurement model combining state and parameters via the previously used random dynamic model. Finally, as we only know the results of the limited sampling of the random modeling error, we use the combination of the maximum correlation estimator and the Kalman filter to establish a new higher order extended Kalman filter. The effectiveness of the new filter is verified by digital simulation.

Keywords:

pseudolinearization; maximum entropy Kalman filter; multidimensional Taylor network

1. Introduction

The application of filters occupies an important position in various fields at the national and international levels. The progress and development of filters play important roles in national economic construction—especially national defense construction—such as in real-time estimation and target tracking. In 1960, Kalman proposed a method of filtering under the minimum mean squared error criterion for linear systems, and it soon began to be widely used [1]. In order to solve nonlinear problems, extended Kalman filters (EKFs) [2], unscented Kalman filters (UKFs) [3], and cubature Kalman filters (CKFs) have since emerged. However, the above-mentioned filtering methods require the modeling error to be Gaussian white noise. As such, their performances are likely to worsen when applied to non-Gaussian situations, especially when the systems are disturbed by impulsive noise. Impulsive noise arises from heavy-tailed distributions [4] (such as some mixed Gaussian distributions), and is common in many real scenarios of automatic control and target tracking (for instance, the measurement noise in the radar system is often not Gaussian noise, but heavy-tailed non-Gaussian noise [5]). In 1993, Gordon and Salmond proposed particle filtering when the density function is known [6]; this achieves an approximation of the distribution function by sampling a large number of particles therein. However, this method is very complicated; it requires a large number of particles, and it will cause particle degradation after re-sampling. In general, the density function is difficult to obtain. For this reason, for the linear system, Chen designed the corresponding Kalman filter under the maximum correlation entropy criterion based on the limited realization of random variables [7]; this is called the maximum correlation entropy Kalman filter (MCKF) [8]. On this basis, the maximum correntropy extended Kalman filter (MCEKF) and the maximum correntropy unscented Kalman filter (MCUKF), which can solve nonlinear non-Gaussian systems, have since emerged [9]. However, in MCEKFs, all higher order terms in the Taylor expansion are discarded. Therefore, a large truncation error will be generated, and the filtering performance will decrease or even diverge as the nonlinearity of the system increases. In addition, each step of the state estimation needs to recalculate the Taylor expansion coefficient, which will undoubtedly increase the complexity of the calculation. MCUKFs use UT transformation and sigma point sampling [10]; this is called deterministic sampling. There is only one sampling point for a dimensional system. Neither low-dimensional nor high-dimensional systems have a strong claim to superiority. A large number of experiments have shown that both EKFs and UKFs can be approximated by a second-order polynomial at most [11], which will produce a large rounding error. Hence, both will eventually face the problems of degraded filtering performance and divergence as their nonlinearity increases [12].

This project proposes a higher order extended Kalman filter method based on maximum correlation entropy, under the assumption that both state and measurement equations can be modeled and based on a strong nonlinear function. The main contributions of this paper are as follows: (1) using multidimensional Taylor nets to convert the general expression of nonlinear functions into higher order polynomials; (2) defining each order of polynomial in the system as hidden variables of the corresponding order, and treating them as time-variable parameters; (3) establishing the dynamic relationship between the time-variable parameters and combining them with the original variables to further establish the expanded dimension state model; (4) based on the expanded linear state variables, equivalently rewriting the measurement model into the corresponding linear form; and (5) according to the established linear state and measurement model of the new extended dimension system, establishing a higher order extended Kalman filter method based on maximum correlation entropy.

The remaining parts of this paper are organized as follows: the first chapter is the preface of our knowledge, which introduces the definition of “entropy”; the Section 2 presents a method for identifying nonlinear functions based on multidimensional Taylor networks; the Section 3 presents a higher order extended Kalman filter method; the Section 4 presents the detailed design process of the maximum correlation entropy higher order extended Kalman filter; the Section 5 concerns simulation verification; and the Section 6 and Section 7 presents a summary and outlook.

2. Description of Correntropy

Correntropy is a generalized similarity measure between two random variables [13]. Given two one-dimensional random variables

φ, ζ \in R^{1}

, their joint distribution function is

F_{ϕ ξ} (φ, ζ)

; then, the correlation entropy is defined as follows:

V (ϕ, ξ) = ε [α (φ, ζ)] = \int α (φ, ζ) d F_{ϕ ξ} (φ, ζ)

(1)

where

ε

is the expectation operator and

α (\cdot, \cdot)

is the translation-invariant Mercer kernel. In this article, it is not particularly emphasized that this kernel function is a Gaussian kernel, which is defined as follows:

α (φ, ζ) = G_{τ} (e) = e x p (- \frac{e^{2}}{2 τ^{2}})

(2)

where

e = φ - ζ

,

τ > 0

represents the kernel’s bandwidth.

By expanding Equation (2) with a Taylor series, we can obtain the following:

α (φ, ζ) = G_{τ} (e) = e x p (- \frac{e^{2}}{2 τ^{2}}) = \sum_{k = 0}^{\infty} \frac{{(- 1)}^{k}}{2^{k} τ^{2 k} k!} ε {{(φ - ζ)}^{2 k}}

(3)

and then the correlation entropy of Equation (1) has the following expression:

\begin{array}{l} V (ϕ, ξ) = ε [α (φ, ζ)] = \int \sum_{k = 0}^{\infty} \frac{{(- 1)}^{k}}{2^{k} τ^{2 k} k!} {(φ - ζ)}^{2 k} d F_{ϕ ξ} (φ, ζ) \\ = \sum_{k = 0}^{\infty} \frac{{(- 1)}^{k}}{2^{k} τ^{2 k} k!} \int {(φ - ζ)}^{2 k} d F_{ϕ ξ} (φ, ζ) = \sum_{k = 0}^{\infty} \frac{{(- 1)}^{k}}{2^{k} τ^{2 k} k!} ε {{(ϕ - ξ)}^{2 k}} \end{array}

(4)

where

ε {{(ϕ - ξ)}^{2 k}} = \int^{} {(φ - ζ)}^{2 k} d F_{ϕ ξ} (φ, ζ)

is the

2 k

truncation statistic of the random variable

ϕ, ξ \in R

.

However, in most practical cases, joint distribution

F_{ϕ ξ}

is usually unknown, and there are often finite implementations

(φ^{(j)}, ζ^{(j)}), j = 1, 2, \dots, N

of

(ϕ, ξ)

for random variables. In these cases, the sample mean estimator can be used to estimate the heterogeneity:

ε {{(ϕ - ξ)}^{2 k}} = \frac{1}{N} (\sum_{j = 0}^{N} {(φ^{(j)} - ζ^{(j)})}^{2 k})

(5)

Then, the entropy expression of the random variable pair

(φ, ζ)

is driven by finite data:

\begin{array}{l} \hat{V} (ϕ, ξ) = ε [α (φ, ζ)] \\ = \int \sum_{k = 0}^{\infty} \frac{{(- 1)}^{k}}{2^{k} τ^{2 k} k!} {(φ - ζ)}^{2 k} d F_{ϕ ξ} (φ, ζ) \\ = \sum_{k = 0}^{\infty} \frac{{(- 1)}^{k}}{2^{k} τ^{2 k} k!} \frac{1}{N} (\sum_{j = 0}^{N} {(φ^{(j)} - ζ^{(j)})}^{2 k}) \\ = \frac{1}{N} \sum_{k = 0}^{\infty} \frac{{(- 1)}^{k}}{2^{k} τ^{2 k} k!} (\sum_{j = 0}^{N} {(φ^{(j)} - ζ^{(j)})}^{2 k}) \\ = \frac{1}{N} \sum_{k = 0}^{\infty} \frac{{(- 1)}^{k}}{2^{k} τ^{2 k} k!} (\sum_{j = 0}^{N} {(e^{(j)})}^{2 k}) = \frac{1}{N} \sum_{j = 0}^{N} G_{τ} (e^{(j)}) \end{array}

(6)

When

ϕ, ξ \in R^{n}

, and the components of vector

e = ϕ - ξ

are independent of one another, multidimensional correlation entropy is based on N sampling.

3. Non-Linear Model Identification Based on Multidimensional Taylor Networks

Given that the state model and observation model are complex dynamic systems with nonlinear characteristics [14]:

α (τ + 1) = σ (α (τ)) + γ (τ)

(7)

Γ (τ + 1) = δ (α (τ + 1)) + θ (τ + 1)

(8)

where

α (τ) \in R^{h}

is an h-dimensional state vector;

T (τ + 1) \in R^{d}

represents the d-dimensional measure vector; and

σ_{i} (α (τ)), i = 1, 2, \dots, h

and

δ_{j} (α (τ + 1)), j = 1, 2, \dots, d

represent the state function and the measurement function, respectively. The modeling errors for non-Gaussian systems are

γ (k)

and

θ (k)

, while

ϑ = d i a g {ϑ_{1}, ϑ_{2}, \dots, ϑ_{h}}

and

η = d i a g (\begin{matrix} η_{1} & η_{2} & \dots & η_{d} \end{matrix})

are the process noise variance and the measurement noise variance, respectively.

Lemma 1.

Any continuous function defined in a closed interval can be approximated accurately with a polynomial function [15].

Lemma 2.

For continuous functions,

σ (α (k))

, defined in a closed interval, can be approximated by the following [16]:

\sum_{i = 1}^{N (h, l)} ψ_{i} (k) \prod_{t = 1}^{l} α_{t}^{λ_{i, t}} (k)

(9)

where

N (h, l)

denotes the total number of terms in the expansion and

λ_{i, t}

denotes the power of the variable

α_{t}

in the product of the ith variable.

3.1. Multidimensional Taylor Network Structure

The multidimensional Taylor network model can replace the traditional neural network with the dynamic model and control the system under certain conditions; it is characterized by a nonlinear autoregressive moving-average model composed of polynomials. The multidimensional Taylor network (MTN) uses a forward single intermediate layer structure, including an input layer, an intermediate layer, and an output layer. Supposing that the input layer comprises n nodes—

α (τ) = {[\begin{matrix} α_{1} (τ) & α_{2} (τ) & \dots & α_{h} (τ) \end{matrix}]}^{T} \in R^{h}

—the output layer is

α (τ + 1)

, the middle layer is the network processing layer, and each input variable realizes the weighted summation of each power product term in this layer. The middle layer is composed of various power product terms and the corresponding connection weight vector

ψ_{j} (τ)

:

ψ_{j} (τ) = {[ψ_{j, 1} (τ), ψ_{j, 2} (τ), \dots, ψ_{j, N (h, l)} (τ)]}^{T}

which represents the output weight vector connecting the intermediate layer and the output node of the network.

According to the multivariate Taylor equation, if a function is differentiable to the

h + 1

th order at a certain point, then the function expands to a form where the power series of the variable is not greater than m times. The model can be expressed as a dynamic equation, as follows:

α_{j} (τ + 1) = σ (α (τ)) = \sum_{i = 1}^{N (h, l)} ψ_{j, i} (τ) \prod_{t = 1}^{l} α_{t}^{λ_{i, t}} (τ) + Δ σ (τ)

(10)

where

σ (\cdot)

is a function of nonlinearity described by a multidimensional Taylor network model,

ψ_{i}

represents the weight before the product item of the ith variable,

N (h, l)

denotes the total number of terms in the expansion,

λ_{i, t}

denotes the power of the variable

α_{t}

in the product of the ith variable, and

Δ σ (τ)

is the error—also known as the remainder—produced by the identification of a function by a multidimensional Taylor network.

3.2. Parameter Identification Method Based on Kalman Filtering

Model Establishment of a Kalman Filter

A Kalman filter can be regarded as an optimized autoregressive data processing method that describes the entire system through a state equation and an observation equation.

State equation:

ψ_{j, i} (τ + 1) = ψ_{j, i} (τ) + w_{j, i} (τ)

(11)

where

i = 1, 2, \dots, N (h, l)

,

j = 1, 2 \dots, h

.

Observation equation:

It is not difficult to draw from Figure 1:

\begin{array}{l} α_{j} (τ + 1) = \sum_{i = 1}^{N (h, l)} ψ_{j, i} (τ + 1) \prod_{t = i}^{l} α_{t}^{λ_{i, t}} (τ + 1) \\ = H_{j} (τ + 1) \cdot {[ψ_{j, 1} (τ + 1), ψ_{j, 2} (τ + 1), \dots, ψ_{j, N (h, l)} (τ + 1)]}^{T} + v_{j} (τ + 1) \\ = H_{j} (τ + 1) \cdot ψ_{j} (τ + 1) + v_{j} (τ + 1) \end{array}

Thus,

\begin{array}{l} α (τ + 1) = {[α_{1} (τ + 1), α_{2} (τ + 1), \dots, α_{j} (τ + 1), \dots, α_{h} (τ + 1)]}^{T} \\ = H (τ) \cdot ψ (τ + 1) + v (τ + 1) \end{array}

(12)

where

H (τ) = {[H_{1} (τ + 1), H_{2} (τ + 1), \dots, H_{h} (τ + 1)]}^{T}

;

ψ (τ + 1) = [ψ_{1} (τ + 1), ψ_{2} (τ + 1), \dots, ψ_{h} (τ + 1)]

;

ψ_{j, i} (τ + 1)

represents the system state at

τ

-time, that is, the parameter status value of the kth moment; and

β (τ + 1)

represents the output value of the network. It is assumed that both process noise

w (τ)

and

v (τ + 1)

are Gaussian white noise during the analysis, and

Q_{j} = d i a g (\begin{matrix} Q_{j, 1} & Q_{j, 2} & \dots & Q_{j, N (h, l)} \end{matrix})

and

R_{j} = d i a g (\begin{matrix} R_{j, 1} & R_{j, 2} & \dots & R_{j, N (h, l)} \end{matrix})

, which are the process noise variance and measurement noise variance, respectively. Here, we use a Kalman filter to approximate the dynamic model. As the filtering principle of Kalman filters is mentioned later in this article, please refer to Equations (20)–(24) for the detailed process.

Figure 1. Model of a multidimensional Taylor network.

3.3. Approximation Analysis

Given a class of nonlinear functions

σ (α (k))

, it can be assumed that it is derivative of the rth order, but r is a relatively large number, making it difficult for us to use Taylor nets to approximate its function. The optimal approach would be to set

m, 1 \leq m \leq r

and use the Taylor network to expand the nonlinear function to the mth order, obtain the result of Equation (16), and simultaneously ensure the higher order error term

Δ δ \leq θ

, where

θ

is the acceptable error threshold. This not only makes the Taylor network fitting function process easier, but also ensures the accuracy of the fit.

4. Higher Order Extended Kalman Filter

4.1. Pseudolinearized Representation of Nonlinear Functions

For ease of description and understanding, if

l = d = 2

, we can expand Equation (7) through a multidimensional Taylor network to the mth order, as follows:

\begin{array}{l} σ_{i} (α (k)) = (ω_{i, 1, 0} α_{1} (τ) + ω_{i, 0, 1} α_{2} (τ)) \\ \begin{array}{l} + (ω_{i, 2, 0} α_{1} {(τ)}^{2} + ω_{i, 1, 1} α_{1} (τ) α_{2} (τ) + ω_{i, 0, 2} α_{2} {(τ)}^{2}) \\ + (ω_{i, 3, 0} α_{1} {(τ)}^{3} + ω_{i, 1, 2} α_{1} (τ) α_{2}^{2} (τ) + ω_{i, 2, 1} α_{1}^{2} (τ) α_{2} (τ) + ω_{i, 0, 3} α_{2} {(τ)}^{3}) \\ + \sum_{\begin{matrix} l_{1} + l_{2} = l \\ l_{1} + l_{2} \leq l \end{matrix}} ω_{i, l_{1}, l_{2}} α_{1}^{l_{1}} (τ) α_{2}^{l_{2}} (τ) + \dots + \sum_{\begin{matrix} m_{1} + m_{2} = r \\ m_{1} + m_{2} \leq r \end{matrix}} ω_{i, r_{1}, r_{2}} α_{1}^{m_{1}} (τ) α_{2}^{m_{2}} (τ) + Δ σ_{i} (τ) \end{array} \end{array}

(13)

where

\sum_{\begin{array}{l} l_{1} + l_{2} = l \\ l_{1}, l_{2} \leq l \end{array}} α_{1}^{l_{1}} (τ) α_{2}^{l_{2}} (τ)

is the sum of all tensors of the

l

th order and

ω_{i, l_{1}, l_{2}}

represents the weight corresponding to each order of the tensor.

Definition 1.

α^{(l)} (τ) = {α_{1}^{l_{1}} (τ) α_{2}^{l_{2}} (τ) \dots α_{2}^{l_{h}} (τ), l_{1} + l_{2} + l_{h} = l; 0 \leq l_{j} \leq l; l = 0, 1, \dots, h}

is a set of implicit variables of the

l

th order.

Definition 2.

ω_{i}^{(l)} = [ω_{i; 1}^{(l)}, ω_{i; 2}^{(l)}, \dots, ω_{i; n_{l}}^{(l)}] = [ω_{i; l, 0}, ω_{i; l - 1, 1}, \dots, ω_{i; 0, l}], i = 1, 2, \dots l

is the weight vector corresponding to the ith order implicit variable.

In [17], there is a detailed pseudolinearization process, so we will not repeat it in this article. In order to make the model more accurate, we treat the remainder

Δ σ (τ)

of the equation of state as latent variables. According to Definition 1 and Definition 2, the pseudolinear extended dimension form using the remainder as a hidden variable is as follows:

α^{(1)} (τ + 1) = W^{(1)} (τ + 1, τ) α^{(1)} (τ) + \sum_{l = 2}^{m} W^{(l)} (τ + 1, τ) α^{(l)} (τ) + C \cdot Δ σ (τ) + γ^{(1)} (τ)

(14)

where

α^{(1)} (τ) = [\begin{matrix} α_{1}^{(1)} (τ) \\ α_{2}^{(1)} (τ) \end{matrix}]

,

W^{(l)} = [\begin{matrix} ω_{1}^{(l)} \\ ω_{2}^{(l)} \end{matrix}]

,

γ (τ) = γ^{(1)} (τ) = [\begin{matrix} γ_{1}^{(1)} (τ) \\ γ_{2}^{(1)} (τ) \end{matrix}]

,

C = [\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix}]

.

Similarly, Equation (8) can be rewritten as follows:

Γ^{(1)} (τ + 1) = χ^{(1)} (τ + 1) α^{(1)} (τ + 1) + \sum_{l = 2}^{m} χ^{(l)} (τ + 1) α^{(l)} (τ + 1) + D \cdot Δ σ (τ + 1) + θ^{(1)} (τ + 1)

(15)

where

Γ^{(1)} (τ + 1) = [\begin{matrix} Γ_{1}^{(1)} (τ) \\ Γ_{2}^{(1)} (τ) \end{matrix}]

,

χ^{(l)} = [\begin{matrix} χ_{1}^{(l)} \\ χ_{2}^{(l)} \end{matrix}]

,

θ^{(1)} (τ + 1) = [\begin{matrix} θ_{1}^{(1)} (τ + 1) \\ θ_{2}^{(1)} (τ + 1) \end{matrix}]

,

D = [\begin{matrix} 0 & 0 \\ 0 & 0 \end{matrix}]

.

4.2. Linearized Representation of Nonlinear Functions

In order to transform the pseudolinear model established in Section 3.1 into a true linear form, it is necessary to establish a dynamic relationship between the lth order hidden variables and the uth order hidden variables [18]:

α^{(l)} (τ + 1) = W_{l}^{(u)} (τ) α^{(u)} (τ) l, u = 2, 3, \dots, m

(16)

where

W

can be identified based on the multidimensional Taylor network in its original state; without any prior information, it can be set as follows:

W_{l}^{(u)} (τ) = {\begin{array}{l} I, l = u \\ 0, l \neq u \end{array}

(17)

Combining Definition 1, Definition 2, and Equation (19), the state model Equation (7) has the following linear matrix form:

If

A (τ) = {[{(α^{(1)} (τ))}^{T}, {(α^{(2)} (τ))}^{T}, \dots, {(α^{(l)} (τ))}^{T}, \dots, {(α^{(r)} (τ))}^{T}, Δ σ (τ)]}^{T}

W (τ + 1, τ) = [\begin{matrix} W_{1}^{(1)} (τ) & W_{1}^{(2)} (τ) & \dots & W_{1}^{(u)} (τ) & \dots & W_{1}^{(m - 1)} (τ) & W_{1}^{(m)} (τ) & C \\ W_{2}^{(1)} (τ) & W_{2}^{(2)} (τ) & \dots & W_{2}^{(u)} (τ) & \dots & W_{2}^{(m - 1)} (τ) & W_{2}^{(m)} (τ) & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋱ & ⋮ & ⋮ & ⋮ \\ W_{l}^{(1)} (τ) & W_{l}^{(2)} (τ) & \dots & W_{l}^{(u)} (τ) & \dots & W_{l}^{(m - 1)} (τ) & W_{l}^{(m)} (τ) & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋱ & ⋮ & ⋮ & ⋮ \\ W_{m - 1}^{(1)} (τ) & W_{m - 1}^{(2)} (τ) & \dots & W_{m - 1}^{(u)} (τ) & \dots & W_{m - 1}^{(m - 1)} (τ) & W_{m - 1}^{(m)} (τ) & 0 \\ W_{m}^{(1)} (τ) & W_{m}^{(2)} (τ) & \dots & W_{m}^{(u)} (τ) & \dots & W_{m}^{(m - 1)} (τ) & W_{m}^{(m)} (τ) & 0 \\ 0 & 0 & \dots & 0 & \dots & 0 & 0 & C \end{matrix}], γ (τ) = [\begin{matrix} γ^{(1)} (τ) \\ γ^{(2)} (τ) \\ ⋮ \\ γ^{(l)} (τ) \\ ⋮ \\ γ^{(m - 1)} (τ) \\ γ^{(m)} (τ) \end{matrix}]

then, Equation (7) has the following linearized form:

A (τ + 1) = W (τ + 1, τ) A (τ) + γ (τ)

(18)

where

γ (k)

is the modeling error.

In the same way, the linear matrix form of the measurement model can be obtained:

Γ (τ + 1) = χ (τ + 1, τ) A (τ + 1) + θ (τ + 1)

(19)

where

χ (τ + 1, τ) = [\begin{matrix} χ_{1}^{(1)} (τ + 1) & χ_{1}^{(2)} (τ + 1) & \dots & χ_{1}^{(u)} (τ + 1) & \dots & χ_{1}^{(m - 1)} (τ + 1) & χ_{1}^{(m)} (τ + 1) & 0 & 0 \\ χ_{2}^{(1)} (τ + 1) & χ_{2}^{(2)} (τ + 1) & \dots & χ_{2}^{(u)} (τ + 1) & \dots & χ_{2}^{(m - 1)} (τ + 1) & χ_{2}^{(m)} (τ + 1) & 0 & 0 \end{matrix}]

,

Γ (τ + 1) = [\begin{matrix} Γ_{1} (τ) \\ Γ_{2} (τ) \end{matrix}]

, and

θ (τ + 1) = [\begin{matrix} θ_{1} (τ + 1) \\ θ_{2} (τ + 1) \end{matrix}]

is the modeling error.

4.3. Design of Higher Order Extended Kalman Filter

For linear models, KF-based filters are given. Given the initial value

A (0)

, when

γ (τ)

and

θ (τ + 1)

are Gaussian white noise with zero mean, the variances are recorded as

ϑ

and

η

, respectively.

A recursive filter can be designed as follows:

\hat{A} (τ + 1 | τ) = W (τ + 1, τ) \hat{A} (τ | τ)

(20)

λ (τ + 1 | τ) = W (τ + 1, τ) λ (τ | τ) W^{T} (τ + 1, τ) + ϑ (τ)

(21)

Κ (τ + 1) = (λ (τ + 1 | τ) χ^{T} (τ + 1)) {(χ (τ + 1) λ (τ + 1 | τ) χ^{T} (τ + 1) + η (τ + 1))}^{- 1}

(22)

\hat{A} (τ + 1 | τ + 1) = \hat{A} (τ + 1 | τ) + Κ (τ + 1) (Γ (τ + 1) - χ (τ + 1) \hat{A} (τ + 1 | τ))

(23)

λ (τ + 1 | τ + 1) = (I - Κ (τ + 1) χ (τ + 1)) λ (τ + 1 | τ)

(24)

5. Higher Order Extended Kalman Filter Design Based on Maximum Correlation Entropy

5.1. Non-Gaussian Modeling of State Vector Based on Multivariate Information Observation

System status

A (τ)

estimates

\hat{A} (τ | τ)

and estimated error covariance

λ (τ | τ)

are obtained based on

Κ

[19]. The filtering equation predicts a step prediction estimation value

\tilde{A} (τ + 1 | τ)

and corresponding step prediction error covariance matrix

λ (k + 1 | k)

.

The step prediction estimate error of system status

A (τ + 1)

is as follows:

\begin{array}{l} \tilde{A} (τ + 1 | τ) = A (τ + 1) - \hat{A} (τ + 1 | τ) \\ = W (τ + 1, τ) \tilde{A} (τ | τ) + γ (τ) \end{array}

(25)

and it can be modified into a measurement model for system status

A (τ + 1)

as follows:

\hat{A} (τ + 1 | τ) = A (τ + 1) - \tilde{A} (τ + 1 | τ)

(26)

where

\hat{A} (τ + 1 | τ)

is a measurement of system status

A (τ + 1)

, while

\tilde{A} (τ + 1 | τ)

is the measurement error. Finally, the combined measurement model is as follows:

[\begin{matrix} \hat{A} (τ + 1 | τ) \\ Γ (τ + 1) \end{matrix}] = [\begin{matrix} I \\ χ (τ + 1) \end{matrix}] A (τ + 1) + ϖ^{(j)} (τ + 1)

(27)

where I is a unit array for the corresponding dimension,

ϖ^{(j)} (τ + 1) = [\begin{matrix} - \tilde{A} (τ + 1 | τ)) \\ θ^{(j)} (τ + 1) \end{matrix}]

, and

E [ϖ^{(i)} (τ + 1) {(ϖ^{(i)})}^{T} (τ + 1)] = [\begin{matrix} \tilde{λ} (τ + 1 | τ) & 0 \\ 0 & \tilde{η} (τ + 1) \end{matrix}]

(28)

According to Equation (20), a step prediction error covariance of the system state

λ (τ + 1)

is received as follows:

\tilde{λ} (τ + 1 | τ) = W (τ + 1, τ) λ (τ | τ) W^{T} (τ + 1, τ) + ϑ (τ)

(29)

where

ϑ (τ) = d i a g {ϑ^{(1)} (τ), ϑ^{(2)} (τ), \dots, ϑ^{(m)} (τ)}

and the

ϑ^{(2)} (τ), \dots, ϑ^{(m)} (τ)

is a covariance matrix of random error vectors

γ^{(2)} (τ), \dots, γ^{(r)} (τ)

when the higher order hidden variable

α^{(2)} (τ), \dots, α^{(m)} (τ)

is dynamically modeled.

ϑ^{(1)} (τ)

is the original system status model (Equation (16)) of the non-Gaussian model error

w^{(1)} (k)

, and calculates the second-order statistic after obtaining a limited number of samples:

{\tilde{ϑ}}^{(1)} (τ) = \frac{1}{N} \sum_{j = 1}^{N} {[γ^{(1, j)} (τ) - \bar{γ} (τ)] {[γ^{(1, j)} (τ) - \bar{γ} (τ)]}^{T}}

(30)

In Equation (23),

\tilde{η} (τ + 1)

is the calculated second-order statistic calculated after the non-Gaussian model error

θ (τ + 1)

in the original system measurement state model (Equation (8)), obtaining a limited sample:

\tilde{η} (τ + 1) = \frac{1}{N} \sum_{j = 1}^{N} {[θ^{(j)} (τ + 1) - \bar{θ} (τ + 1)] {[θ^{(j)} (τ + 1) - \bar{θ} (τ + 1)]}^{T}}

(31)

where

θ^{(j)} (τ + 1)

is the jth realization vector of the non-Gaussian random noise vector

θ (τ + 1)

.

5.2. The Statistical Independence Process of Each Component in the Non-Gaussian Modeling Error Vector $ϖ (τ + 1)$ in the Comprehensive Measurement Model

The vector

ϖ (τ + 1)

in the comprehensive measurement model Equation (22) is a non-Gaussian modeling error vector, and its components are not statistically independent. In order to use the correlation entropy form of the multidimensional independent vector shown in Equation (19), the one-dimensional non-Gaussian vector

ϖ (τ + 1)

needs to be transformed into statistical independence.

From

λ (τ + 1 | τ) = E {[A (τ + 1 | τ) - A (τ | τ)] {[A (τ + 1 | τ) - A (τ | τ)]}^{T}}

,

λ (τ + 1 | τ)

is a positive definite matrix. Similarly, in Equation (26),

\tilde{η} (τ + 1)

is also a positive definite matrix. For this reason, Equation (23) is further expressed as follows:

\begin{array}{l} E {ϖ^{(i)} (τ + 1) {(ϖ^{(i)})}^{T} (τ + 1)} = [\begin{matrix} Λ_{α} (τ + 1 | τ) Λ_{α}^{T} (τ + 1 | τ) & 0 \\ 0 & Λ_{Γ} (τ + 1) Λ_{Γ}^{T} (τ + 1) \end{matrix}] \\ = Λ (τ + 1) Λ^{T} (τ + 1) \end{array}

(32)

where

Λ_{α} (τ + 1)

and

Λ_{Γ} (τ + 1)

are the Cholesky factor matrices of

\tilde{λ} (τ + 1 | τ)

and

\tilde{η} (τ + 1)

, respectively.

Applying

Λ^{- 1} (τ + 1)

to both sides of Equation (22), respectively, yields:

Λ^{- 1} (τ + 1) [\begin{matrix} \hat{A} (τ + 1 | τ) \\ Γ (τ + 1) \end{matrix}] = Λ^{- 1} (τ + 1) [\begin{matrix} I \\ χ (τ + 1) \end{matrix}] A (τ + 1) + Λ^{- 1} (τ + 1) ϖ^{(j)} (τ + 1)

(33)

where

D (τ + 1) = Λ^{- 1} (τ + 1) [\begin{matrix} \hat{A} (τ + 1 | τ) \\ T (τ + 1) \end{matrix}], S (τ + 1) = Λ^{- 1} (τ + 1) [\begin{matrix} I \\ χ (τ + 1) \end{matrix}], e (τ + 1) = Λ^{- 1} (τ + 1) ϖ (τ + 1)

The above equation can be further simplified as follows:

D (τ + 1) = S (τ + 1) A (τ + 1) + e (τ + 1)

(34)

because

\begin{array}{l} E {e (τ) e^{T} (τ)} = E {[Λ^{- 1} (τ + 1) ϖ (τ + 1)] {[Λ^{- 1} (τ + 1) ϖ (τ + 1)]}^{T}} \\ = Λ^{- 1} (τ + 1) E {ϖ (τ + 1) ϖ^{T} (τ + 1)} {(Λ^{- 1} (τ + 1))}^{T} \\ = Λ^{- 1} (τ + 1) Λ (τ + 1) Λ^{T} (τ + 1) {(Λ^{- 1} (τ + 1))}^{T} \\ = I \end{array}

(35)

Therefore, after the non-Gaussian modeling error random variable

ϖ (τ + 1)

undergoes the equivalent transformation of the matrix

Λ^{- 1} (τ + 1)

, the components of the random

e (τ + 1)

are statistically independent.

5.3. Implementation Process of a Higher Order Extended Kalman Filter Based on Maximum Entropy

The filtering process of the extended Kalman filter (H-MCEKF) based on the maximum correlation entropy is as follows (see [20] for the specific derivation process):

The filter initialization obtains the initial filter value $\hat{A} (0)$ and the covariance $λ (0)$ , choosing a suitable core bandwidth $ο$ and a small positive number $ε$ ;
Taylor networks are used for system identification to obtain the parameters in the equations, using the expanded item and the remainder as the new hidden variables. A pseudolinearization process is performed to obtain the pseudolinear form of the system;
Equations (20) and (21) are used to obtain $\hat{X} (k + 1 | k)$ and $P (k + 1 | k)$ , respectively, while Cholesky decomposition is used to obtain $B_{p} (k + 1 | k)$ ;
$t = 1$ and $\hat{A} {(τ + 1 | τ + 1)}_{0} = \hat{A} (τ + 1 | τ)$ are taken, where $\hat{A} {(τ + 1 | τ + 1)}_{t}$ represents the estimated state of the fixed-point iteration t;
The starting fixed-point iterative algorithm is as follows:

${\tilde{e}}_{i} (τ + 1) = d_{i} (τ + 1) - s_{i} (τ + 1) \hat{A} {(τ + 1 | τ + 1)}_{t - 1}$

(36)

where $e_{i}$ is the ith element of $e$ :

${\tilde{C}}_{α} (τ + 1) = d i a g (G_{ο} ({\tilde{e}}_{1} (τ + 1)), \dots, G_{ο} ({\tilde{e}}_{m} (τ + 1)))$

(37)

${\tilde{C}}_{Γ} (τ + 1) = d i a g (G_{ο} ({\tilde{e}}_{r + 1} (τ + 1)), \dots, G_{ο} ({\tilde{e}}_{m + m} (τ + 1)))$

(38)

$\tilde{H} (τ + 1) = Λ_{Γ} (τ + 1) {\tilde{C}}_{Γ}^{- 1} (τ + 1) Λ_{Γ}^{T} (τ + 1)$

(39)

$\tilde{λ} (τ + 1 ∣ τ) = Λ_{α} (τ + 1 ∣ τ) {\tilde{C}}_{α}^{- 1} (τ + 1) Λ_{α}^{T} (τ + 1 ∣ τ)$

(40)

$\tilde{Κ} (τ + 1) = \tilde{λ} (τ + 1 ∣ τ) χ^{T} {(χ \tilde{λ} (τ + 1 ∣ τ) χ^{T} + \tilde{H} (τ + 1))}^{- 1}$

(41)

$\hat{A} {(τ + 1 ∣ τ + 1)}_{t} = \hat{A} (τ + 1 ∣ τ) + \tilde{Κ} (τ + 1) (Γ (τ + 1) - H \hat{A} (τ + 1 ∣ τ))$

(42)

The estimates of the current iteration step are compared with those of the previous iteration and, if satisfied,

$\frac{‖ \hat{A} {(τ + 1 ∣ τ + 1)}_{t} - \hat{A} {(τ + 1 ∣ τ + 1)}_{t - 1} ‖}{‖ \hat{A} {(τ + 1 ∣ τ + 1)}_{t - 1} ‖} \leq ε$

(43)

then $\hat{A} (τ + 1 ∣ τ + 1) = \hat{A} {(τ + 1 ∣ τ + 1)}_{t}, λ (τ + 1 ∣ τ + 1) = \tilde{λ} (τ + 1 ∣ τ)$ , and the value of the pseudovariable can be updated, or the iteration can be repeated;
$τ = τ + 1$ , and steps (3–5) are repeated until the end of filtering.

6. Simulated Cases

This section verifies the validity of the proposed method by providing two cases: one in which the state equation is a nonlinear equation and the measurement equation is a linear equation, and one in which the state and measurement equations are both nonlinear.

6.1. Case 1

Consider a nonlinear system in which the state equation is a nonlinear model and the measurement equation is a linear model:

\begin{array}{l} {\begin{array}{l} x_{1} (k + 1) = (0.8 - 0.5 e^{- x_{1}^{2} (k) (1 + e^{- 0.015 k})}) x_{1} (k) - (0.3 + 0.9 e^{- x_{1}^{2} (k) (1 + 0.5 \sin (\frac{π}{2} k))}) x_{2} (k) + w_{1} (k) \\ x_{2} (k + 1) = 1.2 (1 - e^{- 0.8 k}) x_{2} (k) + 0.11 x_{1} (k) + \cos (1 + x_{2}^{2} (k)) + e^{- 0.8 k} x_{1}^{4} (k) + w_{2} (k) \end{array} \\ {\begin{matrix} y_{1} (k + 1) = x_{1} (k + 1) + v_{1} (k + 1) \\ y_{2} (k + 1) = x_{2} (k + 1) + v_{2} (k + 1) \end{matrix} \end{array}

where the initial value

x (0)

is a random value of

[0, 1]

, the initial estimation error covariance

P (0 | 0) = 0.1 \times d i a g (1, 1)

, and the process noise and measurement noise have the following characteristics:

\begin{matrix} w_{1} (k) ~ 0 . 9 N (0, 0.01) + 0 . 1 N (0, 0.2), w_{2} (k) ~ 0 . 9 N (0, 0.02) + 0 . 1 N (0, 0.2) \\ v_{1} (k) ~ 0 . 9 N (0, 0.01) + 0 . 1 N (0, 2), v_{2} (k) ~ 0 . 9 N (0, 0.02) + 0 . 1 N (0, 2) \end{matrix}

Figure 2 shows a diagram of the MTN identification system, while Figure 3 shows the estimated values of state variables

x_{1}

and

x_{2}

under the three filtering methods. From [21], we know the influence of ε is not significant compared with the kernel bandwidth σ. The parameters are set at

ε = 10^{- 6}

. Table 1 and Table 2 show the mean squared error and the mean relative error, respectively, of the estimated values under the three algorithms, which are computed as averages over 100 independent Monte Carlo runs, with each run containing 50 time steps. When

σ = 5

, the three algorithms all obtain better filtering results. Figure 4 and Figure 5 show the probability densities of the estimation errors when estimating the states

x_{1}

and

x_{2}

, respectively, when the parameters are

ε = 10^{- 6}

and

σ = 5

. All of the results confirm that the proposed H-MCKF (design method for a higher order extended Kalman filter based on maximum correlation entropy and a Taylor network system) can outperform the MCEKF (maximum correntropy extended Kalman filter) significantly when the system is disturbed by non-Gaussian processes and measurement noise, and the H-MCKF_R (H-MCKF with the remainder of the state equation) further improves the filtering performance of the H-MCKF.

6.2. Case 2

Consider a nonlinear system in which the state equation and the measurement equation are both nonlinear models:

\begin{array}{l} {\begin{array}{l} x_{1} (k + 1) = \cos (0 . 5 x_{1} (k) + \frac{2 . 5 x_{2} (k)}{1 + x_{1}^{2} (k) + 8 \cos (1 . 2 k)}) {+ w}_{1} (k) \\ x_{2} (k + 1) = \sin (x_{1}^{2} {(k)) + w}_{2} (k) \end{array} \\ {\begin{matrix} y_{1} (k) = \cos (x_{1} (k) + \sin (x_{1}^{3} (k))) + v_{1} (k + 1) \\ y_{2} (k) = \sin (x_{2} (k) - \sin (x_{2}^{3} (k))) + v_{2} (k + 1) \end{matrix} \end{array}

where the initial value

x (0)

is a random value of [0, 1], the initial estimation error covariance

P (0 | 0) = 0.1 \times d i a g (1, 1)

, and the process noise and measurement noise have the following characteristics:

\begin{matrix} w_{1} (k) ~ 0 . 9 N (0, 0.01) + 0 . 1 N (0, 0.2), w_{2} (k) ~ 0 . 9 N (0, 0.02) + 0 . 1 N (0, 0.2) \\ v_{1} (k) ~ 0 . 9 N (0, 0.01) + 0 . 1 N (0, 2), v_{2} (k) ~ 0 . 9 N (0, 0.02) + 0 . 1 N (0, 2) \end{matrix}

Figure 6 shows a diagram of the MTN identification system, while Figure 7 shows the estimated values of state variables

x_{1}

and

x_{2}

under the three filtering methods. Similar to case 1, the parameters are set at

ε = 10^{- 6}

. Table 3 and Table 4 show the mean squared error and the mean relative error, respectively, of the estimated values under the three algorithms, which are computed as averages over 100 independent Monte Carlo runs, with each run containing 50 time steps. When

σ = 5

, the three algorithms all obtain better filtering results. Figure 8 and Figure 9 show the probability densities of the estimation errors when estimating the states

x_{1}

and

x_{2}

, respectively, when the parameters are

ε = 10^{- 6}

and

σ = 5

. All of the results confirm that the proposed H-MCKF can outperform the MCKF significantly when the system is disturbed by non-Gaussian processes and measurement noise, and the H-MCKF_R further improves the filtering performance of the H-MCKF when the state and measurement equations are both nonlinear.

7. Conclusions

This paper considered a wide range of filter design problems for the state estimation of multivariable dynamic systems, which consist of a strong nonlinear dynamic model and a strong nonlinear observation model. Firstly, we transformed those strong nonlinear models into a higher order polynomial series using a multidimensional Taylor network. Secondly, all higher order items in the polynomial series were defined as hidden variables. Those higher order series were then rewritten as their pseudolinear equivalents. Thirdly, dynamic relationships between all hidden variables and known variables were constructed using the multidimensional Taylor network. Combining the original model of pseudolinearization with the higher order hidden variable dynamic model, linear dynamic models fitted to a standard Kalman filter were presented. Finally, considering that a finite number of samples from modeling error can be obtained, we built the higher order extended Kalman filter based on maximum correlation entropy, and acquired better filter performance than offered by the existing MCEKF [22].

Outlook: There exist several challenges worthy of further research. Firstly, the proposed higher order extended Kalman filter based on maximum correlation entropy is an online iteration process that obtains state estimation constantly, but, as such, it loses one important function possessed by the standard Kalman filter: the ability to operate in real time. Secondly, the linearized model parameters of the original nonlinear model and the hidden variable dynamic model were identified by local time period data; thus, they need to be updated with new time period data in order to fit the time dynamics of the system. Thirdly, in this paper, on the basis of defining all of the hidden variables, we established a linear form of the strong nonlinear model in an expanded state with the original variables and all hidden variables, and obtained better estimation performance than that of a standard EKF; if measurements can be expanded in the same manner as state, we believe that such a filter may offer better estimation performance than the one established by this paper.

Author Contributions

Conceptualization, Q.W., X.S. and C.W.; methodology, C.W.; software, Q.W.; writing—original draft preparation, Q.W. writing—review and editing, Q.W. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the institutions: [01] key project of National Natural Science Foundation of China (No. 61751304), extraction and deep learning of optimal decision rules in an uncertain small sample environment; [02] key project of National Natural Science Foundation of China and Zhejiang joint fund for integration of industrialization and industrialization (No. u1509203), life cycle fault prediction, and intelligent health management of large ship power system operation; [03] key project of National Natural Science Foundation of China (No. 61933013): intelligent diagnosis, prediction and maintenance of abnormal working conditions of large petrochemical units.

Conflicts of Interest

The authors declare no conflict of interest.

References

Kalman, R.E. A new approach to linear filtering and prediction problems. actions of the . Trans. ASME Ser. D J. Basic Eng. 1960, 82, 35–45. [Google Scholar] [CrossRef] [Green Version]
Wen, C.; Wang, Z.; Liu, Q.; Alsaadi, F.E. Recursive distributed filtering for a class of state-saturated systems with fading measurements and quantization effects. IEEE Trans. Syst. Man Cybern. Syst. 2016, 48, 930–941. [Google Scholar] [CrossRef]
Wen, C.; Wang, Z.; Hu, J.; Liu, Q.; Alsaadi, F.E. Rlsaadi. Recursive fifiltering for state-saturated systems with randomly occurring nonlinearities and missing measurements. Int. J. Robust Nonlinear Control. 2018, 28, 1715–1727. [Google Scholar] [CrossRef]
Ge, Q.; Shao, T.; Duan, Z.; Wen, C. Performance Analysis of the Kalman Filter with Mismatched Noise Covariances. IEEE Trans. Autom. Control. 2016, 61, 4014–4019. [Google Scholar] [CrossRef]
Wen, C.; Cheng, X.; Xu, D.; Wen, C. Filter design based on characteristic functions for one class of multi-dimensional nonlinear non-Gaussian systems. Automatica 2017, 82, 171–180. [Google Scholar] [CrossRef]
Arulampalam, M.S.; Maskell, S.; Gordon, N.; Clapp, T. A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking. IEEE Trans. Signal Process. 2002, 50, 174–188. [Google Scholar] [CrossRef] [Green Version]
Feng, X.; Wen, C.; Park, J.H. Sequential fusion $ℋ$ _∞ filtering for multi-rate multi-sensor time-varying systems—A Krein-space approach. IET Control Theory Appl. 2017, 11, 369–381. [Google Scholar] [CrossRef]
Chen, B.; Liu, X.; Zhao, H.; Principe, J.C. Maximum correntropy Kalman filter. Automatica 2017, 76, 70–77. [Google Scholar] [CrossRef] [Green Version]
Liu, X.; Qu, H.; Zhao, J.; Chen, B. Extended Kalman filter under maximum correntropy criterion. In Proceedings of the 2016 International Joint Conference on Neural Networks, Vancouver, BC, Canada, 24–29 July 2016; pp. 1733–1737. [Google Scholar]
Wang, G.; Li, N.; Zhang, Y. Maximum correntropy unscented Kalman and information filters for non-Gaussian measurement noise. J. Frankl. Inst. 2017, 354, 8659–8677. [Google Scholar] [CrossRef]
Meinhold, R.J.; Singpurwalla, N.D. Robustification of Kalman filter models. J. Am. Stat. Assoc. 1989, 84, 479–486. [Google Scholar] [CrossRef]
Wang, L.; Cheng, X.H.; Li, S.X. Gaussian Sum High Order Unscented Kalman Filtering Algorithm. Chin. J. Electron. 2017, 45, 424–430. [Google Scholar]
Zhang, C.; Yan, H.S. Identification of nonlinear time varying system with noise based on multi-dimensional Taylor network with optimal structure. J. Southeast Univ. 2017, 47, 1086–1093. [Google Scholar]
Wen, T.; Ge, Q.; Lyu, X.; Chen, L.; Constantinou, C.; Roberts, C.; Cai, B. A Cost-effective Wireless Network Migration Planning Method Supporting High-security Enabled Railway Data Communication Systems. J. Frankl. Inst. 2021, 358, 131–150. [Google Scholar] [CrossRef]
Wen, T.; Wen, C.; Roberts, C.; Cai, B. Distributed Filtering for a Class of Discrete-time Systems Over Wireless Sensor Networks. J. Frankl. Inst. 2020, 357, 3038–3055. [Google Scholar] [CrossRef]
Xiaohui, S.; Chenglin, W.; Tao, W. A Novel Step-by-Step High-Order Extended Kalman Filter Design for a Class of Complex Systems with Multiple Basic Multipliers. Chin. J. Electron. 2021, 30, 313–321. [Google Scholar] [CrossRef]
Xiaohui, S.; Chenglin, W.; Tao, W. High-Order Extended Kalman Filter Design for a Class of Complex Dynamic Systems with Polynomial Nonlinearities. Chin. J. Electron. 2021, 30, 508–515. [Google Scholar] [CrossRef]
Feng, X.; You, B. Random attractors for the two-dimensional stochastic g-Navier-Stokes equations. Stochastics 2019, 92, 613–626. [Google Scholar] [CrossRef]
Liu, W.; Chi, Y.; Zhang, G. Multiple Resolvable Group Estimation Based on the GLMB Filter with Graph Structure. In Proceedings of the 2018 IEEE 8th Annual International Conference on CYBER Technology in Automation, Control, and Intelligent Systems (CYBER), Tianjin, China, 19–23 July 2018; pp. 960–964. [Google Scholar]
Wu, Z.; Shi, J.; Zhang, X.; Ma, W.; Chen, B. Kernel recursive maximum correntropy. Signal Process. 2015, 117, 11–16. [Google Scholar] [CrossRef]
Anderson, B.; Moore, J. Optimal Filtering; Prentice-Hall: New York, NY, USA, 1979. [Google Scholar]
Julier, S.; Uhlmann, J.; Durrant-Whyte, H.F. A new method for the nonlinear transformation of means and covariances in filters and estimators. IEEE Trans. Autom. Control. 2000, 45, 477–482. [Google Scholar] [CrossRef] [Green Version]

Figure 2. Graph of the MTN identification system in case 1.

Figure 3. (a,b) Set parameters:

σ = 5, ε = 10^{- 6}

in case 1; (c,d) set parameters:

σ = 10, ε = 10^{- 6}

in case 1.

Figure 3. (a,b) Set parameters:

σ = 5, ε = 10^{- 6}

in case 1; (c,d) set parameters:

σ = 10, ε = 10^{- 6}

in case 1.

Figure 4. Probability densities of

x_{1}

estimation errors with the three filters in case 1.

Figure 4. Probability densities of

x_{1}

estimation errors with the three filters in case 1.

Figure 5. Probability densities of

x_{2}

estimation errors with the three filters in case 1.

Figure 5. Probability densities of

x_{2}

estimation errors with the three filters in case 1.

Figure 6. (a) Graph of the MTN identify the state equation in case 2. (b) Graph of the MTN identify the measurement equation in case 2.

Figure 7. (a,b) Set parameters:

σ = 5, ε = 10^{- 6}

in case2; (c,d) set parameters:

σ = 10, ε = 10^{- 6}

in case 2.

Figure 7. (a,b) Set parameters:

σ = 5, ε = 10^{- 6}

in case2; (c,d) set parameters:

σ = 10, ε = 10^{- 6}

in case 2.

Figure 8. Probability densities of

x_{1}

estimation errors with the three filters in case 2.

Figure 8. Probability densities of

x_{1}

estimation errors with the three filters in case 2.

Figure 9. Probability densities of

x_{2}

estimation errors with the three filters in case 2.

Figure 9. Probability densities of

x_{2}

estimation errors with the three filters in case 2.

Table 1. The mean squared error using the three methods in case 1.

		$MSE of x_{1}$			$MSE of x_{2}$
$σ$	$ε$	MCEKF	H-MCKF	H-MCKF_R	MCEKF	H-MCKF	H-MCKF_R
$σ = 2$	$ε = 10^{- 6}$	0.2073	0.1815	0.1694	0.1000	0.0774	0.0738
$σ = 5$	$ε = 10^{- 6}$	0.1974	0.1244	0.1225	0.1100	0.0964	0.0921
$σ = 10$	$ε = 10^{- 6}$	0.2282	0.1669	0.1636	0.1158	0.0925	0.0888
$σ = 20$	$ε = 10^{- 6}$	0.2244	0.1602	0.1572	0.1160	0.0916	0.0880

Table 2. The mean relative error using the three methods in case 1.

		$MRE of x_{1}$			$MRE of x_{2}$
$σ$	$ε$	MCEKF	H-MCKF	H-MCKF_R	MCEKF	H-MCKF	H-MCKF_R
$σ = 2$	$ε = 10^{- 6}$	0.3372	0.2405	0.2403	0.2354	0.2202	0.2084
$σ = 5$	$ε = 10^{- 6}$	0.3462	0.2953	0.2906	0.2679	0.2485	0.2448
$σ = 10$	$ε = 10^{- 6}$	0.3658	0.3052	0.2986	0.2745	0.2469	0.2426
$σ = 20$	$ε = 10^{- 6}$	0.3634	0.3009	0.2945	0.2753	0.2466	0.2419

Table 3. The mean squared error using the three methods in case 2.

		$MSE of x_{1}$			$MSE of x_{2}$
$σ$	$ε$	MCEKF	H-MCKF	H-MCKF_R	MCEKF	H-MCKF	H-MCKF_R
$σ = 2$	$ε = 10^{- 6}$	0.4017	0.1230	0.1219	0.2090	0.0907	0.0883
$σ = 5$	$ε = 10^{- 6}$	0.1148	0.1241	0.1233	0.2542	0.1200	0.1183
$σ = 10$	$ε = 10^{- 6}$	0.3221	0.1254	0.1248	0.2220	0.1207	0.1193
$σ = 20$	$ε = 10^{- 6}$	0.4040	0.1257	0.1251	0.2218	0.1208	0.1196

Table 4. The mean relative error using the three methods in case 2.

		$MRE of x_{1}$			$MRE of x_{2}$
$σ$	$ε$	MCEKF	H-MCKF	H-MCKF_R	MCEKF	H-MCKF	H-MCKF_R
$σ = 2$	$ε = 10^{- 6}$	0.5106	0.2337	0.2306	0.3742	0.2355	0.2316
$σ = 5$	$ε = 10^{- 6}$	0.2147	0.2551	0.2530	0.3070	0.2652	0.2645
$σ = 10$	$ε = 10^{- 6}$	0.4527	0.2570	0.2553	0.3824	0.2661	0.2659
$σ = 20$	$ε = 10^{- 6}$	0.4764	0.2575	0.2558	0.3789	0.2663	0.2662

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wang, Q.; Sun, X.; Wen, C. Design Method for a Higher Order Extended Kalman Filter Based on Maximum Correlation Entropy and a Taylor Network System. Sensors 2021, 21, 5864. https://doi.org/10.3390/s21175864

AMA Style

Wang Q, Sun X, Wen C. Design Method for a Higher Order Extended Kalman Filter Based on Maximum Correlation Entropy and a Taylor Network System. Sensors. 2021; 21(17):5864. https://doi.org/10.3390/s21175864

Chicago/Turabian Style

Wang, Qiupeng, Xiaohui Sun, and Chenglin Wen. 2021. "Design Method for a Higher Order Extended Kalman Filter Based on Maximum Correlation Entropy and a Taylor Network System" Sensors 21, no. 17: 5864. https://doi.org/10.3390/s21175864

APA Style

Wang, Q., Sun, X., & Wen, C. (2021). Design Method for a Higher Order Extended Kalman Filter Based on Maximum Correlation Entropy and a Taylor Network System. Sensors, 21(17), 5864. https://doi.org/10.3390/s21175864

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Design Method for a Higher Order Extended Kalman Filter Based on Maximum Correlation Entropy and a Taylor Network System

Abstract

1. Introduction

2. Description of Correntropy

3. Non-Linear Model Identification Based on Multidimensional Taylor Networks

3.1. Multidimensional Taylor Network Structure

3.2. Parameter Identification Method Based on Kalman Filtering

Model Establishment of a Kalman Filter

3.3. Approximation Analysis

4. Higher Order Extended Kalman Filter

4.1. Pseudolinearized Representation of Nonlinear Functions

4.2. Linearized Representation of Nonlinear Functions

4.3. Design of Higher Order Extended Kalman Filter

5. Higher Order Extended Kalman Filter Design Based on Maximum Correlation Entropy

5.1. Non-Gaussian Modeling of State Vector Based on Multivariate Information Observation

5.2. The Statistical Independence Process of Each Component in the Non-Gaussian Modeling Error Vector $ϖ (τ + 1)$ in the Comprehensive Measurement Model

5.3. Implementation Process of a Higher Order Extended Kalman Filter Based on Maximum Entropy

6. Simulated Cases

6.1. Case 1

6.2. Case 2

7. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Design Method for a Higher Order Extended Kalman Filter Based on Maximum Correlation Entropy and a Taylor Network System

Abstract

1. Introduction

2. Description of Correntropy

3. Non-Linear Model Identification Based on Multidimensional Taylor Networks

3.1. Multidimensional Taylor Network Structure

3.2. Parameter Identification Method Based on Kalman Filtering

Model Establishment of a Kalman Filter

3.3. Approximation Analysis

4. Higher Order Extended Kalman Filter

4.1. Pseudolinearized Representation of Nonlinear Functions

4.2. Linearized Representation of Nonlinear Functions

4.3. Design of Higher Order Extended Kalman Filter

5. Higher Order Extended Kalman Filter Design Based on Maximum Correlation Entropy

5.1. Non-Gaussian Modeling of State Vector Based on Multivariate Information Observation

5.2. The Statistical Independence Process of Each Component in the Non-Gaussian Modeling Error Vector ϖ ( τ + 1 ) in the Comprehensive Measurement Model

5.3. Implementation Process of a Higher Order Extended Kalman Filter Based on Maximum Entropy

6. Simulated Cases

6.1. Case 1

6.2. Case 2

7. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

5.2. The Statistical Independence Process of Each Component in the Non-Gaussian Modeling Error Vector $ϖ (τ + 1)$ in the Comprehensive Measurement Model