Abstract
This paper proposes one new design method for a higher order extended Kalman filter based on combining maximum correlation entropy with a Taylor network system to create a nonlinear random dynamic system with modeling errors and unknown statistical properties. Firstly, the transfer function and measurement function are transformed into a nonlinear random dynamic model with a polynomial form via system identification through the multidimensional Taylor network. Secondly, the higher order polynomials in the transformed state model and measurement model are defined as implicit variables of the system. At the same time, the state model and the measurement model are equivalent to the pseudolinear model based on the combination of the original variable and the hidden variable. Thirdly, higher order hidden variables are treated as additive parameters of the system; then, we establish an extended dimensional linear state model and a measurement model combining state and parameters via the previously used random dynamic model. Finally, as we only know the results of the limited sampling of the random modeling error, we use the combination of the maximum correlation estimator and the Kalman filter to establish a new higher order extended Kalman filter. The effectiveness of the new filter is verified by digital simulation.
1. Introduction
The application of filters occupies an important position in various fields at the national and international levels. The progress and development of filters play important roles in national economic construction—especially national defense construction—such as in real-time estimation and target tracking. In 1960, Kalman proposed a method of filtering under the minimum mean squared error criterion for linear systems, and it soon began to be widely used [1]. In order to solve nonlinear problems, extended Kalman filters (EKFs) [2], unscented Kalman filters (UKFs) [3], and cubature Kalman filters (CKFs) have since emerged. However, the above-mentioned filtering methods require the modeling error to be Gaussian white noise. As such, their performances are likely to worsen when applied to non-Gaussian situations, especially when the systems are disturbed by impulsive noise. Impulsive noise arises from heavy-tailed distributions [4] (such as some mixed Gaussian distributions), and is common in many real scenarios of automatic control and target tracking (for instance, the measurement noise in the radar system is often not Gaussian noise, but heavy-tailed non-Gaussian noise [5]). In 1993, Gordon and Salmond proposed particle filtering when the density function is known [6]; this achieves an approximation of the distribution function by sampling a large number of particles therein. However, this method is very complicated; it requires a large number of particles, and it will cause particle degradation after re-sampling. In general, the density function is difficult to obtain. For this reason, for the linear system, Chen designed the corresponding Kalman filter under the maximum correlation entropy criterion based on the limited realization of random variables [7]; this is called the maximum correlation entropy Kalman filter (MCKF) [8]. On this basis, the maximum correntropy extended Kalman filter (MCEKF) and the maximum correntropy unscented Kalman filter (MCUKF), which can solve nonlinear non-Gaussian systems, have since emerged [9]. However, in MCEKFs, all higher order terms in the Taylor expansion are discarded. Therefore, a large truncation error will be generated, and the filtering performance will decrease or even diverge as the nonlinearity of the system increases. In addition, each step of the state estimation needs to recalculate the Taylor expansion coefficient, which will undoubtedly increase the complexity of the calculation. MCUKFs use UT transformation and sigma point sampling [10]; this is called deterministic sampling. There is only one sampling point for a dimensional system. Neither low-dimensional nor high-dimensional systems have a strong claim to superiority. A large number of experiments have shown that both EKFs and UKFs can be approximated by a second-order polynomial at most [11], which will produce a large rounding error. Hence, both will eventually face the problems of degraded filtering performance and divergence as their nonlinearity increases [12].
This project proposes a higher order extended Kalman filter method based on maximum correlation entropy, under the assumption that both state and measurement equations can be modeled and based on a strong nonlinear function. The main contributions of this paper are as follows: (1) using multidimensional Taylor nets to convert the general expression of nonlinear functions into higher order polynomials; (2) defining each order of polynomial in the system as hidden variables of the corresponding order, and treating them as time-variable parameters; (3) establishing the dynamic relationship between the time-variable parameters and combining them with the original variables to further establish the expanded dimension state model; (4) based on the expanded linear state variables, equivalently rewriting the measurement model into the corresponding linear form; and (5) according to the established linear state and measurement model of the new extended dimension system, establishing a higher order extended Kalman filter method based on maximum correlation entropy.
The remaining parts of this paper are organized as follows: the first chapter is the preface of our knowledge, which introduces the definition of “entropy”; the Section 2 presents a method for identifying nonlinear functions based on multidimensional Taylor networks; the Section 3 presents a higher order extended Kalman filter method; the Section 4 presents the detailed design process of the maximum correlation entropy higher order extended Kalman filter; the Section 5 concerns simulation verification; and the Section 6 and Section 7 presents a summary and outlook.
2. Description of Correntropy
Correntropy is a generalized similarity measure between two random variables [13]. Given two one-dimensional random variables , their joint distribution function is ; then, the correlation entropy is defined as follows:
where is the expectation operator and is the translation-invariant Mercer kernel. In this article, it is not particularly emphasized that this kernel function is a Gaussian kernel, which is defined as follows:
where , represents the kernel’s bandwidth.
By expanding Equation (2) with a Taylor series, we can obtain the following:
and then the correlation entropy of Equation (1) has the following expression:
where is the truncation statistic of the random variable .
However, in most practical cases, joint distribution is usually unknown, and there are often finite implementations of for random variables. In these cases, the sample mean estimator can be used to estimate the heterogeneity:
Then, the entropy expression of the random variable pair is driven by finite data:
When , and the components of vector are independent of one another, multidimensional correlation entropy is based on N sampling.
3. Non-Linear Model Identification Based on Multidimensional Taylor Networks
Given that the state model and observation model are complex dynamic systems with nonlinear characteristics [14]:
where is an h-dimensional state vector; represents the d-dimensional measure vector; and and represent the state function and the measurement function, respectively. The modeling errors for non-Gaussian systems are and , while and are the process noise variance and the measurement noise variance, respectively.
Lemma 1.
Any continuous function defined in a closed interval can be approximated accurately with a polynomial function [15].
Lemma 2.
For continuous functions, , defined in a closed interval, can be approximated by the following [16]:
where denotes the total number of terms in the expansion and denotes the power of the variable in the product of the ith variable.
3.1. Multidimensional Taylor Network Structure
The multidimensional Taylor network model can replace the traditional neural network with the dynamic model and control the system under certain conditions; it is characterized by a nonlinear autoregressive moving-average model composed of polynomials. The multidimensional Taylor network (MTN) uses a forward single intermediate layer structure, including an input layer, an intermediate layer, and an output layer. Supposing that the input layer comprises n nodes——the output layer is , the middle layer is the network processing layer, and each input variable realizes the weighted summation of each power product term in this layer. The middle layer is composed of various power product terms and the corresponding connection weight vector :
which represents the output weight vector connecting the intermediate layer and the output node of the network.
According to the multivariate Taylor equation, if a function is differentiable to the th order at a certain point, then the function expands to a form where the power series of the variable is not greater than m times. The model can be expressed as a dynamic equation, as follows:
where is a function of nonlinearity described by a multidimensional Taylor network model, represents the weight before the product item of the ith variable, denotes the total number of terms in the expansion, denotes the power of the variable in the product of the ith variable, and is the error—also known as the remainder—produced by the identification of a function by a multidimensional Taylor network.
3.2. Parameter Identification Method Based on Kalman Filtering
Model Establishment of a Kalman Filter
A Kalman filter can be regarded as an optimized autoregressive data processing method that describes the entire system through a state equation and an observation equation.
State equation:
where , .
Observation equation:
It is not difficult to draw from Figure 1:
Thus,
where ; ; represents the system state at -time, that is, the parameter status value of the kth moment; and represents the output value of the network. It is assumed that both process noise and are Gaussian white noise during the analysis, and and , which are the process noise variance and measurement noise variance, respectively. Here, we use a Kalman filter to approximate the dynamic model. As the filtering principle of Kalman filters is mentioned later in this article, please refer to Equations (20)–(24) for the detailed process.
Figure 1.
Model of a multidimensional Taylor network.
3.3. Approximation Analysis
Given a class of nonlinear functions , it can be assumed that it is derivative of the rth order, but r is a relatively large number, making it difficult for us to use Taylor nets to approximate its function. The optimal approach would be to set and use the Taylor network to expand the nonlinear function to the mth order, obtain the result of Equation (16), and simultaneously ensure the higher order error term , where is the acceptable error threshold. This not only makes the Taylor network fitting function process easier, but also ensures the accuracy of the fit.
4. Higher Order Extended Kalman Filter
4.1. Pseudolinearized Representation of Nonlinear Functions
For ease of description and understanding, if , we can expand Equation (7) through a multidimensional Taylor network to the mth order, as follows:
where is the sum of all tensors of the th order and represents the weight corresponding to each order of the tensor.
Definition 1.
is a set of implicit variables of theth order.
Definition 2.
is the weight vector corresponding to the ith order implicit variable.
In [17], there is a detailed pseudolinearization process, so we will not repeat it in this article. In order to make the model more accurate, we treat the remainder of the equation of state as latent variables. According to Definition 1 and Definition 2, the pseudolinear extended dimension form using the remainder as a hidden variable is as follows:
where , , , .
Similarly, Equation (8) can be rewritten as follows:
where , , , .
4.2. Linearized Representation of Nonlinear Functions
In order to transform the pseudolinear model established in Section 3.1 into a true linear form, it is necessary to establish a dynamic relationship between the lth order hidden variables and the uth order hidden variables [18]:
where can be identified based on the multidimensional Taylor network in its original state; without any prior information, it can be set as follows:
Combining Definition 1, Definition 2, and Equation (19), the state model Equation (7) has the following linear matrix form:
If
then, Equation (7) has the following linearized form:
where is the modeling error.
In the same way, the linear matrix form of the measurement model can be obtained:
where , , and is the modeling error.
4.3. Design of Higher Order Extended Kalman Filter
For linear models, KF-based filters are given. Given the initial value , when and are Gaussian white noise with zero mean, the variances are recorded as and , respectively.
A recursive filter can be designed as follows:
5. Higher Order Extended Kalman Filter Design Based on Maximum Correlation Entropy
5.1. Non-Gaussian Modeling of State Vector Based on Multivariate Information Observation
System status estimates and estimated error covariance are obtained based on [19]. The filtering equation predicts a step prediction estimation value and corresponding step prediction error covariance matrix .
The step prediction estimate error of system status is as follows:
and it can be modified into a measurement model for system status as follows:
where is a measurement of system status , while is the measurement error. Finally, the combined measurement model is as follows:
where I is a unit array for the corresponding dimension, , and
According to Equation (20), a step prediction error covariance of the system state is received as follows:
where and the is a covariance matrix of random error vectors when the higher order hidden variable is dynamically modeled. is the original system status model (Equation (16)) of the non-Gaussian model error , and calculates the second-order statistic after obtaining a limited number of samples:
In Equation (23), is the calculated second-order statistic calculated after the non-Gaussian model error in the original system measurement state model (Equation (8)), obtaining a limited sample:
where is the jth realization vector of the non-Gaussian random noise vector .
5.2. The Statistical Independence Process of Each Component in the Non-Gaussian Modeling Error Vector in the Comprehensive Measurement Model
The vector in the comprehensive measurement model Equation (22) is a non-Gaussian modeling error vector, and its components are not statistically independent. In order to use the correlation entropy form of the multidimensional independent vector shown in Equation (19), the one-dimensional non-Gaussian vector needs to be transformed into statistical independence.
From , is a positive definite matrix. Similarly, in Equation (26), is also a positive definite matrix. For this reason, Equation (23) is further expressed as follows:
where and are the Cholesky factor matrices of and , respectively.
Applying to both sides of Equation (22), respectively, yields:
where
The above equation can be further simplified as follows:
because
Therefore, after the non-Gaussian modeling error random variable undergoes the equivalent transformation of the matrix
, the components of the random are statistically independent.
5.3. Implementation Process of a Higher Order Extended Kalman Filter Based on Maximum Entropy
The filtering process of the extended Kalman filter (H-MCEKF) based on the maximum correlation entropy is as follows (see [20] for the specific derivation process):
- The filter initialization obtains the initial filter value and the covariance , choosing a suitable core bandwidth and a small positive number ;
- Taylor networks are used for system identification to obtain the parameters in the equations, using the expanded item and the remainder as the new hidden variables. A pseudolinearization process is performed to obtain the pseudolinear form of the system;
- Equations (20) and (21) are used to obtain and , respectively, while Cholesky decomposition is used to obtain ;
- and are taken, where represents the estimated state of the fixed-point iteration t;
- The starting fixed-point iterative algorithm is as follows:where is the ith element of :The estimates of the current iteration step are compared with those of the previous iteration and, if satisfied,then , and the value of the pseudovariable can be updated, or the iteration can be repeated;
- , and steps (3–5) are repeated until the end of filtering.
6. Simulated Cases
This section verifies the validity of the proposed method by providing two cases: one in which the state equation is a nonlinear equation and the measurement equation is a linear equation, and one in which the state and measurement equations are both nonlinear.
6.1. Case 1
Consider a nonlinear system in which the state equation is a nonlinear model and the measurement equation is a linear model:
where the initial value is a random value of , the initial estimation error covariance , and the process noise and measurement noise have the following characteristics:
Figure 2 shows a diagram of the MTN identification system, while Figure 3 shows the estimated values of state variables and under the three filtering methods. From [21], we know the influence of ε is not significant compared with the kernel bandwidth σ. The parameters are set at . Table 1 and Table 2 show the mean squared error and the mean relative error, respectively, of the estimated values under the three algorithms, which are computed as averages over 100 independent Monte Carlo runs, with each run containing 50 time steps. When , the three algorithms all obtain better filtering results. Figure 4 and Figure 5 show the probability densities of the estimation errors when estimating the states and , respectively, when the parameters are and . All of the results confirm that the proposed H-MCKF (design method for a higher order extended Kalman filter based on maximum correlation entropy and a Taylor network system) can outperform the MCEKF (maximum correntropy extended Kalman filter) significantly when the system is disturbed by non-Gaussian processes and measurement noise, and the H-MCKF_R (H-MCKF with the remainder of the state equation) further improves the filtering performance of the H-MCKF.
Figure 2.
Graph of the MTN identification system in case 1.
Figure 3.
(a,b) Set parameters: in case 1; (c,d) set parameters: in case 1.
Table 1.
The mean squared error using the three methods in case 1.
Table 2.
The mean relative error using the three methods in case 1.
Figure 4.
Probability densities of estimation errors with the three filters in case 1.
Figure 5.
Probability densities of estimation errors with the three filters in case 1.
6.2. Case 2
Consider a nonlinear system in which the state equation and the measurement equation are both nonlinear models:
where the initial value is a random value of [0, 1], the initial estimation error covariance , and the process noise and measurement noise have the following characteristics:
Figure 6 shows a diagram of the MTN identification system, while Figure 7 shows the estimated values of state variables and under the three filtering methods. Similar to case 1, the parameters are set at . Table 3 and Table 4 show the mean squared error and the mean relative error, respectively, of the estimated values under the three algorithms, which are computed as averages over 100 independent Monte Carlo runs, with each run containing 50 time steps. When , the three algorithms all obtain better filtering results. Figure 8 and Figure 9 show the probability densities of the estimation errors when estimating the states and , respectively, when the parameters are and . All of the results confirm that the proposed H-MCKF can outperform the MCKF significantly when the system is disturbed by non-Gaussian processes and measurement noise, and the H-MCKF_R further improves the filtering performance of the H-MCKF when the state and measurement equations are both nonlinear.
Figure 6.
(a) Graph of the MTN identify the state equation in case 2. (b) Graph of the MTN identify the measurement equation in case 2.
Figure 7.
(a,b) Set parameters: in case2; (c,d) set parameters: in case 2.
Table 3.
The mean squared error using the three methods in case 2.
Table 4.
The mean relative error using the three methods in case 2.
Figure 8.
Probability densities of estimation errors with the three filters in case 2.
Figure 9.
Probability densities of estimation errors with the three filters in case 2.
7. Conclusions
This paper considered a wide range of filter design problems for the state estimation of multivariable dynamic systems, which consist of a strong nonlinear dynamic model and a strong nonlinear observation model. Firstly, we transformed those strong nonlinear models into a higher order polynomial series using a multidimensional Taylor network. Secondly, all higher order items in the polynomial series were defined as hidden variables. Those higher order series were then rewritten as their pseudolinear equivalents. Thirdly, dynamic relationships between all hidden variables and known variables were constructed using the multidimensional Taylor network. Combining the original model of pseudolinearization with the higher order hidden variable dynamic model, linear dynamic models fitted to a standard Kalman filter were presented. Finally, considering that a finite number of samples from modeling error can be obtained, we built the higher order extended Kalman filter based on maximum correlation entropy, and acquired better filter performance than offered by the existing MCEKF [22].
Outlook: There exist several challenges worthy of further research. Firstly, the proposed higher order extended Kalman filter based on maximum correlation entropy is an online iteration process that obtains state estimation constantly, but, as such, it loses one important function possessed by the standard Kalman filter: the ability to operate in real time. Secondly, the linearized model parameters of the original nonlinear model and the hidden variable dynamic model were identified by local time period data; thus, they need to be updated with new time period data in order to fit the time dynamics of the system. Thirdly, in this paper, on the basis of defining all of the hidden variables, we established a linear form of the strong nonlinear model in an expanded state with the original variables and all hidden variables, and obtained better estimation performance than that of a standard EKF; if measurements can be expanded in the same manner as state, we believe that such a filter may offer better estimation performance than the one established by this paper.
Author Contributions
Conceptualization, Q.W., X.S. and C.W.; methodology, C.W.; software, Q.W.; writing—original draft preparation, Q.W. writing—review and editing, Q.W. All authors have read and agreed to the published version of the manuscript.
Funding
This work was supported by the institutions: [01] key project of National Natural Science Foundation of China (No. 61751304), extraction and deep learning of optimal decision rules in an uncertain small sample environment; [02] key project of National Natural Science Foundation of China and Zhejiang joint fund for integration of industrialization and industrialization (No. u1509203), life cycle fault prediction, and intelligent health management of large ship power system operation; [03] key project of National Natural Science Foundation of China (No. 61933013): intelligent diagnosis, prediction and maintenance of abnormal working conditions of large petrochemical units.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Kalman, R.E. A new approach to linear filtering and prediction problems. actions of the . Trans. ASME Ser. D J. Basic Eng. 1960, 82, 35–45. [Google Scholar] [CrossRef] [Green Version]
- Wen, C.; Wang, Z.; Liu, Q.; Alsaadi, F.E. Recursive distributed filtering for a class of state-saturated systems with fading measurements and quantization effects. IEEE Trans. Syst. Man Cybern. Syst. 2016, 48, 930–941. [Google Scholar] [CrossRef]
- Wen, C.; Wang, Z.; Hu, J.; Liu, Q.; Alsaadi, F.E. Rlsaadi. Recursive fifiltering for state-saturated systems with randomly occurring nonlinearities and missing measurements. Int. J. Robust Nonlinear Control. 2018, 28, 1715–1727. [Google Scholar] [CrossRef]
- Ge, Q.; Shao, T.; Duan, Z.; Wen, C. Performance Analysis of the Kalman Filter with Mismatched Noise Covariances. IEEE Trans. Autom. Control. 2016, 61, 4014–4019. [Google Scholar] [CrossRef]
- Wen, C.; Cheng, X.; Xu, D.; Wen, C. Filter design based on characteristic functions for one class of multi-dimensional nonlinear non-Gaussian systems. Automatica 2017, 82, 171–180. [Google Scholar] [CrossRef]
- Arulampalam, M.S.; Maskell, S.; Gordon, N.; Clapp, T. A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking. IEEE Trans. Signal Process. 2002, 50, 174–188. [Google Scholar] [CrossRef] [Green Version]
- Feng, X.; Wen, C.; Park, J.H. Sequential fusion ∞ filtering for multi-rate multi-sensor time-varying systems—A Krein-space approach. IET Control Theory Appl. 2017, 11, 369–381. [Google Scholar] [CrossRef]
- Chen, B.; Liu, X.; Zhao, H.; Principe, J.C. Maximum correntropy Kalman filter. Automatica 2017, 76, 70–77. [Google Scholar] [CrossRef] [Green Version]
- Liu, X.; Qu, H.; Zhao, J.; Chen, B. Extended Kalman filter under maximum correntropy criterion. In Proceedings of the 2016 International Joint Conference on Neural Networks, Vancouver, BC, Canada, 24–29 July 2016; pp. 1733–1737. [Google Scholar]
- Wang, G.; Li, N.; Zhang, Y. Maximum correntropy unscented Kalman and information filters for non-Gaussian measurement noise. J. Frankl. Inst. 2017, 354, 8659–8677. [Google Scholar] [CrossRef]
- Meinhold, R.J.; Singpurwalla, N.D. Robustification of Kalman filter models. J. Am. Stat. Assoc. 1989, 84, 479–486. [Google Scholar] [CrossRef]
- Wang, L.; Cheng, X.H.; Li, S.X. Gaussian Sum High Order Unscented Kalman Filtering Algorithm. Chin. J. Electron. 2017, 45, 424–430. [Google Scholar]
- Zhang, C.; Yan, H.S. Identification of nonlinear time varying system with noise based on multi-dimensional Taylor network with optimal structure. J. Southeast Univ. 2017, 47, 1086–1093. [Google Scholar]
- Wen, T.; Ge, Q.; Lyu, X.; Chen, L.; Constantinou, C.; Roberts, C.; Cai, B. A Cost-effective Wireless Network Migration Planning Method Supporting High-security Enabled Railway Data Communication Systems. J. Frankl. Inst. 2021, 358, 131–150. [Google Scholar] [CrossRef]
- Wen, T.; Wen, C.; Roberts, C.; Cai, B. Distributed Filtering for a Class of Discrete-time Systems Over Wireless Sensor Networks. J. Frankl. Inst. 2020, 357, 3038–3055. [Google Scholar] [CrossRef]
- Xiaohui, S.; Chenglin, W.; Tao, W. A Novel Step-by-Step High-Order Extended Kalman Filter Design for a Class of Complex Systems with Multiple Basic Multipliers. Chin. J. Electron. 2021, 30, 313–321. [Google Scholar] [CrossRef]
- Xiaohui, S.; Chenglin, W.; Tao, W. High-Order Extended Kalman Filter Design for a Class of Complex Dynamic Systems with Polynomial Nonlinearities. Chin. J. Electron. 2021, 30, 508–515. [Google Scholar] [CrossRef]
- Feng, X.; You, B. Random attractors for the two-dimensional stochastic g-Navier-Stokes equations. Stochastics 2019, 92, 613–626. [Google Scholar] [CrossRef]
- Liu, W.; Chi, Y.; Zhang, G. Multiple Resolvable Group Estimation Based on the GLMB Filter with Graph Structure. In Proceedings of the 2018 IEEE 8th Annual International Conference on CYBER Technology in Automation, Control, and Intelligent Systems (CYBER), Tianjin, China, 19–23 July 2018; pp. 960–964. [Google Scholar]
- Wu, Z.; Shi, J.; Zhang, X.; Ma, W.; Chen, B. Kernel recursive maximum correntropy. Signal Process. 2015, 117, 11–16. [Google Scholar] [CrossRef]
- Anderson, B.; Moore, J. Optimal Filtering; Prentice-Hall: New York, NY, USA, 1979. [Google Scholar]
- Julier, S.; Uhlmann, J.; Durrant-Whyte, H.F. A new method for the nonlinear transformation of means and covariances in filters and estimators. IEEE Trans. Autom. Control. 2000, 45, 477–482. [Google Scholar] [CrossRef] [Green Version]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).