Open Access
This article is
 freely available
 reusable
Mathematics 2019, 7(12), 1168; https://doi.org/10.3390/math7121168
Article
Ellipsoidal and Gaussian Kalman Filter Model for DiscreteTime Nonlinear Systems
^{1}
Geodätisches Institut Hannover, Leibniz Universität Hannover, 30167 Hannover, Germany
^{2}
Institut für Geoinformation und Vermessung Dessau, Hochschule Anhalt, 06846 Dessau, Germany
^{3}
Department of Computer Science, University of Texas at El Paso, El Paso, TX 79968, USA
^{*}
Author to whom correspondence should be addressed.
Received: 9 September 2019 / Accepted: 20 November 2019 / Published: 3 December 2019
Abstract
:In this paper, we propose a new technique—called Ellipsoidal and Gaussian Kalman filter—for state estimation of discretetime nonlinear systems in situations when for some parts of uncertainty, we know the probability distributions, while for other parts of uncertainty, we only know the bounds (but we do not know the corresponding probabilities). Similarly to the usual Kalman filter, our algorithm is iterative: on each iteration, we first predict the state at the next moment of time, and then we use measurement results to correct the corresponding estimates. On each correction step, we solve a convex optimization problem to find the optimal estimate for the system’s state (and the optimal ellipsoid for describing the systems’s uncertainty). Testing our algorithm on several highly nonlinear problems has shown that the new algorithm performs the extended Kalman filter technique better—the state estimation technique usually applied to such nonlinear problems.
Keywords:
Ellipsoidal and Gaussian Kalman filter; state estimation; unknown but bounded uncertainty; nonlinear programming; convex optimization1. Introduction
State estimation is important for virtually all areas of engineering and science. Every discipline which uses the mathematical modeling of its systems needs state estimation; this includes electrical engineering, mechanical engineering, chemical engineering, aerospace engineering, robotics, dynamical systems’ control and many others.
We estimate the system’s state based on the measurement results. In this estimation, we need to take into account that measurements are never absolutely accurate, the measurement results contain inaccuracy (“noise”)—e.g., due to inevitable imperfection of the measuring instruments. Also, our understanding of the system’s dynamics is also usually approximate. In addition to the internal factors (which are described by the system’s state) and the known external factors, there are usually also many other factors that affect the system—and which, from the viewpoint of the model, constitute the noise. The presence of noise affects our ability to predict the system’s behavior and to control the system. It is therefore desirable to minimize the effect of the noise—in particular, to minimize the effect of noise on the state estimation. In engineering, traditionally, techniques for decreasing the effect of noise—i.e., for separating (“filtering”) signal from noise—are known as filtering; because of this, the state estimation problem can be viewed as an important particular case of filtering.
To formulate the state estimation problem in precise terms, we need to have a mathematical model of the actual system, and we need to have a model describing the system’s and measurement uncertainties (noise). To select the best filtering technique, we also need to select a numerical measure for describing the remaining inaccuracy of state estimation. Once this measure is selected, we need, given the measurement results, to find the state estimates for which the selected measure of inaccuracy attains the smallest possible value; see, e.g., [1]. In other words, from the mathematical viewpoint, estimating a state means solving the corresponding optimization problem.
The quality of the resulting state estimates depends on how well our models— i.e., our assumptions about the system and about the noise—describe the actual system and the actual noise. The most widely used state estimation techniques are stochastic techniques, i.e., techniques based on the assumption that we know the probability distribution of the noise (or, more generally, that we have some information about this distribution). In most techniques, it is assumed that the noise is Gaussian, but other distributions have also been considered. Stochastic techniques have been actively developed for dozens (if not hundreds) of years.
The most widely used stochastic state estimation techniques are the Kalman Filter (KF) techniques [2] and the Extended Kalman Filter (EKF) techniques [3,4]. In most practical applications, the traditional Kalman filters are used, which are based on the assumption that the system is linear (and that the noise is Gaussian). In practice, however, many reallife system are nonlinear (and the actual noise distribution is sometimes nonGaussian).
Because of the ubiquity of nonlinear systems, several filtering techniques have been developed for the nonlinear case. Many applications use the finite sum approximation techniques such as the Gaussian sum filter (see, e.g., [5]) or linearization techniques such as EKF (see, e.g., [6]). These techniques work well for many nonlinear systems. Their main limitation is that they assume that both the system noise and the measurement noise are normally distributed. As a result, sometimes, when the actual distributions are nonGaussian—e.g., when they are multimodal—these techniques do not perform well.
Another widely used filtering techniques applicable to nonlinear systems is the Unscented Kalman Filter (UKF), first proposed in [7]. The UKF technique is based on using a special nonlinear transformation of the data—called Unscented Tranform (UT)—that, crudely speaking. transforms the actual probability distributions into simpler ones. The main advantage of UKF is that, in contrast to EKF, UKF techniques do not assume that the required nonlinear transformations are smooth (differentiable). As a result, UKF does not involve timeconsuming computation of the corresponding derivatives—i.e., Jacobian and Hessian matrices. On the other hand, because of their more general nature, UKF algorithms are more complex and, in general, require more computation time.
Yet another widely filtering technique for nonlinear (and nonGaussian, and nonstationary) situations is the Particle Filter (PF) technique [8]. The main idea of this technique is that we simulate each probability distribution by selecting several sample points (called particles). We start with selecting several states distributed according to the known prior distribution of the system’s state. To each of the selected states, we apply the system’s dynamics—simulating the corresponding system’s noise and measurement noise. The resulting sample of states (and sample of expected measurement results) represent the distributions corresponding to the next moment of time. We can then use the actual observation results obtained at this moment of time to update the distributions. This method works very well in many practical applications. For example, in geodesy, the PF techniques have been successfully used to accurately determine the trajectory on a moving vehicle based on Laserscannerbased multisensor measurements [9,10].
While these techniques work well in many practical situations, they all have a common limitation: they assume that we have a reasonably detailed information about the corresponding probability distributions. In practice, often, we do not have that information. Sometimes, some approximate distributions are provided, but the actual frequencies of different noise values are very different from what these approximate distributions predict. This is a frequent situations, e.g., for measuring instruments, when the manufacturer provides us only with an upper bound on the systematic error component (or even on the overall measurement error) without providing any information on the probabilities of different values within the given bounds; see, e.g., [11].
Such situations are known as situations with Unknown But Bounded uncertainty (UBB). Techniques for state estimation under such uncertainty have been developed since the 1960s; see, e.g., [12,13,14].
In the case of UBB uncertainty, for each noise component, once we know the upper bound $\Delta $ on its absolute value, the only information that we have about the actual noise value is that this value is located in the interval $[\Delta ,\Delta ]$; we do not know the probabilities of different values from this interval. Once we know the bounds ${\Delta}_{i}$ on each noise value ${n}_{i}$ ($1\le i\le k$), we can therefore conclude that the tuple $n=({n}_{1},\dots ,{n}_{k})$ formed by these noise values belongs to the box
methods for processing such information are given, e.g., in [15,16,17].
$$[{\Delta}_{1},{\Delta}_{1}]\times \dots \times [{\Delta}_{n},{\Delta}_{n}],$$
We may also have some additional information about the relation between different noise values—e.g., the upper bound on the difference between noise values at two consequent moments of time. As a result, in addition to the interval containing each noise value, we may know a bounded close (hence compact) set containing the tuples of possible noise values.
There exist several techniques for dealing with such uncertainty. The most accurate techniques are the ones that use generic polytopes [18,19]—since by using polytopes, we can approximate any reasonable compact set with any given accuracy. However, to get an accurate approximation, we need to use a large number of parameters—especially in multidimensional case. As a result, in practice, these general methods are only efficient in lowdimensional situations.
Somewhat faster techniques emerge when we limit ourselves to a special class of polytopes called zonotopes; see, e.g., [1,20,21,22,23]. Mathematically, zonotopes are defined as Minkowski sums of intervals (for those who are not familiar with this notion, the definition of Minkowski sum is given in the next section). These methods are more efficient, but still mostly limited to lowdimensional cases.
The only efficient general techniques available in the higherdimensional cases are techniques based on using ellipsoids; see, e.g., [24,25,26]. If we use ellipsoids, then some problems becomes easy to solve: e.g., if the system is linear and we know the ellipsoid that contains its initial state, then the set of all possible states at the next moment of time is also an ellipsoid, and this ellipsoid can be easily computed. Other problems are not so easy. For example, if we have two different sources of noise and each of them is described by an ellipsoid, then the set of possible values of the overall noise is no longer an ellipsoid—using the term which is explained on the next section, it is the Minkowski sum of the two original ellipsoids. To apply the ellipsoid technique to this situation, we need to enclose this Minkowski sum into an ellipsoid. We want this enclosing ellipsoid to represent the sum as accurately as possible—so, to compute this ellipsoid, we need to solve the corresponding optimization problem (this will also be described in the next section).
The above techniques take care of the situations in which we either know all the corresponding probability distributions or we do not know the probabilities at all—we only know the bounds on the noises. In practice, often, we have the probability information about some noise components, and we only know bounds on other components of the noise. For such situations, we need state estimation techniques that would take both types of uncertainty into account. For the linear case, such techniques have been developed by Benjamin Noack in his PhD dissertation [27]. In this paper, we extend these techniques to the general nonlinear case.
Following Noack, we use ellipsoids to describe the UBB uncertainty. At this stage of our research, we limit ourselves to the situations when all the probability distributions are Gaussian. Thus, we call our new filtering techniques the Ellipsoidal and Gaussian Kalman Filter (EGKF, for short).
For random uncertainties, the newly proposed method keeps the Kalman filter’s recursive framework, and thus, with respect to this uncertainty, is as efficient as the original KF. Of course, since we also use ellipsoidal uncertainty, we need to solve an optimization problem on each step, as a result of which our method requires somewhat more computation time than the usual KF.
The structure of the paper is as follows. Section 2 describes the mathematical definitions and results that are used in the following text—including the general definitions and results about the corresponding dynamical systems and filters. In Section 3, we analyze the corresponding problem, derive the formulas of the resulting EGKF algorithm, and, finally, present this algorithm in a practical readytouse form. In Section 4, we show, on several test cases, how the new method works—and we show that, in most cases, it indeed performs better than EKF. The last section contains conclusions and future work.
2. Mathematical Model
This section contains the mathematical definitions and results that are used in the following text—including the general definitions and results about the corresponding dynamical systems and filters. It also includes the formulation of the problem in precise terms.
2.1. Mathematical Definitions and Results Used in This Paper
The following definitions, theorems and corollaries are required for the derivation of the new filter model EGKF. Proofs of these results can be found, e.g., in [26].
Definition 1.
Let n be a positive integer. By a bounded ellipsoid in ${\mathbb{R}}^{n}$ with nonempty interior (or simply ellipsoid, for short) we mean a set
where $\mathbf{c}\in {\mathbb{R}}^{n}$ and S is a positive definite matrix (this will be denoted by $S>0$). $\mathbf{c}$ is called the center of the ellipsoid $\mathcal{E}$, and S is called the shape matrix.
$$\mathcal{E}=\mathcal{E}(\mathbf{c},S)=\{\mathbf{x}\in {\mathbb{R}}^{n}\phantom{\rule{0.166667em}{0ex}}\phantom{\rule{0.166667em}{0ex}}{(\mathbf{x}\mathbf{c})}^{T}{S}^{1}(\mathbf{x}\mathbf{c})\le 1\},$$
The shape matrix specifies the size and orientation of the ellipsoid.
Definition 2.
Let A and B be sets in the Euclidean space ${\mathbb{R}}^{n}$. By the Minkowski sum $A+B$ of A and B, we mean the set of all possible values obtained by adding elements of A and B:
$$A+B=\{a+b\phantom{\rule{0.166667em}{0ex}}\phantom{\rule{0.166667em}{0ex}}a\in A,b\in B\}.$$
In general, for K ellipsoids in ${\mathbb{R}}^{n}$
their Minkowski sum
is not an ellipsoid anymore; however, it is still a convex set.
$${\mathcal{E}}_{k}=\mathcal{E}({\mathbf{c}}_{k},{S}_{k}),(k=1,2,\cdots ,K),$$
$${\mathcal{U}}_{K}=\sum _{k=1}^{K}{\mathcal{E}}_{k}$$
In this paper, we will look for the ellipsoid with the smallest trace $\mathrm{tr}\left(\mathrm{S}\right)$ that contains the Minkowski sum. We selected this criterion since it corresponds to minimizing the mean square error in the probabilistic case. In principle, we could instead minimize the volume of the ellipsoid—which corresponds to minimizing the determinant $\leftS\right$ of the shape matrix, or the largest eigenvalue of the shape matrix ${\lambda}_{M}\left(S\right)$. Minimizing the largest eigenvalue ${\lambda}_{M}\left(S\right)$ makes the ellipsoid more like a ball (in 2D case, more like a circle). Minimizing the volume sometimes leads to oblate ellipsoids, with large uncertainty in some directions.
For minimizing the trace, we will denote the corresponding minimization problem by ${\mathrm{T}}^{+}$:
$${\mathcal{E}}^{*}=\underset{{\mathcal{U}}_{K}\subseteq \mathcal{E}}{argmin}\phantom{\rule{0.166667em}{0ex}}\mathrm{tr}\left(\mathrm{S}\right)\phantom{\rule{0.166667em}{0ex}}\phantom{\rule{1.em}{0ex}}\left(\mathrm{Problem}\phantom{\rule{0.166667em}{0ex}}{\mathrm{T}}^{+}\right).$$
It is known that the optimal ellipsoid ${\mathcal{E}}^{*}$ always exists and is unique [26].
Theorem 1.
The center of the optimal ellipsoid ${\mathcal{E}}^{*}$ for Problem ${T}^{+}$ is given by
$${\mathbf{c}}^{*}=\sum _{k=1}^{K}{\mathbf{c}}_{k}.$$
Theorem 2.
Let $\mathcal{D}$ be the convex set of all vectors $\alpha \in {\mathbb{R}}^{K}$ for which ${\alpha}_{k}>0$ for all k and ${\sum}_{k=1}^{K}{\alpha}_{k}=1$. Then, for any $\alpha \in \mathcal{D}$, the ellipsoid ${\mathcal{E}}_{\alpha}={\mathcal{E}}^{+}({\mathbf{c}}^{*},{S}_{\alpha})$, with ${\mathbf{c}}^{*}$ defined by (6) and
contains the Minkowski sum ${\mathcal{U}}_{K}$.
$${S}_{\alpha}=\sum _{k=1}^{K}{\alpha}_{k}^{1}{S}_{k}$$
Corollary 1.
(Special case of Theorem 2) When $K=2$, we have ${\alpha}_{1}+{\alpha}_{2}=1$, so the formula for ${S}_{\alpha}$ can be rewritten as
where β can be any positive real number.
$${S}_{\alpha}=\frac{1}{{\alpha}_{1}}{S}_{1}+\frac{1}{{\alpha}_{2}}{S}_{2}=\left(1+\frac{1}{\beta}\right){S}_{1}+(1+\beta ){S}_{2},$$
Proof of Corollary 1.
Indeed, for each $\beta >0$, we can take ${\alpha}_{1}=\frac{\beta}{1+\beta}$ and ${\alpha}_{2}=\frac{1}{1+\beta}$. ☐
Theorem 3.
In the family ${\mathcal{E}}_{\alpha}={\mathcal{E}}^{+}({c}^{*},{S}_{\alpha})$, the minimaltrace ellipsoid containing the Minkowski sum of the ellipsoids ${\mathcal{E}}_{k}={\mathcal{E}}^{+}({\mathbf{c}}_{k},{S}_{k}),k=1,2,\cdots ,K$ is obtained for
$${S}_{{\alpha}^{*}}=\left(\sum _{k=1}^{K}\sqrt{\mathrm{tr}\left({S}_{k}\right)}\right)\xb7\left(\sum _{k=1}^{K}\frac{{S}_{k}}{\sqrt{\mathrm{tr}\left({S}_{k}\right)}}\right).$$
Corollary 2.
(Special case of Theorem 3) When $K=2$, we have
$${S}_{{\alpha}^{*}}=\left(1+\frac{1}{{\beta}^{*}}\right){S}_{1}+(1+{\beta}^{*}){S}_{2},where{\beta}^{*}=\sqrt{\frac{\mathrm{tr}\left({S}_{1}\right)}{\mathrm{tr}\left({S}_{2}\right)}}.$$
2.2. Dynamical Systems and Linearization
In this paper, we consider the following general discretetime nonlinear system:
where:
$$\begin{array}{cc}\hfill {\mathbf{x}}_{k+1}& ={f}_{k}({\mathbf{x}}_{k},{\mathbf{u}}_{k},{\mathbf{w}}_{k},{\mathbf{a}}_{k}),\hfill \end{array}$$
$$\begin{array}{cc}\hfill {\mathbf{y}}_{k}& ={h}_{k}({\mathbf{x}}_{k},{\mathbf{v}}_{k},{\mathbf{b}}_{k}),\hfill \end{array}$$
 ${\mathbf{x}}_{k}$ is a ndimensional state vector at time ${t}_{k}$,
 ${\mathbf{u}}_{k}$ is the known input vector,
 ${\mathbf{w}}_{k}\sim \mathrm{N}(\mathbf{0},{C}_{k}^{u})$ is a Gaussian system noise with covariance matrix ${C}_{k}^{u}$,
 ${\mathbf{a}}_{k}\in \mathcal{E}(\mathbf{0},{S}_{k}^{u})$ is an UBB perturbation with shape matrix ${S}_{k}^{u}$,
 ${\mathbf{v}}_{k}\sim \mathrm{N}(\mathbf{0},{C}_{k}^{z})$ is a Gaussian measurement noise with covariance matrix ${C}_{k}^{z}$,
 ${\mathbf{b}}_{k}\in \mathcal{E}(\mathbf{0},{S}_{k}^{z})$ is an UBB perturbation with shape matrix ${S}_{k}^{z}$—all at the time ${t}_{k}$.
To make notations clearer, in this paper, parameters related to the first (system) equation will be denoted by u and parameters related to the second (measurement) equation will be denoted by z.
Similarly to EKF, we start with linearization: namely, we expand the righthand side of the above equations in Taylor series and keep only linear terms in the corresponding expansion. For the system Equation (11a), we perform Taylor expansion around the point $({\mathbf{x}}_{k}={\widehat{\mathbf{x}}}_{k}^{+},{\mathbf{u}}_{k}={\mathbf{u}}_{k},{\mathbf{w}}_{k}=\mathbf{0},{\mathbf{a}}_{k}=\mathbf{0})$ and get the following result:
$$\begin{array}{cc}\hfill {\mathbf{x}}_{k+1}=& {f}_{k}({\widehat{\mathbf{x}}}_{k}^{+},{\mathbf{u}}_{k},\mathbf{0},\mathbf{0})+\frac{\partial {f}_{k}}{\partial {\mathbf{x}}_{k}}{}_{({\widehat{\mathbf{x}}}_{k}^{+},{\mathbf{u}}_{k},\mathbf{0},\mathbf{0})}({\mathbf{x}}_{k}{\widehat{\mathbf{x}}}_{k}^{+})+\frac{\partial {f}_{k}}{\partial {\mathbf{w}}_{k}}{}_{({\widehat{\mathbf{x}}}_{k}^{+},{\mathbf{u}}_{k},\mathbf{0},\mathbf{0})}{\mathbf{w}}_{k}\hfill \\ \hfill \phantom{\rule{1.em}{0ex}}& +\frac{\partial {f}_{k}}{\partial {\mathbf{a}}_{k}}{}_{({\widehat{\mathbf{x}}}_{k}^{+},{\mathbf{u}}_{k},\mathbf{0},\mathbf{0})}{\mathbf{a}}_{k}+\cdots \hfill \\ \hfill \approx & {f}_{k}({\widehat{\mathbf{x}}}_{k}^{+},{\mathbf{u}}_{k},\mathbf{0},\mathbf{0})+{F}_{x,k}({\mathbf{x}}_{k}{\widehat{\mathbf{x}}}_{k}^{+})+{F}_{w,k}{\mathbf{w}}_{k}+{F}_{a,k}{\mathbf{a}}_{k}\hfill \\ \hfill =& {F}_{x,k}{\mathbf{x}}_{k}+[{f}_{k}({\widehat{\mathbf{x}}}_{k}^{+},{\mathbf{u}}_{k},\mathbf{0},\mathbf{0}){F}_{x,k}{\widehat{\mathbf{x}}}_{k}^{+}]+{F}_{w,k}{\mathbf{w}}_{k}+{F}_{a,k}{\mathbf{a}}_{k}\hfill \\ \hfill =& {F}_{x,k}{\mathbf{x}}_{k}+{\tilde{\mathbf{u}}}_{k}+{F}_{w,k}{\mathbf{w}}_{k}+{F}_{a,k}{\mathbf{a}}_{k}.\hfill \end{array}$$
Here, we denoted ${\tilde{\mathbf{u}}}_{k}={f}_{k}({\widehat{\mathbf{x}}}_{k}^{+},{\mathbf{u}}_{k},\mathbf{0},\mathbf{0}){F}_{x,k}{\widehat{\mathbf{x}}}_{k}^{+}$ and ${F}_{x,k}$, ${F}_{w,k}$, and ${F}_{a,k}$ denote the corresponding derivatives. Note that, in spite of the fact that our uncertainty model is different from the purely probabilistic model underlying the EKF technique, the resulting formulas are similar to the corresponding EKF formulas–since at this stage, we did not yet use any information about the uncertainty.
For the measurement Equation (11b), a similar Taylor expansion around the natural point $({\mathbf{x}}_{k}={\widehat{\mathbf{x}}}_{k}^{},{\mathbf{v}}_{k}=\mathbf{0},{\mathbf{b}}_{k}=\mathbf{0})$ leads to the following formula:
$$\begin{array}{cc}\hfill {\mathbf{y}}_{k}=& {h}_{k}({\widehat{\mathbf{x}}}_{k}^{},\mathbf{0},\mathbf{0})+\frac{\partial {h}_{k}}{\partial {\mathbf{x}}_{k}}{}_{({\widehat{\mathbf{x}}}_{k}^{},\mathbf{0},\mathbf{0})}({\mathbf{x}}_{k}{\widehat{\mathbf{x}}}_{k}^{})+\frac{\partial {h}_{k}}{\partial {\mathbf{v}}_{k}}{}_{({\widehat{\mathbf{x}}}_{k}^{},\mathbf{0},\mathbf{0})}{\mathbf{v}}_{k}\hfill \\ \hfill \phantom{\rule{1.em}{0ex}}& +\frac{\partial {h}_{k}}{\partial {\mathbf{b}}_{k}}{}_{({\widehat{\mathbf{x}}}_{k}^{},\mathbf{0},\mathbf{0})}{\mathbf{b}}_{k}+\cdots \hfill \\ \hfill \approx & {h}_{k}({\widehat{\mathbf{x}}}_{k}^{},\mathbf{0},\mathbf{0})+{H}_{x,k}({\mathbf{x}}_{k}{\widehat{\mathbf{x}}}_{k}^{})+{H}_{v,k}{\mathbf{v}}_{k}+{H}_{b,k}{\mathbf{b}}_{k}\hfill \\ \hfill =& {H}_{x,k}{\mathbf{x}}_{k}+{\tilde{\mathbf{z}}}_{k}+{H}_{v,k}{\mathbf{v}}_{k}+{H}_{b,k}{\mathbf{b}}_{k}.\hfill \end{array}$$
Here, we denoted ${\tilde{\mathbf{z}}}_{k}={h}_{k}({\widehat{\mathbf{x}}}_{k}^{},\mathbf{0},\mathbf{0}){H}_{x,k}{\widehat{\mathbf{x}}}_{k}^{}$. Notice that if the measurement equation is linear, then ${\tilde{\mathbf{z}}}_{k}=\mathbf{0}$.
Thus, from the original system (11), we get the following linearized system:
$$\begin{array}{c}\hfill {\mathbf{x}}_{k+1}={F}_{x,k}{\mathbf{x}}_{k}+{\tilde{\mathbf{u}}}_{k}+{F}_{w,k}{\mathbf{w}}_{k}+{F}_{a,k}{\mathbf{a}}_{k},\end{array}$$
$$\begin{array}{c}\hfill {\mathbf{y}}_{k}={H}_{x,k}{\mathbf{x}}_{k}+{\tilde{\mathbf{z}}}_{k}+{H}_{v,k}{\mathbf{v}}_{k}+{H}_{b,k}{\mathbf{b}}_{k}.\end{array}$$
These equations describe the change in the (unknown) true state ${\mathbf{x}}_{k}$. In practice, we only know the estimates for the state. Following the general idea of Kalman filter, at each moment of time ${t}_{k}$, we will consider:
 an a priori estimate of this state—i.e., what we predict based on the information available at the previous moment of time—and
 an a posterior estimate—what we get after taking into account the results of measurements performed at this moment of time.
In this paper, we will denote the a priori estimate by ${\widehat{\mathbf{x}}}_{k}^{}$ and the a posteriori estimate by ${\widehat{\mathbf{x}}}_{k}^{+}$.
Since we consider the case when the system has both the random and the unknownbutbounded error components (14), both a priori and a posteriori estimates should have the same form:
where ${\mathbf{c}}_{k}^{}$ and ${\mathbf{c}}_{k}^{+}$ are points, ${\mathbf{g}}_{k}^{}\sim \mathrm{N}(\mathbf{0},{C}_{k}^{})$ and ${\mathbf{g}}_{k}^{+}\sim \mathrm{N}(\mathbf{0},{C}_{k}^{+})$ are random variables with Gaussian distribution, ${\mathbf{e}}_{k}^{}\in \mathcal{E}(\mathbf{0},{S}_{k}^{})$ and ${\mathbf{e}}_{k}^{+}\in \mathcal{E}(\mathbf{0},{S}_{k}^{+})$ are two points within two ellipsoids with shape matrices ${S}_{k}^{}$ and ${S}_{k}^{+}$ respectively.
$$\begin{array}{c}{\widehat{\mathbf{x}}}_{k}^{}={\mathbf{c}}_{k}^{}+{\mathbf{g}}_{k}^{}+{\mathbf{e}}_{k}^{},\end{array}$$
$$\begin{array}{c}{\widehat{\mathbf{x}}}_{k}^{+}={\mathbf{c}}_{k}^{+}+{\mathbf{g}}_{k}^{+}+{\mathbf{e}}_{k}^{+},\end{array}$$
Figure 1 illustrates shows how a state ${\mathbf{x}}_{0}$ can be decomposed into the sum of the point estimate ${\mathbf{c}}_{0}$ and two perturbations: the UBB noise component ${\mathbf{e}}_{0}$ and the Gaussian noise component ${\mathbf{g}}_{0}$.
Our objective is to use the measurement results to generate the a posteriori estimate ${\widehat{\mathbf{x}}}_{0}^{+}$, i.e., in precise terms, to compute the point estimate ${\mathbf{c}}_{k}^{+}$ and the matrices ${C}_{k}^{+}$ and ${S}_{k}^{+}$.
3. Derivation of Ellipsoidal and Gaussian Kalman Filter
In this section we derive the EGKF model. The derivation is based on the results presented in Section 2.1. The derivation is summarized by the two theorems presented in the following subsection.
3.1. Prediction
As we have mentioned, our algorithm will describe, moment by moment, how our knowledge of the system’s state changes. At the first moment of time, we have information ${\widehat{\mathbf{x}}}_{0}^{+}$ about the initial state of the system. For each moment ${t}_{k}$, $k=0,1,\dots $, our algorithm will describe the transition from moment ${t}_{k}$ to the next moment ${t}_{k+1}$. So, at moment ${t}_{1}$, after applying our algorithm, we will know the a posteriori estimate ${\widehat{\mathbf{x}}}_{1}^{+}$ for the state of the system at this moment. Then, at moment ${t}_{2}$, we will get an a posteriori estimate ${\widehat{\mathbf{x}}}_{2}^{+}$ for the state of the system at moment ${t}_{2}$, etc.
Let us describe the transition from moment ${t}_{k}$ to the next moment ${t}_{k+1}$. At moment ${t}_{k}$, we know the a posteriori estimate of the state, i.e., we know that
where ${\mathbf{g}}_{k}^{+}\sim \mathrm{N}(\mathbf{0},{C}_{k}^{+})$, ${\mathbf{e}}_{k}^{+}\in \mathcal{E}(\mathbf{0},{S}_{k}^{+})$ and ${\mathbf{c}}_{k}^{+},{C}_{k}^{+},{S}_{k}^{+}$ are given.
$${\widehat{\mathbf{x}}}_{k}^{+}={\mathbf{c}}_{k}^{+}+{\mathbf{g}}_{k}^{+}+{\mathbf{e}}_{k}^{+},$$
As we showed in the previous section, the a priori estimate at the next moment of time has the form
$${\widehat{\mathbf{x}}}_{k+1}^{}={F}_{x,k}{\widehat{\mathbf{x}}}_{k}^{+}+{\tilde{\mathbf{u}}}_{k}+{F}_{w,k}{\mathbf{w}}_{k}+{F}_{a,k}{\mathbf{a}}_{k}.$$
The following theorem describes this a prior estimate in the desired form (15).
Theorem 4.
The a priori estimate ${\widehat{\mathbf{x}}}_{k+1}^{}$, defined by the Formula (18), is equal to: ${\widehat{\mathbf{x}}}_{k+1}^{}={\mathbf{c}}_{k+1}^{}+{\mathbf{g}}_{k+1}^{}+{\mathbf{e}}_{k+1}^{}$, where
and
$$\begin{array}{c}{\mathbf{c}}_{k+1}^{}={F}_{x,k}{\mathbf{c}}_{k}^{+}+{\tilde{\mathbf{u}}}_{k},\end{array}$$
$$\begin{array}{c}{\mathbf{g}}_{k+1}^{}\sim \mathrm{N}(\mathbf{0},{C}_{k+1}^{}),\end{array}$$
$$\begin{array}{c}{\mathbf{e}}_{k+1}^{}\in \mathcal{E}(\mathbf{0},{S}_{k+1}^{}),\end{array}$$
$${C}_{k+1}^{}={F}_{x,k}{C}_{k}^{+}{F}_{x,k}^{T}+{F}_{w,k}{C}_{k}^{u}{F}_{w,k}^{T},$$
$${S}_{k+1}^{}=\left(\sqrt{\mathrm{tr}\left({F}_{x,k}{S}_{k}^{+}{F}_{x,k}^{T}\right)}+\sqrt{\mathrm{tr}\left({F}_{a,k}{S}_{k}^{u}{F}_{a,k}^{T}\right)}\phantom{\rule{0.166667em}{0ex}}\right)\xb7\left(\frac{{F}_{x,k}{S}_{k}^{+}{F}_{x,k}^{T}}{\sqrt{\mathrm{tr}\left({F}_{x,k}{S}_{k}^{+}{F}_{x,k}^{T}\right)}}+\frac{{F}_{a,k}{S}_{k}^{u}{F}_{a,k}^{T}}{\sqrt{\mathrm{tr}\left({F}_{a,k}{S}_{k}^{u}{F}_{a,k}^{T}\right)}}\right).$$
Proof.
From (17) and (18), we get
where we denoted:
$$\begin{array}{cc}\hfill {\widehat{\mathbf{x}}}_{k+1}^{}=\phantom{\rule{0.166667em}{0ex}}& {F}_{x,k}({\mathbf{c}}_{k}^{+}+{\mathbf{g}}_{k}^{+}+{\mathbf{e}}_{k}^{+})+{\tilde{\mathbf{u}}}_{k}+{F}_{w,k}{\mathbf{w}}_{k}+{F}_{a,k}{\mathbf{a}}_{k}\hfill \\ \hfill =\phantom{\rule{0.166667em}{0ex}}& {F}_{x,k}{\mathbf{c}}_{k}^{+}+{F}_{x,k}{\mathbf{g}}_{k}^{+}+{F}_{x,k}{\mathbf{e}}_{k}^{+}+{\tilde{\mathbf{u}}}_{k}+{F}_{w,k}{\mathbf{w}}_{k}+{F}_{a,k}{\mathbf{a}}_{k}\hfill \\ \hfill =\phantom{\rule{0.166667em}{0ex}}& {\mathbf{c}}_{k+1}^{}+{\mathbf{g}}_{k+1}^{}+{\mathbf{e}}_{k+1}^{},\hfill \end{array}$$
$$\begin{array}{c}{\mathbf{c}}_{k+1}^{}={F}_{x,k}{\mathbf{c}}_{k}^{+}+{\tilde{\mathbf{u}}}_{k},\end{array}$$
$$\begin{array}{c}{\mathbf{g}}_{k+1}^{}={F}_{x,k}{\mathbf{g}}_{k}^{+}+{F}_{w,k}{\mathbf{w}}_{k},\end{array}$$
$$\begin{array}{c}{\mathbf{e}}_{k+1}^{}={F}_{x,k}{\mathbf{e}}_{k}^{+}+{F}_{a,k}{\mathbf{a}}_{k}.\end{array}$$
Here,
therefore according to the properties of Gaussian distributions and (26),
where
$${\mathbf{g}}_{k}^{+}\sim \mathrm{N}(\mathbf{0},{C}_{k}^{+}),{\mathbf{w}}_{k}\sim \mathrm{N}(\mathbf{0},{C}_{k}^{u}),$$
$${\mathbf{g}}_{k+1}^{}\sim \mathrm{N}(\mathbf{0},{C}_{k+1}^{}),$$
$${C}_{k+1}^{}={F}_{x,k}{C}_{k}^{+}{F}_{x,k}^{T}+{F}_{w,k}{C}_{k}^{u}{F}_{w,k}^{T}.$$
Also, here,
therefore according to Corollary 2:
where
$${\mathbf{e}}_{k}^{+}\in \mathcal{E}(\mathbf{0},{S}_{k}^{+}),{\mathbf{a}}_{k}\in \mathcal{E}(\mathbf{0},{S}_{k}^{u}),$$
$${\mathbf{e}}_{k+1}^{}\in \mathcal{E}(\mathbf{0},{S}_{k+1}^{}),$$
$${S}_{k+1}^{}=\left(\sqrt{\mathrm{tr}\left({F}_{x,k}{S}_{k}^{+}{F}_{x,k}^{T}\right)}+\sqrt{\mathrm{tr}\left({F}_{a}{S}_{k}^{u}{F}_{a}^{T}\right)}\right)\xb7\left(\frac{{F}_{x,k}{S}_{k}^{+}{F}_{x,k}^{T}}{\sqrt{\mathrm{tr}\left({F}_{x,k}{S}_{k}^{+}{F}_{x,k}^{T}\right)}}+\frac{{F}_{a}{S}_{k}^{u}{F}_{a}^{T}}{\sqrt{\mathrm{tr}\left({F}_{a}{S}_{k}^{u}{F}_{a}^{T}\right)}}\right).$$
The proof is complete. ☐
3.2. Filtering
In the previous section, we show how to generate the a priori estimate for the system’s state. As a result, at each moment ${t}_{k}$, we have the following estimate:
where ${\mathbf{g}}_{k}^{}\sim \mathrm{N}(\mathbf{0},{C}_{k}^{})$, ${\mathbf{e}}_{k}^{+}\in \mathcal{E}(\mathbf{0},{S}_{k}^{})$ and ${\mathbf{c}}_{k}^{},{C}_{k}^{},{S}_{k}^{}$ are known.
$${\widehat{\mathbf{x}}}_{k}^{}={\mathbf{c}}_{k}^{}+{\mathbf{g}}_{k}^{}+{\mathbf{e}}_{k}^{},$$
Let us now describe how we take the observation results ${\mathbf{y}}_{k}$ into account, and thus, transform the a priori estimate into an a posteriori one. In line with the general KF idea, to compensate for the prediction error ${\mathbf{y}}_{k}{h}_{k}({\widehat{\mathbf{x}}}_{k}^{},\mathbf{0},\mathbf{0})$, we add, to the a priori estimate, a term proportional to this error, i.e., compute the following a posteriori estimate
for an appropriate matrix ${K}_{k}$. In this section, we consider what happens for a general selection of the gain factor ${K}_{k}$. In the next subsection, we will show how to select the optimal gain factor.
$${\widehat{\mathbf{x}}}_{k}^{+}={\widehat{\mathbf{x}}}_{k}^{}+{K}_{k}[{\mathbf{y}}_{k}{h}_{k}({\widehat{\mathbf{x}}}_{k}^{},\mathbf{0},\mathbf{0})],$$
Let us show how the resulting a posteriori estimate can be represented in the desired form (15). To come up with this form, we introduce a new parameter $\beta $; the optimal value of this parameter will also be described in the next subsection.
Theorem 5.
Let $\beta >0$. Then, an a posteriori estimate ${\widehat{\mathbf{x}}}_{k}^{+}$, as defined by the Formula (35), has the form ${\widehat{\mathbf{x}}}_{k}^{+}={\mathbf{c}}_{k}^{+}+{\mathbf{g}}_{k}^{+}+{\mathbf{e}}_{k}^{+}$, where:
and
$$\begin{array}{c}{\mathbf{c}}_{k}^{+}=(I{K}_{k}{H}_{x,k}){\mathbf{c}}_{k}^{}+{K}_{k}({\mathbf{y}}_{k}{\tilde{\mathbf{z}}}_{k}),\end{array}$$
$$\begin{array}{c}{\mathbf{g}}_{k}^{+}\sim \mathrm{N}(\mathbf{0},{C}_{k}^{+}),\end{array}$$
$$\begin{array}{c}{\mathbf{e}}_{k}^{+}\in \mathcal{E}(\mathbf{0},{S}_{k}^{+}),\end{array}$$
$${C}_{k}^{+}=(I{K}_{k}{H}_{x,k}){C}_{k}^{}{(I{K}_{k}{H}_{x,k})}^{T}+{K}_{k}{H}_{v,k}{C}_{k}^{z}{H}_{v,k}^{T}{K}_{k}^{T},$$
$${S}_{k}^{+}\left(\beta \right)=\left(1+\frac{1}{\beta}\right)(I{K}_{k}{H}_{x,k}){S}_{k}^{}{(I{K}_{k}{H}_{x,k})}^{T}+(1+\beta ){K}_{k}{H}_{b,k}{S}_{k}^{z}{H}_{b,k}^{T}{K}_{k}^{T}.$$
Proof.
From (34) and (35) we get
where we denoted
$$\begin{array}{cc}\hfill {\widehat{\mathbf{x}}}_{k}^{+}=\phantom{\rule{0.166667em}{0ex}}& {\widehat{\mathbf{x}}}_{k}^{}+{K}_{k}[{\mathbf{y}}_{k}{h}_{k}({\widehat{\mathbf{x}}}_{k}^{},\mathbf{0},\mathbf{0})]\hfill \\ \hfill =\phantom{\rule{0.166667em}{0ex}}& {\widehat{\mathbf{x}}}_{k}^{}+{K}_{k}[{\mathbf{y}}_{k}{H}_{x,k}{\widehat{\mathbf{x}}}_{k}^{}{\tilde{\mathbf{z}}}_{k}]\hfill \\ \hfill =\phantom{\rule{0.166667em}{0ex}}& (I{K}_{k}{H}_{x,k}){\widehat{\mathbf{x}}}_{k}^{}+{K}_{k}({\mathbf{y}}_{k}{\tilde{\mathbf{z}}}_{k})\hfill \\ \hfill =\phantom{\rule{0.166667em}{0ex}}& (I{K}_{k}{H}_{x,k})({\mathbf{c}}_{k}^{}+{\mathbf{g}}_{k}^{}+{\mathbf{e}}_{k}^{})+{K}_{k}({H}_{x,k}{\mathbf{x}}_{k}+{H}_{v,k}{\mathbf{v}}_{k}+{H}_{b,k}{\mathbf{b}}_{k})\hfill \\ \hfill =\phantom{\rule{0.166667em}{0ex}}& {\mathbf{c}}_{k}^{+}+{\mathbf{g}}_{k}^{+}+{\mathbf{e}}_{k}^{k},\hfill \end{array}$$
$$\begin{array}{c}{\mathbf{c}}_{k}^{+}=(I{K}_{k}{H}_{x,k}){\mathbf{c}}_{k}^{}+{K}_{k}{H}_{x,k}{\mathbf{x}}_{k},\end{array}$$
$$\begin{array}{c}{\mathbf{g}}_{k}^{+}=(I{K}_{k}{H}_{x,k}){\mathbf{g}}_{k}^{}+{K}_{k}{H}_{v,k}{\mathbf{v}}_{k},\end{array}$$
$$\begin{array}{c}{\mathbf{e}}_{k}^{+}=(I{K}_{k}{H}_{x,k}){\mathbf{e}}_{k}^{}+{K}_{k}{H}_{b,k}{\mathbf{b}}_{k}.\end{array}$$
For the point estimate part, we have ${\mathbf{v}}_{k}=\mathbf{0}$ and ${\mathbf{b}}_{k}=\mathbf{0}$, therefore from (42) we get
$${\mathbf{c}}_{k}^{+}=(I{K}_{k}{H}_{x,k}){\mathbf{c}}_{k}^{}+{K}_{k}({\mathbf{y}}_{k}{\tilde{\mathbf{z}}}_{k}).$$
Here,
therefore according to properties of Gaussian distributions and (37),
where
$${\mathbf{g}}_{k}^{}\sim \mathrm{N}(\mathbf{0},{C}_{k}^{}),{\mathbf{v}}_{k}\sim \mathrm{N}(\mathbf{0},{C}_{k}^{z}),$$
$${\mathbf{g}}_{k}^{+}\sim \mathrm{N}(\mathbf{0},{C}_{k}^{+}),$$
$${C}_{k}^{+}=(I{K}_{k}{H}_{x,k}){C}_{k}^{}{(I{K}_{k}{H}_{x,k})}^{T}+{F}_{v,k}{C}_{k}^{z}{F}_{v,k}^{T}.$$
Also here,
then according to Corollary 1, for any $\beta >0$, we get
where
$${\mathbf{e}}_{k}^{}\in \mathcal{E}(\mathbf{0},{S}_{k}^{}),{\mathbf{b}}_{k}\in \mathcal{E}(\mathbf{0},{S}_{k}^{z}),$$
$${\mathbf{e}}_{k}^{+}\in \mathcal{E}(\mathbf{0},{S}_{k}^{+}),$$
$${S}_{k}^{+}\left(\beta \right)=\left(1+\frac{1}{\beta}\right)(I{K}_{k}{H}_{x,k}){S}_{k}^{}{(I{K}_{k}{H}_{x,k})}^{T}+(1+\beta ){K}_{k}{H}_{b,k}{S}_{k}^{z}{H}_{b,k}^{T}{K}_{k}^{T}.$$
The proof is complete. ☐
3.3. Optimization Problem
To finalize our algorithm, the only thing left is to find the optimal gain factor ${K}_{k}$. Similar to KF and to EKF, we will find the gain factor that minimizes the mean square error of the posteriori estimation.
In our case, this error has two components: the statistical component and the UBB component. To combine these errors into a single objective function, we need to decide how much weight we give to each component. This relative weight will be described by a parameter $\eta \in [0,1]$, so that the resulting cost function will have the following form:
$$J\left(\beta \right)=(1\eta )\mathrm{tr}\left({C}_{k}^{+}\right)+\eta \mathrm{tr}\left[{S}_{k}^{+}\left(\beta \right)\right].$$
This expression represents the overall uncertainty of the posteriori estimation. We will find the gain factor ${K}_{k}$ and the value $\beta $ that minimize this cost function.
Plugging (48) and (51) into (52) we get:
where we denoted $M=\mathrm{tr}\left[(I{K}_{k}{H}_{x,k}){S}_{k}^{}{(I{K}_{k}{H}_{x,k})}^{T}\right]$ and $N=\mathrm{tr}\left({K}_{k}{H}_{b,k}{S}_{k}^{z}{H}_{b,k}^{T}{K}_{k}^{T}\right)$.
$$\begin{array}{cc}\hfill J\left(\beta \right)=\phantom{\rule{0.166667em}{0ex}}& (1\eta )\mathrm{tr}\left[(I{K}_{k}{H}_{x,k}){C}_{k}^{}{(I{K}_{k}{H}_{x,k})}^{T}\right]\hfill \\ \hfill \phantom{\rule{1.em}{0ex}}& +(1\eta )\mathrm{tr}\left[{K}_{k}{H}_{v,k}{C}_{k}^{z}{H}_{v,k}^{T}{K}_{k}^{T}\right]\hfill \\ \hfill \phantom{\rule{1.em}{0ex}}& +\eta \left(1+\frac{1}{\beta}\right)\mathrm{tr}\left[(I{K}_{k}{H}_{x,k}){S}_{k}^{}{(I{K}_{k}{H}_{x,k})}^{T}\right]\hfill \\ \hfill \phantom{\rule{1.em}{0ex}}& +\eta (1+\beta )\mathrm{tr}\left({K}_{k}{H}_{b,k}{S}_{k}^{z}{H}_{b,k}^{T}{K}_{k}^{T}\right)\hfill \\ \hfill \triangleq & \phantom{\rule{0.166667em}{0ex}}(1\eta )\mathrm{tr}\left[(I{K}_{k}{H}_{x,k}){C}_{k}^{}{(I{K}_{k}{H}_{x,k})}^{T}\right]\hfill \\ \hfill \phantom{\rule{1.em}{0ex}}& +(1\eta )\mathrm{tr}\left[{K}_{k}{H}_{v,k}{C}_{k}^{z}{H}_{v,k}^{T}{K}_{k}^{T}\right]+\eta \left(1+\frac{1}{\beta}\right)M+\eta (1+\beta )N,\hfill \end{array}$$
Let us first find the optimal value $\beta $. Since $M>0$ and $N>0$, taking into account that the arithmetic mean is always larger than or equal that the geometric mean, we conclude that
$$\left(1+\frac{1}{\beta}\right)M+(1+\beta )N=M+N+\frac{1}{\beta}M+\beta N\ge M+N+2\sqrt{MN}.$$
In particular, when $\frac{1}{\beta}M=\beta N$, i.e., when $M={\beta}^{2}N$ and $\beta =\sqrt{\frac{M}{N}}$, we have:
$$\left(1+\frac{1}{\beta}\right)M+(1+\beta )N=M+N+2\sqrt{MN}={\left(\sqrt{M}+\sqrt{N}\right)}^{2}.$$
Thus, the smallest possible value of the cost function is attained when $\beta =\sqrt{\frac{M}{N}}$. In this case, the cost function $J\left(\beta \right)$, as described by the Formula (52), has the following form:
$$\begin{array}{cc}\hfill J\left(\beta \right)=\phantom{\rule{0.166667em}{0ex}}& (1\eta )\mathrm{tr}\left[(I{K}_{k}{H}_{x,k}){C}_{k}^{}{(I{K}_{k}{H}_{x,k})}^{T}\right]\hfill \\ \hfill \phantom{\rule{1.em}{0ex}}& +(1\eta )\mathrm{tr}\left[{K}_{k}{H}_{v,k}{C}_{k}^{z}{H}_{v,k}^{T}{K}_{k}^{T}\right]+\eta {\left(\sqrt{M}+\sqrt{N}\right)}^{2}.\hfill \end{array}$$
To find the optimal value of the gain factor ${K}_{k}$, let us differentiate the resulting expression by ${K}_{k}$ and equate the resulting derivative to 0.
Notice that
$$\begin{array}{cc}\hfill \frac{\partial \sqrt{M}}{\partial {K}_{k}}=\phantom{\rule{0.166667em}{0ex}}& \frac{1}{2}{M}^{\frac{1}{2}}\frac{\partial M}{\partial {K}_{k}}=\phantom{\rule{0.166667em}{0ex}}\frac{1}{2\sqrt{M}}(I{K}_{k}{H}_{x,k})\left({S}_{k}^{,T}+{S}_{k}^{}\right){H}_{x,k}^{T}\hfill \\ \hfill =& \frac{1}{\sqrt{M}}(I{K}_{k}{H}_{x,k}){S}_{k}^{}{H}_{x,k}^{T},\hfill \\ \hfill \frac{\partial \sqrt{N}}{\partial {K}_{k}}=\phantom{\rule{0.166667em}{0ex}}& \frac{1}{2}{N}^{\frac{1}{2}}\frac{\partial N}{\partial {K}_{k}}=\frac{1}{2\sqrt{N}}{K}_{k}{H}_{b,k}\left({S}_{k}^{z,T}+{S}_{k}^{z}\right){H}_{b,k}^{T}\hfill \\ \hfill =\phantom{\rule{0.166667em}{0ex}}& \frac{1}{\sqrt{N}}{K}_{k}{H}_{b,k}{S}_{k}^{z}{H}_{b,k}^{T}.\hfill \end{array}$$
Therefore,
$$\begin{array}{cc}\hfill \frac{\partial J}{\partial {K}_{k}}=\phantom{\rule{0.166667em}{0ex}}& 2(1\eta )({K}_{k}{H}_{x,k}I){C}_{k}^{}{H}_{x,k}^{T}+2(1\eta ){K}_{k}{H}_{v,k}{C}_{k}^{z}{H}_{v,k}^{T}\hfill \\ \hfill \phantom{\rule{1.em}{0ex}}& +2\eta \left(1+\frac{1}{\beta}\right)({K}_{k}{H}_{x,k}I){S}_{k}^{}{H}_{x,k}^{T}+2\eta (1+\beta ){K}_{k}{H}_{b,k}{S}_{k}^{z}{H}_{b,k}^{T}.\hfill \end{array}$$
Equating the resulting derivative $\frac{\partial J}{\partial {K}_{k}}$ to 0, we get the following explicit formula for the optimal gain ${K}_{k}$:
where we denoted:
$${K}_{k}={A}^{1}\xb7\left[(1\eta ){C}_{k}^{}{H}_{x,k}^{T}+\eta \left(1+\frac{1}{\beta}\right){S}_{k}^{}{H}_{x,k}^{T}\right],$$
$$A=(1\eta ){H}_{x,k}{C}_{k}^{}{H}_{x,k}^{T}+(1\eta ){H}_{v,k}{C}_{k}^{z}{H}_{v,k}^{T}+\eta \left(1+\frac{1}{\beta}\right){H}_{x,k}{S}_{k}^{}{H}_{x,k}^{T}+\eta (1+\beta ){H}_{b,k}{S}_{k}^{z}{H}_{b,k}^{T}.$$
The only thing left to determine is the parameter $\beta $. For each $\beta $, substituting the expression (48), (51) and (59) into the formula for the cost function, we get the corresponding value of $J\left(\beta \right)$. We thus need to find the value ${\beta}^{*}$ that minimizes this cost function, i.e., that solves the following minimization problem:
$$\begin{array}{cc}\hfill \phantom{\rule{1.em}{0ex}}& \underset{\beta}{min}\phantom{\rule{0.166667em}{0ex}}\phantom{\rule{0.166667em}{0ex}}\phantom{\rule{0.166667em}{0ex}}J\left(\beta \right)\hfill \\ \hfill \phantom{\rule{1.em}{0ex}}& \mathrm{s}.\mathrm{t}.\phantom{\rule{0.166667em}{0ex}}\phantom{\rule{0.166667em}{0ex}}\phantom{\rule{0.166667em}{0ex}}\beta \in (0,+\infty )\subset {\mathbb{R}}^{1}.\hfill \end{array}$$
This optimization problem is more complex than the quadratic optimization problems which usually occur in filtering. Indeed, for quadratic objective functions, derivatives are linear and thus, when we equate all the derivatives to 0, we get an easytosolve system of linear equations. In contrast, for nonquadratic objective functions, equating derivatives to 0 leads to a difficulttosolve system of nonlinear equations. Another reason why nonquadratic optimization problems are often difficult to solve is that traditional methods for solving these problems—such as gradient descent—often lead to a local minimum and not to the desired global minimum of the function. Good news is that in our case, similarly to [27], we can show that the cost function is convex. For convex objective functions, every local minimum is also a global minimum; see, e.g., [28]. Because of this, there exist many efficient techniques for solving convex optimization problems; for example, we can use the corresponding algorithm from the INTLAB toolbox [28].
Once we find the optimal value ${\beta}^{*}$, we can substitute it into the Formulas (48), (51) and (59), and get the desired a posteriori state estimate ${\widehat{\mathbf{x}}}_{k}^{+}$.
It should be mentioned that this estimate depends on the parameter $\eta $—which was introduced to balance the random uncertainty and UBB uncertainty. Specifically, we should mention three important cases—as described, e.g., in [26]:
 (1)
 When $\eta =\frac{1}{2}$, the stochastic uncertainty and UBB uncertainty have the same weight. and the optimal gain factor ${K}_{k}$ contains no $\eta $ in this case. This setting is recommended to users when there is no expertbased information about $\eta $ available.
 (2)
 When $\eta =0$, we get:$${K}_{k}\left(\beta \right)={C}_{k}^{}{H}_{x,k}^{T}\xb7{\left[{H}_{x,k}{C}_{k}^{}{H}_{x,k}^{T}+{H}_{v,k}{C}_{k}^{z}{H}_{v,k}^{T}\right]}^{1}.$$In this case the filtering model contains no UBB uncertainty and the EGKF algorithm reduces to EKF [6].
 (3)
 When $\eta =1$, the model now only contains UBB uncertainty. In this case,$${K}_{k}\left(\beta \right)=\left(1+\frac{1}{\beta}\right){S}_{k}^{}{H}_{x,k}^{T}\xb7{\left[\left(1+\frac{1}{\beta}\right){H}_{x,k}{S}_{k}^{}{H}_{x,k}^{T}+(1+\beta ){H}_{b,k}{S}_{k}^{z}{H}_{b,k}^{T}\right]}^{1}.$$
Based on Theorems 4 and 5, and Equations (59) and (61), we can summarize the EGKF model into the following Algorithm 1:
Algorithm 1 Ellipsoidal and Gaussian Kalman filter model.  
1:  Input:

2:  for k=1,2,…,K do 
3:  Calculation:
$${\mathbf{c}}_{k}^{}={F}_{x,k1}{c}_{k1}^{+}+{\tilde{\mathbf{u}}}_{k1}.$$
$${C}_{k}^{}={F}_{x,k1}{C}_{k1}^{+}{F}_{x,k1}^{T}+{F}_{w,k1}{C}_{k1}^{u}{F}_{w,k1}^{T}.$$
$$B=\left(\sqrt{\mathrm{tr}\left({F}_{x,k1}{S}_{k1}^{+}{F}_{x,k1}^{T}\right)}+\sqrt{\mathrm{tr}\left({F}_{a,k1}{S}_{k1}^{u}{F}_{a,k1}^{T}\right)}\right).$$
$${S}_{k}^{}=B\xb7\left(\frac{{F}_{x,k1}{S}_{k1}^{+}{F}_{x,k1}^{T}}{\sqrt{\mathrm{tr}\left({F}_{x,k1}{S}_{k1}^{+}{F}_{x,k1}^{T}\right)}}+\frac{{F}_{a,k1}{S}_{k1}^{u}{F}_{a,k1}^{T}}{\sqrt{\mathrm{tr}\left({F}_{a,k1}{S}_{k1}^{u}{F}_{a,k1}^{T}\right)}}\right).$$
$$A=(1\eta ){H}_{x,k}{C}_{k}^{}{H}_{x,k}^{T}+(1\eta ){H}_{v,k}{C}_{k}^{z}{H}_{v,k}^{T}+\eta \left(1+\frac{1}{\beta}\right){H}_{x,k}{S}_{k}^{}{H}_{x,k}^{T}+\eta (1+\beta ){H}_{b,k}{S}_{k}^{z}{H}_{b,k}^{T}.$$
$${K}_{k}={A}^{1}\xb7\left[(1\eta ){C}_{k}^{}{H}_{x,k}^{T}+\eta \left(1+\frac{1}{\beta}\right){S}_{k}^{}{H}_{x,k}^{T}\right].$$
$${\mathbf{c}}_{k}^{+}=(I{K}_{k}{H}_{x,k}){\mathbf{c}}_{k}^{}+{K}_{k}({\mathbf{y}}_{k}{\tilde{\mathbf{z}}}_{k}).$$
$${C}_{k}^{+}=(I{K}_{k}{H}_{x,k}){C}_{k}^{}{(I{K}_{k}{H}_{x,k})}^{T}+{F}_{v,k}{C}_{k}^{z}{F}_{v,k}^{T}.$$
$$\begin{array}{cc}{S}_{k}^{+}\left(\beta \right)=\hfill & \left(1+\frac{1}{\beta}\right)(I{K}_{k}{H}_{x,k}){S}_{k}^{}{(I{K}_{k}{H}_{x,k})}^{T}\phantom{\rule{1.em}{0ex}}\hfill \\ & +(1+\beta ){K}_{k}{H}_{b,k}{S}_{k}^{z}{H}_{b,k}^{T}{K}_{k}^{T}.\hfill \end{array}$$
$${\beta}^{*}=arg\underset{\beta >0}{min}\{(1\eta )\mathrm{tr}\left[{C}_{k}^{+}\left(\beta \right)\right]+\eta \mathrm{tr}\left[{S}_{k}^{+}\left(\beta \right)\right]\}.$$

4:  Output: 
${\mathbf{c}}_{k}^{+}$, ${C}_{k}^{+}$, and ${S}_{k}^{+}$.  
5:  end for 
4. Applications
This section contains the result of two test of applying EGKF to simulated data sets. We also applied EKF to both problems, and compared the rootmeansquare error (RMSE) of the two methods.
 In the first test, EGKF was applied on a highly nonlinear benchmark problem. To compare the result, we performed 100 simulations.
 In the second test, we applied EGKF on a twodimensional trajectory estimation problem. For this problem, we performed 1000 Monte Carlo runs.
To simulate the UBB uncertainty, we used uniform distributions on the corresponding ellipsoids.
4.1. Example 1: A Highly Nonlinear Benchmark Example
We consider the following example:
where:
$$\begin{array}{c}{\mathbf{x}}_{k+1}=\frac{1}{2}{\mathbf{x}}_{k}+\frac{25{\mathbf{x}}_{k}}{1+{\mathbf{x}}_{k}^{2}}+8cos\left[1.2(k1)\right]+{\mathbf{w}}_{k}+{\mathbf{a}}_{k},\end{array}$$
$$\begin{array}{c}{\mathbf{y}}_{k}=\frac{1}{20}{\mathbf{x}}_{k}^{2}+{\mathbf{v}}_{k}+{\mathbf{b}}_{k},\end{array}$$
 ${\mathbf{x}}_{k}$ is a 1dimensional scalar,
 ${\mathbf{u}}_{k}=8cos\left[1.2(k1)\right]$ is the input vector,
 ${\mathbf{w}}_{k}\sim \mathrm{N}(\mathbf{0},1)$ is Gaussian noise,
 ${\mathbf{w}}_{i,k}\in \mathcal{E}(\mathbf{0},9)$ is the unknown but bounded perturbation; in this 1D case the ellipsoid is simply the interval $[3,3]$,
 ${\mathbf{v}}_{k}\sim \mathrm{N}(\mathbf{0},1)$ is Gaussian measurement noise,
 ${\mathbf{b}}_{k}\in \mathcal{E}(\mathbf{0},4)$ is the unknown but bounded perturbation—in this case, the ellipsoid is the interval $[2,2]$.
Here:
 the initial true state is ${\mathbf{x}}_{0}=0.1$,
 the initial state estimate is ${\widehat{\mathbf{x}}}_{0}={\mathbf{x}}_{0}$,
 the initial estimate for covariance matrix is ${C}_{0}^{+}=2$,
 the initial estimate for the shape matrix is ${S}_{0}^{+}=1\times {10}^{3}$.
This discretetime dynamical system is a known benchmark in the nonlinear estimation theory; see, e.g., [6,8,29]. A high degree of nonlinearity in both process and measurement equations makes state estimation problem for this system very difficult.
We use this example to show that the new EGKF method behaves better than the traditional first order EKF when UBB uncertainties are taken into account.
We used a simulation length of 50 time steps. Following recommendations from the previous section, we selected the weight parameter $\eta =0.5$.
We repeated our simulation 100 times. Figure 2 shows the results of applying both EGKF and EKF for several of the 100 runs, namely, for runs 25, 50, 75, and 100.
In the above figure:
 the red stars denote the true state,
 the black crosses represent the EKF estimates of the state,
 the blue lines represent the UBB ellipsoids estimated by EGKF (in this 1D case they are intervals),
 the blue plus signs mark the centers of the output ellipsoids.
One can see that the centers of the ellipsoids are, in general, different given from the EKF estimates.
For each of the two methods and for each of the 100 iterations, at each moment of time, we can calculate the difference between the actual state and the corresponding estimate. (As estimates corresponding to EGKF, we took the centers of the corresponding a posterior ellipsoids.)
Based on 50 moments of time, we get a vector consisting of 50 such actualstatevs.estimatedstate differences. To compare the vectors corresponding to two different methods, we calculated the ${l}_{2}$ norms of these vectors—this is equivalent to comparing the rootmeansquare estimation errors. The resulting values are presented in Table 1 for simulations 1, 11, …, 91.
In most cases, the EGKF leads to a smaller mean squared estimation error. Over all 100 simulations:
 the average ${l}_{2}$ norm of the EGKF estimates is 148.70, while
 for the extended Kalman filter, the average ${l}_{2}$ norm is much higher: it is equal to 192.29.
Thus, we conclude that, on average, the new EGKF algorithm performs much better than EKF.
We got a similar conclusion when, instead of comparing the ${l}_{2}$ norms, we compared:
 the ${l}_{1}$ norms—which correspond to comparing mean absolute values of the estimation errors, and
 the ${l}_{\infty}$ norms—which correspond to comparing the largest estimation errors.
4.2. Example 2: TwoDimensional Trajectory Estimation
In the second test, we used 1000times Monte Carlo simulations to compare the performance of EGKF and EKF on a 2D trajectory estimation problem from [10].
In this problem, a vehicle moves on a plane, following a curved trajectory. The state vector $\mathbf{x}=(x,y,{v}_{x},{v}_{y})$ contains positions and velocities of the vehicle, in xdirection and ydirection, respectively. After linearization, we get the following equation:
$${\mathbf{x}}_{k+1}={F}_{k}{\mathbf{x}}_{k}+{\mathbf{w}}_{k}+{\mathbf{a}}_{k}.$$
Here:
 ${\mathbf{x}}_{k}=({x}_{k},{y}_{k},{v}_{x,k},{v}_{y,k})$ is the state vector at time ${t}_{k}$;
 the transition matrix ${F}_{k}$ has the form:$${F}_{k}=\left(\begin{array}{cccc}1& 0& \Delta t& 0\\ 0& 1& 0& \Delta t\\ 0& 0& 1& 0\\ 0& 0& 0& 1\end{array}\right);$$
 ${\mathbf{w}}_{\mathbf{k}}$ is Gaussian noise with covariance matrix ${C}_{k}^{u}$,
 ${\mathbf{a}}_{\mathbf{k}}$ is the unknown but bounded uncertainty, which is bounded by an ellipsoid with shape matrix ${S}_{k}^{u}$.
In total, in each of the 1000 simulations, we observed 500 epochs with time step $\Delta t=0.1$ s. In our formulas, as units of time, distance, and angle, we chose second, meter and degree, respectively.
In this experiment, two observation stations located at points ${S}_{1}=[{s}_{12},{s}_{12}]$ and ${S}_{2}=[{s}_{21},{s}_{22}]$ performed the measurements. Each station measured the distance to the vehicle and the direction to the vehicle, as described by the following equation:
$${\mathbf{y}}_{k}=\left(\begin{array}{c}{d}_{1}\\ {d}_{2}\\ {\theta}_{1}\\ {\theta}_{2}\end{array}\right)=\left(\begin{array}{c}\sqrt{{[x{s}_{11}]}^{2}+{[y{s}_{12}]}^{2}}\\ \sqrt{{[x{s}_{21}]}^{2}+{[y{s}_{22}]}^{2}}\\ arctan[(y{s}_{12})/(x{s}_{11})]\\ arctan[(y{s}_{22})/(x{s}_{21})]\end{array}\right)+{\mathbf{v}}_{k}+{\mathbf{b}}_{k}.$$
Here:
 ${\mathbf{v}}_{\mathbf{k}}$ represents random uncertainty; it is Gaussian with covariance matrix ${C}_{k}^{z}$,
 ${\mathbf{b}}_{k}$ represents the unknown but bounded uncertainty; it is bounded by an ellipsoid with shape matrix ${S}_{k}^{z}$.
The initial state estimate, and initial estimates for the covariance matrix and for the shape matrix are:
 ${\mathbf{x}}_{0}=(0,0,0,0)$,
 ${C}_{0}^{+}=\mathrm{diag}(0.01,0.01,0.01,0.01)$,
 ${S}_{0}^{+}=\mathrm{diag}({10}^{6},{10}^{6},{10}^{6},{10}^{6})$.
The initial covariance matrices in process equation and measurement equation are:
$${C}_{0}^{\mathbf{u}}=\left(\begin{array}{cccc}0.0033& 0& 0.005& 0\\ 0& 0.0033& 0& 0.005\\ 0.005& 0& 0.01& 0\\ 0& 0.005& 0& 0.01\end{array}\right),{C}_{0}^{z}=\left(\begin{array}{cccc}{0.05}^{2}& 0& 0& 0\\ 0& {0.05}^{2}& 0& 0\\ 0& 0& {0.01}^{2}& 0\\ 0& 0& 0& {0.01}^{2}\end{array}\right).$$
The initial shape matrices of UBB uncertainties in process equation and measurement equation are:
$${S}_{0}^{\mathbf{u}}=\left(\begin{array}{cccc}{1}^{2}& 0& 0& 0\\ 0& {1}^{2}& 0& 0\\ 0& 0& {0.5}^{2}& 0\\ 0& 0& 0& {0.5}^{2}\end{array}\right),{S}_{0}^{z}=\left(\begin{array}{cccc}{0.01}^{2}& 0& 0& 0\\ 0& {0.01}^{2}& 0& 0\\ 0& 0& {\left(\frac{\pi}{180}\right)}^{2}& 0\\ 0& 0& 0& {\left(\frac{\pi}{180}\right)}^{2}\end{array}\right).$$
As in the first example, we select the value $\eta =0.5$ of the weighting parameter.
Figure 3 shows an example of trajectory estimates by using EGKF and EKF obtained on one of the 100 iterations—namely, on the 5th iteration. To see it clearer, Figure 4 shows a more detailed information for several portions of the trajectory.
Here:
 the red stars mark the true positions,
 black crosses mark the EKF estimates,
 the blue plus signs are the centers of the ellipsoids computed by EGKF.
The two observation stations are marked by red triangles; they are connected by a straight line.
Figure 5 shows the comparison of RMSE estimation errors EGKF and EKF averaged over all 1000 MonteCarlo runs.
The peak occurred in the figure between [600, 800] is probably caused by the random uncertainty, just like what EKF behaves in nonlinear applications sometimes. In this experiment we introduced new UBB uncertainty but we still kept the influence of random uncertainty ($\eta =0.5$). The new EGKF can provide a better global estimation than EKF but a few exceptions may still happen. This unexpected jump also caused a larger standard deviation value in EGKF.
In almost all simulations (namely, in 997 cases out of 1000), the RMSE of EGKF was significantly smaller than for the EKF. We can therefore conclude that the new EGKF techniques provides better estimation for this nonlinear system than EKF. Detailed statistics of the comparison can be found in Table 2.
In Figure 6, we compared the average traces (over all 1000 MonteCarlo runs) of EGKF UBB shape matrix ${S}_{k}^{+}$, EGKF covariance matrix ${C}_{k}^{+}$ and EKF covariance matrix, in every epoch.
Notice that the trace of EGKF shape matrix is much larger than the traces of covariance matrices.
We can also see that the UBB uncertainty grows in the starting part of the trajectory and when the trajectory crosses the straight line connecting the observation stations—this corresponds to $k\in [210,220]$. This phenomenon is easy to explain: in this case, both angle measurements measure, in effect, the same quantity, so from the measurements, we get fewer information than in other parts of the trajectory.
The EGKF algorithm has been applied to a data set which is obtained from a real world experiment see, e.g., [30]. In that experiment, taken from the scope of georeferencing of terrestrial laser scanner, a multisensor system has collected the trajectory information of two GNSS antennas. EGKF was applied to estimate the positions and velocities of those two antennas and the results were compared with EKF.
5. Conclusions and Future Work
In this paper, we propose a new method for state estimation in situations when, in addition to the probabilistic uncertainty, we also have unknownbutbounded errors. Our testing shows that, on average, the new method leads to much more accurate estimates than the standard extended Kalman filter.
While our method has been shown to be successful, there is room for improvement.
First, the efficiency of our method depends on the initial selection of the parameters describing the system uncertainty and the measurement uncertainties. In some practical situations, we have prior information about these uncertainties, but in other practical cases, we do not have this information. Similarly to the usual Kalman filter, our method eventually converges to the correct uncertainty estimate, but this convergence may be somewhat slow. It would be nice to come up with recommendations on how to select the initial uncertainty parameters that would guarantee faster convergence.
Second, in our method, we assumed that measurements are reasonably frequent, so that during the time interval between the two measurements, we can safely ignore terms quadratic in terms of the corresponding changes—and thus consider a linearized system of equations. This is true in many practical situations—e.g., in the example of GNSSbased navigation that we considered in this paper. However, in some practical situations, measurements are rarer, and so, in the interval between the two measurements, we can no longer ignore terms which are quadratic in terms of changes. To cover such situations, it is desirable to extend our technique to secondorder Kalman filtering.
Third, in this paper, we followed the usual Kalman filter techniques in assuming that the corresponding probability distributions are Gaussian. As we mentioned in Section 1, in practice, sometimes distributions are not Gaussian. It is therefore desirable to extend our method to the general nonGaussian case – e.g., by using ideas of unscented Kalman filtering.
Fourth, in our method, we minimized the mean square estimation error—which, for our method, corresponds to selecting an ellipsoid with the smallest possible trace of the corresponding shape matrix. While in many practical problems, minimizing the mean square error is a reasonable idea, in some problems, it may be more reasonable to minimize, e.g., the largest possible estimation error. In this case, as we have mentioned, instead of minimizing the trace, we should be minimizing the largest eigenvalue of the shape matrix. It is this desirable to extend our method to this—and other—possible criteria.
Fifth, to make computations feasible, we approximated the set of possible states by an ellipsoid. In principle, we can get more accurate estimates if we use families of sets with more parameters for such approximation—e.g., zonotopes or, more generally, generic polytopes. From this viewpoint, it is desirable to come up with efficient algorithms for processing zonotope (and, more generally, polytope) uncertainty.
Finally, it is desirable to analyze (and, if necessary, improve) the stability of our method (and of other state estimation techniques). While stability is very important for practical problems, it is not even clear how to formulate this problem in precise mathematical terms, since the state estimation problems are usually illposed—as most inverse problems (see, e.g., [31])—and thus, strictly speaking, not stable.
Author Contributions
Conceptualization, I.N.; Data curation, L.S. and H.A.; Formal analysis, L.S. and V.K.; Funding acquisition, I.N.; Investigation, L.S. and H.A.; Methodology, L.S., B.K. and V.K.; Project administration, I.N.; Resources, H.A. and I.N.; Software, B.K.; Supervision, V.K. and I.N.; Validation, L.S.; Visualization, L.S.; Writing—original draft, L.S.; Writing—review & editing, H.A., B.K., V.K. and I.N.
Funding
This work was funded by the German Research Foundation (DFG) as a part of the Research Training Group i.c.sens (GRK2159).
Acknowledgments
The authors acknowledge the discussions with Steffen Schön, Franz Rottensteiner and Claus Brenner from Leibniz University Hannover. The authors also thank Sergey Grigorian from Department of Mathematics in University of Texas Rio Grande Valley for his comments and suggestions.
Conflicts of Interest
The authors declare no conflict of interest.
References
 Le, V.T.H.; Stoica, C.; Alamo, T.; Camacho, E.F.; Dumur, D. Zonotopic guaranteed state estimation for uncertain systems. Automatica 2013, 49, 3418–3424. [Google Scholar] [CrossRef]
 Kalman, R.E. A new approach to linear filtering and prediction problems. J. Basic Eng. 1960, 82, 35–45. [Google Scholar] [CrossRef]
 Smith, G.L.; Schmidt, S.F.; McGee, L.A. Application of Statistical Filter Theory to the Optimal Estimation of Position and Velocity on Board a Circumlunar Vehicle; NASA: Washington, DC, USA, 1962.
 McElhoe, B.A. An assessment of the navigation and course corrections for a manned flyby of mars or venus. IEEE Trans. Aerosp. Electron. Syst. 1966, 4, 613–623. [Google Scholar] [CrossRef]
 Sorenson, H.W.; Alspach, D.L. Recursive Bayesian estimation using Gaussian sums. Automatica 1971, 7, 465–479. [Google Scholar] [CrossRef]
 Simon, D. Optimal State Estimation: Kalman, H Infinity, and Nonlinear Approaches; JohnWiley & Sons, Inc.: Hoboken, NJ, USA, 2006. [Google Scholar]
 Julier, S.; Uhlmann, A. New extension of the Kalman filter to nonlinear systems. In Proceedings of the AeroSense, Orlando, FL, USA, 28 July 1997. [Google Scholar]
 Gordon, N.; Salmond, D.; Smith, A.F.M. Novel approach to nonlinear/nongaussian Bayesian state estimation. IEE Proc. F Radar Signal Process. 1993, 140, 107–113. [Google Scholar] [CrossRef]
 Alkhatib, H.; Paffenholz, J.A.; Kutterer, H. Sequential Monte Carlo filtering for nonlinear GNSS trajectories. In Proceedings of the Symposium on VII HotineMarussi Symposium on Mathematical Geodesy, Rome, Italy, 6–10 June 2009; Sneeuw, N., Novák, P., Crespi, M., Sansò, F., Eds.; Springer: Berlin/Heidelberg, Germany, 2012; pp. 81–86. [Google Scholar]
 Alkhatib, H. Alternative Nonlinear Filtering Techniques in Geodesy for Dual State and Adaptive Parameter Estimation. In the 1st International Workshop on the Quality of Geodetic Observation and Monitoring Systems (QuGOMS’11); Kutterer, H., Seitz, F., Alkhatib, H., Schmidt, M., Eds.; Springer: Cham, Switzerland, 2015; Volume 140. [Google Scholar]
 Milanese, M.; Vicino, A. Optimal estimation theory for dynamic systems with set membership uncertainty: An overview. Automatica 1996, 27, 997–1009. [Google Scholar] [CrossRef]
 Witsenhausen, H.S. Sets of possible states of linear systems given perturbed observations. IEEE Trans. Autom. Control 1968, 13, 556–558. [Google Scholar] [CrossRef]
 Schweppe, F. Recursive state estimation: Unknown but bounded errors and system inputs. IEEE Trans. Autom. Control 1968, 13, 22–28. [Google Scholar] [CrossRef]
 Schweppe, F.C. Uncertain Dynamic Systems; Prentice Hall: Englewood Cliffs, NJ, USA, 1973. [Google Scholar]
 Kreinovich, V.; Lakeyev, A.; Rohn, J.; Kahl, P. Computational Complexity and Feasibility of Data Processing and Interval Computations; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2013; Volume 10. [Google Scholar]
 Ferson, S.; Kreinovich, V.; Hajagos, J.; Oberkampf, W.; Ginzburg, L. Experimental Uncertainty Estimation and Statistics for Data Having Interval Uncertainty; Report SAND20070939,162; Sandia National Laboratories: Albuquerque, NM, USA, 2007.
 Kutterer, H.; Neumann, I. Recursive leastsquares estimation in case of interval observation data. Int. J. Reliab. Saf. 2011, 5, 229–249. [Google Scholar] [CrossRef]
 Vicino, A.A.; Zappa, G. Sequential approximation of feasible parameter sets for identification with set membership uncertainty. IEEE Trans. Autom. Control 1996, 41, 774–785. [Google Scholar] [CrossRef]
 Walter, E.; PietLahanier, H. Exact recursive polyhedral description of the feasible parameter set for boundederror models. IEEE Trans. Autom. Control 1989, 34, 911–915. [Google Scholar] [CrossRef]
 Combastel, C. A state bounding observer for uncertain nonlinear continuoustime systems based on zonotopes. In Proceedings of the 44th IEEE Conference on Decision and Control, Seville, Spain, 15 December 2005; pp. 7228–7234. [Google Scholar]
 Alamo, T.; Bravo, J.M.; Camacho, E.F. Guaranteed state estimation by zonotopes. Automatica 2005, 41, 1035–1043. [Google Scholar] [CrossRef]
 Althoff, M.; Stursberg, O.; Buss, M. Safety assessment for stochastic linear systems using enclosing hulls of probability density functions. In Proceedings of the 2009 European Control Conference, Budapest, Hungary, 23–26 August 2009; pp. 625–630. [Google Scholar]
 Schön, S.; Kutterer, H. Using zonotopes for overestimationfree interval leastsquares–some geodetic applications. Reliab. Comput. 2005, 11, 137–155. [Google Scholar] [CrossRef]
 Bertsekas, D.P.; Rhodes, I.B. On the minimax reachability of target sets and target tubes. Automatica 1971, 7, 233–247. [Google Scholar] [CrossRef]
 Polyak, B.T.; Nazin, S.A.; Durieu, C.; Walter, E. Ellipsoidal parameter or state estimation under model uncertainty. Automatica 2004, 40, 1171–1179. [Google Scholar] [CrossRef]
 Durieu, C.; Walter, E.; Polyak, B. Multiinput multioutput ellipsoidal state bounding. J. Optim. Theory Appl. 2001, 111, 273–303. [Google Scholar] [CrossRef]
 Noack, B. State Estimation for Distributed Systems with Stochastic and SetMembership Uncertainties; KIT Scientific Publishing: Karlsruhe, Germany, 2014; Volume 14. [Google Scholar]
 Rump, S.M. INTLAB—INTerval LABoratory. In Developments in Reliable Computing; Csendes, T., Ed.; Kluwer Academic Publishers: Amsterdam, The Netherlands, 1999; pp. 77–104. [Google Scholar]
 Kitagawa, G. Nongaussian state—Space modeling of nonstationary time series. J. Am. Stat. Assoc. 1987, 82, 1032–1041. [Google Scholar]
 Sun, L.; Alkhatib, H.; Paffenholz, J.A.; Neumann, I. GeoReferencing of a MultiSensor System Based on Setmembership Kalman Filter. In Proceedings of the 21st International Conference on Information Fusion (FUSION), Cambridge, UK, 10–13 July 2018; pp. 1–5. [Google Scholar]
 Luise, B. State estimation analysed as inverse problem. In Assessment and Future Directions of Nonlinear Model Predictive Control; Springer: New York, NY, USA, 2007; pp. 335–346. [Google Scholar]
Table 1.
Comparison of Ellipsoidal and Gaussian Kalman Filter (EGKF) and Extended Kalman Filter (EKF) in 10 simulations.
k  1  11  21  31  41  51  61  71  81  91 

EGKF  53.59  116.38  70.38  102.95  100.16  75.81  82.11  48.71  66.30  81.55 
EKF  72.49  116.64  230.08  234.69  80.73  31.32  109.53  79.69  8.63  64.95 
RMSE [m]  EGKF  EKF 

Min  0.187  0.241 
Max  0.501  0.278 
Mean  0.210  0.259 
Median  0.209  0.259 
Standard Deviation  0.019  0.006 
© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).