Robust Distributed Kalman Filtering: On the Choice of the Local Tolerance

Emanuele, Alessandro; Gasparotto, Francesco; Guerra, Giacomo; Zorzi, Mattia

doi:10.3390/s20113244

Open AccessArticle

Robust Distributed Kalman Filtering: On the Choice of the Local Tolerance

Department of Information Engineering, University of Padova, Via Gradenigo 6/B, 35131 Padova, Italy

^*

Author to whom correspondence should be addressed.

Sensors 2020, 20(11), 3244; https://doi.org/10.3390/s20113244

Submission received: 23 April 2020 / Revised: 3 June 2020 / Accepted: 4 June 2020 / Published: 7 June 2020

(This article belongs to the Section Sensor Networks)

Download

Browse Figures

Versions Notes

Abstract

:

We propose a distributed Kalman filter for a sensor network under model uncertainty. The distributed scheme is characterized by two communication stages in each time step: in the first stage, the local units exchange their observations and then they can compute their local estimate; in the final stage, the local units exchange their local estimate and compute the final estimate using a diffusion scheme. Each local estimate is computed in order to be optimal according to the least favorable model belonging to a prescribed local ambiguity set. The latter is a ball, in the Kullback–Liebler topology, about the corresponding nominal local model. We propose a strategy to compute the radius, called local tolerance, for each local ambiguity set in the sensor network, rather than keep it constant across the network. Finally, some numerical examples show the effectiveness of the proposed scheme.

Keywords:

distributed robust Kalman filtering; least favorable analysis; sensor networks

1. Introduction

Modern problems involve a large number of sensors forming a sensor network and taking measurements from which we would like to infer quantities not accessible to observation possibly at each node location. These problems can be classified as filtering problems whose solution is given by the Kalman filter. On the other hand, its implementation is very expensive in terms of data transmission; indeed, we require that all sensors can exchange their measurements. Such a limitation disappears by considering distributed filtering [1,2,3,4,5,6,7,8,9]. The key idea is that the communication among the nodes is limited.

In the simplest distributed strategy, the state estimate of a node (i.e., local unit) is computed by using only the observations from its neighbors. Such a strategy, however, appears to be not very effective. A remarkable progress has been reached by distributed Kalman filtering with consensus [10,11,12,13] and diffusion strategies [14,15]. These distributed approaches are characterized by several communication stages during each time step. For instance, in the first stage, the local units exchange their observations and then they can compute their local estimate; in the final stage, the local units exchange their local estimate and compute the final estimate using a consensus or a diffusion scheme. Many important challenges have been addressed in distributed filtering. For instance, the issues about limited observability, network topologies that restrict allowable communications, and communication noises between sensors are considered in [16]; the case in which the sensor network is subject to transmission delays is considered in [17]; and the cases of missing measurements and absence of communication among the nodes are analyzed in [18,19], respectively. It is worth observing that distributed state estimation can be performed also through different principles [20,21,22,23,24,25]. For instance, each node can transmit its measurements to a fusion center and then the latter computes the state estimate.

An important aspect in filtering applications is that the nominal model does not correspond to the actual one. Risk sensitive Kalman filtering [26,27,28] addresses this problem by penalizing large errors. The severity of this penalization is tuned by the so-called risk sensitivity parameter: the larger the risk sensitivity parameter is, the more large errors are penalized. A refinement of these filters is given by robust Kalman filtering where the uncertainty is expressed incrementally [29,30,31,32]. More precisely, for each time step, the state estimator minimizes the prediction error according to the least favorable model belonging to a prescribed ambiguity set. The latter is a ball in the Kullback–Leibler (KL) topology whose center is the nominal model. The radius of this ball is called tolerance and it represents the discrepancy budget between the actual and the nominal model allowed for the corresponding time step. It is worth noting that the ambiguity set can be formed also by using different types of divergence (see, for instance, [33,34,35]).

The problem of distributed Kalman filtering under model uncertainty has been considered as well (see, e.g., [36,37,38]). In the present paper, we consider the distributed robust Kalman filter with diffusion step proposed in [39]. Here, the local estimate of each node is computed by using the robust Kalman filter in [29]. In this scenario, the least favorable model is the one used to compute the robust Kalman filter of the global model. Accordingly, we have one ambiguity set corresponding to the global model, which contains the actual model, and the local ambiguity sets corresponding to the nodes of the network. The centers of those balls are known because they are given by the nominal model. The local tolerances corresponding to the local ambiguity sets of the nodes are set equal to the one of the global ambiguity set. On the other hand, the local ambiguity set of a node corresponds to a local model which is just a part of the global model. Accordingly, taking all the local tolerances uniform across the network and equal to the global one may not be the best choice.

The main contribution of this paper is to propose a robust distributed Kalman filter as in [39] where the local tolerance for each node is customized. In this way, the local tolerance is non-uniform and time-varying across the network. We show through some simulation studies that the performance of the predictions is improved. Moreover, we show that, if the tolerance corresponding to the global ambiguity set is sufficiently small, then the local tolerances across the network are constant in the steady-state condition. Accordingly, it is also possible to simplify the distributed scheme by replacing the time-varying tolerances with the steady-state values.

The organization of this paper is as follows. In Section 2, we provide the background about robust Kalman filtering whose uncertainty is expressed incrementally. In Section 3, the distributed robust Kalman filter with local uniform tolerance is reviewed. In Section 4, we introduce the distributed robust Kalman filter with non-uniform local tolerance. In Section 5, we perform some numerical experiments to check the performance of the proposed distributed scheme. In Section 6, we propose an efficient approximation of the distributed robust Kalman filter with non-uniform local tolerance. Finally, in Section 7, we draw the conclusions.

Notation:

{[a_{i j}]}_{i j}

denotes the matrix having entry

a_{i j}

in position

(i, j)

;

A^{T}

is the transposition of matrix A; and

A > 0

(A \geq 0)

means that matrix A is positive (semi)-definite.

diag (A_{1} \dots A_{n})

denotes a block-diagonal matrix whose blocks in the main block diagonal are

A_{1} \dots A_{n}

. Given a squared matrix A,

tr (A)

and

| A |

denote the trace and the determinant of A, respectively. Given two matrices A and B,

A \otimes B

denotes their Kronecker product.

x \sim N (m, K)

means that x is a Gaussian random vector with mean m and covariance matrix K.

2. Background

In this section, we review the robust Kalman filter proposed in [29], which represents the “building block” used throughout the paper. Consider the nominal state-space model

\begin{matrix} x_{t + 1} & = A x_{t} + Γ_{B} u_{t} + r_{t} \\ y_{t} & = C x_{t} + Γ_{D} u_{t} \end{matrix}

(1)

where

A \in R^{n \times n}

,

Γ_{B} \in R^{n \times n + p N}

,

C \in R^{p N \times n}

,

Γ_{D} \in R^{p N \times n + p N}

,

x_{t}

is the state process,

y_{t}

is the observation process,

u_{t}

is normalized white Gaussian noise (WGN), and

r_{t}

is a deterministic signal. It is assumed that

u_{t}

is independent from the initial state

x_{0} \sim N ({\hat{x}}_{0}, V_{0})

. We also assume that the noise entering in the state process and the one entering in the observation process are independent, i.e., we assume that

Γ_{B} Γ_{D}^{T} = 0

. Finally, the state-space model in Equation (1) is considered to be reachable and observable. Let

ϕ_{t} (z_{t} | x_{t})

denote the nominal transition probability density of

z_{t} : = {[x_{t + 1}^{T} y_{t}^{T}]}^{T}

given

x_{t}

. Notice that

ϕ_{t} (z_{t} | x_{t})

is Gaussian by construction and it is straightforwardly given by Equation (1).

We assume that the (unknown) actual transition probability

{\tilde{ϕ}}_{t} (z_{t} | x_{t})

belongs to the ambiguity set which is a closed ball centered in

ϕ_{t} (z_{t} | x_{t})

in the KL topology:

B_{t} : = \{{\tilde{ϕ}}_{t} s . t . \tilde{E} [log (\frac{{\tilde{ϕ}}_{t}}{ϕ_{t}})| Y_{t - 1}] \leq c\}

(2)

with

\tilde{E} [log (\frac{{\tilde{ϕ}}_{t}}{ϕ_{t}})| Y_{t - 1}] : = \int \int {\tilde{ϕ}}_{t} (z_{t} | x_{t}) {\overset{ˇ}{f}}_{t} (x_{t} | Y_{t - 1}) log (\frac{{\tilde{ϕ}}_{t} (z_{t} | x_{t})}{ϕ_{t} (z_{t} | x_{t})}) d z_{t} d x_{t},

Y_{t - 1} : = {y_{s}, s = 0 \dots t - 1}

, and

{\overset{ˇ}{f}}_{t} (x_{t} | Y_{t - 1}) \sim N ({\hat{x}}_{t}, V_{t})

is defined as the actual conditional probability density of

x_{t}

given

Y_{t - 1}

. The mismatch modeling budget allowed for each time step is represented by the parameter

c > 0

, which is called tolerance. The robust estimator of

x_{t + 1}

given

Y_{t}

for the nominal model in Equation (1) is given by solving the following minimax problem:

{\hat{x}}_{t + 1} = \underset{g_{t} \in G_{t}}{argmin} max_{{\tilde{ϕ}}_{t} \in B_{t}} \tilde{E} [{∥x_{t + 1} - g_{t} (y_{t})∥}^{2}| Y_{t - 1}]

(3)

where

G_{t}

is the set of all estimators

g_{t}

whose variance is finite under any model in the ambiguity set

B_{t}

,

\begin{matrix} \tilde{E} [{∥x_{t + 1} - g_{t} (y_{t})∥}^{2}| Y_{t - 1}] : = \int \int {∥x_{t + 1} - g_{t} (y_{t})∥}^{2} {\tilde{ϕ}}_{t} (z_{t} | x_{t}) {\overset{ˇ}{f}}_{t} (x_{t} | Y_{t - 1}) d z_{t} d x_{t} \end{matrix}

is the estimation error under the transition density

{\tilde{ϕ}}_{t} (z_{t} | x_{t})

. In [29], it is proved that the estimator solution to the problem in Equation (3) has the following Kalman-like structure:

\begin{matrix} G_{t} & = A V_{t} C^{T} {(C V_{t} C^{T} + Γ_{D} Γ_{D}^{T})}^{- 1} \\ {\hat{x}}_{t + 1} & = A {\hat{x}}_{t} + G_{t} (y_{t} - C {\hat{x}}_{t}) + r_{t} \\ P_{t + 1} & = A {(V_{t}^{- 1} + C^{T} {(Γ_{D} Γ_{D}^{T})}^{- 1} C)}^{- 1} A^{T} + Γ_{B} Γ_{B}^{T} \\ Find & θ_{t} s . t . γ (P_{t + 1}, θ_{t}) = c \\ V_{t + 1} & = {(P_{t + 1}^{- 1} - θ_{t} I)}^{- 1} \end{matrix}

(4)

where

γ (P, θ) : = log det (I - θ P) + tr ({(I - θ P)}^{- 1} - I) .

Parameter

θ_{t} > 0

is called risk sensitivity parameter. It is worth noting that, given

P > 0

and

c > 0

, the equation

γ (P, θ) = c

always admits a unique solution in

θ

and such that:

θ > 0

,

P^{- 1} - θ I > 0

. In the special case that

c = 0

, i.e., the nominal model coincides with the actual model, then

θ_{t} = 0

and thus Equation (4) degenerates in the usual Kalman filter.

Remark 1.

It is worth noting that the robust Kalman filter is well defined also in the case that the ambiguity set Equation (2) is defined by a time-varying tolerance, i.e.,

c_{t}

instead of c. However, we prefer to keep c constant in Equation (3) because in the following we assume that the actual (global) model is the solution to Equation (3) with constant tolerance c, in order to simplify the setup.

3. Distributed Robust Kalman Filtering with Uniform Local Tolerance

In this section, we review the distributed robust Kalman filter presented in [39]. Consider a network made by N sensors. The latter are connected if they can communicate with each other. Accordingly, every sensor k has a set of neighbors which is denoted by

N_{k}

. In particular,

k \in N_{k}

that is each node is connected with itself. The number of neighbors of node k is denoted by

n_{k}

. The corresponding

N \times N

adjacency matrix

J = {[j_{l k}]}_{l k}

is defined as

j_{l k} : = \{\begin{matrix} 1, & if l \in N_{k} \\ 0, & otherwise . \end{matrix}

We assume that every node collects a measurement

y_{k, t} \in R^{p}

at time t and the corresponding nominal state-space model is

\begin{matrix} x_{t + 1} & = A x_{t} + B w_{t} + r_{t} \\ y_{k, t} & = C_{k} x_{t} + D_{k} v_{k, t} k = 1 \dots N \end{matrix}

(5)

where

w_{t}

and

v_{k, t}

, with

k = 1 \dots N

, are independent normalized WGNs. It is worth noting that the actual state-space model for each node is unknown. By stacking Equation (5) for every k, it is possible to rewrite such sensor network as Equation (1) where:

\begin{matrix} y_{t} = [\begin{matrix} y_{1, t} \\ ⋮ \\ y_{N, t} \end{matrix}], u_{t} = [\begin{matrix} w_{t} \\ v_{t} \end{matrix}], v_{t} = [\begin{matrix} v_{1, t} \\ ⋮ \\ v_{N, t} \end{matrix}] \\ Γ_{B} = [\begin{matrix} B & 0 \end{matrix}], Γ_{D} = [\begin{matrix} 0 & D \end{matrix}] \\ C = [\begin{matrix} C_{1} \\ ⋮ \\ C_{N} \end{matrix}], D = diag (D_{1}, \dots, D_{N}) . \end{matrix}

(6)

Accordingly, Equation (4) represents the centralized robust Kalman filter. Defining

R : = D D^{T}

,

R_{l} : = D_{l} D_{l}^{T}

with

l = 1 \dots N

, and

S_{t o t} : = C^{T} R^{- 1} C = \sum_{l = 1}^{N} C_{l}^{T} R_{l}^{- 1} C_{l},

the Kalman gain for Equation (5) becomes, using the matrix inversion lemma,

G_{t} = A {(V_{t}^{- 1} + S_{t o t})}^{- 1} C^{T} R^{- 1} .

(7)

Since the nominal model in Equation (5) does not coincide with the actual one and each node k can only exploit information shared by its neighbors

l \in N_{k}

, the aim of distributed robust Kalman filtering is to compute a prediction

{\hat{x}}_{k, t}

of the state

x_{t}

for every node k by using only the local information, taking into account the model uncertainty. In the case that the node k has access to all measurements across all the nodes in the network, then

{\hat{x}}_{k, t}

coincides with Equation (4) which can be written, using the parameterization in Equations (6) and (7) as

\begin{matrix} {\hat{x}}_{k, t + 1} & = A {\hat{x}}_{k, t} + A {(V_{k, t}^{- 1} + S_{t o t})}^{- 1} \sum_{l = 1}^{N} C_{l}^{T} R_{l}^{- 1} (y_{l, t} - C_{l} {\hat{x}}_{k, t}) + r_{t} \\ P_{k, t + 1} & = A {(V_{k, t}^{- 1} + S_{t o t})}^{- 1} A^{T} + B B^{T} \\ Find & θ_{k, t} s . t . γ (P_{k, t + 1}, θ_{k, t}) = c \\ V_{k, t + 1} & = {(P_{k, t + 1}^{- 1} - θ_{k, t} I)}^{- 1} \end{matrix}

(8)

where

{\hat{x}}_{k, t} = {\hat{x}}_{t}

,

P_{k, t} = P_{t}

,

V_{k, t} = V_{t}

, and

θ_{k, t} = θ_{t}

. In the case that not all the measurements in the network are accessible to node k, then the target is to compute a state prediction

{\hat{x}}_{k, t}

of

x_{t}

which is as similar as possible to the global state prediction.

Assume that the node k can collect the measurements from its neighbors

N_{k}

. Then, the corresponding local nominal state-space model is

\begin{matrix} x_{t + 1} & = A x_{t} + B w_{t} + r_{t} \\ y_{l, t} & = C_{l} x_{t} + D_{l} v_{l, t}, l \in N_{k} . \end{matrix}

(9)

The latter can be rewritten in the compact form

\begin{matrix} x_{t + 1} & = A x_{t} + Γ_{B} u_{k, t}^{loc} + r_{t} \\ y_{k, t}^{loc} & = C_{k}^{loc} x_{t} + Γ_{D_{k}^{loc}} u_{k, t}^{loc} \end{matrix}

(10)

where

u_{k, t}^{loc} = {[w_{t}^{T} {(v_{k, t}^{loc})}^{T}]}^{T}

is the input noise and

y_{k, t}^{loc}

is the output;

v_{k, t}^{loc}

and

y_{k, t}^{loc}

are given by stacking

v_{l, t}

and

y_{l, t}

, with

l \in N_{k}

, respectively. Moreover,

C_{k}^{loc}

is given by stacking

C_{l}

with

l \in N_{k}

,

Γ_{D^{loc}} = [0 D_{k}^{loc}]

and

D_{k}^{loc}

is a block diagonal matrix whose main blocks are

D_{l}

with

l \in N_{k}

. In addition, defining

R_{k}^{loc} : = D_{k}^{loc} {(D_{k}^{loc})}^{T}

and

S_{k} : = {(C_{k}^{loc})}^{T} {(R_{k}^{loc})}^{- 1} C_{k}^{loc}

it results that

S_{k} = \sum_{l \in N_{k}} C_{l}^{T} R_{l}^{- 1} C_{l} .

We conclude that the one-step ahead predictor of

x_{t}

at node k is similar to the one in Equation (8) but now we need to discard the terms for which

l \notin N_{k}

. It is worth noting that the latter represents an intermediate local prediction of

x_{t + 1}

at node k, and it is denoted as

ψ_{k, t + 1}

. Allowing that the connected nodes can exchange their intermediate estimates, then each node can update the prediction at node k in terms of both

ψ_{k, t + 1}

and

ψ_{l, t + 1}

with

l \in N_{k}

. More precisely, consider a matrix

W = {[w_{l k}]}_{l k} \in R^{N \times N}

such that

\begin{matrix} w_{l k} \geq 0 and w_{l k} = 0 if l \notin N_{k} \\ \sum_{l \in N_{k}} w_{l k} = 1 . \end{matrix}

(11)

Therefore, the final predicted state at node k is given by means of the so-called diffusion step [14]:

{\hat{x}}_{k, t + 1} = \sum_{l \in N_{k}} w_{l k} ψ_{l, t + 1} .

To sum up, in the diffusion scheme, each local unit uses the measurements and the intermediate local predictions from its neighbors. The resulting scheme is explained through Algorithm 1.

Algorithm 1 Distributed robust Kalman filter with uniform local tolerance at time t.

Input: ${\hat{x}}_{k, t}, V_{k, t}, y_{k, t}, W = {[w_{l k}]}_{l k}$ with $k = 1 \dots N$
Output: ${\hat{x}}_{k, t + 1}, V_{k, t + 1}$ with $k = 1 \dots N$
Incremental step. Compute at every node k:

$\begin{matrix} ψ_{k, t + 1} & = A {\hat{x}}_{k, t} + A {(V_{k, t}^{- 1} + S_{k})}^{- 1} \sum_{l \in N_{k}} C_{l}^{T} R_{l}^{- 1} (y_{l, t} - C_{l} {\hat{x}}_{k, t}) + r_{t} \\ P_{k, t + 1} & = A {(V_{k, t}^{- 1} + S_{k})}^{- 1} A^{T} + B B^{T} \\ Find & θ_{k, t} s . t . γ (P_{k, t + 1}, θ_{k, t}) = c \\ V_{k, t + 1} & = {(P_{k, t + 1}^{- 1} - θ_{k, t} I)}^{- 1} \end{matrix}$

(12)
Diffusion step. Compute at every node k:

${\hat{x}}_{k, t + 1} = \sum_{l \in N_{k}} w_{l k} ψ_{l, t + 1}$

(13)

It is worth noting that

ψ_{k, t}

is computed by using the robust Kalman scheme in Equation (4) applied to the local model in Equation (10). In addition, c is the same for any node that is c takes a uniform value over the sensor network. In particular, the tolerance c is the same for both the centralized and the distributed Kalman filter. This strategy for the selection of the tolerance does not ensure that the least favorable model computed at node k is compatible with the one of the centralized filter. However, in the case of large deviations of the least favorable model corresponding to the centralized problem, it is very likely that the predictor at node k using Algorithm 1 is better than the one which assumes that the nominal and actual models coincide. Finally, in the case that

c = 0

, i.e., the nominal model coincides with the actual one, Algorithm 1 boils down to the distributed Kalman filter with diffusion step in [14].

4. Distributed Robust Kalman Filtering with Non-Uniform Local Tolerance

We investigate the possibility to assign a possibly different local tolerance to each node that is the local tolerance is not uniform across the sensor network. Recall that the least favorable model is given by the minimax problem in Equation (3), with constant tolerance c, and the corresponding optimal estimator is given by the centralized robust Kalman filter in Equation (4).

Consider the centralized problem in Equation (3). Let

\begin{matrix} {\bar{f}}_{t} (z_{t} | Y_{t - 1}) = \int ϕ_{t} (z_{t} | x_{t}) {\overset{ˇ}{f}}_{t} (x_{t} | Y_{t - 1}) d x_{t} \end{matrix}

(14)

\begin{matrix} {\tilde{f}}_{t} (z_{t} | Y_{t - 1}) = \int {\tilde{ϕ}}_{t} (z_{t} | x_{t}) {\overset{ˇ}{f}}_{t} (x_{t} | Y_{t - 1}) d x_{t} \end{matrix}

(15)

denote the pseudo-nominal and the least favorable conditional probability densities of

z_{t}

given the past observations

Y_{t - 1}

, respectively. Recall that

ϕ_{t} (z_{t} | x_{t})

is the nominal transition density of the state space model in Equation (1) and thus

\begin{matrix} ϕ_{t} (z_{t} | x_{t}) \sim N ([\begin{matrix} A x_{t} + r_{t} \\ C x_{t} \end{matrix}], [\begin{matrix} Γ_{B} \\ Γ_{D} \end{matrix}] [\begin{matrix} Γ_{B}^{T} & Γ_{D}^{T} \end{matrix}]) . \end{matrix}

(16)

Since

{\overset{ˇ}{f}}_{t} (x_{t} | Y_{t - 1}) \sim N ({\hat{x}}_{t}, V_{t})

, and in view of Equations (14) and (16), we have

{\bar{f}}_{t} (z_{t} | Y_{t - 1}) \sim N (m_{z}, K_{z_{t}})

where

m_{z_{t}} = [\begin{matrix} A {\hat{x}}_{t} + r_{t} \\ C {\hat{x}}_{t} \end{matrix}], K_{z_{t}} = [\begin{matrix} A \\ C \end{matrix}] V_{t} [\begin{matrix} A^{T} & C^{T} \end{matrix}] + [\begin{matrix} Γ_{B} \\ Γ_{D} \end{matrix}] [\begin{matrix} Γ_{B}^{T} & Γ_{D}^{T} \end{matrix}] .

(17)

In [29], it has been shown that the optimal solution

{\tilde{ϕ}}_{t}^{0} (z_{t} | x_{t})

to Equation (3) is Gaussian. Accordingly, in view of Equation (15), the corresponding least favorable density of

z_{t}

given

Y_{t - 1}

is Gaussian:

{\tilde{f}}_{t} (z_{t} | Y_{t - 1}) \sim N ({\tilde{m}}_{z_{t}}, {\tilde{K}}_{z_{t}}) .

It is clear then that the minimax problem in Equation (3) can be written by replacing

ϕ_{t} (z_{t} | x_{t})

and

{\tilde{ϕ}}_{t} (z_{t} | x_{t})

with

{\bar{f}}_{t} (z_{t} | Y_{t - 1})

and

{\tilde{f}}_{t} (z_{t} | Y_{t - 1})

, respectively. Then, the equivalent minimax problem is

{\hat{x}}_{t + 1} = \underset{g_{t} \in G_{t}}{argmin} max_{{\tilde{f}}_{t} \in {\bar{B}}_{t}} \int {∥x_{t + 1} - g_{t}∥}^{2} {\tilde{f}}_{t} (z_{t} | Y_{t - 1}) d z_{t}

(18)

where the ambiguity set is a ball about the pseudo-nominal density

{\bar{f}}_{t} (z_{t} | Y_{t - 1})

\begin{matrix} {\bar{B}}_{t} & = \{{\tilde{f}}_{t} (z_{t} | Y_{t - 1}) \sim N ({\tilde{m}}_{z_{t}}, {\tilde{K}}_{z_{t}}) s . t . D_{K L} ({\tilde{f}}_{t} ∥ {\bar{f}}_{t}) \leq c\} \end{matrix}

formed by the KL divergence between

{\tilde{f}}_{t} (z_{t} | Y_{t - 1})

and

{\bar{f}}_{t} (z_{t} | Y_{t - 1})

:

\begin{matrix} D_{K L} ({\tilde{f}}_{t} ∥ {\bar{f}}_{t}) & = \int {\tilde{f}}_{t} (z_{t} | Y_{t - 1}) log (\frac{{\tilde{f}}_{t} (z_{t} | Y_{t - 1})}{{\bar{f}}_{t} (z_{t} | Y_{t - 1})}) d z_{t} . \end{matrix}

Since

{\tilde{f}}_{t} (z_{t} | Y_{t - 1})

and

{\bar{f}}_{t} (z_{t} | Y_{t - 1})

are Gaussian distributed, we have

D_{K L} ({\tilde{f}}_{t} ∥ {\bar{f}}_{t}) = \frac{1}{2} [∥ m_{z_{t}} - {\tilde{m}}_{z_{t}} ∥_{K_{z_{t}}^{- 1}}^{2} - log |{\tilde{K}}_{z_{t}}| + log |K_{z_{t}}| + tr ({\tilde{K}}_{z_{t}} K_{z_{t}}^{- 1}) - (n + p)] .

It is well known that

D_{K L} ({\tilde{f}}_{t} ∥ {\bar{f}}_{t})

also represents the negative log-likelihood of the model

{\bar{f}}_{t}

under the actual model

{\tilde{f}}_{t}

, [40,41,42]. Accordingly, c represents an upper bound of the negative log-likelihood and it can be found as follows. Fix the nominal state space model

(A, B, C, D)

and collect the data

(y^{N}, u^{N}, x^{N})

where

y^{N} = {y_{1} \dots y_{N}}

,

u^{N} = {u_{1} \dots u_{N}}

,

x^{N} = {x_{1} \dots x_{N}}

. Let

ℓ (A, B, C, D; y^{N}, u^{N}, x^{N})

be the negative log-likelihood of this nominal model using the collected data. Then, fix

c = ℓ (A, B, C, D; y^{N}, u^{N}, x^{N})

. Clearly, we need to assume that the state is accessible to observation (or its estimate is reasonably good) to compute c.

Theorem 1

(Levy & Nikoukhah [30]). Let

{\bar{f}}_{t} (z_{t} | Y_{t - 1})

be the nominal density with mean

m_{z_{t}}

and covariance matrix

K_{z_{t}}

partitioned as

K_{z_{t}} = [\begin{matrix} K_{x_{t + 1}} & K_{x_{t + 1}, y_{t}} \\ K_{y_{t}, x_{t + 1}} & K_{y_{t}} \end{matrix}]

according to the dimension of

x_{t + 1}

and

y_{t}

, respectively. The least favorable density

{\tilde{f}}_{t}^{0} (z_{t} | Y_{t - 1})

solution to Equation (18) has mean and covariance matrix as follows:

{\tilde{m}}_{z_{t}} = m_{z_{t}}, {\tilde{K}}_{z_{t}} = [\begin{matrix} {\tilde{K}}_{x_{t + 1}} & K_{x_{t + 1}, y_{t}} \\ K_{y_{t}, x_{t + 1}} & K_{y_{t}} \end{matrix}] .

Let

\begin{matrix} P_{t + 1} & = K_{x_{t + 1}} - K_{x_{t + 1}, y_{t}} K_{y_{t}}^{- 1} K_{y_{t}, x_{t + 1}} \\ V_{t + 1} & = {\tilde{K}}_{x_{t + 1}} - K_{x_{t + 1}, y_{t}} K_{y_{t}}^{- 1} K_{y_{t}, x_{t + 1}} \end{matrix}

(19)

denote the nominal and least favorable error covariance matrices of

x_{t + 1}

given

Y_{t}

. Then,

\begin{matrix} V_{t + 1} = {(P_{t + 1}^{- 1} - θ_{t} I)}^{- 1} \end{matrix}

(20)

and

θ_{t} > 0

is the unique value for which

\begin{matrix} D_{K L} ({\tilde{f}}_{t}^{0} ∥ {\bar{f}}_{t}) = \frac{1}{2} [- log |{\tilde{K}}_{z_{t}}| + log |K_{z_{t}}| + tr ({\tilde{K}}_{z_{t}} K_{z_{t}}^{- 1}) - (n + p)] = c . \end{matrix}

(21)

The above result provides a way to compute

{\tilde{f}}_{t}^{0} (z_{t} | Y_{t - 1})

. Indeed, once the centralized robust Kalman filter in Equation (4) has been computed, the mean and the covariance matrix of

{\tilde{f}}_{t}^{0} (z_{t} | Y_{t - 1})

are given, in view of Equation (17), by

\begin{matrix} {\tilde{m}}_{z_{t}} = [\begin{matrix} A {\hat{x}}_{t} + r_{t} \\ C {\hat{x}}_{t} \end{matrix}], {\tilde{K}}_{z_{t}} = [\begin{matrix} V_{t + 1} + K_{x_{t + 1}, y_{t}} K_{y_{t}}^{- 1} K_{y_{t}, x_{t + 1}} & K_{x_{t + 1}, y_{t}} \\ K_{y_{t}, x_{t + 1}} & K_{y_{t}} \end{matrix}] \end{matrix}

(22)

where

\begin{matrix} K_{x_{t + 1}, y_{t}} & = A V_{t} C^{T} \\ K_{y_{t}} & = C V_{t} C^{T} + Γ_{D} Γ_{D}^{T} . \end{matrix}

From

{\bar{f}}_{t} (z_{t} | Y_{t - 1})

and

{\tilde{f}}_{t}^{0} (z_{t} | Y_{t - 1})

, we can compute the nominal and least favorable density for each node. Consider the state-space model in Equation (10) for node k. Let

z_{k, t} : = {[x_{t + 1}^{T} {(y_{k, t}^{loc})}^{T}]}^{T}

. Then, the nominal transition probability at node k, in view of Equation (10), is

\begin{matrix} ϕ_{k, t} (z_{k, t} | x_{t}) \sim N ([\begin{matrix} A x_{t} + r_{t} \\ C_{k}^{loc} x_{t} \end{matrix}], [\begin{matrix} Γ_{B} \\ Γ_{D_{k}^{loc}} \end{matrix}] [\begin{matrix} Γ_{B}^{T} & Γ_{D_{k}^{loc}}^{T} \end{matrix}]) . \end{matrix}

(23)

Then,

\begin{matrix} {\bar{f}}_{k, t} (z_{k, t} | Y_{t - 1}) = \int ϕ_{k, t} (z_{k, t} | x_{t}) {\overset{ˇ}{f}}_{t} (x_{t} | Y_{t - 1}) d x_{t} \end{matrix}

denotes the pseudo-nominal conditional probability density of

z_{k, t}

given the past observations

Y_{t - 1}

at node k. Since

{\overset{ˇ}{f}}_{t} (x_{t} | Y_{t - 1}) \sim N ({\hat{x}}_{t}, V_{t})

, and in view of Equation (23), we have

\begin{matrix} {\bar{f}}_{k, t} (z_{k, t} | Y_{t - 1}) & \sim N (m_{z_{k, t}}, K_{z_{k, t}}) \end{matrix}

where

\begin{matrix} m_{z_{k, t}} & = [\begin{matrix} A {\hat{x}}_{t} + r_{t} \\ C_{k}^{loc} {\hat{x}}_{t} \end{matrix}] \\ K_{z_{k, t}} & = [\begin{matrix} K_{x_{t + 1}} & K_{x_{t + 1}, y_{k, t}} \\ K_{y_{k, t}, x_{t + 1}} & K_{y_{k, t}} \end{matrix}] \end{matrix}

(24)

and

\begin{matrix} K_{x_{t + 1}, y_{k, t}} & = A V_{t} {(C_{k}^{loc})}^{T} \\ K_{y_{k, t}} & = C_{k}^{loc} V_{t} {(C_{k}^{loc})}^{T} + Γ_{D_{k}^{loc}} Γ_{D_{k}^{loc}}^{T} . \end{matrix}

(25)

Such a result is not surprising, indeed

{\bar{f}}_{k, t} (z_{k, t} | Y_{t - 1})

is given by marginalizing

{\bar{f}}_{t} (z_{t} | Y_{t - 1})

with respect to

y_{l, t}

with

l \notin N_{k}

. Roughly speaking, this means that

m_{z_{k, t}}

,

K_{x_{t + 1}, y_{k, t}}

and

K_{y_{k, t}}

are obtained from

m_{z_{t}}

,

K_{x_{t + 1}, y_{t}}

and

K_{y_{t}}

as follows:

$m_{z_{k, t}}$ is the vector obtained from $m_{z_{t}}$ by deleting the elements from $p l - p + 1$ to $p l$ for any $l \notin N_{k}$ .
$K_{x_{t + 1}, y_{k, t}}$ is the matrix obtained from $K_{x_{t + 1}, y_{t}}$ by deleting the columns from $p l - p + 1$ to $p l$ for any $l \notin N_{k}$ .
$K_{y_{k, t}}$ is the matrix obtained from $K_{y_{t}}$ by deleting the rows and the columns from $p l - p + 1$ to $p l$ for any $l \notin N_{k}$ .

Accordingly, we can compute the least favorable density at node k, say

{\tilde{f}}_{k, t}^{0} (z_{k, t} | Y_{t - 1})

, by marginalizing

{\tilde{f}}_{t}^{0} (z_{t} | Y_{t - 1})

with respect to

y_{l, t}

with

l \notin N_{k}

. Therefore, we have

\begin{matrix} {\tilde{f}}_{k, t}^{0} (z_{k, t} | Y_{t - 1}) & \sim N (m_{z_{k, t}}, {\tilde{K}}_{z_{k, t}}) \end{matrix}

with

{\tilde{K}}_{z_{k, t}} = [\begin{matrix} {\tilde{K}}_{x_{t + 1}} & K_{x_{t + 1}, y_{k, t}} \\ K_{y_{k, t}, x_{t + 1}} & K_{y_{k, t}} \end{matrix}] = [\begin{matrix} V_{t + 1} + K_{x_{t + 1}, y_{t}} K_{y_{t}}^{- 1} K_{y_{t}, x_{t + 1}} & K_{x_{t + 1}, y_{k, t}} \\ K_{y_{k, t}, x_{t + 1}} & K_{y_{k, t}} \end{matrix}]

where in the last equality we exploit Equation (22). It remains to design the robust filter to compute the intermediate prediction

ψ_{k, t + 1}

.

Remark 2.

At this point, it is worth doing a digression about Algorithm 1. The intermediate prediction at node k is the solution to the following minimax problem

ψ_{k, t + 1} = \underset{g_{k, t} \in G_{k, t}}{argmin} max_{{\tilde{f}}_{k, t} \in {\bar{B}}_{k, t}} \int {∥x_{t + 1} - g_{k, t} (y_{k, t}^{loc})∥}^{2} {\tilde{f}}_{k, t} (z_{k, t} | Y_{t - 1}) d z_{k, t}

(26)

where

\begin{matrix} {\bar{B}}_{k, t} : = & \{{\tilde{f}}_{k, t} s . t . D_{K L} ({\tilde{f}}_{k, t} ∥ {\bar{f}}_{k, t}) \leq c\} \end{matrix}

(27)

and

G_{k, t}

is the set of all estimators

g_{k, t}

whose variance is finite under any model in the ambiguity set

{\bar{B}}_{k, t}

. Moreover, in view of Theorem 1, the least favorable density

{\tilde{f}}_{k, t}^{★} (z_{k, t} | Y_{t - 1})

solution to Equation (26) is such that

D_{K L} ({\tilde{f}}_{k, t}^{★} ∥ {\bar{f}}_{k, t}) = c

. It is worth noting that the best estimator at node k would be the one constructed from

{\tilde{f}}_{k, t}^{0}

. On the other hand, the problem in Equation (26) implies neither

{\tilde{f}}_{t, k}^{0} = {\tilde{f}}_{t, k}^{★}

nor

D_{K L} ({\tilde{f}}_{k, t}^{★} ∥ {\bar{f}}_{k, t}) = D_{K L} ({\tilde{f}}_{k, t}^{0} ∥ {\bar{f}}_{k, t})

.

Clearly, one would design the intermediate estimator at node k by using

{\tilde{f}}_{k, t}^{0}

. However, the latter is not available at node k, and it is only known by a “central unit”, i.e., a unit knowing the global model, but neither collecting measurements nor computing predictions. Moreover, the transmission of the mean and the covariance matrix of

{\tilde{f}}_{k, t}^{0}

would be more expensive in terms of transmission costs. As alternative, we can consider a minimax problem whose least favorable model

{\tilde{f}}_{k, t}^{★}

is such that

D_{K L} ({\tilde{f}}_{k, t}^{★} ∥ {\bar{f}}_{k, t}) = D_{K L} ({\tilde{f}}_{k, t}^{0} ∥ {\bar{f}}_{k, t})

:

ψ_{k, t + 1} = \underset{g_{k, t} \in G_{k, t}}{argmin} max_{{\tilde{f}}_{k, t} \in {\bar{B}}_{k, t}} \int {∥x_{t + 1} - g_{k, t} (y_{k, t}^{loc})∥}^{2} {\tilde{f}}_{k, t} (z_{k, t} | Y_{t - 1}) d z_{k, t}

(28)

where

\begin{matrix} {\bar{B}}_{k, t} : = & \{{\tilde{f}}_{k, t} s . t . D_{K L} ({\tilde{f}}_{k, t} ∥ {\bar{f}}_{k, t}) \leq c_{k, t}\}, \end{matrix}

(29)

\begin{matrix} c_{k, t} : = & \frac{1}{2} [- log |{\tilde{K}}_{z_{k, t}}| + log |K_{z_{k, t}}| + tr ({\tilde{K}}_{z_{k, t}} K_{z_{k, t}}^{- 1}) - (n + p_{k})], \end{matrix}

(30)

and

p_{k}

coincides with the number of rows of

C_{k}^{loc}

. Under the above scheme, the central unit only transmits the local tolerance to each node in the network. The procedure which implements this optimized strategy of distributed robust Kalman filtering is outlined in Algorithm 2.

Algorithm 2 Distributed robust Kalman filter with non-uniform local tolerance at time t.

Input: ${\hat{x}}_{k, t}, V_{k, t}, y_{k, t}, W = {[w_{l k}]}_{l k}$ with $k = 1 \dots N$
Output: ${\hat{x}}_{k, t + 1}, V_{k, t + 1}$ with $k = 1 \dots N$
Tolerance update. Using the nominal global model, the central unit computes for every node k:

$c_{k, t} = \frac{1}{2} [- log |{\tilde{K}}_{z_{k, t}}| + log |K_{z_{k, t}}| + tr ({\tilde{K}}_{z_{k, t}} K_{z_{k, t}}^{- 1}) - (n + p_{k})]$

(31)
Incremental step. Compute at every node k:

$\begin{matrix} ψ_{k, t + 1} & = A {\hat{x}}_{k, t} + A {(V_{k, t}^{- 1} + S_{k})}^{- 1} \sum_{l \in N_{k}} C_{l}^{T} R_{l}^{- 1} (y_{l, t} - C_{l} {\hat{x}}_{k, t}) + r_{t} \\ P_{k, t + 1} & = A {(V_{k, t}^{- 1} + S_{k})}^{- 1} A^{T} + B B^{T} \\ Find & θ_{k, t} s . t . γ (P_{k, t + 1}, θ_{k, t}) = c_{k, t} \\ V_{k, t + 1} & = {(P_{k, t + 1}^{- 1} - θ_{k, t} I)}^{- 1} \end{matrix}$

(32)
Diffusion step. Compute at every node k:

${\hat{x}}_{k, t + 1} = \sum_{l \in N_{k}} w_{l k} ψ_{l, t + 1}$

(33)

Least Favorable Performance

We show how to evaluate the performance of the previously introduced distributed algorithm with non-uniform local tolerance and diffusion step with respect to the least favorable model solution of the centralized problem in Equation (3). More precisely, we show how to compute the mean and the variance of the prediction error for each node k in the network. In [29,34], it is shown that the least favorable model can be characterized through a state-space model over a finite interval

[0, T]

as follows. Let

ξ_{t} = {[x_{t}^{T} e_{t}^{T}]}^{T}

, where

x_{t}

is the least favorable state process. Then, the least favorable model takes the form

\begin{matrix} ξ_{t + 1} & = {\overset{ˇ}{A}}_{t} ξ_{t} + {\overset{ˇ}{B}}_{t} ε_{t} + {\overset{ˇ}{r}}_{t} \\ y_{t} & = {\overset{ˇ}{C}}_{t} ξ_{t} + {\overset{ˇ}{D}}_{t} ε_{t} \end{matrix}

(34)

where

ε_{t}

is normalized WGN, independent from

{\hat{x}}_{0}

, and

{\overset{ˇ}{r}}_{t} : = {[r_{t}^{T} 0]}^{T}

. Moreover,

\begin{matrix} {\overset{ˇ}{A}}_{t} : = & [\begin{matrix} A & Γ_{B} Γ_{H_{t}} \\ 0 & A - G_{t} C + (Γ_{B} - G_{t} Γ_{D}) Γ_{H_{t}} \end{matrix}], {\overset{ˇ}{B}}_{t} : = & [\begin{matrix} Γ_{B} Γ_{L_{t}} \\ (Γ_{B} - G_{t} Γ_{D}) Γ_{L_{t}} \end{matrix}] \\ {\overset{ˇ}{C}}_{t} : = & [\begin{matrix} C & Γ_{D} Γ_{H_{t}} \end{matrix}], {\overset{ˇ}{D}}_{t} Γ_{D} Γ_{L_{t}} \end{matrix}

where

Γ_{L_{t}}

is such that

K_{t} = Γ_{L_{t}} Γ_{L_{t}}^{T}

,

\begin{matrix} K_{t} : = & {(I - {(Γ_{B} - G_{t} Γ_{D})}^{T} (Ω_{t + 1}^{- 1} + θ_{t} I) (Γ_{B} - G_{t} Γ_{D}))}^{- 1} \\ Γ_{H_{t}} : = & K_{t} {(Γ_{B} - G_{t} Γ_{D})}^{T} (Ω_{t + 1}^{- 1} + θ_{t} I) (A - G_{t} C) . \end{matrix}

The matrix

Ω_{t + 1}^{- 1}

is computed from the backward recursion

\begin{matrix} Ω_{t}^{- 1} = {(A - G_{t} C)}^{T} (Ω_{t + 1}^{- 1} + θ_{t} I) (A - G_{t} C) & + Γ_{H_{t}}^{T} K_{t}^{- 1} Γ_{H_{t}} \end{matrix}

where the final point is initialized with

Ω_{T + 1}^{- 1} = 0

.

Let

{\tilde{x}}_{k, t} = x_{t} - {\hat{x}}_{t, k}

denote the least favorable state prediction error

{\tilde{x}}_{k, t}

of node k at time t using Algorithm 2 or Algorithm 1. Define the vector containing all the errors across the network

{\tilde{χ}}_{t} : = {[\begin{matrix} {\tilde{x}}_{1, t}^{T} & \dots & {\tilde{x}}_{N, t}^{T} \end{matrix}]}^{T} .

Using the same reasonings in [39], it is not difficult to prove that

{\tilde{χ}}_{t}

obeys the following dynamics

\begin{matrix} {\tilde{χ}}_{t + 1} & = A_{t} {\tilde{χ}}_{t} + B_{t} ε_{t} + C_{t} e_{t} \end{matrix}

(35)

where

\begin{matrix} A_{t} & : = (W^{T} \otimes I) (I \otimes A) {(V_{t}^{- 1} + S)}^{- 1} V_{t}^{- 1} \\ B_{t} & : = - (W^{T} \otimes I) (I \otimes A) {(V_{t}^{- 1} + S)}^{- 1} (J^{T} \otimes I) C^{T} R^{- 1} D L_{t} + 1 \otimes B N_{t} \\ C_{t} & : = - (W^{T} \otimes I) (I \otimes A) {(V_{t}^{- 1} + S)}^{- 1} (J^{T} \otimes I) C^{T} R^{- 1} D H_{t} + 1 \otimes B M_{t} \\ C & : = diag (C_{1}, \dots, C_{N}) \\ V_{t} & : = diag (V_{1, t}, \dots, V_{N, t}) \\ S & : = diag (S_{1}, \dots, S_{N}), \end{matrix}

M_{t} \in R^{n \times n}

,

H_{t} \in R^{p N \times n}

,

N_{t} \in R^{n \times (p N + n)}

and

L_{t} \in R^{p N \times (p N + n)}

are such that

Γ_{H_{t}} = {[M_{t}^{T} H_{t}^{T}]}^{T}

and

Γ_{L_{t}} = {[N_{t}^{T} L_{t}^{T}]}^{T}

. Finally,

1

denotes the vector of ones. Then, we combine Equation (35) with the model for

e_{t}

in Equation (34):

\begin{matrix} η_{t + 1} = F_{t} η_{t} + G_{t} ε_{t} \end{matrix}

(36)

where

η_{t} : = {[{\tilde{χ}}_{t}^{T} e_{t}^{T}]}^{T}

,

\begin{matrix} F_{t} & : = [\begin{matrix} A_{t} & C_{t} \\ 0 & (A - G_{t} C) + (Γ_{B} - G_{t} Γ_{D}) Γ_{H_{t}} \end{matrix}] \\ G_{t} & : = [\begin{matrix} B_{t} \\ (Γ_{B} - G_{t} Γ_{D}) Γ_{L_{t}} \end{matrix}] . \end{matrix}

(37)

Taking the expectation of Equation (36), we obtain

\begin{matrix} E [η_{t + 1}] = F_{t} E [η_{t}] . \end{matrix}

(38)

In view of the fact that

x_{0}

has mean equal to

{\hat{x}}_{0}

and

{\hat{x}}_{k, 0} = {\hat{x}}_{0}

for

k = 1 \dots N

, it is not difficult to see that

\tilde{E} [η_{0}] = 0

. This implies that

η_{t}

is a zero mean stochastic process or, equivalently, all the predictors are unbiased. Next, we show how to derive the variance of the prediction errors. Let

Q_{t} = E [η_{t} η_{t}^{T}]

. In view of the fact that

ε_{t}

is normalized WGN, by Equation (36), we have that

Q_{t}

is given by solving the following Lyapunov equation

\begin{matrix} Q_{t + 1} = F_{t} Q_{t} F_{t}^{T} + G_{t} G_{t}^{T} . \end{matrix}

(39)

We partition

Q_{t}

as follows:

\begin{matrix} Q_{t} = [\begin{matrix} P_{t} & H_{t} \\ H_{t}^{T} & R_{t} \end{matrix}] \end{matrix}

(40)

where

P_{t} \in R^{N n \times N n}

,

H_{t} \in R^{N n \times n}

and

R_{t} \in R^{n \times n}

. Notice that

P_{t}

contains in the main block diagonal the covariance matrices of the estimation error at each node. Accordingly, the least favorable mean square deviation is given by

{\bar{MSD}}_{t} : = \frac{1}{N} \sum_{k = 1}^{N} {MSD}_{k, t} = \frac{1}{N} tr (P_{t})

where

{MSD}_{k, t}

is the variance of the prediction error at node k. Finally, we have the following convergence result for the proposed distributed algorithm.

Proposition 1.

Let

(A, B)

be a reachable pair and

(A, C_{k}^{l o c})

be an observable pair for any k. Let W be an arbitrary diffusion matrix satisfying Equation (11). Then, there exists

c > 0

sufficiently small such that, for any arbitrary initial condition

V_{0} > 0

and

V_{k, 0} > 0

, the sequence

Q_{t}

,

t \geq 0

, generated by Equation (39) converges to

\bar{Q} > 0

over

[α T, β T]

as

T \to \infty

. Moreover, we have

F_{t} \to \bar{F}

,

G_{t} \to \bar{G}

, and

c_{k, t} \to {\bar{c}}_{k}

. In particular,

\bar{Q}

corresponds to the unique solution of the algebraic Lyapunov equation

\begin{matrix} \bar{Q} = \bar{F} \bar{Q} {\bar{F}}^{T} + \bar{G} {\bar{G}}^{T} \end{matrix}

(41)

with

\bar{F}

Schur stable. Accordingly,

{\bar{MSD}}_{t}

converges over

[α T, β T]

as

T \to \infty

.

Proof.

First, notice that the observability condition on the pairs

(A, C_{k}^{l o c})

implies the observability of

(A, C)

. Since the global model is reachable and observable, the robust centralized Kalman filter converges provided that c is sufficiently small (see [43,44]). As a consequence,

V_{t} \to \bar{V} > 0

as

t \to \infty

. Accordingly, in view of Equation (17),

K_{z_{t}} \to K_{z}

and, thus, in view of Equation (22),

{\tilde{K}}_{z_{t}} \to {\tilde{K}}_{z}

. Since

K_{z_{k, t}}

and

{\tilde{K}}_{z_{k, t}}

are submatrices of

K_{z_{t}}

and

{\tilde{K}}_{z_{t}}

, respectively, we have that

K_{z_{k, t}} \to K_{z_{k}}

and

{\tilde{K}}_{z_{k, t}} \to {\tilde{K}}_{z_{k}}

. Accordingly, in view of Equation (30), we have that

c_{k, t} \to {\bar{c}}_{k}

where

\begin{matrix} {\bar{c}}_{k} : = & \frac{1}{2} [- log |{\tilde{K}}_{z_{k}}| + log |K_{z_{k}}| + tr ({\tilde{K}}_{z_{k}} K_{z_{k}}^{- 1}) - (n + p_{k})] . \end{matrix}

(42)

In [30], it has been shown that

V_{t} \to P_{t}

as

c \to 0

, and thus

{\tilde{K}}_{z_{t}} \to K_{z_{t}}

. Since

K_{z_{k, t}}

and

{\tilde{K}}_{z_{k, t}}

are submatrices of

K_{z_{t}}

and

{\tilde{K}}_{z_{t}}

, respectively, we have that

{\tilde{K}}_{z_{k, t}} \to K_{z_{k, t}}

. Accordingly, in view of Equation (30), we have that

c_{k, t} \to 0

as

c \to 0

.

In view of ([43], Proposition 5.3), we conclude that the robust local Kalman filter at node k converges because: the local state-space model is reachable and observable;

{\bar{c}}_{k}

is sufficiently small provided that c is sufficiently small as well.

Finally, the remaining part of the proof follows the one in ([39], Section IV-A) (see also [45]). □

It is worth noting that Proposition 1 guarantees that

\bar{Q}

is bounded because

\bar{F}

is Schur stable. This means that the prediction errors over the network have finite variance, i.e., the Kalman gains of the local filters are stabilizing. The proof above also shows that, in the case

c = 0

, i.e., the nominal model coincides with the actual one, Algorithm 2 boils down to the distributed Kalman filter with diffusion step proposed in [14].

5. Numerical Examples

In this section, we test the performance of the distributed Kalman filters with uniform versus non-uniform local tolerance. More precisely, we consider the problem in [39] to track the position of a projectile from position observations corrupted by noise and coming from a network of

N = 20

sensors. The latter is shown in Figure 1.

The model for the projectile motion is

{\dot{x}}_{t}^{c} = Φ x_{t}^{c} + r_{t}^{c}

(43)

where

Φ = [\begin{matrix} 0 & 0 \\ I_{3} & 0 \end{matrix}],

r_{t}^{c} = {[\begin{matrix} 0 & 0 & - g & 0 & 0 & 0 \end{matrix}]}^{T}

, with

g = 10

, and

x_{t}^{c} = {[\begin{matrix} v_{x, t} & v_{y, t} & v_{z, t} & p_{x, t} & p_{y, t} & p_{z, t} \end{matrix}]}^{T}

, with v denoting the velocity and p the position in the three spatial components. We discretize Equation (43) by using a sampling time equal to

0.1 s

. In this way, the model becomes

x_{t + 1} = A x_{t} + r_{t}

where

x_{t}

is the sampled version of

x_{t}^{c}

,

A = I_{6} + 0.1 Φ

and

r_{t} = (0.1 I_{6} + {0.1}^{2} Φ / 2) u_{t}^{c}

. Assuming that every sensor can measure only along two spatial components, the output matrix of the kth node can be of the type

\begin{matrix} C_{k} = [\begin{matrix} 0 & 0 & 0 & diag (1, 1, 0) \end{matrix}], C_{k} = [\begin{matrix} 0 & 0 & 0 & diag (1, 0, 1) \end{matrix}], C_{k} = [\begin{matrix} 0 & 0 & 0 & diag (0, 1, 1) \end{matrix}] . \end{matrix}

Every output matrix is assigned randomly to each node, with the constraint that the local state-space model in Equation (10) associated to each node must be observable for every node to guarantee the convergence of the robust Kalman filters at every node. Thus, if any node violates the constraint, all the output matrices are discarded and reassigned. Then, we choose

B = \sqrt{0.001} I

,

R_{k} = D_{k} D_{k}^{T} = \sqrt{k} P R_{0} P^{T}

, where

R_{0} = 0.5 \cdot diag (1, 4, 7)

and P is a permutation matrix which is randomly generated for any node. The initial state

x_{0}

is Gaussian distributed and whose covariance matrix is

P_{0} = I

.

In the numerical simulations, the following Kalman filters are considered: the centralized Kalman filter (KFC); the centralized robust Kalman filter (RKFC); the distributed Kalman filter with diffusion step (KFD) proposed in [14]; the distributed robust Kalman with diffusion step and uniform local tolerance (RKFDU) proposed in [39] and reviewed in Section 3; and the distributed robust Kalman filter with diffusion step and non-uniform local tolerance (RKFDNU) proposed in Section 4. For the distributed filters, the diffusion matrix W is defined as

w_{l k} = \{\begin{matrix} α_{k} n_{l}, & if l \in N_{k} \\ 0, & otherwise \end{matrix}

where

n_{l}

represents the total number of neighbors of node l while

α_{k}

is chosen such that Equation (11) holds.

5.1. First Example

We assume that the actual model is contained in the ambiguity set Equation (2) with

c = 0.02

. Figure 2 shows the least favorable mean squared deviation across the network. We notice that

{\bar{MSD}}_{t}

converges at the steady-state for all the distributed versions of the Kalman filter. RKFDNU performs slightly better that RKFDU and both perform consistently better than KFD. Finally, all of them perform worse than the centralized versions, and RKFC results the best.

However, the situation is more salient if we consider the steady-state least favorable

{MSD}_{k, t}

for each node (see Figure 3a): RKFDNU performs slightly better than RKFDU for the majority of the nodes. However, there is a clear difference for nodes 18 and 19 which are more susceptible to model uncertainty: RKFDNU performs better than RKFDU.

Figure 3b shows the behavior of the local tolerances

c_{k, t}

over the time for RKFDNU. As expected, every

c_{k, t}

converges to a constant value. However, the latter is different from the tolerance c of the centralized minimax problem.

Finally, Figure 4a,b shows the risk sensitivity parameters

θ_{k, t}

at every node for RKFDU and RKFDNU. We can observe that the risk sensitivity parameters of RKFDU takes larger values than the ones of RKFDNU. Accordingly, the inferior performance of RKFDU is due by the fact that the robust local filters are too conservative.

5.2. Second Example

In the second experiment, we consider a larger deviation between the actual model and the nominal one, i.e., we choose

c = 0.06

.

Figure 5 and Figure 6a show the least favorable mean square deviation across the network and for each node in the steady-state. The situation is similar to the previous one, but the difference among RKFD, RKFDU, and RKFDNU is more evident. In particular, the steady-state value of

{\bar{MSD}}_{k, t}

for

k = 18, 19

using RKFDNU is clearly better than the ones corresponding to KFD and RKFDU.

In addition, Figure 6b shows the tolerances

c_{k, t}

at every node over the time. As expected, the latter are higher than the ones with

c = 0.02

. Indeed, the uncertainty now is greater than before and thus the robust local filters now must be more conservative than before.

Finally, we study how the least favorable MSD for each node correlates with the topology of the sensor network. Figure 7a,b shows two additional sensor networks obtained from the original network of Figure 1 by adding connections only to some nodes. More precisely, the density of the original network, i.e., number of connections over all possible connections, is

d_{1} = 0.39

; the density of the networks in Figure 7a,b is

d_{2} = 0.48

and

d_{3} = 0.72

, respectively.

Figure 8a,b shows the results obtained by RKFDNU with the three different sensor networks. As expected, the increase of the degrees of the nodes, and consequently of the connections in the network, reduces the least favorable MSD related to those nodes at steady-state and the total least favorable MSD across the network. In conclusion, by adding edges the performance of RKFDNU tends to the one obtained in the centralized case (RKFC), where the nodes are considered all connected to each other.

6. Efficient Algorithm

Proposition 1 suggests a simplified version of Algorithm 2. Indeed, if c is sufficiently small, then

c_{k, t}

converges to

{\bar{c}}_{k}

in the steady state for every node of the network. Accordingly, the central unit can compute

{\bar{c}}_{k}

and transmit it to any node once. In this way, the transmission costs are reduced. The resulting procedure is outlined in Algorithm 3.

Algorithm 3 Efficient distributed robust Kalman filter with non-uniform local tolerance at time t.

Input: ${\hat{x}}_{k, t}, V_{k, t}, y_{k, t}, W = {[w_{l k}]}_{l k}$ with $k = 1 \dots N$
Output: ${\hat{x}}_{k, t + 1}, V_{k, t + 1}$ with $k = 1 \dots N$
Incremental step. Compute at every node k:

$\begin{matrix} ψ_{k, t + 1} & = A {\hat{x}}_{k, t} + A {(V_{k, t}^{- 1} + S_{k})}^{- 1} \sum_{l \in N_{k}} C_{l}^{T} R_{l}^{- 1} (y_{l, t} - C_{l} {\hat{x}}_{k, t}) + r_{t} \\ P_{k, t + 1} & = A {(V_{k, t}^{- 1} + S_{k})}^{- 1} A^{T} + B B^{T} \\ Find & θ_{k, t} s . t . γ (P_{k, t + 1}, θ_{k, t}) = {\bar{c}}_{k} \\ V_{k, t + 1} & = {(P_{k, t + 1}^{- 1} - θ_{k, t} I)}^{- 1} \end{matrix}$

(44)
Diffusion step. Compute at every node k:

${\hat{x}}_{k, t + 1} = \sum_{l \in N_{k}} w_{l k} ψ_{l, t + 1}$

(45)

We compared this algorithm, hereafter called RKFDNU2, with RKFDNU: the performance in practice is the same. Figure 9 shows their least favorable mean square deviation across the network in the scenario of Section 5.2 in the first 50 time steps. Finally, Figure 10a,b shows the risk sensitivity parameters for RKDFNU and RKFDNU2, respectively: there is a slight difference. However, we saw that such a difference disappears after 20 time steps. We conclude that the efficient scheme RKFDNU2 represents a good approximation of RKFDNU.

Finally, Table 1 summarizes the performance of RKFC, RKFDU, RKFDNU, and RKFDNU2 obtained with tolerance

c = 0.02

. The considered values are the least favorable MSD across the network at steady-state, the average among every node of the tolerances at steady-state, the average among every node of the risk sensitivity parameter at steady-state, and the occurred communications between the central unit and the local nodes in the whole time span. In particular, concerning the communication:

in RKFDU, the central unit transmits the uniform tolerance to each node at once (at the beginning);
in RKFDNU, the central unit transmits the local tolerances to each node at every time step;
in RKFDNU2, the central unit transmits the steady-state local tolerances to each node at once (at the beginning).

7. Conclusions

In this article, the problem of distributed robust Kalman filtering for a sensor network is considered. More precisely, we consider a distributed scheme with diffusion step and the intermediate estimate is designed in order to be optimal according to the least favorable model belonging to a prescribed local ambiguity set. The latter is a ball about the local nominal model and the radius of this ball is the local tolerance. In this paper, we propose an algorithm in which the local tolerance of each node is different and suitably computed by the central unit. We also consider a more efficient implementation of the algorithm where the central unit computes and transmits the steady-state local tolerances for every node at once. In this way, the communication between the central unit and the local nodes is reduced. Through some numerical examples, we showed that the proposed algorithm performs better than the one with a uniform local tolerance across the network.

Author Contributions

Conceptualization, A.E., F.G., G.G. and M.Z.; methodology, A.E., F.G., G.G. and M.Z.; simulations, A.E., F.G., G.G. and M.Z.; supervision, M.Z.; All the authors contributed equally. All authors have read and agreed to the published version of the manuscript.

Funding

Part of this work has been supported by the MIUR (Italian Minister for Education) under the initiative “Departments of Excellence” (Law 232/2016).

Conflicts of Interest

The authors declare no conflict of interest.

References

Wu, H.N.; Wang, H.D. Distributed consensus observers-based H_∞ control of dissipative PDE systems using sensor networks. IEEE Trans. Control Netw. Syst. 2015, 2, 112–121. [Google Scholar] [CrossRef]
Orihuela, L.; Millán, P.; Vivas, C.; Rubio, F.R. Distributed control and estimation scheme with applications to process control. IEEE Trans. Control Syst. Technol. 2015, 23, 1563–1570. [Google Scholar] [CrossRef]
Sadamoto, T.; Ishizaki, T.; Imura, J. Average state observers for large-scale network systems. IEEE Trans. Control Netw. Syst. 2017, 4, 761–769. [Google Scholar] [CrossRef]
Wang, S.; Ren, W.; Chen, J. Fully distributed dynamic state estimation with uncertain process models. IEEE Trans. Control Netw. Syst. 2017, 5, 1841–1851. [Google Scholar] [CrossRef]
Dormann, K.; Noack, B.; Hanebeck, U. Optimally distributed Kalman filtering with data-driven communication. Sensors 2018, 18, 1034. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Ruan, Y.; Luo, Y.; Zhu, Y. Globally optimal distributed Kalman filtering for multisensor systems with unknown inputs. Sensors 2018, 18, 2976. [Google Scholar] [CrossRef] [Green Version]
Gao, S.; Chen, P.; Huang, D.; Niu, Q. Stability analysis of multi-sensor Kalman filtering over lossy networks. Sensors 2016, 16, 566. [Google Scholar] [CrossRef] [Green Version]
Kordestani, M.; Samadi, M.; Saif, M. A distributed fault detection and isolation method for multifunctional spoiler system. In Proceedings of the 2018 IEEE 61st International Midwest Symposium on Circuits and Systems, Windsor, ON, Canada, 5–8 August 2018; pp. 380–383. [Google Scholar]
Scott, S.; Blocker, A.; Bonassi, F.; Chipman, H.; George, E.; McCulloch, R. Bayes and big data: The consensus Monte Carlo algorithm. EFaBBayes 250th Conf. 2013, 16. [Google Scholar] [CrossRef] [Green Version]
Spanos, D.P.; Olfati-Saber, R.; Murray, R.M. Approximate distributed Kalman filtering in sensor networks with quantifiable performance. In Proceedings of the Fourth International Symposium on Information Processing in Sensor Networks, Boise, ID, USA, 15 April 2005; pp. 133–139. [Google Scholar]
Olfati-Saber, R. Distributed Kalman filter with embedded consensus filters. In Proceedings of the 44th IEEE Conference on Decision and Control, Seville, Spain, 15 December 2005; pp. 8179–8184. [Google Scholar]
Olfati-Saber, R. Distributed Kalman filtering for sensor networks. In Proceedings of the 2007 46th IEEE Conference on Decision and Control, New Orleans, LA, USA, 12–14 December 2007; pp. 5492–5498. [Google Scholar]
Carli, R.; Chiuso, A.; Schenato, L.; Zampieri, S. Distributed Kalman filtering based on consensus strategies. IEEE J. Sel. Areas Commun. 2008, 26, 622–633. [Google Scholar] [CrossRef] [Green Version]
Cattivelli, F.S.; Sayed, A.H. Diffusion strategies for distributed Kalman filtering and smoothing. IEEE Trans. Autom. Control 2010, 55, 2069–2084. [Google Scholar] [CrossRef]
Yang, S.; Huang, T.; Guan, J.; Xiong, Y.; Wang, M. Diffusion Strategies for Distributed Kalman Filter with Dynamic Topologies in Virtualized Sensor Networks. Mob. Inf. Syst. 2016, 2016, 8695102. [Google Scholar] [CrossRef] [Green Version]
Ji, H.; Lewis, F.L.; Hou, Z.; Mikulski, D. Distributed information weighted Kalman consensus filter for sensor networks. Automatica 2017, 77, 18–30. [Google Scholar] [CrossRef] [Green Version]
Yang, H.; Li, H.; Xia, Y.; Li, L. Distributed Kalman filtering over sensor networks with transmission delays. IEEE Trans. Cybern. 2020, 1–11. [Google Scholar] [CrossRef] [PubMed]
Kordestani, M.; Dehghani, M.; Moshiri, B.; Saif, M. A new fusion estimation method for multi-rate multi-sensor systems with missing measurements. IEEE Access 2020, 8, 47522–47532. [Google Scholar] [CrossRef]
Luengo, D.; Martino, L.; Elvira, V.; Bugallo, M.F. Efficient linear fusion of partial estimators. Digit. Signal Process. 2018, 78, 265–283. [Google Scholar] [CrossRef]
Song, E.; Zhu, Y.; Zhou, J.; You, Z. Optimal Kalman filtering fusion with cross-correlated sensor noises. Automatica 2007, 43, 1450–1456. [Google Scholar] [CrossRef]
Xu, J.; Song, E.; Luo, Y.; Zhu, Y. Optimal distributed Kalman filtering fusion algorithm without invertibility of estimation error and sensor noise covariances. IEEE Signal Process. Lett. 2012, 19, 55–58. [Google Scholar] [CrossRef]
Sun, S.L.; Deng, Z.L. Multi-sensor optimal information fusion Kalman filter. Automatica 2004, 40, 1017–1023. [Google Scholar] [CrossRef]
Feng, J.; Zeng, M. Optimal distributed Kalman filtering fusion for a linear dynamic system with cross-correlated noises. Int. J. Syst. Sci. 2012, 43, 385–398. [Google Scholar] [CrossRef]
Andrieu, C.; Doucet, A.; Holenstein, R. Particle Markov chain Monte Carlo methods. J. R. Statist. Soc. B 2010, 72, 269–342. [Google Scholar] [CrossRef] [Green Version]
Martino, L.; Elvira, V.; Camps-Valls, G. Distributed particle Metropolis-Hastings schemes. In Proceedings of the IEEE Statistical Signal Processing Workshop (SSP), Freiburg, Germany, 10–13 June 2018. [Google Scholar]
Boel, R.; James, M.; Petersen, I. Robustness and risk-sensitive filtering. IEEE Trans. Automat. Control 2002, 47, 451–461. [Google Scholar] [CrossRef]
Hansen, L.; Sargent, T. Robustness; Princeton University Press: Princeton, NJ, USA, 2008. [Google Scholar]
Levy, B.; Zorzi, M. A Contraction analysis of the convergence of risk-sensitive filters. SIAM J. Control Optim. 2016, 54, 2154–2173. [Google Scholar] [CrossRef] [Green Version]
Levy, B.; Nikoukhah, R. Robust state-space filtering under incremental model perturbations subject to a relative entropy tolerance. IEEE Trans. Automat. Control 2013, 58, 682–695. [Google Scholar] [CrossRef] [Green Version]
Levy, B.; Nikoukhah, R. Robust least-squares estimation with a relative entropy constraint. Inf. Theory IEEE Trans. 2004, 50, 89–104. [Google Scholar] [CrossRef]
Zenere, A.; Zorzi, M. On the coupling of model predictive control and robust Kalman filtering. IET Control Theory Appl. 2018, 12, 1873–1881. [Google Scholar] [CrossRef] [Green Version]
Zenere, A.; Zorzi, M. Model predictive control meets robust Kalman filtering. In Proceedings of the 20th IFAC World Congress, Toulouse, France, 9–14 July 2017. [Google Scholar]
Zorzi, M. On the robustness of the Bayes and Wiener estimators under model uncertainty. Automatica 2017, 83, 133–140. [Google Scholar] [CrossRef] [Green Version]
Zorzi, M. Robust Kalman filtering under model perturbations. IEEE Trans. Autom. Control 2017, 62, 2902–2907. [Google Scholar] [CrossRef] [Green Version]
Abadeh, S.; Nguyen, V.; Kuhn, D.; Esfahani, P. Wasserstein distributionally robust Kalman filtering. In Advances in Neural Information Processing Systems; MIT Press: Montreal, CA, USA, 2018; pp. 8474–8483. [Google Scholar]
Shen, B.; Wang, Z.; Hung, Y. Distributed H_∞-consensus filtering in sensor networks with multiple missing measurements: The finite-horizon case. Automatica 2010, 46, 1682–1688. [Google Scholar] [CrossRef] [Green Version]
Luo, Y.; Zhu, Y.; Luo, D.; Zhou, J.; Song, E.; Wang, D. Globally optimal multisensor distributed random parameter matrices Kalman filtering fusion with applications. Sensors 2008, 8, 8086–8103. [Google Scholar] [CrossRef]
Huang, J.; Shi, D.; Chen, T. Distributed robust state estimation for sensor networks: A risk-sensitive approach. In Proceedings of the 2018 IEEE Conference on Decision and Control (CDC), Miami Beach, FL, USA, 17–19 December 2018; pp. 6378–6383. [Google Scholar]
Zorzi, M. Distributed Kalman filtering under model uncertainty. IEEE Trans. Control Netw. Syst. 2019. [Google Scholar] [CrossRef] [Green Version]
Cover, T.; Thomas, J. Information Theory; Wiley: New York, NY, USA, 1991. [Google Scholar]
Zorzi, M. An interpretation of the dual problem of the THREE-like approaches. Automatica 2015, 62, 87–92. [Google Scholar] [CrossRef] [Green Version]
Zorzi, M. A new family of high-resolution multivariate spectral estimators. IEEE Trans. Autom. Control 2014, 59, 892–904. [Google Scholar] [CrossRef]
Zorzi, M.; Levy, B.C. On the convergence of a risk sensitive like filter. In Proceedings of the 54th IEEE Conference on Decision and Control (CDC), Osaka, Japan, 15–18 December 2015; pp. 4990–4995. [Google Scholar]
Zorzi, M. Convergence analysis of a family of robust Kalman filters based on the contraction principle. SIAM J. Control Optim. 2017, 55, 3116–3131. [Google Scholar] [CrossRef]
Zorzi, M.; Levy, B. Robust Kalman filtering: Asymptotic analysis of the least favorable model. In Proceedings of the 57th IEEE Conference on Decision and Control (CDC), Miami Beach, FL, USA, 17–19 December 2018. [Google Scholar]

Figure 1. Network of 20 sensors for measuring the noisy positions of the projectile.

Figure 2. Least favorable mean square deviation across the network with tolerance

c = 0.02

.

Figure 2. Least favorable mean square deviation across the network with tolerance

c = 0.02

.

Figure 3. (a) Least favorable mean square deviation for each node in the steady-state with tolerance

c = 0.02

. (b) Time-variant local tolerances for each node over time with

c = 0.02

.

Figure 3. (a) Least favorable mean square deviation for each node in the steady-state with tolerance

c = 0.02

. (b) Time-variant local tolerances for each node over time with

c = 0.02

.

Figure 4. (a) Risk sensitivity parameter for each node using RKFDU with tolerance

c = 0.02

. (b) Risk sensitivity parameter for each node using RKFDNU with tolerance

c = 0.02

.

Figure 4. (a) Risk sensitivity parameter for each node using RKFDU with tolerance

c = 0.02

. (b) Risk sensitivity parameter for each node using RKFDNU with tolerance

c = 0.02

.

Figure 5. Least favorable mean square deviation across the network with tolerance

c = 0.06

.

Figure 5. Least favorable mean square deviation across the network with tolerance

c = 0.06

.

Figure 6. (a) Least favorable mean square deviation for each node in the steady-state with tolerance

c = 0.06

. (b) Time-variant local tolerances for each node over time with

c = 0.06

.

Figure 6. (a) Least favorable mean square deviation for each node in the steady-state with tolerance

c = 0.06

. (b) Time-variant local tolerances for each node over time with

c = 0.06

.

Figure 7. (a) Sensor network with density

d_{2} = 0.48

. (b) Sensor network with density

d_{3} = 0.72

.

Figure 7. (a) Sensor network with density

d_{2} = 0.48

. (b) Sensor network with density

d_{3} = 0.72

.

Figure 8. Performance of RKFDNU across the three networks compared with RKFC. (a) Least favorable mean square deviation with tolerance

c = 0.06

. (b) Least favorable mean square deviation for each node in the steady-state with tolerance

c = 0.06

.

Figure 8. Performance of RKFDNU across the three networks compared with RKFC. (a) Least favorable mean square deviation with tolerance

c = 0.06

. (b) Least favorable mean square deviation for each node in the steady-state with tolerance

c = 0.06

.

Figure 9. Least favorable mean square deviation across the network with tolerance

c = 0.02

.

Figure 9. Least favorable mean square deviation across the network with tolerance

c = 0.02

.

Figure 10. (a) Risk sensitivity parameter for each node using RKFDNU with tolerance

c = 0.02

. (b) Risk sensitivity parameter for each node using RKFDNU2 with tolerance

c = 0.02

.

Figure 10. (a) Risk sensitivity parameter for each node using RKFDNU with tolerance

c = 0.02

. (b) Risk sensitivity parameter for each node using RKFDNU2 with tolerance

c = 0.02

.

Table 1. Summary of the performance of the different algorithms with tolerance

c = 0.02

.

Table 1. Summary of the performance of the different algorithms with tolerance

c = 0.02

.

	RKFC	RKFDU	RKFDNU	RKFDNU2
MSD [dB]	−2.9174	−1.5553	−1.6182	−1.6182
Average tolerance	0.02	0.02	0.0144	0.0144
Average risk sensitivity parameter	1.3366	0.3764	0.3576	0.3576
Communication requirement	N.A.	One at the beginning	Every time step	One at the beginning

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Emanuele, A.; Gasparotto, F.; Guerra, G.; Zorzi, M. Robust Distributed Kalman Filtering: On the Choice of the Local Tolerance. Sensors 2020, 20, 3244. https://doi.org/10.3390/s20113244

AMA Style

Emanuele A, Gasparotto F, Guerra G, Zorzi M. Robust Distributed Kalman Filtering: On the Choice of the Local Tolerance. Sensors. 2020; 20(11):3244. https://doi.org/10.3390/s20113244

Chicago/Turabian Style

Emanuele, Alessandro, Francesco Gasparotto, Giacomo Guerra, and Mattia Zorzi. 2020. "Robust Distributed Kalman Filtering: On the Choice of the Local Tolerance" Sensors 20, no. 11: 3244. https://doi.org/10.3390/s20113244

APA Style

Emanuele, A., Gasparotto, F., Guerra, G., & Zorzi, M. (2020). Robust Distributed Kalman Filtering: On the Choice of the Local Tolerance. Sensors, 20(11), 3244. https://doi.org/10.3390/s20113244

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Robust Distributed Kalman Filtering: On the Choice of the Local Tolerance

Abstract

1. Introduction

2. Background

3. Distributed Robust Kalman Filtering with Uniform Local Tolerance

4. Distributed Robust Kalman Filtering with Non-Uniform Local Tolerance

Least Favorable Performance

5. Numerical Examples

5.1. First Example

5.2. Second Example

6. Efficient Algorithm

7. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI