Open Access
This article is

- freely available
- re-usable

*Sensors*
**2019**,
*19*(14),
3112;
https://doi.org/10.3390/s19143112

Article

Covariance-Based Estimation for Clustered Sensor Networks Subject to Random Deception Attacks

^{1}

Dpto. de Estadística, Universidad de Jaén, Paraje Las Lagunillas, 23071 Jaén, Spain

^{2}

Dpto. de Estadística, Universidad de Granada, Avda. Fuentenueva, 18071 Granada, Spain

*

Correspondence: [email protected]; Tel.: +34-953-212-926

^{†}

These authors contributed equally to this work.

Received: 30 May 2019 / Accepted: 12 July 2019 / Published: 14 July 2019

## Abstract

**:**

In this paper, a cluster-based approach is used to address the distributed fusion estimation problem (filtering and fixed-point smoothing) for discrete-time stochastic signals in the presence of random deception attacks. At each sampling time, measured outputs of the signal are provided by a networked system, whose sensors are grouped into clusters. Each cluster is connected to a local processor which gathers the measured outputs of its sensors and, in turn, the local processors of all clusters are connected with a global fusion center. The proposed cluster-based fusion estimation structure involves two stages. First, every single sensor in a cluster transmits its observations to the corresponding local processor, where least-squares local estimators are designed by an innovation approach. During this transmission, deception attacks to the sensor measurements may be randomly launched by an adversary, with known probabilities of success that may be different at each sensor. In the second stage, the local estimators are sent to the fusion center, where they are combined to generate the proposed fusion estimators. The covariance-based design of the distributed fusion filtering and fixed-point smoothing algorithms does not require full knowledge of the signal evolution model, but only the first and second order moments of the processes involved in the observation model. Simulations are provided to illustrate the theoretical results and analyze the effect of the attack success probability on the estimation performance.

Keywords:

least-squares filtering; least-squares fixed-point smoothing; networked systems; cluster-based approach; stochastic deception attacks## 1. Introduction

Nowadays, communication networks are widely used to cope with signal estimation problems in engineering, economy, health or security, among others, since they generally provide more robust and precise estimators of the target signal than a single sensor. Conventional centralized and distributed fusion estimation architectures have been widely studied under a state-space approach (see e.g., [1,2,3,4] and references therein). In addition, assuming that the evolution model of the signal is not fully known and only covariance information is available, centralized and distributed fusion estimation algorithms have been proposed for sensor networks affected by different network-induced random uncertainties (see e.g., [5,6,7]). A comprehensive survey of recent developments in estimation and fusion for networked systems with randomly occurring phenomena can be found in [8]. The key theories and methodologies of distributed multisensor data fusion are comprehensively reviewed in [9] and a survey of distributed fusion estimation algorithms with applications in networked systems, including an interesting analysis of some network-induced uncertainties, is provided in [10].

These conventional centralized and distributed fusion architectures require connecting all sensor nodes to a central processor, which sometimes can involve a serious communication burden, especially when the number of sensors is large. Additionally, in the distributed fusion estimation architecture, equipping each single sensor with a local estimator may be sometimes unaffordable. To overcome these shortcomings, a usual practice in engineering consists of arranging the sensors in clusters and selecting one node per cluster (called cluster-head node) acting as a local processor. Each cluster head first collects measurements from the regular sensor nodes of its cluster to generate a local estimator and then all local estimators are collected in the central processor, where the distributed fusion estimator is generated. This structure, called hierarchical cluster-based estimation structure, provides an efficient way to analyze big data in a great variety of application fields [11]. Clustering is also widely used, for example, in underwater sensor networks established for military purposes, such as anti-submarine warfare, communications, positioning and guidance [12]. An essential issue when dealing with complex networked systems consisting of multiple clusters or individual subsystems is coordination control and the design of appropriate consensus protocols; relevant results concerning these problems in different kinds of networked systems are found in [13,14,15,16]. A state-of-the-art and comprehensive survey on clustering approaches can be found in [17] and a detailed outline of some modern energy-efficient clustering approaches to improve the lifetime of wireless sensor networks can be seen in [18]. Recently, a comprehensive review with comparisons and classifications of different optimized clustering approaches according to different metrics has been carried out in [19].

A remarkable disadvantage of using sensor networks for signal estimation is that their reliability can be compromised by possible cyber-attacks from adversaries. For this reason, cyber-security of networked systems is drawing considerable research attention. The most typical kinds of attacks are the denial-of-service attacks and the deception attacks [20]. While the first ones strike at data availability by obstructing the flow of information through the network, the second ones violate data integrity by injecting false information that modifies the real data packets. Such false information can include a wrong sensor measurement or control input, an incorrect time-stamp or a wrong identity of the sending device. The fusion estimation problem for stochastic signals from measured data coming from a sensor network subject to deception attacks is currently an important focus of research (see e.g., [20,21,22,23,24,25,26] and references therein). In [20], new insecurity conditions for the state estimation problem under false data injection attacks are proposed. In [21], the variance-constrained distributed filtering problem is studied for time-varying systems subject to multiplicative deception attacks with bounded attack noises. A distributed recursive filtering algorithm is designed in [22] for a class of discrete time-delayed systems subject to both uniform quantization and intermittent deception attacks. In [23], the security-guaranteed filtering problem is addressed for a class of nonlinear discrete time-delayed systems with both stochastic sensor saturations and deception attacks. The centralized security-guaranteed filtering problem for linear discrete time-invariant stochastic systems with multi-rate sensor nodes is addressed in [24], when deception attacks are launched during the transmission of information from the sensors to the centralized filter. In [25], the cluster-based covariance intersection fusion estimation problem under stochastic deception attacks is investigated. An integrated analysis of event-triggered fault detection and fault estimation is proposed in [26] for a class of discrete-time stochastic systems subject to unknown disturbances and stochastic deception attacks.

Research motivation. In this paper, distributed fusion filtering and fixed-point smoothing algorithms are designed for clustered sensor networks subject to stochastic linear deception attacks. This study is motivated by the following challenges:

(i) To find out fusion estimation structures for multi-sensor networked systems that reduce transmission burdens (with respect to both centralized and distributed fusion) and the cost of embedding a local processor in each sensor (with respect to distributed fusion).

(ii) To design recursive and easily implementable estimation algorithms with high estimation accuracy, fusing unreliable multi-sensor data, corrupted by possible deception attacks, under general assumptions on the target signal that do not require the full knowledge of the system state-space model.

Paper contributions. In light of our motivations, the main contributions of this paper are highlighted as follows:

(i) The fusion estimation problem in multi-sensor networked systems when the sensors are grouped into clusters is investigated assuming, on the one hand, that the measurements are subject to stochastic deception attacks and, on the other, that the signal evolution model is not necessarily known, but only information about its first and second order statistical properties (covariance information) is available. This covariance-based estimation approach is more general than the conventional one based on the full knowledge of the state-space model of the system since, in that case, the mean and autocovariance function of the signal process can be calculated. Hence, this estimation approach provides a comprehensive framework to deal with a great variety of stochastic signals.

(ii) A two-stage fusion estimation algorithm is designed for both filtering and fixed-point smoothing problems. In the first stage, each local processor collects measurements (subject to random deception attacks) from its cluster to generate local least-squares linear estimators; in this stage, the innovation approach is used to design the local estimation algorithms, which are recursive and computationally simple. In the second stage, in order to improve the estimation performance, all these local estimators are transmitted to the fusion center, where fusion estimators are obtained by matrix-weighted linear combinations of the local estimators; the weight matrices are computed by minimizing the mean squared estimation error, for which the cross-correlation matrices between any two local estimators need to be previously calculated.

In contrast with previous papers concerning the fusion estimation problem under a covariance-based approach, this is the first time that a cluster structure of the sensor network is used and stochastic deception attacks are considered. In comparison with the existing literature about the estimation problem in clustered sensor networks, either subject to random deception attacks or not, the main difference is the kind of information required for the derivation of the algorithms: full knowledge of the state-space model in the existing literature and only covariance information (which covers the conventional formulation based on state-space model and also more general situations) in the current paper.

Paper structure. The measurement model framework is presented in Section 2, where the equations of the clustering sensor measurements and the stochastic deception attacks, together with the hypotheses under which the distributed estimation problem will be addressed, are specified. In Section 3 and Section 4 the first and second stages of the proposed cluster-based fusion estimation structure are developed. The local least-squares linear filtering and fixed-point smoothing estimators, as well as formulas for the error covariance matrices, are obtained by recursive algorithms in Section 3. The proposed distributed fusion estimators are proposed in Section 4 by a matrix-weighted linear combination of the local estimators using the mean squared error as optimality criterion; also, to measure the estimation accuracy, recursive formulas for the error covariance matrices are derived. The performance of the proposed estimation algorithms is analyzed in a numerical simulation study, carried out in Section 5. Finally, some concluding remarks are provided in Section 6.

Notation. Unless otherwise stated, the notation used in the paper is fairly standard. ${\mathbb{R}}^{n}$ denotes the n-dimensional Euclidean space and ${\mathbb{R}}^{m\times n}$ the set of all $m\times n$ real matrices. When the dimensions of vectors or matrices are not specified, they are assumed to be compatible with algebraic operations. ${\mathbf{1}}_{n}$ denotes the n-dimensional column vector with all ones, while the identity and zero matrices are denoted by I and 0, respectively. For any function ${\Gamma}_{h,k}$, depending on h and k, and ${\Xi}^{\left(rs\right)}$, depending on r and s, we will write ${\Gamma}_{k}={\Gamma}_{k,k}$ and ${\Xi}^{\left(r\right)}={\Xi}^{\left(rr\right)}$, respectively, for simplicity. The Kronecker product of the matrices $A\in {\mathbb{R}}^{m\times n}$ and $B\in {\mathbb{R}}^{{m}^{\prime}\times {n}^{\prime}}$, which is a matrix in ${\mathbb{R}}^{m{m}^{\prime}\times n{n}^{\prime}}$, is denoted by $A\otimes B$. The Hadamard product of the matrices $C,D\in {\mathbb{R}}^{m\times n}$, which is also a matrix in ${\mathbb{R}}^{m\times n}$, is denoted by $C\circ D$ and it is defined by ${(C\circ D)}_{ij}={C}_{ij}{D}_{ij}$. ${\delta}_{k,s}$ is the Kronecker delta function, i.e., ${\delta}_{k,s}=1$ when $k=s$ and zero otherwise. The autocovariance function of a second-order process ${\left\{{\alpha}_{k}\right\}}_{k\ge 1}$ is defined as $E\left[({\alpha}_{k}-E\left[{\alpha}_{k}\right]){({\alpha}_{s}-E\left[{\alpha}_{s}\right])}^{T}\right],\phantom{\rule{4pt}{0ex}}k,s\ge 1,$ where $E[\xb7]$ stands for the mathematical expectation operator.

## 2. Estimation Problem Formulation and Measurement Model

In this paper, we consider the distributed fusion estimation problem of a discrete-time stochastic signal from a set of measurements provided by multiple sensors. A cluster-based estimation structure will be adopted: the sensors are grouped into different clusters, each of them connected with a local processor which, in turn, is connected with a global fusion center (FC). The distributed fusion estimation process operates in two stages. In the first one, the measured outputs provided by the sensors of each cluster are sent to the corresponding local processor, where local least-squares (LS) linear estimators are obtained; it is assumed that, during the transmission to the local processors, the clustering sensor measurements are subject to stochastic linear deception attacks. In the second stage, the local estimators received from all the local processors are gathered and fused in the FC, where the proposed distributed signal estimators are generated by a matrix-weighted linear combination of the local estimators using the mean squared error as optimality criterion.

The aforementioned estimation problem will be addressed without requiring full knowledge of the evolution model generating the signal process. Instead, it will be assumed that the signal mean function is zero and its covariance function is factorizable (covariance-based estimation approach). More precisely, the following assumption is required ([7]):

**(A1)**- The ${n}_{x}$-dimensional signal ${\left\{{x}_{k}\right\}}_{k\ge 1}$ is a zero-mean second-order process and its autocovariance function is expressed in a separable form; namely, $E\left[{x}_{k}{x}_{h}^{T}\right]={A}_{k}{B}_{h}^{T},\phantom{\rule{4pt}{0ex}}h\le k,$ where ${A}_{k},{B}_{h}\in {\mathbb{R}}^{{n}_{x}\times n}$ are known matrices.

The signal process of the most common signal evolution models (e.g., the signal of linear systems, or that of uncertain systems with a sum of multiple multiplicative noise terms) meets this assumption

**(A1)**and, hence, the covariance-based estimation approach provides a comprehensive framework to cope with different signal evolution models, thus overcoming the necessity of deriving specific algorithms for each situation.#### 2.1. Clustering Sensor Measurements and Stochastic Deception Attacks

Consider a sensor network and assume that the sensor nodes are grouped into L clusters to measure the stochastic signal of interest. Specifically, assume that each cluster $r=1,\dots ,L$ is made up of ${m}_{r}$ sensors that provide measurements of the signal according to the following model:
where ${C}_{k}^{\left(r\right(i\left)\right)}$ are known matrices and ${z}_{k}^{\left(r\right(i\left)\right)}\in {\mathbb{R}}^{{n}_{z}}$ is the signal measured output from the i-th sensor of the r-th cluster at time k, which is transmitted to the r-th local processor to obtain the local LS linear signal estimators. The following assumption is required on the measurement noises, ${\left\{{v}_{k}^{\left(r\right(i\left)\right)}\right\}}_{k\ge 1}$, $i=1,\dots ,{m}_{r},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}r=1,\dots ,L$:

$${z}_{k}^{\left(r\right(i\left)\right)}={C}_{k}^{\left(r\right(i\left)\right)}{x}_{k}+{v}_{k}^{\left(r\right(i\left)\right)}\phantom{\rule{-0.166667em}{0ex}}\phantom{\rule{-0.166667em}{0ex}},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}k\ge 1;\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}i=1,\dots ,{m}_{r},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}r=1,\dots ,L,$$

**(A2)**- The measurement noises of different clusters are independent and, for each $r=1,\dots ,L$, the noises ${\left\{{v}_{k}^{\left(r\right(i\left)\right)}\right\}}_{k\ge 1}$, $i=1,\dots ,{m}_{r}$, are zero-mean second-order white processes with known covariance matrices$$E\left[{v}_{k}^{\left(r\right(i\left)\right)}{v}_{h}^{\left(r\right(j\left)\right)T}\right]={R}_{k}^{\left(r\right(ij\left)\right)}{\delta}_{k,h},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}k,h\ge 1;\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}i,j=1,\dots ,{m}_{r}.$$

For each cluster $r=1,\dots ,L$, the transmissions of the measured outputs ${z}_{k}^{\left(r\right(i\left)\right)},\phantom{\rule{4pt}{0ex}}i=1,\dots ,{m}_{r},$ to the r-th local processor are affected by random linear deception attacks and the deceptive signal injected by the attackers, ${\xi}_{k}^{\left(r\right(i\left)\right)}$, is described by:

$${\xi}_{k}^{\left(r\right(i\left)\right)}=-{z}_{k}^{\left(r\right(i\left)\right)}+{w}_{k}^{\left(r\right(i\left)\right)},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}k\ge 1;\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}i=1,\dots ,{m}_{r},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}r=1,\dots ,L.$$

This signal involves two parts: the first one neutralizes the true information and the second one is the blurred information (noise) added by the attackers. These noises, ${\left\{{w}_{k}^{\left(r\right(i\left)\right)}\right\}}_{k\ge 1}$, $i=1,\dots ,{m}_{r},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}r=1,\dots ,L$, are assumed to satisfy the following requirement:

**(A3)**- The attack noises of different clusters are independent and, for each $r=1,\dots ,L$, the noises ${\left\{{w}_{k}^{\left(r\right(i\left)\right)}\right\}}_{k\ge 1}$, $i=1,\dots ,{m}_{r}$, are zero-mean second-order white processes with known covariance matrices$$E\left[{w}_{k}^{\left(r\right(i\left)\right)}{w}_{h}^{\left(r\right(j\left)\right)T}\right]={S}_{k}^{\left(r\right(ij\left)\right)}{\delta}_{k,h},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}k,h\ge 1;\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}i,j=1,\dots ,{m}_{r}.$$

#### 2.2. Measurements Received by the Local Processors

Usually, in practice, the attacks may randomly succeed or not. So, taking into account this random nature of the attacks, for every $r=1,\dots ,L$, the measurements received by the r-th local processor are modelled by introducing different sequences of Bernoulli random variables, ${\left\{{\lambda}_{k}^{\left(r\right(i\left)\right)}\right\}}_{k\ge 1}$, $i=1,\dots ,{m}_{r}$. For each $r=1,\dots ,L$, and $i=1,\dots ,{m}_{r}$, the value ${\lambda}_{k}^{\left(r\right(i\left)\right)}=1$ models a successful attack into the i-th communication channel from the r-th cluster, meaning that only noise ${w}_{k}^{\left(r\right(i\left)\right)}$ arrives to the r-th local processor; conversely, the value ${\lambda}_{k}^{\left(r\right(i\left)\right)}=0$ models a failed attack, which means that the real measured output ${z}_{k}^{\left(r\right(i\left)\right)}$ is received by the r-th local processor. Taking these considerations into account, the following model for ${y}_{k}^{\left(r\right(i\left)\right)}$, the measurements received by the r-th local processor, is considered:
or, equivalently, by substituting (1) and (2) into (3), we have:

$${y}_{k}^{\left(r\right(i\left)\right)}={z}_{k}^{\left(r\right(i\left)\right)}+{\lambda}_{k}^{\left(r\right(i\left)\right)}{\xi}_{k}^{\left(r\right(i\left)\right)}\phantom{\rule{-0.166667em}{0ex}}\phantom{\rule{-0.166667em}{0ex}},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}k\ge 1;\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}i=1,\dots ,{m}_{r},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}r=1,\dots ,L,$$

$${y}_{k}^{\left(r\right(i\left)\right)}=\left(1-{\lambda}_{k}^{\left(r\right(i\left)\right)}\right)\left({C}_{k}^{\left(r\right(i\left)\right)}{x}_{k}+{v}_{k}^{\left(r\right(i\left)\right)}\right)+{\lambda}_{k}^{\left(r\right(i\left)\right)}{w}_{k}^{\left(r\right(i\left)\right)}\phantom{\rule{-0.166667em}{0ex}}\phantom{\rule{-0.166667em}{0ex}},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}k\ge 1;\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}i=1,\dots ,{m}_{r},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}r=1,\dots ,L.$$

The following assumption is imposed on the Bernoulli random variables describing the success or failure of attacks:

**(A4)**- ${\left\{{\lambda}_{k}^{\left(r\right(i\left)\right)}\right\}}_{k\ge 1}$, $r=1,\dots ,L$, $i=1,\dots ,{m}_{r}$, are independent sequences of independent Bernoulli random variables with known probabilities $P\left({\lambda}_{k}^{\left(r\right(i\left)\right)}=1\right)={\overline{\lambda}}_{k}^{\left(r\right(i\left)\right)}$.

From this assumption, if we denote ${\lambda}_{k}^{\left(r\right)}={\left({\lambda}_{k}^{\left(r\right(1\left)\right)},\dots ,{\lambda}_{k}^{\left(r\left({m}_{r}\right)\right)}\right)}^{T}\otimes {\mathbf{1}}_{{n}_{z}}$, for $r=1,\dots ,L$, then the correlation matrices ${K}_{k}^{{\lambda}^{\left(r\right)}}\phantom{\rule{-0.166667em}{0ex}}\phantom{\rule{-0.166667em}{0ex}}\equiv E\left[{\lambda}_{k}^{\left(r\right)}{\lambda}_{k}^{\left(r\right)T}\right]$ and ${K}_{k}^{\mathbf{1}-{\lambda}^{\left(r\right)}}\equiv E\left[\left({\mathbf{1}}_{{m}_{r}{n}_{z}}-{\lambda}_{k}^{\left(r\right)}\right){\left({\mathbf{1}}_{{m}_{r}{n}_{z}}-{\lambda}_{k}^{\left(r\right)}\right)}^{T}\right]$ are known and their entries are easily calculated taking into account that

$$E\left[{\lambda}_{k}^{\left(r\right(i\left)\right)}{\lambda}_{k}^{\left(r\right(j\left)\right)}\right]=\left\{\begin{array}{c}{\overline{\lambda}}_{k}^{\left(r\right(i\left)\right)},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}i=j,\phantom{\rule{4pt}{0ex}}\hfill \\ {\overline{\lambda}}_{k}^{\left(r\right(i\left)\right)}{\overline{\lambda}}_{k}^{\left(r\right(j\left)\right)},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}i\ne j.\hfill \end{array}\right.$$

Finally, the following independence hypothesis is also assumed:

**(A5)**- For $r=1,\dots ,L$ and $i=1,\dots ,{m}_{r}$, the signal process ${\left\{{x}_{k}\right\}}_{k\ge 1}$ and the processes ${\left\{{v}_{k}^{\left(r\right(i\left)\right)}\right\}}_{k\ge 1}$, ${\left\{{w}_{k}^{\left(r\right(i\left)\right)}\right\}}_{k\ge 1}$ and ${\left\{{\lambda}_{k}^{\left(r\right(i\left)\right)}\right\}}_{k\ge 1}$ are mutually independent.

## 3. First Stage: Local LS Linear Estimators

To start with, as indicated previously, our aim is to calculate in every local processor, $r=1,\dots ,L$, LS linear estimators of the signal based on the measurements received from all the sensors of the r-th cluster. Therefore, to estimate the signal ${x}_{k}$ in the local processor r at time $k+N$, we consider all the measurements received from all the sensors $i=1,\dots ,{m}_{r}$, of the r-th cluster, up to time $k+N$; that is, the measurement set $\left\{{y}_{h}^{\left(r\right(i\left)\right)}\phantom{\rule{-0.166667em}{0ex}},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}h\le k+N,\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}i=1,\dots ,{m}_{r}\right\}$. So defining the vectors ${y}_{h}^{\left(r\right)}={\left({y}_{h}^{\left(r\right(1\left)\right)T}\phantom{\rule{-0.166667em}{0ex}}\phantom{\rule{-0.166667em}{0ex}},\dots ,{y}_{h}^{\left(r\left({m}_{r}\right)\right)T}\right)}^{T}$, made up of all the measurements received by the r-th processor at each sampling time h, the problem at hand is formulated as that of determining local LS linear estimators of the signal ${x}_{k}$ based on the vectors $\left\{{y}_{h}^{\left(r\right)}\phantom{\rule{-0.166667em}{0ex}},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}h\le k+N\right\}$, for $r=1,\dots ,L$.

#### 3.1. Stacked Model for the Measurements Received by the Local Processors

Taking into account Equation (4) for the measurements received by the r-th local processor, the following model for the above-defined vectors ${y}_{k}^{\left(r\right)}$ is clearly deduced:
where

$${y}_{k}^{\left(r\right)}=\left(I-{\Lambda}_{k}^{\left(r\right)}\right)\left({C}_{k}^{\left(r\right)}{x}_{k}+{v}_{k}^{\left(r\right)}\right)+{\Lambda}_{k}^{\left(r\right)}{w}_{k}^{\left(r\right)}\phantom{\rule{-0.166667em}{0ex}}\phantom{\rule{-0.166667em}{0ex}},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}k\ge 1;\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}r=1,\dots ,L,$$

$${C}_{k}^{\left(r\right)}\phantom{\rule{-2.84544pt}{0ex}}=\phantom{\rule{-2.84544pt}{0ex}}\left(\phantom{\rule{-2.84544pt}{0ex}}\begin{array}{c}{C}_{k}^{\left(r\right(1\left)\right)}\\ \vdots \\ {C}_{k}^{\left(r\left({m}_{r}\right)\right)}\end{array}\phantom{\rule{-5.69046pt}{0ex}}\right),\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}{v}_{k}^{\left(r\right)}\phantom{\rule{-2.84544pt}{0ex}}=\phantom{\rule{-2.84544pt}{0ex}}\left(\phantom{\rule{-2.84544pt}{0ex}}\begin{array}{c}{v}_{k}^{\left(r\right(1\left)\right)}\\ \vdots \\ {v}_{k}^{\left(r\left({m}_{r}\right)\right)}\end{array}\phantom{\rule{-5.69046pt}{0ex}}\right),\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}{w}_{k}^{\left(r\right)}\phantom{\rule{-2.84544pt}{0ex}}=\phantom{\rule{-2.84544pt}{0ex}}\left(\phantom{\rule{-2.84544pt}{0ex}}\begin{array}{c}{w}_{k}^{\left(r\right(1\left)\right)}\\ \vdots \\ {w}_{k}^{\left(r\left({m}_{r}\right)\right)}\end{array}\phantom{\rule{-5.69046pt}{0ex}}\right),\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}{\Lambda}_{k}^{\left(r\right)}\phantom{\rule{-2.84544pt}{0ex}}=\phantom{\rule{-2.84544pt}{0ex}}\left(\phantom{\rule{-5.69046pt}{0ex}}\begin{array}{ccc}{\lambda}_{k}^{\left(r\right(1\left)\right)}& \cdots & 0\\ \vdots & \ddots & \vdots \\ 0& \cdots & {\lambda}_{k}^{\left(r\left({m}_{r}\right)\right)}\end{array}\phantom{\rule{-5.69046pt}{0ex}}\right)\otimes I.$$

The following statistical properties of the processes involved in model (6), which will be used to address the LS linear estimation problem, are easily inferred from the model assumptions

**(A1)–(A5)**stated in Section 2:**(P1)**- ${\left\{{v}_{k}^{\left(r\right)}\right\}}_{k\ge 1}$, $r=1,\dots ,L$, are independent zero-mean noise processes with$$E\left[{v}_{k}^{\left(r\right)}{v}_{h}^{\left(s\right)}\right]={R}_{k}^{\left(r\right)}{\delta}_{k,h}{\delta}_{r,s},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\mathrm{where}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}{R}_{k}^{\left(r\right)}={\left({R}_{k}^{\left(r\right(ij\left)\right)}\right)}_{i,j=1,\dots ,{m}_{r}}.$$
**(P2)**- ${\left\{{w}_{k}^{\left(r\right)}\right\}}_{k\ge 1}$, $r=1,\dots ,L$, are independent zero-mean noise processes with$$E\left[{w}_{k}^{\left(r\right)}{w}_{h}^{\left(s\right)}\right]={S}_{k}^{\left(r\right)}{\delta}_{k,h}{\delta}_{r,s},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\mathrm{where}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}{S}_{k}^{\left(r\right)}={\left({S}_{k}^{\left(r\right(ij\left)\right)}\right)}_{i,j=1,\dots ,{m}_{r}}.$$
**(P3)**- ${\left\{{\Lambda}_{k}^{\left(r\right)}\right\}}_{k\ge 1}$, $r=1,\dots ,L$, are independent sequences of independent random matrices with known means, ${\overline{\Lambda}}_{k}^{\left(r\right)}=Diag\left({\overline{\lambda}}_{k}^{\left(r\right(1\left)\right)},\dots ,{\overline{\lambda}}_{k}^{\left(r\left({m}_{r}\right)\right)}\right)\otimes I.$ Moreover, the Hadamard product properties guarantee that, for any random matrix G independent of ${\Lambda}_{k}^{\left(r\right)}$, $E\left[{\Lambda}_{k}^{\left(r\right)}G{\Lambda}_{k}^{\left(r\right)}\right]={K}_{k}^{{\lambda}^{\left(r\right)}}\circ E\left[G\right]$, where ${K}_{k}^{{\lambda}^{\left(r\right)}}$ is calculated from (5).
**(P4)**- For $r=1,\dots ,L$, the signal process, ${\left\{{x}_{k}\right\}}_{k\ge 1}$, and the processes ${\left\{{v}_{k}^{\left(r\right)}\right\}}_{k\ge 1}$, ${\left\{{w}_{k}^{\left(r\right)}\right\}}_{k\ge 1}$ and ${\left\{{\Lambda}_{k}^{\left(r\right)}\right\}}_{k\ge 1}$ are mutually independent.
**(P5)**- ${\left\{{y}_{k}^{\left(r\right)}\right\}}_{k\ge 1}$, $r=1,\dots ,L$, are zero-mean processes with covariance matrices ${\Sigma}_{k}^{{y}^{\left(rs\right)}}\equiv E\left[{y}_{k}^{\left(r\right)}{y}_{k}^{\left(s\right)T}\right]$ given by$${\Sigma}_{k}^{{y}^{\left(rs\right)}}=\left\{\begin{array}{cc}{K}_{k}^{\mathbf{1}-{\lambda}^{\left(r\right)}}\circ \left({C}_{k}^{\left(r\right)}{A}_{k}{B}_{k}^{T}{C}_{k}^{\left(r\right)T}+{R}_{k}^{\left(r\right)}\right)+{K}_{k}^{{\lambda}^{\left(r\right)}}\circ {S}_{k}^{\left(r\right)},\hfill & r=s,\hfill \\ \left(I-{\overline{\Lambda}}_{k}^{\left(r\right)}\right){C}_{k}^{\left(r\right)}{A}_{k}{B}_{k}^{T}{C}_{k}^{\left(s\right)T}\left(I-{\overline{\Lambda}}_{k}^{\left(s\right)}\right),\hfill & r\ne s,\hfill \end{array}\right.$$

#### 3.2. Recursive Local LS Linear Filtering and Fixed-Point Smoothing Algorithms

This subsection is devoted to the design of recursive algorithms, at each local processor $r=1,\dots ,L$, for the LS linear filtering and fixed-point smoothing estimators based on the measurements received from all the sensors of the r-th cluster. In other words, for the r-th cluster, $r=1,\dots ,L$, the aim in this subsection is the design of algorithms to obtain the local LS linear estimators, ${\widehat{x}}_{k/k+N}^{\left(r\right)}$, $N\ge 0$, of the signal ${x}_{k}$ based on the vectors ${y}_{h}^{\left(r\right)}\phantom{\rule{-0.166667em}{0ex}}\phantom{\rule{-0.166667em}{0ex}},\phantom{\rule{4pt}{0ex}}h\le k+N,$ given by (6); specifically, a recursive algorithm for the local LS filter ${\widehat{x}}_{k/k}^{\left(r\right)}$, $k\ge 1$, and a recursive algorithm for the local LS smoother ${\widehat{x}}_{k/k+N}^{\left(r\right)}$, for fixed $k\ge 1$ and $N=1,2,\dots $, will be derived.

For this purpose, we will use the measurement innovations rather than the raw measurements (innovation approach), where the innovation at time k is defined as ${\mu}_{k}^{\left(r\right)}\equiv {y}_{k}^{\left(r\right)}-{\widehat{y}}_{k/k-1}^{\left(r\right)}$, being ${\widehat{y}}_{k/k-1}^{\left(r\right)}$ the one-stage observation predictor (LS linear estimator of ${y}_{k}^{\left(r\right)}$ based on ${y}_{h}^{\left(r\right)}\phantom{\rule{-0.166667em}{0ex}}\phantom{\rule{-0.166667em}{0ex}},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}h\le k-1$). Taking into account the properties of the innovation process, the following expression for the LS linear estimators is derived:

$${\widehat{x}}_{k/H}^{\left(r\right)}=\sum _{h=1}^{H}E\left[{x}_{k}{\mu}_{h}^{\left(r\right)T}\right]{\left(E\left[{\mu}_{h}^{\left(r\right)}{\mu}_{h}^{\left(r\right)T}\right]\right)}^{-1}{\mu}_{h}^{\left(r\right)}.$$

Taking orthogonal projections in (6) and using properties

**(P2)–(P4)**, we get that the one-stage observation predictor is ${\widehat{y}}_{h/h-1}^{\left(r\right)}=\left(I-{\overline{\Lambda}}_{h}^{\left(r\right)}\right){C}_{h}^{\left(r\right)}{\widehat{x}}_{h/h-1}^{\left(r\right)}$ and, consequently, the innovation is given by
$${\mu}_{h}^{\left(r\right)}={y}_{h}^{\left(r\right)}-\left(I-{\overline{\Lambda}}_{h}^{\left(r\right)}\right){C}_{h}^{\left(r\right)}{\widehat{x}}_{h/h-1}^{\left(r\right)}.$$

Furthermore, from property

**(P5)**, it is clear that the innovation covariance matrix, ${\Pi}_{h}^{\left(r\right)}\equiv E\left[{\mu}_{h}^{\left(r\right)}{\mu}_{h}^{\left(r\right)T}\right]=E\left[{y}_{h}^{\left(r\right)}{y}_{h}^{\left(r\right)T}\right]-E\left[{\widehat{y}}_{h/h-1}^{\left(r\right)}{\widehat{y}}_{h/h-1}^{\left(r\right)T}\right]$, satisfies
$${\Pi}_{h}^{\left(r\right)}={\Sigma}_{h}^{{y}^{\left(r\right)}}-\left(I-{\overline{\Lambda}}_{h}^{\left(r\right)}\right){C}_{h}^{\left(r\right)}E\left[{\widehat{x}}_{h/h-1}^{\left(r\right)}{\widehat{x}}_{h/h-1}^{\left(r\right)T}\right]{C}_{h}^{\left(r\right)T}\left(I-{\overline{\Lambda}}_{h}^{\left(r\right)}\right).$$

Using now (6) for ${y}_{h}^{\left(r\right)}$ and (8) for ${\widehat{x}}_{h/h-1}^{\left(r\right)}$, we obtain that the coefficients ${\mathcal{X}}_{k,h}^{\left(r\right)}=E\left[{x}_{k}{\mu}_{h}^{\left(r\right)T}\right]$ satisfy ${\mathcal{X}}_{k,h}^{\left(r\right)}={A}_{k}{\mathcal{O}}_{h}^{\left(r\right)},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}h\le k$, where ${\mathcal{O}}_{h}^{\left(r\right)}$ is a matrix function such that

$${\mathcal{O}}_{h}^{\left(r\right)}=\left({\displaystyle {B}_{h}^{T}-\sum _{l=1}^{h-1}{\mathcal{O}}_{l}^{\left(r\right)}{\Pi}_{l}^{\left(r\right)-1}{\mathcal{O}}_{l}^{\left(r\right)T}{A}_{h}^{T}}\right){C}_{h}^{\left(r\right)T}\left(I-{\overline{\Lambda}}_{h}^{\left(r\right)}\right).$$

Then, by defining
we obtain that the estimators ${\widehat{x}}_{k/L}^{\left(r\right)},\phantom{\rule{4pt}{0ex}}L\le k$, are given by ${\widehat{x}}_{k/L}^{\left(r\right)}={A}_{k}{o}_{L}^{\left(r\right)}$, and, by substitution in (9), we obtain that ${\mu}_{h}^{\left(r\right)}={y}_{h}^{\left(r\right)}-\left(I-{\overline{\Lambda}}_{h}^{\left(r\right)}\right){C}_{h}^{\left(r\right)}{A}_{h}{o}_{h-1}^{\left(r\right)}$.

$$\begin{array}{c}{\displaystyle {o}_{k}^{\left(r\right)}=(1-{\delta}_{k,0})\sum _{h=1}^{k}{\mathcal{O}}_{h}^{\left(r\right)}{\Pi}_{h}^{\left(r\right)-1}{\mu}_{h}^{\left(r\right)},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}k\ge 1,}\\ {\displaystyle {\Sigma}_{k}^{{o}^{\left(r\right)}}=(1-{\delta}_{k,0})\sum _{h=1}^{k}{\mathcal{O}}_{h}^{\left(r\right)}{\Pi}_{h}^{\left(r\right)-1}{\mathcal{O}}_{h}^{\left(r\right)T}\phantom{\rule{-0.166667em}{0ex}}\phantom{\rule{-0.166667em}{0ex}},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}k\ge 1,}\end{array}$$

Finally, since (11) guarantees that $\sum _{l=1}^{h-1}{\mathcal{O}}_{l}^{\left(r\right)}{\Pi}_{l}^{\left(r\right)-1}{\mathcal{O}}_{l}^{\left(r\right)T}={\Sigma}_{h-1}^{{o}^{\left(r\right)}}$, by substitution in (10), it is deduced that ${\mathcal{O}}_{h}^{\left(r\right)}=\left({B}_{h}^{T}-{\Sigma}_{h-1}^{{o}^{\left(r\right)}}{A}_{h}^{T}\right){C}_{h}^{\left(r\right)T}\left(I-{\overline{\Lambda}}_{h}^{\left(r\right)}\right)$. Bearing in mind the above results, the following filtering algorithm is derived.

**Recursive Local LS Linear Filtering Algorithm.**For the r-th cluster, $r=1,\dots ,L$, the local filtering estimators, ${\widehat{x}}_{k/k}^{\left(r\right)}$, and the error covariance matrices, ${\widehat{\Sigma}}_{k/k}^{\left(r\right)}\equiv E\left[\left({x}_{k}-{\widehat{x}}_{k/k}^{\left(r\right)}\right){\left({x}_{k}-{\widehat{x}}_{k/k}^{\left(r\right)}\right)}^{T}\right]$, are recursively obtained by

$$\begin{array}{c}{\widehat{x}}_{k/k}^{\left(r\right)}={A}_{k}{o}_{k}^{\left(r\right)},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}k\ge 1,\\ {\widehat{\Sigma}}_{k/k}^{\left(r\right)}={A}_{k}{\left({B}_{k}-{A}_{k}{\Sigma}_{k}^{{o}^{\left(r\right)}}\right)}^{T}\phantom{\rule{-0.166667em}{0ex}}\phantom{\rule{-0.166667em}{0ex}},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}k\ge 1.\end{array}$$

$${o}_{k}^{\left(r\right)}={o}_{k-1}^{\left(r\right)}+{\mathcal{O}}_{k}^{\left(r\right)}{\Pi}_{k}^{\left(r\right)-1}{\mu}_{k}^{\left(r\right)},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}k\ge 1;\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}{o}_{0}^{\left(r\right)}=0,$$

$${\Sigma}_{k}^{{o}^{\left(r\right)}}={\Sigma}_{k-1}^{{o}^{\left(r\right)}}+{\mathcal{O}}_{k}^{\left(r\right)}{\Pi}_{k}^{\left(r\right)-1}{\mathcal{O}}_{k}^{\left(r\right)T}\phantom{\rule{-0.166667em}{0ex}}\phantom{\rule{-0.166667em}{0ex}},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}k\ge 1;\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}{\Sigma}_{0}^{{o}^{\left(r\right)}}=0.$$

$${\mathcal{O}}_{k}^{\left(r\right)}={\left({B}_{k}-{A}_{k}{\Sigma}_{k-1}^{{o}^{\left(r\right)}}\right)}^{T}{C}_{k}^{\left(r\right)T}\left(I-{\overline{\Lambda}}_{k}^{\left(r\right)}\right),\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}k\ge 1.$$

The innovations, ${\mu}_{k}^{\left(r\right)}={y}_{k}^{\left(r\right)}-{\widehat{y}}_{k/k-1}^{\left(r\right)}$, are given by
and the innovation covariance matrices, ${\Pi}_{k}^{\left(r\right)}\equiv E\left[{\mu}_{k}^{\left(r\right)}{\mu}_{k}^{\left(r\right)T}\right]$, satisfy
where the matrices ${\Sigma}_{k}^{{y}^{\left(r\right)}}$ are given in (7).

$${\mu}_{k}^{\left(r\right)}={y}_{k}^{\left(r\right)}-\left(I-{\overline{\Lambda}}_{k}^{\left(r\right)}\right){C}_{k}^{\left(r\right)}{A}_{k}{o}_{k-1}^{\left(r\right)},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}k\ge 1,$$

$${\Pi}_{k}^{\left(r\right)}={\Sigma}_{k}^{{y}^{\left(r\right)}}-\left(I-{\overline{\Lambda}}_{k}^{\left(r\right)}\right){C}_{k}^{\left(r\right)}{A}_{k}{\Sigma}_{k-1}^{{o}^{\left(r\right)}}{A}_{k}^{T}{C}_{k}^{\left(r\right)T}\left(I-{\overline{\Lambda}}_{k}^{\left(r\right)}\right),\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}k\ge 1,$$

The general expression (8) for the LS linear estimators is also the starting point to derive the following covariance-based recursive fixed-point smoothing algorithm. The derivation is omitted, since this algorithm can easily be deduced by an analogous reasoning to that used in Theorem 2 of [6].

**Recursive Local LS Linear Fixed-point Smoothing Algorithm.**For the r-th cluster, $r=1,\dots ,L$, starting from the filter, ${\widehat{x}}_{k/k}^{\left(r\right)}$, the local LS linear fixed-point smoothers, ${\widehat{x}}_{k/k+N}^{\left(r\right)}$, $N\ge 1$, are calculated as

$${\widehat{x}}_{k/k+N}^{\left(r\right)}={\widehat{x}}_{k/k+N-1}^{\left(r\right)}+{\mathcal{X}}_{k,k+N}^{\left(r\right)}{\Pi}_{k+N}^{\left(r\right)-1}{\mu}_{k+N}^{\left(r\right)},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}N\ge 1,\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}k\ge 1,$$

$${\mathcal{X}}_{k,k+N}^{\left(r\right)}=\left({B}_{k}-{M}_{k,k+N-1}^{\left(r\right)}\right){A}_{k+N}^{T}{C}_{k+N}^{\left(r\right)T}\left(I-{\overline{\Lambda}}_{k+N}^{\left(r\right)}\right),\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}N\ge 1,\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}k\ge 1,$$

$$\begin{array}{c}{M}_{k,k+N}^{\left(r\right)}={M}_{k,k+N-1}^{\left(r\right)}+{\mathcal{X}}_{k,k+N}^{\left(r\right)}{\Pi}_{k+N}^{\left(r\right)-1}{\mathcal{O}}_{k+N}^{\left(r\right)T},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}N\ge 1,\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}k\ge 1;\hfill \\ {M}_{k,k}^{\left(r\right)}={A}_{k}{K}_{k}^{{o}^{\left(r\right)}}\phantom{\rule{-0.166667em}{0ex}}\phantom{\rule{-0.166667em}{0ex}},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}k\ge 1.\hfill \end{array}$$

$${\widehat{\Sigma}}_{k/k+N}^{\left(r\right)}\equiv E\left[\left({x}_{k}-{\widehat{x}}_{k/k+N}^{\left(r\right)}\right){\left({x}_{k}-{\widehat{x}}_{k/k+N}^{\left(r\right)}\right)}^{T}\right]=E\left[{x}_{k}{x}_{k}^{T}\right]-E\left[{\widehat{x}}_{k/k+N}^{\left(r\right)}{\widehat{x}}_{k/k+N}^{\left(r\right)T}\right],$$

$${\widehat{\Sigma}}_{k/k+N}^{\left(r\right)}={\widehat{\Sigma}}_{k/k+N-1}^{\left(r\right)}-{\mathcal{X}}_{k,k+N}^{\left(r\right)}{\Pi}_{k+N}^{\left(r\right)-1}{\mathcal{X}}_{k,k+N}^{\left(r\right)T},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}N\ge 1,\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}k\ge 1.$$

## 4. Second Stage: Distributed Signal Estimators

As we have already mentioned, once the local LS linear estimators, ${\widehat{x}}_{k/k+N}^{\left(r\right)}$, $r=1,\dots ,L$, have been obtained, they are sent to the FC and our goal is to fuse these local estimators to obtain distributed estimators, ${\widehat{x}}_{k/k+N}$, $N\ge 0$, as matrix-weighted linear combinations that minimize the mean squared estimation error.

Defining the stacked vectors by comprising all the local estimators, ${\widehat{X}}_{k/k+N}={\left({\widehat{x}}_{k/k+N}^{\left(1\right)T},\dots ,{\widehat{x}}_{k/k+N}^{\left(L\right)T}\right)}^{T}$, and applying the LS criterion, the proposed distributed fusion estimators are given by:
where $E\left[{\widehat{X}}_{k/k+N}{\widehat{X}}_{k/k+N}^{T}\right]={\left(E\left[{\widehat{x}}_{k/k+N}^{\left(r\right)}{\widehat{x}}_{k/k+N}^{\left(s\right)T}\right]\right)}_{r,s=1,\dots ,L}$ and, from the Orthogonal Projection Lemma (OPL), $E\left[{x}_{k}{\widehat{X}}_{k/k+N}^{T}\right]=\left(E\left[{\widehat{x}}_{k/k+N}^{\left(1\right)}{\widehat{x}}_{k/k+N}^{\left(1\right)T}\right],\dots ,E\left[{\widehat{x}}_{k/k+N}^{\left(L\right)}{\widehat{x}}_{k/k+N}^{\left(L\right)T}\right]\right)$. Hence, the derivation of the distributed estimators in (16) requires to obtain the cross-covariance matrices between the local ones ${\Sigma}_{k/k+N}^{{\widehat{x}}^{\left(rs\right)}}\equiv E\left[{\widehat{x}}_{k/k+N}^{\left(r\right)}{\widehat{x}}_{k/k+N}^{\left(s\right)T}\right],\phantom{\rule{4pt}{0ex}}r,s=1,\dots ,L,\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}k\ge 1,\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}N\ge 0$. Since the initial condition of the recursive Formula (15) for the local smoothing estimators is the local filter, the cross-covariance matrices, ${\Sigma}_{k/k+N}^{{\widehat{x}}^{\left(rs\right)}},\phantom{\rule{4pt}{0ex}}N\ge 1,$ between the local smoothers will be recursively obtained by starting from the cross-covariance matrices, ${\Sigma}_{k/k}^{{\widehat{x}}^{\left(rs\right)}}$, between the local filters.

$${\widehat{x}}_{k/k+N}=E\left[{x}_{k}{\widehat{X}}_{k/k+N}^{T}\right]{\left(E\left[{\widehat{X}}_{k/k+N}{\widehat{X}}_{k/k+N}^{T}\right]\right)}^{-1}{\widehat{X}}_{k/k+N},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}N\ge 0,\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}k\ge 1,$$

#### 4.1. Cross-Covariance Matrices between Local Filtering Estimators ${\Sigma}_{k/k}^{{\widehat{x}}^{\left(rs\right)}}=E\left[{\widehat{x}}_{k/k}^{\left(r\right)}{\widehat{x}}_{k/k}^{\left(s\right)T}\right]$

From expression (12) for the filter ${\widehat{x}}_{k/k}^{\left(r\right)}$, denoting ${\Sigma}_{k}^{{o}^{\left(rs\right)}}\equiv E\left[{o}_{k}^{\left(r\right)}{o}_{k}^{\left(s\right)T}\right],$ it is clear that the cross-covariance matrices between any two local filtering estimators ${\widehat{x}}_{k/k}^{\left(r\right)}$ and ${\widehat{x}}_{k/k}^{\left(s\right)}$ satisfy:

$${\Sigma}_{k/k}^{{\widehat{x}}^{\left(rs\right)}}={A}_{k}{\Sigma}_{k}^{{o}^{\left(rs\right)}}{A}_{k}^{T},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}k\ge 1;\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}r,s=1,\dots ,L.$$

- Using (13) for ${o}_{k}^{\left(r\right)}$ and denoting ${\mathcal{O}}_{h,k}^{\left(rs\right)}\equiv E\left[{o}_{h}^{\left(r\right)}{\mu}_{k}^{\left(s\right)T}\right]$, for $h=k-1,k$, we get that ${\Sigma}_{k}^{{o}^{\left(rs\right)}}$ is recursively obtained by:$${\Sigma}_{k}^{{o}^{\left(rs\right)}}={\Sigma}_{k-1}^{{o}^{\left(rs\right)}}+{\mathcal{O}}_{k-1,k}^{\left(rs\right)}{\Pi}_{k}^{\left(s\right)-1}{\mathcal{O}}_{k}^{\left(s\right)T}+{\mathcal{O}}_{k}^{\left(r\right)}{\Pi}_{k}^{\left(r\right)-1}{\mathcal{O}}_{k}^{\left(sr\right)T},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}k\ge 1,\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}{\Sigma}_{0}^{{o}^{\left(rs\right)}}=0;\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}r,s=1,\dots ,L.$$
- Using again (13) and denoting ${\Pi}_{k}^{\left(rs\right)}\equiv E\left[{\mu}_{k}^{\left(r\right)}{\mu}_{k}^{\left(s\right)T}\right]$, the following expression for ${\mathcal{O}}_{k}^{\left(rs\right)}=E\left[{o}_{k}^{\left(r\right)}{\mu}_{k}^{\left(s\right)T}\right]$ is immediately obtained:$${\mathcal{O}}_{k}^{\left(rs\right)}={\mathcal{O}}_{k-1,k}^{\left(rs\right)}+{\mathcal{O}}_{k}^{\left(r\right)}{\Pi}_{k}^{\left(r\right)-1}{\Pi}_{k}^{\left(rs\right)},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}k\ge 1;\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}r,s=1,\dots ,L.$$
- Next, we derive an expression for ${\mathcal{O}}_{k-1,k}^{\left(rs\right)}=E\left[{o}_{k-1}^{\left(r\right)}{\mu}_{k}^{\left(s\right)T}\right]$. Using (14) for ${\mu}_{k}^{\left(s\right)}$, with (6) for ${y}_{k}^{\left(s\right)}$, and taking into account that, from the OPL, $E\left[{o}_{k-1}^{\left(r\right)}{x}_{k}\right]=E\left[{o}_{k-1}^{\left(r\right)}{\widehat{x}}_{k/k-1}^{\left(r\right)T}\right]$, we obtain:$${\mathcal{O}}_{k-1,k}^{\left(rs\right)}=\left({\Sigma}_{k-1}^{{o}^{\left(r\right)}}-{\Sigma}_{k-1}^{{o}^{\left(rs\right)}}\right){A}_{k}^{T}{C}_{k}^{\left(s\right)T}\left(I-{\overline{\Lambda}}_{k}^{\left(s\right)}\right),\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}k\ge 1;\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}r,s=1,\dots ,L$$
- Finally, an expression for the innovation cross-covariance matrices, ${\Pi}_{k}^{\left(rs\right)}$, is derived. First we write$${\Pi}_{k}^{\left(rs\right)}=E\left[{y}_{k}^{\left(r\right)}{y}_{k}^{\left(s\right)T}\right]-E\left[{y}_{k}^{\left(r\right)}{\widehat{y}}_{k/k-1}^{\left(s\right)T}\right]-E\left[{\widehat{y}}_{k/k-1}^{\left(r\right)}{y}_{k}^{\left(s\right)T}\right]+E\left[{\widehat{y}}_{k/k-1}^{\left(r\right)}{\widehat{y}}_{k/k-1}^{\left(s\right)T}\right].$$Next, for $t=r,s,$ we use (6) for ${y}_{k}^{\left(t\right)}$ and (14) for ${\widehat{y}}_{k/k-1}^{\left(t\right)}$; then, using again that, from the OPL, $E\left[{x}_{k}{o}_{k-1}^{\left(t\right)T}\right]=E\left[{\widehat{x}}_{k/k-1}^{\left(t\right)}{o}_{k-1}^{\left(t\right)T}\right]$, the following expression is clear:$${\Pi}_{k}^{\left(rs\right)}={\Sigma}_{k}^{{y}^{\left(rs\right)}}+(I-{\overline{\Lambda}}_{k}^{\left(r\right)}){C}_{k}^{\left(r\right)}{A}_{k}\left({\Sigma}_{k-1}^{{o}^{\left(rs\right)}}-{\Sigma}_{k-1}^{{o}^{\left(r\right)}}-{\Sigma}_{k-1}^{{o}^{\left(s\right)}}\right){A}_{k}^{T}{C}_{k}^{\left(s\right)T}(I-{\overline{\Lambda}}_{k}^{\left(s\right)}),\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}k\ge 1;\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}r,s=1,\dots ,L,$$

#### 4.2. Cross-Covariance Matrices between Local Smoothing Estimators ${\Sigma}_{k/k+N}^{{\widehat{x}}^{\left(rs\right)}}=E\left[{\widehat{x}}_{k/k+N}^{\left(r\right)}{\widehat{x}}_{k/k+N}^{\left(s\right)T}\right]$

From expression (15), denoting ${\Phi}_{k,k+N}^{\left(rs\right)}\equiv E\left[{\widehat{x}}_{k/k+N-1}^{\left(r\right)}{\mu}_{k+N}^{\left(s\right)T}\right]$, it is clear that the cross-covariance matrices between any two local smoothing estimators ${\widehat{x}}_{k/k+N}^{\left(r\right)}$ and ${\widehat{x}}_{k/k+N}^{\left(s\right)}$ satisfy:

$$\begin{array}{c}{\Sigma}_{k/k+N}^{{\widehat{x}}^{\left(rs\right)}}={\Sigma}_{k/k+N-1}^{{\widehat{x}}^{\left(rs\right)}}+{\Phi}_{k,k+N}^{\left(rs\right)}{\Pi}_{k+N}^{\left(s\right)-1}{\mathcal{X}}_{k,k+N}^{\left(s\right)T}\hfill \\ \phantom{\rule{45.52458pt}{0ex}}+{\mathcal{X}}_{k,k+N}^{\left(r\right)}{\Pi}_{k+N}^{\left(r\right)-1}{\left({\Phi}_{k,k+N}^{\left(sr\right)}+{\mathcal{X}}_{k,k+N}^{\left(s\right)}{\Pi}_{k+N}^{\left(s\right)-1}{\Pi}_{k+N}^{\left(rs\right)T}\right)}^{T}\phantom{\rule{-0.166667em}{0ex}}\phantom{\rule{-0.166667em}{0ex}}\phantom{\rule{-0.166667em}{0ex}},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}N\ge 1,\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}k\ge 1;\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}r,s=1,\dots ,L.\hfill \end{array}$$

- To obtain the expectations ${\Phi}_{k,k+N}^{\left(rs\right)}\equiv E\left[{\widehat{x}}_{k/k+N-1}^{\left(r\right)}{\mu}_{k+N}^{\left(s\right)T}\right]$, we use (14) for ${\mu}_{k+N}^{\left(s\right)}$ and denote ${\Psi}_{k,k+N}^{\left(rs\right)}\equiv E\left[{\widehat{x}}_{k/k+N}^{\left(r\right)}{o}_{k+N}^{\left(s\right)T}\right]$; then, we have:$${\Phi}_{k,k+N}^{\left(rs\right)}=E\left[{\widehat{x}}_{k/k+N-1}^{\left(r\right)}{y}_{k+N}^{\left(s\right)T}\right]-{\Psi}_{k,k+N-1}^{\left(rs\right)}{A}_{k+N}^{T}{C}_{k+N}^{\left(s\right)T}(I-{\overline{\Lambda}}_{k+N}^{\left(s\right)}).$$Using now (6) for ${y}_{k+N}^{\left(s\right)}$ and the independence of ${\widehat{x}}_{k/k+N-1}^{\left(r\right)}$ with ${\Lambda}_{k+N}^{\left(s\right)}$, ${v}_{k+N}^{\left(s\right)}$ and ${w}_{k+N}^{\left(s\right)}$, we have$$E\left[{\widehat{x}}_{k/k+N-1}^{\left(r\right)}{y}_{k+N}^{\left(s\right)T}\right]=E\left[{\widehat{x}}_{k/k+N-1}^{\left(r\right)}{x}_{k+N}^{T}\right]{C}_{k+N}^{\left(s\right)T}(I-{\overline{\Lambda}}_{k+N}^{\left(s\right)}),\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}N\ge 1,\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}k\ge 1;\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}r,s=1,\dots ,L.$$Next, we use that, from the OPL, the estimator is orthogonal to the estimation error, to write $E\left[{\widehat{x}}_{k/k+N-1}^{\left(r\right)}{x}_{k+N}^{T}\right]=E\left[{\widehat{x}}_{k/k+N-1}^{\left(r\right)}{\widehat{x}}_{k+N/k+N-1}^{\left(r\right)T}\right]$. So, since ${\widehat{x}}_{k+N/k+N-1}^{\left(r\right)}={A}_{k+N}{o}_{k+N-1}^{\left(r\right)}$, the following expression for ${\Phi}_{k,k+N}^{\left(rs\right)}$ is immediately derived:$${\Phi}_{k,k+N}^{\left(rs\right)}=\left({\Psi}_{k,k+N-1}^{\left(r\right)}-{\Psi}_{k,k+N-1}^{\left(rs\right)}\right){A}_{k+N}^{T}{C}_{k+N}^{\left(s\right)T}(I-{\overline{\Lambda}}_{k+N}^{\left(s\right)}),\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}N\ge 1,\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}k\ge 1;\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}r,s=1,\dots ,L.$$
- Finally, the expectations ${\Psi}_{k,k+N}^{\left(rs\right)}=E\left[{\widehat{x}}_{k/k+N}^{\left(r\right)}{o}_{k+N}^{\left(s\right)T}\right]$ are calculated. Using (15) for ${\widehat{x}}_{k/k+N}^{\left(r\right)}$ and (13) for ${o}_{k+N}^{\left(s\right)}$, the following expression for ${\Psi}_{k,k+N}^{\left(rs\right)}$ is clear:$$\begin{array}{c}{\Psi}_{k,k+N}^{\left(rs\right)}={\Psi}_{k,k+N-1}^{\left(rs\right)}+{\Phi}_{k,k+N}^{\left(rs\right)}{\Pi}_{k+N}^{\left(s\right)-1}{\mathcal{O}}_{k+N}^{\left(s\right)T}\hfill \\ \phantom{\rule{42.67912pt}{0ex}}+{\mathcal{X}}_{k,k+N}^{\left(r\right)}{\Pi}_{k+N}^{\left(r\right)-1}{\left({\mathcal{O}}_{k+N-1,k+N}^{\left(sr\right)}+{\mathcal{O}}_{k+N}^{\left(s\right)}{\Pi}_{k+N}^{\left(s\right)-1}{\Pi}_{k+N}^{\left(rs\right)T}\right)}^{T}\phantom{\rule{-0.166667em}{0ex}}\phantom{\rule{-0.166667em}{0ex}}\phantom{\rule{-0.166667em}{0ex}},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}N\ge 1,\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}k\ge 1;\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}r,s=1,\dots ,L.\hfill \\ {\Psi}_{k,k}^{\left(rs\right)}={A}_{k}{\Sigma}_{k}^{{o}^{\left(rs\right)}},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}k\ge 1;\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}r,s=1,\dots ,L.\hfill \end{array}$$

#### 4.3. Distributed Filtering and Fixed-Point Smoothing Estimators

The distributed estimators, ${\widehat{x}}_{k/k+N}$, are calculated from (16), while a formula for the error covariance matrices, ${\widehat{\Sigma}}_{k/k+N}\equiv E\left[\left({x}_{k}-{\widehat{x}}_{k/k+N}\right){\left({x}_{k}-{\widehat{x}}_{k/k+N}\right)}^{T}\right]$, is easily derived from assumption

**(A1)**and (16). More specifically, we can state the following results:Let ${\widehat{X}}_{k/k+N}={\left({\widehat{x}}_{k/k+N}^{\left(1\right)T},\dots ,{\widehat{x}}_{k/k+N}^{\left(L\right)T}\right)}^{T}$ be the vectors constituted by the local estimators calculated from the recursive algorithms in Section 3.2; then, the distributed filtering and smoothing estimators are given by
with ${\Sigma}_{k/k+N}={\left({\Sigma}_{k/k+N}^{{\widehat{x}}^{\left(rs\right)}}\right)}_{r,s=1,\dots ,L}$ and ${\Xi}_{k/k+N}=\left({\Sigma}_{k/k+N}^{{\widehat{x}}^{\left(1\right)}},\dots ,{\Sigma}_{k/k+N}^{{\widehat{x}}^{\left(L\right)}}\right),$ where ${\Sigma}_{k/k+N}^{{\widehat{x}}^{\left(rs\right)}}=E\left[{\widehat{x}}_{k/k+N}^{\left(r\right)}{\widehat{x}}_{k/k+N}^{\left(s\right)T}\right],\phantom{\rule{4pt}{0ex}}r,s=1,\dots ,L$, are obtained by (18)) for $N\ge 1$, with initial condition given in (17).

$${\widehat{x}}_{k/k+N}={\Xi}_{k/k+N}{\left({\Sigma}_{k/k+N}\right)}^{-1}{\widehat{X}}_{k/k+N},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}N\ge 0,\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}k\ge 1,$$

The error covariance matrices of the distributed estimators are computed by

$${\widehat{\Sigma}}_{k/k+N}={A}_{k}{B}_{k}^{T}-{\Xi}_{k/k+N}{\Sigma}_{k/k+N}^{-1}{\Xi}_{k/k+N}^{T},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}N\ge 0,\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}k\ge 1.$$

## 5. Numerical Simulation Study

A numerical simulation example is presented to illustrate the application of the algorithms designed in the current paper to estimate a two-dimensional signal. Specifically, such algorithms have been implemented by using MATLAB software and both local and distributed filtering and fixed-point smoothing error variances of the signal components have been calculated, for 100 iterations, in order to examine the estimation accuracy. The effect of the stochastic deception attacks on the estimation performance has also been analyzed.

Consider a two-dimensional signal, ${x}_{k}$, whose evolution is described by the following model:
where

$${x}_{k+1}=\left({F}_{1}+{\epsilon}_{k}{F}_{2}\right){x}_{k}+G{\alpha}_{k},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}k\ge 0,$$

$${F}_{1}=\phantom{\rule{-0.166667em}{0ex}}\left(\begin{array}{cc}\phantom{\rule{-4.97931pt}{0ex}}0.95& 0.01\\ \phantom{\rule{-4.97931pt}{0ex}}0& 0.95\end{array}\phantom{\rule{-4.97931pt}{0ex}}\right),\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}{F}_{2}=\phantom{\rule{-0.166667em}{0ex}}\left(\begin{array}{cc}\phantom{\rule{-4.97931pt}{0ex}}0.01& 0\\ \phantom{\rule{-4.97931pt}{0ex}}0& 0.01\end{array}\phantom{\rule{-4.97931pt}{0ex}}\right),\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}G=\phantom{\rule{-0.166667em}{0ex}}\left(\begin{array}{c}\phantom{\rule{-4.97931pt}{0ex}}0.8\\ \phantom{\rule{-4.97931pt}{0ex}}0.6\end{array}\phantom{\rule{-4.97931pt}{0ex}}\right).$$

The multiplicative noise, ${\left\{{\epsilon}_{k}\right\}}_{k\ge 1}$, and the additive noise, ${\left\{{\alpha}_{k}\right\}}_{k\ge 1}$, are standard white gaussian scalar noises. The initial signal ${x}_{0}$ is a gaussian two-dimensional random vector with zero mean and covariance matrix $E\left[{x}_{0}{x}_{0}^{T}\right]=I$. The noise sequences and initial signal vector are assumed to be mutually independent; then, it is easy to see that the signal covariance function is given by $E\left[{x}_{k}{x}_{h}^{T}\right]={F}_{1}^{k-h}E\left[{x}_{h}{x}_{h}^{T}\right],\phantom{\rule{4pt}{0ex}}h\le k,$ so assumption

**(A1)**is satisfied taking ${A}_{k}={F}_{1}^{k}$ and ${B}_{h}^{T}={F}_{1}^{-h}{\Sigma}_{h}^{x},$ where ${\Sigma}_{h}^{x}\equiv E\left[{x}_{h}{x}_{h}^{T}\right],\phantom{\rule{4pt}{0ex}}h\ge 1,$ is recursively obtained by:
$${\Sigma}_{h}^{x}={F}_{1}{\Sigma}_{h-1}^{x}{F}_{1}^{T}+{F}_{2}{\Sigma}_{h-1}^{x}{F}_{2}^{T}+G{G}^{T}\phantom{\rule{-0.166667em}{0ex}},\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}h\ge 1;\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}{\Sigma}_{0}^{x}=I.$$

This example shows that assumption

**(A1)**on the signal autocovariance function is fulfilled by uncertain systems with state-dependent multiplicative noise.Suppose that a sensor network, comprising 12 sensors, is deployed to measure the stochastic signal ${x}_{k}$. These sensors are grouped into three clusters and the number of sensors in each cluster is ${m}_{1}=3$, ${m}_{2}=4$ and ${m}_{3}=5$, respectively. Scalar measurements are provided by the 12 sensors, according to model (1), with the following time-invariant observation matrices:

$${C}_{k}^{\left(r\right(1\left)\right)}=[0.8\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}0.9],\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}{C}_{k}^{\left(r\right(2\left)\right)}=[0.6\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}0.7],\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}{C}_{k}^{\left(r\right(3\left)\right)}=[0.7\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}0.8],\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}r=1,2,3;$$

$${C}_{k}^{\left(2\right(4\left)\right)}={C}_{k}^{\left(3\right(4\left)\right)}=[0.9\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}0.5],\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}{C}_{k}^{\left(3\right(5\left)\right)}=[0.5\phantom{\rule{4pt}{0ex}}\phantom{\rule{4pt}{0ex}}0.5].$$

The additive measurement noises are defined as ${v}_{k}^{\left(r\right(i\left)\right)}={v}^{\left(r\right)}{\eta}_{k}^{\left(r\right)}$, for all $i=1,\dots ,{m}_{r}$, where ${v}^{\left(1\right)}=0.4$, ${v}^{\left(2\right)}=0.7$, ${v}^{\left(3\right)}=1$, and ${\left\{{\eta}_{k}^{\left(r\right)}\right\}}_{k\ge 1}$, $r=1,2,3,$ are independent zero-mean Gaussian white process with variance 10; so the measurement noises in each cluster $r=1,2,3,$ are correlated with ${R}_{k}^{r\left(ij\right)}=10{\left({v}^{\left(r\right)}\right)}^{2},\phantom{\rule{4pt}{0ex}}k\ge 1;\phantom{\rule{4pt}{0ex}}i,j=1,\dots ,{m}_{r}.$

Assume that the transmissions to the local processors are subject to linear deception attacks and the data injected by the attackers are described by (2), where the attack noises are defined as ${w}_{k}^{\left(r\right(i\left)\right)}={w}^{\left(r\right)}{\zeta}_{k}^{\left(r\right)}$, for all $i=1,\dots ,{m}_{r}$, with ${w}^{\left(1\right)}=0.1$, ${w}^{\left(2\right)}=0.25$, ${w}^{\left(3\right)}=1$, and ${\left\{{\zeta}_{k}^{\left(r\right)}\right\}}_{k\ge 1}$, $r=1,2,3,$ independent standard Gaussian white processes. Clearly, in each cluster $r=1,2,3,$ the attack noises are correlated with ${S}_{k}^{r\left(ij\right)}={\left({w}^{\left(r\right)}\right)}^{2},\phantom{\rule{4pt}{0ex}}k\ge 1;\phantom{\rule{4pt}{0ex}}i,j=1,\dots ,{m}_{r}.$

According to the theoretical study, for $r=1,2,3$, the measurements received at the r-th local processor are modelled by (3); that is, the attacks are considered to take place randomly and the statuses of the attacks are described by independent sequences of independent Bernoulli random variables with known probabilities $P({\lambda}_{k}^{\left(r\right(i\left)\right)}=1)={\overline{\lambda}}^{r\left(i\right)},\phantom{\rule{4pt}{0ex}}k\ge 1,\phantom{\rule{4pt}{0ex}}r=1,2,3,\phantom{\rule{4pt}{0ex}}i=1,\dots ,{m}_{r}$.

First, considering ${\overline{\lambda}}_{k}^{\left(r\right(i\left)\right)}=0.1i$, for $r=1,2,3$, and $i=1,\dots ,{m}_{r}$, the performance of the local filtering estimators has been compared with that of the local fixed-point smoothing estimators; for $r=1,2,3$, these local estimators are computed in each r-th processor using only the measurements received from the sensors of the corresponding r-th cluster. For the first signal component, Figure 1 displays the error variances of the local filtering estimators, ${\left({\widehat{\Sigma}}_{k/k}^{\left(r\right)}\right)}_{11}$, and the local fixed-point smoothing estimators ${\left({\widehat{\Sigma}}_{k/k+N}^{\left(r\right)}\right)}_{11}$, for $N=1,3,5,7$, are displayed. For the three processors, Figure 1 shows that, as expected, the error variances corresponding to the local smoothers are less than those of the local filters. It is also observed that the accuracy of the smoothers at each fixed-point, k, becomes better as the number of available observations, $k+N$, increases, although this improvement is practically imperceptible for $N>3$ in processor 1, $N>5$ in processor 2 and $N>7$ in processor 3. Note that the best estimation accuracy is obtained in processor 1 and the worst one in processor 3; therefore, we can conclude that, when the filtering performance is better, the improvement of the smoothers is less significant and, actually, it becomes practically negligible for lower values of N. Similar results and conclusions are obtained for the second signal component.

Next, considering again ${\overline{\lambda}}_{k}^{\left(r\right(i\left)\right)}=0.1i$, for $r=1,2,3$, and $i=1,\dots ,{m}_{r}$, the performance of the distributed filtering and fixed-point smoothing estimators have been compared with that of the local filtering estimators of the three processors. Figure 2 displays, for both first and second signal components ($a=1,2$), the error variances of the local filtering estimators ${\left({\widehat{\Sigma}}_{k/k}^{\left(r\right)}\right)}_{aa}$, $r=1,2,3$, and the distributed filtering and smoothing estimators ${\left({\widehat{\Sigma}}_{k/k+N}\right)}_{aa}$, $N=0,1,3$. On the one hand, this figure shows that the error variances of the distributed fusion filtering estimators are smaller than those of every local filter; consequently, the distributed fusion filtering estimators outperform all the local ones, agreeing with what theoretically is expected, since all the information of the three clusters is available to the distributed filters, while the local ones are based on the information of a single cluster. On the other hand, agreeing with the comments made about Figure 1, from Figure 2 it is also concluded that the error variances of the distributed fusion smoothers are less than those of the filters and, at each fixed-point, k, the performance of the smoothers is better as the number of available observations, $k+N$, increases (note that, in this case, the improvement is practically imperceptible for $N>1$).

Finally, assuming the same success probability of attack during the transmissions of the measured outputs of all the sensors in the three clusters, ${\overline{\lambda}}_{k}^{\left(r\right(i\left)\right)}=\overline{\lambda}$, $r=1,2,3$, $i=1,\dots ,{m}_{r}$, we analyze how this probability affects the error variances of the distributed filtering estimators. Taking into account that these variances stabilize after a sufficiently large number of iterations, only the results of the last iteration ($k=100$) are shown in Table 1. Specifically, the distributed filtering error variances of the first and second signal components, ${\left({\widehat{\Sigma}}_{100/100}\right)}_{aa}$, $a=1,2$, together with the corresponding percent variation rates are shown in Table 1, when $\overline{\lambda}$ varies from 0.1 to 0.9. From this table, we conclude that, for both signal components, as $\overline{\lambda}$ increases, the distributed filtering error variances become higher, meaning that, as expected, the lower the probability of successful attacks is, the better estimators are obtained. In addition, for both signal components, it is observed that the deterioration of the estimators is more significant (the percent variation rate of the error variance is larger) for high attack probabilities ($\overline{\lambda}\ge 0.7$).

## 6. Concluding Remark

This paper contributes to the literature on the distributed fusion filtering and fixed-point smoothing problems in clustering sensor networks subject to random deception attacks. Actually, the sensor nodes of the network are grouped into some clusters and each cluster is connected to a local processor, which gathers the measured outputs of all the sensors in the cluster. During this transmission, the measured outputs sent by the sensors to the local processor may be deceptively modified by malicious attackers, who launch random deception attacks with known probabilities of success that may be different at each sensor. Aggregating the information received at every local processor, and using an innovation approach, recursive and easily implementable local filtering and fixed-point smoothing algorithms have been designed without requiring full knowledge of the signal evolution model, but only the first and second order moments of the processes involved in the measurement equations. Once the local estimators are available, they are transmitted to the global fusion center where they are fused to obtain the optimal matrix-weighted fusion filter and fixed-point smoother, under the minimum mean-squared error criterion. The simulation results have illustrated the effectiveness of the proposed algorithms, showing that the distributed fusion estimators outperform the local ones. In addition, the influence of the probability of successful attacks on the estimation performance has been analyzed, concluding that the estimation error variance is larger (and, consequently, worse estimations are obtained) for higher attack probabilities of success. Some new research directions stemming from this work would be:

- -
- Considering other different attack strategies (not necessarily linear ones) to cover more general scenarios in practice. Another interesting generalization would be studying the case where the Bernoulli random variables modelling the success or failure of the attacks are not necessarily independent, but they are correlated at consecutive sampling times or they obey a Markovian dependence structure, for example.
- -
- Addressing the estimation problem in clustering sensor networks with a topology represented by a directed or undirected graph, thus generalizing the current study by allowing the exchange of information between sensors and between clusters.
- -
- Investigating other interesting issues related with clustering sensor networks (cluster consensus, sensor mobility, overlap of clusters, cluster position awareness, efficient and energy, uniform clustering or stability of the cluster, among others).

## Author Contributions

All the authors contributed equally to this work. R.C.-Á., A.H.-C. and J.L.-P. provided original ideas for the proposed model and collaborated in the derivation of the estimation algorithms; they participated equally in the design and analysis of the simulation results; and the paper was also written and reviewed cooperatively.

## Funding

This research is supported by Ministerio de Economía, Industria y Competitividad, Agencia Estatal de Investigación and Fondo Europeo de Desarrollo Regional FEDER (grant no. MTM2017-84199-P).

## Conflicts of Interest

The authors declare no conflict of interest.

## References

- Pang, C.; Sun, S. Fusion Predictors for Multisensor Stochastic Uncertain Systems With Missing Measurements and Unknown Measurement Disturbances. IEEE Sens. J.
**2015**, 15, 4346–4354. [Google Scholar] [CrossRef] - Caballero-Águila, R.; García-Garrido, I.; Linares-Pérez, J. Distributed fusion filtering for multi-sensor systems with correlated random transition and measurement matrices. Int. J. Comput. Math.
**2018**. [Google Scholar] [CrossRef] - Ruan, Y.; Luo, Y.; Zhu, Y. Globally Optimal Distributed Kalman Filtering for Multisensor Systems with Unknown Inputs. Sensors
**2018**, 18, 2976. [Google Scholar] [CrossRef] [PubMed] - Caballero-Águila, R.; Hermoso-Carazo, A.; Linares-Pérez, J.; Wang, Z. A new approach to distributed fusion filtering for networked systems with random parameter matrices and correlated noises. Inf. Fusion
**2019**, 45, 324–332. [Google Scholar] [CrossRef] - Caballero-Águila, R.; Hermoso-Carazo, A.; Linares-Pérez, J. Networked Fusion Filtering from Outputs with Stochastic Uncertainties and Correlated Random Transmission Delays. Sensors
**2016**, 16, 847. [Google Scholar] [CrossRef] - Caballero-Águila, R.; Hermoso-Carazo, A.; Linares-Pérez, J. Centralized Fusion Approach to the Estimation Problem with Multi-Packet Processing under Uncertainty in Outputs and Transmissions. Sensors
**2018**, 18, 2697. [Google Scholar] [CrossRef] - Caballero-Águila, R.; Hermoso-Carazo, A.; Linares-Pérez, J. Centralized filtering and smoothing algorithms from outputs with random parameter matrices transmitted through uncertain communication channels. Digit. Signal Process.
**2019**, 85, 77–85. [Google Scholar] [CrossRef] - Jun, H.; Wang, Z.; Chen, D.; Alsaadi, F.E. Estimation, filtering and fusion for networked systems with network-induced phenomena: New progress and prospects. Inf. Fusion
**2016**, 31, 65–75. [Google Scholar] - Bakr, M.A.; Lee, S. Distributed Multisensor Data Fusion under Unknown Correlation and Data Inconsistency. Sensors
**2017**, 17, 2472. [Google Scholar] [CrossRef] - Sun, S.; Lin, H.; Ma, J.; Li, X. Multi-sensor distributed fusion estimation with applications in networked systems: A review paper. Inf. Fusion
**2017**, 38, 122–134. [Google Scholar] [CrossRef] - Din, S.; Ahmad, A.; Paul, A.; Rathore, M.M.U.; Jeon, G. A Cluster-Based Data Fusion Technique to Analyze Big Data in Wireless Multi-Sensor System. IEEE Access
**2017**, 5, 5069–5083. [Google Scholar] [CrossRef] - Wang, F.; Wang, L.; Han, Y.; Liu, B.; Wang, J.; Su, X. A Study on the Clustering Technology of Underwater Isomorphic Sensor Networks Based on Energy Balance. Sensors
**2014**, 14, 12523–12532. [Google Scholar] [CrossRef] [PubMed] - Shang, Y.A. combinatorial necessary and sufficient condition for cluster consensus. Neurocomputing
**2016**, 216, 611–616. [Google Scholar] [CrossRef] - Shang, Y. Finite-time Cluster Average Consensus for Networks via Distributed Iterations. Int. J. Control. Autom. Syst.
**2017**, 15, 933–938. [Google Scholar] [CrossRef] - Shang, Y. Resilient Multiscale Coordination Control against Adversarial Nodes. Energies
**2018**, 11, 1844. [Google Scholar] [CrossRef] - Shang, Y. Resilient consensus of switched multi-agent systems. Syst. Control. Lett.
**2018**, 122, 12–18. [Google Scholar] [CrossRef] - Afsar, M.M.; Tayarani-N, M.-H. Clustering in sensor networks: A literature survey. J. Netw. Comput. Appl.
**2014**, 46, 198–226. [Google Scholar] [CrossRef] - Akila, I.S.; Manisekaran, S.V.; Venkatesan, S.V. Modern Clustering Techniques in Wireless Sensor Networks. In Wireless Sensor Networks—Insights and Innovations; Sallis, P.J., Ed.; InTech Open: London, UK, 2017; pp. 141–156. [Google Scholar]
- Sambo, D.W.; Yenke, B.O.; Förster, A.; Dayang, P. Optimized Clustering Algorithms for Large Wireless Sensor Networks: A Review. Sensors
**2019**, 19, 322. [Google Scholar] [CrossRef] - Hu, L.; Wang, Z.; Han, Q.-L.; Liu, X. State estimation under false data injection attacks: Security analysis and system protection. Automatica
**2018**, 87, 176–183. [Google Scholar] [CrossRef] - Ma, L.; Wang, Z.; Han, Q.L.; Lam, H.K. Variance constrained distributed filtering for time-varying systems with multiplicative noises and deception attacks over sensor networks. IEEE Sens. J.
**2017**, 17, 2279–2288. [Google Scholar] [CrossRef] - Ding, D.; Wang, Z.; Ho, D.W.C.; Wei, G. Distributed recursive filtering for stochastic systems under uniform quantizations and deception attacks through sensor networks. Automatica
**2017**, 78, 231–240. [Google Scholar] [CrossRef] - Wang, D.; Wang, Z.; Shen, B.; Alsaadi, F.E. Security guaranteed filtering for discrete-time stochastic delayed systems with randomly occurring sensor saturations and deception attacks. Int. J. Robust Nonlinear Control.
**2017**, 27, 1194–1208. [Google Scholar] [CrossRef] - Wang, Z.; Wang, D.; Shen, B.; Alsaadi, F.E. Centralized security-guaranteed filtering in multirate-sensor fusion under deception attacks. J. Frankl. Inst.
**2018**, 355, 406–420. [Google Scholar] [CrossRef] - Song, H.; Hong, Z.; Song, H.; Zhang, W.-A. Fusion estimation in clustering sensor networks under stochastic deception attacks. Int. J. Syst. Sci.
**2018**, 49, 2257–2266. [Google Scholar] [CrossRef] - Li, Y.; Wu, Q.; Peng, L. Simultaneous Event-Triggered Fault Detection and Estimation for Stochastic Systems Subject to Deception Attacks. Sensors
**2018**, 18, 321. [Google Scholar] [CrossRef] [PubMed]

**Figure 1.**First signal component error variance comparison of the local filters and fixed-point smoothers.

**Figure 2.**Local filtering and distributed filtering and fixed-point smoothing error variances of the first and second signal components.

**Table 1.**Distributed filtering error variances at $k=100$ and percent variation rates of the first and second signal components under different attack success probabilities $\overline{\lambda}$.

Attack Success Probability $\overline{\mathit{\lambda}}$ | 0.1 | 0.2 | 0.3 | 0.4 | 0.5 | 0.6 | 0.7 | 0.8 | 0.9 |
---|---|---|---|---|---|---|---|---|---|

Error variance ${\left({\widehat{\Sigma}}_{\mathbf{100}/\mathbf{100}}\right)}_{\mathbf{11}}$ | 0.4743 | 0.5597 | 0.6428 | 0.7343 | 0.8427 | 0.9810 | 1.1758 | 1.4950 | 2.1877 |

Percent variation rate | 18.01 | 14.85 | 14.23 | 14.76 | 16.41 | 19.86 | 27.10 | 46.38 | |

Error variance ${\left({\widehat{\Sigma}}_{\mathbf{100}/\mathbf{100}}\right)}_{\mathbf{22}}$ | 0.2650 | 0.3122 | 0.3579 | 0.4082 | 0.4675 | 0.5427 | 0.6478 | 0.8180 | 1.1787 |

Percent variation rate | 17.81 | 14.64 | 14.05 | 14.53 | 16.09 | 19.37 | 26.23 | 44.15 |

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).