Open Access
This article is

- freely available
- re-usable

*Future Internet*
**2020**,
*12*(2),
26;
https://doi.org/10.3390/fi12020026

Article

A Novel Low-Complexity Fault Diagnosis Algorithm for Energy Internet in Smart Cities

^{1}

College of Computer and Control Engineering, Minjiang University, Fujian 350108, China

^{2}

Industrial Robot Application of Fujian University Engineering Research Center, Minjiang University, Fujian 350108, China

^{3}

Fujian Provincial Key Laboratory of Information Processing and Intelligent Control, Minjiang University, Fujian 350108, China

^{4}

Concord University College Fujian Normal University, Fujian 350117, China

^{5}

College of Mathematics and Informatics, Fujian Provincial Key Laboratory of Network Security and Cryptology, Fujian Normal University, Fujian 350117, China

^{*}

Authors to whom correspondence should be addressed.

Received: 20 December 2019 / Accepted: 20 January 2020 / Published: 6 February 2020

## Abstract

**:**

The smart energy system, viewed as an “Energy Internet”, consists of the intelligent integration of decentralized sustainable energy sources, efficient distribution, and optimized power consumption. That implies the fault diagnosis for a smart energy system should be of low complexity. In this paper, we propose a Strong Tracking Unscented Kalman Filter ($STUKF$) and modified Bayes’ classification-based Modified Three Sigma test ($MTS$), abbreviated as $SFBT$, for smart energy networks. The theoretical analysis and simulations indicate that $SFBT$ detects faults with a high accuracy and a low complexity of $O\left(n\right)$.

Keywords:

fault diagnosis; low-complexity; energy internet; smart city## 1. Introduction

The components a smart city include smart infrastructure, smart buildings, smart transportation, smart energy, smart health care, smart technology, smart governance, smart education, and smart citizens [1,2,3,4,5], among which the concept of smart energy can be viewed as an “Energy Internet” model [6,7,8,9,10,11]. The backbone of a smart energy system is the smart energy grid, the smart energy metering of which is an important component. For example, the smart meter records consumption of electric energy in certain time intervals and communicates that information to the utility for monitoring and billing. To ensure a green model for smart cities [12], minimal energy consumption is required. To serve such a purpose, fault diagnosis in green smart cities should be efficient in low complexity [13,14,15].

There exist many filter-based fault diagnosis algorithms specifically designed for nonlinear time-varying stochastic systems. In Reference [16], Ma et al. proposed a fault-related parameter estimation method based on a strong tracking filter for state assessment, as well as a logistic regression algorithm for root cause classification. In Reference [17], Daroogheh et al. designed a particle filter-based dual estimation strategy for both time-varying parameters and states. In Reference [18], Yin and Zhou proposed an intelligent particle filter to solve the problem of misleading state estimation due to the particle impoverishment. They designed a genetic-operators-based strategy to improve the particle diversity. Zhao et al. [19] developed an optimal unbiased finite impulse response filter, the estimator of which is used to minimize the mean square error subject to the unbiasedness constraint. In Reference [20], He et al. used the residual signal to detect sensor bias faults first and then employed residual analysis for faults isolation. Ge et al. [21] studied a performance comparison of the strong tracking filter and Kalman filter. In Reference [22], Zhao et al. designed an adaptive robust square-root cubature Kalman filter ($CKF$) with the noise statistic estimator to solve the decline or divergence problem of the accuracy of the $CKF$. Residual ${\chi}^{2}$ test is a system-level fault detection algorithm [23] that can judge whether a set of measurement information in the Kalman filter system is effective in real time, based on which interactive residual fault detection method was proposed to identify both hard and soft faults, which are similar to the jump type fault and the soft fault, respectively. Huang and He [24] utilized two Bernoulli random variables to model the dead-zone effect. In addition, a Tobit Kalman filter is brought forward to generate a residual that can indicate the occurrence of an intermittent fault. The time-domain uncensored Kalman filter state estimation methodology is applied to address short circuit faults and transient load conditions [25]. On the other hand, researchers have been working on non-filter-based fault diagnosis algorithms for decades. Lau et al. [26] proposed a centralized diagnosis algorithm based on end-to-end transmission time, and then they employed the Bayes’ classification to improve the diagnosis accuracy. In Reference [27], Nandi et al. used the Pearson test to find faulty nodes within a square grid based network. Gong et al. [28] proposed an optimal deducing model to locate faulty nodes based on the return message from the sink node. Jiang [29] developed a two-phase fault detection algorithm based on the majority voting. Panda and Khilar [30] applied a three Sigma test to locate faulty nodes. In Reference [31], Sharma et al. proposed a decentralized algorithm based on spatio-temporal correlation. In Reference [32], Swain et al. integrated the network cut set and majority voting to detect the faulty nodes. Recently, Samuel et al. [33] presented the design of a ${H}_{\infty}$ sliding mode and an unknown input observer for Takagi-Sugeno systems to deal with the problem of inexact measurements of the premise variables. In Reference [34], a data-driven system based on PCA was designed to detect and quantify fluid leaks in an experimental pipeline.

Although these fault diagnosis approaches are able to detect certain industrial system faults, it remains a challenge to design a fault diagnosis model that meets practical production requirements [35,36,37]. In addition, how to accurately detect faults with a low complexity is still a challenge.

#### Our Contribution

In this paper, we propose a novel low-complexity algorithm named $SFBT$ for fault detection in smart cities, which consists of a Strong Tracking Unscented Kalman Filter ($STUKF$), a Modified Bayes’ classification algorithm ($MB$), and a Modified Three Sigma test ($MTS$), aimed to detect both of attenuation type faults and jump type faults. Besides, as a special case of the hypothesis test, $SFBT$ can detect Byzantine faults [38]. Note that the message complexity of $SFBT$ is $O\left(n\right)$ such that the lifetime of a smart city network is extended. The details of our contributions are listed as follows:

- The $STUKF$, along with the $MB$, is applied to detect faulty cluster-heads in a centralized manner with a high accuracy; meanwhile, the messages exchanged are no more than $O\left(n\right)$ such that the corresponding energy cost is reduced.
- We develop $MTS$ to determine the faulty state of every cluster-member within each cluster of a faulty cluster-head, while the cluster with a fault-free cluster-head can easily make a decision by simply comparing the data between the cluster-head and cluster-members. By doing so, the diagnosis accuracy is increased significantly, and the message complexity, which is only $O\left(n\right)$, implies that the energy cost is further reduced.

The rest of the paper is organized as follows. The system model and notions are given in Section 2. The strategies are elaborated in Section 3. Section 4 gives the theoretical analysis on the performance of the $SFBT$. The validation results are presented in this section, as well. We conclude this paper in Section 5.

## 2. System Model

In this paper, a fault diagnosis model that meets practical production requirements for smart energy network of smart city is considered. The discrete state-space model of which is given as follows:
where $k\ge 0$ is a discrete time variable, $x\in {R}^{n}$ is a state vector, $z\in {R}^{m}$ is an output vector, $f:\phantom{\rule{4pt}{0ex}}{R}^{n}\to {R}^{n}$ and $h:\phantom{\rule{4pt}{0ex}}{R}^{n}\to {R}^{m}$ denote nonlinear transition functions, $\mathsf{\Gamma}\in {R}^{n\times n}$ and ${v}_{k}\in {R}^{n}$ and ${e}_{k}\in {R}^{m}$ are uncorrelated, zero-mean, Gaussian white noises with covariance ${V}_{k}$ and ${E}_{k}$, respectively. The meter of the smart energy network records consumption of electric energy in certain time intervals and communicates that information to the utility for monitoring and billing. Based on the normal and observed values of different meters, the data is modeled as a normal distribution with specific mean and standard deviation. Due to the harsh environment, each meter may malfunction with a probability p. In this paper, we aim to solve the fault detection problem with a high accuracy and a low message complexity in a nonlinear time-varying stochastic system. All symbols and descriptions throughout the paper are given in Table 1.

$$\left\{\begin{array}{c}x(k+1)=f\left(x\right(k\left)\right)+\mathsf{\Gamma}\left(k\right)v\left(k\right)\hfill \\ z(k+1)=h\left(x\right(k+1\left)\right)+e(k+1)\hfill \end{array},\right.$$

## 3. The Proposed Strategy

We consider a clustered smart energy network in this paper. Instead of testing each meter, we perform a centralized testing on cluster-heads only. Then, the fault state of cluster-members can be verified by corresponding fault-free cluster-heads in a decentralized manner. To be more specific, the $SFBT$ works in three phases. In the first phase, $SFBT$ finds the Minimal Dominating Set ($MDS$) of the network such that each dominator serves as a cluster-head while the corresponding dominatees become cluster-members. During the phase two, $SFBT$ applys both $STUKF$ and $MB$ to detect the faulty cluster-heads in a centralized manner. In the last phase, an $MTS$ is developed to determine the fault state of each cluster-member for decentralized fault diagnosis.

#### 3.1. $STUKF$- and $MB$-based Centralized Fault Diagnosis

In this section, we introduce the faulty cluster-heads detection algorithm, which is the second phase of the $SFBT$. Specifically, we apply an $STUKF$-based joint estimation method to predict the state and parameter of each cluster-head $C{H}_{i}$ first; then, the $MB$ is adopted to find the faulty ones.

#### 3.1.1. $STUKF$-based State Prediction

First, we introduce fault state related time-varying parameter $\theta \left(k\right)$. Then, joint estimation requires (1) to be rewritten as:

$$\left\{\begin{array}{c}x(k+1)=f\left(x\right(k\left)\right)+\mathsf{\Gamma}\left(k\right)v\left(k\right)\hfill \\ \theta (k+1)=\theta \left(k\right)\hfill \\ z(k+1)=h\left(x\right(k+1\left)\right)+\theta (k+1)+e(k+1)\hfill \end{array}.\right.$$

By applying joint estimation of states and parameters, faulty states can be detected. Then, we extent state, noise, and transition as follows:

$${x}_{e}(k+1)=\left[\begin{array}{c}x(k+1)\\ \theta (k+1)\end{array}\right],$$

$${\mathsf{\Gamma}}_{e}\left(k\right)=\left[\begin{array}{c}\mathsf{\Gamma}\left(k\right)\\ 0\end{array}\right],$$

$${f}_{e}\left({x}_{e}\left(k\right)\right)=\left[\begin{array}{c}f\left(x\right(k\left)\right)\\ \theta \left(k\right)\end{array}\right],$$

$${h}_{e}\left({x}_{e}(k+1)\right)=h\left({x}_{e}(k+1)\right)+\theta (k+1).$$

Substituting Equations (3)–(6) into Equation (2) yields:

$$\left\{\begin{array}{c}{x}_{e}(k+1)={f}_{e}\left({x}_{e}\left(k\right)\right)+{\mathsf{\Gamma}}_{e}\left(k\right)v\left(k\right)\hfill \\ \theta (k+1)=\theta \left(k\right)\hfill \\ z(k+1)={h}_{e}\left({x}_{e}(k+1)\right)+\theta (k+1)+e(k+1)\hfill \end{array}.\right.$$

#### 3.1.2. $MB$-based Faulty Cluster-Heads Identification

Once we obtain ${\widehat{x}}_{e}(k+1)$, the $MB$ is applied to find the faulty cluster-heads. First, we assume $\theta \left(k\right)$ obeys the normal distribution when the system is fault-free, which is ${\theta}_{i}\sim N({\theta}_{i}^{0},{\sigma}_{{\theta}_{i}^{0}}^{2})$. If the system suffers from the parameter exception, it can be determined by the following hypothesis test, where ${s}_{i}$ is fault-free if ${H}_{0}$ holds; otherwise, ${s}_{i}$ is faulty.
where

$$\begin{array}{c}{H}_{0}:{d}_{{\theta}_{i}}\left(k\right)\le {\beta}_{{\theta}_{i}},\end{array}$$

$$\begin{array}{c}{H}_{1}:{d}_{{\theta}_{i}}\left(k\right)>{\beta}_{{\theta}_{i}},\end{array}$$

$$\begin{array}{c}{\mu}_{{\theta}_{i}}\left(k\right)=\frac{1}{{N}_{1}}\sum _{j=1}^{{N}_{1}}{\widehat{\theta}}_{i}(k-j|k-j),\end{array}$$

$$\begin{array}{c}{\sigma}_{{\theta}_{i1}}^{2}\left(k\right)=\frac{1}{{N}_{1}-1}\sum _{j=1}^{{N}_{1}}{[{\widehat{\theta}}_{i}(k-j|k-j)-{\theta}_{i}^{0}]}^{2},\end{array}$$

$$\begin{array}{c}{\sigma}_{{\theta}_{i2}}^{2}\left(k\right)=\frac{1}{{N}_{1}-1}\sum _{j=1}^{{N}_{1}}{[{\widehat{\theta}}_{i}(k-j|k-j)-{\mu}_{{\theta}_{i}}\left(k\right)]}^{2},\end{array}$$

$$\begin{array}{c}{d}_{{\theta}_{i}}\left(k\right)=\frac{{\sigma}_{{\theta}_{i1}}^{2}\left(k\right)}{{\sigma}_{{\theta}_{i}^{0}}^{2}}-ln\frac{{\sigma}_{{\theta}_{i2}}^{2}\left(k\right)}{{\sigma}_{{\theta}_{i}^{0}}^{2}}-1.\end{array}$$

To be more specific, for each ${\widehat{\theta}}_{i}(k+1|k+1)$, the MB is applied to determine the faulty state of ${s}_{i}$. The meter ${s}_{i}$ suffers from the parameter bias fault at $k=\tau $, only if there exists a ${\theta}_{i}$ such that ${H}_{1}$ holds. For the jump type fault, we need to obtain an accurate fault amplitude as follows. If there exists an integer $M\in \{0,1,2,\cdots \}$ such that for each $T\in \{M{T}_{2}+1,\cdots ,M{T}_{2}+{T}_{2}\}$, $M{T}_{2}\ge \tau $, we have
then, at $k=M{T}_{2}+{T}_{2}$, the fault amplitude of ${s}_{i}$ is estimated to be

$$|{\widehat{\theta}}_{i}\left(T\right|T)-{\widehat{\theta}}_{i}(T-1|T-1)|<{\epsilon}_{i},$$

$$\Delta {\theta}_{i}=[\frac{1}{{T}_{2}}\sum _{T=M{T}_{2}+1}^{M{T}_{2}+{T}_{2}}{\widehat{\theta}}_{i}\left(T\right|T)]-{\widehat{\theta}}_{i}^{0}.$$

Note that if a smaller ${\epsilon}_{i}$ is chosen, then faults can be more accurately estimated, which requires much more time, and vice-versa.

#### 3.2. Modified Three Sigma Rule-based Fault Diagnosis

In this section, the modified three sigma rule is introduced first, based on which the corresponding decentralized fault diagnosis is given.

The data ${x}_{i}$ sensed by ${s}_{i}$ can be measured by taking both estimated mean ${\widehat{\mu}}_{i}$ and standard deviation ${\widehat{\sigma}}_{i}$ of the data collected from neighbors. Let ${t}_{i}$ represent the ratio between its deviation to the estimated mean and standard deviation as follows:

$${t}_{i}=\frac{{x}_{i}-{\widehat{\mu}}_{i}}{{\widehat{\sigma}}_{i}}.$$

According to the three sigma rule, if $|{t}_{i}|>3$, then ${s}_{i}$ is considered as faulty; otherwise, ${s}_{i}$ is fault-free. However, this rule is unable to detect faulty meters within a lower degree network. The reason is listed as follows. If there exist two suspicious meters ${s}_{i}$ and ${s}_{j}$ such that ${x}_{i}\gg {x}_{j}$, then ${s}_{j}$ may be considered fault-free, which is called the “masking” [30]. To solve such a problem, the Median ($Med$) of the data and the Median Absolute Deviation ($MAD$) are applied, instead of the ${\mu}_{i}$ and ${\sigma}_{i}$, respectively. This is because the actual data value deviate from the faulty meter readings such that the $Md$- and $MAD$-based fault detecting schemes can achieve better accuracy in results. Then, we give the modified three sigma rule as follows.

For a data set ${x}_{k}=\{{x}_{1},{x}_{2},\cdots ,{x}_{{n}_{k}}\}$ sorted in an ascending order, we have

$$M{d}_{i}=Med\left({x}_{i}\right)\left\{\begin{array}{cc}\frac{{x}_{m}+{x}_{m+1}}{2}\hfill & \hfill {n}_{i}=2m\\ {x}_{m}\hfill & \hfill otherwise\end{array}.\right.$$

The $MAD$ is defined as follows:

$$MAD({x}_{1},\cdots ,{x}_{{n}_{i}})=Med\left\{\right|{x}_{i}-M{d}_{i}\left|\right\}.$$

In fact, only the normalized $MAD$, which is
is suitable to replace Standard Deviation ($SD$) [30]. Then, the absolute error for the modified three sigma test is defined as follows:

$$MADN\left({x}_{i}\right)=\frac{Med\left\{\right|{x}_{i}-M{d}_{i}\left|\right\}}{0.675},$$

$${t}_{i}^{r}=\frac{{x}_{i}-M{d}_{i}}{MADN\left({x}_{i}\right)}.$$

It is apparent that the modified three sigma rule defers from the original one only in the choice of the absolute error.

Next, we give the corresponding decentralized fault diagnosis algorithm as follows:

Step 1: For each cluster ${C}_{i}$ with a fault-free cluster-head $C{H}_{i}$, if $|{x}_{j}-{x}_{i}|\le \eta $, then ${s}_{j}\in {C}_{i}$ is fault-free; otherwise, ${s}_{j}$ is faulty.

Step 2: For each cluster ${C}_{j}$ with a faulty cluster-head $C{H}_{j}$, the $MTS$ is applied to ${C}_{j}^{{}^{\prime}}={C}_{j}\backslash \left\{C{H}_{j}\right\}$ such that the fault state of each cluster-member within ${C}_{j}$ is detected.

By doing so, we can determine the fault states of at least $\left|MDS\right|\u25b3(1-{p}_{r})$ meters, where △ represents the maximum degree of graph G. The theoretical analysis on the diagnosis accuracy and message complexity is given in the next section.

## 4. Performance Analysis

#### 4.1. Theoretical Analysis

In this section, we analyze the performance of the $SFBT$ in terms of: (I) Detection Accuracy (abbreviated as $DA$, which is the ratio between number of faulty meters detected as faulty and the total number of faulty meters); (II) False Alarm Rate (abbreviated as $FAR$, which is the number of fault-free meters detected as faulty to the total number of fault-free meters); (III) False Positive Rate (abbreviated as $FPR$, which is the ratio of the number of faulty meters detected as fault-free to the total number of faulty meters); and (IV) message complexity.

Let ${p}_{r}$ be the probability of a meter being faulty, p be the probability that a faulty meter detected as fault-free, and ${p}^{{}^{\prime}}$ denote the accuracy of the $SFBT$, respectively. Then, the following three theorems hold.

**Theorem**

**1.**

The $DA$ of the proposed algorithm $SFBT$ is

$$D{A}_{SFBT}=\left\{\begin{array}{cc}\frac{\left|MDS\right|\left[\right(1-p)\u25b3+1]}{n},\hfill & \hfill {t}_{i}^{r}\le 3\\ \frac{\left|MDS\right|\{\u25b3[2(1-{p}_{r})+p]+1\}}{n},\hfill & \hfill {t}_{i}^{r}>3\end{array}.\right.$$

**Proof.**

The $SFBT$ detects faulty meters in two phases. In the first phase, there are up to $\left|MDS\right|(1-{p}_{r})$ fault-free cluster-heads detected along with $\left|MDS\right|{p}_{r}$ faulty ones. That implies there are at most $\left|MDS\right|(1-{p}_{r})\u25b3{p}_{r}$ faulty cluster-members correctly diagnosed within corresponding clusters. According to the modified sigma rule, it is apparent that, if ${t}_{i}^{r}\le 3$, then the detection accuracy is according to the $DA$ of the three sigma test given by Panda [38], which is

$$DA=\left\{\begin{array}{cc}{p}_{r}-p,\hfill & \hfill {t}_{i}^{r}\le 3\\ (1-{p}_{r})+p,\hfill & \hfill {t}_{i}^{r}>3\end{array}.\right.$$

That suggests we can correctly detect $\left|MDS\right|{p}_{r}\u25b3$ faulty cluster-members. Therefore, the $DA$ of $SFBT$ is deduced as
where $\left|MDS\right|\le \frac{1+ln(\delta +1)}{\delta +1}n$ [40], n denotes the number of meters, and the minimum and the maximum degree of graph G is denoted by $\delta $ and △, respectively. □

$$D{A}_{SFBT}=\left\{\begin{array}{cc}\frac{\left|MDS\right|\left[\right(1-p)\u25b3+1]}{n},\hfill & \hfill {t}_{i}^{r}\le 3\\ \frac{\left|MDS\right|\{[2(1-{p}_{r})+p]\u25b3+1\}}{n},\hfill & \hfill {t}_{i}^{r}>3\end{array},\right.$$

**Theorem**

**2.**

The $FPR$ of the algorithm $SFBT$ is

$$FP{R}_{SFBT}=\left\{\begin{array}{cc}\frac{n({p}_{r}+p)-\left|MDS\right|[1+(1-{p}_{r})\u25b3]}{n},\hfill & \hfill {t}_{i}^{r}\le 3\\ \frac{n(1-{p}_{r}-p)-\left|MDS\right|[1+(1-{p}_{r})\u25b3]}{n},\hfill & \hfill {t}_{i}^{r}>3\end{array}.\right.$$

**Proof.**

Similar to the proof of Theorem 5.1, the $SFBT$ can correctly detect up to $\left|MDS\right|{p}_{r}$ faulty cluster-heads. With the help of fault-free cluster-heads, there are at most $\left|MDS\right|{p}_{r}(1-{p}_{r})\u25b3$ faulty cluster-members detected as faulty ones. According to literature [38], the $FPR$ of the three sigma test is

$$FPR=\left\{\begin{array}{cc}{p}_{r}+p,\hfill & \hfill {t}_{i}^{r}\le 3\\ 1-({p}_{r}+p),\hfill & \hfill {t}_{i}^{r}>3\end{array}.\right.$$

Therefore, we can deduce that the $FPR$ of the $SFBT$ is
$$FP{R}_{SFBT}=\left\{\begin{array}{cc}\frac{n({p}_{r}+p)-\left|MDS\right|[1+(1-{p}_{r})\u25b3]}{n},\hfill & \hfill {t}_{i}^{r}\le 3\\ \frac{n(1-{p}_{r}-p)-\left|MDS\right|[1+(1-{p}_{r})\u25b3]}{n},\hfill & \hfill {t}_{i}^{r}>3\end{array}.\right.$$
□

**Theorem**

**3.**

The $FAR$ for the algorithm $SFBT$ is

$$FA{R}_{SFBT}=\left\{\begin{array}{cc}\frac{n{p}_{r}-\left|MDS\right|(1-{p}_{r})[1+\u25b3(1-{p}_{r})]}{n(1-{p}_{r})},\hfill & \hfill {t}_{i}^{r}\le 3\\ \frac{np-\left|MDS\right|(1-{p}_{r})[1+\u25b3(1-{p}_{r})]}{n(1-{p}_{r})},\hfill & \hfill {t}_{i}^{r}>3\end{array}.\right.$$

**Proof.**

Similar to the proofs of previous theorems, there are up to $\left|MDS\right|(1-{p}_{r})$ fault-free cluster-heads correctly detected by the $SFBT$. That implies the fault states of corresponding cluster-members, which are as many as $\left|MDS\right|(1-{p}_{r})\u25b3(1-{p}_{r})$ meters, are easily verified. The $FAR$ of the three sigma test is given in Reference [38] as follows:

$$FAR=\left\{\begin{array}{cc}\frac{{p}_{r}}{1-{p}_{r}},\hfill & \hfill {t}_{i}^{r}\le 3\\ \frac{p}{1-{p}_{r}},\hfill & \hfill {t}_{i}^{r}>3\end{array}.\right.$$

There is no doubt that the $FAR$ of the $SFBT$ is
$$FA{R}_{SFBT}=\left\{\begin{array}{cc}\frac{n{p}_{r}-\left|MDS\right|(1-{p}_{r})[1+\u25b3(1-{p}_{r})]}{n(1-{p}_{r})},\hfill & \hfill {t}_{i}^{r}\le 3\\ \frac{np-\left|MDS\right|(1-{p}_{r})[1+\u25b3(1-{p}_{r})]}{n(1-{p}_{r})},\hfill & \hfill {t}_{i}^{r}>3\end{array}.\right.$$
□

**Theorem**

**4.**

The message complexity of the algorithm $SFBT$ is $O\left(n\right)$.

**Proof.**

Since the $SFBT$ is a two-phase strategy, we analyze the complexity of each phase to prove this theorem. Since the centralized diagnosis part of the $SFBT$ requires sampling, the corresponding message complexity depends on the number of sample, which equals to the data window ${N}_{1}$. In fact, the data window can be chosen as a constant such that the number of messages exchanged is less than $\left|MDS\right|{N}_{1}$. That implies complexity of the centralized part of the $SFBT$ is $O\left(n\right)$. On the other hand, for an $MTS$-based strategy, each fault-free $C{H}_{i}$ obtains the broadcasting data of every cluster-member, while meters within each ${C}_{i}$ of a faulty $C{H}_{i}$ broadcast data to each other. That implies the complexity of the decentralized part of the $SFBT$ is $O\left(n\right)$. Therefore, the message complexity of the $SFBT$ is no more than $O\left(n\right)$. □

It is obvious that $SFBT$ is low-complexity when compared with contemporaneous fault diagnosis algorithms.

#### 4.2. Validation Experiment

The performance of the $SFBT$ is given in this section. First, the diagnosis accuracy for faulty cluster-header detection utilizing $STUKF$ and $MB$ only is validated through experiments. Then, we compare the $SFBT$ with the existing DFD [30], Jiang [28], and DSFD [38] in terms of detection accuracy ($DA$), false positive rate ($FPR$), and false alarm rate ($FAR$), respectively.

The resistor of a meter easily suffers from the jump type fault. Suppose the resistor gets faulty at $k=1000$. Let ${\theta}^{0}=10\phantom{\rule{3.33333pt}{0ex}}k\mathsf{\Omega}$, ${\sigma}_{{\theta}^{0}}^{2}=1000$, ${N}_{1}=20$, ${\beta}_{\theta}=4000$, $\epsilon =0.1$, and ${T}_{2}=50$.

Figure 1 and Figure 2 show that the fault is detected at $k=1030$ and it is isolated at $k=1250$ with the fault amplitude equal to $4.8\phantom{\rule{3.33333pt}{0ex}}k\mathsf{\Omega}$. That implies the detection accuracy is about $1-\frac{4.8}{4000}=99.88\%$.

The capacitor is subject to the attenuation type fault, which is a case of the incipient fault. Now, we show the proposed strategy can detect the attenuation type fault. Let ${\theta}^{0}=1000\phantom{\rule{3.33333pt}{0ex}}pF$, ${\sigma}_{{\theta}^{0}}^{2}=0.1$, ${N}_{1}=40$, ${\beta}_{\theta}=\mathrm{500,000}$. Suppose the degradation of the capacitor begins with $\theta (k+1)=\frac{\theta \left(k\right)-1}{2}$ at $k=600$.

Figure 3 and Figure 4 show that, the attenuation type fault has been detected at about $step=1110$ with the magnitude about $235\phantom{\rule{3.33333pt}{0ex}}pF$. That implies the detection accuracy is $1-\frac{235}{1000}=97.65\%$.

In Figure 5, it is clear that the detection accuracy of each strategy drops with the increase of the fault probability. However, the $SFBT$ achieves higher detection accuracy than the others.

Although the false alarm rate and the false positive rate increase with the fault probability (see Figure 6 and Figure 7), the $SFBT$ still outperforms baseline approaches.

To sum up, the $SFBT$ performs better than DFD [41], Jiang [29], and DSFD [30] in terms of detection accuracy, false positive rate, and false alarm rate, respectively. The reason for that is the $SFBT$ employs fault-free cluster-heads, which are located by the strong tracking UKF and the modified Bayes’ classification algorithm, to detect the fault states of cluster-members through the modified three sigma test.

## 5. Conclusions

The optimized power consumption for a smart energy system of a smart city is crucial. Although a variety of fault diagnosis algorithms have been developed, how to efficiently detect faults with low complexity poses a great challenge. To this end, we propose the $SFBT$ that works on a clustered network. It first applies a strong tracking $UKF$ ($STUKF$) and a modified Bayes’ classification algorithm ($MB$) to detect fault-free cluster-heads in a centralized manner. Then, a decentralized modified three Sigma test ($MTS$) is developed to identify faulty cluster-members to overcome the “masking” problem within the cluster with a faulty cluster-head. Besides, a faulty-free cluster-head can detect any faulty cluster-member by simply comparing the data of its own and the one of the cluster-member. The theoretical analysis and experiment results indicate that the $SFBT$ achieves higher diagnosis accuracy over some contemporary strategies with a complexity of $O\left(n\right)$.

In fact, each faulty free cluster-head detected by the $SFBT$ is only one hop away from corresponding cluster-members. That suggests data collected by the cluster-head do not differ much from that of cluster-members, which makes the three Sigma test work. However, by doing so, more cluster-heads are required. Thus, our future work includes how to detect faulty cluster-members at least two hops away with a complexity no more than $O\left(n\right)$.

## Author Contributions

Conceptualization, J.W.; methodology, H.Z. (Hua Zhang); software, D.L.; validation, H.F.; formal analysis, T.W.; investigation, H.Z. (Hongyan Zhang); writing—review and editing, X.W. All authors have read and agreed to the published version of the manuscript.

## Funding

This research was funded by National Nature Science Foundation of China under Nos.61702103, 61871204, 61702103, and 61703195, Industrial Robot Application of Fujian University Engineering Research Center (Minjiang University) under No. MJUKF-IRA1906, Guiding Project of Fujian Science and Technology Department under No.2019H0026, Fujian Science and Technology Department Project under Nos.2017J01768 and 2017J01506, Fujian Natural Science Foundation Grant No. 2019J01756, and Fuzhou Technology Planning Program Grant No.2018-G-96.

## Conflicts of Interest

The authors declare no conflict of interest.

## References

- Wang, K.; Gu, L.; He, X.; Guo, S.; Sun, Y.; Vinel, A.; Shen, J. Distributed energy management for vehicle-to-grid networks. IEEE Netw.
**2017**, 31, 22–28. [Google Scholar] [CrossRef] - Hu, X.; Wang, K.; Liu, X.; Sun, Y.; Li, P.; Guo, S. Energy management for EV charging in software-defined green vehicle-to-grid network. IEEE Commun. Mag.
**2018**, 56, 156–163. [Google Scholar] [CrossRef] - Xu, C.; Wang, K.; Li, P.; Xia, R.; Guo, S.; Guo, M. Renewable energy-aware big data analytics in geo-distributed data centers with reinforcement learning. IEEE Trans. Netw. Sci. Eng.
**2018**. [Google Scholar] [CrossRef] - Wang, K.; Ouyang, Z.; Krishnan, R.; Shu, L.; He, L. A game theory based energy management system using price elasticity for smart grids. IEEE Trans. Ind. Inform.
**2015**, 11, 1607–1616. [Google Scholar] [CrossRef] - Wang, K.; Wang, Y.; Sun, Y.; Guo, S.; Wu, J. Green industrial Internet of Things architecture: an energy-efficient perspective. IEEE Commun. Mag.
**2016**, 54, 48–54. [Google Scholar] [CrossRef] - Mohanty, S.P.; Uma, C.; Elias, K. Everything you wanted to know about smart cities: The internet of things is the backbone. IEEE Consum. Electron. Mag.
**2016**, 5, 60–70. [Google Scholar] [CrossRef] - Wang, K.; Yu, J.; Yu, Y.; Qian, Y.; Zeng, D.; Guo, S.; Xiang, Y.; Wu, J. A survey on Energy Internet: architecture, approach and emerging technologies. IEEE Syst. J.
**2018**, 12, 2403–2416. [Google Scholar] [CrossRef] - Wang, K.; Hu, X.; Li, H.; Li, P.; Zeng, D.; Guo, S. A survey on energy Internet communications for sustainability. IEEE Trans. Sustain. Comput.
**2017**, 2, 231–254. [Google Scholar] [CrossRef] - Wang, K.; Li, H.; Feng, Y.; Tian, G. Big data analytics for system stability evaluation strategy in the energy Internet. IEEE Trans. Ind. Inform.
**2017**, 13, 1969–1978. [Google Scholar] [CrossRef] - Gao, M.; Wang, K.; He, L. Probabilistic model checking and scheduling implementation of energy router system in energy internet for green cities. IEEE Trans. Ind. Inform.
**2018**, 14, 1501–1510. [Google Scholar] [CrossRef] - He, X.; Wang, K.; Huang, H.; Liu, B. QoE-driven big data architecture for smart city. IEEE Commun. Mag.
**2018**, 56, 88–93. [Google Scholar] [CrossRef] - Maksimovic, M. Greening the Future: Green Internet of Things (G-IoT) as a Key Technological Enabler of Sustainable Development. In Internet of Things and Big Data Analytics Toward Next-Generation Intelligence; Springer: Berlin, Germany, 2018; pp. 283–313. [Google Scholar]
- Wang, K.; Shao, Y.; Xie, L.; Wu, J.; Guo, S. Adaptive and fault-tolerant data processing in healthcare iot based on fog computing. IEEE Trans. Netw. Sci. Eng.
**2018**. [Google Scholar] [CrossRef] - Wang, K.; Shao, Y.; Shu, L.; Zhu, C.; Zhang, Y. Mobile big data fault-tolerant processing for eHealth networks. IEEE Netw.
**2016**, 30, 1–7. [Google Scholar] [CrossRef] - Liu, Y.; Wang, K.; Lin, Y.; Xu, W. LightChain: a lightweight blockchain system for industrial internet of things. IEEE Trans. Ind. Inform.
**2019**, 15, 3571–3581. [Google Scholar] [CrossRef] - Ma, J.; Lu, C.; Liu, H. Fault Diagnosis for the Heat Exchanger of the Aircraft Environmental Control System Based on the Strong Tracking Filter. PLoS ONE
**2015**, 10, e0122829. [Google Scholar] [CrossRef] - Daroogheh, N.; Meskin, N.; Khorasani, K. A Dual Particle Filter-Based Fault Diagnosis Scheme for Nonlinear Systems. IEEE Trans. Control Syst. Technol.
**2017**, 26, 1317–1334. [Google Scholar] [CrossRef] - Yin, S.; Zhu, X. Intelligent Particle Filter and Its Application on Fault Detection of Nonlinear System. IEEE Trans. Ind. Electron.
**2015**, 62, 3852–3861. [Google Scholar] [CrossRef] - Zhao, S.; Shmaliy, Y.; Liu, F. Fast Kalman-Like Optimal Unbiased FIR Filtering with Applications. IEEE Trans. Signal Process.
**2016**, 64, 2284–2297. [Google Scholar] [CrossRef] - He, X.; Wang, Z.; Liu, Y.; Qin, L.; Zhou, D. Fault Tolerant Control for an Internet-Based Three-Tank System: Accommodation to Sensor Bias Faults. IEEE Trans. Ind. Electron.
**2016**, 64, 2266–2275. [Google Scholar] [CrossRef] - Ge, Q.; Shao, T.; Chen, S.; Wen, C. Carrier Tracking Estimation Analysis by Using the Extended Strong Tracking Filtering. IEEE Trans. Ind. Electron.
**2017**, 64, 1415–1424. [Google Scholar] [CrossRef] - Zhao, L.; Wang, J.; Yu, T.; Jian, H.; Liu, T. Design of adaptive robust square-root cubature Kalman filter with noise statistic estimator. Appl. Math. Comput.
**2015**, 256, 352–367. [Google Scholar] [CrossRef] - Zhang, L.; Cui, Y.; Xiong, Z.; Liu, J.; Lai, J.; Lv, P. Research on Adaptive Multi-Source Information Fault-Tolerant Navigation Method Based on No-Reference System Diagnosis. Sensors
**2019**, 19, 2911. [Google Scholar] [CrossRef] - Huang, J.; He, X. Detection of Intermittent fault for discrete-time systems with output dead-zone: a variant Tobit Kalman filtering approach. J. Control Sci. Eng.
**2017**, 2017, 9. [Google Scholar] [CrossRef] - Cisneros-Magaña, R.; Medina, A.; Anaya-Lara, O. Time-domain voltage sag state estimation based on the unscented Kalman filter for power systems with nonlinear components. Energies
**2018**, 11, 1411. [Google Scholar] - Lau, B.C.; Ma, E.W.; Chow, T.W. Probabilistic fault detector for wireless sensor network. Expert Syst. Appl.
**2014**, 41, 3703–3711. [Google Scholar] [CrossRef] - Nandi, M.; Nayak, A.; Roy, B.; Sarkar, S. Hypothesis testing and decision theoretic approach for fault detection in wireless sensor networks. Int. J. Parallel Emerg. Distrib. Syst.
**2015**, 30, 262–285. [Google Scholar] [CrossRef] - Gong, W.; Liu, K.; Liu, Y. Directional diagnosis for wireless sensor networks. IEEE Trans. Parallel Distrib. Syst.
**2015**, 26, 1290–1300. [Google Scholar] [CrossRef] - Jiang, P. A new method for node fault detection in wireless sensor networks. Sensors
**2009**, 9, 1282–1294. [Google Scholar] [CrossRef] - Panda, M.; Khilar, P.M. Distributed self fault diagnosis algorithm for large scale wireless sensor networks using modified three sigma edit test. Ad Hoc Netw.
**2015**, 25, 170–184. [Google Scholar] [CrossRef] - Sharma, K.P.; Sharma, T.P. rDFD: reactive distributed fault detection in wireless sensor networks. Wirel. Netw.
**2016**, 23, 1145–1160. [Google Scholar] [CrossRef] - Swain, R.R.; Dash, T.; Khilar, P.M. An effective graph-theoretic approach towards simultaneous detection of fault(s) and cut(s) in wireless sensor networks. Int. J. Commun. Syst.
**2017**, 30, e3273. [Google Scholar] [CrossRef] - Gómez-Peñate, S.; Valencia-Palomo, G.; López-Estrada, F.R.; Astorga-Zaragoza, C.-M.; Osornio-Rios, R.A.; Santos-Ruiz, I. Sensor fault diagnosis based on a H
_{∞}sliding mode and unknown input observer for takagi-sugeno systems with uncertain premise variables. Asian J. Control**2019**, 21, 339–353. [Google Scholar] - Santos-Ruiz, I.D.L.; López-Estrada, F.R.; Puig, V.; Pérez-Pérez, E.J.; Mina-Antonio, J.D.; Valencia-Palomo, G. Diagnosis of Fluid Leaks in Pipelines Using Dynamic PCA. IFAC-PapersOnLine
**2018**, 51, 373–380. [Google Scholar] [CrossRef] - Yi, J.; Huang, D.; He, H.; Zhou, W.; Han, Q.; Li, T. A novel framework for fault diagnosis using kernel partial least squares based on an optimal preference matrix. IEEE Trans. Ind. Electron.
**2017**, 64, 4315–4324. [Google Scholar] [CrossRef] - Jiang, G.; He, H.; Xie, P.; Tang, Y. Stacked multilevel-denoising autoencoders: A new representation learning approach for wind turbine gearbox fault diagnosis. IEEE Transt. Instrum. Meas.
**2017**, 66, 2391–2402. [Google Scholar] [CrossRef] - Yi, J.; Huang, D.; Fu, S.; He, H.; Li, T. Optimized relative transformation matrix using bacterial foraging algorithm for process fault detection. IEEE Trans. Ind. Electron.
**2016**, 63, 2595–2605. [Google Scholar] [CrossRef] - Panda, M.; Khilar, P.M. Distributed Byzantine fault detection technique in wireless sensor networks based on hypothesis testing. Comput. Electr. Eng.
**2015**, 48, 270–285. [Google Scholar] [CrossRef] - Deng, F.; Yang, H.; Wang, L. Adaptive Unscented Kalman Filter Based Estimation and Filtering for Dynamic Positioning with Model Uncertainties. Int. J. Control Autom. Syst.
**2019**, 17, 667–678. [Google Scholar] [CrossRef] - Clark, W.E.; Shekhtman, B.; Suen, S.; Fisher, D.C. Upper bounds for the domination number of a graph. Congr. Numer.
**1998**, 132, 99–123. [Google Scholar] - Panda, M.; Khilar, P.M. Distributed soft fault detection algorithm in wireless sensor networks using statistical test. In Proceedings of the 2012 2nd IEEE International Conference on Parallel, Distributed and Grid Computing, Solan, India, 6–8 Decemebr 2012. [Google Scholar]

**Figure 1.**Real value vs. estimated value of Strong Tracking Unscented Kalman Filter ($STUKF$) for jump type fault.

**Figure 2.**$STUKF$ and Modified Bayes’ classification algorithm ($MB$)-based detection for jump type fault.

Symbols | Descriptions |
---|---|

${C}_{i}$ | i-th cluster |

$C{H}_{i}$ | i-th cluster-head of ${C}_{i}$ |

${s}_{i}$ | i-th meter |

$Ne{g}_{i}$ | Set of neighboring meters of ${s}_{i}$ |

$x\left(k\right)$ | State of a meter at time k |

${x}_{i}$ | data of ${s}_{i}$ |

${\widehat{x}}_{i}$ | Estimated value of ${x}_{i}$ |

${\mu}_{i}$ | Sample mean of ${s}_{i}$ |

${\sigma}_{i}$ | Sample variance of ${s}_{i}$ |

$Med$ | Median of the data |

$MAD$ | Median Absolute Deviation |

$SD$ | Standard Deviation |

$\eta $ | Threshold for identifying fault state |

${p}_{r}$ | Probability of a meter being faulty |

p | Probability of a faulty meter detected as fault-free |

$STUKF$ | Strong Tracking Unscented Kalman Filter |

$MB$ | Modified Bayes’ classification algorithm |

$MTS$ | Modified Three Sigma test |

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).