Detecting Enclosed Board Channel of Data Acquisition System Using Probabilistic Neural Network with Null Matrix

Zhang, Dapeng; Lin, Zhiling; Gao, Zhiwei

doi:10.3390/s22155559

Open AccessArticle

Detecting Enclosed Board Channel of Data Acquisition System Using Probabilistic Neural Network with Null Matrix

by

Dapeng Zhang

¹

,

Zhiling Lin

^2,* and

Zhiwei Gao

³

¹

School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China

²

School of Electrical Engineering, Tianjin University of Technology, Tianjin 300384, China

³

Faculty of Engineering and Environment, University of Northumbria, Newcastle upon Tyne NE2 8ST, UK

^*

Author to whom correspondence should be addressed.

Sensors 2022, 22(15), 5559; https://doi.org/10.3390/s22155559

Submission received: 25 May 2022 / Revised: 20 July 2022 / Accepted: 21 July 2022 / Published: 25 July 2022

(This article belongs to the Collection Recent Advances in Fault Diagnostics, Prognostics, and Intelligent Condition-Based Maintenance)

Download

Browse Figures

Versions Notes

Abstract

:

The board channel is a connection between a data acquisition system and the sensors of a plant. A flawed channel will bring poor-quality data or faulty data that may cause an incorrect strategy. In this paper, a data-driven approach is proposed to detect the status of the enclosed board channel based on an error time series obtained from multiple excitation signals and internal register values. The critical faulty data, contrary to the known healthy data, are constructed by using a null matrix with maximum projection and are labelled as training examples together with healthy data. Finally, the status of the enclosed board channel is validated by a well-trained probabilistic neural network. The experimental results demonstrate the effectiveness of the proposed method.

Keywords:

fault detection and diagnosis; board channel; probabilistic neural network

1. Introduction

Data acquisition systems play a vital role in the data collection of industry [1]. Among them, the board tunnel, which is usually classified as analog input (AI), analog output (AO), digital input (DI), and digital output (DO) modules, is a bridge between the processor and sensors, which ensures the data conversion at the physical level [2]. The tunnel board is made up of enclosed circuit boards that are convenient to be replaced immediately once they are found to have any faults occur due to security reasons. In order to detect the inertial faults of these circuit boards in time, most famous products, such as Siemens, Honeywell, etc., have provided error codes to help operators [3,4,5]. However, these codes are limited to meeting the requirements of board channel diagnosis in a practical complex application.

Different kinds of methods for fault detection and diagnosis (FDD) have been developed, which are classified as model-based approaches, signal-based approaches, and data-driven approaches [6,7]. In model-based approaches, the fault diagnosis algorithms are developed to monitor the consistency between the measured outputs of the practical systems and the model-predicted outputs, which are based on an appropriate model, whether a physical model or equivalent model. Reference [8] proposed a new method by combining the model-based FDD method and the support vector machine (SVM) method. In reference [9], the spindle modes are determined through a three-step procedure in order to overcome these issues of the low number of sensors and the presence of many harmonics in the measured signals and to extract the characteristics of the system. In reference [10], based on the information of fault-free data series, fault detection was promptly implemented by comparison with the model forecast and real-time process. Signal-based approaches include time-domain analysis, frequency-domain analysis, and both together. Reference [11] proposed a novel “frequency-domain damping design” using a high-pass filter for acceleration-based bilateral control (ABC) based on modal space analysis. In reference [12], a unified measurement model was utilized to simultaneously characterize both the phenomena of multiple communication delays and data missing, and a novel residual matching (RM) approach was developed to isolate and estimate the fault once it is detected. Reference [13] proposed a least squares support vector machine (LS-SVM) model optimized by cross validation to implement FDD on a 90-ton centrifugal chiller. Reference [14] investigated the achievable rates of frequency-division-duplex massive MIMO systems with spatially correlated channels. In fact, it is difficult for the board tunnel to build an appropriate model since different boards have different circuit structures. It is also a challenge to obtain the features of integer signals, especially for the fault cases, because the flawed board tunnel will be quickly replaced for safety reasons.

Board tunnels always work on a standard enclosed module, which prevents the circuit from being affected by external factors. This enclosed module is also suitable for quick disassembly or replacement. However, as a double-edged sword, this method introduces issues for fault detection and diagnosis because it loses the ability to directly observe internal states. The data-driven approach [15,16,17,18,19,20] provides a feasible way to solve this problem by external observation data. Reference [15] aimed to provide a state-of-the-art overview on the existing fault diagnosis, prognosis, and resilient control methods and techniques for wind turbine systems, with which great success has been achieved in fault detection and diagnosis. Reference [16] focused on data-driven techniques in the digital era and data analytics in all areas, including process industries. Reference [17] proposed a new data-driven FDD method, which was named probability-relevant PCA (PRPCA), for electrical drives in high-speed trains. In reference [18], a fault diagnosis method based on a deep convolutional neural network model consisting of convolutional layers, pooling layers, dropout, and fully connected layers was proposed for chemical process fault diagnosis. In reference [19], an extended deep belief network (EDBN) was proposed to fully exploit useful information in the raw data. Reference [20] presented a Special Issue on “data-driven approaches for complex industrial systems”. Using a data-driven approach to the board tunnel detection, two obstacles should be overcome: (1) A healthy board shows certain differences in response to the process conditions, working environment, and internal parameters. This dispersivity is difficult to be covered by limited sample data. (2) Generally, the stability of a data acquisition system is generally high, and there are few failures; even if a failure occurs, it will be replaced quickly in order to achieve safety. Therefore, there are almost no historical faulty data.

From the view of board performance, the healthy data will obey the law of health probability distribution, though the healthy data are dispersive in different working environments. Some excellent methods based on probability analysis, such as the conditional probability distributions, Bayesian network, etc., have been reported in chemical processes [20,21,22,23]. Motivated by the probability idea based on the concept that the acquired data signal is regarded as a realization of the distribution of the board, a probabilistic neural network (PNN) is proposed based on critical faulty data being artificially constructed to distinguish between healthy states and faulty states. Firstly, multiple data sources are applied to activate conditions on the board tunnel, and the internal register values are obtained by OPC technology. Then, the error time series are constructed to analyze the healthy state of the enclosed board channel. The critical faulty data are constructed based on the healthy data by using a null matrix with maximum projection. Finally, the healthy state of the enclosed board channel is judged by a probabilistic neural network. The advantages of the proposed approach are summarized as follows:

(1): Multiple input signals are proposed to activate the working state of the board tunnel, which extends the scope of exploration for the dispersivity of a healthy board concerning the working environment and internal parameters.
(2): The critical faulty data are successfully constructed by using the null matrix based on the health data, which overcomes the difficulty of lacking faulty data.
(3): The PNN is used to adapt to the law of probability hidden in the time series, and case studies verify the effectiveness.

The remainder of this article is organized as follows. In Section 2, the acquisition of error time series and the relationship between multiple input signals and overall performance of the board tunnel are given. Section 3 describes the proposed approach, including the probability neural network, the construction of critical faulty data, the structure, and the workflow. The case studies are illustrated in Section 4, followed by conclusions in Section 5.

2. The Error Time Series of Board Tunnel

The error between input signal and output (memory) mainly affected by internal factors of the board is regarded as a comprehensive index reflecting the performance of the board tunnel. A single sample is meaningless for evaluating the board performance because it is an instance and not enough to observe the law of probability. Thus, an error time series is taken as an analysis object of the enclosed board tunnel, and the error time series is obtained, as shown in Figure 1.

Let the input signal of the board tunnel be

{\{x (k)\}}_{k = 1}^{\infty}

and the value of the corresponding memory be

{\{y (k)\}}_{k = 1}^{\infty}

; thus, the error time series is

{\{z (k)\}}_{k = 1}^{\infty} = {\{y (k)\}}_{k = 1}^{\infty} - {\{x (k)\}}_{k = 1}^{\infty}

(1)

where is the sampling time. Formula (1) is abbreviated as Formula (2) by using x, y, z instead of

{\{x (k)\}}_{k = 1}^{\infty}

,

{\{y (k)\}}_{k = 1}^{\infty}

,

{\{z (k)\}}_{k = 1}^{\infty}

.

z = y - x

(2)

Notice that is the converted data of input signal x according to the physical meaning of the board channel, and z is regarded as a probability model of noisy influences that follows a normal distribution with a form of Formula (3):

z ∽ N (μ_{b o a r d}, σ_{b o a r d}^{2})

(3)

where

μ_{b o a r d}

and

σ_{b o a r d}

are an expectation and a variance for the board, respectively.

It is worth noting that if the board input

x

is enough to cover all the work conditions and influences of the environment, the expectation

μ_{b o a r d}

is equal to the mean, which ideally satisfies

\{\begin{matrix} \begin{matrix} μ_{b o a r d} = 0 & y = x \end{matrix} \\ \begin{matrix} μ_{b o a r d} \neq 0 & y \neq x \end{matrix} \end{matrix}

. Thus, thereafter, we use the mean instead of the expectation.

In fact, different input signals will cause some changes due to the influence of the environment and internal parameters. Figure 2 releases the error time series of a healthy board channel under three kinds of different input signals.

Each input signal that is long enough will produce its own probability distribution laws with a form of

z_{i} ∽ N (μ_{i}, σ_{i}^{2})

(4)

where μ_i and σ_i are the mean and the variance under the i-th input signal. It is inevitable for some deviations to occur between

μ_{b o a r d}

and

μ_{i}

. From the view of fault detection and diagnosis, the board tunnel is considered to be in a healthy state as long as

μ_{b o a r d}

is within the allowable range. However, these deviations between

μ_{b o a r d}

and

μ_{i}

will disturb the judgment of healthy states due to the limitation of the sampling data number. In order to establish the relation between sampling data and board performance, it is assumed that the mean

μ_{b o a r d}

is equal to the mean of different input signals, that is,

μ_{b o a r d} = \frac{1}{n} \sum_{i = 1}^{n} μ_{i}

(5)

Lemma 1.

The mean μ_board and variance

σ_{b o a r d}^{2}

of the sampling data series

z

satisfying normal distribution can be replaced by

m

sub-sampling data whose mean is

μ_{m} (i), i = 1, 2, \dots, m

and whose variance is

σ_{m}^{2} (i), i = 1, 2, \dots, m

. That is,

μ_{b o a r d} = \frac{1}{m} \sum_{i = 1}^{m} μ_{m} (i)

(6)

σ_{b o a r d}^{2} = \frac{1}{m - 1} \sum_{i = 1}^{m} σ_{m}^{2} (i)

(7)

Proof.

For the data series

z ~ N (μ_{b o a r d}, σ_{b o a r d}^{2})

that follows normal distribution with a mean

μ_{b o a r d}

and variance

σ_{b o a r d}^{2}

, suppose the data series z has enough data of

n

samples to reflect the statistical characteristics of a whole. The unbiased estimate of

μ_{b o a r d}

is

\bar{z}

, and the unbiased estimate of

σ_{b o a r d}^{2}

is obtained according to

σ_{b o a r d}^{2} = \frac{1}{n - 1} \sum_{i = 1}^{n} {(z_{i} - \bar{z})}^{2}

(8)

□

Consider the relation of the mean between the whole and sub-sampling data. Let the

n

samples be divided into

m

groups with the mean and the length of the k-th group being

{\bar{z}}_{m} (k)

and

L_{m} (k)

:

{\bar{z}}_{m} (k) = \frac{1}{L_{m} (k)} \sum_{i = 1}^{L_{m} (k)} z_{i}

(9)

Thus, the mean of a whole is

\frac{1}{m} \sum_{k = 1}^{m} {\bar{z}}_{m} (k) = \frac{1}{m} \sum_{k = 1}^{m} \frac{1}{L_{m} (k)} \sum_{i = 1}^{L_{m} (k)} z_{i} = \frac{1}{\sum_{k = 1}^{m} L_{m} (k)} \sum_{k = 1}^{m} \sum_{i = 1}^{L_{m} (k)} z_{i} = \frac{1}{n} \sum_{i = 1}^{n} z_{i} = \bar{z}

(10)

Formula (10) shows the unbiased estimate of

\bar{z}

. Therefore,

μ_{b o a r d}

can be estimated by the above formula.

For a variance, it is well known that the sample mean of normal distribution also obeys normal distribution according to the mathematical statistical theory. Thus, the mean

{\bar{z}}_{i}

of each group follows

{\bar{z}}_{i} ∽ N (μ_{i}, \frac{σ_{b o a r d}^{2}}{L_{m}})

(11)

Let

σ_{i}^{2} = \frac{σ_{b o a r d}^{2}}{L_{m}}

; thus,

{\bar{z}}_{i} ∽ N (μ_{i}, σ_{i}^{2})

.

For m groups, an unbiased estimate of

σ_{i}^{2}

is obtained by

σ_{i}^{2} = \frac{1}{m - 1} \sum_{k = 1}^{m} {({\bar{z}}_{i} (k) - {\bar{z}}_{b o a r d})}^{2} = \frac{1}{m - 1} \sum_{k = 1}^{m} {(\frac{1}{L_{m} (k)} \sum_{i = 1}^{L_{m} (k)} (z_{i} - {\bar{z}}_{b o a r d})}^{2})

(12)

Furthermore, the

σ_{b o a r d}^{2}

of a whole is obtained according to

σ_{b o a r d}^{2} = L_{m} σ_{i}^{2} = \frac{L_{m}}{(m - 1) L_{m}} \sum_{i = 1}^{m} σ_{m}^{2} (i) = \frac{1}{m - 1} \sum_{i = 1}^{m} σ_{m}^{2} (i)

(13)

As a result, the proof is completed.

The lemma shows that the performance of the board can be obtained through the combination of different groups. For a board tunnel, this means the total probability of healthy model can be combined with different input signals.

3. The Proposed Approach

3.1. Probability Neural Network

The probability neural network (PNN) that was proposed by D.F Specht in 1990 is a kind of statistical neural network model based on a Bayesian minimum risk criterion [24]. It consists of four layers, including the input layer, the pattern layer, the summation layer, and the output layer. The input layer is responsible for transmitting the feature vector into the network. The pattern layer takes full connection directly from the input layer through the connection weight. The pattern layer reflects the spatial distribution of the samples, in which each sample works in a limited local space, and the whole space constitutes a distributed probability distribution with a sample combination. This structure accurately reflects the probability distribution of the sample in the whole space. It is usually trained with supervised learning based on training samples and the responding patterns. The distance between the input eigenvector and the trained pattern is used to activate the Gaussian function of the pattern layer. The summation layer is responsible for connecting the outputs of the pattern layer and the schema units of each class through the score probability. Finally, the output layer outputs the category with the highest total score of schema units of each class in the summation layer. In PNN, the probability density

p (x | w_{i})

is expressed by a radial basis function:

p (x | w_{i}) = \frac{1}{N_{i}} \sum_{k = 1}^{N_{i}} \frac{1}{2 π^{\frac{l}{2}} σ^{l}} \exp (- \frac{{|x - x_{i k}|}^{2}}{2 σ^{2}})

(14)

where

x_{i k}

,

N_{i}

,

σ

, and

l

are the sample center, the smoothness factor, the hyper-parameters, and the coefficient, respectively. The discriminant function

g_{i} (x)

is

g_{i} (x) = \frac{p (w_{i})}{N_{i}} \sum_{k = 1}^{N_{i}} \exp (- \frac{{|x - x_{i k}|}^{2}}{2 σ^{2}}) = \frac{p (w_{i})}{N_{i}} \sum_{k = 1}^{N_{i}} \exp (- \frac{x^{T} x_{i k} - 1}{σ^{2}})

(15)

where

p (w_{i})

is the probability of

w_{i}

occurrence.

Additionally, the discrimination rule is

if g_{i} (x) > g_{j} (x) \forall i \neq j, then x \in w_{i}

(16)

3.2. The Construction of Critical Faulty Data

The PNN distinguishes the category of input data based on the established relationship of the train examples and the category belonged to. Different from the weights principle of direct mapping between input and output, the PNN adopts computing the proximity to the different sample data and judges the category according to a posterior probability. In principle, as long as there are faulty data samples and health data samples, the new data will be classified in healthy states and faulty states, except for an occurrence of posteriori probability of just 50%. However, the fault samples are in fact in a large range that affects the accuracy as a criterion. The schematic diagram of critical faulty data construction is shown in Figure 3.

In Figure 3, the square represents the entire set of healthy and faulty states, which is classed as section I (health), section II (vagueness), and section III (fault). The A and the B are the observed sets that build the health data samples. The F1 and the F2 are the faulty data samples. Additionally, the T1 and the T2 are the test sets. For a healthy dataset T1, it is prone to find an observed health set A that is close to T1. However, for a faulty dataset T2, if one randomly selects the faulty dataset F1 as faulty data samples, it will produce the incorrect result that the T2 is health because the distance from T2 to B is closer than that from T2 to F1. If the position of F1 moves to the position of F2 that belongs to section III but is close to section II, the previous mistakes will be avoided. Thus, the fault samples at the edge of vagueness and fault are called critical faulty data. Although the critical faulty data cannot distinguish the dataset of all sections (for example, the M of section II), they can solve the judgment problem for the most of the datasets.

However, the board channel has almost no historical faulty data to be used because the board channel is prohibited from working with faults. This makes it impossible to find the critical faulty data by analyzing historical data. To produce the examples of critical faulty data from the healthy data, the null matrix is introduced as a vertical cross mode of the healthy state and the critical faulty data. The null matrix N of a non-full rank matrix

X

is defined if there is a matrix

N

that satisfies

X N

= 0 and

N N

= I [25].

According to the definition of the null matrix, for

x_{i}

being a sampling vector of healthy data, there is

N_{i} x_{i} = 0

(17)

where

N_{i}

is the corresponding null matrix.

For another sampling vector

x_{j}

(

x_{j} \neq x_{i}

), there is

N_{i} x_{j} = b_{i j}

(18)

where

b_{i j}

is the deviation of

x_{j}

under the action of null matrix

N_{i}

.

Compute the deviation

b_{k l}

of all samples

x_{l} (l = 1 \dots n)

and null matrix

N_{k} (k = 1 \dots n)

according to

b_{k l} = N_{k} x_{l} (k = 1. n; L = 1 \dots N)

(19)

Take

b = m a x \{b_{k l}, k = 1.. n; l = 1 \dots n\}

and obtain the corresponding null matrix

N

for all healthy data, and inequality (20) is satisfied:

N x \leq b

(20)

The corresponding equation reflects the critical state of fault and health:

N x = b

(21)

Move the left of Formula (21) to the right and replace I with

N N^{- 1}

:

N x - N N^{- 1} b = 0

(22)

where

N^{- 1}

is a pseudo inverse of

N

.

We obtain

N (x - N^{- 1} b) = 0

(23)

Let

\overset{´}{x} = x - N^{- 1} b

(24)

and

\overset{´}{x}

is the critical faulty data.

3.3. The Structure and Workflow of Proposed Approach

The proposed method is made up with four parts, including the data acquisition, the data processing, the probability neural network, and the diagnostic output. The excitation source acts on the board channel with multiple groups of different kinds of signals in order to expand the detection scope as much as possible. The error time series is built from the excitation signal and the converted data by a technique of OLE for process control (OPC) [26]. Then, it is transformed to a Hankel matrix by a sliding window in order to adapt to the PNN training. The diagnostic result is output by the PNN. The structure is shown in Figure 4.

The workflow is described as follows:

Step 1: Record the signal generator and use OPC to obtain the internal memory data of the board tunnel. Thus, the error time series

{\{z_{k}\}}_{k = 1}^{j}

combined with different input signals is formed according to Formula (1).

Step 2: Suppose the length of the sliding window is

T

, and construct the Hankel matrix

H_{L}

with depth

L

(usually

L ≫ T

):

H_{L} = {[\begin{matrix} \begin{matrix} z_{l} & \begin{matrix} z_{l + 1} & \dots & z_{l + T - L} \end{matrix} \end{matrix} \\ \begin{matrix} z_{l + 1} & \begin{matrix} z_{l + 2} & \dots & z_{l + 1 + T - L} \end{matrix} \end{matrix} \\ \begin{matrix} \begin{matrix} ⋮ & \begin{matrix} ⋮ & ⋱ & ⋮ \end{matrix} \end{matrix} \\ \begin{matrix} z_{l + L - 1} & \begin{matrix} z_{l + T} & \dots & z_{l + T - 1} \end{matrix} \end{matrix} \end{matrix} \end{matrix}]}_{L \times T}

(25)

Step 3: The critical faulty dataset H_LN is constructed according to Formula (24) of 3.2:

H_{L N} = {[\begin{matrix} \begin{matrix} {\overset{´}{z}}_{l} & \begin{matrix} {\overset{´}{z}}_{l + 1} & \dots & {\overset{´}{z}}_{l + T - L} \end{matrix} \end{matrix} \\ \begin{matrix} {\overset{´}{z}}_{l + 1} & \begin{matrix} {\overset{´}{z}}_{l + 2} & \dots & {\overset{´}{z}}_{l + 1 + T - L} \end{matrix} \end{matrix} \\ \begin{matrix} \begin{matrix} ⋮ & \begin{matrix} ⋮ & ⋱ & ⋮ \end{matrix} \end{matrix} \\ \begin{matrix} {\overset{´}{z}}_{l + L - 1} & \begin{matrix} {\overset{´}{z}}_{l + T} & \dots & {\overset{´}{z}}_{l + T - 1} \end{matrix} \end{matrix} \end{matrix} \end{matrix}]}_{L \times T}

(26)

Step 4: Construct the sample matrix of the PNN by using input H:

H = {[H_{L} H_{L N}]}_{L \times 2 T} = {[\begin{matrix} z_{l} & z_{l + 1} & \dots & z_{l + T - L} & {\overset{´}{z}}_{l} & {\overset{´}{z}}_{l + 1} & \dots & {\overset{´}{z}}_{l + T - L} \\ z_{l + 1} & z_{l + 2} & \dots & z_{l + 1 + T - L} & {\overset{´}{z}}_{l + 1} & {\overset{´}{z}}_{l + 2} & \dots & {\overset{´}{z}}_{l + 1 + T - L} \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ z_{l + L - 1} & z_{l + T} & \dots & z_{l + T - 1} & {\overset{´}{z}}_{l + L - 1} & {\overset{´}{z}}_{l + T} & \dots & {\overset{´}{z}}_{l + T - 1} \end{matrix}]}_{L \times 2 T}

(27)

Moreover, the corresponding category is [0 1], where 0 and 1 represent the healthy states and the faulty states, respectively.

Step 5: Build the PNN by following three rules: (1) the number of input layers is the length of the sliding window

(T)

; (2) the number of neurons in the mode layer is the number of input sample vectors

(L)

; and (3) the summation layer is of class 2, which represents health and fault.

Step 6: The test sequence

{\{T_{k}\}}_{k = 1}^{j}

is converted to the input sample matrix

D

by normalizing the Hankel matrix, denoted as

D = Norm {[\begin{matrix} T_{l} & T_{l + 1} & \dots & T_{l + T - L} & {\overset{´}{T}}_{l} & {\overset{´}{T}}_{l + 1} & \dots & {\overset{´}{T}}_{l + T - L} \\ T_{l + 1} & T_{l + 2} & \dots & T_{l + 1 + T - L} & {\overset{´}{T}}_{l + 1} & {\overset{´}{T}}_{l + 2} & \dots & {\overset{´}{T}}_{l + 1 + T - L} \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ T_{l + L - 1} & T_{l + T} & \dots & T_{l + T - 1} & {\overset{´}{T}}_{l + L - 1} & {\overset{´}{T}}_{l + T} & \dots & {\overset{´}{T}}_{l + T - 1} \end{matrix}]}_{L \times 2 T} = {[\begin{matrix} D_{11} & D_{12} & \dots & D_{1, 2 T} \\ D_{21} & D_{22} & \dots & D_{2, 2 T} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ D_{q 1} & D_{q 2} & \dots & D_{q, 2 T} \end{matrix}]}_{q \times 2 T}

(28)

The sample reference C is obtained by row normalizing the train of input matrix H

C = Norm {[\begin{matrix} Z_{l} & Z_{l + 1} & \dots & Z_{l + T - L} & {\overset{´}{Z}}_{l} & {\overset{´}{Z}}_{l + 1} & \dots & {\overset{´}{Z}}_{l + T - L} \\ Z_{l + 1} & Z_{l + 2} & \dots & Z_{l + 1 + T - L} & {\overset{´}{Z}}_{l + 1} & {\overset{´}{Z}}_{l + 2} & \dots & {\overset{´}{Z}}_{l + 1 + T - L} \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ Z_{l + L - 1} & Z_{l + T} & \dots & Z_{l + T - 1} & {\overset{´}{Z}}_{l + L - 1} & {\overset{´}{Z}}_{l + T} & \dots & {\overset{´}{Z}}_{l + T - 1} \end{matrix}]}_{L \times 2 T} = {[\begin{matrix} C_{11} & C_{12} & \dots & C_{1, 2 T} \\ C_{21} & C_{22} & \dots & C_{2, 2 T} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ C_{l 1} & C_{l 2} & \dots & C_{l, 2 T} \end{matrix}]}_{L \times 2 T}

(29)

where the

Norm [∙]

is an operator of matrix row normalizing.

Step 7: Calculate the Euclidean distance between the input matrix

D

and the sample reference matrix

C

according to

E = {[\begin{matrix} \sqrt{\sum_{j = 1}^{2 T} {|D_{1 j} - C_{1 j}|}^{2}} & \sqrt{\sum_{j = 1}^{2 T} {|D_{1 j} - C_{2 j}|}^{2}} & \dots & \sqrt{\sum_{j = 1}^{2 T} {|D_{1 j} - C_{l j}|}^{2}} \\ \sqrt{\sum_{j = 1}^{2 T} {|D_{2 j} - C_{1 j}|}^{2}} & \sqrt{\sum_{j = 1}^{2 T} {|D_{2 j} - C_{2 j}|}^{2}} & \dots & \sqrt{\sum_{j = 1}^{2 T} {|D_{2 j} - C_{T j}|}^{2}} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ \sqrt{\sum_{j = 1}^{2 T} {|D_{q j} - C_{1 j}|}^{2}} & \sqrt{\sum_{j = 1}^{2 T} {|D_{q j} - S_{2 j}|}^{2}} & \dots & \sqrt{\sum_{j = 1}^{2 T} {|D_{q j} - S_{T j}|}^{2}} \end{matrix}]}_{q \times 2 T} = {[\begin{matrix} E_{11} & E_{12} & \dots & E_{1, 2 T} \\ E_{21} & E_{22} & \dots & E_{2, 2 T} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ E_{q 1} & E_{q 2} & \dots & E_{q, 2 T} \end{matrix}]}_{q \times 2 T}

(30)

Step 8: The initial probability matrix P is obtained by activating the Gaussian function of the pattern layer:

P = {[\begin{matrix} e^{- \frac{E_{11}}{2 σ^{2}}} & e^{- \frac{E_{12}}{2 σ^{2}}} & \dots & e^{- \frac{E_{1, T}}{2 σ^{2}}} \\ e^{- \frac{E_{21}}{2 σ^{2}}} & e^{- \frac{E_{22}}{2 σ^{2}}} & \dots & e^{- \frac{E_{2, T}}{2 σ^{2}}} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ e^{- \frac{E_{q 1}}{2 σ^{2}}} & e^{- \frac{E_{q 2}}{2 σ^{2}}} & \dots & e^{- \frac{E_{q, T}}{2 σ^{2}}} \end{matrix}]}_{q \times 2 T} = {[\begin{matrix} P_{11} & P_{12} & \dots & P_{1, T} \\ P_{21} & P_{22} & \dots & P_{2, T} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ P_{q 1} & P_{q 2} & \dots & P_{q, T} \end{matrix}]}_{q \times 2 T}

(31)

Step 9: The probability

S

that

q

samples belong to two categories (health and fault) is obtained according to Formula (29):

S = {[\begin{matrix} \sum_{j = 1}^{T} P_{1 j} & \sum_{j = T + 1}^{2 T} P_{1 j} \\ \sum_{j = 1}^{T} P_{2 j} & \sum_{j = T + 1}^{2 T} P_{2 j} \\ \dots & \dots \\ \sum_{j = 1}^{T} P_{q j} & \sum_{j = T + 1}^{2 T} P_{q j} \end{matrix}]}_{q \times 2} = {[\begin{matrix} S_{11} & S_{21} \\ S_{21} & S_{22} \\ \dots & \dots \\ S_{q 1} & S_{q 2} \end{matrix}]}_{q \times 2}

(32)

Step 10: The maximum probability of each row is taken as the category according to Bayesian decision theory.

4. Case Studies

The experimental platform is a distributed control system with an engineer station. Our goal was to test the performance of the board without any destruction. The input signal of the board tunnel for the test was imposed directly by another board tunnel because the board channel of the laboratory is not loaded. If the board channel is connected to the sensor signal, it will reform the input signal by adding a small series signal source (usually not more than 15% of the normal signal amplitude). This small series signal is used only to detect the performance of the board tunnel and is easily eliminated by software. The central control platform of the laboratory is shown in Figure 5.

There were five groups of healthy data with input signals of 5 V with additional pulse voltage, piecewise linear voltage, exponential voltage, thermal noise, and chirp signal. However, it was a challenge to construct any faults of the board tunnel because the board was not to be disassembled or damaged. Losing faults by changing internal states, there is only one possibility that uses the calibration function of the control system to change the AD converted reference signal. Two groups of faulty data were simulated by changing the AD converted reference signal with the addition of the stochastic disturbance and the periodic voltage, respectively. The cases of seven groups are shown in Table 1.

4.1. Change the Number of Intermediate Layers of PNN

Eight numbers of sliding window length from 100 to 20,000 were selected to detect the state of case5 to case 7. It repeated 1000 times per sliding window length. The training samples were combined with case1, case2, case3, and case4. For each test, the starting of sliding window was randomly selected from the error time series, and the Hankel depth was always kept at 10,000, just for simplicity. The results are shown in Table 2.

It is seen from Table 2 that the lengths of the sliding window have an effect on the detection results. Short lengths of 100 samples and 150 samples succeeded in detecting the healthy states but failed to find the faulty states by mistaking them for the healthy states. With the lengths expanding to 200 samples and 500 samples, the accuracy of fault detection increased by more than 99% (case6 and case7 achieve 99.9% and 99.2%, respectively), although the accuracy of the healthy states was reduced to 89.6% and 89.7%. When the length of the sliding window reached 2000 samples, the detection of healthy states and two faulty states could reach 100%. However, it is not that the longer the sliding window length is, the better the result is. When the window length reached 20,000 samples, the detection results were all healthy states regardless of healthy states or fault status. In other words, the faulty states could not be determined at sliding window lengths that were too long.

4.2. Effects of Different Groups of Health Data Combination as Sample Input

The combination of four groups of health data was selected as the training samples to detect the fifth group of healthy states and the other groups of two faulty states. The length of the sliding window was 2000 samples, and the depth of the Hankel was 10,000. A test was done according to the proposed method, and the results are shown in Table 3, which indicates an accuracy of 100%, whether healthy or faulty.

A change of combination from four groups to three groups of healthy data was used to test the effects of training samples from different group combinations. The length of the sliding window and the depth of the Hankel were kept unchanged. The results are shown in Table 4.

Table 4 shows that the most health and faults can be detected by combining three groups of health data as training examples via the proposed method. However, a few health cases are in states of poor accuracy because the training examples can partly cover the information of other health cases. This will be further confirmed by reducing the number of groups for training examples. In the cases of taking two groups as a combination of training examples, the situation is similar to before. Most health and faults can be detected correctly, but there are some incorrect detection results for healthy states. For example, taking case1 and case5 as training examples, the results of case2 and case4 are correct, but the results of case3 are all wrong in 1000 tests. These results are not listed here due to space limitations. By analyzing the above situations, we found that incorrect detection is related to some kinds of healthy data. It is due to the reason that the training data do not completely cover the characteristics of the test samples. We also notice that the detection results for faulty states are correct, which shows that the null matrix plays an important role. A conclusion is drawn that the feature coverage of training samples is more important than the number of groups.

4.3. Comparison with LDM

The classical linear discriminative method (LDM) was used to detect the fault of board channel. The 10,000 groups combined from the time series were selected as training samples whose length of the sliding window was 2000 samples, and a random 1000 groups of each case were tested for imitating the situation with known historical data. The results are shown in Table 5.

The 1000 groups of data from case7 were used be tested as unknown faulty data, and the results are shown in Table 6.

It is seen from Table 5 that for the labeled data, the LDM has a high accuracy of more than 99.3%, and it can be divided into more detailed categories. However, Table 6 shows that the accuracy of the LDM for a new fault is 70.3%, which is low. Compared with the LDM, the proposed approach, which is shown in Table 3, can achieve good results only by using healthy data.

5. Conclusions

At present, there is no practical method to detect the enclosed board tunnel except for returning it to the factory or an error code display. Failure to find the abnormal board brings a great potential threat to the control system of the plant. This paper proposes an approach for fault detection of an enclosed board channel by using a PNN based on an error time series excited by various external signals. The critical faulty data, contrary to the known healthy data, are constructed by using a null matrix with maximum projection and are labelled as training examples together with healthy data. This provides the mode criteria of PNN training. Thus, the problem of PNN lacking faulty data examples has been solved to some extent. The proposed approach is a data-driven method that can detect the abnormal state or fault of an enclosed board channel without knowing any internal circuit of the board channel. It only needs a small number of additional hardware devices and does not need any mechanism knowledge on the board channel, which greatly reduces the costs and the professional knowledge for staff. In the future, cases where the output probabilities of the health mode and fault mode are similar will be studied, which should improve the accuracy of the proposed approach in some special scenarios.

Author Contributions

Methodology, D.Z. and Z.G.; investigation, Z.L.; writing—original draft preparation, D.Z.; writing—review and editing, Z.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors would like to acknowledge the research support from the School of Electrical and Information Engineering at Tianjin University, School of Electrical Engineering at Tianjin University of Technology, and the Faculty of Engineering and Environment at the University of Northumbria.

Conflicts of Interest

The authors declare no conflict of interest.

References

Roh, Y.; Heo, G.; Whang, S.E. A Survey on Data Collection for Machine Learning: A Big Data—AI Integration Perspective. IEEE Trans. Knowl. Data Eng. 2019, 33, 1328–1347. [Google Scholar] [CrossRef] [Green Version]
Binu, D.; Kariyappa, B. A survey on fault diagnosis of analog circuits: Taxonomy and state of the art. AEU Int. J. Electron. Commun. 2017, 73, 68–83. [Google Scholar] [CrossRef]
Industrial Automation Systems SIMATIC. Available online: https://new.siemens.com/global/en/products/automation/systems/industrial.html (accessed on 20 May 2022).
Honeywell. Available online: https://hps.honeywell.com.cn/product-and-service/control-monitoring-safety-sytems/integrated-control-and-safety-systems/experion-pks/ (accessed on 20 May 2022).
ABB. Available online: http://www.cechina.cn/Company/46258_138835/messagedetail.aspx (accessed on 20 May 2022).
Gao, Z.; Cecati, C.; Ding, S.X. A survey of fault diagnosis and fault-tolerant techniques—Part I: Fault diagnosis with model-Based and signal-based approaches. IEEE T Ind. Electron. 2015, 62, 3757–3767. [Google Scholar] [CrossRef] [Green Version]
Gao, Z.; Cecati, C.; Ding, S.X. A survey of fault diagnosis and fault-tolerant techniques—Part II: Fault diagnosis with knowledge-based and hybrid/active approaches. IEEE T Ind. Electron. 2015, 62, 3768–3774. [Google Scholar] [CrossRef] [Green Version]
Liang, J.; Du, R. Model-based Fault Detection and Diagnosis of HVAC systems using Support Vector Machine method. Int. J. Refrig. 2007, 30, 1104–1114. [Google Scholar] [CrossRef]
Gagnol, V.; Le, T.-P.; Ray, P. Modal identification of spindle-tool unit in high-speed machining. Mech. Syst. Signal Process. 2011, 25, 2388–2398. [Google Scholar] [CrossRef]
Zhang, D.; Lin, Z.; Gao, Z. A Novel Fault Detection with Minimizing the Noise-Signal Ratio Using Reinforcement Learning. Sensors 2018, 18, 3087. [Google Scholar] [CrossRef] [Green Version]
Suzuki, A.; Ohnishi, K. Frequency-Domain Damping Design for Time-Delayed Bilateral Teleoperation System Based on Modal Space Analysis. IEEE Trans. Ind. Electron. 2012, 60, 177–190. [Google Scholar] [CrossRef]
He, X.; Wang, Z.; Liu, Y.; Zhou, D.H. Least-Squares Fault Detection and Diagnosis for Networked Sensing Systems Using A Direct State Estimation Approach. IEEE Trans. Ind. Inform. 2013, 9, 1670–1679. [Google Scholar] [CrossRef]
Han, H.; Cui, X.; Fan, Y.; Qing, H. Least squares support vector machine (LS-SVM)-based chiller fault diagnosis using fault indicative features. Appl. Therm. Eng. 2019, 154, 540–547. [Google Scholar] [CrossRef]
Jiang, Z.; Molisch, A.F.; Caire, G.; Niu, Z. Achievable Rates of FDD Massive MIMO Systems With Spatial Channel Correlation. IEEE Trans. Wirel. Commun. 2015, 14, 2868–2882. [Google Scholar] [CrossRef] [Green Version]
Gao, Z.; Liu, X. An Overview on Fault Diagnosis, Prognosis and Resilient Control for Wind Turbine Systems. Processes 2021, 9, 300. [Google Scholar] [CrossRef]
Alauddin, M.; Khan, F.; Imtiaz, S.A.; Ahmed, S. A Bibliometric Review and Analysis of Data-Driven Fault Detection and Diagnosis Methods for Process Systems. Ind. Eng. Chem. Res. 2018, 57, 10719–10735. [Google Scholar] [CrossRef]
Chen, H.; Jiang, B.; Chen, W.; Yi, H. Data-driven Detection and Diagnosis of Incipient Faults in Electrical Drives of High-Speed Trains. IEEE Trans. Ind. Electron. 2019, 66, 4716–4725. [Google Scholar] [CrossRef]
Wu, H.; Zhao, J. Deep convolutional neural network model based chemical process fault diagnosis. Comput. Chem. Eng. 2018, 115, 185–197. [Google Scholar] [CrossRef]
Wang, Y.; Pan, Z.; Yuan, X.; Yang, C.; Gui, W. A novel deep learning based fault diagnosis approach for chemical process with extended deep belief network. ISA Trans. 2019, 96, 457–467. [Google Scholar] [CrossRef] [PubMed]
Gao, Z.; Saxen, H.; Gao, C. Data-driven approaches for complex industrial systems. IEEE T Ind. Inform. 2013, 9, 2210–2212. [Google Scholar] [CrossRef]
Guo, L.; Zhang, Y.-M.; Wang, H.; Fang, J.-C. Observer-Based Optimal Fault Detection and Diagnosis Using Conditional Probability Distributions. IEEE Trans. Signal Process. 2006, 54, 3712–3719. [Google Scholar] [CrossRef]
Amin, T.; Khan, F.; Imtiaz, S. Fault detection and pathway analysis using a dynamic Bayesian network. Chem. Eng. Sci. 2018, 195, 777–790. [Google Scholar] [CrossRef]
Ma, J.; Zhang, S.; Li, H.; Gao, F.; Jin, S. Sparse Bayesian Learning for the Time-Varying Massive MIMO Channels: Acquisition and Tracking. IEEE Trans. Commun. 2018, 67, 1925–1938. [Google Scholar] [CrossRef]
Specht, D.F. Probabilistic Neural Networks. Neural Netw. 1990, 3, 109–118. [Google Scholar] [CrossRef]
MathWorks. Available online: https://www.mathworks.com/help/matlab/ref/null.html (accessed on 20 May 2022).
OPC. Available online: https://opcfoundation.org/forum/opc-ua-standard/ (accessed on 20 May 2022).

Figure 1. The acquisition principle of error time series with a single input signal.

Figure 2. The error time series of different input signals under healthy status.

Figure 3. The schematic diagram of critical faulty data construction.

Figure 4. The structure of proposed approach.

Figure 5. The central control platform.

Table 1. The cases of 7 groups of signals.

No.	Symbols	Description
1	Case1	Input signal with additional pulse voltage of duty cycle 50% and frequency 20 Hz
2	Case2	Input signal with additional piecewise linear voltage of slope 0.5; amplitude: 0 to −2 V
3	Case3	Input signal with additional exponential voltage from 0 to 2 V in 5 s
4	Case4	Input signal with additional thermal noise of 1 MHz bandwidth
5	Case5	Input signal with additional chirp signal: initial frequency—0 Hz; final frequency—500 Hz; amplitude—1 V; delay—0.05 s
6	Case6	The reference signal with additional random noise (Fault1)
7	Case7	The reference signal with periodic voltage signal (Fault2)

Table 2. Effect of sliding window length on detection results.

No.	Length of Sliding Window	Case5		Case6		Case7
No.	Length of Sliding Window	Correct/Wrong (Times)	Accuracy	Correct/Wrong (Times)	Accuracy	Correct/Wrong (Times)	Accuracy
1	100	1000/0	100%	0/1000	0%	0/1000	0%
2	150	1000/0	100%	0/1000	0%	0/1000	0%
3	200	896/104	89.6%	999/1	99.9%	992/8	99.2%
4	500	897/103	89.7%	998/2	99.8%	971/29	97.1%
5	1000	922/78	92.2%	1000/0	100%	1000/0	100%
6	1500	905/95	90.5%	1000/0	100%	1000/0	100%
7	2000	1000/0	100%	1000/0	100%	1000/0	100%
8	20,000	1000/0	100%	0/1000	0%	0/1000	0%

Table 3. The combination of four groups of health data.

No.	Training Examples	Test	Correct (Times)	Accuracy
1	Case1/Case2/Case3/Case4	Case5	1000	100%
		Case6	1000	100%
		Case7	1000	100%
2	Case1/Case2/Case3/Case5	Case4	1000	100%
		Case6	1000	100%
		Case7	1000	100%
3	Case1/Case2/Case4/Case5	Case3	1000	100%
		Case6	1000	100%
		Case7	1000	100%
4	Case1/Case3/Case4/Case5	Case2	1000	100%
		Case6	1000	100%
		Case7	1000	100%
5	Case2/Case3/Case4/Case5	Case1	1000	100%
		Case6	1000	100%
		Case7	1000	100%

Table 4. The combination of three groups of health data.

No.	Training Examples	Test	Correct (Times)	Wrong (Times)	Accuracy
1	Case1/Case2/Case3	Case4	1000	0	100%
		Case5	1000	0	100%
		Case6	1000	0	100%
		Case7	1000	0	100%
2	Case1/Case2/Case4	Case3	1000	0	100%
		Case5	1000	0	100%
		Case6	1000	0	100%
		Case7	1000	0	100%
3	Case1/Case2/Case5	Case3	1000	0	100%
		Case4	1000	0	100%
		Case6	1000	0	100%
		Case7	1000	0	100%
4	Case1/Case3/Case4	Case2	687	314	68.7%
		Case5	1000	0	100%
		Case6	1000	0	100%
		Case7	1000	0	100%
5	Case1/Case3/Case5	Case2	680	320	68%
		Case4	1000	0	100%
		Case6	1000	0	100%
		Case7	1000	0	100%
6	Case1/Case4/Case5	Case2	667	333	66.7%
		Case3	1000	0	100%
		Case6	1000	0	100%
		Case7	1000	0	100%
7	Case2/Case3/Case4	Case1	879	121	87.9%
		Case5	1000	0	100%
		Case6	1000	0	100%
		Case7	1000	0	100%
8	Case2/Case3/Case5	Case1	792	208	79.2%
		Case4	1000	0	100%
		Case6	1000	0	100%
		Case7	1000	0	100%
9	Case2/Case4/Case5	Case1	811	189	81.1%
		Case3	1000	0	100%
		Case6	1000	0	100%
		Case7	1000	0	100%
10	Case3/Case4/Case5	Case1	184	816	18.4%
		Case2	762	238	76.2%
		Case6	998	2	99.8%
		Case7	1000	0	100%

Table 5. The results of CNN for labeled data.

	Case1	Case2	Case3	Case4	Case5	Case6
Correct	993	1000	1000	1000	997	1000
Incorrect	7	0	0	0	13	0
Accuracy	99.3%	100%	100%	100%	99.7%	100%

Table 6. The results of CNN for unlabeled data.

	Health					Fault
	Case1	Case2	Case3	Case4	Case5	Case6
Case7	164	133	0	0	0	703
Case7	Total: 297

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhang, D.; Lin, Z.; Gao, Z. Detecting Enclosed Board Channel of Data Acquisition System Using Probabilistic Neural Network with Null Matrix. Sensors 2022, 22, 5559. https://doi.org/10.3390/s22155559

AMA Style

Zhang D, Lin Z, Gao Z. Detecting Enclosed Board Channel of Data Acquisition System Using Probabilistic Neural Network with Null Matrix. Sensors. 2022; 22(15):5559. https://doi.org/10.3390/s22155559

Chicago/Turabian Style

Zhang, Dapeng, Zhiling Lin, and Zhiwei Gao. 2022. "Detecting Enclosed Board Channel of Data Acquisition System Using Probabilistic Neural Network with Null Matrix" Sensors 22, no. 15: 5559. https://doi.org/10.3390/s22155559

APA Style

Zhang, D., Lin, Z., & Gao, Z. (2022). Detecting Enclosed Board Channel of Data Acquisition System Using Probabilistic Neural Network with Null Matrix. Sensors, 22(15), 5559. https://doi.org/10.3390/s22155559

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Detecting Enclosed Board Channel of Data Acquisition System Using Probabilistic Neural Network with Null Matrix

Abstract

1. Introduction

2. The Error Time Series of Board Tunnel

3. The Proposed Approach

3.1. Probability Neural Network

3.2. The Construction of Critical Faulty Data

3.3. The Structure and Workflow of Proposed Approach

4. Case Studies

4.1. Change the Number of Intermediate Layers of PNN

4.2. Effects of Different Groups of Health Data Combination as Sample Input

4.3. Comparison with LDM

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI