Deep-Learning-Based Physical Layer Authentication for Industrial Wireless Sensor Networks

Liao, Run-Fa; Wen, Hong; Wu, Jinsong; Pan, Fei; Xu, Aidong; Jiang, Yixin; Xie, Feiyi; Cao, Minggui

doi:10.3390/s19112440

Open AccessArticle

Deep-Learning-Based Physical Layer Authentication for Industrial Wireless Sensor Networks

by

Run-Fa Liao

¹,

Hong Wen

^2,*

,

Jinsong Wu

^3,*

,

Fei Pan

¹,

Aidong Xu

⁴,

Yixin Jiang

⁴,

Feiyi Xie

¹ and

Minggui Cao

¹

National Key Laboratory of Science and Technology on Communications, University of Electronic Science and Technology of China, Chengdu 611731, China

²

School of Aeronautics and Astronautics, University of Electronic Science and Technology of China, Chengdu 611731, China

³

Department of Electrical Engineering, Universidad de Chile, Santiago 8370451, Chile

⁴

EPRI, China Southern Power Grid Co., Ltd., Guangzhou 510080, China

^*

Authors to whom correspondence should be addressed.

Sensors 2019, 19(11), 2440; https://doi.org/10.3390/s19112440

Submission received: 18 April 2019 / Revised: 18 May 2019 / Accepted: 27 May 2019 / Published: 28 May 2019

(This article belongs to the Special Issue Green, Energy-Efficient and Sustainable Networks)

Download

Browse Figures

Versions Notes

Abstract

:

In this paper, a deep learning (DL)-based physical (PHY) layer authentication framework is proposed to enhance the security of industrial wireless sensor networks (IWSNs). Three algorithms, the deep neural network (DNN)-based sensor nodes’ authentication method, the convolutional neural network (CNN)-based sensor nodes’ authentication method, and the convolution preprocessing neural network (CPNN)-based sensor nodes’ authentication method, have been adopted to implement the PHY-layer authentication in IWSNs. Among them, the improved CPNN-based algorithm requires few computing resources and has extremely low latency, which enable a lightweight multi-node PHY-layer authentication. The adaptive moment estimation (Adam) accelerated gradient algorithm and minibatch skill are used to accelerate the training of the neural networks. Simulations are performed to evaluate the performance of each algorithm and a brief analysis of the application scenarios for each algorithm is discussed. Moreover, the experiments have been performed with universal software radio peripherals (USRPs) to evaluate the authentication performance of the proposed algorithms. Due to the trainings being performed on the edge sides, the proposed method can implement a lightweight authentication for the sensor nodes under the edge computing (EC) system in IWSNs.

Keywords:

PHY-layer; light-weight authentication; neural network; WSN; industrial

Graphical Abstract

1. Introduction

With the development of Industry 4.0, wireless sensor networks (WSNs) have great application prospects for industrial scenarios due to their advantages over traditional wired networks [1,2,3,4]. However, fully-automated mechanized operations and the wireless communication environments make the industrial wireless sensor networks (IWSNs) have stronger requirements for high security and low latency [5]. M.Luvisotto et al. [6] mentioned that the response delay in IWSNs should be in milliseconds. Moreover, under the edge computing (EC) system in IWSNs, some sensor nodes are in some completely security-free environments because there are no redundant computing resources and transmission resources. Therefore, lightweight authentication is urgently needed to enhance the security of IWSNs while ensuring low latency. The encrypted methods [7,8] are too heavy to support the nodes due to complex computing. I. Bhardwaj et al. [9] did some lightweight processing on the password, but their method still cannot meet some specific requirements. Some other researchers proposed a fast cross-authentication scheme that combines non-cryptographic and cryptographic algorithms to solve the security and latency issues [10]. In addition, the heterogeneous nature of the IWSNs makes traditional encryption-based authentication methods more complex to implement or manage. However, physical (PHY) layer methods provide some new approaches to protect the lightweightIWSNs. The high authentication rate and low cost are especially valued for such applications. By introducing deep learning (DL) into the PHY-layer authentication method, under the EC system, the training is performed under the edge devices, and the sensor nodes almost do not bear any extra costs.

D. Christin et al. [1] surveyed related WSN technologies dedicated to industrial automation from the aspects of security and quality of service (QoS). The work in [4] presented a QoS framework for IWSNs guaranteeing the delay bound and the target reliability. N. Neshenko et al. [11] surveyed the challenges and research problems in the Internet of Things (IoT) including intrusion detection systems, threat modeling, and emerging technologies. However, the papers mentioned above only address the security and reliability issues from the perspective of the system architecture or simply give a direction for future research. L. Xiao et al. [12] proposed a method to enhance the security of underwater sensor networks exploiting the power delay profile of the underwater acoustic channel to discriminate the sensors. The article [13] presented a two-factor user authentication protocol using the hash function that protects against other attacks in wireless sensor networks, with the exception of denial of service (DoS) and node attacks. However, the traditional security methods have relatively large requirements on computing resources and communication resources, which cannot meet the requirement of low latency.

PHY-layer authentication can achieve lightweight authentication and effectively address the tradeoff between the security and low latency requirement of the wireless sensor networks in industrial scenarios. The PHY-layer authentication methods can distinguish the legitimate sensor nodes and illegal ones by physical layer channel information, such as channel state information (CSI) [14,15,16,17], received signal strength indicator (RSSI) [18,19,20], received signal strength (RSS) [21], and the radio frequency (RF) fingerprint [22,23]. However, the PHY-layer authentication methods mentioned above based on the hypothesis test are mostly compared with a threshold to distinguish users, which makes it difficult to discriminate multi-nodes at the same time. Authenticating multi-nodes simultaneously is a multi-classification problem, which needs to be solved urgently.

Deep learning has a large number of applications, such as computer vision, image classification, pattern recognition [24,25,26], and so on. There are considerable research works using deep learning in wireless communications, such as in channel estimation and channel prediction. P. Illy et al. used machine learning to enhance the security of edge computing by implementing intrusion detection [27]. The paper [28] used the deep neural network to estimate the CSIs in orthogonal frequency division multiplexing (OFDM) systems. The work in [29] proposed a Raleigh fading channel prediction scheme with a deep learning method. N. Wang et al. [30] proposed a physical-layer authentication scheme based on extreme learning machine to detect spoofing attack. The DL-based PHY-layer authentication methods proposed in this paper can achieve multi-user authentication in a short time.

Unlike the traditional test-threshold-based PHY-layer authentication, the DL-based PHY-layer authentication methods can distinguish multiple sensor nodes simultaneously and maintain excellent performance. In the EC system, multi-sensor nodes need to be authenticated simultaneously, which is suitable for using the DL-based methods. The DL-based authentication methods are usually divided into the offline training phase and online authentication phase. The PHY-layer authentication framework we proposed in this paper also includes an online retraining process. In summary, the DL-based sensor nodes’ authentication algorithms proposed in this paper, utilizing the spatial diversity of wireless channels, can discriminate the sensor nodes without the test thresholds and have more practical application values. The main contributions of our work can be summarized as follows:

We propose a DL-based PHY-layer authentication framework to enhance the security of industrial sensor networks. We also briefly explore the applications of the framework for practical industrial scenarios.
Three different algorithms are adopted to implement the PHY-layer authentication in IWSNs, including the deep neural network (DNN)-based sensor nodes’ authentication method, the convolutional neural network (CNN)-based sensor nodes’ authentication method, and the convolution preprocessing neural network (CPNN)-based sensor nodes’ authentication method.
Simulation results show that the proposed algorithms can achieve better performance. In addition, the experiments in the engineering center with USRPs validate their utility in practical industrial environments.

The rest of this paper is organized as follows. We present the preliminaries and system model in Section 2 and Section 3, respectively. The DL-based PHY-layer authentication method in industrial wireless sensor networks is proposed in Section 4. We provide numerical experiments in Section 5. The experiment in a practical environment and conclusions are presented in Section 6 and Section 7, respectively.

The symbols used in this article are briefly described as follows. Uppercase bold letters are used for the matrix (e.g.,

H

,

W

) and lowercase bold letters for vectors (e.g.,

x

,

y

). The elements are represented by the letters with subscripts and not bold (e.g.,

x_{i}

,

ω_{1 i}

).

2. Preliminaries

2.1. Channel State Information

Due to the inherent characteristics of the wireless channels, the transmitted signals may experience a series of attenuations, such as, multipath effects, fading, shadowing, and delay distortion. The channel state information (CSI) provides us the channel variations experienced during propagations. In wireless communications, CSI represents the channel properties of a communication link. The CSI needs to be estimated by the receiver to detect the transmitted signals.

In the wireless fading channel, the system is modeled as:

y = H x + n,

(1)

where

y

and

x

represents the receive and transmit signal, respectively.

H

denotes the channel matrix, which is the CSI we mentioned above.

n

denotes the additive white Gaussian noise vector, which follows a complex standard normal distribution.

n \sim C N (0, σ_{0})

, where the mean value is zero and the noise covariance matrix

σ_{0}

is known.

H

represents the channel’s frequency response, which can be estimated by

y

and

x

in the receiving end.

2.2. Deep Neural Network

Generally speaking, DNN is a deeper version of the artificial neural network (ANN) through increasing the number of hidden layers in order to enhance the ability in representation or classification. As shown in Figure 1, it is a typical deep neural network with an input layer, multiple hidden layers, and an output layer. Each layer has a large number of neurons. The input of each neuron is the output of the upper neuron multiplied by the corresponding coefficient, and the output of each neuron is the input activated by activation functions. For example, the output of the first neuron in the first hidden layer is:

z_{1}^{1} = f_{a} (\sum_{i} ω_{1 i} x_{i} + ξ_{1}),

(2)

where

ω_{1 i}

denotes the weight coefficient of links

z_{1}^{1}

and

x_{i}

.

ξ_{1}

denotes the threshold coefficient of

z_{1}^{1}

.

f_{a} (\cdot)

represents the activation function. Common activation functions are the sigmoid function, the rectified linear unit (ReLU) function, and the soft-max function, defined as

f_{s i g m o i d} (x) = \frac{1}{1 - e^{- x}}

,

f_{R e L U} (x) = max (0, x)

,

f_{s o f t m a x} (x) = \frac{e^{x}}{{∥e^{x}∥}_{1}}

, respectively, where

x

is a vector and

{∥\cdot∥}_{1}

denotes the

ℓ_{1}

-norm. Usually, the hidden layer and output layer use the ReLU function and the soft-max function, respectively. The output of

l^{th}

layer is given by:

z^{l} = f_{a}^{l} (W^{l} \cdot z^{l} + ξ^{l}) .

(3)

We use the

Ψ^{l} (\cdot)

to represent the operation of each layer of neurons. Then, we have the output of the deep neural network,

\hat{y} = Ψ (W, Ξ) = Ψ^{L} (Ψ^{L - 1} (\dots Ψ^{1} (x))) .

(4)

The application of the neural network is executed in two steps, a training phase and an identification phase. When in the training phase, the input data (i.e., CSI) of the input layer and the corresponding label

y

are known. Then, we train the parameters

W

and

ξ

by minimizing the cost function

L

by the gradient descent method, which is formulated as:

\hat{W}, \hat{ξ} = arg min_{W, ξ} (L),

(5)

where

L

represents the value of the loss function. The loss function usually uses a mean squared error function or a cross entropy function, which is given by:

L_{m e a n - s q u a r e} = {∥y - \hat{y}∥}_{2}^{2},

(6)

or:

L_{c r o s s - e n t r o p y} = y^{T} \cdot log (\hat{y}),

(7)

where

{(\cdot)}^{T}

denotes the transpose of the matrix or vector.

In the identification phase, the label of the input data (i.e., CSI) is unknown. By inputting CSI to the neural network, its corresponding output

\hat{y}

will be used to identify and classify the input CSI.

2.3. Convolutional Neural Network

The convolutional neural network (CNN) is part of the feedforward neural network with convolutional computation and a deep structure [11]. CNN includes convolutional layers, pooling layers, and fully-connected layers compared with ordinary neural network. The convolutional layer computes multiple convolutions in parallel to produce a set of linear activation responses. Further, the convolution operation can effectively extract features form the original signal (e.g., CSI). The output of the convolutional layer is given by:

Z^{l} = f_{a}^{l} (Z^{l - 1} \otimes W^{l} + ξ^{l}),

(8)

where

Z^{l}

denotes the output of the

l^{th}

layer and

W^{l}

and

ξ^{l}

denote the convolution kernel and threshold in the

l^{th}

layer, respectively. ⊗ represents the convolution operation.

f_{a}^{l} (\cdot)

denotes the activation function in CNN, often using the ReLU function.

Following the convolutional layer is the pooling layer, which effectively reduces the data dimension without losing valid information. The pooling function replaces the output of the network at that location using the overall statistical characteristics of the adjacent outputs at a location; for example, the maximum value in the adjacent rectangular region. Other commonly-used pooling functions include the average value in an adjacent rectangular region, the

ℓ_{2}

norm, and the weighted average in adjacent regions. The main goal is to reduce the dimension or the resolution of feature maps. The pooling operation, which is a subsampling, can facilitate the extraction of high-level features.

The fully-connected layer of CNN is more like a hidden layer in DNN. There can be one fully-connected layer or multiple in CNN. We convert CSI into a matrix and use different colors to represent different values. As shown in Figure 2, it is a typical convolutional neural network with two convolutional layers, two pooled layers, and one fully-connected layer. We can see that the CSI converts to a matrix of 32 by 32 in size. The size of the convolution kernel in the first convolutional layer is four by four. After the convolution and activation, the average pooling operation is performed with a kernel of four by four in size. Then, there is another convolution, activation, and pooling operation. The final two layers are the fully-connected layer and the output layer activated with soft-max. The output of CNN can be formulated as:

\hat{y} = Υ (w, ξ) = Υ^{L} (Υ^{L - 1} (\dots Υ^{1} (X))) .

(9)

Like DNN, CNN is also executed in two steps, a training phase and an identification phase. During the training phase, the input data (i.e., CSI) and corresponding labels

y

will be used to train the parameters

w

and

ξ

in CNN, which is formulated as:

\hat{w}, \hat{ξ} = arg min_{w, ξ} (L),

(10)

where

L

denotes the value of the loss function in CNN. In the identification phase, the well-trained CNN will be used to perform the PHY-layer authentication.

3. System Model

We propose a DL-based PHY-layer authentication for an industrial wireless sensor network that can resist the spoofing attack. The methods we propose can enhance the security of the industrial wireless network without sacrificing communication resources. As shown in Figure 3, we placed many sensor nodes in the different locations of the industrial scene. The wireless sensor nodes send the pilot to the base station (BS) with time division duplexing (TDD) mode. First of all, each node needs to be identified by the upper layer authentication to facilitate labeling the corresponding CSI. In the initialization phase, we trained our neural networks through the training data (i.e., CSIs) and corresponding labels. Then, we authenticated the legitimate and illegal sensor nodes with newly-estimated CSI in the authentication phase. In the retraining phase, we updated the CSIs’ training set with the new channel information of certified sensor nodes and retrained the neural network for the next authentication. The authentication processing of the industrial wireless sensor network is shown in Figure 4.

The DL-based PHY-layer authentication we propose can dynamically adjust system parameters over time. It can further improve the accuracy of authentication and has higher practicality.

4. Deep Learning-Based Sensor Nodes’ Authentication Algorithms

In our previous work, we briefly introduced the physical layer channel authentication based on CNN [31]. This paper will further improve the CNN algorithm and propose a rapid-DNN-based PHY-layer authentication algorithm to meet the low latency requirements of industrial wireless sensor networks.

4.1. DNN-Based Sensor Nodes’ Authentication

The DNN-based PHY-layer authentication in industrial wireless sensor networks uses the DNN to implement sensor nodes’ authentication. In the initialization phase, the base station collects channel state information of each sensor node and performs the corresponding labeling according to the upper layer protocol authentication (e.g., EAP, AKA). The DNN was trained by the collected information to obtain the initial neural network parameters. In the authentication phase, the CSI of the unknown sensor node will be authenticated by the well-trained DNN in the initialization phase. After the new CSI has been authenticated, the dataset will be trained again for the next authentication.

Algorithm 1 DNN-based sensor nodes’ authentication.

Input: The

i^{th}

CSI to authenticate

x^{(i)}

Output: The label of unknown CSI

{\hat{y}}^{(i)}

, the new weight matrix

W^{†}

, and threshold vector

ξ^{†}

of DNN

1: Initialize all connection weights

W_{0}^{†}

, and thresholds

ξ_{0}^{†}

in the network will be obtained through the training of DNN, using the pre-acquired dataset

D^{†} = {\{(x_{k}, y_{k})\}}_{k = 1}^{m}

;

2: Compute

{\hat{y}}^{(i)}

by well-trained DNN;

3: Update the training set

D^{†} = {\{(x_{k}, y_{k})\}}_{k = 1}^{m}

by

(x^{(i)}, {\hat{y}}^{(i)})

;

4: Retrain the DNN by the new dataset and get new weight matrix

W^{†}

and threshold vector

ξ^{†}

;

5: Return

{\hat{y}}^{(i)}

,

W^{†}

,

ξ^{†}

.

4.2. CNN-Based Sensor Nodes’ Authentication

The CNN-based sensor nodes’ authentication method is more like the DNN-based sensor nodes’ authentication. They have the same steps except that the authenticated neural network changes from DNN to CNN. In the initialization phase, the CNN will be trained by the pre-acquired dataset of different sensor nodes. Then, the

i^{th}

CSI will be authenticated by the well-trained CNN. At last, the CNN will be retrained after the dataset is updated.

Algorithm 2 CNN-based sensor nodes’ authentication.

Input: The

i^{th}

CSI to authenticate

x^{(i)}

Output: The label of unknown CSI

{\hat{y}}^{(i)}

, the new weight matrix

W^{◊}

, and threshold vector

ξ^{◊}

of CNN

1: Initialize all connection weights

W_{0}^{◊}

, and thresholds

ξ_{0}^{◊}

in the network will be obtained through the training of CNN, using the pre-acquired dataset

D^{◊} = {\{(x_{k}, y_{k})\}}_{k = 1}^{m}

;

2: Compute

{\hat{y}}^{(i)}

by the well-trained CNN;

3: Update the training set

D^{◊} = {\{(x_{k}, y_{k})\}}_{k = 1}^{m}

by

(x^{(i)}, {\hat{y}}^{(i)})

;

4: Retrain the CNN by the new dataset and get new weight matrix

W^{◊}

and threshold vector

ξ^{◊}

;

5: Return

{\hat{y}}^{(i)}

,

W^{◊}

,

ξ^{◊}

.

4.3. Convolution Pre-Processing Neural Network-based Sensor Nodes’ Authentication

The convolution pre-processing neural network-based sensor nodes’ authentication method we propose in this paper has shorter training time and higher authentication accuracy. The core idea is to perform offline convolution preprocessing on the CSIs before training the neural network. The convolution preprocessing can effectively reduce the data dimension and extract the feature information of the CSIs, while the convolution kernels are trained by pre-obtained CSIs and corresponding labels. After convolution, activation, and pooling by the convolution kernels, the CSIs

x_{k}

become

{\bar{x}}_{k}

. The latter’s dimensions are much smaller than the former’s. For the CPNN-based sensor node authentication method, the convolution kernels

V^{⊥}

need to be calculated by the pre-obtained CSIs. Then, the neural network is trained by the new dataset

D^{⊥} = {\{({\bar{x}}_{k}, y_{k})\}}_{k = 1}^{m}

in the initialization phase to get the weight matrix

W_{0}^{⊥}

and threshold vector

ξ_{0}^{⊥}

.

Algorithm 3 CPNN-based sensor nodes’ authentication.

Input: The

i^{th}

CSI to authenticate

x^{(i)}

Output: The label of unknown CSI

{\hat{y}}^{(i)}

, the new weight matrix

W^{⊥}

, and threshold vector

ξ^{⊥}

of CPNN

1: Initialize: training the CNN by the pre-acquired CSIs to obtain the convolution kernels

V^{⊥}

; the dataset

D^{⊥} = {\{({\bar{x}}_{k}, y_{k})\}}_{k = 1}^{m}

obtained by convolution; the weights

W_{0}^{⊥}

and thresholds

ξ_{0}^{⊥}

in the neural network will be obtained through the training of CPNN, using dataset

D^{⊥} = {\{({\bar{x}}_{k}, y_{k})\}}_{k = 1}^{m}

;

2: Convolution pre-processing of the CSI

x^{(i)}

into

{\bar{x}}^{(i)}

;

3: Compute

{\hat{y}}^{(i)}

by the well-trained CPNN;

4: Update the training set

D^{⊥} = {\{({\bar{x}}_{k}, y_{k})\}}_{k = 1}^{m}

by

(x^{(i)}, {\hat{y}}^{(i)})

;

5: Retrain the CPNN by the new dataset, and get new weight matrix

W^{⊥}

and threshold vector

ξ^{⊥}

;

6: Return

{\hat{y}}^{(i)}

,

W^{⊥}

,

ξ^{⊥}

.

4.4. Complexity Analysis

We compare the computational complexity of each sensor nodes’ authentication methods in this section. The initialization phase was performed offline, and we will not discuss its computational resources and latency issues. In the authentication phase, the DNN-based sensor nodes’ authentication method needs to perform:

b^{l} = W^{l} \cdot z^{l - 1} + ξ^{l} .

(11)

As shown in Table 1, the computational complexity of the mathematical operation of DNN-based method is almost

O (max (n^{1} \times n^{2}, n^{2} \times n^{3}, \dots, n^{L - 1} \times n^{L}))

, where

n^{l}

denotes the number of neurons in the

l^{th}

layer in DNN. In our numerical experiments, we used a five-layer DNN in which the number of neurons in each layer was 1024, 120, 60, 25, 4. Therefore, the computational complexity is almost

1 \times 10^{5}

. The CNN-based sensor nodes’ authentication method needs to perform:

B^{l} = Z^{l - 1} \otimes W^{l} + ξ^{l} .

(12)

The computational complexity of the mathematical operation of the CNN-based method is almost

O (max (n^{1} \times n_{k e r}^{1} \times n_{n u m}^{1}, n^{2} \times n_{k e r}^{2} \times n_{n u m}^{2}, \dots, n_{f u l l}^{L - 1} \times n^{L}))

, where

n^{l}

indicates the number of convolution operations in the

l^{th}

layer.

n_{k e r}^{l}

and

n_{n u m}^{l}

denote the dimensions and the number of convolution kernels in the

l^{th}

layer.

n_{f u l l}^{L - 1}

and

n^{L}

represent the number of neurons in the fully-connected and output layers. In our numerical experiments, we used eight convolution kernels with dimensions of

4 \times 4 \times 1

and 16 convolution kernels with dimensions of

2 \times 2 \times 8

. The dimensions of the input layer and fully-connected layer were

32 \times 32 \times 1

and

1 \times 1 \times 256

, respectively. Therefore, the computational complexity of the CNN-based method was almost

5 \times 10^{5}

. The CPNN-based sensor nodes’ authentication method needs to perform convolution pre-processing on CSI, and the computation complexity of pre-processing was relatively small. The computational complexity of the CPNN-based method is

O (max (n^{0} \times n_{k e r}^{0} \times n_{n u m}^{0}, n^{1} \times n^{2}, \dots, n^{L - 1} \times n^{L}))

, which is almost the same as that of the DNN-based method, where

n^{0}

denotes the number of convolution operations in pre-processing and

n^{l}

denotes the dimensions of the CSI after being processed in the

l^{th}

layer.

n_{k e r}^{0}

and

n_{n u m}^{0}

denote the dimension and number of convolution kernels in pre-processing, respectively. There were 16 convolution kernels of size

4 \times 4 \times 1

in the pre-processing of the CPNN-based method. There were four convolution steps. The computational complexity of the CPNN-based method was almost

2 \times 10^{4}

.

During the retraining phase, the number of parameters that needed to be trained is shown in Table 2. The DNN-based sensor nodes’ authentication method needs to train weight matrix

W^{†}

and threshold vector

ξ^{†}

, in which it needs to train

(n^{1} \times n^{2} + n^{2} \times n^{3} + \dots + n^{L - 1} \times n^{L})

parameters. There were almost

1 \times 10^{5}

parameters for the DNN-based sensor nodes’ authentication method in our numerical experiments. The CNN-based sensor nodes’ authentication method needs to train convolution kernels

W^{◊}

and threshold vector

ξ^{◊}

, which needed to train

(n_{k e r n e l}^{1} \times n_{n u m}^{1} + n_{k e r n e l}^{2} \times n_{n u m}^{2} + n_{f u l l}^{L - 1} \times n^{L})

parameters. In our numerical experiments, only

1 \times 10^{3}

parameters needed to be trained. The CPNN-based authentication method needed to train weight matrix

W^{⊥}

and threshold vector

ξ^{⊥}

. Like the DNN-based method, the parameters of CPNN-based method depended on the number of neurons in each layer. However, the dimension of the input in the CPNN-based method was much smaller than the DNN-based method. The number of neurons in each layer of CPNN was 256, 50, 25, 12, and 4. There were almost

1 \times 10^{4}

parameters that needed to be trained in the retraining phase.

5. Numerical Experiments

Simulations have been performed to evaluate the performance of DL-based PHY-layer authentication for industrial wireless sensor networks. We performed the simulations under different nodes and analyzed the impact of the number of sensor nodes on the authentication success rate. We also compared the performance of different algorithms under different numbers of sensor nodes. Cost J denotes the value of the loss function, which is calculated by (6) or (7). The authentication rate

P_{a}

is defined as the probability of discriminating the wireless sensor nodes.

We considered the tapped delay line (TDL) model to simulate Raleigh fading channels with multipath delays [32]. The TDL model uses a set of non-frequency selective fading generators (such as the FWGN model or the Jakes model), where each generator is independent of each other and has an average power of one. The channel state information of different transmitters can be generated by:

y (n) = \sum_{d = 0}^{N_{D} - 1} h_{d} (n) x (n - d) .

(13)

where

N_{D}

denotes the number of taps of the filters. We set the normalized Doppler shift

f_{d} = 0.125

, and six paths with different power delays were selected to synthesize the channels of different wireless sensor nodes. For more realistic consideration, the time delay of the first five paths of the sensor nodes was the same, which was 0 second (s),

5 \times 1 0^{- 6}

s,

1 \times 1 0^{- 5}

s,

1 . 5 \times 1 0^{- 5}

s, and

2 \times 1 0^{- 5}

s, respectively. When there were twelve sensor nodes, the time delay of the sixth path of each sensor node was as shown in Table 3.

When there were four sensor nodes, the sixth paths of each sensor node were

6 . 6 \times 1 0^{- 5}

s,

4 . 6 \times 1 0^{- 5}

s,

3 . 4 \times 1 0^{- 5}

s,

2 . 2 \times 1 0^{- 5}

s, respectively. Sampling interval

t_{s a m p l i n g} = 5 \times 1 0^{- 6}

s; the signal to noise ratio (SNR) of the simulation channel was 4 dB; the number of subcarriers was

n_{s u b_c a r r i e r} = 256

; the number of pilot intervals and of the cyclic prefix length were

n_{p i l o t_i n t e v a l} = 256

and

l_{c p_l e n g t h} = 30

, respectively.

We used a five-layer neural network for the DNN-based sensor nodes’ authentication method, where the numbers of neurons in the hidden layer were 120, 60, and 25. The size of the input layer was determined by the CSI dimension, and the size of the output layer was determined by the number of sensor nodes. The convolutional neural network used in the CNN-based algorithm had seven layers, which were an input layer, two convolution layers, two pooling layers, one fully-connected layer, and an output layer. The two convolutional layers respectively used eight convolution kernels of size

4 \times 4 \times 1

and 16 convolution kernels of size

2 \times 2 \times 8

, respectively. For the CPNN-based algorithm, it had 16 convolution kernels of size

4 \times 4 \times 1

for the convolution pre-processing. In the authentication phase and retraining phase, we used a five-layer neural network for the CPNN-based algorithm, where the numbers of neurons in the hidden layer were 50, 25, and 12. Moreover, the adaptive moment estimation (Adam) accelerated gradient algorithm and minibatch skill was used for the accelerating of the neural networks’ training.

As shown in Figure 5a, the x-axis is the number of neural network iterations and the y-axis is the cost function value. As the number of iterations increased, the cost function value decreased. In addition, the fewer the sensor nodes, the faster the cost function value dropped. We can visually see the authentication rate under different sensor nodes from Figure 5b. After 30 iterations, the authenticate rates tended to be stable. However, the authentication rate of four sensor nodes was higher than that of six sensor nodes, and the authentication rate of 12 sensor nodes was the lowest. Specifically, after 30 iterations, the authentication rates of 4 sensor nodes, 6 sensor nodes, 8 sensor nodes, and 12 sensor nodes was 95.5%, 80.83%, 77.25%, and 66.5%, respectively.

By discussing the authentication success rate under different numbers of hidden layers, we researched the robustness of the DL-based authentication method. The DNN-based algorithm had the most excellent performance. Therefore, we considered the influence of different hidden layer numbers on the authentication rate under the DNN-based method. As shown in Figure 6a, the authentication rate of the DNN-based method with different numbers of hidden layers increased as the iterations increased. The greater the number of hidden layers, the faster the convergence of the neural network’s performance. The authentication success rate of the DNN-based method with different hidden layers after the training was stabilized are shown in Figure 6b. As the number of hidden layers increased, the authentication success rate increased. However, due to the inherent characteristics of the specific wireless channels, the performance of the DNN-based method did not continue to grow and tended to be stable, after the number of hidden layers was increased to a certain number.

In addition, we performed simulation analysis on the authentication performance of different algorithms under different numbers of sensor nodes. As shown in Figure 7a, the DNN-based method had the best performance, because it had many parameters. For example, the authentication rates of the DNN-based method were 95.5% and 77.25% under four sensor nodes and eight sensor nodes, respectively. The CNN-based algorithm had the worst performance, because of the convolution and pooling and more or less lost some information of CSIs. For example, the authentication rates of the CNN-based method were 86.25% and 67.87% under four sensor nodes and eight sensor nodes. Another CPNN-based method we proposed in this paper was similar in performance to the CNN-based method. The authentication rates of CPNN-based algorithm were 85.25% and 66.75% under four sensor nodes and eight sensor nodes. However, the CPNN-based method had the shortest training time compared to the DNN-based algorithm and CNN-based algorithm, as shown in Figure 7b. Therefore, it has a better application prospect in the actual industrial wireless sensor network. We can see that the CNN-based method had the longest training time, followed by the DNN-based method.

In summary, the DNN-based sensor nodes’ authentication had the best authenticate performance and a relatively limited training time. However, its training parameters will grow exponentially as the dimensions of CSI become larger. Therefore, the DNN-based algorithm is suitable for a shorter CSI authentication scheme. The CNN-based sensor nodes’ authentication method effectively reduced the parameters that the neural network needed to train. However, due to the convolution operation and the pooling operation, it did not meet the requirements of saving training time, especially when the dimension of CSI was relatively small. At last, the CPNN-based sensor nodes’ authentication method can effectively solve the problem of training time and authentication performance. It has an unparalleled advantage in practical industrial wireless sensor network applications.

6. Experiments In Practical Environment

Experiments have been performed with universal software radio peripherals (USRPs) to evaluate the authentication performance of the proposed DL-based PHY-layer authentication algorithms in industrial wireless sensor networks. The experimental simulations were performed at the school’s engineering center, which has a large number of industrial facilities, such as computer numerical control (CNC) engraving and milling, CNC lathe, and so on. As shown in Figure 8, five radio sensor nodes equipped with industrial computer and USRPs were placed in a

43.56 \times 38.84 \times 6.5 m^{3}

factory. The base station was equipped with 8 antennas in Position 2, and the other sensor nodes were equipped with 2, 4, or 8 antennas in Positions 1, 3, 4, and 5. The distances between sensor nodes and the base station varied from 5–25 meters (m). In this experiment, we set the carrier frequency

f_{c} = 3

gigahertz (GHz), the interval of subcarriers

f_{i n t e r v a l_s u b c a r r i e r} = 15

kilohertz (kHz), and the number of subcarriers

n_{s u b c a r r i e r} = 128

. The transmitting power of USRPs was 15 dBm, and the transmission gain was 20 dB. The practical view of the engineering center is shown in Figure 9.

We tested the authentication rates of sensor nodes with different antennas in different locations. As shown in Figure 10, as the number of antennas increased, the authentication success rate increased correspondingly. For example, the authentication rate of the DNN-based algorithm with 2 antennas was 92%, while the authentication rate of the DNN-based algorithm with 4 antennas and 8 antennas was 99.5% and 99.5%, respectively. From the histograms of different colors, we can see that the DNN-based sensor nodes’ authentication method had the best performance. For example, the authentication rate of DNN-based algorithm with 8 antennas was 99.5%, while the authentication rate of the CNN-based algorithm with 8 antennas was 85%. In addition, the CPNN-based algorithm had the same performance as the CNN-based algorithm. However, the retraining time of the CPNN-based method was much shorter than that of the CNN-based algorithm.

7. Conclusions

The DL-based PHY-layer authentication method in industrial wireless sensor networks we proposed in this paper has a strong practical significance. It can both achieve lightweight authentication and authenticate multiple nodes simultaneously. Especially for the CPNN-based sensor nodes’ authentication algorithm, it had good authentication performance and an ultra-short retraining time. The DNN-based sensor nodes’ authentication had the best authenticate performance and a relatively limited training time. However, its training parameters will grow exponentially as the dimensions of CSI become larger. Therefore, the DNN-based algorithm is suitable for a shorter CSI authentication scheme. As shown in Table 2, the CNN-based algorithm and CPNN-based algorithm effectively reduced the parameters that the neural network needed to train. However, due to the convolution operation and the pooling operation, the CNN-based algorithm did not meet the requirements of saving training time, especially when the dimension of CSI was relatively small. At last, the CPNN-based sensor nodes’ authentication method can effectively solve the problem of training time and authentication performance. It has an unparalleled advantage in practical industrial wireless sensor network applications.

Author Contributions

The work was realized with the collaboration of all the authors. R.-F.L. and M.C. contributed to the main results and code implementation. H.W. and J.W. organized the work, provided the funding, and revised the draft of the paper. R.-F.L. and H.W. designed the experiments. R.-F.L., F.P., and F.X. performed the experiments. R.-F.L. and H.W. analyzed the experimental results. R.-F.L., H.W., F.P., A.X., Y.J., F.X., and M.C. discussed the results. R.-F.L. wrote the original manuscript.

Funding

This work was supported by NSFC (No. 61572114), the National Major R & D Program (2018YFB0904900, 2018YFB0904905), the Sichuan Sci & Tech Basic Research Condition Platform Project (No.2018TJPT0041); and the Sichuan Sci & Tech Service Development Project (No.18KJFWSF0368). This work was also supported in part by the Hunan Provincial Nature Science Foundation Project 2018JJ2535 and the Chile CONICYT FONDECYT Regular Project 1181809.

Acknowledgments

Special thanks to the Engineering Center of UESTCfor the experimental scene they provided.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

AKA	Authentication and key agreement
ANN	Artificial neural network
CNN	Convolutional neural network
CPNN	Convolution preprocessing neural network
CSI	Channel state information
DL	Deep learning
DNN	Deep neural network
EAP	Extensible authentication protocol
IWSNs	Industrial wireless sensor networks
MHz	Megahertz
OFDM	Orthogonal frequency division multiplexing
PHY	Physical
QoS	Quality of service
ReLU	Rectified linear unit
RSS	Received signal strength
RSSI	Received signal strength indicator
TDD	Time division duplexing
USRPs	Universal software radio peripherals
WSNs	Wireless sensor networks

References

Christin, D.; Mogre, P.S.; Hollick, M. Survey on wireless sensor network technologies for industrial automation: The security and quality of service perspectives. Future Internet 2010, 2, 96–125. [Google Scholar] [CrossRef]
Low, K.S.; Win, W.N.N.; Er, M.J. Wireless sensor networks for industrial environments. In Proceedings of the International Conference on Computational Intelligence for Modelling, Control and Automation and International Conference on Intelligent Agents, Web Technologies and Internet Commerce (CIMCA-IAWTIC’06), Vienna, Austria, 28–30 November 2005; pp. 271–276. [Google Scholar]
Bal, M. Industrial applications of collaborative wireless sensor networks: A survey. In Proceedings of the 2014 IEEE 23rd International Symposium on Industrial Electronics (ISIE), Istanbul, Turkey, 1–4 June 2014; pp. 1463–1468. [Google Scholar]
Zoppi, S.; Van Bemten, A.; Gürsu, H.M.; Vilgelm, M.; Guck, J.; Kellerer, W. Achieving Hybrid Wired/Wireless Industrial Networks with WDetServ: Reliability-Based Scheduling for Delay Guarantees. IEEE Trans. Ind. Informat. 2018, 14, 2307–2319. [Google Scholar] [CrossRef]
Pan, F.; Pang, Z.; Luvisotto, M.; Xiao, M.; Wen, H. Physical-Layer Security for Industrial Wireless Control Systems: Basics and Future Directions. IEEE Ind. Electron. Mag. 2018, 12, 18–27. [Google Scholar] [CrossRef]
Luvisotto, M.; Pang, Z.; Dzung, D. Ultra high performance wireless control for critical applications: Challenges and directions. IEEE Trans. Ind. Informat. 2017, 13, 1448–1459. [Google Scholar] [CrossRef]
Zhao, G.; Yang, X.; Zhou, B.; Wei, W. RSA-based digital image encryption algorithm in wireless sensor networks. In Proceedings of the 2nd International Conference on Signal Processing Systems, Dalian, China, 5–7 July 2010. [Google Scholar]
Aysal, T.C.; Barner, K.E. Sensor data cryptography in wireless sensor networks. IEEE Trans. Inf. Forensics Secur. 2008, 3, 273–289. [Google Scholar] [CrossRef]
Bhardwaj, I.; Kumar, A.; Bansal, M. A review on lightweight cryptography algorithm for data security and authentication in IoTs. In Proceedings of the 4th International Conference on Signal Processing, Computing and Control (ISPCC), Solan, India, 21–23 September 2017; pp. 504–509. [Google Scholar]
Moreira, C.M.; Kaddoum, G.; Bou-Harb, E. Cross-layer authentication protocol design for ultra-dense 5g hetnets. In Proceedings of the 2018 IEEE International Conference on Communications (ICC), Kansas City, MO, USA, 20–24 May 2018. [Google Scholar]
Neshenko, N.; Bou-Harb, E.; Crichigno, J.; Kaddoum, G.; Ghani, N. Demystifying IoT Security: An Exhaustive Survey on IoT Vulnerabilities and a First Empirical Look on Internet-scale IoT Exploitations. IEEE Commun. Surveys Tut. 2019. [Google Scholar] [CrossRef]
Xiao, L.; Sheng, G.; Wan, X.; Su, W.; Cheng, P. Learning-Based PHY-Layer Authentication for Underwater Sensor Networks. IEEE Commun. Lett. 2019, 23, 60–63. [Google Scholar] [CrossRef]
Das, M.L. Two-factor user authentication in wireless sensor networks. IEEE Trans. Wireless Commun. 2009, 8, 1086–1090. [Google Scholar] [CrossRef]
Liu, H.; Wang, Y.; Liu, J.; Yang, J.; Chen, Y.; Poor, H.V. Authenticating users through fine-grained channel information. IEEE Trans. Mobile Comput. 2018, 17, 251–264. [Google Scholar] [CrossRef]
Liu, T.Y.; Lin, P.H.; Lin, S.C.; Hong, Y.W.P.; Jorswieck, E.A. To avoid or not to avoid CSI leakage in physical layer secret communication systems. IEEE Commun. Mag. 2015, 53, 19–25. [Google Scholar] [CrossRef]
Zou, Y.; Wang, X.; Shen, W. Optimal relay selection for physical-layer security in cooperative wireless networks. J. Sel. Areas Commun. 2013, 31, 2099–2111. [Google Scholar] [CrossRef]
Tugnait, J.K. Wireless user authentication via comparison of power spectral densities. IEEE J. Sel. Areas Commun. 2013, 31, 1791–1802. [Google Scholar] [CrossRef]
Pei, C.; Zhang, N.; Shen, X.S.; Mark, J.W. Channel-based physical layer authentication. In Proceedings of the 2014 IEEE Global Communications Conference, Austin, TX, USA, 8–12 December 2014; pp. 4114–4119. [Google Scholar]
Bhargava, V.; Sichitiu, M.L. Physical Security Perimeters for Wireless Local Area Networks. Int J. Netw. Secur. 2006, 3, 73–84. [Google Scholar]
Chen, Y.; Yang, J.; Trappe, W.; Martin, R.P. Detecting and localizing identity-based attacks in wireless and sensor networks. IEEE Trans. Veh. Technol. 2010, 59, 2418–2434. [Google Scholar] [CrossRef]
Yang, J.; Chen, Y.; Trappe, W.; Cheng, J. Detection and localization of multiple spoofing attackers in wireless networks. IEEE Trans. Parallel Distrib. Syst. 2013, 24, 44–58. [Google Scholar] [CrossRef]
Xie, F.; Wen, H.; Li, Y.; Chen, S.; Hu, L.; Chen, Y.; Song, H. Optimized Coherent Integration-Based Radio Frequency Fingerprinting in Internet of Things. IEEE Internet Things J. 2018, 5, 3967–3977. [Google Scholar] [CrossRef]
Chen, Y.; Wen, H.; Song, H.; Chen, S.; Xie, F.; Yang, Q.; Hu, L. Lightweight one-time password authentication scheme based on radio-frequency fingerprinting. IET Commun. 2018, 12, 1477–1484. [Google Scholar] [CrossRef]
LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef] [PubMed]
Arulampalam, G.; Bouzerdoum, A. A generalized feedforward neural network architecture for classification and regression. Neural Netw. 2003, 16, 561–568. [Google Scholar] [CrossRef]
Carpenter, G.A. Neural network models for pattern recognition and associative memory. Neural Netw. 1989, 2, 243–257. [Google Scholar] [CrossRef]
Illy, P.; Kaddoum, G.; Moreira, C.M.; Kaur, K.; Garg, S. Securing Fog-to-Things Environment Using Intrusion Detection System Based On Ensemble Learning. arXiv 2019, arXiv:1901.10933. [Google Scholar]
Ye, H.; Li, G.Y.; Juang, B.H. Power of deep learning for channel estimation and signal detection in OFDM systems. IEEE Wireless Commun. Lett. 2018, 7, 114–117. [Google Scholar] [CrossRef]
Liao, R.F.; Wen, H.; Wu, J.; Song, H.; Pan, F.; Dong, L. The Rayleigh Fading Channel Prediction via Deep Learning. Wireless Commun. Mobile Comput. 2018, 2018, 1–11. [Google Scholar] [CrossRef]
Wang, N.; Jiang, T.; Lv, S.; Xiao, L. Physical-layer authentication based on extreme learning machine. IEEE Commun. Lett. 2017, 21, 1557–1560. [Google Scholar] [CrossRef]
Liao, R.F.; Wen, H.; Pan, F.; Song, H.; Xu, A.; Jiang, Y. A Novel Physical Layer Authentication Method with Convolutional Neural Network. IEEE ICAICA 2019. accepted. [Google Scholar]
Clarke, R. A statistical theory of mobile-radio reception. Bell Syst. Tech. J. 1968, 47, 957–1000. [Google Scholar] [CrossRef]

Figure 1. The deep neural network.

Figure 2. The convolutional neural network. CSI, channel state information.

Figure 3. The system model of DL-based PHY-layer authentication in IWSNs.

Figure 4. DL-based PHY-layer authentication flow chart.

Figure 5. The authentication performance with different sensor nodes. (a) The cost value under different numbers of sensor nodes with the DNN-based method; (b) The authentication rate under different numbers of sensor nodes with the DNN-based method.

Figure 6. The authentication performance with different numbers of hidden layers. (a) The authentication rate of different numbers of hidden layers; (b) The authentication rate of different numbers of hidden layers after training was stabilized.

Figure 7. The authentication performance with different algorithms. (a) The authentication rate of different algorithms under different numbers of sensor nodes; (b) The time in the training phase of different algorithms under different numbers of sensor nodes.

Figure 8. The network topology.

Figure 9. The location of the wireless sensor nodes in the practical industrial scenario.

Figure 10. The authentication rate with USRPs.

Table 1. The computational complexity in the authentication phase.

Algorithms	Computational Complexity	Simulation
DNN-based	$O (max (n^{1} \times n^{2}, n^{2} \times n^{3}, \dots, n^{L - 1} \times n^{L}))$	$1 \times 10^{5}$
CNN-based	$O (max (n^{1} \times n_{k e r}^{1} \times n_{n u m}^{1}, n^{2} \times n_{k e r}^{2} \times n_{n u m}^{2}, \dots, n_{f u l l}^{L - 1} \times n^{L}))$	$5 \times 10^{5}$
CPNN-based	$O (max (n^{0} \times n_{k e r}^{0} \times n_{n u m}^{0}, n^{1} \times n^{2}, \dots, n^{L - 1} \times n^{L}))$	$2 \times 10^{4}$

Table 2. The number of parameters in the retraining phase.

Algorithms	Number of Parameters	Simulation
DNN-based	$(n^{1} \times n^{2} + n^{2} \times n^{3} + \dots + n^{L - 1} \times n^{L})$	$1 \times 10^{5}$
CNN-based	$(n_{k e r n e l}^{1} \times n_{n u m}^{1} + n_{k e r n e l}^{2} \times n_{n u m}^{2} + n_{f u l l}^{L - 1} \times n^{L})$	$1 \times 10^{3}$
CPNN-based	$(n^{1} \times n^{2} + n^{2} \times n^{3} + \dots + n^{L - 1} \times n^{L})$	$1 \times 10^{4}$

Table 3. The time delay of the sixth path of 12 sensor nodes.

Sensor Node 1	Sensor Node 2	Sensor Node 3	Sensor Node 4	Sensor Node 5	Sensor Node 6
$6.6 \times 1 0^{- 5}$ s	$6.2 \times 1 0^{- 5}$ s	$5.8 \times 1 0^{- 5}$ s	$5.4 \times 1 0^{- 5}$ s	$5.0 \times 1 0^{- 5}$ s	$4.6 \times 1 0^{- 5}$ s
Sensor Node 7	Sensor Node 8	Sensor Node 9	Sensor Node 10	Sensor Node 11	Sensor Node 12
$4.2 \times 1 0^{- 5}$ s	$3.8 \times 1 0^{- 5}$ s	$3.4 \times 1 0^{- 5}$ s	$3.0 \times 1 0^{- 5}$ s	$2.6 \times 1 0^{- 5}$ s	$2.2 \times 1 0^{- 5}$ s

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Liao, R.-F.; Wen, H.; Wu, J.; Pan, F.; Xu, A.; Jiang, Y.; Xie, F.; Cao, M. Deep-Learning-Based Physical Layer Authentication for Industrial Wireless Sensor Networks. Sensors 2019, 19, 2440. https://doi.org/10.3390/s19112440

AMA Style

Liao R-F, Wen H, Wu J, Pan F, Xu A, Jiang Y, Xie F, Cao M. Deep-Learning-Based Physical Layer Authentication for Industrial Wireless Sensor Networks. Sensors. 2019; 19(11):2440. https://doi.org/10.3390/s19112440

Chicago/Turabian Style

Liao, Run-Fa, Hong Wen, Jinsong Wu, Fei Pan, Aidong Xu, Yixin Jiang, Feiyi Xie, and Minggui Cao. 2019. "Deep-Learning-Based Physical Layer Authentication for Industrial Wireless Sensor Networks" Sensors 19, no. 11: 2440. https://doi.org/10.3390/s19112440

APA Style

Liao, R.-F., Wen, H., Wu, J., Pan, F., Xu, A., Jiang, Y., Xie, F., & Cao, M. (2019). Deep-Learning-Based Physical Layer Authentication for Industrial Wireless Sensor Networks. Sensors, 19(11), 2440. https://doi.org/10.3390/s19112440

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Deep-Learning-Based Physical Layer Authentication for Industrial Wireless Sensor Networks

Abstract

1. Introduction

2. Preliminaries

2.1. Channel State Information

2.2. Deep Neural Network

2.3. Convolutional Neural Network

3. System Model

4. Deep Learning-Based Sensor Nodes’ Authentication Algorithms

4.1. DNN-Based Sensor Nodes’ Authentication

4.2. CNN-Based Sensor Nodes’ Authentication

4.3. Convolution Pre-Processing Neural Network-based Sensor Nodes’ Authentication

4.4. Complexity Analysis

5. Numerical Experiments

6. Experiments In Practical Environment

7. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI