Bearing Fault Diagnosis Using Lightweight and Robust One-Dimensional Convolution Neural Network in the Frequency Domain

Hakim, Mohammed; Omran, Abdoulhadi A. Borhana; Inayat-Hussain, Jawaid I.; Ahmed, Ali Najah; Abdellatef, Hamdan; Abdellatif, Abdallah; Gheni, Hassan Muwafaq

doi:10.3390/s22155793

Open AccessArticle

Bearing Fault Diagnosis Using Lightweight and Robust One-Dimensional Convolution Neural Network in the Frequency Domain

by

Mohammed Hakim

^1,*,

Abdoulhadi A. Borhana Omran

²

,

Jawaid I. Inayat-Hussain

³,

Ali Najah Ahmed

⁴

,

Hamdan Abdellatef

⁵

,

Abdallah Abdellatif

⁶ and

Hassan Muwafaq Gheni

⁷

¹

Department of Mechanical Engineering, College of Engineering, Universiti Tenaga Nasional, Jalan IKRAM-UNITEN, Kajang 43000, Selangor, Malaysia

²

Department of Mechanical and Mechatronic Engineering, Faculty of Engineering, Sohar University, Sohar P.C 311, Oman

³

College of Graduate Studies, Universiti Tenaga Nasional, Jalan IKRAM-UNITEN, Kajang 43000, Selangor, Malaysia

⁴

Department of Civil Engineering, College of Engineering, Universiti Tenaga Nasional, Jalan IKRAM-UNITEN, Kajang 43000, Selangor, Malaysia

⁵

School of Engineering-Electrical & Computer Engineering Department, Lebanese American University, Byblos 1102, Lebanon

⁶

Expert System and Optimization Laboratory, Department of Electrical Engineering, Faculty of Engineering, University of Malaya, Kuala Lumpur 50603, Selangor, Malaysia

⁷

Computer Techniques Engineering Department, Al-Mustaqbal University College, Hillah 51001, Iraq

^*

Author to whom correspondence should be addressed.

Sensors 2022, 22(15), 5793; https://doi.org/10.3390/s22155793

Submission received: 30 June 2022 / Revised: 26 July 2022 / Accepted: 30 July 2022 / Published: 3 August 2022

(This article belongs to the Special Issue Machinery Testing and Intelligent Fault Diagnosis)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

The massive environmental noise interference and insufficient effective sample degradation data of the intelligent fault diagnosis performance methods pose an extremely concerning issue. Realising the challenge of developing a facile and straightforward model that resolves these problems, this study proposed the One-Dimensional Convolutional Neural Network (1D-CNN) based on frequency-domain signal processing. The Fast Fourier Transform (FFT) analysis is initially utilised to transform the signals from the time domain to the frequency domain; the data was represented using a phasor notation, which separates magnitude and phase and then fed to the 1D-CNN. Subsequently, the model is trained with White Gaussian Noise (WGN) to improve its robustness and resilience to noise. Based on the findings, the proposed model successfully achieved 100% classification accuracy from clean signals and simultaneously achieved considerable robustness to noise and exceptional domain adaptation ability. The diagnosis accuracy retained up to 97.37%, which was higher than the accuracy of the CNN without training under noisy conditions at only 43.75%. Furthermore, the model achieved an accuracy of up to 98.1% under different working conditions, which was superior to other reported models. In addition, the proposed model outperformed the state-of-art methods as the Signal-to-Noise Ratio (SNR) was lowered to −10 dB achieving 97.37% accuracy. In short, the proposed 1D-CNN model is a promising effective rolling bearing fault diagnosis.

Keywords:

deep learning; one-dimensional convolutional neural network; signal-to-noise ratio; fault diagnosis; fast Fourier transform; bearing

1. Introduction

The incremental expansion of the modern rotating machinery industry has necessitated the present system to perform with greater reliability and safety and, at the same time, minimal production costs and maintenance expenditures. In view of this, fault diagnosis is a crucial element that can improve product efficacy and reduce the risk of accidental hazards in sophisticated mechanical systems [1]. The rolling element bearing is a vital component in rotating machinery fault diagnosis, where bearing failures contribute roughly 45–55% of the total mechanical equipment failures [2,3,4,5], while bearing faults account for 90% of small rotary machine failures [6,7]. The early detection of motor faults and correct diagnosis performance, particularly during complex and changeable load and working conditions, is therefore essential to avoid heavy financial loss and prevent catastrophic consequences [8,9,10].

With the tremendous development in the fields of artificial intelligence, machine learning has become an essential technique and approach utilised in many areas, including fault diagnosis [11,12], computer vision [13], structural engineering [14], machine health monitoring [15], electricity production [16], etc.

Over the years, three main steps have been considered for Machine Learning (ML)-based fault detection methods of rolling bearings, which are: (1) obtaining vibrational signals of the equipment; (2) pre-processing and extracting important feature information from the acquired signals; (3) fault diagnosis [17,18]. Vibrational signals are widely applied in signal acquisition primarily due to their ability to transmit a massive amount of information and can be easily measured [19]. Meanwhile, various rotating machinery data-driven signal processing methods, such as Fast Fourier Transform (FFT) [20], Wavelet Transform (WT) [21], Empirical Mode Decomposition (EMD) [22], and Ensemble Empirical Mode Decomposition (EEMD) [23], are utilised to evaluate the acquired signals and extract useful fault properties through the time-domain, frequency-domain, or time-frequency-domain methodologies [24]. Eventually, the retrieved signal features are applied to train machines to express the extracted features. Previously, a novel hybrid approach that integrated the Variational Mode Decomposition and Support Vector Machines (SVMs) was proposed to identify the type of rolling bearing faults [25]. In another study, an intelligent defect diagnostic approach was suggested based on the combined Singular Value Decomposition (SVD) and Local Mean Decomposition (LMD), which analysed non-linear and non-stationary vibration data, as well as the Extreme Learning Machine (ELM) [26]. Furthermore, an optimised k-nearest neighbour model was proposed according to the kernel principal component analysis and Particle Swarm Optimisation (PSO) [27], apart from other ML approaches, as described in [10,28,29].

While conventional intelligent methods functioned properly and produced accurate diagnosis outcomes, they still have two drawbacks: (1) they exhibit poor fault classification accuracy features since the signals are normally extracted through manual approaches based on prior information and diagnostic techniques, which are developed for certain fault types only and are unsuitable for other faults [30]; (2) they suffer a significant drop in model accuracy given that signals acquired in real-world industries are often subjected to environmental noise [31]. Consequently, it is imperative to construct new intelligent fault diagnosis methods that can automate fault diagnosis processes.

Besides extracting representative features from the acquired signals, Deep Learning (DL) methods are valuable for evaluating complicated and non-stationary signals, which offers the potential to address the constraints of conventional intelligent diagnosis methods [32,33]. Numerous DL models have been developed and used in rolling element-bearing fault diagnostics applications, such as Autoencoder (AE) [34], Recurrent Neural Network (RNN) [35], Deep Belief Network (DBN) [36], and Generative Adversarial Network (GAN) [37]. Among the popular DL methods is the Convolution Neural Network (CNN) [3], which assists in minimising the computational load on the network while reducing the risk of overfitting. These properties relatively enhance the accuracy and efficiency of the network, which is highlighted through its capacity to automatically extract features from images or signals [38]. Recently, Shaobo et al. [39] proposed an ensemble convolutional neural network model with improved D-S evidence fusion to study benchmark-bearing fault diagnosis. The structure of the CNN model is composed of three convolutional layers plus a full connection layer, the model was trained with two sensor signals, whose outputs are fused using the improved D-S fusion algorithm, and it showed good adaptability on the bearing fault datasets under different load conditions with multi-sensors signals. On the other hand, wanlu et al. [40] solved the problem of an imbalanced sample of the original hydraulic pump vibration signal, and integrate a one-dimensional convolutional neural network (1D-CNN) with generative adversarial networks to produce small sample size fault samples. The model showed a high-quality solution to the imbalanced sample issue during model training. Moreover, the Multi-Scale Deep CNN (MS-DCNN) was utilised to decrease the training period and network variables needed by the CNN algorithm. The model, which contained nine network layers, used a 1*1 convolution to reduce the dimensionality as well as improve the network depth and width, and greatly reduced the network parameters and the training time [41].

Based on the above-mentioned literature, DL approaches demonstrate superior performance over conventional ML techniques and offer a comprehensive alternative to fault classification. Nevertheless, the major downsides of DL approaches are as follows:

Most DL and ML models perform poorly when subjected to noisy environments, where the decrease in model accuracy corresponds with the growing noise levels.
Although the accuracy of the models can be increased, the structure of the models also becomes more intricate, affecting the interpretability of the real-world implementation of the models.

Thus, the proposed 1D-CNN model in this study was aimed to address the above challenges with potential contributions, which are summarised as follows:

Unlike previous studies that applied only magnitude as input and discarded the phase that includes important information about the signal, this study utilised the magnitude and phase components as two separate inputs in the proposed 1D-CNN, which was trained and operated in the frequency domain. The frequency-domain representation allows a better understanding of the signal and enhances the performance in terms of accuracy and computational complexity.
A lightweight four-layer 1D-CNN model was proposed with 9220 parameters, and only 2.6 M Floating-Point Operations (FLOP) were used. The model used to process the benchmarking data of Case Western Reserve University (CWRU) could achieve 100% and 99.3% accuracy with and without additive noise, respectively.
The model is trained with additive noise to improve its resilience to noise. To demonstrate robustness, we show that our model, when trained with signals that have additive noise with SNR (−4~2) dB, achieves 99.3%, 98.8% and 97.3% accuracy for SNRs −6, −8 and −10, respectively.
The proposed model outperforms the previous state-of-the-art works on fault-bearing detection.

Following the introduction section, Section 2 of this paper describes a brief review of the related work. In Section 3, the principles of FFT analysis and CNN are presented, while in Section 4, the proposed model is thoroughly described. Section 5 provides the analyses and evaluation of the experimental results. Finally, Section 6 presents the conclusion and recommendations of this study for future research.

2. Background and Related Studies

The first part of this section presents a brief literature review of past studies related to intelligent bearing fault diagnosis. The discussion highlights two fundamental components techniques that were adopted in the present study, which include the frequency-domain-based fault diagnosis method and noisy environment in rotating machinery fault diagnosis. The second part of this section provides a short introduction to the proposed method that involves FFT and several key features of the efficient 1D-CNN design and its advantages.

2.1. Related Studies

The problem of interference from noise signals is causing a significant issue for researchers. For instance, a study on the impact of radial internal clearance on the dynamical response of ball bearings indicates that the parameters like entropy and recurrence period density entropy refer to the complexity of the experimental signals because of the affected by noise interference [42].

In general, the time domain or the frequency domain can be used to analyse the signals. The signal properties are more distinct when time-domain signals are converted to frequency-domain signals since the latter signals are less influenced by noise compared to the former signals [43]. The presence of a fault characteristic frequency would amplify the signal component amplitudes that are associated with the characteristic component amplitudes that are associated with the characteristic fault frequency, which allows the detection of the failed bearing location through the initial vibration signal of the frequency components, which corresponds to the nature of the bearings [38,44,45]. Considering the significant noise interference at industrial sites, it is, therefore, necessary to examine the anti-noise capacity of a particular model and address the strong noise hindrance.

Numerous studies have been carried out in search of a solution to noise interference. For instance, the periodic potential to generate the adaptive stochastic resonance (PSR) is utilised in this study and aims to identify weak character signals for the diagnosis of rolling element-bearing faults. The concept of this study is that a phenomenon occurs in which the signal is enhanced, and the noise is weakened. The findings of the analysis reveal that the model is effective and is able to deal with a variety of different working scenarios. The limitation of this study is that it focuses on a specific problem for which the theoretical frequency value can be determined prior to detection. Because of this, the technique needs to be applied multiple times if the type of fault is unknown [46].

A five-layer Deep Convolutional Neural Network with a wide input kernel and small following kernels that were demonstrated to achieve high accuracy (WDCNNs) was proposed to diagnose bearing defects by utilising raw vibration signals as the input data and processing them through the WDCNN model [47]. The WDCNN performance decreased drastically when the Adaptive Batch Normalisation (AdaBN), which is a statistical algorithm that requires statistical knowledge of the entire test data, was not applied. Meanwhile, the use of Training Interference in six-layer Convolutional Neural Networks (TICNN) [48] revealed two main interferences, which include the first-layer kernel dropout with a constantly fluctuating rate and an exceptionally small training batch. Additionally, ensemble learning has been used to improve their model’s performance. Thus, the use of data augmentation facilitates the enhanced performance of the TICNN model as the working load increases.

The development of the Multi-Scale Cascade Convolutional Neural Network (MC-CNN) involved the addition of convolutional layers before the multi-scale cascade layer to generate a new signal with more recognisable information, as well as the addition of a convolutional layer with kernels of modest size and a pooling layer after the multi-scale cascade layer to restrict the abundance of neurons generation by the multi-scale signal [49]. Furthermore, the Radial Basis Function Neural Network, combined with the power spectrum of the Welch method (W-RBFNN), was developed to convert time-domain signals into power spectrum [50]. The method could also simultaneously remove the effects of the initial phase difference and minimise the noise effect.

While these approaches have achieved significant progress, the accuracy of these models is reduced when the noise level exceeds −4 dB. Zhou et al. [43] proposed an integrated framework via the Convolutional Neural Network and Frequency-Domain Feature Matching (CNN-FDFM) algorithm to assist in sustaining the excessive noise levels. Essential frequency features from the frequency-domain signals are captured by FDFM and retained at high accuracy with limited samples under high noise conditions. Additionally, the use of the dropout method with larger kernels in the first convolutional layer could simulate the noise input during CNN training and enhance the anti-noise ability of CNN. Nevertheless, the CNN-FDFM is unsuitable for applications under varying operating conditions as the accuracy of the model decreased by around 7% when the noise level exceeded −10 dB.

Other DL methods have been developed, such as deep neural network-based few-shot learning approach [51]. And the Stacked Inverted Residual Convolution Neural Network (SIRCNN), to avoid noise resistance [18]. The suggested neural network comprises one typical convolutional layer with a convolution kernel of 3 × 3, six inverted residual block structures, and depthwise separable convolution. Vibration spectra were created from the one-dimensional data and used as model input. Most of these methods are, however, unable to apprehend high-noise levels, limiting their possible real-world application. Hence, this study attempts to reduce the gap between theoretical findings and practical implementation.

2.2. Convolutional Neural Network (CNN)

CNN is one of the most prevalent DL algorithms that can reduce the number of parameters using spatial convolution relations. The model is derived from multiple layers that are fully connected to the neural network with various filter processes and a single classification process. In comparison to a fully connected network, the superior performance of CNN in numerous engineering applications is contributed by its shared weights, local connects, and pooling operators [52]. A conventional CNN framework comprises a convolutional layer, activation layer, pooling layer, and Fully Connected (FC) layer:

Convolutional layer: This layer, which utilises a class of learnable Gaussian kernel filters to convolve with the input data, generates the feature maps and can be expressed as:

$x_{s}^{k} = f (\sum_{j = 1}^{J} w_{j s}^{k} * x_{j}^{k - 1} + B_{s}^{k}),$

(1)

where $X_{s}^{k}$ refers to the $j$ th feature map at ( $k$ − 1)-th operation, $W_{j s}^{k}$ is the $k$ -th operation’s kernel weight parameter between the $j$ -th input and s-th output, while $B_{s}^{k}$ represents the corresponding bias. Moreover, f (.) denotes a non-linear activation function. In addition, the Rectified Linear Unit (ReLU) is often used to perform the activation process due to its outstanding gradient efficiency, which can be termed as:

$y_{i j k}^{l - 1} = m a x {0, x_{i j k}^{l - 1}},$

(2)

where $y_{i j k}^{l - 1}$ is defined as the coordinate ( $i, j$ ) value in the $k$ th feature map of the ( $l$ − 1)th layer.
Activation layer: Following the convolution operation, the activation layer function is crucial for the network to obtain a non-linear expression of the input signal so that the representation ability is enhanced and permits the learned features to be further dividable. Recently, ReLU has been extensively applied as an activation unit to speed up the CNN convergence by forming more trainable weights in the shallow layer when the back-propagation learning approach is used to modify the variables. The ReLU formula is expressed as:

$a^{l (i, j)} = f (b^{l (i, j)}) = \max {0, b^{l (i, j)}}$

(3)

where $a^{l (i, j)}$ refers to the activation value of the output $b^{l (i, j)}$ of the convolution layer.
Pooling layer: The objective of the pooling layer is to preserve spatial invariance and minimise the middle function map dimensions via the computational statistics method. The service area is first assigned by sliding a personalised pooling operation window onto the input function diagram, followed by the use of a numerical statistical approach to represent these values and minimise the resolution of the selected area. It is also crucial to select the stride parameter of the pooling layer, given its substantial impact on reducing the resolution and numerical information preservation. The maximum pooling (the maximum value in the local acceptance domain) and average pooling (average of all values in the local acceptance domain) are the frequently used pooling methods, which are expressed as follows:

$y_{i j k}^{l - 1} = m a x {0, x_{i j k}^{l - 1}}$

(4)

where 0 $< i^{'} \leq n; 0 < j^{'} \leq m; i^{'} a n d j^{'} \in Z^{+}; n$ denotes the length of the pooling window, m refers to the width, while $X_{i^{'}, j^{'}}^{l - 1}$ denotes the covered data pooling window.

$y_{i}^{l} +^{1} (j) = \underset{(j - 1) W + 1 \leq t \leq j W}{average} \cdot {U_{i}^{l} (t)}$

(5)

where $W$ represents the pool area width; $U_{i}^{l} (t)$ denotes the $t$ th neuron value in the $i$ th eigenvector of the $l$ th layer, and the $t \in [(j - 1) w + 1, j w]; y_{i}^{l} +$ ¹ $(j)$ corresponds to the $l + 1$ neuron value.
FC layer: The final layer is designed to complement its non-linear input. The completely connected layer fitting operation is expressed as follows:

$Y = f_{F} (W X_{i}^{l} + b)$

(6)

where $Y$ refers to the output, W represents the full connection matrix, and $X_{i}^{l}$ defines the output of the upper layer. Additionally, $f$ _F denotes the activate function, while the number of formed categories is nearly equal to the output channel of the final FC layer. Figure 1 illustrates the flow process of CNN.

2.3. Fast Fourier Transform (FFT)

FFT is a useful analytical tool to remodel periodical waveforms via series harmonics in which the harmonic frequency is denoted as a multiple of fundamental. The FFT and its inverse formula are expressed as follows:

F (ω) = \int_{- \infty}^{\infty} f (t) e^{i ω t} d t

(7)

f (t) = \frac{1}{2 π} \int_{- \infty}^{\infty} F (ω) e^{i ω t} d ω

(8)

where f(t) refers to a given time-domain signal and F(ω) represents the FFT of the

f (t)

in the frequency domain. Despite that FFT is an ideal instrument to determine the natural structural frequencies, the method is unable to indicate the time information when a particular frequency component takes place. Naturally, the rolling element-bearing signals are non-linear and non-stationary [53]. As such, the signal structure contains hidden periodicities, which convey supplementary information.

X^{(N)} (P) = \sum_{n = 0}^{N - 1} x_{n} e^{\frac{- i 2 π p n}{N}}

(9)

= \sum_{g}^{\frac{N}{2} - 1} x_{(2 g)} e^{\frac{- i 2 π p g}{\frac{N}{2}}} + e^{\frac{- i 2 π p}{N}} \sum_{h}^{\frac{N}{2} - 1} x_{(2 h + 1)} e^{\frac{- i 2 π k h}{\frac{N}{2}}}

(10)

= X_{0}^{(\frac{N}{2})} (P) + e^{\frac{- i 2 π p}{N}} X_{1}^{(\frac{N}{2})} (P)

(11)

where p = 0, 1, 2, …,

N

− 1, and g,

h

= 0, 1, 2, …,

\frac{N}{2}

− 1. Furthermore,

X_{0}^{(\frac{N}{2})}

(P) represents the

\frac{N}{2}

point of the Discrete Fourier Transform (DFT) of X(N), which is regarded as even-numbered, while

X_{1}^{(\frac{N}{2})} (P)

denotes the

\frac{N}{2}

point of DFT of X(N) and is regarded as odd-numbered. Both functions are periodic and discrete. Apart from that, consider the following:

W_{N} = e^{\frac{- i 2 π}{N}}

(12)

Then, W_{N}^{P + \frac{N}{2}} = - W_{N}^{P}

(13)

Here,

W_{N}^{P}

for p = 0, 1, 2, …, N − 1 refers to the Nth root of unity. Equations (12) and (13) are combined to derive the following:

X^{(N)} (P + \frac{N}{2}) = X_{0}^{(\frac{N}{2})} (P + \frac{N}{2}) - W_{N}^{P} X_{1} (P)

(14)

where p = 0, 1, 2, …,

\frac{N}{2}

− 1. The frequency-domain information is also derived from the signal with

\frac{N}{2}

multiplications as opposed to the N complex multiplication. Consequently, the computational complexity becomes O(N log N). These vibration signals can be processed through an FFT can process by maintaining the original amplitude and phase information, splitting them into individual sinusoidal oscillations at specific frequencies [54].

3. Methodology

In conventional CNN methods, the network inputs are composed of non-pre-processed raw signals, which would result in low prediction accuracy irrespective of the change in the hyper-parameters. This is due to the insufficient information in the input that is useful to achieve precise classification. Therefore, it is essential to improve the prediction performance of the model by implementing information improvement techniques into the input dataset. Taking into account the extreme and rough conditions of the industrial environment with a myriad of interference, the acquired data by the sensor are severely riddled with noise.

Thus, the fault diagnosis model framework in this study was developed via a three-step approach, as shown in Figure 2. The primary step involved the vibration signal acquisition from the equipment. The second step consists of the pre-processing approach to convert the raw signals into training samples that are made up of a set of input data with associated class labels. In addition, the pre-processing step comprises two main parts, namely noise injection and FFT representation, which are further discussed in Section 3.1 and Section 3.2. Finally, a four-layer lightweight 1D-CNN for fault classification was developed.

3.1. Robustness Improvement with Noise Injection

Noise injection was performed on the training data to generate noisy samples with varying Signal-to-Noise Ratios (SNR) in order to improve the robustness and performance of the model to noise on the hidden noisy data. The additive Gaussian noise was added to form composite signals with varying SNR, as follows:

S N R_{d B} = 10 \log_{10} (\frac{P_{s i g n a l}}{P_{n o i s e}})

(15)

where

p_{s i g n a l}

and

p_{n o i s e}

represent the signal power and noise power, respectively. The additive White Gaussian Noise (WGN) was employed to amplify the original signal fault, as shown in Figure 3. This was in line with past studies that stated the use of White Gaussian noise to train the network as an effective method to carry out model regularisation. Thus, enhancing the model’s robustness against input variation [55]. Moreover, the SNR of the composite noisy signal was set to 0 dB, indicating the equal value of the noise power to the original signal power, as depicted in Figure 4. The proposed CNN model was then evaluated using the noisy signals with an amplitude range of −10 to 10 dB.

3.2. Frequency-Domain

The signal representation using the frequency-domain exhibit several favourable characteristics over its time-domain counterpart, especially its efficient ability to identify the necessary specific frequency components [56]. Hence, FFT is used to convert time-domain vibration signals into discrete frequency components, while both FFT and DFT algorithms are applied to examine the raw frequency-domain vibration signals.

Although vibration signals are based on time-domain signals, which is the case with most DL methods, they are composed of multiple basis signals with varying phases, frequencies, and amplitudes that are frequency-domain representations of the vibration signals. Figure 5 illustrates that each fault type produces distinct vibration signals with different FFT spectrums. Moreover, signals of varying fault types possess differing dominant frequency bands. In other words, valuable information is confined to various frequency bands. Hence, FFT was employed as a simple signal pre-processing step, where the frequency representation allows an in-depth understanding of the signal, as opposed to the complex interpretation of the time domain.

Given that the FFT domain is complex-valued, the data was represented using a phaser notation, which separates the magnitude and phase into two groups of inputs for the 1D-CNN to exclude complex computations, as depicted in Figure 6. Both components provide essential features to classify the bearing fault type, where the phase ensures similar signals’ correctness and diversity, while the magnitude peaks highlight interesting features. In comparison, previous work that employed FFT only considered the magnitude and overlooked the critical information in the phase for the signal representation. The present study applied FFT to convert a fixed window size of 4096 from the time-domain input signal to the desired window size and sampling size, which is explained further in Section 4.1.

3.3. Development of the 1D-CNN Model

As a major DL approach, CNN offers exceptional performance in detection and classification problems. The development of CNNs involves three levels, namely the convolutional layer, pooling layer, and FC layer. Valuable features from the input data are extracted using the convolutional and pooling layers, while the FC layer is predominantly accountable for the classification process. The convolutional layer convolves the input signal using a sequence of kernels in a new receptive field. Each kernel was utilised to extract features from the input signal at specific locations by sliding the kernel with a fixed stride. The weight of the kernel was shared during the convolution procedure.

In addition, a four-layer 1D convolution network (1Conv- 1Conv- 1Pool-1FC) was employed in all experiments, as presented in Table 1. The first convolution layer filter was a 16*1 sliding window with four strides and no padding, while the second convolution layer filter was an 8*1 sliding window with four strides and no padding. Next, an adaptive average pooling layer was used to aggregate each channel into a single element by calculating the mean. Finally, the FC layer was fixed with a total of 10 neurons and a SoftMax function, based on the proposed CNN model structure shown in Figure 7.

The 1D-CNN was utilised to adaptively learn the characteristics from the raw time-domain vibration signals without prior information. The CNN input consists of a fraction of a normalised bearing fault vibration temporal signal presented in frequency, as described in Section 2. The total number of parameters and FLOPs were 9220 and 2.6 M, respectively, which was considered a lightweight CNN. Table 1 also includes the number of parameters and FLOPs applied in the proposed model.

4. Experimental Setup

4.1. Dataset Preparation and Partitioning

The CWRU [57] benchmark dataset was retrieved from the drive end of the motor at a 12 k sampling rate and was employed in this study. The data contains four different subsets, with each subset representing particular working load conditions from 0 to 3 hp. In addition, each subset consists of four different fault class labels, namely the normal class and three fault classes comprising the bearing-race (BF), inner-race (IF), and outer-race (OF) at the @6:00 centred position relative to the loading zone. The three fault classes also exhibit specific fault sizes of 0.014, 0.007, and 0.021 inches, respectively. The fault sizes were created using electro-discharge machining (EDM), which generated 10 types of classes (one normal class and nine fault classes), as detailed in Table 2, Table 3 and Table 4. Moreover, sliding windows with time-series data overlaps were applied for data augmentation to amplify the number of samples. The corresponding width of the window and the shifting step were 4096 and 290, respectively. Ultimately, each working condition contains 4000 samples, and each sample was denoted as a 4096-D vector. Figure 8 portrays the overall dataset preparation and partitioning. Table 5 shows the number of training and testing datasets for each class.

4.2. Training Methodology and Implementation Details

The proposed CNN model was trained for 10 epochs for each repetition. The input samples were randomly shuffled at the beginning of each learning epoch to ensure efficient network learning performance Subsequently, the standard backpropagation algorithm and the Stochastic Gradient Descent (SGD) optimisation were applied with a momentum and learning rate of 0.9 and 0.01, respectively, to train the proposed CNN model throughout the experiment. Unless stated, the default settings of the open source Pytorch Framework were applied for the network training and were conducted offline. All experiments in this study were carried out using Python version 3.8 on a machine with six Core i7-9750H microprocessors, 16 GB of RAM, and NVIDIA GeForce GTX 1660 Ti with Max-Q design.

5. Results and Discussion

5.1. Performance Evaluation of Different Sampling Points

The frequency spectrum resolution may be enhanced by increasing the total number of sampling points in FFT, where the abscissa is always an integer. Normally, FFT can convert the time-domain signal with a length of N into the frequency-domain signal with a length of N/2 since it is symmetric. For instance, a sample with 4096 points would be sliced to produce a symmetric frequency spectrum with 2048 points. The shift function of the FFT was applied to shift the zero-frequency component of a Fourier transform X to the centre of the array. Figure 9a–c depicts the frequency spectrums of the inner race at 0.007 inches with varying sampling lengths.

The entire 4096 lengths of the frequency spectrum were analysed in this experiment. The sampling lengths of (a) 1024, (b) 2048, and (c) 4096 correspond to their frequency spectrums, which were composed of 1024, 2048, and 4096 points, respectively. The frequency of the k-th point is k × (fs.max/N) Hz, and these points represent various frequency features. It was observed that longer signals could produce high-resolution frequency spectrums using more points, allowing a more complete and accurate expression of the frequency-domain information. Additionally, the increase in sampling length leads to a more precise measure of the frequency features as well as more prominent discrimination between the adjacent points. Figure 10 illustrates the diagnosis results of the 1D-CNN model under varying SNRs, while Figure 11 displays the advantages of the impact of phase on fault detection at varying sampling points.

According to the findings, the accuracy of the developed 1D-CNN model was enhanced when the number of sampling points from the test samples, which were composed of original signals, was increased. When the sampling length was set at 4096, only a small shift in accuracy was recorded as the SNR of the noisy test samples decreased. In addition, the results revealed that feeding the 1D-CNN model with the phase component and magnitude component enhanced the accuracy of the fault detection, as shown in Figure 12. For instance, the use of both magnitude and phase components significantly improved the accuracy of 2048 sampling points compared to that of the 4096 sampling points for fault detection under different noise levels.

Furthermore, the conversion of time-domain signals into frequency-domain was symmetric. Following the FFT, the symmetric frequency spectrum with the length of 2048 was obtained from the sliced sample with 4096 points. Figure 13 depicts the effect of using the whole signal (4096) or symmetrical signal (2048) under different SNRs. Based on the results, the use of the entire signal achieved a greater fault detection compared to taking half of the signal (symmetric). Despite that the fault detection accuracy decreased as the noise increased, the use of the whole signal produced a more stable and less affected performance compared to the use of the half signal. The CNN would be more resilient to noise as a result of the symmetric nature of the input signal.

5.2. Performance Evaluation under Different Working Environments

The extremely complex working environment of mechanical systems in real-world settings can be grouped into two notable variations. First, the vibration signals are affected by noise effortlessly since noise is inevitable in industrial settings. It is also challenging and impractical to diagnose the faults in noisy environments. Second, the working load may occasionally vary relative to the required production, making it unfeasible to gather and label adequate training samples and apply a compatible robust classifier for all types of working loads. Therefore, it is crucial for feature extractors and classifiers that are trained by samples collected from one working load to adapt, learn, and classify invariant domain features. The performance evaluation of the 1D-CNN model under these two scenarios is discussed further in the following section.

5.3. Performance Evaluation under Noisy Environments

For the diagnosis accuracy of the proposed 1D-CNN model, the model was trained using the original data in the presence of the additive WGN to achieve a fixed SNR value (2 or −4) and randomly added to the training set. The model was also trained in the presence of additive WGN at a fixed SNR range (−4 to 2) and randomly added to the training set, as prepared through the CWRU, before being tested with the noisy data. This setting closely resembles the real-world industrial production conditions given the broad range of the noise environment, which makes it impossible to label all the training samples. The original signal of IF was added with the additive WGN, as shown in Figure 14.

The proposed 1D-CNN model was subjected to a range of noise signals from −10 dB to 10 dB. The random addition of WGN to the training set with varying SNRs (2 or −4, or 2 to −4) was performed at a fixed network structure to verify the effect of network training under different noise levels. Table 6 describes the findings of the proposed 1D-CNN model with and without the added WGN to the training set of the diagnosed noise signal.

The results indicate that the increased accuracy corresponded to the increased noise in the training set. For instance, the test accuracy was only 53.625% when the network was trained without noise addition in the training set at −10 dB. However, the test accuracy rose to 72.75% and 96.875% when the network was trained with the randomly added WGN in the training set at −10 dB under SNR of 2 and −4, respectively. Eventually, the test accuracy achieved 97.375% when the network was trained with the randomly added WGN in the training set at −10 dB under an SNR range of 2 to −4. Additionally, the proposed 1D-CNN recorded excellent performance with weak noise levels, where the model could easily obtain over 99% accuracy with an SNR value of greater than −4 dB. Interestingly, the proposed model was able to attain over 97% accuracy even when the SNR value was −10 dB after the network was trained with added WGN. Hence, training the network with WGN was an effective approach to achieving model regularisation, resulting in enhanced model robustness against varying input data.

5.4. Performance Evaluation under Different Load Domains

The adaptation performance of 1D-CNN under different load domains was also evaluated. Table 7 presents the scenario settings for the domain adaptation analysis. The whole signal that included both magnitude and phase components after the FFT conversion was utilised to enhance the accuracy of the proposed model. Figure 15 illustrates the performance comparison between the model using half signals (magnitude only) and the model using whole signal (magnitude and phase).

5.5. Performance Comparison

The rolling element bearing fault diagnosis of the proposed 1D-CNN model in this study was compared to the performance of other models from past studies, which include conventional SVM, MLP, and DNN algorithms, as well as other frequency-domain models, such as WDCNN and TICNN. Since the models were under an increased noise environment, the diagnostic accuracy declined drastically when the SNR increased, most of the previous models were ineffective when the SNR reached −4 dB. Remarkably, the accuracy of the present model was still above 97%, even at an SNR value of −10 dB. [while most of the previous models were ineffective when the SNR reached −4 dB. Remarkably, the accuracy of the present model was still above 97%, even at an SNR value of −10 dB]. Table 8 provides the performance comparison between the proposed model in this study and past models, which were under noisy conditions at an SNR of −10 dB and tested with the CWRU-bearing dataset. Figure 16 visualises the overall comparison performance of each experimental finding.

Figure 16 shows the poor performance of SVM, MLP, and DNN in terms of the domain adaptation analysis in the six scenarios with an average accuracy of 67%, 80% and 78%, respectively. In fact, the WDCNN and TICNN performed worse and never reached 90% and 95.5% accuracy, respectively. Oppositely, the proposed model in this study recorded greater precision than the other algorithms with up to 98.1%. Thus, demonstrating that the model learned from the frequency domain was more domain invariant compared to the conventional frequency features. In addition, the accuracy in each scenario improved by over 97% with high stability. As depicted in Figure 15, the proposed model exhibited a substantially greater classification accuracy when adapting from Domain A to B and B to A as well as from Domain B to C and C to B compared to from Domain A to C and C to A. Furthermore, the results showed that the model experienced greater difficulty to adapt to the new environment when the difference between the target domain and its source domain increased. The proposed model also recorded a high diagnostic accuracy of 98.8% when adapting from Domain B to C, while the accuracy was reduced by 1.5% only when adapting from Domain C to A. Besides the improved diagnosis accuracy, the proposed model achieved better stability compared to other reported models.

Referring to Figure 16 and Table 8, the WDCNN and TICNN models reported a considerable domain adaptation ability although both lacked robustness against noise interference. In contrast, the FDFM and CNN-FDFM models demonstrated robust performance on noise but were considered inappropriate for multiple working conditions. It was suggested that the robustness against noise does not easily complement the domain adaptation ability. Contrary to other models, the proposed model in this study achieved exceptional robustness to noise and concurrently excellent domain adaptation ability. Furthermore, the accuracy values in the comparative performance table demonstrated that the proposed model outperformed most of the previously published rolling element-bearing fault diagnosis models under similar and different noise environments and appears to be on par with best-performing models. Based on the overall findings, the proposed 1D-CNN model may be recommended as an effective rolling element bearing fault diagnosis.

6. Conclusions and Recommendations

This paper investigated the performance of an effective 1D-CNN model for bearing fault classification to address the concern over strong noise interference and diverse working conditions in industrial settings. The two-step algorithm framework was proposed, which involved the frequency-domain signal processing analysis to achieve an in-depth understanding of the signal, followed by the utilisation of two-channel input signals (magnitude and phase) in the 1D-CNN model with injected noise for the training data to generate noisy samples with high SNR under varying working conditions. The CWRU dataset was also utilised to examine the diagnosis performance of the proposed model. Based on the results, the proposed 1D-CNN model demonstrated a superior fault diagnosis capability with 100% accuracy using the normal dataset and 97.3% using the high noisy dataset. As such, the diagnosis accuracy of the proposed model was 43.75% higher compared to CNN without training under noisy conditions with SNR at −10 dB. Given that this study aimed to propose a simple model that can be easily implemented in real-time settings, this study only retrieved the public dataset with the addition of noise. Hence, future studies should consider evaluating the performance of this model using real-time data. Moreover, the extracted features from the time-domain, the frequency-domain, and the time-frequency-domain contained redundant information, which may aggravate the computation cost and lead to the curse of dimensionality. Thus, the development of a robust basic concept in signal processing is highly considered to address this issue and explain the robustness of the suggested model.

Author Contributions

Conceptualisation, M.H., J.I.I.-H., A.N.A. and A.A.B.O.; Methodology, M.H., H.A. and A.A.; Software, M.H., H.A. and A.A.; Formal Analysis, M.H., J.I.I.-H. and A.N.A.; validation, M.H., J.I.I.-H. and A.N.A.; Data Curation, M.H., A.A.B.O. and H.M.G.; Writing—Original Draft Preparation, M.H.; Writing—Review & Editing, M.H. and H.M.G.; Supervision, J.I.I.-H., A.N.A. and A.A.B.O. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Amar, M.; Gondal, I.; Wilson, C. Vibration spectrum imaging: A novel bearing fault classification approach. IEEE Trans. Ind. Electron. 2015, 62, 494–502. [Google Scholar] [CrossRef]
Gao, S.; Xu, L.; Zhang, Y.; Pei, Z. Rolling bearing fault diagnosis based on SSA optimized self-adaptive DBN. ISA Trans. 2022, in press. [CrossRef]
Amarouayache, I.I.E.; Saadi, M.N.; Guersi, N.; Boutasseta, N. Bearing fault diagnostics using EEMD processing and convolutional neural network methods. Int. J. Adv. Manuf. Technol. 2020, 107, 4077–4095. [Google Scholar] [CrossRef]
Appana, D.K.; Prosvirin, A.; Kim, J.M. Reliable fault diagnosis of bearings with varying rotational speeds using envelope spectrum and convolution neural networks. Soft Comput. 2018, 22, 6719–6729. [Google Scholar] [CrossRef]
Che, C.; Wang, H.; Fu, Q.; Ni, X. Deep transfer learning for rolling bearing fault diagnosis under variable operating conditions. Adv. Mech. Eng. 2019, 11, 1687814019897212. [Google Scholar] [CrossRef] [Green Version]
Shahriar, M.R.; Borghesani, P.; Tan, A.C.C. Electrical Signature Analysis-Based Detection of External Bearing Faults in Electromechanical Drivetrains. IEEE Trans. Ind. Electron. 2018, 65, 5941–5950. [Google Scholar] [CrossRef]
Nguyen, V.C.; Hoang, D.T.; Tran, X.T.; Van, M.; Kang, H.J. A bearing fault diagnosis method using multi-branch deep neural network. Machines 2021, 9, 345. [Google Scholar] [CrossRef]
An, Z.; Li, S.; Wang, J.; Jiang, X. A novel bearing intelligent fault diagnosis framework under time-varying working conditions using recurrent neural network. ISA Trans. 2020, 100, 155–170. [Google Scholar] [CrossRef]
An, J.; Ai, P.; Liu, D. Deep Domain Adaptation Model for Bearing Fault Diagnosis with Riemann Metric Correlation Alignment. Math. Probl. Eng. 2020, 2020, 4302184. [Google Scholar] [CrossRef] [Green Version]
An, Z.; Li, S.; Wang, J.; Xin, Y.; Xu, K. Generalization of deep neural network for bearing fault diagnosis under different working conditions using multiple kernel method. Neurocomputing 2019, 352, 42–53. [Google Scholar] [CrossRef]
Tang, H.; Gao, S.; Wang, L.; Li, X.; Li, B.; Pang, S. A novel intelligent fault diagnosis method for rolling bearings based on wasserstein generative adversarial network and convolutional neural network under unbalanced dataset. Sensors 2021, 21, 6754. [Google Scholar] [CrossRef]
Cheng, Y.; Lin, M.; Wu, J.; Zhu, H.; Shao, X. Intelligent fault diagnosis of rotating machinery based on continuous wavelet transform-local binary convolutional neural network. Knowl.-Based Syst. 2021, 216, 106796. [Google Scholar] [CrossRef]
Bao, Y.; Tang, Z.; Li, H.; Zhang, Y. Computer vision and deep learning–based data anomaly detection method for structural health monitoring. Struct. Heal. Monit. 2019, 18, 401–421. [Google Scholar] [CrossRef]
Wang, X.; Zhang, X.; Shahzad, M.M. A novel structural damage identification scheme based on deep learning framework. Structures 2021, 29, 1537–1549. [Google Scholar] [CrossRef]
Liu, T.; Xu, H.; Ragulskis, M.; Cao, M.; Ostachowicz, W. A data-driven damage identification framework based on transmissibility function datasets and one-dimensional convolutional neural networks: Verification on a structural health monitoring benchmark structure. Sensors 2020, 20, 1059. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Zhao, Y.; Kok Foong, L. Predicting electrical power output of combined cycle power plants using a novel artificial neural network optimized by electrostatic discharge algorithm. Meas. J. Int. Meas. Confed. 2022, 198, 111405. [Google Scholar] [CrossRef]
Du, Y.; Wang, A.; Wang, S.; He, B.; Meng, G. Fault Diagnosis under Variable Working Conditions Based on STFT and Transfer Deep Residual Network. Shock Vib. 2020, 2020, 1274380. [Google Scholar] [CrossRef]
Yao, D.; Liu, H.; Yang, J.; Li, X. A lightweight neural network with strong robustness for bearing fault diagnosis. Meas. J. Int. Meas. Confed. 2020, 159, 107756. [Google Scholar] [CrossRef]
Gao, Y.; Liu, X.; Xiang, J. FEM Simulation-Based Generative Adversarial Networks to Detect Bearing Faults. IEEE Trans. Ind. Inform. 2020, 16, 4961–4971. [Google Scholar] [CrossRef]
Mao, W.; Feng, W.; Liang, X. A novel deep output kernel learning method for bearing fault structural diagnosis. Mech. Syst. Signal Process. 2019, 117, 293–318. [Google Scholar] [CrossRef]
Wang, Z.; Zhang, Q.; Xiong, J.; Xiao, M.; Sun, G.; He, J. Fault Diagnosis of a Rolling Bearing Using Wavelet Packet Denoising and Random Forests. IEEE Sens. J. 2017, 17, 5581–5588. [Google Scholar] [CrossRef]
Li, Y.; Xu, M.; Huang, W.; Zuo, M.J.; Liu, L. An improved EMD method for fault diagnosis of rolling bearing. In Proceedings of the 2016 Prognostics and System Health Management Conference (PHM-Chengdu), Chengdu, China, 19–21 October 2016; pp. 1–5. [Google Scholar]
Guo, T.; Deng, Z. An improved EMD method based on the multi-objective optimization and its application to fault feature extraction of rolling bearing. Appl. Acoust. 2017, 127, 46–62. [Google Scholar] [CrossRef]
Neupane, D.; Seok, J. Bearing fault detection and diagnosis using case western reserve university dataset with deep learning approaches: A review. IEEE Access 2020, 8, 93155–93178. [Google Scholar] [CrossRef]
Fu, W.; Tan, J.; Xu, Y.; Wang, K. Fine-Sorted Dispersion Entropy and SVM Optimized with Mutation SCA-PSO. Entropy 2019, 21, 404. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Tian, Y.; Ma, J.; Lu, C.; Wang, Z. Rolling bearing fault diagnosis under variable conditions using LMD-SVD and extreme learning machine. Mech. Mach. Theory 2015, 90, 175–186. [Google Scholar] [CrossRef]
Dong, S.; Luo, T.; Zhong, L.; Chen, L.; Xu, X. Fault diagnosis of bearing based on the kernel principal component analysis and optimized k-nearest neighbour model. J. Low Freq. Noise Vib. Act. Control 2017, 36, 354–365. [Google Scholar] [CrossRef] [Green Version]
Yuan, H.; Wang, X.; Sun, X.; Ju, Z. Compressive sensing-based feature extraction for bearing fault diagnosis using a heuristic neural network. Meas. Sci. Technol. 2017, 28, 065018. [Google Scholar] [CrossRef]
Goyal, D.; Dhami, S.S.; Pabla, B.S. Non-Contact Fault Diagnosis of Bearings in Machine Learning Environment. IEEE Sens. J. 2020, 20, 4816–4823. [Google Scholar] [CrossRef]
Zhang, T.; Liu, S.; Wei, Y.; Zhang, H. A novel feature adaptive extraction method based on deep learning for bearing fault diagnosis. Meas. J. Int. Meas. Confed. 2021, 185, 110030. [Google Scholar] [CrossRef]
He, J.; Ouyang, M.; Yong, C.; Chen, D.; Guo, J.; Zhou, Y. A Novel Intelligent Fault Diagnosis Method for Rolling Bearing Based on Integrated Weight Strategy Features Learning. Sensors 2020, 20, 1774. [Google Scholar] [CrossRef] [Green Version]
Chen, H.Y.; Lee, C.H. Vibration Signals Analysis by Explainable Artificial Intelligence (XAI) Approach: Application on Bearing Faults Diagnosis. IEEE Access 2020, 8, 134246–134256. [Google Scholar] [CrossRef]
Che, C.; Wang, H.; Ni, X.; Fu, Q. Intelligent fault diagnosis method of rolling bearing based on stacked denoising autoencoder and convolutional neural network. Ind. Lubr. Tribol. 2020, 72, 947–953. [Google Scholar] [CrossRef]
Jia, F.; Lei, Y.; Lin, J.; Zhou, X.; Lu, N. Deep neural networks: A promising tool for fault characteristic mining and intelligent diagnosis of rotating machinery with massive data. Mech. Syst. Signal Process. 2016, 72–73, 303–315. [Google Scholar] [CrossRef]
Guo, L.; Li, N.; Jia, F.; Lei, Y.; Lin, J. A recurrent neural network based health indicator for remaining useful life prediction of bearings. Neurocomputing 2017, 240, 98–109. [Google Scholar] [CrossRef]
Shao, H.; Jiang, H.; Li, X.; Liang, T. Rolling bearing fault detection using continuous deep belief network with locally linear embedding. Comput. Ind. 2018, 96, 27–39. [Google Scholar] [CrossRef]
Gao, Y.; Liu, X.; Huang, H.; Xiang, J. A hybrid of FEM simulations and generative adversarial networks to classify faults in rotor-bearing systems. ISA Trans. 2021, 108, 356–366. [Google Scholar] [CrossRef]
Chen, C.C.; Liu, Z.; Yang, G.; Wu, C.C.; Ye, Q. An improved fault diagnosis using 1d-convolutional neural network model. Electronics 2021, 10, 59. [Google Scholar] [CrossRef]
Li, S.; Liu, G.; Tang, X.; Lu, J.; Hu, J. An ensemble deep convolutional neural network model with improved D-S evidence fusion for bearing fault diagnosis. Sensors 2017, 17, 1729. [Google Scholar] [CrossRef] [Green Version]
Wanlu, J.; Chenyang, W.; Jiayun, Z.; Shuqing, Z. Application of Deep Learning in Fault Diagnosis of Rotating Machinery. Processes 2021, 9, 919. [Google Scholar]
Zilong, Z.; Wei, Q. Intelligent fault diagnosis of rolling bearing using one-dimensional multi-scale deep convolutional neural network based health state classification. In Proceedings of the 2018 IEEE 15th International Conference on Networking, Sensing and Control (ICNSC), Zhuhai, China, 27–29 March 2018; pp. 1–6. [Google Scholar]
Ambrożkiewicz, B.; Syta, A.; Gassner, A.; Georgiadis, A.; Litak, G.; Meier, N. The influence of the radial internal clearance on the dynamic response of self-aligning ball bearings. Mech. Syst. Signal Process. 2022, 171, 108954. [Google Scholar] [CrossRef]
Zhou, X.; Mao, S.; Li, M. A novel anti-noise fault diagnosis approach for rolling bearings based on convolutional neural network fusing frequency domain feature matching algorithm. Sensors 2021, 21, 5532. [Google Scholar] [CrossRef] [PubMed]
Ji, M.; Peng, G.; He, J.; Liu, S.; Chen, Z.; Li, S. A Two-Stage, Intelligent Bearing-Fault-Diagnosis Method Using Order-Tracking and a One-Dimensional Convolutional Neural Network with Variable Speeds. Sensors 2021, 21, 675. [Google Scholar] [CrossRef]
Pham, M.T.; Kim, J.M.; Kim, C.H. Efficient fault diagnosis of rolling bearings using neural network architecture search and sharing weights. IEEE Access 2021, 9, 98800–98811. [Google Scholar] [CrossRef]
Liu, X.; Liu, H.; Yang, J.; Litak, G.; Cheng, G.; Han, S. Improving the bearing fault diagnosis efficiency by the adaptive stochastic resonance in a new nonlinear system. Mech. Syst. Signal Process. 2017, 96, 58–76. [Google Scholar] [CrossRef]
Zhang, W.; Li, C.; Peng, G.; Chen, Y.; Zhang, Z. A deep convolutional neural network with new training methods for bearing fault diagnosis under noisy environment and different working load. Mech. Syst. Signal Process. 2018, 100, 439–453. [Google Scholar] [CrossRef]
Zhang, W.; Peng, G.; Li, C.; Chen, Y.; Zhang, Z. A new deep learning model for fault diagnosis with good anti-noise and domain adaptation ability on raw vibration signals. Sensors 2017, 17, 425. [Google Scholar] [CrossRef] [PubMed]
Huang, W.; Cheng, J.; Yang, Y.; Guo, G. An improved deep convolutional neural network with multi-scale information for bearing fault diagnosis. Neurocomputing 2019, 359, 77–92. [Google Scholar] [CrossRef]
Jin, Z.; Han, Q.; Zhang, K.; Zhang, Y. An intelligent fault diagnosis method of rolling bearings based on Welch power spectrum transformation with radial basis function neural network. JVC/J. Vib. Control 2020, 26, 629–642. [Google Scholar] [CrossRef]
Zhang, A.; Li, S.; Cui, Y.; Yang, W.; Dong, R.; Hu, J. Limited data rolling bearing fault diagnosis with few-shot learning. IEEE Access 2019, 7, 110895–110904. [Google Scholar] [CrossRef]
Li, M.; Wei, Q.; Wang, H.; Zhang, X. Research on fault diagnosis of time-domain vibration signal based on convolutional neural networks. Syst. Sci. Control Eng. 2019, 7, 73–81. [Google Scholar] [CrossRef] [Green Version]
Kim, B.S.; Lee, S.H.; Lee, M.G.; Ni, J.; Song, J.Y.; Lee, C.W. A comparative study on damage detection in speed-up and coast-down process of grinding spindle-typed rotor-bearing system. J. Mater. Process. Technol. 2007, 187–188, 30–36. [Google Scholar] [CrossRef]
Hasan, M.J.; Manjurul Islam, M.M.; Kim, J.M. Bearing fault diagnosis using multidomain fusion-based vibration imaging and multitask learning. Sensors 2022, 22, 56. [Google Scholar] [CrossRef]
He, Z.; Rakin, A.S.; Fan, D. Parametric noise injection: Trainable randomness to improve deep neural network robustness against adversarial attack. In Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 15–20 June 2019; pp. 588–597. [Google Scholar]
Gupta, P.; Pradhan, M.K. Fault detection analysis in rolling element bearing: A review. Mater. Today Proc. 2017, 4, 2085–2094. [Google Scholar] [CrossRef]
Mushtaq, S.; Manjurul Islam, M.M.; Sohaib, M. Deep learning aided data-driven fault diagnosis of rotatory machine: A comprehensive review. Energies 2021, 14, 5150. [Google Scholar] [CrossRef]

Figure 1. Overall design of the One-Dimensional Convolutional Neural Network (1D-CNN).

Figure 2. Schematic flowchart of the proposed methodology in this study.

Figure 3. The procedure for the additive WGN injection.

Figure 4. The composite noisy signal at SNR = 0 dB from the original signal of the inner race fault and the additive WGN.

Figure 5. The 10 fault types of the original and noisy vibration signals based on the respective domain: (a) Orginal signal in time-domain, (b) Noise signal in time-domain, (c) Orginal signal in frequency-domain, (d) Noise signal in frequency-domain, (e) Combined signal (noisy signal) in time-domain, and (f) Combined signal (noisy signal) in frequency-domain.

Figure 6. Phasor notation separating the magnitude and phase in the frequency domain.

Figure 7. The proposed 1D-CNN structure.

Figure 8. (a) CWRU bearing experimental platform, (b) Data augment with overlap, and (c) Vibration signal expansion mode.

Figure 9. The frequency spectrums of IF at 0.007 inches with varying sampling lengths: (a) 1024 points, (b) 2048 points, and (c) 4096 points.

Figure 10. Effect of applying the phase component at various sampling points under varying SNRs.

Figure 11. Advantage of applying the phase component on test accuracy at (a) 1024 points, (b) 2048 points, and (c) 4096 points.

Figure 12. The impact of magnitude and phase components on the improved accuracy under different SNRs with a reduced number of samples.

Figure 13. Impact of utilising whole and symmetrical signals on the fault detection under varying SNRs.

Figure 14. Effect of training 1D-CNN under different SNR ratios.

Figure 15. Effect of model performance using half signal and whole signal on the domain adaptation accuracy.

Figure 16. Visual comparison of the proposed 1D-CNN model of six domain shifts on Dataset A, B and C compared to recently published SVM, MLP, DNN, WDCNN, and TICNN models.

Table 1. The description of the proposed 1D-CNN.

Layer	Type	Kernel Size	In/Out Channels	Stride	Padding
I0	Input	-	-	-	-
C1	Conv	16	2/20	4	No
C2	Conv	8	20/50	4	No
P1	AdaptiveAvgPool1d	Adaptive	50/50	-	-
FC	Fully connected	1	50/10	-	-

Table 2. Dataset class description.

Motor Load (Hp)	Shaft Speed (RPM)	Bearing Fault (inch)			Inner Fault (inch)			Outer Fault (inch)
0	1797	0.007	0.014	0.021	0.007	0.014	0.021	0.007	0.014	0.021
1	1772
2	1750
3	1720

Table 3. The information on the rolling bearing’s structure.

No. of Rolling Elements	Ball Diameter	Outside Diameter	Inside Diameter	Thickness	Contact Angle	Pitch Diameter
9	0.3126 in.	2.0472 in.	0.9843 in.	0.5906 in.	0°	1.537 in.

Table 4. Characteristics of defect frequencies for bearing (6205-2RS JEM SKF).

The Frequencies Characteristic	Formula	Fault Frequencies [Hz]
Outer-race ball pass frequency (BPFO)	$BPFO = \frac{n f_{r}}{2}$ $(1 - \frac{d}{D} c o s α)$	3.5848
Inner-race ball pass frequency (BPFI)	$BPFI = \frac{n f_{r}}{2}$ $(1 + \frac{d}{D} c o s α)$	5.4152
Ball (roller) spin frequency(BSF)	$BSF = \frac{D f_{r}}{2 d}$ $(1 - {[\frac{d}{D} c o s α]}^{2})$	4.7135
Fundamental train frequency(FTF)	$FTF = \frac{f_{r}}{2}$ $(1 - \frac{d}{D} c o s α)$	0.39828

Here, n, D, d,

f_{r}

, α indicate the number of rolling elements, the bearing pitch diameter, the rolling element diameter, running speed in Hz and the angle of the load from the radial plane, respectively.

Table 5. Numbers of training and testing datasets.

Fault Location	Normal		RF			IF			OF
Category labels	0	1	2	3	4	5	6	7	8	9
Fault diameter (inch)	0	0.007	0.014	0.021	0.007	0.014	0.021	0.007	0.014	0.021
Working condition Train	320	320	320	320	320	320	320	320	320	320
(0 HP) Test	80	80	80	80	80	80	80	80	80	80
Working condition Train	320	320	320	320	320	320	320	320	320	320
(1 HP) Test	80	80	80	80	80	80	80	80	80	80
Working condition Train	320	320	320	320	320	320	320	320	320	320
(2 HP) Test	80	80	80	80	80	80	80	80	80	80
Working condition Train	320	320	320	320	320	320	320	320	320	320
(3 HP) Test	80	80	80	80	80	80	80	80	80	80

Table 6. CNN training with different SNR ratios.

SNR	CNN	CNN with Fixed SNR (2)	CNN with Fixed SNR (−4)	CNN with Random SNR (−4~2)
−10	53.62	72.75	96.87	97.37
−8	76.37	80.12	98.62	98.87
−6	84.87	86.62	99.37	99.37
−4	92.62	94.12	99.62	99.37
−2	95.37	96.87	99.50	99.75
0	97.12	98.12	99.37	99.87
2	97.87	99.37	99.37	99.75
4	98.37	99.00	99.62	100
6	98.12	99.12	99.87	100
8	98.75	99.12	100	100
10	98.75	99.37	100	100

Table 7. Scenario settings for the domain adaptation analysis.

Domain Type	Source Domain	Target Domain
Description	Labelled signals under one single load	Unlabelled signals under other loads
Domain details	Training set:	Test set:
	A	B	C
	B	C	A
	C	A	B
Target	Diagnose unlabelled vibration signals in the target domain

Table 8. Comparison of accuracy between the proposed 1D-CNN model and past studies under varying SNR values.

Accuracy (%)	SNR											Ref.
Accuracy (%)	−10	−8	−6	−4	−2	0	2	4	6	8	10
WDCNN	-	-	-	66.95	80.81	90.51	97.52	99.23	99.77	99.83	99.87	[40]
WDCNN(AdaBN)	-	-	-	92.65	97.04	98.77	99.57	99.70	99.83	99.89	99.93	[40]
TICNN	-	-	-	82.05	96.47	98.22	99.27	99.61	99.59	99.75	99.63	[39]
W-RBFNN	-	-	-	79.50	88.48	94.25	96.72	98.35	99.45	99.40	99.76	[42]
SIRCNN	-	-	96.2	99.1	99.7	100	100	100	100	100	100	[12]
FDFM	87.77	92.57	93.9	94.57	95.57	96.33	96	96.13	96.4	96.1	96.87	[36]
CNN-FDFM	93.33	96.73	99.2	99.3	99.6	99.33	99.77	99.7	9987	99.93	99.6	[36]
This study	97.37	98.87	99.37	99.37	99.75	99.87	99.75	100	100	100	100

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Hakim, M.; Omran, A.A.B.; Inayat-Hussain, J.I.; Ahmed, A.N.; Abdellatef, H.; Abdellatif, A.; Gheni, H.M. Bearing Fault Diagnosis Using Lightweight and Robust One-Dimensional Convolution Neural Network in the Frequency Domain. Sensors 2022, 22, 5793. https://doi.org/10.3390/s22155793

AMA Style

Hakim M, Omran AAB, Inayat-Hussain JI, Ahmed AN, Abdellatef H, Abdellatif A, Gheni HM. Bearing Fault Diagnosis Using Lightweight and Robust One-Dimensional Convolution Neural Network in the Frequency Domain. Sensors. 2022; 22(15):5793. https://doi.org/10.3390/s22155793

Chicago/Turabian Style

Hakim, Mohammed, Abdoulhadi A. Borhana Omran, Jawaid I. Inayat-Hussain, Ali Najah Ahmed, Hamdan Abdellatef, Abdallah Abdellatif, and Hassan Muwafaq Gheni. 2022. "Bearing Fault Diagnosis Using Lightweight and Robust One-Dimensional Convolution Neural Network in the Frequency Domain" Sensors 22, no. 15: 5793. https://doi.org/10.3390/s22155793

APA Style

Hakim, M., Omran, A. A. B., Inayat-Hussain, J. I., Ahmed, A. N., Abdellatef, H., Abdellatif, A., & Gheni, H. M. (2022). Bearing Fault Diagnosis Using Lightweight and Robust One-Dimensional Convolution Neural Network in the Frequency Domain. Sensors, 22(15), 5793. https://doi.org/10.3390/s22155793

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Bearing Fault Diagnosis Using Lightweight and Robust One-Dimensional Convolution Neural Network in the Frequency Domain

Abstract

1. Introduction

2. Background and Related Studies

2.1. Related Studies

2.2. Convolutional Neural Network (CNN)

2.3. Fast Fourier Transform (FFT)

3. Methodology

3.1. Robustness Improvement with Noise Injection

3.2. Frequency-Domain

3.3. Development of the 1D-CNN Model

4. Experimental Setup

4.1. Dataset Preparation and Partitioning

4.2. Training Methodology and Implementation Details

5. Results and Discussion

5.1. Performance Evaluation of Different Sampling Points

5.2. Performance Evaluation under Different Working Environments

5.3. Performance Evaluation under Noisy Environments

5.4. Performance Evaluation under Different Load Domains

5.5. Performance Comparison

6. Conclusions and Recommendations

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI