Next Article in Journal
A Practical Method for Blind Pixel Detection for the Push-Broom Thermal-Infrared Hyperspectral Imager
Previous Article in Journal
Towards a Policy Development Methodology for Human-Centred IoT Collectives
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Communication

Deep Learning-Based Remaining Useful Life Estimation of Bearings with Time-Frequency Information

1
School of Instrumentation Science and Engineering, Harbin Institute of Techonoloy, Harbin 150001, China
2
Shanghai Spaceflight Precision Machinery Institute, Shanghai 201600, China
*
Author to whom correspondence should be addressed.
Sensors 2022, 22(19), 7402; https://doi.org/10.3390/s22197402
Submission received: 1 September 2022 / Revised: 17 September 2022 / Accepted: 22 September 2022 / Published: 29 September 2022
(This article belongs to the Section Industrial Sensors)

Abstract

:
In modern industrial production, the prediction ability of remaining useful life of bearings directly affects the safety and stability of the system. Traditional methods require rigorous physical modeling and perform poorly for complex systems. In this paper, an end-to-end remaining useful life prediction method is proposed, which uses short-time Fourier transform (STFT) as preprocessing. Considering the time correlation of signal sequences, a long and short-term memory network is designed in CNN, incorporating the convolutional block attention module, and understanding the decision-making process of the network from the interpretability level. Experiments were carried out on the 2012PHM dataset and compared with other methods, and the results proved the effectiveness of the method.

1. Introduction

In the operation of equipment, a large number of faults are caused by bearing failure. The prediction of remaining useful life (RUL) of bearings has become a key technology to ensure mechanical work safety. Existing bearing RUL prediction methods include two types, statistical life model prediction and data-driven prediction. Model-based methods include particle filtering [1], Eyring model [2], Weibull distribution [3], etc. These methods need a large number of statistical data as the basis to have certain reliability, but it is difficult to establish an accurate and general mathematical model for complex equipment.
The data-driven method uses the end-to-end training strategy and uses the state monitoring data to predict. A. Soualhi et al. proposed a rolling bearing condition monitoring method combining Hilbert–Huang transform, support vector machine, and support vector regression [4]. S.A. Aye et al. proposed an optimal Gaussian process regression (GPR) for low-speed bearing RUL [5]. Zhang Z. et al. converted these signals into the frequency domain using wavelet packet decomposition and fast Fourier transform and trained artificial neural network (ANN) [6].
In the field of CNN, G.S. Babu et al. and L. Ren et al., respectively, used fast Fourier transform and wavelet transform as pretreatments and used CNN to predict RUL [7,8]. Zhu et al. used the original vibration signal as the input of CNN for training and testing [9]. A.Z. Hinchi et al. used CNN and LSTM to predict RUL [10]. Biao Wang et al. made use of information by feedback connection that feeds back the output of the convolutional layer as input [11]. Xiang Li Wei et al. adopted the convolutional neural network of multi-scale feature fusion to gradually extract high-level features and discard low-level features at the same time. Features of different levels are fused, which improves the characteristic of one-way step-by-step transmission of feature information in the traditional convolutional network [12]. The difference between this paper and other methods is that the time-frequency diagram of vibration signals is taken as the network input, and CNN integrates LSTM and CBAM.
Focusing on RUL prediction, STFT pretreatment attention and memory-based CNN (SAL-CNN) is proposed in this paper. Based on CNN, the short-time Fourier transform (STFT) is used as the vibration signal preprocessing, and long short-term memory (LSTM) is integrated to make up for the fact that traditional CNN cannot consider the time correlation between vibration signal sequences. Moreover, convolutional block attention module (CBAM) is used to achieve the accurate acquisition of fault features and improve the accuracy of network life prediction.
The main contributions of this paper are as follows: Presenting a CNN framework (SAL-CNN), which takes STFT transformation as pretreatment and integrates CBAM and LSTM; visualizing the output of CBAM to explain the framework; discussing the relationship between RUL and frequency; and enhancing experimental credibility.
This paper introduces the basic theory about LSTM and CBAM in Section 2, the preprocessing algorithm and network SAL-CNN are detailed in Section 3, the experimental situation and comparative analysis are introduced in Section 4, and the interpretability of the network is subsequently studied. Finally, the summary is presented in Section 5.

2. Related Work

2.1. Long Short-Term Memory

The recurrent neural network (RNN) [13] takes both the output characteristics of the last moment and data of the current moment as input. However, gradient disappearance and gradient explosion will occur with the increase of cycle layers. LSTM [14] is a special RNN, it introduces the concepts of gating unit and cell state. The input gate controls the output information from the previous layer, and the memory information from the previous moment is controlled by the forgetting gate. The most common LSTM structures are shown in Figure 1.
The expression of the forgetting gate is:
f t = σ W f · h t 1 , x t + b f
The expression of the input gate is:
i t = σ W i · h t 1 , x t + b i
C t ˜ = t a n h W C · h t 1 , x t + b C
The expression of the output gate is:
o t = σ W o · h t 1 , x t + b o
h t = o t · t a n h C t . ,
where W f , W i , W c , and W o represent the weight matrix, respectively, σ represents the sigmoid activation function, and acts on after passing through the forgetting gate and the input gate. The expression is:
C t = f t · C t 1 + i t · C ˜ t 1 .

2.2. Convolutional Block Attention Module

The convolutional block attention module (CBAM) [15] is a simple and effective injection module for feedforward convolutional neural network; its structure is shown in Figure 2. As a lightweight module, CBAM can be placed behind any feature graph as required. For feature map F R c · h · w of an intermediate layer, CBAM will deduce the 1-dimensional channel attention map M c R c · 1 · 1 and 2-dimensional spatial attention map M s R 1 · H · W , as shown below:
F = M c ( F ) F
F = M s ( F ) F ,
where ⊗ is element-wise multiplication, and the expressions of the channel attention module Mc(F) and space attention module Ms(F) are as follows:
M C F = σ MLP AvgPool F + MLP MaxPool F
M S F = σ f AvgPool F , MaxPool F ,
where σ is Sigmoid activation function, and f is convolution operation.

3. Framework

Aiming at the problems that existing models cannot accurately predict RUL and work in real-time, an end-to-end detection framework, SAL-CNN, is proposed. The flow diagram is shown in Figure 3. The method includes two parts: vibration signal preprocessing and convolution neural network. CNN is designed to integrate the convolutional block attention module and long and short term memory network.

3.1. Vibration Signal Preprocessing

Since vibration signals have many spectral components and are non-stationary signals, the short-time Fourier transform [16] is used to preprocess the input data to obtain the time-frequency graph diagram. The short-time Fourier transform is a kind of joint time-frequency analysis method. Based on the Fourier transform, the whole segment of signal is processed by window segmentation, and each small segment is considered as a stationary signal. The window function slips along the time axis, and the Fourier transform is carried out in the neighborhood at any time.
For non-stationary signals x t L 2 R , the short-time Fourier transform of x t is expressed as:
S T F T x t , f = x τ h τ t e j 2 π f t d τ ,
where h τ t is the window function.
Take the Fourier transform of h t : the energy is concentrated at the low frequency range, so it is usually thought of as a low-pass filter. As can be seen from Equation (11), the window function h t moves along the time axis and conducts segmented processing on h t in STFT transformation, the expression is:
x t τ = x τ h τ t
Later, take the Fourier transform of x t τ .
After windowing the signal, all the features of the signal covered by the window function will be displayed. The selection of the window function directly affects the results after STFT transformation. The frequency resolution is mainly determined by the main lobe width, so the window function with the narrowest main lobe and the smallest sidelobe peak value should be selected. The common window functions are rectangular window, triangular window, Hamming window, Blackman window, etc. and hamming window is selected as the window function comprehensively.
Figure 4 shows the vibration signal of bearing 1-1 in the whole life cycle and the time-frequency diagram after STFT transformation in the initial, middle, and final stages. The first row is the vibration signal diagram and the second row is the time-frequency diagram after STFT transformation. The X-axis represents the time axis, and the Y-axis represents the vibration frequency. The more brightly colored part represents the more concentrated energy of the vibration frequency in the same time series. The third row is a 3D display of the time-frequency graph, where the z-axis represents the frequency occurrence times. As can be seen from the figure, with the degradation of bearings and the increase of vibration signal amplitude, signals gradually concentrated in the low-frequency range. By this time, the high-frequency part was mainly an external noise signal, which made little contribution to life prediction. Therefore, the frequency of the upper part of the vibration signal was filtered out, and the remaining time-frequency information was used to predict RUL.

3.2. Network Model Structure

Previous methods make use of only time-domain signals, which can increase the learning difficulty of deep learning models. Inspired by this, we propose to use the short-time Fourier transform to preprocess the input data, due to the fact that the short-time Fourier transform can provide information in both the time and frequency domains. It is worth mentioning that the fault information is often more obvious in the frequency domain. To be able to better extract temporal and spatial information, we add an attention mechanism module and an LSTM module to the ordinary convolutional neural network, which will enable the network to better extract these two types of information. Figure 5 shows the specific structure of our method.
Firstly, the vibration signal is preprocessed by STFT transformation (the size of the time-frequency diagram is 11 × 129 after preprocessing), enters into a six-layer convolution operation, and then into CBAM. After that, a one-layer convolution operation with a convolution kernel is used to compress the information, followed by LSTM. The last layer is a full connection layer and outputs a predicted life value; Table 1 describes specific network parameters.

4. Experiments and Discussions

4.1. Data Sets and Evaluation Indicators

In order to verify the effectiveness of the method, the data set of the 2012PHM Challenge [17] was used as the validation data set. The data set was provided by the FEMTO-ST research institute in France and was obtained by building an experimental platform in a laboratory environment [18]. Three working conditions were used for the experiments: (1) rotation speed:1800 r/min, radial force: 4000 N; (2) rotation speed: 1650 r/min, radial force: 4200 N; (3) rotation speed: 1500 r/min, radial force: 5000 N. The experimental platform collects vibration signals of the whole life cycle of bearings, and a total of 17 bearings are tested and their vibration signals collected. Table 2 shows the distribution of bearings.
In order to quantitatively evaluate the network structure, MAE is used as the evaluation index. As shown by expression 13, the more accurate the prediction results are, the lower MAE is:
M A E = 1 s i = 1 s ActRUL i PreRUL i ,
where S is the number of testing samples, ActRUL i is the actual RUL values corresponding to the i-th testing sample, and PreRUL i is the predictive RUL values corresponding to it.

4.2. Experimental Setup

All bearings were used as prediction bearings in turn, and the remaining 16 bearings were used as training sets in each experiment to obtain cross-validation results.
The training set is input into SAL-CNN for training, SAL-CNN is implemented by PyTorch with Nvidia GeForce GTX 2080ti GPU, the iteration training is terminated after 150 epochs. The loss function is L1 loss and the optimizer is Adam, the learning rate is set to 0.001 and batch size is 32. The dropout rate is 0.1. The number of LSTM cycle layers is 1.

4.3. Analysis

Figure 6 shows the prediction results of selected bearings, which is under different work conditions. The red curve is the real RUL and the yellow curve is the predicted RUL. It can be seen that the yellow curve basically fits the true value.
This paper conducted comparative experiments with four methods on the 2012PHM Challenge data set. Table 3 shows MAE indexes compared with other methods. DNN is also known as multi-layer perceptron (MLP), single scale-low (SSL), and single scale-high (SSH) using low-scale features and high-level features, respectively, for RUL estimation without feature concatenation, and a multi-scale method integrating an all scale feature [12].
The comparison result indicates that the MAE obtained by the proposed algorithm is higher than others. Among the 17 bearings, the prediction results of 11 bearings are the best, and 2/3 of the results are better and evenly distributed under each working condition, indicating that this method is effective for bearings under three working conditions. According to the average value of the final MAE, it can be seen that the results in this paper are improved by 2.8 compared with the multi-scale method.

4.4. Feature Information Visualization

Feature visualization is a post-hoc interpretation of the pre-training model. After training, the decision-making process of CNN is explained in the form of images, which can intuitively reflect the key areas of input features that the network focuses on.
Table 4 shows the time-frequency graph and heatmap of bearings 1-1, 1-3, 1-5 in the early, middle, and late stages. In time-frequency graph, the ordinate is the frequency and the value range is [0, 6400], and the abscissa is time and the value range is [0, 1]. In the trained network it is possible to visualize the output of the CBAM layer and obtain aheatmap where the highlighted parts represent the areas that the network is more concerned with. As time goes by, the frequency concentrated range in the time-frequency graph is from 3000–4000 Hz to 0–2000 Hz, and the high-frequency part is basically no energy. It can also be seen from the heatmap that the network pays more attention to the information of the low-frequency part in the late stage, it means low-frequency signal may become more import.

5. Summary

This paper proposes a long and short-term memory convolutional neural network that introduces a convolutional block attention module. The improved algorithm makes up for the lack of traditional convolutional networks that cannot consider the time correlation of vibration signal sequences. At the same time, the convolutional block attention module further improves the prediction performance of the network. The test on the PHM2012 data set proves that the average MAE is slightly higher than the best recent results.
In addition, we found that for predicting the RUL task, low-frequency vibration signals may be more worthy of attention at a later stage; the time-frequency graph directly shows that most of the energy is distributed in the low-frequency range at a later stage. Using heatmap to visualize the output of CBAM, we also found that the network at the later stage pays more attention to the low-frequency range, and interpretability confirms this conjecture.
In the future, we will explore the use of multimodal information to leverage both time and frequency domain information to further improve the prediction of bearing life. In addition, we believe that the study of deep learning interpretability to obtain human-understandable knowledge may provide more insight into the field.

Author Contributions

Conceptualization, B.L. (Bingguo Liu) and B.L. (Binghui Lu); methodology B.L. (Binghui Lu) and H.D.; software, Z.G.; validation, B.L. (Bingguo Liu), Z.G. and Z.A.; formal analysis, H.D.; investigation, Z.A.; resources, B.L. (Bingguo Liu); data curation, Z.G.; writing—original draft preparation, B.L. (Bingguo Liu) and Z.G.; writing—review and editing, B.L. (Binghui Lu) and H.D.; visualization, Z.G.; supervision, B.L. (Bingguo Liu); project administration, B.L. (Binghui Lu). All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Jouin, M.; Gouriveau, R.; Hissel, D.; Péra, M.C.; Zerhouni, N. Particle filter-based prognostics: Review, discussion and perspectives. Mech. Syst. Signal Process. 2016, 72, 2–31. [Google Scholar] [CrossRef]
  2. Jouin, M.; Gouriveau, R.; Hissel, D.; Pera, M.C.; Zerhouni, N. Degradations analysis and aging modeling for health assessment and prognostics of PEMFC. Reliab. Eng. Syst. Saf. 2016, 148, 78–95. [Google Scholar] [CrossRef]
  3. Ali, J.B.; Chebel-Morello, B.; Saidi, L.; Malinowski, S.; Fnaiech, F. Accurate bearing remaining useful life prediction based on Weibull distribution and artificial neural network. Mech. Syst. Signal Process. 2015, 56–57, 150–172. [Google Scholar]
  4. Soualhi, A.; Medjaher, K.; Zerhouni, N. Bearing Health monitoring based on Hilbert-Huang Transform, Support Vector Machine and Regression. IEEE Trans. Instrum. Meas. 2014, 64, 52–62. [Google Scholar] [CrossRef]
  5. Aye, S.A.; Heyns, P.S. An integrated Gaussian process regression for prediction of remaining useful life of slow speed bearings based on acoustic emission. Mech. Syst. Signal Process. 2017, 84, 485–498. [Google Scholar] [CrossRef]
  6. Zhang, Z.; Wang, Y.; Wang, K. Fault diagnosis and prognosis using wavelet packet decomposition, Fourier transform and artificial neural network. J. Intell. Manuf. 2013, 24, 1213–1227. [Google Scholar] [CrossRef]
  7. Babu, G.S.; Zhao, P.; Li, X.L. Deep Convolutional Neural Network Based Regression Approach for Estimation of Remaining Useful Life. In Proceedings of the International Conference on Database Systems for Advanced Applications, Dallas, TX, USA, 16–19 April 2016. [Google Scholar]
  8. Ren, L.; Sun, Y.; Wang, H.; Zhang, L. Prediction of Bearing Remaining Useful Life with Deep Convolution Neural Network. IEEE Access 2018, 6, 13041–13049. [Google Scholar] [CrossRef]
  9. Zhu, J.; Nan, C.; Peng, W. Estimation of Bearing Remaining Useful Life based on Multiscale Convolutional Neural Network. IEEE Trans. Ind. Electron. 2018, 66, 3208–3216. [Google Scholar] [CrossRef]
  10. Hinchi, A.Z.; Tkiouat, M. Rolling element bearing remaining useful life estimation based on a convolutional long-short-term memory network. Procedia Comput. Sci. 2018, 127, 123–132. [Google Scholar] [CrossRef]
  11. Wang, B.; Lei, Y.; Yan, T.; Li, N.; Guo, L. Recurrent convolutional neural network: A new framework for remaining useful life prediction of machinery. Neurocomputing 2020, 379, 117–129. [Google Scholar] [CrossRef]
  12. Li, X.; Zhang, W.; Ding, Q. Deep learning-based remaining useful life estimation of bearings using multi-scale feature extraction. Reliab. Eng. Syst. Saf. 2019, 182, 208–218. [Google Scholar] [CrossRef]
  13. Elman, J.L. Finding Structure in Time. Cogn. Sci. 1990, 14, 179–211. [Google Scholar] [CrossRef]
  14. Zaremba, W.; Sutskever, I.; Vinyals, O. Recurrent Neural Network Regularization. arXiv 2014, arXiv:1409.2329. [Google Scholar]
  15. Woo, S.; Park, J.; Lee, J.Y.; Kweon, I.S. CBAM: Convolutional Block Attention Module. In Proceedings of the European Conference on Computer Vision, Munich, Germany, 8–14 September 2018. [Google Scholar]
  16. Chui, C.K.; Jiang, Q.; Li, L.; Lu, J. Analysis of an adaptive short-time Fourier transform-based multicomponent signal separation method derived from linear chirp local approximation. J. Comput. Appl. Math. 2021, 396, 113607. [Google Scholar] [CrossRef]
  17. Sutrisno, E.; Oh, H.; Vasan, A.S.S.; Pecht, M. Estimation of remaining useful life of ball bearings using data driven methodologies. In Proceedings of the 2012 IEEE Conference on Prognostics and Health Management, Denver, CO, USA, 18–21 June 2012; pp. 1–7. [Google Scholar]
  18. Nectoux, P.; Gouriveau, R.; Medjaher, K.; Ramasso, E.; Chebel-Morello, B.; Zerhouni, N.; Varnier, C. PRONOSTIA: An experimental platform for bearings accelerated degradation tests. In Proceedings of the IEEE International Conference on Prognostics and Health Management, PHM’12, Denver, CO, USA, 18 June 2012; pp. 1–8. [Google Scholar]
Figure 1. LSTM structures.
Figure 1. LSTM structures.
Sensors 22 07402 g001
Figure 2. CBAM structures.
Figure 2. CBAM structures.
Sensors 22 07402 g002
Figure 3. The flow diagram of our method.
Figure 3. The flow diagram of our method.
Sensors 22 07402 g003
Figure 4. Three stages of vibration signal.
Figure 4. Three stages of vibration signal.
Sensors 22 07402 g004
Figure 5. Structure of SAL-CNN.
Figure 5. Structure of SAL-CNN.
Sensors 22 07402 g005
Figure 6. The RUL prediction results of selected bearings.
Figure 6. The RUL prediction results of selected bearings.
Sensors 22 07402 g006
Table 1. Architecture of SAL-CNN.
Table 1. Architecture of SAL-CNN.
LayerParameterValue
STFTWindowHamming
fs25,600 Hz
Nperseg512
Conv1In channels1
Kernel size3 × 3
Out channels5
Conv2In channels5
Kernel size3 × 3
Out channels5
Conv3In channels5
Kernel size3 × 3
Out channels1
LSTMInput size1408
Hidden size1408
Num layers2
Table 2. Distribution of bearings in 2012PHM Challenge.
Table 2. Distribution of bearings in 2012PHM Challenge.
Operating ModeRadial Force (N)Rotate Speed (rpm)Bearing
140001800Bearing1-1, Bearing1-2
Bearing1-3, Bearing1-4
Bearing1-5, Bearing1-6, Bearing1-7
242001650Bearing2-1, Bearing2-2
Bearing2-3,Bearing2-4
Bearing2-5, Bearing2-6, Bearing2-7
350001500Bearing3-1, Bearing3-2, Bearing3-3
Table 3. Comparison of life prediction results based on MAE index. Lower MAE means better performance.
Table 3. Comparison of life prediction results based on MAE index. Lower MAE means better performance.
BearingDNNSSLSSHMulti-Scale [12]SAL-CNN
Bearing1-130.424.324.421.416.9
Bearing1-228.517.422.414.713.0
Bearing1-316.39.59.37.810.3
Bearing1-432.121.324.521.823.0
Bearing1-528.721.419.518.511.8
Bearing1-632.424.520.420.321.4
Bearing1-716.69.611.78.311.9
Bearing2-128.624.225.221.518.5
Bearing2-226.415.316.516.28.8
Bearing2-340.535.231.532.814.8
Bearing2-422.311.69.96.124.2
Bearing2-531.427.325.223.623.3
Bearing2-629,423.219.820.78.6
Bearing2-738.028.228.427.427.1
Bearing3-128.328.526.319.625.2
Bcaring3-230.425.223.723.118.1
Bearing3-338.535.433.530.411.1
Table 4. Spectrogram and heatmap of bearings in three stages.
Table 4. Spectrogram and heatmap of bearings in three stages.
BearingImage TypeEarly StageMiddle StageLate Stage
1-1Time-frequency
graph
Sensors 22 07402 i001 Sensors 22 07402 i002 Sensors 22 07402 i003
Heatmap
1-3Time-frequency
graph
Sensors 22 07402 i004 Sensors 22 07402 i005 Sensors 22 07402 i006
Heatmap
1-5Time-frequency
graph
Sensors 22 07402 i007 Sensors 22 07402 i008 Sensors 22 07402 i009
Heatmap
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Liu, B.; Gao, Z.; Lu, B.; Dong, H.; An, Z. Deep Learning-Based Remaining Useful Life Estimation of Bearings with Time-Frequency Information. Sensors 2022, 22, 7402. https://doi.org/10.3390/s22197402

AMA Style

Liu B, Gao Z, Lu B, Dong H, An Z. Deep Learning-Based Remaining Useful Life Estimation of Bearings with Time-Frequency Information. Sensors. 2022; 22(19):7402. https://doi.org/10.3390/s22197402

Chicago/Turabian Style

Liu, Bingguo, Zhuo Gao, Binghui Lu, Hangcheng Dong, and Zeru An. 2022. "Deep Learning-Based Remaining Useful Life Estimation of Bearings with Time-Frequency Information" Sensors 22, no. 19: 7402. https://doi.org/10.3390/s22197402

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop