An Intelligent Multi-Local Model Bearing Fault Diagnosis Method Using Small Sample Fusion
Abstract
:1. Introduction
- To improve the accuracy of fault diagnosis under the condition where only a small amount of monitoring data can be collected due to the relatively short signal duration and the harsh conditions endured by some special industrial motor bearings such as centrifugal and cryogenic pumps, a multi-local model fault diagnosis algorithm of bearing faults using small sample fusion (MLMF-SS) is proposed in this paper. In the proposed algorithm, we consider the similarity and migration of shallow features of industrial motor bearing data by introducing migration learning strategies for solving the low diagnostic accuracy problem with small amounts of data in industry. Specifically, the Bi-LSTM-based multi-local feature fusion model is used as the fault diagnosis model of the source domain. Then, the transfer of a small sample bearing fault diagnosis model based on similarity transfer measurement is optimized. The maximum mean difference algorithm is used to evaluate the data distribution differences between the source domain dataset and the target domain dataset. The network structure and depth of the source domain fault diagnosis model are adjusted based on the evaluation results.
- The performance of the proposed algorithm is evaluated using an actual experiment platform. Firstly, the effectiveness of optimizing the migration of fault diagnosis models was evaluated. To verify the effectiveness of the proposed fault diagnosis model migration optimization strategy, this paper selects three small-sample processing strategies based on the multi-local feature fusion model for comparison. These are (1) the source domain fault diagnosis model is directly trained using target domain bearing data, termed a conventional strategy (CS); (2) the source domain fault diagnosis model is trained using source and target domain bearing data, termed a mixed strategy (MS); (3) the source domain fault diagnosis model is trained using both the source domain and target domain bearing datasets and all of the network parameters of the source domain fault diagnosis model are fine-tuned, termed the transfer strategy (TS). The experimental results show that the algorithm proposed in this paper achieves higher accuracy compared to the comparison strategy. Then, the proposed MLMF-SS algorithm was evaluated and compared with several advanced algorithms, i.e., long short-term memory (LSTM)-based, transfer component analysis and deep belief network (TCA-DBN)-based, and deep transfer convolutional neural network (TCNN)-based algorithms, under small sample conditions. It is demonstrated that high accuracy and F1 score can be obtained by our proposed algorithm compared with other algorithms.
2. Difference and Migration of Shallow Features of Similar Bearings
2.1. Difference of Shallow Features of Similar Bearings
2.2. Migration of Shallow Features of Similar Bearings
2.3. Migration Selection Based on Sample Dataset Similarity Measure
3. Shallow Feature Migration for Bearing Faults Constrained by Small Sample Constraints
Similarity Migration Metric of Bearing Shallow Features Based on MMD
- (1)
- When the average value between the target and source domains is lower than or equal to the set threshold , the difference between the two domains is minimal, the shallow and deep features between the bearings are highly similar. Therefore, the shallow and the deep feature extraction layers of the fault diagnosis model of the source domain are migrated to generate the fault diagnosis model of the target domain.
- (2)
- When the average value of the target and source domains is higher than the set threshold and lower than the set threshold , it means that the difference between the distributions of the two domains is small while the shallow features between the bearings are highly similar, and the deep features are less similar. Therefore, the shallow feature extraction layers of the fault diagnosis model of the source domain are migrated to generate the fault diagnosis model of the target domain. In addition, a new attention mechanism layer is added in front of the fusion feature layer in the model of the target domain to highlight the contribution of important features to fault diagnosis.
- (3)
- When the average value between the target and the source domains is higher than or equal to the set threshold , it means that there is a large difference between the two domains while the shallow features and deep features between the bearings have large differences. At this time, it is necessary to re-analyze and process the source domain data, and then perform corresponding processing after determining the applicability of migration optimization. The thresholds and are set according to expert experience and multiple experimental analysis.
4. Bearing Fault Diagnosis Based on Small Sample Fusion
4.1. Bi-LSTM-Based Multi-Local Feature Fusion Model Construction
- (1)
- Selection of the original signal source of the bearing. Considering that there are a lot of uncertainties in fault diagnosis and the complementarity between the current and bearing vibration signals, the current and vibration sensor signals are selected as the original signal source, and used as input information for training the fault diagnosis model in parallel.
- (2)
- Feature adaptive extraction and fusion. Two Bi-LSTM deep learning sub-networks adaptively extract features from the original vibration and current signal sources to deeply mine the deep-level mapping relationship and feature information of the industrial motor bearing vibration sensor signals and current sensor signals in various fault states. After that, the feature fusion layer fuses the vibration signal’s feature information and current signal’s feature information extracted by the Bi-LSTM sub-network. Moreover, in order to prevent overfitting, a dropout operation is added between the feature fusion and fully connected layers. The feature fusion part of the fault diagnosis model considered here does not use more complex feature fusion technology, but chooses direct serial fusion. This is because direct serial fusion has stronger versatility. In addition, no matter whether the feature average summation method or the feature screening fusion method are chosen, it will cause a certain amount of information loss. Direct serial fusion has a lower information loss rate and contains more comprehensive bearing status information. It is obvious to say that fault diagnosis is more suitable under small sample conditions.
- (3)
- Fault identification and classification. The classification and fully connected layers of the fault diagnosis model are mainly used for the recognition and classification stages. In this paper, the features processed by the fusion feature layer are output to the fully connected layer. After performing the conversion of the feature space on the fused features using the fully connected layer, the classification layer is used to identify and classify the bearing status.
4.2. Migration Optimization of Bearing Fault Diagnosis Model Based on Similarity Migration Metric
- (1)
- After filtering out any data sample w in the bearing dataset in the source domain, calculate the difference of the two domains using the formula below.
- (2)
- After that, calculate the average value of the a , with the calculation formula as follows:
- (3)
- Sort all values in ascending (or descending) order and compare with values to determine the sample data that needs to be screened out, and generate a new source domain bearing dataset. The judgment formula for dataset sample screening is as follows:
Algorithm 1 Migration Optimization Process |
|
5. Performance Evaluation
5.1. Experimental Settings
5.2. Effectiveness Evaluation of Fault Diagnosis Model Migration Optimization
5.3. Performance Evaluation of Fault Diagnosis Algorithms under Small Sample Conditions
6. Conclusions and Future Work
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Tang, H.; Lu, S.; Qian, G.; Ding, J.; Liu, Y.; Wang, Q. IoT-based signal enhancement and compression method for efficient motor bearing fault diagnosis. IEEE Sens. J. 2021, 21, 1820–1828. [Google Scholar] [CrossRef]
- Xu, Y.; Yan, X.; Sun, B.; Liu, Z. Deep Coupled Visual Perceptual Networks for Motor Fault Diagnosis Under Nonstationary Conditions. IEEE Trans. Mechatronics 2022, 27, 4840–4850. [Google Scholar] [CrossRef]
- Luo, P.; Yin, Z.; Yuan, D.; Gao, F.; Liu, J. An intelligent method for early motor bearing fault diagnosis based on Wasserstein distance generative adversarial networks meta learning. IEEE Trans. Instrum. Meas. 2023, 72, 3517611. [Google Scholar] [CrossRef]
- Wang, X.; Lu, S.; Chen, K.; Wang, Q.; Zhang, S. Bearing Fault Diagnosis of Switched Reluctance Motor in Electric Vehicle Powertrain via Multisensor Data Fusion. IEEE Trans. Ind. Inform. 2022, 18, 2452–2464. [Google Scholar] [CrossRef]
- Xing, Z.; Yi, C.; Lin, J.; Zhou, Q. A novel periodic cyclic sparse network with entire domain adaptation for deep transfer fault diagnosis of rolling bearing. IEEE Sens. J. 2023, 23, 13452–13468. [Google Scholar] [CrossRef]
- Zhou, Y.; Dong, Y.; Zhou, H.; Tang, G. Deep dynamic adaptive transfer network for rolling bearing fault diagnosis with considering cross-machine instance. IEEE Trans. Instrum. Meas. 2021, 70, 1–11. [Google Scholar] [CrossRef]
- Riera-Guasp, M.; Antonino-Daviu, J.A.; Capolino, G.A. Advances in Electrical Machine, Power Electronic, and Drive Condition Monitoring and Fault Detection: State of the Art. IEEE Trans. Ind. Electron 2015, 62, 1746–1759. [Google Scholar] [CrossRef]
- Wang, B.; Wen, L.; Li, X.; Gao, L. Adaptive class center generalization network: A sparse domain-regressive framework for bearing fault diagnosis under unknown working conditions. IEEE Trans. Instrum. Meas. 2023, 72, 3516511. [Google Scholar] [CrossRef]
- Liu, X.; Xia, L.; Shi, J.; Zhang, L.; Bai, L.; Wang, S. A fault diagnosis method of rolling bearing based on improved recurrence plot and convolutional neural network. IEEE Sens. J. 2023, 23, 10767–10775. [Google Scholar] [CrossRef]
- Kong, X.; Cai, B.; Liu, Y.; Zhu, H.; Liu, Y.; Shao, H.; Yang, C.; Li, H.; Mo, T. Optimal sensor placement methodology of hydraulic control system for fault diagnosis. Mech. Syst. Signal Process. 2022, 174, 109069. [Google Scholar] [CrossRef]
- Meng, Z.; Luo, C.; Li, J.; Cao, L.; Fan, F. Research on fault diagnosis of rolling bearing based on lightweight model with multiscale features. IEEE Sens. J. 2023, 23, 13236–13247. [Google Scholar] [CrossRef]
- Zhao, D.F.; Liu, S.L.; Du, H.Y.; Wang, L.; Miao, Z.H. Deep branch attention network and extreme multi scale entropy based signal driven variab le speed fault diagnosis scheme for rolling bearing. Adv. Eng. Inform. 2023, 55, 101844. [Google Scholar] [CrossRef]
- Wang, X.; Shen, C.Q.; Xia, M.; Wang, D.; Zhu, J.; Zhu, Z.K. Multi-scale deep intra-class transfer learning for bearing fault diagnosis. Reliab. Eng. Syst. Saf. 2020, 202, 107050. [Google Scholar] [CrossRef]
- Meng, Z.; Zhan, Y.; Li, J.; Pan, Z. An enhancement denoising autoencoder for rolling bearing fault diagnosis. Measurement 2018, 130, 448–454. [Google Scholar] [CrossRef]
- Chao, Q.; Xu, Z.; Shao, Y.; Tao, J.; Liu, C.; Ding, S. Hybrid model-driven and data-driven approach for the health assessment of axial piston pumps. Int. J. Hydromechatronics 2023, 6, 76–92. [Google Scholar] [CrossRef]
- Fang, H.; An, J.; Liu, H.; Xiang, J.; Dunkin, F. A lightweight transformer with strong robustness application in portable bearing fault diagnosis. IEEE Sens. J. 2023, 23, 9649–9657. [Google Scholar] [CrossRef]
- Kong, X.; Cai, B.; Liu, Y.; Zhu, H.; Yang, C.; Gao, C.; Liu, Y.; Liu, Z.; Ji, R. Fault Diagnosis Methodology of Redundant Closed-Loop Feedback Control Systems: Subsea Blowout Preventer System as a Case Study. IEEE Trans. Syst. Man Cybern. Syst. 2023, 53, 1618–1619. [Google Scholar] [CrossRef]
- Ragab, M.; Chen, Z.; Zhang, W.; Eldele, E.; Wu, M.; Kwoh, C.-K.; Li, X. Conditional constrictive domain generalization for fault diagnosis. IEEE Trans. Instrum. Meas. 2022, 71, 3506912. [Google Scholar] [CrossRef]
- Chen, X.; Ma, M.; Zhao, Z.; Zhai, Z.; Mao, Z. Physics-informed deep neural network for bearing prognosis with multistory signals. J. Dyn. Monitor Diag. 2022, 4, 200–207. [Google Scholar]
- Wang, H.; Bai, X.; Wang, S.; Tan, J.; Liu, C. Generalization on unseen domains via model-agnostic learning for intelligent fault diagnosis. IEEE Trans. Instrum. Meas. 2022, 71, 3506411. [Google Scholar] [CrossRef]
- Feng, K.; Ji, J.C.; Zhang, Y.; Ni, Q.; Liu, Z.; Beer, M. Digital twin-driven intelligent assessment of gear surface degradation. Mech. Syst. Signal Process 2023, 186, 109896. [Google Scholar] [CrossRef]
- Islam, M.M.M.; Prosvirin, A.E.; Kim, J.M. Data-driven prognostic scheme for rolling-element bearings using a new health index and variants of least-square support vector machines. Mech. Syst. Signal Process. 2021, 160, 107853. [Google Scholar] [CrossRef]
- Li, Y.; Chen, Y.; Zhu, K.; Bai, C.; Zhang, J. An Effective Federated Learning Verification Strategy and Its Applications for Fault Diagnosis in Industrial IoT Systems. IEEE Internet Things J. 2022, 9, 16835–16849. [Google Scholar] [CrossRef]
- Yang, C.; Cai, B.; Wu, Q.; Wang, C.; Ge, W.; Hu, Z.; Zhu, W.; Zhang, L.; Wang, L. Digital twin-driven fault diagnosis method for composite faults by combining virtual and real data. J. Ind. Inf. Integr. 2023, 33, 100469. [Google Scholar] [CrossRef]
- Wang, X.; Li, A.; Han, G. A Deep-Learning-Based Fault Diagnosis Method of Industrial Bearings Using Multi-Source Information. Appl. Sci. 2023, 13, 933. [Google Scholar] [CrossRef]
- Yosinski, J.; Clune, J.; Bengio, Y.; Lipon, H. How transferable are features in deep neural network? In Proceedings of the NIPS’14: 27th International Conference on Neural Information Processing Systems, Montreal, QC, Canada, 8–14 December 2014; pp. 3320–3328. [Google Scholar]
- Miao, X.; Li, S.; Zhu, Y.; An, Z. A novel real-time fault diagnosis method for planetary gearbox using transferable hidden. IEEE Sens. J. 2020, 20, 8403–8412. [Google Scholar] [CrossRef]
- Banerjee, A.; Merugu, S.; Dhilon, I.S.; Glosh, J. Clustering with Bregman Divergences. J. Mach. Learn. Res. 2005, 6, 1705–1749. [Google Scholar]
- Kulback, S.; Leibler, R.A. On Information and Sufficiency. Ann. Math. Statist 1951, 22, 79–86. [Google Scholar] [CrossRef]
- Lin, J. Divergence measures based on the Shannon entropy. IEEE Trans. Inf. Theory 1991, 37, 145–151. [Google Scholar] [CrossRef]
- Borgwardt, K.M.; Gretton, A.; Rasch, M.J.; Kriegel, H.-P.; Scholkopf, B.; Smola, A.J. Integrating structured biological data by Kernel maximum mean discrepancy. Bioinformatics 2006, 22, e49–e57. [Google Scholar] [CrossRef]
- Si, S.; Tao, D.; Geng, B. Bregman divergence-based regularization for transfer subspace learning. IEEE Trans. Knowl. Data Eng. 2010, 22, 929–942. [Google Scholar] [CrossRef]
- Gretton, A.; Borgwardt, K.M.; Rasch, M.J.; Scholkopf, B.; Smola, A. A kernel two-sample test. J. Mach. Learn. Res. 2012, 13, 723–773. [Google Scholar]
- Yan, H.; Ding, Y.; Li, P.; Wang, Q.; Xu, Y.; Zuo, W. Mind the lass weight bias: Weighted maximum mean discrepancy for unsupervised domain adaptation. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017. [Google Scholar]
- Dziugaite, G.; Roy, D.; Ghahramani, Z. Training generative neural networks via maximum mean discrepancy optimization. arXiv 2015, arXiv:1505.03906. [Google Scholar]
Dataset Name | Rotating Speed (rmp) | Load (hp) | Damage Diameter (mm) | Operating Status | Sample Size |
---|---|---|---|---|---|
Dataset of source domain | 1500 | 0 | 0.03 | Normal status, internal fault, outer fault | 250, 250, 250 |
Dataset 1 of target domain | 1530 | 0 | 0.03 | Normal status, internal fault, outer fault | 100, 100, 100 |
Dataset 2 of target domain | 1550 | 1 | 0.03 | Normal status, internal fault, outer fault | 100, 100, 100 |
Dataset 3 of target domain | 1700 | 2 | 1 | Normal status, internal fault, outer fault | 100, 100, 100 |
Network Layer Name | Activation Function | Number of Units |
---|---|---|
Bi-LSTM layer a | Tanh | 32 |
Bi-LSTM layer b | Tanh | 16 |
Attention mechanism layer | / | 16 |
Fully connected layer | ReLU | 32 |
Classification layer | Softmax | 3 |
Name of Algorithm | MLMF-TS | MLMF-SS |
---|---|---|
Training time reduction ratio | 15.3 | 29.0 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zhou, X.; Li, A.; Han, G. An Intelligent Multi-Local Model Bearing Fault Diagnosis Method Using Small Sample Fusion. Sensors 2023, 23, 7567. https://doi.org/10.3390/s23177567
Zhou X, Li A, Han G. An Intelligent Multi-Local Model Bearing Fault Diagnosis Method Using Small Sample Fusion. Sensors. 2023; 23(17):7567. https://doi.org/10.3390/s23177567
Chicago/Turabian StyleZhou, Xianzhang, Aohan Li, and Guangjie Han. 2023. "An Intelligent Multi-Local Model Bearing Fault Diagnosis Method Using Small Sample Fusion" Sensors 23, no. 17: 7567. https://doi.org/10.3390/s23177567
APA StyleZhou, X., Li, A., & Han, G. (2023). An Intelligent Multi-Local Model Bearing Fault Diagnosis Method Using Small Sample Fusion. Sensors, 23(17), 7567. https://doi.org/10.3390/s23177567