Transformer-Based Unsupervised Cross-Sensor Domain Adaptation for Electromechanical Actuator Fault Diagnosis
Abstract
:1. Introduction
- An end-to-end cross-sensor domain fault diagnosis model is proposed. The domain adaptation is based on a source-only supervised method.
- The proposed method takes advantage of a new input embedding technique, which is investigated to incorporate the local features into the attention mechanism. An enhanced Transformer is introduced as the backbone to extract effective information from local features with an attention mechanism.
- A benchmark dataset is grouped into twenty-four transfer tasks to validate the effectiveness of the proposed method. Experimental results on EMA fault diagnosis show the excellent performance of the proposed method in terms of fault diagnosis accuracy and sensor generalization capability under different working conditions.
2. Problem Formulation
3. Proposed Method
3.1. Local Feature Extraction
3.2. Enhanced Transformer Based Feature Extraction
3.3. Feature Transfer and Classification
4. Experiment Study
4.1. Dataset Description
- Actuator A—Fault-injected test actuator;
- Actuator B—The nominal test actuator;
- Actuator C—Load actuator.
4.2. Transfer Task Description
4.3. Compared Approaches
4.4. Model Parameters
4.5. Training Strategies
5. Results and Discussion
5.1. Overall Results
5.2. Visualization
6. Conclusions
Author Contributions
Funding
Conflicts of Interest
References
- Mazzoleni, M.; Di Rito, G.; Previdi, F. Electro-Mechanical Actuators for the More Electric Aircraft; Springer: Cham, Switzerland, 2021; ISBN 3-030-61798-X. [Google Scholar]
- Garza, P.; Perinpanayagam, S.; Aslam, S.; Wileman, A. Qualitative Validation Approach Using Digital Model for the Health Management of Electromechanical Actuators. Appl. Sci. 2020, 10, 7809. [Google Scholar] [CrossRef]
- Quattrocchi, G.; Berri, P.C.; Dalla Vedova, M.D.L.; Maggiore, P. An Improved Fault Identification Method for Electromechanical Actuators. Aerospace 2022, 9, 341. [Google Scholar] [CrossRef]
- Arriola, D.; Thielecke, F. Model-Based Design and Experimental Verification of a Monitoring Concept for an Active-Active Electromechanical Aileron Actuation System. Mech. Syst. Signal Process. 2017, 94, 322–345. [Google Scholar] [CrossRef]
- Ossmann, D.; van der Linden, F.L.J. Advanced Sensor Fault Detection and Isolation for Electro-Mechanical Flight Actuators. In Proceedings of the 2015 NASA/ESA Conference on Adaptive Hardware and Systems (AHS), Montreal, QC, Canada, 15–18 June 2015; pp. 1–8. [Google Scholar]
- Chirico, A.J., III; Kolodziej, J.R. A Data-Driven Methodology for Fault Detection in Electromechanical Actuators. J. Dyn. Syst. Meas. Control 2014, 136, 041025. [Google Scholar] [CrossRef]
- Rezaeianjouybari, B.; Shang, Y. Deep Learning for Prognostics and Health Management: State of the Art, Challenges, and Opportunities. Measurement 2020, 163, 107929. [Google Scholar] [CrossRef]
- Reddy, K.K.; Sarkar, S.; Venugopalan, V.; Giering, M. Anomaly Detection and Fault Disambiguation in Large Flight Data: A Multi-Modal Deep Auto-Encoder Approach. In Proceedings of the Annual Conference of the PHM Society, Denver, CO, USA, 3–6 October 2016; Volume 8. [Google Scholar] [CrossRef]
- Yang, J.; Guo, Y.; Zhao, W. Long Short-Term Memory Neural Network Based Fault Detection and Isolation for Electro-Mechanical Actuators. Neurocomputing 2019, 360, 85–96. [Google Scholar] [CrossRef]
- Riaz, N.; Shah, S.I.A.; Rehman, F.; Gilani, S.O.; Udin, E. A Novel 2-D Current Signal-Based Residual Learning with Optimized Softmax to Identify Faults in Ball Screw Actuators. IEEE Access 2020, 8, 115299–115313. [Google Scholar] [CrossRef]
- Li, W.; Huang, R.; Li, J.; Liao, Y.; Chen, Z.; He, G.; Yan, R.; Gryllias, K. A Perspective Survey on Deep Transfer Learning for Fault Diagnosis in Industrial Scenarios: Theories, Applications and Challenges. Mech. Syst. Signal Process. 2022, 167, 108487. [Google Scholar] [CrossRef]
- Zhao, Z.; Zhang, Q.; Yu, X.; Sun, C.; Wang, S.; Yan, R.; Chen, X. Applications of Unsupervised Deep Transfer Learning to Intelligent Fault Diagnosis: A Survey and Comparative Study. IEEE Trans. Instrum. Meas. 2021, 70, 3525828. [Google Scholar] [CrossRef]
- Wang, J.; Zhang, Y.; Luo, C.; Miao, Q. Deep Learning Domain Adaptation for Electro-Mechanical Actuator Fault Diagnosis under Variable Driving Waveforms. IEEE Sens. J. 2022, 22, 10783–10793. [Google Scholar] [CrossRef]
- Zhang, Z.; Chen, H.; Li, S.; An, Z. Unsupervised Domain Adaptation via Enhanced Transfer Joint Matching for Bearing Fault Diagnosis. Measurement 2020, 165, 108071. [Google Scholar] [CrossRef]
- Shao, S.; McAleer, S.; Yan, R.; Baldi, P. Highly Accurate Machine Fault Diagnosis Using Deep Transfer Learning. IEEE Trans. Ind. Inform. 2019, 15, 2446–2455. [Google Scholar] [CrossRef]
- Zhou, Y.; Kumar, A.; Parkash, C.; Vashishtha, G.; Tang, H.; Xiang, J. A Novel Entropy-Based Sparsity Measure for Prognosis of Bearing Defects and Development of a Sparsogram to Select Sensitive Filtering Band of an Axial Piston Pump. Measurement 2022, 203, 111997. [Google Scholar] [CrossRef]
- Yang, B.; Lei, Y.; Xu, S.; Lee, C.-G. An Optimal Transport-Embedded Similarity Measure for Diagnostic Knowledge Transferability Analytics across Machines. IEEE Trans. Ind. Electron. 2022, 69, 7372–7382. [Google Scholar] [CrossRef]
- Zhiyi, H.; Haidong, S.; Lin, J.; Junsheng, C.; Yu, Y. Transfer Fault Diagnosis of Bearing Installed in Different Machines Using Enhanced Deep Auto-Encoder. Measurement 2020, 152, 107393. [Google Scholar] [CrossRef]
- Li, X.; Jiang, H.; Wang, R.; Niu, M. Rolling Bearing Fault Diagnosis Using Optimal Ensemble Deep Transfer Network. Knowl. Based Syst. 2021, 213, 106695. [Google Scholar] [CrossRef]
- Zareapoor, M.; Shamsolmoali, P.; Yang, J. Oversampling Adversarial Network for Class-Imbalanced Fault Diagnosis. Mech. Syst. Signal Process. 2021, 149, 107175. [Google Scholar] [CrossRef]
- Zou, L.; Li, Y.; Xu, F. An Adversarial Denoising Convolutional Neural Network for Fault Diagnosis of Rotating Machinery under Noisy Environment and Limited Sample Size Case. Neurocomputing 2020, 407, 105–120. [Google Scholar] [CrossRef]
- Li, X.; Zhang, W.; Xu, N.-X.; Ding, Q. Deep Learning-Based Machinery Fault Diagnostics with Domain Adaptation across Sensors at Different Places. IEEE Trans. Ind. Electron. 2020, 67, 6785–6794. [Google Scholar] [CrossRef]
- Tzeng, E.; Hoffman, J.; Saenko, K.; Darrell, T. Adversarial Discriminative Domain Adaptation. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017; pp. 2962–2971. [Google Scholar]
- Long, M.; Cao, Y.; Wang, J.; Jordan, M. Learning Transferable Features with Deep Adaptation Networks. In Proceedings of the 32nd International Conference on Machine Learning, ICML’15, Lille, France, 6–11 July 2015; Bach, F., Blei, D., Eds.; Volume 37, pp. 97–105. [Google Scholar]
- Pandhare, V.; Li, X.; Miller, M.; Jia, X.; Lee, J. Intelligent Diagnostics for Ball Screw Fault through Indirect Sensing Using Deep Domain Adaptation. IEEE Trans. Instrum. Meas. 2021, 70, 2504211. [Google Scholar] [CrossRef]
- Wu, B.; Xu, C.; Dai, X.; Wan, A.; Zhang, P.; Tomizuka, M.; Keutzer, K.; Vajda, P. Visual Transformers: Token-Based Image Representation and Processing for Computer Vision. CoRR 2020, arXiv:2006.03677v4. [Google Scholar]
- Gulati, A.; Qin, J.; Chiu, C.-C.; Parmar, N.; Zhang, Y.; Yu, J.; Han, W.; Wang, S.; Zhang, Z.; Wu, Y.; et al. Conformer: Convolution-Augmented Transformer for Speech Recognition. In Proceedings of the Interspeech 2020, Shanghai, China, 25–29 October 2020; pp. 5036–5040. [Google Scholar]
- Xu, Z.; Li, C.; Yang, Y. Fault Diagnosis of Rolling Bearings Using an Improved Multi-Scale Convolutional Neural Network with Feature Attention Mechanism. ISA Trans. 2021, 110, 379–393. [Google Scholar] [CrossRef] [PubMed]
- Li, X.; Zhang, W.; Ding, Q. Understanding and Improving Deep Learning-Based Rolling Bearing Fault Diagnosis with Attention Mechanism. Signal Process. 2019, 161, 136–154. [Google Scholar] [CrossRef]
- Zhang, X.; He, C.; Lu, Y.; Chen, B.; Zhu, L.; Zhang, L. Fault Diagnosis for Small Samples Based on Attention Mechanism. Measurement 2022, 187, 110242. [Google Scholar] [CrossRef]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; Polosukhin, I. Attention Is All You Need. In Advances in Neural Information Processing Systems 30; Curran Associates Inc.: Red Hook, NY, USA, 2017; pp. 5998–6008. [Google Scholar]
- Ding, Y.; Jia, M.; Miao, Q.; Cao, Y. A Novel Time-Frequency Transformer Based on Self-Attention Mechanism and Its Application in Fault Diagnosis of Rolling Bearings. Mech. Syst. Signal Process. 2022, 168, 108616. [Google Scholar] [CrossRef]
- Li, S.; Jin, X.; Xuan, Y.; Zhou, X.; Chen, W.; Wang, Y.-X.; Yan, X. Enhancing the Locality and Breaking the Memory Bottleneck of Transformer on Time Series Forecasting. In Advances in Neural Information Processing Systems; Wallach, H., Larochelle, H., Beygelzimer, A., d’Alché-Buc, F., Fox, E., Garnett, R., Eds.; Curran Associates, Inc.: Red Hook, NY, USA, 2019; Volume 32. [Google Scholar]
- Long, M.; Zhu, H.; Wang, J.; Jordan, M.I. Deep Transfer Learning with Joint Adaptation Networks. In Proceedings of the 34th International Conference on Machine Learning, Sydney, Australia, 6–11 August 2017; Precup, D., Teh, Y.W., Eds.; Volume 70, pp. 2208–2217. [Google Scholar]
- Wright, D.; Augenstein, I. Transformer Based Multi-Source Domain Adaptation. arXiv 2020. [Google Scholar] [CrossRef]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
- Ba, J.L.; Kiros, J.R.; Hinton, G.E. Layer Normalization. arXiv 2016. [Google Scholar] [CrossRef]
- Xiong, R.; Yang, Y.; He, D.; Zheng, K.; Zheng, S.; Xing, C.; Zhang, H.; Lan, Y.; Wang, L.; Liu, T. On Layer Normalization in the Transformer Architecture. In Proceedings of the 37th International Conference on Machine Learning, ICML’20, Vienna, Austria, 12–18 July 2020; Daumé, H., III, Singh, A., Eds.; Volume 119, pp. 10524–10533. [Google Scholar]
- Nair, V.; Hinton, G.E. Rectified Linear Units Improve Restricted Boltzmann Machines. In ICML’20, Proceedings of the 27th International Conference on International Conference on Machine Learning, Haifa, Israel, 21–24 June 2010; Omnipress: Madison, WI, USA, 2010; pp. 807–814. [Google Scholar]
- Hendrycks, D.; Gimpel, K. Gaussian Error Linear Units (Gelus). arXiv 2016, arXiv:1606.08415. [Google Scholar]
- Balaban, E.; Saxena, A.; Narasimhan, S.; Roychoudhury, I.; Koopmans, M.; Ott, C.; Goebel, K. Prognostic Health-Management System Development for Electromechanical Actuators. J. Aerosp. Inf. Syst. 2015, 12, 329–344. [Google Scholar] [CrossRef] [Green Version]
- Long, M.; Cao, Z.; Wang, J.; Jordan, M.I. Conditional Adversarial Domain Adaptation. In Advances in Neural Information Processing Systems 31; Curran Associates Inc.: Red Hook, NY, USA, 2018. [Google Scholar] [CrossRef]
Experiment | Driving Waveform | Waveform Parameters | Load Profile (lbs) | Max. Velocity (m/s) |
---|---|---|---|---|
1 | Trapezoidal | 40 mm, 22 s (1 + 1 s motion, 10 + 10 s hold) 1 | Const at −10 | 0.04 |
2 | 40 mm, 21 s (0.5 + 0.5 s motion, 10 + 10 s hold) | Const at 10 | 0.08 | |
3 | Triangular | 80 mm, 4 s 2 | Const at −10 | 0.04 |
4 | 40 mm, 2 s | Const at 10 | 0.08 |
Task | Experiment | Source Sensor | Target Sensor |
---|---|---|---|
T1a | 1 | Location X | Location Y |
T2a | Location X | Location Z | |
T3a | Location Y | Location X | |
T4a | Location Z | Location X | |
T5a | Location Y | Location Z | |
T6a | Location Z | Location Y | |
T1b | 2 | Location X | Location Y |
T2b | Location X | Location Z | |
T3b | Location Y | Location X | |
T4b | Location Z | Location X | |
T5b | Location Y | Location Z | |
T6b | Location Z | Location Y | |
T1c | 3 | Location X | Location Y |
T2c | Location X | Location Z | |
T3c | Location Y | Location X | |
T4c | Location Z | Location X | |
T5c | Location Y | Location Z | |
T6c | Location Z | Location Y | |
T1d | 4 | Location X | Location Y |
T2d | Location X | Location Z | |
T3d | Location Y | Location X | |
T4d | Location Z | Location X | |
T5d | Location Y | Location Z | |
T6d | Location Z | Location Y |
Architecture | Networks | Parameters | Values |
---|---|---|---|
Local Feature Network | Convolution | 3 | |
1 | |||
Linear Mapping | 1 | ||
16 | |||
0.5 | |||
Position Encoding | 128 | ||
Backbone Network | Encoder | 4 | |
32 | |||
0.5 | |||
2 | |||
Flatten | - | - | |
Fully Connected | 256 | ||
Bottleneck Network | Fully Connected | 2 | |
256 | |||
0.5 | |||
1 | |||
Classification network | Fully Connected | 5 | |
0.5 |
Architecture | Layer | Parameters | Values |
---|---|---|---|
Backbone network | Convolution1 | out_channels | 8 |
kernel_size | 16 | ||
stride | 1 | ||
padding | 0 | ||
dilation | 1 | ||
batchnorm_size | 4 | ||
Convolution2 | out_channels | 16 | |
kernel_size | 3 | ||
stride | 1 | ||
padding | 0 | ||
dilation | 1 | ||
batchnorm_size | 16 | ||
Adaptive Max Polling | output_size | 4 | |
Flatten | - | - | |
Fully Connected | output_features | 256 | |
0.5 |
Task | CNN | BT | ET | CJ | ETJ |
---|---|---|---|---|---|
T1a | 67.37% | 75.23% | 83.88% | 91.49% | 97.45% |
T2a | 73.21% | 82.42% | 85.51% | 92.53% | 97.75% |
T3a | 76.86% | 84.66% | 88.79% | 93.74% | 99.75% |
T4a | 78.34% | 84.38% | 89.35% | 94.41% | 99.75% |
T5a | 77.53% | 82.25% | 87.92% | 93.03% | 98.32% |
T6a | 74.58% | 80.82% | 86.43% | 93.12% | 98.60% |
T1b | 57.96% | 78.61% | 85.81% | 86.13% | 97.95% |
T2b | 63.79% | 82.24% | 87.72% | 91.20% | 98.17% |
T3b | 69.42% | 84.38% | 89.35% | 93.95% | 99.58% |
T4b | 67.50% | 83.90% | 88.65% | 93.40% | 99.32% |
T5b | 76.65% | 82.66% | 87.66% | 91.90% | 98.34% |
T6b | 71.05% | 81.85% | 87.97% | 86.08% | 98.17% |
T1c | 48.31% | 67.41% | 71.59% | 77.64% | 92.41% |
T2c | 52.07% | 64.97% | 69.88% | 78.20% | 88.95% |
T3c | 74.20% | 83.40% | 87.78% | 92.44% | 98.98% |
T4c | 69.40% | 84.12% | 88.29% | 93.52% | 99.20% |
T5c | 67.27% | 72.78% | 78.38% | 85.65% | 90.96% |
T6c | 64.03% | 77.98% | 80.45% | 88.86% | 94.49% |
T1d | 62.89% | 78.91% | 85.43% | 89.05% | 96.89% |
T2d | 64.18% | 77.37% | 83.19% | 89.45% | 94.81% |
T3d | 74.43% | 84.55% | 89.75% | 93.80% | 99.89% |
T4d | 75.54% | 84.58% | 89.52% | 93.99% | 99.83% |
T5d | 73.30% | 80.96% | 84.84% | 90.93% | 96.97% |
T6d | 75.46% | 82.53% | 87.73% | 91.94% | 98.26% |
Mean | 68.97% | 80.12% | 85.24% | 90.27% | 97.28% |
St. Dev. | 0.07940 | 0.05328 | 0.05298 | 0.04609 | 0.02919 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Chen, Z.; He, C. Transformer-Based Unsupervised Cross-Sensor Domain Adaptation for Electromechanical Actuator Fault Diagnosis. Machines 2023, 11, 102. https://doi.org/10.3390/machines11010102
Chen Z, He C. Transformer-Based Unsupervised Cross-Sensor Domain Adaptation for Electromechanical Actuator Fault Diagnosis. Machines. 2023; 11(1):102. https://doi.org/10.3390/machines11010102
Chicago/Turabian StyleChen, Zihan, and Chao He. 2023. "Transformer-Based Unsupervised Cross-Sensor Domain Adaptation for Electromechanical Actuator Fault Diagnosis" Machines 11, no. 1: 102. https://doi.org/10.3390/machines11010102
APA StyleChen, Z., & He, C. (2023). Transformer-Based Unsupervised Cross-Sensor Domain Adaptation for Electromechanical Actuator Fault Diagnosis. Machines, 11(1), 102. https://doi.org/10.3390/machines11010102