Adaptive Label Refinement Network for Domain Generalization in Compound Fault Diagnosis
Highlights
- A novel label refinement framework is proposed for compound fault diagnosis under source-scarce conditions, which iteratively evolves soft labels to enhance domain generalization.
- A KL-divergence-based stability coefficient autonomously guides the iterative label refinement process.
- ALRN achieves over 22% accuracy gain, setting a new state-of-the-art under source-scarce conditions
- It offers a practical solution for industrial diagnosis where collecting diverse labeled data is prohibitive.
Abstract
1. Introduction
- This study reveals imperfect label supervision as a critical factor undermining cross-domain generalization performance under the challenging yet practical conditions of scarce source domains and prevalent compound faults.
- A novel adaptive label refinement algorithm is proposed, through which soft labels are dynamically calibrated by an intra-class weighting mechanism. This process is autonomously guided by a KL-divergence-based stability coefficient, which is utilized to quantitatively monitor the refinement process and determine its convergence, thereby eliminating the need for a pre-defined iteration count.
- Extensive experiments on a planetary gearbox compound fault dataset demonstrate that the proposed ALRN framework establishes a new state-of-the-art for cross-domain fault diagnosis, achieving a significant accuracy improvement against conventional supervised baselines in both single-source and dual-source settings.
2. Preliminaries
2.1. Domain Generalization Problem Statement
2.2. Limitation of Hard Labels and Label Smoothing
3. Methodology
3.1. Adaptive Label Refinement Network (ALRN)
3.1.1. Feature Extraction
3.1.2. Label Refinement Algorithm
3.1.3. Label Refinement Stability Coefficient
| Algorithm 1 Training and test procedures for ALRN. |
| ① Training: |
| Input: Labeled source domain dataset . |
| Set the hyperparameters, including the learning rate, convolutional layers, pooling layers, activation functions, epochs, batch size. Set the labels as the identity matrix and threshold . |
| 1: For -th label refinement iteration do |
| 2: Let . |
| 3: For each epoch do |
| 4: Calculate the output of the model. |
| 5: Solve loss based on Equation (3). |
| 6: Calculate the gradients and update the model parameters. |
| 7: End for |
| 8: Calculate weights for the probability distribution outputs based on Equation (6). |
| 9: Slove based on Equation (7). |
| 10: If do |
| 11: Calculate based on Equation (9). |
| 12: Else do |
| 13: Calculate based on Equation (9). |
| 14: If do |
| 15: Break out |
| 16: End if |
| 17: End if |
| 18: End for |
| 19: Save trained model. |
| Output: The trained ALRN diagnostic model. |
| ② Testing: |
| Feed the target domain samples into model for fault diagnosis. |
3.2. Overall Structure
4. Experiments and Results
4.1. Data Collection and Description
4.2. Performance of Adaptive Label Refinement Network
4.3. Ablation Studies
4.4. Comparative Experiments
5. Discussion
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Gao, Z.; Cecati, C.; Ding, S. A survey of fault diagnosis and fault-tolerant techniques part II: Fault diagnosis with knowledge-based and hybrid/active approaches. IEEE Trans. Ind. Electron. 2015, 62, 3768–3774. [Google Scholar] [CrossRef]
- Cai, B.; Huang, L.; Xie, M. Bayesian networks in fault diagnosis. IEEE Trans. Ind. Inf. 2017, 13, 2227–2240. [Google Scholar] [CrossRef]
- Xiao, Y.; Shao, H.; Wang, J.; Yan, S.; Liu, B. Bayesian variational transformer: A generalizable model for rotating machinery fault diagnosis. Mech. Syst. Signal Process. 2024, 207, 110936. [Google Scholar] [CrossRef]
- Spirto, M.; Nicolella, A.; Melluso, F.; Malfi, P.; Cosenza, C.; Savino, S.; Niola, V. Enhancing SDP-CNN for Gear Fault Detection Under Variable Working Conditions via Multi-Order Tracking Filtering. J. Dyn. Monit. Diagn. 2025. early access. [Google Scholar] [CrossRef]
- Hou, B.; Wang, Y.; Wang, D. Investigations on Multiclass Classification Model-Based Optimized Weights Spectrum for Rotating Machinery Condition Monitoring. J. Dyn. Monit. Diagn. 2025, 4, 194–202. [Google Scholar]
- Zuo, J.; Lin, J.; Miao, Y. Planetary Gearbox Fault Detection under Speed-varying Conditions with Angular Domain Cyclostationary Feature Mode Decomposition. IEEE Sens. J. 2025, 25, 36725–36734. [Google Scholar] [CrossRef]
- Liu, T.; Mao, Y.; Dou, H.; Zhang, W.; Yang, J.; Wu, P.; Li, D.; Mu, X. Emerging wearable acoustic sensing technologies. Adv. Sci. 2025, 12, 2408653. [Google Scholar] [CrossRef] [PubMed]
- Choudhary, A.; Mishra, R.K.; Fatima, S.; Panigrahi, B.K. Multi-input CNN-based vibro-acoustic fusion for accurate fault diagnosis of induction motor. Eng. Appl. Artif. Intell. 2023, 120, 105872. [Google Scholar] [CrossRef]
- Yang, H.; Lai, M.; Li, G. Novel underwater acoustic signal denoising: Combined optimization secondary decomposition coupled with original component processing algorithms. Chaos Solitons Fractals 2025, 193, 116098. [Google Scholar] [CrossRef]
- Kundu, P. Review of rotating machinery elements condition monitoring using acoustic emission signal. Expert Syst. Appl. 2024, 252, 124169. [Google Scholar] [CrossRef]
- Tang, L.; Wu, X.; Wang, C.; Liu, X. Single-sensor defect localization for low-speed bearing based on acoustic emission signal dispersion characteristics. IEEE Trans. Instrum. Meas. 2025, 74, 3521111. [Google Scholar] [CrossRef]
- Zhu, Y.; Li, G.; Tang, S.; Wang, R.; Su, H.; Wang, C. Acoustic signal-based fault detection of hydraulic piston pump using a particle swarm optimization enhancement CNN. Appl. Acoust. 2022, 192, 108718. [Google Scholar] [CrossRef]
- Zhang, X.; Liu, J.; Zhang, X.; Lu, Y. Self-supervised graph feature enhancement and scale attention for mechanical signal node-level representation and diagnosis. Adv. Eng. Inform. 2025, 65, 103197. [Google Scholar] [CrossRef]
- Zhang, X.; Zhang, X.; Liu, J.; Wu, B.; Hu, Y. Graph features dynamic fusion learning driven by multi-head attention for large rotating machinery fault diagnosis with multi-sensor data. Eng. Appl. Artif. Intell. 2023, 125, 106601. [Google Scholar] [CrossRef]
- Zhang, X.; Liu, J.; Zhang, X.; Lu, Y. Multiscale channel attention-driven graph dynamic fusion learning method for robust fault diagnosis. IEEE Trans. Ind. Inform. 2024, 20, 11002–11013. [Google Scholar] [CrossRef]
- Jia, L.; Chow, T.W.S.; Yuan, Y. GTFE-Net: A Gramian time frequency enhancement CNN for bearing fault diagnosis. Eng. Appl. Artif. Intell. 2023, 119, 105794. [Google Scholar] [CrossRef]
- Peng, D.; Wang, H.; Liu, Z.; Zhang, W.; Zuo, M.J.; Chen, J. Multibranch and multiscale CNN for fault diagnosis of wheelset bearings under strong noise and variable load condition. IEEE Trans. Ind. Inf. 2020, 16, 4949–4960. [Google Scholar] [CrossRef]
- Tang, S.; Zhu, Y.; Yuan, S. An improved convolutional neural network with an adaptable learning rate towards multi-signal fault diagnosis of hydraulic piston pump. Adv. Eng. Inform. 2021, 50, 101406. [Google Scholar] [CrossRef]
- Yang, J.; Duan, H.; Li, L.; Stewart, E.; Huang, J.; Dixon, R. 1D CNN based detection and localisation of defective droppers in railway catenary. Appl. Sci. 2023, 13, 6819. [Google Scholar] [CrossRef]
- Siddique, M.F.; Saleem, F.; Umar, M.; Kim, C.H.; Kim, J.M. A hybrid deep learning approach for bearing fault diagnosis using continuous wavelet transform and attention-enhanced spatiotemporal feature extraction. Sensors 2025, 25, 2712. [Google Scholar] [CrossRef] [PubMed]
- Dong, Z.; Jiang, Y.; Jiao, W.; Zhang, F.; Wang, Z.; Huang, J.; Wang, X.; Zhang, K. Double attention-guided tree-inspired grade decision network: A method for bearing fault diagnosis of unbalanced samples under strong noise conditions. Adv. Eng. Inform. 2025, 64, 103004. [Google Scholar] [CrossRef]
- Lu, B.; Zhang, Y.; Sun, Q.; Li, M.; Li, P. A novel multidomain contrastive-coding-based open-set domain generalization framework for machinery fault diagnosis. IEEE Trans. Ind. Inf. 2024, 20, 6369–6381. [Google Scholar] [CrossRef]
- Qian, Q.; Qin, Y.; Luo, J.; Wang, Y.; Wu, F. Deep discriminative transfer learning network for cross-machine fault diagnosis. Mech. Syst. Signal Process. 2023, 186, 109884. [Google Scholar] [CrossRef]
- Gholami, B.; Sahu, P.; Rudovic, O.; Bousmalis, K.; Pavlovic, V. Unsupervised multi-target domain adaptation: An information theoretic approach. IEEE Trans. Image Process. 2020, 29, 3993–4002. [Google Scholar] [CrossRef]
- Shermin, T.; Lu, G.; Teng, S.W.; Murshed, M.; Sohel, F. Adversarial network with multiple classifiers for open set domain adaptation. IEEE Trans. Multimed. 2021, 23, 2732–2744. [Google Scholar] [CrossRef]
- Wang, R.; Huang, W.; Wang, J.; Shen, C.; Zhu, Z. Multisource domain feature adaptation network for bearing fault diagnosis under time-varying working conditions. IEEE Trans. Instrum. Meas. 2022, 71, 3511010. [Google Scholar] [CrossRef]
- An, Y.; Zhang, K.; Chai, Y.; Zhu, Z.; Liu, Q. Gaussian mixture variational-based transformer domain adaptation fault diagnosis method and its application in bearing fault diagnosis. IEEE Trans. Ind. Inf. 2024, 20, 615–625. [Google Scholar] [CrossRef]
- Wang, Q.; Gao, J.; Li, X. Weakly supervised adversarial domain adaptation for semantic segmentation in urban scenes. IEEE Trans. Image Process. 2019, 28, 4376–4386. [Google Scholar] [CrossRef]
- Borgwardt, K.M.; Gretton, A.; Rasch, M.J.; Kriegel, H.; Schölkopf, B.; Smola, A.J. Integrating structured biological data by kernel maximum mean discrepancy. Bioinformatics 2006, 22, e49–e57. [Google Scholar] [CrossRef]
- Qian, Q.; Wang, Y.; Zhang, T.; Qin, Y. Maximum mean square discrepancy: A new discrepancy representation metric for mechanical fault transfer diagnosis. Knowl.-Based Syst. 2023, 276, 110748. [Google Scholar] [CrossRef]
- Song, Q.; Jiang, X.; Liu, J.; Shi, J.; Zhu, Z. Contrast-assisted domain-specificity-removal network for semi-supervised generalization fault diagnosis. IEEE Trans. Neural Netw. Learn. Syst. 2025, 36, 5403–5416. [Google Scholar] [CrossRef]
- Korevaar, S.; Tennakoon, R.; Bab-Hadiashar, A. Failure to achieve domain invariance with domain generalization algorithms: An analysis in medical imaging. IEEE Access 2023, 11, 39351–39372. [Google Scholar] [CrossRef]
- Li, S.; Zhao, Q.; Kivaisi, A.R.; Wang, L.; Liu, W.; Wang, X. Causality-aware single-source domain generalization for transferring knowledge to multiple, unknown target domain. Tsinghua Sci. Technol. 2026, 31, 518–529. [Google Scholar] [CrossRef]
- Mai, J.; Gao, C.; Bao, J. Domain generalization through data augmentation: A survey of methods, applications, and challenges. Mathematics 2025, 13, 824. [Google Scholar] [CrossRef]
- Lei, X.; Shao, H.; Tang, Z.; Xu, S.; Zhong, D. Cross-domain remaining useful life prediction under unseen condition via mixed data and domain generalization. Measurement 2025, 244, 116451. [Google Scholar] [CrossRef]
- Gao, Z.; Guo, S.; Xu, C.; Zhang, J.; Gong, M.; Del Ser, J.; Li, S. Multi-domain adversarial variational Bayesian inference for domain generalization. IEEE Trans. Circuits Syst. Video Technol. 2025, 35, 3081–3093. [Google Scholar] [CrossRef]
- Xiao, Y.; Shao, H.; Yan, S.; Wang, J.; Peng, Y.; Liu, B. Domain generalization for rotating machinery fault diagnosis: A survey. Adv. Eng. Inform. 2025, 64, 103063. [Google Scholar] [CrossRef]
- Guan, W.; Wang, S.; Chen, Z.; Wang, G.; Liu, Z.; Cui, D.; Mao, Y. Domain generalization network based on inter-domain multivariate linearization for intelligent fault diagnosis. Reliab. Eng. Syst. Saf. 2025, 261, 111055. [Google Scholar] [CrossRef]
- Pu, H.; Teng, S.; Xiao, D.; Xu, L.; Luo, J.; Qin, Y. Domain generalization for machine compound fault diagnosis by domain-relevant joint distribution alignment. Adv. Eng. Inform. 2024, 62, 102771. [Google Scholar] [CrossRef]
- Fan, C.; Wang, P.; Ma, H.; Zhang, Y.; Ma, Z.; Yin, X.; Zhang, X.; Zhao, S. Performance degradation assessment of rolling bearing cage failure based on enhanced CycleGAN. Expert Syst. Appl. 2024, 255, 124697. [Google Scholar] [CrossRef]
- Guo, Y.; Li, X.; Zhang, J.; Cheng, Z. SDCGAN: A CycleGAN-based single-domain generalization method for mechanical fault diagnosis. Reliab. Eng. Syst. Saf. 2025, 258, 110854. [Google Scholar] [CrossRef]
- Sandfort, V.; Yan, K.; Pickhardt, P.J.; Summers, R.M. Data augmentation using generative adversarial networks (CycleGAN) to improve generalizability in CT segmentation tasks. Sci. Rep. 2019, 9, 16884. [Google Scholar] [CrossRef]
- Chao, K.; Shih, Y.; Lee, C. A novel sensor-based label-smoothing technique for machine state degradation. IEEE Sens. J. 2023, 23, 10879–10888. [Google Scholar] [CrossRef]
- Wang, P.; Song, Y.; Wang, X.; Xiang, Q. MD-BiMamba: An aero-engine inter-shaft bearing fault diagnosis method based on Mamba with modal decomposition and bidirectional features fusion strategy. Measurement 2025, 242, 115870. [Google Scholar] [CrossRef]
- Guo, Z.; Yu, K.; Jolfaei, A.; Ding, F.; Zhang, N. Fuz-Spam: Label smoothing-based fuzzy detection of spammers in Internet of Things. IEEE Trans. Fuzzy Syst. 2022, 30, 4543–4554. [Google Scholar] [CrossRef]
- Ren, H.; Zhao, Y.; Zhang, Y.; Sun, W. Learning label smoothing for text classification. PeerJ Comput. Sci. 2024, 10, e2005. [Google Scholar] [CrossRef]
- Liu, Z.; Xiong, X.; Li, Y.; Yu, Y.; Lu, J.; Zhang, S.; Xiong, F. HyGloadAttack: Hard-label black-box textual adversarial attacks via hybrid optimization. Neural Netw. 2024, 178, 106461. [Google Scholar] [CrossRef]
- Tu, F.; Zhang, T.; Liu, T.; Zhang, D.; Yang, S. A novel acoustic-based framework for compound fault diagnosis in rotating machinery with limited samples. IEEE Trans. Instrum. Meas. 2025, 74, 3521415. [Google Scholar] [CrossRef]
- Tu, F.; Yang, S.; Yang, J. Fault diagnosis for rotating machine based on Mel spectrogram and residual neural network. In Proceedings of the International Conference on Condition Monitoring and Asset Management, Oxford, UK, 17–20 June 2024; pp. 1–12. [Google Scholar]
- Wang, D.; Zhang, M.; Xu, Y.; Lu, W.; Yang, J.; Zhang, T. Metric-based meta-learning model for few-shot fault diagnosis under multiple limited data conditions. Mech. Syst. Signal Process. 2021, 155, 107510. [Google Scholar] [CrossRef]
- Zhong, X.; Wang, Q.; Liu, D.; Liao, J.; Yang, R.; Duan, S.; Ding, G.; Sun, J. A deep domain adaptation framework with correlation alignment for EEG-based motor imagery classification. Comput. Biol. Med. 2023, 163, 107235. [Google Scholar] [CrossRef]
- Liu, M.; Cheng, Z.; Yang, Y.; Hu, N.; Yang, Y. Multi-target domain adaptation intelligent diagnosis method for rotating machinery based on multi-source attention mechanism and mixup feature augmentation. Reliab. Eng. Syst. Saf. 2024, 250, 110298. [Google Scholar] [CrossRef]
- Song, Y.; Zhuang, Y.; Wang, D.; Li, Y.; Zhang, Y. Fault diagnosis in rolling bearings using multi-gaussian attention and covariance loss for single domain generalization. IEEE Trans. Instrum. Meas. 2025, 74, 3516910. [Google Scholar] [CrossRef]












| Layer Type | Output Size | Kernel Size | Padding Type/Stride | Activation Type | Pool Type/BN |
|---|---|---|---|---|---|
| Input | 1 × 4096 × 1 | ||||
| Conv1D_1 | 1 × 4096 × 32 | 1 × 16 | same | ReLU | BN |
| Pool1D_1 | 1 × 1024 × 32 | 1 × 4 | 4 | Maxpool | |
| Conv1D_2 | 1 × 1024 × 64 | 1 × 8 | same | ReLU | BN |
| Pool1D_2 | 1 × 256 × 64 | 1 × 4 | 4 | Maxpool | |
| Conv1D_3 | 1 × 256 × 128 | 1 × 4 | same | ReLU | BN |
| Pool1D_3 | 1 × 1 × 128 | 1 × 256 | Avgpool | ||
| Output | 8 × 1 | Softmax |
| Domain | Rotational Speed | Load Torque | Number of Samples Per Class | Total Number of Samples |
|---|---|---|---|---|
| A0 | 900 r/min | 1 Nm | 400 | 3200 |
| A1 | 900 r/min | 1 Nm | 400 | 3200 |
| B0 | 1800 r/min | 2 Nm | 400 | 3200 |
| B1 | 1800 r/min | 2 Nm | 400 | 3200 |
| C0 | 2700 r/min | 3 Nm | 400 | 3200 |
| C1 | 2700 r/min | 3 Nm | 400 | 3200 |
| Scenario | Tasks | Source Domain | Sample Size | Target Domain | Sample Size |
|---|---|---|---|---|---|
| Single-source domain | G1 | B0 | 3200 | A0 | 3200 |
| G2 | B0 | 3200 | C0 | 3200 | |
| G3 | B1 | 3200 | A1 | 3200 | |
| G4 | B1 | 3200 | C1 | 3200 | |
| Dual-source domain | Q1 | A0, A1 | 6400 | B0, B1 | 6400 |
| Q2 | A0, A1 | 6400 | C0, C1 | 6400 | |
| Q3 | B0, B1 | 6400 | A0, A1 | 6400 | |
| Q4 | B0, B1 | 6400 | C0.C1 | 6400 | |
| Q5 | C0, C1 | 6400 | A0, A1 | 6400 | |
| Q6 | C0, C1 | 6400 | B0, B1 | 6400 |
| Model | Q1 | Q2 | Q3 | Q4 | Q5 | Q6 | Average | Avg-F1 |
|---|---|---|---|---|---|---|---|---|
| Baseline | 66.07 ± 2.91 | 53.17 ± 2.41 | 40.14 ± 2.58 | 86.02 ± 2.42 | 43.34 ± 2.96 | 79.03 ± 1.75 | 61.30 ± 2.51 | 54.27 ± 3.67 |
| DANN | 70.90 ± 2.46 | 53.66 ± 4.76 | 42.23 ± 4.53 | 86.46 ± 2.25 | 50.39 ± 3.28 | 86.36 ± 1.27 | 65.00 ± 3.10 | 61.26 ± 2.22 |
| CORAL | 74.28 ± 2.37 | 62.80 ± 2.57 | 52.28 ± 1.91 | 87.41 ± 2.41 | 53.84 ± 3.79 | 86.10 ± 1.33 | 69.45 ± 2.40 | 67.45 ± 3.68 |
| MMD | 77.29 ± 2.64 | 60.75 ± 3.59 | 50.52 ± 1.68 | 88.21 ± 2.88 | 46.63 ± 3.98 | 84.96 ± 1.09 | 68.06 ± 3.03 | 65.05 ± 1.88 |
| Mixup | 69.05 ± 4.36 | 52.04 ± 4.50 | 55.21 ± 5.22 | 88.40 ± 2.30 | 51.80 ± 3.65 | 87.29 ± 2.38 | 67.30 ± 3.74 | 64.37 ± 2.67 |
| SDCGAN | 76.54 ± 3.87 | 58.27 ± 5.65 | 61.21 ± 2.78 | 89.78 ± 3.45 | 47.29 ± 4.26 | 88.89 ± 3.38 | 70.66 ± 3.90 | 67.88 ± 3.67 |
| MGA-SDG | 81.67 ± 2.28 | 53.29 ± 2.87 | 72.14 ± 2.32 | 91.54 ± 1.49 | 45.87 ± 2.66 | 83.52 ± 2.89 | 71.34 ± 2.42 | 68.27 ± 2.69 |
| ALRN | 90.11 ± 3.32 | 80.43 ± 2.79 | 92.64 ± 2.11 | 98.47 ± 0.55 | 65.65 ± 1.27 | 98.85 ± 0.31 | 87.69 ± 1.73 | 86.57 ± 1.27 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Du, Q.; Yao, J.; Yang, J.; Tu, F.; Yang, S. Adaptive Label Refinement Network for Domain Generalization in Compound Fault Diagnosis. Sensors 2025, 25, 6939. https://doi.org/10.3390/s25226939
Du Q, Yao J, Yang J, Tu F, Yang S. Adaptive Label Refinement Network for Domain Generalization in Compound Fault Diagnosis. Sensors. 2025; 25(22):6939. https://doi.org/10.3390/s25226939
Chicago/Turabian StyleDu, Qiyan, Jiajia Yao, Jingyuan Yang, Fengmiao Tu, and Suixian Yang. 2025. "Adaptive Label Refinement Network for Domain Generalization in Compound Fault Diagnosis" Sensors 25, no. 22: 6939. https://doi.org/10.3390/s25226939
APA StyleDu, Q., Yao, J., Yang, J., Tu, F., & Yang, S. (2025). Adaptive Label Refinement Network for Domain Generalization in Compound Fault Diagnosis. Sensors, 25(22), 6939. https://doi.org/10.3390/s25226939

