Bearing Fault Diagnosis Based on a Depthwise Separable Atrous Convolution and ASPP Hybrid Network
Abstract
1. Introduction
2. Theoretical Background
2.1. Depthwise Separable Atrous Convolution (DSAC)
2.2. Dual-Channel Fault Diagnosis
2.2.1. Atrous Spatial Pyramid Pooling (ASPP)
2.2.2. Deep Residual Shrinkage Network (DRSN)
2.2.3. Attention Mechanisms
2.2.4. Integrated Architecture of the DSAC-ASPP Hybrid Network
3. Experimental Results
3.1. Experimental Setup
3.2. Data Preprocessing
3.3. Experimental Method
3.4. Results and Evaluation
3.4.1. Comparison of Training Convergence and Classification Performance
3.4.2. Validation of Dual-Channel Design and Complementary Information
3.4.3. Confusion Matrix Analysis
3.4.4. t-SNE Visualization Analysis
3.4.5. Generalization Analysis
4. Conclusions and Future Work
4.1. Conclusions
- Atrous spatial pyramid pooling (ASPP) effectively captures multi-scale features across varying receptive fields by employing dilated convolutions with multiple dilation rates. Combined with global pooling and 1 × 1 convolutions, it achieves comprehensive global information integration and feature fusion, significantly enhancing the network’s representational capability for diverse bearing fault patterns.
- Dual-Channel Fault Diagnosis, integrating deep separable convolution, channel–spatial attention, and SimAM mechanisms, enhances the representation of input features while improving training stability and mitigating issues such as gradient explosion.
- On laboratory-collected bearing vibration and acoustic signal datasets, the DSAC–ASPP model achieves superior diagnostic performance. It not only outperforms conventional CNN backbones but also surpasses several state-of-the-art methods, including ViT-FDM, LightNAS, and MFACNN, in terms of overall accuracy and robustness. Furthermore, under varying intensities of superimposed Gaussian white noise, the proposed model maintains the highest classification accuracy with minimal performance degradation, demonstrating its strong generalization capability and noise robustness. This makes it suitable for reliable multi-source signal fault diagnosis under complex operating conditions. Moreover, this performance is achieved with high efficiency and low inference latency. This presents a favorable trade-off compared to computationally heavy foundation models, which typically require orders of magnitude more parameters and longer inference times, making our approach more amenable to real-world, resource-constrained and real-time deployment.
4.2. Future Work
- Real-world data validation and collection. Vibration and acoustic signals will be collected from operational rotating machinery under real industrial conditions. The method will be evaluated using naturally progressing faults and diverse operational profiles to better assess its practicality.
- Adaptation to variable operating conditions. Domain adaptation and transfer learning techniques will be investigated to improve the model’s robustness under non-stationary conditions, such as changing rotational speeds, fluctuating loads, and complex noise environments.
- Lightweight and efficient deployment. Model compression techniques (e.g., pruning, quantization) and optimized time–frequency transformation pipelines will be explored to reduce computational overhead and memory footprint, facilitating real-time or edge-device deployment without compromising diagnostic accuracy.
- Extension to complex fault scenarios. The fault label space will be expanded to include compound faults and early-stage degradation patterns. Fine-grained health assessment beyond discrete fault classification will also be studied to enable more predictive maintenance capabilities.
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Chafic, C.E.; Hocine, K.; Salima, B.; Pont, M.; Hay, M. Prediction of fatigue damage and spalling in a multilayered journal bearing shell. Tribol. Int. 2022, 175, 107850. [Google Scholar] [CrossRef]
- Mahendra, R.S.; Shivdayal, P. Crack propagation and fatigue life estimation of spur gear with and without spalling failure. Theor. Appl. Fract. Mech. 2023, 127, 104020. [Google Scholar] [CrossRef]
- Gu, J.; Congalton, G.R. Assessing the impact of mixed pixel proportion training data on SVM-based remote sensing classification: A simulated study. Remote Sens. 2025, 17, 1274. [Google Scholar] [CrossRef]
- Li, X.; Wu, W.; Zhu, F.; Guan, S.; Zhang, W.; Li, Z. FA-Unet: A deep learning method with fusion of frequency domain features for fruit leaf disease identification. Horticulturae 2025, 11, 783. [Google Scholar] [CrossRef]
- Wang, Z.; Wang, S.; Cheng, Y. Fault feature extraction of parallel-axis gearbox based on IDBO-VMD and t-SNE. Appl. Sci. 2024, 14, 289. [Google Scholar] [CrossRef]
- Li, W.; Cai, H.; Yang, X.; Xue, Y.; Ye, J.; Hu, X. Dual-channel parallel multimodal feature fusion for bearing fault diagnosis. Machines 2025, 13, 950. [Google Scholar] [CrossRef]
- Spirto, M.; Melluso, F.; Nicolella, A.; Malfi, P.; Cosenza, C.; Savino, S.; Niola, V. A Comparative Study between SDP-CNN and Time–Frequency-CNN based Approaches for Fault Detection. J. Dyn. Monit. Diagn. 2025, early access. [Google Scholar] [CrossRef]
- Chen, X.; Zhang, B.; Gao, D. Bearing fault diagnosis based on multi-scale CNN and LSTM model. J. Intell. Manuf. 2021, 32, 597–613. [Google Scholar] [CrossRef]
- Zhou, Z.; Ai, Q.; Lou, P.; Hu, J.; Yan, J. A novel method for rolling bearing fault diagnosis based on Gramian angular field and CNN-ViT. Sensors 2024, 24, 3967. [Google Scholar] [CrossRef]
- Cui, K.; Liu, M.; Meng, Y. A new fault diagnosis of rolling bearing on FFT image coding and L-CNN. Meas. Sci. Technol. 2024, 35, 076108. [Google Scholar] [CrossRef]
- Jiang, G.; Li, D.; Feng, K.; Li, Y.; Zheng, J.; Ni, Q.; Li, H. Rolling Bearing Fault Diagnosis Based On Convolutional Capsule Network. J. Dyn. Monit. Diagn. 2023, 2, 275–289. [Google Scholar] [CrossRef]
- Song, B.; Liu, Y.; Fang, J.; Liu, W.; Zhong, M.; Liu, X. An optimized CNN–BiLSTM network for bearing fault diagnosis under multiple working conditions with limited training samples. Neurocomputing 2024, 574, 127284. [Google Scholar] [CrossRef]
- Jia, L.; Chow, T.W.S.; Yuan, Y. GTFE-Net: A Gramian time frequency enhancement CNN for bearing fault diagnosis. Eng. Appl. Artif. Intell. 2023, 119, 105794. [Google Scholar] [CrossRef]
- Ruan, D.; Jin, W.; Yang, J.; Gühmann, C. CNN parameter design based on fault signal analysis and its application in bearing fault diagnosis. Adv. Eng. Inform. 2023, 55, 101877. [Google Scholar] [CrossRef]
- Gao, S.; Zhao, K.; Wen, T. Learning criteria of normalized regressor-based adaptive observer for actuator fault diagnosis of disturbed systems. Nonlinear Dyn. 2024, 113, 9551–9576. [Google Scholar] [CrossRef]
- Xu, Z.; Chen, X.; Li, Y.; Xu, J. Hybrid Multimodal Feature Fusion with Multi-Sensor for Bearing Fault Diagnosis. Sensors 2024, 24, 1792. [Google Scholar] [CrossRef] [PubMed]
- Wang, Z.; Chen, J.; Wang, C.; Peng, C.; Xuan, J.; Shi, T.; Zuo, M. CNC-VLM: An RLHF-optimized industrial large vision-language model with multimodal learning for imbalanced CNC fault detection. Mech. Syst. Signal Process. 2026, 245, 113838. [Google Scholar] [CrossRef]
- Wei, C.; Fan, H. Attention mechanism-enhanced multi-scale separable atrous convolution method for motor bearing fault diagnosis. In Proceedings of the 2025 International Conference on Electrical Automation and Artificial Intelligence (ICEAAI), Guangzhou, China, 10–12 January 2025; IEEE: New York, NY, USA, 2025; pp. 969–973. [Google Scholar] [CrossRef]
- Mbiethieu, C.; Tsopze, N.; Mephu Nguifo, E. XLITE-Unet: Extremely light and efficient deep learning architecture with selective atrous and axial depthwise convolution for image segmentation. Comput. Vis. Image Underst. 2025, 262, 104543. [Google Scholar] [CrossRef]
- Deng, W.; Zhang, Y.; Yu, H.; Li, H. Knowledge graph embedding based on dynamic adaptive atrous convolution and attention mechanism for link prediction. Inf. Process. Manag. 2024, 61, 103642. [Google Scholar] [CrossRef]
- Wang, Y.; Yang, M.; Zhang, Y.; Xu, Z.; Zhou, Z.; Li, H. A bearing fault diagnosis model based on deformable atrous convolution and squeeze-and-excitation aggregation. IEEE Trans. Instrum. Meas. 2021, 70, pp,1–10. [Google Scholar] [CrossRef]
- Li, S.; Wang, H.; Song, L.; Cui, L.; Li, X. An adaptive data fusion strategy for fault diagnosis based on the convolutional neural network. Measurement 2020, 165, 108122. [Google Scholar] [CrossRef]
- Chen, J.; Zhang, Q.; Zhang, S.; Peng, L.; Wen, J. A lightweight model for bearing fault diagnosis based on Gramian angular field and coordinate attention. Machines 2022, 10, 282. [Google Scholar] [CrossRef]
- Cao, Z.; Xu, X.; Hu, B.; Zhou, M. Rapid detection of blind roads and crosswalks by using a lightweight semantic segmentation network. IEEE Trans. Intell. Transp. Syst. 2021, 22, 6188–6197. [Google Scholar] [CrossRef]
- Xie, S.; Wang, J.; Li, Y.; Yang, L. Bearing fault diagnosis method based on improved meta-ResNet and sample weighting under noisy labels. Struct. Health Monit. 2025, 24, 3707–3727. [Google Scholar] [CrossRef]
- Li, P.; Wang, W.; Yang, X.; Liu, S.; Zhang, L.; Cheng, Y. Intelligent fault diagnosis of rolling bearings based on wavelet transform and improved ResNet under noisy labels and environments. Eng. Appl. Artif. Intell. 2022, 115, 105269. [Google Scholar] [CrossRef]
- Qiu, G.; Nie, Y.; Peng, Y.; Huang, P.; Chen, J.; Gu, Y. A variable-speed-condition fault diagnosis method for crankshaft bearing in the RV reducer with WSO-VMD and ResNet-SWIN. Qual. Reliab. Eng. Int. 2024, 40, 2321–2347. [Google Scholar] [CrossRef]
- Dong, H.; Zhang, R.; Mi, Y. Fault diagnosis on bearing of electric motor based on DRSN-BIGRU using stator current signals. J. Phys. Conf. Ser. 2025, 3079, 012050. [Google Scholar] [CrossRef]
- Zhang, Z.; Zhang, C.; Zhang, X.; Chen, L.; Shi, H.; Li, H. A self-adaptive DRSN-GPReLU for bearing fault diagnosis under variable working conditions. Meas. Sci. Technol. 2022, 33, 124005. [Google Scholar] [CrossRef]
- Zhang, Z.; Zhang, C.; Chen, L.; Li, H. LGMA-DRSN: A lightweight convex global multi-attention deep residual shrinkage network for fault diagnosis. Meas. Sci. Technol. 2023, 34, 115011. [Google Scholar] [CrossRef]
- Zhang, Z.; Zhang, C.; Chen, L.; Li, H.; Shi, H. GMA-DRSNs: A novel fault diagnosis method with global multi-attention deep residual shrinkage networks. Measurement 2022, 196, 111203. [Google Scholar] [CrossRef]
- Hu, H.; Jiang, A.; Wu, X.; An, Z.; Zhang, S. Multi-condition fault diagnosis method for rotating machinery based on whale optimization variational mode decomposition algorithm and deep residual network. Meas. Sci. Technol. 2025, 36, 076118. [Google Scholar] [CrossRef]
- Luo, L.; Liu, Y. Fault diagnosis of planetary gear train crack based on DC-DRSN. Appl. Sci. 2024, 14, 6873. [Google Scholar] [CrossRef]
- Guo, S.; Yang, T.; Gao, W.; Zhang, C.; Zhang, Y. An intelligent fault diagnosis method for bearings with variable rotating speed based on Pythagorean spatial pyramid pooling CNN. Sensors 2018, 18, 3857. [Google Scholar] [CrossRef]
- Yan, X.; Zhang, Y.; Jin, Q. Chemical process fault diagnosis based on improved ResNet fusing CBAM and SPP. IEEE Access 2023, 11, 46678–46690. [Google Scholar] [CrossRef]
- Yang, X.; Yunlei, F.; Hui, L. Lightweight semantic segmentation of complex structural damage recognition for actual bridges. Struct. Health Monit. 2023, 22, 3250–3269. [Google Scholar] [CrossRef]
- Chen, J.; Wang, H.; Su, B.; Li, Z. Rolling bearing fault diagnosis based on DRS frequency spectrum image and improved DQN. Trans. Can. Soc. Mech. Eng. 2024, 48, 437–446. [Google Scholar] [CrossRef]
- Lei, Y.; Yang, B.; Jiang, X.; Jia, F.; Li, N.; Nandi, A.K. Applications of Machine Learning to Machine Fault Diagnosis: A Review and Roadmap. Mech. Syst. Signal Process. 2020, 138, 106587. [Google Scholar] [CrossRef]
- Yao, D.; Zhou, T.; Yang, J.; Meng, C.; Huan, B. Fault diagnosis of rolling bearings based on dynamic convolution and dual-channel feature fusion under variable working conditions. Meas. Sci. Technol. 2024, 35, 066110. [Google Scholar] [CrossRef]
- Hua, C.; Luo, K.; Wu, Y.; Shi, R. YOLO-ABD: A Multi-Scale Detection Model for Pedestrian Anomaly Behavior Detection. Symmetry 2024, 16, 1003. [Google Scholar] [CrossRef]

















| Method | Core Feature Extraction | Multi-Modal Signal Processing | Noise Robustness Strategy | Model Efficiency/Lightweight Design | Primary Focus |
|---|---|---|---|---|---|
| Traditional CNNs | Stacked standard convolutions | Typically single-modal | Relies on data augmentation & deep architecture | High parameter count, computationally intensive | Generic feature learning |
| Lightweight CNNs | Depthwise separable convolutions | Typically single-modal | Limited inherent robustness | High | Deployment efficiency |
| Advanced Diagnostic Networks | Time–frequency enhancement | Single-modal, focused on pre-processing | Pre-processing denoising via GNR strategy | Medium | Feature enhancement in noisy environments |
| Attention-Based Networks | Residual shrinkage + multi-scale attention | Single-modal | High | Lower | Strong noise suppression |
| Multi-Modal Fusion Networks | Multi-input fusion | Multi-Modal Signal Processing | Not explicitly emphasized | Lower | Multi-source information fusion |
| DSAC-ASPP | Depthwise separable atrous convolution (DSAC) + ASPP | Multi-Modal Sig-nal Processing | High | High | Integrated multi-modality, strong robustness, and high computational efficiency |
| Condition | Training Samples | Validation Samples | Test Samples | Label |
|---|---|---|---|---|
| Normal | 1200 | 400 | 400 | N |
| Inner ring fault | 1200 | 400 | 400 | IF |
| Outer ring fault | 1200 | 400 | 400 | OF |
| Rolling element fault | 1200 | 400 | 400 | BF |
| Model | Accuracy (%) | Precision (%) | Recall (%) | F1-Score (%) | Parameters (M) | GFLOPs |
|---|---|---|---|---|---|---|
| DSAC-ASPP | 98.21 ± 0.28 | 98.18 ± 0.31 | 99.23 ± 0.25 | 99.20 ± 0.27 | 1.2 | 0.45 |
| Inseption_v3 | 97.80 ± 0.55 | 97.65 ± 0.60 | 97.90 ± 0.58 | 97.77 ± 0.59 | 23.8 | 5.2 |
| VGG_16 | 96.50 ± 0.85 | 96.40 ± 0.90 | 96.55 ± 0.88 | 96.47 ± 0.89 | 138.4 | 15.5 |
| Resnet_50 | 95.80 ± 0.95 | 95.70 ± 1.00 | 95.85 ± 0.98 | 95.77 ± 0.99 | 25.6 | 4.1 |
| ViT-Based Model | 96.75 ± 0.45 | 97.60 ± 0.50 | 98.85 ± 0.48 | 98.72 ± 0.49 | 85.7 | 16.3 |
| LightNAS Model | 96.95 ± 0.35 | 97.88 ± 0.40 | 98.99 ± 0.38 | 98.93 ± 0.39 | 3.5 | 1.8 |
| MFACNN | 96.50 ± 0.60 | 96.45 ± 0.65 | 98.55 ± 0.62 | 98.50 ± 0.63 | 5.8 | 2.5 |
| Model Configuration | Accuracy (%) | vs. Full Dual-Channel Mode |
|---|---|---|
| full dual-channel model | 94.31 ± 0.91 | — |
| Vibration-Only | 90.15 ± 1.35 | −4.16 |
| Acoustic-Only (with DRSN) | 92.80 ± 1.28 | −1.51 |
| Acoustic-Only (with standard Conv) | 85.22 ± 1.52 | −9.09 |
| Model | Accuracy Under Gaussian White Noise (%) | |||
|---|---|---|---|---|
| −8 dB | −4 dB | −2 dB | 0 dB | |
| DSAC-ASPP | 79.64 ± 1.06 | 85.21 ± 0.73 | 89.38 ± 0.46 | 94.31 ± 0.21 |
| Inception_v3 | 75.13 ± 0.89 | 77.64 ± 0.65 | 81.12 ± 0.52 | 91.86 ± 0.31 |
| mobileNet_v2 | 66.37 ± 1.52 | 71.43 ± 1.15 | 76.94 ± 0.83 | 88.46 ± 0.46 |
| VGG-16 | 73.11 ± 1.64 | 75.48 ± 1.03 | 79.32 ± 0.75 | 90.17 ± 0.19 |
| ResNet50 | 54.62 ± 1.86 | 60.15 ± 1.24 | 66.75 ± 0.84 | 75.37 ± 0.26 |
| ViT-FDM | 77.50 ± 1.20 | 83.15 ± 0.95 | 87.20 ± 0.70 | 93.05 ± 0.40 |
| LightNAS | 75.80 ± 1.35 | 80.92 ± 1.10 | 85.41 ± 0.85 | 91.88 ± 0.55 |
| MFACNN | 74.25 ± 1.50 | 79.36 ± 1.25 | 83.97 ± 1.00 | 90.75 ± 0.60 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Gu, X.; Liu, C.; Li, J.; Yu, X.; Tian, Y. Bearing Fault Diagnosis Based on a Depthwise Separable Atrous Convolution and ASPP Hybrid Network. Machines 2026, 14, 93. https://doi.org/10.3390/machines14010093
Gu X, Liu C, Li J, Yu X, Tian Y. Bearing Fault Diagnosis Based on a Depthwise Separable Atrous Convolution and ASPP Hybrid Network. Machines. 2026; 14(1):93. https://doi.org/10.3390/machines14010093
Chicago/Turabian StyleGu, Xiaojiao, Chuanyu Liu, Jinghua Li, Xiaolin Yu, and Yang Tian. 2026. "Bearing Fault Diagnosis Based on a Depthwise Separable Atrous Convolution and ASPP Hybrid Network" Machines 14, no. 1: 93. https://doi.org/10.3390/machines14010093
APA StyleGu, X., Liu, C., Li, J., Yu, X., & Tian, Y. (2026). Bearing Fault Diagnosis Based on a Depthwise Separable Atrous Convolution and ASPP Hybrid Network. Machines, 14(1), 93. https://doi.org/10.3390/machines14010093

