Figure 1.
Convergence curves of the fitted values for the selected functions under different optimization methods. (a) Convergence curve of the F1 test function; (b) convergence curve of the F2 test function; (c) convergence curve of the F3 test function; (d) convergence curve of the F4 test function.
Figure 1.
Convergence curves of the fitted values for the selected functions under different optimization methods. (a) Convergence curve of the F1 test function; (b) convergence curve of the F2 test function; (c) convergence curve of the F3 test function; (d) convergence curve of the F4 test function.
Figure 2.
Flow chart of WAA-optimized FMD.
Figure 2.
Flow chart of WAA-optimized FMD.
Figure 3.
Model structure diagram of Swin Transformer.
Figure 3.
Model structure diagram of Swin Transformer.
Figure 4.
Structural diagram of LGAF-Swin Transformer model.
Figure 4.
Structural diagram of LGAF-Swin Transformer model.
Figure 5.
Structure diagram of depthwise convolution.
Figure 5.
Structure diagram of depthwise convolution.
Figure 6.
Structure diagram of pointwise convolution.
Figure 6.
Structure diagram of pointwise convolution.
Figure 7.
Structure diagram of local–global attention.
Figure 7.
Structure diagram of local–global attention.
Figure 8.
Fault Diagnosis Model of LGAF-Swin Transformer.
Figure 8.
Fault Diagnosis Model of LGAF-Swin Transformer.
Figure 9.
Convergence curves of the algorithm optimizing FMD under different SNRs. (a) Optimization convergence curves at −5 dB; (b) optimization convergence curves at −7 dB; (c) optimization convergence curves at −9 dB; (d) optimization convergence curves at −11 dB; (e) optimization convergence curves at −13 dB.
Figure 9.
Convergence curves of the algorithm optimizing FMD under different SNRs. (a) Optimization convergence curves at −5 dB; (b) optimization convergence curves at −7 dB; (c) optimization convergence curves at −9 dB; (d) optimization convergence curves at −11 dB; (e) optimization convergence curves at −13 dB.
Figure 10.
Time domain diagrams of simulated inner raceway faults under different SNRs. (a) Waveform diagram of signal impact; (b) time domain plot with −5 dB noise added; (c) time domain plot with −7 dB noise added; (d) time domain plot with −9 dB noise added; (e) time domain plot with −11 dB noise added; (f) time domain plot with −13 dB noise added.
Figure 10.
Time domain diagrams of simulated inner raceway faults under different SNRs. (a) Waveform diagram of signal impact; (b) time domain plot with −5 dB noise added; (c) time domain plot with −7 dB noise added; (d) time domain plot with −9 dB noise added; (e) time domain plot with −11 dB noise added; (f) time domain plot with −13 dB noise added.
Figure 11.
Noise reduction indicators of various methods under different SNRs. (a) SNR of different methods under various SNRs; (b) MSE of different methods under various SNRs; (c) NCC of different methods under various SNRs.
Figure 11.
Noise reduction indicators of various methods under different SNRs. (a) SNR of different methods under various SNRs; (b) MSE of different methods under various SNRs; (c) NCC of different methods under various SNRs.
Figure 12.
Envelope spectra of optimal components of each noise reduction method under different SNRs. (a) CEEMDAN (−5 dB); (b) CEEMDAN (−9 dB); (c) CEEMDAN (−13 dB); (d) fixed-parameter FMD (−5 dB); (e) fixed-parameter FMD (−9 dB); (f) fixed-parameter FMD (−13 dB); (g) DA-FMD (−5 dB); (h) DA-FMD (−9 dB); (i) DA-FMD (−13 dB); (j) GWO-FMD (−5 dB); (k) GWO-FMD (−9 dB); (l) GWO-FMD (−13 dB); (m) proposed method (−5 dB); (n) proposed method (−9 dB); (o) proposed method (−13 dB).
Figure 12.
Envelope spectra of optimal components of each noise reduction method under different SNRs. (a) CEEMDAN (−5 dB); (b) CEEMDAN (−9 dB); (c) CEEMDAN (−13 dB); (d) fixed-parameter FMD (−5 dB); (e) fixed-parameter FMD (−9 dB); (f) fixed-parameter FMD (−13 dB); (g) DA-FMD (−5 dB); (h) DA-FMD (−9 dB); (i) DA-FMD (−13 dB); (j) GWO-FMD (−5 dB); (k) GWO-FMD (−9 dB); (l) GWO-FMD (−13 dB); (m) proposed method (−5 dB); (n) proposed method (−9 dB); (o) proposed method (−13 dB).
Figure 13.
Structural diagram of the test bench.
Figure 13.
Structural diagram of the test bench.
Figure 14.
Convergence curves of different algorithms optimizing FMD. (a) Inner raceway (0.3 mm); (b) inner raceway (1.0 mm); (c) outer raceway (0.3 mm); (d) outer raceway (1.0 mm).
Figure 14.
Convergence curves of different algorithms optimizing FMD. (a) Inner raceway (0.3 mm); (b) inner raceway (1.0 mm); (c) outer raceway (0.3 mm); (d) outer raceway (1.0 mm).
Figure 15.
Envelope spectrum of the optimal components decomposed by WAA-FMD and GWO-FMD. (a) WAA-FMD (inner raceway fault with a diameter of 0.3 mm); (b) GWO-FMD (inner raceway fault with a diameter of 0.3 mm); (c) WAA-FMD (inner raceway fault with a diameter of 1.0 mm); (d) GWO-FMD (inner raceway fault with a diameter of 1.0 mm); (e) WAA-FMD (outer raceway fault with a diameter of 0.3 mm); (f) GWO-FMD (outer raceway fault with a diameter of 0.3 mm); (g) WAA-FMD (outer raceway fault with a diameter of 1.0 mm); (h) GWO-FMD (outer raceway fault with a diameter of 1.0 mm).
Figure 15.
Envelope spectrum of the optimal components decomposed by WAA-FMD and GWO-FMD. (a) WAA-FMD (inner raceway fault with a diameter of 0.3 mm); (b) GWO-FMD (inner raceway fault with a diameter of 0.3 mm); (c) WAA-FMD (inner raceway fault with a diameter of 1.0 mm); (d) GWO-FMD (inner raceway fault with a diameter of 1.0 mm); (e) WAA-FMD (outer raceway fault with a diameter of 0.3 mm); (f) GWO-FMD (outer raceway fault with a diameter of 0.3 mm); (g) WAA-FMD (outer raceway fault with a diameter of 1.0 mm); (h) GWO-FMD (outer raceway fault with a diameter of 1.0 mm).
Figure 16.
Data graph of sample segmentation.
Figure 16.
Data graph of sample segmentation.
Figure 17.
Transformation of time–frequency graph. (a) Time domain waveform of inner raceway fault (diameter: 0.3 mm); (b) time domain waveform of inner raceway fault (diameter: 1.0 mm); (c) time–frequency representation of inner raceway fault (diameter: 0.3 mm); (d) time–frequency representation of inner raceway fault (diameter: 1.0 mm); (e) time domain waveform of outer raceway fault (diameter: 0.3 mm); (f) time domain waveform of outer raceway fault (diameter: 1.0 mm); (g) time–frequency representation of outer raceway fault (diameter: 0.3 mm); (h) time–frequency representation of outer raceway fault (diameter: 1.0 mm); (i) time domain waveform of the normal state; (j) time–frequency representation of the normal state.
Figure 17.
Transformation of time–frequency graph. (a) Time domain waveform of inner raceway fault (diameter: 0.3 mm); (b) time domain waveform of inner raceway fault (diameter: 1.0 mm); (c) time–frequency representation of inner raceway fault (diameter: 0.3 mm); (d) time–frequency representation of inner raceway fault (diameter: 1.0 mm); (e) time domain waveform of outer raceway fault (diameter: 0.3 mm); (f) time domain waveform of outer raceway fault (diameter: 1.0 mm); (g) time–frequency representation of outer raceway fault (diameter: 0.3 mm); (h) time–frequency representation of outer raceway fault (diameter: 1.0 mm); (i) time domain waveform of the normal state; (j) time–frequency representation of the normal state.
Figure 18.
Comparison of different models. (a) Iteration curve of training loss function; (b) accuracy curve for training set; (c) iteration curve of validation loss function; (d) accuracy curve for validation set.
Figure 18.
Comparison of different models. (a) Iteration curve of training loss function; (b) accuracy curve for training set; (c) iteration curve of validation loss function; (d) accuracy curve for validation set.
Figure 19.
Experimental flowchart.
Figure 19.
Experimental flowchart.
Figure 20.
Images of bearings with different fault types. (a) Image of inner raceway fault; (b) image of outer raceway fault; (c) image of rolling element fault.
Figure 20.
Images of bearings with different fault types. (a) Image of inner raceway fault; (b) image of outer raceway fault; (c) image of rolling element fault.
Figure 21.
Time domain graphs under different fault types. (a) Time domain graph of the inner raceway fault; (b) time domain graph of the outer raceway fault; (c) time domain graph of the rolling element fault.
Figure 21.
Time domain graphs under different fault types. (a) Time domain graph of the inner raceway fault; (b) time domain graph of the outer raceway fault; (c) time domain graph of the rolling element fault.
Figure 22.
Convergence curves of different algorithms optimizing FMD. (a) Inner raceway; (b) outer raceway; (c) rolling element.
Figure 22.
Convergence curves of different algorithms optimizing FMD. (a) Inner raceway; (b) outer raceway; (c) rolling element.
Figure 23.
Envelope spectra of acoustic signals for different fault types after denoising with different methods. (a) Envelope spectrum of the inner raceway fault after denoising using the proposed method; (b) envelope spectrum of the inner raceway fault after denoising using GWO-FMD; (c) envelope spectrum of the outer raceway fault after denoising using the proposed method; (d) envelope spectrum of the outer raceway fault after denoising using GWO-FMD; (e) envelope spectrum of the rolling element fault after denoising using the proposed method; (f) envelope spectrum of the rolling element fault after denoising using GWO-FMD.
Figure 23.
Envelope spectra of acoustic signals for different fault types after denoising with different methods. (a) Envelope spectrum of the inner raceway fault after denoising using the proposed method; (b) envelope spectrum of the inner raceway fault after denoising using GWO-FMD; (c) envelope spectrum of the outer raceway fault after denoising using the proposed method; (d) envelope spectrum of the outer raceway fault after denoising using GWO-FMD; (e) envelope spectrum of the rolling element fault after denoising using the proposed method; (f) envelope spectrum of the rolling element fault after denoising using GWO-FMD.
Figure 24.
HSN and ISEI values of optimal components under different noise reduction methods. (a) HSN and ISEI under the inner raceway fault; (b) HSN and ISEI under the outer raceway fault; (c) HSN and ISEI under the rolling element fault.
Figure 24.
HSN and ISEI values of optimal components under different noise reduction methods. (a) HSN and ISEI under the inner raceway fault; (b) HSN and ISEI under the outer raceway fault; (c) HSN and ISEI under the rolling element fault.
Figure 25.
Comparison of different models under experimental datasets. (a) Iteration curve of training set loss function; (b) accuracy curve for training set; (c) iteration curve of training set loss function; (d) accuracy curve for training set.
Figure 25.
Comparison of different models under experimental datasets. (a) Iteration curve of training set loss function; (b) accuracy curve for training set; (c) iteration curve of training set loss function; (d) accuracy curve for training set.
Figure 26.
Confusion matrices for different models under test sets. (a) Confusion matrix of LGAF-Swin Transformer; (b) confusion matrix of Swin Transformer; (c) confusion matrix of ResNet; (d) confusion matrix of Lenet; (e) confusion matrix of CNN.
Figure 26.
Confusion matrices for different models under test sets. (a) Confusion matrix of LGAF-Swin Transformer; (b) confusion matrix of Swin Transformer; (c) confusion matrix of ResNet; (d) confusion matrix of Lenet; (e) confusion matrix of CNN.
Figure 27.
Ten tests using different models on the test set.
Figure 27.
Ten tests using different models on the test set.
Table 1.
Summary of the research methods in this article.
Table 1.
Summary of the research methods in this article.
Algorithm Type | Algorithm Name | Advantages | Disadvantages |
---|
Parameter Optimization | DA | Few parameters and easy to adjust. | It is prone to premature convergence, and its efficiency severely decreases in high-dimensional problems. |
GWO | Strong global search capability. | Prone to falling into local optima, with suboptimal search precision. |
WAA | It can demonstrate relatively strong optimization efficiency and robustness, and possesses adaptive capabilities. | The algorithm’s weight settings are sensitive (this paper sets its weights based on the literature [25] and experimental results). |
Signal Processing | CEEMDAN | Can suppress modal aliasing issues with minimal noise residue. | The algorithm has a high computational load, is sensitive to noise, and suffers from endpoint effects. |
FMD | The computational efficiency is high, enabling the rapid decomposition and processing of bearing signals. The complexity of the signals and noise interference have a minimal impact on the results of the algorithm’s decomposition. | Lack of parameter self-adaptation capability (WAA is employed in this paper to optimize the key parameters of FMD). |
Deep Learning | CNN | The model structure is relatively flexible. | Sensitive to noise; requires a high volume of data. |
LeNet | The structure is relatively simple, making it easy to understand and implement. | The feature extraction capability is relatively weak, and the model capacity is limited. |
ResNet | Strong feature representation and adaptability capabilities. | High computational complexity and large model parameter size. |
Deep Learning | Swin Transformer | Strong capability in capturing global features, high flexibility, and excellent adaptability. | The insufficient ability to capture long-range dependencies, along with high computational resource and time costs, is improved in this paper through the use of LGAF. |
Table 2.
Parameter table of LGAF-Swin Transformer model.
Table 2.
Parameter table of LGAF-Swin Transformer model.
Layer Name | Output Size | Kernel Size | Stride | Number of Layers | Output Channels |
---|
Image | 256 × 256 | — | — | — | 3 |
BA Layer | 512 × 512 | 3 × 3 | 1 | 1 | 64 |
DCS block | 128 × 128 | 3 × 3 | 2 | 1 | 256 |
AP Layer | 128 × 128 | 3 × 3 | 1 | 1 | 256 |
LGAM block | 128 × 128 | 3 × 3 | 1 | 2 | 256 |
LGA block | 64 × 64 | 3 × 3 | 2 | 1 | 256 |
AFSM block | 64 × 64 | 3 × 3 | 1 | 1 | 256 |
DC Layer | 32 × 32 | — | — | 1 | 256 |
Global Pool | 32 × 32 | 4 × 4 | 2 | 1 | 512 |
HAL | 32 × 32 | 1 × 1 | — | 1 | — |
FC Layer | 1 × 4 | — | — | — | 4 |
Table 3.
Improvements in WAA-FMD.
Table 3.
Improvements in WAA-FMD.
Method | Characteristics | Improvements in WAA-FMD |
---|
CEEMDAN | A variant of empirical mode decomposition based on additive noise, suitable for non-stationary signals. | CEEMDAN is sensitive to noise and lacks an adaptive mechanism for parameter tuning. The WAA-FMD method introduces the ISEI to dynamically select the optimal IMF, enhancing the objectivity and accuracy of modal selection. |
Fixed-parameter FMD | The original FMD, with fixed parameters, cannot adapt to different operating conditions. | WAA-FMD can adaptively adjust its FMD parameters under varying noise intensities and signal characteristics, thereby avoiding issues of under-decomposition or over-decomposition. |
DA-FMD | Using the DA for FMD parameter optimization, with certain search capabilities. | DA has the issue of a strong exploratory capability but slow convergence, while WAA integrates multi-solution information using a weighted strategy, resulting in faster convergence and better stability. |
GWO-FMD | Using GWO to optimize FMD demonstrates a good convergence capability but is prone to falling into local optima. | WAA comprehensively considers the quality of multiple solutions by enhancing population diversity through a balancing strategy, thereby avoiding local optima and improving global search capability. |
Table 4.
Fault types of bearing 6205.
Table 4.
Fault types of bearing 6205.
Fault Type | Fault Diameter (mm) | Load (Nm) | Speed (RPM) |
---|
Normal | - | 0 | 3010 |
Inner raceway fault | 0.3 | 0 | 3010 |
1.0 | 0 | 3010 |
Outer raceway fault | 0.3 | 0 | 3010 |
1.0 | 0 | 3010 |
Table 5.
Hyperparameter settings.
Table 5.
Hyperparameter settings.
Parameter Name | Numerical/Configuration | Explanation |
---|
Batch Size | 16 | GPU memory constraints (RTX 4060, 16 GB VRAM) introduce moderate gradient noise with small batch sizes to prevent overfitting. |
Maximum Epochs | 50/100 | Maximum training epochs. |
Early Stopping (patience) | 10 | Early termination if validation loss shows no improvement for 10 consecutive rounds to prevent overfitting. |
Optimizer | SGD (lr = 0.001; momentum = 0.9; weight_decay = 1 × 10−4) | Enhances generalization capability. |
Learning Rate Scheduler | StepLR (γ = 0.1 at epochs = [30,40]) | Grid search validation (range 0.0001–0.01), balancing convergence speed and stability. |
Loss Function | CrossEntropyLoss | Applicable to multi-class classification tasks. |
Input dimensions | 256 × 256 | CWT outputs time–frequency map dimensions. |
Patch Size | 4 × 4 | Consistent with the original Swin text. |
Embed Dim | 100 | Embedding dimension. |
Window Size | 7 | Local attention window size. |
MLP Ratio | 4.0 | FFN hidden layer multiplier. |
Table 6.
Overview of optimization strategies.
Table 6.
Overview of optimization strategies.
Variant | Module | Key Parameters | Explanation |
---|
ST-BA | BA | Kernel size: 3 × 3; Channels: 256→512. | Enhance the density of the time–frequency diagram; strengthen the input. |
ST-DSC | DSC | Depthwise 3 × 3 + Pointwise 1 × 1; Channel: 256→256; BatchNorm + ReLU. | Reduce the number of parameters; accelerate shallow feature extraction. |
ST-LGAM | LGAM | Local convolution 3 × 3; global convolution 7 × 7. | Integrate multi-scale features; enhance cross-region modeling. |
ST-AFSM | AFSM | HAL activation β = 0.5; dynamic threshold θ = 0.7. | Select high-discriminative channels and suppress noise redundancy. |
Table 7.
Accuracy of test set.
Table 7.
Accuracy of test set.
Model | KAIST Accuracy (%) |
---|
ST | 93.67 |
ST-BA | 98.17 |
ST-DSC | 96.83 |
ST-LGAM | 98.33 |
ST-AFSM | 97.33 |
LGAF-Swin Transformer | 100 |
Table 8.
Diagnostic results of different models.
Table 8.
Diagnostic results of different models.
Model | TPR | FNR | F1 Score | Kappa Value | AUC Value |
---|
LGAF-Swin Transformer | 100% | 0 | 100% | 100% | 100% |
Swin Transformer | 93.65% | 6.35% | 93.67% | 92.08% | 96.04% |
ResNet | 92.50% | 7.50% | 92.90% | 90.63% | 95.32% |
LeNet | 93.33% | 6.67% | 93.34% | 91.67% | 96.23% |
CNN | 88.67% | 11.33% | 88.68% | 85.83% | 93.98% |
Table 9.
Accuracy of different models under test sets.
Table 9.
Accuracy of different models under test sets.
Model | KAIST Accuracy (%) |
---|
CNN | 88.67 |
Resnet | 93.33 |
Lenet | 92.50 |
Swin Transformer | 93.67 |
LGAF-Swin Transformer | 100.00 |
Table 10.
Parameters of the acoustic sensor.
Table 10.
Parameters of the acoustic sensor.
Parameter | Value |
---|
Sensitivity, mV/Pa | 50 |
Dynamic range, dB | 20~142 |
Frequency range, Hz | 10~20,000 |
Size, φ (mm) | 12.7 |
Frequency response characteristics | free field |
Table 11.
Fault frequencies of bearing 30205.
Table 11.
Fault frequencies of bearing 30205.
Fault Characteristic Frequency | Value |
---|
Inner raceway fault frequency , Hz | 775 |
Outer raceway fault frequency , Hz | 557 |
Rolling element fault frequency , Hz | 241 |
Table 12.
Explanation of test data samples.
Table 12.
Explanation of test data samples.
Fault Types | Training Set | Validation Set | Test Set | Label Value |
---|
Normal Condition | 360 | 120 | 120 | 0 |
Inner raceway fault | 360 | 120 | 120 | 1 |
Outer raceway fault | 360 | 120 | 120 | 2 |
Rolling element fault | 360 | 120 | 120 | 3 |
Total | 1440 | 480 | 480 | |
Table 13.
Diagnostic results of different models.
Table 13.
Diagnostic results of different models.
Model | TPR | FNR | F1 Score | Kappa Value | AUC Value |
---|
LGAF-Swin Transformer | 98.13% | 1.87% | 98.23% | 97.50% | 98.78% |
Swin Transformer | 89.79% | 10.21% | 89.80% | 86.39% | 93.19% |
ResNet | 83.33% | 16.67% | 83.40% | 77.78% | 88.89% |
LeNet | 80.42% | 19.58% | 80.15% | 73.89% | 86.94% |
CNN | 76.46% | 23.54% | 76.43% | 68.61% | 84.31% |
Table 14.
Accuracy of different models under test sets.
Table 14.
Accuracy of different models under test sets.
Model | KAIST Accuracy (%) |
---|
CNN | 75.82 ± 2.49 |
Resnet | 83.12 ± 3.33 |
Lenet | 80.75 ± 3.25 |
Swin Transformer | 90.88 ± 1.50 |
LGAF-Swin Transformer | 98.62 ± 0.29 |
Table 15.
Metrics of various models on the NU218 test set.
Table 15.
Metrics of various models on the NU218 test set.
Model | TPR | FNR | F1 Score | Kappa Value | AUC Value |
---|
LGAF-Swin Transformer | 98.54% | 1.46% | 98.64% | 98.05% | 99.03% |
Swin Transformer | 91.67% | 8.33% | 91.60% | 88.89% | 95.00% |
ResNet | 84.58% | 15.42% | 84.56% | 79.44% | 90.90% |
Lenet | 79.17% | 20.83% | 79.86% | 82.22% | 87.50% |
CNN | 77.92% | 22.08% | 77.75% | 70.56% | 86.67% |
Table 16.
Metrics of each model on the 7205 test set.
Table 16.
Metrics of each model on the 7205 test set.
Model | TPR | FNR | F1 Score | Kappa Value | AUC Value |
---|
LGAF-Swin Transformer | 98.19% | 1.81% | 98.40% | 97.50% | 98.75% |
Swin Transformer | 90.52% | 9.48% | 90.20% | 88.05% | 94.03% |
ResNet | 82.92% | 17.08% | 82.80% | 77.23% | 88.61% |
Lenet | 76.88% | 23.12% | 76.90% | 69.17% | 84.58% |
CNN | 74.79% | 25.21% | 0.750% | 66.39% | 83.19% |
Table 17.
Detection results of generalization test.
Table 17.
Detection results of generalization test.
Bearing Model Number | Accuracy (%) |
---|
NU 218 | 98.75 ± 0.21 |
7205 | 98.44 ± 0.52 |