Robust Sparse Non-Negative Matrix Factorization for Identifying Signals of Interest in Bearing Fault Detection
Abstract
1. Introduction
- 1.
- Robust cost function: In the literature, various NMF frameworks and robust cost functions have been explored for bearing condition monitoring. In this work, we further adapt the NMF framework to this purpose by introducing the Maximum Correntropy Criterion (MCC), which significantly improves robustness against heavy-tailed and impulsive noise compared with classical least-squares- and -divergence-based NMF methods.
- 2.
- Sparsity constraint: An -norm sparsity term is incorporated into the NMF formulation to emphasize localized impulsive components that characterize bearing defects, enhancing the clarity and separability of cyclic features.
- 3.
- Stable initialization: The NNDSVD initialization is applied to both W and H matrices to ensure stable and reproducible decompositions and to accelerate convergence of the algorithm.
- 4.
- Automatic filter selection: The Envelope Spectrum Indicator (ENVSI) is employed to automatically identify the most informative NMF component corresponding to the fault-related frequency band, ensuring objective and repeatable filter selection.
- 5.
- Comparative validation: The proposed Sparse NMF-MCC is benchmarked against the classic NMF and -HALS NMF algorithms, demonstrating that the MCC-based approach consistently achieves higher ENVSI values and clearer fault-frequency localization across both simulated and real-world datasets (belt conveyor, ore crusher, and test rig).
2. Methodology and Theory
2.1. Short-Time Fourier Transform (STFT)
2.2. NMF Model
2.3. Maximum Correntropy Criterion (MCC) with Gaussian Kernel
- Robustness: The Gaussian kernel suppresses large deviations by assigning lower values to them. For large deviations, the value of approaches zero, making corentropy less sensitive to outlier points than traditional measures such as mean squared error, which enhances the robustness of MCC against outliers.
- Bandwidth Control: The parameter dictates the width of the Gaussian kernel, balancing between sensitivity to small errors and robustness against large ones.
2.4. Sparse NMF with MCC Objective
2.5. Algorithm of Sparse NMF MCC
- Initialization: Initialize the matrices and with the non-negative values. Set the parameters and . In presented case both matrices, i.e., and were initialize using NNDSVD.
- Update Rules: The update for incorporates the MCC weights:where is the transposition of matrix , is the matrix of MCC weights, ⊙ is the Hadamard product, ⊘ is the Hadamard division and is a small constant to prevent division by zero. The update for is:Finally, the normalization of W is done by the columns of matrix W:
- Stopping Condition: The algorithm continues until a maximum number of iterations is reached or the optimality gap becomes sufficiently small.
2.6. Selecting the Parameters
| Algorithm 1: Sparse NMF MCC: Sparse Non-negative Matrix Factorization with Maximum Correntropy Criterion |
![]() |
2.7. Envelope Spectrum Indicator
3. Simulation
3.1. Simulated Signal
3.2. Signal in the Presence of Gaussian Noise
3.3. Signal in the Presence of Heavy-Tailed Noise
3.4. Monte-Carlo Analysis for Simulated Signal
3.5. Gaussian Case
3.6. Non-Gaussian Case
3.7. Sensitivity Analysis
4. Real Data Analysis
4.1. Case 1: Belt Conveyor Acoustic Signal
4.2. Case 2: Copper Ore Crusher
4.3. Case 3: Test Rig
4.4. Results for Real Dataset
4.5. Discussion, Limitations, and Future Work
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Appendix A



Appendix B
| Symbol | Description |
|---|---|
| One-dimensional discrete data (raw input signal) | |
| Short-Time Fourier Transform at time t and frequency f | |
| w | Window function |
| L | Number of samples in the analyzed segment (window length) |
| N | Number of points used for Discrete Fourier Transform (DFT) |
| j | Imaginary unit |
| Sampling Frequency | |
| Spectrogram (absolute value of ), the non-negative input matrix, | |
| Dimensions (rows, columns) of the input matrix S | |
| r | Rank of factorization |
| Sparsity regularization parameter | |
| Maximum number of iterations in the Sparse NMF MCC algorithm | |
| Basis matrix | |
| Coefficient matrix | |
| Error matrix | |
| V | Correntropy |
| Gaussian kernel function in Maximum Correntropy Criterion (MCC) | |
| Kernel bandwidth parameter () in MCC | |
| norm | |
| MCC weight matrix | |
| ⊙ | Hadamard product (element-wise multiplication) |
| ⊘ | Hadamard division (element-wise division) |
| Small constant introduced to prevent division by zero | |
| Envelope Spectrum Indicator | |
| Squared Envelope Spectrum (SES) | |
| Signal of Interest (SOI) component | |
| Noise component | |
| Degrees of freedom (for Student’s t-distribution, modeling heavy-tailed noise) | |
| A | Magnitude/Amplitude of the sinusoidal component in SOI |
| Frequency of the sinusoidal component in SOI | |
| Damping factor | |
| Dirac delta function | |
| Nominal period of the signal | |
| Slow variations in the period due to jitter effects | |
| Initial phase shift | |
| * | Convolution operation |
| standard deviation in Gaussian noise |
References
- Antoni, J. The spectral kurtosis: A useful tool for characterising non-stationary signals. Mech. Syst. Signal Process. 2006, 20, 282–307. [Google Scholar] [CrossRef]
- Antoni, J.; Randall, R.B. The spectral kurtosis: Application to the vibratory surveillance and diagnostics of rotating machines. Mech. Syst. Signal Process. 2006, 20, 308–331. [Google Scholar] [CrossRef]
- Barszcz, T.; Randall, R.B. Application of spectral kurtosis for detection of a tooth crack in the planetary gear of a wind turbine. Mech. Syst. Signal Process. 2009, 23, 1352–1365. [Google Scholar] [CrossRef]
- Wang, Y.; Xiang, J.; Markert, R.; Liang, M. Spectral kurtosis for fault detection, diagnosis and prognostics of rotating machines: A review with applications. Mech. Syst. Signal Process. 2016, 66, 679–698. [Google Scholar] [CrossRef]
- Liu, S.; Hou, S.; He, K.; Yang, W. L-Kurtosis and its application for fault detection of rolling element bearings. Measurement 2018, 116, 523–532. [Google Scholar] [CrossRef]
- Zhong, J.; Wang, D.; Li, C. A nonparametric health index and its statistical threshold for machine condition monitoring. Measurement 2021, 167, 108290. [Google Scholar] [CrossRef]
- Jaworski, P.; Pitera, M. The 20-60-20 rule. Discret. Contin. Dyn. Syst. Ser. B 2016, 21, 1149–1166. [Google Scholar] [CrossRef]
- Hebda-Sobkowicz, J.; Zimroz, R.; Pitera, M.; Wyłomańska, A. Informative frequency band selection in the presence of non-Gaussian noise—A novel approach based on the conditional variance statistic with application to bearing fault diagnosis. Mech. Syst. Signal Process. 2020, 145, 106971. [Google Scholar] [CrossRef]
- Nowicki, J.; Hebda-Sobkowicz, J.; Zimroz, R.; Wyłomańska, A. Dependency measures for the diagnosis of local faults in application to the heavy-tailed vibration signal. Appl. Acoust. 2021, 178, 107974. [Google Scholar] [CrossRef]
- Żak, G.; Wyłomańska, A.; Zimroz, R. Application of alpha-stable distribution approach for local damage detection in rotating machines. J. Vibroengineering 2015, 17, 2987–3002. [Google Scholar]
- Randall, R.B.; Antoni, J.; Chobsaard, S. The relationship between spectral correlation and envelope analysis in the diagnostics of bearing faults and other cyclostationary machine signals. Mech. Syst. Signal Process. 2001, 15, 945–962. [Google Scholar] [CrossRef]
- Borghesani, P.; Shahriar, M.R. Cyclostationary analysis with logarithmic variance stabilisation. Mech. Syst. Signal Process. 2016, 70, 51–72. [Google Scholar] [CrossRef]
- Bonnardot, F.; Randall, R.; Guillet, F. Extraction of second-order cyclostationary sources—Application to vibration analysis. Mech. Syst. Signal Process. 2005, 19, 1230–1244. [Google Scholar] [CrossRef]
- Marsick, A.; André, H.; Khelf, I.; Leclère, Q.; Antoni, J. Restoring cyclostationarity of rolling element bearing signals from the instantaneous phase of their envelope. Mech. Syst. Signal Process. 2023, 193, 110264. [Google Scholar] [CrossRef]
- Borghesani, P.; Antoni, J. A faster algorithm for the calculation of the fast spectral correlation. Mech. Syst. Signal Process. 2018, 111, 113–118. [Google Scholar] [CrossRef]
- Wodecki, J.; Michalak, A.; Wyłomańska, A.; Zimroz, R. Influence of non-Gaussian noise on the effectiveness of cyclostationary analysis–Simulations and real data analysis. Measurement 2021, 171, 108814. [Google Scholar] [CrossRef]
- Flandrin, P.; Rilling, G.; Goncalves, P. Empirical mode decomposition as a filter bank. IEEE Signal Process. Lett. 2004, 11, 112–114. [Google Scholar] [CrossRef]
- Arifin, M.S.; Wang, W.; Uddin, M.N. A modified EMD technique for broken rotor bar fault detection in induction machines. Sensors 2024, 24, 5186. [Google Scholar] [CrossRef]
- Wang, W.; Yuan, H. Bearing Fault Feature Extraction Method Based on Adaptive Time-Varying Filtering Empirical Mode Decomposition and Singular Value Decomposition Denoising. Machines 2025, 13, 50. [Google Scholar] [CrossRef]
- Dragomiretskiy, K.; Zosso, D. Variational mode decomposition. IEEE Trans. Signal Process. 2013, 62, 531–544. [Google Scholar] [CrossRef]
- Jiang, W.; Qi, Z.; Jiang, A.; Chang, S.; Xia, X. Lightweight network bearing intelligent fault diagnosis based on VMD-FK-ShuffleNetV2. Machines 2024, 12, 608. [Google Scholar] [CrossRef]
- Liu, G.; Ma, Y.; Wang, N. Rolling Bearing Fault Diagnosis Based on SABO–VMD and WMH–KNN. Sensors 2024, 24, 5003. [Google Scholar] [CrossRef]
- Shiri, H.; Wodecki, J. Analysis of the sound signal to fault detection of bearings based on Variational Mode Decomposition. IOP Conf. Ser. Earth Environ. Sci. 2021, 942, 012020. [Google Scholar] [CrossRef]
- Smith, J.S. The local mean decomposition and its application to EEG perception data. J. R. Soc. Interface 2005, 2, 443–454. [Google Scholar] [CrossRef]
- Shiri, H.; Wodecki, J.; Ziętek, B.; Zimroz, R. Inspection robotic UGV platform and the procedure for an acoustic signal-based fault detection in belt conveyor idler. Energies 2021, 14, 7646. [Google Scholar] [CrossRef]
- Lei, Y.; Lin, J.; He, Z.; Zuo, M.J. A review on empirical mode decomposition in fault diagnosis of rotating machinery. Mech. Syst. Signal Process. 2013, 35, 108–126. [Google Scholar] [CrossRef]
- Wang, Y.; Liu, F.; Jiang, Z.; He, S.; Mo, Q. Complex variational mode decomposition for signal processing applications. Mech. Syst. Signal Process. 2017, 86, 75–85. [Google Scholar] [CrossRef]
- Li, Y.; Si, S.; Liu, Z.; Liang, X. Review of local mean decomposition and its application in fault diagnosis of rotating machinery. J. Syst. Eng. Electron. 2019, 30, 799–814. [Google Scholar] [CrossRef]
- Randall, R.B.; Antoni, J. Why EMD and similar decompositions are of little benefit for bearing diagnostics. Mech. Syst. Signal Process. 2023, 192, 110207. [Google Scholar] [CrossRef]
- Zhang, F.; Li, Z.; Cheng, Y.; Yang, Y.; Han, Y.; Yi, C.; Li, T.; Yan, J.; Dong, L.; Zhang, W. MHSNet: A Multi-Scale Hidden State Interaction Network for Fault Diagnosis of Rotating Machinery. Tsinghua Sci. Technol. 2025. [Google Scholar] [CrossRef]
- Wang, H.; Song, Y.; Yang, H.; Liu, Z. Generalized Koopman Neural Operator for Data-driven Modelling of Electric Railway Pantograph-catenary Systems. IEEE Trans. Transp. Electrif. 2025. [Google Scholar] [CrossRef]
- Yan, J.; Cheng, Y.; Zhang, F.; Li, M.; Zhou, N.; Jin, B.; Wang, H.; Yang, H.; Zhang, W. Research on multimodal techniques for arc detection in railway systems with limited data. Struct. Health Monit. 2025. [Google Scholar] [CrossRef]
- Xu, H.; Pan, H.; Zheng, J.; Tong, J.; Zhang, F.; Chu, F. Intelligent fault identification in sample imbalance scenarios using robust low-rank matrix classifier with fuzzy weighting factor. Appl. Soft Comput. 2024, 152, 111229. [Google Scholar] [CrossRef]
- Wodecki, J.; Kruczek, P.; Bartkowiak, A.; Zimroz, R.; Wyłomańska, A. Novel method of informative frequency band selection for vibration signal using Nonnegative Matrix Factorization of spectrogram matrix. Mech. Syst. Signal Process. 2019, 130, 585–596. [Google Scholar] [CrossRef]
- Liang, L.; Ding, X.; Liu, F.; Chen, Y.; Wen, H. Feature extraction using sparse kernel non-negative matrix factorization for rolling element bearing diagnosis. Sensors 2021, 21, 3680. [Google Scholar] [CrossRef]
- Hou, W.; Liu, X.; Wang, J.; Chen, C.; Xu, X. Multispectral Land Surface Reflectance Reconstruction Based on Non-Negative Matrix Factorization: Bridging Spectral Resolution Gaps for GRASP TROPOMI BRDF Product in Visible. Remote Sens. 2025, 17, 1053. [Google Scholar] [CrossRef]
- Wang, M.; Zhang, W.; Shao, M.; Wang, G. Separation and Extraction of Compound-Fault Signal Based on Multi-Constraint Non-Negative Matrix Factorization. Entropy 2024, 26, 583. [Google Scholar] [CrossRef]
- Gabor, M.; Zdunek, R.; Zimroz, R.; Wylomanska, A. Bearing Damage Detection With Orthogonal and Nonnegative Low-Rank Feature Extraction. IEEE Trans. Ind. Inform. 2023, 20, 2944–2955. [Google Scholar] [CrossRef]
- Michalak, A.; Zdunek, R.; Zimroz, R.; Wyłomańska, A. Influence of α-Stable Noise on the Effectiveness of Non-Negative Matrix Factorization—Simulations and Real Data Analysis. Electronics 2024, 13, 829. [Google Scholar] [CrossRef]
- Boutsidis, C.; Gallopoulos, E. SVD based initialization: A head start for nonnegative matrix factorization. Pattern Recognit. 2008, 41, 1350–1362. [Google Scholar] [CrossRef]
- Allen, J. Short term spectral analysis, synthesis, and modification by discrete Fourier transform. IEEE Trans. Acoust. Speech Signal Process. 1977, 25, 235–238. [Google Scholar] [CrossRef]
- Lee, D.; Seung, S. Algorithms for non-negative matrix factorization. In Proceedings of the Advances in Neural Information Processing Systems, Denver, CO, USA, 11 May 2001; MIT Press: Cambridge, MA, USA, 2001; pp. 556–562. [Google Scholar]
- Cichocki, A.; Zdunek, R.; Phan, A.H.; Amari, S.i. Nonnegative Matrix and Tensor Factorizations: Applications to Exploratory Multi-Way Data Analysis and Blind Source Separation; John Wiley & Sons: Hoboken, NJ, USA, 2009. [Google Scholar]
- Cichocki, A.; Phan, A.H.; Caiafa, C. Flexible HALS algorithms for sparse non-negative matrix/tensor factorization. In Proceedings of the 2008 IEEE Workshop on Machine Learning for Signal Processing, Cancun, Mexico, 16–19 October 2008; pp. 73–78. [Google Scholar]
- Cichocki, A.; Phan, A.H. Fast local algorithms for large scale nonnegative matrix and tensor factorizations. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 2009, 92, 708–721. [Google Scholar] [CrossRef]
- Chen, B.; Zhu, Y.; Hu, J.; Principe, J.C. System Parameter Identification: Information Criteria and Algorithms; Elsevier: London, UK, 2013. [Google Scholar]
- Peng, S.; Chen, B.; Sun, L.; Ser, W.; Lin, Z. Constrained maximum correntropy adaptive filtering. Signal Process. 2017, 140, 116–126. [Google Scholar] [CrossRef]
- Wang, J.J.Y.; Wang, X.; Gao, X. Non-negative matrix factorization by maximizing correntropy for cancer clustering. BMC Bioinform. 2013, 14, 107. [Google Scholar] [CrossRef]
- Peng, S.; Ser, W.; Lin, Z.; Chen, B. Robust sparse nonnegative matrix factorization based on maximum correntropy criterion. In Proceedings of the 2018 IEEE International Symposium on Circuits and Systems (ISCAS), Florence, Italy, 27–30 May 2018; IEEE eXpress Conference Publishing: Red Hook, NY, USA, 2018; pp. 1–5. [Google Scholar]
- Du, L.; Li, X.; Shen, Y.D. Robust nonnegative matrix factorization via half-quadratic minimization. In Proceedings of the 2012 IEEE 12th International Conference on Data Mining, Brussels, Belgium, 10–13 December 2012; IEEE Computer Society Conference Publishing Services (CPS): Danvers, MA, USA, 2012; pp. 201–210. [Google Scholar]
- Shiri, H.; Zimroz, P.; Wodecki, J.; Wyłomańska, A.; Zimroz, R.; Szabat, K. Using long-term condition monitoring data with non-Gaussian noise for online diagnostics. Mech. Syst. Signal Process. 2023, 200, 110472. [Google Scholar] [CrossRef]
- Shiri, H.; Zimroz, P.; Wyłomańska, A.; Zimroz, R. Estimation of machinery’s remaining useful life in the presence of non-Gaussian noise by using a robust extended Kalman filter. Measurement 2024, 235, 114882. [Google Scholar] [CrossRef]
- Berry, M.W.; Browne, M.; Langville, A.N.; Pauca, V.P.; Plemmons, R.J. Algorithms and applications for approximate nonnegative matrix factorization. Comput. Stat. Data Anal. 2007, 52, 155–173. [Google Scholar] [CrossRef]























| Parameter | Value |
|---|---|
| Window | hamming(256) |
| Overlap | 180 |
| FFT Points | 512 |
| Sampling Frequency (Fs) | 25,000 |
| Description | Value |
|---|---|
| Shaft speed frequency | 3 Hz |
| Cage defect frequency (FTF) | 1.3 Hz |
| Ball spin frequency (BSF) | 10.6 Hz |
| Inner race defect frequency (BPFI) | 30.7 Hz |
| Outer race defect frequency (BPFO) | 23.3 Hz |
| Rolling element defect frequency | 21.1 Hz |
| Description | Value |
|---|---|
| Shaft speed frequency | 17.35 Hz |
| Cage defect frequency (FTF) | 7 Hz |
| Ball spin frequency (BSF) | 42.75 Hz |
| Inner race defect frequency (BPFI) | 134.44 Hz |
| Outer race defect frequency (BPFO) | 91.11 Hz |
| Rolling element defect frequency | 85.5 Hz |
| Method | Type | Case 1 | Case 2 | Case 3 |
|---|---|---|---|---|
| NMF Sparse MCC | Random | 0.2803 | 0.0196 | 0.4650 |
| NNDSVD | 0.3111 | 0.1258 | 0.4737 | |
| Sparse NMF MCC Modified Bandwidth | Random | 0.3011 | 0.0228 | 0.3188 |
| NNDSVD | 0.2887 | 0.0201 | 0.3636 | |
| Classic NMF | Default | 0.3051 | 0.0197 | 0.0136 |
| NNDSVD | 0.3050 | 0.0210 | 0.0118 | |
| -HALS NMF | Random | 0.2927 | 0.0208 | 0.0088 |
| NNDSVD | 0.2899 | 0.0207 | 0.0087 | |
| Spectral Kurtosis | – | 0.2437 | 0.0192 | 0.0045 |
| Alpha Selector | – | 0.2991 | 0.0197 | 0.0067 |
| CV Selector | – | 0.2021 | 0.0198 | 0.0050 |
| Pearson Selector | – | 0.2910 | 0.0187 | 0.3973 |
| Parameter | Symbol | Case 1 | Case 2 | Case 3 |
|---|---|---|---|---|
| (Conveyor) | (Ore Crusher) | (Bearing Rig) | ||
| Sparsity coefficient | 1 | 1 | 1 | |
| MCC kernel bandwidth | 0.3 | 0.3 | 0.3 | |
| Factorization rank | r | 4 | 4 | 4 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Shiri, H.; Michalak, A. Robust Sparse Non-Negative Matrix Factorization for Identifying Signals of Interest in Bearing Fault Detection. Sensors 2025, 25, 7041. https://doi.org/10.3390/s25227041
Shiri H, Michalak A. Robust Sparse Non-Negative Matrix Factorization for Identifying Signals of Interest in Bearing Fault Detection. Sensors. 2025; 25(22):7041. https://doi.org/10.3390/s25227041
Chicago/Turabian StyleShiri, Hamid, and Anna Michalak. 2025. "Robust Sparse Non-Negative Matrix Factorization for Identifying Signals of Interest in Bearing Fault Detection" Sensors 25, no. 22: 7041. https://doi.org/10.3390/s25227041
APA StyleShiri, H., & Michalak, A. (2025). Robust Sparse Non-Negative Matrix Factorization for Identifying Signals of Interest in Bearing Fault Detection. Sensors, 25(22), 7041. https://doi.org/10.3390/s25227041


