Diffusion-Based Feature Denoising and Using NNMF for Robust Brain Tumor Classification
Abstract
1. Introduction
2. Related Works
- Deep Learning-Based Brain Tumor Classification (Recent Advances)Almuhaimeed et al. [9] proposed a deep learning framework for brain tumor classification using MRI images, integrating convolutional neural networks with data augmentation techniques. Their model achieved high classification accuracy exceeding 97%, demonstrating the effectiveness of deep learning approaches in medical imaging. However, their evaluation was conducted under standard conditions without considering adversarial robustness. In contrast, the current study evaluates both clean and adversarial performance using AutoAttack.
- Hybrid CNN and Transformer ModelsGómez et al. [10] proposed a hybrid CNN–Transformer architecture for multi-class brain tumor classification. Their model combines local feature extraction with global attention mechanisms, achieving improved classification performance. However, this approach increases model complexity and reduces interpretability. In contrast, the proposed framework employs NNMF to generate interpretable feature representations.
- Diffusion and Generative Models in Medical ImagingDeem et al. [11] analyzed robustness in brain tumor classification using modern deep learning models and highlighted the trade-off between classification accuracy and adversarial robustness. Similarly, recent studies have explored generative approaches such as GANs and diffusion models to improve performance and robustness. However, most of these methods operate in pixel or latent space.
- Brain Tumor Detection Using CNNHossain et al. [12] suggest a hybrid brain tumor detection framework that joins classical machine learning classifiers with a computational neural network (CNN). The research used the BRATS benchmark dataset and applied Fuzzy C-Means (FCM) clustering for tumor segmentation, followed by feature extraction using texture and statistical descriptors.In the classical machine learning phases, several classifiers were evaluated, including Support Vector Machine (SVM), K-Nearest Neighbors (KNN), Logistic Regression, Naïve Bayes, Random Forest, and Multilayer Perceptron (MLP). Among them, SVM achieved the best performance with an accuracy of 92.42%.For deep learning-based classification, the researcher designed a five-layer CNN structure based on convolution, max-pooling, flattening, and fully linked layers. The suggested CNN achieved 97.87% accuracy in an 80:20 training–testing split, outperforming traditional classifiers.However, the performance reported increases; the approach relies on pixel-level segmentation and explicit image-based CNN classification. In contrast, our work achieves interpretable low-rank NNMF representations combined with a lightweight CNN approach and diffusion-based robustness boost, aiming not only at high classification accuracy but also improved adversarial robustness.
- NMF-CNN for the enhancement of featuresChan et al. [13] suggest a hybrid NMR–convolutional neural network (NMF–CNN) framework for the detection of sound events in the DCASE 2019 challenge. In their work, NMF was used as a preprocessing and feature enhancement step to approximate powerful labels from weakly labeled data by resolving the analysis matrix H. The extracted representations were then fed into a CNN approach for event classification. The results of their study showed that the integration of NMF with a shallow CNN enhances the event-based F1-score (30.39%) compared to the baseline system (23.7%), explaining that NMF can provide a meaningful structural decay that benefits deep learning architectures. Unlike their system in voiced scene analysis, the current study adopts NNMF for medical image feature extraction, where it is applied to generate interpretable low-rank representations of brain MRI images. Subsequently, these representations are utilized for classification and robustness evaluation under autoattacks.
- Semi-NMF Network for Image ClassificationHuang et al. [14] suggest a Semi-NMF-based convolutional network (SNnet) for image classification, where convolutional filters are not learned through backpropagation but instead are built using semi-non-negative matrix factorization used to image patches. Unlike traditional CNNs that depend on slope-based optimization, their process learns filter banks by matrix factorization, reduces computational cost, and avoids global parameter tuning. In addition, a weakly supervised extension (S-SNnet) was introduced via merge graph regularization into the Semi-NMF framework to enhance discriminative capability. Experimental results on the MNIST dataset demonstrated that the suggested style achieves competitive performance compared to state-of-the-art shallow and deep learning architectures such as PCANet. This work highlights the feasibility of merging the matrix factorization mechanism within convolutional frameworks for active feature learning.
- Classification–Denoising Joint ModelsThiry and Guth [15] suggest classification–denoising networks, which simultaneously model image classification and denoising by learning a single network that holds the common distribution of noisy image information and their labels. Their method combines loss of cross-entropy classification with the proper denoising outcome at multiple noise levels, using the Tweedie–Miyasawa formula to estimate the denoised product. Experimental results in CIFAR-10 and ImageNet demonstrate competitive performance and improved adversarial robustness compared to standard discriminative classifiers. This study provides a theoretical link between denoising objectives and adversarial gradients, offering a new view of robustness that complements conventional defense mechanisms [15].
- Reliable Robustness EvaluationCroce and Hein [3] propose a robust evaluation framework for adversarial robustness based on a crew of various parameter-free attacks. Unlike earlier benchmarks that often build on individual attack procedures, their ensemble joins complementary attacks such as APGD and Square attacks to provide a parameter-independent and safe robustness rating. The evaluation of the study involved testing more than fifty models and explained that many already considered robust defenses could be broken when evaluated with the suggested attack ensemble. This study highlighted the need for a strict and united robustness estimate in adversarial machine learning and directly motivated the use of AutoAttack as a reliable benchmark in the current work.
- Recent studies (2024–2026) on brain tumor classification have reported high precision using the deep CNN model and Transformer-based models, particularly in publicly available datasets. Many of these approaches focus mainly on maximizing clean accuracy, often exceeding 95%. More three reviews [16,17,18] treated brain tumor segmentation, potential of hybrid models and analysis of model interpretability, resp.However, these methods are typically evaluated under classical conditions and do not consider adversarial robustness or interpretability. In contrast, the suggested work confirms robustness against adversarial perturbations while preserving interpretable NNMF-based feature representations.Thus, this study provides a complementary perspective by addressing reliability and robustness, which are critical in safety-sensitive medical systems. In addition to classification-based approaches, recent studies have discussed anomaly detection and the mechanism of medical image segmentation for brain tumor testing, with the aim of improving localization accuracy and robustness. The suggested work differs by focusing on interpretable feature extraction combined with adversarial robustness over diffusion-based feature purification, providing a complementary perspective to existing methods.
3. Materials and Methods
3.1. Preprocessing Dataset
3.2. Extraction Feature Phase Using NNMF
3.3. Feature Selection and Statistical Feature Analysis
3.4. CNN-Based Classification on Selected NNMF Features
3.5. Feature-Space Diffusion Data Generation
3.6. Feature-Space Denoiser Training
3.7. Evaluation of Feature-Space Diffusion Refine
3.8. Robustness Evaluation Under AutoAttack
3.9. Comprehensive Performance Evaluation
3.9.1. ROC-AUC
3.9.2. Brier Score
3.9.3. Log-Loss
3.9.4. Matthews Correlation Coefficient (MCC)
3.9.5. Results and Discussion
3.10. Comparison with Recent Works
4. Implementation and Computational Performance Analysis
4.1. Computational Time Comparison
4.2. Analysis
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Litjens, G.; Kooi, T.; Bejnordi, B.E.; Setio, A.A.A.; Ciomp, F.; Ghafoorian, M.; van der Laak, J.A.; van Ginneke, B.; Sánchez, C.I. A survey on deep learning in medical image analysis. Med. Image Anal. 2017, 42, 60–88. [Google Scholar] [CrossRef] [PubMed]
- Szegedy, C.; Zaremba, W.; Sutskever, I.; Bruna, J.; Erhan, D.; Goodfellow, I.; Fergus, R. Intriguing properties of neural networks. arXiv 2013, arXiv:1312.6199. [Google Scholar] [CrossRef]
- Croce, F.; Hein, M. Reliable evaluation of adversarial robustness with an ensemble of diverse parameter-free attacks. arXiv 2020, arXiv:2003.01690. [Google Scholar] [CrossRef]
- Guyon, I.; Elisseeff, A. An introduction to variable and feature selection. J. Mach. Learn. Res. 2003, 3, 1157–1182. [Google Scholar]
- Rajkó, R.; Siket, I.; Hegedűs, P.; Ferenc, R. Development of partial least squares regression with discriminant analysis for software bug prediction. Heliyon 2024, 10, e35045. [Google Scholar] [CrossRef] [PubMed]
- Lee, D.D.; Seung, H.S. Learning the parts of objects by non-negative matrix factorization. Nature 1999, 401, 788–791. [Google Scholar] [CrossRef] [PubMed]
- Gillis, N. Nonnegative Matrix Factorization; SIAM: Philadelphia, PA, USA, 2020. [Google Scholar] [CrossRef]
- Croce, F.; Hein, M. AutoAttack: Reliable Evaluation of Adversarial Robustness. 2020. Available online: https://github.com/fra31/auto-attack (accessed on 10 March 2026).
- Almuhaimeed, A.; Alenezi, F.; Alotaibi, A. Brain Tumor Classification Using Deep Learning and Data Augmentation Techniques. Front. Med. 2025, 12, 1635796. [Google Scholar] [CrossRef] [PubMed]
- Gómez, J.; Martínez, C.; Fernández, L. Enhanced Multi-Class Brain Tumor Classification Using Hybrid CNN and Transformer Models. Technologies 2025, 13, 379. [Google Scholar] [CrossRef]
- Deem, M.; Johnson, S.; Kim, D. Robustness Analysis of Brain Tumor Classification Models Under Adversarial Attacks. arXiv 2026, arXiv:2602.11646. [Google Scholar] [CrossRef]
- Hossain, T.; Shishir, F.S.; Ashraf, M.; Al Nasim, M.A.; Shah, F.M. Brain tumor detection using Convolutional Neural Network. In Proceedings of the 1st International Conference on Advances in Science, Engineering and Robotics Technology (ICASERT-2019), Dhaka, Bangladesh, 3–5 May 2019. [Google Scholar] [CrossRef]
- Chan, T.K.; Chin, C.S.; Li, Y. Non-Negative Matrix Factorization-Convolutional Neural Network (NMF-CNN) for sound event detection. arXiv 2020, arXiv:2001.07874. [Google Scholar] [CrossRef]
- Huang, H.; Yang, Z.; Liang, N.; Li, Z. Semi-NMF network for image classification. IEEE Access 2019, 7, 8899–8903. [Google Scholar] [CrossRef]
- Thiry, L.; Guth, F. Classification-denoising networks. arXiv 2024, arXiv:2410.03505. [Google Scholar] [CrossRef]
- Ahamed, M.F.; Hossain, M.M.; Nahiduzzaman, M.; Islam, M.R.; Islam, M.R.; Ahsan, M.; Haider, J. A review on brain tumor segmentation based on deep learning methods with federated learning techniques. Comput. Med. Imaging Graph. 2023, 110, 102313. [Google Scholar] [CrossRef] [PubMed]
- Netshamutshedzi, N.; Netshikweta, R.; Ndogmo, J.C.; Obagbuwa, I.C. A systematic review of the hybrid machine learning models for brain tumour segmentation and detection in medical images. Front. Artif. Intell. 2025, 8, 1615550. [Google Scholar] [CrossRef]
- Gomes, E.F.; Barbosa, R.S. Deep Learning Approaches for Brain Tumor Classification in MRI Scans: An Analysis of Model Interpretability. Appl. Sci. 2026, 8, 831. [Google Scholar] [CrossRef]
- Darabi, P.K. Brain Tumor Image Dataset: Semantic Segmentation. 2023. Available online: https://www.kaggle.com/datasets/pkdarabi/brain-tumor-image-dataset-semantic-segmentation (accessed on 25 January 2026).
- Yagis, E.; Atnafu, S.W.; García Seco de Herrera, A.; Marzi, C.; Scheda, R.; Giannelli, M.; Tessa, C.; Citi, L.; Diciotti, S. Effect of data leakage in brain MRI classification using 2D convolutional neural networks. Sci. Rep. 2021, 11, 22544. [Google Scholar] [CrossRef] [PubMed]
- Lee, S.; Pang, H.S. Feature extraction based on the Non-Negative Matrix Factorization of Convolutional Neural Networks for monitoring domestic activity with acoustic signals. IEEE Access 2020, 8, 122384–122395. [Google Scholar] [CrossRef]
- Lee, D.D.; Seung, H.S. Algorithms for Non-negative Matrix Factorization. In Proceedings of the Advances in Neural Information Processing Systems, Denver, CO, USA, 27 November–2 December 2000; MIT Press: Cambridge, MA, USA, 2000; Volume 13. Available online: https://proceedings.neurips.cc/paper_files/paper/2000/file/f9d1152547c0bde01830b7e8bd60024c-Paper.pdf (accessed on 11 March 2026).
- Mazzanti, S. It Took Me 6 Years to Find the Best Metric for Classification Models. 2023. Available online: https://medium.com/data-science-collective/it-took-me-6-years-to-find-the-best-metric-for-classification-models-0f5aa21a2b85 (accessed on 10 March 2026).
- Héder, M.; Rigó, E.; Medgyesi, D.; Lovas, R.; Tenczer, S.; Török, F.; Farkas, A.; Emődi, M.; Kadlecsik, J.; Mező, G.; et al. The Past, Present and Future of the ELKH Cloud. Inf. Társadalom 2022, 22, 128–137. [Google Scholar] [CrossRef]




























| Model | Acc | Prec | Rec | F1 | MCC | BalAcc | ROC-AUC | Brier | LogLoss |
|---|---|---|---|---|---|---|---|---|---|
| Clean_Baseline | 0.8605 | 0.8548 | 0.8983 | 0.8760 | 0.7178 | 0.8564 | 0.9105 | 0.1461 | 0.4751 |
| Clean_Defended | 0.8512 | 0.8525 | 0.8814 | 0.8667 | 0.6988 | 0.8479 | 0.8967 | 0.1555 | 0.4963 |
| Robust_Baseline | 0.0047 | 0.0000 | 0.0000 | 0.0000 | −0.9906 | 0.0052 | 0.0075 | 0.4702 | 1.1629 |
| Robust_Defended | 0.5953 | 0.6115 | 0.7203 | 0.6615 | 0.1703 | 0.5818 | 0.7485 | 0.2150 | 0.6182 |
| Stage | CPU (sec) | GPU (sec) |
|---|---|---|
| NNMF Feature Extraction | 17.03 | 9.11 |
| CNN Training (NNMF Features) | 12.60 | 17.22 |
| Diffusion Denoiser Training | 8.42 | 9.24 |
| AutoAttack Baseline (APGD-CE + Square) | 8.42 | 10.73 |
| AutoAttack Defended (APGD-CE + Square) | 155.00 | 70.30 |
| Total Runtime | 201.47 | 116.60 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Al-kharsan, H.A.; Rajkó, R. Diffusion-Based Feature Denoising and Using NNMF for Robust Brain Tumor Classification. Mach. Learn. Knowl. Extr. 2026, 8, 105. https://doi.org/10.3390/make8040105
Al-kharsan HA, Rajkó R. Diffusion-Based Feature Denoising and Using NNMF for Robust Brain Tumor Classification. Machine Learning and Knowledge Extraction. 2026; 8(4):105. https://doi.org/10.3390/make8040105
Chicago/Turabian StyleAl-kharsan, Hiba Adil, and Róbert Rajkó. 2026. "Diffusion-Based Feature Denoising and Using NNMF for Robust Brain Tumor Classification" Machine Learning and Knowledge Extraction 8, no. 4: 105. https://doi.org/10.3390/make8040105
APA StyleAl-kharsan, H. A., & Rajkó, R. (2026). Diffusion-Based Feature Denoising and Using NNMF for Robust Brain Tumor Classification. Machine Learning and Knowledge Extraction, 8(4), 105. https://doi.org/10.3390/make8040105

