1. Introduction
It can be anxiety-inducing when patients must wait for a medical diagnosis, especially regarding intracranial neoplasms (or brain tumors). Brain tumors are uncontrolled and abnormal growths of cells in the brain, classified into primary tumors, which originate in brain tissue, and secondary tumors, which spread from other parts of the body to the brain tissue via the bloodstream [
1]. Given the intricate nature of the brain—an enormous and complex organ that controls the nervous system and contains around 100 billion nerve cells—the uncertainty surrounding a potential brain tumor diagnosis intensifies the anxiety experienced by patients awaiting medical assessments [
2]. Patients are concerned about the impact of brain tumors on their cognitive functions, treatment options, and overall quality of life, amplifying the emotional strain they face in this situation.
Among brain tumors, glioma and meningioma stand out as lethal primary tumor types, with glioma ranking as the most prevalent brain tumor in humans [
3]. The World Health Organization (WHO) classifies brain tumors into four grades: grades 1 and 2 represent less severe tumors like meningioma, and grades 3 and 4 indicate more serious types such as glioma. In clinical practice, meningioma, pituitary, and glioma tumors account for approximately 15%, 15%, and 45% of cases, respectively [
4]. Understanding the differences between these tumor types and their grades is important for accurate diagnosis and effective treatment.
The median medical wait time for specialists across hospital providers in the US state of Vermont is 41 days, but it varies significantly by location. The wait time range for radiologists in Vermont is between 7 and 112 days. Additionally, the average wait time for a primary physician in the United States overall is 20.6 days [
5]. However, the challenges do not end there; after completing the MRI scan, patients often must wait several weeks or months for their next appointment to receive their MRI results [
6]. This prolonged wait time for diagnoses causes stress and anxiety for many patients. Integrating artificial intelligence into the medical diagnosis process can reduce the wait time for a brain tumor diagnosis [
7].
Convolutional neural networks (CNNs) are popular deep learning models designed for image classification tasks [
8], making them particularly adept at determining intricate patterns within medical images, such as those obtained from MRI scans [
9]. Many CNN models are already trained and available for image classification purposes; such models are known as pre-trained models. Training models from scratch can be time-consuming and computationally intensive, often requiring significant resources like GPUs. This can be mitigated using pre-trained models, which significantly reduces the time and resources needed for training. Transfer learning, a technique that involves utilizing knowledge from pre-trained models on large datasets, enhances the efficiency and effectiveness of the classification process by fine-tuning these models to adapt to new tasks or datasets [
10]. This approach allows the model to build upon previously learned features and patterns, accelerating the learning process and improving performance on new tasks.
This research presents a comparative analysis of five widely used pre-trained deep learning models—ResNet50, Xception, EfficientNetV2-S, ResNet152V2, and VGG16—on the task of brain tumor classification using MRI images. While extensive research exists on individual models for MRI-based brain tumor classification, our study stands out by providing a direct comparison of these five specific models on a single dataset. The key contributions of this study include: (1) a systematic evaluation of these models’ performance on a publicly available MRI dataset using metrics such as accuracy, F1 score, and precision; and (2) the identification of the most effective pre-trained model for brain tumor classification. We hope that this study provides a robust framework for future research in the medical diagnostics field.
The rest of the paper is structured as follows:
Section 2 describes the methodology, including the dataset and image augmentation, pre-trained model descriptions, model architecture, and model fine-tuning.
Section 3 presents the results of the performance analysis of individual models and a comparison of model performance. The paper concludes with
Section 4, which discusses the conclusions and future work.
Author Contributions
Conceptualization, A.A.; methodology, A.A.; software, A.A.; validation, A.A. and A.A.B.; formal analysis, A.A.; investigation, A.A.; intramural resources, A.A.B. and J.A.H.; data curation, A.A.; writing—original draft preparation, A.A.; writing—review and editing, A.A., A.A.B. and J.A.H.; visualization, A.A.; supervision, A.A.B. and J.A.H.; project administration, A.A.B. and J.A.H.; intramural funding acquisition, A.A.B. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
Acknowledgments
A.A. would like to extend heartfelt gratitude to Amrutaa Vibho for her help in starting him in AI research, to his sister Archana M. for her informal review of the work in progress, and to his coauthors for their mentorship, guidance, patience, and encouragement. A.A., A.A.B., and J.H. would like to thank Norwich University for the institutional resources necessary for conducting this research.
Conflicts of Interest
The authors declare no conflicts of interest.
References
- Tandel, G.S.; Biswas, M.; Kakde, O.G.; Tiwari, A.; Suri, H.S.; Turk, M.; Laird, J.; Asare, C.; Ankrah, A.A.; Khanna, N.N.; et al. A Review on a Deep Learning Perspective in Brain Cancer Classification. Cancers 2019, 11, 111. [Google Scholar] [CrossRef] [PubMed]
- van der Meer, P.B.; Dirven, L.; Hertler, C.; Boele, F.W.; Batalla, A.; Walbert, T.; Rooney, A.G.; Koekkoek, J.A. Depression and anxiety in glioma patients. Neuro-Oncol. Pract. 2023, 10, 335–343. [Google Scholar] [CrossRef] [PubMed]
- Liu, J.; Pan, Y.; Li, M.; Chen, Z.; Tang, L.; Lu, C.; Wang, J. Applications of deep learning to MRI images: A survey. Big Data Min. Anal. 2018, 1, 1–18. [Google Scholar] [CrossRef]
- Louis, D.N.; Perry, A.; Reifenberger, G.; von Deimling, A.; Figarella-Branger, D.; Cavenee, W.K.; Ohgaki, H.; Wiestler, O.D.; Kleihues, P.; Ellison, D.W. The 2016 World Health Organization Classification of Tumors of the Central Nervous System: A Summary. Acta Neuropathol. 2016, 131, 803–820. [Google Scholar] [CrossRef] [PubMed]
- Department of Financial Regulation. State of Vermont Wait Times Report|Health Services Wait Times Report Findings. State of Vermont. 2022. Available online: https://dfr.vermont.gov/about-us/councils-and-commissions/health-services-wait-times (accessed on 22 May 2024).
- Nall, R. How Long Does It Take to Get MRI Results? 2021. Available online: https://www.healthline.com/health/how-long-does-it-take-to-get-results-from-mri#1 (accessed on 22 May 2024).
- Vamsi, B.; Al Bataineh, A.; Doppala, B.P. Prediction of Micro Vascular and Macro Vascular Complications in Type-2 Diabetic Patients using Machine Learning Techniques. Int. J. Adv. Comput. Sci. Appl. 2022, 13, 19–32. [Google Scholar] [CrossRef]
- Al Bataineh, A.; Kaur, D.; Al-khassaweneh, M.; Al-sharoa, E. Automated CNN architectural design: A simple and efficient methodology for computer vision tasks. Mathematics 2023, 11, 1141. [Google Scholar] [CrossRef]
- Sarvamangala, D.R.; Kulkarni, R.V. Convolutional neural networks in medical image understanding: A survey. Evol. Intell. 2021, 15, 1–22. [Google Scholar] [CrossRef] [PubMed]
- Stančić, A.; Vyroubal, V.; Slijepčević, V. Classification Efficiency of Pre-Trained Deep CNN Models on Camera Trap Images. J. Imaging 2022, 8, 20. [Google Scholar] [CrossRef] [PubMed]
- Chaki, J.; Wozniak, M. Brain Tumor MRI Dataset. IEEE Dataport. 2023. Available online: https://ieee-dataport.org/documents/brain-tumor-mri-dataset (accessed on 22 May 2024).
- Xu, M.; Yoon, S.; Fuentes, A.; Park, D.S. A Comprehensive Survey of Image Augmentation Techniques for Deep Learning. Pattern Recognit. 2023, 137, 109347. [Google Scholar] [CrossRef]
- Alomar, K.; Aysel, H.I.; Cai, X. Data Augmentation in Classification and Segmentation: A Survey and New Strategies. J. Imaging 2023, 9, 46. [Google Scholar] [CrossRef] [PubMed]
- Tanwar, S.; Singh, J. ResNext50 based convolution neural network-long short term memory model for plant disease classification. Multimed. Tools Appl. 2023, 82, 29527–29545. [Google Scholar] [CrossRef]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 7–12 June 2015. [Google Scholar]
- Chollet, F. Xception: Deep Learning with Depthwise Separable Convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016. [Google Scholar]
- Fan, Y.J.; Tzeng, I.S.; Huang, Y.S.; Hsu, Y.Y.; Wei, B.C.; Hung, S.T.; Cheng, Y.L. Machine Learning: Using Xception, a Deep Convolutional Neural Network Architecture, to Implement Pectus Excavatum Diagnostic Tool from Frontal-View Chest X-rays. Biomedicines 2023, 11, 760. [Google Scholar] [CrossRef] [PubMed]
- Li, N.; Xue, J.; Wu, S.; Qin, K.; Liu, N. Research on Coal and Gangue Recognition Model Based on CAM-Hardswish with EfficientNetV2. Appl. Sci. 2023, 13, 8887. [Google Scholar] [CrossRef]
- Elshennawy, N.M.; Ibrahim, D.M. Deep-Pneumonia Framework Using Deep Learning Models Based on Chest X-ray Images. Diagnostics 2020, 10, 649. [Google Scholar] [CrossRef] [PubMed]
- Qassim, H.; Verma, A.; Feinzimer, D. Compressed residual-VGG16 CNN model for big data places image recognition. In Proceedings of the 2018 IEEE 8th Annual Computing and Communication Workshop and Conference (CCWC), Las Vegas, NV, USA, 8–10 January 2018. [Google Scholar] [CrossRef]
- Baldi, P.; Sadowski, P. The dropout learning algorithm. Artif. Intell. 2014, 210, 78–122. [Google Scholar] [CrossRef] [PubMed]
- Nuti, G.; Cross, A.I.; Rindler, P. Evidence-Based Regularization for Neural Networks. Mach. Learn. Knowl. Extr. 2022, 4, 1011–1023. [Google Scholar] [CrossRef]
- Shen, K.; Guo, J.; Xu, T.; Tang, S.; Wang, R.; Bian, J. A Study on ReLU and Softmax in Transformer. arXiv 2023, arXiv:2302.06461. [Google Scholar] [CrossRef]
- Bjorck, N.; Gomes, C.P.; Selman, B.; Weinberger, K.Q. Understanding Batch Normalization. arXiv 2018, arXiv:1806.02375. [Google Scholar]
- Yong, H.; Huang, J.; Meng, D.; Hua, X.; Zhang, L. Momentum Batch Normalization for Deep Learning with Small Batch Size. In Computer Vision—ECCV 2020, Proceedings of 16th European Conference, Glasgow, UK, 23–28 August 2020; Lecture Notes in Computer Science; Springer International Publishing: Cham, Switzerland, 2020; pp. 224–240. [Google Scholar] [CrossRef]
- Balestriero, R.; Baraniuk, R.G. Batch Normalization Explained. arXiv 2022, arXiv:2209.14778. [Google Scholar] [CrossRef]
- Deng, Y.; Song, Z.; Zhou, T. Superiority of Softmax: Unveiling the Performance Edge Over Linear Attention. arXiv 2023, arXiv:2310.11685. [Google Scholar] [CrossRef]
- Hassan, E.; Shams, M.Y.; Hikal, N.A.; Elmougy, S. The effect of choosing optimizer algorithms to improve computer vision tasks: A comparative study. Multimed. Tools Appl. 2022, 82, 16591–16633. [Google Scholar] [CrossRef] [PubMed]
- Chong, K.L.; Huang, Y.F.; Koo, C.H.; Sherif, M.; Ahmed, A.N.; El-Shafie, A. Investigation of cross-entropy-based streamflow forecasting through an efficient interpretable automated search process. Appl. Water Sci. 2022, 13, 6. [Google Scholar] [CrossRef]
- Patel, N.; Shah, H.; Mewada, K. Enhancing Financial Data Visualization for Investment Decision-Making. arXiv 2023, arXiv:2403.18822. [Google Scholar] [CrossRef]
| Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).