Stroke Prediction Using Machine Learning Algorithms †
Abstract
1. Introduction
2. Literature Review
3. Methodology
3.1. Random Forest (RF)
3.2. Gradient Boosting Machines (GBMs)
3.3. Support Vector Machines (SVMs)
3.4. Convolutional Neural Networks (CNNs)
3.5. SMOTE (Synthetic Minority Oversampling Technique)
3.6. Ensemble Methods
3.7. Framework
3.8. Dataset Preparation
3.9. Data Preprocessing
3.10. Model Selection and Training
- Random Forest (RF): An ensemble method that builds multiple decision trees to improve prediction accuracy and reduce overfitting. It handles high-dimensional data well and can capture complex relationships in clinical features.
- Gradient Boosting Machines (GBMs): A sequential ensemble technique that builds models iteratively, with each model correcting errors made by the previous one. A GBM will enhance prediction by focusing on residual errors and handling imbalanced data effectively.
- Support Vector Machines (SVMs): A classification algorithm that maximizes the margin between classes, suitable for high-dimensional spaces and offering high accuracy for stroke prediction.
- Convolutional Neural Networks (CNNs): Applied to MRI scan data, CNNs excel at extracting features from images and identifying stroke-related abnormalities. These networks capture spatial hierarchies in imaging data, aiding stroke detection.
- Each model is trained using the pre-processed data, and hyperparameters are optimized through techniques like grid search or random search to improve predictive performance.
3.11. Model Evaluation
3.12. Model Integration and Deployment
3.13. Performance Optimization
4. Results
5. Conclusions and Future Work
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Ben Kahla, M.; Kanzari, D.; Ben Amor, S.; Ayachi Ghannouchi, S.; Martinho, R. Enhanced Fuzzy Score-Based Decision Support System for Early Stroke Prediction. ACM Trans. Comput. Healthc. 2025, 6, 1–23. [Google Scholar] [CrossRef]
- Byna, A.; Lakulu, M.M.; Panessai, I.Y. Machine Learning-Based Stroke Prediction: A Critical Analysis. Int. J. Adv. Sci. Eng. Inf. Technol. 2024, 14, 1609–1618. [Google Scholar] [CrossRef]
- Sitanaboina, S.L.P.; Aruna Devi, B.; Kulkarni, G.L.; Murugan, S.; Vijayammal, B.K.P.; Neha. Exploring feature relationships in brain stroke data using polynomial feature transformation and linear regression modeling. J. Mach. Comput. 2024, 4, 1158–1169. [Google Scholar] [CrossRef]
- Saleem, M.A.; Javeed, A.; Akarathanawat, W.; Chutinet, A.; Suwanwela, N.C.; Kaewplung, P.; Benjapolakul, W. An intelligent learning system based on electronic health records for unbiased stroke prediction. Sci. Rep. 2024, 14, 23052. [Google Scholar] [CrossRef] [PubMed]
- Liu, T.; Fan, W.; Wu, C. A hybrid machine learning approach to cerebral stroke prediction based on imbalanced medical dataset. Artif. Intell. Med. 2019, 101, 101723. [Google Scholar] [CrossRef] [PubMed]
- Matsulevits, A.; Alvez, P.; Atzori, M.; Beyh, A.; Corbetta, M.; Del Pup, F.; de Schotten, M.T. Benchmarking Stroke Outcome Prediction through Comprehensive Data Analysis–NeuralCup 2023. bioRxiv 2024, 2024-10. [Google Scholar]
- Khatri, P.; Sharma, A. An Optimized Machine Learning-Based Stroke Prediction: Enhancing Precision Medicine and Public Health. In Proceedings of the 2024 International Conference on Data Science and Network Security (ICDSNS), Tiptur, India, 26–27 July 2024; pp. 1–6. [Google Scholar]
- Asan Nainar, M. Predictive modeling for brain stroke detection using machine learning. Int. J. Sci. Res. Eng. Manag. 2024, 8, 1–5. [Google Scholar] [CrossRef]
- Byna, A.; Lakulu, M.M.; Panessai, I.Y. Current critical review on prediction stroke using machine learning. Bull. Electr. Eng. Inform. 2024, 13, 3470–3480. [Google Scholar] [CrossRef]
- Wu, D.; Zhang, X.; Zhu, X. A machine learning-based model for stroke prediction. J. Biomed. Eng. Res. 2024, 67, 20240645. [Google Scholar] [CrossRef]
- Sahriar, S.; Akther, S.; Mauya, J.; Amin, R.; Mia, M.S.; Ruhi, S.; Reza, M.S. Unlocking stroke prediction: Harnessing projection-based statistical feature extraction with ML algorithms. Heliyon 2024, 10, e27411. [Google Scholar] [CrossRef]
- Aldughayfiq, B.; Ashfaq, F.; Jhanjhi, N.; Humayun, M. Capturing semantic relationships in electronic health records using knowledge graphs: An implementation using mimic iii dataset and graphdb. Healthcare 2023, 11, 1762. [Google Scholar] [CrossRef] [PubMed]
- Aldughayfiq, B.; Ashfaq, F.; Jhanjhi, N.; Humayun, M. YOLOv5-FPN: A robust framework for multi-sized cell counting in fluorescence images. Diagnostics 2023, 13, 2280. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Kanwal, N.; Javaid, S.; Dewi, D.D. Stroke Prediction Using Machine Learning Algorithms. Eng. Proc. 2025, 107, 36. https://doi.org/10.3390/engproc2025107032
Kanwal N, Javaid S, Dewi DD. Stroke Prediction Using Machine Learning Algorithms. Engineering Proceedings. 2025; 107(1):36. https://doi.org/10.3390/engproc2025107032
Chicago/Turabian StyleKanwal, Nayab, Sabeen Javaid, and Dhita Diana Dewi. 2025. "Stroke Prediction Using Machine Learning Algorithms" Engineering Proceedings 107, no. 1: 36. https://doi.org/10.3390/engproc2025107032
APA StyleKanwal, N., Javaid, S., & Dewi, D. D. (2025). Stroke Prediction Using Machine Learning Algorithms. Engineering Proceedings, 107(1), 36. https://doi.org/10.3390/engproc2025107032