Comparative Evaluation of Machine Learning Classifiers for Breast Cancer Diagnosis: A Comprehensive Statistical Analysis †
Abstract
1. Introduction
2. Materials and Methods
2.1. Data Preprocessing
2.2. Feature Importance
2.3. Train–Test Split
2.4. Machine Learning Models and Performance Metrics for Breast Cancer Prediction
2.5. Comparative Statistical Analysis
2.6. Optimal Model Selection
3. Results and Discussion
4. Statistical Test for Breast Cancer Detection
4.1. Friedman Test
4.2. Wilcoxon Signed-Rank Tests
- Wilcoxon Signed-Rank Tests (Best vs. Others)
4.3. Top Three Models’ Comparison
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Cakmak, Y.; Pacal, I. Enhancing Breast Cancer Diagnosis: A Comparative Evaluation of Machine Learning Algorithms Using the Wisconsin Dataset. J. Oper. Intell. 2025, 3, 175–196. [Google Scholar] [CrossRef]
- Kumar, A.; Saini, R.; Kumar, R. A Comparative Analysis of Machine Learning Algorithms for Breast Cancer Detection and Identification of Key Predictive Features. Trait. Signal 2024, 41, 127–140. [Google Scholar] [CrossRef]
- Jain, P.; Aggarwal, S.; Adam, S.; Imam, M. Parametric optimization and comparative study of machine learning and deep learning algorithms for breast cancer diagnosis. Breast Dis. 2024, 43, 257–270. [Google Scholar] [CrossRef] [PubMed]
- Fatima, A.; Shabbir, A.; Janjua, J.I.; Ramay, S.A.; Bhatty, R.A.; Irfan, M.; Abbas, T. Analyzing breast cancer detection using machine learning & deep learning techniques. J. Comput. Biomed. Inform. 2024, 7. [Google Scholar]
- Rafiepoor, H.; Ghorbankhanloo, A.; Zendehdel, K.; Madar, Z.Z.; Hajivalizadeh, S.; Hasani, Z.; Amanpour, S. Comparison of Machine Learning Models for Classification of Breast Cancer Risk Based on Clinical Data. Cancer Rep. 2025, 8, e70175. [Google Scholar] [CrossRef] [PubMed]
- Al-Imran, M.; Akter, S.; Mozumder, M.A.S.; Bhuiyan, R.J.; Rahman, T.; Ahmmed, M.J.; Hossen, M.E. Evaluating Machine Learning Algorithms for Breast Cancer Detection: A Study on Accuracy and Predictive Performance. Am. J. Eng. Technol. 2024, 6, 22–33. [Google Scholar] [CrossRef]
- Darwich, M.; Bayoumi, M. An evaluation of the effectiveness of machine learning prediction models in assessing breast cancer risk. Inform. Med. Unlocked 2024, 49, 101550. [Google Scholar] [CrossRef]
- Ashika, T.; Grace, H.; Martin, N.; Smarandache, F. Enhanced Neutrosophic Set and Machine Learning Approach for Breast Cancer Prediction; Infinite Study: Paris, France, 2024. [Google Scholar]


















| Models | Accuracy | Precision | Recall | F1-Score | AUC-ROC |
|---|---|---|---|---|---|
| Logistic Regression | 0.97 | 0.97 | 0.97 | 0.97 | 0.99 |
| K-Nearest Neighbor | 0.95 | 0.96 | 0.96 | 0.96 | 0.98 |
| Support Vector Machine | 0.95 | 0.96 | 0.96 | 0.96 | 0.99 |
| Decision Tree | 0.96 | 0.97 | 0.96 | 0.96 | 0.99 |
| Random Forest | 0.96 | 0.97 | 0.96 | 0.96 | 0.99 |
| Gradient Boosting | 0.95 | 0.96 | 0.96 | 0.96 | 0.99 |
| XGBoost | 0.95 | 0.96 | 0.96 | 0.96 | 0.99 |
| Naïve Bayes | 0.96 | 0.97 | 0.96 | 0.96 | 0.99 |
| AdaBoosting | 0.97 | 0.97 | 0.97 | 0.97 | 0.99 |
| Light GBM | 0.97 | 0.97 | 0.97 | 0.97 | 0.99 |
| CatBoost | 0.97 | 0.97 | 0.97 | 0.97 | 0.99 |
| Artificial Neural Network | 0.96 | 0.96 | 0.96 | 0.96 | 0.99 |
| Histogram Gradient Boosting | 0.97 | 0.97 | 0.95 | 0.96 | 0.99 |
| Gaussian Process Classifier | 0.96 | 0.97 | 0.96 | 0.96 | 0.99 |
| Ridge Classifier | 0.95 | 0.96 | 0.95 | 0.95 | 0.99 |
| Model | Library (Version) | Key Hyperparameters | Final Values |
|---|---|---|---|
| Decision Tree | scikit-learn (v1.3) | max_depth, min_samples_split | max_depth = 5, min_samples_split = 4 |
| Random Forest | scikit-learn (v1.3) | n_estimators, max_depth | n_estimators = 300, max_depth = 10 |
| Gradient Boosting | scikit-learn (v1.3) | n_estimators, learning_rate, max_depth | n_estimators = 300, learning_rate = 0.05, max_depth = 3 |
| AdaBoost | scikit-learn (v1.3) | n_estimators, learning_rate | n_estimators = 200, learning_rate = 0.1 |
| XGBoost | XGBoost (v1.7) | n_estimators, learning_rate, max_depth, subsample | n_estimators = 400, learning_rate = 0.05, max_depth = 5, subsample = 0.8 |
| LightGBM | LightGBM (v4.0) | n_estimators, learning_rate, max_depth, num_leaves | n_estimators = 500, learning_rate = 0.05, max_depth = 6, num_leaves = 31 |
| CatBoost | CatBoost (v1.2) | iterations, depth, learning_rate | iterations = 500, depth = 6, learning_rate = 0.1 |
| ANN | scikit-learn (MLP) | hidden layers, optimizer, batch size, epochs | layers = (30, 15), optimizer = Adam, batch size = 32, epochs = 100 |
| Friedman χ2 = 79.470, p = 0.000000 |
| Significant differences exist among models (p < 0.05) |
| Model | W-Statistic | p-Value | Significant |
|---|---|---|---|
| KNN | 0.0 | 0.0020 | True |
| SVM | 0.0 | 0.0020 | True |
| RF | 0.0 | 0.0020 | True |
| Gradient Boosting | 0.0 | 0.0020 | True |
| NB | 0.0 | 0.0020 | True |
| XGBoost | 0.0 | 0.0020 | True |
| Ridge Classifier | 0.0 | 0.0020 | True |
| Histogram GB | 0.0 | 0.0020 | True |
| GP classifier | 1.0 | 0.0039 | True |
| DT | 2.0 | 0.0059 | True |
| Model | p-Value |
|---|---|
| Logistic Regression vs. Light GBM | 0.0020 |
| Logistic Regression vs. AdaBoosting | 0.1309 |
| Light GBM vs. AdaBoosting | 0.0195 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Das, S.S.; Mahaprasad, A.; Padhy, N.; Misra, S.; Panigrahi, R.; Mahapatro, P.K.; Arangi, D. Comparative Evaluation of Machine Learning Classifiers for Breast Cancer Diagnosis: A Comprehensive Statistical Analysis. Eng. Proc. 2026, 124, 35. https://doi.org/10.3390/engproc2026124035
Das SS, Mahaprasad A, Padhy N, Misra S, Panigrahi R, Mahapatro PK, Arangi D. Comparative Evaluation of Machine Learning Classifiers for Breast Cancer Diagnosis: A Comprehensive Statistical Analysis. Engineering Proceedings. 2026; 124(1):35. https://doi.org/10.3390/engproc2026124035
Chicago/Turabian StyleDas, Sambit Subhankar, Atal Mahaprasad, Neelamadhab Padhy, Srikant Misra, Rasmita Panigrahi, Pradeep Kumar Mahapatro, and Dasaradha Arangi. 2026. "Comparative Evaluation of Machine Learning Classifiers for Breast Cancer Diagnosis: A Comprehensive Statistical Analysis" Engineering Proceedings 124, no. 1: 35. https://doi.org/10.3390/engproc2026124035
APA StyleDas, S. S., Mahaprasad, A., Padhy, N., Misra, S., Panigrahi, R., Mahapatro, P. K., & Arangi, D. (2026). Comparative Evaluation of Machine Learning Classifiers for Breast Cancer Diagnosis: A Comprehensive Statistical Analysis. Engineering Proceedings, 124(1), 35. https://doi.org/10.3390/engproc2026124035

