Federated Learning for Breast Cancer Classification: A Comparative Study of Aggregation Methods
Abstract
1. Introduction
- 1.
- Privacy and Security Concerns: Patient data is highly sensitive and subject to strict privacy regulations, such as the Health Insurance Portability and Accountability Act (HIPAA) and the General Data Protection Regulation (GDPR) [8,9]. Hospitals and medical institutions are often reluctant to share data due to ethical concerns and potential legal restrictions, limiting access to diverse and high-quality datasets.
- 2.
- 3.
- Computational and Storage Constraints: Transferring large-scale medical image datasets incurs high computational and storage costs, further hindering the feasibility of centralized learning approaches [12].
- How does FL perform in a binary breast cancer classification task?
- How do different FL aggregation methods (FedAvg, FedProx, FedNova, FedDyn, and SCAFFOLD) impact classification accuracy and convergence?
- How does data heterogeneity (balanced, imbalanced, non-IID, and non-homogeneous distributions) affect FL model performance?
- 1.
- We conduct a comprehensive comparative evaluation of five federated learning aggregation strategies (FedAvg, FedProx, FedNova, FedDyn, and SCAFFOLD) for breast cancer classification using a unified MobileNetV2 architecture.
- 2.
- We design five progressively challenging federated configurations combining IID, non-IID, class imbalance, multi-source heterogeneity, and client-specific image distortions to better simulate realistic cross-silo hospital environments.
- 3.
- We provide an in-depth analysis of aggregation method behavior under heterogeneous federated conditions, including client drift, distribution shift, and minority-class sensitivity, with particular emphasis on clinically relevant metrics such as recall and false-negative detection.
- 4.
- We demonstrate that aggregation strategies specifically designed to mitigate client heterogeneity provide improved robustness and stability compared to conventional averaging-based methods in decentralized medical imaging applications.
- 5.
- We highlight the importance of evaluating clinical reliability beyond overall accuracy in federated healthcare systems, showing that high accuracy may still correspond to poor malignant-case detection under highly imbalanced distributions.
2. Related Work
2.1. Deep Learning and the Limitations of Centralized Medical Imaging Approaches
2.2. Federated Learning in Breast Cancer Prediction and Medical Imaging
2.3. Key Federated Learning Algorithms for Medical Imaging
3. Methodology
3.1. Dataset Description
3.1.1. Data Sources
3.1.2. Preprocessing Steps
- Resizing: All images were resized to 224 × 224 pixels to match the input size required by CNN-based architectures like MobileNetV2.
- Channel conversion: Grayscale images were converted to RGB to match pretrained model expectations.
- Normalization: Pixel intensities were scaled to a [0, 1] range.
- Augmentation: Techniques such as rotation, flipping, zooming, and contrast adjustments were used to increase dataset diversity and model robustness.
- CLAHE (for contrast enhancement): Applied on selected datasets to enhance visibility of tissue structures [42].
3.2. Data and Client Distribution Configurations
3.2.1. Configuration 1: IID and Class-Balanced Data Across Clients
3.2.2. Configuration 2: Imbalanced Data Skewed Toward the Benign Class
3.2.3. Configuration 3: Non-IID Multi-Source Data Across Clients with Varying Class Imbalances
3.2.4. Configuration 4: Highly Imbalanced, Non-Homogeneous, and Non-IID Multi-Source Data Skewed Toward Benign Class
3.2.5. Configuration 5: Non-IID Multi-Source Data with Client Specific Image Distortions
- 1.
- Client 1:
- RandomRotation (15°): Rotates the image randomly within ±15 degrees.
- ColorJitter (brightness & contrast adjustment): Slightly changes brightness and contrast to mimic different lighting conditions.
- 2.
- Client 2:
- Resize (200, 200): Resizes images to a slightly smaller dimension than default (224, 224).
- RandomHorizontalFlip: Flips images horizontally with a 50% probability.
- Random Gaussian Blur: Applies a Gaussian blur with a probability of 50%.
- 3.
- Client 3:
- Resize (256, 256): Increases image size before training.
- RandomVerticalFlip: Flips images vertically with a 50% probability.
- RandomAffine (Shear transformation): Applies random shearing to distort the image.
- 4.
- Client 4:
- RandomResizedCrop (224, 224, scale = (0.8, 1.0)): Crops a portion of the image randomly while maintaining aspect ratio.
- AdjustSharpness (Factor = 2): Enhances sharpness to create variations in edge clarity.
- 5.
- Client 5:
- ColorJitter (Saturation boost): Increases saturation in colors randomly with a 50% probability.
- RandomGrayscale: Converts some images to grayscale with a probability of 30%.
3.3. Federated Aggregation Methods
4. Experimental Protocol
4.1. Federated Learning Setup
Model Architecture
4.2. Training Strategy
4.2.1. Training Parameters
4.2.2. Evaluation Metrics
5. Results and Discussion
5.1. Aggregation Methods Comparison
5.1.1. Configuration 1: Results of IID and Class-Balanced Setting
5.1.2. Configuration 2: Results of Imbalanced Data Setting
5.1.3. Configuration 3: Results of Non-IID Multi-Source, Imbalanced Data Setting
5.1.4. Configuration 4: Results of Highly Imbalanced, Non-Homogeneous, and Non-IID Multi-Source Data Setting
5.1.5. Configuration 5: Results of Non-IID Multi-Source Data with Client Specific Image Distortions
5.2. Performance Summary Across All Configurations
5.3. Best Performing Aggregation Methods
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
| FL | Federated Learning |
| CNN | Convolutional Neural Network |
| AI | Artificial Intelligence |
| XAI | Explainable Artificial Intelligence |
| AUC | Area Under the Curve |
| HIPAA | Health Insurance Portability and Accountability Act |
| GDPR | General Data Protection Regulation |
| CLAHE | Contrast-Limited Adaptive Histogram Equalization |
| RSNA | Radiological Society of North America |
| DDSM | Digital Database for Screening Mammography |
| MIAS | Mammographic Image Analysis Society |
| INbreast | Full-field Digital Mammography Database |
| FedAvg | Federated Averaging |
| FedProx | Federated Proximal |
| FedNova | Federated Normalized Averaging |
| FedDyn | Federated Dynamics |
| SCAFFOLD | Stochastic Controlled Averaging for Federated Learning |
| IID | Independent and Identically Distributed |
| Non-IID | Non-Independent and Identically Distributed |
References
- World Health Organization. Breast Cancer: Prevention and Control. 2023. Available online: https://www.who.int/ (accessed on 30 June 2025).
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016. [Google Scholar] [CrossRef]
- LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef]
- Litjens, G.; Kooi, T.; Bejnordi, B.E.; Setio, A.A.A.; Ciompi, F.; Ghafoorian, M.; van der Laak, J.A.W.M.; van Ginneken, B.; Sánchez, C.I. A survey on deep learning in medical image analysis. Med. Image Anal. 2017, 42, 60–88. [Google Scholar] [CrossRef] [PubMed]
- Esteva, A.; Kuprel, B.; Novoa, R.A.; Ko, J.; Swetter, S.M.; Blau, H.M.; Thrun, S. Dermatologist-level classification of skin cancer with deep neural networks. Nature 2017, 542, 115–118. [Google Scholar] [CrossRef] [PubMed]
- Sandler, M.; Howard, A.; Zhu, M.; Zhmoginov, A.; Chen, L.C. MobileNetV2: Inverted Residuals and Linear Bottlenecks. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–22 June 2018; pp. 4510–4520. [Google Scholar] [CrossRef]
- Rieke, N.; Hancox, J.; Li, W.; Milletarì, F.; Roth, H.R.; Albarqouni, S.; Bakas, S.; Galtier, M.N.; Landman, B.A.; Maier-Hein, K.; et al. The future of digital health with federated learning. npj Digit. Med. 2020, 3, 119. [Google Scholar] [CrossRef]
- Health Insurance Portability and Accountability Act (HIPAA). U.S. Federal Law 1996. 1996. Available online: https://www.hhs.gov/hipaa/index.html (accessed on 30 June 2025).
- General Data Protection Regulation (GDPR). EU Regulation 2016/679. 2016. Available online: https://eur-lex.europa.eu/eli/reg/2016/679/oj (accessed on 30 June 2025).
- Lundervold, A.S.; Lundervold, A. An overview of deep learning in medical imaging focusing on MRI. Z. Med. Phys. 2019, 29, 102–127. [Google Scholar] [CrossRef]
- Ronneberger, O.; Fischer, P.; Brox, T. U-Net: Convolutional Networks for Biomedical Image Segmentation. In Proceedings of the Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015; Lecture Notes in Computer Science; Springer: Cham, Switzerland, 2015; Volume 9351, pp. 234–241. [Google Scholar] [CrossRef]
- Sheller, M.J.; Edwards, B.; Reina, G.A.; Martin, J.; Bakas, S. Federated learning in medicine: Facilitating multi-institutional collaborations without sharing patient data. Sci. Rep. 2020, 10, 12598. [Google Scholar] [CrossRef]
- McMahan, H.B.; Moore, E.; Ramage, D.; Hampson, S.; y Arcas, B.A. Communication-efficient learning of deep networks from decentralized data. In Proceedings of the 20th International Conference on Artificial Intelligence and Statistics (AISTATS), Lauderdale, FL, USA, 20–22 April 2017. [Google Scholar]
- Yang, Q.; Liu, Y.; Chen, T.; Tong, Y. Federated Machine Learning: Concept and Applications. ACM Trans. Intell. Syst. Technol. 2019, 10, 12. [Google Scholar] [CrossRef]
- Kaissis, G.A.; Makowski, M.R.; Rückert, D.; Braren, R.F. Secure, privacy-preserving and federated machine learning in medical imaging. Nat. Mach. Intell. 2020, 2, 305–311. [Google Scholar] [CrossRef]
- Almufareh, M.F.; Tariq, N.; Humayun, M.; Almas, B. A Federated Learning Approach to Breast Cancer Prediction in a Collaborative Learning Framework. Healthcare 2023, 11, 3185. [Google Scholar] [CrossRef]
- Gupta, C.; Gill, N.S.; Gulia, P.; Alduaiji, N.; Shreyas, J.; Shukla, P.K. Applying YOLOv6 as an ensemble federated learning framework to classify breast cancer pathology images. Sci. Rep. 2025, 15, 12345. [Google Scholar] [CrossRef]
- Kairouz, P.; McMahan, H.B.; Avent, B.; Bellet, A.; Bennis, M.; Nitin Bhagoji, A.; Bonawitz, K.; Charles, Z.; Cormode, G.; Cummings, R.; et al. Advances and Open Problems in Federated Learning. Found. Trends. Mach. Learn. 2021, 14, 1–210. [Google Scholar] [CrossRef]
- Li, T.; Sahu, A.K.; Talwalkar, A.; Smith, V. Federated Optimization in Heterogeneous Networks. Proc. Mach. Learn. Syst. 2020, 2, 429–450. [Google Scholar]
- Wang, J.; Liu, Q.; Liang, H.; Joshi, G.; Poor, H.V. Tackling the objective inconsistency problem in heterogeneous federated optimization. In Proceedings of the 34th International Conference on Neural Information Processing Systems (NIPS ’20), Red Hook, NY, USA, 6–12 December 2020. [Google Scholar]
- Acar, D.A.E.; Zhao, Y.; Matas, R.; Mattina, M.; Whatmough, P.N.; Saligrama, V. Federated Learning Based on Dynamic Regularization. In Proceedings of the International Conference on Learning Representations (ICLR), Vienna, Austria, 3–7 May 2021. [Google Scholar]
- Karimireddy, S.P.; Kale, S.; Mohri, M.; Reddi, S.J.; Stich, S.U.; Suresh, A.T. SCAFFOLD: Stochastic controlled averaging for federated learning. In Proceedings of the 37th International Conference on Machine Learning (ICML ’20), Virtual Event, 13–18 July 2020. [Google Scholar]
- Spanhol, F.A.; Oliveira, L.S.; Petitjean, C.; Heutte, L. A Dataset for Breast Cancer Histopathological Image Classification. IEEE Trans. Biomed. Eng. 2016, 63, 1455–1462. [Google Scholar] [CrossRef]
- Ragab, D.A.; Attallah, O.; Sharkas, M.; Ren, J.; Marshall, S. A framework for breast cancer classification using Multi-DCNNs. Comput. Biol. Med. 2021, 131, 104245. [Google Scholar] [CrossRef]
- Arevalo, J.; González, F.A.; Ramos-Pollán, R.; Oliveira, J.L.; Guevara Lopez, M.A. Representation learning for mammography mass lesion classification with convolutional neural networks. Comput. Methods Programs Biomed. 2016, 127, 248–257. [Google Scholar] [CrossRef]
- Castro, D.; Walker, I.; Glocker, B. Causality matters in medical imaging. Nat. Commun. 2020, 11, 3673. [Google Scholar] [CrossRef] [PubMed]
- Sheller, M.J.; Reina, G.A.; Edwards, B.; Martin, J.; Bakas, S. Multi-Institutional Deep Learning Modeling Without Sharing Patient Data: A Feasibility Study on Brain Tumor Segmentation. In International MICCAI Brainlesion Workshop; Springer: Cham, Switzerland, 2019; Volume 11383, pp. 92–104. [Google Scholar] [CrossRef]
- Li, X.; Gu, Y.; Dvornek, N.; Staib, L.H.; Ventola, P.; Duncan, J.S. Multi-site fMRI analysis using privacy-preserving federated learning and domain adaptation: ABIDE results. Med. Image Anal. 2020, 65, 101765. [Google Scholar] [CrossRef]
- Li, L.; Xie, N.; Yuan, S. A Federated Learning Framework for Breast Cancer Histopathological Image Classification. Electronics 2022, 11, 3767. [Google Scholar] [CrossRef]
- Al-Hejri, A.M.; Sable, A.H.; Al-Tam, R.M.; Al-antari, M.A.; Alshamrani, S.S.; Alshmrany, K.M.; Alatebi, W. A hybrid explainable federated-based vision transformer framework for breast cancer prediction via risk factors. Sci. Rep. 2025, 15, 18453. [Google Scholar] [CrossRef] [PubMed]
- Alhussan, A.A.; Nhidi, W.; Filali, I.; Benhmida, F.; Ejbali, R. Federated Learning Architecture for 3D Breast Cancer Image Classification. Cancers 2025, 17, 3450. [Google Scholar] [CrossRef]
- Jiménez-Sánchez, A.; Tardy, M.; González Ballester, M.A.; Mateus, D.; Piella, G. Memory-aware curriculum federated learning for breast cancer classification. Comput. Methods Programs Biomed. 2022, 226, 107318. [Google Scholar] [CrossRef]
- Guan, H.; Yap, P.-T.; Bozoki, A.; Liu, M. Federated Learning for Medical Image Analysis: A Survey. Pattern Recognit. 2024, 151, 110424. [Google Scholar] [CrossRef] [PubMed]
- Briola, E.; Nikolaidis, C.C.; Perifanis, V.; Pavlidis, N.; Efraimidis, P. A Federated Explainable AI Model for Breast Cancer Classification. In Proceedings of the 2024 European Interdisciplinary Cybersecurity Conference (EICC ’24), New York, NY, USA, 5–6 June 2024; pp. 194–201. [Google Scholar] [CrossRef]
- Rai, A.; Chen, X.; Mou, S. Control-Inspired Federated Learning: A Projection-Based Approach. IFAC-PapersOnLine 2025, 59, 109–114. [Google Scholar] [CrossRef]
- Yuan, X.; Li, P. On Convergence of FedProx: Local Dissimilarity Invariant Bounds, Non-smoothness and Beyond. In Proceedings of the Advances in Neural Information Processing Systems; Curran Associates, Inc.: Red Hook, NY, USA, 2022; Volume 35, pp. 10752–10765. [Google Scholar]
- Kang, M.; Kim, S.; Jin, K.H.; Adeli, E.; Pohl, K.M.; Park, S.H. FedNN: Federated Learning on Concept Drift Data Using Weight and Adaptive Group Normalizations. Pattern Recognit. 2024, 149, 110230. [Google Scholar] [CrossRef]
- Yao, T.; Li, J.; Liu, J. FedAWR: Aggregation Optimization in Federated Learning with Adaptive Weights and Learning Rates. Future Internet 2026, 18, 106. [Google Scholar] [CrossRef]
- Moreira, I.C.; Amaral, I.; Domingues, I.; Cardoso, A.; Cardoso, M.J.; Cardoso, J.S. INbreast: Toward a Full-Field Digital Mammographic Database. Acad. Radiol. 2012, 19, 236–248. [Google Scholar] [CrossRef] [PubMed]
- Suckling, J.; Parker, J.; Dance, D.; Astley, S.; Hutt, I.; Boggis, C.; Ricketts, I.; Stamatakis, E.; Cerneaz, N.; Kok, S.; et al. Mammographic Image Analysis Society (MIAS) Database v1.21; Apollo—University of Cambridge Repository: Cambridge, UK, 2015. [Google Scholar] [CrossRef]
- Heath, M.; Bowyer, K.; Kopans, D.; Moore, R.; Kegelmeyer, W.P. The digital database for screening mammography. In Proceedings of the 5th International Workshop on Digital Mammography, Toronto, ON, Canada, 11–14 June 2001. [Google Scholar]
- Pisano, E.D.; Zong, S.; Hemminger, B.M.; DeLuca, M.; Johnston, R.E.; Muller, K.; Braeuning, M.P.; Pizer, S.M. Contrast limited adaptive histogram equalization image processing to improve the detection of simulated spiculations in dense mammograms. J. Digit. Imaging 1998, 11, 193–200. [Google Scholar] [CrossRef]
- The Radiological Society of North America. RSNA Screening Mammography Breast Cancer Detection Dataset. 2022. Available online: https://www.kaggle.com/competitions/rsna-breast-cancer-detection (accessed on 2 July 2025).





| Study | Year | Application | FL Method | Heterogeneity Setting | Main Limitation |
|---|---|---|---|---|---|
| Sheller et al. [27] | 2020 | Brain tumor segmentation | FedAvg | Multi-institutional | Limited aggregation comparison and imbalance analysis |
| Li et al. [29] | 2022 | Histopathological breast cancer classification | FedAvg | Non-IID datasets | Limited heterogeneity scenarios |
| Almufareh et al. [16] | 2023 | Breast cancer prediction | Federated framework | Multi-hospital setup | Limited robustness analysis under severe non-IID conditions |
| Briola et al. [34] | 2024 | Explainable breast cancer FL | XAI-based FL | Moderate heterogeneity | Focused mainly on interpretability rather than aggregation robustness |
| Gupta et al. [17] | 2025 | Breast cancer pathology classification | FedAvg ensemble | Limited non-IID | No analysis of client-specific distortions or severe imbalance |
| Al-Hejri et al. [30] | 2025 | Mammogram classification | Hybrid ViT-CNN FL | Multi-class FL setting | Primarily architecture-focused with limited aggregation comparison |
| Proposed Work | 2026 | Breast cancer mammogram classification | FedAvg, FedProx, FedNova, FedDyn, SCAFFOLD | Severe imbalance, non-IID multi-source data, and client-specific distortions | Comprehensive comparative robustness analysis under realistic federated healthcare conditions |
| Client | Total Cases | Benign | Malignant |
|---|---|---|---|
| Client1 | 1000 | 500 | 500 |
| Client2 | 1000 | 500 | 500 |
| Client3 | 1000 | 500 | 500 |
| Client4 | 1000 | 500 | 500 |
| Client5 | 1000 | 500 | 500 |
| Client | Total Cases | Benign | Malignant |
|---|---|---|---|
| Client1 | 1350 | 1350 | 0 |
| Client2 | 1500 | 1200 | 300 |
| Client3 | 1500 | 1050 | 450 |
| Client4 | 1500 | 900 | 600 |
| Client5 | 1500 | 750 | 750 |
| Dataset/Client | Total Cases | Benign | Malignant |
|---|---|---|---|
| Client1 (DDSM) | 4000 | 3400 | 600 |
| Client2 (INbreast) | 3000 | 300 | 2700 |
| Client3 (MIAS) | 2400 | 1500 | 900 |
| Client4 (DDSM) | 1050 | 900 | 150 |
| Client5 (CLAHE) | 3150 | 2700 | 450 |
| Dataset/Client | Total Cases | Benign | Malignant (%) |
|---|---|---|---|
| Client1 (DDSM) | 378 | 336 | 42 (11.1%) |
| Client2 (CLAHE) | 1180 | 1086 | 94 (8.0%) |
| Client3 (RSNA) | 5470 | 5355 | 115 (2.1%) |
| Client4 (MIAS) | 247 | 238 | 9 (3.6%) |
| Client5 (INbreast) | 285 | 252 | 33 (11.6%) |
| Aggregation Method | Loss | Accuracy (%) | Recall | Specificity | Precision | F1-Score |
|---|---|---|---|---|---|---|
| FedAvg | 0.019 | 99.32 | 0.994 | 0.992 | 0.992 | 0.993 |
| FedProx | 0.052 | 98.00 | 0.968 | 0.992 | 0.991 | 0.980 |
| FedNova | 0.043 | 98.36 | 0.992 | 0.975 | 0.975 | 0.984 |
| FedDyn | 0.042 | 98.44 | 0.992 | 0.977 | 0.977 | 0.985 |
| SCAFFOLD | 0.042 | 99.28 | 0.990 | 0.996 | 0.996 | 0.993 |
| Label | Predicted Label | Aggregation Method | ||
|---|---|---|---|---|
| Benign | Malignant | |||
| True Label | Benign | 2481 | 19 | FedAvg |
| 2479 | 21 | FedProx | ||
| 2437 | 63 | FedNova | ||
| 2442 | 58 | FedDyn | ||
| 2490 | 10 | SCAFFOLD | ||
| Malignant | 15 | 2485 | FedAvg | |
| 79 | 2421 | FedProx | ||
| 19 | 2481 | FedNova | ||
| 20 | 2480 | FedDyn | ||
| 26 | 2474 | SCAFFOLD | ||
| Aggregation Method | Loss | Accuracy (%) | Recall | Specificity | Precision | F1-Score |
|---|---|---|---|---|---|---|
| FedAvg | 0.024 | 99.224 | 0.9986 | 0.9897 | 0.9750 | 0.9866 |
| FedProx | 0.454 | 91.156 | 1.0000 | 0.8762 | 0.7636 | 0.8660 |
| FedNova | 0.331 | 92.871 | 0.9995 | 0.9004 | 0.8005 | 0.8890 |
| FedDyn | 0.407 | 89.755 | 0.9995 | 0.8568 | 0.7365 | 0.8481 |
| SCAFFOLD | 0.181 | 91.306 | 1.0000 | 0.8783 | 0.7666 | 0.8681 |
| Label | Predicted Label | Aggregation Method | ||
|---|---|---|---|---|
| Benign | Malignant | |||
| True Label | Benign | 5196 | 54 | FedAvg |
| 4600 | 650 | FedProx | ||
| 4727 | 523 | FedNova | ||
| 4498 | 752 | FedDyn | ||
| 4611 | 639 | SCAFFOLD | ||
| Malignant | 3 | 2097 | FedAvg | |
| 0 | 2100 | FedProx | ||
| 1 | 2099 | FedNova | ||
| 1 | 2099 | FedDyn | ||
| 0 | 2100 | SCAFFOLD | ||
| Aggregation Method | Loss | Accuracy (%) | Recall | Specificity | Precision | F1-Score |
|---|---|---|---|---|---|---|
| FedAvg | 2.384 | 65.022 | 0.009 | 1.000 | 1.000 | 0.017 |
| FedProx | 0.298 | 86.618 | 0.773 | 0.917 | 0.836 | 0.803 |
| FedNova | 2.189 | 65.764 | 0.030 | 1.000 | 1.000 | 0.058 |
| FedDyn | 0.309 | 84.846 | 0.591 | 0.989 | 0.966 | 0.733 |
| SCAFFOLD | 0.480 | 75.434 | 0.623 | 0.827 | 0.661 | 0.642 |
| Label | Predicted Label | Aggregation Method | ||
|---|---|---|---|---|
| Benign | Malignant | |||
| True Label | Benign | 8800 | 0 | FedAvg |
| 8071 | 729 | FedProx | ||
| 8800 | 0 | FedNova | ||
| 8701 | 99 | FedDyn | ||
| 7267 | 1533 | SCAFFOLD | ||
| Malignant | 4757 | 43 | FedAvg | |
| 1091 | 3709 | FedProx | ||
| 4656 | 144 | FedNova | ||
| 1962 | 2838 | FedDyn | ||
| 1808 | 2992 | SCAFFOLD | ||
| Aggregation Method | Loss | Accuracy (%) | Recall | Specificity | Precision | F1-Score |
|---|---|---|---|---|---|---|
| FedAvg | 0.324 | 97.593 | 0.392 | 0.9990 | 0.966 | 0.558 |
| FedProx | 0.078 | 98.228 | 0.567 | 0.9990 | 0.960 | 0.712 |
| FedNova | 0.298 | 97.646 | 0.392 | 0.9990 | 0.966 | 0.558 |
| FedDyn | 0.049 | 99.286 | 0.823 | 0.9997 | 0.992 | 0.900 |
| SCAFFOLD | 0.053 | 98.836 | 0.700 | 1.0000 | 1.000 | 0.823 |
| Label | Predicted Label | Aggregation Method | ||
|---|---|---|---|---|
| Benign | Malignant | |||
| True Label | Benign | 7263 | 4 | FedAvg |
| 7260 | 7 | FedProx | ||
| 7263 | 4 | FedNova | ||
| 7265 | 2 | FedDyn | ||
| 7267 | 0 | SCAFFOLD | ||
| Malignant | 178 | 115 | FedAvg | |
| 127 | 166 | FedProx | ||
| 178 | 115 | FedNova | ||
| 52 | 241 | FedDyn | ||
| 88 | 205 | SCAFFOLD | ||
| Aggregation Method | Loss | Accuracy (%) | Recall | Specificity | Precision | F1-Score |
|---|---|---|---|---|---|---|
| FedAvg | 0.166 | 96.124 | 0.000 | 1.000 | 0.000 | 0.000 |
| FedProx | 0.098 | 98.466 | 0.604 | 1.000 | 1.000 | 0.753 |
| FedNova | 0.172 | 96.124 | 0.000 | 1.000 | 0.000 | 0.000 |
| FedDyn | 0.118 | 96.217 | 0.024 | 1.000 | 1.000 | 0.047 |
| SCAFFOLD | 0.151 | 96.124 | 0.000 | 1.000 | 0.000 | 0.000 |
| Label | Predicted Label | Aggregation Method | ||
|---|---|---|---|---|
| Benign | Malignant | |||
| True Label | Benign | 7267 | 0 | FedAvg |
| 7267 | 0 | FedProx | ||
| 7267 | 0 | FedNova | ||
| 7267 | 0 | FedDyn | ||
| 7267 | 0 | SCAFFOLD | ||
| Malignant | 293 | 0 | FedAvg | |
| 116 | 177 | FedProx | ||
| 293 | 0 | FedNova | ||
| 286 | 7 | FedDyn | ||
| 293 | 0 | SCAFFOLD | ||
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Lachemi, N.S.; Merati, M.; Mahmoudi, S. Federated Learning for Breast Cancer Classification: A Comparative Study of Aggregation Methods. Information 2026, 17, 545. https://doi.org/10.3390/info17060545
Lachemi NS, Merati M, Mahmoudi S. Federated Learning for Breast Cancer Classification: A Comparative Study of Aggregation Methods. Information. 2026; 17(6):545. https://doi.org/10.3390/info17060545
Chicago/Turabian StyleLachemi, Nadjat Saàdia, Medjeded Merati, and Saïd Mahmoudi. 2026. "Federated Learning for Breast Cancer Classification: A Comparative Study of Aggregation Methods" Information 17, no. 6: 545. https://doi.org/10.3390/info17060545
APA StyleLachemi, N. S., Merati, M., & Mahmoudi, S. (2026). Federated Learning for Breast Cancer Classification: A Comparative Study of Aggregation Methods. Information, 17(6), 545. https://doi.org/10.3390/info17060545

