This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Open AccessArticle
MFedBN: Tackling Data Heterogeneity with Gradient-Based Aggregation and Advanced Distribution Skew Modeling
1
Faculty of Computer Science and Technology, Saint Petersburg Electrotechnical University “LETI”, Saint Petersburg 197376, Russia
2
Faculty of Electrical and Electronic Engineering, Aleppo University, Aleppo 12212, Syria
3
Faculty of Electrical Engineering and Automation, Saint Petersburg Electrotechnical University “LETI”, Saint Petersburg 197376, Russia
*
Authors to whom correspondence should be addressed.
Sensors 2025, 25(23), 7314; https://doi.org/10.3390/s25237314 (registering DOI)
Submission received: 21 October 2025
/
Revised: 21 November 2025
/
Accepted: 29 November 2025
/
Published: 1 December 2025
Abstract
Federated Learning (FL) enables collaborative model training on smart edge devices while preserving data privacy, but it suffers from decreased performance when faced with non-Independent and Identically Distributed (non-IID) data. This paper addresses the problem of the evaluation of aggregation strategies in non-IID FL environments, and it proposes an approach to generation of the skewed datasets with different types of non-IIDness from one dataset: with Feature Distribution Skew; with Label Distribution Skew; with Same Label, Different Features skew; and with Same Features, Different Label skew. The authors also introduce a Modified Federated via Local Batch Normalization (MFedBN), which improves model convergence and robustness across various non-IID data skews by implementing a server-side gradient-style update with several Learning Rate values tested within the aggregated function. Experimental evaluation of the MFedBN strategy was conducted on two heterogeneous datasets, namely, the Commercial Vehicles Sensor dataset designed for monitoring vehicle behavior and the NF-UNSW-NB15 dataset for cybersecurity threat detection. In the majority of cases, the MFedBN algorithm outperformed the baseline FedBN, with test accuracies of up to 85% on the Commercial Vehicles Sensor dataset and 99.98% on the NF-UNSW-NB15 dataset. The model trained with MFedBN showed convergence stability and improved generalization in highly heterogeneous federated environments. The proposed algorithm and data generation methods establish a viable platform for privacy-preserving applications in IoT-based monitoring and network intrusion detection, advancing the validity of Federated Learning in real-world, non-IID conditions.
Share and Cite
MDPI and ACS Style
Mreish, K.; Novikova, E.; Chaplygin, M.; Kholod, I.; Alnajar, T.
MFedBN: Tackling Data Heterogeneity with Gradient-Based Aggregation and Advanced Distribution Skew Modeling. Sensors 2025, 25, 7314.
https://doi.org/10.3390/s25237314
AMA Style
Mreish K, Novikova E, Chaplygin M, Kholod I, Alnajar T.
MFedBN: Tackling Data Heterogeneity with Gradient-Based Aggregation and Advanced Distribution Skew Modeling. Sensors. 2025; 25(23):7314.
https://doi.org/10.3390/s25237314
Chicago/Turabian Style
Mreish, Kinda, Evgenia Novikova, Mikhail Chaplygin, Ivan Kholod, and Tarek Alnajar.
2025. "MFedBN: Tackling Data Heterogeneity with Gradient-Based Aggregation and Advanced Distribution Skew Modeling" Sensors 25, no. 23: 7314.
https://doi.org/10.3390/s25237314
APA Style
Mreish, K., Novikova, E., Chaplygin, M., Kholod, I., & Alnajar, T.
(2025). MFedBN: Tackling Data Heterogeneity with Gradient-Based Aggregation and Advanced Distribution Skew Modeling. Sensors, 25(23), 7314.
https://doi.org/10.3390/s25237314
Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details
here.
Article Metrics
Article Access Statistics
For more information on the journal statistics, click
here.
Multiple requests from the same IP address are counted as one view.