Personalized Federated Learning Incorporating Adaptive Model Pruning at the Edge
Abstract
1. Introduction
- Proposing and implementing a federated learning framework based on personalized pruning. Existing research combining federated learning and model pruning tends to focus on pruning the global model on the server, with insufficient discussion on the heterogeneity of client-side models. This framework uses heterogeneous models to address Non-IID problems while achieving good generalization effects;
- Introducing a mask decision algorithm based on internal layer characteristics, enabling the fastest discovery of winning lottery tickets and reducing the time cost of one-shot pruning in the initialization stage to one-tenth of the baseline method [16];
- Utilizing sparse models for effective parameter compression, significantly reducing the computation and communication overhead of edge nodes. With almost unchanged model accuracy, our method achieves a model compression ratio far below the baseline method (3.8%) and a 46.4% improvement in training speed compared to the baseline method [16].
2. Related Works
2.1. Personalized Federated Learning
2.2. Model Pruning
- Pruning methods designed to accelerate training speed often sacrifice significant accuracy, failing to strike a balance between speed and precision;
- Simulations based on a single-machine approach to federated learning may not accurately reflect the actual performance, disregarding communication costs in distributed execution environments. This discrepancy between simulated results and real-world performance could be substantial;
- Conventional pruning methods find it challenging to protect crucial weights (winning ticket), while LTH-based pruning methods incur significant computational costs, failing to strike a balance between computational costs and pruning effectiveness.
3. Personalized Federated Learning Algorithms with Edge-Side Pruning
3.1. Preliminaries
3.2. Adaptive Pruning for Local Training
| Algorithm 1 Layerwise Pruning | 
| 
 | 
| Algorithm 2 Local Pruning | 
| 
 | 
3.3. Federated Learning Method based on Adaptive Pruning
| Algorithm 3 Global Merge Algorithm | 
| 
 | 
4. Experimental Results
4.1. Pruning Structural Evaluation
4.2. Pruning Performance Evaluation
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Wan, C.; Wang, Y.; Xu, J.; Wu, J.; Zhang, T.; Wang, Y. Research on Privacy Protection in Federated Learning Combining Distillation Defense and Blockchain. Electronics 2024, 13, 679. [Google Scholar] [CrossRef]
- Ma, X.; Zhu, J.; Lin, Z.; Chen, S.; Qin, Y. A state-of-the-art survey on solving non-IID data in Federated Learning. Future Gener. Comput. Syst. 2022, 135, 244–258. [Google Scholar] [CrossRef]
- Skianis, K.; Giannopoulos, A.; Gkonis, P.; Trakadas, P. Data Aging Matters: Federated Learning-Based Consumption Prediction in Smart Homes via Age-Based Model Weighting. Electronics 2023, 12, 3054. [Google Scholar] [CrossRef]
- Zheng, M.; Yang, Y. Personalized federated learning algorithm based on mutual information and soft clustering. Comput. Eng. 2023, 49, 20–28. [Google Scholar]
- Qiu, T.; Zheng, X.; Zhu, Y.; Feng, S. Federated learning architecture for non-IID data. Comput. Eng. 2023, 49, 110–117. [Google Scholar]
- Yu, M.; Zheng, Z.; Li, Q.; Wu, F.; Zheng, J. A Comprehensive Study on Personalized Federated Learning with Non-IID Data. In Proceedings of the2022 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking (ISPA/BDCloud/ SocialCom/SustainCom), Melbourne, Australia, 17–19 December 2022; pp. 40–49. [Google Scholar]
- Zhu, Z.; Hong, J.; Zhou, J. Data-free knowledge distillation for heterogeneous federated learning. In Proceedings of the 38th International Conference on Machine Learning, Virtual Event, 18–24 July 2021. [Google Scholar]
- Zhang, L.; Shen, L.; Ding, L.; Tao, D.; Duan, L.Y. Fine-tuning global model via data-free knowledge distillation for non-iid federated learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, 18–24 June 2022. [Google Scholar]
- Zhang, J.; Shi, Y. A Personalized Federated Learning Method Based on Clustering and Knowledge Distillation. Electronics 2024, 13, 857. [Google Scholar] [CrossRef]
- Zhan, Z.; Zhang, X. Computation-Effective Personalized Federated Learning: A Meta Learning Approach. In Proceedings of the 2023 IEEE 43rd International Conference on Distributed Computing Systems (ICDCS), Hong Kong, China, 18–21 July 2023; pp. 957–958. [Google Scholar] [CrossRef]
- Han, S.; Liu, X.; Mao, H.; Pu, J.; Pedram, A.; Horowitz, M.A.; Dally, W.J. Retrospective: EIE: Efficient Inference Engine on Sparse and Compressed Neural Network. arXiv 2023, arXiv:2306.09552. [Google Scholar]
- Zhang, Z.; Jin, J.; Zhang, Z.; Zhou, Y.; Zhao, X.; Ren, J.; Liu, J.; Wu, L.; Jin, R.; Dou, D. Validating the lottery ticket hypothesis with inertial manifold theory. Adv. Neural Inf. Process. Syst. (Neurips) 2021, 34, 30196–30210. [Google Scholar]
- Lin, M.; Ji, R.; Wang, Y.; Zhang, Y.; Zhang, B.; Tian, Y.; Shao, L. Hrank: Filter pruning using high-rank feature map. In Proceedings of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 13–19 June 2020; pp. 1529–1538. [Google Scholar]
- Hayou, S.; Ton, J.-F.; Doucet, A.; Teh, Y.W. Robust pruning at initialization. In Proceedings of the 9th International Conference on Learning Representations, ICLR 2021, Online, 4–7 May 2021. Conference Track Proceedings, 2021. [Google Scholar]
- Frankle, J.; Carbin, M. The lottery ticket hypothesis: Finding sparse, trainable neural networks. arXiv 2018, arXiv:1803.03635. [Google Scholar]
- Jiang, Y.; Wang, S.; Valls, V.; Ko, B.J.; Lee, W.H.; Leung, K.K.; Tassiulas, L. Model pruning enables efficient federated learning on edge devices. IEEE Trans. Neural Netw. Learn. Syst. 2022, 34, 10374–10386. [Google Scholar] [CrossRef] [PubMed]
- McMahan, B.; Moore, E.; Ramage, D.; Hampson, S.; y Arcas, B.A. Communication-efficient learning of deep networks from decentralized data. In Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, Fort Lauderdale, FL, USA, 20–22 April 2017; pp. 1273–1282. [Google Scholar]
- Zhao, Y.; Li, M.; Lai, L.; Suda, N.; Civin, D.; Chandra, V. Federated learning with non-iid data. arXiv 2018, arXiv:1806.00582. [Google Scholar] [CrossRef]
- Ye, M.; Fang, X.; Du, B.; Yuen, P.C.; Tao, D. Heterogeneous federated learning: State-of-the-art and research challenges. Acm Comput. Surv. 2023, 56, 1–44. [Google Scholar] [CrossRef]
- Chen, H.; Frikha, A.; Krompass, D.; Gu, J.; Tresp, V. FRAug: Tackling federated learning with Non-IID features via representation augmentation. In Proceedings of the IEEE/CVF International Conference on Computer Vision 2023, Paris, France, 2–6 October 2023; pp. 4849–4859. [Google Scholar]
- Zhang, H.; Liu, J.; Jia, J.; Zhou, Y.; Dai, H.; Dou, D. Fedduap: Federated learning with dynamic update and adaptive pruning using shared data on the server. arXiv 2022, arXiv:2204.11536. [Google Scholar]
- Jiang, Z.; Xu, Y.; Xu, H.; Wang, Z.; Qiao, C.; Zhao, Y. Fedmp: Federated learning through adaptive model pruning in heterogeneous edge computing. In Proceedings of the 2022 IEEE 38th International Conference on Data Engineering (ICDE), Kuala Lumpur, Malaysia, 9–12 May 2022; pp. 767–779. [Google Scholar]
- Vahidian, S.; Morafah, M.; Lin, B. Personalized federated learning by structured and unstructured pruning under data heterogeneity. In Proceedings of the 2021 IEEE 41st international conference on distributed computing systems workshops (ICDCSW), Washington, DC, USA, 7–10 July 2021; pp. 27–34. [Google Scholar]
- M Ghari, P.; Shen, Y. Personalized Online Federated Learning with Multiple Kernels. Adv. Neural Inf. Process. Syst. (Neurips) 2022, 35, 33316–33329. [Google Scholar]
- Liu, B.; Zhang, Z.; He, P.; Wang, Z.; Xiao, Y.; Ye, R.; Zhou, Y.; Ku, W.S.; Hui, B. A survey of lottery ticket hypothesis. arXiv 2024, arXiv:2403.04861. [Google Scholar]
- You, H.; Li, C.; Xu, P.; Fu, Y.; Wang, Y.; Chen, X.; Baraniuk, R.G.; Wang, Z.; Lin, Y. Drawing early-bird tickets: Towards more efficient training of deep networks. arXiv 2019, arXiv:1909.11957. [Google Scholar]
- Kim, Y.; Li, Y.; Park, H.; Venkatesha, Y.; Yin, R.; Panda, P. Exploring lottery ticket hypothesis in spiking neural networks. In European Conference on Computer Vision; Springer Nature: Cham, Switzerland, 2022; pp. 102–120. [Google Scholar]
- Lee, N.; Ajanthan, T.; Torr, P.H. Snip: Single-shot network pruning based on connection sensitivity. arXiv 2018, arXiv:1810.02340. [Google Scholar]
- Huang, H.; Zhang, L.; Sun, C.; Fang, R.; Yuan, X.; Wu, D. Distributed pruning towards tiny neural networks in federated learning. In Proceedings of the 2023 IEEE 43rd International Conference on Distributed Computing Systems (ICDCS), Hong Kong, China, 18–21 July 2023; pp. 190–201. [Google Scholar]
- Jiang, X.; Borcea, C. Complement sparsification: Low-overhead model pruning for federated learning. In Proceedings of the AAAI Conference on Artificial Intelligence, Washington, DC, USA, 7–14 February 2023; Volume 37, pp. 8087–8095. [Google Scholar]
- Deng, Y.; Chen, W.; Ren, J.; Lyu, F.; Liu, Y.; Liu, Y.; Zhang, Y. Tailorfl: Dual-personalized federated learning under system and data heterogeneity. In Proceedings of the 20th ACM Conference on Embedded Networked Sensor Systems, Boston, MA, USA, 6–9 November 2022; pp. 592–606. [Google Scholar]













| Notation | Definition | 
|---|---|
| ⊙ | Hadamard product | 
| ⊕ | binary OR operation | 
| client index, total number of clients | |
| global training round, global epochs | |
| local model for client n | |
| server model to client in k-th iteration | |
| local dataset for client n | |
| param. weight for client n in k-th iteration | |
| param. mask for client n in k-th iteration | |
| proportion of each client in the aggregation | |
| mask/weight filtering threshold | |
| param. gradient for client n in k-th iteration | |
| index of a particular gradient | |
| function to calculate sparse model density | |
| loss function | 
| Method | Rounds | Retain Pct. | Time Cost | Final Pct. | 
|---|---|---|---|---|
| PruneFL [16] | 16 | 8.6% | 306.2 s | 13.4% | 
| Threshold Pruning (ours) | 1 | 71.7% | 39.7 s | 19.1% | 
| Layerwise Pruning (ours) | 2 | 15.7% | 72.3 s | 3.7% | 
| Methods | acc | Time for 70% acc/s | Time for 80% acc/s | Convergence Time/s | Time for 10,000 r/s | Model Retention Ratio | Model Size | Client Memory Usage | GPU Memory Usage | 
|---|---|---|---|---|---|---|---|---|---|
| Layerwise | 83.0% | 549.97 | 2210.78 | 5161.72 | 13,816.69 | 3.7% | 1760 KB | 3.083 GB | 1117 MB | 
| PruneFL [16] | 82.5% | 747.79 | 3396.00 | 9277.19 | 23,766.16 | 8.6% | 3024 KB | 3.121 GB | 1323 MB | 
| SNIP [28] | 84.0% | 1419.31 | 4468.36 | 9654.30 | 15,099.10 | 3.7% | 1760 KB | \ | 7619 MB | 
| Iterative | 84.3% | 617.71 | 2036.17 | 9171.22 | 15,332.01 | 3.7% | 1760 KB | \ | 7075 MB | 
| Conventional | 82.8% | 909.75 | 3521.44 | 4586.73 | 12,959.22 | 8.6% | 3024 KB | 2.936 GB | 1117 MB | 
| Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. | 
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zhou, Y.; Duan, G.; Qiu, T.; Zhang, L.; Tian, L.; Zheng, X.; Zhu, Y. Personalized Federated Learning Incorporating Adaptive Model Pruning at the Edge. Electronics 2024, 13, 1738. https://doi.org/10.3390/electronics13091738
Zhou Y, Duan G, Qiu T, Zhang L, Tian L, Zheng X, Zhu Y. Personalized Federated Learning Incorporating Adaptive Model Pruning at the Edge. Electronics. 2024; 13(9):1738. https://doi.org/10.3390/electronics13091738
Chicago/Turabian StyleZhou, Yueying, Gaoxiang Duan, Tianchen Qiu, Lin Zhang, Li Tian, Xiaoying Zheng, and Yongxin Zhu. 2024. "Personalized Federated Learning Incorporating Adaptive Model Pruning at the Edge" Electronics 13, no. 9: 1738. https://doi.org/10.3390/electronics13091738
APA StyleZhou, Y., Duan, G., Qiu, T., Zhang, L., Tian, L., Zheng, X., & Zhu, Y. (2024). Personalized Federated Learning Incorporating Adaptive Model Pruning at the Edge. Electronics, 13(9), 1738. https://doi.org/10.3390/electronics13091738
 
        


 
       