Utility–Leakage Trade-Off for Federated Representation Learning

Yuchen Liu; Onur Günlü; Yuanming Shi; Youlong Wu

doi:10.3390/e27111163

,

and

¹

School of Information Science and Technology, ShanghaiTech University, Shanghai 201210, China

²

Lehrstuhl für Nachrichtentechnik, Technical University Dortmund, 44227 Dortmund, Germany

³

Information Theory and Security Laboratory (ITSL), Linköping University, 581 83 Linköping, Sweden

^*

Author to whom correspondence should be addressed.

Entropy2025, 27(11), 1163;https://doi.org/10.3390/e27111163
(registering DOI)

This article belongs to the Special Issue Information-Theoretic Approaches for Machine Learning and AI

Version Notes

Order Reprints

Abstract

Federated representation learning (FRL) is a promising technique for learning shared data representations that capture general features across decentralized clients without sharing raw data. However, there is a risk of sensitive information leakage from learned representations. The conventional differential privacy (DP) mechanism protects the privacy of the whole data by randomizing (adding noise or random response) at the cost of deteriorating learning performance. Inspired by the fact that some data information may be public or non-private and only sensitive information (e.g., race) should be protected, we investigate the information-theoretic protection on specific sensitive information for FRL. To characterize the trade-off between utility and sensitive information leakage, we adopt mutual information-based metrics to measure utility and sensitive information leakage, and propose a method that maximizes the utility performance, while restricting sensitive information leakage less than any positive value

ϵ

via the local DP mechanism. Simulation demonstrates that our scheme can achieve the best utility–leakage trade-off among baseline schemes, and more importantly can adjust the trade-off between leakage and utility by controlling the noise level in local DP.

Keywords:

utility; federated learning; differential privacy

Article Metrics

Citations

Article Access Statistics

Journal Statistics

Multiple requests from the same IP address are counted as one view.