Federated Learning Incentive Mechanism Design via Shapley Value and Pareto Optimality

Yang, Xun; Xiang, Shuwen; Peng, Changgen; Tan, Weijie; Li, Zhen; Wu, Ningbo; Zhou, Yan

doi:10.3390/axioms12070636

Open AccessArticle

Federated Learning Incentive Mechanism Design via Shapley Value and Pareto Optimality

by

Xun Yang

¹

,

Shuwen Xiang

¹

,

Changgen Peng

^2,3,*

,

Weijie Tan

^2,3,4

,

Zhen Li

⁵

,

Ningbo Wu

⁶

and

Yan Zhou

²

¹

School of Mathematics and Statistics, Guizhou University, Guiyang 550025, China

²

State Key Laboratory of Public Big Data, College of Computer Science and Technology, Guizhou University, Guiyang 550025, China

³

Guizhou Big Data Academy, Guizhou University, Guiyang 550025, China

⁴

Key Laboratory of Advanced Manufacturing Technology, Ministry of Education, Guizhou University, Guiyang 550025, China

⁵

College of Big Data and Information Engineering, Guizhou University, Guiyang 550025, China

⁶

School of Information, Guizhou University of Finance and Economics, Guiyang 550025, China

^*

Author to whom correspondence should be addressed.

Axioms 2023, 12(7), 636; https://doi.org/10.3390/axioms12070636

Submission received: 7 May 2023 / Revised: 16 June 2023 / Accepted: 25 June 2023 / Published: 27 June 2023

(This article belongs to the Special Issue Advances in Logic and Game Theory)

Download

Browse Figures

Versions Notes

Abstract

:

Federated learning (FL) is a distributed machine learning framework that can effectively help multiple players to use data to train federated models while complying with their privacy, data security, and government regulations. Due to federated model training, an accurate model should be trained, and all federated players should actively participate. Therefore, it is crucial to design an incentive mechanism; however, there is a conflict between fairness and Pareto efficiency in the incentive mechanism. In this paper, we propose an incentive mechanism via the combination of the Shapley value and Pareto efficiency optimization, in which a third party is introduced to supervise the federated payoff allocation. If the payoff can reach Pareto optimality, the federated payoff is allocated by the Shapley value method; otherwise, the relevant federated players are punished. Numerical and simulation experiments show that the mechanism can achieve fair payoff allocation and Pareto optimality payoff allocation. The Nash equilibrium of this mechanism is formed when Pareto optimality payoff allocation is achieved.

Keywords:

federated learning; Shapley value; Pareto optimality; Nash equilibrium

MSC:

91A12; 91A06; 58E17

1. Introduction

As artificial intelligence (AI) continues to develop at a rapid pace, massive and diverse data are rapidly being generated. Data from all walks of life have high utility value and privacy information. In particular, there are data islands and data privacy problems. To solve the problems of data islands and data privacy, McMahan et al. [1,2,3] proposed a distributed machine learning method called federated learning (FL) in 2016. FL is used to train the model at each node, and their own sensitive data are not leaked, which not only makes full use of the data to train models but also protects sensitive data privacy [4].

However, in the FL process, to make each data owner willing to contribute their data to train the model and improve the accuracy of the model, designing an incentive mechanism in the FL system is a valuable research topic. If there is a phenomenon of free riding among federated players, such as the lack of real-time training data, the accuracy of federated model training will be seriously affected. Therefore, establishing an attractive incentive mechanism is valuable research work in the FL system.

Nowadays, the FL incentive mechanism has attracted much attention from academics [5,6]. They have conducted a lot of research in [7,8,9], and although they considered fairness [10,11], they did not consider the Pareto optimality efficiency. We aim to establish an incentive model in a federated system to achieve fair payoff allocation and Pareto optimality. For this purpose, we propose a method via the Shapley value and Pareto optimality. Since the Shapley value is consistent with the principle of budget balance, however, according to Holmstrom’s team production theory [12], budget balance and Pareto optimality cannot be reached simultaneously. Therefore, although the Shapley value method can satisfy fair payoff allocation after the completion of FL, it cannot achieve the optimality incentive input of each federated player before FL, i.e., it cannot achieve Pareto efficiency optimization before FL. Therefore, we design this mechanism for the payoff allocation of FL to make the payoff allocation of FL fair and efficient. We introduce supervisor and set penalty conditions. If the federated payoffs reach Pareto efficiency optimality, the payoffs are allocated by Shapley’s value formula; otherwise, the relevant federated players are penalized. Finally, numerical experiments are performed to confirm our theoretical analysis. The main contributions of the paper can be summarized as follows:

(1): Discussing the conditions satisfied by the fines paid to the regulator by the limited-liability federated agent if Pareto optimality is achieved;
(2): Demonstrating that the federated players’ inputs constitute the mechanism’s Nash equilibrium when Pareto optimality is satisfied;
(3): Numerical examples are performed to verify the rationality of designing the mechanism for the both equal and unequal statuses of the federated players.

The remaining work is organized as follows: The related work is introduced in Section 2. The preliminaries are introduced in Section 3. The FL incentive mechanism is established in Section 4. The rationality of this incentive mechanism is verified by numerical and simulation experiments in Section 5. The conclusion and future work are drawn in Section 6. The discussion is drawn in Section 7. The proof of the theorem is given in Appendix A.

2. Related Work

The main theoretical approaches currently used for the FL incentive mechanism include the Stackelberg game [13], contract theory [14], auction mechanism [15] and Shapley value [16]. In this paper, we review the research works related to the FL incentive mechanism design by the Shapley value. In [17], Song et al. proposed a new Shapley value based on the contribution metric to evaluate the contribution of each player who owns the data for the training of the FL model. In [18], the authors proposed a new expression for the Shapley value for FL, which can be computed without consuming additional communication costs, can play a role in the value of the FL data, and can have an incentive effect on the players. Wang et al. [19] used the Shapley value to fairly calculate each federated player’s contribution. To properly incentivize data owners to contribute their data to train the federated model, in [20], the authors proposed a blockchain-based peer-to-peer payment system for FL to achieve a feasible fair payoff allocation mechanism based on the Shapley value. In the literature [21], a fair incentive mechanism based on the Shapley value was proposed. This can motivate more players to be willing to share their data and receive a certain fee. In [22], the authors proposed the bootstrap truncated gradient Shapley approach for the fair valuation of the FL players’ contributions. This approach mainly reconstructs the FL model from gradient updates for Shapley value calculation. Nagalapattiet et al. [23] proposed a cooperative game, where players share gradients and compute players’ Shapley values to filter those with relevant data. To address the fact that there are still other inequities in calculating the FL Shapley value, Fan et al. [24] proposed a new complete federated Shapley value mechanism to improve the fairness of the federated Shapley value. In addition, to address the fact that the calculation of the Shapley value in FL requires a certain communication cost, in [25], the authors proposed a Shapley value based on a contribution evaluation metric called the vertical federated Shapley value (VerFedSV) and verified the fairness of VerFedSV through experiments. In [26], the authors considered several factors affecting FL and proposed an FL incentive mechanism according to the enhanced Shapley value method, and numerical experiments verified that the payoffs allocated among all participants can be fairer when using the enhanced Shapley value method.

3. Preliminaries

3.1. Federated Learning Framework

Federated learning (FL) is a distributed machine learning technique or machine learning framework. The goal of FL is to technically break down data silos and enable AI collaboration, where participants’ data do not leave the local area during the model training process, to achieve common modeling based on ensuring data privacy, and security and legal compliance [1].

Let

N = {1, 2, \dots, n}

be defined as n data players and participate in the training model M; their local dataset is

D = {D_{1}, D_{2}, \dots, D_{n}}

.

M_{F E D}

denotes the shared model that FL requires players to train together, and

M_{S U M}

denotes the traditional machine learning model, which puts all data together to train the model.

V_{F E D}

and

V_{S U M}

are the model accuracy of

M_{F E D}

and

M_{S U M}

, respectively, if there exists a positive number

δ \geq 0

which satisfies

| V_{F E D} - V_{S U M} | < δ,

(1)

we say that the FL algorithm has

δ

-accuracy loss [4].

The framework diagram of FL is shown in Figure 1, and the training steps in the FL model are as follows:

Step 1: Local federated players download the initialized global model from the aggregation server;

Step 2: Each federated player trains the local model with the initializing global model;

Step 3: After training the local model, the updated model and parameters are uploaded to the aggregation server;

Step 4: The aggregation server aggregates the models and parameters uploaded by each federated player for the next update round.

The commonly used aggregation method is the federated averaging (FedAvg) algorithm [27], Steps 2 and 3 are repeated until the local model converges.

3.2. Cooperative Games

Let

G (N, v)

be a defined cooperative game, satisfying the following conditions [28]:

v (S_{1}) + v (S_{2}) \leq v (S_{1} \cup S_{2}),

(2)

S_{1} \cap S_{2} = ⌀, v (⌀) = 0 .

(3)

where N is a finite set of players,

S_{1}, S_{2} \in 2^{N}, v : 2^{N}

\to R

is a game-characteristic function, and

2^{N}

is the set of all the subsets of N. Let

v (S)

be the players’ payoff function,

v (N)

indicate the coalition payoff, and

φ_{i} (v)

be the payoff of player i in

v (N)

, which satisfy two constraints:

v (N) = \sum φ_{i} (v) and v (i) \leq φ_{i} (v), \forall i \in N, i = 1, 2, \dots, n,

(4)

v (S) \leq \sum_{i \in S} φ_{i} (v), \forall S \subseteq N, S \neq ⌀ .

(5)

Formulas (4) and (5) are called individual rationality and coalition rationality, respectively.

3.3. Shapley Value

The Shapley value was proposed in the cooperative game theory [28], which can effectively solve the problem of cooperative payoff allocation and is defined as

φ_{i} (v) = \sum_{i \in S, S \subseteq N} w (| S |) [v (S) - v (S ∖ i)],

(6)

w (| S |) = \frac{(n - | S |)! (| S | - 1)!}{n!} .

(7)

where

S \subseteq N

,

i \in S

,

i = 1, 2, \dots, n

,

| S |

is the number of players in subset S,

w (| S |)

is the weight coefficient,

v (S)

is the profit of subset

| S |

and satisfies the conditions (2) and (3), the expression

v (S) - v (S ∖ i)

assesses the marginal contribution of i to the coalition S, and

v (S ∖ i)

indicates the payoff of the other players in the subset

| S |

other than i.

3.4. Pareto Optimality

Let

π = (π_{1}, π_{2}, \dots, π_{n}) : x \to R^{n}

be the vector of the players’ payoff function, and x be a feasible action space for two actions

x_{1}

and

x_{2} \in x

[29]:

(1): If $\forall i \in [n] : π_{i} (x_{1}) \geq π_{i} (x_{2})$ , then $x_{1}$ weakly dominates $x_{2}$ and is marked by $x_{1} ⪰ x_{2}$ ;
(2): If $x_{1} ⪰ x_{2}$ and $\exists i \in [n] : π_{i} (x_{1}) > π_{i} (x_{2})$ , then $x_{1}$ dominates $x_{2}$ and is marked by $x_{1} ≻ x_{2}$ .

If no other action in x dominates it and the collection of the players’ action vectors of all Pareto optimality actions is the Pareto front, then an action is called the Pareto optimality [29]. In other words, an allocation is considered to be Pareto optimality if no alternative allocation could make someone better off without making someone else worse off [30].

3.5. Nash Equilibrium

We consider a game with n players, and the set of players is denoted as

N = {1, 2, \dots, n}

. The player i’s payoff function is

π_{i} (x)

, where

x = {[x_{1}, x_{2}, \dots, x_{n}]}^{T} \in

R^{n}

is the vector of the player’s actions, and

x_{i} \in R

is the action of player i. If player j is not a neighbor of player i, then player i has no direct access to player j’s action.

Nash equilibrium is an action profile on which no player can gain more payoff by unilaterally changing its action, i.e., an action profile

x^{*} = (x_{i}^{*}, x_{- i}^{*})

is the Nash equilibrium [31,32] if

π_{i} (x_{i}^{*}, x_{- i}^{*}) \geq π_{i} (x_{i}, x_{- i}^{*}), \forall i \in N

where

x_{- i} = {[x_{1}, x_{2}, \dots, x_{i - 1}, x_{i + 1}, \dots, x_{n}]}^{T}

. Note that

π_{i} (x)

and

π x

might alternatively be written as

π_{i} (x_{i}, x_{- i})

and

(x_{i}, x_{- i})

, respectively, in this paper.

4. Federated Learning Incentive Mechanism

4.1. The FL Incentive Model

We make the following assumptions before setting up the FL incentive model:

(1): All players can pay for FL, and in the payoff distribution process, they adopt the best payoff distribution scheme.
(2): All players are satisfied with the final distribution of payoffs, as all players were willing to join the coalition.
(3): All players are entirely trustworthy and have no cheating in the FL.
(4): To ensure the smooth implementation of the strategy, the FL should adopt a multi-party agreement to accept the payoff distribution plan.

According to the idea of FL, we establish the FL incentive model in Figure 2, and the main steps of the FL incentive mechanism are as follows:

Step 1: Assume that there are n players for federated model training, and each player has its local dataset

D_{i}

.

Step 2: Each player downloads the initialized model from the aggregation server, trains the model using its local dataset

D_{i}

, and uploads the trained model

m_{i}

to the federated aggregation server.

Step 3: The federated aggregation server collects the model parameters

m_{i}

uploaded by all players and uses the federated aggregation algorithm (FedAvg) to aggregate these parameters to obtain a new global model.

Step 4: The contribution of each player to the global model is calculated using Shapley values or other methods. These contribution values are used to determine the distribution of the player’s payoffs.

Step 5: The supervising organization determines whether each player’s payoff is Pareto optimal, and if it is, the federated payoff is distributed using the Shapley value formula; otherwise, the player receives a penalty from the supervising organization.

Step 6: According to the gain allocation formula, rewards are issued to each player who achieves Pareto optimality.

4.2. The Conflict between Fairness and Pareto Optimality

All players are allowed to actively contribute to the FL to guarantee that each player is happy with the federated payoff allocation method and to make the allocation process motivating. The Shapley value considers each player’s contribution to be

1 / n

and ignores the conflict between the fairness of Shapley’s value and Pareto optimality.

Assume there are n players, and the coalition input of player i is

x_{i}

and satisfies

x_{i} \in (0, \infty),

i = 1, 2, \dots, n

. The coalition inputs of all players form an n-dimensional vector

x = (x_{1}, x_{2}, \dots, x_{n})

. The coalition cost input of player i is

c_{i} (x_{i})

and is a differentiable convex function that is strictly monotonically increasing, which satisfies

\frac{\partial c_{i}}{x_{i}} > 0

,

\frac{\partial^{2} c_{i}}{\partial x_{i}^{2}} > 0

, and

c_{i} (0) = 0

. The federated payoff

v (x_{1}, x_{2}, \dots, x_{n})

determined by the federated input of n players is a strictly monotonically increasing differentiable concave function, which satisfies

\frac{\partial v_{i}}{x_{i}} > 0

,

\frac{\partial^{2} v_{i}}{\partial x_{i}^{2}} < 0

, and

v (0, 0, \dots, 0) = 0

. The federated payoffs of n players are distributed according to Formulas (6) and (7).

Although the Shapley value method is fair for the federated payoff allocation, the phenomenon of the free-riding of federated players cannot be avoided. That is, the Pareto optimization before payoff allocation is not satisfied. Therefore, we obtain the following theorem:

Theorem 1.

The Shapley value method satisfies the fairness payoff allocation after FL but not the optimality incentive of federated players’ inputs before FL, i.e., it does not achieve the Pareto efficiency optimization before FL.

Proof.

The proof is given in Appendix A.1 of the Appendix A. □

4.3. FL Incentive Mechanism via Introducing Supervisory Organization

4.3.1. The Establishment of Supervisory Organization Mechanism

In [33], Alchian and Demsetz argued that introducing supervisory organizations would address free riding in the FL process. To encourage supervisor initiative, federated members must pay a certain fee to the supervisor. In [12], Holmstrom pointed out that the phenomenon of free riding can be addressed by using an incentive mechanism. The supervisor’s main task is to break the equilibrium and create incentives.

If the supervisor knows that the federated payoff is greater than or equal to the Pareto optimality payoff, and the supervisor distributes this payoff to the players according to Formulas (6) and (7), then if the federated payoff is less than the Pareto optimality value, the federated player must pay a fee

k_{i}

as in the following:

r_{i} (x) = \{\begin{matrix} φ_{i} (v), i f v \geq v (x^{*}) \\ φ_{i} (v) - k_{i}, i f v < v (x^{*}) \end{matrix}

(8)

where

x^{*} = (x_{1}^{*}, x_{2}^{*}, \dots, x_{n}^{*})

is the federated input vector satisfying Formula (A4).

4.3.2. Penalty Conditions

Theorem 2.

When the mechanism of federated input

x^{*} = (x_{1}^{*}, x_{2}^{*}, \dots, x_{n}^{*})

that satisfies Pareto optimality is a Nash equilibrium, the penalty

k_{i}

must satisfy the two conditions as follows:

(1): If the independent input $x_{i}$ of the player i is less than the Pareto optimality federated input $x_{i}^{*}$ , i.e., e.g., $x_{i} < x_{i}^{*}$ , and $v (x)$ is monotonically increasing, i.e., $v (x_{i}, x_{n - i}^{*}) < v (x_{i}^{*}, x_{n - i}^{*})$ , then the player i is fined, and the payoff remaining after the fine is $r_{i} [v (x_{i}, x_{n - i}^{*})] = φ_{i} (v) - k_{i}$ , and finally the profit of the player i is

$π_{i} [v (x_{i}, x_{n - i}^{*})] = φ_{i} (v) - k_{i} - c_{i} (x_{i}), i = 1, 2, \dots, n .$

(9)
(2): If the independent input $x_{i}$ of the player i is equal to the Pareto optimality federated input $x_{i}^{*}$ , i.e., $x_{i} = x_{i}^{*}$ , then $v = v (x^{*})$ , and the payoff remaining after the penalty is $r_{i} [v (x_{i}^{*}, x_{n - i}^{*})] = φ_{i} (v^{*})$ , and finally the profit of player i is

$π_{i} [v (x_{i}^{*}, x_{n - i}^{*})] = φ_{i} (v^{*}) - c_{i} (x_{i}^{*}), i = 1, 2, \dots, n .$

(10)

where $x_{n - i}^{*} = (x_{1}^{*}, \dots x_{i - 1}^{*}, x_{i + 1}^{*}, \dots, x_{n}^{*})$ represents the Pareto optimality federated input vector composed of $n - i$ players.

Proof.

The proof is given in Appendix A.2 of the Appendix A. □

5. Numerical Examples and Simulation Experiments

In this section, we will use two examples to verify the rationality of the above discussion. In example 1, we consider that the status of the federated players is equal, and in example 2, we consider that the status of the federated players is unequal.

5.1. Numerical Example 1: Equal Status of Federated Players

Assuming there are three players in the FL system, and the federated payoff function is

v (x_{1}, x_{2}, x_{3}) = x_{1} + x_{2} + x_{3} + x_{1} x_{2} + x_{1} x_{3} + x_{2} x_{3}

, and the return function

v (x_{1}, x_{2}, x_{3})

is a strictly monotonically increasing concave function, the cost functions of the three players are

c_{1} (x_{1}) = \frac{3}{2} x_{1}^{2}

,

c_{2} (x_{2}) = \frac{3}{2} x_{2}^{2}

, and

c_{3} (x_{3}) = \frac{3}{2} x_{3}^{2}

. It is easy to know that the cost function

c_{i} (x_{i})

is a strictly increasing convex function. When the federated input is

x^{*} = (x_{1}^{*}, x_{2}^{*}, x_{3}^{*})

, the federated profit

max R = v (x_{1}, x_{2}, x_{3}) - \sum_{i = 1}^{3} c_{i} (x_{i})

maximizes and satisfies Pareto optimality, and its first-order condition is

\{\begin{matrix} 1 + x_{2}^{*} + x_{3}^{*} - 3 x_{1}^{*} = 0 \\ 1 + x_{1}^{*} + x_{3}^{*} - 3 x_{2}^{*} = 0 \\ 1 + x_{1}^{*} + x_{2}^{*} - 3 x_{3}^{*} = 0 \end{matrix}

We determine that the federated inputs satisfying the Pareto optimality conditions are

x_{1}^{*} = 1

,

x_{2}^{*} = 1

, and

x_{3}^{*} = 1

, the federated profit is

v (x^{*}) = 6

, and maximum profit is

max R^{*} = 1.5

. Because of the equal status of the three players, according to the anonymity of Formulas (6) and (7), the effectiveness of Formulas (6) and (7) can be obtained as

\begin{matrix} φ_{1} (v (x)) = φ_{2} (v (x)) = φ_{3} (v (x)) \\ φ_{1} (v (x)) + φ_{2} (v (x)) + φ_{3} (v (x)) = v (x) \\ φ_{1} (v (x)) = φ_{2} (v (x)) = φ_{3} (v (x)) = \frac{1}{3} v (x) . \end{matrix}

Therefore, the profit functions of the three players are

\begin{matrix} π_{1} (\frac{1}{3} v, x_{1}) = \frac{1}{3} v (x) - \frac{3}{2} x_{1}^{2} \\ π_{2} (\frac{1}{3} v, x_{2}) = \frac{1}{3} v (x) - \frac{3}{2} x_{2}^{2} \\ π_{3} (\frac{1}{3} v, x_{3}) = \frac{1}{3} v (x) - \frac{3}{2} x_{3}^{2} . \end{matrix}

The Nash equilibrium requires other players to decide their investment in the FL, and each player has the right to decide their investment to maximize their profits. Therefore, the first-order condition that satisfies the Nash equilibrium is

\{\begin{matrix} 1 + x_{2} + x_{3} - 9 x_{1} = 0 \\ 1 + x_{1} + x_{3} - 9 x_{2} = 0 \\ 1 + x_{1} + x_{2} - 9 x_{3} = 0 \end{matrix}

We determine that the federated input satisfying the Nash equilibrium conditions are

{\tilde{x}}_{1} = 0.14

,

{\tilde{x}}_{2} = 0.14

, and

{\tilde{x}}_{3} = 0.14

, the federated payoff is

v (\tilde{x}) = 0.49

and the maximum profit is

max \tilde{R} = 0.4

.

By comparing the Pareto optimality solution and Nash equilibrium solution in Table 1, we know that using the Shapley value method can achieve fairness post-FL; the optimality incentive is not reached before FL. Under the Nash equilibrium, the input of each player is lower than the Pareto optimality level, and the federated profit cannot reach the maximum.

Next, we introduce the supervisory authority. If the supervisory authority knows that the federated payoff is greater than or equal to Pareto optimality payoff 6, the payoff is allocated to the federated players by Formulas (6) and (7). If it knows that the federated payoff is lower than Pareto optimality payoff 6, the payoff of the federated players is 0. The specific expression is as follows:

φ_{i} (v (x)) = \{\begin{matrix} \frac{1}{3} v (x), & i f v (x) \geq 6, \\ 0, & i f v (x) < 6 . \end{matrix}

Here, we prove that the Nash equilibrium constituting the supervision mechanism satisfies the Pareto optimality condition

x^{*} = (x_{1}^{*}, x_{2}^{*}, x_{3}^{*}) = (1, 1, 1)

. If the Pareto optimality values of the players 2 and 3 are

x_{1}^{*} = 1

and

x_{2}^{*} = 1

, respectively, the federated payoff is

v (x) = 3 + 3 x_{3}

. If the input of player 3 is

x_{3} < 1

, there is

v (x) < 6

, at this time,

\begin{matrix} φ_{3} (v (x)) = 0, \\ π_{3} (0, x_{3}) = 0 - \frac{3}{2} x_{3}^{2} < 0, \end{matrix}

so rational player 3 will not input

x_{3} < 1

. If the federated input of player 3 is

x_{3} > 1

, then

v (x) > 6

and

\begin{matrix} φ_{3} (v (x)) = \frac{1}{3} v (x) = \frac{4}{3} + \frac{2}{3} x_{3}, \\ π_{3} (x_{3}) = \frac{4}{3} + \frac{2}{3} x_{3} - \frac{3}{2} x_{3}^{2} . \end{matrix}

when

x_{3} \geq 1

, then

\frac{\partial π_{3} (x_{3})}{\partial x_{3}} = \frac{2}{3} - 3 x_{3} < 0

, obviously,

π_{3} (φ_{3}, x_{3})

is a monotonic decreasing function of

x_{3}

on interval

[1, \infty)

, and the profit of player 3 reaches the maximum at

x_{3} = 1

. Therefore, the Nash equilibrium constituting the supervision mechanism satisfies the Pareto optimality condition

x^{*} = (x_{1}^{*}, x_{2}^{*}, x_{3}^{*}) = (1, 1, 1)

.

Next, we consider the minimum value of penalty mechanism

k_{i} = φ_{i} (x) - π_{i} (x^{*})

. If the supervisory authority knows that the federated payoff is greater than or equal to Pareto optimality payoff 6, the payoff is allocated to the federated players by Formulas (6) and (7). If it knows that the federated payoff is lower than Pareto optimality payoff 6, the payoff of the federated players is 0.5. The specific expression is as follows:

φ_{i} (v (x)) = \{\begin{matrix} \frac{1}{3} v (x), & i f v (x) \geq 6, \\ 0.5, & i f v (x) < 6 . \end{matrix}

Here, we prove that the Nash equilibrium constituting the supervision mechanism satisfies the Pareto optimality condition

x^{*} = (x_{1}^{*}, x_{2}^{*}, x_{3}^{*}) = (1, 1, 1)

. If the Pareto optimality values of players 2 and 3 are

x_{1}^{*} = 1

and

x_{2}^{*} = 1

, respectively, the federated payoff is

v (x) = 3 + 3 x_{3}

. If the input of player 3 is

x_{3} < 1

, there is

v (x) < 6

. At this time,

\begin{matrix} φ_{3} (v (x)) = 0.5, \\ π_{3} (0, x_{3}) = 0.5 - \frac{3}{2} x_{3}^{2} \leq 0.5, \end{matrix}

so rational player 3 will not input

x_{3} < 1

. If the federated input of player 3 is

x_{3} \geq 1

, then

v (x) \geq 6

and

\begin{matrix} φ_{3} (v (x)) = \frac{1}{3} v (x) = \frac{4}{3} + \frac{2}{3} x_{3}, \\ π_{3} (x_{3}) = \frac{4}{3} + \frac{2}{3} x_{3} - \frac{3}{2} x_{3}^{2} . \end{matrix}

In the last part, we proved that

π_{3} (φ_{3}, x_{3})

is a monotonic decreasing function of

x_{3}

on interval

[1, \infty)

, and the profit of player 3 reaches the maximum at

x_{3} = 1

.

According to the above, when

x_{1}^{*} = 1

,

x_{2}^{*} = 1

, and the input of player 3 is

x_{3}^{*} = 1

, it just reaches the Pareto optimality value. At this time,

π_{3} (x_{3}^{*}) = 0.5

. Therefore, the condition to reach the Nash equilibrium of the mechanism is that the Pareto optimality input is

x^{*} = (x_{1}^{*}, x_{2}^{*}, x_{3}^{*}) = (1, 1, 1)

.

5.2. Numerical Example 2: Unequal Status of Federated Players

Assuming that there are three federated players that form a coalition, and the federated payoff function is

v (x_{1}, x_{2}, x_{3}) = 2 x_{1} + 4 x_{2} + 6 x_{3} + x_{1} x_{2} .

The payoff function

v (x_{1}, x_{2}, x_{3})

is a strictly increasing linear function, and the cost functions

c (x_{1}) = \frac{1}{2} x_{1}^{2}

,

c (x_{2}) = x_{2}^{2}

and

c (x_{3}) = \frac{1}{4} x_{3}^{2}

of these three players are strictly monotonically increasing convex functions. When the federated input

x^{*} = (x_{1}^{*}, x_{2}^{*}, x_{3}^{*})

, the federated profit

max R = v (x_{1}, x_{2}, x_{3}) - \sum_{i = 1}^{3} c_{i} (x_{i}) = 2 x_{1} + 4 x_{2} + 6 x_{3} + x_{1} x_{2} - \frac{1}{2} x_{1}^{2} - x_{2}^{2} - \frac{1}{4} x_{3}^{2}

maximizes and satisfies Pareto optimality, its first-order condition is

\{\begin{matrix} 2 - x_{1}^{*} + x_{2}^{*} = 0 \\ 4 + x_{1}^{*} - 2 x_{2}^{*} = 0 \\ 6 - \frac{1}{2} x_{3}^{*} = 0 . \end{matrix}

we determine that the federated inputs satisfying the Pareto optimality conditions are

x_{1}^{*} = 8

,

x_{2}^{*} = 6

, and

x_{3}^{*} = 12

, the federated payoff is

v (x^{*}) = 160

, and the maximum profit is

max R^{*} = 56

. According to Formulas (6) and (7), because

v (0) = 0

,

v (x_{1}) = 2 x_{1}

,

v (x_{2}) = 4 x_{2}

,

v (x_{3}) = 6 x_{3}

,

v (x_{1}, x_{2}) = x_{1} x_{2}

,

v (x_{1}, x_{3}) = v (x_{2}, x_{3}) = 0

, and

v (x_{1}, x_{2}, x_{3}) = 2 x_{1} + 4 x_{2} + 6 x_{3} + x_{1} x_{2}

, then

\begin{array}{l} φ_{1} (v (x)) = \frac{4}{3} x_{1} + \frac{2}{3} x_{2} + x_{3} + \frac{1}{2} x_{1} x_{2} \\ φ_{2} (v (x)) = \frac{1}{3} x_{1} + \frac{8}{3} x_{2} + x_{3} + \frac{1}{2} x_{1} x_{2} \\ φ_{3} (v (x)) = \frac{1}{3} x_{1} + \frac{2}{3} x_{2} + 4 x_{3} . \end{array}

Therefore, the profit functions of the three players are

\begin{matrix} π_{1} (v, x_{1}) = \frac{4}{3} x_{1} + \frac{2}{3} x_{2} + x_{3} + \frac{1}{2} x_{1} x_{2} - \frac{1}{2} x_{1}^{2} \\ π_{2} (v, x_{2}) = \frac{1}{3} x_{1} + \frac{8}{3} x_{2} + x_{3} + \frac{1}{2} x_{1} x_{2} - x_{2}^{2} \\ π_{3} (v, x_{3}) = \frac{1}{3} x_{1} + \frac{2}{3} x_{2} + 4 x_{3} - \frac{1}{4} x_{3}^{2} . \end{matrix}

Nash equilibrium requires other players to decide their investment in the FL, and each player has the right to decide their investment, to maximize their profits. Therefore, the first-order condition satisfying Nash equilibrium is

\{\begin{matrix} \frac{4}{3} - x_{1} + \frac{1}{2} x_{2} = 0 \\ \frac{8}{3} + \frac{1}{2} x_{1} - 2 x_{2} = 0 \\ 4 - \frac{1}{2} x_{3} = 0 . \end{matrix}

we determine that the federated inputs satisfying the Pareto optimality conditions are

x_{1} = 2.29

,

x_{2} = 1.90

, and

x_{3} = 8

, the federated profit is

v (x) = 64.53

, and the maximum profit is

max R = 42.30

.

Comparing Pareto optimality and the Nash equilibrium solution in Table 2, we know that using the Shapley value method can achieve post fairness, but there is no prior optimality incentive. Under the Nash equilibrium, the input of each player is lower than the Pareto optimality level, and the federated profit does not reach the maximum.

Next, we introduce the supervisory authority. If the supervisory authority knows that the federated payoff is greater than or equal to the Pareto optimality payoff 160, the payoff is allocated to the federated players by Formulas (6) and (7). If it knows that the federated payoff is lower than Pareto optimality payoff 160, the federated player payoff is

τ_{i}

. When the input of players reaches the Pareto optimality value, i.e.,

x_{1}^{*} = 8

,

x_{2}^{*} = 6

, and

x_{3}^{*} = 12

, then

\begin{matrix} φ_{1} (v (x^{*}) & = & \frac{4}{3} x_{1}^{*} + \frac{2}{3} x_{2}^{*} + x_{3}^{*} + \frac{1}{2} x_{1}^{*} x_{2}^{*} = 50.67 \\ φ_{2} (v (x^{*}) & = & \frac{1}{3} x_{1}^{*} + \frac{8}{3} x_{2}^{*} + x_{3}^{*} + \frac{1}{2} x_{1}^{*} x_{2}^{*} = 54.67 \\ φ_{3} (v (x^{*}) & = & \frac{1}{3} x_{1}^{*} + \frac{2}{3} x_{2}^{*} + 4 x_{3}^{*} = 54.67 \\ π_{1} (x^{*}) & = & 18.67 \\ π_{2} (x^{*}) & = & 18.67 \\ π_{3} (x^{*}) & = & 18.67 . \end{matrix}

Therefore, the value ranges of

τ_{1}

,

τ_{2}

and

τ_{3}

are

0 \leq τ_{1} \leq 18.67

,

0 \leq τ_{2} \leq 18.67

and

0 \leq τ_{3} \leq 18.67

, respectively. The payoffs

φ_{1} (v (x))

,

φ_{2} (v (x))

and

φ_{3} (v (x))

of players 1, 2 and 3 are as follows:

φ_{1} (v (x)) = \{\begin{matrix} \frac{4}{3} x_{1} + \frac{2}{3} x_{2} + x_{3} + \frac{1}{2} x_{1} x_{2}, & i f v (x) \geq 160 \\ τ_{1}, & i f v (x) < 160 \end{matrix}

φ_{2} (v (x)) = \{\begin{matrix} \frac{1}{3} x_{1} + \frac{8}{3} x_{2} + x_{3} + \frac{1}{2} x_{1} x_{2}, & i f v (x) \geq 160 \\ τ_{2}, & i f v (x) < 160 \end{matrix}

φ_{3} (v (x)) = \{\begin{matrix} \frac{1}{3} x_{1} + \frac{2}{3} x_{2} + 4 x_{3}, & i f v (x) \geq 160 \\ τ_{3}, & i f v (x) < 160 . \end{matrix}

In further work, we will prove that the Nash equilibrium constituting the supervision mechanism satisfies the Pareto optimality condition

x^{*} = (x_{1}^{*}, x_{2}^{*}, x_{3}^{*}) = (8, 6, 12)

.

(1) We consider the value of player 1. When

x_{2}^{*} = 6

and

x_{3}^{*} = 12

, then

φ_{1} (v (x)) = \{\begin{matrix} \frac{13}{3} x_{1} + 16, & i f v (x) \geq 8 \\ τ_{1}, & i f v (x) < 8 . \end{matrix}

where

0 \leq τ_{1} \leq 18.67

, and if the input of player 1 is

x_{1} \geq 8

, there is

v_{1} (x) = \frac{13}{3} x_{1} + 16

,

π_{1} (x) = \frac{13}{3} x_{1} + 16 - \frac{1}{2} x_{1}^{2}

, then

\frac{\partial π_{1} (x_{1})}{\partial x_{1}} = \frac{13}{3} - x_{1} < 0

. It indicates that the profit function

π_{1} (x)

of player 1 is monotonically decreasing on

[8, \infty)

; therefore, when player 1 invests

x_{1} = 8

, it can obtain the maximum profit

π_{1} (x) = 18.67

. If player 1 invests

x_{1} < 8

, the player payoff is

τ_{1}

and profit is

π_{1} (x) = τ_{1} - \frac{1}{2} x_{1}^{2}

, then

π_{1} (x) = τ_{1} - \frac{1}{2} x_{1}^{2} \leq 18.67

. Thus, when

x_{2}^{*} = 6

and

x_{3}^{*} = 12

, player 1 can obtain the maximum profit

π_{1} (x) = 18.67

by investing

x_{1}^{*} = 8

.

(2) We consider the value of player 2. When

x_{1}^{*} = 8

and

x_{3}^{*} = 12

, then

φ_{2} (v (x)) = \{\begin{matrix} \frac{20}{3} x_{2} + \frac{44}{3}, & i f v (x) \geq 6 \\ τ_{2}, & i f v (x) < 6 . \end{matrix}

where

0 \leq τ_{2} \leq 18.67

, and if the input of player 2 is

x_{2} \geq 6

, there is

v_{2} (x) = \frac{20}{3} x_{2} + \frac{44}{3}

and

π_{2} (x) = \frac{20}{3} x_{2} + \frac{44}{3} - x_{2}^{2}

, then

\frac{\partial π_{2} (x_{2})}{\partial x_{2}} = \frac{20}{3} - 2 x_{2} < 0

. It indicates that the profit function

π_{2} (x)

of player 2 is monotonically decreasing on

[6, \infty)

; therefore, when player 2 invests

x_{2} = 6

, it can obtain the maximum profit

π_{2} (x) = 18.67

. If player 2 invests

x_{1} < 6

, the player payoff is

τ_{2}

and profit is

π_{2} (x) = τ_{2} - x_{2}^{2}

, then

π_{2} (x) = τ_{2} - x_{2}^{2} \leq 18.67

. Thus, when

x_{1}^{*} = 8

and

x_{3}^{*} = 12

, player 2 can obtain the maximum profit

π_{2} (x) = 18.67

by investing

x_{2} = 6

.

(3) We consider the value of player 3. When

x_{1}^{*} = 8

and

x_{2}^{*} = 6

, then

φ_{3} (v (x)) = \{\begin{matrix} 4 x_{3} + \frac{20}{3}, & i f v (x) \geq 12 \\ τ_{3}, & i f v (x) < 12 . \end{matrix}

where

0 \leq τ_{3} \leq 18.67

, and if the input of player 3 is

x_{3} \geq 12

, there is

v_{3} (x) = 4 x_{3} + \frac{20}{3}

,

π_{3} (x) = 4 x_{3} + \frac{20}{3} - \frac{1}{4} x_{3}^{2}

, then

\frac{\partial π_{3} (x_{3})}{\partial x_{3}} = 4 - \frac{1}{2} x_{3} < 0

. It indicates that the profit function

π_{3} (x)

of player 3 is monotonically decreasing on

[12, \infty)

; therefore, when player 3 invests

x_{3} = 12

, it can obtain the maximum profit

π_{3} (x) = 18.67

. If player 1 invests

x_{1} < 12

, player payoff is

τ_{3}

and profit is

π_{3} (x) = τ_{3} - \frac{1}{4} x_{3}^{2}

, then

π_{3} (x) = τ_{3} - \frac{1}{4} x_{3}^{2} \leq 12

. Thus, when

x_{1}^{*} = 8

and

x_{2}^{*} = 6

, player 3 can obtain the maximum profit

π_{3} (x) = 18.67

by investing

x_{3} = 12

.

5.3. Numerical Simulation Experiments

Figure 3 shows the simulations of numerical experiments 1 and 2, respectively. Subplot (a) and subplot (c) in Figure 3 show that the optimality inputs of the federated players have reached the Pareto optimality, and satisfying the Nash equilibrium’s inputs does not reach the Pareto optimality. Subplots (b) and (d) show that the payoffs of satisfying the Pareto optimality inputs are more than the federal payoffs of satisfying the Nash equilibrium inputs.

As in the literature [17,19], although the Shapley value method can satisfy the fair distribution of payoffs after FL, it cannot achieve Pareto efficiency optimality before FL, i.e., it cannot reach the optimality incentives for federated player inputs before FL, and neither individual nor collective maximum payoffs are achieved.

Therefore, according to the supervisory organization introduced in this paper, all federated players are observed to have lower federated payoffs than those satisfying the Pareto optimality, and the penalty goes to the supervisory organization. In this way, the supervisory organization supervises and disciplines all federated players.

By analyzing and proving that the inputs that satisfy the Pareto efficiency optimality constitute a Nash equilibrium for FL through the introduced supervisory organization, it is possible to solve the conflict that the payoff allocation of FL is fair and efficient.

6. Conclusions and Future Work

In the model training of FL, to obtain an accurate federated model, this paper designs an incentive mechanism to encourage all federated players to contribute to their data-training model. Under the condition of payoff determination, combined with the Shapley value method, a federated payoff allocation mechanism with third-party supervision is introduced. Under this mechanism, the federated payoff can reach Pareto optimality, and finally, the federated payoff is allocated by the Shapley value method. This mechanism solves the conflict between the fairness and efficiency of the payoff allocation in the FL system. Through the verification of numerical and simulation experiments, when the optimality payoff allocation of Pareto optimality is achieved, the Nash equilibrium of the mechanism is formed. Therefore, the use of an incentive mechanism will play a better role for federated players.

In future research, we apply the incentive mechanism solution proposed in this paper to solve the problem of payoff allocation among players in the training scenario of the FL model. In particular, the incentive mechanism proposed in this paper can be applied to banks, hospitals, insurance companies, etc., providing important theoretical assistance for them to train accurate federated models and improve economic efficiency in practice.

7. Discussion

The Shapley value method is a way to measure how much each player contributes to FL, and it is a fair rule for allocating resources. Pareto efficiency is a resource allocation’s ideal state, in which all resources are in full use and there is no waste. Although the Shapley value method can achieve fair payoff allocation after the completion of FL, it cannot guarantee that the inputs of each federated player achieve optimality before FL.

Therefore, the combination of the Shapley value and Pareto optimality provides a solution for federated payoff distribution that is both fair and efficient, and the solution can help ensure the stability and dynamic equilibrium of the payoff distribution.

However, this combination may have some shortcomings. For example, calculating the Shapley values in federated model training can be very complicated in the case of a large number of players. Furthermore, to determine the distribution of Pareto efficiency, the utility functions of all federated payers must be provided, which may be difficult to implement in practical applications.

Author Contributions

Conceptualization, X.Y. and S.X.; methodology, X.Y., S.X. and C.P.; software, X.Y. and S.X.; validation, X.Y., S.X., C.P. and Z.L.; writing—original draft preparation, X.Y.; writing review and editing, X.Y., W.T. and Z.L.; visualization, X.Y., C.P. and W.T.; supervision, X.Y., S.X., C.P. and N.W.; project administration, N.W. and Y.Z.; funding acquisition, Z.L. and Y.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by the National Natural Science Foundation of China (No. 62272124), the National Key Research and Development Program of China (No. 2022YFB2701401), the Guizhou Science Contract Plat Talent (No. [2020]5017), the Open Fund of Key Laboratory of Advanced Manufacturing Technology, Ministry of Education (grant No. GZUAMT2021KF[01]), the Research Project of Guizhou University for Talent Introduction (No. [2020]61), the Cultivation Project of Guizhou University (No. [2019]56), the Research Project of Guizhou University for Talent Introduction (GDRJHZ[2022]14), and Guizhou Provincial Science and Technology Projects (ZK[2023]YB053).

Data Availability Statement

The datasets used and/or analyzed in the current study are available from the corresponding author on reasonable request.

Acknowledgments

The authors are grateful to the referees for their careful reading of the manuscript and valuable comments. The authors thank the editor too.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

N	The set of n players
D	The set of players’ local dataset
M	The model trained jointly by all players
$M_{F E D}$	The FL sharing model
$M_{S U M}$	The traditional machine learning model
$V_{F E D}$	The model accuracy of $M_{F E D}$
$M_{S U M}$	The model accuracy of $M_{S U M}$
S	The alliance subset of different players, $S \subseteq N$
$v (.)$	A characteristic function
$v (S)$	The player’s payoff through the alliance S
$v (N)$	The overall federated payoff
$φ_{i} (v)$	The payoff allocated to player i
$\| S \|$	The number of players in subset S
$(\| S \| - 1)!$	The sorted total number when players i participates in coalition S
$(n - \| S \|)!$	The sorted total number of remaining $(n - \| S \|)$ players
$S ∖ i$	The alliance after removing player i from alliance S
$[v (S) - v (S ∖ i)]$	The marginal contribution of i to coalition S
$w (\| S \|)$	The weight coefficient
x	A feasible action space
$π (x)$	The federated player’s payoff function
$c (x)$	The coalition cost input of player
$k_{i}$	The penalty condition for the supervisor to achieve Pareto optimality
$r_{i} (x)$	The supervisor obtained fines
R	The federated player’s profit

Appendix A

Appendix A.1. Proof of Theorem 1

Proof.

According to the validity of the Shapley value, we can obtain

\sum_{i = 1}^{n} φ_{i} (v) = v, \forall v

(A1)

By differentiating Formula (A1) with respect to x, we obtain

\sum_{i = 1}^{n} φ_{i}^{'} (v) = 1 .

(A2)

where

φ_{i}^{'} (v) = \partial φ_{i} / \partial v

, from the Nash equilibrium, assuming that the input of each federated player i is

x_{i}

and the profit is

π_{i} (φ_{i}, x_{i}) = φ_{i} (v) - c_{i} (x_{i})

. The profit maximization of player i is

max π_{i} (φ_{i}, x_{i}) = φ_{i} (v) - c_{i} (x_{i}), i = 1, 2, \dots, n

(A3)

Therefore, the first-order condition of the Nash equilibrium is

φ_{i}^{'} (x) x_{i}^{'} = c_{i}^{'}, i = 1, 2, \dots, n .

(A4)

Here,

φ_{i}^{'} (x) = \partial φ_{i} / \partial x

,

x_{i}^{'} = \partial x / \partial y_{i}

, and

c_{i}^{'} = \partial c_{i} / \partial y_{i}

. To maximize federated profits, the federated investment needs to meet Pareto optimality:

y^{*} = arg max_{y} (x (y) - \sum_{i = 1}^{n} c_{i} (y_{i}))

(A5)

the first-order condition of Pareto optimality is

x_{i}^{'} = c_{i}^{'}, i = 1, 2, \dots, n .

(A6)

In combination with Formulas (A4) and (A6), it can be seen that the Nash equilibrium achieves Pareto optimality, which only needs to satisfy the following conditions:

φ_{i}^{'} (x) = 1, i = 1, 2, \dots, n

(A7)

However, this is in contradiction to satisfying Shapley value condition

\sum_{i = 1}^{n} φ_{i}^{'} (x) = 1

. □

Appendix A.2. Proof of Theorem 2

Proof.

To make the proof meaningful, we assume that when the federated players achieve Pareto optimality, the federated payoffs allocated by each player i according to the Shapley value method are greater than their input costs, i.e.,

v (x_{i}^{*}) > c_{i} (x_{i}^{*})

. If

x^{*} = (x_{1}^{*}, x_{2}^{*}, \dots, x_{n}^{*})

is the Nash equilibrium of this mechanism, it should satisfy

π_{i} [v (x_{i}^{*}, x_{n - i}^{*})] \geq π_{i} [v (x_{i}, x_{n - i}^{*})]

and

\begin{matrix} φ_{i} (v^{*}) - c_{i} (x_{i}^{*}) \geq φ_{i} (v) - k_{i} - c_{i} (x_{i}) \\ k_{i} \geq [φ_{i} (v) - c_{i} (x_{i})] - [φ_{i} (v^{*}) - c_{i} (x_{i}^{*})] \end{matrix}

(A8)

During the FL process, the coalition input

c_{i} (x_{i})

is invisible, and the value is not unique, so Formula (A8) cannot be used as a basis for formulating fines, but more importantly

c_{i} (x_{i}) \geq 0

, so there is

φ_{i} (v) - [φ_{i} (v^{*}) - c_{i} (x_{i}^{*})] \geq [φ_{i} (v) - c_{i} (x_{i})] - [φ_{i} (v^{*}) - c_{i} (x_{i}^{*})]

. Therefore, Formula (A8) is only satisfied if the condition

k_{i} \geq φ_{i} (v) - [φ_{i} (v^{*}) - c_{i} (x_{i}^{*})]

is satisfied so that the Pareto optimality value is achieved. The penalty condition for the supervisor to achieve Pareto optimality is

k_{i} \geq φ_{i} (v) - [φ_{i} (v^{*}) - c_{i} (x_{i}^{*})]

(A9)

However, to increase the players’ enthusiasm, it should be noted that fines should not be too high, and the principles of limited participation and limited liability are followed. Here, we assume that all players in federated learning are only responsible for limited liability. The amount of the penalty cannot exceed the amount of the player’s payoff, so if the player’s payoff is zero, there is no need for a penalty. Therefore, according to Formula (8),

φ_{i} (v) - k_{i} \geq 0

is obtained, then

k_{i} \leq φ_{i} (v) .

(A10)

According to Formulas (A9) and (A10), we obtain

φ_{i} (v) - [φ_{i} (v^{*}) - c_{i} (x_{i}^{*})] \leq k_{i} \leq φ_{i} (v) .

(A11)

Assuming that the net payoff of player i after being fined is

δ_{i}

, then

δ_{i} = φ_{i} (v) - k_{i}

, and the following formula can be obtained according to Formula (A11):

0 \leq δ_{i} \leq φ_{i} (v^{*}) - c_{i} (x_{i}^{*})

(A12)

According to the above, under the constraint of limited liability, the Pareto value is optimized. When the penalty of the supervisor meets Formula (A11), the optimality mechanism is

r_{i} (x) = \{\begin{matrix} φ_{i} (v), i f v \geq v (x^{*}) \\ δ_{i}, i f v < v (x^{*}) \end{matrix}

(A13)

where the value of

δ_{i}

satisfies Formula (A12),

i = 1, 2, \dots, n .

Next, we explain the mechanism under two conditions of the penalty value in Formula (A12).

When

k_{i} = φ_{i} (v)

, it means that the supervisor knows that the federated payoff is greater than or equal to the Pareto optimality payoff, and the supervisor will distribute the federated payoff to all players according to the Shapley value formula. If the federated payoff is less than the Pareto optimality payoff, all federated payoffs will belong to the supervisor. The expression is

r_{i} (x) = \{\begin{matrix} φ_{i} (v), i f v \geq v (x^{*}) \\ 0, i f v < v (x^{*}) \end{matrix}

(A14)

Furthermore, we will prove that the federated input

x^{*} = (x_{1}^{*}, x_{2}^{*}, \dots, x_{n}^{*})

satisfying Pareto optimality is the Nash equilibrium of this mechanism.

Assuming that the federated input of player i is

x_{i} < x^{*}

and the federated input of other players is

x_{n - i}^{*}

because

v (x)

is a monotonically increasing function, then

v (x_{i}, x_{n - i}^{*}) < v (x_{i}^{*}, x_{n - i}^{*})

,

r_{i} (x_{i}, x_{n - i}^{*}) = 0

, and the profit of player i is

π_{i} (x_{i}) = - c_{i} (x_{i}) \leq 0

; therefore, rational player i will not invest

x_{i} < x^{*}

. If the federated input of player i is

x_{i} \geq x^{*}

and because

v (x)

is a monotonically increasing function, then the profit of player i is

π_{i} (x_{i}) = φ_{i} (x_{i}) - c_{i} (x_{i}) > 0

; therefore, rational player i will invest

x_{i} \geq x^{*}

. □

References

Konečnỳ, J.; McMahan, H.B.; Ramage, D.; Richtárik, P. Federated optimization: Distributed machine learning for on-device intelligence. arXiv 2016, arXiv:1610.02527. [Google Scholar]
McMahan, H.B.; Moore, E.; Ramage, D.; y Arcas, B.A. Federated learning of deep networks using model averaging. arXiv 2016, arXiv:1602.05629. [Google Scholar]
Konečnỳ, J.; McMahan, H.B.; Yu, F.X.; Richtárik, P.; Suresh, A.T.; Bacon, D. Federated learning: Strategies for improving communication efficiency. arXiv 2016, arXiv:1610.05492. [Google Scholar]
Yang, Q.; Liu, Y.; Chen, T.; Tong, Y. Federated machine learning: Concept and applications. ACM Trans. Intell. Syst. Technol. (TIST) 2019, 10, 1–19. [Google Scholar] [CrossRef]
Kairouz, P.; McMahan, H.B.; Avent, B.; Bellet, A.; Bennis, M.; Bhagoji, A.N.; Bonawitz, K.; Charles, Z.; Cormode, G.; Cummings, R.; et al. Advances and open problems in federated learning. Found. Trends® Mach. Learn. 2021, 14, 1–210. [Google Scholar] [CrossRef]
Sarikaya, Y.; Ercetin, O. Motivating workers in federated learning: A stackelberg game perspective. IEEE Netw. Lett. 2019, 2, 23–27. [Google Scholar] [CrossRef] [Green Version]
Zhan, Y.; Li, P.; Qu, Z.; Zeng, D.; Guo, S. A learning-based incentive mechanism for federated learning. IEEE Internet Things J. 2020, 7, 6360–6368. [Google Scholar] [CrossRef]
Kang, J.; Xiong, Z.; Niyato, D.; Xie, S.; Zhang, J. Incentive mechanism for reliable federated learning: A joint optimization approach to combining reputation and contract theory. IEEE Internet Things J. 2019, 6, 10700–10714. [Google Scholar] [CrossRef]
Tran, N.H.; Bao, W.; Zomaya, A.; Nguyen, M.N.; Hong, C.S. Federated learning over wireless networks: Optimization model design and analysis. In Proceedings of the IEEE INFOCOM 2019-IEEE Conference on Computer Communications, Paris, France, 29 April–2 May 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 1387–1395. [Google Scholar]
Yu, H.; Liu, Z.; Liu, Y.; Chen, T.; Cong, M.; Weng, X.; Niyato, D.; Yang, Q. A fairness-aware incentive scheme for federated learning. In Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society, New York, NY, USA, 7–8 February 2020; Volume 2020, pp. 393–399. [Google Scholar]
Li, T.; Sanjabi, M.; Beirami, A.; Smith, V. Fair resource allocation in federated learning. arXiv 2019, arXiv:1905.10497. [Google Scholar]
Holmstrom, B. Moral hazard in teams. Bell J. Econ. 1982, 13, 324–340. [Google Scholar] [CrossRef]
Kim, H.; Park, J.; Bennis, M.; Kim, S.L. Blockchained on-device federated learning. IEEE Commun. Lett. 2019, 24, 1279–1283. [Google Scholar] [CrossRef] [Green Version]
Feng, S.; Niyato, D.; Wang, P.; Kim, D.I.; Liang, Y.C. Joint service pricing and cooperative relay communication for federated learning. In Proceedings of the 2019 International Conference on Internet of Things (iThings) and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, Physical and Social Computing (CPSCom) and IEEE Smart Data (SmartData), Atlanta, GA, USA, 14–17 July 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 815–820. [Google Scholar]
Yang, X. An exterior point method for computing points that satisfy second-order necessary conditions for a C1, 1 optimization problem. J. Math. Anal. Appl. 1994, 187, 118–133. [Google Scholar] [CrossRef] [Green Version]
Wang, Z.; Hu, Q.; Li, R.; Xu, M.; Xiong, Z. Incentive mechanism design for joint resource allocation in blockchain-based federated learning. IEEE Trans. Parallel Distrib. Syst. 2023, 34, 1536–1547. [Google Scholar] [CrossRef]
Song, T.; Tong, Y.; Wei, S. Profit allocation for federated learning. In Proceedings of the 2019 IEEE International Conference on Big Data (Big Data), Los Angeles, CA, USA, 9–12 December 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 2577–2586. [Google Scholar]
Wang, T.; Rausch, J.; Zhang, C.; Jia, R.; Song, D. A principled approach to data valuation for federated learning. In Federated Learning: Privacy and Incentive; Springer: Cham, Switzerland, 2020; pp. 153–167. [Google Scholar]
Wang, G.; Dang, C.X.; Zhou, Z. Measure contribution of participants in federated learning. In Proceedings of the 2019 IEEE International Conference on Big Data (Big Data), Los Angeles, CA, USA, 9–12 December 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 2597–2604. [Google Scholar]
Liu, Y.; Ai, Z.; Sun, S.; Zhang, S.; Liu, Z.; Yu, H. Fedcoin: A peer-to-peer payment system for federated learning. In Federated Learning: Privacy and Incentive; Springer: Berlin/Heidelberg, Germany, 2020; pp. 125–138. [Google Scholar]
Zeng, R.; Zeng, C.; Wang, X.; Li, B.; Chu, X. A comprehensive survey of incentive mechanism for federated learning. arXiv 2021, arXiv:2106.15406. [Google Scholar]
Liu, Z.; Chen, Y.; Yu, H.; Liu, Y.; Cui, L. Gtg-shapley: Efficient and accurate participant contribution evaluation in federated learning. ACM Trans. Intell. Syst. Technol. (TIST) 2022, 13, 1–21. [Google Scholar] [CrossRef]
Nagalapatti, L.; Narayanam, R. Game of gradients: Mitigating irrelevant clients in federated learning. In Proceedings of the AAAI Conference on Artificial Intelligence, Virtual, 2–9 February 2021; Volume 35, pp. 9046–9054. [Google Scholar]
Fan, Z.; Fang, H.; Zhou, Z.; Pei, J.; Friedlander, M.P.; Liu, C.; Zhang, Y. Improving fairness for data valuation in horizontal federated learning. In Proceedings of the 2022 IEEE 38th International Conference on Data Engineering (ICDE), Kuala Lumpur, Malaysia, 9–12 May 2022; IEEE: Piscataway, NJ, USA, 2022; pp. 2440–2453. [Google Scholar]
Fan, Z.; Fang, H.; Zhou, Z.; Pei, J.; Friedlander, M.P.; Zhang, Y. Fair and efficient contribution valuation for vertical federated learning. arXiv 2022, arXiv:2201.02658. [Google Scholar]
Yang, X.; Tan, W.; Peng, C.; Xiang, S.; Niu, K. Federated Learning Incentive Mechanism Design via Enhanced Shapley Value Method. Wirel. Commun. Mob. Comput. 2022, 2022, 9690657. [Google Scholar] [CrossRef]
McMahan, B.; Moore, E.; Ramage, D.; Hampson, S.; y Arcas, B.A. Communication-efficient learning of deep networks from decentralized data. In Proceedings of the Artificial Intelligence and Statistics, PMLR, Fort Lauderdale, FL, USA, 20–22 April 2017; pp. 1273–1282. [Google Scholar]
Shapley, L.S. A value for n-person games. Contrib. Theory Games 1953, 2, 307–317. [Google Scholar]
Zhou, Z.H.; Yu, Y.; Qian, C. Evolutionary Learning: Advances in Theories and Algorithms; Springer: Berlin/Heidelberg, Germany, 2019. [Google Scholar]
Pardalos, P.M.; Migdalas, A.; Pitsoulis, L. Pareto Optimality, Game Theory and Equilibria; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2008; Volume 17. [Google Scholar]
Nas, J. Non-cooperative games. Ann. Math. 1951, 54, 286–295. [Google Scholar] [CrossRef]
Ye, M.; Hu, G. Distributed Nash Equilibrium Seeking by a Consensus Based Approach. IEEE Trans. Autom. Control 2017, 62, 4811–4818. [Google Scholar] [CrossRef] [Green Version]
Alchian, A.A.; Demsetz, H. Production, information costs, and economic organization. Am. Econ. Rev. 1972, 62, 777–795. [Google Scholar]

Figure 1. Federated learning framework.

Figure 2. Federated learning incentive model.

Figure 3. Federated player’s input and payoff comparison. (a) Example 1: Federated player’s input. (b) Example 1: Federated player’s payoff. (c) Example 2: Federated player’s input. (d) Example 2: Federated player’s payoff.

Table 1. Example 1: Federated input and profit comparison.

Input and Profit Comparison	Input $x_{1}$	Input $x_{2}$	Input $x_{3}$	Federated Profit	Maximum Profit
Pareto optimality	1	1	1	6	1.5
Nash equilibrium	0.14	0.14	0.14	0.49	0.4

Table 2. Example 2: Federated input and profit comparison.

Input and Profit Comparison	Input $x_{1}$	Input $x_{2}$	Input $x_{3}$	Federated Profit	Maximum Profit
Pareto optimality	8	6	12	160	56
Nash equilibrium	2.19	1.90	8	64.53	42.30

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yang, X.; Xiang, S.; Peng, C.; Tan, W.; Li, Z.; Wu, N.; Zhou, Y. Federated Learning Incentive Mechanism Design via Shapley Value and Pareto Optimality. Axioms 2023, 12, 636. https://doi.org/10.3390/axioms12070636

AMA Style

Yang X, Xiang S, Peng C, Tan W, Li Z, Wu N, Zhou Y. Federated Learning Incentive Mechanism Design via Shapley Value and Pareto Optimality. Axioms. 2023; 12(7):636. https://doi.org/10.3390/axioms12070636

Chicago/Turabian Style

Yang, Xun, Shuwen Xiang, Changgen Peng, Weijie Tan, Zhen Li, Ningbo Wu, and Yan Zhou. 2023. "Federated Learning Incentive Mechanism Design via Shapley Value and Pareto Optimality" Axioms 12, no. 7: 636. https://doi.org/10.3390/axioms12070636

APA Style

Yang, X., Xiang, S., Peng, C., Tan, W., Li, Z., Wu, N., & Zhou, Y. (2023). Federated Learning Incentive Mechanism Design via Shapley Value and Pareto Optimality. Axioms, 12(7), 636. https://doi.org/10.3390/axioms12070636

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Federated Learning Incentive Mechanism Design via Shapley Value and Pareto Optimality

Abstract

1. Introduction

2. Related Work

3. Preliminaries

3.1. Federated Learning Framework

3.2. Cooperative Games

3.3. Shapley Value

3.4. Pareto Optimality

3.5. Nash Equilibrium

4. Federated Learning Incentive Mechanism

4.1. The FL Incentive Model

4.2. The Conflict between Fairness and Pareto Optimality

4.3. FL Incentive Mechanism via Introducing Supervisory Organization

4.3.1. The Establishment of Supervisory Organization Mechanism

4.3.2. Penalty Conditions

5. Numerical Examples and Simulation Experiments

5.1. Numerical Example 1: Equal Status of Federated Players

5.2. Numerical Example 2: Unequal Status of Federated Players

5.3. Numerical Simulation Experiments

6. Conclusions and Future Work

7. Discussion

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A

Appendix A.1. Proof of Theorem 1

Appendix A.2. Proof of Theorem 2

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI