1. Introduction
The communication for omniscience (CO) problem is formulated in [
1]. It is assumed that there are a finite number of users in a system that are indexed by the set 
V. Each user 
 observes a distinct component 
 of a discrete multiple random source 
 in private. The users are allowed to exchange their observations over public authenticated broadcast channels so as to attain omniscience, the state where each user recovers the observation sequence of the entire source 
. Originally, the CO problem was studied in [
1] due to its dual relationship with the multi-terminal secret capacity (Theorem 1 in [
1]). The interactive data exchange process was also studied in other source coding scenarios, e.g., the interactive function computation problem [
2,
3,
4] (Refs. [
2,
3,
4] studied the two-terminal 
 lossy source coding problem, where each user communicates with the other to compute a function. The aim is to characterize the sum-rate-distortion region. This paper considers lossless source coding, which, for 
, corresponds to zero-distortion case (see Theorem 2 in [
3], Section IV-B in [
4]), but the focus is to find how to distribute the sum-rate evenly among more than two users). More recently, CO was also cast into the 
coded cooperative data exchange (CCDE) problem [
5,
6,
7], in which the users are mobile clients broadcasting linear combinations of packets over noiseless peer-to-peer (P2P) wireless channels and the communication rates are restricted to being integral.
One main optimization problem that arises in CO is how to minimize the overall source coding rate to attain omniscience. We call it the 
minimum sum-rate problem and denote the value of the minimum sum-rate by 
. By utilizing submodular function minimization (SFM) techniques, the value of 
, along with an optimal rate vector, are determined in 
 time in [
8] for the asymptotic model, where the communication rates are real-valued (In an asymptotic model, the observation sequence is assumed to be infinitely long. The CCDE corresponds to the finite linear source model, an example of the non-asymptotic model. In the non-asymptotic model, each user only obtains a finite length of observations, and the broadcasts are integer numbers of linear combinations of observations (Section II in [
8])), and in [
9,
10] for CCDE. Here, 
 is the complexity of a SFM algorithm and is polynomial (Chapter VI in [
11]). A more efficient algorithm can be found in [
12] by simulating the communications based on the random linear network coding scheme [
13,
14]. The complexity was further reduced to 
 in [
15].
While [
8,
9,
10,
12] only determined one optimal rate vector, it is shown in (Section III-B in [
8]) that the optimal rate region is not a singleton in general. Thus, it is natural to consider how to choose an optimal rate vector that also attains fairness, particularly when the intention is to promote the mobile clients’ cooperation in CCDE or even out the battery usage in a wireless sensor network (WSN). The problem of how to attain fairness has been previously considered in [
16,
17] for CCDE. In [
17], a multi-layer acyclic graph is proposed, based on which, a constrained quadratic programming was formulated to determine the Jain’s fairness solution [
18]. The algorithm proposed in [
16] is a greedy approach, where, in each iteration, a unit rate is assigned to the user that optimizes a fairness measure, so that the resulting solution converges on a fair and integer-valued optimal rate vector (The fair solutions in [
16,
17] coincide with the egalitarian solution [
19] in coalitional game theory due to the equivalence between the submodular base polyhedron and the optimal rate region (Section III-B in [
8]), both of which, as will be shown in 
Section 3 in this paper, coincide with the core of a coalitional game).
However, neither of them applies to systems where the communication rates are non-integral, e.g., the asymptotic model, or where packet splitting (and hence fractional transmission rates) is allowed in CCDE. The main purpose of this paper is to study how to attain fairness in the optimal rate region for the CO problem, where the broadcast rates are not constrained to be integer-valued. We start the study by showing the equivalence between the optimal rate region and the core (the solution set) of a coalitional game. We consider two fair solutions proposed in coalitional game theory: the Shapley value [
20] and the egalitarian solution [
19]. We propose a steepest descent algorithm (SDA) for searching a fractional egalitarian solution that can be implemented by packet splitting in CCDE. Finally, we show that the game can be decomposed by the fundamental partition 
 into subgames (The fundamental partition 
 is an optimizer that determines the minimum sum-rate 
 [
8]. See also 
Section 2.1), each of which can attain fairness, either being the Shapley value or the egalitarian solution, on its own. This decomposition leads to a distributed computation method for fairness and reduces the complexity.
  1.1. Summary of Main Results
Our main results are summarized as follows:
(1) We formulate the problem of attaining omniscience with the minimum sum-rate 
 by a coalitional game model, where the characteristic cost function, denoted by 
 for all user groups 
, quantifies the remaining randomness in 
 given the common randomness 
 shared by all users in 
V (The game model is closely related to the dual relationship (Theorem 1 in [
1,
21]): 
, where 
 is the entropy of 
 and 
 is the common randomness that is shared by all the users in 
V [
22,
23]. The interpretation is that attaining omniscience by the minimum sum-rate 
 is equivalent to determining how to let the users encode the remaining randomness in 
 given the common randomness 
). That is, 
 specifies the source coding rate/cost upper bound to each user subset 
. We show that (a) the core of the game coincides with the optimal rate region containing all the solutions to the minimum sum-rate problem, and (b) the game describes the users’ cooperation when they jointly encode the remaining randomness in 
 to reach omniscience with the sum-rate exactly equal to 
.
(2) We show that the Shapley value [
20] assigns each user the expected marginal cost 
 over all 
. This value is fair in that it penalizes each user according to the source coding rate he/she incurred in CO. While the complexity of obtaining the exact Shapley value is exponentially growing in the number of users 
, we show that the Shapley value is the mean over all extreme points in the core. By randomly generating an extreme point set of a desired size, an approximation of the Shapley value can be obtained in polynomial time.
(3) The egalitarian solution [
19] aims to equalize the rate/cost allocation in the optimal rate region regardless of the marginal costs. We show that this solution is more suitable for those systems with equally privileged users, e.g., CCDE and WSN. We propose a steepest descent algorithm (SDA) for searching a fractional egalitarian solution that can be implemented in CCDE by splitting each packet into 
 chunks, where 
 denotes the number of user subsets in the fundamental partition 
. Based on an optimality criterion for the egalitarian solution stating that the local optimum implies the global optimum, we show that the estimation sequence generated by the SDA converges in the fractional egalitarian solution in 
 time, where 
 is the maximum 
-norm over all pairs of points in the optimal rate region. In addition, the steepest direction in each iteration of SDA can be computed in a distributed manner.
(4) The game is decomposable such that the users in each subset  form a subgame with the characteristic cost function  for all . This is due to the mutual independence between  and  for any two distinct subsets  given the common randomness . To attain fairness, it suffices to let the users within each subgame  decide how to allocate the source coding rates fairly, which not only reduces complexity, but also allows parallel computation.
  1.2. Organization
The rest of the paper is organized as follows. The system model is described in 
Section 2, where we also review existing results on the minimum sum-rate problem. In 
Section 3, we formulate the coalitional game model and show that it can be decomposed by the fundamental partition 
. In 
Section 4, we show how to attain fairness in the optimal rate region by the Shapley value and discuss how to approximate it to avoid the exponentially growing complexity. In 
Section 5, we propose the SDA algorithm for searching the fractional egalitarian solution. In both 
Section 4 and 
Section 5, we also present methods to obtain the Shapley value and egalitarian solution by the decomposition method.
  3. Decomposable Coalitional Game
We formulate a coalitional game model (The coalitional game was first formulated to propose the Shapley value as a fair rate allocation for CO. This paper introduces the decomposition property and focuses on the mutual dependence, cooperation among the users and distributed computation of fair solutions) in this section and show the equivalence of the optimal rate region 
 and the core of this game. The purpose is to introduce two game-theoretic solutions, the Shapley value and egalitarian solution in 
Section 4 and 
Section 5, respectively, for attaining fairness in 
. We also show the decomposition of this game model, a property that will be utilized in 
Section 4 and 
Section 5 to propose a decomposition method for obtaining the Shapley value and egalitarian solution, respectively.
  3.1. Coalition Game Model
Let the users in 
V be self-autonomous decision makers that take part in the CO, and assume that, instead of being selfish, they may cooperate with others to form groups. We call 
 a 
coalition and 
V the 
grand coalition. Consider the function 
. Here, 
 equals the common randomness 
 in 
 that is shared by all users in 
V due to the dual relationship (Theorem 1 in [
1]) [
21]
        
Here, 
 is called the multivariate mutual information in [
23], or shared information in [
22]. Assume that 
 is obtained by a random variable 
U, which does not need to be broadcast over the public channels. Then, the problem is how to encode the remaining randomness in 
 given 
U for all 
 that is measured by the Dilworth truncation [
23]
        
We call  the characteristic cost function in that  specifies the upper bound on the (source) coding cost when the users in X form a coalition so as to jointly encode the randomness in  given U. The coalitional game model is characterized by the user set V and the characteristic cost function . We denote it by . In this sense, the game  formulates a multi-terminal data compression problem where the users jointly encode the remaining randomness in  that is specified by the set function .
Example 2. For the 5-user system in Example 1, the common randomness  is obtained by the random variable U. For users 1 and 2, we havebeing the remaining randomness in  given U. The interpretation is that, in order to attain omniscience with sum-rate , the rate for users 1 and 2 to jointly encode their observations is no more than 2 bits. Alternatively, the maximum cost incurred by users 1 and 2 cooperating with each other is 2 bits of coding rate. One can show that (7) holds for all  (An explanation of (7) can be found in (Section IV-B in [23])).    3.2. Core
While 
 quantifies the maximum coding cost in each coalition, each 
 denotes a cost allocation method, with each 
 being the source coding rate assigned to user 
. The solution set of the game 
 is called the 
core [
30,
31] which contains all 
s distributing exactly the total cost 
 to individual users such that 
 holds for all coalitions 
. It is not difficult to see from (
4) that the core coincides with the optimal rate region 
, which is nonempty (Theorem 4 in [
8]) (The nonemptiness of the core 
 can also be explained by the submodularity of 
. See 
Appendix B). In the rest of the paper, we will refer to 
 as the core or the optimal rate region interchangeably.
The inequality 
 in the core 
 also has an interpretation in coalitional game theory. If a cost allocation method 
 results in 
 for some 
X, the users in 
X may break the coalition 
X and seek another 
 such that 
. This means the coalition 
X is not stable (This can also be explained by the definition of stability (Section 4.3 in [
32]) and the fact that the core is a stable set in (Theorem 8 in [
32])). On the other hand, if 
 holds for all 
, then no user has the incentive to break the coalition 
V and form a smaller one, i.e., the grand coalition 
V forms. In this sense, the core contains all cost allocation methods 
 that exactly distribute the sum-cost 
 to all users in a way such that all of them would like to cooperate with others for the purpose of attaining omniscience (Chapter 12 in [
30]).
  3.3. Decomposition
For any  such that , let ⊔ denote the disjoint union and  be the direct sum of  and . For example, for  and , . For , let  be the characteristic vector of the subset X such that  if  and  if .
For the fundamental partition 
, each 
 defines a subgame 
 with the characteristic cost function 
 for all 
. The core of the subgame 
 is
        
        where the polyhedron 
 is a 
reduction/projection of 
 on to 
C. The following lemma shows the decomposition property of the game 
.
Lemma 1 (Theorem 38 and Lemma 39 in [
8])
. The game  can be decomposed by the fundamental partition  so that- (a) 
- the dimension of  is  and 
- (b) 
- The following holds for any : - (i) 
- For any  such that , , for all ,  and ; 
- (ii) 
- For all ,  for some  and . 
 
 The decomposition of the core 
 in Lemma 1(a) interprets the decomposition of the solution set of 
 and the fact that it makes no difference for the users to cooperate in the grand coalition 
V or in subgames 
 (This fact can be seen more clearly via the definition of the decomposable game in 
Appendix B). Lemma 1(b) states that the costs, or source coding rates, can be exchanged within a subgame, but not between subgames, which can be explained by the dependence relationship in the remaining randomness as follows.
  Interpretation
Recall that 
. Due to the fact that 
 is the finest minimizer of (
5), we have ((
8a) holds because 
 is the minimizer of the Dilworth truncation; the strict inequality (
8a) holds because otherwise 
 is not the finest minimizer)
          
Here, (
8a) means that given the common randomness, any two distinct coalitions 
C and 
 in 
 have 
 and 
 mutually independent. That is, to attain omniscience with the minimum sum-rate 
, the users in 
C and 
 must encode the exact randomness 
 and 
, respectively. In other words, the costs or the source coding rates cannot transfer between any two users 
 and 
. This is the interpretation of Lemma 1(b)-(i) and we call it 
zero exchange rate between 
i and 
j. On the other hand, (8b) states that, given the common randomness, any two users 
i and 
j in the same coalition 
C are mutually dependent. In this case, the information amount 
 that is mutual to 
X and 
 can be encoded by either 
 or 
, i.e., the costs or source coding rates can be transferred between users 
i and 
j: they have 
nonzero exchange rate.
Example 3. For the 5-user system in Example 1, we have the fundamental partition . The core  has the dimension of  and is decomposed aswhere , as shown in Figure 2, is a 2-dimensional plane and  and  are singletons containing single points  and , respectively. Given the common randomness  that is obtained by U, any two distinct  are independent, e.g.,for any , any two disjoint  such that  are mutually dependent; e.g.,i.e., in the fundamental partition , we have zero exchange rate between coalitions and nonzero exchange rate within a coalition.  The decomposition property in Lemma 1 is useful when considering the fairness. Since there is no freedom for the users who belong to distinct coalitions in 
 to negotiate how to allocate coding costs fairly, it suffices to just study how to attain fairness within each 
. This will be further summarized in Theorem 1 in 
Section 4 and Theorem 3 in 
Section 5 that allow distributed computation for attaining the two fair solutions, the Shapley value and egalitarian solution, in the optimal rate region 
.
  4. Shapley Value
For an omniscience-achievable rate vector , it is worth discussing how fairly it can distribute the source coding rates. In the game model , fairness is also an important performance metric of a cost allocation method  in that it promotes the users incentives to cooperate with each other. In this section, we discuss how to attain fairness by searching the Shapley value in the optimal rate region .
The Shapley value 
 is defined in (Theorem 7 in [
20]) as a unique solution in the core 
, with each dimension being
      
Here,  is the remaining uniqueness in  given the  and the common randomness in U. The interpretation is that, to attain the omniscience by the minimum sum-rate , if the users in X encode at the rate  first, user i needs to encode at the rate .
In the game model 
, 
 is the marginal coding cost incurred by user 
i when he/she joins the coalition 
X. Let 
 such that 
 and 
 for all 
 be a 
permutation of 
V. Here, each 
 denotes the order that the users join the grand coalition 
V, for which, the total cost 
 can be assigned to individual users by the Edmond greedy algorithm [
33]: For 
i increasing from 1 to 
, we assign each user the marginal cost
      
      where 
 and 
 for all 
. The resulting 
 satisfies 
. The Shapley value 
 is based on the assumption that all the permutations are equiprobable. For each 
, user 
i will be assigned the marginal coding cost 
 for 
 out of 
 times. Then, 
 assigns each user the expected marginal coding cost he/she incurs over all permutations.
  4.1. Decomposition
The fairness of 
 can also be explained by its relationship with the extreme points in the core 
. Let 
 be the 
extreme point set containing all vertices of the core 
. For a particular permutation 
, the optimal rate vector returned by the Edmond greedy algorithm is an extreme point of 
 and 
 can be constructed by applying the Edmond greedy algorithm for all 
 permutations of 
V (Section 3.2 in [
11]). Based on the definition (
9), the Shapley value is the mean value of 
 [
20] (In this sense, the Shapley value is the gravity center of 
 [
20]):
Since the core  is decomposed by the fundamental partition  (Lemma 1(a)), we have the extreme point set also decomposed as , which leads to the decomposition of the Shapley value in Theorem 1 below (Theorem 1 is a special case of Theorem 1 when the minimum sum-rate ).
Theorem 1. For the Shapley value  in the core , we have  where  is the Shapley value in the core  of the subgame .
 Proof.  For the fundamental partition 
, since 
, we have
          
Theorem holds.    □
 Example 4. In the core  of the 5-user system in Example 1, the Shapley value by the definition (9) is . We have four extreme points insuch that . Recall that we have the fundamental partition  that decomposes the game  as in Example 2. According to Theorem 1, we havewhere  is the Shapley value of the subgame  as shown in Figure 3,  and .    4.2. Complexity and Approximation
The complexity of computing the Shapley value is exponentially large in the problem size 
, since the values of 
 for all 
 are required to be calculated to get 
 in (
9). What makes the situation worse is that determining the value of the Dilworth truncation 
 for a given 
X requires calling SFM algorithms and their complexity is 
. Therefore, it is impractical to obtain the exact value of 
 in large systems.
One alternative approach is to utilize the decomposition property in Theorem 1 to allow distributed and parallel computation. For each coalition C in the fundamental partition , let the users in C obtain the Shapley value  in the subgame  by themselves; All  are combined to form the Shapley value  of the entire game . By doing so, the complexity is determined by the subgame of maximum size . However, the complexity to obtain the Shapley value  in the subgame  is again exponentially growing in .
While the high computational complexity is an intrinsic problem of the Shapley value, there are various approximation algorithms proposed in the literature to alleviate this complexity problem. For example, the random permutation method in [
34] utilizes the fact that the Shapley value is the mean value over the extreme point set in (
10). The idea is to randomly generate a set of permutations of 
V of a desired size, e.g., 
 or 
 permutations, and apply the Edmond greedy algorithm to determine the corresponding extreme points, the mean of which is an approximation of the Shapley value 
. This approximation method can also be used in combination with the decomposition method in Theorem 1.
Example 5. For the 5-user system in Example 1, we first decompose the game into subgames ,  and . For the subgame , we randomly select  permutations. For example, for ,  and , we can generate three extreme points:respectively, so that the mean value  is an approximation of the Shapley value  in . Note, different permutations might result in different approximations. For example, if we choose three permutations ,  and , we would have the approximation . See the two approximations in Figure 3. By combining the approximation of  with the ones obtained in other subgames, we have the approximation of the Shapley value  of the game . For example, the above two approximations generate  and , which are the two approximations to .
 In Example 5, we chose no more than 
 permutations for each subgame 
, where the extreme point corresponding to each permutation can be determined by Algorithm 3 in [
8] (The algorithm (Algorithm 3 in [
8]) can be considered as a modified Edmond greedy algorithm. See Appendix B in [
8] for the explanation. In 
Figure 3, the path towards the extreme point 
 is generated by Algorithm 3 in [
8] for the permutation 
) in 
 time. Therefore, the overall complexity for approximating the Shapley value 
 is determined by the subgame 
 of maximum size as polynomial time 
. Accordingly, if we choose 
 permutations for each subgame 
, the complexity would be 
. We also remark that the approximation algorithm is not unique. In fact, there are many other existing methods, e.g., [
34,
35,
36], that can be implemented to approximate the Shapley value 
.
  5. Egalitarian Solution
The Shapely value 
 is fair in that it penalizes each user based on the expected marginal cost he/she incurs in game 
. For example, in the 5-user system in 
Figure 1, user 4 incurs the most expected marginal cost 
 over all 
, the Shapley value 
 in Example 4 assigns him/her the most coding cost.
However, this fairness suggested by the Shapley value might not be the desired one in some practical systems. For example, in CCDE where mobile clients are considered as equally privileged peers, it is desirable to find a 
 that allocates the source coding rate as evenly as possible without considering users’ prior knowledge of the source 
. Another example is a WSN containing a large number of battery-powered sensors with equal initial energy budget, where the even allocation of the source coding rate prolongs the overall lifetime of the WSN. In these cases, it might be more suitable to consider the 
egalitarian solution  [
19,
28], the minimizer of 
. In this section, we consider a more general quadratic programming [
37,
38]:
      where 
 and 
 is a positive weight vector which could have some interpretations in practical scenarios. For example, 
 could denote the quality of the wireless transmission of user 
i in CCDE or the remaining battery energy of sensor node 
i in a WSN.
It is shown [
39,
40] that, if the function value of 
 can be obtained directly, problem (
11) can be solved in 
 time (In the case when 
, the minimizer of (
11) is also called the minimum-norm point in 
, which can be searched by the algorithm in [
41] by polynomial time calls of the Dilworth truncation 
). However, determining the Dilworth truncation 
 for a given 
X has the complexity 
. In addition, the minimizer of (
11) may not be fractional or, if it is fractional, may require splitting each packet into more than 
 chunks in CCDE. Since 
 and it is shown in (Corollary 28 in [
8]) that there exists an optimal rate vector in 
 with an LCM 
, it would be of interest to see if we can find a fair optimal rate vector in 
 still with LCM 
.
Example 6. Consider the minimizer  of (11) for the 5-user system in Example 1, we have  for  and  for . While the former can be implemented by 2-packet-splitting, the latter requires dividing each packets into 10 chunks.  In fact, not only the minimizer of (
11), but also the Shapley value have the problem of incurring more than 
-packet-splitting. For example, the Shapley value 
 in Example 4 requires 4-packet-splitting, where 
, and its approximation 
 in Example 5 even requires 30-packet-splitting. Such dividing and reconstructing of packets could be cumbersome or even very impractical. In the next subsection, we consider how to search for an egalitarian solution in 
 that can be implemented by 
-packet-splitting.
  5.1. Steepest Descent Algorithm
For 
, let 
 be the set containing all rational numbers that are divisible by 
K. Consider the problem
        
The purpose is to search for a fractional egalitarian solution 
 with an LCM 
. The objective function in (
12) is a separable convex function, for which local optimality w.r.t. the 
elementary exchange  implies the global optimality. See Lemma 2 below. Here, 
 denotes the cost/rate exchange between users 
i and 
j in the game 
 (The optimization criterion in Lemma 2 is related to the discrete convexity: The problem in (
11) exhibits 
M-convexity on the real number set (Section 1.4.2 in [
42]), which also leads to the 
M-convexity on the fractional number set of (
12). This is essentially due to the 
M-convexity of a submodular base polyhedron (Theorem 4.12 and Proposition 4.13 in [
42]). See also 
Appendix A for the definition of the submodular base polyhedron).
Lemma 2. In CCDE,  is the minimizer of (12) if and only if, for all  and positive integer  such that ,where .  Proof.  The proof is based on a necessary and sufficient condition for the minimizer of (
11) for any convex function 
g in (Theorem 20.3 in [
11]): 
 is the minimizer of (
11) if and only if, for all 
 and positive integer 
 such that 
, 
. In CCDE, the entropy function 
H is integer-valued and 
 is fractional with denominator 
 so that the value of 
 has the denominator 
 for all 
. Furthermore, all extreme points in 
 have the LCM 
 (Corollary 10 in [
8]). Therefore, for any 
, if 
, then 
. So, Lemma 2 is the result of Theorem 20.3 in [
11] on the set 
.    □
 | Algorithm 1: Steepest descent algorithm (SDA). | 
| input:  a positive integer  and an initial point output:  , the minimizer of (12 ) | 
| ![Entropy 24 00109 i001 Entropy 24 00109 i001]() | 
Lemma 2 directly suggests the steepest descent algorithm (SDA) in Algorithm 1 (The SDA algorithm is also based on a discrete convex minimization algorithm in (Section 10.1.1 in [
42]), which has been adopted in (Algorithm 1) for determining an integer-valued egalitarian solution for CCDE. The difference is that we use a dependence function 
 to search the steepest descent direction, which is more efficient than the brute-force search in (Algorithm 1). Furthermore, note that (Algorithm 1) only determines a real-valued egalitarian solution, which may be unable to be implemented in some practical systems, e.g., CCDE).
Furthermore, note that, as an input to the SDA, the initial point 
 can be searched by the MDA algorithm at the same time when the minimum sum-rate problem is solved (Corollary 28(a) in [
8]). The optimality of the SDA algorithm is stated below.
Theorem 2. For CCDE, the SDA algorithm generates an estimation sequence  that converges on the minimizer  of (12).  Proof.  Consider the recursive process
          
          where 
. This is a steepest descent approach: in each iteration 
n, we move from the current estimation 
 in the steepest elementary exchange 
 by a constant step size 
. Based on Lemma 2, starting with any initial 
, the minimum of (
12) is reached when this recursion converges, i.e., when 
.
For 
, consider the 
dependence function (Sections 2.2 and 2.3, Equations (2.14), (2.15), (2.18) and (2.19) in [
11])
          
The last equality (
14) states that 
 is the minimal minimizer of 
 (The last equality (
14) is shown in (Equations (2.14) and (2.15) in [
11]) due to the min-max theorem (Corollary 3.4 in [
11]). The minimizers of 
 form a set lattice and the smallest/minimal is the intersection of all minimizers. See  Sections 2.2  and 2.3 in [
11] for details). A trivial case is that 
. Based on (
14), we have 
 for all 
. So, for all iterations 
n of the recursion above, 
 and
          
Therefore, theorem holds.    □
 Remark 1. According to the proofs of Lemma 2 and Theorem 2, if , we could have  for some iteration n in the SDA algorithm, or the estimation sequence converges on, but may not reach exactly, the minimizer of (12), i.e., the output vector  can be a suboptimal solution of (12).  Example 7. For the 5-user system in Example 1, we first apply the MDA algorithm in [8] and get the minimum sum-rate , the fundamental partition  and an extreme point  in the core . By setting  and , we start the SDA algorithm with the initial point . At the first iteration , we have Then, . For  and , we have  and, therefore, . Since , we assign  and continue the iteration.
By repeating the same procedure in each iteration, we get the estimation sequence  that results in the update path The recursion converges at , where we have , which is the minimizer  of (12) for  and . Here,  is a fractional egalitarian solution, a fair optimal rate vector in , that can be implemented by 2-packet-splitting in CCDE.    5.2. Dependence Function
Based on (
14), Lemma 1(b) and the discussion in 
Section 3.3, it is not difficult to see that, for all 
, if 
 for any 
, then 
 and 
 are mutually dependent given the common randomness 
 in 
U, i.e., 
, hence the name dependence function. Moreover, due to the fact that 
, we can transfer arbitrarily small, but nonzero, coding cost from user 
j to user 
i for encoding the mutually shared information between users 
i and 
j, which is consistent with the nonzero exchange rate in 
Section 3.3.
In addition, we must have 
 for the coalition 
 such that 
, e.g., (
15). This is because 
 for all 
 such that 
 and 
 for all 
, i.e., given the common randomness in 
U, any 
 is only mutually dependent on any other 
 in the same coalition 
. This will be formally stated as the decomposition of 
 in Theorem 3.
  5.3. Complexity and Distributed Implementation
The SDA algorithm requires oracle calls of 
, instead of 
, which is equivalent to the entry of the entropy function 
H and avoids the complexity of calculating the Dilworth truncation. We derive the worst-case complexity of SDA as follows. For any initial point 
, the total number of iterations of the SDA algorithm is 
. Let
        
        denote the 
-size of the core 
. The maximum number of iterations of the SDA algorithm is 
. The minimization problem (13) in step 5 in the SDA algorithm is a SFM due to the intersecting submodularity of 
 (Lemma 3 in [
8]). Thus, each iteration of the SDA algorithm completes in 
 time and the overall complexity is 
 (The reason that the 
-size determines the upper bound on the number of iterations is explained in detail in (Section 10.1.1 in [
42])).
Example 8. For the estimation sequence  generated in Example 7 by the SDA algorithm, we show the error of the estimation  in terms of the -norm  in Figure 4. Since in each iteration of the SDA algorithm, the estimation  is updated along the steepest elementary exchange  by step size  toward the optimizer , we necessarily have  decreased by  each time. As in Figure 4, we have the error , a linearly decreasing curve. In this case, there are  iterations in the SDA algorithm so that we incur  calls of . In general, since the -size of  is , the worst-case complexity of the SDA algorithm when applied to the 5-user system in Figure 1, is  calls of .  The SDA algorithm can also be implemented in a decentralized manner: let each user i obtain the dependence function , a set of mutually dependent users given the common randomness in U, by him/herself in steps 4 to 6; the steps 7 to 13 can be completed by users’ communications over the broadcast channels. By doing so, the computational complexity incurred at each user is .
  5.4. Decomposition
Similar to the decomposition of the Shapley value in Theorem 1, we also have the decomposition property of the egalitarian solution in Theorem 3. We omit the proof since it is a direct result of Corollary 42 in [
8], Lemma 1 and Lemma 2(b).
Theorem 3. With  as the egalitarian solution, the minimizer of (11), or the fractional egalitarian solution, the minimizer of (12),  where  is the egalitarian solution or fractional egalitarian solution, respectively, in the core  of the subgame .  Theorem 3 states that the egalitarian solution 
 can be determined by allowing the subgames 
 for all 
 to obtain their own 
. This decomposition method can be used in combination with the SDA algorithm so that the complexity is reduced to 
, where 
 is the 
-size of the core 
 of the subgame 
 of maximum size. In addition, the users in each subgame can run the SDA algorithm in a distributed manner as discussed in 
Section 5.3 and therefore the complexity incurred at each user is 
.
Remark 2. Theorems 1 and 3 justify the exchange rate resulted from the mutual dependence in Section 3.3 when the game  is decomposed by the fundamental partition  into the subgames  for all : since the exchange rate, or mutual dependence, is only nonzero inside each subgame , we just need to let the users cooperating in the same  decide how to attain fairness.  Example 9. For the 5-user system in Example 1, consider searching the fractional egalitarian solution w.r.t.  in Example 7 by the decomposition method in Theorem 3. We first decompose  into subgames ,  and . For the subgames  and , we can directly assign  and , respectively. For the subgame , we apply the SDA algorithm and get the following update path to the fractional egalitarian solution : See Figure 5. Then, we get , the fractional egalitarian solution w.r.t.  in . In this case, we still have 5 iterations in the SDA algorithm and the convergence performance is exactly the same as in Figure 4. However, the complexity reduces to  calls of . In general, since , the complexity of the SDA algorithm when applied to the subgame , is  calls of .    6. Conclusions
We established the equivalence between the optimal rate region of CO and the core of a coalitional game with the characteristic cost function being the Dilworth truncation  measuring the remaining information  in  for all subsets  given the common randomness in U. For attaining fairness in the optimal rate region, we considered the Shapley value and the egalitarian solution. The Shapley value differs from the egalitarian solution in that the fairness is attained if each user i is penalized by the expected marginal cost or source coding rate  he/she incurs if in coalition X. By utilizing the fact that the Shapley value is the average over all extreme points in the core, we showed that an approximation, instead of the exact Shapley value, can be obtained by taking the mean over a desired number of randomly generated extreme points. We also proposed the SDA algorithm for obtaining the egalitarian solution in the core that can be implemented in CCDE by -packet-splitting. We showed that the game is itself decomposable by the fundamental partition  so that, given the common randomness,  and  for any two distinct  are mutually independent, while  and  for all  are mutually dependent. This dependence relationship leads to a decomposition method for obtaining the fair solutions: the Shapley value and the egalitarian solution can be obtained independently within each subgame.
The methods for searching the Shapley value and the egalitarian solution in this paper require the solutions to the minimum sum-rate problem, the value of  and  and also an optimal rate vector in  to initiate the SDA. To further improve the efficiency of attaining fairness in CO, it is worth studying whether we can directly attain the fairness in the optimal rate region without solving the minimum sum-rate problem first. On the other hand, apart from the fact that the egalitarian solution is more suitable to CCDE and WSN, it is worth understanding to which scenarios the fairness suggested by the Shapley value applies. Finally, the fractional egalitarian solution only determines a fair rate assigned to each user in CCDE. We still need a complete network coding scheme that also specifies the coefficients in the linear combination of chunks in each transmission.