An Umbrella Converse for Data Exchange: Applied to Caching, Computing, and Shuffling

Krishnan, Prasad; Natarajan, Lakshmi; Lalitha, V.

doi:10.3390/e23080985

Open AccessArticle

An Umbrella Converse for Data Exchange: Applied to Caching, Computing, and Shuffling^†

by

Prasad Krishnan

^1,*

,

Lakshmi Natarajan

² and

V. Lalitha

¹

Signal Processing & Communications Research Center, International Institute of Information Technology Hyderabad, Hyderabad 500032, India

²

Department of Electrical Engineering, Indian Institute of Technology Hyderabad, Kandi 502205, India

^*

Author to whom correspondence should be addressed.

^†

Part of this work was presented at the IEEE Information Theory Workshop 2020 held virtually from 11–15 April 2021.

Entropy 2021, 23(8), 985; https://doi.org/10.3390/e23080985

Submission received: 23 May 2021 / Revised: 18 July 2021 / Accepted: 24 July 2021 / Published: 30 July 2021

(This article belongs to the Special Issue Coding and Information Theory for Distributed Storage Systems)

Download

Browse Figures

Versions Notes

Abstract

:

The problem of data exchange between multiple nodes with storage and communication capabilities models several current multi-user communication problems like Coded Caching, Data Shuffling, Coded Computing, etc. The goal in such problems is to design communication schemes which accomplish the desired data exchange between the nodes with the optimal (minimum) amount of communication load. In this work, we present a converse to such a general data exchange problem. The expression of the converse depends only on the number of bits to be moved between different subsets of nodes, and does not assume anything further specific about the parameters in the problem. Specific problem formulations, such as those in Coded Caching, Coded Data Shuffling, and Coded Distributed Computing, can be seen as instances of this generic data exchange problem. Applying our generic converse, we can efficiently recover known important converses in these formulations. Further, for a generic coded caching problem with heterogeneous cache sizes at the clients with or without a central server, we obtain a new general converse, which subsumes some existing results. Finally we relate a “centralized” version of our bound to the known generalized independence number bound in index coding and discuss our bound’s tightness in this context.

Keywords:

data exchange; coded caching; coded distributed computing; coded data shuffling; converse; index coding

1. Introduction and Main Result

Consider a system of K nodes, denoted by

[K] ≜ {1, \dots, K}

, each of which have (not necessarily uniform) storage. The nodes can communicate with each other through a noiseless bus link, in which transmissions of any node is received by all others. Each node possesses a collection of data symbols (represented in bits) in its local storage and demands another set of symbols present in other nodes. We formalize this as a data exchange problem.

Definition 1.

A data exchange problem on a set of K nodes involving a collection B of information bits is given by the following:

a collection ${C_{i} : i \in [K]},$ where $C_{i} \subset B$ denotes the subset of data present in node i,
a collection ${D_{i} : i \in [K]}$ where $D_{i} \subset \cup_{j \neq i} C_{j} ∖ C_{i}$ denotes the set of bits demanded by node i.

The above data exchange problem models a number of cache-enabled multi-receiver communication problems studied recently in the coding theory community, including Coded Caching [1], Coded Distributed Computing [2,3], Coded Data Shuffling [4,5,6], and Coded Data Rebalancing [7]. In [8], a special case of our general problem here was considered in the name of cooperative data exchange, where the goal was to reach a state in which all nodes have all the data in the system.

A solution to a given data exchange problem involves communication between the nodes. Each node i encodes the symbols in

C_{i}

into a codeword of length

l_{i}

and sends it to all other nodes. The respective demanded symbols at any node is then to be decoded using the received transmissions from all the other nodes and the node’s own content.

Formally, a communication scheme for the given data exchange problem consists of a set of encoding functions

Φ ≜ {ϕ_{i} : i \in [K]}

and decoding functions

Ψ ≜ {ψ_{i} : i \in [K]}

, defined as follows.

\begin{matrix} ϕ_{i} : & {0, 1}^{| C_{i} |} \to {0, 1}^{l_{i}}, (for some non-negative integer l_{i}) \\ ψ_{i} : & {0, 1}^{| C_{i} |} \times {0, 1}^{\sum_{j \neq i} l_{j}} \to {0, 1}^{| D_{i} |}, \end{matrix}

such that

\begin{matrix} ψ_{i} (C_{i}, {ϕ_{j} (C_{j}) : j \neq i}) = D_{i} . \end{matrix}

The communication load of the above scheme is defined as the total number of bits communicated, i.e.,

L (Φ, Ψ) ≜ \sum_{i \in [K]} l_{i} .

The optimal communication load is then denoted by

L^{*} ≜ min_{Φ, Ψ} L (Φ, Ψ) .

The central result in this work is Theorem 1 in Section 1.1, which is a lower bound on the optimal communication load

L^{*}

. Using this lower bound, we recover several important converse results of cache-enabled communication problems studied in the literature, including Coded Caching (Section 2), Data Shuffling (Section 3), and Distributed Computing (Section 4). In each of these sections, we briefly review each setting and then apply Theorem 1 to recover the respective converses. As a result, the proofs of these existing converses are also made simpler than what is already available in the literature for the respective settings. The generic structure of the converse proofs obtained using our data exchange bound is presented in Section 1.2. This structure includes three steps, which we also highlight at the appropriate junctures within the proofs themselves. The close relationship between these problems is quite widely known. This work gives a further formal grounding to this connection, by abstracting the common structure of these converses into a general form, which can potentially be applied to other new data exchange problems as well.

Apart from recovering existing results, more importantly we also use our data exchange lower bound to obtain new tight converse results for some settings, while improving tightness results of some known bounds. Specifically, we present a new converse for a generic coded caching setting with multi-level cache sizes. Using this, we are able to close the gap to optimality for some known special cases of this generic setting (Section 2.1). In Section 5, we show the relationship between a “centralized” version of our data exchange lower bound and an existing bound for index coding known as the

α

-bound or the generalized independence number bound [9]. In general, we find that our bound is weaker than the

α

-bound. However, for unicast index coding problems, we identify the precise conditions under which our data exchange bound is equal to the

α

-bound. In Section 6, we discuss the application of our data exchange lower bound to more generalized index coding settings, specifically distributed index coding [10,11] and embedded index coding [12].

Notation: For positive integer a, let

[a] ≜ {1, \dots, a}

. For a set S, we denote by

S ∖ k

the set of items in S except for the item k, and represent the union

S \cup {k}

as

S \cup k

. The binomial coefficient is denoted by

(\binom{n}{k})

, which is zero if

k > n .

The set of all t-sized subsets of a set A is denoted by

(\binom{A}{t}) .

1.1. A Converse for the Data Exchange Problem

In this subsection, we will obtain a lower bound on the optimal communication load of the general data exchange problem defined in Section 1. This is the central result of this work. The predecessor to the proof technique of our data exchange lower bound is in [3], which first presented an induction based approach for the converse of the coded distributed computing setting. Our proof uses a similar induction technique.

Given a data exchange problem and for

P, Q \subset [K]

such that

P \neq \emptyset

, let

a_{P}^{Q}

denote the number of bits which are stored in every node in the subset of nodes Q and stored in no other node, and demanded by every node in the subset P and demanded by no other node, i.e.,

\begin{matrix} a_{P}^{Q} ≜ | (\cap_{i \in P} D_{i}) \cap (\cap_{j \in Q} C_{j}) ∖ (\cup_{j \notin Q} C_{j}) \cup (\cup_{i \notin P} D_{i})) | . \end{matrix}

(1)

Note that, by definition,

a_{P}^{Q} = 0

under the following conditions.

If $P \cap Q \neq \emptyset$ , as the bits demanded by any node are absent in the same node.
If $Q = \emptyset$ , by Definition 1.

Theorem 1 gives a lower bound on the optimal communication load of a given data exchange problem. The proof of the theorem is relegated to Appendix A. The idea of the proof is as follows. If we consider only two nodes in the system, say

[K] = {1, 2}

, then each of the 2 nodes has to transmit whatever bits it has which are demanded by the other node, i.e.,

L^{*} \geq a_{{2}}^{{1}} + a_{{1}}^{{2}}

. The proof of the theorem uses this as a base case and employs an induction technique to obtain a sequence of cut-set bounds leading to the final expression.

Theorem 1.

L^{*} \geq \sum_{P \subset [K]} \sum_{Q \subset [K] ∖ P} \frac{| P |}{| P | + | Q | - 1} a_{P}^{Q} .

Theorem 1, along with the observation that

a_{\emptyset}^{Q} = 0 = a_{P}^{\emptyset}

gives us the following corollary, which is a restatement of Theorem 1.

Corollary 1.

Let

n (p, q) ≜ \sum_{\begin{matrix} P, Q \subset [K] : \\ | P | = p, | Q | = q, P \cap Q = \emptyset \end{matrix}} a_{P}^{Q}

denote the total number of bits present exactly in q nodes and demanded exactly by p (other) nodes. Then,

\begin{matrix} L^{*} \geq \sum_{p = 1}^{K - 1} \sum_{q = 1}^{K - p} \frac{p}{p + q - 1} n (p, q) . \end{matrix}

(2)

Remark 1.

In [13], the authors presented an essentially identical bound (Lemma 1, [13]) as Corollary 1 in the setting of coded distributed computing. The proof given in [13] for this lemma also generalizes the arguments presented in [3], as does this work. Our present work considers a general data exchange problem and derives the lower bound in Theorem 1 for the communication load in such a setting. We had derived this lower bound independently in the conference version of this paper [14], and only recently came to know about the bound in [13]. In subsequent sections, we show how to use this bound to recover converses for various multi-terminal communication problems considered in the literature in recent years, and also obtain new converses for some settings. We also discuss, in Section 5, the looseness of Theorem 1 by considering a centralized version of the data exchange problem and comparing our bound with the generalized independence number bound in index coding. In Section 6, we discuss the application of our data exchange bound to more generalized index coding settings. These are the novel features of our present work, compared to the bound in Lemma 1 of [13].

1.2. A Generic Outline of the Converse Proofs Presented in This Paper

In this work, we derive converse bounds for various settings in coded caching, coded distributed computing, and coded data shuffling using the bound in Theorem 1. Some of these converse bounds are already available in the literature, while others are novel. Each setting enjoins some constraints on the size of the demands and the size of the pre-stored content at each node. The bound in Theorem 1 applies for the setting in which the nodes have some predetermined local storage and some specific demanded bits. However, the settings of coded caching, coded distributed computing, and coded data shuffling permit the design of the initial storage so that the communication load is minimized. Further, the optimal communication load as defined in the literature for some of these settings involves maximization over all possible demand configurations, keeping only the size of the demands fixed. Keeping with these specifics, our bound in Theorem 1 must be tuned for each setting to obtain the respective converse, as captured by the three following steps which describe the generic structure behind our converse proofs.

Applying Theorem 1 to the present setting, we obtain a lower bound expression on the communication load, assuming an arbitrary choice of demands across the nodes and some arbitrary but fixed storage across the nodes.
“Symmetrization” step: In this step, the lower bound expression obtained in the previous step is averaged over some carefully chosen configurations of demanded bits at the nodes. This step helps to remove the dependency of the lower bound on the specific choice of demands.
Refine the averaged bound by imposing the constraints on the size of the initial storage at the nodes, and using convexity of terms inside the averaged bound to obtain the final expression of the bound. This step helps to remove the dependency of the converse on the specific initial storage configuration at the nodes.

These three steps enable us to give simpler proofs to those in the literature for known converses, and also obtain novel converses for some variants of the same problems. Further, it also illustrates the generic nature of the data exchange bound of Theorem 1. In the converse proofs that are to follow in this paper, we will highlight these steps at the appropriate junctures.

2. Coded Caching

In this section, we apply Theorem 1 to recover the lower bound obtained in [15] for the problem of coded caching introduced in [1]. Further, using Theorem 1, we prove in Section 2.1 a new converse for a generic coded caching problem under multiple cache size settings. This provides new converses for some existing settings in literature, and also tightens bounds in some others. In Section 2.2, we recover a converse for coded caching with multiple file requests. In Section 2.3, we recover the converse for coded caching with decentralized cache placement.

We now describe the main setting of this section. In the coded caching system introduced in [1], there is one server connected via a noiseless broadcast channel to K clients indexed as

[K]

. The server possesses N files, each of size F bits, where the files are indexed as

W_{i} : i \in [N]

. Each client contains local storage, or a cache, of size

M F

bits, for some

M \leq N

. We call this a

(K, M, N, F)

coded caching system. Figure 1 illustrates this system model.

The coded caching system operates in two phases: in the caching phase which occurs during the low-traffic periods, the caches of the clients are populated by the server with some (uncoded) bits of the file library. This is known as uncoded prefetching. In this phase, the demands of the clients are not known. We denote the caching function for node k as

ζ_{k},

and thus the cache content at client k at the end of the caching phase is denoted as

Z_{k} ≜ ζ_{k} ({W_{i} : i \in [N]})

.

In the delivery phase which occurs during the high-traffic periods, each client demands one file from the server, and the server makes transmissions via the broadcast channel to satisfy the client demands. Let the demanded file at client k be

W_{d_{k}}

, where

d_{k} \in [N] .

The server uses an encoding function

ϕ

to obtain coded transmissions

X = ϕ ({W_{d_{k}} : k \in [K]})

such that each client

k \in [K]

can employ a decoding function

ψ_{k}

to decode its demanded file using the coded transmissions and its cache content, i.e.,

ψ_{k} (X, Z_{k}) = W_{d_{k}} .

The communication load

L_{c} ({ζ_{k} : k \in [K]}, ϕ, {ψ_{k} : k \in [K]})

of the above coded caching scheme is the number of bits transmitted in the delivery phase (i.e., the length of X) in the worst case (where “worst case” denotes maximization across all possible demands). The optimal communication load denoted by

L_{c}^{*}

, is then defined as

\begin{matrix} L_{c}^{*} ≜ min_{{ζ_{k} : k \in [K]}, ϕ, {ψ_{k} : k \in [K]}} L_{c} ({ζ_{k} : k \in [K]}, ϕ, {ψ_{k} : k \in [K]}) . \end{matrix}

(3)

For this system model, when

\frac{M K}{N} \in Z

,the work in [1] proposed a caching and delivery scheme which achieves a communication load (normalized by the size of the file F) given by

K (1 - \frac{M}{N}) min \{\frac{1}{1 + \frac{M K}{N}}, \frac{N}{K}\}

. In [15], it was shown that, for any coded caching scheme with uncoded cache placement, the optimal communication load is lower bounded by

L_{c}^{*} \geq \frac{K (1 - \frac{M}{N}) F}{1 + M K / N}

. Therefore, it was shown that, when

K \leq N

and

\frac{M K}{N} \in Z

, the scheme in [1] is optimal.

In the present section, we give another proof of the lower bound for coded caching derived in [15]. We later discuss the case of arbitrary

K, N

in Remark 2.

We now proceed with restating the lower bound from [15]. Note that these converses are typically normalized by the file size in literature, however we recall them in their non-normalized form, in order to relate them with our data exchange problem setting.

Theorem 2

([15]). Consider a

(K, M, N, F)

coded caching system with

K \leq N

. The optimal communication load

L_{c}^{*}

in the delivery phase satisfies

L_{c}^{*} \geq \frac{K (1 - M / N)}{1 + M K / N} F .

Proof based on Theorem 1.

We assume that the caching scheme and delivery scheme of the coded caching scheme are designed such that the communication load

L_{c}

is exactly equal to the optimal load

L_{c}^{*} .

Let the K client demands in the delivery phase be represented by a demand vector

d = (d_{1}, \dots, d_{K})

, where

d_{k} \in [N]

denotes the index of the demanded file of the client k. We are interested in the worst case demands scenario; this means we can assume that all the demanded files are distinct, i.e.,

d_{k} \neq d_{k^{'}}

for all

k \neq k^{'}

to bound

L_{c}^{*}

from below, without loss of generality.

We observe that a

(K, M, N, F)

coded caching problem during the delivery phase satisfies Definition 1 of a data exchange problem on

K + 1

nodes indexed as

{0, 1, \dots, K}

, where we give the index 0 to the server node and include this in the data exchange system. Before proceeding, we remark that the below proof gives a lower bound where all

K + 1

nodes in the system may transmit, whereas in the coded caching system of [1] only the server can transmit. Thus, any lower bound that we obtain in this proof applies to the setting in [1] also.

Clearly in the equivalent data exchange problem, the node 0 (the server) does not demand anything, but has a copy of all the bits in the entire system. With these observations, we have by definition of

a_{P}^{Q}

in (1)

\begin{matrix} a_{P}^{Q} = 0, if 0 \notin Q or if P \notin (\binom{[K]}{1}), \end{matrix}

(4)

where the quantities

a_{P}^{Q}

clearly depend on the demand vector

d

.

We thus use a new set of variables: for each

k \in [K]

,

Q \subset [K]

, and given demands

d = (d_{1}, \dots, d_{K})

, let

c_{k}^{Q} (d)

denote the number of bits demanded by receiver node k that are available only at the nodes

Q \cup {0}

, i.e.,

\begin{matrix} c_{k}^{Q} (d) ≜ a_{{k}}^{Q \cup 0} . \end{matrix}

(5)

Using these definitions, we proceed following the three steps given in Section 1.2.

Applying Theorem 1: By Theorem 1, we have the following lower bound for demand vector

d

\begin{matrix} L_{c}^{*} = L_{c} & \geq \sum_{P \subset [K] \cup {0}} \sum_{Q^{'} \subset [K] \cup {0} ∖ P} \frac{| P |}{| P | + | Q^{'} | - 1} a_{P}^{Q^{'}} \\ = \sum_{k = 1}^{K} \sum_{Q \subset [K] ∖ k} \frac{1}{| Q | + 1} c_{k}^{Q} (d), \end{matrix}

(6)

where (6) is obtained from (4) and (5).

“Symmetrizing” (6) over carefully chosen demand vectors: We now consider the averaging of bounds of type (6) over a chosen collection of N demand vectors, given by

\begin{matrix} D & ≜ \{(j \oplus_{N} 0, j \oplus_{N} 1, \dots, j \oplus_{N} (K - 1)) : j = 0, \dots, N - 1\} \end{matrix}

(7)

where

j \oplus_{N} i ≜ ((j + i) mod N) + 1 .

That is,

D

contains the demand vectors consisting of consecutive K files, starting with each of the N files as the demand of the first client.

Averaging (6) through the set of N demand vectors in

D

, the lower bound we obtain is

\begin{matrix} L_{c}^{*} & \geq \frac{1}{N} \sum_{d \in D} \sum_{k = 1}^{K} \sum_{Q \subset [K] ∖ k} \frac{1}{| Q | + 1} c_{k}^{Q} (d) . \end{matrix}

(8)

Let

b_{n}^{Q}

denote the number of bits of file n stored only in

Q \cup {0}

. Then, in the above sum,

b_{n}^{Q} = c_{k}^{Q} (d)

if and only if

d_{k} = n

. This happens precisely once in the collection of N demand vectors in

D

. Thus, we have

\begin{matrix} L_{c}^{*} & \geq \frac{1}{N} \sum_{k = 1}^{K} \sum_{Q \subset [K] ∖ k} \sum_{d \in D} \frac{1}{| Q | + 1} c_{k}^{Q} (d) \end{matrix}

\begin{matrix} = \frac{1}{N} \sum_{k = 1}^{K} \sum_{Q \subset [K] ∖ k} \sum_{n = 1}^{N} \frac{1}{| Q | + 1} b_{n}^{Q} \end{matrix}

(9)

\begin{matrix} = F \sum_{Q \subset [K]} \sum_{n = 1}^{N} (\frac{K - | Q |}{| Q | + 1}) \frac{b_{n}^{Q}}{N F} \end{matrix}

(10)

where (10) follows as for a fixed n and Q,

k \in [K] ∖ Q

in (9), and by multiplying and dividing by F.

Refining the bound (10) by using the constraints of the setting: Now, by definition,

\sum_{n} \sum_{Q \subset [K]} b_{n}^{Q} = N F

, and thus

b_{n}^{Q} / N F : n \in [N], Q \subset [K]

, denotes a probability mass function. Furthermore,

\sum_{Q \subset [K]} | Q | b_{n}^{Q} \leq K M F .

As

(K - x) / (1 + x)

is a convex decreasing function for

x \geq 0

, using Jensen’s inequality, we have

L_{c}^{*} \geq (K - x) / (1 + x)

, where

x = \sum_{n} \sum_{Q \subset [K]} | Q | \frac{b_{n}^{Q}}{N F} \leq \frac{K M F}{N F} = \frac{K M}{N} .

Thus, we get

L_{c}^{*} \geq \frac{K (1 - M / N)}{1 + M K / N} F,

which completes the proof. □

Remark 2.

In the previous part of this section, we have shown the converse for the worst case communication load

L_{c}^{*}

for coded caching in the regime of

K \leq N

. We now consider a general coded caching setup with arbitrary

K, N

values and cache size M. Consider a positive integer

N_{u} \leq min {N, K} .

For a fixed caching scheme denoted by

ζ = {ζ_{k} : k \in [K]},

let the minimum communication load for satisfying the clients, maximized across all possible demand vectors with exactly

N_{u}

distinct files in each of the demand vectors, be denoted as

L_{c}^{*} (N_{u}, ζ) .

In the work [16], it was shown that for

t ≜ \frac{M K}{N},

\begin{matrix} L_{c}^{*} (N_{u}, ζ) \geq g_{_{N_{u}}} (t), \end{matrix}

(11)

where

g_{_{N_{u}}} (x)

is defined as the lower convex envelope of the points

P (N_{u}) = \{(x, \frac{(\binom{K}{x + 1}) - (\binom{K - N_{u}}{x + 1})}{(\binom{K}{x})} F) : x \in {0, \dots, K}\} .

Note that

g_{_{N_{u}}} (t)

is independent of

ζ .

For this general setting, the optimal worst case load

L_{c}^{*}

, as defined in (3), satisfies

L_{c}^{*} = min_{ζ} L_{c}^{*} (min {N, K}, ζ) .

Thus, from (11), we get

\begin{matrix} L_{c}^{*} \geq g_{_{min {N, K}}} (t), \end{matrix}

(12)

which is the converse bound on the worst case communication load proved in [16] for this general scenario. In Appendix B, we use our data exchange bound in Theorem 1 to recover (11), which therefore shows (12).

2.1. Server-Based and Server-Free Coded Caching with Heterogeneous Cache Sizes at Clients

So far we have discussed the coded caching scenario where there is a central server containing the entire file library and the client cache sizes are homogeneous, i.e., the same at all clients. We now describe a generalization of the result in Theorem 2 to the case of systems in which the clients have heterogeneous cache sizes, with either a centralized server present or absent. The proof of this is easily obtained from our data exchange bound in Theorem 1. To the best of our knowledge, a converse for this general setting is not known in the literature. Using this converse, we can derive new converses and tighten existing converses for various special cases of this setting, which include widely studied coded caching settings, such as device-to-device coded caching [17].

Consider a coded caching system with N files (each of size F) with K client nodes denoted by a set

K_{T}

. We shall indicate by the value

γ

the presence (

γ = 1

) or absence (

γ = 0

) of a centralized server in the system containing the file library. For the purpose of utilizing our data exchange bound, we assume that all the nodes in the system are capable of transmissions; thereby, any converse for this scenario is also valid for the usual coded caching scenario in which only the server (if it is present) does transmissions in the delivery phase. The set of clients

K_{T}

is partitioned into subsets

K_{T_{i}} : i = 1, \dots, t

where the nodes in subset

K_{T_{i}}

can store a fraction

γ_{T_{i}}

of the file library. Let

| K_{T_{i}} | = K_{T_{i}} .

We now give our converse for this setting. The caching and the delivery scheme, as well as the optimal communication load

L_{c}^{*}

, are defined as in the case of coded caching with homogeneous cache sizes.

Proposition 1.

For the above heterogeneous cache sizes setting, assuming

K \leq N

, the optimal communication load

L_{c}^{*}

for uncoded cache placement is lower bounded as follows.

\begin{matrix} L_{c}^{*} \geq (\frac{K - \sum_{i = 1}^{t} K_{T_{i}} γ_{T_{i}}}{γ + \sum_{i = 1}^{t} K_{T_{i}} γ_{T_{i}}}) F . \end{matrix}

(13)

Before giving the proof of Proposition 1, we give the following remarks regarding the generality of Proposition 1, the new results which arise by applying Proposition 1 and various results from existing literature that are subsumed or improved by it.

Heterogeneous Cache Sizes: There exists a number of works discussing distinct or heterogenous client cache sizes, for instance, in [18,19]. However, closed form expressions for the lower bound on the load seem to be lacking for such scenarios, to the best of our knowledge. Proposition 1 gives a lower bound for all such settings.
Device-to-Device Coded Caching: Suppose there is no designated server in a coded caching setup, but the client nodes themselves are responsible for exchanging the information to satisfy their demands. This corresponds to the case of Device-to-Device (D2D) coded caching, first explored in [17]. In [17], an achievable scheme was presented for the case when each (client) node has equal cache fraction $\frac{M}{N}$ , and this scheme achieves a communication load of $(\frac{N}{M} - 1) F$ bits. In the work [20], it was shown that this communication load is optimal (for the regime of $K \leq N$ ) over all possible “one shot” schemes (where “one shot” refers to those schemes in which each demanded bit is decoded using the transmission only from one server), and further it was shown that the load is within a multiplicative factor of 2 of the optimal communication load under the constraint for uncoded cache placement. We remark that the D2D setting of [17] corresponds to the special case of our current setting, with $γ = 0, t = 1, K_{T_{1}} = K,$ and $γ_{T_{1}} = M / N$ . By this correspondence, by applying Proposition 1, we see that the load in this case is lower bounded as $(\frac{N}{M} - 1) F,$ thus showing that the achievable scheme in [17] is exactly optimal under uncoded cache placement. The D2D scenario with heterogeneous cache sizes was explored in [21], in which the optimal communication load was characterized as the solution of an optimization problem. However, no closed form expression of the load for such a scenario is mentioned. Clearly, our Proposition 1 gives such a lower bound, when we fix $γ = 0$ , for any number of levels t of the client-side cache sizes.

Further, the result for coded caching with a server and equal cache sizes at receivers, as in Theorem 2, is clearly obtained as a special case of Proposition 1 with

γ = 1, t = 1, K_{T_{1}} = K

and

γ_{T_{1}} = \frac{M}{N}

.

We now proceed to prove Proposition 1. The proof is similar to that of Theorem 2.

Proof of Proposition 1.

As in the proof of Theorem 2, we will denote the server node as the node 0 and assume a caching and delivery scheme which achieves the optimal load

L_{c}^{*}

for worst case client demands.

Applying Theorem 1, for our setting, we have

\begin{matrix} L_{c}^{*} \geq \{\begin{matrix} \sum_{P \subset K_{T}} \sum_{Q \subset K_{T} ∖ P} \frac{| P |}{| P | + | Q | - 1} a_{P}^{Q} & if γ = 0, \\ \sum_{P \subset K_{T}} \sum_{Q^{'} \subset K_{T} \cup 0 ∖ P} \frac{| P |}{| P | + | Q^{'} | - 1} a_{P}^{Q^{'}} & if γ = 1 . \end{matrix} \end{matrix}

Note that if

γ = 1

(i.e., the server is present), then

a_{P}^{Q^{'}} = 0

whenever

0 \notin Q^{'} .

For a specific demand vector

d = (d_{1}, \dots, d_{K})

consisting of distinct demands and for some

Q \subset K_{T}

, we define the quantity

c_{k}^{Q} (d)

as follows.

\begin{matrix} c_{k}^{Q} (d) = \{\begin{matrix} a_{{k}}^{Q} = Number of bits demanded by k & if γ = 0, \\ available exclusively in Q \\ a_{{k}}^{Q \cup 0} = Number of bits demanded by k & if γ = 1 . \\ available exclusively in Q \cup 0 \end{matrix} \end{matrix}

Symmetrization over appropriately chosen demand vectors: Choosing the same special set of demand vectors

D

as in (7) and averaging the above lower bound over the demand vectors in

D

similar to the proof of Theorem 2, we obtain a bound similar to (8):

\begin{matrix} L_{c}^{*} & \geq \{\begin{matrix} \frac{1}{N} \sum_{d \in D} \sum_{k \in K_{T}} \sum_{Q \subset K_{T} ∖ k} \frac{c_{k}^{Q} (d)}{| Q |} & if γ = 0, \\ \frac{1}{N} \sum_{d \in D} \sum_{k \in K_{T}} \sum_{Q \subset K_{T} ∖ k} \frac{c_{k}^{Q} (d)}{| Q \cup 0 |} & if γ = 1 . \end{matrix} \end{matrix}

(14)

Combining the two expressions in (14), we can write a single equation which holds for

γ \in {0, 1}

,

\begin{matrix} L_{c}^{*} & \geq \frac{1}{N} \sum_{d \in D} \sum_{k \in K_{T}} \sum_{Q \subset K_{T} ∖ k} \frac{c_{k}^{Q} (d)}{γ + | Q |} . \end{matrix}

(15)

We now define the term

b_{n}^{Q}

as follows.

\begin{matrix} b_{n}^{Q} = \{\begin{matrix} Number of bits of file n available exclusively in Q & if γ = 0, \\ Number of bits of file n available exclusively in Q \cup 0 & if γ = 1 . \end{matrix} \end{matrix}

(16)

Using the above definition of

b_{n}^{Q}

and observing that each demand vector in

D

has distinct components, Equation (15) can be written as

\begin{matrix} L_{c}^{*} & \geq \frac{1}{N} \sum_{k \in K_{T}} \sum_{Q \subset K_{T} ∖ k} \sum_{d \in D} \frac{c_{k}^{Q} (d)}{γ + | Q |} \end{matrix}

(17)

\begin{matrix} = \frac{1}{N} \sum_{k \in K_{T}} \sum_{Q \subset K_{T} ∖ k} \sum_{n = 1}^{N} \frac{b_{n}^{Q}}{γ + | Q |} \end{matrix}

(18)

\begin{matrix} = \frac{1}{N} \sum_{Q \subset K_{T}} \sum_{n = 1}^{N} \frac{(K - | Q |) b_{n}^{Q}}{γ + | Q |} . \end{matrix}

(19)

Refining the bound in (19) using setting constraints and convexity: By the definition of

b_{n}^{Q}

in (16), we have

\sum_{n} \sum_{Q \subset K_{T}} b_{n}^{Q} = N F

. Further,

\sum_{Q \subset K_{T}} | Q | b_{n}^{Q} \leq \sum_{i = 1}^{t} K_{T_{i}} γ_{T_{i}} N F

. Furthermore, for

γ \geq 0

, the function

\frac{K - x}{γ + x}

is a convex decreasing function in x for

x > 0 .

Thus, using Jensen’s inequality, we have

L_{c}^{*} \geq \frac{K - x}{γ + x},

where

\begin{matrix} x = \sum_{n} \sum_{Q \subset K_{T}} | Q | \frac{b_{n}^{Q}}{N F} \leq \frac{\sum_{i = 1}^{t} K_{T_{i}} γ_{T_{i}} N F}{N F} = \sum_{i = 1}^{t} K_{T_{i}} γ_{T_{i}} . \end{matrix}

This completes the proof. □

Remark 3.

Proposition 1 holds when

N \geq K

. This scenario is the most studied case in the literature and is practically more relevant than the case

K > N

. We now provide lower bounds for the heterogeneous cache sizes setting for general values of

K, N

, which includes the case

K > N

. As before, we consider two cases:

γ = 1

indicates the presence of a centralized server in the system and

γ = 0

indicates its absence.

Case 1,

γ = 1

: For the case where a centralized server is present, i.e.,

γ = 1

, we have

\begin{matrix} L_{c}^{*} \geq g_{min {N, K}} (\sum_{i = 1}^{t} K_{T_{i}} γ_{T_{i}}), \end{matrix}

(20)

where the function

g_{min {N, K}}

is defined in Remark 2. The derivation of this lower bound follows the steps in Appendix B until (A21), where we choose

N_{u} = min {N, K}

. Without loss of generality, we assume that all caches are fully populated with uncoded bits from the library, thus the total memory occupied by the cached bits

\sum_{Q \subset [K]} | Q | a^{Q}

is equal to the sum of all the cache memory available in the system

\sum_{i = 1}^{t} K_{T_{i}} γ_{T_{i}} N F

. Applying Jensen’s inequality on (A21) and using the fact

\sum_{Q \subset [K]} | Q | \frac{a^{Q}}{N F} = \sum_{i = 1}^{t} K_{T_{i}} γ_{T_{i}}

, we immediately arrive at the lower bound (20).

Case 2,

γ = 0

: In this case, the optimal worst-case communication load can be lower bounded as follows:

\begin{matrix} L_{c}^{*} \geq \frac{min {N, K}}{K} (\frac{K - \sum_{i = 1}^{t} K_{T_{i}} γ_{T_{i}}}{\sum_{i = 1}^{t} K_{T_{i}} γ_{T_{i}}}) F . \end{matrix}

(21)

The proof of this lower bound follows similar approach as Appendix B and is outlined in Appendix C. Note that when

N \geq K

, both (20) and (21) become identical to the inequality in Proposition 1.

2.2. Coded Caching with Multiple File Requests

In [22], coded caching with multiple file requests was considered, in which each client requests any

Δ

files out of the N files in the delivery phase. It was shown in [22] (Section V.A) that if the

Δ K \leq N

, then the optimal worst case communication load can be lower bounded as

\begin{matrix} L_{c}^{*} \geq \frac{K Δ (1 - M / N)}{1 + M K / N} F . \end{matrix}

(22)

The work in [22] also gives an achievable scheme based on the scheme in [1] which meets the above bound. The same lower bound can be derived using Theorem 1 also, by following a similar procedure as that of the proof of Theorem 2.

Applying Theorem 1, we give the proof in brief. The demand vector assumed in proof of Theorem 1 becomes a

K Δ

-length vector in this case, consisting of K subvectors, each of length

Δ

, capturing

Δ

distinct demands for each client. The proof proceeds as is until (6).

Symmetrization: The set

D

in (7) now contains the

K Δ

-length vectors of consecutive file indices, cyclically constructed, starting from

(1, \dots, K Δ)

, i.e.,

\begin{matrix} D & ≜ \{(j \oplus_{N} 0, j \oplus_{N} 1, \dots, j \oplus_{N} (K Δ - 1)) : j = 0, \dots, N - 1\} . \end{matrix}

(23)

Thus, if the demand vector considered is

d (j) ≜ (j \oplus_{N} 0, j \oplus_{N} 1, \dots, j \oplus_{N} (K Δ - 1)) \in D,

then the indices of the demanded files at client

k \in [K]

, denoted by

d_{k} (j),

is given by

\begin{matrix} d_{k} (j) ≜ {j \oplus_{N} (k - 1) Δ, j \oplus_{N} (k - 1) Δ + 1, \dots, j \oplus_{N} (k Δ - 1)} . \end{matrix}

The averaged lower bound expression similar to (8) is then obtained as

\begin{matrix} L_{c}^{*} & \geq \frac{1}{N} \sum_{j = 0}^{N - 1} \sum_{k = 1}^{K} \sum_{Q \subset [K] ∖ k} \frac{1}{| Q | + 1} c_{k}^{Q} (d (j)) . \end{matrix}

(24)

In this expression, we have

c_{k}^{Q} (d (j))

which now indicates the number of bits of

Δ

distinct and consecutive files indexed by

d_{k} (j)

and available exclusively at the nodes in

Q \cup 0

(0 denoting the server).

Observation:

c_{k}^{Q} (d (j)) = \sum_{n^{'} \in d_{k} (j)} b_{n^{'}}^{Q}

where

b_{n^{'}}^{Q}

denotes the number of bits of file

n^{'}

available exclusively in the nodes

Q \cup 0

, as in the proof of Theorem 2.

Now,

n^{'} \in d_{k} (j)

if and only if the file

n^{'}

is demanded by client k. By definition of

D

, the event

n^{'} \in d_{k} (j)

happens for precisely

Δ

values of index j. From (24), applying the above observation, we have the following.

\begin{matrix} L_{c}^{*} & \geq \frac{1}{N} \sum_{k = 1}^{K} \sum_{Q \subset [K] ∖ k} \sum_{j = 0}^{N - 1} \frac{1}{| Q | + 1} c_{k}^{Q} (d (j)) \\ = \frac{1}{N} \sum_{k = 1}^{K} \sum_{Q \subset [K] ∖ k} \frac{1}{| Q | + 1} (\sum_{j = 0}^{N - 1} \sum_{n^{'} \in d_{k} (j)} b_{n^{'}}^{Q}) \\ = \frac{1}{N} \sum_{k = 1}^{K} \sum_{Q \subset [K] ∖ k} \sum_{n^{'} = 1}^{N} \frac{Δ}{| Q | + 1} b_{n^{'}}^{Q} . \end{matrix}

(25)

Refining the bound in (25) using the setting constraints: We use the constraints of the setting and the convexity of the resultant expression to refine (25). This refinement essentially follows similar subsequent steps as in the proof of Theorem 2 following (9), and leads finally to (22).

Remark 4.

The work in [23] considers a coded caching setup in which Λ caches (

Λ \leq K

) are shared between the K clients. The special case when Λ divides K and each cache is serving exactly

\frac{K}{Λ}

clients is equivalent to the scenario of the multiple file requests in [22] with Λ clients, each demanding

\frac{K}{Λ}

files. The above proof then recovers the converse for this setting, which is obtained in [23] (Section III.A in [23]).

2.3. Coded Caching with Decentralized Caching

Theorem 2 and the subsequent results discussed above hold for the centralized caching framework, in which the caching phase is designed carefully in a predetermined fashion. In [24], the idea of decentralized placement was introduced, in which the caching phase is not coordinated centrally (this was called “decentralized coded caching” in [24]). In this scenario, each client, independently of others, caches a fraction

γ = \frac{M}{N}

of the bits in each of the N files in the file library, chosen uniformly at random. For this scenario, the server (which has the file library) is responsible for the delivery phase. The optimal communication load

L_{c}^{*}

is defined as the minimum worst case communication load over all possible delivery schemes for a given caching configuration, randomly constructed as given above. For the case of

K \leq N

, the authors of [24] show a scheme which achieves the worst case communication load

L_{c} = K F (1 - M / N) \frac{(1 - {(1 - M / N)}^{K})}{M K / N}

. This was shown to be optimal for large F in [16] and also in [25] via a connection to index coding. In the following, we show that the same optimality follows easily via our Theorem 1.

Assume that we have distinct demands at the K clients, as in the proof of Theorem 1, given by the demand vector

d

. We first note that by the law of large numbers, as F increases, for the decentralized cache placement, for any

k \in [K], Q \subset [K] ∖ k

, we have

c_{k}^{Q} (d) = F {(\frac{M}{N})}^{| Q |} {(1 - \frac{M}{N})}^{K - | Q |},

with probability close to 1, where

c_{k}^{Q} (d)

is as defined in (5). This observation enables us to avoid the steps 2 and 3 mentioned in Section 1.2, as the value of

c_{k}^{Q} (d)

is independent of the specific random cache placement or the demands chosen (as long as they are distinct). Using this in (6), we get

\begin{matrix} L_{c}^{*} & \geq \sum_{k = 1}^{K} \sum_{Q \subset [K] ∖ k} \frac{1}{| Q | + 1} c_{k}^{Q} (d) \\ \geq \sum_{k = 1}^{K} \sum_{Q \subset [K] ∖ k} \frac{F}{| Q | + 1} {(\frac{M}{N})}^{| Q |} {(1 - \frac{M}{N})}^{K - | Q |} \\ = F \sum_{Q \subset [K]} (\frac{K - | Q |}{| Q | + 1}) {(\frac{M}{N})}^{| Q |} {(1 - \frac{M}{N})}^{K - | Q |} \\ = F \sum_{i = 0}^{K} (\binom{K}{i}) (\frac{K - i}{i + 1}) {(\frac{M}{N})}^{i} {(1 - \frac{M}{N})}^{K - i} \\ = F \sum_{i = 0}^{K} (\binom{K}{i + 1}) {(\frac{M}{N})}^{i} {(1 - \frac{M}{N})}^{K - i} \\ = F \sum_{j = 1}^{K} (\binom{K}{j}) {(\frac{M}{N})}^{j - 1} {(1 - \frac{M}{N})}^{K - j + 1} \\ = F \frac{N}{M} (1 - \frac{M}{N}) \sum_{j = 1}^{K} (\binom{K}{j}) {(\frac{M}{N})}^{j} {(1 - \frac{M}{N})}^{K - j} \\ = \frac{N F}{M} (1 - \frac{M}{N}) (1 - {(1 - \frac{M}{N})}^{K}), \end{matrix}

where the last step follows as

\sum_{j = 0}^{K} (\binom{K}{j}) {(\frac{M}{N})}^{j} {(1 - \frac{M}{N})}^{K - j} = {(\frac{M}{N} + (1 - \frac{M}{N}))}^{K} = 1 .

Thus, we have given an alternate proof of the optimality of the decentralized scheme in [24].

3. Decentralized Coded Data Shuffling

In distributed machine learning systems consisting of a master and multiple worker nodes, data are distributed to the workers by the master in order to perform training of the machine learning model in a distributed manner. In general, this training process takes multiple iterations, with the workers doing some processing (like computing gradients) on their respective training data subsets. In order to ensure that the training data subset at each node are sufficiently representative of the data, and to improve the statistical performance of machine learning algorithms, shuffling of the training data between the worker nodes is implemented after every training iteration. This is known as data shuffling.

A coding theoretic approach to data shuffling, which involves the master communicating coded data to the workers was presented in [4]. The setting in [4] was centralized, which meant that there is a master node communicating to the servers to perform the data shuffling.

The work in [5] considered the data shuffling problem in which there is no master node, but the worker nodes exchange the training data among themselves, without involving the master node, to create a new desired partition in the next iteration. This was termed as decentralized data shuffling in [5]. Note that these notions of “centralized” and “decentralized” in the data shuffling problem are different from those in the coded caching [24], in which these terms were used to define the deterministic and random design of the caching phase, respectively. In this section, we look at the work in [5] and give a new simpler proof of the lower bound on the communication load for decentralized data shuffling.

We first review the setting in [5]. Consider K workers in the system, where each worker node is required to process q data units at any given time. The total dataset

F_{1} \cup \dots \cup F_{N}

consists of

N = K q

data units

F_{1}, \dots, F_{N}

, with a size of B bits per data unit. The collection of data units to be processed by worker node k at time t is denoted as

A_{k, t}

. The collection of data units

A_{1, t}, \dots, A_{K, t}

must form a partition of the dataset

F_{1} \cup \dots \cup F_{N}

for every time instant t, i.e., for any time t and any choice of

k, k^{'} \in [K]

with

k \neq k^{'}

we have

\begin{matrix} A_{k, t} \subset F_{1} \cup \dots \cup F_{N}, | A_{k, t} | = q B and A_{k, t} \cap A_{k^{'}, t} = \emptyset . \end{matrix}

Each node k has a local cache of size

M B

bits (such that

q \leq M \leq K q

) that can hold M data units. Out of these M units q units are the current “active” data

A_{k, t}

at any time step which are required to be processed by the node k. The contents of the cache of node k at time t is denoted as

Z_{k, t}

. Therefore, for each choice of

k \in [K]

and any time t, we have

\begin{matrix} | Z_{k, t} | = M B and A_{k, t} \subset Z_{k, t} . \end{matrix}

At each time instance t, a new partition

{A_{k, t} : k \in [K]}

is to be made active at the nodes

[K]

, where this new partition is made known to the workers only at time step t. Note that the contents of the nodes at time

t - 1

are

Z_{1, t - 1}, \dots, Z_{K, t - 1}

, and the active partition at time

t - 1

is

A_{1, t - 1}, \dots, A_{K, t - 1}

. The worker nodes communicate with each other over a common broadcast link, as shown in Figure 2, to achieve the new partition. The decentralized data shuffling problem is to find a delivery scheme (between workers) to shuffle the collection of active data units

{A_{k, t - 1} : k \in [K]}

to a new partition

{A_{k, t} : k \in [K]}

. Each worker k computes a function

ϕ_{k} (Z_{k, t - 1})

of its cache contents and broadcasts it to the other workers. Using these transmissions and the locally available cache content

Z_{k, t - 1},

each node k is required to decode

A_{k, t} .

As in the case of coded caching, one seeks to reduce the worst-case communication load by designing the initial storage and coded transmissions carefully. The communication load of this data shuffling scheme, denoted by

L_{d s}

, is the sum of the number of bits broadcast by all the K nodes in the system, i.e.,

L_{d s} = \sum_{k \in [K]} | ϕ_{k} (Z_{k, t - 1}) |

. The optimal communication load of data shuffling

L_{d s}^{*}

(for the worst case data shuffle) is defined as

L_{d s}^{*} = min max L_{d s}

where the maximization is over all possible choices for

A_{k, t - 1} : k \in [K]

and

A_{k, t} : k \in [K]

, and the minimization is over all possible choices for the cache placement

{Z_{k, t - 1} : k \in [K]}

and the delivery scheme

{ϕ_{k} : k \in [K]}

.

For the above setting, the following bound on the communication load

L_{d}^{*}

was shown in [5].

\begin{matrix} L_{d s}^{*} \geq \frac{K q}{K - 1} . \frac{K - M / q}{M / q} B . \end{matrix}

(26)

The above bound was shown to be optimal for some special cases of the parameters, and order-optimal otherwise.

Proof of the Decentralized Data Shuffling Converse

We now recover the bound (26) by a simple proof using our generic lower bound in Theorem 1. We assume that the cache placement and delivery scheme of the data shuffling scheme are designed such that the communication load of the data shuffling scheme is exactly equal to

L_{d s}^{*} .

We proceed as per the three steps in Section 1.2.

Applying Theorem 1: For

k \in [K]

and

Q \subset [K],

let

A_{k, t}^{Q}

denote the subset of bits of

A_{k, t}

available exactly at the nodes in Q and not anywhere else. Note that

| A_{k, t}^{Q} | = 0

if

Q = \emptyset

, as each bit is necessarily present in at least one of the K nodes.

As per our bound in Theorem 1, we have

\begin{matrix} L_{d s}^{*} = L_{d s} \geq \sum_{k \in [K]} \sum_{Q \subset [K] ∖ k} \frac{| A_{k, t}^{Q} |}{| Q |} . \end{matrix}

Symmetrization by averaging over appropriately chosen set of shuffles: Let the set of circular permutations of

(1, 2, \dots, K)

, apart from the identity permutation, be denoted by

Γ

. There are

K - 1

of them clearly. We denote an arbitrary permutation in

Γ

by

γ

, and by

γ_{k}

we denote the

k t h

coordinate of

γ

.

Now, consider the shuffle given by

γ \in Γ

, i.e., for each k,

A_{k, t} = A_{γ_{k}, t - 1}

. For this shuffle, we have by the above equation that

\begin{matrix} L_{d s}^{*} & \geq \sum_{k \in [K]} \sum_{Q \subset [K] ∖ k} \frac{| A_{γ_{k}, t - 1}^{Q} |}{| Q |} \end{matrix}

(27)

\begin{matrix} = \sum_{Q \subset [K]} \sum_{k \in [K] ∖ Q} \frac{| A_{γ_{k}, t - 1}^{Q} |}{| Q |} . \end{matrix}

(28)

Now, averaging (28) over all permutations in

Γ

, we get

\begin{matrix} L_{d s}^{*} & \geq \frac{1}{K - 1} \sum_{γ \in Γ} \sum_{Q \subset [K]} \sum_{k \in [K] ∖ Q} \frac{| A_{γ_{k}, t - 1}^{Q} |}{| Q |}, \end{matrix}

(29)

\begin{matrix} = \frac{1}{K - 1} \sum_{Q \subset [K]} \sum_{k \in [K] ∖ Q} \sum_{γ \in Γ} \frac{| A_{γ_{k}, t - 1}^{Q} |}{| Q |} . \end{matrix}

(30)

As we go through all choices of

γ \in Γ,

we see that

γ_{k}

takes every value except k, i.e.,

γ_{k}

assumes each value in

[K] ∖ k

exactly once. Moreover,

A_{k^{'}, t - 1}^{Q}

is the collection of bits of

A_{k^{'}, t - 1}

present only in Q. However, the bits

A_{k^{'}, t - 1}

are already presented in

k^{'}

. Hence,

| A_{k^{'}, t - 1}^{Q} | = 0

if

k^{'} \notin Q

. Therefore, we have

\begin{matrix} L_{d s}^{*} & \geq \frac{1}{K - 1} \sum_{Q \subset [K]} \sum_{k \in [K] ∖ Q} \sum_{k^{'} \in Q} \frac{| A_{k^{'}, t - 1}^{Q} |}{| Q |}, \end{matrix}

(31)

\begin{matrix} = \frac{1}{K - 1} \sum_{Q \subset [K]} \sum_{k^{'} \in Q} \frac{| A_{k^{'}, t - 1}^{Q} | (K - | Q |)}{| Q |} \end{matrix}

(32)

\begin{matrix} = \frac{1}{K - 1} \sum_{k^{'} \in [K]} \sum_{Q \subset [K] : k^{'} \in Q} \frac{| A_{k^{'}, t - 1}^{Q} | (K - | Q |)}{| Q |} . \end{matrix}

(33)

Refining the bound using setting constraints and convexity: Now, we have the following observations as

A_{k^{'}, t - 1}^{Q} : k^{'} \in [K], {Q^{'} \subset [K] : k^{'} \in Q}

form a partition of all the

N B

bits.

\begin{matrix} \sum_{Q \subset [K]} \sum_{k^{'} \in Q} | A_{k^{'}, t - 1}^{Q} | & = N B = K q B \\ \sum_{Q \subset [K]} \sum_{k^{'} \in Q} | A_{k^{'}, t - 1}^{Q} | | Q | & \leq K M B . \end{matrix}

Utilizing the above, and the fact that

\frac{K - | Q |}{| Q |}

is a convex decreasing function in

| Q |

(for

| Q | \geq 0

), we have

\begin{matrix} L_{d s}^{*} & \geq \frac{K q B}{K - 1} . \frac{(K - \sum_{Q \subset [K]} \sum_{k^{'} \in Q} \frac{| A_{k^{'}, t - 1}^{Q} |}{N B} | Q |)}{\sum_{Q \subset [K]} \sum_{k^{'} \in Q} \frac{| A_{k^{'}, t - 1}^{Q} |}{N B} | Q |} \end{matrix}

(34)

\begin{matrix} \geq \frac{K q B}{K - 1} . \frac{K - K M / N}{K M / N} \end{matrix}

(35)

\begin{matrix} = \frac{K q B}{K - 1} . \frac{K - M / q}{M / q} \end{matrix}

(36)

Thus, we have recovered (26).

Remark 5.

We have considered the decentralized version of the coded data shuffling problem in this subsection. The centralized version of the data shuffling problem was introduced in [4] and its information theoretic limits were studied elaborately in [6]. Our data exchange bound, when applied to the setting in [6], results in a looser converse result than that in [6]. The reasons for this is explored in Section 5 using the connection between our data exchange bound and the bound for index coding known in literature.

4. Coded Distributed Computing

In a distributed computing setting, there are N files on which the distributed computing task has to be performed by K nodes. The job at hand is divided into three phases: Map, Shuffle, and Reduce. In the shuffle phase, the nodes that are assigned to perform the distributed computing task exchange data. In [3], the authors proposed coded communication during the shuffle phase to reduce the communication load. We recollect the setting and the main converse result from [3], which we recover using our data exchange bound.

A subset

M_{i}

of N files is assigned to

i th

node and the

i th

node computes the map functions on this subset in the map phase (see Figure 3). We assume that the total number of map functions computed at the K nodes is

r N

, where r is referred to as the computation load. In the reduce phase, a total of W reduce functions is to be computed across the K nodes corresponding to the N files. Each node is assigned the same number of functions. Obtaining the output of the reduce functions at all the nodes will complete the distributed computing task. In this work, as in [3], we consider two scenarios: in the first one, each reduce function is computed exactly at one node and in the second, each reduce function is computed at s nodes, where

s \geq 2

.

Each map function output (also referred to as intermediate output) corresponds to a particular file and a particular reduce function. For each file and each reduce function, an intermediate output of T bits is obtained. To compute an assigned reduce function, each node requires the intermediate outputs of all the files corresponding to the assigned reduce function. This means each node is missing the intermediate outputs (corresponding to the assigned reduce functions) of those files that are not assigned to it in the map phase.

The intermediate outputs of each file assigned to node i corresponding to all the reduce functions are available at node i at the end of the map phase and denoted by

v_{1 : Q, M_{i}}

. These intermediate outputs at the end of the map phase are encoded as follows:

X_{i} = ϕ_{i} (v_{1 : Q, M_{i}})

and broadcasted to the remaining nodes, in the shuffle phase (in order to deliver the missing intermediate outputs at the nodes). Let

L_{d c}^{*}

be the total number of bits broadcasted by the K nodes in the shuffle phase, minimized over all possible map function assignments, reduce function assignments, and shuffling schemes, with a computation load r. We refer to

L_{d c}^{*}

as the minimum communication load.

To obtain similar expressions for the communication load as in [3], we normalize the communication load by the total number of intermediate output bits (=

W N T

). We consider the first scenario now, where each reduce function is computed exactly at one node.

Theorem 3

([3]). The minimum communication load

L_{d c}^{*}

incurred by a distributed computing system of K nodes for a given computation load r, where every reduce function is computed at exactly one node and each node computes

\frac{W}{K}

reduce functions, is bounded as

\frac{L_{d c}^{*}}{W N T} \geq \frac{1}{r} (1 - \frac{r}{K}) .

(37)

Proof.

We resort to two of the three steps of Section 1.2 to complete this proof. The symmetrization step, which involves averaging over demand configurations, is not applicable in the present setting because the definition of

L_{d c}^{*}

involves minimization over the reduce function assignment as well.

Applying Theorem 1: Let

M = (M_{1}, \dots, M_{K})

denote a given map function assignment to the nodes, where

M_{i} \subset [N]

. Let

L_{M}

denote the communication load associated with the map function assignment

M

. We will prove that

\frac{L_{M}}{W N T} \geq \sum_{j = 1}^{K} \frac{{\tilde{a}}_{M}^{j}}{N} \frac{K - j}{K j},

where

{\tilde{a}}_{M}^{j}

denotes the number of files which are mapped at exactly j nodes in

[K]

. It is easy to see that

\sum_{j = 1}^{K} {\tilde{a}}_{M}^{j} = N

and

\sum_{j = 1}^{K} j {\tilde{a}}_{M}^{j} = r N

. We will apply Theorem 1 to this setting. Recall that each reduce function is computed exactly at one node in our present setup. To apply Theorem 1, we need to ascertain the quantities

a_{P}^{Q}

for

P, Q

being disjoint subsets of

[K]

. To do this, we first denote by

{\tilde{a}}^{Q}

the number of files whose intermediate outputs are demanded by some node k and available exclusively in the nodes of Q. Note that

{\tilde{a}}^{Q}

is the same for any

k \in [K] ∖ Q

, as each node demands intermediate outputs of all the files that are not mapped at the node itself.

As the number of reduce functions assigned to node k is

\frac{W}{K}

(as each reduce function is computed at exactly one node) and each intermediate output is T bits, the number of intermediate output bits which are demanded by any node k and available exclusively in the nodes of Q are

\frac{W T}{K} {\tilde{a}}^{Q}

. Thus, for any

Q \subset [K]

, the quantities

a_{P}^{Q}

in Theorem 1 are given as follows.

a_{P}^{Q} = \{\begin{matrix} \frac{W T}{K} {\tilde{a}}^{Q} & if P = {k} for some k \in [K] such that k \notin Q \\ 0 & otherwise \end{matrix}

Further note that

\sum_{Q \subset [K] : | Q | = j} {\tilde{a}}^{Q} = {\tilde{a}}_{M}^{j}

by definition of

{\tilde{a}}_{M}^{j} .

Using these and applying Theorem 1 with the normalization factor

W N T

, we have the following inequalities.

\begin{matrix} \frac{L_{M}}{W N T} & \geq \frac{1}{W N T} \sum_{k = 1}^{K} \sum_{Q \subset [K] ∖ {k}} \frac{1}{| Q |} {\tilde{a}}^{Q} \frac{W T}{K} \\ = \frac{1}{K N} \sum_{j = 1}^{K} \sum_{Q \subset [K] : | Q | = j} \sum_{k \in [K] ∖ Q} \frac{1}{j} {\tilde{a}}^{Q} \\ = \frac{1}{K N} \sum_{j = 1}^{K} \sum_{Q \subset [K] : | Q | = j} \frac{K - j}{j} {\tilde{a}}^{Q} \\ = \frac{1}{K N} \sum_{j = 1}^{K} \frac{K - j}{j} (\sum_{Q \subset [K] : | Q | = j} {\tilde{a}}^{Q}) \\ = \frac{1}{K N} \sum_{j = 1}^{K} \frac{K - j}{j} {\tilde{a}}_{M}^{j} . \end{matrix}

Refining the bound using convexity and setting constraints: Using definition of

L_{d c}^{*}

, noting that

\frac{K - j}{j}

is a convex decreasing function of j and that

\frac{\sum_{j = 1}^{K} {\tilde{a}}_{M}^{j}}{N} = 1

, we have that

\begin{matrix} \frac{L_{d c}^{*}}{W N T} & \geq \frac{1}{K} \frac{K - \sum_{j = 1}^{K} \frac{j {\tilde{a}}_{M}^{j}}{N}}{\sum_{j = 1}^{K} \frac{j {\tilde{a}}_{M}^{j}}{N}} \\ = \frac{1}{K} \frac{K - r}{r} = \frac{1}{r} (1 - \frac{r}{K}) . \end{matrix}

□

Now, we consider the case in which each reduce function has to be computed at s nodes. The total number of reduce functions is assumed to be W. In addition, the following assumption is made to keep the problem formulation symmetric with respect to reduce functions: every possible s sized subset of K nodes is assigned

\frac{W}{(\binom{K}{s})}

reduce functions (we assume

(\binom{K}{s})

divides W). As in the previous case, we will denote the communication load for a given map function assignment by

L_{M} (s)

and the optimal communication load with computation load r by

L_{d c}^{*} (s)

. We will prove the following result which gives a lower bound on

L_{M} (s)

.

Proposition 2

([3]). The communication load corresponding to a map function assignment

M

when each reduce function has to be computed at s nodes is lower bounded as

\frac{L_{M} (s)}{W N T} \geq \sum_{j = 1}^{K} \frac{{\tilde{a}}_{M}^{j}}{N} \sum_{l = max (0, s - j)}^{min (K - j, s)} \frac{(\binom{K - j}{l}) (\binom{j}{s - l})}{(\binom{K}{s})} \frac{l}{l + j - 1} .

(38)

Proof.

As before, we will denote by

{\tilde{a}}^{Q}

the number of files whose map function outputs are available exclusively in the nodes of Q. Furthermore, we will denote the number of intermediate output bits which are demanded exclusively by the nodes in P and available exclusively in the nodes of Q by

b_{P}^{Q}

. Then, applying Theorem 1, the lower bound on the communication load in terms of

{b_{P}^{Q}}

is given by

\begin{matrix} \frac{L_{M} (s)}{W N T} & \geq & \frac{1}{W N T} \sum_{P \subset [K]} \sum_{Q \subset [K] ∖ P} \frac{| P |}{| P | + | Q | - 1} b_{P}^{Q} . \end{matrix}

We first interchange the above summation order and consider all sets Q with

| Q | = j

and all sets P such that

| P | = l

. For

| Q | = j

, we need to count the subsets of size s, which form a subset of

P \cup Q

. Thus, for a fixed j, we can see that the range of l can vary from

max (0, s - j)

to

min (K - j, s)

. For a given subset P of size l, the number of s sized subsets which are contained within

P \cup Q

and contain P are

(\binom{j}{s - l})

. Therefore, the number of intermediate output bits demanded exclusively by the nodes in P and available exclusively in Q,

b_{P}^{Q}

, is given by

b_{P}^{Q} = {\tilde{a}}^{Q} W T \frac{(\binom{j}{s - l})}{(\binom{K}{s})}

. This is because each of the s-sized subset has to reduce

{\tilde{a}}^{Q} \frac{W}{(\binom{K}{s})}

functions. Using this relation, the above inequality can be rewritten as follows.

\begin{matrix} \frac{L_{M} (s)}{W N T} & \geq & \sum_{j = 1}^{K} \frac{1}{N} (\sum_{Q \subset [K] : | Q | = j} {\tilde{a}}^{Q}) \sum_{l = max (0, s - j)}^{min (K - j, s)} \frac{l}{l + j - 1} (\sum_{P \subset [K] ∖ Q : | P | = l} \frac{(\binom{j}{s - l})}{(\binom{K}{s})}) \end{matrix}

(39)

\begin{matrix} = & \sum_{j = 1}^{K} \frac{{\tilde{a}}_{M}^{j}}{N} \sum_{l = max (0, s - j)}^{min (K - j, s)} \frac{l}{l + j - 1} \frac{(\binom{K - j}{l}) (\binom{j}{s - l})}{(\binom{K}{s})}, \end{matrix}

(40)

where (40) follows as

{\tilde{a}}_{M}^{j} = \sum_{Q \subset [K] : | Q | = j} {\tilde{a}}^{Q}

. This completes the proof. □

The above lemma along with certain convexity arguments resulting from the constraints imposed by the computation load can be used to prove the lower bound on

L_{d c}^{*} (s)

. The interested reader is referred to the converse proof of Theorem 2 in [3] for the same.

5. Relation to Index Coding Lower Bound

We now consider the “centralized” version of the data exchange problem, where one of the nodes has a copy of all the information bits and is the lone transmitter in the system. We will use the index 0 for this server node, and assume that there are K other nodes in the system, with index set

[K]

, acting as clients. In terms of Definition 1, this system is composed of

K + 1

nodes

{0} \cup [K]

, the demand

D_{0}

of the server is empty, while the demands

D_{i}

and the contents

C_{i}

of all the clients are subsets of the contents of the server, i.e.,

C_{i}, D_{i} \subset C_{0}

for all

i \in [K]

. Without loss of generality, we assume that only the server performs all the transmissions as any coded bit that can be generated by any of the client nodes can be generated at the server itself. Clearly, this is an index coding problem [26] with K clients or receivers, the demand of the

i th

receiver is

D_{i}

, and its side information is

C_{i}

. When applied to this scenario, our main result Theorem 1 therefore provides a lower bound on the index coding communication cost.

The maximum acyclic induced subgraph (MAIS) and its generalization, which is known as the generalized independence number or the

α

-bound, are well-known lower bounds in index coding [9,26]. In this section, we describe the relation between the

α

-bound of index coding and the centralized version of Theorem 1. We show that the latter is in general weaker, and identify the scenarios when these two bounds are identical. We then use these observations to explain why Theorem 1 cannot provide a tight lower bound for the centralized data shuffling problem [6].

Let us first apply Theorem 1 to the centralized data exchange problem. As node 0 contains all the information bits and its demand is empty, we have

a_{P}^{Q^{'}} = 0

if

0 \notin Q^{'}

or

0 \in P

. Using

Q = Q^{'} ∖ {0}

and defining the variable

c_{P}^{Q} = a_{P}^{Q \cup {0}} = a_{P}^{Q^{'}}

, we obtain

Theorem 4.

The centralized version of our main result Theorem 1 is

\begin{matrix} L^{*} & \geq \sum_{P \subset [K]} \sum_{\begin{matrix} Q^{'} \subset {0} \cup [K] \\ 0 \in Q^{'}, P \cap Q^{'} = \emptyset \end{matrix}} \frac{| P |}{| P | + | Q^{'} | - 1} a_{P}^{Q^{'}} \\ = \sum_{P \subset [K]} \sum_{Q \subset [K] ∖ P} \frac{| P |}{| P | + | Q |} c_{P}^{Q} . \end{matrix}

Note that it is possible to have

c_{P}^{Q} = a_{P}^{Q \cup {0}} > 0

when

Q = \emptyset

.

In Section 5.1, we express the generalized independence number

α

in terms of the parameters

c_{P}^{Q}

, and in Section 5.2, we identify the relation between our lower bound Theorem 4 and the index coding lower bound

α

.

5.1. The Generalized Independence Number Bound

Let

γ = (γ_{1}, \dots, γ_{K})

be any permutation of

[K]

, where

γ_{i}

is the

i th

coordinate of the permutation. Applying similar ideas as in the proof of Theorem 1 to the centralized scenario, we obtain the following lower bound on

L^{*}

. This lower bound considers the nodes in the order

γ_{1}, \dots, γ_{K}

, and for each node in this sequence it counts the number of bits that are demanded by this node which are neither demanded by and nor available as side information in any of the earlier nodes.

Proposition 3.

For any permutation γ of

[K]

,

L^{*} \geq \sum_{i = 1}^{K} \sum_{\begin{matrix} P \subset {γ_{i}, \dots, γ_{K}} \\ γ_{i} \in P \end{matrix}} \sum_{Q \subset {γ_{i + 1}, \dots, γ_{K}}} c_{P}^{Q} .

(41)

Proof.

See Appendix D. □

A direct consequence of Proposition 3 is

L^{*} \geq max_{γ} \sum_{i = 1}^{K} \sum_{\begin{matrix} P \subset {γ_{i}, \dots, γ_{K}} \\ γ_{i} \in P \end{matrix}} \sum_{Q \subset {γ_{i + 1}, \dots, γ_{K}}} c_{P}^{Q}

(42)

where the maximization is over all possible permutations on

[K]

.

We now recall the definition of the generalized independence number [9]. Denote the collection of the

c_{P}^{Q}

information bits available exclusively at the nodes

Q \cup {0}

and demanded exclusively by the nodes P as

{w_{P, m}^{Q} : m = 1, \dots, c_{P}^{Q}}

. Therefore, the set of all the information bits present in the system is

B = ⋃_{P \subset [K]} ⋃_{Q \subset [K] ∖ P} \{w_{P, m}^{Q} : m = 1, \dots, c_{P}^{Q}\} .

Note that each bit is identified by a triple

(P, Q, m)

.

Definition 2.

A subset

H

of B is a generalized independent set if and only if every subset

I \subset H

satisfies the following:

there exists a node $k \in [K]$ and an information bit in $I$ such that this information bit is demanded by k (and possibly some other nodes), and none of the other bits in $I$ are available as side information at k.

The generalized independence number α is the size of the largest generalized independent set.

We next show that the lower bound in (42) is in fact equal to the generalized independence number

α

of this index coding problem.

Theorem 5.

The generalized independence number α satisfies

α = max_{γ} \sum_{i = 1}^{K} \sum_{\begin{matrix} P \subset {γ_{i}, \dots, γ_{K}} \\ γ_{i} \in P \end{matrix}} \sum_{Q \subset {γ_{i + 1}, \dots, γ_{K}}} c_{P}^{Q},

(43)

where the maximization is over all

K!

permutations of

[K]

.

Proof.

See Appendix E. □

5.2. Relation to the Index Coding Lower Bound

Proposition 3 serves as the platform for comparing Theorem 4 and the

α

-bound. While

α

equals the maximum value of the bound in Proposition 3 over all permutations on

[K]

, our bound in Theorem 4 equals the average value of the lower bound given in Proposition 3 over all permutations on

[K]

. We will show this relation between Theorem 4 and Proposition 3 now.

Taking the average of the right hand side of (41) with respect to all

γ

, we obtain

\frac{1}{K!} \sum_{γ} \sum_{i = 1}^{K} \sum_{\begin{matrix} P \subset {γ_{i}, \dots, γ_{K}} \\ γ_{i} \in P \end{matrix}} \sum_{Q \subset {γ_{i + 1}, \dots, γ_{K}}} c_{P}^{Q} .

For each choice of

P, Q \subset [K]

with

P \cap Q = \emptyset

, we now count the number of times

c_{P}^{Q}

appears in this sum. For a given

γ

, the inner summations include the term

c_{P}^{Q}

if and only if the following holds:

γ_{i} \in P, where i = min {j \in [K] : γ_{j} \in P \cup Q},

i.e., if we consider the elements

γ_{1}, \dots, γ_{K}

in that order, the first element from

P \cup Q

to be observed in this sequence belongs to P. Thus, for a given pair

P, Q

the probability that a permutation

γ

chosen uniformly at random includes the term

c_{P}^{Q}

in the inner summation is

| P | / (| P | + | Q |)

. Therefore, the average of the lower bound in Proposition 3 over all possible

γ

is

\sum_{P \subset [K]} \sum_{\begin{matrix} Q \subset [K] \\ P \cap Q = \emptyset \end{matrix}} \frac{| P |}{| P | + | Q |} c_{P}^{Q},

which is exactly the bound in Theorem 4.

As the bound in Theorem 4 is obtained by averaging over all

γ

, instead of maximizing over all

γ

, we conclude that this is in general weaker than the

α

-bound of index coding. The two bounds are equal if and only if the bound in Proposition 3 has the same value for every permutation

γ

.

Although weaker in general, we note that the bound of Theorem 4 is easier to use than the

α

-bound. As demonstrated by (2), in order to use Theorem 4, we only need to know, for each information bit, the number of nodes that contain this bit and the number of nodes that demand this bit. In comparison, this information is insufficient to evaluate the

α

-bound, which also requires the identities of these nodes.

5.3. On the Tightness of Theorem 4

We now consider the class of unicast problems, i.e., problems where each bit is demanded by exactly one of the nodes. For this class of problems, we characterize when Theorem 4 yields a tight bound.

Theorem 6.

For unicast problems the bound in Theorem 4 equals

L^{*}

if and only if every

S \subset [K]

with

| S | \geq 2

satisfies the following,

c_{{k}}^{S ∖ k} = c_{{k^{'}}}^{S ∖ k^{'}}

for every

k, k^{'} \in S

.

Proof.

See Appendix F. When the lower bound of Theorem 4 is tight, the clique-covering based index coding scheme (see in [26,27]) yields the optimal communication cost. □

Our main result in Theorem 1, or equivalently, Theorem 4, does not provide a tight lower bound for centralized data shuffling problem [6], because this problem involves scenarios that do not satisfy the tightness condition of Theorem 6. For instance, consider the simple canonical data shuffling setting, where the system has exactly K files, all of equal size F bits, and each node stores exactly one of these files, i.e., the entirety of the contents of the

k th

node

C_{k}

is the

k th

file. Here,

| C_{k} | = F

for all

k \in [K]

, and

C_{i} \cap C_{j} = \emptyset

for all

i \neq j

. Assume that the shuffling problem is to move the file

C_{k + 1}

to node k, i.e.,

D_{k} = C_{k + 1}

, where we consider the index

K + 1

to be equal to 1. This is a worst-case demand for data shuffling incurring the largest possible communication cost. For this set of demands, we have

c_{k}^{{k + 1}} = F

for all

k \in [K]

, and

c_{k}^{Q} = 0

for all other choices of

k, Q

. In particular,

c_{k + 1}^{{k}} = 0 \neq c_{k}^{{k + 1}}

. Clearly, the condition in Theorem 6 does not hold for

S = {k, k + 1}

. Therefore, our lower bound is strictly less than

L^{*}

for this data shuffling problem, and therefore is not tight.

6. Relationship to Other Index Coding Settings

We now comment on the application of our data exchange bound to a couple of other important index coding settings known in literature, (a) distributed index coding studied in [10] (which is equivalent to the cooperative multi-sender index coding setting considered in [11]), and (b) embedded index coding, presented in [12].

6.1. Distributed Index Coding

In [10], the authors consider a generalization of the single-server index coding problem (which we studied in Section 5) called distributed index coding. The specific setting in [10] is as follows. There are n messages denoted by

x_{j} : j \in [n],

where

x_{j} \in {0, 1}^{t_{j}}

(for some positive integer

t_{j}

). There is a corresponding set of n receivers indexed by

[n]

. The receiver

j \in [n]

contains as side-information the subset of messages indexed by

A_{j} \subset [n]

(i.e., receiver j knows

{x_{i} : i \in A_{j}}

) and demands the message

x_{j}

. There are

2^{n} - 1

servers in the system, indexed by the sets

J = {J : J \subset [n], J \neq \emptyset}

. The server J contains the messages

{x_{i} : i \in J} .

The servers do not demand any messages and are responsible only for transmissions that satisfy the receivers. The server J is connected to the n receivers via a broadcast link with capacity

C_{J}

bits. In order to satisfy the demands, each server J sends a message

y_{J} \in {0, 1}^{s_{J}}

to all the receivers, where

s_{J}

is some positive integer.

Definition 3

([10]). The rate-capacity tuple

((R_{j} : j \in [n]), (C_{J} : J \in J))

is said to be achievable if there exists some positive integer r such that

t_{j} \geq r R_{j}, \forall j

and

s_{J} \leq r C_{J}, \forall J,

and there exists valid encoding functions (encoding the messages of lengths

(t_{j} : j \in [n])

into codewords of lengths

(s_{J} : J \in J)

) and decoding functions, such that all receivers can decode their respective demands.

Slightly abusing Definition 2, for some

T \subset [n]

, we call a set

S \subset T

of message indices as a generalized independent set of T, if for every subset

S^{'} \subset S,

there is some

j \in S^{'}

such that

A_{j} \cap (S^{'} ∖ j) = \emptyset .

Let

((R_{j} : j \in [n]), (C_{J} : J \in J))

be an achievable rate-capacity tuple. For any non-empty subset

T \subset [n]

, let

S_{T}

be a generalized independent set of T. In Corollary 2 of [10], it is shown that

\begin{matrix} \sum_{j \in S_{T}} R_{j} \leq \sum_{J : J \cap T \neq \emptyset} C_{J} . \end{matrix}

(44)

Remark 6.

The above bound in (44) is given in [10] using the terminology of the side-information graph defining the index coding problem and its acyclic induced subgraphs. However, we have used generalized independent sets to state the same bound. The reader can easily confirm that the acyclic induced subgraph of the side-information graph as defined in [10] is the same as a generalized independent set we have used in this work. Therefore, (44) is the same as the bound in Corollary 2 of [10].

Let

S_{m a x} = a r g max_{S} \sum_{j \in S} t_{j},

where the maximization is over all generalized independent sets S of

[n]

.

Then, we have by (44),

\begin{matrix} \sum_{j \in S_{m a x}} R_{j} \leq \sum_{J \in J} C_{J} . \end{matrix}

(45)

In order to relate the bound in (44) with our data exchange bound, we fix

R_{j} = t_{j}, \forall j \in [n]

. This means that we should have

r = 1

in Definition 3. For these parameters, let

s_{J}^{*} : J \in J

be a choice of integers

s_{J} : J \in J

such that the rate-capacity tuple

((R_{j} = t_{j} : j \in [n]), (s_{J} : J \in J))

is achievable and

\sum_{J \in J} s_{J}

is minimized. Note that such integers

s_{J}^{*} : J \in J

will exist as each index coding problem has at least one solution, namely, the trivial solution consisting of uncoded transmissions of

x_{j} : j \in [n]

.

Then, applying (45), we have

\begin{matrix} \sum_{j \in S_{m a x}} t_{j} \leq \sum_{J \in J} s_{J}^{*} . \end{matrix}

(46)

Note that

\sum_{J \in J} s_{J}^{*}

is exactly the minimum number of bits to be communicated by the servers for satisfying receiver demands.

For

Q \subset [n]

, define

\begin{matrix} f_{j}^{Q} ≜ \{\begin{matrix} 1 & if j \in (\cap_{k \in Q} A_{k}) ∖ (\cup_{k^{'} \in [n] ∖ Q} A_{k^{'}}), \\ 0 & otherwise . \end{matrix} \end{matrix}

By arguments similar to that of the proof of Theorem 5, we can verify that

\begin{matrix} \sum_{j \in S_{m a x}} t_{j} = max_{γ} \sum_{j = 1}^{n} \sum_{Q \subset {γ_{j + 1}, \dots, γ_{n}}} f_{γ_{j}}^{Q} t_{γ_{j}}, \end{matrix}

(47)

where the maximization is over all possible permutations

γ = (γ_{1}, \dots, γ_{n})

of

(1, \dots, n) .

We thus have by (46) and (47),

\begin{matrix} \sum_{J \in J} s_{J}^{*} \geq max_{γ} \sum_{j = 1}^{n} \sum_{Q \subset {γ_{j + 1}, \dots, γ_{n}}} f_{γ_{j}}^{Q} t_{γ_{j}} . \end{matrix}

(48)

Finally, we apply our data exchange bound in Theorem 1 to the distributed index coding setting. To do this, we first observe that if we replaced all servers by a single “virtual” central server containing all the messages,

x_{j} : j \in [n],

then

\sum_{J \in J} s_{J}^{*}

is the minimum number of bits to be transmitted by this virtual central server to satisfy the receiver demands. Any lower bound on the communication cost for this transformed setting with the virtual server will thus continue to apply for the original distributed setting with messages of length

t_{j} : j \in [n]

. Now, utilizing the centralized version of Theorem 1 shown in Theorem 4 and by the discussion in Section 5.2, we get

\begin{matrix} \sum_{J \in J} s_{J}^{*} \geq \frac{1}{n!} \sum_{γ} \sum_{j = 1}^{n} \sum_{Q \subset {γ_{j + 1}, \dots, γ_{n}}} f_{γ_{j}}^{Q} t_{γ_{j}} . \end{matrix}

(49)

Therefore, we see that the generalized independent set based bound in (48) is in general better than (49), as (48) involves a maximization over all permutations

γ,

while (49) involves the average.

6.2. Embedded Index Coding

We now consider the embedded index coding problem, introduced in [12], motivated by device-to-device communications. The embedded index coding setting consists of a set of m data blocks (each a binary vector of length t) distributed across a set of n nodes. Each node stores (as side information) a subset of the data blocks and demands another subset which it already does not have. This setting is different from [26,27] or distributed index coding [10], as there are no dedicated servers by default here. Each node transmits a codeword obtained by encoding its data blocks, and each demanded data block at any node is decoded from the codewords obtained from other nodes and the side information at the node itself. An embedded index code consists of a collection of such encoding functions and decoding functions at the nodes, such that all demanded blocks are decoded at the respective nodes. The communication cost of embedded index coding is the total number of bits transmitted between the nodes to satisfy the node demands. The work [12] generalizes the notion of minrank [26] of single-server index codes to define the optimal length of linear embedded index codes. Further, the authors also present heuristic constructions for general and specialized linear codes which have some nice properties.

As the embedded index coding problem clearly has a direct mapping with the data exchange problem considered in the present work, we can apply our data exchange bound directly to obtain a new lower bound for the communication cost of embedded index coding. The expression of this bound would be in the same form (up to only the change in notation) as Theorem 1 itself. As our bound holds in the information-theoretic sense, it would apply to not just the linear codes considered in [12] but nonlinear embedded index codes as well.

7. Conclusions

We have presented an information theoretic converse result for a generic data exchange problem, where the terminals contain some data in their local storage and want other data available at the local storage of other nodes. As a number of recently studied multi-terminal communication problems fall under this setting, we have used our general converse to obtain converses in many such settings, thus recovering many existing results and presenting some new results as well. Using a connection with index coding, we also presented some ideas on why and when our data exchange based converse can be loose in the index coding setting. It would be quite interesting to see if our converse result can be tightened further while still retaining a closed form expression, so as to cover all known bounds for any existing setting that can be modeled in the data exchange framework. A lower bound for the communication load in a generic data exchange setting in the presence of coded storage bits would also be a prospective direction for future research in this area.

Author Contributions

Conceptualization, P.K. and L.N.; formal analysis, P.K., L.N., V.L.; writing—original draft preparation, P.K., L.N., V.L.; writing—review and editing, P.K., L.N., V.L.; funding acquisition, P.K., L.N., V.L. All authors have read and agreed to the published version of the manuscript.

Funding

Prasad Krishnan was supported by Science and Engineering Research Board (SERB), Government of India via grants CRG/2019/005572 and MTR/2017/000827. Lakshmi Natarajan was supported by Science and Engineering Research Board (SERB-DST), Government of India (via projects MTR/2019/001454 and DST/INSPIRE/04/2015/002094).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors are thankful to the anonymous reviewers and the academic editor for their careful reading of the manuscript and comments that helped improved the quality of the manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Proof of Theorem 1

We assume that all the bits in the collection B as in Definition 1 are i.i.d uniformly distributed on

{0, 1} .

For a given communication scheme for the given data exchange problem, let

X_{i} ≜ ϕ_{i} (C_{i})

represent the codeword transmitted by node i. For a subset

S \subset [K],

let

X_{S} ≜ \cup_{i \in S} X_{i}

. Furthermore, let

Y_{S} = ⋃_{i \in S} (D_{i} \cup C_{i})

. We first prove the following claim.

Claim A1.

For any

S \subset [K]

,

\begin{matrix} H (X_{S} | Y_{\bar{S}}) \geq \sum_{P \subset S} \sum_{Q \subset S ∖ P} \frac{| P |}{| P | + | Q | - 1} a_{P}^{Q}, \end{matrix}

(A1)

where

\bar{S} = [K] ∖ S

.

Applying

S = [K]

to the above claim then gives Theorem 1, as

L^{*} \geq H (X_{[K]}) .

Now, we prove Claim A1. For this, we use induction on

| S | .

We take the base case to be

| S | = 2

, as for

| S | = 1

the problem of data exchange is not well defined. Let

S = {1, 2}

without loss of generality. Then, the LHS of (A1) gives

\begin{matrix} H (X_{1}, X_{2} | Y_{\bar{S}}) & \geq H (X_{1} | Y_{\bar{S}}) + H (X_{2} | Y_{\bar{S}}, X_{1}) \end{matrix}

\begin{matrix} \geq H (X_{1} | Y_{\bar{S}}, C_{2}) + H (X_{2} | Y_{\bar{S}}, C_{1}) \end{matrix}

(A2)

\begin{matrix} = H (X_{1}, D_{2} | Y_{\bar{S}}, C_{2}) + H (X_{2}, D_{1} | Y_{\bar{S}}, C_{1}) \end{matrix}

(A3)

\begin{matrix} \geq H (D_{2} | Y_{\bar{S}}, C_{2}) + H (D_{1} | Y_{\bar{S}}, C_{1}) \end{matrix}

(A4)

\begin{matrix} \geq a_{{2}}^{{1}} + a_{{1}}^{{2}}, \end{matrix}

(A5)

where (A2) follows as conditioning reduces entropy and

H (X_{1} | C_{1}) = 0

, (A3) is true as

H (D_{2} | Y_{\bar{S}}, C_{2}, X_{1}) = 0

and

H (D_{1} | Y_{\bar{S}}, C_{1}, X_{2}) = 0

. This proves the base case.

We now assume that the statement is true for

| S | = t - 1,

and prove that it holds for

| S | = t

. We have the LHS of (A1) satisfying the following relationships for

| S | = t

.

\begin{matrix} H (X_{S} | Y_{\bar{S}}) & = \frac{1}{t} \sum_{k \in S} (H (X_{S ∖ k} | Y_{\bar{S}}, X_{k}) + H (X_{k} | Y_{\bar{S}})) \end{matrix}

\begin{matrix} \geq \frac{1}{t} (\sum_{k \in S} H (X_{S ∖ k} | Y_{\bar{S}}, C_{k})) + \frac{1}{t} H (X_{S} | Y_{\bar{S}}) \\ \geq \frac{1}{t - 1} \sum_{k \in S} H (X_{S ∖ k} | Y_{\bar{S}}, C_{k}) \end{matrix}

(A6)

\begin{matrix} = \frac{1}{t - 1} \sum_{k \in S} H (X_{S ∖ k}, D_{k} | Y_{\bar{S}}, C_{k}) \end{matrix}

(A7)

\begin{matrix} = \frac{1}{t - 1} \sum_{k \in S} (H (D_{k} | C_{k}, Y_{\bar{S}}) + H (X_{S ∖ k} | Y_{\bar{S ∖ k}})), \end{matrix}

(A8)

where (A6) follows because

H (X_{k} | C_{k}) = 0

. In (A7), we introduce

D_{k}

freely, because

\begin{matrix} H (D_{k} | X_{S ∖ k}, Y_{\bar{S}}, C_{k}) & \leq H (D_{k} | X_{S ∖ k}, C_{\bar{S}}, C_{k}) \\ \leq H (D_{k} | X_{S ∖ k}, X_{\bar{S}}, C_{k}) \\ = 0, \end{matrix}

where the last two statements follow because

H (X_{\bar{S}} | C_{\bar{S}}) = 0

and from the decoding condition, respectively. We now interpret the two terms of (A8). For the first term, we have

\begin{matrix} \sum_{k \in S} H (D_{k} | C_{k}, Y_{\bar{S}}) = & \sum_{k \in S} \sum_{P^{'} \subset S ∖ k} \sum_{Q \subset S ∖ (P^{'} \cup k)} a_{k \cup P^{'}}^{Q} \\ = \sum_{P \subset S} \sum_{Q \subset S ∖ P} | P | a_{P}^{Q}, \end{matrix}

(A9)

where the last statement follows by noting that for a fixed choice of

P, Q

we have

| P |

choices for

(k, P^{'})

such that

P^{'} \cup k = P

.

Now, using the induction hypothesis for the last term of (A8),

\begin{matrix} \sum_{k \in S} & H (X_{S ∖ k} | Y_{\bar{S ∖ k}}) \end{matrix}

\begin{matrix} \geq \sum_{k \in S} \sum_{P \subset S ∖ k} \sum_{Q \subset S ∖ (P \cup k)} \frac{| P |}{| P | + | Q | - 1} a_{P}^{Q}, \end{matrix}

(A10)

\begin{matrix} = \sum_{P \subset S} \sum_{Q \subset S ∖ P} (t - | P | - | Q |) \frac{| P |}{| P | + | Q | - 1} a_{P}^{Q} \end{matrix}

(A11)

where the above follows by noting that for a fixed choice of disjoint subsets

P, Q

of S, we have

| S | - | P | - | Q |

choices for k such that

P \subset S ∖ k

and

Q \subset S ∖ (P \cup k) .

Using (A9) and (A11) we have

\begin{matrix} RHS & of (A 8) \\ \geq \frac{1}{t - 1} \sum_{P \subset S} \sum_{Q \subset S ∖ P} | P | (1 + \frac{t - | P | - | Q |}{| P | + | Q | - 1}) a_{P}^{Q} \\ = \sum_{P \subset S} \sum_{Q \subset S ∖ P} \frac{| P |}{| P | + | Q | - 1} a_{P}^{Q}, \end{matrix}

thus proving Claim A1, which also concludes the proof of the theorem.

Appendix B. Proof of (11)

We proceed according to the three steps in Section 1.2, however with some important variations that are required to prove (11).

Applying Theorem 1 and symmetrizing: As in the proof of Theorem 2, we use the index 0 to represent the server. Consider that, for the given placement scheme

ζ

, the delivery scheme is designed so that the optimal communication load

L_{c}^{*} (N_{u}, ζ)

is achieved. Let

A = (\binom{[K]}{N_{u}})

be the set of all

N_{u}

-sized subsets of clients. Consider the coded caching subproblem induced by the server and a set

A \in A

of clients. Consider some demand vector

d = (d_{1}, \dots, d_{K})

such that the demands of the clients in A are distinct, i.e.,

d_{i} \neq d_{j}

for

i, j \in A, i \neq j

.

Let

Ω

consist of the

N - 1

cyclic permutations of

(1, \dots, N)

which are N-cycles, along with the identity permutation. For

σ \in Ω

, let

σ (d)

denote the demand vector in which the

i th

component is exactly the value obtained by applying the permutation

σ

on the

i th

component of

d,

i.e.,

σ {(d)}_{i} = σ (d_{i}) .

Clearly, for each

σ \in Ω

, we have

L_{c}^{*} (N_{u}, ζ) \geq L_{A}^{*} (σ (d)),

where

L_{A}^{*} (σ (d))

is the optimal communication load for this subproblem with demands

σ (d)

with respect to the placement

ζ .

Now, for each

σ \in Ω,

following similar steps as in proof of Theorem 2, we can use our data exchange bound in Theorem 1 to obtain

\begin{matrix} L_{c}^{*} (N_{u}, ζ) \geq L_{A}^{*} (σ (d)) \geq \sum_{k \in A} \sum_{Q_{1} \subset A ∖ k} \frac{1}{1 + | Q_{1} |} f_{k}^{Q_{1}} (σ (d)), \end{matrix}

(A12)

where

f_{k}^{Q_{1}} (σ (d))

is the number of bits of the demanded file

W_{σ {(d)}_{k}}

of the client k available at the nodes

Q_{1} \subset A

and no other nodes in A. Let

a^{Q}

be the number of all bits (corresponding to all N files) stored exclusively in clients

Q \cup {0} \subset [K] \cup {0}

. Then, by the structure of the demand vectors in the set

D = {σ (d) : σ \in Ω},

we see that

\begin{matrix} \sum_{σ \in Ω} f_{k}^{Q_{1}} (σ (d)) = \sum_{Q_{2} \subset [K] ∖ A} a^{Q_{1} \cup Q_{2}} . \end{matrix}

By averaging (A12) across the N demand vectors in

{σ (d) : σ \in Ω},

we get

\begin{matrix} L_{c}^{*} (N_{u}, ζ) \geq \frac{1}{N} \sum_{k \in A} \sum_{Q_{1} \subset A ∖ k} \sum_{Q_{2} \subset [K] ∖ A} \frac{1}{1 + | Q_{1} |} a^{Q_{1} \cup Q_{2}} . \end{matrix}

(A13)

For each

A \in A,

the bound (A13) holds. Averaging these bounds for all

A \in A,

we obtain

\begin{matrix} L_{c}^{*} (N_{u}, ζ) \geq \frac{1}{N (\binom{K}{N_{u}})} \sum_{A \in A} \sum_{k \in A} \sum_{Q_{1} \subset A ∖ k} \sum_{Q_{2} \subset [K] ∖ A} \frac{1}{1 + | Q_{1} |} a^{Q_{1} \cup Q_{2}} . \end{matrix}

(A14)

In the above summation, for any given

A,

for fixed

Q_{1} \subset A, Q_{2} \subset [K] ∖ A,

the variable k takes values from

A ∖ Q_{1} .

Thus, we have

\begin{matrix} L_{c}^{*} (N_{u}, ζ) & \geq \frac{1}{N (\binom{K}{N_{u}})} \sum_{A \in A} \sum_{Q_{1} \subset A} \sum_{Q_{2} \subset [K] ∖ A} (\frac{N_{u} - | Q_{1} |}{1 + | Q_{1} |}) a^{Q_{1} \cup Q_{2}} \end{matrix}

(A15)

\begin{matrix} = \frac{1}{N (\binom{K}{N_{u}})} \sum_{q = 0}^{N_{u}} \sum_{Q \subset [K]} \sum_{A \in A : | Q \cap A | = q} (\frac{N_{u} - q}{1 + q}) a^{Q} . \end{matrix}

(A16)

For some

Q \subset [K]

, to obtain an

A \in A

such that

| A \cap Q | = q,

we have to choose q elements from Q and

N_{u} - q

elements from outside Q. Thus, we have

| {A \in A : | Q \cap A | = q} | = (\binom{| Q |}{q}) (\binom{K - | Q |}{N_{u} - q}) .

Thus, we have

\begin{matrix} L_{c}^{*} (N_{u}, ζ) \geq \frac{1}{N (\binom{K}{N_{u}})} \sum_{q = 0}^{N_{u}} \sum_{Q \subset [K]} (\binom{| Q |}{q}) (\binom{K - | Q |}{N_{u} - q}) (\frac{N_{u} - q}{1 + q}) a^{Q} . \end{matrix}

(A17)

Now

\begin{matrix} (\binom{| Q |}{q}) (\binom{K - | Q |}{N_{u} - q}) (\frac{N_{u} - q}{1 + q}) & = \frac{| Q |! (K - | Q |)! (N_{u} - q)}{q! (| Q | - q)! (N_{u} - q)! (K - | Q | - N_{u} + q)! (1 + q)} \end{matrix}

(A18)

\begin{matrix} = (\binom{K}{N_{u}}) \frac{(\binom{N_{u}}{q + 1}) (\binom{K - N_{u}}{| Q | - q})}{(\binom{K}{| Q |})} . \end{matrix}

(A19)

Further, we have

\begin{matrix} \sum_{q = 0}^{N_{u}} (\binom{N_{u}}{q + 1}) (\binom{K - N_{u}}{| Q | - q}) = \sum_{q = 1}^{N_{u}} (\binom{N_{u}}{q}) (\binom{K - N_{u}}{| Q | - q + 1}) = (\binom{K}{| Q | + 1}) - (\binom{K - N_{u}}{| Q | + 1}) . \end{matrix}

(A20)

Using (A20) and (A19) in (A17), we get

\begin{matrix} L_{c}^{*} (N_{u}, ζ) & \geq \frac{1}{N} \sum_{Q \subset [K]} \frac{(\binom{K}{| Q | + 1}) - (\binom{K - N_{u}}{| Q | + 1})}{(\binom{K}{| Q |})} a^{Q} \\ \geq \frac{1}{N F} \sum_{Q \subset [K]} g_{_{N_{u}}} (| Q |) a^{Q}, \end{matrix}

(A21)

where

g_{_{N_{u}}}

is the lower convex envelope as defined in Remark 2.

Using the constraints to revise (A21):

Observe that

\sum_{Q \subset [K]} a^{Q} = N F

. We assume without loss of generality that all caches are completely populated with the bits of the file library (as we are bounding the optimal load), and thus we have

\sum_{Q \subset [K]} | Q | a^{Q} = t N F .

By using these and applying Jensen’s inequality to (A21), we have (11). This completes the proof.

Appendix C. Proof of (21)

The proof uses the same approach as in Appendix B, but takes into account the fact that there is no central server and the cache sizes are heterogeneous. The choice of the set of demand vectors used for symmetrization is the same as in Appendix B.

Applying Theorem 1 and symmetrizing: Let the sets

A

and

A \in A

, the set of cyclic permutations

Ω

, demand vector

d

, and optimal communication loads

L_{c}^{*} (N_{u}, ζ)

and

L_{A}^{*} (σ (d))

be defined as in Appendix B. Throughout this proof we will assume

N_{u} = min {N, K}

. Applying our main result Theorem 1 to the subproblem induced by the demands of the subset of nodes A, we obtain

\begin{matrix} L_{c}^{*} (N_{u}, ζ) \geq L_{A}^{*} (σ (d)) \geq \sum_{k \in A} \sum_{Q \subset [K] ∖ k} \frac{1}{| Q |} c_{k}^{Q} (σ (d)), \end{matrix}

where

c_{k}^{Q} (σ (d))

is the number of bits of the file

W_{{(σ (d))}_{k}}

demanded by node k available exclusively in all nodes in Q and not available in

[K] ∖ Q

. Let

a^{Q}

be the total number of bits stored exclusively in the nodes Q. Then, considering the set of N demands

{σ (d) : σ \in Ω}

, we have

\begin{matrix} \sum_{σ \in Ω} c_{k}^{Q} (σ (d)) = a^{Q} . \end{matrix}

Averaging over all possible

σ

, we obtain

\begin{matrix} L_{c}^{*} (N_{u}, ζ) \geq \frac{1}{N} \sum_{σ \in Ω} L_{A}^{*} (σ (d)) \geq \frac{1}{N} \sum_{k \in A} \sum_{Q \subset [K] ∖ k} \frac{1}{| Q |} a^{Q} . \end{matrix}

Again, averaging this inequality over all possible choices of

A \in A = (\binom{[K]}{N_{u}})

, we obtain

\begin{matrix} L_{c}^{*} (N_{u}, ζ) & \geq \frac{1}{N (\binom{K}{N_{u}})} \sum_{A \in A} \sum_{σ \in Ω} L_{A}^{*} (σ (d)) \\ \geq \frac{1}{N (\binom{K}{N_{u}})} \sum_{A \in A} \sum_{k \in A} \sum_{Q \subset [K] ∖ k} \frac{1}{| Q |} a^{Q} . \end{matrix}

(A22)

In (A22), for a given choice of

Q \subset [K]

, the term

\frac{1}{| Q |} a^{Q}

appears in the summation for every choice of

(A, k)

such that

k \notin Q

,

| A | = N_{u}

,

k \in A

. Therefore, the number of times

\frac{1}{| Q |} a^{Q}

appears in (A22) is the number of such choices of

(A, k)

which is

(K - | Q |) (\binom{K - 1}{N_{u} - 1})

. Hence, we arrive at

\begin{matrix} L_{c}^{*} (N_{u}, ζ) & \geq \frac{1}{N (\binom{K}{N_{u}})} \sum_{Q \subset [K]} (\binom{K - 1}{N_{u} - 1}) \frac{(K - | Q |)}{| Q |} a^{Q} \\ = \frac{F N_{u}}{K} \sum_{Q \subset [K]} \frac{(K - | Q |)}{| Q |} \frac{a^{Q}}{N F} . \end{matrix}

(A23)

Using the constraints to revise (A23): We use the observations that

\frac{K - x}{x}

is convex and decreasing in

x > 0

,

\sum_{Q \subset [K]} \frac{a^{Q}}{N F} = 1

, and

\sum_{Q \subset [K]} | Q | a^{Q} \leq \sum_{i = 1}^{t} K_{T_{i}} γ_{T_{i}} N F

since this is the total available cache memory across all nodes. Applying Jensen’s inequality on (A23) using these constraints, we obtain

\begin{matrix} L_{c}^{*} (N_{u}, ζ) \geq \frac{F N_{u}}{K} \frac{K - \sum_{i = 1}^{t} K_{T_{i}} γ_{T_{i}}}{\sum_{i = 1}^{t} K_{T_{i}} γ_{T_{i}}} . \end{matrix}

This completes the proof.

Appendix D. Proof of Proposition 3

We will continue to use the notations used in the proof of Theorem 1. Let

k \in [K]

, and

S = {γ_{k}, \dots, γ_{K}}

. Note that

\bar{S} = {γ_{1}, \dots, γ_{k - 1}}

. We will prove by induction on

| S | = K - k + 1

that

H (X_{0} | Y_{\bar{S}}) \geq \sum_{i = k}^{K} \sum_{\begin{matrix} P \subset {γ_{i}, \dots, γ_{K}} \\ γ_{i} \in P \end{matrix}} \sum_{Q \subset {γ_{i + 1}, \dots, γ_{K}}} c_{P}^{Q} .

(A24)

Then, the result claimed in the proposition follows by using

S = {γ_{1}, \dots, γ_{K}} = [K]

, i.e.,

k = 1

. When

| S | = 1

, i.e.,

k = K

and

S = {γ_{K}}

, clearly (A24) is true, as

H (X_{0} | Y_{{γ_{1}, \dots, γ_{K - 1}}}) \geq a_{{γ_{K}}}^{{0}} = c_{S}^{\emptyset} .

Now, consider

S = {γ_{k}, \dots, γ_{K}}

. The induction hypothesis is

H (X_{0} | Y_{\bar{S ∖ γ_{k}}}) \geq \sum_{i = k + 1}^{K} \sum_{\begin{matrix} P \subset {γ_{i}, \dots, γ_{K}} \\ γ_{i} \in P \end{matrix}} \sum_{Q \subset {γ_{i + 1}, \dots, γ_{K}}} c_{P}^{Q} .

(A25)

Using the fact

H (D_{γ_{k}} | Y_{\bar{S}}, C_{γ_{k}}, X_{0}) \leq H (D_{γ_{k}} | C_{γ_{k}}, X_{0}) = 0

, we have

\begin{matrix} H (X_{0} | Y_{\bar{S}}) & \geq H (X_{0} | Y_{\bar{S}}, C_{γ_{k}}) \\ = H (X_{0} | Y_{\bar{S}}, C_{γ_{k}}) + H (D_{γ_{k}} | Y_{\bar{S}}, C_{γ_{k}}, X_{0}) \\ = H (X_{0}, D_{γ_{k}} | Y_{\bar{S}}, C_{γ_{k}}) \\ = H (D_{γ_{k}} | Y_{\bar{S}}, C_{γ_{k}}) + H (X_{0} | C_{γ_{k}}, D_{γ_{k}}, Y_{\bar{S}}) \\ = \sum_{\begin{matrix} P \subset {γ_{k}, \dots, γ_{K}} \\ γ_{k} \in P \end{matrix}} \sum_{Q \subset {γ_{k + 1}, \dots, γ_{K}}} c_{P}^{Q} + H (X_{0} | Y_{\bar{S ∖ γ_{k}}}) \end{matrix}

(A26)

We observe that (A24) follows from (A25) and (A26).

Appendix E. Proof of Theorem 5

We prove this theorem by showing that

α

is both upper and lower bounded by the right hand side of (43).

Upper Bound: Assume that

H

is a largest generalized independent set. We will now identify a permutation

π = (π_{1}, \dots, π_{K})

corresponding to

H

. Let

I_{1} = H

, and observe that as

I_{1}

is itself a subset of

H

, it must contain an information bit, say

w_{P, m}^{Q}

that is demanded by a node, say

π_{1}

, and none of the bits in

I_{1} ∖ {w_{P, m}^{Q}}

is available as side information at

π_{1}

. For

k = 2, \dots, K

, we sequentially identify

π_{k}

as follows. We first define

I_{k} = H ∖ ⋃_{i < k} ⋃_{P : π_{i} \in P} ⋃_{Q \subset [K] ∖ P} \{w_{P, m}^{Q} : m = 1, \dots, c_{P}^{Q}\},

which is

H

minus all the bits demanded by any of

π_{1}, \dots, π_{k - 1}

. Thus, any bit in

I_{k}

is demanded by one or more of the nodes in

[K] ∖ {π_{1}, \dots, π_{k - 1}}

. As

I_{k} \subset H

, it contains an information bit such that this bit is demanded by a node, say

π_{k} \in [K] ∖ {π_{1}, \dots, π_{k - 1}}

, and the rest of

I_{k}

is not available as side information at

π_{k}

.

Observe that

H = I_{1} \supset I_{2} \supset \dots \supset I_{K}

, and

I_{k} ∖ I_{k + 1}

is the set of bits in

H

that are demanded by

π_{k}

but not by any of the nodes in

π_{1}, \dots, π_{k - 1}

. Thus,

I_{1} ∖ I_{2}, I_{2} ∖ I_{3}, \dots, I_{K - 1} ∖ I_{K}, I_{K}

form a partition of

H

. Here, we have abused the notation to denote

I_{K}

by

I_{K} ∖ I_{K + 1}

. We also observe that for any choice of

k^{'}

none of the bits of

I_{k^{'}}

is available as side information at

π_{k^{'}}

. If

k > k^{'}

, as

I_{k} \subset I_{k^{'}}

, we deduce that none of the bits in

I_{k}

is available as side information at

π_{k^{'}}

. Thus, we conclude that each bit in

I_{k} ∖ I_{k + 1}

is demanded by

π_{k}

and is neither demanded by and nor available as side information at any of

π_{1}, \dots, π_{k - 1}

. Therefore,

| I_{k} ∖ I_{k + 1} |

is upper bounded by the number of bits exclusively demanded by

π_{k}

and possibly some subset of

{π_{k + 1}, \dots, π_{K}}

and which are also exclusively available at some subset of

{π_{k + 1}, \dots, π_{K}}

, i.e.,

| I_{k} ∖ I_{k + 1} | \leq \sum_{\begin{matrix} P \subset {π_{k}, \dots, π_{K}} \\ π_{k} \in P \end{matrix}} \sum_{Q \subset {π_{k + 1}, \dots, π_{K}}} c_{P}^{Q} .

This provides us the following upper bound,

\begin{matrix} α & = | H | = \sum_{k = 1}^{K} | I_{k} ∖ I_{k + 1} | \\ \leq \sum_{k = 1}^{K} \sum_{\begin{matrix} P \subset {π_{k}, \dots, π_{K}} \\ π_{k} \in P \end{matrix}} \sum_{Q \subset {π_{k + 1}, \dots, π_{K}}} c_{P}^{Q} \\ \leq max_{γ} \sum_{k = 1}^{K} \sum_{\begin{matrix} P \subset {γ_{k}, \dots, γ_{K}} \\ γ_{k} \in P \end{matrix}} \sum_{Q \subset {γ_{k + 1}, \dots, γ_{K}}} c_{P}^{Q}, \end{matrix}

where the maximization is over all permutations

γ

of

[K]

.

Lower bound: We derive the lower bound by showing that, for any permutation

γ

, the set

H = ⋃_{k = 1}^{K} ⋃_{\begin{matrix} P \subset {γ_{k}, \dots, γ_{K}} \\ γ_{k} \in P \end{matrix}} ⋃_{\begin{matrix} Q \subset \\ {γ_{k + 1}, \dots, γ_{K}} \end{matrix}} \{w_{P, m}^{Q} : m = 1, \dots, c_{P}^{Q}\}

is a generalized independent set. Then,

α \geq max_{γ} | H | = max_{γ} \sum_{k = 1}^{K} \sum_{\begin{matrix} P \subset {γ_{k}, \dots, γ_{K}} \\ γ_{k} \in P \end{matrix}} \sum_{Q \subset {γ_{k + 1}, \dots, γ_{K}}} c_{P}^{Q} .

To show that

H

is a generalized independent set, consider any subset

I \subset H

. Let k be the smallest integer such that

I

contains an information bit

w_{P, m}^{Q}

with

γ_{k} \in P

, i.e., k is the smallest integer such that

γ_{k}

demands some information bit in

I

. Therefore, any other bit

w_{P^{'}, m^{'}}^{Q^{'}}

in

I

must satisfy

P^{'} \subset {γ_{k^{'}}, \dots, γ_{K}}, Q^{'} \subset {γ_{k^{'} + 1}, \dots, γ_{K}} for some k^{'} \geq k .

Clearly, this bit is not available as side information at

γ_{k}

. Thus,

H

is a generalized independent set.

Appendix F. Proof of Theorem 6

For unicast problems

c_{P}^{Q} > 0

only if

| P | = 1

. We abuse the notation mildly and use

c_{k}^{Q}

to denote

c_{{k}}^{Q}

.

The Necessity Part: The lower bound of Proposition 3 for unicast problems is

\sum_{i = 1}^{K} \sum_{Q \subset {γ_{i + 1}, \dots, γ_{K}}} c_{γ_{i}}^{Q}

. For brevity, we will denote this sum as

f (γ)

. For Theorem 4 to be tight it is necessary that the bound of this theorem be equal to

α

, i.e., the value of f be the same for all permutations

γ

.

We first show that

c_{i}^{{j}} = c_{j}^{{i}}

for any

i \neq j

. Consider two permutations

γ

and

π

which differ only in the two coordinates

K - 1, K

, given as,

γ_{K - 1} = i, γ_{K} = j

and

π_{K - 1} = j, π_{K} = i

. Then,

\begin{matrix} 0 & = f (γ) - f (π) = c_{i}^{{j}} + c_{i}^{\emptyset} + c_{j}^{\emptyset} - c_{j}^{{i}} - c_{j}^{\emptyset} - c_{i}^{\emptyset} \\ = c_{i}^{{j}} - c_{j}^{{i}} . \end{matrix}

This proves the result for

| S | = 2

. Next, we will assume that the necessity part is true for any

S \subset [K]

of size less than or equal to t, and use induction to prove the result for

| S | = t + 1

.

Given any

(t + 1)

-set

S \subset [K]

and any

k, k^{'} \in S

we will now show that

c_{k}^{S ∖ k} = c_{k^{'}}^{S ∖ k^{'}}

. Consider two permutations

γ, π

that differ only in the two coordinates

K - t, K - t + 1

, and

\begin{matrix} γ_{K - t}, \dots, γ_{K}, π_{K - t}, \dots, π_{K} \in S, \\ γ_{K - t} = k, γ_{K - t + 1} = k^{'}, and π_{K - t} = k^{'}, π_{K - t + 1} = k . \end{matrix}

We observe that

S = {γ_{K - t}, \dots, γ_{K}} = {π_{K - t}, \dots, π_{K}},

and

\begin{matrix} 0 & = f (γ) - f (π) \\ = \sum_{Q \subset {γ_{K - t + 1, \dots, γ_{K}}}} c_{k}^{Q} + \sum_{Q \subset {γ_{K - t + 2, \dots, γ_{K}}}} c_{k^{'}}^{Q} - \sum_{Q \subset {π_{K - t + 1, \dots, π_{K}}}} c_{k^{'}}^{Q} - \sum_{Q \subset {π_{K - t + 2, \dots, π_{K}}}} c_{k}^{Q} \\ = \sum_{Q \subset S ∖ k} c_{k}^{Q} + \sum_{Q \subset S ∖ {k, k^{'}}} c_{k^{'}}^{Q} - \sum_{Q \subset S ∖ k^{'}} c_{k^{'}}^{Q} - \sum_{Q \subset S ∖ {k, k^{'}}} c_{k}^{Q} . \end{matrix}

(A27)

We now argue that except for the two terms

c_{k}^{S ∖ k}

and

- c_{k^{'}}^{S ∖ k^{'}}

all other terms in (A27) cancel out. Consider any term in the first summation of (A27) with

| Q | \leq t - 1

. If

k^{'} \in Q

, then by the induction hypothesis, the term

- c_{k^{'}}^{Q \cup k ∖ k^{'}}

present in the third summation will cancel

c_{k}^{Q}

. If

k^{'} \notin Q

, then

k, k^{'} \notin Q

, and the term

- c_{k}^{Q}

in the fourth summation will cancel

c_{k}^{Q}

. Similarly, every term

c_{k^{'}}^{Q}

in the second summation will cancel the corresponding term

- c_{k^{'}}^{Q}

in the third summation. It is straightforward to observe that these correspondences between the positive and negative terms are unique, and thus we are left with

0 = c_{k}^{S ∖ k} - c_{k^{'}}^{S ∖ k^{'}}

.

The Sufficiency Part: The lower bound in Theorem 4 is

\begin{matrix} L^{*} \geq \sum_{k = 1}^{K} \sum_{Q \subset [K] ∖ k} \frac{1}{1 + | Q |} c_{k}^{Q} = \sum_{k = 1}^{K} c_{k}^{\emptyset} + \sum_{\begin{matrix} S \subset [K] \\ | S | \geq 2 \end{matrix}} \sum_{k \in S} \frac{c_{k}^{S ∖ k}}{| S |} . \end{matrix}

This lower bound can be met by a scheme that uses a combination of uncoded transmission and clique covering. All the bits that are not available at any of the K clients are transmitted uncoded incurring the cost

\sum_{k = 1}^{K} c_{k}^{\emptyset}

. For every

S \subset [K]

with

| S | \geq 2

, the encoder constructs

| S |

vectors, one corresponding to each

k \in S

and broadcasts the XOR of these vectors to the clients. The vector for

k \in S

consists of the

c_{k}^{S ∖ k}

bits demanded by node k and available at nodes

S ∖ k

. All these

| S |

vectors have the same length

c_{k}^{S ∖ k}

. These coded transmissions incur an additional cost

\sum_{\begin{matrix} S \subset [K] \\ | S | \geq 2 \end{matrix}} \sum_{k \in S} \frac{c_{k}^{S ∖ k}}{| S |}

, thereby achieving the lower bound. This is the well known clique-covering index coding scheme (see [27,28]) and these transmissions allow the clients to decode their demands.

References

Maddah-Ali, M.A.; Niesen, U. Fundamental Limits of Caching. IEEE Trans. Inf. Theory 2014, 60, 2856–2867. [Google Scholar] [CrossRef] [Green Version]
Li, S.; Maddah-Ali, M.A.; Avestimehr, A.S. Coded MapReduce. In Proceedings of the 2015 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), Monticello, IL, USA, 29 September–2 October 2015; pp. 964–971. [Google Scholar] [CrossRef]
Li, S.; Maddah-Ali, M.A.; Yu, Q.; Avestimehr, A.S. A Fundamental Tradeoff Between Computation and Communication in Distributed Computing. IEEE Trans. Inf. Theory 2018, 64, 109–128. [Google Scholar] [CrossRef]
Lee, K.; Lam, M.; Pedarsani, R.; Papailiopoulos, D.; Ramchandran, K. Speeding Up Distributed Machine Learning Using Codes. IEEE Trans. Inf. Theory 2018, 64, 1514–1529. [Google Scholar] [CrossRef]
Wan, K.; Tuninetti, D.; Ji, M.; Caire, G.; Piantanida, P. Fundamental Limits of Decentralized Data Shuffling. IEEE Trans. Inf. Theory 2020, 66, 3616–3637. [Google Scholar] [CrossRef] [Green Version]
Elmahdy, A.; Mohajer, S. On the Fundamental Limits of Coded Data Shuffling for Distributed Machine Learning. IEEE Trans. Inf. Theory 2020, 66, 3098–3131. [Google Scholar] [CrossRef]
Krishnan, P.; Lalitha, V.; Natarajan, L. Coded Data Rebalancing: Fundamental Limits and Constructions. In Proceedings of the 2020 IEEE International Symposium on Information Theory (ISIT), Los Angeles, CA, USA, 21–26 June 2020; pp. 640–645. [Google Scholar] [CrossRef]
El Rouayheb, S.; Sprintson, A.; Sadeghi, P. On coding for cooperative data exchange. In Proceedings of the 2010 IEEE Information Theory Workshop on Information Theory (ITW 2010, Cairo), Cairo, Egypt, 6–8 January 2010; pp. 1–5. [Google Scholar] [CrossRef] [Green Version]
Dau, S.H.; Skachek, V.; Chee, Y.M. Error Correction for Index Coding with Side Information. IEEE Trans. Inf. Theory 2013, 59, 1517–1531. [Google Scholar] [CrossRef] [Green Version]
Sadeghi, P.; Arbabjolfaei, F.; Kim, Y.H. Distributed index coding. In Proceedings of the 2016 IEEE Information Theory Workshop (ITW), Cambridge, UK, 11–14 September 2016; pp. 330–334. [Google Scholar] [CrossRef] [Green Version]
Li, M.; Ong, L.; Johnson, S.J. Cooperative Multi-Sender Index Coding. IEEE Trans. Inf. Theory 2019, 65, 1725–1739. [Google Scholar] [CrossRef] [Green Version]
Porter, A.; Wootters, M. Embedded Index Coding. IEEE Trans. Inf. Theory 2021, 67, 1461–1477. [Google Scholar] [CrossRef]
Yu, Q.; Li, S.; Maddah-Ali, M.A.; Avestimehr, A.S. How to optimally allocate resources for coded distributed computing? In Proceedings of the 2017 IEEE International Conference on Communications (ICC), Paris, France, 21–25 May 2017; pp. 1–7. [Google Scholar] [CrossRef] [Green Version]
Krishnan, P.; Natarajan, L.; Lalitha, V. An Umbrella Converse for Data Exchange: Applied to Caching, Computing, Shuffling & Rebalancing. In Proceedings of the 2020 IEEE Information Theory Workshop, Riva del Garda, Italy, 11–15 April 2020. [Google Scholar]
Wan, K.; Tuninetti, D.; Piantanida, P. On the optimality of uncoded cache placement. In Proceedings of the 2016 IEEE Information Theory Workshop (ITW), Cambridge, UK, 11–14 September 2016; pp. 161–165. [Google Scholar]
Yu, Q.; Maddah-Ali, M.A.; Avestimehr, A.S. The Exact Rate-Memory Tradeoff for Caching with Uncoded Prefetching. IEEE Trans. Inf. Theory 2018, 64, 1281–1296. [Google Scholar] [CrossRef]
Ji, M.; Caire, G.; Molisch, A.F. Fundamental Limits of Caching in Wireless D2D Networks. IEEE Trans. Inf. Theory 2016, 62, 849–869. [Google Scholar] [CrossRef] [Green Version]
Asadi, B.; Ong, L.; Johnson, S.J. Centralized Caching with Unequal Cache Sizes. In Proceedings of the 2018 IEEE Information Theory Workshop (ITW), Guangzhou, China, 25–29 November 2018; pp. 1–5. [Google Scholar]
Ibrahim, A.M.; Zewail, A.A.; Yener, A. Centralized Coded Caching with Heterogeneous Cache Sizes. In Proceedings of the 2017 IEEE Wireless Communications and Networking Conference (WCNC), San Francisco, CA, USA, 19–22 March 2017; pp. 1–6. [Google Scholar]
Yapar, C.; Wan, K.; Schaefer, R.F.; Caire, G. On the Optimality of D2D Coded Caching With Uncoded Cache Placement and One-Shot Delivery. IEEE Trans. Commun. 2019, 67, 8179–8192. [Google Scholar] [CrossRef] [Green Version]
Ibrahim, A.M.; Zewail, A.A.; Yener, A. Device-to-Device Coded Caching with Heterogeneous Cache Sizes. In Proceedings of the 2018 IEEE International Conference on Communications (ICC), Kansas City, MO, USA, 20–24 May 2018; pp. 1–6. [Google Scholar]
Wei, Y.; Ulukus, S. Coded caching with multiple file requests. In Proceedings of the 2017 55th Annual Allerton Conference on Communication, Control, and Computing (Allerton), Monticello, IL, USA, 3–6 October 2017; pp. 437–442. [Google Scholar]
Parrinello, E.; Ünsal, A.; Elia, P. Fundamental Limits of Coded Caching With Multiple Antennas, Shared Caches and Uncoded Prefetching. IEEE Trans. Inf. Theory 2020, 66, 2252–2268. [Google Scholar] [CrossRef]
Maddah-Ali, M.A.; Niesen, U. Decentralized Coded Caching Attains Order-Optimal Memory-Rate Tradeoff. IEEE/ACM Trans. Netw. 2015, 23, 1029–1040. [Google Scholar] [CrossRef] [Green Version]
Karat, N.S.; Bhargav, K.L.V.; Rajan, B.S. On the Optimality of Two Decentralized Coded Caching Schemes with and without Error Correction. In Proceedings of the 2020 IEEE International Symposium on Information Theory (ISIT), Los Angeles, CA, USA, 21–26 June 2020; pp. 1664–1669. [Google Scholar]
Bar-Yossef, Z.; Birk, Y.; Jayram, T.S.; Kol, T. Index Coding with Side Information. IEEE Trans. Inf. Theory 2011, 57, 1479–1494. [Google Scholar] [CrossRef]
Birk, Y.; Kol, T. Informed-source coding-on-demand (ISCOD) over broadcast channels. In Proceedings of the IEEE INFOCOM ’98, the Conference on Computer Communications, Seventeenth Annual Joint Conference of the IEEE Computer and Communications Societies, Gateway to the 21st Century (Cat. No.98), San Francisco, CA, USA, 29 March–2 April 1998; Volume 3, pp. 1257–1264. [Google Scholar] [CrossRef]
Tehrani, A.S.; Dimakis, A.G.; Neely, M.J. Bipartite index coding. In Proceedings of the 2012 IEEE International Symposium on Information Theory Proceedings, Cambridge, MA, USA, 1–6 July 2012; pp. 2246–2250. [Google Scholar] [CrossRef]

Figure 1. A single server is connected to K clients via a broadcast channel. Each user has a cache capable of storing

M F

of the

N F

bits in the file-library available at the server.

Figure 1. A single server is connected to K clients via a broadcast channel. Each user has a cache capable of storing

M F

of the

N F

bits in the file-library available at the server.

Figure 2. The decentralized data shuffling problem. The contents of the local cache of each node k at time

t - 1

is

Z_{k, t - 1}

, out of which a subset

A_{k, t - 1}

is the active data currently processed by the node. The worker nodes must communicate via a broadcast link to shuffle the active data among each other and create a new partition

A_{1, t}, \dots, A_{K, t}

of the data units at the next time instance t.

Figure 2. The decentralized data shuffling problem. The contents of the local cache of each node k at time

t - 1

is

Z_{k, t - 1}

, out of which a subset

A_{k, t - 1}

is the active data currently processed by the node. The worker nodes must communicate via a broadcast link to shuffle the active data among each other and create a new partition

A_{1, t}, \dots, A_{K, t}

of the data units at the next time instance t.

Figure 3. There are N files and a subset

M_{i}

of them are assigned to node i in map phase. The output of map phase at each node is

v_{1 : Q, M_{i}}

. Each node computes

X_{i} = ϕ_{i} (v_{1 : Q, M_{i}})

which it broadcasts to other nodes. The nodes compute the reduce outputs based on their own map outputs and the broadcasts which they receive.

Figure 3. There are N files and a subset

M_{i}

of them are assigned to node i in map phase. The output of map phase at each node is

v_{1 : Q, M_{i}}

. Each node computes

X_{i} = ϕ_{i} (v_{1 : Q, M_{i}})

which it broadcasts to other nodes. The nodes compute the reduce outputs based on their own map outputs and the broadcasts which they receive.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Krishnan, P.; Natarajan, L.; Lalitha, V. An Umbrella Converse for Data Exchange: Applied to Caching, Computing, and Shuffling. Entropy 2021, 23, 985. https://doi.org/10.3390/e23080985

AMA Style

Krishnan P, Natarajan L, Lalitha V. An Umbrella Converse for Data Exchange: Applied to Caching, Computing, and Shuffling. Entropy. 2021; 23(8):985. https://doi.org/10.3390/e23080985

Chicago/Turabian Style

Krishnan, Prasad, Lakshmi Natarajan, and V. Lalitha. 2021. "An Umbrella Converse for Data Exchange: Applied to Caching, Computing, and Shuffling" Entropy 23, no. 8: 985. https://doi.org/10.3390/e23080985

APA Style

Krishnan, P., Natarajan, L., & Lalitha, V. (2021). An Umbrella Converse for Data Exchange: Applied to Caching, Computing, and Shuffling. Entropy, 23(8), 985. https://doi.org/10.3390/e23080985

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

An Umbrella Converse for Data Exchange: Applied to Caching, Computing, and Shuffling †

Abstract

1. Introduction and Main Result

1.1. A Converse for the Data Exchange Problem

1.2. A Generic Outline of the Converse Proofs Presented in This Paper

2. Coded Caching

2.1. Server-Based and Server-Free Coded Caching with Heterogeneous Cache Sizes at Clients

2.2. Coded Caching with Multiple File Requests

2.3. Coded Caching with Decentralized Caching

3. Decentralized Coded Data Shuffling

Proof of the Decentralized Data Shuffling Converse

4. Coded Distributed Computing

5. Relation to Index Coding Lower Bound

5.1. The Generalized Independence Number Bound

5.2. Relation to the Index Coding Lower Bound

5.3. On the Tightness of Theorem 4

6. Relationship to Other Index Coding Settings

6.1. Distributed Index Coding

6.2. Embedded Index Coding

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Proof of Theorem 1

Appendix B. Proof of (11)

Appendix C. Proof of (21)

Appendix D. Proof of Proposition 3

Appendix E. Proof of Theorem 5

Appendix F. Proof of Theorem 6

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

An Umbrella Converse for Data Exchange: Applied to Caching, Computing, and Shuffling^†