Network Coding for Line Networks with Broadcast Channels

Kramer, Gerhard; Yazdi, Seyed Mohammadsadegh Tabatabaei

doi:10.3390/e14101813

Open AccessArticle

Network Coding for Line Networks with Broadcast Channels

by

Gerhard Kramer

¹

and

Seyed Mohammadsadegh Tabatabaei Yazdi

²

¹

Institute for Communications Engineering, Technische Universität München, 80333 Munich, Germany

²

Corporate R&D, Qualcomm, San Diego, CA 92121, USA

Entropy 2012, 14(10), 1813-1828; https://doi.org/10.3390/e14101813

Submission received: 26 July 2012 / Revised: 7 September 2012 / Accepted: 18 September 2012 / Published: 28 September 2012

(This article belongs to the Special Issue Information Theory Applied to Communications and Networking)

Download

Browse Figures

Versions Notes

Abstract

:

An achievable rate region for line networks with edge and node capacity constraints and broadcast channels (BCs) is derived. The region is shown to be the capacity region if the BCs are orthogonal, deterministic, physically degraded, or packet erasure with one-bit feedback. If the BCs are physically degraded with additive Gaussian noise then independent Gaussian inputs achieve capacity.

Keywords:

broadcast channels; capacity; network coding; relay channels

Graphical Abstract

1. Introduction

Consider a line network with edge and node capacity constraints as shown in Figure 1. “Supernode” u,

u = 1, 2, 3, 4

, consists of two nodes

u i, u o

where the “i” represents “input” and “o” represents “output”. More generally, for N supernodes

V = {1, 2, \dots, N}

we have

2 N

nodes

\begin{matrix} V_{i o} = {1 i, 1 o, 2 i, 2 o, \dots, N i, N o} \end{matrix}

(1)

and

2 (N - 1) + N

directed edges

\begin{matrix} E_{i o} & = {(1 o, 2 i), (2 o, 3 i), \dots, ((N - 1) o, N i} \end{matrix}

\begin{matrix} \cup {(N o, (N - 1) i), \dots, (3 o, 2 i), (2 o, 1 i} \end{matrix}

\begin{matrix} \cup {(1 i, 1 o), (2 i, 2 o), \dots, (N i, N o)} \end{matrix}

(2)

Every edge

(a, b)

is labeled with a capacity constraint

C_{(a, b)}

and for simplicity we write

C_{(u i, u o)}

as

C_{u}

.

Figure 1. A line network with edge and node capacity constraints.

Let

\begin{matrix} u \to D (u) = {v (1), v (2), \dots, v (L)} \end{matrix}

(3)

denote a multicast traffic session, where

u, v (1), \dots, v (L)

are supernodes. The meaning is that a source message is available at supernode u and is destined for supernodes in the set

D (u)

. Since u takes on N values and

D (u)

can take on

2^{N - 1} - 1

values, there are up to

N (2^{N - 1} - 1)

multicast sessions. We associate sources with input nodes

u i

and sinks with output nodes

u o

. Such line networks are special cases of discrete memoryless networks (DMNs) and we use the capacity definition from [1] (Section III.D). The capacity region was recently established in [2]. A binary linear network code achieves capacity and progressive d-separating edge-cut (PdE) bounds [3] provide the converse.

The goal of this work is to extend results from [2] to wireless line networks by using insights from two-way relaying [4], broadcasting with cooperation [5], and broadcasting with side-information [6]. The model is shown in Figure 2 where the difference to Figure 1 is that node

u o

transmits over a two-receiver broadcast channel (BC)

P (y_{u - 1}, y_{u + 1} | x_{u})

to nodes

(u - 1) i

and

(u + 1) i

(see [7]). The channel outputs at node

u i

are

\begin{matrix} Y_{u - 1, u} = f_{u - 1, u} (X_{u - 1}, Z_{u - 1}) \end{matrix}

(4)

\begin{matrix} Y_{u + 1, u} = f_{u + 1, u} (X_{u + 1}, Z_{u + 1}) \end{matrix}

(5)

for some functions

f_{u - 1, u} (\cdot)

and

f_{u + 1, u} (\cdot)

, and where the

Z_{u}

,

u = 1, 2, \dots, N

, are statistically independent. We permit the noise random variables

Z_{u}

to be common to

Y_{u, u - 1}

and

Y_{u, u + 1}

for generality. The edges

(u i, u o)

are the usual links with capacity

C_{u}

. Such line networks are again special cases DMNs and we use the capacity definition from [1] (Section III.D).

Figure 2. A line network with broadcasting and node capacity constraints.

The paper is organized as follows. Section 2 reviews the capacity region for line networks derived in [2]. Section 3 gives our main result: an achievable rate region for line networks with BCs. Section 4 shows that this region is the capacity region for orthogonal, deterministic, and physically degraded BCs, and packet erasure BCs with feedback. We further show that for physically degraded Gaussian BCs the best input distributions are Gaussian. Section 5 relates our work to recent work on relaying and concludes the paper.

2. Review of Wireline Capacity

We review the main result from [2]. Let

m (u \to D (u))

and

R (u \to D (u))

denote the message bits and rate, respectively, of traffic session

u \to D (u)

. We collect the bits going through supernode u into the following 8 sets:

\begin{matrix} m_{L R}^{(u)} & = {m (i \to D (i)) : 1 \leq i \leq u - 1, D (i) \cap {u + 1, \dots, N} \neq \emptyset, u \notin D (i)} \end{matrix}

(6)

\begin{matrix} m_{R L}^{(u)} & = {m (i \to D (i)) : u + 1 \leq i \leq N, D (i) \cap {1, \dots, u - 1} \neq \emptyset, u \notin D (i)} \end{matrix}

(7)

\begin{matrix} m_{L R u} & = {m (i \to D (i)) : 1 \leq i \leq u - 1, D (i) \cap {u + 1, \dots, N} \neq \emptyset, u \in D (i)} \end{matrix}

(8)

\begin{matrix} m_{R L u} & = {m (i \to D (i)) : u + 1 \leq i \leq N, D (i) \cap {1, \dots, u - 1} \neq \emptyset, u \in D (i)} \end{matrix}

(9)

\begin{matrix} m_{u} & = {m (i \to D (i)) : 1 \leq i \leq u - 1, D (i) \cap {u + 1, \dots, N} = \emptyset, u \in D (i)} \end{matrix}

\begin{matrix} ⋃ {m (i \to D (i)) : u + 1 \leq i \leq N, D (i) \cap {1, \dots, u - 1} = \emptyset, u \in D (i)} \end{matrix}

(10)

\begin{matrix} m_{u, L R} & = {m (u \to D (u)) : D (u) \cap {1, \dots, u - 1} \neq \emptyset}, D (u) \cap {u + 1, \dots, n} \neq \emptyset}} \end{matrix}

(11)

\begin{matrix} m_{u, R} & = {m (u \to D (u)) : D (u) \cap {1, \dots, u - 1} = \emptyset}, D (u) \cap {u + 1, \dots, n} \neq \emptyset}} \end{matrix}

(12)

\begin{matrix} m_{u, L} & = {m (u \to D (u)) : D (u) \cap {1, \dots, u - 1} \neq \emptyset}, D (u) \cap {u + 1, \dots, n} = \emptyset}} \end{matrix}

(13)

The idea is that

$m_{L R}^{(u)}$ and $m_{R L}^{(u)}$ represent traffic flowing from left-to-right and right-to-left, respectively, through supernode u without being required at supernode u;
$m_{L R u}$ , $m_{R L u}$ represent traffic flowing from left-to-right and right-to-left, respectively, through supernode u but required at supernode u also;
$m_{u}$ represents traffic from the left and right, respectively, and destined for supernode u but not destined for any nodes on the right and left (so this traffic “stops" at supernode u on its way from the left or right);
$m_{u, L R}$ , $m_{u, R}$ , and $m_{u, L}$ represent traffic originating at supernode u and destined for nodes on both the left and right, right only, and left only, respectively.

The non-negative message rates are denoted

R_{L R}^{(u)}

,

R_{R L}^{(u)}

,

R_{L R u}

,

R_{R L u}

,

R_{u}

,

R_{u, L R}

,

R_{u, R}

, and

R_{u, L}

.

Theorem 1 (Theorem 1 in [2])

The capacity region of a line network with supernodes

u = 1, 2, \dots, N

is specified by the inequalities

\begin{matrix} max (R_{L R}^{(u)}, R_{R L}^{(u)}) + R_{L R u} + R_{R L u} + R_{u} + R_{u, L R} + R_{u, R} + R_{u, L} \leq C_{u} \end{matrix}

(14)

\begin{matrix} R_{R L}^{(u)} + R_{R L u} + R_{u, L R} + R_{u, L} \leq C_{u, u - 1} \end{matrix}

(15)

\begin{matrix} R_{L R}^{(u)} + R_{L R u} + R_{u, L R} + R_{u, R} \leq C_{u, u + 1} \end{matrix}

(16)

Remark 1 The converse in [2] follows by PdE arguments [3] and achievability follows by using rate-splitting, routing, copying, and “butterfly” binary linear network coding. We review both the PdE bound and the coding method after Examples 1 and 2 below.

Remark 2 Inequalities (15) and (16) are classic cut bounds [8] (Section 14.10). If we have no node constraints (

C_{u} = \infty

) then (15) and (16) are routing bounds, so routing is optimal for this case (see [9]).

Example 1 Consider

N = 3

for which we have 9 possible multicast sessions. The network is as in Figure 1 but where the nodes

4 i

and

4 o

are removed, as well as the edges touching them. For supernode

u = 1

we collect 7 of these sessions into 2 sets as follows. (We abuse notation and write

u \to {v}

as

u \to v

.)

\begin{matrix} m_{1} & : 2 \to 1, 2 \to {1, 3}, 3 \to 1, 3 \to {1, 2} \end{matrix}

(17)

\begin{matrix} m_{1, R} & : 1 \to 2, 1 \to 3, 1 \to {2, 3} \end{matrix}

(18)

Sessions

2 \to 3

and

3 \to 2

are missing from (17) and (18) because they do not involve supernode 1. Similarly, for supernode 2 we collect the 9 sessions into 8 sets:

\begin{matrix} m_{L R}^{(2)} : 1 \to 3, m_{R L}^{(2)} : 3 \to 1, m_{L R 2} : 1 \to {2, 3}, m_{R L 2} : 3 \to {1, 2} \end{matrix}

(19)

\begin{matrix} m_{2} : 1 \to 2, 3 \to 2, m_{2, L R} : 2 \to {1, 3}, m_{2, R} : 2 \to 3, m_{2, L} : 2 \to 1 \end{matrix}

(20)

Finally, for supernode 3 we have the 2 sets

\begin{matrix} m_{3} & : 1 \to 3, 1 \to {2, 3}, 2 \to 3, 2 \to {1, 3} \end{matrix}

(21)

\begin{matrix} m_{3, L} & : 3 \to 1, 3 \to 2, 3 \to {1, 2} \end{matrix}

(22)

The inequalities of Theorem 1 are

\begin{matrix} u = 1 : \{\begin{matrix} R_{1} + R_{1, R} \leq C_{1} \\ R_{1, R} \leq C_{1, 2} \end{matrix} \end{matrix}

(23)

\begin{matrix} u = 2 : \{\begin{matrix} max (R_{L R}^{(2)}, R_{R L}^{(2)}) + R_{L R 2} + R_{R L 2} + R_{2} + R_{2, L R} + R_{2, R} + R_{2, L} \leq C_{2} \\ R_{R L}^{(2)} + R_{R L 2} + R_{2, L R} + R_{2, L} \leq C_{2, 1} \\ R_{L R}^{(2)} + R_{L R 2} + R_{2, L R} + R_{2, R} \leq C_{2, 3} \end{matrix} \end{matrix}

(24)

\begin{matrix} u = 3 : \{\begin{matrix} R_{3} + R_{3, L} \leq C_{3} \\ R_{3, L} \leq C_{3, 2} \end{matrix} \end{matrix}

(25)

We discuss the 7 inequalities (23)–(25) in more detail. Consider first the converse. We write a classic cut as

(S, S^{c})

, where

S

is the set of nodes on one side of the cut and

S^{c}

is the set of nodes on the other side of the cut. The inequalities with the edge capacities

C_{1, 2}

,

C_{2, 1}

,

C_{2, 3}

,

C_{3, 2}

are classic cut bounds. For example, the cut

(S, S^{c}) = ({1 i, 1 o}, {2 i, 2 o, 3 i, 3 o})

gives the bound

R_{1, R} \leq C_{1, 2}

.

The inequalities with the “node" capacities

C_{1}

,

C_{2}

,

C_{3}

in (23)–(25) are not classic cut bounds. To see this, consider the bound

R_{1} + R_{1, R} \leq C_{1}

. A classic cut bound would require us to choose

{1 o, 2 o, 3 o} \subseteq S^{c}

because

m_{1}

and

m_{1, R}

generally include messages with positive rates for all supernodes. But then the only way to isolate the edge

(1 i, 1 o)

is to choose

S = {1 i}

which gives the too-weak bound

R_{1, R} \leq C_{1}

.

We require a stronger method and use PdE bounds. We use the notation in [3]:

E_{d}

is the edge cut,

S_{d}

is the set of sources whose sum-rate we bound,

π (\cdot)

is the permutation that defines the order in which we test the sources. Consider the edge cut

E_{d} = {(1 i, 1 o)}

, the source set

S_{d}

with the traffic sessions (17) and (18), and any permutation

π (\cdot)

for which the sessions (17) appear before the sessions (18). After removing edge

(1 i, 1 o)

the PdE algorithm removes edge

(1 o, 2 i)

because node

1 o

has no incoming edges. We next test if the sessions (17) are disconnected from one of their destinations; indeed they are because one of these destinations is node

1 o

. The PdE algorithm now removes the remaining edges in the graph because the nodes

2 i, 2 o, 3 i, 3 o

are not the sources of messages in (18). As a result, the remaining sessions (18) are disconnected from their destinations and the PdE bound gives

R_{1} + R_{1, R} \leq C_{1}

. The bound on

C_{3}

follows similarly. The bound on

C_{2}

is more subtle and we develop it in a more general context below (see the text after Example 2).

For achievability, note that all 7 inequalities are routing bounds except for the bound on

C_{2}

in (24). To approach this bound, we use a classic method and XOR the bits in sessions

1 \to 3

and

3 \to 1

before sending them through edge

(2 i, 2 o)

. More precisely, we combine

m (1 \to 3)

and

m (3 \to 1)

to form

\begin{matrix} m (1 \to 3) \oplus m (3 \to 1) \end{matrix}

(26)

by which we mean the bits formed when the smaller-rate message bits are XORed with a corresponding number of bits of the larger-rate message. The remaining larger-rate message bits are appended so that

m (1 \to 3) \oplus m (3 \to 1)

has rate

max (R (1 \to 3), R (3 \to 1))

. The message (26) is sent to node

2 o

together with the remaining messages received at node

2 i

. We must thus satisfy the bound on

C_{2}

in (24).

For the routing bounds there are two subtleties. First, node

2 o

forwards to the left the uncoded bits

m (3 \to {1, 2})

,

m (2 \to {1, 3})

, and

m (2 \to 1)

. However, it must treat

m (1 \to 3) \oplus m (3 \to 1)

specially because it cannot necessarily determine

m (3 \to 1)

and

m (1 \to 3)

. But if

R (1 \to 3) > R (3 \to 1)

then node

2 o

can remove the appended bits in (26) and communicate

m (3 \to 1)

to node

1 i

at rate

R (3 \to 1)

, rather than

max (R (1 \to 3), R (3 \to 1))

. If

R (1 \to 3) \leq R (3 \to 1)

then no bits should be removed and node

2 o

again communicates

m (3 \to 1)

to node

1 i

at rate

R (3 \to 1)

. The bits node

2 o

forwards to the right are treated similarly. In summary, the rates for messages

m (3 \to 1)

and

m (1 \to 3)

on edges

(2 o, 1 i)

and

(2 o, 3 i)

, respectively, are simply the classic routing rates.

The second routing subtlety is more straightforward: after node

1 i

receives the XORed bits, it can recover

m (3 \to 1)

by subtracting the bits

m (1 \to 3)

that it knows. Finally, node

1 i

transmits

m (3 \to 1)

to node

1 o

. Node

3 i

operates similarly.

Example 2 Consider Figure 1 with

N = 4

for which there are 28 possible multicast sessions. For supernode 1 we collect 19 of these sessions into 2 sets as follows.

\begin{matrix} m_{1} & : 2 \to 1, 2 \to {1, 3}, 2 \to {1, 4}, 2 \to {1, 3, 4}, \end{matrix}

\begin{matrix} 3 \to 1, 3 \to {1, 2}, 3 \to {1, 4}, 3 \to {1, 2, 4}, \end{matrix}

\begin{matrix} 4 \to 1, 4 \to {1, 2}, 4 \to {1, 3}, 4 \to {1, 2, 3} \end{matrix}

(27)

\begin{matrix} m_{1, R} & : 1 \to 2, 1 \to 3, 1 \to 4, \end{matrix}

\begin{matrix} 1 \to {2, 3}, 1 \to {2, 4}, 1 \to {3, 4}, \end{matrix}

\begin{matrix} 1 \to {2, 3, 4} \end{matrix}

(28)

The 9 sessions not involving supernode 1 are missing. The rate bounds for supernode 1 are given by (14)–(16) with

u = 1

. The messages and rate bounds for supernode 4 are similar.

Similarly, for supernode 2 we collect 26 of 28 sessions into 8 sets as follows.

\begin{matrix} m_{L R}^{(2)} & : 1 \to 3, 1 \to 4, 1 \to {3, 4} \end{matrix}

(29)

\begin{matrix} m_{R L}^{(2)} & : 3 \to 1, 3 \to {1, 4}, 4 \to 1, 4 \to {1, 3} \end{matrix}

(30)

\begin{matrix} m_{L R 2} & : 1 \to {2, 3}, 1 \to {2, 4}, 1 \to {2, 3, 4} \end{matrix}

(31)

\begin{matrix} m_{R L 2} & : 3 \to {1, 2}, 3 \to {1, 2, 4}, 4 \to {1, 2}, 4 \to {1, 2, 3} \end{matrix}

(32)

\begin{matrix} m_{2} & : 1 \to 2, 3 \to 2, 3 \to {2, 4}, 4 \to 2, 4 \to {2, 3} \end{matrix}

(33)

\begin{matrix} m_{2, L R} & : 2 \to {1, 3}, 2 \to {1, 4}, 2 \to {1, 3, 4} \end{matrix}

(34)

\begin{matrix} m_{2, R} & : 2 \to 3, 2 \to 4, 2 \to {3, 4} \end{matrix}

(35)

\begin{matrix} m_{2, L} & : 2 \to 1 \end{matrix}

(36)

Sessions

3 \to 4

and

4 \to 3

are missing. The rate bounds for supernode 2 are given by (14)–(16) with

u = 2

. The messages and rate bounds for supernode 3 are similar.

The converse and coding method for

N > 3

are entirely similar to the case

N = 3

. However, we have not yet developed the PdE bound for

N = 3

and edge cut

E_{d} = {(2 i, 2 o)}

. We do this now but in the more general context of

N \geq 2

and

E_{d} = {(u i, u o)}

for any u.

So consider the PdE bound with

E_{d} = {(u i, u o)}

and

S_{d}

having all the traffic sessions (6)–(13) except for (7). We choose

π (\cdot)

so that the sessions (8)–(10) appear first, the sessions (6) and (11)–(12) appear second, and the sessions (13) appear last. The PdE algorithm performs the following steps.

Remove $(u i, u o)$ and then remove $(u o, (u - 1) i)$ and $(u o, (u + 1) i)$ because node $u o$ has no incoming edges. The resulting graph at supernode u is shown in Figure 3.
Test if the sessions (8)–(10) (sessions $m_{L R u}$ , $m_{R L u}$ , $m_{u}$ ) are disconnected from one of their destinations, which they are because one of these destinations is node $u o$ .
Remove all edges to the right of supernode u because the nodes to the right are not the sources of the remaining sessions (6), (11)–(13) (sessions $m_{L R}^{(u)}$ , $m_{u, L R}$ , $m_{u, R}$ , and $m_{u, L}$ ).
Test if the sessions (6), (11) and (12) (sessions $m_{L R}^{(u)}$ , $m_{u, L R}$ , $m_{u, R}$ ) are disconnected from one of their destinations, which they are because one of these destinations is to the right of supernode u.
Remove all edges to the left of supernode u because the nodes to the left are not the sources of the sessions (13) (sessions $m_{u, L}$ ).
Test if the sessions (13) are disconnected from one of their destinations, which they are.

Since the algorithm completes successfully, the PdE bound (almost) gives inequality (14), but with

R_{L R}^{(u)}

replacing

max (R_{L R}^{(u)}, R_{R L}^{(u)})

. The other inequality, i.e., the one with

R_{R L}^{(u)}

replacing

max (R_{L R}^{(u)}, R_{R L}^{(u)})

follows by choosing

S_{d}

with all the traffic sessions (6)–(13) except for (6), and by modifying

π (\cdot)

so that the edges to the left of supernode u are removed first, and then the edges to the right.

Figure 3. Network at supernode u after the PdE bound has removed the edges

(u i, u o)

,

(u o, (u - 1) i)

, and

(u o, (u + 1) i)

. The session messages are tested in the order:

m_{L R u}

,

m_{R L u}

,

m_{u}

, then

m_{L R}^{(u)}

,

m_{u, L R}

,

m_{u, R}

, and finally

m_{u, L}

.

Figure 3. Network at supernode u after the PdE bound has removed the edges

(u i, u o)

,

(u o, (u - 1) i)

, and

(u o, (u + 1) i)

. The session messages are tested in the order:

m_{L R u}

,

m_{R L u}

,

m_{u}

, then

m_{L R}^{(u)}

,

m_{u, L R}

,

m_{u, R}

, and finally

m_{u, L}

.

3. Achievable Rates with Broadcast

We separate channel and network coding, which sounds simple enough. However, every BC receiver has side information about some of the messages being transmitted, so we will need the methods of [6]. We further use the theory in [5] to describe our achievable rate region.

We begin by having each node

u i

combine

m_{L R}^{(u)}

and

m_{R L}^{(u)}

into the message

\begin{matrix} m_{L R}^{(u)} \oplus m_{R L}^{(u)} \end{matrix}

(37)

by which we mean the same operation as in (26): the smaller-rate message bits are XORed with a corresponding number of bits of the larger-rate message. The remaining larger-rate message bits are appended so that

m_{L R}^{(u)} \oplus m_{R L}^{(u)}

has rate

max (R_{L R}^{(u)}, R_{R L}^{(u)})

. The message (37) is sent to node

u o

together with the remaining messages received at node

u i

. As a result, we must satisfy the bound (14).

The bits arriving at node

u o

are (37) and (8)–(13). Bits

m_{u}

are removed at node

u o

since this node is their final destination. The bits (37) and (8)–(9) and (11) must be broadcast to both nodes

(u - 1) i

and

(u + 1) i

. The remaining bits

m_{u, R}

and

m_{u, L}

are destined (or dedicated) for the right and left only, respectively. However, we know from information theory for broadcast channels [7] that it can help to broadcast parts of these dedicated messages to both receivers. So we split

m_{u, R}

and

m_{u, L}

into two parts each, namely the respective

(m_{u, R}^{'}, m_{u, R}^{''})

and

(m_{u, L}^{'}, m_{u, L}^{''})

where

m_{u, R}^{'}

and

m_{u, L}^{'}

are broadcast to both nodes

(u - 1) i

and

(u + 1) i

. The rates of

m_{u, R}^{'}

and

m_{u, R}^{''}

are the respective

R_{u, R}^{'}

and

R_{u, R}^{''}

, and similarly for

R_{u, L}^{'}

and

R_{u, L}^{''}

. We choose a joint distribution

P_{S_{u} T_{u} W_{u} X_{u}} (\cdot)

and generate a codebook of size

2^{n (max (R_{L R}^{(u)}, R_{R L}^{(u)}) + R_{L R u} + R_{R L u} + R_{u, L R} + R_{u, R}^{'} + R_{u, L}^{'})}

with length-n codewords

{\underset{̲}{w}}_{u} (m_{L R}^{(u)} \oplus m_{R L}^{(u)}, m_{L R u}, m_{R L u}, m_{u, L R}, m_{u, R}^{'}, m_{u, L}^{'})

by choosing every letter of every codeword independently using

P_{W_{u}} (\cdot)

.

We next choose “binning" rates

R_{T_{u}}

and

R_{S_{u}}

. For every

{\underset{̲}{w}}_{u}

, we choose

2^{n (R_{u, R}^{''} + R_{T_{u}})}

length-n codewords

{\underset{̲}{t}}_{u}

by choosing the ith letter

t_{u, i}

of

{\underset{̲}{t}}_{u}

via the distribution

P_{T_{u} | W_{u}} (\cdot | w_{u, i})

where

w_{u, i}

is the ith letter of

{\underset{̲}{w}}_{u}

. We label

{\underset{̲}{t}}_{u}

with the arguments of

{\underset{̲}{w}}_{u}

,

m_{u, R}^{''}

, and a “bin” index from

{1, 2, \dots, 2^{n R_{T_{u}}}}

. Similarly, for every

{\underset{̲}{w}}_{u}

we generate

2^{n (R_{u, L}^{''} + R_{S_{u}})}

length-n codewords

{\underset{̲}{s}}_{u}

generated via

P_{S_{u} | W_{u}} (\cdot)

and label

{\underset{̲}{s}}_{u}

with the arguments of

{\underset{̲}{w}}_{u}

,

m_{u, L}^{''}

, and a “bin” index from

{1, 2, \dots, 2^{n R_{S_{u}}}}

.

Next, the encoder tries to find a pair of bin indices such that

({\underset{̲}{w}}_{u}, {\underset{̲}{t}}_{u}, {\underset{̲}{s}}_{u})

is jointly typical according to one’s favorite flavor of typicality. Using standard typicality arguments (see, e.g., [5]) a typical triple exists with high probability if n is large and

\begin{matrix} R_{S_{u}} + R_{T_{u}} > I (S_{u}; T_{u} | W_{u}) \end{matrix}

(38)

Once this triple is found, we transmit a length-n signal

{\underset{̲}{x}}_{u}

that is generated via

P_{X_{u} | S_{u} T_{u} W_{u}} (\cdot | s_{u, i}, t_{u, i}, w_{u, i})

for

i = 1, 2, \dots, n

.

The receivers use joint typicality decoders to recover their messages. They further use their knowledge (or side-information) about some of the messages. The result is that decoding is reliable if n is large and if the following rate constraints are satisfied (see [5,6]):

\begin{matrix} R_{u, L}^{''} + R_{S_{u}} < I (S_{u}; Y_{u, u - 1} | W_{u}) \end{matrix}

(39)

\begin{matrix} R_{R L}^{(u)} + R_{R L u} + R_{u, L R} + R_{u, R}^{'} + R_{u, L} + R_{S_{u}} < I (S_{u} W_{u}; Y_{u, u - 1}) \end{matrix}

(40)

\begin{matrix} R_{u, R}^{''} + R_{T_{u}} < I (T_{u}; Y_{u, u + 1} | W_{u}) \end{matrix}

(41)

\begin{matrix} R_{L R}^{(u)} + R_{L R u} + R_{u, L R} + R_{u, R} + R_{u, L}^{'} + R_{T_{u}} < I (T_{u} W_{u}; Y_{u, u + 1}) \end{matrix}

(42)

Finally, we use Fourier–Motzkin elimination (see [5]) to remove

R_{S_{u}}

,

R_{T_{u}}

,

R_{u, L}^{'}

,

R_{u, R}^{'}

,

R_{u, L}^{''}

, and

R_{u, R}^{''}

from the above expressions and obtain the following result.

Theorem 2

An achievable rate region for a line network with broadcast channels is given by the bounds

\begin{matrix} max (R_{L R}^{(u)}, R_{R L}^{(u)}) + R_{L R u} + R_{R L u} + R_{u} + R_{u, L R} + R_{u, R} + R_{u, L} \leq C_{u} \end{matrix}

(43)

\begin{matrix} R_{R L}^{(u)} + R_{R L u} + R_{u, L R} + R_{u, L} \leq I (S_{u} W_{u}; Y_{u, u - 1}) \end{matrix}

(44)

\begin{matrix} R_{L R}^{(u)} + R_{L R u} + R_{u, L R} + R_{u, R} \leq I (T_{u} W_{u}; Y_{u, u + 1}) \end{matrix}

(45)

\begin{matrix} R_{R L}^{(u)} + R_{R L u} + R_{u, L R} + R_{u, R} + R_{u, L} \leq I (S_{u} W_{u}; Y_{u, u - 1}) + I (T_{u}; Y_{u, u + 1} | W_{u}) - I (S_{u}; T_{u} | W_{u}) \end{matrix}

(46)

\begin{matrix} R_{L R}^{(u)} + R_{L R u} + R_{u, L R} + R_{u, R} + R_{u, L} \leq I (T_{u} W_{u}; Y_{u, u + 1}) + I (S_{u}; Y_{u, u - 1} | W_{u}) - I (S_{u}; T_{u} | W_{u}) \end{matrix}

(47)

\begin{matrix} R_{L R}^{(u)} + R_{R L}^{(u)} + R_{L R u} + R_{R L u} + 2 R_{u, L R} + R_{u, R} + R_{u, L} \end{matrix}

\begin{matrix} \leq I (S_{u} W_{u}; Y_{u, u - 1}) + I (T_{u} W_{u}; Y_{u, u + 1}) - I (S_{u}; T_{u} | W_{u}) \end{matrix}

(48)

for any choice of

P (s_{u}, t_{u}, w_{u}, x_{u})

and for all u, and where

S_{u} T_{u} W_{u} - X_{u} - Y_{u, u - 1} Y_{u, u + 1}

forms a Markov chain for all u.

Remark 3 The bound (43) is the same as (14).

Remark 4 The bounds (44)–(48) are similar to the bounds of [5, Theorem 5]. A few rates are “missing” because nodes

(u - 1) i

and

(u + 1) i

know

(m_{L R}^{(u)}, m_{L R u})

and

(m_{R L}^{(u)}, m_{R L u})

, respectively, when decoding.

Example 3 Consider

N = 3

for which we have the sessions (17)–(22). The inequalities of Theorem 2 are

\begin{matrix} u = 1 : \{\begin{matrix} R_{1} + R_{1, R} \leq C_{1} \\ R_{1, R} \leq I (T_{1}; Y_{1, 2} | W_{1}) - I (S_{1}; T_{1} | W_{1}) \end{matrix} \end{matrix}

(49)

\begin{matrix} u = 2 : \{\begin{matrix} max (R_{L R}^{(2)}, R_{R L}^{(2)}) + R_{L R 2} + R_{R L 2} + R_{2} + R_{2, L R} + R_{2, R} + R_{2, L} \leq C_{2} \\ T h e f i v e i n e q u a l i t i e s (44)–(48) w i t h u = 2 \end{matrix} \end{matrix}

(50)

\begin{matrix} u = 3 : \{\begin{matrix} R_{3} + R_{3, L} \leq C_{3} \\ R_{3, L} \leq I (S_{3}; Y_{3, 2} | W_{3}) - I (S_{3}; T_{3} | W_{3}) \end{matrix} \end{matrix}

(51)

Observe that the channels from node

1 o

to node

2 i

, and node

3 o

to node

2 i

, are memoryless channels with capacities

C_{1, 2}

and

C_{3, 2}

, respectively. In fact, from (49) and (51) it is easy to see that we may as well choose

W_{1}

,

S_{1}

,

W_{3}

, and

T_{3}

as constants. Moreover, we should choose

T_{1} = X_{1}

and

S_{3} = X_{3}

, and then choose the input distributions so that

I (X_{1}; Y_{1, 2}) = C_{1, 2}

and

I (X_{3}; Y_{3, 2}) = C_{3, 2}

. The inequalities (44)–(48) at node

u = 2

correspond to Marton’s region [10] (Section 7.8) for broadcast channels including a common rate. We will see in the next section that if we specialize to the model of [2] then only the bounds (43)–(45) remain at node 2 because the bounds (46)–(48) are redundant.

4. Special Channels

4.1. Orthogonal Channels

A BC

P_{Y_{1} Y_{2} | X}

is orthogonal if

X = (X_{1}, X_{2})

and

P_{Y_{1} Y_{2} | X} = P_{Y_{1} | X_{1}} P_{Y_{2} | X_{2}}

(see [8] (p. 419)). In fact, if all BCs in Figure 2 are orthogonal then the model reduces to that of Figure 1 so hopefully we recover Theorem 1 from Theorem 2.

Let

X_{u} = (X_{u, u - 1}, X_{u, u + 1})

and

Y_{u} = (Y_{u - 1, u}, Y_{u + 1, u})

. Suppose

C_{u, u - 1}

and

C_{u, u + 1}

are the respective capacities of the memoryless channels

P_{Y_{u, u - 1} | X_{u, u - 1}}

and

P_{Y_{u, u + 1} | X_{u, u + 1}}

. We choose

S_{u} = X_{u, u - 1}

,

T_{u} = X_{u, u + 1}

,

W_{u} = 0

, and

X_{u, u - 1}, X_{u, u + 1}

to be independent and capacity-achieving. Inequalities (44)–(48) reduce to

\begin{matrix} R_{R L}^{(u)} + R_{R L u} + R_{u, L R} + R_{u, L} \leq C_{u, u - 1} \end{matrix}

(52)

\begin{matrix} R_{L R}^{(u)} + R_{L R u} + R_{u, L R} + R_{u, R} \leq C_{u, u + 1} \end{matrix}

(53)

The region of Theorem 1 is therefore achievable. The converse follows by using the same steps as in the converse of Theorem 1.

4.2. Deterministic Channels

A BC

P_{Y_{1} Y_{2} | X}

is deterministic if

Y_{1} = f_{1} (X)

and

Y_{2} = f_{2} (X)

for some functions

f_{1} (\cdot)

and

f_{2} (\cdot)

. We show that Theorem 2 gives the capacity region if all BCs in Figure 2 are deterministic.

Theorem 3

The capacity region of a line network with deterministic BCs is the union over all

P (w_{u}, x_{u})

,

u = 1, 2, \dots, N

, of the (non-negative) rates satisfying

\begin{matrix} max (R_{L R}^{(u)}, R_{R L}^{(u)}) + R_{L R u} + R_{R L u} + R_{u} + R_{u, L R} + R_{u, R} + R_{u, L} \leq C_{u} \end{matrix}

(54)

\begin{matrix} R_{R L}^{(u)} + R_{R L u} + R_{u, L R} + R_{u, L} \leq H (Y_{u, u - 1}) \end{matrix}

(55)

\begin{matrix} R_{L R}^{(u)} + R_{L R u} + R_{u, L R} + R_{u, R} \leq H (Y_{u, u + 1}) \end{matrix}

(56)

\begin{matrix} R_{R L}^{(u)} + R_{R L u} + R_{u, L R} + R_{u, R} + R_{u, L} \leq I (W_{u}; Y_{u, u - 1}) + H (Y_{u, u - 1} Y_{u, u + 1} | W_{u}) \end{matrix}

(57)

\begin{matrix} R_{L R}^{(u)} + R_{L R u} + R_{u, L R} + R_{u, R} + R_{u, L} \leq I (W_{u}; Y_{u, u + 1}) + H (Y_{u, u - 1} Y_{u, u + 1} | W_{u}) \end{matrix}

(58)

\begin{matrix} R_{L R}^{(u)} + R_{R L}^{(u)} + R_{L R u} + R_{R L u} + 2 R_{u, L R} + R_{u, R} + R_{u, L} \end{matrix}

\begin{matrix} \leq I (W_{u}; Y_{u, u - 1}) + I (W_{u}; Y_{u, u + 1}) + H (Y_{u, u - 1} Y_{u, u + 1} | W_{u}) \end{matrix}

(59)

Proof.

Achievability follows by Theorem 2 with

S_{u} = Y_{u, u - 1}

and

T_{u} = Y_{u, u + 1}

. For the converse, the constraint (54) is the PdE bound of [11] (Section III.A). The bounds (55) and (56) are cut bounds. For the remaining steps, let

S^{c}

be the complement of

S

in

V

. We define

\begin{matrix} Y_{S, T} = {Y_{u, v} : u \in S, v \in T} \end{matrix}

(60)

Let

M_{u, L}

be the random message corresponding to

m_{u, L}

, and similarly for the other messages. The messages are independent and have entropy equal to n times their rate, where n is the number of times we use each BC. Let

M (S)

be the set of messages originating at supernodes in

S

. Let

M_{u, L}^{c}

to be the set of all network messages except for

M_{u, L}

, and similarly for other messages. We use the notation

\begin{matrix} Y_{u, u - 1}^{i - 1} = Y_{u, u - 1, 1}, Y_{u, u - 1, 2}, \dots, Y_{u, u - 1, i - 1} \\ {\tilde{W}}_{u, i} = {(M_{u, L} M_{u, R})}^{c} Y_{u, u - 1}^{i - 1} Y_{u, u + 1}^{i - 1} \end{matrix}

For the following, let

S = {u, u + 1, \dots, N}

and

\tilde{S} = {1, 2, \dots, u}

. We bound

\begin{matrix} I (M_{R L}^{(u)} M_{R L u} M_{u, L R}; Y_{S, S^{c}}^{n} M (S^{c})) & = I (M_{R L}^{(u)} M_{R L u} M_{u, L R}; Y_{u, u - 1}^{n} | M (S^{c})) \end{matrix}

\begin{matrix} \overset{(a)}{\leq} I ({(M_{u, L} M_{u, R})}^{c}; Y_{u, u - 1}^{n}) \end{matrix}

\begin{matrix} = \sum_{i = 1}^{n} I ({(M_{u, L} M_{u, R})}^{c}; Y_{u, u - 1, i} | Y_{u, u - 1}^{i - 1}) \end{matrix}

\begin{matrix} \overset{(a)}{\leq} \sum_{i = 1}^{n} I ({\tilde{W}}_{u, i}; Y_{u, u - 1, i}) \end{matrix}

\begin{matrix} \overset{(b)}{=} n I ({\tilde{W}}_{u, Q}; Y_{u, u - 1, Q} | Q) \end{matrix}

\begin{matrix} \overset{(c)}{\leq} n I (W_{u}; Y_{u, u - 1}) \end{matrix}

(61)

where steps

(a)

follow by

I (A; B | C) \leq I (A C D; B)

, step

(b)

follows by defining Q to be a time-sharing random variable that is uniform over

1, 2, \dots, n

, and

(c)

follows by defining

Y_{u, u - 1} = Y_{u, u - 1, Q}

and

W_{u} = {\tilde{W}}_{u, Q} Q

. We similarly have

\begin{matrix} I (M_{L R}^{(u)} M_{L R u} M_{u, L R}; Y_{\tilde{S}, {\tilde{S}}^{c}}^{n} M ({\tilde{S}}^{c})) & \leq n I (W_{u}; Y_{u, u + 1}) \end{matrix}

(62)

where

Y_{u, u + 1} = Y_{u, u + 1, Q}

. Note that our choices for

Y_{u, u - 1}

and

Y_{u, u + 1}

are appropriate for the cut bounds (55) and (56). Finally, we have

\begin{matrix} I (M_{u, L} M_{u, R}; Y_{{u}, V}^{n} {(M_{u, L} M_{u, R})}^{c}) & = I (M_{u, L} M_{u, R}; Y_{u, u - 1}^{n} Y_{u, u + 1}^{n} | {(M_{u, L} M_{u, R})}^{c}) \end{matrix}

\begin{matrix} = \sum_{i = 1}^{n} H (Y_{u, u - 1, i} Y_{u, u + 1, i} | {\tilde{W}}_{u, i}) \end{matrix}

\begin{matrix} = n H (Y_{u, u - 1} Y_{u, u + 1} | W_{u}) \end{matrix}

(63)

Consider the bound (57). We have

\begin{matrix} n (R_{R L}^{(u)} + R_{R L u} + R_{u, L R} + R_{u, R} + R_{u, L}) \end{matrix}

\begin{matrix} \overset{(a)}{\leq} I (M_{R L}^{(u)} M_{R L u} M_{u, L R}; Y_{S, S^{c}}^{n} M (S^{c})) + I (M_{u, L} M_{u, R}; Y_{{u}, V}^{n} {(M_{u, L} M_{u, R})}^{c}) \end{matrix}

\begin{matrix} \overset{(b)}{\leq} n I (W_{u}; Y_{u, u - 1}) + n H (Y_{u, u - 1} Y_{u, u + 1} | W_{u}) \end{matrix}

(64)

where

(a)

follows by Fano’s inequality [8] (p. 38) when the block error probability tends to zero, and

(b)

follows by (61) and (63). This proves (57), and (58) follows in the same way.

Finally, for (59) we use Fano’s inequality to bound

\begin{matrix} n (R_{L R}^{(u)} + R_{R L}^{(u)} + R_{L R u} + R_{R L u} + 2 R_{u, L R} + R_{u, R} + R_{u, L}) \end{matrix}

\begin{matrix} \leq I (M_{R L}^{(u)} M_{R L u} M_{u, L R}; Y_{S, S^{c}}^{n} M (S^{c})) + I (M_{L R}^{(u)} M_{L R u} M_{u, L R}; Y_{\tilde{S}, {\tilde{S}}^{c}}^{n} M ({\tilde{S}}^{c})) \end{matrix}

\begin{matrix} + I (M_{u, L} M_{u, R}; Y_{{u}, V}^{n} {(M_{u, L} M_{u, R})}^{c}) \end{matrix}

\begin{matrix} \leq n I (W_{u}; Y_{u, u - 1}) + n I (W_{u}; Y_{u, u + 1}) + n H (Y_{u, u - 1} Y_{u, u + 1} | W_{u}) \end{matrix}

(65)

This proves Theorem 3. ■

4.3. Physically Degraded Channels

A BC

P_{Y_{1} Y_{2} | X}

is said to be physically degraded if either

X - Y_{1} - Y_{2} or X - Y_{2} - Y_{1}

forms Markov chains (see [8] (p. 422)). For the following theorem, we suppose that

X - Y_{u, u - 1} - Y_{u, u + 1}

forms a Markov chain for all u. However, the direction of degradation can be adjusted either way for any supernode u.

Theorem 4

The capacity region of a line network with physically degraded BCs is the union over all

P (w_{u}, x_{u})

,

u = 1, 2, \dots, N

, of the (non-negative) rates satisfying

\begin{matrix} max (R_{L R}^{(u)}, R_{R L}^{(u)}) + R_{L R u} + R_{R L u} + R_{u} + R_{u, L R} + R_{u, R} + R_{u, L} \leq C_{u} \end{matrix}

(66)

\begin{matrix} R_{L R}^{(u)} + R_{L R u} + R_{u, L R} + R_{u, R} \leq I (W_{u}; Y_{u, u + 1}) \end{matrix}

(67)

\begin{matrix} R_{R L}^{(u)} + R_{R L u} + R_{u, L R} + R_{u, R} + R_{u, L} \leq I (X_{u}; Y_{u, u - 1}) \end{matrix}

(68)

\begin{matrix} R_{L R}^{(u)} + R_{L R u} + R_{u, L R} + R_{u, R} + R_{u, L} \leq I (W_{u}; Y_{u, u + 1}) + I (X_{u}; Y_{u, u - 1} | W_{u}) \end{matrix}

(69)

and where

W_{u} - X_{u} - Y_{u, u - 1} - Y_{u, u + 1}

forms a Markov chain.

Proof.

For achievability, Theorem 2 with

S_{u} = X_{u}

and

T_{u} = 0

gives the region specified by (66)–(69). For the converse, the bound (66) is based on an extension of PdE bounds to mixed wireline/wireless networks (see [11]). The bound (68) is a cut bound. The other two bounds follow by modifying the steps of [12] as follows.

Consider

{\tilde{W}}_{u, i} = M_{u, L}^{c} Y_{u, u - 1}^{i - 1} Y_{u, u + 1}^{i - 1}

and let

\tilde{S} = {1, 2, \dots, u}

. We then have

\begin{matrix} n (R_{L R}^{(u)} + R_{L R u} + R_{u, L R} + R_{u, R}) & \overset{(a)}{\leq} I (M_{L R}^{(u)} M_{L R u} M_{u, L R} M_{u, R}; Y_{\tilde{S}, {\tilde{S}}^{c}}^{n} M ({\tilde{S}}^{c})) \end{matrix}

\begin{matrix} = I (M_{L R}^{(u)} M_{L R u} M_{u, L R} M_{u, R}; Y_{u, u + 1}^{n} | M ({\tilde{S}}^{c})) \end{matrix}

\begin{matrix} = \sum_{i = 1}^{n} H (Y_{u, u + 1, i} | M ({\tilde{S}}^{c}) Y_{u, u + 1}^{i - 1}) \end{matrix}

\begin{matrix} - H (Y_{u, u + 1, i} | M_{L R}^{(u)} M_{L R u} M_{u, L R} M_{u, R} M ({\tilde{S}}^{c}) Y_{u, u + 1}^{i - 1}) \end{matrix}

\begin{matrix} \leq \sum_{i = 1}^{n} H (Y_{u, u + 1, i}) - H (Y_{u, u + 1, i} | {\tilde{W}}_{u, i}) \end{matrix}

\begin{matrix} = n I ({\tilde{W}}_{u, Q}; Y_{u, u + 1, Q} | Q) \end{matrix}

\begin{matrix} \overset{(b)}{\leq} n I (W_{u}; Y_{u, u + 1}) \end{matrix}

(70)

where

(a)

follows by Fano’s inequality and

(b)

follows by defining

W_{u} = {\tilde{W}}_{u, Q} Q

and

Y_{u, u + 1} = Y_{u, u + 1, Q}

. We similarly define

X_{u} = X_{u, Q}

and

Y_{u, u - 1} = Y_{u, u - 1, Q}

. Note that our choices for

X_{u}

and

Y_{u, u - 1}

are appropriate for the cut bound (68).

Finally, for (69) we use (70) to bound

\begin{matrix} n (R_{L R}^{(u)} + R_{L R u} + R_{u, L R} + R_{u, R} + R_{u, L}) & \leq n I (W_{u}; Y_{u, u + 1}) + n I (M_{u, L}; Y_{{u}, V}^{n} M_{u, L}^{c}) \end{matrix}

\begin{matrix} = n I (W_{u}; Y_{u, u + 1}) + I (M_{u, L}; Y_{u, u - 1}^{n} Y_{u, u + 1}^{n} | M_{u, L}^{c}) \end{matrix}

(71)

and

\begin{matrix} I (M_{u, L}; Y_{u, u - 1}^{n} Y_{u, u + 1}^{n} | M_{u, L}^{c}) & \overset{(a)}{=} \sum_{i = 1}^{n} I (M_{u, L} X_{u, i}; Y_{u, u - 1, i} Y_{u, u + 1, i} | {\tilde{W}}_{u, i}) \end{matrix}

\begin{matrix} \overset{(b)}{=} n I (X_{u, Q}; Y_{u, u - 1, Q} Y_{u, u + 1, Q} | {\tilde{W}}_{u, Q} Q) \end{matrix}

\begin{matrix} = n I (X_{u}; Y_{u, u - 1} Y_{u, u + 1} | W_{u}) \end{matrix}

\begin{matrix} \overset{(b)}{=} n I (X_{u}; Y_{u, u - 1} | W_{u}) \end{matrix}

(72)

where

(a)

follows because

X_{u, i}

is defined by the messages at supernode u and the past channel outputs at supernode u, and steps

(b)

follow because

Y_{u, u - 1}^{i - 1} Y_{u, u + 1}^{i - 1} M (V) - X_{u, i} - Y_{u, u - 1, i} - Y_{u, u + 1, i}

forms a (long) Markov chain. Collecting the bounds (70)–(72) proves Theorem 4. ■

4.4. Physically Degraded Gaussian Channels

The additive white Gaussian noise (AWGN) and physically degraded BC has (see [13])

\begin{matrix} Y_{u, u - 1} & = X_{u} + Z_{u, u - 1} \end{matrix}

(73)

\begin{matrix} Y_{u, u + 1} & = Y_{u, u - 1} + Z_{u, u + 1}^{'} \end{matrix}

(74)

where

X_{u}

is real with power constraint

\sum_{i = 1}^{n} X_{u, i}^{2} \leq n P_{u}

for all u, and

Z_{u, u - 1}

and

Z_{u, u + 1}^{'}

are independent Gaussian random variables with variances

N_{u, u - 1}

and

N_{u, u + 1}^{'}

, respectively (again, the direction of degradation can be swapped for any u without changing the results conceptually).

The capacity region is given by Theorem 4 and it remains to optimize

P (w_{u}, x_{u})

. The variances of

Y_{u, u - 1}

and

Y_{u, u + 1}

are at most

P_{u} + N_{u, u - 1}

and

P_{u} + N_{u, u - 1} + N_{u, u + 1}^{'}

, respectively, so the maximum entropy theorem (see [8] (p. 234)) gives

\begin{matrix} I (W_{u}; Y_{u, u + 1}) \leq \frac{1}{2} log (2 π e (P_{u} + N_{u, u + 1})) - h (Y_{u, u + 1} | W_{u}) \end{matrix}

(75)

\begin{matrix} I (X_{u}; Y_{u, u - 1}) \leq \frac{1}{2} log (1 + P_{u} / N_{u, u - 1}) \end{matrix}

(76)

\begin{matrix} I (X_{u}; Y_{u, u - 1} | W_{u}) \leq h (Y_{u, u - 1} | W_{u}) - \frac{1}{2} log (2 π e N_{u, u - 1}) \end{matrix}

(77)

where

N_{u, u + 1} = N_{u, u - 1} + N_{u, u + 1}^{'}

and

h (Y | W)

is the differential entropy of Y conditioned on W. Observe that

\begin{matrix} \frac{1}{2} log (2 π e N_{u, u - 1}) \leq h (Y_{u, u - 1} | W_{u}) \leq \frac{1}{2} log (2 π e (P_{u} + N_{u, u - 1})) \end{matrix}

(78)

so there is an

α_{u}

,

0 \leq α_{u} \leq 1

, such that

\begin{matrix} \frac{1}{2 π e} e^{2 h (Y_{u, u - 1} | W_{u})} = α_{u} P_{u} + N_{u, u - 1} \end{matrix}

(79)

Furthermore, a conditional version of the entropy power inequality (see [8] (p. 496)) gives

\begin{matrix} h (Y_{u, u + 1} | W_{u}) = h (Y_{u, u - 1} + Z_{u, u + 1}^{'} | W_{u}) \geq \frac{1}{2} log (e^{2 h (Y_{u, u - 1} | W_{u})} + 2 π e N_{u, u + 1}^{'}) \end{matrix}

(80)

Collecting the bounds, and inserting (79) and (80) into (77), we have

\begin{matrix} I (W_{u}; Y_{u, u + 1}) \leq \frac{1}{2} log (1 + \frac{(1 - α_{u}) P_{u}}{α_{u} P_{u} + N_{u, u + 1}}) \end{matrix}

(81)

\begin{matrix} I (X_{u}; Y_{u, u - 1}) \leq \frac{1}{2} log (1 + P_{u} / N_{u, u - 1}) \end{matrix}

(82)

\begin{matrix} I (X_{u}; Y_{u, u - 1} | W_{u}) \leq \frac{1}{2} log (1 + α_{u} P_{u} / N_{u, u - 1}) . \end{matrix}

(83)

But we achieve equality in (81)–(83) by choosing

\begin{matrix} X_{u} = V_{u} + W_{u} \end{matrix}

(84)

where

V_{u}

and

W_{u}

are independent Gaussian random variables with zero-mean and variances

α_{u} P_{u}

and

(1 - α_{u}) P_{u}

, respectively. The optimal

P (w_{u}, x_{u})

is therefore zero-mean Gaussian, and the capacity region is given by inserting (81)–(83) with equality into (67)–(69), and taking the union over the rates permitted by varying

α_{u}

.

4.5. Packet Erasure Channels with Feedback

A BC

P_{Y_{1} Y_{2} | X}

is called packet erasure with feedback if X is an L-bit vector and

\begin{matrix} P (y_{1}, y_{2} | x) & = \{\begin{matrix} (1 - p_{1}) \cdot (1 - p_{2}), & y_{1} = y_{2} = x \\ p_{1} \cdot (1 - p_{2}), & y_{1} = Δ, y_{2} = x \\ (1 - p_{1}) \cdot p_{2}, & y_{1} = x, y_{2} = Δ \\ p_{1} \cdot p_{2}, & y_{1} = y_{2} = Δ \end{matrix} \end{matrix}

(85)

and all supernodes receive one bit of feedback from each receiver telling them whether the receiver has seen an erasure or not.

Suppose we give receiver 1 both

Y_{1}

and

Y_{2}

, which means that the channel is physically degraded. Let

R_{1}

be the resulting capacity region. Similarly, let

R_{2}

be the capacity region if we (instead) give receiver 2 both

Y_{1}

and

Y_{2}

. The authors of [14] (see also [15]) showed that the capacity region of the original BC is

R_{1} \cap R_{2}

. The following theorem slightly generalizes the main result of [14] and gives the capacity of line networks with broadcast erasure channels and feedback. The input

X_{u}

has

L_{u}

bits and we denote the erasure probabilities for

Y_{u, u - 1}

and

Y_{u, u + 1}

as

p_{u, u - 1}

and

p_{u, u + 1}

, respectively.

Theorem 5

The capacity region of a line network with broadcast erasure channels and feedback is the union of the (non-negative) rates satisfying (14) and

\begin{matrix} \frac{R_{R L}^{(u)} + R_{R L u} + R_{u, L R} + R_{u, L}}{1 - p_{u, u - 1}} + \frac{R_{u, R}}{1 - p_{u, u - 1} p_{u, u + 1}} \leq L_{u} \end{matrix}

(86)

\begin{matrix} \frac{R_{L R}^{(u)} + R_{L R u} + R_{u, L R} + R_{u, R}}{1 - p_{u, u + 1}} + \frac{R_{u, L}}{1 - p_{u, u - 1} p_{u, u + 1}} \leq L_{u} \end{matrix}

(87)

Proof.

(Sketch) Achievability follows by using the network codes of [14] and [2]. For the converse, the constraint (14) again follows from PdE bounds. For the constraints (86) and (87), we make every BC physically degraded by giving one of the receivers both channel outputs (see [14,16]). Theorem 4 gives a collection of outer bounds for each degradation choice. Finally, we optimize the coding to obtain (86) and (87). ■

5. Discussion

The capacity results in Section 4.1, Section 4.2, Section 4.3, Section 4.4, Section 4.5 imply that decode-forward (DF) relaying suffices, i.e., amplify-forward (AF) and compress-forward (CF) do not improve rates (see also [19] ([Chapter 4])). Quantize-map-forward [17] and noisy network coding [18] also do not improve on DF. In fact, the non-DF methods are suboptimal in general because they do not use superposition coding or binning to treat broadcasting. However, we have found capacity only for BCs that are orthogonal, deterministic, physically degraded, or packet erasure with one-bit feedback. AF and CF strategies are useful for other classes of BCs, as shown in [20] and many further papers.

Finally, our model applies to wireless problems where every node has a dedicated tone and/or time slot for transmission. If nodes use the same tone at the same time, then one must consider the effects of interference. For example, scheduling transmissions with half-duplex protocols is an interesting problem for further study.

Acknowledgments

G. Kramer was supported by an Alexander von Humboldt Professorship endowed by the German Federal Ministry of Education and Research. He was also supported by the Board of Trustees of the University of Illinois Subaward No. 04-217 under NSF Grant No. CCR-0325673, by ARO Grant W911NF-06-1-0182, and by NSF Grant CCF-0905235. S. M. Sadegh Tabatabaei Yazdi was supported by NSF Grant No. CCF-0430201. The material in this paper was presented in part at the 2008 Conference on Information Sciences and Systems, Princeton, NJ, USA, and at the 2009 IEEE Information Theory Workshop, Taormina, Italy. The former paper was co-authored with Serap Savari from Texas A&M University. She declined to be included as a co-author and we are grateful for her contributions.

References

Kramer, G. Capacity results for the discrete memoryless network. IEEE Trans. Inform. Theor. 2003, 49, 4–21. [Google Scholar] [CrossRef]
Tabatabaei Yazdi, S.M.S.; Savari, S.A.; Kramer, G. Network coding in node-constrained line and star networks. IEEE Trans. Inform. Theor. 2011, 57, 4452–4468. [Google Scholar] [CrossRef]
Kramer, G.; Savari, S.A. Edge-cut bounds on network coding rates. J. Netw. Syst. Manag. 2006, 14, 49–67. [Google Scholar] [CrossRef]
Rankov, B.; Wittneben, A. Spectral efficient protocols for half-duplex fading relay channels. IEEE J. Sel. Area. Comm. 2007, 25, 379–389. [Google Scholar] [CrossRef]
Liang, Y.; Kramer, G. Rate regions for relay broadcast channels. IEEE Trans. Inform. Theor. 2007, 53, 3517–3535. [Google Scholar] [CrossRef]
Kramer, G.; Shamai, S. Capacity for classes of broadcast channels with receiver side information. In Proceedings of the IEEE Information Theory Workshop, Tahoe City, CA, USA, 2–6 September 2007; pp. 313–318.
Cover, T.M. Broadcast channels. IEEE Trans. Inform. Theor. 1972, 18, 2–14. [Google Scholar] [CrossRef]
Cover, T.M.; Thomas, J.A. Elements of Information Theory; John Wiley & Sons: New York, NY, USA, 1991. [Google Scholar]
Bakshi, M.; Effros, M.; Gu, W.; Koetter, R. On network coding of independent and dependent sources in line networks. In Proceedings of the IEEE International Symposium on Information Theory, Nice, France, 24–29 June 2007.
Kramer, G. Topics in multi-user information theory. Found. Trends Commun. Inf. Theory 2007, 4, 265–444. [Google Scholar] [CrossRef]
Kramer, G.; Savari, S.A. Capacity bounds for relay networks. In Presented at the Workshop on Information Theory and Applications, UCSD Campus, La Holla, CA, USA, 6–10 February 2006.
Gamal, A.E. The feedback capacity of degraded broadcast channels. IEEE Trans. Inform. Theor. 1978, 24, 379–381. [Google Scholar] [CrossRef]
Gamal, A.E.; van der Meulen, E.C. A proof of Marton’s coding theorem for the discrete memoryless broadcast channel. IEEE Trans. Inform. Theor. 1981, 27, 120–122. [Google Scholar] [CrossRef]
Georgiadis, L.; Tassiulas, L. Broadcast erasure channel with feedback—Capacity and algorithms. In Proceedings of the Workshop on Network Coding, Theory, and Applications, Lausanne, Switzerland, 15–16 June 2009.
Wang, C.C. On the capacity of 1-to-K broadcast packet erasure channels with channel output feedback. IEEE Trans. Inform. Theor. 2012, 58, 931–956. [Google Scholar] [CrossRef]
Ozarow, L. The capacity of the white Gaussian multiple access channel with feedback. IEEE Trans. Inform. Theor. 1984, 30, 623–629. [Google Scholar] [CrossRef]
Avestimehr, A.S.; Diggavi, S.N.; Tse, D.N.C. Wireless network information flow: A deterministic approach. IEEE Trans. Inform. Theor. 2011, 57, 1872–1905. [Google Scholar] [CrossRef]
Lim, S.H.; Kim, Y.H.; Gamal, A.E.; Chung, S.Y. Noisy network coding. IEEE Trans. Inform. Theor. 2011, 57, 3132–3152. [Google Scholar] [CrossRef]
Kramer, G.; Marić, I.; Yates, R.D. Cooperative communications. Found. Trends Netw. 2006, 1, 271–425. [Google Scholar] [CrossRef]
Knopp, R. Two-way radio networks with a star topology. In Proceedings of the 2006 International Zurich Seminar on Communications, Zurich, Switzerland, February 2006; pp. 154–157.

© 2012 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).

Share and Cite

MDPI and ACS Style

Kramer, G.; Yazdi, S.M.T. Network Coding for Line Networks with Broadcast Channels. Entropy 2012, 14, 1813-1828. https://doi.org/10.3390/e14101813

AMA Style

Kramer G, Yazdi SMT. Network Coding for Line Networks with Broadcast Channels. Entropy. 2012; 14(10):1813-1828. https://doi.org/10.3390/e14101813

Chicago/Turabian Style

Kramer, Gerhard, and Seyed Mohammadsadegh Tabatabaei Yazdi. 2012. "Network Coding for Line Networks with Broadcast Channels" Entropy 14, no. 10: 1813-1828. https://doi.org/10.3390/e14101813

APA Style

Kramer, G., & Yazdi, S. M. T. (2012). Network Coding for Line Networks with Broadcast Channels. Entropy, 14(10), 1813-1828. https://doi.org/10.3390/e14101813

Article Menu

Network Coding for Line Networks with Broadcast Channels

Abstract

1. Introduction

2. Review of Wireline Capacity

3. Achievable Rates with Broadcast

4. Special Channels

4.1. Orthogonal Channels

4.2. Deterministic Channels

4.3. Physically Degraded Channels

4.4. Physically Degraded Gaussian Channels

4.5. Packet Erasure Channels with Feedback

5. Discussion

Acknowledgments

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI