Capacity Bounds and Mapping Design for Binary Symmetric Relay Channels

Khormuji, Majid Nasiri; Skoglund, Mikael

doi:10.3390/e14122589

Open AccessArticle

Capacity Bounds and Mapping Design for Binary Symmetric Relay Channels

by

Majid Nasiri Khormuji

^* and

Mikael Skoglund

^*

School of Electrical Engineering and ACCESS Linnaeus Center, Royal Institute of Technology (KTH), Stockholm, 100 44, Sweden

^*

Authors to whom correspondence should be addressed.

Entropy 2012, 14(12), 2589-2610; https://doi.org/10.3390/e14122589

Submission received: 1 September 2012 / Revised: 4 December 2012 / Accepted: 6 December 2012 / Published: 17 December 2012

(This article belongs to the Special Issue Information Theory Applied to Animal Communication)

Download

Browse Figures

Versions Notes

Abstract

:

Capacity bounds for a three-node binary symmetric relay channel with orthogonal components at the destination are studied. The cut-set upper bound and the rates achievable using decode-and-forward (DF), partial DF and compress-and-forward (CF) relaying are first evaluated. Then relaying strategies with finite memory-length are considered. An efficient algorithm for optimizing the relay functions is presented. The Boolean Fourier transform is then employed to unveil the structure of the optimized mappings. Interestingly, the optimized relay functions exhibit a simple structure. Numerical results illustrate that the rates achieved using the optimized low-dimensional functions are either comparable to those achieved by CF or superior to those achieved by DF relaying. In particular, the optimized low-dimensional relaying scheme can improve on DF relaying when the quality of the source-relay link is worse than or comparable to that of other links.

Keywords:

binary symmetric relay channel; decode-and-forward; compress-and-forward; linear relaying; capacity bounds; binary Fourier transform

1. Introduction

In this paper we consider a relay network consisting of a source, a relay, and a destination, as illustrated in Figure 1. This channel is considered in [1,2]. The communication task is to reproduce the transmitted message M, uniformly chosen from the set

M = {1, 2, \dots, 2^{n R}}

, at the destination such that the probability of error is arbitrarily small. The transmission of a message consumes n channel uses. We desire to quantify the supremum of the set of rates, R, for which the average message error probability at the destination can be made to approach zero as the number of channel uses n goes to infinity. This number is the capacity, C, in the communication between the source and destination. The channel capacity for the general relay channel is still an open problem.

Figure 1. Three-node relay network.

One special case of the general relay channel is the Gaussian relay channel, which is studied, e.g., in [2,3,4,5,6,7,8,9]. In this paper, however, we focus on a relay network in which each link is a binary symmetric channel. Some instances of the binary symmetric relay channel (BSRC) are considered in [10,11,12,13,14]. The work of [10,11] considers BSRCs with correlated noises and [12,13] focuses on the detect-and-forward protocol for multi-hop relaying and the relay channel, respectively. In this paper, we consider a model that is also studied in an independent work [14]. We study decode-and-forward (DF), partial decode-and-forward (PDF), compress-and-forward (CF) and general finite-memory relay mappings. One of our main contributions is to propose a general structure for finite dimensional relaying. We show that one can improve on the result presented in [14] and illustrate that it is possible to approach the capacity upper bound for some cases by using the proposed low-dimensional mappings. Interestingly we recover the lower bound in [14] by a simple one-dimensional mapping at the relay. We also illustrate that one can obtain higher reliable rates by employing optimized relaying functions with larger memories. The rate obtained via optimized finite-length mappings can be superior to those achieved by DF protocol in some cases and are comparable to those achieved by CF protocol in general.

The remaining part of the paper is organized as follows. In Section 2, we present the three-node BSRC with orthogonal components at the destination. Section 3 derives capacity bounds for the BSRC when the relay is assumed to have infinite memory. Section 4 investigates capacity bounds for the BSRC when the relay is assumed to have a finite memory. We present an algorithm to optimize the relay mapping in Section 4. In Section 5 we compare the achievable rates as a function of memory length. Finally, Section 6 concludes the paper.

Notation: We use the following notation for brevity.

${GF}_{b}$ denotes the binary Galois filed, i.e., ${0, 1}$ .
${GF}_{b}^{k}$ denotes the k-dimensional binary Galois filed, i.e., ${0, 1}^{k}$ .
$H_{b} (ϵ) : = - ϵ log (ϵ) - (1 - ϵ) log (1 - ϵ)$ denotes binary entropy function where $ϵ \in [0, 1]$ .
$D_{H} (x_{1}^{k}, y_{1}^{k})$ denotes the Hamming distance between the two binary sequences of length k.
The operation * is defined as $ϵ_{1} * ϵ_{2} : = ϵ_{1} + ϵ_{2} - 2 ϵ_{1} ϵ_{2}$ .
We denote that the binary random variable Z has a Bernoulli distribution by $Z \sim Ber (ϵ)$ where $Pr (Z = 1) = ϵ$ and $Pr (Z = 0) = 1 - ϵ$ .
We denote the binary Kronecker delta function by $δ_{b} (x)$ , where $δ_{b} (0) = 1$ and $δ_{b} (1) = 0$ for $x \in {GF}_{b}$ .

2. Binary Symmetric Relay Channel

In this section, we introduce the channel model that we consider in the paper. Figure 2 shows a BSRC consisting of three nodes: a source, a relay and a destination. In this model, we however assume that received signals at the destination are orthogonal. That is, the signals transmitted from the source and the relay do not interfere with each other. We further assume that all links are corrupted with modulo-sum noises distributed according to the Bernoulli distribution and all quantities are binary; i.e.,

X, X_{r}, Y_{1}, Y_{2}, Y_{r} \in {0, 1}

.

The received signal at the relay

Y_{r}

is given by

Y_{r} = X \oplus Z_{r}

(1)

where X is the transmitted symbol from the source and

Z_{r} \sim Ber (ϵ_{r})

is the additive Bernoulli noise. (The Bernoulli noise models for example the conventional communication set-up using BPSK modulation followed by a hard decision (see also [14]).) The received signal from the source at the destination

Y_{1}

is given by

\begin{matrix} Y_{1} & = & X \oplus Z_{1} \end{matrix}

(2)

where

Z_{1} \sim Ber (ϵ_{1})

is the additive Bernoulli noise. Similarly, the received signal from the relay at the destination

Y_{2}

is given by

\begin{matrix} Y_{2} & = & X_{r} \oplus Z_{2} \end{matrix}

(3)

where

X_{r}

is the transmitted symbol from the relay and

Z_{2} \sim Ber (ϵ_{2})

is the additive Bernoulli noise. We assume that the random variables

Z_{r}

,

Z_{1}

, and

Z_{2}

are mutually independent. Note that the addition in Equations (1)–(3) is done in

GF (2)

.

Remark 1. Figure 3 shows a BSRC with non-orthogonal reception at the destination. The received signal at the destination is given by

\begin{matrix} Y & = & X \oplus X_{r} \oplus Z \end{matrix}

(4)

where X and

X_{r}

respectively denote the symbols transmitted by the source and the relay. The random variable

Z \sim Ber (ϵ)

is the additive Bernoulli noise and is independent of

Z_{r}

. The capacity of this channel is

C_{n o} = 1 - H_{b} (ϵ)

(5)

By setting

X_{r} = 0

, we have

Y = X \oplus Z

. Then the achievability follows since

{max}_{p (x)} I (X; Y | X_{r} = 0) = C_{n o}

. The converse follows from the cut-set bound. Invoking the multiple access bound, we have

\begin{matrix} C & \leq & I (X, X_{r}; Y) \\ = & H (Y) - H (Y | X, X_{r}) \\ = & H (Y) - H (Z) \\ \leq & 1 - H_{b} (ϵ) \end{matrix}

(6)

In the sequel, we present various capacity bounds for the orthogonal BSRC.

Figure 2. Orthogonal Binary Symmetric Relay Channel.

Figure 3. Non-Orthogonal Binary Symmetric Relay Channel.

3. Capacity Bounds for the Orthogonal BSRC: Infinite Memory Relay Case

In this section, we consider the cut-set upper bound and three lower bounds on the capacity: DF, PDF and CF. These three bounds are evaluated based on the results in [2] for the general relay channel where the relay has infinite memory and unlimited processing capability.

Proposition 1 (cutset bound). For the relay channel in Figure 2, the capacity is upper bounded by

\begin{matrix} R_{U B} & = & min \{1 + H_{b} (ϵ_{1} * ϵ_{r}) - H_{b} (ϵ_{1}) - H_{b} (ϵ_{r}), 2 - H_{b} (ϵ_{1}) - H_{b} (ϵ_{2})\} . \end{matrix}

(7)

Proof. See Appendix A and also [14]. ☐

Proposition 2 (DF lower bound). For the relay channel in Figure 2, the capacity is lower bounded by

\begin{matrix} R_{D F} = max {1 - H_{b} (ϵ_{1}) & , & min {1 - H_{b} (ϵ_{r}), 2 - H_{b} (ϵ_{1}) - H_{b} (ϵ_{2})}} \end{matrix}

(8)

where the bound is achieved using the block Markov encoding scheme in [2] (known as DF relaying) and the only-direct-transmission (i.e., when the relay is off).

Proof. See Appendix B. ☐

Note that in Equation (8), we take the maximum of the rates achieved using the conventional DF [2] and the only-direct-link transmission. In some cases it is possible to improve on DF by using PDF at the relay. That is, the relay only decodes a part of the transmitted message. The achievable rate of PDF is given by

\begin{matrix} R_{P D F} & = & max_{p (u, x, x_{r})} min {I (X, X_{r}; Y_{1}, Y_{2}), I (U; Y_{r} | X_{r}) + I (X; Y_{1}, Y_{2} | X_{r}, U)} \end{matrix}

(9)

where U denotes the part of the transmitted message that the relay decodes. (See Theorem 7 in [2] and also [15].)

Proposition 3 (PDF lower bound). For the relay channel in Figure 2, generalized block Markov encoding attains the same rate as the modified DF given in Equation (8).

Proof. See Appendix C. ☐

Corollary 1. DF is an optimal relaying strategy if

H_{b} (ϵ_{1}) + H_{b} (ϵ_{2}) - H_{b} (ϵ_{r}) \geq 1

.

Proof. The proof follows from Propositions 2 and 1. ☐

Proposition 4 (CF lower bound). For the relay channel in Figure 2, the capacity is lower bounded by

R_{C F} = \{\begin{matrix} 1 + H_{b} (ϵ_{1} * ϵ_{r} * ϵ_{q}) - H_{b} (ϵ_{r} * ϵ_{q}) - H_{b} (ϵ_{1}), & if 1 < H_{b} (ϵ_{2}) + H_{b} (ϵ_{1} * ϵ_{r}) \\ 1 + H_{b} (ϵ_{1} * ϵ_{r}) - H_{b} (ϵ_{r}) - H_{b} (ϵ_{1}), & if 1 \geq H_{b} (ϵ_{2}) + H_{b} (ϵ_{1} * ϵ_{r}) \end{matrix}

(10)

where

ϵ_{q}

satisfies

H_{b} (ϵ_{1} * ϵ_{r} * ϵ_{q}) - H_{b} (ϵ_{q}) + H_{b} (ϵ_{2}) = 1

(11)

Proof. See Appendix D. ☐

The bound in Equation (10) is achieved using the side-information encoding scheme in [2], known as CF relaying.

Corollary 2. CF is an optimal relaying strategy if

H_{b} (ϵ_{1} * ϵ_{r}) + H_{b} (ϵ_{2}) \leq 1

.

Proof. The proof follows from Propositions 4 and 1. ☐

Figure 4. Optimal regions for

ϵ_{2} = 0.05

.

Figure 4. Optimal regions for

ϵ_{2} = 0.05

.

Figure 4 and Figure 5 show optimal regions in the plane [0, 1]×[0, 1] for ϵ₂ set to 0.05 and 0.2, respectively. One can see that for some parameters of ϵ_r and ϵ₁, CF or DF is optimal.

From Figure 4 and Figure 5, we see that DF is optimal when ϵ_r is small and ϵ₁ is close to 0.5. When ϵ₁ = 0.5, the BSRC simplifies to a two-hop channel for which DF is capacity achieving. However, DF is also optimal when the relay is relatively strong compared with the direct link.

From Figure 4 and Figure 5, we similarly see that CF is optimal at the corner of the plane, i.e., when ϵ₁ and ϵ_r are small. As ϵ₂ increases, the area of the region for which CF is optimal shrinks. This is due to the fact that small value of ϵ₂ allows the relay to use higher rate to describe the received signal to destination. Small value of ϵ₁ means the destination has better side-information in order to decode the reproduction sequence generated at the relay, and small value of ϵ_r means that the relay receives a less-noisy sequence on average that makes compression of the “true” signal easier.

Figure 5. Optimal regions for

ϵ_{2} = 0.2

.

Figure 5. Optimal regions for

ϵ_{2} = 0.2

.

4. Capacity Bounds for the Orthogonal BSRC: Finite Memory Relay Case

We next consider the case when the relay has a finite memory length. For the Gaussian relay channel, optimized memoryless relaying is investigated in [16,17,18,19]. Here we consider higher-dimension mappings for the BSRC.

If the relay has a storage memory of k bits, it can process the last

k - 1

and the presently received symbol to generate k new symbols using k possibly different k-dimensional functions. This results in a low-complexity relaying protocol suitable for delay-sensitive or inexpensive applications. In the following, we denote the relay functions by

\begin{matrix} f_{i} : {GF}_{b}^{k} ⟼ {GF}_{b} \\ x_{r, i} = f_{i} (y_{r, 1}, \dots, y_{r, k}) \end{matrix}

(12)

for

i \in {1, 2, \dots, k}

. Note that we here consider the whole sub-block of k symbols to generate k new symbols to be transmitted to the destination. This is different from the classical definition with strictly causal relaying. We are allowed to do this, without any particular condition in signal reception at the relay, since the relay has an orthogonal link to the destination.

4.1. Achievable Rate

For a given set of relay functions

{f_{i}}_{i = 1}^{k}

, the channel is parameterized by the pmf

p (y | x)

, by defining

y \equiv (y_{1, 1}^{k}, y_{2, 1}^{k})

and

x \equiv x_{1}^{k}

, where

\begin{matrix} p (y | x) & : = & p (y_{1, 1}^{k}, y_{2, 1}^{k} | x_{1}^{k}) \\ = & \sum_{y_{r, 1}^{k} \in {GF}_{b}^{k}} p (y_{1, 1}^{k}, y_{2, 1}^{k} | y_{r, 1}^{k}, x_{1}^{k}) p (y_{r, 1}^{k} | x_{1}^{k}) \\ = & \sum_{y_{r, 1}^{k} \in {GF}_{b}^{k}} p (y_{1, 1}^{k} | x_{1}^{k}) p (y_{2, 1}^{k} | y_{r, 1}^{k}) p (y_{r, 1}^{k} | x_{1}^{k}) \\ = & \sum_{y_{r, 1}^{k} \in {GF}_{b}^{k}} \prod_{i = 1}^{k} p (y_{1, i} | x_{i}) p (y_{r, i} | x_{i}) p (y_{2, i} | f_{i} (y_{r, 1}^{k})) \\ = & \sum_{y_{r, 1}^{k} \in {GF}_{b}^{k}} \prod_{i = 1}^{k} [(1 - ϵ_{1}) δ_{b} (y_{1, i} \oplus x_{i}) + ϵ_{1} δ_{b} (y_{1, i} \oplus x_{i} \oplus 1)] \times \\ [(1 - ϵ_{r}) δ_{b} (y_{r, i} \oplus x_{i}) + ϵ_{r} δ_{b} (y_{r, i} \oplus x_{i} \oplus 1)] \times \\ [(1 - ϵ_{2}) δ_{b} (y_{2, i} \oplus f_{i} (y_{r, 1}^{k})) + ϵ_{2} δ_{b} (y_{2, i} \oplus f_{i} (y_{r, 1}^{k}) \oplus 1)] \end{matrix}

(13)

Now one can apply the standard random coding argument for the equivalent discrete memoryless point-to-point communication link with the input

x

and the output

y

whose relation is governed by the pmf

p (y | x)

as follows. Generate

2^{(n k) C_{k}}

i.i.d. codewords where each has length

k n

and each k subsequent symbols in every codeword is distributed according to

p (x_{1}^{k})

(i.e.,

p (x_{1}^{k n}) = \prod_{i = 1}^{n} p (x_{1}^{k})

). Thus, the achievable rate using the finite memory relay is given by

C_{k} = sup_{{f_{i}}_{i = 1}^{k}, p (x_{1}^{k})} \frac{1}{k} I (X_{1}^{k}; Y_{1, 1}^{k}, Y_{2, 1}^{k})

(14)

where the supremum is taken over the set of Boolean functions

{f_{i}}_{i = 1}^{k}

and the joint pmf

p (x_{1}^{k})

of k symbols at the source. Since the channel is used k times, the mutual information in Equation (14) is divided by k (see also [1,4]).

Achievable Rate for

k = 1

: The simplest case is the memoryless relay in which the relay just transmits the received noisy bit to the destination without any further processing. That is

x_{r, i} = y_{r, i}

for

1 \leq i \leq n

. For this relay function, the optimal input distribution is

X \sim Ber (\frac{1}{2})

.

Proposition 5. For the relay channel in Figure 2, the rate

C_{1} = 1 + H_{b} (ϵ_{1} * ϵ_{r} * ϵ_{2}) - H_{b} (ϵ_{r} * ϵ_{2}) - H_{b} (ϵ_{1})

(15)

is achievable.

Proof. See Appendix E. ☐

We note that the rate

C_{1}

is also derived in [14] via a suboptimal evaluation of the CF lower bound. However, here we arrive at this rate using a one-dimensional mapping without any need for a compression codebook at the relay.

Computation of

C_{k}

for

k \geq 2

is a cumbersome task. Nevertheless, there is no unique set of relay functions that is optimal for all channel parameters

ϵ_{r}

,

ϵ_{1}

,

ϵ_{2}

. To see this, consider the case with

ϵ_{2} = 0

. For this case, the simple strategy with

k = 1

used in Proposition 5 is optimal. However, this relay function is not necessarily optimal for cases with

ϵ_{2} \neq 0

since one can potentially provide error protection on the relay-destination link by utilizing functions with higher dimensions.

4.2. Mapping Optimization for an Arbitrary k

In the following, we confine the pmf at the source to be

p (x_{1}^{k}) = \prod_{i = 1}^{k} p (x)

, and

p (x) = \frac{1}{2} δ_{b} (x) + \frac{1}{2} δ_{b} (x \oplus 1)

, i.e.,

X \sim Ber (\frac{1}{2})

.

Lemma 1. The mutual information given in Equation (14) can be written as

\begin{matrix} \frac{1}{k} I (X_{1}^{k}; Y_{1, 1}^{k}, Y_{2, 1}^{k}) & = & 1 - H_{b} (ϵ_{1}) - \frac{1}{k} E [log (p (y_{2, 1}^{k} | y_{1, 1}^{k}))] + \frac{1}{k} E [log (p (y_{2, 1}^{k} | x_{1}^{k}))] \end{matrix}

(16)

where

\begin{matrix} p (y_{2, 1}^{k} | x_{1}^{k}) = {(1 - ϵ_{2})}^{k} {(1 - ϵ_{r})}^{k} \sum_{y_{r, 1}^{k} \in {GF}_{b}^{k}} {(\frac{ϵ_{2}}{1 - ϵ_{2}})}^{D_{H} (y_{2, 1}^{k}, f (y_{r, 1}^{k}))} {(\frac{ϵ_{r}}{1 - ϵ_{r}})}^{D_{H} (y_{r, 1}^{k}, x_{1}^{k})} \end{matrix}

(17)

and

\begin{matrix} p & (y_{2, 1}^{k} | y_{1, 1}^{k}) = \end{matrix}

\begin{matrix} {(1 - ϵ_{1})}^{k} {(1 - ϵ_{2})}^{k} {(1 - ϵ_{r})}^{k} \sum_{y_{r 1}^{k} \in {GF}_{b}^{k}} {(\frac{ϵ_{2}}{1 - ϵ_{2}})}^{D_{H} (y_{2, 1}^{k}, f (y_{r, 1}^{k}))} \sum_{x_{1}^{k} \in {GF}_{b}^{k}} {(\frac{ϵ_{r}}{1 - ϵ_{r}})}^{D_{H} (y_{r, 1}^{k}, x_{1}^{k})} {(\frac{ϵ_{1}}{1 - ϵ_{1}})}^{D_{H} (y_{1, 1}^{k}, x_{1}^{k})} . \end{matrix}

(18)

Proof. See Appendix F. ☐

To compute the rate in Equation (14), one needs to select the best functions among

2^{k 2^{k}}

possible choices, which has a exponential complexity. (For

k = 4

, there are approximately

1.8 \times 10^{19}

possible functions.) In order to cope with the complexity, we implement an efficient hill-climbing search algorithm as follows. For a given k, we first initialize the relay functions with a random mapping and compute the rate using Lemma 1. Then we randomly select one function and one corresponding dimension, and flip the mapping and recompute the rate. If the new mapping provides a higher rate, we accept the change. Otherwise we repeat the process until the mapping converges. Since the algorithm by construction may terminate in a local optimum, we repeat the whole algorithm with different initializations and pick the mapping that attains the highest rate.

One example of the optimized mapping for

k = 4

when

ϵ_{1} = ϵ_{2} = ϵ_{r} = 0.01

is

\begin{matrix} F = [\begin{matrix} 0 & 0 & 1 & 1 & 1 & 1 & 0 & 0 & 1 & 1 & 0 & 0 & 0 & 0 & 1 & 1 \\ 1 & 0 & 0 & 1 & 0 & 1 & 1 & 0 & 1 & 0 & 0 & 1 & 0 & 1 & 1 & 0 \\ 1 & 0 & 0 & 1 & 1 & 0 & 0 & 1 & 0 & 1 & 1 & 0 & 0 & 1 & 1 & 0 \\ 1 & 0 & 1 & 0 & 0 & 1 & 0 & 1 & 0 & 1 & 0 & 1 & 1 & 0 & 1 & 0 \end{matrix}] . \end{matrix}

Here

F_{i j}

denotes the output of the relay along the ith dimension for the jth input configuration (we have

1 \leq i \leq 4

and

1 \leq j \leq 2^{4}

). The relay for a given combination of the received bits finds the decimal representation and transmits the bits in the column given by the decimal representation plus one. For example, for the received string 0000, the relay transmits the bits in the first column, i.e., 0111. By studying the matrix F, we can get some insight into the underlying structure of the mapping. Finding an efficient structure at the relay simplifies the design and the implementation of relaying. We employ the Fourier transform to accomplish this task. Our use of the binary Fourier (or Hadamard) transform is related to how it was used in, e.g., [20,21], to analyze the performance of quantizers over noisy channels.

4.3. Fourier Spectrum of the Optimized Mappings

In order to define the Fourier transform, we need an orthonormal basis [22]. Consider the following set of functions

\begin{matrix} X_{S} (x) : {- 1, + 1}^{k} ⟼ {- 1, + 1} \\ X_{S} (x) = \prod_{i \in S} x_{i} \end{matrix}

(19)

where

S \subseteq {1, 2, \dots, k}

. Then, any function

f : {0, 1}^{k} \to {0, 1}

can uniquely be represented as (We use the one-to-one mapping

0 ⟺ + 1

and

1 ⟺ - 1

.)

\begin{matrix} f = \sum_{S \subseteq {1, 2, \dots, k}} \hat{f} (S) X_{S} \end{matrix}

(20)

where

\hat{f} (S)

is the Fourier coefficient of f and is given by

\begin{matrix} \hat{f} (S) & = & 〈 f, X_{S} 〉 = \frac{1}{2^{k}} \sum_{x \in {+ 1, - 1}^{k}} f (x) X_{S} (x) \\ = & E [f \cdot X_{S}] \end{matrix}

(21)

The expectation in Equation (21) is taken uniformly over

x \in {+ 1, - 1}^{k}

. Note that the Fourier expansion of f can potentially have up to

2^{k}

terms. As an example consider the following randomly chosen function.

x₃	x₂	x₁	f
+ 1	+ 1	+ 1	+ 1
+ 1	+ 1	− 1	+ 1
+ 1	− 1	+ 1	− 1
+ 1	− 1	− 1	− 1
− 1	+ 1	+ 1	− 1
− 1	+ 1	− 1	+ 1
− 1	− 1	+ 1	+ 1
− 1	− 1	− 1	+ 1

This function can be expanded as

\begin{matrix} f (x_{1}, x_{2}, x_{3}) & = & \frac{1}{4} - \frac{1}{4} x_{1} + \frac{1}{4} x_{2} - \frac{1}{4} x_{3} - \frac{1}{4} x_{1} x_{2} + \frac{1}{4} x_{1} x_{3} + \frac{3}{4} x_{2} x_{3} + \frac{1}{4} x_{1} x_{2} x_{3} \end{matrix}

(22)

That is, the Fourier spectrum has eight terms and the function is not linear. In general, we would like a sparse Fourier spectrum for efficient implementation. (Sparsity of mappings allows to realize them with much fewer multiplications and additions. This can be also compared to codes with low-density generator/parity matrices that allow simpler encoding and decoding in general.)

Table 1 presents the Fourier expansion of optimized relay functions for

k = 2, 3, 4, 6

when

ϵ_{1} = ϵ_{2} = ϵ_{r} = 0.01

. Interestingly, we see that that the Fourier expansion of the optimized functions is indeed sparse. Using the results in Table 1, we can rewrite the functions in the following form

\begin{matrix} f : {0, 1}^{k} ⟼ {0, 1}^{k} \\ x_{r} = A_{k} y_{r} + b_{k} \end{matrix}

(23)

where

x_{r} = {[x_{r, 1}, \dots, x_{r, k}]}^{T}

and

y_{r} = {[y_{r, 1}, \dots, y_{r, k}]}^{T}

, and

A_{k} \in {0, 1}^{[k \times k]}

and

b_{k} \in {0, 1}^{[k \times 1]}

. For example for

k = 6

, we have

\begin{matrix} A_{6} = [\begin{matrix} 0 & 1 & 1 & 0 & 1 & 0 \\ 1 & 0 & 1 & 1 & 0 & 1 \\ 0 & 1 & 0 & 1 & 1 & 0 \\ 1 & 1 & 1 & 1 & 0 & 0 \\ 1 & 0 & 0 & 0 & 1 & 1 \\ 0 & 1 & 1 & 1 & 0 & 1 \end{matrix}], b_{6} = [\begin{matrix} 0 \\ 1 \\ 1 \\ 1 \\ 0 \\ 1 \end{matrix}] \end{matrix}

Note that the mapping given in Equation (23) is not linear in the binary field, when

b_{k} \neq 0

. However, the linear mapping

x_{r} = A_{k} y_{r}

gives the same performance as

x_{r} = A_{k} y_{r} + b_{k}

. In other words the bias term

b_{k}

does not improve the rate. This essentially follows from the data processing inequality. Therefore, the underlying relay functions define a linear code of rate one on the noisy received bits at the relay. Additionally, the code used at the relay performs joint source–channel coding, it therefore should be good for both source and channel coding.

Table 1. Fourier expansion of the optimized relay functions for the orthogonal BSRC with

ϵ_{1} = ϵ_{2} = ϵ_{r} = 0.01 .

**Table 1.** Fourier expansion of the optimized relay functions for the orthogonal BSRC with $ϵ_{1} = ϵ_{2} = ϵ_{r} = 0.01 .$
$k = 2$	$\begin{matrix} f_{1} (y_{r 1}^{k}) = y_{r 1} \\ f_{2} (y_{r 1}^{k}) = y_{r 2} y_{r 1} \end{matrix}$
$k = 3$	$\begin{matrix} f_{1} (y_{r 1}^{k}) = y_{r 3} y_{r 2} y_{r 1} \\ f_{2} (y_{r 1}^{k}) = - y_{r 3} y_{r 1} \\ f_{3} (y_{r 1}^{k}) = y_{r 3} y_{r 2} \end{matrix}$
$k = 4$	$\begin{matrix} f_{1} (y_{r 1}^{k}) = - y_{r 4} y_{r 3} y_{r 1} \\ f_{2} (y_{r 1}^{k}) = - y_{r 4} y_{r 2} y_{r 1} \\ f_{3} (y_{r 1}^{k}) = - y_{r 3} y_{r 2} y_{r 1} \\ f_{4} (y_{r 1}^{k}) = y_{r 4} y_{r 3} y_{r 2} \end{matrix}$
$k = 6$	$\begin{matrix} f_{1} (y_{r 1}^{k}) = y_{r 5} y_{r 3} y_{r 2} \\ f_{2} (y_{r 1}^{k}) = - y_{r 6} y_{r 4} y_{r 3} y_{r 1} \\ f_{3} (y_{r 1}^{k}) = - y_{r 5} y_{r 4} y_{r 2} \\ f_{4} (y_{r 1}^{k}) = - y_{r 4} y_{r 3} y_{r 2} y_{r 1} \\ f_{5} (y_{r 1}^{k}) = y_{r 6} y_{r 5} y_{r 1} \\ f_{6} (y_{r 1}^{k}) = - y_{r 6} y_{r 4} y_{r 3} y_{r 2} \end{matrix}$

4.4. Effect of Channel Parameters on the Structure of the Optimized Mappings

In this section, we investigate the structure of the optimized mappings for different channel parameters. Our numerical search indicates that the linear mapping

x_{r} = A_{k} y_{r}

is an efficient strategy among all classes of mappings for low-dimensional relaying. That is, the relay employs the binary matrix

A_{k}

to generate the relay outputs using k received bits.

Table 2 shows the optimized generator matrices

A_{k}

for various values of

ϵ_{r}

when

k = 6

,

ϵ_{1} = ϵ_{2} = 0.05

. In particular, for

ϵ_{r} = 0.25

the optimized generator matrix is the identity matrix, i.e.,

A_{6} = I_{6}

. For this case, the relay is better off transmitting the received noisy bits without any further processing. However, as

ϵ_{r}

decreases, the relay starts combining the received bits at the relay before transmitting. The number of ones in a row of the generator matrix indicates the number of inputs that the relay combines. The density of ones in the generator matrices

ρ : = \frac{# of ones}{k^{2}}

are also shown in Table 2. We see that as

ϵ_{r}

decreases, ρ increases. That is, the relay starts to transmit combinations of more bits in one single channel use. This occurs because of two main reasons: firstly when

ϵ_{r}

decreases the relay receives less noisy bits on average and secondly the destination has some partial knowledge of individual bits via the received signal from the source.

Table 3 shows the optimized generator matrices for various values of

ϵ_{1}

when

k = 6

,

ϵ_{r} = 0.01

and

ϵ_{2} = 0.1

. We similarly see that as

ϵ_{1}

decreases, ρ increases. This is due to the fact that when

ϵ_{1}

decreases, the destination receives better descriptions of the transmitted bits via the source-destination link. The relay then forwards combinations of several incoming bits when the destination has access to more reliable side information.

Table 2. Optimized generator matrices as a function of

ϵ_{r}

for

k = 6

and

ϵ_{1} = ϵ_{2} = 0.05

.

**Table 2.** Optimized generator matrices as a function of $ϵ_{r}$ for $k = 6$ and $ϵ_{1} = ϵ_{2} = 0.05$ .
$ϵ_{r}$	$0.25$	$0.1$	$0.05$	$0.01$	$0.001$	$0.0001$
$A_{6}$	$[\begin{matrix} 1 & 0 & 0 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 & 0 & 0 \\ 0 & 0 & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & 1 & 0 & 0 \\ 0 & 0 & 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 0 & 0 & 1 \end{matrix}]$	$[\begin{matrix} 0 & 0 & 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 1 \\ 0 & 0 & 0 & 1 & 1 & 0 \\ 1 & 0 & 0 & 0 & 1 & 0 \\ 1 & 1 & 1 & 1 & 1 & 1 \end{matrix}]$	$[\begin{matrix} 1 & 0 & 0 & 0 & 1 & 0 \\ 1 & 1 & 1 & 1 & 0 & 0 \\ 1 & 0 & 1 & 0 & 0 & 0 \\ 1 & 0 & 0 & 1 & 1 & 1 \\ 0 & 1 & 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 0 & 1 & 1 \end{matrix}]$	$[\begin{matrix} 1 & 1 & 0 & 0 & 0 & 1 \\ 0 & 1 & 0 & 1 & 1 & 0 \\ 0 & 0 & 1 & 1 & 1 & 0 \\ 1 & 0 & 1 & 1 & 0 & 0 \\ 1 & 0 & 1 & 0 & 1 & 1 \\ 0 & 1 & 1 & 0 & 1 & 1 \end{matrix}]$	$[\begin{matrix} 1 & 1 & 1 & 0 & 0 & 1 \\ 0 & 1 & 1 & 1 & 0 & 0 \\ 0 & 1 & 1 & 0 & 1 & 1 \\ 1 & 1 & 0 & 1 & 1 & 0 \\ 1 & 0 & 0 & 0 & 1 & 1 \\ 1 & 0 & 1 & 1 & 1 & 0 \end{matrix}]$	$[\begin{matrix} 1 & 0 & 0 & 1 & 0 & 1 \\ 0 & 1 & 0 & 1 & 0 & 1 \\ 1 & 1 & 1 & 0 & 0 & 1 \\ 1 & 1 & 1 & 1 & 1 & 0 \\ 1 & 1 & 0 & 0 & 1 & 1 \\ 0 & 0 & 1 & 1 & 1 & 1 \end{matrix}]$
ρ	0.1667	0.3611	0.4444	0.5556	0.6111	0.6389

Table 3. Optimized generator matrices as a function of

ϵ_{1}

for

k = 6

and

ϵ_{r} = 0.01

,

ϵ_{2} = 0.1

.

**Table 3.** Optimized generator matrices as a function of $ϵ_{1}$ for $k = 6$ and $ϵ_{r} = 0.01$ , $ϵ_{2} = 0.1$ .
$ϵ_{1}$	$0.4$	$0.25$	$0.1$	$0.05$	$0.01$	$0.001$
$A_{6}$	$[\begin{matrix} 1 & 0 & 0 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 & 0 & 0 \\ 0 & 0 & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & 1 & 0 & 0 \\ 0 & 0 & 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 0 & 0 & 1 \end{matrix}]$	$[\begin{matrix} 0 & 1 & 1 & 0 & 0 & 0 \\ 1 & 0 & 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & 0 & 0 & 1 \\ 1 & 0 & 0 & 0 & 0 & 1 \\ 0 & 1 & 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 & 1 & 0 \end{matrix}]$	$[\begin{matrix} 1 & 1 & 0 & 0 & 1 & 1 \\ 0 & 1 & 0 & 1 & 0 & 1 \\ 0 & 1 & 1 & 0 & 1 & 0 \\ 1 & 0 & 0 & 1 & 1 & 1 \\ 1 & 0 & 1 & 0 & 1 & 0 \\ 1 & 0 & 1 & 1 & 0 & 0 \end{matrix}]$	$[\begin{matrix} 1 & 1 & 1 & 0 & 1 & 0 \\ 1 & 0 & 1 & 1 & 0 & 0 \\ 1 & 0 & 1 & 0 & 1 & 1 \\ 0 & 1 & 1 & 1 & 1 & 1 \\ 1 & 0 & 0 & 1 & 1 & 0 \\ 1 & 1 & 0 & 1 & 0 & 1 \end{matrix}]$	$[\begin{matrix} 1 & 0 & 1 & 0 & 1 & 1 \\ 1 & 1 & 0 & 1 & 0 & 1 \\ 0 & 1 & 0 & 1 & 1 & 1 \\ 1 & 0 & 1 & 1 & 1 & 0 \\ 1 & 1 & 1 & 0 & 1 & 0 \\ 0 & 1 & 1 & 1 & 0 & 1 \end{matrix}]$	$[\begin{matrix} 1 & 1 & 1 & 1 & 0 & 0 \\ 0 & 1 & 1 & 1 & 1 & 1 \\ 1 & 1 & 1 & 0 & 1 & 0 \\ 1 & 0 & 1 & 1 & 1 & 1 \\ 1 & 1 & 1 & 0 & 0 & 1 \\ 1 & 1 & 0 & 1 & 1 & 1 \end{matrix}]$
ρ	0.1667	0.3333	0.5556	0.6389	0.6667	0.7500

Table 4. Optimized generator matrices as a function of

ϵ_{2}

for

k = 6

and

ϵ_{r} = 0.01

,

ϵ_{1} = 0.1

.

**Table 4.** Optimized generator matrices as a function of $ϵ_{2}$ for $k = 6$ and $ϵ_{r} = 0.01$ , $ϵ_{1} = 0.1$ .
$ϵ_{2}$	$0.4$	$0.25$	$0.05$	$0.01$	$0.001$	$0.0001$
$A_{6}$	$[\begin{matrix} 0 & 1 & 1 & 1 & 1 & 1 \\ 1 & 1 & 1 & 1 & 0 & 1 \\ 1 & 0 & 1 & 1 & 1 & 1 \\ 1 & 1 & 1 & 0 & 1 & 1 \\ 1 & 1 & 1 & 1 & 1 & 0 \\ 1 & 1 & 0 & 1 & 1 & 1 \end{matrix}]$	$[\begin{matrix} 1 & 1 & 0 & 1 & 1 & 0 \\ 0 & 1 & 1 & 0 & 1 & 1 \\ 1 & 1 & 1 & 1 & 0 & 0 \\ 1 & 1 & 0 & 1 & 0 & 1 \\ 1 & 0 & 1 & 0 & 1 & 1 \\ 0 & 0 & 1 & 1 & 1 & 1 \end{matrix}]$	$[\begin{matrix} 0 & 1 & 1 & 1 & 1 & 1 \\ 1 & 0 & 1 & 1 & 0 & 1 \\ 0 & 1 & 0 & 1 & 0 & 1 \\ 1 & 1 & 1 & 0 & 1 & 1 \\ 0 & 1 & 0 & 0 & 1 & 1 \\ 1 & 0 & 0 & 0 & 1 & 1 \end{matrix}]$	$[\begin{matrix} 0 & 1 & 1 & 0 & 1 & 0 \\ 0 & 1 & 0 & 1 & 1 & 0 \\ 0 & 0 & 0 & 1 & 1 & 1 \\ 1 & 1 & 1 & 1 & 1 & 0 \\ 1 & 0 & 1 & 0 & 1 & 1 \\ 1 & 0 & 0 & 1 & 0 & 1 \end{matrix}]$	$[\begin{matrix} 0 & 0 & 1 & 0 & 0 & 1 \\ 0 & 1 & 0 & 1 & 0 & 1 \\ 1 & 1 & 0 & 0 & 0 & 1 \\ 0 & 1 & 1 & 0 & 1 & 0 \\ 1 & 0 & 0 & 0 & 0 & 0 \\ 0 & 1 & 0 & 1 & 1 & 0 \end{matrix}]$	$[\begin{matrix} 1 & 0 & 0 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 & 0 & 0 \\ 0 & 0 & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & 1 & 0 & 0 \\ 0 & 0 & 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 0 & 0 & 1 \end{matrix}]$
ρ	0.8333	0.6667	0.6389	0.5833	0.4167	0.1667

Table 4 shows the optimized generator matrices for various values of

ϵ_{2}

when

k = 6

,

ϵ_{r} = 0.01

and

ϵ_{1} = 0.1

. We can see that as

ϵ_{2}

decreases, the density of ones ρ decreases as well. For

ϵ_{2} = 0.0001

, we have

A_{6} = I_{6}

. This is due to the fact that for small value of

ϵ_{2}

, the relay can reliably transmits the received bits to the destination. (Note that this strategy is optimal if

ϵ_{2} = 0

). However, for high values of

ϵ_{2}

, the relay combines several bits prior to the transmission. This helps the destination combat the noise on the relay-destination link.

5. Numerical Examples

Figure 6 shows the capacity results for the orthogonal BSRC shown in Figure 2 as a function of

ϵ_{r}

when

ϵ_{1} = ϵ_{2} = 0.01

. In this figure, we have plotted the cut-set upper bound (UB) (Equation (7)), rates achieved using decode-and-forward (DF) (Equation (8)), compress-and-forward (CF) (Equation (10)), and optimized finite memory relay (Equation (14)) for different memory size. The relay functions are optimized for the channel parameters

ϵ_{1} = ϵ_{2} = ϵ_{r} = 0.01

and are given in Table 1.

From Figure 6, we see that the achievable rate of DF decreases as

ϵ_{r}

increases to

0.01

. For

ϵ_{r} \geq 0.01

, the achievable rate of DF coincides with that achievable with direct transmission (i.e., the relay is off). On the other hand, the rates achieved by CF coincides with the upper bound for the chosen channel parameters since the condition in Corollary 2 is satisfied. More interestingly, optimized low-dimensional relaying with

k = 6

achieves rates close to those achieved by CF and operates close to the capacity.

Figure 6. Capacity results for the binary symmetric relay channel as a function of

ϵ_{r}

when

ϵ_{1} = ϵ_{2} = 0.01

.

Figure 6. Capacity results for the binary symmetric relay channel as a function of

ϵ_{r}

when

ϵ_{1} = ϵ_{2} = 0.01

.

6. Summary and Concluding Remarks

We introduced a binary symmetric relay channel with orthogonal components at the destination, and investigated three main relaying strategies: decode-and-forward (DF) relaying, compress-and-forward (CF) relaying, and optimized low-dimensional relaying. We used a bit-switching numerical algorithm to find optimized mappings. We initialized our algorithm with arbitrary random nonlinear mappings, and after optimization based on Fourier analysis, we observed that all optimized mappings that we found were linear. We also illustrated that one can obtain rates very close to the upper bound by using the proposed optimized low-dimensional relaying scheme. It is worth noting that DF and CF require codebooks with infinite block length codewords at the relay. This stands in a sharp contrast to the proposed low-dimensional relaying scheme. Additionally, the suggested relaying protocol has low-delay processing and paves the way for implementation of inexpensive relaying protocols. We finally note that the sufficiency of linear mappings for the problem of optimal relaying remains open.

Acknowledgements

The authors wish to acknowledge Abbas El Gamal for helpful input, and the Swedish Research Council for funding the work in part.

Claims

The material in this paper in part was presented in proceedings of the IEEE ITW, Cairo, Egypt, 6–8 January 2010 [23].

Appendix

A. Proof of Proposition 1

In order to proceed, we first recite a lemma given in [13] (We thank one of the anonymous reviewers for bringing this result to our attention.) that we occasionally use in the sequel.

Lemma 2. Consider a binary symmetric channel with input X and outputs

Y_{1}

and

Y_{2}

, where

\begin{matrix} Y_{1} & = & X \oplus Z_{1} \\ Y_{2} & = & X \oplus Z_{2} \end{matrix}

The random variable

Z_{1} \sim Ber (ϵ_{1})

and is independent of

Z_{2} \sim Ber (ϵ_{2})

. The capacity of this channel is given by

C = 1 + H_{b} (ϵ_{1} * ϵ_{2}) - H_{b} (ϵ_{1}) - H_{b} (ϵ_{2})

(24)

Proof. A proof of this lemma can be found in [24], but for completeness we present a slightly different proof in the following. The channel is a standard

1 \times 2

SIMO (single-input multiple-output) link and its capacity is given by

\begin{matrix} C & = & max_{p (x)} I (X; Y_{1}, Y_{2}) \end{matrix}

(25)

Next consider

\begin{matrix} C & = & H (Y_{1}, Y_{2}) - H (Y_{1}, Y_{2} | X) \\ = & H (Y_{1}, Y_{2}) - H (Z_{1}, Z_{2} | X) \\ = & H (Y_{1}, Y_{2}) - H (Z_{1}) - H (Z_{2}) \\ = & H (Y_{1}) + H (Y_{2} | Y_{1}) - H_{b} (ϵ_{1}) - H_{b} (ϵ_{2}) \\ = & H (Y_{1}) + H (Y_{2} \oplus Y_{1} | Y_{1}) - H_{b} (ϵ_{1}) - H_{b} (ϵ_{2}) \\ \leq & H (Y_{1}) + H (Y_{2} \oplus Y_{1}) - H_{b} (ϵ_{1}) - H_{b} (ϵ_{2}) \\ = & H (Y_{1}) + H (Z_{2} \oplus Z_{1}) - H_{b} (ϵ_{1}) - H_{b} (ϵ_{2}) \\ \leq & 1 + H_{b} (ϵ_{1} * ϵ_{2}) - H_{b} (ϵ_{1}) - H_{b} (ϵ_{2}) \end{matrix}

(26)

We finally note that the upper bound can be achieved by choosing

X \sim Ber (0.5)

. ☐

Proof of Proposition 1: Using the cut-set bound [2], we have

C \leq max_{p (x, x_{r})} min {I (X, X_{r}; Y_{1}, Y_{2}), I (X; Y_{1}, Y_{2}, Y_{r} | X_{r})}

Now we bound each term.

\begin{matrix} I (X, X_{r}; Y_{1}, Y_{2}) & = & H (Y_{1}, Y_{2}) - H (Y_{1}, Y_{2} | X, X_{r}) \\ = & H (Y_{1}, Y_{2}) - H (Z_{1}, Z_{2} | X, X_{r}) \\ = & H (Y_{1}, Y_{2}) - H (Z_{1}) - H (Z_{2}) \\ \leq & H (Y_{1}) + H (Y_{2}) - H (Z_{1}) - H (Z_{2}) \\ \leq & 2 - H_{b} (ϵ_{1}) - H_{b} (ϵ_{1}) \end{matrix}

(27)

Similarly, we have

\begin{matrix} I (X; Y_{r}, Y_{1}, Y_{2} | X_{r}) & = & I (X; Y_{1}, Y_{r} | X_{r}) + I (X; Y_{2} | Y_{1}, Y_{r}, X_{r}) \\ = & I (X; Y_{1}, Y_{r} | X_{r}) + H (Y_{2} | Y_{1}, Y_{r}, X_{r}) \\ - H (Y_{2} | Y_{1}, Y_{r}, X_{r}, X) \\ = & I (X; Y_{1}, Y_{r} | X_{r}) + H (Z_{2} | Y_{1}, Y_{r}, X_{r}) \\ - H (Z_{2} | Y_{1}, Y_{r}, X_{r}, X) \\ = & I (X; Y_{1}, Y_{r} | X_{r}) \\ = & H (Y_{1}, Y_{r} | X_{r}) - H (Y_{1}, Y_{r} | X, X_{r}) \\ = & H (Y_{1}, Y_{r} | X_{r}) - H (Z_{1}, Z_{r} | X, X_{r}) \\ = & H (Y_{1}, Y_{r} | X_{r}) - H (Z_{1}, Z_{r}) \\ \leq & H (Y_{1}, Y_{r}) - H (Z_{1}, Z_{r}) \\ = & I (X; Y_{1}, Y_{r}) \\ \leq & 1 + H_{b} (ϵ_{1} * ϵ_{r}) - H_{b} (ϵ_{1}) - H_{b} (ϵ_{r}) \end{matrix}

(28)

where the last inequality follows from Lemma 2. Combining (27) and (28) and noting that (27) and (28) are maximized when the input distribution is chosen as

p (x, x_{r}) = p (x) p (x_{r}) = [0.5 δ_{b} (x) + 0.5 δ_{b} (x \oplus 1)] [0.5 δ_{b} (x_{r}) + 0.5 δ_{b} (x_{r} \oplus 1)]

yield the result.

B. Proof of Proposition 2

Using Theorem 1 in [2], the rate

R_{D F} = max_{p (x, x_{r})} min {I (X, X_{r}; Y_{1}, Y_{2}), I (X; Y_{r} | X_{r})}

is achievable. The first term is evaluated in Appendix A and is maximized when X and

X_{r}

are independent and have uniform distribution. One can show the same distribution maximizes the second term. This yields

\begin{matrix} R_{D F} = min {1 - H_{b} (ϵ_{r}), 2 - H_{b} (ϵ_{1}) - H_{b} (ϵ_{2})} . \end{matrix}

(29)

C. Proof of Proposition 3

Using Theorem 7 in [2] , the rate

R_{P D F} = max_{p (u, x, x_{r})} min {I (X, X_{r}; Y_{1}, Y_{2}), I (U; Y_{r} | X_{r}) + I (X; Y_{1}, Y_{2} | X_{r}, U)}

is achievable. (See also [15].) The first term is evaluated in Appendix A and is bounded as

\begin{matrix} I (X, X_{r}; Y_{1}, Y_{2}) \leq 2 - H_{b} (ϵ_{1}) - H_{b} (ϵ_{2}) \end{matrix}

(30)

Next consider

\begin{matrix} I (U; Y_{r} | X_{r}) + I (X; Y_{1}, Y_{2} | X_{r}, U) \\ = & I (U; Y_{r} | X_{r}) + H (Y_{1}, Y_{2} | X_{r}, U) - H (Y_{1}, Y_{2} | X, X_{r}, U) \\ = & I (U; Y_{r} | X_{r}) + H (Y_{1}, Y_{2} | X_{r}, U) - H (Z_{1}, Z_{2} | X, X_{r}, U) \\ = & I (U; Y_{r} | X_{r}) + H (Y_{1}, Y_{2} | X_{r}, U) - H (Z_{1}) - H (Z_{2}) \\ = & I (U; Y_{r} | X_{r}) + H (Y_{1} | X_{r}, U) + H (Y_{2} | X_{r}, U, Y_{1}) - H (Z_{1}) - H (Z_{2}) \\ = & I (U; Y_{r} | X_{r}) + H (Y_{1} | X_{r}, U) + H (Z_{2} | X_{r}, U, Y_{1}) - H (Z_{1}) - H (Z_{2}) \\ = & H (Y_{r} | X_{r}) - H (Y_{r} | X_{r}, U) + H (Y_{1} | X_{r}, U) - H (Z_{1}) \\ \leq & H (Y_{r}) - H (Y_{r} | X_{r}, U) + H (Y_{1} | X_{r}, U) - H (Z_{1}) \\ \leq & 1 - H_{b} (ϵ_{1}) + H (Y_{1} | X_{r}, U) - H (Y_{r} | X_{r}, U) \end{matrix}

(31)

Now we bound

H (Y_{1} | X_{r}, U) - H (Y_{r} | X_{r}, U)

. First, define

V : = (X_{r}, U)

where

p (V = v_{i}) = p_{i}

and

\sum_{i} p_{i} = 1

. Further assume that

p (x = 0 | V = v_{i}) = δ_{i}

,

p (x = 1 | V = v_{i}) = 1 - δ_{i}

. Next consider

\begin{matrix} H (Y_{1} | X_{r}, U) - H (Y_{r} | X_{r}, U) & = & H (X + Z_{1} | V) - H (X + Z_{r} | V) \\ = & \sum_{i} (H (X + Z_{1} | V = v_{i}) - H (X + Z_{r} | V = v_{i})) p_{i} \\ \leq & \sum_{i} max_{δ_{i}} (H (X + Z_{1} | V = v_{i}) - H (X + Z_{r} | V = v_{i})) p_{i} \\ = & max_{δ_{i}} (H (X + Z_{1} | V = v_{i}) - H (X + Z_{r} | V = v_{i})) \sum_{i} p_{i} \\ = & max_{δ_{i}} (H (X + Z_{1} | V = v_{i}) - H (X + Z_{r} | V = v_{i})) \\ = & max_{δ_{i}} (H_{b} (δ_{i} * ϵ_{1}) - H_{b} (δ_{i} * ϵ_{r})) \end{matrix}

(32)

In the following, let

h (δ) : = H_{b} (δ * ϵ_{1}) - H_{b} (δ * ϵ_{r})

and

max {ϵ_{1}, ϵ_{r}} \leq 0.5

. We then obtain

\begin{matrix} \frac{\partial h}{\partial δ} & = & (1 - 2 ϵ_{1}) log (\frac{1 - (δ * ϵ_{1})}{δ * ϵ_{1}}) - (1 - 2 ϵ_{r}) log (\frac{1 - (δ * ϵ_{r})}{δ * ϵ_{r}}) \\ \frac{\partial^{2} h}{\partial δ^{2}} & = & (\frac{{(1 - 2 ϵ_{r})}^{2}}{(δ * ϵ_{r}) (1 - (δ * ϵ_{r}))} - \frac{{(1 - 2 ϵ_{1})}^{2}}{(δ * ϵ_{1}) (1 - (δ * ϵ_{1}))}) \end{matrix}

(33)

Now let

g (ϵ) : = \frac{{(1 - 2 ϵ)}^{2}}{(δ * ϵ) (1 - (δ * ϵ))}

. One can show that

\frac{\partial g}{\partial ϵ} \leq 0

if

ϵ \leq 0.5

and hence

g (ϵ)

is a non-increasing function. Thus we conclude that

\begin{matrix} \{\begin{matrix} ϵ_{r} \leq ϵ_{1} ⟹ g (ϵ_{r}) \geq g (ϵ_{1}) ⟹ \frac{\partial^{2} h}{\partial δ^{2}} \geq 0 ⟹ h (δ) is convex \\ ϵ_{r} \geq ϵ_{1} ⟹ g (ϵ_{r}) \leq g (ϵ_{1}) ⟹ \frac{\partial^{2} h}{\partial δ^{2}} \leq 0 ⟹ h (δ) is concave \end{matrix} \end{matrix}

(34)

Therefore

\begin{matrix} \{\begin{matrix} h (δ) \leq max {h (0), h (1)}, & if ϵ_{r} \leq ϵ_{1} \\ h (δ) \leq h (δ^{☆}), & if ϵ_{r} \geq ϵ_{1} \end{matrix} \end{matrix}

(35)

where

δ^{☆}

is the solution of

\frac{\partial h}{\partial δ} = 0

. Finally we obtain the following bound

\begin{matrix} H (Y_{1} | X_{r}, U) - H (Y_{r} | X_{r}, U) & \leq & \{\begin{matrix} H_{b} (ϵ_{1}) - H_{b} (ϵ_{r}), & if ϵ_{r} \leq ϵ_{1} \\ 0, & otherwise \end{matrix} \end{matrix}

(36)

Combining Equations (30), (31) and (36) proves that partial DF does not improve on DF for the BSRC.

D. Proof of Proposition 4

We use the equivalent formulation of the original CF (Theorem 6 in [2]) given in [4]. CF achieves the rate

\begin{matrix} R_{C F} = max_{p (q) p (x | q) p (x_{r} | q) p ({\hat{y}}_{r} | x_{r}, y_{r}, q)} min & \{I (X, X_{r}; Y_{1}, Y_{2} | Q) - I (Y_{r}; {\hat{Y}}_{r} | X, X_{r}, Y_{1}, Y_{2}, Q), \\ I (X; Y_{1}, Y_{2}, {\hat{Y}}_{r} | X_{r}, Q)\} \end{matrix}

(37)

First consider

\begin{matrix} I (X, X_{r}; Y_{1}, Y_{2} | Q) & = & I (X; Y_{1}, Y_{2} | Q) + I (X_{r}; Y_{1}, Y_{2} | X, Q) \\ = & I (X; Y_{1} | Q) + I (X; Y_{2} | Y_{1}, Q) + I (X_{r}; Y_{2} | X, Q) + I (X_{r}; Y_{1} | Y_{2}, X, Q) \\ = & I (X; Y_{1} | Q) + I (X_{r}; Y_{2} | X, Q) \\ = & I (X; Y_{1} | Q) + I (X_{r}; Y_{2} | Q) \\ \leq & 2 - H_{b} (ϵ_{1}) - H_{b} (ϵ_{2}) \end{matrix}

(38)

where the upper bound can be achieved by choosing

X \sim Ber (0.5)

and

X_{r} \sim Ber (0.5)

.

In order to proceed we choose the following binary test channel:

{\hat{Y}}_{r} = Y_{r} \oplus Z_{q}

(39)

where

Z_{q} \sim Ber (ϵ_{q})

and is independent of other random variables.

This yields

\begin{matrix} I (Y_{r}; {\hat{Y}}_{r} | X, X_{r}, Y_{1}, Y_{2}, Q) & = & H ({\hat{Y}}_{r} | X, X_{r}, Y_{1}, Y_{2}, Q) - H ({\hat{Y}}_{r} | Y_{r}, X, X_{r}, Y_{1}, Y_{2}, Q) \\ = & H (Z_{r} \oplus Z_{q}) - H (Z_{q}) = H_{b} (ϵ_{r} * ϵ_{q}) - H_{b} (ϵ_{q}) \end{matrix}

(40)

\begin{matrix} I (X; Y_{1}, Y_{2}, {\hat{Y}}_{r} | X_{r}, Q) & = & I (X; Y_{1}, {\hat{Y}}_{r} | X_{r}, Q) + I (X; Y_{2} | Y_{1}, {\hat{Y}}_{r}, X_{r}, Q) \\ = & I (X; Y_{1}, {\hat{Y}}_{r} | Q) \\ \leq & 1 + H_{b} (ϵ_{1} * ϵ_{r} * ϵ_{q}) - H_{b} (ϵ_{r} * ϵ_{q}) - H_{b} (ϵ_{1}) \end{matrix}

(41)

where the last inequality follows from Lemma 2 and it is achieved by choosing

X \sim Ber (0.5)

.

Putting all together, the following rate is achievable

\begin{matrix} R_{C F} = max_{ϵ_{q} \in [0, 1]} min & {2 - H_{b} (ϵ_{1}) - H_{b} (ϵ_{2}) - H_{b} (ϵ_{r} * ϵ_{q}) + H_{b} (ϵ_{q}), \\ 1 + H_{b} (ϵ_{1} * ϵ_{r} * ϵ_{q}) - H_{b} (ϵ_{r} * ϵ_{q}) - H_{b} (ϵ_{1})} \end{matrix}

(42)

Now define

\begin{matrix} R_{1} (ϵ_{q}) : = 2 - H_{b} (ϵ_{1}) - H_{b} (ϵ_{2}) - H_{b} (ϵ_{r} * ϵ_{q}) + H_{b} (ϵ_{q}) \\ R_{2} (ϵ_{q}) : = 1 + H_{b} (ϵ_{1} * ϵ_{r} * ϵ_{q}) - H_{b} (ϵ_{r} * ϵ_{q}) - H_{b} (ϵ_{1}) \end{matrix}

(43)

Using the fact that

f (ϵ) : = H_{b} (ϵ * δ) - H_{b} (ϵ)

is convex

\forall δ \in [0, 1]

, we conclude that

R_{1} (ϵ_{q})

is concave and

R_{2} (ϵ_{q})

is convex in

ϵ_{q}

. We next note that

\begin{matrix} max_{ϵ_{q}} R_{1} & = & R_{1} (0.5) = 2 - H_{b} (ϵ_{1}) - H_{b} (ϵ_{2}) \\ min_{ϵ_{q}} R_{2} & = & R_{2} (0.5) = 1 - H_{b} (ϵ_{1}) \end{matrix}

(44)

Since

R_{2} (0.5) \leq R_{1} (0.5)

we only need to consider two following cases:

Case 1: $R_{1} (0) < R_{2} (0)$
If $R_{1} (0) < R_{2} (0)$ we have $1 - H_{b} (ϵ_{2}) < H_{b} (ϵ_{1} * ϵ_{r})$ and there exists $ϵ_{q}$ such that $R_{1} (ϵ_{q}) = R_{2} (ϵ_{q})$ . Thus

$R_{C F} = 1 + H_{b} (ϵ_{1} * ϵ_{r} * ϵ_{q}) - H_{b} (ϵ_{r} * ϵ_{q}) - H_{b} (ϵ_{1})$

(45)

is achievable where $ϵ_{q}$ satisfies

$H_{b} (ϵ_{1} * ϵ_{r} * ϵ_{q}) - H_{b} (ϵ_{q}) + H_{b} (ϵ_{2}) = 1$

(46)
Case 2: $R_{1} (0) \geq R_{2} (0)$
If $R_{1} (0) \geq R_{2} (0)$ we have $1 - H_{b} (ϵ_{2}) \geq H_{b} (ϵ_{1} * ϵ_{r})$ . Thus

$R_{C F} = max_{ϵ_{q}} min {R_{1} (ϵ_{q}), R_{2} (ϵ_{q})} = max_{ϵ_{q}} R_{2} (ϵ_{q}) = R_{2} (0) = 1 + H_{b} (ϵ_{1} * ϵ_{r}) - H_{b} (ϵ_{r}) - H_{b} (ϵ_{1})$

(47)

is achievable.

E. Proof of Proposition 5

For the memoryless relay, we have

x_{r} = y_{r}

and hence

\begin{matrix} Y_{1} & = & X \oplus Z_{1} \\ Y_{2} & = & X_{r} \oplus Z_{2} = X \oplus Z_{r} \oplus Z_{2} = X \oplus Z_{e q} \end{matrix}

(48)

where

Z_{e q} : = Z_{r} \oplus Z_{2} \sim Ber (ϵ_{r} * ϵ_{2})

. Then the achievable rate is given by

C_{1} = {max}_{p (x)} I (X; Y_{1}, Y_{2})

. Now using Lemma 2, we obtain

C_{1} = 1 + H_{b} (ϵ_{1} * ϵ_{r} * ϵ_{2}) - H_{b} (ϵ_{r} * ϵ_{2}) - H_{b} (ϵ_{1})

(49)

F. Proof of Lemma 1

Consider

\begin{matrix} I (X_{1}^{k}; Y_{1, 1}^{k}, Y_{2, 1}^{k}) & = & I (X_{1}^{k}; Y_{1, 1}^{k}) + I (X_{1}^{k}; Y_{2, 1}^{k} | Y_{1, 1}^{k}) \\ \overset{(a)}{=} & k I (X_{1}; Y_{1}) + I (X_{1}^{k}; Y_{2, 1}^{k} | Y_{1, 1}^{k}) \\ = & k I (X_{1}; Y_{1}) + H (Y_{2, 1}^{k} | Y_{1, 1}^{k}) - H (Y_{2, 1}^{k} | Y_{1, 1}^{k}, X_{1}^{k}) \\ \overset{(b)}{=} & k I (X_{1}; Y_{1}) + H (Y_{2, 1}^{k} | Y_{1, 1}^{k}) - H (Y_{2, 1}^{k} | X_{1}^{k}) \\ = & k I (X_{1}; Y_{1}) + H (Y_{2, 1}^{k} | Y_{1, 1}^{k}) - H (Y_{2, 1}^{k} | X_{1}^{k}) \\ = & k (1 - H_{b} (ϵ_{1})) - E [{log}_{2} (p (y_{2, 1}^{k} | y_{1, 1}^{k}))] + E [{log}_{2} (p (y_{2, 1}^{k} | x_{1}^{k}))] \end{matrix}

(50)

where

(a)

holds since

{X_{i}}_{i = 1}^{k}

are i.i.d. and the channel is memoryless and

(b)

holds since

Y_{1, 1}^{k} = X_{1}^{k} \oplus Z_{1, 1}^{k}

and

Z_{1, 1}^{k}

is independent of other random sequences. The conditional probabilities can be computed as follows

\begin{matrix} p (y_{2, 1}^{k} | x_{1}^{k}) & = & \sum_{y_{r 1}^{k}} p (y_{2, 1}^{k} | x_{1}^{k}, y_{r, 1}^{k}) p (y_{r, 1}^{k} | x_{1}^{k}) \\ = & \sum_{y_{r 1}^{k}} p (y_{2, 1}^{k} | y_{r, 1}^{k}) p (y_{r, 1}^{k} | x_{1}^{k}) \\ = & \sum_{y_{r 1}^{k}} p (y_{2, 1}^{k} | y_{r, 1}^{k}) \prod_{i = 1}^{k} p (y_{r, i} | x_{i}) \\ = & \sum_{y_{r 1}^{k}} \prod_{i = 1}^{k} p (y_{2, i} | f_{i} (y_{r, 1}^{k})) \prod_{i = 1}^{k} p (y_{r, i} | x_{i}) \\ = & \sum_{y_{r 1}^{k}} ϵ_{2}^{D_{H} (y_{2, 1}^{k}, f (y_{r, 1}^{k}))} {(1 - ϵ_{2})}^{k - D_{H} (y_{2, 1}^{k}, f (y_{r, 1}^{k}))} \\ \times ϵ_{r}^{D_{H} (y_{r, 1}^{k}, x_{1}^{k})} {(1 - ϵ_{r})}^{k - D_{H} (y_{r, 1}^{k}, x_{1}^{k})} \\ = & {(1 - ϵ_{2})}^{k} {(1 - ϵ_{r})}^{k} \times \\ \sum_{y_{r 1}^{k}} {(\frac{ϵ_{2}}{1 - ϵ_{2}})}^{D_{H} (y_{2, 1}^{k}, f (y_{r, 1}^{k}))} {(\frac{ϵ_{r}}{1 - ϵ_{r}})}^{D_{H} (y_{r, 1}^{k}, x_{1}^{k})} \end{matrix}

(51)

We similarly obtain

\begin{matrix} p (y_{2, 1}^{k} | y_{1, 1}^{k}) & = & \sum_{y_{r 1}^{k}} p (y_{2, 1}^{k} | y_{1, 1}^{k}, y_{r, 1}^{k}) p (y_{r, 1}^{k} | y_{1, 1}^{k}) \\ = & \sum_{y_{r 1}^{k}} p (y_{2, 1}^{k} | y_{r, 1}^{k}) p (y_{r, 1}^{k} | y_{1, 1}^{k}) \\ = & \sum_{y_{r 1}^{k}} p (y_{2, 1}^{k} | y_{r, 1}^{k}) \sum_{x_{1}^{k}} p (y_{r, 1}^{k} | y_{1, 1}^{k}, x_{1}^{k}) p (x_{1}^{k} | y_{1}^{k}) \\ = & \sum_{y_{r 1}^{k}} p (y_{2, 1}^{k} | y_{r, 1}^{k}) \sum_{x_{1}^{k}} p (y_{r, 1}^{k} | x_{1}^{k}) \frac{p (y_{1, 1}^{k} | x_{1}^{k}) p (x_{1}^{k})}{p (y_{1, 1}^{k})} \\ = & \sum_{y_{r 1}^{k}} \prod_{i = 1}^{k} p (y_{2, i} | f_{i} (y_{r, 1}^{k})) \sum_{x_{1}^{k}} \prod_{i = 1}^{k} p (y_{r, i}^{k} | x_{i}) \prod_{i = 1}^{k} p (y_{1, i} | x_{i}) \\ = & {(1 - ϵ_{1})}^{k} {(1 - ϵ_{2})}^{k} {(1 - ϵ_{r})}^{k} \times \sum_{y_{r 1}^{k}} {(\frac{ϵ_{2}}{1 - ϵ_{2}})}^{D_{H} (y_{2, 1}^{k}, f (y_{r, 1}^{k}))} \times \\ \sum_{x_{1}^{k}} {(\frac{ϵ_{r}}{1 - ϵ_{r}})}^{D_{H} (y_{r, 1}^{k}, x_{1}^{k})} {(\frac{ϵ_{1}}{1 - ϵ_{1}})}^{D_{H} (y_{1, 1}^{k}, x_{1}^{k})} \end{matrix}

(52)

References

van der Meulen, E.C. Three-terminal communication channels. Adv. Appl. Probab. 1971, 3, 120–154. [Google Scholar] [CrossRef]
Cover, T.M.; El Gamal, A. Capacity theorems for the relay channel. IEEE Trans. Inform. Theory 1979, 25, 572–584. [Google Scholar] [CrossRef]
Høst-Madsen, A.; Zhang, J. Capacity bounds and power allocation for wireless relay channels. IEEE Trans. Inform. Theory 2006, 51, 2020–2040. [Google Scholar] [CrossRef]
El Gamal, A.; Mohseni, M.; Zahedi, S. Bounds on capacity and minimum energy-per-bit for AWGN relay channels. IEEE Trans. Inform. Theory 2006, 52, 1545–1561. [Google Scholar] [CrossRef]
Kramer, G.; Gastpar, M.; Gupta, P. Cooperative strategies and capacity theorems for relay networks. IEEE Trans. Inform. Theory 2005, 51, 3037–3063. [Google Scholar] [CrossRef]
Dabora, R.; Servetto, S.D. On the role of estimate-and-forward with time sharing in cooperative communication. IEEE Trans. Inform. Theory 2008, 541, 4409–4431. [Google Scholar] [CrossRef]
Laneman, J.N.; Wornell, G.W.; Tse, D.N.C. Cooperative diversity in wireless networks: Efficient protocols and outage behavior. IEEE Trans. Inform. Theory 2004, 50, 3062–3080. [Google Scholar] [CrossRef]
Sendonaris, A.; Erkip, E.; Aazhang, B. User cooperation diversity-Part I: System description. IEEE Trans. Commun. 2003, 51, 1927–1938. [Google Scholar] [CrossRef]
Sendonaris, A.; Erkip, E.; Aazhang, B. User cooperation diversity-Part II: Implementation aspects and performance analysis. IEEE Trans. Commun. 2003, 51, 1939–1948. [Google Scholar] [CrossRef]
Aleksic, M.; Razaghi, P.; Yu, W. Capacity of a class of modulo-sum relay channels. IEEE Trans. Inform. Theory 2009, 55, 921–930. [Google Scholar] [CrossRef]
Kim, Y.H. Coding Techniques for Primitive Relay Channels. In Proceedings of the Forty-Fifth Annual Allerton Conference, Allerton House, UIUC, IL, USA, 26–28 September 2007.
Lau, A.P.T.; Cui, S. Joint power minimization in wireless relay channels. IEEE Trans. Wirel. Commun. 2007, 6, 2820–2824. [Google Scholar]
Karystinos, G.N.; Liavas, A.P. Outage Capacity of a Cooperative Scheme with Binary Input and a Simple Relay. In Proceedings of the IEEE ICASSP 2008-International Conference Acoustics, Speech and Signal Processing, Las Vegas, NV, USA, 31 March– 4 April 2008.
Sagar, Y.; Kwon, H.M.; Ding, Y. Capacity of Modulo-Sum Simple Relay Network. In Proceedings International Zurich Seminar on Communications, Zurich, Switzerland, 3–5 March 2010.
El Gamal, A.; Aref, M. The capacity of the semi-deterministic relay channel. IEEE Trans. Inform. Theory 1982, 28, 536. [Google Scholar] [CrossRef]
Khormuji, M.N.; Larsson, E.G. Rate-optimized constellation rearrangement for the relay channel. IEEE Commun. Lett. 2008, 12, 618–620. [Google Scholar] [CrossRef]
Khormuji, M.N.; Skoglund, M. Piecewise linear relaying: Low complexity parametric relaying. In Proceedings of the IEEE SPAWC, Perugia, Italy, 21–24 June 2009.
Khormuji, M.N.; Skoglund, M. On instantaneous relaying. IEEE Trans. Inform. Theory 2010, 56, 3378–3394. [Google Scholar] [CrossRef]
Zaidi, A.; Khormuji, M.N.; Yao, S.; Skoglund, M. Rate-maximizing Mappings for Memoryless Relaying. In Proceedings of the IEEE ISIT, Seoul, Korea, 28 June–3 July, 2009.
Skoglund, M. On channel-constrained vector quantization and index assignment for discrete memoryless channels. IEEE Trans Inform. Theory 1999, 45, 2615–2622. [Google Scholar] [CrossRef]
Mehes, A.; Zeger, K. Performance of quantizers on noisy channels using structured families of codes. IEEE Trans. Inform. Theory 2000, 46, 2468–2476. [Google Scholar]
Rudin, W. Fourier Analysis on Groups; John Wiley And Sons Ltd: Hoboken, NJ, USA, 1990. [Google Scholar]
Khormuji, M.N.; Skoglund, M. On the Capacity of the Binary Symmetric Relay Channel with a Finite Memory Relay. In Proceedings of the IEEE ITW, Cairo, Egypt, 6–8 January 2010.
Liavas, A. Outage Capacity of a Cooperative Scheme with Binary Input and a Simple Relay. Year-1 Work-Package-1 report of European Commission FET Project FP6-033533-COOPCOM, November 2007. [Google Scholar]

© 2012 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).

Share and Cite

MDPI and ACS Style

Khormuji, M.N.; Skoglund, M. Capacity Bounds and Mapping Design for Binary Symmetric Relay Channels. Entropy 2012, 14, 2589-2610. https://doi.org/10.3390/e14122589

AMA Style

Khormuji MN, Skoglund M. Capacity Bounds and Mapping Design for Binary Symmetric Relay Channels. Entropy. 2012; 14(12):2589-2610. https://doi.org/10.3390/e14122589

Chicago/Turabian Style

Khormuji, Majid Nasiri, and Mikael Skoglund. 2012. "Capacity Bounds and Mapping Design for Binary Symmetric Relay Channels" Entropy 14, no. 12: 2589-2610. https://doi.org/10.3390/e14122589

APA Style

Khormuji, M. N., & Skoglund, M. (2012). Capacity Bounds and Mapping Design for Binary Symmetric Relay Channels. Entropy, 14(12), 2589-2610. https://doi.org/10.3390/e14122589

Article Menu

Capacity Bounds and Mapping Design for Binary Symmetric Relay Channels

Abstract

1. Introduction

2. Binary Symmetric Relay Channel

3. Capacity Bounds for the Orthogonal BSRC: Infinite Memory Relay Case

4. Capacity Bounds for the Orthogonal BSRC: Finite Memory Relay Case

4.1. Achievable Rate

4.2. Mapping Optimization for an Arbitrary k

4.3. Fourier Spectrum of the Optimized Mappings

4.4. Effect of Channel Parameters on the Structure of the Optimized Mappings

5. Numerical Examples

6. Summary and Concluding Remarks

Acknowledgements

Claims

Appendix

A. Proof of Proposition 1

B. Proof of Proposition 2

C. Proof of Proposition 3

D. Proof of Proposition 4

E. Proof of Proposition 5

F. Proof of Lemma 1

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI