# A Decentralized Receiver in Gaussian Interference

^{1}

^{2}

^{3}

^{*}

Next Article in Journal

Previous Article in Journal

School of Electrical, Computer and Energy Engineering, Arizona State University, Tempe, AZ 85281, USA

School of Mathematical and Statistical Sciences, Arizona State University, Tempe, AZ 85281, USA

MIT Lincoln Laboratory, Lexington, MA 02421, USA

Author to whom correspondence should be addressed.

Received: 1 February 2018
/
Revised: 6 April 2018
/
Accepted: 9 April 2018
/
Published: 11 April 2018

(This article belongs to the Section Information Theory, Probability and Statistics)

Bounds are developed on the maximum communications rate between a transmitter and a fusion node aided by a cluster of distributed receivers with limited resources for cooperation, all in the presence of an additive Gaussian interferer. The receivers cannot communicate with one another and can only convey processed versions of their observations to the fusion center through a Local Array Network (LAN) with limited total throughput. The effectiveness of each bound’s approach for mitigating a strong interferer is assessed over a wide range of channels. It is seen that, if resources are shared effectively, even a simple quantize-and-forward strategy can mitigate an interferer 20 dB stronger than the signal in a diverse range of spatially Ricean channels. Monte-Carlo experiments for the bounds reveal that, while achievable rates are stable when varying the receiver’s observed scattered-path to line-of-sight signal power, the receivers must adapt how they share resources in response to this change. The bounds analyzed are proven to be achievable and are seen to be tight with capacity when LAN resources are either ample or limited.

We are at a remarkable time for radio systems. With the significant reductions in cost and improvements in performance, there has been an explosion in the number of radio systems with a wide range of applications including personal, machine-to-machine, vehicle-to-vehicle, and Internet-of-Things (IoT) communications. In many situations, radios cluster in physically close groups. This has two effects. First, it increases the likelihood of interference between systems and necessitates that systems be designed to maintain good performance in these conditions. Second, it creates an opportunity for groups of independent radios to be used as a distributed array to improve performance. Considering this new setting, we study the problem of receiving a signal in the presence of interference with the help of a distributed array, as illustrated in Figure 1.

We develop bounds on the maximum communications rate from a single transmitter to a base node that is provided side information from a distributed receive array in the presence of interference. The nodes that provide side information to the base are identified as helpers, and we assert they do not communicate with one another, but only forward information to the base through a previously established reliable Local Array Network (LAN) link, and that this link only supports a maximum total throughput which all the helpers must share.

This receive system can mitigate a strong interferer in a wide variety of environments. We consider the effects of an overall LAN capability that is parameterized by a total number of bits that helpers can share with the base for each source channel usage. Furthermore, to investigate algorithmic needs as a function of environmental conditions, we consider a spatially Ricean channel and a parametrically defined interference. Finally, we consider both the conditions in which the base node does and does not have its own noisy observation of the source signal. Our bound development in this paper provides significant extensions to and clarifications of the results found in References [1,2], and our preliminary efforts in Reference [3].

We perform an analysis of the achievable rates of the system in the presence of an interferer. The following contributions are provided:

- Upper and lower bounds on the system’s achievable communication rate in correlated Gaussian noise and regimes where these bounds are tight (Section 3). The strongest lower bound is given in Theorem 7.
- Performance characteristics of these rates in the presence of an interferer (Section 4.1).
- Behavior of the strategies in various scattering environments (Section 4.2). The scattering environment is seen to not affect average performance if LAN resource sharing adapts to the channel.
- A strengthening of an existing achievability proof for the system, where the same rate is achieved using less cooperation between nodes (Remark 7).

The problem of communicating with the help of an array of distributed receivers has been studied in a variety of contexts, most generally as an instance of communications over a single-input multiple-output (SIMO) channel. A variety of SIMO channels have been extensively analyzed, but most analyses are subtly different than the problem considered here. Our work is done in the context of information theory and bounds achievable communications rates, while in contrast most existing studies work towards minimizing distortion metrics or bit error rates.

Results presented here are a significant extension and generalization of the work in [3], where there is no treatment of the system’s performance in the presence of an interferer. A tighter inner bound than all those in [3] is included in Theorem 7, and full proofs are provided for all the results.

Achievable communications rates for this network topology were studied in References [1,2], although their bounds do not directly apply to channels with a Gaussian interferer. The studies provide an example demonstrating the sub-optimality of a Gaussian broadcaster when no interference is present, and suggest that non-Gaussian signaling techniques are needed for this sort of network. In contrast, we demonstrate in Section 4 that an achievable rate using Gaussian signaling typically comes quite close to the system’s upper bound in many practical regimes.

The helpers in the system we have described seem like relay nodes but there is an important distinction. Here, the ensemble of links from each helper to the base are like those in a graphical network, available for design within distributional and rate constraints. This is not usual in the context of relay networks, including those studied in very general setting such as Reference [4]. Studies such as References [5,6,7] detailed using a collection of receivers as beam forming relays to a destination node in a network with structure similar to the system considered here. In our situation, each helper-to-base link is orthogonal to the others, so beam forming studies are not directly applicable.

Performance of specific coding schemes for this system were studied in References [8,9,10]. In particular, [10] was able to identify a scheme that can perform to within 1.5 dB of allowing the base and receivers to communicate without constraints. Results in our study further work towards characterizing achievable rates of the system rather than designing and analyzing the performance of specific modulation schemes.

The topology we consider is similar to many-help-one problems such as the “Chief Executive Officer (CEO) problem” posed by Berger in Reference [11]. In this scenario, some node called the CEO seeks to estimate a source by listening to “agent” nodes which communicate to the CEO at fixed total rates. Many variations of this problem have been studied, for instance in Reference [12] in limit with number of agents. The focus of the CEO problem and most of its derivatives are to estimate a source to within some distortion (often mean squared error), whereas the focus here is on finding achievable rates for lossless communications.

The noisy Slepian–Wolf problem [13] may be interpreted as communications in the opposite direction of our system: distributed, cooperative helpers have a message for a base, but their cooperation ability is limited and their modulations must be sent over a noisy channel.

Throughout the paper, we use the notation in Table 1. A broadcaster seeks to convey a message to a base at some rate $R>0$ bits per time period. The message M is uniformly distributed along $[1:{2}^{TR}]$ over some $T\in \mathbb{N}$ time periods. Our goal is to determine the greatest rate R at which the base can recover M with low probability of error.

Before communicating, the broadcaster and base agree on a modulation scheme $\varphi :[1:{2}^{TR}]\to {\mathbb{C}}^{T}$, where each possible message $m\in [1:{2}^{TR}]$ is mapped to a T-length signal through $\varphi $. Each of these signals is denoted ${\mathit{x}}_{m}=({x}_{m,1},\dots ,{x}_{m,T})=\varphi \left(m\right)$, and it is chosen within the broadcaster’s power constraint: $\frac{1}{T}{\sum}_{\ell =1}^{T}{\left|{x}_{m,\ell}\right|}^{2}\le 1$. To send message M, the broadcaster transmits ${\mathit{x}}_{M}$, which we denote $\mathit{X}=({X}_{1},\dots ,{X}_{T})={\mathit{x}}_{M}$.

The signal is observed by $N+1$ single-antenna receive nodes through a static flat fading channel and additive white Gaussian noise. Enumerating these receivers 0–N, we identify the base as the “$0\mathrm{th}$” receiver, and call the rest helpers. For $n\in [0:N]$, at each time t, the $n\mathrm{th}$ receiver observes
where

$${Y}_{n,t}={h}_{n}{X}_{t}+{W}_{n,t},$$

- the channel $\mathit{h}\triangleq \left[\begin{array}{c}{h}_{0}\\ \vdots \\ {h}_{N}\end{array}\right]\in {\mathbb{C}}^{(N+1)\times 1}$ is constant over time t, and
- the noise $({W}_{0,t},\dots ,{W}_{N,t})\sim \mathcal{CN}\text{}(0,\Sigma )$ has covariance $\Sigma $ across receivers, and is independent over t.

The receiver observations are written as vectors:

$$\begin{array}{cc}\hfill {\mathit{Y}}_{n}& \triangleq ({Y}_{n,1},\dots ,{Y}_{n,T})\in {\mathbb{C}}^{T},\hfill \end{array}$$

$$\begin{array}{cc}\hfill \phantom{\rule{1.4cm}{0ex}}\mathit{Y}& \triangleq ({\mathit{Y}}_{0},{\mathit{Y}}_{1},\dots ,{\mathit{Y}}_{N})\in {\left({\mathbb{C}}^{T}\right)}^{N+1}.\hfill \end{array}$$

Helper n for $n\in [1:N]$ employs a (possibly randomly chosen) vector quantizer ${Q}_{n}$ on its received sequence ${\mathit{Y}}_{n}$ to produce a coarse ${r}_{n}$-bitrate summary of its observations, ${U}_{n}\triangleq {Q}_{n}\left({\mathit{Y}}_{n}\right)$. The vector of helper messages is denoted:
and the quantizers $({Q}_{1},\dots ,{Q}_{N})=\mathit{Q}$ satisfy two properties:

$$\begin{array}{cc}\hfill \mathit{U}& \triangleq ({U}_{1},\dots ,{U}_{N})\hfill \end{array}$$

$$\begin{array}{c}\phantom{\rule{2.3cm}{0ex}}=({Q}_{1}\left({\mathit{Y}}_{1}\right),\dots ,{Q}_{N}\left({\mathit{Y}}_{N}\right))\hfill \end{array}$$

- Every node in the system is informed of each quantizer’s behavior, and quantizers produce their compression using only local information. Formally, the probability distribution of $(\mathit{Q},\mathit{X},\mathit{Y},\mathit{U})$ factors:$$\begin{array}{c}\hfill {P}_{(\mathit{Q},\mathit{X},\mathit{Y},\mathit{U})}={P}_{\mathit{Q}}\xb7{P}_{\mathit{X}|\mathit{Q}}\xb7{P}_{\mathit{Y}|\mathit{X}}\xb7\prod _{n=1}^{N}{P}_{{U}_{n}|{\mathit{Y}}_{n},\mathit{Q}}.\end{array}$$
- Quantizations are within the LAN constraint:$$\frac{1}{T}H\left({U}_{n}\right|\mathit{Q})\le {r}_{n},\phantom{\rule{4pt}{0ex}}n\in [1:N].$$

The vector of the helper’s quantization rates is denoted:

$$\mathit{r}\triangleq ({r}_{1},\dots ,{r}_{N})\in {\mathbb{C}}^{(N+1)\times 1}.$$

Each ${U}_{n}$ is conveyed precisely to the base over a Local Array Network (LAN). We assume the LAN only supports a limited amount of communication between helpers and the base. For instance, the helpers must all share a small frequency band. We model this by asserting that feasible rate vectors $\mathit{r}$ belong to a bounded set ${\mathcal{R}}_{\mathrm{LAN}}\left(L\right)$ with a sum-capacity $L>0$:

$${\mathcal{R}}_{\mathrm{LAN}}\left(L\right)\triangleq \left\{\mathit{r}\left|\sum _{n=1}^{N}{r}_{n}\le L,\phantom{\rule{4pt}{0ex}}{r}_{n}\ge 0\right.\right\}.$$

We refer to the condition that $\mathit{r}\in {\mathcal{R}}_{\mathrm{LAN}}\left(L\right)$ as the LAN constraint. The base reconstructs an estimate $\widehat{\mathit{X}}$ of $\mathit{X}$ using its own full-precision observation ${\mathbf{Y}}_{0}$ and side information $\mathit{U}$ from the helpers, then recovers an estimate $\widehat{M}$ of message M from $\widehat{\mathit{X}}$. A block diagram of the system is shown in Figure 2.

Channel $\mathit{h}$ and noise covariance $\Sigma $ are assumed to be static throughout the transmission of each message and are known to all the receive nodes. Both the transmitter and the receivers are assumed to have knowledge of the set of feasible rates, ${\mathcal{R}}_{\mathrm{LAN}}\left(L\right)$. The environment determines:

- channel fades $\mathit{h}$,
- noise covariance $\Sigma $ and
- maximum LAN throughput L.

Available for design are:

- quantizers $\mathit{Q}$,
- message modulation ${\left\{{\mathit{x}}_{m}\right\}}_{m\in [1:{2}^{TR}]}$,
- rates each helper should send to the base $\mathit{r}$ and
- fusion and decoding methods at the base to produce $\widehat{M}$.

We say a communication rate R is achievable if for any $\epsilon >0$ then for large enough T there is some distribution of quantizers $\mathit{Q}$ satisfying $\frac{1}{T}H\left({Q}_{k}\left({Y}_{k}\right)\right|\mathit{Q})\le {r}_{k}$ for each k, encoders ${\varphi}_{\mathit{Q}}:[1:{2}^{TR}]\to {\mathcal{X}}^{T}$ and decoders ${f}_{\mathit{Q}}$ where:

$$P({f}_{\mathit{Q}}\left(\mathit{U}\right)\ne M)<\epsilon .$$

The maximum communications rate achievable with arbitrarily low probability of error is:
where ${\mathcal{S}}_{T}$ is the collection of joint distributions with block length T that satisfy conditions in Equations (6) and (7) for some $\mathit{r}\in {\mathcal{R}}_{\mathrm{LAN}}\left(L\right)$.

$$C=\underset{T}{sup}\underset{\begin{array}{c}(\mathit{X},\mathit{Y},\mathit{Q},\mathit{U})\sim P\in {\mathcal{S}}_{T}:\\ {\mathbb{E}\left|X\right|}^{2}\le 1\end{array}}{max}\frac{1}{T}I(\mathit{X};{\mathit{Y}}_{0},\mathit{U}|\mathit{Q})$$

C is achievable through the usual noisy channel coding argument [14]. For any achievable rate R, applying Fano’s inequality to $H\left(M\right|\mathit{Q})$ gives $R\le C$. □

In the sequel, we bound C from Theorem 1 with computable expressions.

The system’s capacity C can be upper bounded by considering a stronger channel where all helpers are informed of each other’s observations. Compute:
where the first inequality follows since $\mathit{Q}\to \mathit{X}\to {\mathit{Y}}_{0}$ is a Markov chain (Equation (6)). If the LAN constraint were not imposed and the base had access to the helpers’ receptions in full precision, the receive side would be equivalent to a multi-antenna Gaussian receiver. By noisy channel coding [15], the capacity of this stronger channel is ${max}_{{P}_{X}{:E\left[\right|X|}^{2}]<1}I(X;\mathit{Y})$ (here, $(X,\mathit{Y})$ are variables at a single timepoint) which, by a derivation given in [16], simplifies to the familiar upper bound:

$$\begin{array}{cc}\hfill I(\mathit{X};{\mathit{Y}}_{0},\mathit{U}|\mathit{Q})& =I(\mathit{X};\mathit{U}|{\mathit{Y}}_{0},\mathit{Q})+I(\mathit{X};{\mathit{Y}}_{0}|\mathit{Q})\hfill \end{array}$$

$$\begin{array}{c}\phantom{\rule{1.3cm}{0ex}}\le H\left(\mathit{U}\right|\mathit{Q})+I(\mathit{X};{\mathit{Y}}_{0})\hfill \end{array}$$

$$\begin{array}{c}\phantom{\rule{3.3cm}{0ex}}\le T\xb7\left(L+{log}_{2}\left[1+\parallel {h}_{0}{\parallel}^{2}/{\Sigma}_{0,0}\right]\right),\hfill \end{array}$$

$${log}_{2}\phantom{\rule{-0.166667em}{0ex}}\left[\frac{|\Sigma +{\mathit{hh}}^{\u2020}|}{|\Sigma |}\right].$$

Taking the minimum of this bound and Equation (14) yields an upper bound for the original system. This is an instance of a general cut-set upper bound from [17].

Any achievable rate R must satisfy

$$\begin{array}{c}\hfill R\le min\phantom{\rule{-0.166667em}{0ex}}\left\{{log}_{2}\phantom{\rule{-0.166667em}{0ex}}\left[\frac{|\Sigma +{\mathit{hh}}^{\u2020}|}{|\Sigma |}\right]\phantom{\rule{-0.166667em}{0ex}},L+{log}_{2}\left[1+\parallel {h}_{0}{\parallel}^{2}/{\Sigma}_{1,1}\right]\right\}\phantom{\rule{-0.166667em}{0ex}}.\end{array}$$

Justified by the preceding discussion. □

The rest of the section is dedicated to four communication strategies for the system in order of increasing complexity and diversity utilization. The strongest bound is presented in Theorem 7. It is described using the context of the preceding two bounds.

Treating each helper node as a user seeking to receive its own message, the link from the broadcaster to helpers is a scalar Gaussian broadcast channel. The capacity region of this channel was characterized in Reference [18], and in particular its sum-rate was given in Reference [19]. In the scalar case, this sum rate reduces to:
where only the point-to-point channel between transmitter and the best receiver is used.

$$\underset{n}{max}{log}_{2}\left[\frac{{\Sigma}_{n,n}+{\left|{h}_{n}\right|}^{2}}{{\Sigma}_{n,n}}\right],$$

The following rate is achievable

$$\begin{array}{c}\hfill {R}_{\mathrm{BC}}\left(L\right)=max\left\{{log}_{2}\left[\frac{{\Sigma}_{0,0}+{\left|{h}_{0}\right|}^{2}}{{\Sigma}_{0,0}}\right],min\left\{L,\underset{n}{max}{log}_{2}\left[\frac{{\Sigma}_{n,n}+{\left|{h}_{n}\right|}^{2}}{{\Sigma}_{n,n}}\right]\right\}\right\}.\end{array}$$

The rate is achievable with the following facilities:

- Helpers are informed of which helper is the best (best is derived from $\mathit{h},\Sigma ,L$).
- The transmitter and best helper coordinate a codebook of appropriate rate and distribution.

By having the broadcaster send a message to the nth receiver, a rate
is achievable from transmitter-to-receiver. If $n=0$ (i.e., the transmitter broadcasts to the base), then Equation (19) is achievable from transmitter-to-base. Otherwise, Equation (19) is achievable from transmitter-to-base if it is less than L by forwarding the receiver’s decoding over the LAN. □

$${log}_{2}\left[\frac{{\Sigma}_{n,n}+{\left|{h}_{n}\right|}^{2}}{{\Sigma}_{n,n}}\right]$$

When the base does not have its own reception (i.e., $I(X;{Y}_{0})=0$) and
then the system’s capacity is L.

$$L\le \underset{n}{max}{log}_{2}\left[\frac{{\Sigma}_{n,n}+{\left|{h}_{n}\right|}^{2}}{{\Sigma}_{n,n}}\right]$$

By assumption, the upper bound in Equation (16) equals $L$. This is achievable by Remark 2. □

The strategy to achieve this rate requires a strong amount of cooperation from the broadcaster, since the decoding receivers must all share codebooks with the transmitter.

Rate-distortion theory shows that to encode a Gaussian source $Y\sim \mathcal{N}(0,{\sigma}^{2})$ with minimum rate such that distortion does not exceed some maximum allowable mean squared error $\mathcal{D}>0$, the rate must be at least

$$\mathcal{R}\left(\mathcal{D}\right)={log}_{2}\left({\sigma}^{2}\right)-{log}_{2}\left(\mathcal{D}\right).$$

To achieve this rate, the encoding operation must emulate the following test channel (Reference [14]):
where $\alpha =\sqrt{1-\mathcal{D}/{\sigma}^{2}}$ and $\beta =\sqrt{\mathcal{D}}$. Dithered lattice quantization [20] is one method that can be used to realize such a quantizer in limit with block length in addition to having other nice analytic properties. Using this method, in limit with block length, the base’s estimates for a helper’s observation at any single timepoint can be made to approximate
where ${W}_{Q,n}\left({r}_{n}\right)$ is independent Gaussian distortion with variance some function of the helper’s rate ${r}_{n}$. We will now derive this function. Define
and the vector of ${Z}_{n}$s corresponding to this choice of ${W}_{Q,n}$ as:

$$\underset{W\sim \mathcal{CN}\text{\hspace{0.17em}}(0,1)}{Y\stackrel{\alpha /\beta}{\to}\underset{\uparrow}{\oplus}\stackrel{\beta}{\to}Z}=\alpha Y+\beta W$$

$${Z}_{n}\left({r}_{n}\right)\triangleq {Y}_{n}+{W}_{Q,n}\left({r}_{n}\right)$$

$${\mathit{W}}_{Q}\left(\mathit{r}\right)\triangleq ({W}_{Q,1}\left({r}_{1}\right),\dots ,{W}_{Q,N}\left({r}_{n}\right))\in {\mathbb{C}}^{N\times 1},$$

$$\mathit{Z}\left(\mathit{r}\right)\triangleq \mathit{Y}+{\mathit{W}}_{Q}\left(\mathit{r}\right).$$

The subscript Q here is a decoration to distinguish quantization distortion terms ${\mathit{W}}_{Q},{W}_{Q,n}$ from environment noise, $\mathit{W},{W}_{n}.$

If the nth helper is to forward information to the base at rate ${r}_{n},$ then the amount of distortion in the helper’s encoding under this strategy can be determined by setting Equation (21) equal to ${r}_{n}$ and solving for $\mathcal{D}$, with ${\sigma}^{2}$ equal to the helper observation’s variance, $\parallel {h}_{n}{\parallel}^{2}+{\Sigma}_{n,n}$. Scaling the output of the test channel the helper emulates (Equation (22)) by $1/\alpha $ causes it to equal the helper’s observation plus independent Gaussian noise with variance ${(\beta /\alpha )}^{2}=\mathcal{D}/(1-\mathcal{D}/{\sigma}^{2})$. This means that, in such a system, the distortion from the helpers is equivalent to adding independent additive Gaussian noise with variance:

$$\mathrm{Var}\left({W}_{Q,n}\left({r}_{n}\right)\right)=\frac{\parallel {h}_{n}{\parallel}^{2}+{\Sigma}_{n,n}}{{2}^{{r}_{n}}-1}.$$

By Equations (1) and (26), all the noise and distortion on the signal present in the helpers’ messages to the base are summarized by the vector $\mathit{W}+{\mathit{W}}_{Q}$ with covariance matrix

$$\mathbf{D}\left(\mathit{r}\right)\triangleq \Sigma +\mathrm{diag}\left({\displaystyle 0,\frac{\parallel {h}_{1}{\parallel}^{2}+{\Sigma}_{1,1}}{{2}^{{r}_{1}}-1},\dots ,\frac{\parallel {h}_{N}{\parallel}^{2}+{\Sigma}_{N,N}}{{2}^{{r}_{N}}-1}}\right).$$

We refer to this system as a Gaussian distortion system. A block diagram of this system is shown in Figure 3. This system is equivalent to a multi-antenna Gaussian receiver and its achievable rate has a closed form:

$${R}_{G}\left(\mathit{r}\right)\triangleq I(X;\mathit{Z}\left(\mathit{r}\right))={log}_{2}\left[\frac{|\mathbf{D}\left(\mathit{r}\right)+{\mathit{hh}}^{\u2020}|}{\left|\mathbf{D}\right(\mathit{r}\left)\right|}\right].$$

For a distributed receive system as described in Section 2 with noise covariance matrix Σ, LAN constraint L and fixed average helper quantization rates $\mathit{r}\in {\mathcal{R}}_{\mathrm{LAN}}\left(L\right)$, then a rate ${R}_{G}\left(\mathit{r}\right)$ is achievable with the following facilities:

- The transmitter and base coordinate a codebook of appropriate rate and distribution.
- Each n-th helper is informed of an encoding rate ${r}_{n}$ (optimal choice derived from $\mathit{h},\Sigma ,L$).
- Each helper coordinates a random dither with the base.
- The base is informed of $\mathit{r},\mathit{h},\Sigma ,L$.

All the details of this derivation are formalized in Appendix B.

Maximizing ${R}_{G}$ over ${\mathcal{R}}_{LAN}\left(L\right)$ gives a lower bound on the system capacity:

$${R}_{G,\mathrm{max}}\left(L\right)\triangleq \underset{\mathit{r}\in {\mathcal{R}}_{\mathrm{LAN}}\left(L\right)}{max}{R}_{G}\left(\mathit{r}\right).$$

Unfortunately, this expression cannot be simplified much in general. Maximization of ${R}_{G}$ over ${\mathcal{R}}_{\mathrm{LAN}}(\xb7)$ can be performed efficiently using quasi-convex optimization algorithms:

The max of ${R}_{G}\left(\mathit{r}\right)$ follows the maximum of ${\mathit{h}}^{\u2020}\mathbf{D}{\left(\mathit{r}\right)}^{-1}\mathit{h},$ which is quasi-concave in $\mathit{r}$.

The maximum of ${R}_{G}$ follows the maximum of $-{\mathit{h}}^{\u2020}\mathbf{D}{\left(\mathit{r}\right)}^{-1}\mathit{h}$ by the matrix determinant lemma.

It suffices to show that the restriction of this functional on the intersection of any line with ${\mathbb{R}}_{+}^{N\times 1}$ is quasi-concave (see Reference [21]). Further, a one-dimensional function is quasi-concave if anywhere its derivative is 0 and its second derivative is below 0.

Fix any $\mathit{a},\mathit{b}\in {\mathbb{R}}^{N\times 1},$ denote ${\mathbf{D}}_{t}{\triangleq \mathbf{D}(\xb7)|}_{t\mathit{a}+\mathit{b}}$ for any t where $t\mathit{a}+\mathit{b}\in {\mathbb{R}}_{+}^{N\times 1}$. Define ${\mathbf{A}}_{t}\triangleq \frac{d}{dt}{\mathbf{D}}_{t},$ and $f\left(t\right)\triangleq -{\mathit{h}}^{\u2020}{\mathbf{D}}_{t}^{-1}\mathit{h}$. Then,

$$\frac{d}{dt}f\left(t\right)={\mathbf{A}}_{t}{\mathbf{D}}_{t}^{-1}\mathit{h}$$

$$\frac{{d}^{2}}{d{t}^{2}}f\left(t\right)={\left({\mathbf{D}}_{t}^{-1}\mathit{h}\right)}^{\u2020}\left[\frac{d}{dt}{\mathbf{A}}_{t}-2{\mathbf{A}}_{t}{\mathbf{D}}_{t}^{-1}{\mathbf{A}}_{t}\right]\left({\mathbf{D}}_{t}^{-1}\mathit{h}\right).$$

If $\frac{d}{dt}f\left(t\right)=0,$ then ${\mathbf{D}}_{t}^{-1}\mathit{h}$ is in the null space of ${\mathbf{A}}_{t}$ so the $-2{\mathbf{A}}_{t}{\mathbf{D}}_{t}^{-1}\mathbf{A}$ term in Equation (31) vanishes. $\frac{d}{dt}{\mathbf{A}}_{t}$ is negative definite in ${\mathbb{R}}_{+}^{N\times 1}$ so $\frac{{d}^{2}}{d{t}^{2}}f\left(t\right)<0$ there, establishing f’s quasi-concavity and thus that of $-{\mathit{h}}^{\u2020}{\mathbf{D}}_{t}^{-1}{\mathit{h}}^{\u2020}$. Since $-{\mathit{h}}^{\u2020}{\mathbf{D}}_{t}^{-1}{\mathit{h}}^{\u2020}$ is quasi-concave along all of ${\mathbb{R}}_{+}^{N\times 1}$, then it is also quasi-concave in any convex restriction of that domain. ${\mathcal{R}}_{LAN}\left(L\right)$ is a simplex, which is convex. □

${R}_{G,\mathrm{max}}\left(L\right)$ is tight with the upper bound in Equation (16), in limit with LAN throughput L.

${R}_{G,\mathrm{max}}\left(L\right)\to C$ as $L\to \infty .$

${R}_{G\mathrm{max}}\left(L\right)\ge {R}_{G}(L/N\xb7{\mathbf{1}}_{N}),$ and if $L\to \infty $ then
by continuity of ${log}_{2}$, sums, and the matrix determinant (in terms of matrix coordinates). The limit in Equation (32) is greater than or equal to Equation (16), establishing the remark. □

$${R}_{G}(\phantom{\rule{4pt}{0ex}}L/N\xb7{\mathbf{1}}_{N}\phantom{\rule{4pt}{0ex}})\to {log}_{2}\left[\frac{|\Sigma +{\mathit{hh}}^{\u2020}|}{|\Sigma |}\right]$$

This strategy has the benefit of not needing much coordination between receive nodes. The transmitter does not have to share codes with helpers because here the helpers do not perform any decoding. Each helper needs only its encoding rate and its own channel state to form its messages properly.

Helper n for $n\in [1:N]$ in a realization of the Gaussian distortion system described in Section 3.2, on average and with long codes, will produce an ${r}_{n}$-bit encoding of its observation that can be decoded to produce the helper observation with additional approximately Gaussian noise. Since all the helper’s quantizations contain the same signal component (and possibly the same interference), they are correlated and can be compressed before forwarding to allow for less noise to be introduced in quantization.

The Slepian–Wolf theorem [22] shows that if the LAN is such that helpers can encode at rates $\mathit{\rho}\triangleq ({\rho}_{1},\dots ,{\rho}_{N})$, then the helpers can losslessly convey the encodings to the base at lower rates $\mathit{r}\u2aaf\mathit{\rho}$, as long as $\mathit{r}$ and the encodings $\mathit{U}$ satisfy the conditions that for all subsets $S\subseteq [1:N]$, then:

$$H\left({\mathit{U}}_{S}\right|{\mathit{U}}_{{S}^{C}},{Y}_{0},\mathit{Q})<\sum _{n\in S}{r}_{n}.$$

Note that ${Y}_{0}$ is always included in the conditioning because ${Y}_{0}$ is available at the base node in full precision.

In the Gaussian distortion setting described right before Equation (23), $H\left({\mathit{U}}_{S}\right|{\mathit{U}}_{{S}^{C}},{Y}_{0},\mathit{Q})$ can be made close to $I({\mathit{Y}}_{S};{\mathit{Z}}_{S}\left(\mathit{\rho}\right)|{\mathit{Z}}_{{S}^{C}}\left(\mathit{\rho}\right),{Y}_{0})$ with time expansion. The details of this are given in Lemma 2. Maximizing over the LAN constraint in Equation (9), the following rate is achievable:
where
$\mathit{Z}(\xb7)$ in Equation (35) is defined as in Equation (25). We can then state the following:

$$\begin{array}{c}\hfill {R}_{DC}\left(L\right)\triangleq \underset{\mathit{r}\in {\mathcal{R}}_{\mathrm{LAN}}\left(L\right)}{max}\phantom{\rule{4pt}{0ex}}\underset{\mathit{\rho}\in {\mathcal{R}}_{DC}\left(\mathit{r}\right)}{max}{R}_{G}\left(\mathit{\rho}\right)\end{array}$$

$$\begin{array}{c}\hfill {\mathcal{R}}_{DC}\left(\mathit{r}\right)=\left\{\mathit{\rho}:\forall S\subseteq [1:N],I\left({\mathit{Y}}_{S};{\mathit{Z}}_{S}\left(\mathit{\rho}\right)|{\mathit{Z}}_{{S}^{C}}\left(\mathit{\rho}\right),{Y}_{0}\right)<\sum _{n\in S}{r}_{n}\right\}.\end{array}$$

For a distributed receive system as described in Section 2 with noise covariance matrix Σ and LAN constraint L, ${R}_{DC}\left(L\right)$ is achievable with the following facilities:

- The transmitter and base coordinate a codebook of appropriate rate and distribution.
- Each n-th helper is informed of a quantization rate ${\rho}_{n}$ and binning rate ${r}_{n}$, (optimal choice derived from $\mathit{h},\Sigma ,L$).
- Each helper coordinates a random dither with the base.
- The base is informed of $\mathit{\rho},\mathit{r},\mathit{h},\Sigma ,L$.

The base can unambiguously decompress the helper’s compressed encodings with low probability of error if and only if $\mathit{\rho}$ is chosen such that Equation (33) is satisfied. However, in analog to Corollary 1 from Reference [1], the rate in 6 can be improved by expanding ${\mathcal{R}}_{DC}\left(\mathit{r}\right)$ to include some of the $\mathit{\rho}$ where the base cannot perform unambiguous decompression. This helps because even if the encoding rates in $\mathit{\rho}$ are chosen outside of ${\mathcal{R}}_{DC}\left(\mathit{r}\right)$ so that helpers cannot convey $\mathit{U}$ to the base unambiguously, some extra correlation with X is retained through this distortion.

For a distributed receive system as described in Section 2 with noise covariance matrix Σ and LAN constraint L, the following rate is achievable:
where

$${R}_{\overline{DC}}\left(L\right)\triangleq \underset{\begin{array}{c}\mathit{r}\in {\mathcal{R}}_{\mathrm{LAN}}\left(L\right),\\ \lambda >0\end{array}}{max}\phantom{\rule{4pt}{0ex}}\underset{\mathit{\rho}\in {\mathcal{R}}_{DC}^{\lambda}\left(\mathit{r}\right)}{max}{R}_{G}\left(\mathit{\rho}\right)-\lambda $$

$$\begin{array}{c}\hfill {\mathcal{R}}_{DC}^{\lambda}\left(\mathit{r}\right)\triangleq \left\{\mathit{\rho}:\forall S\subseteq [1:N],I\left({\mathit{Y}}_{S};{\mathit{Z}}_{S}\left(\mathit{\rho}\right)|{\mathit{Z}}_{{S}^{C}}\left(\mathit{\rho}\right),{Y}_{0}\right)<\sum _{n\in S}{r}_{n}+\lambda \right\}.\end{array}$$

The rate requires the following facilities:

- The transmitter and base coordinate a codebook of appropriate rate and distribution.
- Each n-th helper is informed of a quantization rate ${\rho}_{n}$, binning rate ${r}_{n}$ and hull parameter λ (optimal choice derived from $\mathit{h},\Sigma ,L$).
- Each helper coordinates a random dither with the base.
- The base is informed of $\mathit{\rho},\mathit{r},\lambda ,\mathit{h},\Sigma ,L$.

All the details of Theorems 6 and 7 are shown in Appendix B.

The capacity of the system is ${R}_{\overline{DC}}\left(L\right)$ under the following restrictions:

- Σ is diagonal (no interference).
- The base does not have its own full-precision observation of the broadcast (${h}_{0}=0$).
- The broadcaster must transmit a Gaussian signal.
- Construction of helper messages is independent of the transmitter’s codebook $\mathcal{X}$.

This is demonstrated in Appendix C. The last three assumptions are necessary:

- Since the base has codebook knowledge, it is possible for the transmitter to send a direct message to the base, which is not accounted for in the compress-and-forward strategy used for ${R}_{\overline{DC}}\left(L\right)$.
- The Gaussian broadcast assumption is needed because of a counterexample given in Reference [1].
- Codebook independence is necessary because ${R}_{\overline{DC}}\left(L\right)$ is strictly less than the upper bound in Equation (16), but, by Remark 3, this upper bound is achieved in some regimes.

Theorem 7 and Remark 7 strengthen Corollary 1 and Theorem 5 from Reference [1] since we show that the same rate can be achieved with less cooperation between transmitter and receivers. Sanderovich et al. [1] used a “nomadicity” assumption which asserts that the mapping from transmitter messages to codewords is not present at the helpers, whereas Theorem 7 shows (the same argument applying to the general discrete case) that indeed the same rate is achievable when helpers have no knowledge at all of the codewords the transmitter is using.

Similar to the Gaussian distortion achievable rate, the distributed compression technique does not require cooperation between the transmitter and helpers. It does, however, require a priori sharing of codes from the helpers to the base so that the helpers can perform distributed compression.

In this section, the bounds are averaged over random channels in various regimes. Each bound tested is a deterministic function of:

- number of helpers N,
- LAN constraint L,
- noise covariance matrix $\Sigma $, and
- channel $\mathit{h}$ (assumed to be static and precisely estimated a priori per each channel use).

In all graphs, the rate from Equation (16) is called “Upper Bound”, the rate from Equation (28) is “Gaussian Distortion”, the rate from Theorem 7 is “Distributed Compression” and the rate from Remark 2 is “Broadcast”. A discussion of the optimizations performed for the Gaussian Distortion and Distributed Compression bounds is in Appendix A.

One motivation for increasing receive diversity is to better mitigate the effect of interference, so it is important to study the extent to which the strategies from Section 3 can do so.

We model interference as a zero-mean Gaussian broadcast independent of our system. Our system’s nodes are informed of nothing more than the scales at which the interferer appears in our receivers’ observations. This assumption prohibits using dirty paper coding [23] and other transmit-side interference mitigation strategies. With this model, the covariance matrix of noise terms associated with a particular interferer seen by the receivers is a matrix $\mathbf{A}=\mathit{a}{\mathit{a}}^{\u2020},$ where $\mathit{a}\in {\mathbb{C}}^{N+1}.$ In all trials run, a single interferer was assumed to be present at all nodes with power $\alpha $ and random phase so that
with ${\mathit{a}}_{n}={e}^{j2\pi \xb7{\theta}_{n}},\phantom{\rule{4pt}{0ex}}{\theta}_{n}\sim \mathsf{Unif}[0,1]$ where $\mathsf{Unif}$ is the uniform distribution.

$$\Sigma ={\mathbf{I}}_{N+1\times N+1}+\alpha \xb7\mathit{a}{\mathit{a}}^{\u2020}.$$

Figure 4 shows the difference in achievable rates with and without interference for different receive-side strategies. Even when a strong rank-one interferer is present at all the nodes, achievable communications rates are comparable to the case without an interferer.

Sanderovich et al. [1] provided an example of a system such as the one in the present work where a Gaussian transmitter is significantly sub-optimal. In contrast, we see in Figure 4 that, over an average of many random channels in a range of settings, achievable rates using Gaussian signaling come quite close to an upper bound on the system. Even the relatively simple Gaussian distortion bound is close in performance to the optimum in all regimes but one with strong interference and little cumulative helper-to-base information. In this regime the LAN constraint limits the helper’s ability to provide the base with the diversity of observations necessary to perform good interference mitigation.

It is reasonable to expect that the best strategy for a distributed receiver to use will change depending on its environment. If the receivers observe the signal through a single line-of-sight path, all else being equal, the signal and noise at each receiver will have similar statistics. In contrast, in environments with many scatterers, the channel statistics will vary more across spatially distributed receivers.

The level of attenuation at a receiver can be modeled with a Rician distribution [24]. The Rician distribution follows the magnitude of a circularly-symmetric complex Gaussian with nonzero mean and can be parameterized by two nonzero values: a scale $\mathsf{\Omega}>0$ representing the average receive SNR and a shape $K>0$ (called a K-factor in other literature) denoting the ratio of signal power received from direct paths to the amount of power received from scattered paths. By construction, if $K=0$, then a Rician distribution is equivalent to a Rayleigh distribution with mean $\mathsf{\Omega}$. In contrast, if $K\to \infty $, then the distribution approaches a point mass at $\mathsf{\Omega}$ (Reference [25]).

The type of scattering environment does not greatly affect the average rate achievable by any bound other than the Broadcast bound (Figure A1). This is because the Broadcast bound’s only utilization of receive diversity is in channel variances which is small when the channel is dominated by line-of-sight receptions. Despite the invariance of most bounds to K, the profile of helper-to-base bits does change. Figure 5 shows that, no matter the other parameters, in high scattering, it is most helpful for the base to draw most of its information from the highest SNR helper while mostly ignoring low-SNR helpers’ observations. The imbalance is less pronounced in interference and when SNR is less varied across receivers (high K), since in these regimes, gaining diverse observations for combining is more beneficial than using the strongest helper’s observation.

When the base collects its own full-precision observation, the helper rate profile is unaffected by interference and scattering, where most helper information is provided by the highest SNR helper. In practice, in this situation, the base might use its observation as an estimate of the interferer and subtract it from a compression of the strongest helper’s observation.

One upper bound and three lower bounds on communication rate were developed. A simple upper bound (Remark 1) was derived, achievable when the LAN constraint is stringent (small L), and achievable in limit as the LAN constraint is relaxed ($L\to \infty $). An achievable rate (Theorem 5) considering quantizers which add Gaussian distortion was seen to provide large gains in interference over a rate that does not fully use the distributed receive array (Remark 2). The rate (Theorem 7) achieved when a distributed compression stage is added to the previous technique provides gains over Theorem 5 when interference is strong and helper-to-base communication is limited, but may be more difficult to implement.

Leaving SNR fixed, performance is mostly unchanged in high-scattering versus low-scattering environments, although the profile of helper-to-base communication changes: In high-scattering environments, some helper observations are ignored by the base, while, in line-of-sight environments, each helper informs the base equally. The presence of a strong interferer dampens this effect, since in this regime the base needs more spatial diversity to mitigate the interferer.

An immediate goal for future work is to devise practical implementations of the presented strategies. This may include refining the models of the LAN and helpers to more precisely reflect system capabilities.

The following are available online at https://www.mdpi.com/1099-4300/20/4/269/s1.

We would like to thank Shankar Ragi for aid in producing datasets for this paper. We would also like to thank Bryan Paul for his comments and corrections. Support by Hans Mittelmann offered by AFOSR under grant FA 9550-15-1-0351. DISTRIBUTION STATEMENT A: Approved for Public Release, Distribution Unlimited. This work was sponsored in part by Defense Advanced Research Projects Agency under Air Force Contract FA8721-05-C-0002. Opinions, interpretations, conclusions, and recommendations are those of the authors and are not necessarily endorsed by the United States Government.

Daneil W. Bliss, Adam Margetts and Christian Chapman developed the theoretical work in the paper. Christian Chapman and Hans Mittelmann performed the simulations. Christian Chapman wrote the paper.

The authors declare no conflict of interest.

Evaluation of the Gaussian distortion and distributed compression bounds in each environment requires optimization of a quasi-convex objective (Remark 4). In the case of distributed compression, the space of acceptable parameters is not convex: consider compressing the encodings for two highly-correlated sources. By the Slepian–Wolf theorem, if the encodings are high-bitrate, then perturbing the encodings’ bitrate does not greatly affect the region of feasible compression rates, since the encodings have high redundancy. On the other hand, perturbing the bitrate of low-bitrate encodings greatly affects the region of feasible compression rates, since uncorrelated distortion from encoding dominates source redundancies. Then, a compression to an average between minimum feasible compression rates for low-bitrate encodings and ones for high-bitrate encodings is infeasible, and the distance from the feasible set becomes large as the difference between high and low bitrates is made large. This is the mechanism that governs ${\mathcal{R}}_{\overline{DC}}\left(\mathit{r}\right)\subseteq {\mathbb{R}}_{+}^{K},$ hence the domain’s non-convexity.

To overcome this, an iterative interior point method was used to find the maximum: each constraint $f\left(x\right)<0$ is replaced with a stricter constraint, $f\left(x\right)+\beta <0$ for some $\beta >0$ and a minimization of the objective is performed under the new constraint. The optimal point is passed as the initial guess for the next iteration, where the objective function is minimized with updated constraints $f\left(x\right)+{\beta}^{\prime}<0$ with $0<{\beta}^{\prime}<\beta $. The method is iterated until the constraint is practically equivalent to $f\left(x\right)<0$ or the optimal point converged. Each individual minimization was performed using sequential least-squares programming (SLSQP) through SciPy [26]. The code for this is included in the Supplementary Materials.

Several lemmas are needed to prove bound achievability. The following lemma demonstrates that lattice encodings become close in joint information content to the equivalent random variables in a Gaussian distortion system from Section 3.2. Lemma 1 demonstrates the joint differential entropies between source estimates from the encodings are close to the equivalent differential entropies in the Gaussian distortion system. Lemma 2 demonstrates the encodings themselves have joint Shannon entropies that are close to mutual informations in the Gaussian distortion system.

Take helper observations $\mathit{Y}$ and independent dithers ${\left({\mathit{W}}_{Q,n}\right)}_{n\in [0:N]}$ where, for each n, the dither vector ${\mathit{W}}_{Q,n}$ is distributed along the base cell of a regular lattice in ${\mathbb{C}}^{t}$ whose normalized second moment is ${\sigma}_{Q,n}^{2}$. For any $A\subset [0:N]$, $\epsilon >0$ and t large enough,

$$\left|\frac{1}{t}h({\mathit{Y}}_{A}-{\left({\mathit{W}}_{Q,n}\right)}_{n\in A})-h({Y}_{A}+{W}_{A})\right|<\epsilon $$

The proof is similar to Theorem 6 from [20]. Notice that, for any additive noise channel, $x\to x+n$ then $I(x;x+n)=h(x+n)-h\left(n\right)$ so that $h(x+n)=h\left(n\right)+I(x;x+n)$. Then,
where Equation (A4) follows since the terms are independent, Equation (A5) follows by the worst additive noise lemma [27], and the substitution and convergence in Equation (A7) follow by Section III of [20]. In addition:
where Equation (A4) follows by Lemma 1 from [28] and Equations (A10) and (A11) follow since the terms in ${\left({\mathit{W}}_{Q,n}\right)}_{n\in A}$ are independent and by Section III of [20]. □

$$\begin{array}{}\mathrm{(A2)}& \hfill \frac{1}{t}h({\mathit{Y}}_{A}-{\left({\mathit{W}}_{Q,n}\right)}_{n\in A})& =\frac{1}{t}[h\left({\left({\mathit{W}}_{Q,n}\right)}_{n\in A}\right)+I({\mathit{Y}}_{A};{\mathit{Y}}_{A}-{\left({\mathit{W}}_{Q,n}\right)}_{n\in A})]\hfill \mathrm{(A3)}& & =\frac{1}{t}[h\left({\left({\mathit{W}}_{Q,n}\right)}_{n\in A}\right)-th\left({W}_{A}\right)+th\left({W}_{A}\right)+I({\mathit{Y}}_{A};{\mathit{Y}}_{A}-{\left({\mathit{W}}_{Q,n}\right)}_{n\in A})]\hfill \mathrm{(A4)}& & =\frac{1}{t}[\sum _{n\in A}h\left({\mathit{W}}_{Q,n}\right)-th\left({W}_{n}\right)+th\left({W}_{A}\right)+I({\mathit{Y}}_{A};{\mathit{Y}}_{A}-{\left({\mathit{W}}_{Q,n}\right)}_{n\in A})]\hfill \mathrm{(A5)}& & \ge \frac{1}{t}[\sum _{n\in A}h\left({\mathit{W}}_{Q,n}\right)-th\left({W}_{n}\right)+th\left({W}_{A}\right)+tI({Y}_{A};{Y}_{A}+{W}_{A})]\hfill \mathrm{(A6)}& & =\frac{1}{t}[\sum _{n\in A}h\left({\mathit{W}}_{Q,n}\right)-th\left({W}_{n}\right)+th({Y}_{A}+{W}_{A})]\hfill \mathrm{(A7)}& & {\textstyle =h({Y}_{A}+{W}_{A})+\frac{1}{t}{\displaystyle \sum _{n\in A}}D({P}_{{\mathit{W}}_{Q,n}}\parallel {\prod}_{\ell =1}^{t}{P}_{{W}_{n}})\stackrel{t\uparrow \infty}{\to}h({Y}_{A}+{W}_{A}).}\hfill \end{array}$$

$$\begin{array}{}\mathrm{(A8)}& \hfill \frac{1}{t}h({\mathit{Y}}_{A}-{\left({\mathit{W}}_{Q,n}\right)}_{n\in A})& =\frac{1}{t}[h\left({\left({\mathit{W}}_{Q,n}\right)}_{n\in A}\right)+I({\mathit{Y}}_{A};{\mathit{Y}}_{A}-{\left({\mathit{W}}_{Q,n}\right)}_{n\in A})]\hfill \mathrm{(A9)}& & {\textstyle \le \frac{1}{t}h\left({\left({\mathit{W}}_{Q,n}\right)}_{n\in A}\right)+I({Y}_{A};{Y}_{A}+{W}_{A})+{\displaystyle \sum _{n\in A}}D({P}_{{\mathit{W}}_{Q,n}}\parallel {\prod}_{\ell =1}^{t}{P}_{{W}_{n}})}\hfill \mathrm{(A10)}& & <I({Y}_{A};{Y}_{A}+{W}_{A})+\sum _{n\in A}\frac{1}{t}h\left({\mathit{W}}_{Q,n}\right)+\epsilon /2\hfill \mathrm{(A11)}& & \le I({Y}_{A};{Y}_{A}+{W}_{A})+\sum _{n\in A}h\left({W}_{n}\right)+\epsilon \hfill \mathrm{(A12)}& & =I({Y}_{A};{Y}_{A}+{W}_{A})+h\left(W\right)+\epsilon =h({Y}_{A}+{W}_{A})+\epsilon \hfill \end{array}$$

For length-t helper observations $\mathit{Y}$, dithered lattice encodings $\mathit{U}$ of $\mathit{Y}$ using dithers ${\left({\tilde{\mathit{W}}}_{Q,n}\right)}_{n\in [0:N]}$ where each ${\mathit{U}}_{n}$ encodes ${\mathit{Y}}_{n}$ onto a regular lattice in ${\mathbb{C}}^{t}$ whose cells have normalized second moment ${\sigma}_{Q,n}^{2}$. Then, for any $\epsilon >0$ and if t is large enough, any $S\subseteq [0:N]$ has

$$\left|\frac{1}{t}H\left({\mathit{U}}_{S}\right|{\left({\tilde{\mathit{W}}}_{Q,n}\right)}_{n\in S})-I({Y}_{S};{\mathit{Z}}_{S}\left(\mathit{r}\right))\right|<\epsilon .$$

In Equations (A13), $\mathit{Z}\left(\mathit{r}\right)=Y+W,$ with $Y=({Y}_{0},\dots ,{Y}_{N})\in {\mathbb{C}}^{N+1}$ distributed the same as helper observations at a single time point, and

$$\begin{array}{c}\hfill W=({W}_{0},\dots ,{W}_{N})\sim \mathcal{CN}\left(0,\mathrm{diag}\left({\sigma}_{Q,1}^{2},\dots ,{\sigma}_{Q,N}^{2}\right)\right).\end{array}$$

We prove this by induction. By Theorem 1 in Reference [29], for any $n\in [0:N]$

$$H\left({\mathit{U}}_{n}\right|{\tilde{\mathit{W}}}_{Q,n})=I({\mathit{Y}}_{n};{\mathit{Y}}_{n}-{\tilde{\mathit{W}}}_{Q,n}).$$

Theorem 6 in Reference [20] has that

$$\frac{1}{t}I\left({\mathit{Y}}_{n};{\mathit{Y}}_{n}-{\tilde{\mathit{W}}}_{Q,n}\right)\underset{t\uparrow \infty}{\to}I({Y}_{n};\phantom{\rule{4pt}{0ex}}{Y}_{n}+{W}_{Q,n})=I({Y}_{n};{\mathit{Z}}_{n}\left({r}_{n}\right)).$$

Thus, the statement holds for any singleton $\left\{n\right\}$. Take any disjoint $A,B\subset [0:N]$ for which the claim holds for ${\epsilon}^{\prime}>0$. By construction, given ${\left({\tilde{\mathit{W}}}_{Q,n}\right)}_{n\in A\cup B}$, there is a function ${f}_{A\cup B}$ where ${f}_{A\cup B}\left({\mathit{U}}_{A\cup B}\right)={\mathit{Y}}_{A\cup B}-{\left({\tilde{\mathit{W}}}_{Q,n}\right)}_{n\in A\cup B}^{*}$ with ${\left({\tilde{\mathit{W}}}_{Q,n}\right)}_{n\in A\cup B}^{*}$ independent of but distributed like ${\left({\tilde{\mathit{W}}}_{Q,n}\right)}_{n\in A\cup B}$ [20]. Compute:

$$\begin{array}{}\mathrm{(A17)}& \hfill \frac{1}{t}H\left({\mathit{U}}_{A\cup B}\right|{\left({\tilde{\mathit{W}}}_{Q,n}\right)}_{n\in A\cup B})& \ge \frac{1}{t}H\left({f}_{A\cup B}\left({\mathit{U}}_{A\cup B}\right)\right|{\left({\tilde{\mathit{W}}}_{Q,n}\right)}_{n\in A\cup B})\hfill \mathrm{(A18)}& & \ge \frac{1}{t}I({f}_{A\cup B}\left({\mathit{U}}_{A\cup B}\right);{\mathit{Y}}_{A\cup B}|{\left({\tilde{\mathit{W}}}_{Q,n}\right)}_{n\in A\cup B})\hfill \mathrm{(A19)}& & =\frac{1}{t}I({f}_{A\cup B}\left({\mathit{U}}_{A\cup B}\right);{\mathit{Y}}_{A\cup B})\hfill \mathrm{(A20)}& & \ge I({Y}_{A\cup B};{\mathit{Z}}_{A\cup B}\left(\mathit{r}\right)).\hfill \end{array}$$

The first inequality follows by the data processing inequality. Equation (A19) follows since ${\left({\tilde{\mathit{W}}}_{Q,n}\right)}_{n\in A\cup B}$ is independent of the other variables in the term. The last inequality follows by the worst additive noise lemma [27]. For t large enough,

$$\begin{array}{}\mathrm{(A21)}& \hfill \frac{1}{t}H\left({\mathit{U}}_{A\cup B}\right|{\left({\tilde{\mathit{W}}}_{Q,n}\right)}_{n\in A\cup B})=& \frac{1}{t}[H\left({\mathit{U}}_{A}\right|{\left({\tilde{\mathit{W}}}_{Q,n}\right)}_{n\in A\cup B})+H\left({\mathit{U}}_{B}\right|{\left({\tilde{\mathit{W}}}_{Q,n}\right)}_{n\in A\cup B})-I({\mathit{U}}_{A};{\mathit{U}}_{B}|{\left({\tilde{\mathit{W}}}_{Q,n}\right)}_{n\in A\cup B})]\hfill \mathrm{(A22)}& & \le I({Y}_{A};{\mathit{Z}}_{A}\left(\mathit{r}\right))+I({Y}_{B};{\mathit{Z}}_{B}\left(\mathit{r}\right))-\frac{1}{t}I({\mathit{U}}_{A};{\mathit{U}}_{B}|{\left({\tilde{\mathit{W}}}_{Q,n}\right)}_{n\in A\cup B})+2\epsilon .\hfill \end{array}$$

Similar to before, given ${\left({\tilde{\mathit{W}}}_{Q,n}\right)}_{n\in S},S=A,B$ there are deterministic functions ${f}_{A},{f}_{B}$ where ${f}_{A}\left({\mathit{U}}_{A}\right)={\mathit{Y}}_{A}-{\left({\tilde{\mathit{W}}}_{Q,n}\right)}_{n\in A}^{*},$ and ${f}_{B}\left({\mathit{U}}_{B}\right)={\mathit{Y}}_{B}-{\left({\tilde{\mathit{W}}}_{Q,n}\right)}_{n\in B}^{*}$ with ${\left({\tilde{\mathit{W}}}_{Q,n}\right)}_{n\in S}^{*}$ iid to ${\left({\tilde{\mathit{W}}}_{Q,n}\right)}_{n\in S}$, $S=A,B$. Compute:
where, again, the first line follows by the data processing inequality and the second line since the conditional is independent of the rest of the mutual information’s terms. For large enough t, the second inequality follows by Lemma 1. Combining Equations (A20), (A22) and (A27),

$$\begin{array}{}\mathrm{(A23)}& \hfill \frac{1}{t}I({\mathit{U}}_{A};{\mathit{U}}_{B}|{\left({\tilde{\mathit{W}}}_{Q,n}\right)}_{n\in A\cup B})& \ge \frac{1}{t}I({f}_{A}\left({\mathit{U}}_{A}\right);{f}_{B}\left({\mathit{U}}_{B}\right)|{\left({\tilde{\mathit{W}}}_{Q,n}\right)}_{n\in A\cup B})\hfill \mathrm{(A24)}& & =\frac{1}{t}I({f}_{A}\left({\mathit{U}}_{A}\right);{f}_{B}\left({\mathit{U}}_{B}\right))\hfill \mathrm{(A25)}& & =\frac{1}{t}[h({\mathit{Y}}_{A}-{\left({\tilde{\mathit{W}}}_{Q,n}\right)}_{n\in A}^{*})+h({\mathit{Y}}_{B}-{\left({\tilde{\mathit{W}}}_{Q,n}\right)}_{n\in B}^{*})-\dots h({\mathit{Y}}_{A}-{\left({\tilde{\mathit{W}}}_{Q,n}\right)}_{n\in A}^{*},{\mathit{Y}}_{B}-{\left({\tilde{\mathit{W}}}_{Q,n}\right)}_{n\in B}^{*})]\hfill \mathrm{(A26)}& & \ge h({Y}_{A}+{W}_{A})+h({Y}_{B}+{W}_{B})-h({Y}_{A}+{W}_{A},{Y}_{B}+{W}_{B})-3{\epsilon}^{\prime}\hfill \mathrm{(A27)}& & =I({\mathit{Z}}_{A}\left(\mathit{r}\right);{\mathit{Z}}_{B}\left(\mathit{r}\right))-3{\epsilon}^{\prime}\hfill \end{array}$$

$$0\le H\left({\mathit{U}}_{A\cup B}^{t}\right)-I({\mathit{Y}}_{A\cup B}^{t};{\mathit{Z}}_{A\cup B}^{t}\left(\mathit{r}\right))\le 5{\epsilon}^{\prime}.$$

Since $S\subseteq [1:N]$, the inductive step will only need to be used up to N times. Letting $0<{\epsilon}^{\prime}<\epsilon \xb7{5}^{-N}$, the initial statement holds. □

A small result is also needed to show that the base having its own full precision observation is approximately equivalent to receiving a high-bitrate helper message. Recall that the base is associated with helper index 0.

If $\forall S\subseteq [1:N]$, then
then, some value ${r}_{0}$ can be chosen sufficiently large so that for any $S\subseteq [0:N]$

$$I({\mathit{Y}}_{S},{\mathit{Z}}_{S}\left(\mathit{r}\right)|{\mathit{Z}}_{{S}^{C}}\left(\mathit{r}\right),{Y}_{0})<\sum _{m\in S}{r}_{m},$$

$$I({\mathit{Y}}_{S},{\mathit{Z}}_{S}\left(\mathit{r}\right)|{\mathit{Z}}_{{S}^{C}}\left(\mathit{r}\right))<\sum _{m\in S}{r}_{m}.$$

Since all the conditional mutual information terms in the statement are continuous in $\mathit{r}$, it is enough to show the statement for $\mathit{r}\succ 0$. In this case, with no loss of generality, we can say there is some $\delta >0$ where:

$$I({\mathit{Y}}_{S},{\mathit{Z}}_{S}\left(\mathit{r}\right)|{\mathit{Z}}_{{S}^{C}}\left(\mathit{r}\right),{Y}_{0})<\sum _{m\in S}{r}_{m}-\delta .$$

Fix ${r}_{0}$ large enough so that for any $S\subseteq [1:N]$,

$$\begin{array}{c}\hfill \u2225I({\mathit{Y}}_{S};{\mathit{Z}}_{S}\left(\mathit{r}\right)|{\mathit{Z}}_{{S}^{C}}\left(\mathit{r}\right),{Y}_{0})-I({\mathit{Y}}_{S};{\mathit{Z}}_{S}\left(\mathit{r}\right)|{\mathit{Z}}_{{S}^{C}}\left(\mathit{r}\right),{Z}_{0}\left({r}_{0}\right))\u2225\le \delta .\end{array}$$

There is guaranteed to be such an ${r}_{0}$ because the above expression is a continuous function of the covariance matrix of $({Y}_{0},\mathit{Y},{Z}_{0}\left({r}_{0}\right),\mathit{Z}\left(\mathit{r}\right))$ component-wise, and as ${r}_{0}$ is made large, the covariance matrices involved in the second term converge component-wise to those of the first since $\frac{\parallel {h}_{0}{\parallel}^{2}+{\Sigma}_{1,1}}{{2}^{{r}_{0}}-1}\to 0$ as ${r}_{0}\to \infty $. By Equations (A31) and (A32), the statement holds for any $S\subseteq [1:N],$ and it remains to show that it also holds for each $S\cup \left\{0\right\}.$

Note that, for any $S\subseteq [1:N]$,
where Equation (A35) follows because conditioning reduces mutual information, and Equation (A36) follows from what was shown at the start. □

$$\begin{array}{}\mathrm{(A33)}& \hfill \phantom{\le}& I({\mathit{Y}}_{S}^{t},{\mathit{Y}}_{0}^{t};{\mathit{Z}}_{S}^{t}\left(\mathit{r}\right),{\mathit{Z}}_{0}^{t}\left(\mathit{r}\right)|{\mathit{Z}}_{{(S\cup \left\{0\right\})}^{C}}^{t}\left(\mathit{r}\right))\hfill \mathrm{(A34)}& \hfill =& I({\mathit{Y}}_{S}^{t};{\mathit{Z}}_{S}^{t}\left(\mathit{r}\right)|{\mathit{Z}}_{{(S\cup \left\{0\right\})}^{C}}^{t}\left(\mathit{r}\right))+I({\mathit{Y}}_{0}^{t};{\mathit{Z}}_{0}^{t}\left(\mathit{r}\right)|{\mathit{Z}}_{[1:N]}^{t}\left(\mathit{r}\right))\hfill \mathrm{(A35)}& \hfill \le & I({\mathit{Y}}_{S}^{t};{\mathit{Z}}_{S}^{t}\left(\mathit{r}\right)|{\mathit{Z}}_{{(S\cup \left\{0\right\})}^{C}}^{t}\left(\mathit{r}\right))+{r}_{0}\hfill \mathrm{(A36)}& \hfill \le & \sum _{m\in S}{r}_{m}+{r}_{0}\hfill \end{array}$$

For a distributed receive system as described in Section 2 with noise covariance matrix Σ, LAN constraint L and fixed average helper quantization rates $\mathit{r}\in {\mathcal{R}}_{\mathrm{LAN}}\left(L\right)$, a rate ${R}_{G}\left(\mathit{r}\right)$ is achievable.

If some $n\in [0:N]$ has ${r}_{n}=0$, the system is equivalent to the case where the nth receiver’s observation is not present, so, without loss of generality, we assert that ${r}_{n}>\epsilon $. Fix some rate $R<{R}_{G}\left(\mathit{r}\right)$ and a block length $T={t}^{2}\in \mathbb{N}$. Form a message M uniform on $[1:{2}^{tR}].$

**Outline**

Form a rate-R codebook with length-T codewords. At receiver $n\in [0:N]$, form a length-t regular lattice encoder with lattice chosen coarse enough that its encodings have rate ${r}_{n}$. Combine encodings at base and find the typical codeword. Observe that probability of error is low by recognizing that each t-length encoding from a helper approximately contains the information content of the helper’s observation plus Gaussian distortion.

**Transmitter setup**

Generate a codebook mapping $\varphi :[1:{2}^{TR}]\to {\mathbb{C}}^{T\times 1}$ where:
with each component ${x}_{m,\ell}$ drawn iid from $\mathcal{CN}\text{\hspace{0.17em}}(0,1)$. Reveal $\varphi $ to the transmitter and base.

$$\varphi \left(m\right)=({x}_{m,1},\dots ,{x}_{m,T})\in {\mathbb{C}}^{T\times 1}$$

**Helper encoder setup**

For each helper $n,\phantom{\rule{4pt}{0ex}}n\in [1:N]$, generate a dither vector ${\tilde{\mathit{W}}}_{Q,n}={\left({\tilde{\mathit{W}}}_{Q,n}^{\ell}\right)}_{\ell}\in {\left({\mathbb{C}}^{t\times 1}\right)}^{t}$, where each successive t-segment is uniform in the base region of the Voroni partition ${\mathcal{P}}_{n}^{t}$ of a regular, white t-dimensional lattice ${\mathcal{L}}_{n}^{t}$ which has the following normalized-second-moment:

$${G}^{t}({\mathcal{L}}_{n}^{t},{\mathcal{P}}_{n}^{t})=\frac{\parallel {h}_{n}{\parallel}^{2}+{\Sigma}_{n,n}}{{2}^{{r}_{n}-\epsilon}-1}.$$

These terms are detailed at the beginning of Reference [20]. Reveal ${\tilde{\mathit{W}}}_{Q,n}$ to the base and helper n. Similarly, at the base, generate a dither vector ${\tilde{\mathit{W}}}_{Q,0}={\left({\tilde{\mathit{W}}}_{Q,0}^{\ell}\right)}_{\ell}\in {\left({\mathbb{C}}^{t\times 1}\right)}^{t}$ with ${r}_{0}$ chosen large enough for Lemma 3 to hold.

**Transmission**

To send message $M\in [1:{2}^{TR}],$ have the transmitter broadcast $\mathit{X}=({X}_{1},\dots ,{X}_{T})=\varphi \left(M\right).$

**Helper encoding and forwarding**

Helper n ($n\in [0:N])$ observes a sequence of length T, ${\mathit{Y}}_{n}={\left({\mathit{Y}}_{n}^{\ell}\right)}_{\ell}\in {\left({\mathbb{C}}^{t\times 1}\right)}^{t}$. Now ${\mathit{Y}}_{n}-{\tilde{\mathit{W}}}_{Q,n}={({\mathit{Y}}_{n}^{\ell}-{\tilde{\mathit{W}}}_{Q,n}^{\ell})}_{\ell}\in {\mathbb{C}}^{T}={\left({\mathbb{C}}^{t}\right)}^{t}$ is composed of t consecutive length-t segments. For the ℓ-th length-t segment ($\ell \in [1:t]$), form a quantization ${U}_{n}^{\ell}\in {\mathcal{L}}_{n}^{t}$ by finding the point in ${\mathcal{L}}_{n}^{t}$ associated with the region in ${\mathcal{P}}_{n}^{t}$ in which that segment resides. The properties of such ${U}_{n}$ are the subject of References [20,29]. By the extension of Theorem 1 in Appendix A of Reference [29],

$$H\left({U}_{n}^{\ell}\right|{\tilde{\mathit{W}}}_{Q,n})=\frac{1}{t}I({\mathit{Y}}_{n};{\mathit{Y}}_{n}-{\tilde{\mathit{W}}}_{Q,n}).$$

Further, by Theorem 6 in Reference [20],
where ${Y}_{n}$ has the distribution of any individual component of ${\mathit{Y}}_{n}$ and ${W}_{Q,n}\sim \mathcal{CN}(0,{G}^{t}({\mathcal{L}}_{n}^{t},{\mathcal{P}}_{n}^{t}))$ (in agreement with notation in Section 3.2). Thus, the t encoded messages produced by helper $n,$${\left({U}_{n}^{\ell}\right)}_{\ell}$, are within the LAN constraint for large enough block length T. Helper $n\in [1:N]$ forwards ${\left({U}_{n}^{\ell}\right)}_{\ell}$ to the base.

$$\frac{1}{t}I\left({\mathit{Y}}_{n};{\mathit{Y}}_{n}-{\tilde{\mathit{W}}}_{Q,n}\right)\underset{t\uparrow \infty}{\to}I({Y}_{n};\phantom{\rule{4pt}{0ex}}{Y}_{n}+{W}_{Q,n})={r}_{n}-\epsilon $$

**Decoding**

At the base, receive all of the lattice quantizations,

$$\mathit{U}={\left({\left({U}_{n}^{\ell}\right)}_{\ell \in [1:t]}\right)}_{n\in [0:N]}.$$

Take ${A}_{\epsilon}^{T}(\mathit{X},\mathit{U}|{\mathit{W}}_{Q})$ to be the collection of
which are jointly $\epsilon $-weakly-typical with respect to the joint distribution of $(\mathit{X},\mathit{U}),$ conditional on all the dithers ${\left({\tilde{\mathit{W}}}_{Q,n}\right)}_{n\in [0:N]}$. Weak- and joint-typicality are defined in Reference [14].

$$\left({\mathit{x}}_{m},\mathit{u}\right)\in \mathrm{Range}\left(\varphi \right)\times {\left(\prod _{n=0}^{N}{\mathcal{L}}_{n}^{t}\right)}^{t}$$

At the base, find a message estimate $\widehat{m}\in [1:{2}^{TR}]$ where $(\varphi \left(\widehat{m}\right),\mathit{U})\in {A}_{\epsilon}^{T}(\mathit{X},\mathit{U}|{\mathit{W}}_{Q})$. Declare error events ${\mathcal{E}}_{0}$ if M is not found to be typical with $\mathit{U}$, and ${\mathcal{E}}_{1}$ if there is some $\widehat{m}\ne M$ where $(\varphi \left(\widehat{m}\right),\mathit{U})\in {A}_{\epsilon}^{T}(\mathit{X},\mathit{U}|{\mathit{W}}_{Q})$.

**Error analysis**

By typicality and the law of large numbers, $P\left({\mathcal{E}}_{0}\right)\to 0$ as $t\to \infty $. In addition,
$\mathit{Z}(\xb7)$ is defined as in Equation (25). Equation (A47) follows by the data processing inequality. Equation (A48) follows by combining the entropies into a mutual information, then using the worst additive noise lemma [27]. Thus, if R is chosen less than $I(X;\mathit{Z}(\mathit{r}-\mathbf{1}\xb7\epsilon \left)\right)-4\epsilon $ then $P({\mathcal{E}}_{0}\cup {\mathcal{E}}_{1})\to 0$ as $T\to \infty $. Mutual information is lower semi-continuous so $I(X;\mathit{Z}(\mathit{r}-\mathbf{1}\xb7\epsilon \left)\right)-4\epsilon $ can be made arbitrarily close to ${R}_{G}\left(\mathit{r}\right)$. □

$$\begin{array}{}\mathrm{(A43)}& \hfill P\left({\mathcal{E}}_{1}\right)& \le \sum _{\widehat{m}\ne M}P\left(\{\varphi \left(\widehat{m}\right):(\varphi \left(\widehat{m}\right),\mathit{U})\in {A}_{\epsilon}^{T}(\mathit{X},\mathit{U}|{\mathit{W}}_{Q})\}\right)\hfill \mathrm{(A44)}& & \le \sum _{\widehat{m}\ne M}{2}^{-t\xb7(I({X}_{1},\dots ,{X}_{t};{\left({\mathit{U}}_{n}^{1}\right)}_{n\in [0:N]}|{\left({\tilde{\mathit{W}}}_{Q,n}^{1}\right)}_{n\in [0:N]})-3\epsilon )}\hfill \mathrm{(A45)}& & <{2}^{-t\xb7(I({X}_{1},\dots ,{X}_{t};{\left({\mathit{U}}_{n}^{1}\right)}_{n\in [0:N]}|{\left({\tilde{\mathit{W}}}_{Q,n}^{1}\right)}_{n\in [0:N]})-3\epsilon )+TR}\hfill \mathrm{(A46)}& & ={2}^{-t\xb7(h({X}_{1},\dots ,{X}_{t})-h({X}_{1},\dots ,{X}_{t}|{\left({\mathit{U}}_{n}^{1}\right)}_{n\in [0:N]},{\left({\tilde{\mathit{W}}}_{Q,n}^{1}\right)}_{n,\ell})-3\epsilon -tR)}\hfill \mathrm{(A47)}& & \le {2}^{-t\xb7(h({X}_{1},\dots ,{X}_{t})-h({X}_{1},\dots ,{X}_{t}|{({\mathit{Y}}_{n}^{\ell}-{\tilde{\mathit{W}}}_{Q,n}^{1})}_{n})-3\epsilon -tR)}\hfill \mathrm{(A48)}& & \le {2}^{-t\xb7(t\xb7I(X;\mathit{Z}(\mathit{r}-\mathbf{1}\xb7\epsilon ))-4\epsilon -tR)}\hfill \end{array}$$

For a distributed receive system as described in Section 2 with noise covariance matrix Σ and LAN constraint L, a rate ${R}_{DC}\left(L\right)$ is achievable.

Apply Theorem 7 with $\lambda =0$ (proof shown below). □

For a distributed receive system as described in Section 2 with noise covariance matrix Σ and LAN constraint L then ${R}_{\overline{DC}}\left(L\right)$ is achievable.

It is enough to show that any component in the maximization from Equation (36) is achievable. Fix $\lambda >0,$ a helper rate vector $\mathit{r}\in {\mathcal{R}}_{\mathrm{LAN}}\left(L\right)$, and a compression rate vector $\mathit{\rho}\in {\mathcal{R}}_{DC}^{\lambda}\left(\mathit{r}\right)$. If some $n\in [0:N]$ has ${r}_{n}=0$ or ${\rho}_{n}=0$, the system is equivalent to the case where the nth helper is not present, so, without loss of generality, we assert that each ${r}_{n},{\rho}_{n}>\epsilon $. Fix some rate $R<{R}_{G}(\mathit{\rho}-\mathbf{1}\xb7\epsilon )-\lambda $ and a block length $T={t}^{2}\in \mathbb{N}$. Form a message M uniform on $[1:{2}^{tR}]$.

**Outline**

Form a rate-R codebook with length-T codewords. At receiver $n\in [0:N]$, form a length-t regular lattice encoder with lattice chosen coarse enough that its encodings have rate ${r}_{n}$. Randomly bin the encodings down to rate ${\rho}_{n}$. At the base find the codeword jointly typical with the binned encodings. Observe that probability of error is low by recognizing that the unbinned t-length encodings from helpers approximately contain the joint-information content of the helper’s observations plus Gaussian distortion.

**Transmitter setup**

Generate a codebook mapping $\varphi :[1:{2}^{TR}]\to {\mathbb{C}}^{T\times 1}$ where
with each component ${x}_{m,\ell}$ drawn iid from $\mathcal{CN}\text{\hspace{0.17em}}(0,1)$. Reveal $\varphi $ to the transmitter and base.

$$\varphi \left(m\right)=({x}_{m,1},\dots ,{x}_{m,T})\in {\mathbb{C}}^{T\times 1}$$

**Helper encoder setup**

For each helper $n,\phantom{\rule{4pt}{0ex}}n\in [1:N]$, generate a dither vector ${\tilde{\mathit{W}}}_{Q,n}={\left({\tilde{\mathit{W}}}_{Q,n}^{\ell}\right)}_{\ell}\in {\left({\mathbb{C}}^{t\times 1}\right)}^{t}$, where each successive t-segment is uniform in the base region of the Voroni partition ${\mathcal{P}}_{n}^{t}$ of a regular, white $t-$dimensional lattice ${\mathcal{L}}_{n}^{t}$ which has the following normalized-second-moment:

$${G}^{t}({\mathcal{L}}_{n}^{t},{\mathcal{P}}_{n}^{t})=\frac{\parallel {h}_{n}{\parallel}^{2}+{\Sigma}_{n,n}}{{2}^{{\rho}_{n}-\epsilon}-1}.$$

These terms are detailed at the beginning of Reference [20]. Reveal ${\tilde{\mathit{W}}}_{Q,n}$ to the base and helper n. Similarly at the base generate a dither vector ${\tilde{\mathit{W}}}_{Q,0}={\left({\tilde{\mathit{W}}}_{Q,0}^{\ell}\right)}_{\ell}\in {\left({\mathbb{C}}^{t\times 1}\right)}^{t}$ with ${\rho}_{0}$ chosen large enough for Lemma 3 to hold.

For each receiver $n\in [0:N],$ form a random mapping ${\mathsf{Index}}_{n}:{\mathcal{L}}_{n}^{t}\to [1:{2}^{t{r}_{n}}]$ which takes on iid values at each ${u}_{n}\in {\mathcal{L}}_{n}^{t}$:

$${\mathsf{Index}}_{n}\left({u}_{n}\right)\sim \mathsf{Uniform}\left([1:{2}^{t{r}_{n}}]\right)$$

${\mathsf{Index}}_{n}$ is the binning scheme used by receiver $n$. Distribute each ${\mathsf{Index}}_{n}$ to helper n and the base.

**Transmission**

To send a message $M\in [1:{2}^{TR}],$ have the transmitter broadcast $\mathit{X}=({X}_{1},\dots ,{X}_{T})=\varphi \left(M\right).$

**Helper encoding**

Helper n ($n\in [0:N])$ observes a sequence of length T, ${\mathit{Y}}_{n}={\left({\mathit{Y}}_{n}^{\ell}\right)}_{\ell}\in {\left({\mathbb{C}}^{t\times 1}\right)}^{t}$. Now ${\mathit{Y}}_{n}-{\tilde{\mathit{W}}}_{Q,n}={({\mathit{Y}}_{n}^{\ell}-{\tilde{\mathit{W}}}_{Q,n}^{\ell})}_{\ell}\in {\mathbb{C}}^{T}={\left({\mathbb{C}}^{t}\right)}^{t}$ is composed of t consecutive length-t segments. For the ℓ-th length-t segment, $\ell \in [1:t]$, form a quantization ${U}_{n}^{\ell}\in {\mathcal{L}}_{n}^{t}$ by finding the point in ${\mathcal{L}}_{n}^{t}$ associated with the region in ${\mathcal{P}}_{n}^{t}$ in which that segment resides. The properties of such ${U}_{n}$ are the subject of References [20,29].

Applying Lemma 2, then for large enough t, any $S\subseteq [0:N]$ has
where the right-hand inequality comes from choice of $\mathit{\rho}.$

$$\frac{1}{t}H({U}_{S}^{\ell}|{U}_{{S}^{C}}^{\ell},{\left({\tilde{\mathit{W}}}_{Q,n}\right)}_{n\in [0:N]})\underset{t\uparrow \infty}{\to}I({\mathit{Z}}_{S}\left(\mathit{\rho}\right);{\mathit{Y}}_{S}|{\mathit{Z}}_{{S}^{C}}\left(\mathit{\rho}\right))<\sum _{n\in S}{r}_{n}$$

**Helper joint-compression and forwarding**

At receiver $n\in [0:N]$, form binned encodings ${\left({V}_{n}^{\ell}\right)}_{\ell \in [1:t]}$ where for each ℓ, ${V}_{n}^{\ell}={\mathsf{Index}}_{n}\left({U}_{n}^{\ell}\right)$. Note that $|\mathsf{Range}\left({\mathsf{Index}}_{n}\right)|\le {2}^{t\xb7{r}_{n}}$ so $H\left({V}_{n}^{\ell}\right)\le t\xb7{r}_{n}$ and $H\left({\left({V}_{n}^{\ell}\right)}_{\ell}\right)\le T\xb7{r}_{n}$ so that the LAN constraint is satisfied.

Each receiver forwards ${\left({V}_{n}^{\ell}\right)}_{\ell}$ to the base.

**Decoding**

At the base, receive all all the binned helper encodings $\mathit{V}={\left({\left({V}_{n}^{\ell}\right)}_{\ell}\right)}_{n}.$

Take ${A}_{\epsilon}^{T}(\mathit{X},\mathit{U}|{\mathit{W}}_{Q})$ to be the set of
which are jointly $\epsilon $-weakly-typical with respect to the joint distribution of $(\mathit{X},\mathit{U})$, conditional on all the dithers ${\left({\tilde{\mathit{W}}}_{Q,n}\right)}_{n\in [0:N]}$. Weak- and joint-typicality are defined in Reference [14].

$$\left({\mathit{x}}_{m},({\mathit{u}}^{1},\dots ,{\mathit{u}}^{t})\right)\in \mathsf{Range}\left(\varphi \right)\times {\left(\prod _{n=0}^{N}{\mathcal{L}}_{n}^{t}\right)}^{t}$$

For an ensemble of bin indices from all the receivers, $\mathit{v}={\left({\left({v}_{n}^{\ell}\right)}_{\ell}\right)}_{n}\in {\prod}_{n=0}^{N}{[1:{2}^{t\xb7{r}_{n}}]}^{t}$, define:

$$\begin{array}{cc}\hfill {B}_{\mathit{v}}& \triangleq \left\{\mathit{u}\in {\left(\prod _{n=0}^{N}{\mathcal{L}}_{n}^{t}\right)}^{t}|{\mathsf{Index}}_{k}\left({u}_{k}^{\ell}\right)={v}_{k}^{\ell},k\in [0:N],\ell \in [1:t]\right\}.\hfill \end{array}$$

Each ${B}_{\mathit{v}}$ represents the set of helper lattice quantizations represented by the ensemble of helper bin indices $\mathit{v}$. At the base, find $\widehat{m}$ for which there is some $\widehat{\mathit{u}}\in {B}_{\mathit{V}}$ where $(\varphi \left(\widehat{m}\right),\widehat{\mathit{u}})\in {A}_{\epsilon}^{T}(\mathit{X},\mathit{U}|{\mathit{W}}_{Q})$. Declare the message associated with $\widehat{x}$ to be the broadcast.

**Error analysis**

We have the following error events:

- ${\mathcal{E}}_{JT}$: $\mathit{X}$ is not typical with anything in ${B}_{\mathit{V},[0:N]}$.
- ${\mathcal{E}}_{\widehat{m},S}$: For $S\subseteq [0:N],$ there is some $\widehat{m}\ne M$ and $\widehat{\mathit{u}}=({\widehat{\mathit{u}}}_{S},{\widehat{\mathit{u}}}_{{S}^{C}})\in {B}_{\mathit{V}}$ where $(\varphi \left(\widehat{m}\right),\widehat{\mathit{u}})\in {A}_{\epsilon}^{T}(\mathit{X},\mathit{U}|{\mathit{W}}_{Q})$ and ${\widehat{\mathit{u}}}_{{S}^{C}}={\mathit{U}}_{{S}^{C}}$, but any $n\in S$ has ${\widehat{\mathit{u}}}_{n}\ne {\mathit{U}}_{n}$. ${\mathcal{E}}_{\widehat{m},S}$ denotes the situation where the base identifies the wrong quantizations for all the receivers in S.

By the law of large numbers and properties of typical sets, as t becomes large, eventually $P\left({\mathcal{E}}_{JT}\right)<\epsilon $. For each $S\subseteq [0:N]$, take ${A}_{\epsilon}^{T}\left(\mathit{U}\right|{\mathit{W}}_{Q})$ to be the collection of jointly-typical sequences in ${\left({\prod}_{n}{\mathcal{L}}_{n}^{t}\right)}^{t}$ up to the joint distribution on ${\left({\mathit{U}}_{n}^{\ell}\right)}_{n\in S}$ (any $\ell \in [1:t]$), conditioned on ${\left({\tilde{\mathit{W}}}_{Q,n}\right)}_{n}$. Now compute:
where $\tilde{\mathit{X}}$ is a random variable independent of and distributed identically to $\mathit{X}$. Then, with high probability for large enough t,
where Equation (A58) follows by construction of ${B}_{\mathit{V}}$. Taking the $log,$

$$\begin{array}{cc}\hfill \sum _{\widehat{m}\ne M}P\left({\mathcal{E}}_{\widehat{m},S}\right)& \le {2}^{TR}\xb7\sum _{\begin{array}{c}\tilde{\mathit{u}}\in {A}_{\epsilon}^{T}\left(\mathit{U}\right|{\mathit{W}}_{Q}):\\ {\tilde{\mathit{u}}}_{{S}^{C}}={\mathit{U}}_{{S}^{C}}\end{array}}{\mathbf{1}}_{{B}_{\mathit{V}}}\left(\tilde{\mathit{u}}\right)\xb7P\left((\tilde{\mathit{X}},\tilde{\mathit{u}})\in {A}_{\epsilon}^{T}(\mathit{X},\mathit{U}|{\mathit{W}}_{Q})\right)\hfill \end{array}$$

$$\begin{array}{}\mathrm{(A56)}& \hfill \sum _{\widehat{m}\ne M}P\left({\mathcal{E}}_{\widehat{m},S}\right)& \le {2}^{TR}\xb7\sum _{\begin{array}{c}\tilde{\mathit{u}}\in {A}_{\epsilon}^{T}\left(\mathit{U}\right|{\mathit{W}}_{Q}):\\ {\tilde{\mathit{u}}}_{{S}^{C}}={\mathit{U}}_{{S}^{C}}\end{array}}{\mathbf{1}}_{{B}_{\mathit{V}}}\left(\tilde{\mathit{u}}\right)\xb7{2}^{-t\xb7(I(\mathit{X};\mathit{U}|{\left({\mathit{W}}_{Q,n}\right)}_{n})-3\epsilon )}\hfill \mathrm{(A57)}& & \le {2}^{TR}\xb7\left|\{\tilde{\mathit{u}}\in {A}_{\epsilon}^{T}\left(\mathit{U}\right|{\mathit{W}}_{Q})|{\tilde{\mathit{u}}}_{{S}^{C}}={\mathit{U}}_{{S}^{C}}\}\right|\xb7(1+\epsilon )\xb7P(\mathit{U}\in {B}_{\mathit{V}})\xb7{2}^{-t\xb7(I(\mathit{X};\mathit{U}|{\left({\mathit{W}}_{Q,n}\right)}_{n})-3\epsilon )}\hfill \mathrm{(A58)}& & \le {2}^{TR}\xb7{2}^{t\xb7(H\left({\mathit{U}}_{S}\right|{\mathit{U}}_{{S}^{C}},{\left({\mathit{W}}_{Q,n}\right)}_{n})+\epsilon )}\xb7{2}^{-t\xb7(t\xb7{\sum}_{n\in S}{r}_{n}-\epsilon )}\xb7{2}^{-t\xb7(I(\mathit{X};\mathit{U}|{\left({\mathit{W}}_{Q,n}^{\ell}\right)}_{n})-3\epsilon )}\hfill \end{array}$$

$$\begin{array}{}\mathrm{(A59)}& & log\left(\sum _{\widehat{m}\ne M}P\left({\mathcal{E}}_{\widehat{m},S}\right)\right)\hfill \mathrm{(A60)}& \hfill \le & TR+t\xb7(H\left({\mathit{U}}_{S}\right|{\mathit{U}}_{{S}^{C}},{\left({\mathit{W}}_{Q,n}\right)}_{n})+\epsilon )-t\xb7(t\xb7\sum _{n\in S}{r}_{n}-\epsilon )-t\xb7\left(I(\mathit{X};\mathit{U}|{\left({\mathit{W}}_{Q,n}\right)}_{n})-3\epsilon \right)+\epsilon \hfill \mathrm{(A61)}& \hfill \le & {\textstyle t\xb7(H\left({\mathit{U}}_{S}\right|{\mathit{U}}_{{S}^{C}},{\left({\mathit{W}}_{Q,n}\right)}_{n})-t\xb7{\sum}_{n\in S}{r}_{n}+tR-I(\mathit{X};\mathit{U}|{\left({\mathit{W}}_{Q,n}\right)}_{n})+6\epsilon )}\hfill \mathrm{(A62)}& \hfill \le & {\textstyle {t}^{2}\xb7(I({\mathit{Y}}_{S};{\mathit{Z}}_{S}(\mathit{\rho}-\mathbf{1}\xb7\epsilon )|{\mathit{Z}}_{{S}^{C}}(\mathit{\rho}-\mathbf{1}\xb7\epsilon ))-{\sum}_{n\in S}{r}_{n}+R-I(X;\mathit{Z}(\mathit{\rho}-\mathbf{1}\xb7\epsilon ))+8\epsilon /t).}\hfill \end{array}$$

Equation (A62) follows by using Lemma 2 on $H\left({\mathit{U}}_{S}\right|{\mathit{U}}_{{S}^{C}})/t$ and using reasoning identical to Equations (A45)–(A48) on $I(\mathit{X};\mathit{U}|{\left({\mathit{W}}_{Q,n}\right)}_{n})$. Recall that R was chosen so that $R\le {R}_{G}(\mathit{\rho}-\mathbf{1}\xb7\epsilon )-\lambda =I(X;\mathit{Z}(\mathit{\rho}-\mathbf{1}\xb7\epsilon ))-\lambda $. Then, Equation (A62) will approach $-\infty $ as $t\to \infty $ (thereby $P\left({\cup}_{\widehat{m},S}{\mathcal{E}}_{\widehat{m},S}\right)$ approaches 0) if all $S\subseteq [0:N]$ satisfy:

$$I({\mathit{Y}}_{S};{\mathit{Z}}_{S}(\mathit{\rho}-\mathbf{1}\xb7\epsilon )|{\mathit{Z}}_{{S}^{C}}(\mathit{\rho}-\mathbf{1}\xb7\epsilon ))<\lambda +\sum _{m\in S}{r}_{m}-8\epsilon .$$

By assumption that $\mathit{\rho}\in {\mathcal{R}}_{DC}^{\lambda}\left(\mathit{r}\right)$ and for small enough $\epsilon ,$ then Equation (A63) holds for each S. Since all error events approach 0, then a rate of ${R}_{G}(\mathit{\rho}-\mathbf{1}\xb7\epsilon )-\lambda $ is achievable. For small enough $\epsilon ,$ by lower semi-continuity of mutual information ${R}_{G}(\mathit{\rho}-\mathbf{1}\xb7\epsilon )-\lambda $ can be made arbitrarily close to ${R}_{G}\left(\mathit{\rho}\right)-\lambda $. □

The capacity of the system is ${R}_{\overline{DC}}\left(L\right)$ under the following restrictions:

- Σ is diagonal (no interference).
- The base does not have its own full-precision observation of the broadcast (${h}_{0}=0$).
- The broadcaster must transmit a Gaussian signal.
- Construction of helper messages is independent of the transmitter’s codebook $\mathcal{X}$.

By Theorem 5 in Reference [1] (variable names altered, constants adapted for complex variables), the capacity of the system under the assumed restrictions is

$$\underset{\begin{array}{c}\mathit{r}\in {\mathcal{R}}_{\mathrm{LAN}}\left(L\right),\\ \mathit{V}\in \mathcal{V}\end{array}}{max}{C}_{\mathit{r},\mathit{V}}$$

$${C}_{\mathit{r},\mathit{V}}\triangleq \underset{S\subseteq [1:N]}{min}\left\{\sum _{m\in S}[{r}_{m}-I({V}_{m};{Y}_{m}|X)]+I({\mathit{V}}_{{S}^{C}};X)\right\}.$$

In Equation (A65), $\mathcal{V}$ is the collection of random vectors $\mathit{V}=({V}_{1},\dots ,{V}_{N})$ whose components are of the form ${V}_{n}={Y}_{n}+{\tilde{W}}_{n}$ with independently distributed ${\tilde{W}}_{n}$:
for any ${v}_{n}\ge 0$. Since both the variance in Equation (A66) as a function of ${v}_{n}$ and Equation (26) as a function of ${r}_{n}$ are injective on $(0,\infty )$, then for any $\mathit{V}\in \mathcal{V}$ there is some $\mathit{\rho}$ (in the context of Equation (37); either inside or outside ${\cup}_{\lambda \in \mathbf{R}}{\mathcal{R}}_{DC}^{\lambda}\left(L\right)$) which will yield a variable $\mathit{Z}\left(\mathit{\rho}\right)$ with identical distribution to $\mathit{V}$.

$${\tilde{W}}_{n}\sim \mathcal{CN}\left(0,{\sigma}_{n}^{2}\xb7\frac{{2}^{-2{v}_{n}}}{1-{2}^{-2{v}_{n}}}\right)$$

Fixing helper rates $\mathit{r},$ for any $S\subseteq [1:N]$, we can instead write ${C}_{\mathit{r},\mathit{V}}$ as:
where $I({\mathit{V}}_{S};{\mathit{Y}}_{S}|X)\le {\sum}_{m\in S}I({V}_{m};{Y}_{m}|X)$ follows because, for each n, ${V}_{n}\leftrightarrow (X,{Y}_{n})\leftrightarrow ({\mathit{V}}_{\{j\ne n\}},{\mathit{Y}}_{\{j\ne n\}})$ is a Markov chain. Maximizing over $\mathit{r}$, Equation (A70) is identical to the set of rates shown in Theorem 7. □

$$\begin{array}{}\mathrm{(A67)}& \hfill {C}_{\mathit{r},\mathit{V}}\phantom{\rule{-2cm}{0ex}}& \hfill =max\left\{c:\forall S\subseteq [1:N],\phantom{\le}c\le I({\mathit{V}}_{{S}^{C}};X)+\sum _{m\in S}{r}_{m}-I({V}_{m};{Y}_{m}|X)\right\}\mathrm{(A68)}& & \hfill \le max\left\{c:\forall S\subseteq [1:N],\phantom{\le}c-I({\mathit{V}}_{S};X)+I({\mathit{V}}_{S};{\mathit{Y}}_{S}|X)\le \sum _{m\in S}{r}_{m}\right\}\mathrm{(A69)}& & \hfill \le max\left\{c:\forall S\subseteq [1:N],\phantom{\le}[c-I(X;\mathit{V})]+I({\mathit{Y}}_{S};{\mathit{V}}_{S}|{\mathit{V}}_{{S}^{C}})\le \sum _{m\in S}{r}_{m}\right\}\mathrm{(A70)}& & \hfill =\underset{\lambda \in \mathbb{R}}{max}max\left\{I(X;\mathit{V})-\lambda :\forall S\subseteq [1:N]I({\mathit{Y}}_{S};{\mathit{V}}_{S}|{\mathit{V}}_{{S}^{C}})\le \sum _{m\in S}{r}_{m}+\lambda \right\}\end{array}$$

- Sanderovich, A.; Shamai, S.; Steinberg, Y.; Kramer, G. Communication via Decentralized Processing. IEEE Trans. Inf. Theory
**2008**, 54, 3008–3023. [Google Scholar] [CrossRef] - Sanderovich, A.; Shamai, S.; Steinberg, Y. On Upper bounds for Decentralized MIMO Receiver. In Proceedings of the 2007 IEEE International Symposium on Information Theory, Nice, France, 24–29 June 2007; pp. 1056–1060. [Google Scholar]
- Chapman, C.; Margetts, A.R.; Bliss, D.W. Bounds on the Information Capacity of a Broadcast Channel with Quantizing Receivers. In Proceedings of the IEEE 2015 49th Asilomar Conference on Signals, Systems and Computers, Pacific Grove, CA, USA, 8–11 November 2015; pp. 281–285. [Google Scholar]
- Kramer, G.; Gastpar, M.; Gupta, P. Cooperative Strategies and Capacity Theorems for Relay Networks. IEEE Trans. Inf. Theory
**2005**, 51, 3037–3063. [Google Scholar] [CrossRef] - Bliss, D.W.; Kraut, S.; Agaskar, A. Transmit and Receive Space-Time-Frequency Adaptive Processing for Cooperative Distributed MIMO Communications. In Proceedings of the 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Kyoto, Japan, 25–30 March 2012; pp. 5221–5224. [Google Scholar]
- Quitin, F.; Irish, A.; Madhow, U. Distributed Receive Beamforming: A Scalable Architecture and Its Proof of Concept. In Proceedings of the IEEE 77th Vehicular Technology Conference (VTC Spring), Dresden, Germany, 2–5 June 2013; pp. 1–5. [Google Scholar]
- Vagenas, E.D.; Karadimas, P.; Kotsopoulos, S.A. Ergodic Capacity for the SIMO Nakagami-M Channel. EURASIP J. Wirel. Commun. Netw.
**2009**, 2009, 802067. [Google Scholar] [CrossRef] - Love, D.J.; Choi, J.; Bidigare, P. Receive Spatial Coding for Distributed Diversity. In Proceedings of the 2013 Asilomar Conference on Signals, Systems and Computers, Pacific Grove, CA, USA, 3–6 November 2013; pp. 602–606. [Google Scholar]
- Choi, J.; Love, D.J.; Bidigare, P. Coded Distributed Diversity: A Novel Distributed Reception Technique for Wireless Communication Systems. IEEE Trans. Signal Process.
**2015**, 63, 1310–1321. [Google Scholar] [CrossRef] - Brown, D.R.; Ni, M.; Madhow, U.; Bidigare, P. Distributed Reception with Coarsely-Quantized Observation Exchanges. In Proceedings of the 2013 47th Annual Conference on Information Sciences and Systems (CISS), Altimore, MD, USA, 20–22 March 2013; pp. 1–6. [Google Scholar]
- Berger, T.; Zhang, Z.; Viswanathan, H. The CEO Problem [Multiterminal Source Coding]. IEEE Trans. Inf. Theory
**1996**, 42, 887–902. [Google Scholar] [CrossRef] - Tavildar, S.; Viswanath, P.; Wagner, A.B. The Gaussian Many-Help-One Distributed Source Coding Problem. IEEE Trans. Inf. Theory
**2010**, 56, 564–581. [Google Scholar] [CrossRef] - Yedla, A.; Pfister, H.D.; Narayanan, K.R. Code Design for the Noisy Slepian-Wolf Problem. IEEE Trans. Commun.
**2013**, 61, 2535–2545. [Google Scholar] [CrossRef] - Cover, T.M.; Thomas, J.A. Elements Informations Theory; John Wiley & Sons: Hoboken, NJ, USA, 2012. [Google Scholar]
- Shannon, C.E. A Mathematical Theory of Communication. ACM SIGMOBILE Mob. Comput. Commun. Rev.
**2001**, 5, 3–55. [Google Scholar] [CrossRef] - Telatar, I.E. Capacity of Multi-Antenna Gaussian Channels. Eur. Trans. Telecommun.
**1999**, 10, 585–595. [Google Scholar] [CrossRef] - Gamal, A.E.; Kim, Y. Network Information Theory; Cambridge University Press: Cambridge, UK, 2011. [Google Scholar]
- Weingarten, H.; Steinberg, S.; Shamai, S. The Capacity Region of the Gaussian Multiple-Input Multiple-Output Broadcast Channel. IEEE Trans. Inf. Theory
**2006**, 52, 3936–3964. [Google Scholar] [CrossRef] - Wei, Y.; Cioffi, J.M. Sum Capacity of Gaussian Vector Broadcast Channels. IEEE Trans. Inf. Theory
**2004**, 50, 1875–1892. [Google Scholar] - Zamir, R.; Feder, M. On lattice Quantization Noise. IEEE Trans. Inf. Theory
**1996**, 42, 1152–1159. [Google Scholar] [CrossRef] - Boyd, S.; Vandenberghe, L. Convex Optimization; Cambridge University Press: Cambridge, UK, 2004. [Google Scholar]
- Slepian, D.; Wolf, J.K. Noiseless Coding of Correlated Information Sources. IEEE Trans. Inf. Theory
**1973**, 19, 471–480. [Google Scholar] [CrossRef] - Costa, M. Writing on Dirty Paper (corresp.). IEEE Trans. Inf. Theory
**1983**, 29, 439–441. [Google Scholar] [CrossRef] - Goldsmith, A. Wireless Communications; Cambridge University Press: Cambridge, UK, 2005. [Google Scholar]
- Bliss, D.W.; Govindasamy, S. Adaptive Wireless Communications: MIMO Channels and Networks; Cambridge University Press: Cambridge, UK, 2013. [Google Scholar]
- Jones, E.; Oliphant, T.; Peterson, P. SciPy: Open Source Scientific Tools for Python. 2001. Available online: http://www.scipy.org/ (accessed on 10 September 2017).
- Diggavi, S.N.; Cover, T.M. The Worst Additive Noise Under a Covariance Constraint. IEEE Trans. Inf. Theory
**2001**, 47, 3072–3081. [Google Scholar] [CrossRef] - Ihara, S. On the Capacity of Channels with Additive Non-Gaussian Noise. Inf. Control
**1978**, 37, 34–39. [Google Scholar] [CrossRef] - Zamir, R.; Feder, M. On Universal Quantization by Randomized Uniform/Lattice Quantizers. IEEE Trans. Inf. Theory
**1992**, 38, 428–436. [Google Scholar] [CrossRef]

$\mathcal{CN}\text{}(0,\Sigma )$ | Complex circularly symmetric normal distribution with zero mean and Hermitian covariance matrix $\Sigma $ |

$I(\xb7;\xb7),H(\xb7),h(\xb7)$ | Mutual information, Shannon entropy and differential entropy |

${\mathbf{A}}^{\u2020},{\mathit{a}}^{\u2020}$ | Conjugate transpose of matrix $\mathbf{A}$, vector $\mathit{a}$ |

${\mathit{a}}_{S};S\subseteq T$ | Vector with components indexed by S |

$\left|\mathbf{A}\right|$ | Determinant of a matrix $\mathbf{A}$ |

$[a:b]$ | Integers from a to b, inclusive |

$\mathit{a}\u2ab00$ | Vector $\mathit{a}$ with non-negative elements |

$\mathbf{A}\u2ab00$ | Positive semidefinite matrix $\mathbf{A}$ |

${\mathbf{1}}_{N}$ | N-row vector of 1 s |

${\mathbf{A}}_{j,k}$ | Element on row j, column k of a matrix $\mathbf{A}$ |

$\mathrm{diag}\left(\mathit{a}\right)$ | Diagonal matrix where the $(i,i)\mathrm{th}$ diagonal element is the $i\mathrm{th}$ element of vector $\mathit{a}$ |

${\mathbf{I}}_{n\times n}$ | $n\times n$ identity matrix |

$\parallel x\parallel $ | Vector magnitude/2-norm |

${\mathbb{R}}_{+}$ | Nonnegative real numbers |

© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).