Convexity of the Capacity of One-Bit Quantized Additive White Gaussian Noise Channels

Lee, Sungmin; Min, Moonsik

doi:10.3390/math10224343

Open AccessArticle

Convexity of the Capacity of One-Bit Quantized Additive White Gaussian Noise Channels

by

Sungmin Lee

¹ and

Moonsik Min

^1,2,*

¹

School of Electronic and Electrical Engineering, Kyungpook National University, Daegu 41566, Republic of Korea

²

School of Electronics Engineering, Kyungpook National University, Daegu 41566, Republic of Korea

^*

Author to whom correspondence should be addressed.

Mathematics 2022, 10(22), 4343; https://doi.org/10.3390/math10224343

Submission received: 20 October 2022 / Revised: 15 November 2022 / Accepted: 17 November 2022 / Published: 18 November 2022

Download

Browse Figures

Versions Notes

Abstract

In this study, the maximum error-free transmission rate of an additive white Gaussian noise channel with a symmetric analog-to-digital converter (ADC) was derived as a composite function of the binary entropy function, Gaussian Q-function, and the square root function, assuming that the composite function was convex on the set of all non-negative real numbers. However, because mathematically proving this convexity near zero is difficult, studies in this field have only presented numerical results for small values in the domain. Because the low-signal-to-noise (SNR) regime is considered to be a major application area for one-bit ADCs in wireless communication, deriving a concrete proof of the convexity of the composite function on small SNR values (non-negative values near zero) is important. Therefore, this study proposes a novel proof for convexity, which is satisfied for all non-negative values, based on the continuity of the involved functions.

Keywords:

convexity; entropy; mutual information; channel capacity

MSC:

94A05

1. Introduction

The capacity of a communication channel is defined as the maximum data rate that can be transmitted over the channel with an arbitrarily small error probability. In information theory, the channel capacity is given by the supremum of the mutual information between the input and output of the channel [1]. For example, consider the channel

Y = X + N

, where

X \in R

and

Y \in R

are input and output random variables, respectively;

N \in R

is an additive noise independent of X and follows a zero-mean Gaussian distribution; and

R

denotes the set of real numbers. This type of channel is referred to as an additive white Gaussian noise (AWGN) channel, and its capacity is given by

\frac{1}{2} {log}_{2} (1 + γ)

, where

γ

is the signal-to-noise ratio (SNR) defined as

γ ≜ \frac{E [| X |^{2}]}{E [| N |^{2}]}

. In recent decades, extensive research on wireless communications has been conducted based on the aforementioned channel capacity and its convexity [2,3].

To satisfy the consistently increasing demand for data rates, recent standards for wireless communication have focused on a large amount of unused bandwidth in the millimeter-wave (mmWave) and terahertz frequency bands [4,5,6,7]. This is because using a significantly wider bandwidth available in a higher-frequency band, such as the mmWave band, is a good solution to satisfy the fast-growing demand for higher data rates. However, such an increase in the bandwidth can cause dissipation of an extensive amount of power when using conventional high-resolution analog-to-digital converters (ADCs) because the power consumption of ADCs linearly increases with increasing sampling rates [8]. This power consumption issue is critical for mobile devices because it is directly related to the battery lifetime as well as the available power resources for uplink communications.

Receiver structures based on low-resolution ADCs have attracted considerable attention as a useful solution for exploiting the abundant resources available in extremely high-frequency bands while achieving low power consumption [9]. In particular, an extreme case that uses one-bit ADCs at the receivers has been widely studied because it is the most power-efficient solution. Extensive studies have been devoted to the evaluation of the performance of communication channels with one-bit ADCs [9,10,11,12,13,14,15,16,17,18,19,20,21,22]. The capacity of a single-input and single-output (SISO) real-valued additive white Gaussian noise (AWGN) channel with one-bit ADCs was derived in [10]. In [10], a symmetric one-bit ADC was assumed, such that the channel output was given by

Y_{Q} = f_{Q} (X + N)

, where

f_{Q} (\cdot)

denotes the quantization function defined as

\begin{matrix} f_{Q} (x) = \{\begin{matrix} 1, & x \geq 0 \\ - 1, & x < 0 \end{matrix} . \end{matrix}

(1)

The channel capacity of this one-bit quantized AWGN channel was derived as

1 - H_{b} (Q (\sqrt{γ}))

[10], where

H_{b}

and Q denote the binary entropy and Gaussian Q-functions, respectively. Further extending this result, the capacity of complex fading channels with multiple transmit and receive antennas was considered in [11]. For example, assuming perfect channel state information at the transmitter, the capacity of a multiple-input and single-output (MISO) complex fading channel was derived for the given channel components. Moreover, certain upper and lower bounds were presented for multiple-input and multiple-output (MIMO) complex channels. Furthermore, numerous important studies have derived the capacities of various wireless channels with one-bit ADCs [12,13,14,15,16]. In [12], the tradeoffs between the achievable rates and energy rates were considered when one-bit ADC receivers were used. The authors of [13] used a one-bit ADC for an uplink massive MIMO system, and the corresponding throughput was analyzed. The performance of a one-bit quantized channel with limited channel state information at the transmitter was considered in [14]. In addition to using ADCs in receivers, the application of one-bit digital-to-analog converters to transmitters was considered in [15]. The secrecy capacity of a Gaussian wiretap channel with one-bit ADCs was analyzed in [16]. The detection performance of fading channels with one-bit ADCs was also studied [17,18,19,20,21,22].

Several studies so far have been based on the capacity

1 - H_{b} (Q (\sqrt{γ}))

of the one-bit quantized SISO AWGN channel, which was first derived in [10]. However, when it was derived in [10], the authors assumed that the composite function

H_{b} (Q (\sqrt{x}))

was convex for

x \geq 0

, although the convexity was proved by restricting the domain to

x > c

for a constant c. Thus, the authors did not provide a concrete mathematical proof when x was close to zero; only a numerical result was presented to support their claim of convexity when

0 \leq x \leq c

[10,23]. Because a low-SNR regime is an important application of a one-bit ADC [11], we need to prove the convexity on

0 \leq x \leq c

for the exact derivation of the capacity for all SNR regions. Systems with one-bit ADCs have been extensively studied for the further evolution of 5G and 6G communication. Thus, providing a concrete proof for the convexity of

H_{b} (Q (\sqrt{x}))

, for all

x \geq 0

, will be an important supplement to the theoretical completeness of previous results derived assuming this convexity. Furthermore, the corresponding results can be used to derive unknown channel capacities with one-bit ADCs for various important communication applications, such as multiple-input and multiple-output channels. These results can also be used to solve various optimization problems associated with the design of appropriate resource allocation strategies as the convexity of the capacity function can guarantee the existence of a unique solution depending on the system parameters [24,25].

In this regard, this study proves that the composite function

H_{b} (Q (\sqrt{x}))

is convex for all possible values of the input SNR, i.e., for all

x \geq 0

.

2. Notations and Preliminaries

The function

log x

denotes a logarithmic function with base e. In addition, we use the following notations throughout this paper for simplicity.

Notation 1.

D is defined as the set of nonnegative real numbers:

D = {x \in R : x \geq 0}

. The function

Q : D \to R

denotes the Gaussian Q-function defined as follows, and

Q_{n} (x)

is defined as the n-th order derivative of

Q (x)

:

\begin{matrix} Q (x) ≜ \int_{x}^{\infty} \frac{1}{\sqrt{2 π}} e^{- \frac{r^{2}}{2}} d r, \\ Q_{n} (x) ≜ \frac{d^{n}}{d x^{n}} Q (x) . \end{matrix}

The function

H_{b} : [0, 1] \to R

denotes the binary entropy function defined as

\begin{matrix} H_{b} (x) = - x {log}_{2} x - (1 - x) {log}_{2} (1 - x) . \end{matrix}

Notation 2.

The function

g : D \to R

is defined as follows, and

g_{n} (x)

is defined as the n-th order derivative of

g (x)

on D:

\begin{matrix} g (x) ≜ log (\frac{1 - Q (x)}{Q (x)}), \\ g_{n} (x) ≜ \frac{d^{n}}{d x^{n}} log (\frac{1 - Q (x)}{Q (x)}) . \end{matrix}

Notation 3.

The functions

t : D \to R

and

s : D \to R

are defined as follows, and

t_{n} (x)

and

s_{n} (x)

are defined as the n-th order derivatives of

t (x)

and

s (x)

on D, respectively:

\begin{matrix} t (x) ≜ \frac{Q_{1} (x)}{Q (x)}, s (x) ≜ \frac{Q_{1} (x)}{Q (- x)}, \\ t_{n} (x) ≜ \frac{d^{n} t (x)}{d x^{n}}, s_{n} (x) ≜ \frac{d^{n} s (x)}{d x^{n}} . \end{matrix}

Based on direct differentiation using the chain rule, we can easily derive the followings (see Appendix A for the derivation):

\begin{matrix} Q_{1} (x) = Q_{1} (- x) = - \frac{1}{\sqrt{2 π}} e^{- \frac{x^{2}}{2}}, Q_{2} (x) = - x Q_{1} (x), \end{matrix}

(2)

\begin{matrix} \frac{d}{d x} H_{b} (x) = {log}_{2} (1 - x) - {log}_{2} x, \end{matrix}

(3)

\begin{matrix} t_{1} (x) = - t (x) (x + t (x)), s_{1} (x) = - s (x) (x - s (x)), \end{matrix}

(4)

\begin{matrix} t_{2} (x) = - t_{1} (x) (x + t (x)) - t (x) (1 + t_{1} (x)), \end{matrix}

(5)

\begin{matrix} s_{2} (x) = - s_{1} (x) (x - s (x)) - s (x) (1 - s_{1} (x)), \end{matrix}

(6)

\begin{matrix} g_{1} (x) = - t (x) - s (x), g_{2} = (t (x) + s (x)) (x + t (x) - s (x)) . \end{matrix}

(7)

Moreover, the limiting value of

t_{1} (x)

and

s_{1} (x)

can be obtained as follows.

Lemma 1.

The functions

t_{1} (x)

and

s_{1} (x)

have the following limits:

\begin{matrix} lim_{x \to \infty} t_{1} (x) = - 1, lim_{x \to 0} s_{1} (x) = \frac{4}{2 π} . \end{matrix}

(8)

Moreover, for all

x \in D

, the following inequalities are satisfied:

\begin{matrix} g (x) \geq 0, g_{1} (x) \geq 0, \end{matrix}

(9)

\begin{matrix} t (x) \leq 0, s (x) \leq 0, \end{matrix}

(10)

\begin{matrix} x + t (x) \leq 0, \end{matrix}

(11)

\begin{matrix} t_{1} (x) \leq 0, s_{1} (x) \geq 0 . \end{matrix}

(12)

Proof.

See Appendix A. □

Lemma 1 includes a summary of inequalities that are essential for deriving the main results presented in the following section. Each result in this lemma does not have a specific physical meaning. Nevertheless, the results play key roles in proving the convexity of

H_{b} (Q (\sqrt{x})

as described in the following section.

3. Proof of Convexity

Based on Lemma 1, we can derive the following results.

Lemma 2.

For all

x \in D

, we have

\begin{matrix} - 1 \leq t_{1} (x) \leq 0, 0 \leq s_{1} (x) \leq 1 . \end{matrix}

(13)

Proof.

By (8) and (12), it suffices to prove that

t_{1} (x) \geq - 1

and

s_{1} (x) \leq 1

for all

x > 0

.

First, a proof by contradiction is used to prove that

t_{1} (x) \geq - 1

for all

x > 0

. To this end, suppose that there exists a positive real value a that satisfies

t_{1} (a) < - 1

, or equivalently,

t_{1} (a) = - 1 - δ_{0}

for some

δ_{0} > 0

. Because

t_{1} (x)

is continuous for

x \geq 0

, there must be an open interval

(a, a + ϵ)

inside domain D with some

ϵ > 0

on which

t_{1} (x) < - 1

. Let

ϵ^{*}

be the supremum of all possible values of

ϵ

. If

ϵ^{*}

is finite, there exists a

δ_{1} > 0

such that

t_{1} (x) < - 1

on

(a, a + ϵ^{*})

and

t_{1} (x) \geq - 1

on

[a + ϵ^{*}, a + ϵ^{*} + δ_{1})

, by the definition of

ϵ^{*}

. However, by (5), with the inequalities in Lemma 1,

t_{1} (x) < - 1

implies that

t_{2} (x) < 0

on

(a, a + ϵ^{*})

, which further implies that

t_{1} (x)

is strictly decreasing on

(a, a + ϵ^{*})

. As

t_{1} (x)

is strictly decreasing on

(a, a + ϵ^{*})

, we must have

{lim}_{x \to a + ϵ^{*}} t_{1} (x) \leq t_{1} (a) = - 1 - δ_{0}

. Because

t_{1} (x)

is continuous, this contradicts the assumption that

t_{1} (x) \geq - 1

on

[a + ϵ^{*}, a + ϵ^{*} + δ_{1})

, which was induced by assuming a finite value of

ϵ^{*}

. Similarly, if

ϵ^{*}

diverges to infinity, then

t_{1} (x)

must be strictly decreasing on

(a, \infty)

. As

t_{1} (a) = - 1 - δ_{0}

, this implies that

{lim}_{x \to \infty} t_{1} (x) \leq - 1 - δ_{0}

, but this contradicts (8). Therefore, we conclude that

t_{1} (x) \geq - 1

for all

x > 0

.

Similarly, a proof by contradiction is used to prove that

s_{1} (x) \leq 1

for all

x > 0

. For the sake of contradiction, suppose that there exists a positive real value b that satisfies

s_{1} (b) > 1

, or equivalently,

s_{1} (b) = 1 + ρ_{0}

, for some

ρ_{0} > 0

. Because

s_{1} (x)

is continuous for

x \geq 0

, there must be an open interval

(b - τ, b)

inside domain D with some

τ > 0

on which

s_{1} (b) > 1

. Let

τ^{*}

be the supremum of all possible values of

τ

. If

τ^{*} < b

, there exists a

ρ_{1} > 0

such that

s_{1} (x) > 1

on

(b - τ^{*}, b)

and

s_{1} (x) \leq 1

on

(b - τ^{*} - ρ_{1}, b - τ^{*}]

, by the definition of

τ^{*}

. However, by (6), with the inequalities in Lemma 1,

s_{1} (x) > 1

implies that

s_{2} (x) < 0

on

(b - τ^{*}, b)

, which further implies that

s_{1} (x)

is strictly decreasing on

(b - τ^{*}, b)

. As

s_{1} (x)

is strictly decreasing on

(b - τ^{*}, b)

, we must have

{lim}_{x \to b - τ^{*}} s_{1} (x) \geq s_{1} (b) = 1 + ρ_{0}

. Because

s_{1} (x)

is continuous, this contradicts the assumption that

s_{1} (x) \leq 1

on

(b - τ^{*} - ρ_{1}, b - τ^{*}]

. Similarly, if

τ^{*} = b

,

s_{1} (x)

must be strictly decreasing on

[0, b)

. As

s_{1} (b) > 1

, this implies that

{lim}_{x \to 0} s_{1} (x) > 1

, but this contradicts (8). Therefore, we conclude that

s_{1} (x) \leq 1

for all

x > 0

. □

Figure 1 verifies Lemma 2, which can be used to derive the following result.

Lemma 3.

For all

x \in D

, the following inequality is true.

\begin{matrix} g (x) + x g_{1} (x) - g_{2} (x) \geq 0 . \end{matrix}

(14)

Proof.

From (7), we have

g (x) + x g_{1} (x) - g_{2} (x) = g (x) + (2 x + t (x) - s (x)) g_{1} (x)

. Because both

g (x)

and

g_{1} (x)

are nonnegative (9), it suffices to prove that

2 x + t (x) - s (x) \geq 0

. By definition,

{lim}_{x \to 0} [t (x) - s (x)] = 0

, such that

{lim}_{x \to 0} [2 x + t (x) - s (x)] = 0

. Thus, it suffices to prove that

\frac{d (2 x + t (x) - s (x))}{d x} = 2 + t_{1} (x) - s_{1} (x) \geq 0

, or equivalently,

s_{1} (x) - t_{1} (x) \leq 2

. Because

s_{1} (x) \geq 0

(by (12)) and

- t_{1} (x) \geq 0

(by (10)), the triangle inequality implies that

s_{1} (x) - t_{1} (x) \leq | t_{1} (x) | + | s_{1} (x) |

, and Lemma 2 further implies that

s_{1} (x) - t_{1} (x) \leq | t_{1} (x) | + | s_{1} (x) | \leq 2

. □

In Figure 2, function

g (x) + x g_{1} (x) - g_{2} (x)

is depicted to demonstrate the result in Lemma 3.

Lemma 4.

The composite function

H_{b} (Q (\sqrt{x}))

is convex on D if and only if

\begin{matrix} (x + \frac{1}{x}) g (x) \geq g_{1} (x) \end{matrix}

(15)

on D.

Proof.

A real-valued twice differentiable function of one variable is convex if and only if its derivative is nondecreasing (i.e., the second-order derivative is nonnegative). Thus, we investigate the derivative of

H_{b} (Q (\sqrt{x}))

, which can be calculated using (2) and (3), as

\begin{matrix} \frac{d}{d x} H_{b} (Q (\sqrt{x})) & = {log}_{2} (\frac{1 - Q (\sqrt{x})}{Q (\sqrt{x})}) (- \frac{1}{\sqrt{2 π}} e^{- \frac{x}{2}}) (\frac{1}{2 \sqrt{x}}) \\ = \frac{- e^{- \frac{x}{2}}}{2 (log 2) \sqrt{2 π x}} log (\frac{1 - Q (\sqrt{x})}{Q (\sqrt{x})}) . \end{matrix}

Thus,

\frac{d}{d x} H_{b} (Q (\sqrt{x}))

is nondecreasing if and only if

\frac{d}{d x} v (x) \leq 0

, where

v (x) = x^{- \frac{1}{2}} e^{- \frac{x}{2}} log (\frac{1 - Q (\sqrt{x})}{Q (\sqrt{x})})

. Then,

\frac{d}{d x} v (x)

can be calculated as

\begin{matrix} \frac{d}{d x} v (x) \\ = - \frac{1}{2} e^{- \frac{x}{2}} x^{- \frac{3}{2}} [(x + 1) log (\frac{1 - Q (\sqrt{x})}{Q (\sqrt{x})}) - 2 x \frac{d}{d x} \{log (\frac{1 - Q (\sqrt{x})}{Q (\sqrt{x})})\}] . \end{matrix}

Hence,

\frac{d}{d x} v (x) \leq 0

if and only if

\begin{matrix} \frac{1}{2} (1 + \frac{1}{x}) log (\frac{1 - Q (\sqrt{x})}{Q (\sqrt{x})}) & = \frac{1}{2} (1 + \frac{1}{x}) g (\sqrt{x}) \\ \geq \frac{d}{d x} \{log (\frac{1 - Q (\sqrt{x})}{Q (\sqrt{x})})\} = \frac{g_{1} (\sqrt{x})}{2 \sqrt{x}}, \end{matrix}

or, by multiplying both sides by

2 \sqrt{x}

, it follows that

\frac{d}{d x} v (x) \leq 0

if and only if

\begin{matrix} (\sqrt{x} + \frac{1}{\sqrt{x}}) g (\sqrt{x}) \geq g_{1} (\sqrt{x}) . \end{matrix}

Therefore, we conclude that

H_{b} (Q (\sqrt{x}))

is convex on D if and only if

(\sqrt{x} + \frac{1}{\sqrt{x}}) g (\sqrt{x}) \geq g_{1} (\sqrt{x})

on D. Because the function

y = \sqrt{x}

has a one-to-one correspondence from D to D, proving that

(y + \frac{1}{y}) g (y) \geq g_{1} (y)

for all

y \in D

is equivalent to proving that

(\sqrt{x} + \frac{1}{\sqrt{x}}) g (\sqrt{x}) \geq g_{1} (\sqrt{x})

for all

x \in D

; thus, the proof is complete. □

Lemma 5.

If

\frac{g (x)}{x} \leq g_{1} (x)

on a subset of D, then

\begin{matrix} \frac{d}{d x} [(x + \frac{1}{x}) g (x)] - g_{2} (x) \geq 0 \end{matrix}

(16)

on the subset.

Proof.

The left-hand side of (16) can be calculated as

\begin{matrix} \frac{d}{d x} [(x + \frac{1}{x}) g (x)] - g_{2} (x) & = (1 - \frac{1}{x^{2}}) g (x) + (x + \frac{1}{x}) g_{1} (x) - g_{2} (x) \\ = \frac{1}{x} (g_{1} (x) - \frac{g (x)}{x}) + g (x) + x g_{1} (x) - g_{2} (x) . \end{matrix}

As we know that

g (x) + x g_{1} (x) - g_{2} (x) \geq 0

on D by Lemma 3,

g_{1} (x) - \frac{g (x)}{x} \geq 0

implies that

\frac{d}{d x} [(x + \frac{1}{x}) g (x)] - g_{2} (x) \geq 0

. □

Theorem 1.

The composite function

H_{b} (Q (\sqrt{x}))

is convex on D.

Proof.

For simplicity, let

f (x) = (x + \frac{1}{x}) g (x)

. First, by applying L’Hôpital’s rule, we have

lim_{x \to 0} f (x) = lim_{x \to 0} [\frac{g (x)}{x}] = lim_{x \to 0} g_{1} (x) .

Thus, by Lemma 4, the proof is complete if

f (x) \geq g_{1} (x)

is true for all

x > 0

.

For the sake of contradiction, suppose that there exists a real value

a > 0

on which

f (a) < g_{1} (a)

. Because both

f (x)

and

g_{1} (x)

are continuous, this implies that there exists an open interval

(a - ϵ, a)

inside D on which

f (x) < g_{1} (x)

. Moreover, because

{lim}_{x \to 0} f (x) = {lim}_{x \to 0} g_{1} (x)

, there must exist an

ϵ_{0} > 0

and a corresponding open interval

(a - ϵ_{0}, a)

that satisfies

\begin{matrix} f (x) < g_{1} (x) on (a - ϵ_{0}, a) \end{matrix}

(17)

and

\begin{matrix} lim_{x \to a - ϵ_{0}} f (x) = lim_{x \to a - ϵ_{0}} g_{1} (x) . \end{matrix}

(18)

For example,

ϵ_{0} = a

if

f (x)

is strictly less than

g_{1} (x)

for all x in

\in (0, a)

. Combining (17) and (18), we obtain

f (x) \leq g_{1} (x)

on

[a - ϵ_{0}, a)

. Because

f (x) = (x + \frac{1}{x}) g (x)

and

g (x) \geq 0

, this implies that

\frac{g (x)}{x} \leq g_{1} (x)

on

[a - ϵ_{0}, a)

. By Lemma 5,

\frac{g (x)}{x} \leq g_{1} (x)

further implies that

\begin{matrix} \frac{d}{d x} [f (x) - g_{1} (x)] \geq 0 on [a - ϵ_{0}, a) . \end{matrix}

(19)

Subsequently, (18) and (19) imply that

f (x)

must be greater than or equal to

g_{1} (x)

on

(a - ϵ_{0}, a)

. However, this contradicts (17) and thus, the proof is completed. □

4. Discussion and Conclusions

This letter provides a mathematical proof for the convexity of the capacity function of one-bit quantized AWGN channel. Specifically, the continuity of involved functions are used to provide a proof by contradiction. The corresponding result can be an important supplement for the theoretical completeness of previous studies as numerous important studies have derived the capacities of various wireless channels with one-bit ADCs assuming that the convexity is true without providing a mathematical proof when SNR is near zero.

For example, in the proof of Theorem 2 provided in [10], Jensen’s inequality was applied to the conditional entropy

H (Y_{Q} | X)

, which is derived as

\begin{matrix} H (Y_{Q} | X) = E [H_{b} (Q (\sqrt{\frac{{| X |}^{2}}{σ_{N}^{2}}}))], \end{matrix}

if

Y_{Q}

is the one-bit quantized output of an AWGN channel such that

Y_{Q} = f_{Q} (X + N)

, where X is the transmit signal, N is the zero-mean Gaussian noise with variance

σ_{N}^{2}

, and

f_{Q}

is the quantization function defined in (1). The convexity of

H_{b} (Q (\sqrt{x}))

guarantees the following result based on Jensen’s inequality:

\begin{matrix} H (Y_{Q} | X) \geq H_{b} (Q (\sqrt{E [\frac{{| X |}^{2}}{σ_{N}^{2}}]})) . \end{matrix}

(20)

Because there exists a transmit signal distribution that satisfies the equality in (20) (e.g., binary phase shift keying (BPSK)), this result implies that the capacity of a one-bit quantized AWGN channel is given by

1 - H_{b} (Q (\sqrt{γ}))

, where

γ = E [\frac{{| X |}^{2}}{σ_{N}^{2}}]

is the SNR. Although this derivation of the capacity assumes the convexity of

H_{b} (Q (\sqrt{x}))

, an analytical proof was presented assuming that

x \geq 2

, and only numerical results were provided for the case of

0 \leq x \leq 2

. Thus, our complete proof of convexity can be a theoretical supplement to this result. In short, the proof guarantees that the capacity cannot be larger than

1 - H_{b} (Q (\sqrt{γ}))

and that BPSK signaling achieves this upper bound. In Figure 3, the mutual information between X and

Y_{Q}

, denoted as

I (X, Y_{Q})

, is compared with the capacity

1 - H_{b} (Q (\sqrt{γ}))

. BPSK signaling with equal probabilities is used as the probability distribution of X. The numerical results are consistent with the analysis results. In [11], authors extended this result to the multi-antenna complex fading channels. For MISO cases, the capacity was derived based on the convexity of

H_{b} (Q (\sqrt{x}))

on

x \geq 0

by directly extending the capacity of the SISO channel with a one-bit ADC. Moreover, an upper bound for the capacity of the MIMO channels was derived, and this derivation was also based on the convexity of

H_{b} (Q (\sqrt{x}))

. Thus, our results could supplement these results.

The convexity of the capacity function can also be used for optimization problems to provide appropriate resource allocation. In such cases, the convexity of the objective function can be used to clarify the existence of an optimal solution and identify it. For example, in [25], an optimal power allocation problem was formulated and solved based on the convexity of

H_{b} (Q (\sqrt{x}))

; however, the convexity was assumed with only some numerical results.

With one-bit quantization, deriving the exact capacity is difficult because the quantization function is difficult to analyze. Thus, although analyzing the conventional capacity with infinite-resolution ADCs is feasible, obtaining the capacity for channels with one-bit ADCs may be extremely difficult. For example, the capacity of MIMO channels with one-bit ADCs is generally unknown, although the exact capacity of the MIMO channels with infinite-resolution ADCs is already known. Thus, our results can provide a valid mathematical basis for further derivations of unknown capacities and various optimization problems that use the capacity function of a one-bit quantized AWGN channel.

Author Contributions

Conceptualization, S.L. and M.M.; methodology, S.L.; software, S.L. and M.M.; validation, S.L. and M.M.; formal analysis, S.L. and M.M.; investigation, S.L. and M.M.; resources, M.M.; data curation, S.L.; writing—original draft preparation, S.L.; writing—review and editing, S.L. and M.M.; visualization, S.L.; supervision, M.M.; project administration, S.L. and M.M.; and funding acquisition, S.L. and M.M. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the National Research Foundation of Korea (NRF), funded by the Korean government (MSIT) (Grant No. 2020R1F1A1071649), and in part by the BK21 FOUR Project, funded by the Ministry of Education, Korea (4199990113966).

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare that there is no conflict of interest.

Appendix A. Proof of Lemma 1

Appendix A.1. Derivations of (2) and (3)

The Q-function is defined as

\begin{matrix} Q (x) = lim_{B \to \infty} \frac{1}{\sqrt{2 π}} \int_{x}^{x + B} e^{- \frac{r^{2}}{2}} d r . \end{matrix}

Thus, the derivative is

\begin{matrix} Q_{1} (x) = \frac{d}{d x} Q (x) & = lim_{B \to \infty} \frac{1}{\sqrt{2 π}} (\frac{d}{d x} \int_{x}^{x + B} e^{- \frac{r^{2}}{2}} d r) \\ \overset{(a)}{=} lim_{B \to \infty} \frac{1}{\sqrt{2 π}} (e^{- \frac{{(x + B)}^{2}}{2}} - e^{- \frac{{(x)}^{2}}{2}}) = - \frac{1}{\sqrt{2 π}} e^{- \frac{x^{2}}{2}}, \end{matrix}

where (a) follows from Leibniz’s rule. Hence, the second-order derivative is

Q_{2} (x) = - \frac{d}{d x} [\frac{1}{\sqrt{2 π}} e^{- \frac{x^{2}}{2}}] = \frac{x}{\sqrt{2 π}} e^{- \frac{x^{2}}{2}} = - x Q_{1} (x) .

Similarly, (3) can be proved through direct differentiation using

\frac{d log x}{d x} = \frac{1}{x}

.

Appendix A.2. Derivations of (4)–(8)

By definition,

\begin{matrix} t_{1} (x) = \frac{d}{d x} \frac{Q_{1} (x)}{Q (x)} = \frac{d}{d x} \frac{Q_{2} (x) Q (x) - {(Q_{1} (x))}^{2}}{{(Q (x))}^{2}} \overset{(a)}{=} - t (x) (x + t (x)), \\ t_{2} (x) = \frac{d}{d x} t_{1} (x) = - t_{1} (x) (x + t (x)) - t (x) (1 + t_{1} (x)), \\ s_{1} (x) = \frac{d}{d x} \frac{Q_{1} (x)}{Q (- x)} = \frac{d}{d x} \frac{Q_{2} (x) Q (- x) + Q_{1} (x) Q_{1} (- x)}{{(Q (- x))}^{2}} \overset{(b)}{=} - s (x) (x - s (x)), \\ s_{2} (x) = \frac{d}{d x} s_{1} (x) = - s_{1} (x) (x - s (x)) - s (x) (1 - s_{1} (x)), \end{matrix}

where

Q_{2} (x) = - x Q_{1} (x)

is used for (a), and

Q_{2} (x) = - x Q_{1} (x)

and

Q_{1} (x) = Q_{1} (- x)

are used for (b). As

Q (x) - \frac{1}{2}

is symmetric with respect to the origin, so it follows that

1 - Q (x) = Q (- x)

. Thus,

g_{1} (x)

and

g_{2} (x)

can be calculated as follows:

\begin{matrix} g_{1} (x) & = \frac{d}{d x} log (\frac{Q (- x)}{Q (x)}) = \frac{Q (x)}{Q (- x)} \frac{d}{d x} (\frac{Q (- x)}{Q (x)}) \\ = (\frac{Q (x)}{Q (- x)}) (\frac{- Q_{1} (- x) Q (x) - Q (- x) Q_{1} (x)}{{(Q (x))}^{2}}) \\ = - \frac{Q_{1} (- x)}{Q (- x)} - \frac{Q_{1} (x)}{Q (x)} \overset{(a)}{=} - t (x) - s (x), \end{matrix}

where

Q_{1} (x) = Q_{1} (- x)

is used for (a). By differentiating this,

\begin{matrix} g_{2} (x) & = - t_{1} (x) - s_{1} (x) \overset{(a)}{=} t (x) (x + t (x)) + s (x) (x - s (x)) \\ = t (x) x + {[t (x)]}^{2} + s (x) x - {[s (x)]}^{2} = (t (x) + s (x)) (x + t (x) - s (x)), \end{matrix}

where (4) is used for (a).

Finally, the limiting values in (8) are obtained as follows:

\begin{matrix} lim_{x \to \infty} t_{1} (x) = lim_{x \to \infty} [- t (x) (x + t (x))] = lim_{x \to \infty} \frac{x Q (x) Q_{1} (x) + {[Q_{1} (x)]}^{2}}{- {[Q (x)]}^{2}} \\ \overset{(a)}{=} lim_{x \to \infty} \frac{Q (x) Q_{1} (x) + x {[Q_{1} (x)]}^{2} + x Q (x) Q_{2} (x) + 2 Q_{1} (x) Q_{2} (x)}{- 2 Q (x) Q_{1} (x)} \\ \overset{(b)}{=} lim_{x \to \infty} \frac{(1 - x^{2}) Q (x) - x Q_{1} (x)}{- 2 Q (x)} \\ \overset{(c)}{=} lim_{x \to \infty} \frac{- 2 x Q (x) + (1 - x^{2}) Q_{1} (x) - Q_{1} (x) - x Q_{2} (x)}{- 2 Q_{1} (x)} \overset{(d)}{=} lim_{x \to \infty} \frac{x Q (x)}{Q_{1} (x)}, \end{matrix}

(A1)

where L’Hôpital’s rule is used for (a) and (c), and

Q_{2} (x) = - x Q_{1} (x)

is used for (b) and (d). The authors in [26] showed that

- \frac{x}{1 + x^{2}} Q_{1} (x) < Q (x) < - \frac{Q_{1} (x)}{x}

, which is equivalent to

- \frac{x^{2}}{1 + x^{2}} > \frac{x Q (x)}{Q_{1} (x)} > - 1

as

\frac{Q_{1} (x)}{x} < 0

. Because

{lim}_{x \to \infty} - \frac{x^{2}}{1 + x^{2}} = - 1

, we have

{lim}_{x \to \infty} \frac{x Q (x)}{Q_{1} (x)} = - 1

; thus, from (A1), we finally have

{lim}_{x \to \infty} t_{1} (x) = - 1

.

From (2),

{lim}_{x \to 0} Q_{1} (x) = - \frac{1}{\sqrt{2 π}}

, and by definition,

{lim}_{x \to 0} Q (- x) = \frac{1}{2}

. Thus,

\begin{matrix} lim_{x \to 0} s_{1} (x) & = - lim_{x \to 0} [s (x) (x - s (x))] \\ = - lim_{x \to 0} [\frac{x Q_{1} (x)}{Q (- x)} - \frac{{[Q_{1} (x)]}^{2}}{{[Q (- x)]}^{2}}] \\ = lim_{x \to 0} \frac{{[Q_{1} (x)]}^{2}}{{[Q (- x)]}^{2}} = \frac{4}{2 π} . \end{matrix}

Appendix A.3. Proofs of (9)–(11)

By definition,

Q (x) \geq 0

for

x \in R

. Moreover,

Q_{1} (x) < 0

and

\frac{1}{Q (x)} > 1

for

x \in D

. Thus, for all

x \in D

, it is clear that

t (x) = \frac{Q_{1} (x)}{Q (x)} \leq 0

,

s (x) = \frac{Q_{1} (x)}{Q (- x)} \leq 0

,

g (x) = log (\frac{1}{Q (x)} - 1) \geq 0

, and

g_{1} (x) = - t (x) - s (x) \geq 0

. For (11), we use

Q (x) < - \frac{Q_{1} (x)}{x}

, which was derived in [26]. That is,

\begin{matrix} Q (x) < - \frac{Q_{1} (x)}{x} & ⟺ \frac{Q (x)}{- Q_{1} (x)} < \frac{1}{x} (∵) - Q_{1} (x) > 0 \\ ⟺ \frac{- Q_{1} (x)}{Q (x)} > x ⟺ 0 > x + \frac{Q_{1} (x)}{Q (x)} = x + t (x) . \end{matrix}

Then, because

t_{1} (x) = - t (x) (x + t (x))

and

s_{1} (x) = - s (x) (x - s (x))

, the inequalities from (9) to (11) imply (12).

References

Thomas, J.A.; Cover, T.M. Elements of Information Theory; Wiley: Hoboken, NJ, USA, 2001. [Google Scholar]
Tse, D.; Viswanath, P. Fundamentals of Wireless Communication; Cambridge University Press: Cambridge, UK, 2005. [Google Scholar]
Heath, R.W., Jr.; Lozano, A. Foundations of MIMO Communication; Cambridge University Press: Cambridge, UK, 2018. [Google Scholar]
Swindlehurst, A.L.; Ayanoglu, E.; Heydari, P.; Capolino, F. Millimeter-wave massive MIMO: The next wireless revolution? IEEE Commun. Mag. 2014, 52, 56–62. [Google Scholar] [CrossRef]
Larsson, E.G.; Edfors, O.; Tufvesson, F.; Marzetta, T.L. Massive MIMO for next generation wireless systems. IEEE Commun. Mag. 2014, 52, 186–195. [Google Scholar] [CrossRef]
Busari, S.A.; Huq, K.M.S.; Mumtaz, S.; Dai, L.; Rodriguez, J. Millimeter-Wave Massive MIMO Communication for Future Wireless Systems: A Survey. IEEE Commun. Surv. Tut. 2017, 20, 836–869. [Google Scholar] [CrossRef]
Saleem, A.; Cui, H.; He, Y.; Boag, A. Channel propagation characteristics for massive multiple-input/multiple-output systems in a tunnel environment. IEEE Antennas Propag. Mag. 2022, 126–142. [Google Scholar] [CrossRef]
Walden, R. Analog-to-Digital Converter Survey and Analysis. IEEE J. Select. Areas Commun. 1999, 17, 539–550. [Google Scholar] [CrossRef]
Zhang, J.; Dai, L.; Li, X.; Liu, Y.; Hanzo, L. On low-resolution ADCs in practical 5G millimeter-wave massive MIMO systems. IEEE Commun. Mag. 2018, 56, 205–211. [Google Scholar] [CrossRef]
Singh, J.; Dabeer, O.; Madhow, U. On the limits of communication with low-precision analog-to-digital conversion at the receiver. IEEE Trans. Commun. 2009, 57, 3629–3639. [Google Scholar] [CrossRef]
Mo, J.; Heath, R.W., Jr. Capacity Analysis of One-Bit Quantized MIMO Systems with Transmitter Channel State Information. IEEE Trans. Signal Process. 2015, 63, 5498–5512. [Google Scholar] [CrossRef]
Mo, J.; Alkhateeb, A.; Abu-Surra, S.; Heath, R.W., Jr. Hybrid architectures with few-bit ADC receivers: Achievable rates and energy-rate tradeoffs. IEEE Trans. Wirel. Commun. 2017, 16, 2274–2287. [Google Scholar] [CrossRef]
Jacobsson, S.; Durisi, G.; Coldrey, M.; Gustavsson, U.; Studer, C. Throughput analysis of massive MIMO uplink with low-resolution ADCs. IEEE Trans. Wirel. Commun. 2017, 16, 4038–4051. [Google Scholar] [CrossRef]
Mo, J.; Heath, R.W., Jr. Limited feedback in single and multi-user MIMO systems with finite-bit ADCs. IEEE Trans. Wirel. Commun. 2018, 17, 3284–3297. [Google Scholar] [CrossRef]
Nam, Y.; Do, H.; Jeon, Y.-S.; Lee, N. On the Capacity of MISO Channels with One-Bit ADCs and DACs. IEEE J. Sel. Areas Commun. 2019, 37, 2132–2145. [Google Scholar] [CrossRef]
Nam, S.H.; Lee, S.H. Secrecy Capacity of a Gaussian Wiretap Channel With ADCs is Always Positive. IEEE Trans. Inf. Theory 2022, 68, 1186–1196. [Google Scholar] [CrossRef]
Choi, J.; Mo, J.; Heath, R.W., Jr. Near Maximum-Likelihood Detector and Channel Estimator for Uplink Multiuser Massive MIMO Systems with One-Bit ADCs. IEEE Trans. Commun. 2016, 64, 2005–2018. [Google Scholar] [CrossRef]
Mollén, C.; Choi, J.; Larsson, E.G.; Heath, R.W., Jr. Uplink Performance of Wideband Massive MIMO with One-Bit ADCs. IEEE Trans. Wirel. Commun. 2017, 16, 87–100. [Google Scholar] [CrossRef]
Hong, S.-N.; Kim, S.; Lee, N. A Weighted Minimum Distance Decoding for Uplink Multiuser MIMO Systems with Low-Resolution ADCs. IEEE Trans. Commun. 2018, 66, 1912–1924. [Google Scholar] [CrossRef]
Jeon, Y.-S.; Lee, N.; Hong, S.-N.; Heath, R.W., Jr. One-Bit Sphere Decoding for Uplink Massive MIMO Systems with One-Bit ADCs. IEEE Trans. Wirel. Commun. 2018, 17, 4509–4521. [Google Scholar] [CrossRef]
Jeon, Y.-S.; Do, H.; Hong, S.-N.; Lee, N. Soft-Output Detection Methods for Sparse Millimeter Wave MIMO Systems with Low-Precision ADCs. IEEE Trans. Commun. 2019, 67, 2822–2836. [Google Scholar] [CrossRef]
Li, Y.; Tao, C.; Seco-Granados, G.; Mezghani, A.; Swindlehurst, A.L.; Liu, L. Channel Estimation and Performance Analysis of One-Bit Massive MIMO Systems. IEEE Trans. Signal Process. 2017, 65, 4075–4089. [Google Scholar] [CrossRef]
Dabeer, O.; Singh, J.; Madhow, U. On the limits of communication performance with one-bit analog-to-digital conversion. In Proceedings of the 2006 IEEE 7th Workshop on Signal Processing Advances in Wireless Communications, Cannes, France, 2–5 July 2006; pp. 1–5. [Google Scholar]
Boyd, S.; Vandenberghe, L. Convex Optimization; Cambridge University Press: Cambridge, UK, 2004. [Google Scholar]
Min, M. Optimal Power Allocation for Multiple Channels With One-Bit Analog-to-Digital Converters. IEEE Trans. Vehi. Tech. 2022, 71, 4438–4443. [Google Scholar] [CrossRef]
Borjesson, P.; Sundberg, C.-E. Simple Approximations of the Error Function Q(x) for Communications Applications. IEEE Trans. Commun. 1979, 27, 39–643. [Google Scholar] [CrossRef]

Figure 1. Verification for Lemma 2.

Figure 2. Verification for Lemma 3.

Figure 3. Verification for the capacity function.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Lee, S.; Min, M. Convexity of the Capacity of One-Bit Quantized Additive White Gaussian Noise Channels. Mathematics 2022, 10, 4343. https://doi.org/10.3390/math10224343

AMA Style

Lee S, Min M. Convexity of the Capacity of One-Bit Quantized Additive White Gaussian Noise Channels. Mathematics. 2022; 10(22):4343. https://doi.org/10.3390/math10224343

Chicago/Turabian Style

Lee, Sungmin, and Moonsik Min. 2022. "Convexity of the Capacity of One-Bit Quantized Additive White Gaussian Noise Channels" Mathematics 10, no. 22: 4343. https://doi.org/10.3390/math10224343

APA Style

Lee, S., & Min, M. (2022). Convexity of the Capacity of One-Bit Quantized Additive White Gaussian Noise Channels. Mathematics, 10(22), 4343. https://doi.org/10.3390/math10224343

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Convexity of the Capacity of One-Bit Quantized Additive White Gaussian Noise Channels

Abstract

1. Introduction

2. Notations and Preliminaries

3. Proof of Convexity

4. Discussion and Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A. Proof of Lemma 1

Appendix A.1. Derivations of (2) and (3)

Appendix A.2. Derivations of (4)–(8)

Appendix A.3. Proofs of (9)–(11)

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI