1. Introduction
The capacity of a communication channel is defined as the maximum data rate that can be transmitted over the channel with an arbitrarily small error probability. In information theory, the channel capacity is given by the supremum of the mutual information between the input and output of the channel [
1]. For example, consider the channel 
, where 
 and 
 are input and output random variables, respectively; 
 is an additive noise independent of 
X and follows a zero-mean Gaussian distribution; and 
 denotes the set of real numbers. This type of channel is referred to as an additive white Gaussian noise (AWGN) channel, and its capacity is given by 
, where 
 is the signal-to-noise ratio (SNR) defined as 
. In recent decades, extensive research on wireless communications has been conducted based on the aforementioned channel capacity and its convexity [
2,
3].
To satisfy the consistently increasing demand for data rates, recent standards for wireless communication have focused on a large amount of unused bandwidth in the millimeter-wave (mmWave) and terahertz frequency bands [
4,
5,
6,
7]. This is because using a significantly wider bandwidth available in a higher-frequency band, such as the mmWave band, is a good solution to satisfy the fast-growing demand for higher data rates. However, such an increase in the bandwidth can cause dissipation of an extensive amount of power when using conventional high-resolution analog-to-digital converters (ADCs) because the power consumption of ADCs linearly increases with increasing sampling rates [
8]. This power consumption issue is critical for mobile devices because it is directly related to the battery lifetime as well as the available power resources for uplink communications.
Receiver structures based on low-resolution ADCs have attracted considerable attention as a useful solution for exploiting the abundant resources available in extremely high-frequency bands while achieving low power consumption [
9]. In particular, an extreme case that uses one-bit ADCs at the receivers has been widely studied because it is the most power-efficient solution. Extensive studies have been devoted to the evaluation of the performance of communication channels with one-bit ADCs [
9,
10,
11,
12,
13,
14,
15,
16,
17,
18,
19,
20,
21,
22]. The capacity of a single-input and single-output (SISO) real-valued additive white Gaussian noise (AWGN) channel with one-bit ADCs was derived in [
10]. In [
10], a symmetric one-bit ADC was assumed, such that the channel output was given by 
, where 
 denotes the quantization function defined as
      
The channel capacity of this one-bit quantized AWGN channel was derived as 
 [
10], where 
 and 
Q denote the binary entropy and Gaussian Q-functions, respectively. Further extending this result, the capacity of complex fading channels with multiple transmit and receive antennas was considered in [
11]. For example, assuming perfect channel state information at the transmitter, the capacity of a multiple-input and single-output (MISO) complex fading channel was derived for the given channel components. Moreover, certain upper and lower bounds were presented for multiple-input and multiple-output (MIMO) complex channels. Furthermore, numerous important studies have derived the capacities of various wireless channels with one-bit ADCs [
12,
13,
14,
15,
16]. In [
12], the tradeoffs between the achievable rates and energy rates were considered when one-bit ADC receivers were used. The authors of [
13] used a one-bit ADC for an uplink massive MIMO system, and the corresponding throughput was analyzed. The performance of a one-bit quantized channel with limited channel state information at the transmitter was considered in [
14]. In addition to using ADCs in receivers, the application of one-bit digital-to-analog converters to transmitters was considered in [
15]. The secrecy capacity of a Gaussian wiretap channel with one-bit ADCs was analyzed in [
16]. The detection performance of fading channels with one-bit ADCs was also studied [
17,
18,
19,
20,
21,
22].
Several studies so far have been based on the capacity 
 of the one-bit quantized SISO AWGN channel, which was first derived in [
10]. However, when it was derived in [
10], the authors assumed that the composite function 
 was convex for 
, although the convexity was proved by restricting the domain to 
 for a constant 
c. Thus, the authors did not provide a concrete mathematical proof when 
x was close to zero; only a numerical result was presented to support their claim of convexity when 
 [
10,
23]. Because a low-SNR regime is an important application of a one-bit ADC [
11], we need to prove the convexity on 
 for the exact derivation of the capacity for all SNR regions. Systems with one-bit ADCs have been extensively studied for the further evolution of 5G and 6G communication. Thus, providing a concrete proof for the convexity of 
, for all 
, will be an important supplement to the theoretical completeness of previous results derived assuming this convexity. Furthermore, the corresponding results can be used to derive unknown channel capacities with one-bit ADCs for various important communication applications, such as multiple-input and multiple-output channels. These results can also be used to solve various optimization problems associated with the design of appropriate resource allocation strategies as the convexity of the capacity function can guarantee the existence of a unique solution depending on the system parameters [
24,
25].
In this regard, this study proves that the composite function  is convex for all possible values of the input SNR, i.e., for all .
  2. Notations and Preliminaries
The function  denotes a logarithmic function with base e. In addition, we use the following notations throughout this paper for simplicity.
Notation 1.  D is defined as the set of nonnegative real numbers: . The function  denotes the Gaussian Q-function defined as follows, and  is defined as the n-th order derivative of : The function  denotes the binary entropy function defined as  Notation 2.  The function  is defined as follows, and  is defined as the n-th order derivative of  on D:  Notation 3.  The functions  and  are defined as follows, and  and  are defined as the n-th order derivatives of  and  on D, respectively:  Based on direct differentiation using the chain rule, we can easily derive the followings (see 
Appendix A for the derivation): 
Moreover, the limiting value of  and  can be obtained as follows.
Lemma 1.  The functions  and  have the following limits: Moreover, for all , the following inequalities are satisfied:  Lemma 1 includes a summary of inequalities that are essential for deriving the main results presented in the following section. Each result in this lemma does not have a specific physical meaning. Nevertheless, the results play key roles in proving the convexity of  as described in the following section.
  3. Proof of Convexity
Based on Lemma 1, we can derive the following results.
Proof.  By (
8) and (
12), it suffices to prove that 
 and 
 for all 
.
First, a proof by contradiction is used to prove that 
 for all 
. To this end, suppose that there exists a positive real value 
a that satisfies 
, or equivalently, 
 for some 
. Because 
 is continuous for 
, there must be an open interval 
 inside domain 
D with some 
 on which 
. Let 
 be the supremum of all possible values of 
. If 
 is finite, there exists a 
 such that 
 on 
 and 
 on 
, by the definition of 
. However, by (
5), with the inequalities in Lemma 1, 
 implies that 
 on 
, which further implies that 
 is strictly decreasing on 
. As 
 is strictly decreasing on 
, we must have 
. Because 
 is continuous, this contradicts the assumption that 
 on 
, which was induced by assuming a finite value of 
. Similarly, if 
 diverges to infinity, then 
 must be strictly decreasing on 
. As 
, this implies that 
, but this contradicts (
8). Therefore, we conclude that 
 for all 
.
Similarly, a proof by contradiction is used to prove that 
 for all 
. For the sake of contradiction, suppose that there exists a positive real value 
b that satisfies 
, or equivalently, 
, for some 
. Because 
 is continuous for 
, there must be an open interval 
 inside domain 
D with some 
 on which 
. Let 
 be the supremum of all possible values of 
. If 
, there exists a 
 such that 
 on 
 and 
 on 
, by the definition of 
. However, by (
6), with the inequalities in Lemma 1, 
 implies that 
 on 
, which further implies that 
 is strictly decreasing on 
. As 
 is strictly decreasing on 
, we must have 
. Because 
 is continuous, this contradicts the assumption that 
 on 
. Similarly, if 
, 
 must be strictly decreasing on 
. As 
, this implies that 
, but this contradicts (
8). Therefore, we conclude that 
 for all 
.    □
 Figure 1 verifies Lemma 2, which can be used to derive the following result.
 Lemma 3.  For all , the following inequality is true.  Proof.  From (
7), we have 
. Because both 
 and 
 are nonnegative (
9), it suffices to prove that 
. By definition, 
, such that 
. Thus, it suffices to prove that 
, or equivalently, 
. Because 
 (by (
12)) and 
 (by (
10)), the triangle inequality implies that 
, and Lemma 2 further implies that 
.    □
 In 
Figure 2, function 
 is depicted to demonstrate the result in Lemma 3.
Lemma 4.  The composite function  is convex on D if and only ifon D.  Proof.  A real-valued twice differentiable function of one variable is convex if and only if its derivative is nondecreasing (i.e., the second-order derivative is nonnegative). Thus, we investigate the derivative of 
, which can be calculated using (
2) and (
3), as
        
Thus, 
 is nondecreasing if and only if 
, where 
. Then, 
 can be calculated as
        
Hence, 
 if and only if
        
        or, by multiplying both sides by 
, it follows that 
 if and only if
        
Therefore, we conclude that  is convex on D if and only if  on D. Because the function  has a one-to-one correspondence from D to D, proving that  for all  is equivalent to proving that  for all ; thus, the proof is complete.    □
 Lemma 5.  If  on a subset of D, thenon the subset.  Proof.  The left-hand side of (
16) can be calculated as
        
As we know that  on D by Lemma 3,  implies that .    □
 Theorem 1.  The composite function  is convex on D.
 Proof.  For simplicity, let 
. First, by applying L’Hôpital’s rule, we have
        
Thus, by Lemma 4, the proof is complete if  is true for all .
For the sake of contradiction, suppose that there exists a real value 
 on which 
. Because both 
 and 
 are continuous, this implies that there exists an open interval 
 inside 
D on which 
. Moreover, because 
, there must exist an 
 and a corresponding open interval 
 that satisfies
        
        and
        
For example, 
 if 
 is strictly less than 
 for all 
x in 
. Combining (
17) and (
18), we obtain 
 on 
. Because 
 and 
, this implies that 
 on 
. By Lemma 5, 
 further implies that
        
Subsequently, (
18) and (
19) imply that 
 must be greater than or equal to 
 on 
. However, this contradicts (
17) and thus, the proof is completed.    □
   4. Discussion and Conclusions
This letter provides a mathematical proof for the convexity of the capacity function of one-bit quantized AWGN channel. Specifically, the continuity of involved functions are used to provide a proof by contradiction. The corresponding result can be an important supplement for the theoretical completeness of previous studies as numerous important studies have derived the capacities of various wireless channels with one-bit ADCs assuming that the convexity is true without providing a mathematical proof when SNR is near zero.
For example, in the proof of Theorem 2 provided in [
10], Jensen’s inequality was applied to the conditional entropy 
, which is derived as
      
      if 
 is the one-bit quantized output of an AWGN channel such that 
, where 
X is the transmit signal, 
N is the zero-mean Gaussian noise with variance 
, and 
 is the quantization function defined in (
1). The convexity of 
 guarantees the following result based on Jensen’s inequality:
Because there exists a transmit signal distribution that satisfies the equality in (
20) (e.g., binary phase shift keying (BPSK)), this result implies that the capacity of a one-bit quantized AWGN channel is given by 
, where 
 is the SNR. Although this derivation of the capacity assumes the convexity of 
, an analytical proof was presented assuming that 
, and only numerical results were provided for the case of 
. Thus, our complete proof of convexity can be a theoretical supplement to this result. In short, the proof guarantees that the capacity cannot be larger than 
 and that BPSK signaling achieves this upper bound. In 
Figure 3, the mutual information between 
X and 
, denoted as 
, is compared with the capacity 
. BPSK signaling with equal probabilities is used as the probability distribution of 
X. The numerical results are consistent with the analysis results. In [
11], authors extended this result to the multi-antenna complex fading channels. For MISO cases, the capacity was derived based on the convexity of 
 on 
 by directly extending the capacity of the SISO channel with a one-bit ADC. Moreover, an upper bound for the capacity of the MIMO channels was derived, and this derivation was also based on the convexity of 
. Thus, our results could supplement these results.
The convexity of the capacity function can also be used for optimization problems to provide appropriate resource allocation. In such cases, the convexity of the objective function can be used to clarify the existence of an optimal solution and identify it. For example, in [
25], an optimal power allocation problem was formulated and solved based on the convexity of 
; however, the convexity was assumed with only some numerical results.
With one-bit quantization, deriving the exact capacity is difficult because the quantization function is difficult to analyze. Thus, although analyzing the conventional capacity with infinite-resolution ADCs is feasible, obtaining the capacity for channels with one-bit ADCs may be extremely difficult. For example, the capacity of MIMO channels with one-bit ADCs is generally unknown, although the exact capacity of the MIMO channels with infinite-resolution ADCs is already known. Thus, our results can provide a valid mathematical basis for further derivations of unknown capacities and various optimization problems that use the capacity function of a one-bit quantized AWGN channel.