Abstract
Given the mutual information of finite-alphabet inputs cannot be calculated concisely and accurately over fading channels, this paper proposes a new method to calculate the mutual information. First, the applicability of the saddle point method is studied, and then the mutual information is estimated by the saddle point approximation method with known channel state information. Furthermore, we induce the expectation of mutual information over doubly correlated multiple-input multiple-output (MIMO) Rayleigh fading channels. The validity of the saddle point approximation method is verified by comparing the numerical results of the Monte Carlo method and the saddle point approximation method under different doubly correlated MIMO fading channel scenarios.
1. Introduction
Mutual information plays an irreplaceable role in the theoretical analysis of communication system performance, including analysis, evaluation and optimization of transceiver structure [1], encoding and decoding schemes [2], and communication system bit error rate (BER) performance [3], etc., so it attracts increasing research interest. Channel capacity, defined as the upper bound of mutual information, is realized under Gaussian inputs over additive white Gaussian noise (AWGN) channels [4]. A large number of theoretical analyses and research are also based on this concept [5,6,7]. By means of Minkowski’s inequality [1], two lower bounds of system capacity are obtained and are used as selection indexes to discuss the selection of antenna subsets in spatial multiplexing systems. Under the condition of Gaussian inputs, the hybrid encoder and combiners are designed by maximizing the achievable SE [2].
However, Gaussian inputs are rarely realized in practice because the unbounded amplitude of Gaussian distribution may lead to infinite transmitting power, and the continuity of Gaussian distribution will make it difficult to detect and decode the signal at the receiver. In practical communication systems, inputs are usually taken from finite-alphabet constellation sets with average distribution, rather than Gaussian inputs [8]. Considerable gaps in terms of transmitting performance exist [9,10,11] due to the differences between Gaussian and finite-alphabet inputs and then lead to deviations from optimal strategies. For example, it is believed that the traditionally optimal strategy to achieve capacity for Gaussian inputs is to allocate higher power to the sub-channels with a larger signal-to-noise ratio (SNR). In [10], it is demonstrated that such strategy may be quite suboptimal for the reason that the mutual information with finite-alphabet inputs is upper bounded, and there is little incentive to allocate more power to the sub-channels already close to saturation. At the same time, channel capacity reflects the upper bound of communication system performance, so the performance of digital communication systems can be accurately evaluated by mutual information under the condition of finite alphabet sets inputs and actual transmission environment [12,13].
Due to the great complexity of the direct calculation of mutual information, it is almost impossible to obtain a closed-form solution. Monte Carlo trials are usually used for direct and accurate calculation [14]. In order to reduce the computational complexity, a bit-level algorithm using PDF of a log-likelihood ratio (LLR) to calculate mutual information is proposed [15]. However, each modulation mode of the algorithm requires a lot of prior simulations, and it is only suitable for specific scenarios without universality. To optimize linear precoding with finite-alphabet inputs, the authors in [16,17] deduced the closed-form lower and upper bounds of mutual information as alternatives, which reduce the computational effort by several orders of magnitude compared to calculating the average mutual information directly. Then the bounds of mutual information are applied to multiple antennas, secure cognitive radio networks [18], and relay networks [19]. The study in [20] utilized the cutoff rate (CR) as the alternative of mutual information (MI) to design the linear precoders. Mutual information was also used to develop a two-step algorithm to enhance the achievable secrecy rate of cooperative jamming for secure communication with finite-alphabet inputs in [11]. However, the gaps between approximation and accurate mutual information are still ambiguous, which limits the range where mutual information can be applied. Recently, the authors of [21] approximated ergodic mutual information based on multi-exponential decay curve fitting under M-ary quadrature amplitude modulation (M-QAM) signaling, but other modulation modes were neglected.
This study takes a step toward evaluating accurate mutual information for finite-alphabet-based transmissions over doubly correlated MIMO fading channels. After discussing the applicability of the saddle point method, we obtain the approximate solution of mutual information for any known CSI and modulation mode by using this method, which is universal. On this basis, the mutual information expectation of doubly-correlated MIMO Rayleigh fading channels is further derived. This proposition highlights the considerable accuracy with radically reduced complexity.
The outline of this paper is as follows. The second section introduces the MIMO transmission model and its preliminary research. In the third section, the saddle point approximation method is used to estimate the mutual information, and then we calculate the mutual information expectation over doubly correlated MIMO Rayleigh fading channels. Then the validity and accuracy of the proposed method are verified under different doubly correlated MIMO fading channels scenarios in the fourth section. Finally, the fifth section gives conclusions.
2. Problem Formulation
2.1. Model of MIMO Transmission
Consider MIMO system with transmitting antennas and receiving antennas. Let ( denotes the complex spaces) be a transmitting signal vector, satisfying and , where stands for the statistical expectation of random with respect to its variable ; and denote an unit and zero matrix of appropriate dimensions, respectively. is the conjugate transpose of matrix . MIMO transmission is generally modeled by
where preliminaries are made as below.
- 1.
- is a complex fading channel matrix between transmitting antenna and receiving antenna arrays. The doubly correlated MIMO Rayleigh fading channel is modeled by [1], where is a matrix consisted of independent and identically distributed complex Gaussian entries; and are transmitting and receiving correlation matrices, respectively. and can be expressed aswhere and are unitary matrices whose columns are eigenvectors of and ; and represent diagonal matrices whose diagonal entries are the eigenvalues of and , respectively.
- 2.
- stands for AWGN corresponding to NR receiving antennas, where each element is independent and identically complex Gaussian distributed, satisfying and .
2.2. Mutual Information for Finite-Alphabet Inputs
When a linear unitary transform is applied on the receiving signal , the MIMO model in (1) is equivalent to a model with channel matrix and noise [16], which is written as
where and . is selected equiprobably from -dimension constellation consisted of vectors, where denotes the number of symbols in the i-th discrete constellation . The mutual Information for Finite-Alphabet inputs between and is given by [22]
where and . and are the m-th and k-th points in the constellation of , and stands for the Euclidean norm of the variable .
Since the statistical Channel State Information (CSI) is varying much slower than instantaneous , and can be obtained by channel estimation, this work assumed the statistical CSI was a perfectly-known constant. Consequently, the problem was to calculate the average mutual information with given statistical CSI.
3. Saddle Point Approximation for Mutual Information
Mutual information by (4) needs multiple integrals to compute expectation over and . As N increases, it leads to prohibitive complexity and becomes the most significant obstacle in achieving accurate mutual information. Therefore, we used the idea of the mean value theorem of integrals to simplify multiple integrals by finding an appropriate point. In this section, we explore the saddle point approximation method and highlight the convenient calculation with a weighted mean over constellation set of , instead of expectation over all possible samples of random and .
3.1. Saddle Point Approximation
We first considered the expectation over the AWGN vector . The Taylor series of (4) is expanded to
where
Before proceeding to the saddle point approximation, we needed to establish the following lemma, which guarantees the existence of the saddle point.
Lemma 1.
For non-zero natural number, the maximum ofexists and is achieved at, whereis the weighted average vector of,, …, and.
whereis a positive real number over an open interval (0, 1) and satisfies.
The proof of Theorem 1 is shown in Appendix A.
We are now at to perform saddle point approximation.
Proposition 1.
For non-zero natural number, integral over complex AWGN vectoris approximated by
where
The proof of Proposition 1 is shown in Appendix B.
Mutual information is approximated by (5) and (8) as
Generally speaking, the close-form solution is hardly obtained. That is to say, we cannot write down the exact expression of . Optionally, we adopt a numerical method to obtain approximated ,
where is computed by Monte Carlo method by taking BPSK over single-input and single-output (SISO) over AWGN channel (that is ) as an example, and is fixed at each signal to noise ratio (SNR). Thus, (10) can be written as
3.2. Average Mutual Information over Doubly Correlated MIMO Rayleigh Fading Channels
The average mutual information over doubly correlated MIMO Rayleigh fading channels is computed as below,
Since , it is still quite hard to compute the expectation of . Consequently, (13) remains unsuitable for theoretical applications. Obviously, when SNR varies from to , satisfies by (11), so average mutual information is approximated by
For the simplified calculation, the following proposition provides approximate solution:
Proposition 2.
Average mutual information integral overis lower bounded by
The proof of Proposition 2 is shown in Appendix C.
4. Simulation Verification and Result Analysis
This section presents examples to illustrate that the saddle point approximation method is very accurate. We considered an exponential correlation model. According to [23], the correlation matrix elements of transmitting and receiving antennas can be expressed as:
where .
4.1. Accuracy of Saddle Point Approximation
In Figure 1 and Figure 2, doubly correlated Rayleigh fading and Rice fading channel models were considered, respectively. We compared the average mutual information by the Monte Carlo method and saddle point approximation method by (12). Different input types (BPSK, QPSK, QAM, 8PSK, and 16QAM) were assigned to transmitting antennas to ensure generality. Obviously, with the increase in SNR, mutual information presented an upward trend. When the SNR was greater than 15dB, mutual information tended to be stable. In these cases, (12) offered a very good approximation to the average mutual information for known channel state information.
Figure 1.
Comparison on MI calculated by Monte Carlo and saddle point approximation under different input types and correction parameters () over doubly correlated Rayleigh fading channel model.
Figure 2.
Comparison on MI calculated by Monte Carlo and saddle point approximation under different input types and correction parameters () over doubly correlated Rice fading channel model.
Figure 3 compares normalized MI calculated by Monte Carlo and saddle point approximation according to upper and lower bounds of by (14) over doubly correlated Rayleigh fading channels. At low SNR, the normalized average mutual information of different modulation signals had little difference. With the increase in SNR, the normalized average mutual information of different input types tended to 1, and the growth rate of BPSK was the fastest. All simulation values were very close to the approximation values.
Figure 3.
Comparison on normalized MI calculated by Monte Carlo and saddle point approximation according to upper and lower bounds of by (14) under different input types, correction parameters () over doubly correlated Rayleigh fading channel model (NT = NR = 2).
Comparison of MI calculated by Monte Carlo and the lower bound of saddle point approximation methods by (15) over doubly correlated Rayleigh fading channel are shown in Figure 4. With the increase in SNR, the mutual information increased, and the accuracy of saddle point approximation became higher. It was also clear that the lower bound of saddle point approximation achieved considerable accuracy for different doubly correlated MIMO fading channel scenarios.
Figure 4.
Comparison on MI calculated by Monte Carlo and the lower bound of saddle point approximation method under different input types and correction parameters () over doubly correlated Rayleigh fading channel model.
4.2. Conciseness of Saddle Point Approximation
The average mutual information has no closed-form expression, so it is usually calculated by the Monte Carlo method. The more sample points, the more accurate the calculation result is. We denoted the sample points as NW. In order to obtain a relatively accurate value of mutual information, NW was at least 104. Table 1 and Table 2 compare the computational complexity of the Monte Carlo method and the saddle point approximation method according to the number of operations and CPU time under the condition that NW was 104. The codes of mutual information calculation based on the Monte Carlo method and the saddle point approximation method were executed on an Intel Core i5-5200U 2.20 GHz processor. The results showed that the computational complexity of the proposed saddle point approximation method was much lower than that of the traditional Monte Carlo method. For example, as shown in Table 2, when NT and NR were equal to 2 and the input type of the two transmitting antennas was 16QAM, the CPU time of saddle point approximation methods by (15) was several orders of magnitude less than that of Monte Carlo method.
Table 1.
Comparison of computational complexity between Monte Carlo method and saddle point approximation method according to the number of operations.
Table 2.
Comparison of computational complexity between Monte Carlo method and saddle point approximation method according to CPU time (seconds) under different input types and correction parameters (). The symbol/indicates the CPU time is more than half an hour.
5. Conclusions
This paper studied the numerical calculation of mutual information for finite-alphabet-based transmissions over doubly correlated MIMO fading channels. The average mutual information was dominated by statistical CSI, and the obstacle of computation was complexity. We examined the appropriateness of the saddle point method first. Then mutual information over any known channel model was calculated by saddle point approximation. Furthermore, we induced the expectation of mutual information over doubly correlated MIMO Rayleigh-fading channels. Numerical results for various MIMO scenarios showed the efficacy of the proposed method. Compared to existing conclusions, the proposed approximation is of considerable accuracy in estimating the average mutual information with radically reduced complexity. It is promising that its accuracy and convenience will facilitate the practical application of mutual information.
Author Contributions
Conceptualization, J.Z., D.Z., and Y.L.; methodology, J.Z. and Y.L; validation, J.Z. and Y.L.; investigation, J.Z. and D.Z.; writing—original draft preparation, Y.L.; writing—review and editing, J.Z. and D.Z. All authors have read and agreed to the published version of the manuscript.
Funding
This research was funded by the Basic Research Business Expenses of Beijing Jiaotong University—Special Project of Frontier Science Center of Smart High Speed Railway System (number 2020JBZD004, 2020JBZD010); and Enterprise project (number AIPEC/BJJT/20BW1219).
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
Not applicable.
Acknowledgments
The authors wish to thank the reviewers for their valuable comments and suggestions concerning this manuscript.
Conflicts of Interest
The authors declare no conflict of interest.
Appendix A
Define and for computational simplicity,
where and stand for the real and imaginary components of a complex number . and are dimensional real vectors ( and ) that satisfy and .
By (6) and (A1), we have
Since is positive integers, it is easy to verify that
Therefore, there is a maximum value of , which satisfies the conditions of saddle point approximation calculation. Assuming achieves the maximum at , satisfies
where is the Hessian matrix of at . By the fact of and , for , is equivalent to
and then the Hessian matrix is rewritten as
Note that (A5) is equivalent to an implicit function of , , …, and as below
where
for . Then the Jacobi matrix of is computed as below
Recalling (A6), is a positive definite matrix at , so it is invertible. Consequently, can, in principle, express in terms of , , …, by implicit function theorem. Namely, the maximum of is achieved on the condition that satisfies (A5). Note that a complex number is zero when and only when both its real and imaginary parts are zero vectors, so we have,
where is a positive real number over an open interval (0, 1) and satisfies . So and are both weighted average vectors of , , …, and .
Appendix B
Lemma 1 denotes that is maximized at . By (A5), we have,
By (A5), (A6), and (A10), the Taylor series of is expanded to
Note that a positive definite matrix is invertible, and the determinant of can be computed by , where stands for the trace of . Recalling (A1) and (A6), the saddle point approximation can be computed by Gaussian integral
According to (A5), we can induce
Therefore, (A13) and (A14) demonstrate that Gaussian integral is dominated by in terms of exponential. Define a multiplier dominated by ,
So
Appendix C
Obviously, when SNR varies from to , satisfies by (11), so the average mutual information over doubly correlated MIMO Rayleigh-fading channels is approximated by
Because is a concave function, by Jensen’s inequality, we have
where , so expectation in (A18) is rewritten as
where
stands for the row of .
Since is an independent and identically distributed complex AWGN matrix, is an independent and identically distributed complex AWGN vector. So
According to [24],
where is iid
. (A21) can be written as
Note that for column vector ,
we have
recall (A17), we have
References
- Jin, S.; Gao, X. Statistical antenna selection for MIMO systems in double-sided correlated rayleigh fading channels. In Proceedings of the IEEE Wireless Communications and Networking Conference, Las Vegas, NV, USA, 3–6 April 2006; pp. 729–733. [Google Scholar]
- Zhang, Y.; Huo, Y.; Wang, D.; Dong, X.; You, X. Channel Estimation and Hybrid Precoding for Distributed Phased Arrays Based MIMO Wireless Communications. IEEE Trans. Veh. Technol. 2020, 69, 12921–12937. [Google Scholar] [CrossRef]
- Jin, X.L.; Yang, J.D.; Song, K.Y.; No, J.S.; Shin, D.J. On the relationship between mutual information and bit error probability for some linear dispersion codes. IEEE Tran. Wirel. Commun. 2009, 8, 90–94. [Google Scholar] [CrossRef]
- Shannon, C.E. A mathematical theory of communication. Bell Syst. Technol. J. 1948, 27, 379–423, 623–656. [Google Scholar] [CrossRef]
- Smith, P.J.; Shafi, M. On a Gaussian approximation to the capacity of wireless MIMO systems. In Proceedings of the 2002 IEEE International Conference on Communications. Conference Proceedings, ICC 2002, New York, NY, USA, 28 April–2 May 2002; pp. 406–410. [Google Scholar]
- Chuah, D.N.; Tse, D.N.C.; Kahn, J.M.; Valenzuela, R.A. Capacity scaling in MIMO wireless systems under correlated fading. IEEE Trans. Inf. Theory 2002, 48, 637–650. [Google Scholar] [CrossRef]
- Levin, G.; Loyka, S. On the Outage Capacity Distribution of Correlated Keyhole MIMO Channels. IEEE Trans. Inf. Theory 2008, 54, 3232–3245. [Google Scholar] [CrossRef][Green Version]
- Hsu, H. Digital communications over fading channels. IEEE Circuits Devices Mag. 2001, 17, 57. [Google Scholar]
- Lozona, A.; Tulino, A.M.; Verdu, S. Optimum power allocation for parallel Gaussian channels with arbitrary input distributions. IEEE Trans. Inf. Theory 2006, 52, 3033–3051. [Google Scholar] [CrossRef]
- Xiao, C.; Zheng, Y.R.; Ding, Z. Globally Optimal Linear Precoders for Finite Alphabet Signals Over Complex Vector Gaussian Channels. IEEE Trans. Signal Process. 2011, 59, 3301–3314. [Google Scholar] [CrossRef]
- Cao, K.; Cai, Y.; Wu, Y.; Yang, W. Cooperative Jamming for Secure Communication with Finite Alphabet Inputs. IEEE Commun. Lett. 2017, 21, 2025–2028. [Google Scholar] [CrossRef]
- Sadeghi, P.; Vontobel, P.O.; Shams, R. Optimization of information rate upper and lower bounds for channels with memory. IEEE Trans. Inf. Theory 2009, 55, 663–688. [Google Scholar] [CrossRef]
- Güney, N.; Delic, H.; Alagöz, F. Achievable information rates of PPM impulse radio for UWB channels and rake reception. IEEE Trans. Commun. 2010, 58, 1524–1535. [Google Scholar] [CrossRef]
- Owen, A.B. Monte Carlo extension of quasi-Monte Carlo. In Proceedings of the 1998 Winter Simulation Conference, Washington, DC, USA, 13–16 December 1998; pp. 571–577. [Google Scholar]
- Sayana, K.; Zhuang, J.; Stewart, K. Short Term Link Performance Modeling for ML Receivers with Mutual Information per Bit Metrics. In Proceedings of the IEEE GLOBECOM 2008–2008 IEEE Global Telecommunications Conference, New Orleans, LA, USA, 30 November–4 December 2008; pp. 1–6. [Google Scholar]
- Zeng, W.; Xiao, C.; Wang, M.; Lu, J. Linear precoding for finite alphabet inputs over MIMO fading channels with statistical CSI. IEEE Trans. Signal Process. 2012, 60, 3134–3148. [Google Scholar] [CrossRef]
- Yang, P.; Yang, H. A Low-Complexity Linear Precoding for Secure Transmission over MIMOME Wiretap Channels with Finite-Alphabet Inputs. IEEE Trans. Veh. Technol. 2019, 68, 9896–9907. [Google Scholar] [CrossRef]
- Zeng, W.; Zheng, Y.R.; Xiao, C. Multiantenna secure cognitive radio networks with finite-alphabet inputs: A global optimization approach for precoder design. IEEE Trans. Wirel. Commun. 2016, 15, 3044–3057. [Google Scholar] [CrossRef]
- Zeng, W.; Zheng, Y.R.; Wang, M.; Lu, J. Linear precoding for relay networks: A perspective on finite-alphabet inputs. IEEE Trans. Wireless Commun. 2012, 11, 1146–1157. [Google Scholar] [CrossRef]
- Yadav, A.; Juntti, M.; Lilleberg, J. Linear Precoder Design for Doubly Correlated Partially Coherent Fading MIMO Channels. IEEE Trans. Wirel. Commun. 2014, 13, 3621–3635. [Google Scholar] [CrossRef]
- Ouyang, C.; Wu, S.; Jiang, C.; Cheng, J.; Yang, H. Approximating Ergodic Mutual Information for Mixture Gamma Fading Channels with Discrete Inputs. IEEE Commun. Lett. 2020, 24, 734–738. [Google Scholar] [CrossRef]
- Xiao, C.; Zheng, Y.R. On the mutual information and power allocation for vector Gaussian channels with finite discrete inputs. In Proceedings of the IEEE GLOBECOM 2008–2008 IEEE Global Telecommunications Conference, New Orleans, LA, USA, 30 November–4 December 2008; pp. 1–5. [Google Scholar]
- Wang, W.; Guyet, T.; Guiniou, R. Autonomic intrusion detection: Adaptively detecting anomalies over unlabeled audit data streams in computer networks. Knowl. Based Syst. 2014, 70, 103–117. [Google Scholar] [CrossRef]
- Hassibi, B.; Marzetta, T. Multiple-antennas and isotropically random unitary inputs: The received signal density in closed form. IEEE Trans. Inf. Theory 2002, 48, 1473–1484. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).