Saddle Point Approximation of Mutual Information for Finite-Alphabet Inputs over Doubly Correlated MIMO Rayleigh Fading Channels

Yuyu Liu; Jinbao Zhang; Dan Zhang

doi:10.3390/app11104700

,

and

¹

Institute of Electromagnetic Compatibility, Beijing Jiaotong University, Beijing 100044, China

²

Frontiers Science Center for Smart High-speed Railway System, Beijing 100044, China

³

Beijing Engineering Research Center of EMC and GNSS Technology for Rail Transportation, Beijing 100044, China

^*

Author to whom correspondence should be addressed.

Appl. Sci.2021, 11(10), 4700;https://doi.org/10.3390/app11104700

This article belongs to the Special Issue Wireless Communication: Applications, Security and Reliability

Version Notes

Order Reprints

Review Reports

Abstract

Given the mutual information of finite-alphabet inputs cannot be calculated concisely and accurately over fading channels, this paper proposes a new method to calculate the mutual information. First, the applicability of the saddle point method is studied, and then the mutual information is estimated by the saddle point approximation method with known channel state information. Furthermore, we induce the expectation of mutual information over doubly correlated multiple-input multiple-output (MIMO) Rayleigh fading channels. The validity of the saddle point approximation method is verified by comparing the numerical results of the Monte Carlo method and the saddle point approximation method under different doubly correlated MIMO fading channel scenarios.

Keywords:

mutual information; finite-alphabet inputs; doubly correlated MIMO Rayleigh fading channels; saddle point approximation

1. Introduction

Mutual information plays an irreplaceable role in the theoretical analysis of communication system performance, including analysis, evaluation and optimization of transceiver structure [1], encoding and decoding schemes [2], and communication system bit error rate (BER) performance [3], etc., so it attracts increasing research interest. Channel capacity, defined as the upper bound of mutual information, is realized under Gaussian inputs over additive white Gaussian noise (AWGN) channels [4]. A large number of theoretical analyses and research are also based on this concept [5,6,7]. By means of Minkowski’s inequality [1], two lower bounds of system capacity are obtained and are used as selection indexes to discuss the selection of antenna subsets in spatial multiplexing systems. Under the condition of Gaussian inputs, the hybrid encoder and combiners are designed by maximizing the achievable SE [2].

However, Gaussian inputs are rarely realized in practice because the unbounded amplitude of Gaussian distribution may lead to infinite transmitting power, and the continuity of Gaussian distribution will make it difficult to detect and decode the signal at the receiver. In practical communication systems, inputs are usually taken from finite-alphabet constellation sets with average distribution, rather than Gaussian inputs [8]. Considerable gaps in terms of transmitting performance exist [9,10,11] due to the differences between Gaussian and finite-alphabet inputs and then lead to deviations from optimal strategies. For example, it is believed that the traditionally optimal strategy to achieve capacity for Gaussian inputs is to allocate higher power to the sub-channels with a larger signal-to-noise ratio (SNR). In [10], it is demonstrated that such strategy may be quite suboptimal for the reason that the mutual information with finite-alphabet inputs is upper bounded, and there is little incentive to allocate more power to the sub-channels already close to saturation. At the same time, channel capacity reflects the upper bound of communication system performance, so the performance of digital communication systems can be accurately evaluated by mutual information under the condition of finite alphabet sets inputs and actual transmission environment [12,13].

Due to the great complexity of the direct calculation of mutual information, it is almost impossible to obtain a closed-form solution. Monte Carlo trials are usually used for direct and accurate calculation [14]. In order to reduce the computational complexity, a bit-level algorithm using PDF of a log-likelihood ratio (LLR) to calculate mutual information is proposed [15]. However, each modulation mode of the algorithm requires a lot of prior simulations, and it is only suitable for specific scenarios without universality. To optimize linear precoding with finite-alphabet inputs, the authors in [16,17] deduced the closed-form lower and upper bounds of mutual information as alternatives, which reduce the computational effort by several orders of magnitude compared to calculating the average mutual information directly. Then the bounds of mutual information are applied to multiple antennas, secure cognitive radio networks [18], and relay networks [19]. The study in [20] utilized the cutoff rate (CR) as the alternative of mutual information (MI) to design the linear precoders. Mutual information was also used to develop a two-step algorithm to enhance the achievable secrecy rate of cooperative jamming for secure communication with finite-alphabet inputs in [11]. However, the gaps between approximation and accurate mutual information are still ambiguous, which limits the range where mutual information can be applied. Recently, the authors of [21] approximated ergodic mutual information based on multi-exponential decay curve fitting under M-ary quadrature amplitude modulation (M-QAM) signaling, but other modulation modes were neglected.

This study takes a step toward evaluating accurate mutual information for finite-alphabet-based transmissions over doubly correlated MIMO fading channels. After discussing the applicability of the saddle point method, we obtain the approximate solution of mutual information for any known CSI and modulation mode by using this method, which is universal. On this basis, the mutual information expectation of doubly-correlated MIMO Rayleigh fading channels is further derived. This proposition highlights the considerable accuracy with radically reduced complexity.

The outline of this paper is as follows. The second section introduces the MIMO transmission model and its preliminary research. In the third section, the saddle point approximation method is used to estimate the mutual information, and then we calculate the mutual information expectation over doubly correlated MIMO Rayleigh fading channels. Then the validity and accuracy of the proposed method are verified under different doubly correlated MIMO fading channels scenarios in the fourth section. Finally, the fifth section gives conclusions.

2. Problem Formulation

2.1. Model of MIMO Transmission

Consider MIMO system with

N_{T}

transmitting antennas and

N_{R}

receiving antennas. Let

\tilde{\tilde{x}} \in ℂ^{N_{T} \times 1}

(

ℂ^{N \times m}

denotes the

N \times m

complex spaces) be a transmitting signal vector, satisfying

E_{\tilde{\tilde{x}}} {\tilde{\tilde{x}}} = 0

and

E_{\tilde{\tilde{x}}} {\tilde{\tilde{x}} {\tilde{\tilde{w}}}^{H}} = I

, where

E_{(•)} {*}

stands for the statistical expectation of random

*

with respect to its variable

\cdot

;

I

and

0

denote an unit and zero matrix of appropriate dimensions, respectively.

{(•)}^{H}

is the conjugate transpose of matrix

\cdot

. MIMO transmission is generally modeled by

\tilde{\tilde{y}} = \tilde{\tilde{H}} \tilde{\tilde{x}} + \tilde{\tilde{w}}

(1)

where preliminaries are made as below.

1.: $\tilde{\tilde{H}} \in ℂ^{N_{R} \times N_{T}}$ is a complex fading channel matrix between transmitting antenna and receiving antenna arrays. The doubly correlated MIMO Rayleigh fading channel is modeled by $Ψ_{R}^{1 / 2} {\tilde{\tilde{H}}}_{WG} Ψ_{T}^{1 / 2} \in ℂ^{N_{R} \times N_{T}}$ [1], where ${\tilde{\tilde{H}}}_{WG} \in ℂ^{N_{R} \times N_{T}}$ is a matrix consisted of independent and identically distributed $C N (0, 1)$ complex Gaussian entries; $Ψ_{T}$ and $Ψ_{R}$ are transmitting and receiving correlation matrices, respectively. $Ψ_{T}$ and $Ψ_{R}$ can be expressed as

$Ψ_{T} = U_{T} Σ_{T} U_{T}^{H} and Ψ_{R} = U_{R} Σ_{R} U_{R}^{H}$

(2)

where $U_{T}$ and $U_{R}$ are unitary matrices whose columns are eigenvectors of $Ψ_{T}$ and $Ψ_{R}$ ; $Σ_{T}$ and $Σ_{R}$ represent diagonal matrices whose diagonal entries are the eigenvalues of $Ψ_{T}$ and $Ψ_{R}$ , respectively.
2.: $\tilde{\tilde{w}} \in ℂ^{N_{R} \times 1}$ stands for AWGN corresponding to N_R receiving antennas, where each element is independent and identically complex Gaussian distributed, satisfying $E_{\tilde{\tilde{w}}} {\tilde{\tilde{w}}} = 0$ and $E_{\tilde{\tilde{w}}} {\tilde{\tilde{w}} {\tilde{\tilde{w}}}^{H}} = σ^{2} I$ .

2.2. Mutual Information for Finite-Alphabet Inputs

When a linear unitary transform

U_{R}^{H}

is applied on the receiving signal

\tilde{\tilde{y}}

, the MIMO model in (1) is equivalent to a model with channel matrix

U_{R}^{H} \tilde{\tilde{H}}

and noise

U_{R}^{H} \tilde{\tilde{w}}

[16], which is written as

y = U_{R}^{H} \tilde{\tilde{y}} = Σ_{R}^{1 / 2} H_{WG} Σ_{T}^{1 / 2} U_{T}^{H} \tilde{\tilde{x}} + w

(3)

where

H_{WG} = U_{R}^{H} {\tilde{\tilde{H}}}_{WG} U_{T}

and

w = U_{R}^{H} \tilde{\tilde{w}}

.

\tilde{\tilde{x}}

is selected equiprobably from

N_{T}

-dimension constellation consisted of

N = \prod_{I = 1}^{N_{T}} N_{I}

vectors, where

N_{I}

denotes the number of symbols in the i-th discrete constellation

Ω_{I}

. The mutual Information for Finite-Alphabet inputs between

x

and

y

is given by [22]

\begin{array}{l} I (\tilde{\tilde{x}}; \tilde{\tilde{y}}) = I (U_{T}^{H} \tilde{\tilde{x}}; y) \\ = \log_{2} N + \frac{1}{N} \sum_{m = 1}^{N} E_{H_{WG}, w} \{\log_{2} [\exp (- \frac{| | w | |^{2}}{σ^{2}}) / \sum_{k = 1}^{N} \exp (- \frac{| | c_{m, k} + w | |^{2}}{σ^{2}})]\} \end{array}

(4)

where

c_{m, k} = Σ_{R}^{1 / 2} H_{WG} Σ_{T}^{1 / 2} U_{T}^{H} d_{m, k}

and

d_{m, k} = q_{m} - q_{k}

.

q_{m}

and

q_{k}

are the m-th and k-th points in the constellation of

\tilde{\tilde{x}}

, and

| | • | |

stands for the Euclidean norm of the variable

•

.

Since the statistical Channel State Information (CSI) is varying much slower than instantaneous

\tilde{\tilde{H}}

, and can be obtained by channel estimation, this work assumed the statistical CSI was a perfectly-known constant. Consequently, the problem was to calculate the average mutual information with given statistical CSI.

3. Saddle Point Approximation for Mutual Information

Mutual information by (4) needs multiple integrals to compute expectation over

H_{WG}

and

w

. As N increases, it leads to prohibitive complexity and becomes the most significant obstacle in achieving accurate mutual information. Therefore, we used the idea of the mean value theorem of integrals to simplify multiple integrals by finding an appropriate point. In this section, we explore the saddle point approximation method and highlight the convenient calculation with a weighted mean over constellation set of

x

, instead of expectation over all possible samples of random

H_{WG}

and

w

.

3.1. Saddle Point Approximation

We first considered the expectation over the AWGN vector

w

. The Taylor series of (4) is expanded to

ℐ (x; y | H) = \log_{2} N - \frac{1}{N \ln 2} \sum_{m = 1}^{N} \sum_{σ = 1}^{+ \infty} \frac{1}{σ} \sum_{q = 0}^{σ} \frac{σ! {(- 1)}^{q}}{q! (σ - q)!} \frac{1}{π^{N_{R}} σ^{2 N_{R}}} \int_{w} {(\sum_{k = 1}^{N} σ_{m, k} (w))}^{- q} d w

(5)

where

p_{m, k} (w) = \exp (\frac{| | w - q c_{m, k} | |^{2}}{q σ^{2}} - \frac{(q + 1) | | c_{m, k} | |^{2}}{σ^{2}}) and H = Σ_{R}^{1 / 2} H_{WG} Σ_{T}^{1 / 2}

(6)

Before proceeding to the saddle point approximation, we needed to establish the following lemma, which guarantees the existence of the saddle point.

Lemma 1.

For non-zero natural number

q

, the maximum of

{(\sum_{k = 1}^{N} σ_{m, k} (w))}^{- q}

exists and is achieved at

w = w_{0}

, where

w_{0}

is the weighted average vector of

c_{m, 1}

,

c_{m, 2}

, …, and

c_{m, N}

.

w_{0} = \sum_{k = 1}^{N} q ρ_{m, k} c_{m, k} ≜ q {\bar{c}}_{m}

(7)

where

ρ_{m, k} = σ_{m, k} (w_{0}) / \sum_{k = 1}^{N} σ_{m, k} (w_{0}) = σ_{m, k} (w_{0}) / σ_{m} (w_{0})

is a positive real number over an open interval (0, 1) and satisfies

\sum_{k = 1}^{N} ρ_{m, k} = 1

.

The proof of Theorem 1 is shown in Appendix A.

We are now at

w = w_{0}

to perform saddle point approximation.

Proposition 1.

For non-zero natural number

q

, integral over complex AWGN vector

w

is approximated by

\int_{w} \frac{1}{π^{N_{R}} σ^{2 N_{R}}} {(\sum_{k = 1}^{N} σ_{m, k} (w))}^{- q} d w \approx {[\sum_{k = 1}^{N} \exp (- \frac{α_{m, k} | | c_{m, k} | |^{2}}{σ^{2}})]}^{- q}

(8)

where

α_{m, k} = [1 - q \frac{| | {\bar{c}}_{m} | |^{2}}{| | c_{m, k} | |^{2}} + q \frac{c_{m, k}^{H} {\bar{c}}_{m} + {\bar{c}}_{m}^{H} c_{m, k}}{| | c_{m, k} | |^{2}} - {(\frac{| | c_{m, k} | |^{2}}{σ^{2}})}^{- 1} \ln (\sum_{k = 1}^{N} \frac{ρ_{m,}_{k} | | c_{m, k} | |^{2}}{σ^{2}} - \frac{| | {\bar{c}}_{m} | |^{2}}{σ^{2}})]

(9)

The proof of Proposition 1 is shown in Appendix B.

Mutual information is approximated by (5) and (8) as

ℐ (x; y | H) \approx ℐ (α_{m, k}, H) = \log_{2} N - \frac{1}{N} \sum_{m = 1}^{N} \log_{2} [\sum_{k = 1}^{N} \exp (- \frac{α_{m, k} | | c_{m, k} | |^{2}}{σ^{2}})]

(10)

Generally speaking, the close-form solution is hardly obtained. That is to say, we cannot write down the exact expression of

α_{m, k}

. Optionally, we adopt a numerical method to obtain approximated

α_{m, k}

,

\underset{α_{m, k} : k = 1, 2}{\arg \min} {| ℐ (x; y | 1) - ℐ (α_{m, k}, 1)} \approx {3 - \exp [- | | c_{m, k} | |^{2} / (4 σ^{2})]}^{- 1}

(11)

where

ℐ (x; y | 1)

is computed by Monte Carlo method by taking BPSK over single-input and single-output (SISO) over AWGN channel (that is

H = 1

) as an example, and

α_{m, k}

is fixed at each signal to noise ratio (SNR). Thus, (10) can be written as

\begin{array}{l} ℐ (x; y | H) \approx \log_{2} N - \frac{1}{N} \sum_{m = 1}^{N} \log_{2} [\sum_{k = 1}^{N} \exp (- \frac{| | c_{m, k} | |^{2}}{σ^{2}} \frac{1}{3 - \exp [- | | c_{m, k} | |^{2} / (4 σ^{2})]})] \\ = \log_{2} N - \frac{1}{N} \sum_{m = 1}^{N} \log_{2} [\sum_{k = 1}^{N} \exp (- \frac{| | H d_{m, k} | |^{2}}{σ^{2}} \frac{1}{3 - \exp [- | | H d_{m, k} | |^{2} / (4 σ^{2})]})] \end{array}

(12)

3.2. Average Mutual Information over Doubly Correlated MIMO Rayleigh Fading Channels

The average mutual information over doubly correlated MIMO Rayleigh fading channels is computed as below,

I (x; y) \approx \log_{2} N - \frac{1}{N} \sum_{m = 1}^{N} E_{H_{WG}} \{\log_{2} [\sum_{k = 1}^{N} \exp (- \frac{| | c_{m, k} | |^{2}}{σ^{2}} α_{m, k})]\}

(13)

Since

c_{m, k} = Σ_{R}^{1 / 2} H_{WG} Σ_{T}^{1 / 2} U_{T}^{H} d_{m, k}

, it is still quite hard to compute the expectation of

H_{WG}

. Consequently, (13) remains unsuitable for theoretical applications. Obviously, when SNR varies from

- \infty

to

+ \infty

,

α_{m, k}

satisfies

1 / 3 < α_{m, k} < 1 / 2

by (11), so average mutual information is approximated by

\begin{array}{l} I (x; y) \approx \log_{2} N - \frac{1}{2 N} \sum_{m = 1}^{N} E_{H_{WG}} \{\log_{2} [\sum_{k = 1}^{N} \exp (- \frac{| | c_{m, k} | |^{2}}{2 σ^{2}})]\} \\ - \frac{1}{2 N} \sum_{m = 1}^{N} E_{H_{WG}} \{\log_{2} [\sum_{k = 1}^{N} \exp (- \frac{| | c_{m, k} | |^{2}}{3 σ^{2}})]\} \end{array}

(14)

For the simplified calculation, the following proposition provides approximate solution:

Proposition 2.

Average mutual information integral over

H_{WG}

is lower bounded by

\begin{array}{l} ℐ (x; y) \geq \log_{2} N - \frac{1}{2 N} \sum_{m = 1}^{N} \log_{2} [\sum_{k = 1}^{N} \prod_{l = 1}^{N_{R}} {(1 + \frac{{[Σ_{R}]}_{l, l}}{2 σ^{2}} d_{m, k}^{H} σ^{H} Σ_{T} σ d_{m, k})}^{- 1}] \\ - \frac{1}{2 N} \sum_{m = 1}^{N} \log_{2} [\sum_{k = 1}^{N} \prod_{l = 1}^{N_{R}} {(1 + \frac{{[Σ_{R}]}_{l, l}}{3 σ^{2}} d_{m, k}^{H} σ^{H} Σ_{T} σ d_{m, k})}^{- 1}] \end{array}

(15)

where

P = U_{T}^{H}

.

The proof of Proposition 2 is shown in Appendix C.

4. Simulation Verification and Result Analysis

This section presents examples to illustrate that the saddle point approximation method is very accurate. We considered an exponential correlation model. According to [23], the correlation matrix elements of transmitting and receiving antennas can be expressed as:

\{\begin{cases} [Ψ_{T} (ρ_{T})] I_{T}, j_{T} = ρ_{T}^{| I_{T} - j_{T} |} \\ [Ψ_{R} (ρ_{R})] I_{R}, j_{R} = ρ_{R}^{| I_{R} - j_{R} |} \end{cases} and \{\begin{cases} I_{T}, j_{T} = 1, 2, \dots, N_{T} \\ I_{R}, j_{R} = 1, 2, \dots, N_{R} \end{cases}

(16)

where

ρ_{T}, ρ_{R} \in [0, 1)

.

4.1. Accuracy of Saddle Point Approximation

In Figure 1 and Figure 2, doubly correlated Rayleigh fading and Rice fading channel models were considered, respectively. We compared the average mutual information by the Monte Carlo method and saddle point approximation method by (12). Different input types (BPSK, QPSK, QAM, 8PSK, and 16QAM) were assigned to transmitting antennas to ensure generality. Obviously, with the increase in SNR, mutual information presented an upward trend. When the SNR was greater than 15dB, mutual information tended to be stable. In these cases, (12) offered a very good approximation to the average mutual information for known channel state information.

Figure 1. Comparison on MI calculated by Monte Carlo and saddle point approximation under different input types and correction parameters (

ρ_{T} = ρ_{R} = ρ

) over doubly correlated Rayleigh fading channel model.

Figure 2. Comparison on MI calculated by Monte Carlo and saddle point approximation under different input types and correction parameters (

ρ_{T} = ρ_{R} = ρ

) over doubly correlated Rice fading channel model.

Figure 3 compares normalized MI calculated by Monte Carlo and saddle point approximation according to upper and lower bounds of

α_{m, k}

by (14) over doubly correlated Rayleigh fading channels. At low SNR, the normalized average mutual information of different modulation signals had little difference. With the increase in SNR, the normalized average mutual information of different input types tended to 1, and the growth rate of BPSK was the fastest. All simulation values were very close to the approximation values.

Figure 3. Comparison on normalized MI calculated by Monte Carlo and saddle point approximation according to upper and lower bounds of

α_{m, k}

by (14) under different input types, correction parameters (

ρ_{T} = ρ_{R} = ρ = 0.5

) over doubly correlated Rayleigh fading channel model (N_T = N_R = 2).

Comparison of MI calculated by Monte Carlo and the lower bound of saddle point approximation methods by (15) over doubly correlated Rayleigh fading channel are shown in Figure 4. With the increase in SNR, the mutual information increased, and the accuracy of saddle point approximation became higher. It was also clear that the lower bound of saddle point approximation achieved considerable accuracy for different doubly correlated MIMO fading channel scenarios.

Figure 4. Comparison on MI calculated by Monte Carlo and the lower bound of saddle point approximation method under different input types and correction parameters (

ρ_{T} = ρ_{R} = 0.5

) over doubly correlated Rayleigh fading channel model.

4.2. Conciseness of Saddle Point Approximation

The average mutual information has no closed-form expression, so it is usually calculated by the Monte Carlo method. The more sample points, the more accurate the calculation result is. We denoted the sample points as N_W. In order to obtain a relatively accurate value of mutual information, N_W was at least 10⁴. Table 1 and Table 2 compare the computational complexity of the Monte Carlo method and the saddle point approximation method according to the number of operations and CPU time under the condition that N_W was 10⁴. The codes of mutual information calculation based on the Monte Carlo method and the saddle point approximation method were executed on an Intel Core i5-5200U 2.20 GHz processor. The results showed that the computational complexity of the proposed saddle point approximation method was much lower than that of the traditional Monte Carlo method. For example, as shown in Table 2, when N_T and N_R were equal to 2 and the input type of the two transmitting antennas was 16QAM, the CPU time of saddle point approximation methods by (15) was several orders of magnitude less than that of Monte Carlo method.

Table 1. Comparison of computational complexity between Monte Carlo method and saddle point approximation method according to the number of operations.

Table 2. Comparison of computational complexity between Monte Carlo method and saddle point approximation method according to CPU time (seconds) under different input types and correction parameters (

ρ_{T} = ρ_{R} = 0.4

). The symbol/indicates the CPU time is more than half an hour.

5. Conclusions

This paper studied the numerical calculation of mutual information for finite-alphabet-based transmissions over doubly correlated MIMO fading channels. The average mutual information was dominated by statistical CSI, and the obstacle of computation was complexity. We examined the appropriateness of the saddle point method first. Then mutual information over any known channel model was calculated by saddle point approximation. Furthermore, we induced the expectation of mutual information over doubly correlated MIMO Rayleigh-fading channels. Numerical results for various MIMO scenarios showed the efficacy of the proposed method. Compared to existing conclusions, the proposed approximation is of considerable accuracy in estimating the average mutual information with radically reduced complexity. It is promising that its accuracy and convenience will facilitate the practical application of mutual information.

Author Contributions

Conceptualization, J.Z., D.Z., and Y.L.; methodology, J.Z. and Y.L; validation, J.Z. and Y.L.; investigation, J.Z. and D.Z.; writing—original draft preparation, Y.L.; writing—review and editing, J.Z. and D.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Basic Research Business Expenses of Beijing Jiaotong University—Special Project of Frontier Science Center of Smart High Speed Railway System (number 2020JBZD004, 2020JBZD010); and Enterprise project (number AIPEC/BJJT/20BW1219).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors wish to thank the reviewers for their valuable comments and suggestions concerning this manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Define

{\tilde{c}}_{m, k}

and

\tilde{w}

for computational simplicity,

{\tilde{c}}_{m, k} = [\begin{array}{l} Re {c_{m, k}} \\ Im {c_{m, k}} \end{array}] and \tilde{w} = [\begin{array}{l} Re {w} \\ Im {w} \end{array}]

(A1)

where

Re {•}

and

Im {•}

stand for the real and imaginary components of a complex number

•

.

{\tilde{c}}_{m, k}

and

\tilde{w}

are

2 N_{R} \times 1

dimensional real vectors (

{\tilde{c}}_{m, k} \in ℝ^{2 N_{R} \times 1}

and

\tilde{w} \in ℝ^{2 N_{R} \times 1}

) that satisfy

| | {\tilde{c}}_{m, k} | |^{2} = | | c_{m, k} | |^{2}

and

| | \tilde{w} | |^{2} = | | w | |^{2}

.

By (6) and (A1), we have

{(\sum_{k = 1}^{N} σ_{m, k} (\tilde{w}))}^{- q} = σ_{m}^{- q} (\tilde{w}) = {[\sum_{k = 1}^{N} \exp (\frac{| | \tilde{w} - q {\tilde{c}}_{m, k} | |^{2}}{q σ^{2}} - \frac{(q + 1) | | {\tilde{c}}_{m, k} | |^{2}}{σ^{2}})]}^{- q}

(A2)

Since

q

is positive integers, it is easy to verify that

{\begin{cases} σ_{m}^{- q} (\tilde{w}) = {[\sum_{k = 1}^{N} \exp (\frac{| | \tilde{w} - q {\tilde{c}}_{m, k} | |^{2}}{q σ^{2}} - \frac{(q + 1) | | {\tilde{c}}_{m, k} | |^{2}}{σ^{2}})]}^{- q} > 0 \\ \lim_{{[\tilde{w}]}_{l} \to + \infty} {σ_{m}^{- q} (\tilde{w})} = \lim_{{[\tilde{w}]}_{l} \to + \infty} {[\sum_{k = 1}^{N} \exp (\frac{| | \tilde{w} - q {\tilde{c}}_{m, k} | |^{2}}{q σ^{2}} - \frac{(q + 1) | | {\tilde{c}}_{m, k} | |^{2}}{σ^{2}})]}^{- q} \to 0 \\ \lim_{{[\tilde{w}]}_{l} \to - \infty} {σ_{m}^{- q} (\tilde{w})} = \lim_{{[\tilde{w}]}_{l} \to - \infty} {[\sum_{k = 1}^{N} \exp (\frac{| | \tilde{w} - q {\tilde{c}}_{m, k} | |^{2}}{q σ^{2}} - \frac{(q + 1) | | {\tilde{c}}_{m, k} | |^{2}}{σ^{2}})]}^{- q} \to 0 \end{cases}

(A3)

Therefore, there is a maximum value of

σ_{m}^{- q} (\tilde{w})

, which satisfies the conditions of saddle point approximation calculation. Assuming

σ_{m}^{- q} (\tilde{w})

achieves the maximum at

\tilde{w} = {\tilde{w}}_{0}

,

{\tilde{w}}_{0}

satisfies

\{\begin{cases} grad {\ln [σ_{m}^{- q} (\tilde{w})]} |_{{\tilde{w}}_{0}} = 0 \\ H {\ln [σ_{m}^{- q} (\tilde{w})]} |_{{\tilde{w}}_{0}} ≺ 0 \end{cases}

(A4)

where

H {\ln [σ_{m}^{- q} (\tilde{w})]} |_{{\tilde{w}}_{0}}

is the Hessian matrix of

\ln [σ_{m}^{- q} (\tilde{w})]

at

\tilde{w} = {\tilde{w}}_{0}

. By the fact of

σ_{m} (\tilde{w}) > 0

and

q > 0

, for

I = 1, 2, \dots, 2 N_{R}

,

grad {\ln [σ_{m}^{- q} (\tilde{w})]} |_{{\tilde{w}}_{0}} = 0

is equivalent to

{\sum_{k = 1}^{N} \frac{2 (\tilde{w} - q {\tilde{c}}_{m, k})}{q σ^{2}} \exp (\frac{| | \tilde{w} - q {\tilde{c}}_{m, k} | |^{2}}{q σ^{2}} - \frac{(q + 1) | | {\tilde{c}}_{m, k} | |^{2}}{σ^{2}})|}_{\tilde{w} = {\tilde{w}}_{0}} = 0

(A5)

and then the Hessian matrix

H {\ln [σ_{m}^{- q} (\tilde{w})]}_{I, j} (\tilde{w}) |_{{\tilde{w}}_{0}}

is rewritten as

\begin{array}{l} H {\ln [σ_{m}^{- q} (\tilde{w})]} I, j (\tilde{w}) |_{{\tilde{w}}_{0}} & = - q H {\ln [σ_{m} (\tilde{w})]} I, j (\tilde{w}) |_{{\tilde{w}}_{0}} \\ = - \frac{q}{σ_{m} (\tilde{w})} {[{\frac{\partial^{2}}{\partial {[\tilde{w}]}_{I} \partial {[\tilde{w}]}_{j}} σ_{m} (\tilde{w})}_{I, j = 1, 2, \dots, 2 N_{R}}] |}_{\tilde{w} = {\tilde{w}}_{0}} \\ = - \sum_{k = 1}^{N} \frac{4 ({\tilde{w}}_{0} - q {\tilde{c}}_{m, k}) {({\tilde{w}}_{0} - q {\tilde{c}}_{m, k})}^{T} {\tilde{σ}}_{m, k} ({\tilde{w}}_{0})}{q σ^{4} σ_{m} ({\tilde{w}}_{0})} - \frac{2}{σ^{2}} I_{2 N_{R}} ≺ 0 \end{array}

(A6)

Note that (A5) is equivalent to an implicit function of

{\tilde{c}}_{m, 1}

,

{\tilde{c}}_{m, 2}

, …,

{\tilde{c}}_{m, N}

and

{\tilde{w}}_{0}

as below

ℱ ({\tilde{c}}_{m, 1}, \dots, {\tilde{c}}_{m, N}, {\tilde{w}}_{0}) = [{\{f_{I} ({\tilde{c}}_{m, 1}, \dots, {\tilde{c}}_{m, N}, {\tilde{w}}_{0})\}|}_{I = 1, 2, \dots, 2 N_{R}}] = 0

(A7)

where

\begin{array}{l} f_{I} ({\tilde{c}}_{m, 1}, \dots, {\tilde{c}}_{m, N}, \tilde{w}) \\ = \sum_{k = 1}^{N} ({[\tilde{w}]}_{j} - q {[{\tilde{c}}_{m, k}]}_{j}) \exp (\frac{1}{q σ^{2}} \sum_{l = 1}^{2 N_{R}} {({[\tilde{w}]}_{l} - q {[{\tilde{c}}_{m, k}]}_{l})}^{2} - \frac{q + 1}{σ^{2}} \sum_{l = 1}^{2 N_{R}} {[{\tilde{c}}_{m, k}]}_{l}^{2}) \end{array}

(A8)

for

i = 1, 2, \dots, 2 N_{R}

. Then the Jacobi matrix of

ℱ ({\tilde{c}}_{m, 1}, \dots, {\tilde{c}}_{m, N}, \tilde{w})

is computed as below

\begin{array}{l} J_{ℱ} |_{({\tilde{c}}_{m, 1}, \dots, {\tilde{c}}_{m, N}, \tilde{w})} = [{\{\frac{\partial}{\partial {[\tilde{w}]}_{j}} f_{I} ({\tilde{c}}_{m, 1}, \dots, {\tilde{c}}_{m, N}, \tilde{w})\}}_{I, j = 1, 2, \dots, 2 N_{R}}] \\ = σ_{m} (\tilde{w}) (I_{2 N_{R}} + \frac{2}{q σ^{2} σ_{m} (\tilde{w})} \sum_{k = 1}^{N} (\tilde{w} - q {\tilde{c}}_{m, k}) {(\tilde{w} - q {\tilde{c}}_{m, k})}^{T} σ_{m, k} (\tilde{w})) \end{array}

(A9)

Recalling (A6),

J_{ℱ} |_{({\tilde{c}}_{m, 1}, \dots, {\tilde{c}}_{m, N}, \tilde{w})}

is a positive definite matrix at

{\tilde{w}}_{0}

, so it is invertible. Consequently,

{\tilde{w}}_{0}

can, in principle, express in terms of

{\tilde{c}}_{m, 1}

,

{\tilde{c}}_{m, 2}

, …,

{\tilde{c}}_{m, N}

by implicit function theorem. Namely, the maximum of

σ_{m}^{- q} (\tilde{w})

is achieved on the condition that

{\tilde{w}}_{0}

satisfies (A5). Note that a complex number is zero when and only when both its real and imaginary parts are zero vectors, so we have,

w_{0} = \sum_{k = 1}^{N} q ρ_{m, k} c_{m, k} ≜ q {\bar{c}}_{m}

(A10)

where

ρ_{m, k} = σ_{m, k} (w_{0}) / σ_{m} (w_{0})

is a positive real number over an open interval (0, 1) and satisfies

\sum_{k = 1}^{N} ρ_{m, k} = 1

. So

w_{0}

and

{\bar{c}}_{m}

are both weighted average vectors of

c_{m, 1}

,

c_{m, 2}

, …, and

c_{m, N}

.

Appendix B

Lemma 1 denotes that

σ_{m}^{- q} (\tilde{w})

is maximized at

{\tilde{w}}_{0}

. By (A5), we have,

grad {\ln [p_{m}^{- q} (\tilde{w})]} |_{{\tilde{w}}_{0}} = - \frac{q}{p_{m} (\tilde{w})} {[{\{\frac{\partial σ_{m} (\tilde{w})}{\partial {[\tilde{w}]}_{j}}\}|}_{I = 1, 2, \dots, 2 N_{R}}]|}_{\tilde{w} = {\tilde{w}}_{0}} = 0

(A11)

By (A5), (A6), and (A10), the Taylor series of

\ln [σ_{m}^{- q} (\tilde{w})]

is expanded to

\ln [σ_{m}^{- q} (\tilde{w})] \approx \ln [σ_{m}^{- q} ({\tilde{w}}_{0})] + \frac{1}{2} {(\tilde{w} - {\tilde{w}}_{0})}^{T} H_{{\tilde{w}}_{0}} (\tilde{w} - {\tilde{w}}_{0})

(A12)

Note that a positive definite matrix

A

is invertible, and the determinant of

A

can be computed by

\exp [Tr \ln (A)]

, where

Tr A

stands for the trace of

A

. Recalling (A1) and (A6), the saddle point approximation can be computed by Gaussian integral

\begin{array}{l} \int_{w} \frac{p_{m}^{- q} (w)}{π^{N_{R}} σ^{2 N_{R}}} d w & \approx \frac{p_{m}^{- q} ({\tilde{w}}_{0})}{π^{N_{R}} p^{2 N_{R}}} \int_{\tilde{w}} \exp (\frac{1}{2} {(\tilde{w} - {\tilde{w}}_{0})}^{T} H_{{\tilde{w}}_{0}} (\tilde{w} - {\tilde{w}}_{0})) d \tilde{w} \\ = {\{p_{m} ({\tilde{w}}_{0}) \det^{1 / 2 q} (- \frac{σ^{2}}{2} H_{{\tilde{w}}_{0}})\}}^{- q} \geq {\{p_{m} ({\tilde{w}}_{0}) * \frac{1}{2 q} Tr (- \frac{σ^{2}}{2} H_{{\tilde{w}}_{0}})\}}^{- q} \\ \geq \sum_{k = 1}^{N} \exp {\{\begin{array}{l} - \frac{| | c_{m, k} | |^{2}}{σ^{2}} \\ [\begin{array}{l} 1 - q \frac{| | {\bar{c}}_{m} | |^{2}}{| | c_{m, k} | |^{2}} + q \frac{c_{m, k}^{H} {\bar{c}}_{m} + {\bar{c}}_{m}^{H} c_{m, k}}{| | c_{m, k} | |^{2}} - {(\frac{| | c_{m, k} | |^{2}}{σ^{2}})}^{- 1} \\ \ln (\sum_{k = 1}^{N} \frac{{[ρ_{m}]}_{k} | | c_{m, k} | |^{2}}{σ^{2}} - \frac{| | {\bar{c}}_{m} | |^{2}}{σ^{2}}) \end{array}] \end{array}\}}^{- q} \end{array}

(A13)

According to (A5), we can induce

\sum_{k = 1}^{N} ({\bar{c}}_{m} - c_{m, k}) \exp [- \frac{| | c_{m, k} | |^{2}}{σ^{2}} (1 + q \frac{c_{m, k}^{H} {\bar{c}}_{m} + {\bar{c}}_{m}^{H} c_{m, k}}{σ^{2}} - q \frac{| | {\bar{c}}_{m} | |^{2}}{σ^{2}})] = 0

(A14)

Therefore, (A13) and (A14) demonstrate that Gaussian integral is dominated by

| | c_{m, k} | |^{2} / σ^{2}

in terms of exponential. Define a multiplier

α_{m, k}

dominated by

| | c_{m, k} | |^{2} / σ^{2}

,

α_{m, k} = [\begin{array}{l} 1 - q \frac{| | {\bar{c}}_{m} | |^{2}}{| | c_{m, k} | |^{2}} + q \frac{c_{m, k}^{H} {\bar{c}}_{m} + {\bar{c}}_{m}^{H} c_{m, k}}{| | c_{m, k} | |^{2}} - {(\frac{| | c_{m, k} | |^{2}}{σ^{2}})}^{- 1} \\ \ln (\sum_{k = 1}^{N} \frac{{[ρ_{m}]}_{k} | | c_{m, k} | |^{2}}{σ^{2}} - \frac{| | {\bar{c}}_{m} | |^{2}}{σ^{2}}) \end{array}]

(A15)

So

\int_{w} \frac{1}{π^{N_{R}} σ^{2 N_{R}}} {(\sum_{k = 1}^{N} p_{m, k} (w))}^{- q} d w \approx {[\sum_{k = 1}^{N} \exp (- \frac{α_{m, k} | | c_{m, k} | |^{2}}{σ^{2}})]}^{- q}

(A16)

Appendix C

Obviously, when SNR varies from

- \infty

to

+ \infty

,

α_{m, k}

satisfies

1 / 3 < α_{m, k} < 1 / 2

by (11), so the average mutual information over doubly correlated MIMO Rayleigh-fading channels is approximated by

\begin{array}{l} I (x; y) \approx & - \frac{1}{2 N} \sum_{m = 1}^{N} E_{H_{WG}} \{\log_{2} [\frac{1}{N} \sum_{k = 1}^{N} \exp (- \frac{| | c_{m, k} | |^{2}}{2 σ^{2}})]\} \\ - \frac{1}{2 N} \sum_{m = 1}^{N} E_{H_{WG}} \{\log_{2} [\frac{1}{N} \sum_{k = 1}^{N} \exp (- \frac{| | c_{m, k} | |^{2}}{3 σ^{2}})]\} \end{array}

(A17)

Because

\log_{2} (x)

is a concave function, by Jensen’s inequality, we have

E_{H_{WG}} \{\log_{2} [\sum_{k = 1}^{N} \exp (- \frac{| | c_{m, k} | |^{2}}{α_{m, k} σ^{2}})]\} \leq \log_{2} [\sum_{k = 1}^{N} E_{H_{WG}} \{\exp (- \frac{| | c_{m, k} | |^{2}}{α_{m, k} σ^{2}})\}]

(A18)

where

c_{m, k} = Σ_{R}^{1 / 2} H_{WG} Σ_{T}^{1 / 2} U_{T}^{H} d_{m, k}

, so expectation in (A18) is rewritten as

E_{H_{WG}} \{\exp (- \frac{| | c_{m, k} | |^{2}}{α_{m, k} σ^{2}})\} = E_{H_{WG}} \{\exp (- \sum_{l = 1}^{N_{R}} \frac{{[H_{WG}]}_{l} ({[Σ_{R}]}_{l, l} q_{m, k}) {[H_{WG}]}_{l}^{H}}{α_{m, k} σ^{2}})\}

(A19)

where

q_{m, k} = Σ_{T}^{1 / 2} σ d_{m, k} d_{m, k}^{H} σ^{H} Σ_{T}^{1 / 2} and σ = U_{T}^{H}

(A20)

{[H_{WG}]}_{l}

stands for the

l^{th}

row of

H_{WG}

.

Since

H_{WG}

is an independent and identically distributed complex AWGN matrix,

{[H_{WG}]}_{l}

is an independent and identically distributed complex AWGN vector. So

\begin{array}{l} E_{H_{WG}} \{\exp (- \frac{| | c_{m, k} | |^{2}}{α_{m, k} σ^{2}})\} & = E_{H_{WG}} \{\exp (- \frac{1}{α_{m, k} σ^{2}} \sum_{l = 1}^{N_{R}} {[H_{WG}]}_{l} ({[Σ_{R}]}_{l, l} Q_{m, k}) {[H_{WG}]}_{l}^{H})\} \\ = \prod_{l = 1}^{N_{R}} E_{{[H_{WG}]}_{l}} \{\exp (- \frac{{[H_{WG}]}_{l} ({[Σ_{R}]}_{l, l} q_{m, k}) {[H_{WG}]}_{l}^{H}}{α_{m, k} σ^{2}})\} \\ = \prod_{l = 1}^{N_{R}} \int_{{[H_{WG}]}_{l}} p ({[H_{WG}]}_{l}) \exp (\begin{array}{l} - \frac{1}{α_{m, k} σ^{2}} \\ [H_{WG}] l ([Σ_{R}] l, l q_{m, k}) \\ [H_{WG}] l^{H} \end{array}) d {[H_{WG}]}_{l} \\ = \prod_{l = 1}^{N_{R}} \int_{{[H_{WG}]}_{l}} \frac{1}{π^{N_{T}}} \exp (\begin{array}{l} - [H_{WG}] l \\ (I_{N_{T}} + \frac{{[Σ_{R}]}_{l, l}}{α_{m, k} σ^{2}} q_{m, k}) \\ [H_{WG}] l^{H} \end{array}) d {[H_{WG}]}_{l} \end{array}

(A21)

According to [24],

\int_{h \in ℂ^{N \times 1}} \exp (- h^{h} (A + j B) h) d h = \frac{π^{N}}{\det (A + j B)}

(A22)

where

[h_{1}]

is iid

C N (0, σ)

. (A21) can be written as

\begin{array}{l} E_{H_{WG}} \{\exp (- \frac{| | c_{m, k} | |^{2}}{α_{m, k} σ^{2}})\} & = \prod_{l = 1}^{N_{R}} {[\det (I_{N_{T}} + \frac{{[Σ_{R}]}_{l, l}}{α_{m, k} σ^{2}} Q_{m, k})]}^{- 1} \\ = \prod_{l = 1}^{N_{R}} {[\det (\begin{array}{l} I_{N_{T}} + \\ \frac{{[Σ_{R}]}_{l, l}}{α_{m, k} σ^{2}} Σ_{T}^{1 / 2} P d_{m, k} d_{m, k}^{H} P^{H} Σ_{T}^{1 / 2} \end{array})]}^{- 1} \end{array}

(A23)

Note that for column vector

α

,

\det (I_{N} + α α^{H}) = 1 + {[Σ_{α}]}_{1} = 1 + tr (α α^{H}) = 1 + α^{H} α

(A24)

we have

E_{H_{WG}} \{\exp (- \frac{| | c_{m, k} | |^{2}}{α_{m, k} σ^{2}})\} = \prod_{l = 1}^{N_{R}} {(1 + \frac{{[Σ_{R}]}_{l, l}}{α_{m, k} σ^{2}} d_{m, k}^{H} P^{H} Σ_{T} P d_{m, k})}^{- 1}

(A25)

recall (A17), we have

\begin{array}{l} ℐ (x; y) & \geq \log_{2} N - \frac{1}{2 N} \sum_{m = 1}^{N} \log_{2} [\sum_{k = 1}^{N} \prod_{l = 1}^{N_{R}} {(1 + \frac{{[Σ_{R}]}_{l, l}}{2 σ^{2}} d_{m, k}^{H} P^{H} Σ_{T} P d_{m, k})}^{- 1}] \\ - \frac{1}{2 N} \sum_{m = 1}^{N} \log_{2} [\sum_{k = 1}^{N} \prod_{l = 1}^{N_{R}} {(1 + \frac{{[Σ_{R}]}_{l, l}}{3 σ^{2}} d_{m, k}^{H} P^{H} Σ_{T} P d_{m, k})}^{- 1}] \end{array}

(A26)

References

Jin, S.; Gao, X. Statistical antenna selection for MIMO systems in double-sided correlated rayleigh fading channels. In Proceedings of the IEEE Wireless Communications and Networking Conference, Las Vegas, NV, USA, 3–6 April 2006; pp. 729–733. [Google Scholar]
Zhang, Y.; Huo, Y.; Wang, D.; Dong, X.; You, X. Channel Estimation and Hybrid Precoding for Distributed Phased Arrays Based MIMO Wireless Communications. IEEE Trans. Veh. Technol. 2020, 69, 12921–12937. [Google Scholar] [CrossRef]
Jin, X.L.; Yang, J.D.; Song, K.Y.; No, J.S.; Shin, D.J. On the relationship between mutual information and bit error probability for some linear dispersion codes. IEEE Tran. Wirel. Commun. 2009, 8, 90–94. [Google Scholar] [CrossRef]
Shannon, C.E. A mathematical theory of communication. Bell Syst. Technol. J. 1948, 27, 379–423, 623–656. [Google Scholar] [CrossRef]
Smith, P.J.; Shafi, M. On a Gaussian approximation to the capacity of wireless MIMO systems. In Proceedings of the 2002 IEEE International Conference on Communications. Conference Proceedings, ICC 2002, New York, NY, USA, 28 April–2 May 2002; pp. 406–410. [Google Scholar]
Chuah, D.N.; Tse, D.N.C.; Kahn, J.M.; Valenzuela, R.A. Capacity scaling in MIMO wireless systems under correlated fading. IEEE Trans. Inf. Theory 2002, 48, 637–650. [Google Scholar] [CrossRef]
Levin, G.; Loyka, S. On the Outage Capacity Distribution of Correlated Keyhole MIMO Channels. IEEE Trans. Inf. Theory 2008, 54, 3232–3245. [Google Scholar] [CrossRef][Green Version]
Hsu, H. Digital communications over fading channels. IEEE Circuits Devices Mag. 2001, 17, 57. [Google Scholar]
Lozona, A.; Tulino, A.M.; Verdu, S. Optimum power allocation for parallel Gaussian channels with arbitrary input distributions. IEEE Trans. Inf. Theory 2006, 52, 3033–3051. [Google Scholar] [CrossRef]
Xiao, C.; Zheng, Y.R.; Ding, Z. Globally Optimal Linear Precoders for Finite Alphabet Signals Over Complex Vector Gaussian Channels. IEEE Trans. Signal Process. 2011, 59, 3301–3314. [Google Scholar] [CrossRef]
Cao, K.; Cai, Y.; Wu, Y.; Yang, W. Cooperative Jamming for Secure Communication with Finite Alphabet Inputs. IEEE Commun. Lett. 2017, 21, 2025–2028. [Google Scholar] [CrossRef]
Sadeghi, P.; Vontobel, P.O.; Shams, R. Optimization of information rate upper and lower bounds for channels with memory. IEEE Trans. Inf. Theory 2009, 55, 663–688. [Google Scholar] [CrossRef]
Güney, N.; Delic, H.; Alagöz, F. Achievable information rates of PPM impulse radio for UWB channels and rake reception. IEEE Trans. Commun. 2010, 58, 1524–1535. [Google Scholar] [CrossRef]
Owen, A.B. Monte Carlo extension of quasi-Monte Carlo. In Proceedings of the 1998 Winter Simulation Conference, Washington, DC, USA, 13–16 December 1998; pp. 571–577. [Google Scholar]
Sayana, K.; Zhuang, J.; Stewart, K. Short Term Link Performance Modeling for ML Receivers with Mutual Information per Bit Metrics. In Proceedings of the IEEE GLOBECOM 2008–2008 IEEE Global Telecommunications Conference, New Orleans, LA, USA, 30 November–4 December 2008; pp. 1–6. [Google Scholar]
Zeng, W.; Xiao, C.; Wang, M.; Lu, J. Linear precoding for finite alphabet inputs over MIMO fading channels with statistical CSI. IEEE Trans. Signal Process. 2012, 60, 3134–3148. [Google Scholar] [CrossRef]
Yang, P.; Yang, H. A Low-Complexity Linear Precoding for Secure Transmission over MIMOME Wiretap Channels with Finite-Alphabet Inputs. IEEE Trans. Veh. Technol. 2019, 68, 9896–9907. [Google Scholar] [CrossRef]
Zeng, W.; Zheng, Y.R.; Xiao, C. Multiantenna secure cognitive radio networks with finite-alphabet inputs: A global optimization approach for precoder design. IEEE Trans. Wirel. Commun. 2016, 15, 3044–3057. [Google Scholar] [CrossRef]
Zeng, W.; Zheng, Y.R.; Wang, M.; Lu, J. Linear precoding for relay networks: A perspective on finite-alphabet inputs. IEEE Trans. Wireless Commun. 2012, 11, 1146–1157. [Google Scholar] [CrossRef]
Yadav, A.; Juntti, M.; Lilleberg, J. Linear Precoder Design for Doubly Correlated Partially Coherent Fading MIMO Channels. IEEE Trans. Wirel. Commun. 2014, 13, 3621–3635. [Google Scholar] [CrossRef]
Ouyang, C.; Wu, S.; Jiang, C.; Cheng, J.; Yang, H. Approximating Ergodic Mutual Information for Mixture Gamma Fading Channels with Discrete Inputs. IEEE Commun. Lett. 2020, 24, 734–738. [Google Scholar] [CrossRef]
Xiao, C.; Zheng, Y.R. On the mutual information and power allocation for vector Gaussian channels with finite discrete inputs. In Proceedings of the IEEE GLOBECOM 2008–2008 IEEE Global Telecommunications Conference, New Orleans, LA, USA, 30 November–4 December 2008; pp. 1–5. [Google Scholar]
Wang, W.; Guyet, T.; Guiniou, R. Autonomic intrusion detection: Adaptively detecting anomalies over unlabeled audit data streams in computer networks. Knowl. Based Syst. 2014, 70, 103–117. [Google Scholar] [CrossRef]
Hassibi, B.; Marzetta, T. Multiple-antennas and isotropically random unitary inputs: The received signal density in closed form. IEEE Trans. Inf. Theory 2002, 48, 1473–1484. [Google Scholar] [CrossRef]

Figure 1. Comparison on MI calculated by Monte Carlo and saddle point approximation under different input types and correction parameters (

ρ_{T} = ρ_{R} = ρ

) over doubly correlated Rayleigh fading channel model.

Figure 2. Comparison on MI calculated by Monte Carlo and saddle point approximation under different input types and correction parameters (

ρ_{T} = ρ_{R} = ρ

) over doubly correlated Rice fading channel model.

Figure 3. Comparison on normalized MI calculated by Monte Carlo and saddle point approximation according to upper and lower bounds of

α_{m, k}

by (14) under different input types, correction parameters (

ρ_{T} = ρ_{R} = ρ = 0.5

) over doubly correlated Rayleigh fading channel model (N_T = N_R = 2).

Figure 4. Comparison on MI calculated by Monte Carlo and the lower bound of saddle point approximation method under different input types and correction parameters (

ρ_{T} = ρ_{R} = 0.5

) over doubly correlated Rayleigh fading channel model.

Table 1. Comparison of computational complexity between Monte Carlo method and saddle point approximation method according to the number of operations.

Number of Operations	Monte Carlo Method	Formula (15)
Exponential operation	N_W * (N² + 1)	0
Logarithm operation	N_W * N	2N + 1

Table 2. Comparison of computational complexity between Monte Carlo method and saddle point approximation method according to CPU time (seconds) under different input types and correction parameters (

ρ_{T} = ρ_{R} = 0.4

). The symbol/indicates the CPU time is more than half an hour.

Table 2. Comparison of computational complexity between Monte Carlo method and saddle point approximation method according to CPU time (seconds) under different input types and correction parameters (

ρ_{T} = ρ_{R} = 0.4

). The symbol/indicates the CPU time is more than half an hour.

Cases	Input Type	Monte Carlo Method	Formula (15)
N_T = N_R = 2	BPSK	3.043959	0.001049
N_T = N_R = 3	BPSK	4.444435	0.019326
N_T = N_R = 4	BPSK	8.427808	0.029586
N_T = N_R = 2	QPSK	7.457080	0.027120
N_T = N_R = 3	QPSK	69.558626	0.081062
N_T = N_R = 4	QPSK	/	0.558082
N_T = N_R = 2	8PSK	58.627341	0.070549
N_T = N_R = 3	8PSK	/	1.573137
N_T = N_R = 4	8PSK	/	/
N_T = N_R = 2	16QAM	1281.853	0.612448
N_T = N_R = 3	16QAM	/	255.736633
N_T = N_R = 4	16QAM	/	/

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Saddle Point Approximation of Mutual Information for Finite-Alphabet Inputs over Doubly Correlated MIMO Rayleigh Fading Channels

Abstract

1. Introduction

2. Problem Formulation

2.1. Model of MIMO Transmission

2.2. Mutual Information for Finite-Alphabet Inputs

3. Saddle Point Approximation for Mutual Information

3.1. Saddle Point Approximation

3.2. Average Mutual Information over Doubly Correlated MIMO Rayleigh Fading Channels

4. Simulation Verification and Result Analysis

4.1. Accuracy of Saddle Point Approximation

4.2. Conciseness of Saddle Point Approximation

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

Appendix B

Appendix C

References

Article Metrics

Citations

Article Access Statistics