A Computationally Efficient Gradient Algorithm for Downlink Training Sequence Optimization in FDD Massive MIMO Systems

Alsabah, Muntadher; Naser, Marwah Abdulrazzaq; Mahmmod, Basheera M.; Abdulhussain, Sadiq H.

doi:10.3390/network2020021

Open AccessArticle

A Computationally Efficient Gradient Algorithm for Downlink Training Sequence Optimization in FDD Massive MIMO Systems

by

Muntadher Alsabah

^1,*,†

,

Marwah Abdulrazzaq Naser

^2,†

,

Basheera M. Mahmmod

^3,†

and

Sadiq H. Abdulhussain

^3,†

¹

Department of Electronic and Electrical Engineering, University of Sheffield, Sheffield S1 4ET, UK

²

Continuous Education Center, University of Baghdad, Baghdad 10001, Iraq

³

Department of Computer Engineering, University of Baghdad, Al-Jadriya, Baghdad 10071, Iraq

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Network 2022, 2(2), 329-349; https://doi.org/10.3390/network2020021

Submission received: 12 March 2022 / Revised: 24 May 2022 / Accepted: 30 May 2022 / Published: 5 June 2022

Download

Browse Figures

Versions Notes

Abstract

:

Future wireless networks will require advance physical-layer techniques to meet the requirements of Internet of Everything (IoE) applications and massive communication systems. To this end, a massive MIMO (m-MIMO) system is to date considered one of the key technologies for future wireless networks. This is due to the capability of m-MIMO to bring a significant improvement in the spectral efficiency and energy efficiency. However, designing an efficient downlink (DL) training sequence for fast channel state information (CSI) estimation, i.e., with limited coherence time, in a frequency division duplex (FDD) m-MIMO system when users exhibit different correlation patterns, i.e., span distinct channel covariance matrices, is to date very challenging. Although advanced iterative algorithms have been developed to address this challenge, they exhibit slow convergence speed and thus deliver high latency and computational complexity. To overcome this challenge, we propose a computationally efficient conjugate gradient-descent (CGD) algorithm based on the Riemannian manifold in order to optimize the DL training sequence at base station (BS), while improving the convergence rate to provide a fast CSI estimation for an FDD m-MIMO system. To this end, the sum rate and the computational complexity performances of the proposed training solution are compared with the state-of-the-art iterative algorithms. The results show that the proposed training solution maximizes the achievable sum rate performance, while delivering a lower overall computational complexity owing to a faster convergence rate in comparison to the state-of-the-art iterative algorithms.

Keywords:

massive MIMO systems; CSI estimation; limited coherence time; sum rate maximization; spatial correlation; training sequence design; FDD; TDD; optimization on manifolds; iterative algorithm; gradient algorithm

1. Introduction

The rapid growth in the number of smart devices and the emergence of the Internet of Everything (IoE) applications, which require an ultra-reliable and low-latency communication, will result in a substantial burden on the current wireless networks [1]. As such, the data rate that could be supplied by current wireless networks will be unlikely to sustain the enormous ongoing data traffic explosion. In addition, the latest statistics revealed that by 2023 nearly two-thirds of the global population will have Internet access with around 5.3 billion Internet users (66 percent of the global population), and by 2025 more than 80 billion devices will require communication connectivity [2]. Therefore, this will create the requirements for massive communication systems and networks. These requirements and demands for data traffic have motivated research into continuing to advance the existing networks in addition to the innovation of new physical-layer techniques for the future generation of cellular systems. Massive MIMO (m-MIMO) is introduced as a key physical layer technique for the next generation wireless networks that is required to support the huge increasing demand for data traffic [3,4,5], whilst improving the energy efficiency [6]. In particular, the m-MIMO system is considered a potential solution for fifth generation (5G) wireless communications and beyond [7]. However, from an information theoretic point of view, the performance of the m-MIMO system depends on the accuracy and availability of the channel state information (CSI) estimation. Obtaining the CSI at the BS depends on the duplexing operation mode, i.e., either frequency division duplex (FDD) or time division duplex (TDD) mode. Despite the promising results of TDD operation with m-MIMO when it comes to the CSI estimation, the vast majority of the currently deployed cellular networks operates in FDD mode. For example, over 85% of the current commercial long term evolution (LTE) wireless mobile networks operate in FDD mode [8]. Therefore, this paper focuses on FDD systems, as the CSI can only be obtained by a dedicated training sequence in the downlink (DL). In particular, to obtain the CSI in FDD systems, the BS needs to optimize the DL training sequence and transmit it to the users [9,10]. However, the expansion of a number of antenna elements at the BS in the m-MIMO system makes the DL training sequence optimization for fast CSI estimation, i.e., with limited coherence time, in FDD systems very challenging.

To overcome the CSI estimation challenge in FDD m-MIMO systems, several studies have investigated the optimization of DL training sequences by considering the scenario where the users exhibit common spatial correlation at the BS under different system model assumptions; namely, refs. [11,12,13,14,15,16,17,18] exploit correlations in the spatial domain and correlations both in the time and spatial domains, respectively. However, in practice, users could exhibit distinct spatial correlation patterns, which arise due to independent propagation conditions and scattering geometries; hence, the optimization framework of the training sequences developed in [11,12,13,14,15,16,17,18] does not hold in general multiuser scenarios with K distinct spatial correlations. Another line of research has focused on the channel estimation by using compressed sensing (CS) based techniques [19,20,21,22,23,24]. However, due to the unknown sparse nature of the channels, CS based approaches cannot be applied in practice to predict the DL achievable sum rate of an FDD m-MIMO system. Another development for CSI estimation is to use a hybrid two-stage precoding approaches, see e.g., [25,26,27,28]. However, the aforementioned research works do not address the single-stage precoding scenario with K independent user covariance matrices.

To address the challenge of having distinct correlated channels and to find a more realistic solution with single-stage precoding scheme, several studies have investigated the design of training sequences by utilizing different iterative algorithms. For example, in [29] the DL training sequences are optimized iteratively based on the average sum rate loss due to CSI estimation with zero forcing (ZF) precoding. In refs. [30,31] the DL training sequences are optimized iteratively as a solution to a sum conditional mutual information (SCMI) maximization problem and a sum mean square error (SMSE) minimization problem, respectively. While the investigations in refs. [30,31] have optimized the DL training sequences iteratively, the algorithms used in the optimization exhibit slow convergence speed, and thus, deliver high latency and computational complexity. However, in real-time systems, the CSI estimation must be obtained more frequently with an acceptable latency from channel estimation to precoding, especially when the channel exhibits a shorter coherence time. Therefore, there is an essential requirement for developing a new efficient iterative algorithm for the FDD m-MIMO systems that optimizes the DL training sequence, i.e., maximizes the sum rate, while at the same time exhibits the fast convergence rate so as to obtain the CSI estimation more quickly over a limited coherence time. Note that the design principles in refs. [30,31] are more general compared with [29] since they do not enforce a specific precoding structure and do not consider a heuristic approximate upper bound of the sum rate loss for CSI estimation. This makes the approaches in refs. [30,31] more rigorous than the approach in [29] for DL sequences optimization.

This paper addresses the challenge of CSI DL channel estimation in the FDD m-MIMO systems with limited coherence time considering the users have different correlations. We propose a conjugate gradient descent (CGD) optimization algorithm over the Riemannian manifold for optimizing the DL training sequences. The proposed algorithm is essentially required to speed up the convergence rate by lessening the required number of iterations for achieving fast CSI estimation and hence reduces the overall computational complexity. This proposed algorithm is also very useful for massive communication systems and networks. In this paper, a random matrix theory (RMT) method [32,33] is used to validate our Monte-Carlo simulations. This allows the achievement of a straightforward methodology without resorting to a computationally demanding exhaustive search. In addition, the computational complexity analyses of the proposed CGD algorithm and the state-of-the-art SCMI [30] and SMSE [31] iterative algorithms are provided. Comparisons are presented for the sum rate and overall complexity performances between the proposed CGD algorithm and existing SCMI [30] and SMSE [31] iterative algorithms for DL sequence optimization in both eigenbeamforming (BF) and the regularized ZF (RZF) precoding. The aforementioned algorithms are considered the best known state-of-the-art iterative algorithms. The results demonstrate that the RZF precoder under correlated channels achieves a significant gain in the DL sum rate in comparison to the BF precoder. The results show that the proposed CGD algorithm achieves more or less the same rate performances as the state-of-the-art iterative algorithms for training designs, while reducing the computational complexity in both BF and RZF precoding. This computational complexity reduction is due to the use of matrix exponential search over the Riemannian manifold, which results in an optimized DL training sequence for fast CSI estimation in the FDD m-MIMO systems. These findings create a pathway for realizing FDD m-MIMO with massive communication systems. Finally, the results demonstrate that the analytical solution using the RMT method tightly agrees with the simulation, which underpins the contributions of this paper.

The present paper is organized as follows: in Section 2, the system model is described. In Section 3, the training sequence design in an FDD m-MIMO system is investigated. In Section 4, the CGD iterative algorithm is developed. In Section 5, the expressions that accurately approximate the SINR and the DL achievable sum rate are introduced. In Section 6, the results are presented and discussed. Finally, in Section 7, the conclusions are drawn.

Notation: In this paper, an upper boldface symbol denotes a matrix whereas a lower boldface symbol denotes a vector. The term

E [\cdot]

refers to the expectation operator. The operators trace, transpose, Hermitian transpose, inverse and absolute value are denoted by

tr (\cdot)

,

{(\cdot)}^{T}

,

{(\cdot)}^{H}

,

{(\cdot)}^{- 1}

, and

| \cdot |

, respectively.

2. System Model Description

This paper considers a single-cell wireless communications system, where the BS is equipped with an N transmit antenna that serves K single antenna users. We consider a non-line-of-sight (NLOS) Rayleigh fading channels over a single-frequency band with an overall coherence time denoted by

T_{} \in Z^{+}

and enumerated in symbols per transmission block. Figure 1 demonstrates a BS with m-MIMO system and illustrates the DL and UL transmissions within each coherence block length. The received signal during the data transmission phase at the k-th user is given as:

\begin{matrix} y_{k}^{} = \sqrt{ρ_{d}^{} λ} h_{k}^{H} V s + z_{k}^{} \end{matrix}

(1)

where

ρ_{d}

denotes the per-user signal-to-noise-ratio (SNR) during the data transmission phase and

λ

refers to the normalization constant, which is defined as

λ = K / E [tr ({VV}^{H})]

, and

V = [v_{1}, \dots, v_{K}] \in C^{N \times K}

is the precoding matrix. The term

s = {[s_{1}, \dots, s_{K}]}^{T} \in C^{K}

is independently and identically distributed (IID) with a zero mean circularly symmetric complex Gaussian (CSCG) vector of data symbols, which satisfies

E [s s^{H}] = I_{K}

, and

z_{k}

denotes the additive noise, which is modeled as a zero mean unit variance CSCG random variable. The DL instantaneous channel vector is given by

h_{k} = R_{k}^{1 / 2} {\tilde{h}}_{k}^{}

, where the elements of

{\tilde{h}}_{k} \in C^{N}

are IID with zero mean and unit variance and k-th user’s correlation matrix,

R_{k}

satisfies

R_{k} = E [h_{k}^{} h_{k}^{H}] \in C^{N \times N}

. It is worth noting that

R_{k}

depends on large-scale statistics, i.e., angles of arrival and departure or spatial/temporal correlation, which are considered to be frequency-invariant, and thus, can be efficiently obtained in the FDD or TDD systems [34]. To this end, the received signal-to-interference-plus-noise ratio (

SINR

), denoted

γ_{k}

, is given as:

γ_{k} = \frac{λ ∣ E [h_{k}^{H} v_{k}^{}] ∣^{2}}{\frac{1}{ρ_{d}^{}} + λ E [| h_{k}^{H} v_{k}^{} - E [h_{k}^{H} v_{k}^{}] |^{2}] + λ \sum_{l \neq k}^{K} E [| h_{k}^{H} v_{l}^{} |^{2}]}

(2)

Accordingly, the DL achievable sum rate,

C_{}^{}

, for an FDD m-MIMO system under consideration can be expressed as:

C_{}^{} = (1 - \frac{T_{tr}^{}}{T_{}^{}}) \sum_{k = 1}^{K} \log_{2}^{} (1 + γ_{k}^{}) [bit / s / Hz]

(3)

where parameter

T_{tr}

represents the training sequence length used during the training-phase. This paper uses

T_{tr}

that maximizes the achievable sum rate. The received SINR term in (2) depends on the channel statistics, the channel estimates, and the linear precoding technique used at the BS. We consider two commonly prevailing types of linear precoders, named the BF and the RZF, as defined in [33]. The expectation (i.e., the averaging process) in (2) is taken with respect to different channel realizations, which are obtained separately by means of Monte Carlo simulation. As the purpose of the present paper is to concentrate on the DL training sequence design, the UL feedback time and associated error rate are assumed to be zero. Efficient feedback schemes can be considered in the future, see, e.g., [35,36,37,38]. Besides, advanced signal processing techniques can also be considered for feedback, see e.g., [39,40]. A large system limit based on random matrix theory methods is used to provide asymptotic expressions that accurately approximate the SINR and achievable sum rate for the BF and RZF precoders.

3. CSI Estimation Using DL Training Sequences

To gain further insight into the problem of DL CSI estimation, this section investigates the training sequence design in an FDD m-MIMO system. To estimate the DL channel, the BS transmits a predetermined training sequence of length

T_{tr}

during the training-phase. The DL pilot sequences, collected by each user and returned to the BS, provide instantaneous channel realizations at the BS, which in turn implement regular BF or RZF precoding based on channel estimates. As such, the training sequences for channel estimation [41] are updated according to the dynamics of each

R_{k}

, which in turn is updated at every coherence time interval. To this end, the received signal during training-phase,

y_{k}^{} \in C^{T_{tr}}

, at the k-th user is given by:

y_{k}^{} = \sqrt{ρ_{tr}^{}} S_{tr}^{H} h_{k}^{} + z_{k}^{}

(4)

where

S_{tr}^{} \in C^{N \times T_{tr}^{}}

is the training matrix. The receiver noise

z_{k} \in C^{T_{tr}}

exhibits a CSCG distribution

CN (0, I_{T_{tr}})

.

Extensive research has been carried out to address the challenge of CSI estimation, see e.g., [11,12,13,14,15,16,17,18]. In these investigations, a special scenario when the users exhibit common spatial correlation at the BS is considered. However, a more general scenario when the users exhibit distinct spatial correlation at the BS is needed. Besides, compressed sensing based approaches for CSI estimation have been used in [19,20,21,22,23,24]. In these works, the sparsity structure on the virtual angular domain is used to reduce the training length required for the CSI estimation. However, the compressed sensing based approaches cannot be applied in practice due to the unknown sparsity nature of the channels that cannot be predicted in these approaches. Another area of research on CSI estimation is to exploit the hybrid two-stage precoding based approaches, see e.g., [25,26,27,28]. In particular, the hybrid based approaches exploit correlation in the spatial domain, where the users within each group exhibit the same spatial correlation, and a linear superposition of each group correlation matrix is utilized to perform the first of two stages of precoding, thus forming a beam for each group. However, sophisticated scheduling and clustering algorithms of the user groups, and of the users inside each group, are required in these hybrid approaches. Hence, these works cannot be applied to predict the achievable sum rate performance in a more general single-stage scenario with K independent user covariance matrices. To address the challenge of CSI estimation in the more general single-stage precoding scenario when the users exhibit distinct correlated channels, several studies have been carried out, see, e.g., [30,31,42,43]. Specifically, the training sequences in [30,31,42,43] are designed by exploiting different iterative algorithms as a solution to an SCMI maximization criterion and an SMSE minimization criterion, respectively. However, the limited coherence time interval in practice implies that the CSI estimation should be determined more frequently, and thus, iterative-based solutions for the DL training sequence design may be infeasible. In addition, the algorithms used in the aforementioned optimizations exhibit slow convergence speed, and thus, deliver high latency and computational complexity. Hence, there is a crucial need to develop a lower complexity iterative algorithm solution for the FDD m-MIMO systems, which has the ability to optimize the DL training sequence, i.e., maximizes the achievable sum rate, while at the same time exhibits a fast convergence rate so that obtaining a fast CSI estimation when a limited coherence time is considered.

This paper proposes a computationally efficient CGD iterative algorithm, described in detail in Section 4, to optimize

S_{tr}

for pre-selected training length

T_{tr}

, and training power

ρ_{tr}^{}

. The proposed algorithm aims to achieve a robust sum rate performance when the users exhibit different correlations while at the same time reducing the computational complexity. Let

f (S_{tr}) \in R

denote the sum MSE of the channel estimate cost function, which needs to be minimized with respect to the DL training matrix

S_{tr}

, as

f (S_{tr}) = \sum_{k = 1}^{K} tr (R_{k}) - \sum_{k = 1}^{K} tr (Φ_{k})

(5)

where

Φ_{k} = R_{k}^{} S_{tr}^{} {(S_{tr}^{H} R_{k}^{} S_{tr}^{} + \frac{1}{ρ_{tr}^{}} I_{T_{tr}^{}}^{})}^{- 1} S_{tr}^{H} R_{k}^{}

, which denotes the k-th user’s covariance matrix with minimum mean square error (MMSE) channel estimation. To this end, the formulation of the CSI estimation optimization problem is provided. Minimizing the sum MSE over the training sequence

S_{tr}^{}

for a given training-phase duration

T_{tr}

in the FDD m-MIMO system equates to the optimization problem defined in (6).

\begin{matrix} \underset{S_{tr}}{minimize} f (S_{tr}) \\ subject to S_{tr}^{H} S_{tr}^{} = I_{T_{tr}} \end{matrix}

(6)

The sum MSE cost function, which corresponds to a function of the subspace that is spanned by the pilot matrix

S_{tr}

, is invariant to the unitary rotation, i.e.,

f (S_{tr} U) = f (S_{tr})

.

This property allows the CGD algorithm over the Riemannian manifold to be effectively used to solve the optimization problem in (6). To apply the CGD optimization algorithm on a Riemannian manifold, the partial derivative of the sum MSE cost function in (5) needs to be determined with respect to

S_{tr}

. To this end, the partial derivative

Γ \in C^{N \times T_{tr}}

is given by [31]:

\begin{matrix} Γ = 2 \sum_{k = 1}^{K} (- R_{k}^{2} S_{tr}^{} {(S_{tr}^{H} R_{k}^{} S_{tr}^{} + \frac{1}{ρ_{tr}^{}} I_{T_{tr}^{}}^{})}^{- 1} + R_{k}^{} S_{tr}^{} \times \\ {(S_{tr}^{H} R_{k}^{} S_{tr}^{} + \frac{1}{ρ_{tr}^{}} I_{T_{tr}^{}}^{})}^{- 1} S_{tr}^{H} R_{k}^{2} S_{tr}^{} {(S_{tr}^{H} R_{k}^{} S_{tr}^{} + \frac{1}{ρ_{tr}^{}} I_{T_{tr}^{}}^{})}^{- 1}) \end{matrix}

(7)

where the right hand side in (7) is obtained by differentiating the cost function in (6) with respect to pilot matrix, (i.e.,

\partial f (S_{tr}) / \partial S_{tr}^{*}

).

4. Training Sequence Optimization Based on CGD Algorithm over the Riemannian Manifold

Optimization on manifolds means finding an optimum solution of a desired function using a smooth finite-dimensional Riemannian manifold. The Riemannian gradient is simply the orthogonal projection of the classical gradient. Conceptually, the key point is to perform as unconstrained nonlinear optimization. Exploiting the Riemannian makes it easy to deal with various types of constraints, which arise in low rank matrices. In particular, optimization on manifolds is a powerful scheme to address the nonlinear optimization problems such as the problem of finding the training sequence matrix that maximizes the achievable sum rate of the FDD massive MIMO systems, which is considered in this present paper. In wireless communication systems and array signal processing, several studies have considered various optimization problems under a unitary matrix constraint [44,45,46,47]. Consequently, advanced gradient-based iterative algorithms have been developed to find efficient solutions with fast convergence rate to such optimization problems [48,49,50,51]. Some of those efficient solutions have been obtained using a geometrical approach based on the smooth parameter space known as the Riemannian manifold. The potential of the Riemannian manifold has motivated the author of this paper to explore the application of such a geometrical approach for optimising the training sequence in an FDD massive MIMO system when the users exhibit distinct spatial correlations. In this section, the CGD iterative algorithm based on the Riemannian manifold is explored to optimize the DL pilot matrix

S_{tr}

iteratively across multiple users with independent channel covariance matrices. In optimizing (6) for

S_{tr}

, the CGD method uses the almost periodic property of the matrix exponential search [52,53] over the Riemannian manifold to increase the algorithm convergence speed by lessening the required number of iterations. In particular, the introduction of matrix exponential on the Riemannian manifold aids the proposed CGD algorithm to operate over a smaller dimensional search space and thus provide a fast convergence behaviour. In addition, the existence of matrix exponential parametrization in the proposed CGD method is a sufficient condition to satisfy the semi-unitary matrices

S_{tr}^{H} S_{tr}^{} = I_{T_{tr}}

. Further details on the matrix exponential property on the Riemannian manifold can be found in [52,53,54]. This algorithm can also be used to increase the capacity with interference channel [55]. Algorithm 1 summarizes the proposed CGD iterative algorithm. The following steps explain Algorithm 1 in further detail.

Algorithm 1 Iterative CGD optimization algorithm.

1:: $Initialization : t = 0 and μ = 1, S_{t} = I_{} \in C^{N \times T_{tr}}$
2:: Determine the gradient using (8), and set $G_{t} = D_{t} .$
3:: $If 〈 D_{t}, D_{t} 〉 < ϵ, then break .$
4:: $Determine the matrix exponential : P_{t} = \exp (- μ D_{t}) .$
5:: $If f (S_{t}) - f (P_{t} S_{t}) \geq μ 〈 D_{t}, D_{t} 〉, double P_{t}, μ : = 2 μ .$
6:: $If f (S_{t}) - f (P_{t} S_{t}) < 0.5 μ 〈 D_{t}, D_{t} 〉, halve P_{t}, μ : = 0.5 μ .$
7:: $Update S_{t + 1} : = P_{t} S_{t}$ .
8:: $Update G_{t + 1} = D_{t + 1} + υ_{t} G_{t}, t : = t + 1, and go to step 2$ .

Step 1—Initial step: The proposed iterative gradient algorithm starts by selecting an initial training sequence matrix $S_{tr} \in C^{N \times T_{tr}}$ . For the sake of notational simplicity, in the algorithm, the training sequence matrix notation $S_{t}$ is used instead, where the subscript t corresponds to the number of iterations.
Step 2—Gradient evaluation and projection on the Riemannian parameter space: In this step, the gradient of the sum MSE cost function defined in (5) is determined, and the projection on the Riemannian manifold is computed to find the descent direction and set $G_{t} = D_{t}$ .

$D_{t} = Γ_{t}^{} S_{t}^{H} - S_{t}^{} Γ_{t}^{H}$

(8)

where $D_{t}$ is the descent direction for iteration t and $Γ_{t}$ corresponds to the Euclidean gradient with respect to the training sequence matrix as given in (7).
Step 3—Stopping criteria: Check the gradient on the Riemannian parameter space (i.e., $D_{t}$ ) whether it reaches convergence or not according to:

$〈 D_{t}, D_{t} 〉 = \frac{1}{2} Re \{tr (D_{t}^{H} D_{t}^{})\}$

(9)

where Re stands for the real value and $ϵ$ in step 3 of Algorithm 1 denotes the error tolerance. If the steepest direction is sufficiently small, which implies a close to local minimum value of the cost function the algorithm stops.
Step 4—Matrix exponential computation: Determine the local parametrisation based on the matrix exponential on the Riemannian manifold, i.e., $P_{t} = \exp (- μ D_{t})$ , where $μ$ corresponds to the step size that controls the steepest direction. The complex matrix exponential is determined by the convergent power series [52,53], where a Matlab function expm is used for this purpose.
Steps 5 and 6—Step size evaluation: The step size $μ$ is required to be tuned in order to ensure an appropriate steepest direction movement towards the training solution. In particular, in steps 5 and 6, the sum MSE cost function of different sequences are evaluated based on different possibilities of $μ$ . If $μ$ is too small and the condition of the cost function is not met, then $μ$ is doubled, as given in step 5. If $μ$ is too large and the condition of the cost function is not met, then $μ$ is halved, as provided in step 6.
Step 7—Updating the training sequence and the gradient direction: In this step, the pilot matrix and the gradient direction are updated according to the obtained matrix exponential $P_{t}$ . Due to the unitary invariant rotation of the cost function, the obtained training sequence has unitary columns.
Step 8—The new conjugate gradient direction is obtained by updating $G_{t + 1} = D_{t + 1} + υ_{t} G_{t}$ , where $D_{t + 1}$ is determined by substituting the obtained sequence matrix into (8), and the parameter $υ_{t}$ is given as [53]:

$υ_{t} = \frac{〈 D_{t + 1} - D_{t}, D_{t + 1} 〉}{〈 D_{t}, D_{t} 〉}$

(10)

where $υ_{t}$ denotes the Polak–Ribièrre formula, as defined in [53]. If the stopping criteria is met, the algorithm is executed and the sequence is optimized. In contrast, if the stopping criteria is not met, a new iteration $(t + 1)$ is started.

We provide the flops calculation (obtained by counting the number of multiplications and additions per iteration) for the proposed CSD algorithm and the existing SCMI and SMSE algorithms. Table 1 summarises the complexity analysis in flops [56] per iteration for the three training designs considered. Parameters (

t_{d g}

,

t_{h g}

) and (

t_{d}

,

t_{h}

) represent the number of iterations required for doubling and halving the step size in the proposed CGD method and the SMSE iterative algorithm, respectively. In this paper, the gradient descent based approach is applied, which can be defined as a first-order iterative optimization solution. This approach is useful for finding a local minimum of the MSE differentiable function with respect to the pilot matrix. In particular, the proposed solution exploits the doubling or halving of the gradient direction to direct steps proportional to the negative of the gradient in order to find the local minimum of the MSE function using a gradient descent based approach. The subscript

d g

denotes doubling the step size in the proposed CGD method, whereas subscript

h g

refers to halving the step size in the proposed CGD, where the letter g stands for the gradient. The variable X is given as

X = (5 T_{tr}^{3} + 2 T_{tr}^{2} (7 N - 1))

for SMSE and the variable B is given as

B = (5 T_{tr}^{3} + 16 N^{2} T_{tr} + 14 N T_{tr}^{2} - 2 T_{tr}^{2} - 10 N T_{tr})

for CGD. The flops calculations show that all three gradient algorithms grow with the number of training sequences as

O (T_{tr}^{3})

, with the number of antennas as

O (N^{2})

, and with the number of users as

O (K)

.

The complexity of calculating the gradient in step 2 of Algorithm 1, so that the term $\sum_{k = 1}^{K} R_{k}^{2} S_{tr}^{} {(S_{tr}^{H} R_{k}^{} S_{tr}^{} + I_{T_{tr}^{}}^{})}^{- 1} + R_{k}^{} S_{tr}^{} \times {(S_{tr}^{H} R_{k}^{} S_{tr}^{} + I_{T_{tr}^{}}^{})}^{- 1} S_{tr}^{H} R_{k}^{2} S_{tr}^{} {(S_{tr}^{H} R_{k}^{} S_{tr}^{} + I_{T_{tr}}^{})}^{- 1}$ requires $(K - 1) (5 T_{tr}^{3} + 16 N^{2} T_{tr} + 14 N T_{tr}^{2} - 2 T_{tr}^{2} - 10 N T_{tr})$ flops.
Calculating the matrix exponential requires $2 (2 / 3) N^{2} T_{tr}$ flops [52,53].
Checking the gradient convergence in step 3 using the squared Frobenius norm of an $N \times T_{tr}$ matrix requires $2 N T_{tr}^{} - 1$ flops [56].
Calculating the term step size adaptation in step 5 and step 6 of Algorithm 1 requires $(2 (2 / 3) N^{2} T_{tr} + (t_{d} + 3) N^{2} T_{tr} + t_{h} (2 (2 / 3) N^{2} T_{tr}) + 3 N^{2} T_{tr})$ flops [52].
Multiplying an $N \times N$ matrix with an $N \times T_{tr}$ matrix, entails $2 N^{2} T_{tr} - N T_{tr}$ flops.
Multiplying a $T_{tr} \times N$ matrix with an $N \times T_{tr}$ matrix, entails $2 N T_{tr}^{2} - T_{tr}^{2}$ flops.
Multiplying a $T_{tr} \times N$ matrix with an $N \times N$ matrix, entails $2 N^{2} T_{tr} - N T_{tr}$ flops.
Multiplying a $T_{tr} \times T_{tr}$ matrix with a $T_{tr} \times T_{tr}$ matrix requires $2 T_{tr}^{3} - T_{tr}^{2}$ flops.
Multiplying an $N \times N$ matrix with an $N \times N$ matrix, entails $2 N^{3} - N^{2}$ flops.
Multiplying an $N \times T_{tr}$ matrix with a $T_{tr} \times T_{tr}$ matrix, entails $2 N T_{tr}^{2} - N T_{tr}$ flops.
Multiplying an $N \times T_{tr}$ matrix with a $T_{tr} \times N$ matrix, entails $2 N^{2} T_{tr} - N^{2}$ flops.
The scalar matrix multiplication with an $N \times T_{tr}$ matrix needs $N T_{tr}$ flops.
Adding an $N \times T_{tr}$ matrix to an $N \times T_{tr}$ matrix requires $N T_{tr}$ flops.
Inverting of a $T_{tr} \times T_{tr}$ matrix requires $T_{tr}^{3}$ flops.

5. Achievable Sum Rate Analysis Using RMT Method

This section provides the expressions that accurately approximate the SINR and DL achievable sum rate based on the asymptotic RMT approach in [32,33]. In particular, asymptotically tight approximations of

γ_{k}

for the BF and RZF precoders are obtained when N and K grow without bounds while the ratio

K / N > 0

is kept constant. To this end, the asymptotic expression for BF precoding is given as:

{\bar{γ}}_{k}^{BF} = \frac{\bar{ξ} {(tr (Φ_{k}))}^{2}}{\frac{1}{ρ_{d}} + \sum_{i = 1}^{K} \bar{ξ} tr (R_{k} Φ_{i})}

(11)

where

\bar{ξ} = {(\frac{1}{K} \sum_{k = 1}^{K} tr (Φ_{k}))}^{- 1}

. The SINR approximation for the RZF precoder is given as:

{\bar{γ}}_{k}^{RZF} = \frac{N \bar{ξ} δ_{k}^{2}}{\frac{{(1 + δ_{k})}^{2}}{ρ_{d} K} + \bar{ξ} \sum_{i = 1}^{K} {(\frac{1 + δ_{k}}{1 + δ_{i}})}^{2} {\bar{μ}}_{k, i}}

(12)

where the term

\bar{ξ} \in R

is determined later in (20). Defining a recursion on integer t, where

t = 1, 2, \dots,

δ_{k}^{(t)} = \frac{1}{N} tr (Φ_{k} {(\frac{1}{N} \sum_{i = 1}^{K} \frac{Φ_{i}}{1 + δ_{i}^{(t - 1)}} + ζ I_{N}^{})}^{- 1})

(13)

with an initial value

δ_{k}^{(0)} = 1 / ζ

for all k with

ζ = 1 / ρ_{d}

, the variable

δ_{k} \in R

is found numerically by the standard fixed-point algorithm as

δ_{k}^{} = lim_{t \to \infty} δ_{k}^{(t)}

(14)

After the solution of the fixed-point equations in (13) and (14) is numerically obtained, it is substituted into:

T = {(ζ I_{N}^{} + \frac{1}{N} \sum_{k = 1}^{K} \frac{Φ_{k}}{1 + δ_{k}})}^{- 1}

(15)

to obtain random matrix

T \in C^{N \times N}

. Auxiliary matrix

\bar{T} \in C^{N \times N}

is given by

\bar{T} = T (I_{N}^{} + \frac{1}{N} \sum_{k = 1}^{K} \frac{Φ_{k} {\bar{δ}}_{k}}{{(1 + δ_{k})}_{}^{2}}) T

(16)

and

\bar{δ} \overset{Δ}{=} {[{\bar{δ}}_{1} \dots {\bar{δ}}_{K}]}^{T}

is given as:

\bar{δ} = {(I_{K} - J)}^{- 1} \bar{v}

(17)

where

J \in C^{K \times K}

and

\bar{v} \in C^{K}

are obtained from the expressions given in (18) and (19).

{[J]}_{k, l}^{} = \frac{tr (Φ_{k} T Φ_{l} T)}{{(N (1 + δ_{k}))}^{2}}, 1 \leq k, l \leq K,

(18)

{[\bar{v}]}_{k}^{} = \frac{1}{N} tr (T Φ_{k} T), 1 \leq k \leq K .

(19)

Parameter

\bar{ξ} \in R

in (12) is obtained by substituting the matrices

T

and

\bar{T}

into:

\bar{ξ} = {(tr (T) - ζ tr ({\bar{T}}^{}))}^{- 1}

(20)

The auxiliary variable

{\bar{μ}}_{k, i} \in R

in (12) is obtained from the expressions given in (21)–(23),

\begin{matrix} {\bar{μ}}_{k, i} & = \frac{1}{N} tr (R_{k}^{} T_{i}^{'}) \\ - \frac{2 Re (tr (Φ_{k}^{} T) tr (Φ_{k}^{} T_{i}^{'})) (1 + δ_{k}) - tr {(Φ_{k}^{} T)}^{2} δ_{i}^{'}}{{(N (1 + δ_{k}))}^{2}} \end{matrix}

(21)

\begin{matrix} T_{i}^{'} & = T (Φ_{i}^{} + \frac{1}{N} \sum_{k = 1}^{K} \frac{Φ_{k}^{} δ_{k}^{'}}{{(1 + δ_{k})}_{}^{2}}) T \end{matrix}

(22)

\begin{matrix} δ^{'} & = {(I_{K} - J)}^{- 1} v^{'} \end{matrix}

(23)

where

v^{'} \in C^{K}

denotes

{[v^{'}]}_{k}^{} = \frac{1}{N} tr (T Φ_{k} T Φ_{k}), 1 \leq k \leq K .

(24)

The aforementioned analyses of BF and RZF precoding with the RMT method can be very useful for validating our work.

6. Results and Discussion

In the first part of this section, we provide comparisons between the sum rates of the CGD training design and the sequences designed based on the SCMI/SMSE iterative algorithms. In the second part of this section, results that characterise the overall computational complexity of the proposed CGD training design and the state-of-the-art SCMI/SMSE designs are presented. For a fair comparison, the same tolerance

ϵ = 0.001

is used for all the training designs. This comparison is carried out based on the scattering one ring (OR) channel model [30], which is frequently encountered in the open literature on MIMO evaluation. The system parameters in the OR model are determined by the angular spread

ω

, angles of arrival

θ_{k}

, and antenna spacing D as described in [30]. Table 2 is provided to summarize the simulation parameters that are used in the performance evaluation.

6.1. Achievable Sum Rate Performance Evaluation

In this subsection, the curves for the BF and RZF precoding are obtained numerically based on the random matrix theory method as in [10], while simulated curves are plotted based on Equation (3). The number of incidences of the channel realizations used in the Monte Carlo simulations is given as

10^{4}

. The results are presented with

θ_{k} = {- 57 . 5^{\circ}, - 45^{\circ}, - 41 . 5^{\circ}, - 23^{\circ}, - 7 . 5^{\circ}, 7 . 5^{\circ}, 23 . 5^{\circ}, 41 . 5^{\circ}, 45^{\circ}, 57 . 5^{\circ}}

, [30,57]. The other salient system parameters are

T_{} = 100

symbols,

ρ_{d}

= 10 dB and

K = 10

users. The results in Figure 2 demonstrate that the proposed CGD training design achieves the best rate performances. The results in Figure 2 confirm that significant improvement in the rate performances are obtained for both the BF and RZF precoders when the channels are strongly correlated, i.e.,

ω_{} = 2 . 5^{\circ}

,

D = 1 / 2

.

Figure 3 examines the achievable sum rate comparing different training sequence designs under a weak channel correlation, i.e.,

ω_{} = 25^{\circ}

,

D = 1

. Figure 3 shows that, with weak channel correlation, some loss in the rate performances is obtained for both the BF and RZF precoders with the SCMI training sequence design, which uses a Lagrangian multiplier iterative algorithm, in comparison with the SMSE and the proposed CGD training designs. The degradation in the achievable sum rate performance with the SCMI training design can be justified as the Lagrangian-based iterative algorithm does not consider the spatial properties of the parameter space such as a Grassmannian space or Riemannian manifold in the training sequences optimization. In particular, using the smooth parameter spaces in the sequences optimization allows for an efficient precoding design to be achieved, hence maximizing the sum rate performance of the FDD m-MIMO systems.

6.2. Computational Complexity Evaluation

The results from Figure 4, Figure 5, Figure 6, Figure 7 and Figure 8 show plots of the overall computational complexity versus the number of BS antennas N, comparing the proposed CGD training sequence design with the SMSE and SCMI training designs using the pilot length for BF and RZF precoding. The number of iterations each algorithm required to converge are provided in Table 3, Table 4, Table 5, Table 6 and Table 7, which are used to obtain the simulated results from Figure 4, Figure 5, Figure 6, Figure 7 and Figure 8, respectively. The results are presented based on the scattering OR channel model, when the system parameters K, N,

T_{c}

, SNR, angle of arrivals (AoAs), normalised antenna spacing D, and angular spread

ω_{}

are used. The AoAs are distributed in the range

[- 60^{\circ}, 60^{\circ}]

, i.e.,

θ_{k} = {- 57 . 5^{\circ}, - 45^{\circ}, - 41 . 5^{\circ}, - 23^{\circ}, - 7 . 5^{\circ}, 7 . 5^{\circ}, 23 . 5^{\circ}, 41 . 5^{\circ}, 45^{\circ}, 57 . 5^{\circ}}

, [30,57]. The overall computational complexity of the proposed training optimization algorithm and the state-of-the-art SCMI [30] and SMSE [31] algorithms are obtained based on multiplying the number of iterations each algorithm needs to converge by the number of flops involved per iteration.

The results show that the proposed CGD iterative algorithm for the DL training sequence optimization achieves a faster convergence rate in comparison with the state-of-the-art iterative algorithms. This is clearly indicated by the number of iterations given in Table 3, Table 4, Table 5, Table 6 and Table 7. The fast convergence speed provided by the proposed CGD algorithm arises from the use of matrix exponential search over the Riemannian manifold. Specifically, the proposed CGD iterative algorithm exhibits fewer iterations to converge due to the significant advantages of the almost periodic property of the matrix exponential search over the smooth Riemannian manifold. This signifies the feasibility of the proposed CGD algorithm to optimize the achievable sum rate of the FDD m-MIMO systems with limited coherence time in comparison with state-of-the-art iterative algorithms. Although the proposed CGD provides fast convergence speed, the results demonstrate that as the number of BS antennas N increases, the SCMI and proposed CGD iterative algorithms exhibit almost the same overall computational complexity. For example, the results in Figure 4 show that at

N = 350

, the proposed CGD iterative algorithm offers a comparable overall computational complexity to the SCMI algorithm, which implies that the SCMI iterative algorithm requires fewer flops at this BS array size. Nonetheless, the proposed CGD iterative algorithm still provides a larger achievable sum rate performance than the SCMI algorithm under a relatively weak correlated channels, see e.g., Figure 3. In addition, the SMSE algorithm grows asymptotically at about the same rate at large N, since it exhibits the same order of complexity

O (N^{2})

as other algorithms. However, the

N^{2}

term for SMSE has a much higher multiplier, which gives the large gap above the SCMI and the proposed CGD algorithms. The results provided in this subsection indicate that the relative ratio between the three iterative algorithms for the training sequence optimization depends on the number of BS antennas N and the levels of spatial correlation. Noting that the non smooth curves associated with different iterative algorithms occur because the number of iterations each algorithm requires to converge are different for different values of N. In addition, the results show that the performance of the SMSE based approach is decreased when N is increased as indicated by Figure 6b and Figure 8a. This is attributed to the fact that the convergence rate of the SMSE algorithm is not propositional to the number of BS antenna N. These results are linked to Table 5 and Table 7, which clearly show that the convergence rates are not propositional to the number of BS antenna (N). Overall, the results show that the proposed training algorithm maximizes the achievable sum rate performance, while delivering a lower overall computational complexity owing to a faster convergence rate in comparison to the state-of-the-art iterative algorithms. Table 8 presents a comparison of the fundamental characteristics of the proposed method with the state-of-the-art-methods.

7. Conclusions

This paper explained that optimizing the DL training sequences for CSI estimation in a general scenario with single-stage precoding and K independent channel correlation matrices is very challenging. To overcome this issue, this paper investigated a geometrical optimization approach, which utilizes a computationally efficient Riemannian manifold, with an aim to maximize the achievable sum rate performance, while improving the convergence rate to achieve a fast CSI estimation for an FDD m-MIMO system. To this end, a new computationally efficient gradient-based iterative algorithm, which uses the matrix exponential search on the Riemannian manifold, is proposed to optimize the DL training sequence in an FDD m-MIMO system, while delivering a lower overall computational complexity owing to a faster convergence rate. The sum rate performance of the proposed CGD algorithm is also compared with the rates achieved by the best known state-of-the-art iterative algorithms. Additionally, the computational complexity is analyzed and compared with the state-of-the-art SCMI and SMSE iterative algorithms. The number of convergence iterations of the proposed CGD training design is also provided. Consequently, the overall computational complexity of the proposed CGD training design is compared with the state-of-the-art iterative algorithms. Importantly, the results demonstrated that, under relatively weakly correlated channels, over a 5 bit/s/Hz improvement in the achievable sum rate is obtained using the proposed CGD iterative algorithm in comparison to the SCMI algorithm. This achievement signifies the advantages of the proposed approach for the training sequence optimization of FDD m-MIMO systems compared with the SCMI approach. The results indicated that the proposed CGD algorithm is able to achieve the best sum rate performance owing to a faster convergence rate. This is attributed to the efficient matrix exponential search on the Riemannian manifold, which allows the algorithm to converge faster with fewer iterations. The results showed that the proposed CGD algorithm maximized the achieved sum rate of FDD m-MIMO systems with a reduced overall computational complexity.

Author Contributions

Conceptualization, M.A.; methodology, M.A. and M.A.N.; software, S.H.A. and M.A.; validation, M.A.N., M.A. and B.M.M.; investigation, M.A.N. and B.M.M.; writing—original draft preparation, M.A. and B.M.M.; writing—review and editing, S.H.A. and M.A.; visualization, M.A.N. and B.M.M.; supervision, M.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

All data are available within the manuscript.

Acknowledgments

The authors would like to thank the University of Baghdad for general support.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

AoA	angle of arrivals
BS	Base Station
IoE	Internet of Everything
m-MIMO	Massive MIMO
CSI	Channel State Information
CGD	Conjugate gradient-descent
SCMI	Sum conditional mutual information
SMSE	Sum mean square error
FDD	Frequency Division Duplex
DL	Downlink
UL	Uplink
TDD	Time division duplex
Massive MIMO	m-MIMO
5G	Fifth generation
6G	Sixth generation
iid	Independently and identically distributed
SINR	Signal-to-interference-plus-noise ratio
OR	One ring
BF	Eigenbeamforming
RZF	Regularized zero forcing

References

Alsabah, M.; Naser, M.A.; Mahmmod, B.M.; Abdulhussain, S.H.; Eissa, M.R.; Al-Baidhani, A.; Noordin, N.K.; Sait, S.M.; Al-Utaibi, K.A.; Hashim, F. 6G wireless communications networks: A comprehensive survey. IEEE Access 2021, 9, 148191–148243. [Google Scholar] [CrossRef]
Cisco. Cisco Annual Internet Report (2018–2023) White Paper; Cisco: San Jose, CA, USA, 2020; Available online: https://www.cisco.com/c/en/us/solutions/collateral/executive-perspectives/annual-internet-report/white-paper-c11-741490.html (accessed on 23 February 2022).
Marzetta, T.L. Noncooperative Cellular Wireless with Unlimited Numbers of Base Station Antennas. IEEE Trans. Wirel. Commun. 2010, 9, 3590–3600. [Google Scholar] [CrossRef]
Rusek, F.; Persson, D.; Lau, B.K.; Larsson, E.G.; Marzetta, T.L.; Edfors, O.; Tufvesson, F. Scaling Up MIMO: Opportunities and Challenges with Very Large Arrays. IEEE Signal Process. Mag. 2013, 30, 40–60. [Google Scholar] [CrossRef] [Green Version]
Omer, D.S.; Hussein, M.A.; Mina, L.M. Ergodic capacity for evaluation of mobile system performance. J. Eng. 2020, 26, 135–148. [Google Scholar] [CrossRef]
Guo, W.; Turyagyenda, C.; Hamdoun, H.; Wang, S.; Loskot, P.; O’Farrell, T. Towards a low energy LTE cellular network: Architectures. In Proceedings of the 2011 19th European Signal Processing Conference, Barcelona, Spain, 29 August–2 September 2011; pp. 879–883. [Google Scholar]
Gkonis, P.K.; Trakadas, P.T.; Kaklamani, D.I. A comprehensive study on simulation techniques for 5g networks: State of the art results, analysis, and future challenges. Electronics 2020, 9, 468. [Google Scholar] [CrossRef] [Green Version]
Ericsson. Ericsson Mobility Report: On the Pulse of the Network Society; Technical Report; Ericsson: Stockholm, Sweden, 2016; Available online: https://www.ericsson.com/assets/local/mobility-report/documents/2016/ericsson-mobility-report-november-2016.pdf (accessed on 12 March 2022).
Alsabah, M.; Naser, M.A.; Mahmmod, B.M.; Noordin, N.K.; Abdulhussain, S.H. Sum rate maximization versus MSE minimization in FDD massive MIMO systems with short coherence time. IEEE Access 2021, 9, 108793–108808. [Google Scholar] [CrossRef]
Alsabah, M.; Vehkapera, M.; O’Farrell, T. Non-Iterative Downlink Training Sequence Design Based on Sum Rate Maximization in FDD Massive MIMO Systems. IEEE Access 2020, 8, 108731–108747. [Google Scholar] [CrossRef]
Kotecha, J.H.; Sayeed, A.M. Transmit signal design for optimal estimation of correlated MIMO channels. IEEE Trans. Signal Process. 2004, 52, 546–557. [Google Scholar] [CrossRef] [Green Version]
Björnson, E.; Ottersten, B. A Framework for Training-Based Estimation in Arbitrarily Correlated Rician MIMO Channels With Rician Disturbance. IEEE Trans. Signal Process. 2010, 58, 1807–1820. [Google Scholar] [CrossRef] [Green Version]
Noh, S.; Zoltowski, M.D.; Sung, Y.; Love, D.J. Pilot Beam Pattern Design for Channel Estimation in Massive MIMO Systems. IEEE J. Sel. Top. Signal Process. 2014, 8, 787–801. [Google Scholar] [CrossRef] [Green Version]
Choi, J.; Love, D.J.; Bidigare, P. Downlink Training Techniques for FDD Massive MIMO Systems: Open-Loop and Closed-Loop Training With Memory. IEEE J. Sel. Top. Signal Process. 2014, 8, 802–814. [Google Scholar] [CrossRef] [Green Version]
So, J.; Kim, D.; Lee, Y.; Sung, Y. Pilot Signal Design for Massive MIMO Systems: A Received Signal-To-Noise-Ratio-Based Approach. IEEE Signal Process. Lett. 2015, 22, 549–553. [Google Scholar] [CrossRef] [Green Version]
Naser, M.A.; Alsabah, M.; Mahmmod, B.M.; Noordin, N.K.; Abdulhussain, S.H.; Baker, T. Downlink training design for FDD massive MIMO systems in the presence of colored noise. Electronics 2020, 9, 2155. [Google Scholar] [CrossRef]
Naser, M.A.; Salman, M.I.; Alsabah, M. The role of correlation in the performance of massive MIMO systems. Appl. Syst. Innov. 2021, 4, 54. [Google Scholar] [CrossRef]
Naser, M.A.; Alsabah, M.Q.; Taher, M.A. A partial CSI estimation approach for downlink FDD massive-MIMO system with different base transceiver station topologies. Wirel. Pers. Commun. 2021, 119, 3609–3630. [Google Scholar] [CrossRef]
Gao, Z.; Dai, L.; Dai, W.; Shim, B.; Wang, Z. Structured Compressive Sensing-Based Spatio-Temporal Joint Channel Estimation for FDD Massive MIMO. IEEE Trans. Commun. 2016, 64, 601–617. [Google Scholar] [CrossRef] [Green Version]
Han, Y.; Lee, J.; Love, D.J. Compressed Sensing-Aided Downlink Channel Training for FDD Massive MIMO Systems. IEEE Trans. Commun. 2017, 65, 2852–2862. [Google Scholar] [CrossRef]
Nouri, N.; Azizipour, M.J.; Mohamed-Pour, K. A Compressed CSI Estimation Approach for FDD Massive MIMO Systems. In Proceedings of the 2020 28th Iranian Conference on Electrical Engineering (ICEE), Tabriz, Iran, 4–6 August 2020; pp. 1–6. [Google Scholar]
Shi, Y.; Jiang, Z.; Liu, Y.; Wang, Y.; Xu, S. A Compressive Sensing Based Channel Prediction Scheme with Uneven Pilot Design in Mobile Massive MIMO Systems. In Proceedings of the 2021 13th International Conference on Wireless Communications and Signal Processing (WCSP), Changsha, China, 20–22 October 2021; pp. 1–6. [Google Scholar]
Han, T.; Zhao, D. On the Performance of FDD Cell-Free Massive MIMO with Compressed Sensing Channel Estimation. In Proceedings of the 2021 IEEE 21st International Conference on Communication Technology (ICCT), Tianjin, China, 13–16 October 2021; pp. 238–242. [Google Scholar]
Mei, Y.; Gao, Z. CS-Based CSIT Estimation for Downlink Pilot Decontamination in Multi-Cell FDD Massive MIMO. In Proceedings of the 2021 IEEE/CIC International Conference on Communications in China (ICCC), Xiamen, China, 28–30 July 2021; pp. 1–5. [Google Scholar]
Adhikary, A.; Nam, J.; Ahn, J.Y.; Caire, G. Joint Spatial Division and Multiplexing: The Large-Scale Array Regime. IEEE Trans. Inf. Theory 2013, 59, 6441–6463. [Google Scholar] [CrossRef]
Nam, J.; Caire, G.; Ha, J. On the role of transmit correlation diversity in multiuser MIMO systems. IEEE Trans. Inf. Theory 2017, 63, 336–354. [Google Scholar] [CrossRef]
Wu, X.; Yang, X.; Ma, S.; Zhou, B.; Yang, G. Hybrid Channel Estimation for UPA-Assisted Millimeter-Wave Massive MIMO IoT Systems. IEEE Internet Things J. 2021. [Google Scholar] [CrossRef]
Mirzaei, J.; ShahbazPanahi, S.; Sohrabi, F.; Adve, R. Hybrid Analog and Digital Beamforming Design for Channel Estimation in Correlated Massive MIMO Systems. IEEE Trans. Signal Process. 2021, 69, 5784–5800. [Google Scholar] [CrossRef]
Bazzi, S.; Xu, W. On the Amount of Downlink Training in Correlated Massive MIMO Channels. IEEE Trans. Signal Process. 2018, 66, 2286–2299. [Google Scholar] [CrossRef]
Jiang, Z.; Molisch, A.F.; Caire, G.; Niu, Z. Achievable Rates of FDD Massive MIMO Systems With Spatial Channel Correlation. IEEE Trans. Wirel. Commun. 2015, 14, 2868–2882. [Google Scholar] [CrossRef] [Green Version]
Bazzi, S.; Xu, W. Downlink Training Sequence Design for FDD Multiuser Massive MIMO Systems. IEEE Trans. Signal Process. 2017, 65, 4732–4744. [Google Scholar] [CrossRef]
Couillet, R.; Debbah, M. Random Matrix Methods for Wireless Communications; Cambridge University Press: Cambridge, UK, 2011. [Google Scholar]
Hoydis, J.; Ten Brink, S.; Debbah, M. Massive MIMO in the UL/DL of Cellular Networks: How Many Antennas Do We Need? IEEE J. Sel. Areas Commun. 2013, 31, 160–171. [Google Scholar] [CrossRef] [Green Version]
Xie, H.; Gao, F.; Jin, S.; Fang, J.; Liang, Y.C. Channel estimation for TDD/FDD massive MIMO systems with channel covariance computing. IEEE Trans. Wirel. Commun. 2018, 17, 4206–4218. [Google Scholar] [CrossRef] [Green Version]
Abdulhasan, M.Q.; Salman, M.I.; Ng, C.K.; Noordin, N.K.; Hashim, S.J.; Hashim, F. Review of channel quality indicator estimation schemes for multi-user MIMO in 3GPP LTE/LTE-A systems. KSII Trans. Internet Inf. Syst. (TIIS) 2014, 8, 1848–1868. [Google Scholar]
Salman, M.I.; Abdulhasan, M.Q.; Ng, C.K.; Noordin, N.K.; Ali, B.M.; Sali, A. A partial feedback reporting scheme for LTE mobile video transmission with QoS provisioning. Comput. Netw. 2017, 112, 108–121. [Google Scholar] [CrossRef]
Abdulhasan, M.Q.; Salman, M.I.; Ng, C.K.; Noordin, N.K.; Hashim, S.J.; Hashim, F.B. Approximate linear minimum mean square error estimation based on channel quality indicator feedback in LTE systems. In Proceedings of the 2013 IEEE 11th Malaysia international conference on communications (MICC), Kuala Lumpur, Malaysia, 26–28 November 2013; pp. 446–451. [Google Scholar]
Abdulhasan, M.Q.; Salman, M.I.; Ng, C.K.; Noordin, N.K.; Hashim, S.J.; Hashim, F. An adaptive threshold feedback compression scheme based on channel quality indicator (CQI) in long term evolution (LTE) system. Wirel. Pers. Commun. 2015, 82, 2323–2349. [Google Scholar] [CrossRef]
Al-Utaibi, K.A.; Abdulhussain, S.H.; Mahmmod, B.M.; Naser, M.A.; Alsabah, M.; Sait, S.M. Reliable recurrence algorithm for high-order Krawtchouk polynomials. Entropy 2021, 23, 1162. [Google Scholar] [CrossRef]
Abdulhussain, S.H.; Mahmmod, B.M.; Naser, M.A.; Alsabah, M.Q.; Ali, R.; Al-Haddad, S. A robust handwritten numeral recognition using hybrid orthogonal polynomials and moments. Sensors 2021, 21, 1999. [Google Scholar] [CrossRef]
Abdulmajeed, M.M.S.; Omran, B.M. Pilot Based Channel Estimation and Synchronization in OFDM System. J. Eng. 2020, 26, 50–59. [Google Scholar] [CrossRef]
Gu, Y.; Zhang, Y.D. Information-Theoretic Pilot Design for Downlink Channel Estimation in FDD Massive MIMO Systems. IEEE Trans. Signal Process. 2019, 67, 2334–2346. [Google Scholar] [CrossRef]
Tomasi, B.; Decurninge, A.; Guillaud, M. SNOPS: Short non-orthogonal pilot sequences for downlink channel state estimation in FDD massive MIMO. In Proceedings of the IEEE Globecom Workshops (GC Wkshps), Washington, DC, USA, 4–8 December 2016; pp. 1–6. [Google Scholar]
Stoica, P.; Linebarger, D.A. Optimization result for constrained beamformer design. IEEE Signal Process. Lett. 1995, 2, 66–67. [Google Scholar] [CrossRef]
Fiori, S.; Uncini, A.; Piazza, F. Application of the MEC network to principal component analysis and source separation. In Proceedings of the International Conference on Artificial Neural Networks, Lausanne, Switzerland, 8–10 October 1997; pp. 571–576. [Google Scholar]
Yang, J.; Williams, D.B. MIMO transmission subspace tracking with low rate feedback. In Proceedings of the (ICASSP’05) IEEE International Conference on Acoustics, Speech, and Signal Processing, Philadelphia, PA, USA, 23 March 2005; Volume 3, pp. iii/405–iii/408. [Google Scholar]
Yu, X.; Shen, J.C.; Zhang, J.; Letaief, K.B. Alternating minimization algorithms for hybrid precoding in millimeter wave MIMO systems. IEEE J. Sel. Top. Signal Process. 2016, 10, 485–500. [Google Scholar] [CrossRef] [Green Version]
Cichocki, A.; Amari, S.i. Adaptive Blind Signal and Image Processing: Learning Algorithms and Applications; John Wiley & Sons: Hoboken, NJ, USA, 2002. [Google Scholar]
Nikpour, M.; Manton, J.H.; Hori, G. Algorithms on the Stiefel manifold for joint diagonalisation. In Proceedings of the 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing, Orlando, FL, USA, 13–17 May 2002; Volume 2, pp. 1481–1484. [Google Scholar]
Absil, P.A.; Mahony, R.; Sepulchre, R. Optimization Algorithms on Matrix Manifolds; Princeton University Press: Princeton, NJ, USA, 2009. [Google Scholar]
Liu, F.; Masouros, C.; Amadori, P.V.; Sun, H. An efficient manifold algorithm for constructive interference based constant envelope precoding. IEEE Signal Process. Lett. 2017, 24, 1542–1546. [Google Scholar] [CrossRef] [Green Version]
Abrudan, T.E.; Eriksson, J.; Koivunen, V. Steepest descent algorithms for optimization under unitary matrix constraint. IEEE Trans. Signal Process. 2008, 56, 1134–1147. [Google Scholar] [CrossRef]
Abrudan, T.; Eriksson, J.; Koivunen, V. Conjugate gradient algorithm for optimization under unitary matrix constraint. Signal Process. 2009, 89, 1704–1714. [Google Scholar] [CrossRef]
Palka, T.A. Bounds and Algorithms for Subspace Estimation on Riemannian Quotient Submanifolds; University of Rhode Island: Kingston, RI, USA, 2016. [Google Scholar]
Naser, A.A.; Abdulameer, L.F.; Bayat, O. SINR and Capacity Analysis for Multiuser MIMO Interference Channels. In IOP Conference Series: Materials Science and Engineering, Proceedings of the 2nd International Scientific Conference of Engineering Sciences (ISCES 2020), Diyala, Iraq, 16–17 December 2020; IOP Publishing: Bristol, UK, 2021; Volume 1076, p. 012056. [Google Scholar]
Hunger, R. Floating Point Operations in Matrix-Vector Calculus; Tech. Rep. TUM-LNS-TR-05-05; Technische Universität München, Associate Institute for Signal Processing: München, Germany, 2005. [Google Scholar]
Nam, J.; Adhikary, A.; Ahn, J.Y.; Caire, G. Joint spatial division and multiplexing: Opportunistic beamforming, user grouping and simplified downlink scheduling. IEEE J. Sel. Top. Signal Process. 2014, 8, 876–890. [Google Scholar] [CrossRef]

Figure 1. A base station with N m-MIMO that serves K users in FDD mode.

Figure 2. Achievable sum rate versus N, comparing different training designs with

ω_{} = 2 . 5^{\circ}

,

D = 1 / 2

.

Figure 2. Achievable sum rate versus N, comparing different training designs with

ω_{} = 2 . 5^{\circ}

,

D = 1 / 2

.

Figure 3. Achievable sum rate versus N, comparing different training designs with

ω_{} = 25^{\circ}

,

D = 1

.

Figure 3. Achievable sum rate versus N, comparing different training designs with

ω_{} = 25^{\circ}

,

D = 1

.

Figure 4. Overall computational complexity versus the number of BS antennas N comparing different training sequence methods corresponding to the pilot length in BF and RZF precoding in the OR model with

ω_{} = 2 . 5^{\circ}

,

D = 1 / 2

,

T_{} = 100

symbols,

ρ_{d} = 10

dB and

K = 10

users.

Figure 4. Overall computational complexity versus the number of BS antennas N comparing different training sequence methods corresponding to the pilot length in BF and RZF precoding in the OR model with

ω_{} = 2 . 5^{\circ}

,

D = 1 / 2

,

T_{} = 100

symbols,

ρ_{d} = 10

dB and

K = 10

users.

Figure 5. Overall computational complexity versus the number of BS antennas N comparing different training sequence methods corresponding to the pilot length in BF and RZF precoding in the OR model with

ω = 25^{\circ}

,

D = 1

,

T_{} = 100

symbols,

ρ_{d} = 10

dB and

K = 10

users.

Figure 5. Overall computational complexity versus the number of BS antennas N comparing different training sequence methods corresponding to the pilot length in BF and RZF precoding in the OR model with

ω = 25^{\circ}

,

D = 1

,

T_{} = 100

symbols,

ρ_{d} = 10

dB and

K = 10

users.

Figure 6. Overall computational complexity versus the number of BS antennas N comparing different training sequence methods corresponding to the pilot length in BF and RZF precoding in the OR model with

ω = 5^{\circ}

,

D = 1 / 2

,

T_{} = 100

symbols,

ρ_{d} = 5

dB and

K = 10

users.

Figure 6. Overall computational complexity versus the number of BS antennas N comparing different training sequence methods corresponding to the pilot length in BF and RZF precoding in the OR model with

ω = 5^{\circ}

,

D = 1 / 2

,

T_{} = 100

symbols,

ρ_{d} = 5

dB and

K = 10

users.

Figure 7. Overall computational complexity versus the number of BS antennas N comparing different training sequence methods corresponding to the pilot length in BF and RZF precoding in the OR model with

ω = 5^{\circ}

,

D = 1 / 2

,

T_{} = 100

symbols,

ρ_{d} = 10

dB and

K = 6

users.

Figure 7. Overall computational complexity versus the number of BS antennas N comparing different training sequence methods corresponding to the pilot length in BF and RZF precoding in the OR model with

ω = 5^{\circ}

,

D = 1 / 2

,

T_{} = 100

symbols,

ρ_{d} = 10

dB and

K = 6

users.

Figure 8. Overall computational complexity versus the number of BS antennas N comparing different training sequence methods corresponding to the pilot length in BF and RZF precoding in the OR model with

ω = 5^{\circ}

,

D = 1 / 2

,

T_{} = 100

symbols,

ρ_{d} = 10

dB and

K = 12

users.

Figure 8. Overall computational complexity versus the number of BS antennas N comparing different training sequence methods corresponding to the pilot length in BF and RZF precoding in the OR model with

ω = 5^{\circ}

,

D = 1 / 2

,

T_{} = 100

symbols,

ρ_{d} = 10

dB and

K = 12

users.

Table 1. Computational complexity analysis.

Algorithms	Complexity in Flops per Iteration
SCMI [30]	$T_{tr}^{3} K + T_{tr}^{2} (N (4 K - 2) - 1)) + 4 N^{2} T_{tr} (K - 1) + N T_{tr} (7 - 3 K) - 1$
SMSE [31]	$N T_{tr}^{} (2 K (8 N - 5) + 23 N (t_{d} + t_{h}) + 80 N + 13) - 1 + (K - 1) X$
Proposed CGD	$N T_{tr} + 1 / 3 N^{2} (T_{tr} (3 t_{d g} + 4 t_{h g} + 44) - 3) - 1 + (K - 1) B$

Table 2. Simulation parameters.

Parameters	Symbol	Value
Number of BS antennas	N	1–500
Number of users	K	10
Azimuth standard deviation	$ω$	$2 . 5^{\circ}, 25^{\circ}$
Antenna distance spacing	D	$λ / 2$ , 1
Coherence time	T	100 symbols

Table 3. The number of iterations per each training sequence algorithm required to converge with respect to the number of BS antennas N for the pilot length in BF and RZF precoding in the OR model with

ω_{} = 2 . 5^{\circ}

,

D = 1 / 2

,

T_{} = 100

symbols,

ρ_{d} = 10

dB and

K = 10

users.

Table 3. The number of iterations per each training sequence algorithm required to converge with respect to the number of BS antennas N for the pilot length in BF and RZF precoding in the OR model with

ω_{} = 2 . 5^{\circ}

,

D = 1 / 2

,

T_{} = 100

symbols,

ρ_{d} = 10

dB and

K = 10

users.

N	SCMI	SMSE	CGD	N	SCMI	SMSE	CGD
(a) BF precoding				(b) RZF precoding
10	85	120	66	10	151	40	66
50	3151	767	201	50	3151	413	54
100	4593	786	226	100	5660	166	84
150	4210	669	190	150	3376	669	190
200	4120	1276	182	200	4120	1276	182
250	2816	2172	223	250	2816	2172	223
300	2915	1909	315	300	2915	1909	315
350	2655	2341	316	350	2849	1562	316
400	5557	1419	343	400	2851	2261	309
450	5727	3766	420	450	5727	2500	326
500	5245	2637	379	500	4893	2022	336

Table 4. The number of iterations per each training sequence algorithm required to converge with respect to the number of BS antennas N for the pilot length in BF and RZF precoding in the OR model with

ω = 25^{\circ}

,

D = 1

,

T_{} = 100

symbols,

ρ_{d} = 10

dB and

K = 10

users.

Table 4. The number of iterations per each training sequence algorithm required to converge with respect to the number of BS antennas N for the pilot length in BF and RZF precoding in the OR model with

ω = 25^{\circ}

,

D = 1

,

T_{} = 100

symbols,

ρ_{d} = 10

dB and

K = 10

users.

N	SCMI	SMSE	CGD	N	SCMI	SMSE	CGD
(a) BF precoding				(b) RZF precoding
10	316	25	23	10	163	3	3
50	13742	394	129	50	5865	136	64
100	15806	1278	162	100	11823	870	270
150	8449	1606	217	150	10546	1416	238
200	10297	1392	311	200	7824	844	309
250	7069	2298	310	250	7069	2127	267
300	6401	2437	332	300	7568	2833	271
350	7360	1958	313	350	6618	2563	287
400	7897	2294	317	400	6959	2655	273
450	6484	2264	278	450	6448	2483	296
500	9592	2106	358	500	9920	2483	292

Table 5. The number of iterations per each training sequence algorithm required to converge with respect to the number of BS antennas N for the pilot length in BF and RZF precoding in the OR model with

ω = 5^{\circ}

,

D = 1 / 2

,

T_{} = 100

symbols,

ρ_{d} = 5

dB and

K = 10

users.

Table 5. The number of iterations per each training sequence algorithm required to converge with respect to the number of BS antennas N for the pilot length in BF and RZF precoding in the OR model with

ω = 5^{\circ}

,

D = 1 / 2

,

T_{} = 100

symbols,

ρ_{d} = 5

dB and

K = 10

users.

N	SCMI	SMSE	CGD	N	SCMI	SMSE	CGD
(a) BF precoding				(b) RZF precoding
10	62	119	100	10	253	56	71
50	2589	866	258	50	2465	337	106
100	3189	602	165	100	2044	602	165
150	2417	836	272	150	1979	836	272
200	3485	822	354	200	3485	1583	268
250	4161	3028	269	250	3711	3028	269
300	4787	3078	342	300	4787	1298	338
350	4455	3110	473	350	4833	884	358
400	5933	2384	544	400	4845	2384	470
450	6808	2800	583	450	6451	1338	494
500	6156	1803	508	500	8930	3277	553

Table 6. The number of iterations per each training sequence algorithm required to converge with respect to the number of BS antennas N for the pilot length in BF and RZF precoding in the OR model with

ω = 5^{\circ}

,

D = 1 / 2

,

T_{} = 100

symbols,

ρ_{d} = 10

dB and

K = 6

users.

Table 6. The number of iterations per each training sequence algorithm required to converge with respect to the number of BS antennas N for the pilot length in BF and RZF precoding in the OR model with

ω = 5^{\circ}

,

D = 1 / 2

,

T_{} = 100

symbols,

ρ_{d} = 10

dB and

K = 6

users.

N	SCMI	SMSE	CGD	N	SCMI	SMSE	CGD
(a) BF precoding				(b) RZF precoding
10	49	55	50	10	209	60	24
50	1763	288	117	50	1763	155	73
100	4545	318	140	100	2177	188	102
150	2689	747	155	150	2281	747	155
200	2391	1091	165	200	2679	853	161
250	5800	1041	200	250	4789	1225	264
300	5831	1355	280	300	4776	1126	263
350	5627	1428	298	350	6494	1580	263
400	7380	1763	303	400	8822	1490	346
450	9942	1739	403	450	9942	1739	403
500	6266	1633	347	500	7171	1917	400

Table 7. The number of iterations per each training sequence algorithm required to converge with respect to the number of BS antennas N for the pilot length in BF and RZF precoding in the OR model with

ω = 5^{\circ}

,

D = 1 / 2

,

T_{} = 100

symbols,

ρ_{d} = 10

dB and

K = 12

users.

Table 7. The number of iterations per each training sequence algorithm required to converge with respect to the number of BS antennas N for the pilot length in BF and RZF precoding in the OR model with

ω = 5^{\circ}

,

D = 1 / 2

,

T_{} = 100

symbols,

ρ_{d} = 10

dB and

K = 12

users.

N	SCMI	SMSE	CGD	N	SCMI	SMSE	CGD
(a) BF precoding				(b) RZF precoding
10	74	49	47	10	348	49	26
50	7154	469	163	50	7154	220	88
100	4979	1574	234	100	5385	348	135
150	4882	2149	237	150	4291	836	238
200	4139	3654	385	200	4321	1001	250
250	5099	7357	533	250	3796	3087	329
300	5539	7101	616	300	5686	7113	537
350	5508	6428	785	350	6008	6428	584
400	5409	6772	827	400	5850	7133	761
450	8468	3089	886	450	10823	6915	588
500	9564	2901	974	500	7536	7736	703

Table 8. Comparison between the methods to compare the fundamental characteristics of the proposed method with the state-of-the-art-methods.

Algorithms	Fundamental Characteristics
SCMI [30]	Training sequences optimization based on a sum conditional mutual information (SCMI) maximization problem. Uses gradient with Karush-Kuhn-Tucker (KKT) conditions of the Lagrangian dual problem based approach. Requires high number of iterations to converges. The complexity is still high in comparison to the proposed CGD.
SMSE [31]	Training sequences optimization based on a sum conditional mutual information (SCMI) maximization problem. Uses a Grassmannian manifold based approach. Requires a highest number of iterations to converges. Delivers a highest overall computational complexity.
Proposed CGD	Training sequences optimization based on a sum achievable sum rate maximization problem. Uses a Riemannian manifold based approach. Requires a lowest number of iterations to converges. Delivers a lower overall computational complexity.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Alsabah, M.; Naser, M.A.; Mahmmod, B.M.; Abdulhussain, S.H. A Computationally Efficient Gradient Algorithm for Downlink Training Sequence Optimization in FDD Massive MIMO Systems. Network 2022, 2, 329-349. https://doi.org/10.3390/network2020021

AMA Style

Alsabah M, Naser MA, Mahmmod BM, Abdulhussain SH. A Computationally Efficient Gradient Algorithm for Downlink Training Sequence Optimization in FDD Massive MIMO Systems. Network. 2022; 2(2):329-349. https://doi.org/10.3390/network2020021

Chicago/Turabian Style

Alsabah, Muntadher, Marwah Abdulrazzaq Naser, Basheera M. Mahmmod, and Sadiq H. Abdulhussain. 2022. "A Computationally Efficient Gradient Algorithm for Downlink Training Sequence Optimization in FDD Massive MIMO Systems" Network 2, no. 2: 329-349. https://doi.org/10.3390/network2020021

APA Style

Alsabah, M., Naser, M. A., Mahmmod, B. M., & Abdulhussain, S. H. (2022). A Computationally Efficient Gradient Algorithm for Downlink Training Sequence Optimization in FDD Massive MIMO Systems. Network, 2(2), 329-349. https://doi.org/10.3390/network2020021

Article Menu

A Computationally Efficient Gradient Algorithm for Downlink Training Sequence Optimization in FDD Massive MIMO Systems

Abstract

1. Introduction

2. System Model Description

3. CSI Estimation Using DL Training Sequences

4. Training Sequence Optimization Based on CGD Algorithm over the Riemannian Manifold

5. Achievable Sum Rate Analysis Using RMT Method

6. Results and Discussion

6.1. Achievable Sum Rate Performance Evaluation

6.2. Computational Complexity Evaluation

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI