1. Introduction
Cooperative multiple-input multiple-output (MIMO) communication techniques, wherein data exchange between MIMO terminal nodes is assisted by one or multiple MIMO relays, have been studied for Long Term Evolution Advanced (LTE-A) cellular systems [
1,
2,
3], since they assure significant performance gains in terms of coverage, reliability, and capacity. Relay technology has been also considered for Internet of Things (IoT) applications, by allowing in particular the support of the massive access for fog and social networking services [
4,
5,
6]. One of the main changes when going from LTE-A to 5th generation (5G) systems is the spectrum use at radically higher frequencies in the millimeter-wave (mmWave) range [
7]. However, mmWave signals are highly susceptible not only to blockages from large-size structures, for example, buildings, but they are also severely attenuated by the presence of small-size objects, for example, human bodies and foliage [
8]. In this regard, cooperative MIMO technology additionally represents a possible approach for circumventing the unreliability of mmWave channels [
9] in 5G networks.
In addition, 5G systems have stringent requirements in terms of spectral efficiency. Many relaying protocols operate in 
half-duplex mode [
10,
11,
12,
13], where two time slots are required to perform a single transmission, due to the inability of the relays to receive and transmit at the same time. To overcome the inherent halving of spectral efficiency, a possible remedy for 5G applications is to adopt 
two-way relaying [
14] (see 
Figure 1), which works as follows: (i) in the first slot, the two terminal nodes simultaneously transmit their precoded signals to the relays; (ii) in the second slot, the relays precode and forward the received signals to the terminals. Since each terminal knows its own transmitted signal, the effects of self-interference can be subtracted from the received signal at the terminals, and the data of interest can be decoded. On the other side of the coin, with respect to the one-way relaying setting, the optimization of two-way cooperative networks is complicated by fact that terminal precoders/decoders and relay forwarding matrices are coupled among themselves.
Design and performance analysis of two-way cooperative MIMO networks encompassing multiple 
amplify-and-forward (AF) or 
non-regenerative relays has been considered in References [
15,
16,
17,
18,
19]. Compared with the single-relay case [
20], the multiple-relay scenario generally leads to more challenging 
nonconvex constrained optimization problems, which are usually solved by burdensome iterative procedures. In Reference [
15], by adopting a weighted sum-mean-square-error (MSE) or a sum-rate cost function, iterative gradient descent optimization algorithms are proposed, with transmit-power constraints imposed at both the terminals and the relays. A similar scenario is considered in Reference [
16] and Reference [
17]. In Reference [
16], the original constrained minimum sum-MSE nonconvex optimization problem is iteratively solved. Specifically, the algorithm of Reference [
16] starts by randomly choosing the terminal precoders and the relay forwarding matrices satisfying the transmission power constraints at the source terminals and the relay nodes. In each iteration, the terminal precoders, the relay forwarding matrices, and the decoders are alternatingly updated in Reference [
16] through solving convex subproblems: first, with given precoders and relay forwarding matrices, the optimal decoders are obtained in closed-form by solving an unconstrained convex problem; second, with fixed precoders and decoders, the relaying matrix of all the relays are updated in closed-form one-by-one by freezing the relaying matrices of the other relays; finally, given the decoders and relaying matrices, the precoders are updated by solving a convex quadratically constrained quadratic programming problem. A different iterative optimization procedure is proposed in Reference [
17], based on the matrix conjugate gradient algorithm, which is shown to converge faster than conventional gradient descent methods. Finally, some recent papers [
18,
19] propose architectures for two-way relaying based on relay/antenna selection strategies.
In this paper, we propose an optimization algorithm for two-way AF MIMO relaying 5G networks, where terminal precoders/decoders and relay forwarding matrices are jointly derived under power constraints on the transmitted/received power at the terminals. Rather than attempting to solve it iteratively, we derive a relaxed version of the original minimum sum-MSE nonconvex optimization, which allows one to decompose it in two separate problems that admit a closed-form, albeit suboptimal, solution. We show by Monte Carlo trials that our closed-form approach performs comparably or better than representative iterative approaches proposed in the literature for the same scenario with a reduced computational complexity, especially for increasing values of the number of relays.
  2. Network Model and Basic Assumptions
We consider the two-way MIMO 5G network configuration of 
Figure 1, where bidirectional communication between two terminals, equipped with 
 and 
 antennas, respectively, is assisted by 
 half-duplex relays, each equipped with 
 antennas. We assume that there is no direct link between the two terminals, due to high path loss values or obstructions. Even though our approach can be generalized, for simplicity, the considered physical layer is that of a single-carrier cooperative system where all the channel links are quasi static and experience flat fading.
Let 
 and 
 denote the symbol vectors to be transmitted by terminal 1 and 2, respectively. In the first time slot, each terminal precodes its symbols with matrix 
, for 
, before transmitting it to the relays, which thus receive 
, for 
, where 
 is the 
first-hop channel matrix (from terminal 
i to relay 
k), and 
 models additive noise at 
kth relay. By defining 
, the overall signal received by the relays can be compactly written as
      
      where 
 gathers all first-hop channels and the vector 
 gathers all the noise samples.
In the second time slot, the 
kth relay forwards its received signal 
, by using the relaying matrix 
, thus transmitting 
. The received signal at each terminal can be written, for 
, as 
      
      where 
 is the 
second-hop channel matrix (from relay 
k to terminal 
i), and the vector 
 is additive noise at terminal 
i. Additionally, we have defined in (
2) the extended matrices 
 and 
. Moreover, by taking into account (
1), the vector 
 can also be directly written in terms of 
 and 
 as
      
      where 
 is the 
dual-hop matrix from terminal 
j to 
i, for 
, and  vector 
 is the overall noise.
We assume customarily [
14,
18] that each terminal can estimate and subtract the self-interference deriving from its own symbols. To do this, terminal 
i has to first acquire the matrix 
, which can be obtained by resorting to standard training-based identification methods. Specifically, each data transmission can be preceded by a training period, wherein the two terminals transmit orthogonal pilot sequences to the relays. In this case, by redefining 
 with a slight abuse of notation as 
, for 
, we write explicitly
      
      where 
 when 
, whereas 
 when 
.
At terminal i, vector  is subject to linear equalization through matrix , thus yielding a soft estimate  of the symbols  transmitted by terminal , whose entries are then subject to minimum-distance hard decision.
In the sequel, we consider the common assumptions: (a1)  and  are mutually independent zero-mean circularly symmetric complex (ZMCSC) random vectors, with , for ; (a2) the entries of  and  are independent identically distributed ZMCSC Gaussian unit-variance random variables, for ; (a3) the noise vectors ,  and  are mutually independent ZMCSC Gaussian random vectors, statistically independent of , with  and , for .
Full channel-state information (CSI) is assumed to be available at both the terminals and the relays. Particularly, we assume that: (i) 
 are known at the terminals and at the relays; (ii) the 
kth second-hop channel matrices 
 and 
 are known only to the 
kth relay, for 
; (iii) the dual-hop channel matrix 
 and the covariance matrix
      
      of 
 are known at the 
ith terminal, for 
. It should be noted that, hereinafter, all the ensemble averages are evaluated for fixed values of the first- and second-hop channel matrices.
  3. The Proposed Closed-Form Design
With reference to model (
4), the problem at hand is to find optimal values of 
, 
, and 
 for recovering 
 and 
 according to a certain cost function and subject to suitable power constraints at the terminals and relays.
A common performance measure of the accuracy in recovering the symbol vector 
 at terminal 
 is the mean-square value of the error 
: 
, where 
 is the error covariance matrix, which depends on 
. As a global cost function for the overall two-way transmission, we consider as in References [
15,
16,
17,
18] the 
sum-MSE, defined as 
. It is well-known that, for fixed values of 
 and 
, the matrices 
 minimizing the sum-MSE are the Wiener filters
      
      for 
, thus yielding
      
It is noteworthy that the variables 
, 
, and 
 are coupled in (
7) and, hence, the two terms in (
7) cannot be minimized independently. Herein, we relax the original problem so as to 
separate the minimization of the two terms in (
7).
As a first step, we observe that minimizing (
7) is complicated by the presence of 
, which depends non-trivially on 
. For such a reason, we consider instead minimization of the following high signal-to-noise ratio (SNR) approximation: 
      which turns out to be accurate when 
, where 
 is the smallest eigenvalue of 
. Suitable constraints must be set to avoid trivial solutions in minimizing (
8). It is customary to impose power constraints to limit the average transmit power at the terminals:
      for 
. In order to limit 
, we impose a constraint on the average power received at the terminals in the second time slot, that is, with reference to (
2), we attempt to limit, for 
, the following quantities: 
      where 
 is the covariance matrix of 
. It is noteworthy that (
10) is typically limited in those scenarios where a target performance has to be achieved and per-node fairness is not of concern [
10,
12]. Moreover, the average power received at the terminals is an important metric measuring the human exposure to radio frequency (RF) fields generated by transmitters operating at mmWave frequencies [
21] and, with respect to traditional per-relay transmit power constraints, it is more easily related to regulatory specifications [
22]. To simplify (
10), we exploit the following chain of inequalities: 
      where the last approximate inequality holds noting that, for fixed values of 
, by the law of large numbers one has 
 almost surely as 
 gets large. Therefore, if we impose 
, we get the upper bound: 
Such a choice allows one to considerably simplify the system design. In summary, the optimization problem to be solved can be expressed as
      
In order to find a closed-form solution of (
13), we introduce the matrix 
, with 
, and rewrite (
13) as follows
      
Remarkably, the cost function is the sum of two terms: the former one depends only on the variables 
, whereas the latter one involves only the variables 
. Therefore, (
14) can be decomposed in two problems involving 
 and 
 separately, which can be solved in parallel in a closed-form manner. Indeed, capitalizing on such a decomposition, the solution of (
14) can be characterized by the following theorem.
Theorem 1. Assume that: (a4)  is full-column rank, that is, , ; (a5)  is full-column rank, that is, , for . Moreover, let  denote the singular value decomposition (SVD) of , where  and  are the unitary matrices of left/right singular vectors, and  is the rectangular diagonal matrix of the corresponding singular values arranged in increasing order. Then, the solution of (14) has the following general form:where  contains the  rightmost columns of ,  contains the  rightmost columns of , the diagonal matrices  and  will be specified soon after, and  is an arbitrary semi-unitary matrix, that is, .  Remark 1.(a4) implies that , .
Remark 2.(a5) implies that  is full-column rank too, that is,  and . Hence, in the following we set .
Under (a4) and (a5), the dual-hop channel matrices  are full-column rank, that is, , for : this ensures perfect recovery of the source symbol vectors  at the terminals in the absence of noise by means of linear equalizers. Although Theorem 1 holds for any value of  and , we will assume herein that , which allows the terminals to transmit as many symbols as possible with an acceptable performance in practice.
Theorem 1 allows one to rewrite the optimization problem (
14) in a simpler scalar form:
      with 
 and 
 representing the 
ℓth squared diagonal entry of 
 and 
, respectively, whereas 
 denotes the 
ℓth nonzero singular value of 
, for 
. Similarly to (
14), problem (
17) can be decomposed into two separate problems involving disjoint subsets of variables.
It can be shown, with straightforward manipulations, that the objective function in (
17) is convex if and only if
      
, with 
. It is also seen that, based on (a2), one has 
 in the large 
 limit, with 
. Thus, condition (
18) boils down to 
, for all 
, with 
, which is already included in the constraints of (
17). Therefore, convex programming can be used to find a global minimum of (
17).
To calculate the relaying matrices, let us partition solution (
16) as 
, with 
, 
. Defining 
 and 
, and assuming that 
 is full-row rank, that is, 
, with 
, the 
kth relay can construct its own relaying matrix by solving the matrix equation 
, whose minimum-norm solution is given by
      
      where the superscript † denotes the Moore-Penrose inverse.    
      
| Algorithm 1: The proposed design algorithm. | 
| Input quantities: Output quantities:  and
 
   1.Choose arbitrary  such that .  2.Perform the SVD of . and collect the  largest singular values and the corresponding left/right singular vectors.  3.Solve the convex problem (17 ) in the disjoint subsets of variables   and   separately.  4.From the solution of step 3, build the matrices .  5.Build the matrices   according to (15 ) and (16 ).  6.Calculate   according to (19 ).  7.Calculate   according to (6 ).
 | 
With reference to the step-by-step description of the proposed design algorithm reported at the top of this page, the following comments are in order. The convex optimization in step 3 can be efficiently carried out using standard techniques, such as the interior-point method. We observe that the worst-case theoretical complexity of the interior-point method is proportional to 
. Hence, for a realistic setting of the system parameters, the computational complexity of the proposed algorithm, is dominated by the SVD computation (in step 2), which is of order 
 and, thus, it 
linearly grows with the number 
 of relays. It is noteworthy that, even though the alternating algorithm proposed in Reference [
16] allows to solve a nonconvex problem by solving convex subproblems, it is more complex than calculating the solution of (
17); moreover, it requires proper initialization to monotonically converge to (at least) a local optimum.
  4. Simulation Results
In this section, to assess the performance of the considered design, we present the results of Monte Carlo computer simulations, aimed at evaluating the average (with respect to channel realizations) bit-error-rate (BER) of the proposed cooperative two-way MIMO system. We consider a network encompassing two terminals equipped with  antennas, and transmitting QPSK symbols with . The  relays are equipped with  antennas. We also assume that , for all , where  represents the average transmitted power at the kth relay, and set . Consequently, the energy per bit to noise power spectral density ratio  is a measure of the per-antenna link quality of both the first- and second-hop transmissions. The BER is evaluated by carrying out  independent Monte Carlo trials, with each run using independent sets of channel realizations and noise, and an independent record of  source symbols.
We compare the performances of our design (labeled as “Proposed”) to those of the iterative technique proposed in Reference [
16], which has been shown [
16] in its turn to outperform other iterative techniques, such as the gradient-descent technique of Reference [
15]. It is worthwhile to note that both the strategies under comparison require the same amount of CSI. Furthermore, since the method of Reference [
16] imposes different power constraints on the design of the relaying matrices, our solutions for 
 are properly scaled so as to ensure that the average power transmitted by each relay is the same for both methods.
In 
Figure 2, 
Figure 3 and 
Figure 4, we report the BER for different values of the number 
 of relays. Results in 
Figure 2 for 
 show that the proposed closed-form design, based on the solution of the relaxed problem (
14), exhibits performances comparable with the iterative solution of Reference [
16] in the considered range of 
 values only when the latter employs more than 5 iterations. Specifically, when the method of Reference [
16] employs 10 iterations, a crossover can be observed in 
Figure 2 between the BER curve of the proposed algorithm and that of Reference [
16]. This behavior is due to the fact that the rate of convergence of Reference [
16] strongly depends on the SNR.
Figure 3 and 
Figure 4 show that, as the number of relays increases, the proposed method clearly outperforms the method of Reference [
16] even when the latter employs 10 iterations. Performance improvement of Reference [
16] is negligible after 10 iterations.
 In a nutshell, although the alternating iterative procedure [
16] attempts to solve the nonconvex original two-way constrained minimum sum-MSE problem, its convergence behaviors are affected in practice by both the operative SNR and number of relays: in the low-SNR region and/or when the number of relays is sufficiently large, convergence to a local minimizer is not guaranteed in a reasonable number of iterations for all possible initializations. This is the price to pay for swapping a difficult joint optimization with a sequence of easier problems involving subsets of the variables. On the other hand, the proposed optimization strategy gives up the idea of solving the original nonconvex problem, by resorting to suitable relaxations of both the cost function and the relaying power constraint. This allows us to jointly optimize all the variables, without  using burdensome iterative algorithms.