Rate Optimization of Two-Way Relaying with Wireless Information and Power Transfer

We consider the simultaneous wireless information and power transfer in two-phase decode-and-forward two-way relaying networks, where a relay harvests the energy from the signal to be relayed through either power splitting or time splitting. Here, we formulate the resource allocation problems optimizing the time-phase and signal splitting ratios to maximize the sum rate of the two communicating devices. The joint optimization problems are shown to be convex for both the power splitting and time splitting approaches after some transformation if required to be solvable with an existing solver. To lower the computational complexity, we also present the suboptimal methods optimizing the splitting ratio for the fixed time-phase and derive a closed-form solution for the suboptimal method based on the power splitting. The results demonstrate that the power splitting approaches outperform their time splitting counterparts and the suboptimal power splitting approach provides a performance close to the optimal one while reducing the complexity significantly.


Introduction
The Internet-of-Things (IoT) is envisioned to make operation of our today life more convenient and productive by connecting things to collect data, make intelligent decisions, and perform autonomous controls without human intervention [1,2].One of the major concerns in developing IoT systems is the energy consumption, not only in the data centers processing big data [3,4], but also in the end-devices sensing and exchanging data [1,5].The devices should be even sustainable without a battery or with a small battery when they are deployed in either hazardous or vast areas due to the high cost of battery replacements.In this context, wireless power transfer through radio frequency signals has received huge attention as a way of charging the devices without battery replacements [5,6] and traditional wireless and mobile communication networks [7][8][9][10] are now being reformed by incorporating the energy harvesting capability at the devices.
In particular, various relaying protocols have been rebuilt to support energy harvesting devices in cooperative networks [11][12][13][14][15][16][17][18][19][20][21][22][23][24][25][26], where relays can be categorized into either energy harvesting nodes or energy transmission nodes.Energy harvesting relays are as low in complexity as communicating devices and harvest the energy in helping the data exchange of the devices in general [11][12][13][14][15][16][17][18][19][23][24][25][26].On the other hand, energy transmission relays are as highly complex as base stations so that the relays transfer the power and information to the communicating devices [20][21][22]; the relays even employ massive antennas to leverage the efficiency in the power and information transfer at a high cost in implementation [27,28].This paper focuses on the former case in which there is no infrastructure relay and one of neighboring peer devices takes the role of signal relaying by spending the energy harvested by means of simultaneous wireless power and information transfer (SWIPT).

Related Works
This subsection discusses the relevant literature on SWIPT in cooperative networks, where the relays harvest energy and then transmit a signal.To accomplish SWIPT practically, the received signal should be split into two branches for energy harvesting and information decoding.This is performed either by power splitting (PS) or by time splitting (TS) at a single antenna receiver, although their variants exist [29].These splitting methods are investigated in cooperative networks with one-way relaying (OWR) protocols [11,[16][17][18][19]24,26] and with two-way relaying (TWR) protocols [12,13,15,23,25].Here, amplify-and-forward (AF) relays are adopted in [11][12][13]15,24] for a lower complexity with the simple relay operation amplifying the received signal without information decoding, while decode-and-forward (DF) relays are adopted in [16][17][18][19]23,25] for a better performance with the complicated relay operation regenerating the relay signal after decoding the received signal.
To be more specific, for the OWR protocol, the performance of the PS-SWIPT and TS-SWIPT methods is studied with an AF relay in terms of the outage probability and ergodic capacity [11]; with a DF relay in terms of the ergodic capacity, outage capacity [16], maximum achievable rate [17], and outage probability [26].The performance of the PS-SWIPT for the OWR protocol is also studied with a full-duplex DF relay [19] and with the AF relay when the direct link is available [24].In addition, the optimization of the SWIPT parameters is also studied for the OWR with a DF relay by using statistical information [18].
For the TWR protocol, the SWIPT methods are combined with the spectrally efficient two-phase protocols composed of multiple access (MAC) and broadcasting (BC) phases [12,13,23,25]; an exception is observed in [15], which incorporates the PS-SWIPT and the TS-SWIPT into the four-phase TWR for an easier modification in spite of a large loss in the spectral efficiency.As in the OWR protocol, the outage probability and ergodic capacity are analyzed for the AF-TWR protocol employing the PS-SWIPT [12].A source node selection method is devised for wireless power transfer in the AF-TWR protocol employing the TS-SWIPT [13].In addition, the outage probability of the DF-TWR protocol with the PS-SWIPT and the TS-SWIPT is studied without resource allocation in cognitive radio environments [23] and with outage-optimal resource allocation in non-cognitive radio environment [25].Some studies on the TWR with energy harvesting nodes are also observed in [14,20,22], but they are irrelevant since the SWIPT is not considered due to a different energy harvesting scenario [14] and an infrastructure relay transmitting the power is considered for the AF-TWR in [20,22].

Motivation and Contribution
The SWIPT methods are less studied for the TWR than for the OWR, and their initial studies for the TWR focus on the AF relays for low-complexity implementations [12,13].In addition, the studies focus on the achievable performance when the SWIPT methods are introduced to the TWR protocols [23], although the traditional TWR protocols without energy harvesting are studied in various aspects from performance analysis to the resource allocation [30][31][32][33].To leverage the performance at the given resources, resource allocation methods are essential.Therefore, they should be studied also for the TWR protocols employing the SWIPT methods.In such a case, the DF-TWR is preferable to the AF-TWR since the former not only outperforms the latter without resource allocation but also provides an additional degree of freedom in resource allocation by allowing unequal time durations for the MAC and BC phases; the time durations for two phases should be kept equal for the AF-TWR to relay the non-regenerative signal.
In this context, the PS-SWIPT and the TS-SWIPT have been introduced to the DF-TWR protocol recently and several resource allocation problems are addressed to optimize the outage probability by adapting the time-phase ratio and the SWIPT parameters [25].The study reveals that the PS-SWIPT approach outperforms the TS-SWIPT in the outage probability and an appropriate choice of the time-phase ratio leverages the outage performance significantly by balancing the performance in two directions.This paper revisits the DF-TWR network, employing two SWIPT methods in [25], but aims at maximizing the sum rate of the network instead of minimizing the outage probability.We formulate the problem optimizing the time-phase ratio of the DF-TWR and the splitting ratio of the SWIPT method for each of the PS-SWIPT and TS-SWIPT methods.The joint optimization problems are transformed to convex optimization problems after appropriate transformation if required so that they can be solved with an existing solver at some reasonable complexity.To reduce the computational complexity further, we also present the suboptimal methods adapting the splitting ratio for the fixed time-phase ratio and derive a closed-form solution readily available for the PS-SWIPT method.
The remainder of the paper is organized as follows: Section 2 describes the system model of the DF-TWR with the PS-SWIPT and the TS-SWIPT and provides the rate performance of the DF-TWR achievable with each SWIPT method.Section 3 formulates the resource allocation problems for the two SWIPT methods and provides the methods of finding the optimal and suboptimal solutions.Numerical results are provided in Section 4 for the two SWIPT methods with different resource allocation methods.Section 5 draws concluding remarks.
Notation: Vectors and matrices are expressed by boldface lowercase and uppercase letters, respectively, where 0 n denotes the length-n column vector with all zero entries and I n denotes the n × n identity matrix.Functions max(x, y) and min(x, y) denote the maximum and minimum of x and y, respectively, while C(x) = log 2 (1 + x) denotes the capacity at signal-to-noise ratio x.Distribution CN(µ, σ 2 ) denotes the complex Gaussian with mean µ and variance σ 2 .An optimization problem is stated as max x f (x) s.t.x ∈ Ω, where f (x) is a scalar-valued function called the objective function, x is the decision variable, Ω is the constraint set (feasible set), and max and s.t. are abbreviations for maximize and subject to, respectively.An optimal solution of the optimization problem is expressed as

System Model
We consider a three-device TWR network in Figure 1a, where a relay device Q cooperates in the information exchange of two communicating devices S 1 and S 2 .All devices operate in half-duplex and are equipped with a single antenna.The channel between S i and Q is denoted by h i for i = 1, 2 under the assumption of flat fading and channel reciprocity.
Figure 1b,c illustrate the SWIPT methods for the DF-TWR protocol when devices S 1 and S 2 exchange their messages w 1 and w 2 with each other.For the DF-TWR, a data frame is divided into MAC and BC phases of δN and (1 − δ)N durations, respectively, where N is the number of symbols comprising the data frame and 0 < δ < 1 is the time-phase (TP) ratio of the MAC phase in the data frame.In the MAC phase, S 1 and S 2 transmit their symbol vectors x 1 and x 2 containing the messages, as in [25], to the relay simultaneously.The relay harvests the energy as well as decodes the messages w 1 and w 2 by splitting the received signal in the MAC phase.The signal splitting is accomplished either in the power domain with PS ratio θ, as in Figure 1b, or in the time domain with TS ratio τ, as shown in Figure 1c, where 0 < θ < 1 and 0 < τ < δ.The harvested energy is then given by where η is the energy harvesting efficiency, P i is the transmit power of S i for i = 1, 2, and the background noise is ignored.On the other hand, the received signal vector for the information decoding in the MAC phase can be stated as where n Q ∼ CN(0 δN , σ 2 I δN ) is the vector of the background noise composed of the baseband equivalent antenna noise and the noise introduced by the conversion from the RF signal to the baseband signal [25].
In the BC phase, the relay transmits the network coded symbol vector x bc [30,33] of the decoded messages ŵ1 and ŵ2 by consuming the energy harvested in the MAC phase.This paper does not take into account the energy consumption in the decoding process so that the transmit power of the relay is given by The signal received at S i in the BC phase is then given by where ) is the background noise vector at S i for i = 1, 2.
Version October 10, 2017 submitted to Information 4 of 14 (a) Energy Information where η is the energy harvesting efficiency, 2 i is the transmit power of S i for i = 1, 2, and the background noise is ignored.On the other hand, the signal for the information decoding in the MAC !phase can be stated as where n Q ∼ CN(0, σ 2 ) is the background noise composed of the baseband equivalent antenna noise # and the noise introduced by the conversion from the RF signal to the baseband signal [23].

$
In the BC phase, the relay transmits the network coding x bc [27,30] of the decoded messages Ŵ1

Achievable Rate Region
Let R i be the rate delivered for the ith data flow from S i to S 3−i , i = 1, 2. The achievable rate region D of the rates (R 1 , R 2 ) for the DF-TWR is given by [14,25] where I mac i and I bc i are the maximum achievable rates in the MAC and BC phases, respectively, of the ith data flow for i = 1, 2 whilst I mac Σ is the maximum sum rate achievable in the MAC phase.
For the DF-TWR, the maximum achievable rate I mac i from S i to Q is derived from (2) as where σ 2 is the SNR from S i to Q, and the factors δ and (δ − τ) represent the MAC-phase data fraction in a data frame for the PS-SWIPT and the TS-SWIPT, respectively.The maximum sum rate in the MAC phase is also obtained as which is achievable by applying the successive interference cancellation to (2).The achievable sum rate for the MAC phase is explained in detail in [34] and is also summarized for the DF-TWR in [14,33].
On the other hand, the maximum achievable rate in the BC phase for the ith data flow is given by for the PS-SWIPT and for the TS-SWIPT from (1), we have where α i = ησ 2 γ i (γ 1 +γ 2 )

Rate-Optimal Resource Allocation
We aim at maximizing the sum rate R 1 + R 2 by optimizing the parameters for the SWIPT and the DF-TWR based on the instantaneous channel state information (CSI) (γ 1 , γ 2 ).We provide optimal and suboptimal resource allocation methods for the PS-SWIPT and their counterparts for the TS-SWIPT.

Resource Allocation for the PS-SWIPT
For the PS-SWIPT, we optimize the PS ratio θ and TP ratio δ to maximize the sum rate.The problem is formulated with (4) as max (δ,θ):0≤δ,θ≤1 by applying I mac i , I mac Σ , and I bc i , for the PS-SWIPT given in ( 5), (6), and (8), respectively.The problem has the linear objective function, the linear constraints 0 ≤ θ, δ ≤ 1 and the convex constraints (10) and (12) with linear functions in the left hand side (LHS) and a concave function C(x) in the right hand side (RHS).However, the constraints (11) are not convex due to the factor θδ in C(x).
By introducing a new variable z = θδ, we transform ( 9)-( 12) into max (δ,z):0≤δ,z≤1 which is convex with the convex constraints ( 14)-( 16); the RHS of the constraints is the perspective x f (y/x) of a concave function f (x) and is thus concave [35].Therefore, the problem ( 13)-( 16) can be solved with an existing convex optimization solver.As a suboptimal approach, we fix the TP ratio as δ = 1 2 and find the optimal PS ratio θ † to maximize the sum rate.The problem ( 9)-( 12) with the fixed TP ratio becomes max θ:0≤θ≤1 The RHS of ( 18) is concave with respect to θ since the minimum operation preserves the concavity [35] and C(x) is a concave function while the RHS of ( 19) is also concave with concave function C(x).Therefore, the constraints ( 18) and ( 19) are convex so that there exists a unique θ † which maximizes R 1 + R 2 .
Remark 1.The original problem (9)-( 12) for the PS-SWIPT can be solved by an exhaustive search at computational complexity O(( 1) 2 ) with a searching step for two decision variables.After transformation, the problem ( 13)-( 16) can be solved at complexity O 2 4 log 2 1 by applying interior-point methods in solving the convex optimization problem [36].The suboptimal method with δ = 1 2 and θ † has complexity O(1) since the closed-form solution is readily available.

Resource Allocation for the TS-SWIPT
For the TS-SWIPT, we optimize the TS ratio τ and the TP ratio δ to maximize the sum rate of the DF-TWR.The optimization problem is formulated as max (τ,δ):0≤τ≤δ≤1 from (4) by using I mac i , I mac Σ , and I bc i given in ( 5), (6), and ( 8), respectively, for the TS-SWIPT.The problem ( 28)-( 31) has the linear objective function (28), linear constraints ( 29) and (31), and convex constraint (30).The convexity of ( 30) is guaranteed since the LHS is linear and the RHS is concave with the perspective of a concave function [35].Therefore, the problem ( 28)-( 31) is a convex optimization problem that can be solved with an existing convex optimization tool.
As in the PS-SWIPT, a suboptimal solution for the TS-SWIPT can be obtained by fixing δ = δ s for an appropriately choosing constant δ s and finding the optimal TS ratio τ in ( 28)- (31).As another suboptimal approach, we may set δ − τ = 1 − δ (or equivalently, δ = 1−τ 2 ) in ( 28)- (31) for equal time duration in data reception and transmission.Although the closed-form solution for the aforementioned suboptimal problems is not tractable, the solution can be obtained by an existing convex optimization tool at less computational time than that for the original problem for the TS-SWIPT.
Remark 2. The problem (28)- (31) for the TS-SWIPT can be solved at computational complexity O 2 4 log 2 1 by applying interior-point methods for two decision variables [36].The suboptimal method with a fixed δ can be solved at complexity O log 2 1 for a single decision variable.

Numerical Results
We evaluate the performance of the SWIPT methods in the DF-TWR network when the relay is positioned on a straight line, connecting S 1 and S 2 of distance D. The distance between S i and the relay is denoted by d i for i = 1, 2, which satisfies d 1 + d 2 = D with D = 10 m.The transmission power of the sources is equal as P i = P and the noise power is set to σ 2 = −70 dBm.The path loss and Rayleigh fading are included in the channels as h i ∼ CN(0, 0.01d −ν i ) for i = 1, 2, where ν represents the path loss exponent.The energy harvesting efficiency is assumed to be η = 0.5 in all the figures.
Figure 2 provides the constellation of the optimized TP and PS ratios (δ, θ) obtained by solving the problem ( 9)-( 12) for 500 channel realizations when ν = 3 and d 1 /D = 0.5.The optimized values are shown in the subfigures with different transmit power as P = 10, 20, and 30 dBm. Figure 3 provides the constellation of the optimized TP and TS ratios (δ, τ/δ) obtained by solving the problem ( 28)-( 31) under the same configuration with Figure 2. The parameter values optimizing the sum rate are observed to have a different tendency from those optimizing the outage probability [25] in that the TP ratio tends to have values around δ = 0.5 for both the PS-SWIPT and TS-SWIPT.When the transmit power is as small as P = 10 dBm, the splitting ratio (PS ratio for the PS-SWIPT and TS ratio for the TS-SWIPT) tends to have a value close to 1 since the energy harvested in the MAC phase dominates the performance.As the transmit power increases, the splitting ratio varies in a large degree to balance the harvested energy and the reliably deliverable information rate for each channel realization.
Figure 4 compares the sum rate of the PS-SWIPT with different resource allocation methods when P = 20 dBm and ν = 3.The abscissa is the normalized distance d 1 /D between S 1 and Q.The results of "Opt" and "Sub (Tool)" are obtained by employing the convex optimization solver YALMIP [37] to find the solution of the joint optimization problem ( 13)-( 16) and the suboptimal problem ( 17)- (19), respectively, the results of "Sub (Anal)" are obtained with the closed-form solution (24) for the suboptimal problem ( 17)- (19), and the results of "Equal" are obtained with (θ, δ) = (0.5, 0.5).It should be mentioned that the performance obtained with the analytically derived solution is indistinguishable from that obtained with the convex optimization solver for the suboptimal problem.The joint resource allocation method shows the best performance, but the suboptimal method with the closed-form solution provides a performance close to the optimal one.Therefore, the suboptimal method is favorable to avoid the time-consuming computation in solving the joint optimization problems with the convex optimization solver.The sum rate decreases steeply as the relay moves toward the center location due to the energy harvesting relay which suffers from a large loss in the harvested energy as the distance increases.This observation is the opposite of that in the traditional DF-TWR without the SWIPT which provides the best performance at the center location of the relay.Figure 5 compares the sum rate of the TS-SWIPT with different resource allocation methods under the same conditions with Figure 4.In the figure, "Opt" denotes the performance obtained by solving the joint optimization problem ( 28)- (31) and "Equal" denotes the performance with (τ, δ) = (1/3, 1/3).The suboptimal methods "Sub1" and "Sub2" denote the performance obtained by optimizing τ in ( 28)- (31), after fixing δ = 0.5 from the observation in Figure 3 and δ = (1 − τ)/2 for the equal data portions in the MAC and BC, respectively.The solutions for "Opt", "Sub1", and "Sub2" are obtained by employing the convex optimization solver YALMIP [37].The best performance is achieved by the joint optimization method The suboptimal method "Sub1" with δ = 0.5 provides a performance closer to than the suboptimal method "Sub1" with δ = (1 − τ)/2.This observation results from the fact that the BC phase dominates the performance by using the harvested energy so that the suboptimal method "Sub1" with asymmetric time allocation as 0.5 − τ and 0.5 for the MAC and BC phases is more desirable than the suboptimal method 'Sub2' with symmetric time allocation as (1 − τ)/2 and (1 − τ)/2.Figure 6 compares the sum rate of the two SWIPT methods as a function of the normalized distance d 1 /D when P = 20 dBm.The optimal and suboptimal resource allocation methods are compared in the channels with different path exponents of ν = 2.5 and ν = 3, where the suboptimal methods optimize the splitting ratio after fixing δ = 0.5 for both SWIPT methods.The results show that the sum rate performance is improved significantly as the path loss exponent decreases from 3 to 2.5 since the harvested energy increases significantly.The PS-SWIPT methods outperform the TS-SWIPT methods in the sum rate as observed in the outage probability in [25].In addition, the optimal and suboptimal methods exhibit very close performances for each SWIPT method.

Conclusions
The SWIPT performed by power splitting (PS) and time splitting (TS) is studied for the DF-TWR network to maximize the sum rate by optimizing the splitting ratio and the time-phase ratio of the DF-TWR.The joint optimization problem for each SWIPT method is shown to be convex with an appropriate transformation to that it can be solved by an existing convex optimization solver.To lower the computational complexity further, we also propose the suboptimal methods optimizing the splitting ratio for the given time-phase and derive a closed-form solution for the PS-SWIPT.It is observed that the PS-SWIPT outperforms the TS-SWIPT in both optimal and suboptimal methods and the suboptimal method provides a performance close to the optimal one for each SWIPT method.Therefore, the PS-SWIPT with the suboptimal closed-form solution is favorable to the DF-TWR in the aspects of complexity and performance.
As a future work, it would be interesting to extend the sum rate optimization problem to the problem of adapting not only the SWIPT parameter but also the transmit power of the devices under the constraints of the total power consumption and the battery capacity limit at the relay.In addition, the optimization problem in this paper can be extended to a scenario of supporting more than two devices, where the devices share all the data via multi-way relay communications [38].Multi-way relay communications with K devices can be implemented efficiently with one MAC phase and K − 1 BC phases for transmission of the network coded symbols similar to the TWR, for which the SWIPT parameters are optimized to maximize the overall performance.

Figure 4 .Figure 5 .Figure 6 .
Figure 4. Sum rate of the PS-SWIPT with different resource allocation methods as a function of d 1 /D when P = 20 dBm and ν = 3.

Figure 7
Figure7compares the sum rate of the SWIPT methods as the transmit power P varies when ν = 3 and d 1 = d 2 = 5 m.Again, the resource allocation methods for the PS-SWIPT outperform their counterparts for the TS-SWIPT.As the transmit power increases, the gain achieved with the resource allocation becomes more prominent for both the SWIPT methods.

Figure 7 .
Figure 7. Sum rate of the PS-SWIPT and the TS-SWIPT as a function of P dBm when ν = 3 and d 1 /D = 0.5.