Abstract
We investigate the downlink of a cell-free massive multiple-in multiple-out system in which all access points (APs) are connected in a linear-topolpgy fronthaul with constrained capacity and send a common message to a single receiver. By modeling the system as an extension of the multiple-access channel with partially cooperating encoders, we derive the channel capacity of the two-AP setting and then extend the results to arbitrary N-AP scenarios. By developing a cooperating mode concept, we investigate the optimal cooperation among the encoders (APs) when we limit the total fronthaul capacity, and the total transmit power is constrained as well. It is demonstrated that achieving capacity requires a water-pouring distribution of the total available fronthaul capacity over the fronthaul links. Our study reveals that a linear growth of total fronthaul capacity results in a logarithmic growth of the beamforming capacity. Moreover, even if the number of APs would be unlimited, only a finite number of them need to be activated. We found an expression for this number.
1. Introduction
Recently, cell-free massive multiple-input multiple-output (mMIMO) has been considered as a key technology for beyond-5G networks. In such user-centric transmission systems, a large number of distributed access points (APs) are connected to one central processing unit (CPU) via fronthaul links and phase coherently cooperate to cover a wide area for a small number of users in the same time-frequency resource using time-division operation. Compared to cell-based collocated mMIMO solutions, such technology improves energy-spectral efficiency and enhances immunity to shadow fading without extra signal processing burdens. We refer to [,,] and the references therein for a general overview of current developments of cell-free mMIMO.
Effectively utilizing fronthaul resources is of critical importance for deploying a scalable cell-free mMIMO system. Considering the downlink for instance, simple distributed conjugate beamforming is optimal, as shown in []. However, it can already be seen that a large amount of information exchange over fronthaul links is required since all the APs need to know the message that is to be transmitted. A star-topology fronthaul where each APs are individually connected to a CPU was originally modeled and has been widely studied, see, e.g., [,,] and the references therein. Currently, a serial fronthaul connecting APs in a linear topology is considered for achieving a cost-efficient architecture, both in deployment and maintenance []. A novel and promising technique relying on a linear topology is the radio stripe system, where multiple APs are embedded in a cable/strip, see [,] in detail. Such radio stripes can be easily and invisibly deployed indoor or outdoor in existing constructions to enable numerous new applications [].
The focus of prior work in cell-free mMIMO study was on developing wireless signaling techniques. In this paper, we study from an information-theoretic perspective the downlink of a cell-free mMIMO system shown by Figure 1, where single-antenna APs are connected in a linear topology with constrained fronthaul capacities to communicate to one single-antenna terminal receiver (Rx).
Figure 1.
Cell-free transmit beamforming where access points (APs) are connected via serial fronthaul.
The considered multiple-in single-out (MISO) setup forms a distributed massive beamforming system and can be formulated as a multiple-access-channel (MAC) with limited fronthaul capacity, which is defined as the maximal amount of information that can be reliably sent per MAC channel use []. By investigating the channel capacity of such a MAC, we reveal essential relations between the three the most fundamental resources of the system, i.e., the total available number of APs (N), the total transmit power (P), and total available fronthaul capacity (). Specifically, in the current cell-free mMIMO literature, the only configuration of APs that is considered is where full cooperation (full beamforming) is realized and where the same information is shared at all involved APs. Therefore, for a real-valued Gaussian MISO channel with N APs and unity channel gains, the maximum downlink rate is given by the channel capacity
where is the received signal-to-noise ratio (SNR) if only one AP is active with all available transmit power assigned to it. It requires fronthaul capacity among N APs.
In this work, we focus on the case where the available fronthaul capacity is not large enough to support full cooperation of the APs. We were motivated to investigate the achievable downlink rates given that fronthaul resources for communication between the APs is constrained. We call this setting partial beamforming, since . We could derive the channel capacity and the optimal cooperation strategies among APs for given total available P and N.
1.1. Related Work
We can model the studied system as a special extension of the multiple-access channel (MAC) with partially cooperating encoders studied by Willems []. In particular, we can generalize the system setup in [] to a network of encoders by considering only one source but employing an arbitrary number of encoders, namely APs, via unidirectional conferences. Since fronthaul links can be treated as separate channels that are orthogonal to the beamforming MAC, our setup might also be viewed as an extension of a special case of the orthogonal-component relay channel due to El Gamal and Zahedi [], which is generalized to relay networks by Ghabeli and Aref in []. In addition, if only two APs are considered, our study is also strongly related to the multiple access diamond channel as studied in [,]. Moreover, the two APs setup looks very similar to the semi-deterministic relay channels []. Furthermore, it is also worth to note that in our system, all APs cooperatively send one message to a receiver at a same time. In this sense, our channel setting is “noncausal”, which is related to the relay-with-delay channel studied in [] in general.
1.2. Contributions and Organization
By investigating the MAC with limited fronthaul capacity in the discrete channel case and in the Gaussian channel cases, the main findings of our research work include
- The channel capacity is found for an arbitrary number of APs for both discrete channel and the Gaussian channel with constrained transit power, where the total fronthaul capacity and the total number of APs are limited.
- When numerous APs are engaged, a linear growth of total fronthaul capacity results in a logarithmic growing of the channel (beamforming) capacity.
- A concept of cooperating modes is developed to demonstrate the optimal cooperation among APs to achieve capacity based on superposition coding.
- When the channel capacity is only limited by the fronthaul capacity, the number of required APs is quasi-linear to the available fronthaul capacity even if the number of APs would be unlimited.
- A new and sharp lower bound of the Lambert-W function is derived for computing the number of required APs given by the total fronthaul constraint.
In the rest of this paper, the system model is first presented in Section 2. In Section 3, we start with investigating a two-APs setting consisting of one fronthaul link. This setting serves as a baseline system where the cooperating mode concept is developed. In Section 4, the study is extended to the case where an arbitrary number of APs is engaged and the behavior and exact solution of the channel capacity is derived. In Section 5, the number of required APs is derived to leverage limited fronthaul resources if the number of available APs is unlimited. Finally, the conclusion and final remarks can be found in Section 6. Detailed proofs and derivations of the presented results are collected in the Appendix A. Partial material in this paper was presented in [].
2. Problem Setup
2.1. Notation
Although all the paper, capital letters, e.g., X, denote random variables, and their realizations are denoted by small letters, e.g., x. The probability mass or density function according to X is denoted by or simply . The expectation of X is denoted by . The entropy of X is denoted by and the differential entropy is denoted by . The mutual information between X and Y is denoted by . The consecutive integer range from i to j with is denoted by . In addition, a set of elements with index m in range of i to j is denoted as .
2.2. System Model
The investigated system is modeled as Figure 2, where we denote the CPU as the source, the APs as encoders, while for the destination, the receiver is denoted as the decoder. As plotted, one-directional fronthaul links connect N adjacent encoders that simultaneously send a uniformly distributed message to a decoder (receiver). We focus on the study of the fronthaul resource usage among all encoders. The discrete memoryless MAC denoted by consists of input alphabets , output alphabet , a transition probability distribution , and a set of fronthaul capacity constraints between N encoders.
Figure 2.
N encoders that cooperate in sending a message W to a decoder with limited fronthaul communication.
Before the beginning of each n channel uses, (partial) information about the generated message W is first shared among N encoders. Let for be the message sent over the fronthaul link between encoder m to encoder . Then, the encoders map the messages W and into codewords as follows
where are the corresponding encoding functions. Meanwhile, the generated fronthaul messages should satisfy
As presented, the corresponding fronthaul link capacity is defined as the maximal amount of information that can be reliably sent per channel use of the MAC channel over the link from encoder m to encoder .
At the decoder, a deterministic decoding function is applied to obtain the message-estimate based on the channel output . We define the average probability of error at the decoder as
Now we say that a rate R is achievable with given fronthaul capacities if there exists N encoders and a corresponding decoder, such that
for all and large enough n. The channel capacity C (of MAC) as a function of the fronthaul capacities is defined as the supremum of all achievable rates given by all the fronthaul constraints. Eventually, we will be interested only in a constraint on the sum of the fronthaul capacities that is defined as
To interpret the capacity results for the partial beamforming, we focus on MACs with additive white Gaussian noise. At the output of the Gaussian MAC, the decoder receives
at time i, where is the transmitted symbol by encoder m and is modeled as independent and identically distributed (i.i.d.) Gaussian noise at the decoder for all . For individual encoder m, , the transmit power constraint is
for . Then, the total transmit power is limited as
Without loss of generality, we assume that . Therefore, the transmit SNR can be directly represented by the total constrained transmit power P.
3. Two-Encoder Result
We first investigate the simplest system setting where only two encoders are involved. The MAC is now denoted by . The fronthaul message must satisfy the constraint
which is same to the total fronthaul capacity in this case. The underlying Gaussian MAC is given by
Although this two-encoder setting can be considered as a special case of related work, see discussion later, we provide here the capacity proofs for both discrete and Gaussian MACs. The applied approach carries over to the N-encoder setting that is investigated in Section 2.
In the following, the channel capacity as a function of the fronthaul capacity is first obtained for the discrete memoryless MAC. Then, we derive capacity results for the Gaussian case with total transmit power constraint. Within this study, a so-called cooperating mode concept is developed that will be very useful to provide cooperation insights among encoders when more of them are engaged.
3.1. Discrete Channel
First, consider the discrete channel setup.
Theorem 1.
For the discrete memoryless channel , the channel capacity C as a function of the fronthaul capacity is given by
where distribution is determined by the input distribution .
The detailed proof is provided in Section Appendix A.1, where the converse is based on the Markovities of and , and the achievability is based on applying superposition coding. For the achievability, the source splits the message W into two parts and delivers the index of over the fronthaul link to encoder 2 that maps into the inner code while encoder 1 of the source encodes into an outer code-word which is super-imposed on the inner code-word. Although this coding scheme is simple, the cooperating mode concept that is important for studying the multi-encoder setup will be developed based on the superposition scheme as discussed later.
Remark 1.
By viewing the two-encoder setting as a special setup of ([], Figure 1), where only one source and one conference link are deployed, we can have Theorem 1 by letting the common message and the conference capacity in ([], Thm.). Note that the achievability in [] which is based on binning becomes superposition coding.
Remark 2.
By viewing the two-encoder setting as a special setup of the multiple access diamond channel where one source connects to two encoders (relays) by using two separate noiseless links, see [,], Theorem 1 can also be obtained if letting , the common message , and the common message rate (or , , ) in ([], Thm. 2). Note that the achievability based on superposition and Marton-coding in [,] becomes superposition coding only.
3.2. Gaussian Channel
Now we consider the Gaussian MAC of the two-encoder channel setting given by (10) with total power constraint P, i.e., , where
This first leads to the following result.
Theorem 2.
The channel capacity of the two-encoder Gaussian MAC is
The proof is the adaptation of the discrete channel version given in Section Appendix A.1 by considering the transmit power constraints and Gaussian channel noise.
Proof.
(i) Converse. First note that without loss of generality (and without violating the power constraints) we may assume that all for all . If we define being the random triple with density then converse in Section Appendix A.1 shows that
where the random variables , satisfy and
First consider the random pair . By applying the Cholesky factorization ([], Thm. 4.2.7) to the covariance matrix of , the assignment of
can be obtained, where and are uncorrelated with zero means and unit variances.
Next, observe that if we take this choice does not affect and , but minimizes the total transmit power for fixed , since
Therefore we only need to consider assignment
Now we take , , and . By denoting
in , we further have
Taking the signal assignment (19) and the power assignment (21) gives that
where and follow by the maximum differential entropy theorem, see ([], Thm. 8.6.5).
(ii) Achievability. Taking the assignment (19) by letting and . Using the power assignment (21) directly gives
The rest of the proof follows by first establishing a coding theorem for the discrete memoryless channel with input cost (power constraint). The step from discrete to Gaussian channels is justified by the relation between differential entropy and discrete entropy, see, e.g., ([], Thm. 9.3.1). □
Now, by optimizing over in (13), we can further express C as a function only in total transmit power P and total fronthaul capacity , which is for this two-encoder setup.
Corollary 1.
The channel capacity of the total transmit power constrained two-encoder Gaussian MAC can be expressed as
Proof.
The two logarithms on the RHS of (13) are monotonically increasing and decreasing in respectively and equal to each other at . Hence, we can set
to obtain the that maximizes C for as
This results in the second capacity expression in (26). Then, if , the second term is always larger than the first term for any in (13). This corresponds to the situation where is large enough and the transmission over the MAC is the bottleneck of the network. In this case, C remains at its global maximum. □
Note that, for , the first term of the capacity result (26) is the channel capacity with no beamforming and the second term directly represents the partial beamforming gain that is independent of transmit power P and only grows as the fronthaul capacity increases. As revealed, the partial beamforming gain increases with a same rate regardless of the transmit power P.
3.3. Cooperating Modes
Based on assignment (19) that possesses a superposition structure, we can naturally denote two cooperating modes as what follows to describe the optimal cooperation between the encoders for the capacity achieving.
- mode 1: Sending a private message given by from encoder 1;
- mode 2: Coherently sending a common message given by from encoder 2 and encoder 1.
According to (20), the parameter represents the fraction of the total transmit power assigned to mode 2 while represents the remaining fraction assigned to mode 1. Note that given by (28) should be taken for achieving the capacity.
Now consider the cooperation scenarios of the two encoders based on the availability of . If , the transmission reduces to the point-to-point communication case. This is represented by having only 1 active and encoder 2 is inactive. If , full cooperation can be achieved by activating 2 only. For , two encoders cooperate to achieve partial beamforming capacity by activating both cooperating modes. Figure 3 illustrates the cooperating modes activating and deactivating at encoders depending on increasing from 0 to .
Figure 3.
Modes activation for the two-encoder Gaussian setting.
For the two-encoder setting, the modes evolution due to available amount of looks straightforward. Nevertheless, it will be shown that this cooperating modes interpretation provides a clear insight of leveraging available encoders for given certain total fronthaul and transmit power constraints, where the optimal cooperation is not trivial as the number of encoders goes largely.
4. N-Encoder Result
Based on the investigations of the two-encoder setting, we extend the study to the system model with arbitrarily N encoders, where . The parameter N in principle can be any large integer so that a distributed massive beamforming is obtained. The investigation is focused on the Gaussian MAC under the constraints of the total fronthaul capacity and the total transmit power P, which are defined by (5) and (8), respectively. Before addressing the exact capacity solution for arbitrary N encoders, we first derive capacity bounds of to provide a general behavior of channel capacity C in total fronthaul capacity . The obtained result indicates that the growth of C requires an exponential growth of . By using the compound mode, the exact capacity solution with the optimal cooperation among encoders are derived. The results show that the distributed beamforming system works most efficiently when it is working in its fronthaul-capacity-limited regime. As a result, we consider the case where encoders are always available to be activated as needed to leverage the entire fronthaul resource.
4.1. Discrete Channel
For simplicity, let the tuple be the collection of ordered transmitted random variables that are generated at encoder l to encoder m with for one channel use. In addition, let be the collection of the corresponding fronthaul capacities.
Theorem 3.
For the discrete memoryless N-encoder setting, channel capacity C of the channel as a function of fronthaul capacities is
with .
A sketch of the proof is given in Section Appendix A.2. As shown in the achievability, the capacity is achieved by applying an N-layer superposition coding among the encoders, which naturally agrees with the studied linear topology.
4.2. Gaussian Channel under Total fronthaul Constraint
By considering on the total power and separate fronthaul constraints, we first have the following result.
Theorem 4.
The the N-encoder Gaussian setting with the total transmit power constraint of P, the channel capacity C as a function of the fronthaul capacities is
where is a probability vector.
Proof.
Similar to proof of the two-encoder setting, the generic signal assignment
can be used at each encoder for , where are uncorrelated and have zero mean and unit variance. Again, we can further apply the special signal assignment
to minimize the total transmit power without affecting dependency of the different signals at the decoder that determines the beamforming capacity. In this way, the transmit power allocated for signal can be expressed by
such that .
The proof shows that all the transmitted signals at encoders should form a Markov chain . Again, since signal represents the common messages used at first l encoders, we say that cooperating mode l is active if the signal is generated and sent and there can be N cooperating modes in total for this N-encoder setting.
Now, we can solve the optimization problem
where is given by (30), to investigate the total power limited capacity C under the constraint of total fronthaul capacity for a given P. To do so, we first prove the following lemma. Note that the full-cooperation capacity is now when N encoders are used. For simplicity, we denote the mutual informations as
for any .
Lemma 1.
For the N-encoder setting with any given , power distribution can only be optimal if equality of all the terms on the RHS of (30) is achieved.
The proof is given in Appendix A.3.
Remark 3.
Lemma 1 indicates that asymmetric distribution of over fronthaul link is optimal. This result will be further demonstrated after the capacity result is derived.
Based on the reduced set given by Lemma 1, we make the terms on the RHS of (30) equal and have
Thus, the channel capacity and the required total fronthaul capacity in the power allocation vector can now be represented as
and
respectively for a fixed P if . Based on (38) and (39), we can have the following theorem.
Theorem 5.
For the N-encoder Gaussian channel under the total power constraint P and total fronthaul constraint , the channel capacity is given by
To evaluate the channel capacity, we only need to maximize the function by introducing a Lagrange multiplier as
under the constraint that is a probability vector to derive the solution of for the general N-encoder case. Note that the parameter is the slope of . However, before working out the exact solution of this optimization problem, we first derive general bounds of to reveal the capacity behavior of the studied distributed beamforming.
4.3. Capacity Behavior Bounds
To obtain a simple but meaningful insight of the relation between C and the constrained and P for an arbitrary N, we propose an upper bound and a lower bound of the channel capacity to draw the following conclusion.
Proposition 1.
For any fixed total transmit power P and number of encoders N, a linear growth of total fronthaul capacity results in a logarithmical growing of the channel capacity as C can be bounded as
and
for .
Proof.
(1) Upper bound. By considering as a random variable with distribution , the capacity (38) can be expressed as
where . By applying Jensen’s inequality, the corresponding fronthaul capacity in (39) can be lower bounded as
where follows and follows for . Since , we can have that results in and thus (43).
(2) Lower bound. Consider time-sharing of the rates given by only using one cooperating mode. Hence, the channel capacity should be larger than or equal to an achievable rate R as
where k is the number of the activated encoders corresponding to the required total fronthaul capacity
that achieves R. By applying for , see, e.g., ([], Section 3.6.15), we have
that gives an upper bound of as
Therefore, we can have that directly gives (44). □
The upper bound (43) and lower bound (44) thus indicate the logarithmical behavior of C in . Figure 4 gives an illustration of these two bounds for 10-encoder Gaussian setting where total transmit power is set at .
Figure 4.
Upper and lower bounds on at for 10-encoder Gaussian setting.
The exact capacity solution derived shortly is plotted as well as a comparison, showing that the bounds describes the capacity behavior.
4.4. Compound Mode and Exact Solution
In what follows, we perform evaluation of the channel capacity given in Theorem 5 by defining a compound mode as a collection of all consecutive cooperating modes between and including modes with . A compound mode is referred to as active if all are nonzero and the other elements in are zeros. Note that using a single mode is a special case of compound mode. By denoting
we have the following results.
Corollary 2.
For an N-encoder Gaussian setting where with a fixed transmit power P, if there is a compound mode such that , where
the channel capacity corresponding to the slope is achieved and only achieved by using that compound mode which gives
and
The proof is given in Appendix A.4.
Remark 4.
The proof in Appendix A.4 shows that if compound mode achieves the capacity and , the modes in should be assigned with same power amount as the optimal setting.
4.5. Modes Selection for Capacity Achieving
The results in Corollary 2 state how the capacity is achieved and expressed over a certain range. To further elaborate how to exactly use cooperating modes from no cooperation to full cooperation, a procedure efficiently activating modes is developed based on applying the following result, where an identification of valid compound modes that are the ones resulting in capacity is provided in terms of using a power penalty.
Corollary 3.
For any fixed P, a compound mode achieves the capacity if and only if
where is the ceiling function. If (57) is satisfied, compound modes with and do not achieve the capacity.
The proof is given in Appendix A.5. The power condition (57) indicates that a compound mode needs certain transmit power to be supported to be optimal. On the other hand, some compound modes can never be optimal if the transmit power is too large.
Now, note that C is monotonically increasing in owing to nonnegative slope and monotonically decreasing in according to (54) when j and k are fixed. It shows that to achieve the capacity, compound modes should be activated in a way such that the corresponding slope range varies from large to small as increases. Therefore, based on the results in Corollary 3, an algorithm is resulted for computing C and over by activating valid compound modes sequentially.
Algorithm 1 represents the cooperating strategy among encoders. It reveals that cooperating modes should be activated one-by-one to form new compound modes with the increase of . At certain point of the growth of , the first mode dies, i.e., deactivated owing to the limited P or N. With the further increasing of , lower modes die in a one-by-one fashion till the full cooperation is obtained.
Figure 5 plots the results for and by applying Algorithm 1, over the full range of . Different number of the available encoders are considered. In the plot, each color segment represents the corresponding activated compound mode. In addition, the pentagram markers label the points where a lower mode has to be deactivated (dead) because of the power penalty (57) or because all N encoders are all used up, namely operations in line 13 and line 9 of the algorithm, respectively. It is shown that for low SNR, i.e., , the modes die fast due to the small power. On the other hand, for large SNR, i.e., , the larger available encoder number the slower the modes die such that higher capacity can be achieved (consider curves of using 2-encoder, 3-encoder, 4-encoder, and 5-encoder).
Figure 5.
at and for different N-encoder settings.
| Algorithm 1 Compute C and from no cooperation to full cooperation |
| Initialize: and Ensure:
|
Proposition 2.
As probably the most natural strategy, the way of applying modes in the lower bound proof of Proposition 1, i.e., time-sharing full cooperation of small number of encoders, is not optimal in general. However, it is sub-optimal when SNR is small as the compound modes that achieve capacity reduce to single modes.
To visualize each mode evolution from no beamforming to total beamforming, we can illustrate the power allocation for each cooperating mode as increases. By incorporating calculations of (given in Appendix A.4) and (37) into Algorithm 1, Figure 6 and Figure 7 show the modes’ power evolution of the 10-encoder setting for and , respectively. It is shown that the first mode dies faster when P is relatively small. They also interestingly show that once is large enough to approach the total beamforming, the last mode dominates as other modes all vanish.
Figure 6.
Optimal power allocation for modes distribution for 10-encoder Gaussian setting at .
Figure 7.
Optimal power allocation for modes distribution for 10-encoder Gaussian setting at .
Moreover, we can also elaborate the cooperating of encoders in terms of showing optimal distribution of over fronthaul links. Figure 8 illustrates the distribution of the 10-encoder setting where the bolder curves are for while the lighter curves are for . In each case, the fronthaul capacity curves for for to are located from left to right in the plot. This result further demonstrates the asymmetric water-pouring assignment of over fronthaul links, see Remark 3.
Figure 8.
Optimal distribution of at and for 10-encoder Gaussian setting.
4.6. Mode and Capacity Regimes
Consider the case where mode achieves the capacity for . In this case, the growth rate of is independent of P and N, see the expression of in (56) with . Therefore, we call that the system works in a fronthaul-capacity-limited regime when a mode is used. The reason why we are interested in the fronthaul-capacity-limited regime is that achieves the fast growth rate regardless of P and N. As a further increase of , the first mode dies due to either limited P or limited N. We then call the system works in a power-limited regime or encoder-limited regime, respectively. When the system is in either power-limited regime or encoder-limited regime, growth is slowed down compared to when the system works in the fronthaul-capacity-limited regime. This is due to the discontinuities of the slope , see the derived optimal upper and lower bounds of . The following result shows how to determine which regime the system works in for given , P, and N.
Proposition 3.
For given P and N, if
the capacity growth is limited by P and the system works in a fronthaul-capacity-limited regime if
where . Otherwise it works in a power-limited regime.
On the other hand, if , the capacity growth is limited by N and the system works in a fronthaul-capacity-limited regime if
Otherwise it works in a encoder-number-limited regime.
Proof.
Modes dying hampers the growth of C in . Consider that the first mode of the compound mode dies because of constrained P not N. In this case, P must satisfy (58) which is given by the lower bound of (57). Consequently, at the moment after the first mode dies, i.e., compound mode is active, we have and given by taking the upper bound of (57). This setting results in so that (59) is obtained by evaluating (55).
Similarly, considering the compound mode can be supported by P, the first mode dies because that no new encoders can be used. At the moment of first mode dying, i.e., compound mode is still active, we thus have and , which also result in . Hence, (60) is resulted. □
Figure 9 plots of 10-encoder setting for and , respectively, where the regime separations are indicated at the first mode dies for both powers. It is illustrated that in the fronthaul-capacity-limited regime, C has the highest growth rate no matter what its initial value is (point-to-point communication). In the next subsection, we focus on a system working at the fronthaul-capacity-limited regime.
Figure 9.
at and for 10-encoder Gaussian setting.
5. Infinitely Many Encoders
Consider designing a system in practice when and P are critical resources while available encoders could be many, for instance, the radio stripe system. Based on the previous study, we should always try to let the system work in its fronthaul-capacity-limited regime where the fronthaul capacity is maximally utilized. Hence, we are motivated to determine the number of encoders that are required to be activated for a given by considering infinitely many of them are available when the system is purely fronthaul constrained.
To directly solve k, the highest active mode that is the number of required encoders, from (55) or (56) is not trivial. To achieve an accurate approximate result, we first need the following lemma, of which the proof follows the outline in [] and is given in Appendix A.6.
Lemma 2.
The non-principle branch of Lambert W function defined in the interval , see [], can be bounded as follows
for .
Remark 6.
Proposition 4.
When the system works in the fornthaul limited regime, the number of required encoders is quasi-linear to the available fronthaul capacity as
where in nats per channel use and is the floor function.
Proof.
As the system works in the fronthaul-limited regime, compound mode exists. Thus, according to (56),
in nats. To upper bound k, we lower bound by taking , see the bound (52), which gives
where follows by applying derived based on ([], 6.1.38), and follows by taking the fact that is monotonically decreasing in k and goes to e as . Therefore, we have
Now, solving k and applying (61) give
Finally, since the number of encoders is a integer, (62) is resulted. □
In Figure 10, the bound of k given by (66) and the actual number encoders required to be activated are plotted as a function of . It is revealed that the derived result is accurate enough.
Figure 10.
Number of required encoders in relation to for fronthaul resource maximum usage.
6. Concluding Remarks
In this paper, the downlink of a cell-free mMIMO in which multiple APs connected in a linear fronthaul topology serve as a single receiver was studied to reveal relations between the three fundamental network resources, namely the total fronthaul capacity , the total transmit number P, and the number of available APs N. Specifically, we focused on partial distributed beamforming where the total available fronthaul capacity is not enough to support full cooperation between all APs, i.e., beamforming. By formulating the problem as a MAC channel with multiple encoders linked in a feed-and-forward setting, we derived the channel capacity as a function of the total fronthaul capacity for both discrete and Gaussian channels. The derivation was started by considering two encoders and then we extended the analysis multiple encoders. It was demonstrated that capacity is achieved by multi-layer superposition coding from which the concept of cooperating mode was developed for the Gaussian channel. This cooperating mode technique leads to optimal cooperation among encoders. Bounds on the capacity for N-encoder setting demonstrated that this channel capacity grows logarithmically in for a fixed P. The exact capacity solution shows that the capacity is achieved if and only if by certain compound modes are used. An algorithm was derived for computing which compound modes should be activated as as function of , which grows from zero to the value obtaining full beamforming. We demonstrated that should be water-poured over the fronthaul links to obtain optimality. Finally, by considering the case where infinitely many encoders are available, we showed that the number of required encoders is quasi-linear to the available total fronthaul capacity when the system is purely constrained by fronthaul resources.
Future directions include extending the results to channels with links which do not have unit gain as is the case here, and considering multiple receivers. Another interesting direction would be the equivalent uplink case.
Author Contributions
All authors conceived the problem and solution. P.Z. wrote the paper. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Conflicts of Interest
The authors declare no conflict of interest.
Appendix A. Proofs and Derivations
Appendix A.1. Proof of Theorem 1
(i) Converse. Consider Fano’s inequality for . Since forms a Markov chain, we have that
Moreover, from the Markovity of , we obtain that
In the above derivations, the random variable Q is uniformly distributed on and for . If now both and we obtain that
for some distribution , for all achievable rate R. This concludes the converse for the discrete memoryless two-encoder case.
(ii) Achievability. We prove that if the message rate for a given fronthaul capacity , the message error probability approaches zero if the codeword length n increases. Our coding method is based on superposition.
Codebook Generation: First fix a joint probability distribution . This distribution determines and for with . Now generate at random i.i.d. sequences of length n, each drawn according to and index these sequences as as an inner code, where . Then, for each such , generate sequences drawn according to in an i.i.d. fashion as an outer code, where . The resulting codebook is revealed to both encoders and to the decoder.
Encoding: Split the message W that is uniformly distributed on into with , where the first part , which is uniformly distributed on , is transmitted by encoder 1 and the second part , which is uniformly distributed on and is conveyed to encoder 2 by , is transmitted by two encoders cooperatively. Hence, when , encoder 2 sends while encoder 1 inputs into the MAC.
Decoding: Let . Based on the observed channel output sequence , the decoder finds the message pair such that
where set is the set of jointly -typical sequences, see Cover and Thomas []. If such a pair cannot be found, or if there are more than one such pairs, an error is declared.
Probability of Error: Due to symmetry, the average probability of error is equivalent to the probability of error for an arbitrary message . Hence, without loss of generality, we assume . Thus, we have
where
Due to the Asymptotic Equipartition Property (AEP), it can be shown that
for all n large enough. Moreover
and
Now as long as
for all n large enough. Therefore we take
then both (9) and (A10) are satisfied. Note that this implies that
If we now let , the achievability part of the Theorem 1 is thus established.
Appendix A.2. Proof of Theorem 3
The proof is a generalization of the proof of the two-encoder settings. Consider a simplified block diagram of the N-encoder setting as shown in Figure A1. Now, consider a cut of the fronthaul link between and for any given such that the nodes in the network are separated in two sets of and .
Figure A1.
Simplified illustration of the MAC for the N-encoder setting.
(i) Converse. Consider the Markovity of . By applying Fano’s inequality, we first have
Then, considering the cut between and , we have that
Note that the above result is valid for any m in . Thus, by letting the converse follows.
(ii) Achievability. First consider the message W that can be represented by N independent messages as , where each is uniformly distributed on with . Then, given by the linear topology of encoders, we distribute into the network in the manner illustrated by Figure A2, i.e., for the link between any and , the fronthaul message conveys corresponding messages . Therefore, for a fixed distribution and corresponding marginals, we can first generate i.i.d. n-sequences with according to and then for each generate i.i.d. n-sequences with according to and so on. In this way, an N-layer superposition codebook is generated and revealed at both encoders and decoder.
Figure A2.
N-layer superposition coding message structure.
Thus, for sending a message w, encoders transmit sequences over the MAC channel. At the decoder, a unique message tuple is found by using simultaneous typicality decoding as performed for the two-encoder case. By taking the similar probability of error analysis, it gives that, as long as
we can have for all sufficiently large n and any . By further considering , , …, and , we can subsequently take
Finally, observe that (again subsequently)
which establishes the achievability for and .
Appendix A.3. Proof of Lemma 1
For any fixed , we first consider a realization of . Since for any , by distributing on top of in a water-filling fashion, we can always have two possible cases if the equality of all terms on the RHS of (30) can not be achieved, i.e.,
where j is the smallest index such that for all , and for any m such that , namely the ‘water level’. Once the water-filling is performed, we fix the corresponding distribution of .
For Case (a) where , we now can decrease to increase such that is increased. For Case (b) where , we can look for an with such that by decreasing , increases and increases as well. Therefore, the satisfying Case (a) and Case (b) are not optimal. The equality of the terms on the RHS of (30) is thus necessary.
Appendix A.4. Proof of Proposition 2
Three steps are taken in the proof. In step (1) we show that an active compound mode achieves the capacity if is satisfied. In step (2) we show that using any two separated active modes (all other modes are inactive) does not achieve the capacity. In step (3) we show that exact solutions of (54) and (55) are resulted.
Step (1) Note that function g given in (42) is convex-∩ when if the largest activated mode is k. So, we set the partial derivatives of g with respect to according to the Kuhn–Tucker conditions, see ([], eqn.4.4.10 and eqn.4.4.11), when the active compound mode achieves the capacity. By considering (42) in nats, the partial derivative of function g with respect to is
where .
Step (1.1) Firstly, by only considering that compound mode is active, i.e., all are nonzero, while the other and are zeros, the partial derivative can be expressed as
For simplicity, we denote that
Now, consider that the partial derivatives corresponding to should be all identical to some value , i.e.,
to find the capacity solution in terms of optimal distribution of . Note that
For the case of , we can recursively evaluate the equalities in (A23) as by taking i from k to in a descending order with the use of (A21). In such a way, it is obtained that
Now, based on (A24) and (A25), we can derive expressions of by considering two scenarios.
Scenario 1: Consider , i.e., at least three consecutive modes are active. By taking and , (A25) can be used twice to obtain the equality that gives the relation
For , expression (A25) allows us to further obtain
by taking i in the order of to , which results in an interesting and important relation
Therefore, by applying relation (A26), we can express as
which is valid for the case of as well. So, by setting (A25) equal to (A24) with as
and substituting in (A29), we can first derive the power of modes from to as
According to (A26), we can then obtain the power of the first mode as
Furthermore, owing to , we can finally represent the power of the last mode as
Now, applying the total power constraint, we should have
By substituting (A33), (A31), and (A32), the corresponding slope should simultaneously satisfy
Since , it is easy to see that the lower bound of is . For the upper bound, if , it leads to . Due to , such P results in
which contradicts the lower bound. Therefore, to make the compound mode exist, the slope should be in the range
Scenario 2: Consider , i.e., the compound mode only consists of two modes. So, letting (A24) equal to (A25) gives the relation
By considering now, it is obtained
and a same expression as in (A32). If such compound mode gives optimal solution, the condition should be also satisfied, which gives that is consistent with the result of (A37).
Step (1.2) Secondly, consider that the derivative for with should be less than as denoted in Step (1.1). Since , we have
which directly gives
Step (1.3) Finally, consider that the derivative for with should be less than as well. Since , we have
Similarly, by upper bounding this derivative by given in (A24), we can have
where the summation can be computed by using relation of (A25) as
By further incorporating given by (A24) and given by (A29) into (A44), the inequality becomes to
By substituting (A31) and (A33) for and , it can be easily shown that
where is applied in the bounding for the last step.
Note that, Step (1.2) and (1.3) and resulted bounds of also cover the the case of , i.e., only one mode is active and achieves the capacity. Now, by considering the ranges given by (A37), (A42), and (A47), the slope bounds in (52) and (53) are resulted.
Step (2) Assume that the capacity can also be achieved by only activating any two separated modes and , where and . Then, the Kuhn–Tucker condition requires , which results in the relation of
Note that in only and are nonzero. Moreover, for , we have
By substituting the relation (A48) into above derivative, it can be shown that
where the last step is due to . This result contradicts to the Kuhn–Tucker condition. This demonstrates the only compound modes achieves the capacity.
Appendix A.5. Proof of Corollary 3
Consider four possible settings of and based on (52) and (53) as
where inequalities given by are only valid for . By evaluating the inequalities, the power condition (57) is resulted. Since the and are derived from the Kuhn–Tucker conditions, the derived power condition is a sufficient and necessary condition. This result directly gives that using compound mode with and is not a capacity solution.
Appendix A.6. Proof of Lemma 2
By following the method in [], we define and prove
for . Once this is done, one can take the procedure applied in ([], Thm. 1) and the lower bound (61) follows.
Now, we start converting (A51) to an equivalent problem. First, by substituting in (A51), we need to prove
By denoting with and considering and are monotonically increasing in x for , showing (A52) is equivalent to show
let , we can focus on showing
since . Considering , we finally convert proving (A51) to demonstrating
To do so, we take the first derivative of and set it to zero to have
By solving (A56), we can evaluate the local maxima and/or local minima of . It can be seen that (A56) has only two real roots for , which are
where is the principle branch of the Lambert W function defined over . Based on the property of the Lambert W function, . Therefore, since , we can have two scenarios as
- Scenario 1: if , we must have , i.e., gives a local maxima and gives a minima;
- Scenario 2: if , we must have , i.e., gives a global minima and gives a maxima.
Hence, if we can prove that and , both, we can conclude (A55) and thus (A51). We show this in what follows.
By setting , can be expressed as
If , it is equivalent to have , which results in the range of
must be satisfied. For , by applying the bounds given in ([], Thm. 1) which is , it is easily obtained that , which is in the range (A59). So, follows. For , we apply an upper bound on given in ([], Thm. 2.3), which is
for and . By taking , can be bounded as . By also considering , must in the range (A59) as well, which completes showing .
Numerical evaluation of the bound (61) verifies the proof as illustrated in Figure A3, where the bound in [] is plotted as a reference as well.
Figure A3.
The bounds on the Lambert function .
References
- Ngo, H.Q.; Ashikhmin, A.; Yang, H.; Larsson, E.G.; Marzetta, T.L. Cell-free massive MIMO versus small cells. IEEE Trans. Wirel. Commun. 2017, 16, 1834–1850. [Google Scholar] [CrossRef]
- Zhang, J.; Chen, S.; Lin, Y.; Zheng, J.; Ai, B.; Hanzo, L. Cell-free massive MIMO: A new next-generation paradigm. IEEE Access 2019, 7, 99878–99888. [Google Scholar] [CrossRef]
- Interdonato, G.; Björnson, E.; Ngo, H.G.; Frenger, P.; Larsson, E.G. Ubiquitous cell-free massive MIMO communications. EURASIP J. Wirel. Commun. Netw. 2019, 197. [Google Scholar] [CrossRef]
- Bashar, M.; Cumanan, K.; Burr, A.G.; Ngo, H.Q.; Debbah, M. Cell-Free massive MIMO with limited fronthaul. In Proceedings of the 2018 IEEE International Conference on Communications (ICC), Kansas, MO, USA, 20–24 May 2018; pp. 1–7. [Google Scholar]
- Bashar, M.; Cumanan, K.; Burr, A.G.; Ngo, H.Q.; Larsson, E.G.; Xiao, P. On the energy efficiency of limited-fronthaul cell-free massive MIMO. In Proceedings of the 2019 IEEE International Conference on Communications (ICC), Shanghai, China, 20–24 May 2019; pp. 1–7. [Google Scholar]
- Femenias, G.; Riera-Palou, F. Cell-free millimeter-wave massive MIMO systems with limited fronthaul capacity. IEEE Access 2019, 7, 44596–44612. [Google Scholar] [CrossRef]
- Frenger, P.; Hederen, J.; Hessler, M.; Interdonato, G. Improved Antenna Arrangement for Distributed Massive MIMO (2017). Patent Application WO2018103897. Available online: patentscope.wipo.int/search/en/WO2018103897 (accessed on 21 January 2020).
- Radio Stripes: Re-Thinking Mobile Networks. Available online: https://www.ericsson.com/en/blog/2019/2/radio-stripes (accessed on 21 January 2020).
- Willems, F. The discrete memoryless multiple access channel with partially cooperating encoders (corresp). IEEE Trans. Inf. Theory 1983, 29, 441–445. [Google Scholar] [CrossRef]
- El Gamal, A.; Zahedi, S. Capacity of a class of relay channels with orthogonal components. IEEE Trans. Inf. Theory 2005, 51, 1815–1817. [Google Scholar] [CrossRef]
- Ghabeli, L.; Aref, M.R. A new achievable rate and the capacity of some classes of multilevel relay network. EURASIP J. Wirel. Commun. Netw. 2008, 2008, 135857. [Google Scholar] [CrossRef][Green Version]
- Kang, W.; Liu, N.; Chong, W. The Gaussian multiple access diamond channel. IEEE Trans. Inf. Theory 2015, 61, 6049–6059. [Google Scholar] [CrossRef]
- Saeedi Bidokhti, S.; Kramer, G. Capacity Bounds for Diamond Networks With an Orthogonal Broadcast Channel. IEEE Trans. Inf. Theory 2016, 62, 7103–7122. [Google Scholar] [CrossRef]
- El Gamal, A.; Aref, M. The capacity of the semideterministic relay channel (corresp.). IEEE Trans. Inf. Theory 1982, 28, 536. [Google Scholar] [CrossRef]
- EL Gamal, A.; Hassanpour, N.; Mammen, J. Relay networks with delays. IEEE Trans. Inf. Theory 2007, 53, 3413–3431. [Google Scholar] [CrossRef]
- Zhang, P.; Willems, F.; Huang, L. Capacity study of distributed beamforming in relation to constrained backbone communication. In Proceedings of the 2014 International Symposium on Information Theory and its Applications, Melbourne, VIC, Australia, 26–29 October 2014; pp. 473–477. [Google Scholar]
- Golub, G.H.; Van Loan, C.F. Matirx Computations; The Johns Hopkins University Press: Baltimore, MD, USA, 2013. [Google Scholar]
- Cover, T.; Thomas, J. Elements of Information Theory; Wiley: New York, NY, USA, 2006. [Google Scholar]
- Mitrinović, D.; Vasić, P. Analytic Inequalities, ser. Grundlehren der Mathematischen Wissenschaften; Springer: Berlin, Germany, 1970. [Google Scholar]
- Chatzigeorgiou, I. Bounds on the Lambert function and their application to the outage analysis of user cooperation. IEEE Commun. Lett. 2013, 17, 1505–1508. [Google Scholar] [CrossRef]
- Corless, R.M.; Gonnet, G.H.; Hare, D.; Jeffrey, D.J.; Knuth, D.E. On the Lambert W function. Adv. Comput. Math. 1996, 5, 329. [Google Scholar] [CrossRef]
- Abramowitz, M.; Stegun, I. Handbook of Mathematical Functions, with Formulas, Graphs, and Mathematical Tables; Dover Publications, Incorporated: St. Mineola, NY, USA, 1964. [Google Scholar]
- Gallager, G. Information Theory and Reliable Communication; John Wiley & Sons, Inc.: New York, NY, USA, 1968. [Google Scholar]
- Hoorfar, A.; Hassani, M. Inequalities on the Lambert W function and hyperpower function. J. Inequal. Pure Appl. Math. 2008, 9, 51. [Google Scholar]
© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).