Next Article in Journal
IViT: An Incremental Learning Method for Object Detection of Hidden Hazards in Transmission Line Corridors
Previous Article in Journal
Radar Interferometry Using gNB Base Stations: Estimation and Compensation of Mast Motion and Atmospheric Effects
Previous Article in Special Issue
A Hybrid Low-Complexity WMMSE Precoder with Adaptive Damping for Massive Multi-User Multiple-Input Multiple- Output Systems
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Robust Low-Complexity WMMSE Precoding Under Imperfect CSI with Per-Antenna Power Constraints

School of Electronic Information, Central South University, Changsha 410004, China
*
Author to whom correspondence should be addressed.
Sensors 2026, 26(1), 159; https://doi.org/10.3390/s26010159
Submission received: 26 November 2025 / Revised: 17 December 2025 / Accepted: 24 December 2025 / Published: 25 December 2025

Abstract

Weighted sum-rate (WSR) maximization in downlink massive multi-user multiple-input (MU-MIMO) with per-antenna power constraints (PAPCs) and imperfect channel state information (CSI) is computationally challenging. Classical weighted minimum mean-square error (WMMSE) algorithms, in particular, have per-iteration costs that scale cubically with the number of base-station antennas. This article proposes a robust low-complexity WMMSE-based precoding framework (RLC-WMMSE) tailored for massive MU-MIMO downlink under PAPCs and stochastic CSI mismatch. The algorithm retains the standard WMMSE structure but incorporates three key enhancements: a diagonal dual-regularization scheme that enforces PAPCs via a lightweight projected dual ascent with row-wise safety projection; a Woodbury-based transmit update that replaces the dominant M × M inversion with an ( N K ) × ( N K ) symmetric positive-definite solve, greatly reducing the per-iteration complexity; and a hybrid switching mechanism with adaptive damping that blends classical and low-complexity updates to improve robustness and convergence under channel estimation errors. We also analyze computational complexity and signaling overhead for both TDD and FDD deployments. Simulation results over i.i.d. and spatially correlated channels show that the proposed RLC-WMMSE scheme achieves WSR performance close to benchmark WMMSE-PAPCs designs while providing substantial runtime savings and strictly satisfying the per-antenna power limits. These properties make RLC-WMMSE a practical and scalable precoding solution for large-scale MU-MIMO systems in future wireless sensor and communication networks.

1. Introduction

Massive MU-MIMO systems are extensively recognized as a fundamental technology for fifth-generation (5G) and subsequent wireless networks. This is due to their ability to significantly spectral efficiency, reliability, and overall network capacity [1,2,3,4,5]. To fully realize these benefits in the downlink, effective precoding is essential to manage multi-user interference. Among linear precoding approaches, the weighted minimum mean-square error (WMMSE) method has gained prominence for achieving near-optimal weighted sum-rate performance in real-world implementations [6,7,8]. Nevertheless, the conventional WMMSE algorithm relies on repeated high-dimensional matrix inversions each iteration, resulting in computational complexity that scales cubically with the number of base station antennas [9,10]. This substantial computational burden challenges its practical deployment in large-scale massive MU-MIMO systems.
In practice, limited pilot resources and severe channel aging in massive MU-MIMO systems, particularly in high-mobility scenarios, make it impractical for the transmitter to obtain accurate channel state information (CSI) for precoding [11], thereby motivating research on robust precoding under imperfect CSI. The imperfect CSI is commonly modeled using a Gaussian-distributed posterior channel model [11,12,13]. Based on this model, robust precoding design can be formulated as an ergodic weighted sum-rate (EWSR) maximization problem subject to per-antenna power constraints (PAPCs). Several iterative robust precoding algorithms have been proposed to address this problem and have demonstrated excellent performance. Furthermore, in many practical BS implementations, each antenna is driven by a separate power amplifier with its own maximum rating [14], so per-antenna power constraints (PAPCs) are often more realistic than a single sum-power constraint (SPC). A simple engineering workaround is to design the precoder under an SPC and then heuristically rescale the columns or rows to satisfy the PAPCs [15], which may cause noticeable WSR loss. To address this, several works have studied WSR maximization directly under PAPCs. For example, zero-forcing (ZF) precoders with per-antenna constraints were analyzed in [16,17,18], but such schemes do not fully exploit the WSR objective because they do not jointly optimize receive filters and user weights. However, many existing approaches rely on fixed approximations or simplified update rules, which can induce non-negligible rate loss and poorer convergence behavior in large-scale regimes [19].
The weighted sum-rate (WSR) maximization problem for the downlink under a sum-power constraint (SPC) is nonconvex and NP-hard [20,21,22]. To address this challenge, the classical weighted minimum mean-square error (WMMSE) algorithm [23] provides an iterative solution by exploiting the fundamental relationship between the mean-square error (MSE) and the signal-to-interference-plus-noise ratio (SINR). This method reformulates WSR maximization as a weighted sum-MSE minimization problem and applies block coordinate descent (BCD) [24], resulting in three closed-form updates for the receive filters, weight matrices, and precoders that converge to a stationary point of the original WSR problem. To alleviate the computational burden, recent research has developed low-complexity variants that approximate WMMSE performance with reduced cost. For example, the rethinking WMMSE (R-WMMSE) algorithm [20] reduces complexity via randomized sketching (data dimensionality reduction). In contrast, our previously proposed LC-WMMSE scheme [25] leverages the structure of the WMMSE update through a Woodbury reformulation combined with a diagonal weight surrogate. While R-WMMSE introduces a probabilistic approximation error due to sketching, LC-WMMSE is deterministic and exhibits per-iteration complexity that scales primarily with the number of data streams rather than the number of BS antennas. Additionally, LC-WMMSE incorporates a hybrid switching mechanism that adaptively blends classical WMMSE updates with lightweight approximations, alongside an adaptive damping strategy that stabilizes the precoder trajectory and accelerates convergence.
Most existing robust and PAPC-aware designs still inherit the high-order matrix inversions of classical WMMSE or rely on generic convex solvers, which become prohibitive for massive MU-MIMO, especially under stochastic CSI mismatch. In contrast, this article proposes a robust low-complexity WMMSE-based framework (RLC-WMMSE) that simultaneously (i) handles imperfect CSI, (ii) enforces PAPCs, and (iii) maintains a low per-iteration complexity. The proposed RLC-WMMSE enforces PAPCs through a lightweight diagonal dual-regularization scheme while keeping the main transmit update in a ( N K ) × ( N K ) symmetric positive-definite (SPD) form via a Woodbury identity, so that the dominant cost scales with N K rather than M. This makes the method particularly suitable for large-scale MU-MIMO with per-antenna power limits and imperfect CSI. In summary, the main contributions of this paper are as follows:
  • We extend the classical WMMSE framework to handle per-antenna power constraints (PAPCs) by introducing a diagonal dual regularization. A diminishing-step projected dual ascent is used to update the per-antenna Lagrange multipliers, and a final row-wise feasibility projection is applied to eliminate any residual PAPC violations. This design keeps all additional variables local to the BS and yields a PAPC-feasible precoder.
  • We develop a Woodbury-based reformulation of the transmit update that trades the classical M × M inversion for an ( N K ) × ( N K ) symmetric positive-definite (SPD) solve built from a block-diagonal surrogate weight. This reduces the dominant per-iteration complexity in the massive-MIMO regime M N K , while retaining a numerically stable Cholesky/LDL implementation.
  • We propose a robust low-complexity WMMSE (RLC-WMMSE) update that blends the classical WMMSE precoder with its low-complexity surrogate through an adaptive mixing factor driven by the per-iteration MSE variation. Together with an Armijo-type adaptive damping rule, this hybrid scheme stabilizes the iterations when updates are computed on H ^ but performance is evaluated on the true channels H , and enforces sufficient ascent of the monitored WSR in practice.
  • Simulation results indicate that the proposed RLC-WMMSE becomes cheaper per iteration than classical WMMSE once M c N K for typical system dimensions. In addition, for the PAPC dual loop, we establish a feasibility bound of the form E max m ( [ P P H ] m , m ρ m ) + = O ( log T / T ) , quantifying how the expected PAPC violation decays with the number of outer iterations T.
  • Through Monte Carlo simulations with both Kronecker-correlated channels and i.i.d. Rayleigh fading, and under various CSI mismatch levels, we show that the proposed RLC-WMMSE achieves near-identical WSR to the WMMSE-PAPCs benchmark while maintaining negligible PAPC violations. At the same time, it exhibits favorable runtime scaling with M and clear speedups over classical WMMSE in the large-array regime.
Table 1 summarizes the main design principles of three WMMSE-based precoders discussed in this work. The R-WMMSE approach [20] reduces cost via randomized sketching, solving compressed normal equations. Conversely, the LC-WMMSE method [25] instead exploits problem structure using a Woodbury reformulation and a diagonal-weight surrogate in the transmit step. Our RLC-WMMSE extends LC-WMMSE by adding diagonal dual regularization to handle PAPCs under imperfect CSI, while maintaining an ( N K ) × ( N K ) symmetric positive-definite (SPD) solve for the transmit update. The three methods thus differ in update dimension, dominant cost, constraint handling, and approximation source.
The remainder of this article is summarized as follows. Section 2 introduces the system model and problem formulation. Section 3 presents the proposed robust low-complexity WMMSE (RLC-WMMSE) algorithm, including detailed derivations and a complexity analysis. Simulation results are provided and discussed in Section 4. Finally, Section 5 concludes the paper.

2. System Model and Problem Formulation

2.1. Downlink System Model

We consider a single-cell downlink MU-MIMO system in which a BS with M transmit antennas serves K users, each equipped with N receive antennas. The downlink channel to user k is denoted by H k C M × N , k = 1 , , K , and we model its entries as i.i.d. circularly symmetric complex Gaussian random variables (Rayleigh fading). The transmitted signal is
x = k = 1 K P k s k C M × 1 ,
where P k C M × d k is the linear precoder for user k and s k C d k × 1 is the data symbol vector satisfying E [ s k s k H ] = I N .
Under flat fading, the received signal at user k is
y k = H k x + n k = H k j = 1 K P j s j + n k ,
where n k CN ( 0 , σ 2 I N ) represents additive white Gaussian noise (AWGN). The data vectors { s k } are mutually independent and also independent of the noise vectors { n k } .
Remark 1
(Scalability in Massive MIMO). Practical massive MU-MIMO deployments typically satisfy M K N , meaning the base station is equipped with far more antennas than each user [26]. In such configurations, classical algorithms such as WMMSE require large matrix inversions whose computational cost grows rapidly with M. To alleviate this burden, our earlier LC-WMMSE scheme [25] integrates hybrid switching and adaptive damping mechanisms, significantly lowering the complexity. These enhancements enable the algorithm to scale efficiently with the BS antenna count, attaining approximately sub-cubic complexity in large-scale regimes.

2.2. Problem Formulation with Imperfect CSI and PAPCs

In downlink MU-MIMO systems, a core design goal is to determine the set of precoders { P k } k = 1 K that maximize the weighted sum-rate (WSR) under a transmit power constraint. Here, μ k 0 represents the weight assigned to user k. The WSR is defined as
R = k = 1 K μ k R k ,
where the achievable rate of user k is given by
R k = log 2 det I N + Σ k 1 H k P k P k H H k H ,
where the interference-plus-noise covariance matrix for user k is expressed as
Σ k = σ 2 I N + j = 1 j k K H k P j P j H H k H .
We maximize the WSR over all feasible precoders under per-antenna power constraints (PAPCs), which introduce a different feasibility set and a distinct performance–complexity trade-off compared with the conventional sum-power constraint.
The WSR maximization problem under PAPCs can be expressed as
max { P k } k = 1 K μ k R k , s . t . k = 1 K P k P k H m , m ρ m ,
where [ A ] m , m donates the m-th diagonal element of matrix A . The constraints in Equation (6) ensure that the transmit power at the m-th antenna of the BS does not exceed ρ m . Solving the WSR maximization problem in Equation (6) is challenging due to the objective function being highly nonlinear and nonconvex. Furthermore, as shown in [22], this problem is NP-hard, as summarized in the following proposition.
Proposition 1
(NP-hardness of WSR Maximization). The downlink weighted sum-rate maximization problem under a sum-power constraint is known to be NP-hard [21,22]. Since PAPCs are more restrictive than an SPC, the PAPC formulation (6) is also NP-hard.
We adopt the standard imperfect-CSI model H k = H ^ k + Δ k , Δ k CN 0 , ε I M × N , k = 1 , , K , where H ^ k is the channel estimate and ε sets the estimation NMSE; we use E Δ k F 2 / E H k F 2 = ε . Given nonnegative user weights { μ k } , the weighted sum-rate (WSR) for a channel realization is R ( { P k } ; H ) = k = 1 K μ k R k ( { P j } ; H ) , with R k and Σ k defined in Equations (4) and (5).
Imperfect CSI model: The additive Gaussian CSIT error model above is widely used in robust precoding/beamforming because it is tractable and captures the aggregate impact of estimation noise, quantization, and residual uncertainty; see, e.g., [11,12,13].
Remark 2
(CSI outdatedness). In practice, CSIT can also be outdated due to feedback/processing delay, which is often modeled by a first-order Gauss–Markov evolution H k [ n ] = α H k [ n 1 ] + 1 α 2 W k [ n ] [2,27]. In this work, we focus on the additive error model for algorithm design and analysis, and in simulations, we sweep ϵ to represent different CSI qualities.
Under per-antenna power constraints (PAPCs), the feasible precoder set is
P PAPC { P k } k = 1 K : k = 1 K P k P k H m , m ρ m , m = 1 , , M ,
where ρ = [ ρ 1 , , ρ M ] collects the per-antenna budgets. For apples-to-apples comparisons with the SPC case, we set m = 1 M ρ m = P max .
A practical “plug-in” (mismatch) design maximizes the WSR built on the channel estimates H ^ k while enforcing PAPCs:
maximize { P k } P PAPC R ˜ ( { P k } ; H ^ ) k = 1 K μ k log 2 det I N + Σ ˜ k 1 H ^ k P k P k H H ^ k H where Σ ˜ k = σ 2 I N + j k H ^ k P j P j H H ^ k H ,
with I N the N × N identity. Problem Equation (8) is nonconvex and NP-hard. We solve it via alternating WMMSE updates on H ^ combined with a dual-based enforcement of Equation (7) and a Woodbury low-complexity step; the final performance is reported by evaluating R ( { P k } ; H ) on the true channel. All symbols used in this paper are summarized in Table 2.

3. Proposed Robust LC-WMMSE Algorithm

3.1. The Classical WMMSE Algorithm

The WMMSE framework is widely used for WSR optimization [20]. In this section, we briefly revisit the classical WMMSE approach [23,28] from an optimization perspective, where the MSE serves as an auxiliary variable rather than a physical metric. Since the WSR problem in (6) is nonconvex, we leverage the well-known rate–MSE equivalence to rewrite it as a weighted sum-MSE minimization, which admits an efficient BCD procedure [24]; the problem can be reformulated as
min { W k , U k , P k } k = 1 K μ k Tr ( W k E k ) log det ( W k ) s . t . k = 1 K P k F 2 P max
subject to the same transmit power constraint specified in Equation (9). Here, μ k 0 denotes the priority weight associated with user k, and the mean-square error (MSE) matrix E k for user k is defined as
E k = E ( U k H y k s k ) ( U k H y k s k ) H ,
where E k denotes the user-k mean-square error (MSE) matrix. Here, U k C N × d k is the linear receive filter and W k C d k × d k is the MSE weight matrix. Both variables are updated jointly with the precoders { P k } . By expanding E k , we obtain
E k = I N U k H H ^ k P k U k H H ^ k P k H + j = 1 K U k H H ^ k P j P j H H ^ k H U k + σ 2 U k H U k .
As shown in [23], the reformulated problem in Equation (9) is nonconvex in the joint variable set { P k , U k , W k } . Nonetheless, it is convex in each variable block with the others fixed, which enables an efficient alternating optimization scheme summarized below:
U k = j = 1 K H ^ k P j P j H H ^ k H + σ 2 I N 1 H ^ k P k , k .
Keeping the remaining two variable blocks unchanged, the weight matrix W k admits the following closed-form update:
W k = μ k E k + ε I d k 1 , E k C d k × d k , k .
For given receive filters { U k } and weight matrices { W k } , the precoders { P k } are updated by the solution of a convex quadratic problem:
P = arg min P 1 2 P , A P { B , P } ,
with the corresponding normal equation A P = B .
A σ 2 I M + k = 1 K H ^ k U k W k U k H H ^ k H C M × M ,
B H ^ 1 U 1 W 1 H ^ K U K W K C M × D , D = k = 1 K d k .
Here, A C M × M , B C M × D , P C M × D , and  P k C M × d k . Using the definitions of A and B in (15) and (16), the optimality condition A P = B gives the unique solution P = A 1 B , which can be written per user as
P = A 1 B P k = A 1 H ^ k U k W k , k = 1 , , K .
While A is shared by all users, the blocks on the right-hand side, H ^ k U k W k , depend on k; consequently, the precoder blocks P k are user-specific. Since σ 2 > 0 and ε > 0 in (13), A 0 (Hermitian), and the associated system admits a unique solution. Now, we replace the SPC update in Equation (17) by a per-antenna constrained step; for a dual vector λ 0 , define
P ( λ ) = A + diag ( λ ) 1 B ,
and choose λ so that each antenna power satisfies
P ( λ ) P ( λ ) H m , m ρ m , m = 1 , , M .
A projected dual update is
λ ( t + 1 ) = λ ( t ) + η t diag ( P P H ) ρ + , η t = η 0 / t .
If a tiny residual remains, apply the row-wise safety projection
P S P , S = diag min 1 , ρ m [ P P H ] m , m m = 1 M .
Remark 3.
If m = 1 M ρ m = P max and λ = λ 1 , Equation (18) reduces to the SPC form in Equation (17).
The classical WMMSE algorithm requires solving an M × M linear system in every outer iteration, which entails cubic work O ( M 3 ) and becomes the dominant bottleneck when M is large. With per-antenna constraints the transmit update must also be regularized by a diagonal dual term, but the cost is still governed by the M × M solve. This motivates a low-complexity variant that avoids the big inversion. In the next subsection, we construct a Woodbury update that shifts the solve to an ( N K ) × ( N K ) system and enforces PAPCs with a light projected dual loop, yielding a favorable crossover when M c N K while preserving the WSR performance under CSI mismatch.

3.2. Proposed RLC-WMMSE

In this subsection, we develop an efficient iterative solution to the PAPC-constrained WSR maximization problem in Equation (8) using the WMMSE framework. The proposed robust low-complexity WMMSE (RLC-WMMSE) alternates standard WMMSE updates (on H ^ ) with a light dual update to enforce PAPCs and a Woodbury-based transmit step; hybrid switching and adaptive damping are employed for stability under CSI mismatch.

The RLC-WMMSE Algorithm Problem Reformulation

In the proposed robust LC-WMMSE, the  M × M matrix inversion in (15)–(17) is avoided by applying the Woodbury identity [29], which reduces the update to solving an ( N K ) × ( N K ) system. As a result, the dominant per-iteration complexity decreases from O ( M 3 ) to O M ( N K ) 2 + ( N K ) 3 in the massive-MIMO setting M N K .
  • Update the hybrid transmit precoder: At iteration t, we construct P ( t ) by convexly combining the classical WMMSE precoder P WMMSE ( t ) and the LC precoder P LC ( t ) as follows:
    P ( t ) = ω ( t ) P WMMSE ( t ) + 1 ω ( t ) P LC ( t ) ,
    where ω ( t ) [ 0 , 1 ] is an adaptive switch:
    ω ( t ) = Δ E ( t ) F Δ E ( t ) F + κ , Δ E ( t ) = k = 1 K E k ( t ) E k ( t 1 ) ,
    with a small smoothing constant κ > 0 (we use κ = 10 3 unless stated otherwise). When Δ E ( t ) F is large (far from a fixed point), ω ( t ) 1 favors P WMMSE ( t ) ; near convergence, it gradually shifts to the LC update to save time.
    Diagonal weight and Woodbury build.
    We approximate the full WMMSE weight by its diagonal,
    D k ( t ) = diag diag ( W k ( t ) ) 0 , D k ( t ) C d k × d k , k = 1 , , K ,
    which preserves positive-definiteness while removing inter-stream couplings. Using U k ( t ) and D k ( t ) , we form the block diagonal matrix
    S ( t ) = blkdiag U 1 ( t ) D 1 ( t ) U 1 ( t ) H , , U K ( t ) D K ( t ) U K ( t ) H C ( N K ) × ( N K ) , S ( t ) 0 ,
    and stack the channels horizontally,
    H ^ = [ H ^ 1 , , H ^ K ] C M × ( N K ) .
    PAPC enforcement via diagonal dual regularization.
    Under per-antenna power constraints, we replace the global Frobenius normalization by a diagonal dual regularization. At iteration t, construct A ( t ) and B ( t ) from the current ( U k ( t ) , D k ( t ) , w k ( t ) ) as in (27) and (28). Let
    A ( t ) = σ 2 I M + k = 1 K H ^ k U k ( t ) W k ( t ) U k ( t ) H H ^ k H ,
    B ( t ) = H ^ 1 U 1 ( t ) D 1 ( t ) , , H ^ K U K ( t ) D K ( t ) C M × D .
    With these, the LC step inverts an ( N K ) × ( N K ) SPD matrix (via Cholesky) instead of an M × M system, yielding the usual Woodbury speedup when M N K . For a dual vector λ 0 , define
    P ( t ) ( λ ) = A ( t ) + diag ( λ ) 1 B ( t ) ,
    and choose λ so that each antenna satisfies the PAPC P ( t ) P ( t ) H m , m ρ m , m = 1 , , M . A lightweight projected dual ascent that works well in practice is
    λ ( i + 1 ) = λ ( i ) + η i diag ( P ( t ) P ( t ) H ) ρ + , η i = η 0 i ,
    with [ · ] + denoting projection onto R 0 M and ρ = [ ρ 1 , , ρ M ] T . If a tiny residual remains after the inner loop, apply the row-wise safety projection
    P ( t ) S P ( t ) , S = diag min 1 , ρ m [ P ( t ) P ( t ) H ] m , m m = 1 M .
    The row scaling in (31) is not an ad-hoc post-processing; it is the exact row-wise Euclidean projection onto the PAPC-feasible set P { P : P m , : 2 2 ρ m , m } , since the constraints decouple across rows. Concretely, for any P ^ , Π P ( P ^ ) is obtained by P m , : = min 1 , ρ m / P ^ m , : 2 P ^ m , : . Thus, every iterate is feasible (up to numerical tolerance), and Armijo acceptance is evaluated on the projected (feasible) iterate.
    Notes. (i) Equations (29)–(31) enforce PAPCs without any global Frobenius rescaling, so this section supersedes the SPC normalization previously used after Equation (28). (ii) If m ρ m = P max and λ = λ 1 , Equation (29) reduces to the SPC form (recovering the classical update).
  • Adaptive Damping with Robust PAPCs: To stabilize the outer iterations under CSI mismatch, we adopt the adaptive damping mechanism originally proposed for the LC-WMMSE algorithm in [25] and extend it to the robust PAPC setting. Let R ˜ ( { P k } ; H ^ ) denote the estimated WSR evaluated on the imperfect CSI H ^ . At iteration t, we measure the change in the estimated WSR as
    Δ R ( t ) R ˜ ( { P k ( t ) } ; H ^ ) R ˜ ( { P k ( t 1 ) } ; H ^ ) , Δ ¯ ( t ) Δ R ( t ) max { 1 , | R ˜ ( { P k ( t ) } ; H ^ ) | } .
    We then choose a mixing factor
    α ( t ) = clip 1 Δ ¯ ( t ) η t , α min , α max , η t = η 0 t ,
    where clip ( x , a , b ) min { max { x , a } , b } and typical values are α min [ 0.1 , 0.3 ] , α max [ 0.9 , 0.98 ] , and  η 0 [ 10 2 , 10 1 ] . Let P ^ ( t ) denote the undamped precoder returned by the robust LC-WMMSE (RLC) update at iteration t. The damped update is
    P ( t ) α ( t ) P ( t 1 ) + 1 α ( t ) P ^ ( t ) .
    Optionally, we apply a short Armijo backtracking (up to a few trials) on α ( t ) to enforce monotone ascent of the estimated objective, i.e., R ^ ( { P k ( t ) } ; H ^ ) R ^ ( { P k ( t 1 ) } ; H ^ ) . This schedule reduces the step size when the iterate is far from stationarity (large Δ ¯ ( t ) ) and allows larger steps near convergence. Final performance is always reported as R ( { P k ( t ) } ; H ) on the true channel H for fairness.

3.3. Proposed RLC-WMMSE Updates Precoder with PAPCs and Imperfect CSI

The proposed robust low-complexity WMMSE (RLC-WMMSE) precoding algorithm follows the standard three-block WMMSE structure, but computes all updates with the channel estimates H ^ k and enforces per-antenna power constraints (PAPCs) via the diagonal dual map described in Section 3.2.
  • Receive Filter Update U k ( t ) : At iteration t, the receive filter for user k is
    U k ( t ) = H ^ k H S x ( t 1 ) H ^ k + σ 2 I N 1 H ^ k H P k ( t 1 ) , k ,
    where the BS transmit covariance from the previous iterate is S x ( t 1 ) = j = 1 K P j ( t 1 ) P j ( t 1 ) H C M × M . The matrix inside the inverse is Hermitian positive-definite, so Equation (35) is well posed.
  • Weight Matrix Update W k ( t ) : With the MSE matrix E k ( t ) (defined earlier), we set
    W k ( t ) = μ k E k ( t ) + ε I N 1 , ε > 0 k ,
    which yields a positive-definite weight; in the LC branch, we use its diagonal surrogate as shown in Equations (24)–(28); we form S ( t ) and stack H ^ to obtain the Woodbury update.
  • Transmit Precoder Update P ( t ) : Using the block-diagonal matrix S ( t ) and the stacked channel H ^ = [ H ^ 1 , , H ^ K ] (both defined earlier), the Woodbury step gives the LC precoder in closed form as
    P ( t ) = 1 σ 2 B ( t ) H ^ S ( t ) 1 + 1 σ 2 H ^ H H ^ 1 1 σ 2 H ^ H B ( t ) C M × D ,
    with B ( t ) = [ H ^ 1 U 1 ( t ) D 1 ( t ) , , H ^ K U K ( t ) D K ( t ) ] and S ( t ) = blkdiag U k ( t ) D k ( t ) U k ( t ) H . This moves the inversion from size M to size N K , yielding per-iteration cost O M ( N K ) 2 + ( N K ) 3 instead of O ( M 3 ) when M N K .

PAPC Enforcement (Replaces SPC Rescaling)

Rather than the global Frobenius normalization, we enforce PAPCs via the diagonal dual regularization described in Section 3.2: the classical and LC right-hand sides are evaluated through P ( t ) ( λ ) = A ( t ) + diag ( λ ) 1 B ( t ) Equation (18), where λ is updated by the projected dual step in Equation (20); if a tiny residual remains, we apply the row-wise safety projection in Equation (21). This entirely replaces the SPC normalization.

3.4. Convergence and Feasibility Analysis

The classical WMMSE alternates minimization of a convex quadratic surrogate and achieves monotone ascent of the WSR [23]. In our robust low-complexity variant, at iteration t, we form the transmit quadratic Q ( t ) ( P ) = 1 2 P , A ( t ) P { B ( t ) , P } , from the pairs { ( U k ( t ) , W k ( t ) ) } k = 1 K computed on the estimates H ^ , and obtain a candidate update P ^ ( t ) { P WMMSE ( t ) , P LC ( t ) } , the LC candidate uses the diagonal surrogate with D k ( t ) = diag ( W k ( t ) ) . We then apply the damped step
P ( t + 1 ) = ( 1 α ( t ) ) P ( t ) + α ( t ) P ^ ( t ) , α ( t ) ( 0 , 1 ] ,
where α ( t ) is chosen by Armijo backtracking on the true-channel objective WSR ( P ; H ) evaluated at the feasible iterate P ( t + 1 ) .
Proposition 2
(Monotone WSR and limit-point stationarity). With Armijo acceptance evaluated on the feasible iterate P ( t + 1 ) , the objective sequence { WSR ( P ( t ) ; H ) } is nondecreasing and bounded above; hence, it converges. Moreover, the overall procedure is an inexact block-coordinate method: { U k ( t ) } and { W k ( t ) } are updated in closed form, while the transmit update may be inexact due to (i) the diagonal surrogate and (ii) the PAPC dual loop. If the transmit inexactness vanishes asymptotically (e.g., W k ( t ) D k ( t ) 0 , or the hybrid rule selects P WMMSE ( t ) increasingly often), then every accumulation point of { ( U ( t ) , W ( t ) , P ( t ) ) } is a stationary point of the classical WMMSE objective. Otherwise, accumulation points are stationary for the corresponding surrogate objective using D k = diag ( diag ( W k ) ) .
Sketch. Armijo yields sufficient ascent of WSR ( · ; H ) for accepted feasible iterates, and boundedness follows from PAPCs. Standard inexact-BCD arguments imply convergence of the objective values and stationarity of limit points under vanishing inexactness.
Remark 4
(Practical convergence rate and hybrid impact). To quantify practical convergence speed and the impact of hybrid switching, we report (i) WSR versus outer iteration and (ii) an iterations-to-target metric (e.g., the number of outer iterations required to reach 99 % of the final WSR) under mild and strong spatial correlation. These results complement Proposition 2 by providing an empirical convergence characterization under CSI mismatch.
To quantify the impact of the proposed hybrid switching mechanism on convergence under PAPCs and CSI mismatch, we run the same robust algorithm in three modes: hybrid switching, LC-only by fixing ω ( t ) 0 , and WMMSE-only by fixing ω ( t ) 1 . Figure 1 reports the averaged WSR trajectories versus iteration at SNR = 20  dB over spatially correlated channels ( r tx = 0.7 ) with CSI error variance ϵ = 10 3 , averaged over 100 Monte Carlo trials. All three variants converge to essentially the same WSR plateau, indicating that hybrid switching does not compromise the achieved performance in the considered robust setting. The numerical summary is provided in Table 3. The final WSR values are 351.25 , 350.95 , and  351.29  bps/Hz for hybrid, LC-only, and WMMSE-only, respectively. Moreover, the hybrid mode reaches 99 % of its final WSR within 12 iterations, whereas LC-only requires 15 iterations, demonstrating faster practical convergence (relative to the pure LC branch) while preserving stability under imperfect CSI and PAPCs.
Lemma 1
(Diagonal-surrogate perturbation bound). Let D k = diag ( diag ( W k ) ) and E k = W k D k . The diagonal surrogate perturbs the transmit normal matrix by Δ A ( t ) = k = 1 K H ^ k D k ( t ) E k ( t ) D k ( t ) H H ^ k H , which satisfies Δ A ( t ) 2   k = 1 K H ^ k 2 2 D k ( t ) 2 2 E k ( t ) 2 . Hence, the diagonal approximation is accurate when the off-diagonal energy W k ( t ) D k ( t ) is small (diagonal dominance), and it can degrade in strongly coupled regimes (e.g., high SNR / strong correlation), motivating the hybrid safeguard.
Implication: Since Δ A ( t ) 2 scales with W k W ˜ k F , the diagonal surrogate may be less reliable under strong correlation/high SNR, and the hybrid switching mitigates this by favoring the classical update when needed.
Diagonal approximation error: To quantify the error introduced by replacing W k with W ˜ k diag ( diag ( W k ) ) , we use the relative off-diagonal energy
η k W k W ˜ k F W k F ,
which measures the departure of W k from a diagonal structure. Empirically, η k increases under stronger spatial correlation, indicating a less accurate diagonal surrogate.
Moreover, η k admits an explicit perturbation bound. Let E k W k W ˜ k and Δ A ( t ) A ( t ) A ˜ ( t ) = k = 1 K H ^ k D k ( t ) E k ( t ) D k ( t ) H H ^ k H . Then,
Δ A ( t ) 2 k = 1 K H ^ k D k ( t ) 2 2 E k ( t ) F = k = 1 K H ^ k D k ( t ) 2 2 η k ( t ) W k ( t ) F ,
so smaller η k implies a smaller surrogate perturbation. The hybrid rule further improves robustness by favoring the classical update when the surrogate is less reliable.
PAPC feasibility (every outer iteration): Let P ( λ ) = ( A ( t ) + diag λ ) 1 B ( t ) and update λ by the projected dual step Equations (29) and (30) with the final row-wise safety projection Equation (31). Then, each iterate P ( t ) satisfies n | P m n ( t ) | 2 ρ m for all m (to numerical tolerance), so PAPCs hold throughout.

3.5. Computational Complexity Analysis

In massive MU-MIMO, the computational burden of iterative precoding is often the limiting factor. In particular, the classical WMMSE [23] is dominated per iteration by solving the M × M precoder system in (27), i.e., factorizing σ 2 I M + k = 1 K H k U k W k U k H H k H , which costs O ( M 3 ) . Receiver and weight updates each cost O ( K N 3 ) . Rethinking WMMSE (R-WMMSE) reduces the dominant cost via sketching and matrix-free linear solves and exhibits near-linear scaling in M empirically [20].
LC step (Woodbury): Using the stacked channel H ^ = H ^ 1 , , H ^ K ]   C M × N K and S = blkdiag ( S 1 , , S K ) with S k = U k diag ( W k ) U k H C N × N , the LC step replaces the M × M inverse by ( N K ) × ( N K ) SPD solve via Woodbury:
A 1 = σ 2 I M + H ^ S H ^ H 1 = 1 σ 2 I M H ^ S 1 + 1 σ 2 H ^ H H ^ 1 H ^ H 1 σ 2 .
Algorithm 1 dominant costs per iteration are as follows: (i) Cholesky/solve of T = S 1 + 1 σ 2 H ^ H H ^ C ( N K ) × ( N K ) : O ( N K ) 3 ; (ii) Gram products with H ^ (e.g., H ^ H H ^ , H ^ H B ): O M ( N K ) 2 ; and (iii) per-user N × N factorizations (for S k 1 ): O ( K N 3 ) . Thus, Table 4 shows that the RLC-WMMSE algorithm has dominant cost O M ( N K ) 2 + O ( N K ) 3 + O ( K N 3 ) , which is much smaller than the classical WMMSE algorithm complexity O ( M 3 ) when M N K .
Algorithm 1 Robust low-complexity WMMSE (RLC-WMMSE) precoding.
Require: 
Channel estimates { H ^ k } k = 1 K , true channels { H k } k = 1 K (for evaluation), weights { μ k } k = 1 K , noise σ 2 , PAPC budgets ρ , max iters T, tol ϵ
  1:
Initialize stacked precoder P ( 0 ) (feasible or normalized)
  2:
for  t = 1  to T do
  3:
    for  k = 1  toK do
  4:
        Receive update: U k ( t ) by (35),
  5:
        MSE build: E k ( t ) from U k ( t ) and P ( t 1 )
  6:
        Weight update: W k ( t ) by (36),
  7:
    end for
  8:
    Hybrid switch: compute ω ( t ) by (23),
  9:
   Classical candidate (with PAPCs): build A ( t ) , B ( t ) by (27); compute P WMMSE ( t ) ( λ ) via P ( λ ) = ( A ( t ) + diag λ ) 1 B ( t ) with λ updated by the projected dual step (29) and (30) and the safety projection (31),
10:
  Low-complexity candidate (Woodbury + PAPCs): form S ( t ) and stacked H ^ by (25) and (26), compute P LC ( t ) ( λ ) using the Woodbury closed form by (37) while enforcing PAPCs by (29)–(31) on the LC branch.
11:
  Hybrid precoder update by (22),
12:
  Adaptive damping: compute α ( t ) by (33), then set P ( t ) = α ( t ) P ( t 1 ) + ( 1 α ( t ) ) P ^ ( t )
13:
  Armijo acceptance is evaluated on the PAPC-feasible trial iterate.
14:
  Evaluate WSR ( t ) on true channels { H k } (monotone track); if  | WSR ( t ) WSR ( t 1 ) | < ϵ  then break
15:
end for
Ensure: 
Per-user blocks { P k ( t ) } k = 1 K from the stacked P ( t )

3.5.1. PAPC (RLC-WMMSE) Effect

With PAPCs, the classical branch evaluates P ( λ ) = ( A ( t ) + diag λ ) 1 B ( t ) using a short projected dual loop in Equations (29)–(31). In the LC branch, we replace σ 2 I M by the diagonal matrix A λ = diag ( σ 2 + λ m ) and apply Woodbury with S ( t ) = blkdiag ( U k ( t ) D k ( t ) U k ( t ) H ) and H ^ = [ H ^ 1 , , H ^ K ] :
( A λ + H ^ S ( t ) H ^ H ) 1 = A λ 1 A λ 1 H ^ ( S ( t ) ) 1 + H ^ H A λ 1 H ^ 1 H ^ H A λ 1 .
The per-iteration asymptotic costs remain O M ( N K ) 2 + O ( N K ) 3 + O ( K N 3 ) ; the dual loop adds J cheap inner steps (typically J 5 ), each requiring one ( M × N K ) triangular solve and power check, i.e., an extra O J M ( N K ) that is lower-order when M N K .
Takeaway. Because N K M in massive MU-MIMO, LC/RLC replace the cubic O ( M 3 ) term with operations that scale with N K . Empirically, we observe a crossover where LC/RLC become faster than classical WMMSE when M c ( N K ) (with c 3 in our setup), and the PAPC dual loop incurs only a small constant overhead. Imperfect CSI (updates on H ^ ) does not change orders of complexity.

3.5.2. Practical Implementation and Latency Considerations

We emphasize that RLC-WMMSE remains an iterative optimizer and thus is not primarily intended to replace one-shot closed-form precoders in extremely stringent URLLC pipelines. Rather, it targets regimes in which a small iteration budget is feasible within the channel coherence time and the dominant computations can be efficiently parallelized. In practice, the latency can be reduced by warm-starting from the previous slot/TTI solution, adopting early stopping with a fixed iteration cap or an iterations-to-target criterion (e.g., t 99 ), and implementing the dominant ( N K ) × ( N K ) SPD solves using highly optimized batched Cholesky/LDL routines on GPU/FPGA/ASIC. From this perspective, the key advantage of the proposed reformulation is that it shifts the dominant per-iteration cost from an M × M inversion to an ( N K ) × ( N K ) SPD solve, which is more favorable when M N K .

3.6. Implementation Considerations and Signaling Overhead Analysis

At iteration t, the BS needs downlink (DL) CSI and the current receive filters and weights for each user. In a TDD deployment, the BS estimates the downlink channels { H ^ k } from uplink pilots by reciprocity and can update U k ( t ) and W k ( t ) locally. Hence, no per-iteration DL feedback is required beyond the usual pilot overhead, and the additional PAPC dual variables λ are kept entirely at the BS. In an FDD system, the user equipments (UEs) estimate the channels from DL pilots and must feed back effective CSI based on H ^ k for the RLC-WMMSE updates. A full WMMSE mode would return the receive filters U k ( t ) C N × d k and the Hermitian weight matrices W k ( t ) C d k × d k . Using b c bits per complex number and b r bits per real number, the per-user payload per iteration is on the order of
B full , k N d k b c + d k ( d k + 1 ) 2 b r ,
where d k ( d k + 1 ) 2 real entries describe a d k × d k Hermitian matrix. Our LC branch suggests a lighter diagonal feedback mode. Since the algorithm only uses the diagonal surrogate D k ( t ) = diag ( diag ( W k ( t ) ) ) , the UE can quantize and feed back diag ( W k ( t ) ) R d k together with a compressed representation of U k ( t ) (e.g., a codebook index or a few dominant singular vectors). The overhead reduces to roughly
B LC , k d k b r + B codebook ,
which can be significantly smaller than B full , k when d k N . In both modes, the additional PAPC dual variables are updated at the BS and incur no extra DL/UL signaling.
Relation to imperfect CSI model: In our stochastic CSI model H k = H ^ k + Δ k , the mismatch variance ε captures the combined effect of channel estimation noise and, in FDD, feedback quantization/compression and delay. Longer UL/DL pilot sequences or finer feedback quantization reduce ε, while aggressive compression in the LC feedback mode typically increases it. The proposed RLC-WMMSE algorithm is designed to be robust to such mismatches, as reflected in the simulations of Section 4.

4. Simulations and Results

4.1. Simulation Setup

We examine a single-cell massive MU-MIMO downlink where a BS with M transmit antennas serves K users. Each user is equipped with N receive antennas and is allocated d k = N data streams. Under the sum-power constraint (SPC), the BS transmit-power budget is set to P max = 10 W . For per-antenna power constraints, the power limits are collected in ρ = [ ρ 1 , , ρ M ] T with ρ m = P max / M . The channel matrices { H k } are generated as circularly symmetric complex Gaussian fading with large-scale pathloss 128.1 + 37.6 log 10 ( d ) dB [30], where d [ 0.1 , 0.3 ] km denotes the user–BS distance. The noise variance is identical for all users and is set to σ 2 = P max 10 SNR / 10 , so that SNR (in dB) corresponds to the average received SNR in the absence of precoding.
Unless stated otherwise, results are reported under perfect CSI at the BS and are averaged over 100 independent channel realizations. For robustness evaluations, we explicitly model imperfect CSI via H ^ k = H k + Δ k , where Δ k CN ( 0 , ε I ) and ε controls the estimation NMSE. Throughout the simulations, we use κ = 10 3 , η = 10 3 , α min = 0.2 , α max = 0.9 , tolerance ϵ = 10 4 , and a maximum of T = 100 iterations. All experiments are implemented in Matlab R2024b on a Windows 11 (64-bit) workstation with an Intel i7-12700H CPU at 3.20 GHz, 16 GB RAM, and RTX Graphics.

4.2. Robust Low-Complexity (RLC-WMMSE) Performance Under Correlated Channels

This subsection presents simulation results that evaluate the performance of the proposed RLC-WMMSE algorithm under per-antenna power constraints and correlated-channel conditions. The method is compared with several baseline schemes: the LC-WMMSE algorithm [25], the classical WMMSE algorithm [23], the R-WMMSE approach [20], and two non-iterative precoding techniques—zero-forcing (ZF) [31] and block diagonalization (BD) [32]. The closed-form ZF and BD methods exploit low-dimensional channel properties (e.g., BD uses null-space projection to suppress interference) and exhibit low computational complexity, specifically O ( K 3 ) for ZF and O ( K M 2 ) for BD [31,32]. Although these non-iterative schemes are computationally efficient and practical for massive MU-MIMO, they generally yield suboptimal weighted sum-rate performance. To ensure a fair comparison, each trial initializes the precoder as P ( 0 ) CN ( 0 , 1 ) , which is then scaled to satisfy the power constraints and kept identical for all methods. To investigate robustness in realistic propagation environments, we first consider on spatially correlated channels at the BS. For user k, the true downlink channel is generated according to
H ^ k = R BS 1 / 2 W k , W k CN ( 0 , I M × N ) ,
where R BS C M × M is the BS correlation matrix. We adopt an exponential model [ R BS ] m , n = r | m n | with 0 r < 1 and use r = 0.7 in the correlated-channel experiments. The i.i.d. Rayleigh case is recovered by setting R BS = I M . Apart from the choice of R BS , the simulation protocol (SNR grid, ( M , N , K ) , initialization, tolerances, and power normalization) is identical for both correlated and i.i.d. channels.
Figure 2 shows the WSR versus SNR for correlated MU-MIMO channels with perfect CSI. The LC-WMMSE algorithm closely matches the WSR of the classical WMMSE algorithm over the whole SNR range, while both clearly outperform the R-WMMSE and the closed-form ZF and BD precoders.
Figure 3 and Figure 4 illustrate the convergence behavior of the different PAPC schemes for two system dimensions, ( M , K , N ) = ( 64 , 12 , 2 ) and ( 128 , 16 , 4 ) , at SNR = 10 dB , 0 dB . For the SPC case, WMMSE, LC-WMMSE, and R-WMMSE all exhibit fast, monotone convergence and attain almost identical final WSR, with R-WMMSE and LC-WMMSE reaching the steady state in noticeably fewer iterations than classical WMMSE. Under PAPCs, the proposed robust LC-WMMSE converges much faster than the WMMSE-PAPCs baseline while achieving nearly the same steady-state WSR, confirming that the low-complexity Woodbury/dual design preserves good convergence properties even in the presence of per-antenna constraints.
Now, we compare our proposed RLC-WMMSE algorithm with the LC-WMMSE algorithm, the classical WMMSE and R-WMMSE schemes in terms of average CPU time to convergence, and quantify the additional overhead introduced by the PAPC variants (robust LC-WMMSE and WMMSE-PAPCs). Figure 5 shows the runtime versus the number of users K for a system with M = 128 BS antennas, N = 2 receive antennas per user, and SNR = 10 dB. As K increases, all algorithms exhibit higher computational cost, but with markedly different slopes. The proposed RLC-WMMSE algorithm is consistently faster than the classical WMMSE baseline, showing a much flatter growth in runtime as K grows, which is consistent with its reduced per-iteration complexity. For the largest tested value K = 40 , LC-WMMSE is roughly 40 % faster than WMMSE in terms of CPU time, whereas R-WMMSE, thanks to its randomized/sketched updates, achieves runtimes that are almost an order of magnitude smaller than those of WMMSE. The robust LC-WMMSE under PAPCs remains significantly cheaper than the exact WMMSE-PAPCs solver, confirming the benefit of the proposed dual-based low-complexity design for PAPCs.
Figure 6 shows the runtime as a function of the number of BS antennas M for K = 16 users, N = 4 receive antennas per user, and SNR = 10 dB . The computational cost of all methods increases with M, but the scaling behavior differs. The classical WMMSE and WMMSE-PAPCs curves grow rapidly, reflecting the O ( M 3 ) matrix inversions in each iteration. In contrast, LC-WMMSE and robust LC-WMMSE grow much more slowly with M, and for M 10 3 , the LC-WMMSE achieves about a 1.7 × speedup over WMMSE while remaining substantially faster than WMMSE-PAPCs. R-WMMSE exhibits the lowest runtime across all antenna dimensions, in line with its near-linear complexity in M. Overall, these results confirm the complexity analysis: LC-WMMSE provides a substantial reduction in computational cost compared with the classical WMMSE algorithm, and the robust LC-WMMSE inherits this favorable scaling even in the presence of PAPCs.
Figure 7 compares the weighted sum-rate versus SNR for the considered precoders with correlated channels ( M = 128 , K = 16 , N = 4 , and imperfect CSI). As expected, all schemes benefit from increasing SNR, but their relative gaps remain almost constant. The benchmark WMMSE-PAPCs and its normalized variant achieve the highest WSR, and the proposed RLC-WMMSE closely tracks both curves across the entire SNR range. At SNR = 30 dB, for example, the WSR loss of RLC-WMMSE with respect to WMMSE-PAPCs is below 0.3 % , and below 0.2 % relative to the normalized WMMSE-PAPCs baseline. R-WMMSE delivers slightly lower WSR due to the sketching approximation, whereas LC-WMMSE (SPC), ZF, and BD exhibit larger gaps, highlighting the importance of both PAPC enforcement and robust design under CSI mismatch.
Figure 8 reports the corresponding average runtimes per channel realization versus SNR. The runtimes of all algorithms are essentially insensitive to SNR and are dominated by their per-iteration matrix operations. R-WMMSE is the fastest method, but this comes at the cost of a noticeable WSR loss. LC-WMMSE reduces the runtime by about 20– 25 % compared to classical WMMSE, confirming the effectiveness of the Woodbury reformulation. The proposed RLC-WMMSE incurs only a moderate overhead relative to LC-WMMSE to enforce PAPCs and handle imperfect CSI, yet it still runs about 10– 15 % faster than the full WMMSE-PAPCs solver. Overall, these results demonstrate that RLC-WMMSE attains WSR performance essentially identical to the best PAPC baselines while offering more favorable runtime scaling than classical WMMSE and WMMSE-PAPCs in the large-array regime.

4.3. Robustness to Channel Estimation Errors

In this subsection we investigate the robustness of the proposed robust LC-WMMSE precoder to imperfect CSI at the BS. For each user k, the estimated downlink channel is modeled as H ^ k = H k + E k , where H k C M × N is the true channel matrix and E k collects the estimation errors whose entries are i.i.d. circularly symmetric complex Gaussian. The quality of the CSI is parameterized by the normalized mean-squared error (NMSE)
ϵ E H ^ k H k F 2 E H k F 2 ,
so that ϵ = 0 corresponds to perfect CSI and larger values of ϵ represent increasingly severe estimation errors. For each fixed ϵ , we generate ( H k , H ^ k ) according to the above model and compare the following transceiver designs:
  • WMMSE (oracle, H ): The classical WMMSE algorithm with a sum-power constraint, which has perfect knowledge of H k and serves as an upper bound.
  • WMMSE (mismatch, H ^ H ): The classical WMMSE algorithm that designs the precoder using the imperfect CSI H ^ k , while the achieved weighted sum-rate is always evaluated on the true channels H k .
  • Robust LC-WMMSE (PAPCs): The proposed low-complexity LC-WMMSE algorithm with per-antenna power constraints, which updates all variables using only H ^ k and is also evaluated on the true channels H k .
Let R or ( ϵ ) , R mis ( ϵ ) , and R RLC ( ϵ ) denote the average weighted sum-rate (WSR) of the oracle WMMSE, mismatched WMMSE, and robust LC-WMMSE, respectively, for a given value of ϵ . The relative WSR loss with respect to the oracle design is defined as
Δ mis ( ϵ ) = R or ( ϵ ) R mis ( ϵ ) R or ( ϵ ) × 100 % , Δ RLC ( ϵ ) = R or ( ϵ ) R RLC ( ϵ ) R or ( ϵ ) × 100 % .
Figure 9 shows the WSR versus ϵ at SNR = 10 dB. As expected, the oracle WMMSE curve is almost insensitive to ϵ , since it always uses the true CSI. In contrast, the mismatched WMMSE suffers a significant performance degradation as ϵ increases. The proposed robust LC-WMMSE (PAPCs) consistently tracks the oracle curve more closely, thereby mitigating the loss induced by imperfect CSI. The relative losses Δ mis ( ϵ ) and Δ RLC ( ϵ ) are shown in Figure 10; these results confirm that the proposed design reduces the WSR loss compared to the mismatched WMMSE baseline over the entire range of channel-estimation errors considered.
To verify that robustness does not come at the expense of violating the per-antenna power constraints, we also monitor the average maximum PAPC violation
v ¯ ( ϵ ) E max 1 m M p m 2 2 ρ m + ,
where p m denotes the mth row of the final precoder and ( x ) + max { x , 0 } . As illustrated in Figure 11, the proposed robust LC-WMMSE scheme satisfies the PAPCs up to numerical precision for all tested values of ϵ .

4.4. Performance Under i.i.d. Rayleigh

To provide a baseline reference, we first consider an uncorrelated i.i.d. Rayleigh fading scenario with imperfect CSI at the BS. The true downlink channels are drawn as H k CN ( 0 , I M ) , while the BS only has access to the estimates H ^ k = H k + E k , where E k CN ( 0 , ϵ I M ) models the estimation error. In this subsection, we fix the NMSE parameter to ϵ = 10 3 (approximately 30 dB channel-estimation NMSE), and set the transmit-side correlation coefficient to r tx = 0 . Unless otherwise stated, we use M = 128 BS antennas and K = 16 users, each with N = 4 receive antennas (and streams), and a total BS sum power ρ = 10 W under the SPC case. For PAPC-based schemes, the per-antenna budgets are chosen as ρ m = ρ / M so that m = 1 M ρ m = ρ . All results are averaged over a large number of independent Monte Carlo channel realizations.
Figure 12 shows the weighted sum-rate versus SNR for the i.i.d. Rayleigh case with imperfect CSI ( ϵ = 10 3 , r tx = 0 ). The proposed LC-WMMSE and robust LC-WMMSE (PAPCs) closely track the classical WMMSE and WMMSE-PAPCs benchmarks, respectively, across the entire SNR range.
Figure 13 reports the corresponding average runtime per channel realization, highlighting the significant complexity reduction of LC-WMMSE and robust LC-WMMSE compared with the WMMSE algorithm and WMMSE-PAPCs algorithm counterparts.

5. Conclusions

Weighted sum-rate maximization in downlink massive MU-MIMO with per-antenna power constraints (PAPCs) and imperfect CSI is computationally demanding, particularly for classical WMMSE algorithms whose cost scales cubically with the number of base-station antennas. This paper proposed a robust low-complexity WMMSE (RLC-WMMSE) precoding framework that preserves the favorable WSR performance of classical WMMSE while explicitly enforcing PAPCs under stochastic CSI mismatch. The approach combines a Woodbury-based low-complexity transmit update that operates on an ( N K ) × ( N K ) SPD system, a hybrid switching rule with adaptive damping that blends classical and low-complexity updates, and a lightweight diagonal dual regularization with row-wise safety projection to satisfy per-antenna power limits.
A qualitative complexity analysis shows that the proposed RLC-WMMSE scheme becomes more efficient than classical WMMSE once the number of BS antennas significantly exceeds the total number of data streams, which is typical in massive MU-MIMO deployments. Extensive simulations over i.i.d. and correlated channels, and for various CSI mismatch levels, demonstrate that RLC-WMMSE attains WSR performance very close to a WMMSE-PAPCs benchmark, while maintaining negligible average PAPC violations and offering clearly favorable runtime scaling as the array size grows. The implementation and signaling discussion further indicates that robustness and PAPC feasibility can be handled entirely at the BS, with only mild feedback requirements on the user side.
Overall, the proposed RLC-WMMSE method provides a practical, feasibility-aware, and computationally efficient precoding option for large-array base stations with per-antenna power budgets and imperfect CSI. Future work will consider extensions to multi-cell coordination, hybrid analog–digital architectures, and more refined stochastic error models that capture channel aging and hardware impairments.

Author Contributions

Conceptualization, V.S. and H.D.; methodology, V.S., Z.G. and H.D.; software, V.S. and Z.G.; validation, V.S. and H.D.; formal analysis, V.S. and Z.G.; investigation, V.S. and Z.G.; resources, V.S. and Z.G.; data curation, V.S.; writing—original draft preparation, V.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Acknowledgments

We are grateful to the High Performance Computing Center of Central South University for assistance with the computations.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Wang, M.; Gao, F.; Jin, S.; Lin, H. An Overview of Enhanced Massive MIMO with Array Signal Processing Techniques. IEEE J. Sel. Top. Signal Process. 2019, 13, 886–901. [Google Scholar] [CrossRef]
  2. Marzetta, T.L.; Larsson, E.G.; Yang, H.; Ngo, H.Q. Fundamentals of Massive MIMO; Cambridge University Press: Cambridge, UK, 2016. [Google Scholar]
  3. Pereira de Figueiredo, F.A. An Overview of Massive MIMO for 5G and 6G. IEEE Lat. Am. Trans. 2022, 20, 931–940. [Google Scholar] [CrossRef]
  4. Zhang, J.; Björnson, E.; Matthaiou, M.; Ng, D.W.K.; Yang, H.; Love, D.J. Prospective Multiple Antenna Technologies for Beyond 5G. IEEE J. Sel. Areas Commun. 2020, 38, 1637–1660. [Google Scholar] [CrossRef]
  5. Li, Q.C.; Niu, H.; Papathanassiou, A.T.; Wu, G. 5G Network Capacity: Key Elements and Technologies. IEEE Veh. Technol. Mag. 2014, 9, 71–78. [Google Scholar] [CrossRef]
  6. Peng, M.; Sun, Y.; Li, X.; Mao, Z.; Wang, C. Recent Advances in Cloud Radio Access Networks: System Architectures, Key Techniques, and Open Issues. IEEE Commun. Surv. Tutor. 2016, 18, 2282–2308. [Google Scholar] [CrossRef]
  7. Sohrabi, F.; Nuzman, C.; Du, J.; Yang, H.; Viswanathan, H. Energy-Efficient Flat Precoding for MIMO Systems. IEEE Trans. Signal Process. 2025, 73, 795–810. [Google Scholar] [CrossRef]
  8. Choi, H.; Swindlehurst, A.L.; Choi, J. WMMSE-Based Rate Maximization for RIS-Assisted MU-MIMO Systems. IEEE Trans. Commun. 2024, 72, 5194–5208. [Google Scholar] [CrossRef]
  9. Albreem, M.A.; Juntti, M.; Shahabuddin, S. Massive MIMO Detection Techniques: A Survey. IEEE Commun. Surv. Tutor. 2019, 21, 3109–3132. [Google Scholar] [CrossRef]
  10. Xu, Y.; Larsson, E.G.; Jorswieck, E.A.; Li, X.; Jin, S.; Chang, T.H. Distributed Signal Processing for Extremely Large-Scale Antenna Array Systems: State-of-the-Art and Future Directions. IEEE J. Sel. Top. Signal Process. 2025, 19, 304–330. [Google Scholar] [CrossRef]
  11. Shi, J.; Wang, W.; Yi, X.; Gao, X.; Li, G.Y. Deep Learning-Based Robust Precoding for Massive MIMO. IEEE Trans. Commun. 2021, 69, 7429–7443. [Google Scholar] [CrossRef]
  12. Lu, A.A.; Gao, X.; Zhong, W.; Xiao, C.; Meng, X. Robust Transmission for Massive MIMO Downlink with Imperfect CSI. IEEE Trans. Commun. 2019, 67, 5362–5376. [Google Scholar] [CrossRef]
  13. Shi, J.; Lu, A.A.; Zhong, W.; Gao, X.; Li, G.Y. Robust WMMSE Precoder with Deep Learning Design for Massive MIMO. IEEE Trans. Commun. 2023, 71, 3963–3976. [Google Scholar] [CrossRef]
  14. Yu, W.; Lan, T. Transmitter Optimization for the Multi-Antenna Downlink with Per-Antenna Power Constraints. IEEE Trans. Signal Process. 2007, 55, 2646–2660. [Google Scholar] [CrossRef]
  15. Lee, S.R.; Kim, J.S.; Moon, S.H.; Kong, H.B.; Lee, I. Zero-Forcing Beamforming in Multiuser MISO Downlink Systems Under Per-Antenna Power Constraint and Equal-Rate Metric. IEEE Trans. Wirel. Commun. 2013, 12, 228–236. [Google Scholar] [CrossRef]
  16. Li, B.; Wu, C.Z.; Dam, H.H.; Cantoni, A.; Teo, K.L. A Parallel Low Complexity Zero-Forcing Beamformer Design for Multiuser MIMO Systems Via a Regularized Dual Decomposition Method. IEEE Trans. Signal Process. 2015, 63, 4179–4190. [Google Scholar] [CrossRef]
  17. Jang, J.; Jeon, S.W.; Chae, H.; Cha, H.S.; Kim, D.K. Low Complexity Zeroforcing Precoder Design Under Per-Antenna Power Constraints. IEEE Commun. Lett. 2015, 19, 1556–1559. [Google Scholar] [CrossRef]
  18. Pham, T.M.; Farrell, R.; Dooley, J.; Dutkiewicz, E.; Nguyen, D.N.; Tran, L.N. Efficient Zero-Forcing Precoder Design for Weighted Sum-Rate Maximization With Per-Antenna Power Constraint. IEEE Trans. Veh. Technol. 2018, 67, 3640–3645. [Google Scholar] [CrossRef]
  19. Chen, C.W.; Tsai, W.C.; Wong, S.S.; Teng, C.F.; Wu, A.Y. WMMSE-Based Alternating Optimization for Low-Complexity Multi-IRS MIMO Communication. IEEE Trans. Veh. Technol. 2022, 71, 11234–11239. [Google Scholar] [CrossRef]
  20. Zhao, X.; Lu, S.; Shi, Q.; Luo, Z.Q. Rethinking WMMSE: Can Its Complexity Scale Linearly with the Number of BS Antennas? IEEE Trans. Signal Process. 2023, 71, 433–446. [Google Scholar] [CrossRef]
  21. Liu, Y.F.; Dai, Y.H.; Luo, Z.Q. Coordinated Beamforming for MISO Interference Channel: Complexity Analysis and Efficient Algorithms. IEEE Trans. Signal Process. 2011, 59, 1142–1157. [Google Scholar] [CrossRef]
  22. Luo, Z.Q.; Zhang, S. Dynamic Spectrum Management: Complexity and Duality. IEEE J. Sel. Top. Signal Process. 2008, 2, 57–73. [Google Scholar] [CrossRef]
  23. Shi, Q.; Razaviyayn, M.; Luo, Z.Q.; He, C. An Iteratively Weighted MMSE Approach to Distributed Sum-Utility Maximization for a MIMO Interfering Broadcast Channel. IEEE Trans. Signal Process. 2011, 59, 4331–4340. [Google Scholar] [CrossRef]
  24. Bertsekas, D.P. Nonlinear Programming. J. Oper. Res. Soc. 1997, 48, 334. [Google Scholar] [CrossRef]
  25. Sen, V.; Deng, H.; Xu, X.; Shen, M. A Hybrid Low-Complexity WMMSE Precoder with Adaptive Damping for Massive Multi-User Multiple-Input Multiple- Output Systems. Sensors 2025, 25, 6827. [Google Scholar] [CrossRef] [PubMed]
  26. Björnson, E.; Sanguinetti, L.; Wymeersch, H.; Hoydis, J.; Marzetta, T.L. Massive MIMO is a reality—What is next?: Five promising research directions for antenna arrays. Digit. Signal Process. 2019, 94, 3–20. [Google Scholar] [CrossRef]
  27. Adhikary, A.; Nam, J.; Ahn, J.Y.; Caire, G. Joint Spatial Division and Multiplexing—The Large-Scale Array Regime. IEEE Trans. Inf. Theory 2013, 59, 6441–6463. [Google Scholar] [CrossRef]
  28. Shi, Q.; Xu, W.; Wu, J.; Song, E.; Wang, Y. Secure Beamforming for MIMO Broadcasting with Wireless Information and Power Transfer. IEEE Trans. Wirel. Commun. 2015, 14, 2841–2853. [Google Scholar] [CrossRef]
  29. Budhu, J.; Grbic, A. Fast and Accurate Optimization of Metasurfaces with Gradient Descent and the Woodbury Matrix Identity. IEEE Trans. Antennas Propag. 2023, 71, 7679–7683. [Google Scholar] [CrossRef]
  30. Dahrouj, H.; Yu, W. Coordinated beamforming for the multicell multi-antenna wireless system. IEEE Trans. Wirel. Commun. 2010, 9, 1748–1759. [Google Scholar] [CrossRef]
  31. Parfait, T.; Kuang, Y.; Jerry, K. Performance analysis and comparison of ZF and MRT based downlink massive MIMO systems. In Proceedings of the 2014 Sixth International Conference on Ubiquitous and Future Networks (ICUFN), Shanghai, China, 8–11 July 2014; pp. 383–388. [Google Scholar] [CrossRef]
  32. Spencer, Q.; Swindlehurst, A.; Haardt, M. Zero-forcing methods for downlink spatial multiplexing in multiuser MIMO channels. IEEE Trans. Signal Process. 2004, 52, 461–471. [Google Scholar] [CrossRef]
Figure 1. Hybrid-impact ablation under PAPCs and imperfect CSI: WSR versus iteration at SNR = 20 dB with correlated channels ( r tx = 0.7 ), ϵ = 10 3 , averaged over 100 trials.
Figure 1. Hybrid-impact ablation under PAPCs and imperfect CSI: WSR versus iteration at SNR = 20 dB with correlated channels ( r tx = 0.7 ), ϵ = 10 3 , averaged over 100 trials.
Sensors 26 00159 g001
Figure 2. WSR of MU-MIMO systems with perfect CSI ( M = 128 , K = 16 , N = 4 ).
Figure 2. WSR of MU-MIMO systems with perfect CSI ( M = 128 , K = 16 , N = 4 ).
Sensors 26 00159 g002
Figure 3. Convergence comparison between the proposed RLC-WMMSE algorithm and the WMMSE algorithms with PAPCs under M = 64 , K = 12 , N = 2 , and SNR = 10 dB .
Figure 3. Convergence comparison between the proposed RLC-WMMSE algorithm and the WMMSE algorithms with PAPCs under M = 64 , K = 12 , N = 2 , and SNR = 10 dB .
Sensors 26 00159 g003
Figure 4. Convergence comparison between the proposed RLC-WMMSE algorithm and the WMMSE algorithms with PAPCs under M = 128 , K = 16 , N = 4 , 0 dB ), and SNR = 0 dB .
Figure 4. Convergence comparison between the proposed RLC-WMMSE algorithm and the WMMSE algorithms with PAPCs under M = 128 , K = 16 , N = 4 , 0 dB ), and SNR = 0 dB .
Sensors 26 00159 g004
Figure 5. Average CPU convergence time versus number of users K ( M = 128 , N = 2 , SNR = 10 dB).
Figure 5. Average CPU convergence time versus number of users K ( M = 128 , N = 2 , SNR = 10 dB).
Sensors 26 00159 g005
Figure 6. Average CPU convergence time versus number of BS antennas M ( K = 16 , N = 4 , 10 dB ).
Figure 6. Average CPU convergence time versus number of BS antennas M ( K = 16 , N = 4 , 10 dB ).
Sensors 26 00159 g006
Figure 7. WSR performance with different SNRs ( M = 128 , K = 16 , N = 4 ).
Figure 7. WSR performance with different SNRs ( M = 128 , K = 16 , N = 4 ).
Sensors 26 00159 g007
Figure 8. Average runtime between different SNRs.
Figure 8. Average runtime between different SNRs.
Sensors 26 00159 g008
Figure 9. WSR versus channel estimation NMSE parameter ϵ at M = 128 , K = 16 , N = 4 , SNR = 10 dB. The oracle WMMSE (perfect CSI), mismatched WMMSE (imperfect CSI), and robust LC−WMMSE with PAPCs are compared.
Figure 9. WSR versus channel estimation NMSE parameter ϵ at M = 128 , K = 16 , N = 4 , SNR = 10 dB. The oracle WMMSE (perfect CSI), mismatched WMMSE (imperfect CSI), and robust LC−WMMSE with PAPCs are compared.
Sensors 26 00159 g009
Figure 10. Relative WSR loss with respect to the oracle WMMSE baseline versus channel estimation NMSE parameter ϵ at M = 128 , K = 16 , N = 4 , SNR = 10 dB.
Figure 10. Relative WSR loss with respect to the oracle WMMSE baseline versus channel estimation NMSE parameter ϵ at M = 128 , K = 16 , N = 4 , SNR = 10 dB.
Sensors 26 00159 g010
Figure 11. Average maximum PAPC violation versus channel estimation NMSE parameter ϵ for the proposed robust LC−WMMSE precoder at M = 128 , K = 16 , N = 4 , SNR = 10 dB.
Figure 11. Average maximum PAPC violation versus channel estimation NMSE parameter ϵ for the proposed robust LC−WMMSE precoder at M = 128 , K = 16 , N = 4 , SNR = 10 dB.
Sensors 26 00159 g011
Figure 12. Sum-rate versus SNR for the i.i.d. Rayleigh scenario ( M = 128 , K = 16 , N = 4 , ϵ = 10 3 , r tx = 0 ).
Figure 12. Sum-rate versus SNR for the i.i.d. Rayleigh scenario ( M = 128 , K = 16 , N = 4 , ϵ = 10 3 , r tx = 0 ).
Sensors 26 00159 g012
Figure 13. Average runtime between different SNRs under the i.i.d. Rayleigh scenario ( M = 128 , K = 16 , N = 4 ).
Figure 13. Average runtime between different SNRs under the i.i.d. Rayleigh scenario ( M = 128 , K = 16 , N = 4 ).
Sensors 26 00159 g013
Table 1. Comparison of WMMSE variants. SPD: symmetric positive definite.
Table 1. Comparison of WMMSE variants. SPD: symmetric positive definite.
R-WMMSELC-WMMSERLC-WMMSE (Proposed)
PrincipleRandomized sketchingStructure exploitation (Woodbury + diag ( W k ) )Robust structure exploitation (Woodbury + diag ( W k ) + dual PAPCs)
Update sizeCompressed (sketched domain) ( N K ) × ( N K ) SPD solve ( N K ) × ( N K ) SPD solve with dual variables
CSI/constraintsPerfect CSI, SPCPerfect CSI, SPCImperfect CSI (stochastic mismatch), PAPCs
Dominant costSketch products + small SPD solveBuild A , B + ( N K ) × ( N K ) SPD solveLC cost + short dual loop (J inner steps)
Error sourceSketching bias/varianceNeglect of off-diagonals in W k Diagonal surrogate + inexact dual PAPC enforcement
Table 2. Summary of notation. Bold denotes matrices/vectors; ( · ) H is the Hermitian transpose and · F the Frobenius norm.
Table 2. Summary of notation. Bold denotes matrices/vectors; ( · ) H is the Hermitian transpose and · F the Frobenius norm.
NotationMeaning
MNumber of BS transmit antennas
NNumber of receive antennas per user
KNumber of users
d k Number of streams for user k
D k = 1 K d k Total number of streams
H k C M × N True downlink channel from BS to user k
H ^ k C M × N Estimated channel for user k (imperfect CSI)
Δ k Channel estimation error for user k with H k = H ^ k + Δ k
ε Estimation NMSE parameter in the stochastic CSI model
P k C M × d k Precoder for user k
P = [ P 1 , , P K ] C M × D Stacked precoder (all users)
s k C d k Data symbol vector for user k
x C M Transmit signal at the BS
y k C N Received signal at user k
n k C N AWGN at user k
σ 2 Noise power (per receive antenna)
P max Total transmit power budget under SPC
ρ = [ ρ 1 , , ρ M ] T Per-antenna power budgets (PAPCs)
P PAPC Feasible precoder set satisfying the per-antenna power constraints
U k C N × d k MMSE receive filter for user k
W k C d k × d k Weight matrix for user k
E k C d k × d k MSE matrix for user k
D k = diag ( diag ( W k ) ) Diagonal weight surrogate used in LC/RLC branch
S x = j = 1 K P j P j H C M × M BS transmit covariance matrix
H = [ H 1 , , H K ] C M × ( N K ) Stacked true channel
H ^ = [ H ^ 1 , , H ^ K ] C M × ( N K ) Stacked estimated channel
S = blkdiag ( U 1 D 1 U 1 H , , U K D K U K H ) C N K × N K Block-diagonal weight matrix (LC/RLC branch)
B LC = [ H 1 U 1 D 1 , , H K U K D K ] C M × D RHS factor for LC/RLC Woodbury update (true channel)
B class = [ H 1 U 1 W 1 , , H K U K W K ] C M × D RHS factor for classical WMMSE update
G = H H H C ( N K ) × ( N K ) Stacked Gram matrix
A ( t ) , B ( t ) Quadratic WMMSE transmit matrices at iteration t
λ ( t ) R + M Dual variables for PAPCs at iteration t
A λ ( t ) = σ 2 I M + diag ( λ ( t ) ) PAPC-regularized diagonal matrix in the classical branch
I M , I N , I d k Identity matrices of sizes M, N, and  d k
diag ( · ) , blkdiag ( · ) , tr ( · ) Standard matrix operators (diagonal, block-diagonal, trace)
P WMMSE ( t ) Classical WMMSE precoder at iteration t
P LC ( t ) LC-WMMSE (Woodbury) precoder at iteration t
P RLC ( t ) Robust LC-WMMSE (with PAPCs and imperfect CSI) precoder at iteration t
ω ( t ) Hybrid switching factor at iteration t
α ( t ) Adaptive damping factor at iteration t
WSR ( t ) Weighted sum-rate at iteration t (bps/Hz)
JNumber of dual inner steps
Table 3. Hybrid-impact ablation summary at SNR = 20 dB, r t x = 0.7 , ϵ = 10 3 (100 trials).
Table 3. Hybrid-impact ablation summary at SNR = 20 dB, r t x = 0.7 , ϵ = 10 3 (100 trials).
VariantFinal WSR (bps/Hz) t 99 (Iterations)
RLC Hybrid351.2512
RLC LC-only ( ω = 0 )350.9515
RLC WMMSE-only ( ω = 1 )351.2912
Table 4. Per-iteration computational complexity.
Table 4. Per-iteration computational complexity.
OperationClassical WMMSELC-WMMSE (SPC)RLC-WMMSE (PAPC)
Precoder solve O ( M 3 ) O M ( N K ) 2 + O ( N K ) 3 same as LC + dual + O J M ( N K )
Per-user N × N factorizations O ( K N 3 ) O ( K N 3 ) O ( K N 3 )
Gram products ( H H H , H H B ) O M ( N K ) 2 O M ( N K ) 2 O M ( N K ) 2
Hybrid switch ω ( t ) O ( K N 2 ) O ( K N 2 )
Adaptive damping α ( t ) O ( 1 ) O ( 1 )
Total (dominant) O ( M 3 + K N 3 ) O M ( N K ) 2 + ( N K ) 3 + K N 3 O M ( N K ) 2 + ( N K ) 3 + K N 3
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Guo, Z.; Sen, V.; Deng, H. Robust Low-Complexity WMMSE Precoding Under Imperfect CSI with Per-Antenna Power Constraints. Sensors 2026, 26, 159. https://doi.org/10.3390/s26010159

AMA Style

Guo Z, Sen V, Deng H. Robust Low-Complexity WMMSE Precoding Under Imperfect CSI with Per-Antenna Power Constraints. Sensors. 2026; 26(1):159. https://doi.org/10.3390/s26010159

Chicago/Turabian Style

Guo, Zijiao, Vaskar Sen, and Honggui Deng. 2026. "Robust Low-Complexity WMMSE Precoding Under Imperfect CSI with Per-Antenna Power Constraints" Sensors 26, no. 1: 159. https://doi.org/10.3390/s26010159

APA Style

Guo, Z., Sen, V., & Deng, H. (2026). Robust Low-Complexity WMMSE Precoding Under Imperfect CSI with Per-Antenna Power Constraints. Sensors, 26(1), 159. https://doi.org/10.3390/s26010159

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop