1. Introduction
With the sixth-generation mobile communication era, demands for larger capacity and higher rates are pursued [
1]. Millimeter-wave (mmWave)and even terahertz frequency bands will be adopted. Simultaneously, the small wavelength characteristic of mmWave is recognized as an inherent advantage for deploying massive MIMO systems [
2,
3]. However, mmWave is characterized by severe path loss and poor penetration. This deficiency is compensated in mmWave massive MIMO systems through beamforming to focus energy [
4]. Although superior performance is offered by traditional fully digital beamforming, an independent radio frequency (RF) chain is required for each antenna, leading to hardware complexity and power consumption that grow exponentially with the number of antennas. Hybrid (HBF) beamforming technology, achieved through the decomposition of beamforming into joint analog-digital domain optimization, has been recognized as the critical solution for balancing spectral efficiency and hardware implementation costs in multi-user dense communication systems [
5].
HBF is recognized for significantly reducing system power consumption and hardware complexity while high-bandwidth, high-data-rate, low-latency communication services are delivered [
6,
7], with its spectral efficiency approaching the performance upper bound of fully digital beamforming. However, the optimization problem is rendered inherently non-convex due to the high coupling between digital and analog beamforming domains, combined with the constant modulus constraints (CMC) of analog beamforming and transmit power constraints [
8]. A principal solution approach is implemented by leveraging the sparse nature of mm Wave MIMO channels through alternating optimization of the transmit precoder and receive combiner matrices; this optimization is approached by first identifying a locally optimal precoder solution, subsequently solving the updated matrix equations for the corresponding optimal combiner, and progressively iterating to optimize all four transceiver matrices [
9,
10]. In [
11], the base station (BS) precoding matrix and per-user combining matrix are designed through gradual minimization of the mean square error (MSE) to minimize the bit error rate (BER) in downlink multi-user transmissions. In [
12], an iterative algorithm employing alternating minimization (AltMin) and manifold optimization is proposed to solve the non-convex weighted minimum mean square error (WMMSE) problem with established convergence guarantees. Nevertheless, conventional approaches are characterized by requiring numerous iterations leading to slow convergence, being prone to suboptimal solutions, and critically not accounting for multidimensional information such as intra-cell multi-user interference management—constituting the primary challenge for future.
To overcome the limitations of matrix-based methods in capturing the inherent multidimensional structure of channels, tensor decomposition theory provides a powerful mathematical framework [
13,
14]. As a higher-order generalization of matrices, tensors offer a natural and efficient representation for multi-domain data encountered in mmWave massive MIMO systems, inherently revealing correlations among spatial, frequency, and user domains. Among various tensor decomposition models, the Tucker decomposition is particularly suitable for this task [
15]. It enables the direct processing of high-dimensional data without structural information loss caused by matricization. Unlike the canonical polyadic (CP) decomposition [
16], which requires identical column ranks across all factor matrices, the Tucker model allows different modes (e.g., antennas, subcarriers, users) to have distinct ranks. This flexibility is essential for accurately characterizing the varying degrees of freedom in each domain within hybrid beamforming problems. By performing a truncated-rank Tucker decomposition on the constructed channel tensor, a new approach is provided for designing an analog precoder and combiner under CMC. Meanwhile, the resulting frequency-domain and user correlation matrices can be used to design digital precoder and combiner.
mmWave channels (30–300 GHz) are characterized by substantial bandwidth advantages, which has driven the exploration of orthogonal frequency division multiplexing (OFDM) systems. In such systems, wide bandwidth channels are decomposed into multiple mutually orthogonal narrowband subchannels. This decomposition motivates the application of OFDM’s frequency-flat subchannels to mmWave massive MIMO systems, where the multidimensional structure of multiuser OFDM can be effectively addressed through tensor decomposition techniques. Prior implementations of tensor decomposition have been applied to channel estimation and hybrid precoding design in beamforming systems. Specifically, in [
17], phase rotation is incorporated into factor matrices of tensor-based orthogonal matching pursuit (OMP) algorithms to mitigate grid constraint-induced energy leakage; in [
18], high-rank four-dimensional canonical polyadic decomposition with Vandermonde constraints on three factor matrices is introduced for efficient subpath parameter estimation in sub-6G channel modeling; while in [
19], subcarrier channel matrices are represented as a third-order tensor to fully exploit channel information across all subcarriers via tensor decomposition. Although spectral efficiency approaching fully digital beamforming is achieved, these methods are characterized by critical dependency on RF chain count.
In HBF technology, designing is required for the analog precoding matrix and its combiner, digital precoding and its combiner, followed by joint optimization of both. Simultaneously, OFDM systems are equipped with numerous subcarriers, and mmWave massive MIMO systems are deployed with hundreds to thousands of antennas to serve more users. Addressing this problem necessitates tensor decomposition tools for processing high-dimensional data [
20,
21]. For hybrid beamforming in multi-user OFDM systems, employing tensor decomposition naturally characterizes multi-user interference and directly captures the coupling relationships of multidimensional parameters (spatial domain, temporal domain, frequency domain, user dimension, etc.) [
22], enabling the joint optimization of inter-user beams through decomposition to enhance spectral efficiency.
Proposed in this paper is an HBF design method based on joint optimization of Tucker Decomposition of a Fourth-Order Tensor and the RBD algorithm for mmWave massive MIMO multi-user OFDM systems. Through hybrid tensor decomposition and RBD interference management, noise is effectively suppressed, spectral efficiency approaching fully digital performance is achieved, and an effective solution is provided for mmWave massive MIMO multi-user OFDM systems by the algorithm.
The main contributions of this article are concluded as follows.
A unified fourth-order tensor channel modeling framework is established to overcome the dimensionality separation limitation of conventional matrix-based methods. This model cohesively integrates the spatial, frequency-domain, and user dimensions. A novel analog beamforming design scheme is subsequently proposed, wherein the analog precoder and combiner are derived from the factor matrices of a truncated-rank Tucker decomposition. The inherent non-convexity arising from the constant modulus constraints is effectively resolved using an Alternating Direction Method of Multipliers (ADMM)-based projection technique.
A digital beamforming scheme is developed by utilizing the outcomes of the tensor decomposition. The subcarrier and user factor matrices, obtained through the Alternating Least Squares (ALS) algorithm, are systematically integrated with the Regularized Block Diagonalization (RBD) technique and water-filling power allocation strategy. This integration enables precise interference management and effective noise suppression, thereby significantly enhancing the overall spectral efficiency.
Simulation experiments validate the impact of different parameters on the spectral efficiency performance of the HBF design in this paper. Simultaneously, simulation experiments verify that the proposed algorithm can still achieve satisfactory performance under low-resolution phase shifters.
The remainder of this paper is organized as follows:
Section 2 presents the channel model, system framework, and methodological approach;
Section 3 elaborates the HBF system design, computational complexity analysis, and numerical validation of spectral efficiency performance;
Section 4 concludes the study and outlines future research directions.
3. HBF Design
This paper presents a HBF framework based on joint Tucker decomposition of fourth-order tensors and RBD. Compared with conventional HBF designs, the proposed approach integrates channel state information, user-specific characteristics, and frequency-domain resources through a unified tensor model. Frequency-domain and user-domain factor matrices are utilized for digital beamformer optimization to achieve precise interference suppression. The detailed designs of analog and digital beamforming architectures are elaborated in subsequent sections.
3.1. Analog Precoder and Combiner
Through Tucker decomposition of a fourth-order tensor, the design of analog precoder and combiner is transformed into phase extraction and quantization of the transmitter-side factor matrices
. This method significantly reduces computational complexity while ensuring hardware feasibility. Moreover, the decomposed factor matrices of the frequency domain and the user domain provide the optimization space for subsequent digital precoder. First, analog precoder and combiner are designed via truncated-rank Tucker decomposition of a fourth-order tensor. First, the channel tensor
is built, with core tensor
. The four factor matrices obtained from its truncated-rank decomposition are analog precoder and combiner, subcarrier correlation matrix, and user correlation matrix, respectively. Tucker decomposition of a fourth-order tensor achieves interference suppression through multi-dimensional joint optimization without requiring per-stream alternating optimization. The channel model is then expressed as follows:
where
is the subcarrier factor matrix,
is the user factor matrix. The problem of Tucker decomposition of a fourth-order tensor for the channel is transformed into the solving process of analog precoder and combiner. Under the assumption of perfect channel state information (CSI), Although channel estimation for mmWave massive MIMO systems is challenging, several high-precision low-complexity methods have been proposed [
31,
32]. Let the channel tensor be denoted as
. Our objective is to minimize
. By optimizing the design variables
, subject to hardware constraints during the design of the analog precoder and combiner.
The CMC on the analog precoder and combiner render Equation (
10) a non-convex optimization problem. To address this, we deploy the ADMM, which effectively handles the non-convexity induced by CMC. Compared to conventional per-stream optimization methods, this approach significantly reduces computational complexity. Within the ADMM framework, we employ alternating least squares (ALS) to iteratively optimize factor matrices while updating the core tensor as follows:
where † denotes pseudoinverse. By performing SVD decomposition on the unfolding of the tensor, the factor matrices are updated. The tensor
is unfolded along mode-3 to obtain matrix
, defined as.
where
denotes the mode-3 unfolding operation of the tensor, which unfolds the third dimension (subcarrier dimension) into matrix rows and other dimensions (user, antenna) into columns. By performing SVD decomposition and taking the top M left singular vectors:
The tensor
is unfolded along mode-4 to obtain matrix
, defined as
By performing SVD decomposition and taking the top K left singular vectors:
Theauxiliary variables
and
and dual variables
UF and
UW are introduced, decomposing the constraint problem of constant modulus into solvable sub-problems. The objective function is:
where
,
, and the solution is obtained as follows:
To satisfy the CMC, phase extraction and normalization are performed on
The dual variable is updated to rigorously enforce the constant modulus constraint:
Collaborative optimization of analog precoder and combiners in mmWave massive MIMO systems is achieved by this algorithm through joint Tucker decomposition of a fourth-order tensor channel model using ALS and ADMM algorithms. Frequency-domain and factor matrices are yielded through the decomposition to participate in subsequent digital precoder and combiner design. After each iteration, an economy-size QR decomposition is executed, which fundamentally resolves beam pattern distortion caused by constant modulus constraints, thereby ensuring array gain. The algorithm flow of analog beamforming is presented in Algorithm 1.
Algorithm 1 Design of analog beamforming based on Joint ADMM-ALS decomposition |
- 1:
Randomly initialization: - 2:
Zero matrix: , Set - 3:
fortodo - 4:
ALS: Core tensor update: - 5:
- 6:
- 7:
- 8:
ADMM: Analog matrices solution: - 9:
- 10:
- 11:
- 12:
- 13:
Update , , according to (19) - 14:
if , then - 15:
break loop - 16:
end if - 17:
Update , , according to (17) - 18:
end for - 19:
Output:
|
3.2. Digital Precoder and Combiner
In digital precoder and combiner design, consideration must be given to the multi-user interference (MUI) problem. Traditional SVD algorithms cannot be applied for direct handling of multi-user interference, while the BD algorithm requires orthogonal user channels—a condition difficult to satisfy in practical scenarios. The RCD algorithm is characterized by slow convergence and susceptibility to local optima. Therefore, the RBD algorithm [
33] is adopted, where a regularization term is introduced based on the BD framework, enabling further suppression of MUI in the digital domain while noise amplification is avoided and spectral efficiency is significantly improved.
During the design of digital precoders and combiners in this work, the frequency-domain correlation matrix
and user correlation matrix
, derived from Tucker decomposition of a fourth-order tensor, are fully exploited. Precise suppression of dominant interferers is prioritized while avoiding excessive suppression of non-dominant interference components. Building upon analog pre-beamforming, multi-user interference is effectively canceled, inter-user interference is mitigated, signal-to-noise ratio is enhanced, and near-optimal power allocation is realized. For the k-th user at the m-th subcarrier, the post-combining equivalent channel is constructed as follows:
Construct the Interference Matri,
where
,
denotes the subcarrier weighting matrix and
represents the user weighting matrix. The user factor
enables effective quantification of interference intensity. Compared with the equal-weighting scheme, this approach achieves precise suppression of dominant interference sources. Subsequently, a primary digital precoder is designed to construct an interference null space, enabling
to nearly eliminate multiuser interference completely. Compared with the BD algorithm, this approach preserves more degrees of freedom. To derive the null-space projection matrix, SVD is performed on
, to extract the right singular vectors.
Then, when designing the digital precoder, to enhance interference suppression, frequency-domain and user factors are introduced.
can enhance independent user signals,
can enhance high-quality subcarrier weights, then SVD decomposition is performed.
The water-filling algorithm is introduced to design the secondary digital precoder [
34]. According to singular values of the equivalent baseband channel (channel quality indicators), limited power is intelligently allocated to high-quality subchannels to suppress noise.
Then the complete digital precoder is as follows:
The digital combiner similarly features a secondary structure. By performing SVD decomposition on the weighted equivalent channel, left singular vectors are extracted.
In the digital domain, balanced control of interference and noise is achieved through regularized projection and optimal power allocation. Frequency-domain and user-domain factor matrices are introduced to enhance high-quality subcarriers and precisely suppress interference, significantly improving spectral efficiency in dense user scenarios.
3.3. Complexity Analysis
We proceed to analyze the computational complexity of the proposed HBF technique based on fourth-order Tucker decomposition. For the analog precoding stage, the complexity is
. For the digital precoding stage, the complexity is
. The overall algorithm complexity is
. Analog precoding complexity is higher in our algorithm due to the higher-order Tucker decomposition and subcarrier scaling in wideband scenarios. Digital precoding complexity is similar to hybrid Tucker2 and hybrid SOTS. The hybrid Tucker2 fits small antenna arrays. Hybrid SOTS fits medium-scale arrays. Despite marginally higher overall complexity, our algorithm provides superior interference suppression. It is ideal for massive MU-MIMO with abundant subcarriers and users when computational resources allow. The Computational Complexity Comparison are shown in
Table 1.
3.4. Simulation Results and Comparisons
In this section, simulation results are presented to evaluate the performance of our proposed algorithm and compare it with prior algorithms. Under the assumption of perfect CSI availability, SNR for all users, and identical numbers of scattering clusters, propagation paths, and antenna array orientations, each experiment yields results averaged over 500 Monte Carlo channel realizations. Key simulation parameters are summarized in
Table 2.
Experimental comparisons are conducted between the proposed algorithm and the hybrid Tucker2 [
27] and hybrid SOTS [
28] methods. While practical OFDM systems typically employ 128 or more subcarriers, we configure the number of subcarriers as M = 32 due to computational constraints, particularly those imposed by the hybrid SOTS implementation. This parameter configuration does not compromise the validity of the experimental findings.
In the first experiment, the spectral efficiency performance of the system is evaluated under different SNR is evaluated. Assuming a mm Wavemassive MIMO system with
,
antennas and
subcarriers, number of RF chains
, number of users
, and number of data streams
. In
Figure 2, Compared with hybrid Tucker2 and hybrid SOTS under identical simulation conditions, the FD-RBD scheme—characterized by the absence of analog beamforming—omits frequency- and user-domain factor matrices in its design. This fully-digital beamforming approach, derived by removing the analog beamforming component from our proposed algorithm, requires dedicated RF chains per antenna element, resulting in substantially higher hardware and computational complexity than HBF architectures. While FD-RBD serves as the performance upper bound benchmark, the core objective of hybrid solutions is to approach this bound with controlled complexity. Our proposed algorithm demonstrates superior spectral efficiency over competing methods, achieving performance levels approaching those of the fully-digital RBD benchmark. Spectral Efficiency Performance Comparison (SNR = 5 dB) are shown in
Table 3.
At 5 dB SNR, the proposed algorithm achieves a spectral efficiency of 76.61 ± 0.22 bits/s/Hz ( confidence interval). This represents gains of 3.38 bits/s/Hz and 6.28 bits/s/Hz over hybrid Tucker2 and hybrid SOTS, respectively. With the narrowest confidence interval width (0.22 bits/s/Hz) observed in the comparative study, the proposed scheme demonstrates enhanced robustness against channel stochastic variations while approaching the theoretical limit of fully-digital beamforming (82.96 bits/s/Hz). The performance superiority over both hybrid Tucker2 and hybrid SOTS is statistically significant.
The second experiment evaluated the relationship between system spectral efficiency and transmitting and receiving antennas, setting parameters
,
, user number
, and data stream number
. In
Figure 3, at 5 dB SNR,
, the proposed algorithm achieves spectral efficiency gains of 20.45 bits/s/Hz and 23.09 bits/s/Hz over hybrid SOTS and hybrid Tucker2, respectively. These results demonstrate consistent superiority of our algorithm in spectral efficiency across the evaluated SNR range.
We then tested the impact of receive antennas on spectral efficiency, setting the parameter
with all other parameters consistent with the second experiment. In
Figure 4, conducting tests at SNR = −15 dB and SNR = 5 dB. At −15 dB with
, the spectral efficiency is 3.52 bits/s/Hz higher than the hybrid SOTS algorithm and 4.71 bits/s/Hz higher than the hybrid Tucker2 algorithm. The results indicate that our algorithm exhibits superior spectral efficiency compared to other algorithms.
In both Experiment 2 and 3, the spectral efficiency of HBF algorithms exhibits strong sensitivity to the number of transmit antennas but weak dependence on receive antennas. This occurs because: Spatial degrees of freedom are dominated by the transmit array, where increasing transmit antennas enhances interference suppression and precoding gain through high-precision beam energy focusing, yielding logarithmic growth in spectral efficiency; Receive antennas primarily provide diversity gain, yet under fixed data-stream constraints, surplus antennas remain underutilized; The sparsity of mmWave multipath channels makes transmit beamforming critical, as additional transmit antennas precisely match angle-domain paths while extra receive antennas offer diminishing returns in path capture capability. Consequently, HBF spectral efficiency demonstrates higher sensitivity to transmit antenna scaling.
Then spectral efficiency performance under different data streams is simulated in
Figure 5. Parameters are set as
.
. At −15 dB SNR with
, the proposed algorithm achieves spectral efficiency improvements of 7.78 bits/s/Hz over hybrid SOTS and 7.02 bits/s/Hz over hybrid Tucker2. At 5 dB SNR with
, it maintains superior gains of 6.86 bits/s/Hz and 5.01 bits/s/Hz against these benchmarks respectively. These results demonstrate consistent performance dominance across tested conditions. Moreover, with increasing data streams, our algorithm not only outperforms competitors but also approaches fully-digital beamforming performance, attributed to enhanced utilization of RF chains and strengthened interference suppression that eliminates multi-user interference. Consequently, the proposed architecture proves particularly effective in high-data-stream scenarios.
Below, spectral efficiency performance under different users is evaluated in
Figure 6. Parameters are set as,
,
,
. At −15 dB SNR with
users, the proposed algorithm achieves spectral efficiency gains of 26.57 bits/s/Hz over hybrid SOTS and 17.02 bits/s/Hz over hybrid Tucker2. At 5 dB SNR with
, it demonstrates superior improvements of 43.69 bits/s/Hz and 22.13 bits/s/Hz against these benchmarks respectively. This significant outperformance in dense-user scenarios confirms the algorithm’s advantage over existing HBF designs, increased user count enables precise suppression of dominant interferers through frequency- and user-domain factor matrices while avoiding over-suppression of weak interference sources, thereby establishing particular efficacy for hybrid beamforming in dense-user communication systems.
Finite phase-shifter resolution causes phase discretization, leading to beam pattern gain degradation. With ideal continuous-phase shifters, spectral efficiency remains uncompromised. Thus while higher resolution improves spectral efficiency, practical hardware constraints impose fundamental limits. Simulations validate phase-shifter resolution impact on our algorithm’s spectral efficiency under Parameter Set
,
SNR = 5 dB. In
Figure 7, our method achieves
of the ideal spectral efficiency at 6-bit resolution, whereas hybrid Tucker2 and hybrid SOTS require 5-bit and 7-bit resolutions respectively for comparable performance. This demonstrates marginally higher resolution sensitivity than hybrid Tucker2 but lower than hybrid SOTS, enabling spectral efficiency requirements to be met with moderate-resolution hardware.