Next Article in Journal
An Accelerated Maximum Flow Algorithm with Prediction Enhancement in Dynamic LEO Networks
Previous Article in Journal
Query by Example: Semantic Traffic Scene Retrieval Using LLM-Based Scene Graph Representation
Previous Article in Special Issue
High-Performance Series-Fed Array Multiple-Input Multiple-Output Antenna for Millimeter-Wave 5G Networks
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Cross-Layer Stream Allocation of mMIMO-OFDM Hybrid Beamforming Video Communications

by
You-Ting Chen
1,
Shu-Ming Tseng
1,*,
Yung-Fang Chen
2 and
Chao Fang
3
1
Department of Electronic Engineering, National Taipei University of Technology, Taipei 106, Taiwan
2
Department of Communication Engineering, National Central University, Taoyuan 320, Taiwan
3
Faculty of Information Technology, Beijing University of Technology, Beijing 100021, China
*
Author to whom correspondence should be addressed.
Sensors 2025, 25(8), 2554; https://doi.org/10.3390/s25082554
Submission received: 13 February 2025 / Revised: 10 April 2025 / Accepted: 15 April 2025 / Published: 17 April 2025

Abstract

:
This paper proposes a source encoding rate control and cross-layer data stream allocation scheme for uplink millimeter-wave (mmWave) multi-user massive MIMO (MU-mMIMO) orthogonal frequency division multiplexing (OFDM) hybrid beamforming video communication systems. Unlike most previous studies that focus on the downlink scenario, our proposed scheme optimizes the uplink transmission while also addressing the limitation of prior works that only consider single-data-stream users. A key distinction of our approach is the integration of cross-layer resource allocation, which jointly considers both the physical layer channel state information (CSI) and the application layer video rate-distortion (RD) function. While traditional methods optimize for spectral efficiency (SE), our proposed method directly maximizes the peak signal-to-noise ratio (PSNR) to enhance video quality, aligning with the growing demand for high-quality video communication. We introduce a novel iterative cross-layer dynamic data stream allocation scheme, where the initial allocation is based on conventional physical-layer data stream allocation, followed by iterative refinement. Through multiple iterations, users with lower PSNR can dynamically contend for data streams, leading to a more balanced and optimized resource allocation. Our approach is a general framework that can incorporate any existing physical-layer data stream allocation as an initialization step before iteration. Simulation results demonstrate that the proposed cross-layer scheme outperforms three conventional physical-layer schemes by 0.4 to 1.14 dB in PSNR for 4–6 users, at the cost of a 1.8 to 2.3× increase in computational complexity (requiring 3.6–5.8 iterations).

1. Introduction

In response to the increasing demand for higher data transmission rates, the 5G system can utilize the mmWave frequency band to increase available bandwidth [1]. However, the mmWave frequency band poses significant challenges in terms of propagation path loss, susceptibility to blockages, and high penetration loss, making it less reliable for stable communication [2]. To address the propagation path loss issues in the mmWave frequency band, the use of a massive MIMO architecture with multiple antennas has been considered an effective solution.
Beamforming technology is a critical technology in the 5G specification and can be divided into digital beamforming, analog beamforming, and hybrid beamforming (HBF). In terms of bit error rate (BER) performance, digital beamforming offers the best performance. Nevertheless, the hardware cost can be substantially high due to the need to connect each antenna to a RF chain. Moreover, fully digital beamforming requires extensive power consumption, making it impractical for large-scale deployments. Analog beamforming exhibits the poorest performance among the three techniques. It typically relies on hardware phase shifters to adjust the phase and achieve energy concentration in specific directions. However, analog beamforming has lower beam control accuracy and limited adaptability due to its reliance on hardware-based phase adjustments. As a result, neither digital beamforming nor analog beamforming alone is optimal for 5G mmWave applications. Hence, the hybrid beamforming architecture is a good compromise between system performance and hardware cost. It combines high-dimensional phase shifters with low-dimensional digital processors [3]. Hybrid beamforming is particularly advantageous in multi-user (MU) scenarios, where balancing multiple data streams efficiently is crucial. It is notable that the hybrid beamforming structure was already adopted by the current 5G New Radio (NR) systems [4].
In [3,5], the authors studied single-user (SU) and multi-user (MU) mmWave MIMO narrowband single-carrier systems. However, these methods lack the flexibility to adapt to wideband mmWave channels, making them unsuitable for real-world 5G communication systems that require efficient spectrum utilization over broad frequency ranges. In wideband environments, there is a frequency-selective fading issue, which is typically addressed using orthogonal frequency division multiplexing (OFDM) technology [6,7].
In [8], the authors explored the design of HBF for SU-MIMO-OFDM systems. But considering a single user in practical environments is unrealistic, as most 5G applications involve multiple users sharing the network simultaneously. In [9], the authors investigated the HBF design of MU-MIMO-OFDM systems. However, the mobile stations (MS) only employ analog beamforming, which means the base station (BS) can only transmit only one data stream to the MS. This limitation reduces transmission efficiency and prevents the system from leveraging the full potential of multi-user MIMO (MU-MIMO).
When a user has multiple RF chains, the BS has the flexibility to allocate multiple data streams to the user dynamically, taking into account the channel conditions. This dynamic allocation can significantly improve the system performance [10]. The beamforming design in [10] adopted a fully digital beamforming architecture, which means it cannot be directly applied to hybrid beamforming architectures due to hardware constraints and cost limitations. The concept of data stream allocation in hybrid beamforming design has been addressed in related studies such as [11,12,13,14]. However, it is important to note that these studies focus on single-carrier systems. Thus, the applicability of their findings to OFDM-based mmWave systems remains an open question.
In addition to physical layer resource allocation, cross-layer resource allocation recently becoming popular. It is a resource allocation strategy considering the mutual influence between the physical layer channel state information and the application layer video rate distortion (RD) function. Unlike traditional resource allocation approaches that treat the physical and application layers separately, cross-layer optimization provides a holistic approach to maximizing video quality while maintaining efficient spectrum utilization. Different videos have different RD functions, so cross-layer allocation has an inherent advantage in that we allocate more resources to the user whose video has most to gain in video distortion reduction [15]. This strategy is particularly effective in scenarios with dynamic network conditions, where user priorities and network congestion levels fluctuate over time. There have been numerous studies indicating that cross-layer resource allocation can enhance video quality [15,16,17], providing a better simulation of real-world usage scenarios where internet video consumes most internet traffic.
Some interesting real-world usage scenarios include Wireless Multimedia Sensor Networks (WMSNs) [18], where distributed camera and microphone networks generate large volumes of multimedia content, and Internet of Video Things (IoVT) [19], where intelligent transportation systems utilize real-time road monitoring and vehicle-to-vehicle video sharing to enhance driving safety and efficiency. Given the increasing reliance on video-based applications in modern networks, optimizing hybrid beamforming and data stream allocation for cross-layer video communication is an urgent and relevant research challenge.

2. Related Works

In [4], the authors considered a downlink MU-MISO system with BS with massive antennas and multiple single-antenna users where a hybrid precoding and simultaneous wireless information and power transfer (SWIPT) are adopted. However, the single-antenna user can support only one data stream and downlink case is considered. In comparison, the proposed scheme considers the multi-antenna user which can support multiple data streams and consider the uplink case.
In [20], data streams are allocated in multi-carrier OFDM systems. Its objective function was sum data rate maximization, in which the downlink case is considered, and only the physical layer metrics are considered. In comparison, the proposed scheme considers the uplink case and the cross-layer performance metric PSNR for video quality.
In [21], multiple data streams are allocated in MU-MIMO-OFDM systems. Its objective function was power consumption minimization, in which the downlink case is considered, and only the physical layer metrics are considered. In comparison, the proposed scheme considers the uplink case and the cross-layer performance metric PSNR for video quality.
Most of the abovementioned studies focused on downlink transmission. The uplink transmission was considered in [22], whereby each MS has the ability to dynamically allocate a varying number of transmission data streams, taking into account the CSI at the physical layer. The objective function is set to maximize the spectral efficiency. The aforementioned prior works, however, considered the physical layer only. In comparison, we propose source encoding rate control and cross-layer data stream allocation considering the cross-layer performance metric PSNR for video quality.
The rationale behind this shift from a physical layer performance metric (such as information rate) to cross-layer performance metric (such as video quality PSNR) and its implications on the network’s overall performance is explained as follows. Cross-layer resource management considers the mutual influence between the physical layer channel state information and the application layer video rate distortion (RD) function. Users’ videos would not have the same RD functions (e.g., fast moving video needs many more bits for the same video distortion), so cross-layer allocation has an inherent advantage in that we can allocate more resources to the user whose video has most to gain in video distortion reduction/PSNR increase [15,16,17]. Considering an average PSNR as the objective function, we can obtain a better video quality and better user experience in video communications, which represents the majority of today’s data traffic [23], including the Internet of Video Things [19]. In summary, for video communications, the increase in the received data rate does not necessarily increase the received video quality (e.g., PSNR) because video quality depends on the source encoding rate control and video content [16,24,25,26,27].
The contributions of this paper are as follows:
  • We propose a novel iterative cross-layer data stream allocation scheme with the objective function being to maximize the average peak signal-to-noise ratio (PSNR). Building upon the physical layer data stream allocation from [14,20,22] as the initialization before iterations, the proposed iterative approach allows the user with the lowest video quality to gain an additional data stream from users with more than one data stream, and improves the average PSNR. The iterative process is repeated until there is no further enhancement. In comparison, the authors of [14,20,22] allocated a data stream with a physical layer, the goal being to maximize the information rate, while there is no iterative process in the proposed cross-layer approach.
  • We jointly adapted the video source encoder rate and cross-layer dynamic data stream allocation with objective of the video content-dependent performance metric PSNR. In comparison, the authors of [14,20,22] did not have source encoding rate control and allocated data streams with a physical layer objective independent of video content, such as information rate. The rationale behind this shift is as follows: For video communications, the increase in the received data rate does not necessarily increase the received video quality (e.g., PSNR), because the video quality depends on the source encoding rate control and video content [16,24,25,26,27].
  • The proposed scheme is a general approach in the sense that it can use any physical layer data stream allocation schemes, including those in [14,20,22]. The simulation results show that the proposed cross-layer scheme outperforms the physical layer scheme in [14,20,22] by 1.14 dB, 0.65 dB, and 1.1 dB for 4 users, respectively, as shown in the PSNR performance
  • We analyze theoretical computational complexity of the proposed cross-layer schemes in big O notation in Section 5. Compared to [22], the computational complexity of the proposed cross-layer scheme is 1.8~2.3 times that of the physical scheme [22] when the number of iterations, L, is 3.6~5.8, as shown in Section 5.

3. System Model

3.1. mmWave MU Massive MIMO-OFDM System

Figure 1 illustrates the structure of a hybrid beamforming system designed for mmWave MU-Massive MIMO OFDM. The BS is equipped with N B S antennas and N R F B S RF chains, serving U MSs with N M S antennas and N R F M S chains through K subcarriers. In addition, at the BS, the total number of received data streams N S is limited by the number of RF chains of the BS, i.e., N S N R F B S . The system has the capability to allocate varying numbers of transmission data streams to each MS based on their respective CSI.

3.2. Channel Model

As in [22], we adopt a cluster channel model with N c l clusters and N r a y propagation paths in each cluster, which includes time delays, based on the framework presented in [28] for simulation. According to [29,30], the frequency-domain channel model for the BS and the u-th MS on the k-th subcarrier in an OFDM system can be described as follows:
H u k = 1 P L u c = 0 N c l 1 l = 1 N r a y α u , c , l a r ϕ u , c , l r a t H ϕ u , c , l t e j 2 π c ( k / K ) ,
The path loss is represented by P L u , while α u , c , l ~ C N ( 0 , N M S N B S N c l N r a y ) , ϕ u , c , l r , and ϕ u , c , l t denote the complex path gain, angle of arrival (AOA), and angle of departure (AOD), respectively, for the l-th propagation path in the c-th cluster. The antenna array response vector (AARV) for the receiver and transmitter is denoted as a r   (∙) and a t   (∙), respectively.

3.3. Notation

The notation of hybrid beamforming symbols are described in Table 1.

4. Proposed Scheme: Cross-Layer Resource Allocation

First, we introduce the physical layer baseline scheme in [22], and then our proposed cross-layer scheme, which uses the building blocks of the analog beamformer and digital beamformer in [22] and adds outer iterations allowing lower-PSNR users to contend for data streams with higher-PSNR users to maximize the average PSNR. The proposed cross-layer data stream allocation scheme is a general approach in the sense that it can use any physical layer data stream allocation scheme, not just that of [22], as the starting point before iterations. In Section 6 (Figure 2), we also provide performance comparisons of the cross-layer data stream allocation schemes we are building upon [14,20]. For illustration purposes, we use [22].

4.1. Baseline Scheme: Physical Layer Allocation [22]

In [22], a hybrid precoder and combiner is designed for the uplink mmWave MU-Massive-MIMO OFDM system. The objective is to maximize the overall SE while satisfying the constant amplitude constraints and total transmitted power constraint for each MS. To increase the system throughput, multiple users can be served at each subcarrier simultaneously [31], assuming perfect CSI availability. The optimization problem can thus be formulated as follows:
max F RF , u u = 1 U , F BB , u k k = 1 K u = 1 U , W RF , W BB k k = 1 K , n s , u u = 1 U , P u [ k ] k = 1 K u = 1 U 1 K k = 1 K u = 1 U i = 1 n s , u log 2 1 + S I N R u , i k
subject to
F R F , u ( m , n ) = 1 N M S , u , m , n ,
W R F m , n = 1 N B S , m , n ,  
k = 1 K P u k F 2 = P 0 , u ,
F R F , u f B B , u , i k 2 = 1 , u , i , k .
Constraints (2a) and (2b) are imposed to ensure constant amplitude on the analog precoder at the MS and the analog combiner at the BS, respectively. Constraint (2c) is imposed to ensure that the total transmit power of the MS is P 0 . To satisfy constraint (2c), the values of f B B , u , i k are normalized in order to meet constraint (2d).

4.2. Propose Scheme: Cross-Layer Allocation

Our proposed framework is tailored for video transmission, which takes into account not only the theoretical communication capacity but also the characteristics of the video content and channel propagation conditions. This methodology extends beyond relying solely on physical layer CSI and integrates rate-distortion considerations from the application layer.
The mean square error (MSE) distortion of the u-th user can be expressed as
M S E u = a u + b u R u + c u  
The PSNR of the u-th user can be expressed as
P S N R u = 10 log 10 255 × 255 M S E u
The proposed scheme employs a content-based source encoder rate control similar to that in [15,16,17]. H.264 source encoding with baseline profile is used. The frames inside one Group of Pictures (GOP) are encoded by H.264 source encoding rate control 80, 100, …, 600 kbps. The rate R u (bits/s) in (3) is the target encoding bit rate for the H.264 source encoding rate control.
Our objective is to maximize the average PSNR. P S N R u is the performance metric used to evaluate the video quality of user u , which is inversely proportional to the video MSE distortion M S E u . In cross-layer resource allocation, maximizing PSNR directly translates to better video quality, making it a more relevant metric than conventional physical-layer objectives such as spectral efficiency. The proposed iterative stream reallocation scheme optimizes PSNR by dynamically adjusting data stream allocation based on the rate-distortion characteristics of each user’s video. Unlike static or purely physical-layer-based allocations, the iterative process allows lower-PSNR users to contend for additional data streams, improving their video quality while maintaining a balance among all users.
The cross-layer resource allocation problem can be expressed as
max F RF , u u = 1 U , F BB , u k k = 1 K u = 1 U , W RF , W BB k k = 1 K , n s , u u = 1 U , P u k k = 1 K u = 1 U 1 U u = 1 U P S N R u
where the constraints are the same as those in (2a–d).
The reasons for employing source encoding rate control and the video content-dependent performance metric PSNR is as follows: For video communications, the increase in the received data rate does not necessarily increase the received video quality (e.g., PSNR) because the video quality depends on the source encoding rate control and video content [16,24,25,26,27].
The proposed cross-layer hybrid beamforming scheme uses the building blocks of the analog beamformer Tensor Unfolding Matrix Decomposition (TUMD) and digital beamformer Uplink Coordinated Block Diagonalization (UPCBD) in [22] and adds outer iterations allowing lower-PSNR users to contend for data streams with higher-PSNR users to maximize the average PSNR.
The pseudo-code of the analog beamforming TUMD algorithm proposed in [22] is described in Algorithm 1 below. The algorithm is inspired by tensor-unfolding, which is a mathematical operation that converts a tensor into a matrix, aiming to simplify calculations. The CSI of MIMO-OFDM is represented by a three-dimensional matrix with dimensions ( N B S * N M S * K ). By applying tensor unfolding and eigenvalue decomposition (EVD) mathematical operations, the algorithm computes the analog precoder F R F , u and analog combiner W R F . Please refer to the paper [22] for a detailed description of the algorithm and its mathematical formulation.
Algorithm 1: Analog beamforming design TUMD in [22].
Input:  H u k ,   u = 1 ,   2 ,   ,   U ,   k = 1 ,   2 ,   ,   K ,   N R F M S ,   N R F B S
1: for  u = 1   t o   U  do
2: H u ( l ) = H u [ 1 ] T H u [ 2 ] T H u [ K ] T T C ( K N B S ) × N M S represents the longitudinal tensor-unfolding of a three-dimensional matrix.
3: Do   EVD .   H u ( l ) H H u ( l ) = V ¯ u D ¯ u V ¯ u H .
               Matrix superscript H means Hermitian transpose.
D ¯ u is a diagonal matrix. Diagonal entries are the eigenvalues
ρ ¯ u ( 1 ) ρ ¯ u ( N M S ) .
V ¯ u refers to the eigenvector associated with ρ ¯ u ( n ) .
4: F R F , u = 1 N M S e j V ¯ u : , 1 : N R F M S
5:    Definition H e q , u k = H u k F R F , u
H e q , u ( h ) = [ H e q , u 1   H e q , u K   ]   C N B S × ( K N R F M S ) represents the horizontal tensor-unfolding of a three-dimensional matrix.
6: Do   EVD .   H e q , u ( h ) H e q , u ( h ) H = U ¯ e q , u D ¯ e q , u U ¯ e q , u H
D ¯ e q , u is a diagonal matrix. Diagonal entries are the eigenvalues ρ ¯ e q , u ( 1 ) ρ ¯ e q , u ( N B S ) .
U ¯ e q , u refers to the eigenvector associated with ρ ¯ e q , u ( n )
7: end for
8: Assign one RF chain to all served MS.
N R F , u = 1 ,   u = 1 ,   2 ,   ,   U
9: repeat
10: u * = argmax u C   ρ ¯ e q , u ( N R F , u + 1 ) .
C = u | N R F , u < N R F M S , u U ,   U = 1 ,   2 ,   ,   U .
11: N R F , u * = N R F , u * + 1 .
12: until  u = 1 U N R F , u = N R F B S or C =
13: n s , u = N R F , u ,   u = 1 ,   2 ,   ,   U .
14: for  u = 1   t o   U  do
15: W R F , u = 1 N B S e j U ¯ e q , u : , 1 : N R F , u
16: end for
17: W R F = W R F , 1 W R F , 2 W R F , U
Output: F R F , u , n s , u ,   u = 1 ,   2 ,   ,   U and W R F
The pseudo-code of the digital beamforming algorithm UPCBD in [22] is described in Algorithm 2 below. The design concept of this algorithm is to eliminate the inter-user interference and inter-stream interference by utilizing block diagonalization. It is primarily designed based on the channel state information H u k , analog precoder F R F , u , and analog combiner W R F . It mainly involves performing singular value decomposition (SVD). Please refer to paper [22] for a detailed procedure.
Algorithm 2: Digital beamforming design UPCBD in [22].
Input:  n s , u , F R F , u ,   u = 1 ,   2 ,   ,   U , W R F ,
H u k ,   u = 1 ,   2 ,   ,   U   , k = 1 ,   2 ,   ,   K
1: for  k = 1   t o   K  do
2:       for  u = 1   t o   U  do
3:            H ¯ u k = W R F H H u   [ k ] F R F , u
4:           Do SVD.
H ¯ u k = U ¯ u k Σ ¯ u k V ¯ u [ k ] H
5:           Definition F ^ u k = V ¯ u k : , 1 : n s , u  
H ̿ u [ k ] = H ¯ u k F ^ u k
6:       end for
7:       for  u = 1   t o   U  do
8:              H ˇ u k = H ̿ 1 [ k ] , , H ̿ u 1 [ k ] , H ̿ u + 1 [ k ] , , H ̿ U [ k ]
9:             Do SVD.
H ˇ u k = U ˇ u 1 k U ˇ u 2 k Σ ˇ u k V ˇ u k H
10:             Do SVD.
U ˇ u 2 H k H ̿ u k = U ¨ u k Σ ¨ u k V ¨ u k H
11:             Define the digital precoder in its unnormalized form.
F ~ B B , u k = F ^ u k V ¨ u 1 k =
f ~ B B , u , 1 k , , f ~ B B , u , n s , u k
12:             for  i = 1   t o   n s , u  do
13:                    f B B , u , i k = f ~ B B , u , i k F R F , u f ~ B B , u , i k
14:             end for
15:              F B B , u k = [ f B B , u , 1 k , , f B B , u , n s , u k ]
16:              W B B , u k = U ˇ u 2 k U ¨ u 1 k
17:       end for
18: end for
Output:  F B B , u k ,   W B B ,   u k ,   u = 1 ,   2 ,   ,   U ,   k = 1 ,   2 ,   ,   K
The proposed cross-layer data stream allocation is as follows: Initially, we conduct the baseline physical layer scheme [22], Algorithms 1 and 2. Then, we calculate the PSNR for all users and identify the user with the lowest PSNR. We then create a set C, which includes users with more than one data stream. Next, we proceed with the data stream re-allocation. We attempt to reduce one data stream for the users in set C and increase one data stream for the user with the lowest PSNR. Since the number of data streams for each user has changed, we need to rerun the analog beamforming algorithm and digital beamforming algorithm, and then recalculate the PSNR. The design of the analog combiner in step 15 of Algorithm 1 (analog beamforming) depends on the number of data streams for each user. Therefore, we only need to re-compute the analog beamforming algorithm from steps 14 to 17 in Algorithm 1. If the average PSNR after exchanging the data streams is higher than before, we proceed with the data stream exchange; otherwise, we maintain the initial data stream allocation. Assuming that the average PSNR after exchanging the data streams is higher than before, we will once again identify the user with the lowest PSNR. Repeat the above steps until the termination condition is met, which is when the average PSNR is not higher than before.
The pseudo-code of our proposed cross-layer data stream allocation algorithm is shown in Algorithm 3.
Algorithm 3: Proposed cross-layer data stream allocation.
Our proposed cross-layer data stream allocation framework is indeed designed with modularity and flexibility in mind, allowing it to be integrated with various physical-layer data stream allocation (beamforming) techniques.
Steps 1–4 constitute a replaceable function block for analog/digital beamforming and initial data stream allocation. This function block is agnostic to the specific beamforming algorithm used and can be readily substituted with other advanced hybrid beamforming methods such as IGLRAM [14], ACMD [20], and TUMP [22]. Here, we use the Algorithm 1 analog beamformer TUMD [22] and Algorithm 2 digital beamformer UPCBD [22] as an example.
1: Analog Beamforming Selection: Compute F R F , u , n s , u , W R F using an analog beamforming algorithm (e.g., Algorithm 1 from TUMP [22]).
2: Digital Beamforming Calculation: Compute F B B , u k ,   W B B , u k   f o r   u = 1 ,   2 ,   ,   U and k = 1 ,   2 ,   ,   K using a digital beamforming algorithm (e.g., Algorithm 2 from TUMP [22]).
3: Power Allocation: Apply water-filling power allocation to optimize power distribution.
4: Calculate SE
The following steps represent the newly proposed cross-layer data stream allocation in this paper.
5: Calculate average PSNR
6: Index = min( P S N R u ,   u = 1 ,   2 ,   ,   U )
7: repeat
8:       for  u = 1   t o   U  do
9:             if       N s , u > 1
10:                         add u to Set C
11:             end if
12:       end for
13:       for  u = 1   t o   U  do
14:               if  C 1 , u ,   u = 1 ,   2 ,   ,   U ~= 0
15:                          N s , I n d e x N s , I n d e x + 1
16:                          N s , u N s , u − 1
17:                         Re-execute Analog/Digital Beamforming (e.g., Algorithms 1 and 2 from TUMP [22]))
18:               end if
19:               if new average PSNR > previous average PSNR
20:                         data stream change
21:                         Index = min( P S N R u , u = 1 , 2 , , U )
22:               else
23:                          N s , I n d e x N s , I n d e x − 1
24:                          N s , u N s , u + 1
25:               end if
26:       end for
27: until new average PSNR <= previous average PSNR

5. Computational Complexity

According to the complexity analysis in [22], the computational complexity of analog beamforming mainly arises from lines 3, 5, and 6 of Algorithm 1. These three lines are executed U times according to the number of users U, resulting in a total complexity of adjusted PSNR. Scheme C has a latency of 2, and Scheme B has latency of 6, so Scheme C outperforms Scheme B in terms of latency.
O U 2 K N B S N M S N R F M S + m i n N B S 2 K N R F M S , N B S K 2 N R F M S 2
According to the complexity analysis in [22], the computational complexity of digital beamforming primarily arises from lines 3, 5, 10, 11, and 16 of Algorithm 2. These five lines are executed U times according to the number of users U and K times according to the number of subcarriers K, resulting in a total complexity of
O U K N B S N R F M S N R F B S + N M S
In our proposed data stream allocation (Algorithm 3), when the data streams are re-allocated, it is necessary to re-compute the analog beamforming algorithm from line 14 to line 17 in Algorithm 1, as well as the complete digital beamforming (Algorithm 2). The analog beamforming (Algorithm 1) only needs to execute from line 14 to line 17 because the design of the analog combiner is dependent on the allocation of user RF chains. Hence, it is sufficient to execute line 14 to line 17 concerning the design of the analog combiner. We assume a variable L to count the number of times the digital beamforming algorithm is re-executed from scratch in the cross-layer data stream allocation algorithm. Since the computational complexity of analog beamforming mainly arises from lines 3, 5, and 6 of Algorithm 1, the computational complexity of analog beamforming can be considered as negligible or close to 0. The computational complexity of digital beamforming increases with the number of iterations of the cross-layer data stream allocation algorithm. The computational complexity of digital beamforming becomes
O   ( U K N B S N R F M S N R F B S + N M S ) × ( 1 + L ) .
According to the above explanation, the complexity of the cross-layer data stream allocation algorithm we proposed can be summarized as O U 2 K N B S N M S N R F M S + m i n N B S 2 K N R F M S , N B S K 2 N R F M S 2 + O   ( U K N B S N R F M S N R F B S + N M S ) × ( 1 + L ) . Table 2 presents the complexity analysis expressions for both the PHY layer and cross-layer scenarios.

6. Simulation Results

This section presents our experimental results. Our simulations were conducted on a computer environment with an 8-core CPU: Intel® Core™ i7-9700 (Intel, Santa Clara, CA, USA) and 48GB of memory.
For video communications, we use a similar setup to that in [15,16,17]. We utilize a sequence of CIF videos with a total duration of 50 s, each containing 30 frames per second. We applied compression using the baseline profile of the H.264/AVC reference software, specifically Version JM 11.0. Our Group of Pictures (GOP) configuration consisted of 15 frames, structured as I-P-P-P, and the frames within each GOP were encoded using H.264 source encoding rate control.
To assess various bitrates, we encoded each GOP at discrete rates between 80 and 600 kbps. These operational points were then used to fit the RD function in (3) through a nonlinear regression approach [15,16,17].
For our resource management, we randomly assigned different starting points of the same video to different users [15,16,17]. The resource allocation decision was made for each GOP, ensuring that the cross-layer resource allocation problem in (5) is optimized.
The hybrid beamforming architecture and parameters of our MU-MIMO OFDM system are based on the environment configuration in [22], as shown in Table 3, where the system being analyzed operates with a total channel bandwidth of 50 MHz and utilizes an OFDM system with 16 subcarriers. Additionally, for all schemes, the optimal water-filling power allocation method is employed for each user (MS).
We conducted experiments with 4, 5, and 6 users, respectively. The BS is equipped with eight RF chains, indicating that the BS can receive a maximum of eight data streams. Each MS has three RF chains, meaning that each MS can receive a maximum of three data streams.
The comparisons of simulation results for PSNR performance for the proposed schemes and baseline schemes [14,20,22] are shown in Figure 2. We have utilized three baseline physical schemes with the same digital beamformer (UPCBD) as that in [22], and three analog beamforming algorithms, namely TUMD [22], Average Channel Matrix Decomposition (ACMD) [20], and Iterative Generalized Low Rank Approximation of Matrices (IGLRAM) [14]. For each baseline PHY layer scheme, we have utilized corresponding proposed cross-layer schemes by substituting the analog beamformer TUMD (Algorithm 1) with ACMD [20] or IGLRAM [14] in Algorithm 3.
The results in Figure 2 represent the outcomes of 10,000 experimental trials. For scenarios with user counts of 4, 5, and 6, the PSNR improvements were as follows: in the TUMD algorithm, an increase of 1.1 to 0.85 dB; in the ACMD algorithm, an enhancement of 0.65 to 0.41 dB; and in the IGLRAM algorithm, a gain of 1.14 to 0.96 dB. It demonstrates the applicability of our proposed cross-layer data stream allocation algorithm to other analog beamforming algorithms as well, not just TUMD in [22].
From the results in Figure 2, it can be observed that as the number of users increases, the improvement in average PSNR becomes less significant. It can be explained as follows: In a scenario with four users and the condition that each user has at least one data stream, there is a higher probability of a situation where a user with one data stream competes with users having three data streams to acquire additional data streams. In a scenario with six users, the probability of a user with one data stream competing against users with three data streams to acquire additional data streams is relatively low. Therefore, the improvement in PSNR for four users is higher compared to that for five users and five users.
The parameters from Table 3 are substituted into the complexity Big O notation in Table 2, and the results are presented in Table 4. In the complexity Big O notation of our proposed cross-layer data stream allocation, the time complexity of analog beamforming is the same as the baseline approach [22], while the time complexity of digital beamforming is 1 + L times that of the baseline approach [22]. L represents the average number of times the digital beamforming is re-executed in the simulation of the cross-layer data stream allocation algorithm. The hybrid beamforming architecture and parameters of the MU-MIMO-OFDM system were set according to the configuration described in Table 3 of the referenced paper [22]. According to our simulation results, L is approximately 5.8 times for four users, 5.2 times for five users, and 3.6 times for six users. Based on the simulation results, it can be observed that L decreases as the number of users increases. The reason is that the number of L is determined by the set C created in line 10 of Algorithm 3. The set C represents the collection of users with more than one data stream. We require each user to have a minimum of one data stream and a maximum of three data streams. Therefore, in the case of four users, set C can only have three or two users; and in the case of six users, set C can only have two or one users. Hence, as the number of users increases, L decreases.
Table 5 presents the results of the simulated elapsed times for the proposed cross-layer data stream allocation algorithm compared to the baseline [22] physical-layer data stream allocation algorithm. From the results in Table 4 and Table 5, it can be observed that the outcomes are roughly consistent, with cross-layer elapsed time being approximately twice that of the PHY layer elapsed. If the number of users increases to six, the elapsed time ratio decrease to approximately 1.6.

7. Conclusions

We propose a novel and modular cross-layer data stream allocation framework for uplink mmWave MU-Massive MIMO OFDM hybrid beamforming video communication systems. Unlike conventional approaches that focus solely on physical-layer CSI, our method integrates cross-layer optimization [32,33] by dynamically reallocating data streams from users with more than one stream to the user with the lowest PSNR. This iterative process, driven by both CSI and application-layer PSNR, aims to maximize the average video quality across all users. To ensure flexibility, the first four steps of our revised Algorithm 3 are formulated as a functional block responsible for analog/digital beamforming and initial stream allocation. This block is independent of any specific physical-layer method and can be replaced with various hybrid beamforming algorithms such as IGLRAM [14], ACMD [20], and TUMP [22]. Our simulation results demonstrate that the proposed framework yields significant PSNR improvements across diverse user scenarios (as shown in Table 4) while maintaining a reasonable computational cost of approximately 1.8–2.3× that of conventional methods (Table 5). Furthermore, the modular nature of our design allows for future integration with other physical-layer schemes, including alternative hybrid beamforming and detection techniques, which could serve as additional baselines for comparison.

8. Future Directions

A promising future direction is the integration of deep learning-based cross-layer dynamic data stream allocation to further reduce computational complexity. Since the proposed scheme is iterative, the number of iterations can become large when the number of RF chains and users increases. A deep learning-based approach could learn optimal allocation strategies from data, reducing the need for extensive real-time computations while maintaining a high PSNR performance. However, to effectively integrate deep learning into our existing cross-layer framework, further investigations are required to address challenges such as model training, real-time adaptability, and deployment in practical systems. Ensuring that deep learning models generalize well across different network conditions and maintaining low-latency inference are key aspects to be explored in future research.

Author Contributions

Methodology, S.-M.T. and Y.-F.C.; Software, Y.-T.C.; Resources, S.-M.T. and C.F. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the National Science and Technology Council, Taiwan, under Grant NSTC 112-2221-E-027-079-MY2, and NTUT-BJUT Joint Program, NTUT-BJUT-113-01.

Data Availability Statement

The data are available upon request.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Yong, S.K.; Chong, C.-C. An overview of multigigabit wireless through millimeter wave technology: Potentials and technical challenges. EURASIP J. Wirel. Commun. Netw. 2007, 2007, 78907. [Google Scholar] [CrossRef]
  2. Rappaport, T.S.; Sun, S.; Mayzus, R.; Zhao, H.; Azar, Y.; Wang, K.; Wong, G.N.; Shulz, J.K.; Samimi, M.; Gutierrez, F. Millimeter Wave Mobile Communications for 5G Cellular: It Will Work! IEEE Access 2013, 1, 335–349. [Google Scholar] [CrossRef]
  3. El Ayach, O.; Rajagopal, S.; Abu-Surra, S.; Pi, Z.; Heath, R.W. Spatially sparse precoding in millimeter wave MIMO systems. IEEE Trans. Wirel. Commun. 2014, 13, 1499–1513. [Google Scholar] [CrossRef]
  4. Kim, S.H.; Jin, H. Hybrid Beamforming Based SWIPT System Satisfying Individual Rate and Energy Constraints. IEEE Access 2024, 12, 5617–5629. [Google Scholar] [CrossRef]
  5. Wu, X.; Liu, D.; Yin, F. Hybrid beamforming for multi-user massive MIMO systems. IEEE Trans. Commun. 2018, 66, 3879–3891. [Google Scholar] [CrossRef]
  6. Cheng, N.-H.; Huang, K.-C.; Chen, Y.-F.; Tseng, S.-M. Maximum likelihood-based adaptive iteration algorithm design for joint CFO and channel estimation in MIMO-OFDM systems. EURASIP J. Adv. Signal Process. 2021, 2021, 6. [Google Scholar] [CrossRef]
  7. Tseng, S.-M.; Chen, G.-Y.; Chan, H.-C. Cross-layer resource management for downlink BF-NOMA-OFDMA video transmission systems and supervised/unsupervised learning based approach. IEEE Trans. Veh. Technol. 2022, 71, 10744–10753. [Google Scholar] [CrossRef]
  8. Ma, M.; Nguyen, N.T.; Juntti, M. Closed-form hybrid beamforming solution for spectral efficiency upper bound maximization in mmWave MIMO-OFDM systems. In Proceedings of the 2021 IEEE 94th Vehicular Technology Conference (VTC2021-Fall), Norman, OK, USA, 27–30 September 2021; IEEE: Piscataway, NJ, USA, 2021; pp. 1–5. [Google Scholar]
  9. Yuan, H.; An, J.; Yang, N.; Yang, K.; Duong, T.Q. Low complexity hybrid precoding for multiuser millimeter wave systems over frequency selective channels. IEEE Trans. Veh. Technol. 2019, 68, 983–987. [Google Scholar] [CrossRef]
  10. Chen, R.; Shen, Z.; Andrews, J.G.; Heath, R.W. Multimode transmission for multiuser MIMO systems with block diagonalization. IEEE Trans. Signal Process. 2008, 56, 3294–3302. [Google Scholar] [CrossRef]
  11. Li, Z.; Han, S.; Molisch, A.F. Optimizing channel-statistics-based analog beamforming for millimeter-wave multi-user massive MIMO downlink. IEEE Trans. Wirel. Commun. 2017, 16, 4288–4303. [Google Scholar] [CrossRef]
  12. Kwon, G.; Park, H. Limited feedback hybrid beamforming for multimode transmission in wideband millimeter wave channel. IEEE Trans. Wirel. Commun. 2020, 19, 4008–402220. [Google Scholar] [CrossRef]
  13. Wang, S.; He, M.; Ruby, R.; Zhang, Y. SVM-based optimization on the number of data streams for massive MIMO systems. IEEE Syst. J. 2022, 17, 83–86. [Google Scholar] [CrossRef]
  14. Song, N.; Yang, T.; Sun, H. Overlapped subarray based hybrid beamforming for millimeter wave multiuser massive MIMO. IEEE Signal Process. Lett. 2017, 24, 550–554. [Google Scholar] [CrossRef]
  15. Wang, D.; Toni, L.; Cosman, P.C.; Milstein, L.B. Uplink resource management for multiuser OFDM video transmission systems: Analysis and algorithm design. IEEE Trans. Commun. 2013, 61, 2060–2073. [Google Scholar] [CrossRef]
  16. Li, F.; Wang, T.; Cosman, P.C. Joint rate adaptation and resource management for real-time H.265/HEVC video transmission over uplink OFDMA systems. Multimed. Tools Appl. 2019, 78, 26807–26831. [Google Scholar] [CrossRef]
  17. Tseng, S.-M.; Chen, W.-Y. Cross-layer codebook allocation for uplink SCMA and PDNOMA-SCMA video transmission systems and a deep learning-based approach. IEEE Syst. J. 2022, 17, 294–305. [Google Scholar] [CrossRef]
  18. Kader, M.E.E.D.A.E.; Youssif, A.A.A.; Ghalwash, A.Z. Energy aware and adaptive cross-layer scheme for video transmission over wireless sensor networks. IEEE Sensors J. 2016, 16, 7792–7802. [Google Scholar] [CrossRef]
  19. Zhang, X.; Wei, X.; Zhou, L.; Qian, Y. Social-content-aware scalable video streaming in internet of video things. IEEE Internet Things J. 2022, 9, 830–843. [Google Scholar] [CrossRef]
  20. Zhang, D.; Wang, Y.; Li, X.; Xiang, W. Hybrid beamforming for downlink multiuser millimetre wave MIMO-OFDM systems. IET Commun. 2019, 13, 1557–1564. [Google Scholar] [CrossRef]
  21. Chen, B.-Y.; Chen, Y.-F.; Tseng, S.-M. Hybrid Beamforming and Data Stream Allocation Algorithms for Power Minimization in Multi-User Massive MIMO-OFDM Systems. IEEE Access 2022, 10, 101898–101912. [Google Scholar] [CrossRef]
  22. Tseng, H.-H.; Chen, Y.-F.; Tseng, S.-M. Hybrid Beamforming and Resource Allocation Designs for mmWave Multi-User Massive MIMO-OFDM Systems on Uplink. IEEE Access 2023, 11, 133070–133085. [Google Scholar] [CrossRef]
  23. Cisco Visual Networking Index: Forecast and Trends, 2017–2022. Available online: https://get.drivenets.com/hubfs/1211_BUSINESS_SERVICES_CKN_PDF.pdf (accessed on 9 April 2025).
  24. Li, F.; Shuang, F.; Liu, Z.; Qian, X. A cost-constrained video quality satisfaction study on mobile devices. IEEE Trans. Multimedia 2018, 20, 1154–1168. [Google Scholar] [CrossRef]
  25. Lin, W.-Y.; Chang, T.-H.; Tseng, S.-M. Deep learning based cross-layer power allocation for downlink cell-free massive multiple-input–multiple-output video communication systems. Symmetry 2023, 15, 1968. [Google Scholar] [CrossRef]
  26. Tseng, S.-M.; Wu, J.-J.; Fang, C. Machine Learning Assisted Cross-Layer Joint Optimal Subcarrier and Power Allocation for Device-to-Device Video Transmission. IEEE Access 2024, 12, 93568–93579. [Google Scholar] [CrossRef]
  27. Tseng, S.-M.; Chen, R.-Y.; Chen, Y.-F.; Fang, C. Joint Cross Layer Radio Downlink Beamforming and RIS Configuration via Deep Reinforcement Learning in RIS-Aided MISO Video Communications. IEEE Trans. Cogn. Commun. Netw. 2025. early access. [Google Scholar] [CrossRef]
  28. Hemadeh, I.A.; Satyanarayana, K.; El-Hajjar, M.; Hanzo, L. Millimeter-wave communications: Physical channel models, design considerations, antenna constructions, and link-budget. IEEE Commun. Surveys Tuts. 2018, 20, 870–913. [Google Scholar] [CrossRef]
  29. Yu, X.; Shen, J.-C.; Zhang, J.; Letaief, K.B. Alternating minimization algorithms for hybrid precoding in millimeter wave MIMO systems. IEEE J. Sel. Top. Signal Process. 2016, 10, 485–500. [Google Scholar] [CrossRef]
  30. Sohrabi, F.; Yu, W. Hybrid analog and digital beamforming for mmWave OFDM large-scale antenna arrays. IEEE J. Sel. Areas Commun. 2017, 35, 1432–1443. [Google Scholar] [CrossRef]
  31. Pi, Z.; Khan, F. An introduction to millimeter-wave mobile broadband systems. IEEE Commun. Mag. 2011, 49, 101–107. [Google Scholar] [CrossRef]
  32. Stuhlmuller, K.; Farber, N.; Link, M.; Girod, B. Analysis of video transmission over lossy channels. IEEE J. Sel. Areas Commun. 2000, 18, 1012–1032. [Google Scholar] [CrossRef]
  33. Tseng, S.-M.; Wen, S.-T.; Fang, C.; Norouzi, M. Cross Layer Power Allocation by Graph Neural Networks in Heterogeneous D2D Video Communications. IEEE Access 2025, 13, 44484–44496. [Google Scholar] [CrossRef]
Figure 1. Uplink mm-Wave MU massive MIMO-OFDM system with hybrid beamforming structure.
Figure 1. Uplink mm-Wave MU massive MIMO-OFDM system with hybrid beamforming structure.
Sensors 25 02554 g001
Figure 2. PSNR performance comparisons of the proposed schemes and baseline schemes [14,20,22].
Figure 2. PSNR performance comparisons of the proposed schemes and baseline schemes [14,20,22].
Sensors 25 02554 g002aSensors 25 02554 g002b
Table 1. Hybrid beamforming symbol description.
Table 1. Hybrid beamforming symbol description.
SymbolQuantity
U number of users.
K number of subcarriers in OFDM.
N B S number of antennas in a BS.
N R F B S number of RF chains in a BS.
N s number of data streams that the BS can receive.
N M S number of antennas configured on the MS.
N R F M S number of RF chains configured on MS.
n s , u s-th data stream transmitted by the u-th MS.
s u k transmitted data symbol vector of the u-th MS at the k-th subcarrier.
P u k transmit power of the u-th MS at the k-th subcarrier.
F B B , u k digital precoder of the u-th MS at the k-th subcarrier.
F R F , u analog precoder at user u.
W R F analog combiner.
W B B , u k the digital combiner of the u-th MS at the k-th subcarrier.
y ^ k final processed signal at the k-th subcarrier.
Table 2. Complexity comparison.
Table 2. Complexity comparison.
PHY Layer O U 2 K N B S N M S N R F M S + m i n N B S 2 K N R F M S , N B S K 2 N R F M S 2 + O U K N B S N R F M S N R F B S + N M S
Cross-Layer O U 2 K N B S N M S N R F M S + m i n N B S 2 K N R F M S , N B S K 2 N R F M S 2 + O   ( U K N B S N R F M S N R F B S + N M S ) × ( 1 + L )
Table 3. Hybrid beamforming setup parameters.
Table 3. Hybrid beamforming setup parameters.
SymbolQuantity
U4, 5, 6
K16
N B S 128
N R F B S 8
N s 8
N M S 16
N R F , u 1, 2, 3
n s , u 1, 2, 3
BW50 MHz
Table 4. The actual computational complexity of the proposed schemes and baseline schemes.
Table 4. The actual computational complexity of the proposed schemes and baseline schemes.
UserLCross-Layer Data Stream Allocation
Computational Complexity Big O
(A)
[22] Physical Layer
Computational Complexity Big O
(B)
Times
(A)/(B)
45.85,976,883.22,555,9042.3
55.27,028,7363,194,8802.2
63.67,018,905.63,833,8561.8
Table 5. Elapsed times.
Table 5. Elapsed times.
UserCross-Layer Data Stream Allocation Elapsed Time(s) (C)[22] Physical Layer
Elapsed Time(s) (D)
Elapse Times Ratio (C)/(D)
4 1439.7 s 689.9 s 2.1
5 1690.4 s 847.4 s 2
6 1644.8 s 1006.9 s 1.6
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Chen, Y.-T.; Tseng, S.-M.; Chen, Y.-F.; Fang, C. Cross-Layer Stream Allocation of mMIMO-OFDM Hybrid Beamforming Video Communications. Sensors 2025, 25, 2554. https://doi.org/10.3390/s25082554

AMA Style

Chen Y-T, Tseng S-M, Chen Y-F, Fang C. Cross-Layer Stream Allocation of mMIMO-OFDM Hybrid Beamforming Video Communications. Sensors. 2025; 25(8):2554. https://doi.org/10.3390/s25082554

Chicago/Turabian Style

Chen, You-Ting, Shu-Ming Tseng, Yung-Fang Chen, and Chao Fang. 2025. "Cross-Layer Stream Allocation of mMIMO-OFDM Hybrid Beamforming Video Communications" Sensors 25, no. 8: 2554. https://doi.org/10.3390/s25082554

APA Style

Chen, Y.-T., Tseng, S.-M., Chen, Y.-F., & Fang, C. (2025). Cross-Layer Stream Allocation of mMIMO-OFDM Hybrid Beamforming Video Communications. Sensors, 25(8), 2554. https://doi.org/10.3390/s25082554

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop