1. Introduction
The advancement of 6G technology has brought Integrated Sensing and Communication (ISAC) to the forefront of Vehicle-to-Everything (V2X) research [
1,
2,
3]. ISAC-enabled V2X systems facilitate comprehensive data exchange between vehicles and their environment, including infrastructure, pedestrians, and networks [
4,
5,
6]. To optimize urban road management efficiency, Simultaneous Localization and Mapping (SLAM) technology is increasingly being integrated with ISAC [
7]. While traditional SLAM implementations primarily rely on vision and LiDAR sensors, recent breakthroughs have shown that wireless channel propagation characteristics can be used to construct highly accurate 2D environmental maps [
8,
9]. This synergy establishes a novel co-design paradigm for ISAC-SLAM integration in V2X. Recent research demonstrate that an ISAC base station (BS) can effectively function as a roadside unit (RSU) [
2,
10,
11]. These advanced ISAC-RSUs enhance environmental sensing and establish real-time SLAM capabilities through low-latency V2X links, compensating for onboard sensor limitations. Smart city implementations have validated this approach, with [
12] demonstrating that MIMO radar-equipped RSUs can utilize echo signals for channel estimation and adaptive beamforming. This enables ISAC-RSU as the core infrastructure for future 6G vehicular networks. In dense urban environments, buildings and other obstacles can significantly impede signal propagation. Relay communication effectively extends coverage and ensures continuous connectivity between vehicles and infrastructure by mitigating the impact of physical obstructions. In high-density traffic scenarios, relay nodes facilitate multi-path data transmission, enhancing communication reliability and stability through redundancy and improved link diversity [
2]. By distributing the communication load across multiple nodes, relay systems increase overall network capacity, which is crucial for supporting the concurrent communication demands of numerous connected vehicles and devices [
13]. Ref. [
10] has shown that RSUs, as critical components of next-generation urban communication infrastructures, possess significant capabilities in directional relay-assisted transmission. Specifically, these ISAC-RSUs can simultaneously sense urban road conditions and function as relay base stations, transmitting directional communication signals to user equipment (UE) [
14]. Furthermore, Ref. [
15] introduced a hybrid reconfigurable intelligent surface-assisted downlink ISAC system that offers a new reference approach for integrating RSUs and reconfigurable intelligent surfaces to enhance UE communication rates. Refs. [
16,
17] propose deep learning (DL)-based predictive beamforming for ISAC in vehicular networks and demonstrate DL’s potential for next-generation vehicular ISAC systems.
However, the simultaneous delivery of high-quality communication and precise detection becomes particularly challenging in complex multi-user, multi-target scenarios. Balancing communication-centric and sensing-centric objectives remains a critical issue in ISAC systems due to competing demands for spectral resources, beamforming design, and signal processing strategies [
18]. Furthermore, RSUs face substantial signal interference that adversely affects performance in both communication and sensing domain. In communication, downlink signals from adjacent RSUs generate multiple access interference (MAI) with UEs, while spatiotemporal resource reuse further aggravates multi-user interference (MUI) [
19]. In sensing, the overlap of RSU echo signals in the time–frequency domain can degrade the signal-to-noise ratio (SNR). These dual-domain interference problems necessitate advanced beamforming solutions for ISAC-RSU systems. Additionally, urban multi-path effects and building occlusions constrain their detection capabilities, as sensing beams reflect off buildings, creating clutter that interferes with the ISAC-RSU receiver. These challenges require the development of enhanced beamforming designs for ISAC-RSU systems [
20,
21,
22,
23].
Contemporary studies demonstrate that beamforming design plays a crucial role in establishing the Pareto frontier, which quantifies the fundamental trade-off between sensing and communication performance in ISAC systems. ISAC beamforming aims to achieve optimal performance trade-offs while suppressing mutual interference between sensing and communication signals [
24]. In the context of ISAC beamforming methods, Refs. [
16,
17] address dynamic channel variations caused by high mobility, leveraging neural networks to forecast beamforming vectors and optimize ISAC performance. However, the internal weights of its network are difficult to map to physical-layer parameters (such as beam pointing angle and null notch depth), limiting their effectiveness in fault diagnosis. In contrast, conventional methods offer closed-form solutions derived from analytical algorithms based on channel state information, thereby facilitating optimization and parameter adjustment. Ref. [
25] investigated a weighted minimum mean square error (WMMSE) ISAC solution based on mutual information (MI) to transform a non-convex problem into a convex one, utilizing MI as a metric to assess both sensing and communication performance. Furthermore, cell-free ISAC MIMO systems represent a significant advancement in the simultaneous provision of communication services to users and the detection of targets [
26,
27,
28,
29]. Specifically, Ref. [
26] proposed a joint beamforming design for cell-free ISAC MIMO systems, where distributed MIMO access points collaboratively facilitate communication for user equipment (UEs) while simultaneously sensing targets. This study employs a max–min fairness formulation and derives the optimal structure of beamforming vectors to suppress both MAI and MUI, thereby improving the communication SINR [
27]. Additionally, in low-altitude urban environments, objects exhibiting a significant radar cross-section (RCS), including structures and metallic advertisements, can generate scattered clutter patches that interfere with the receiver’s performance. In order to address the issue of environmental clutter, Ref. [
29] proposed a constant-modulus waveform design method for ISAC systems by incorporating cyclic optimization, Dinkelbach’s transform, and the ADMM algorithm. This method effectively addresses the non-convex optimization problem while optimizing both radar detection performance and communication service quality, but it suffers from high computational complexity due to excessive matrix operations and exhibits relatively slow convergence rates.
In response to these issues, we propose a dual-base-station ISAC system with a transmitting RSU and a receiving RSU, suitable for road surveillance and directional relay communication in urban areas. The BS transmits downlink signals to the ISAC-RSU, providing high-quality services. We propose a hierarchical optimization solution for ISAC beamforming, specifically designed for ISAC-RSU systems operating in complex environments. The main contributions of this paper are as follows:
We construct a hierarchical and collaborative ISAC beamforming optimization framework, and it has the potential to overcome the limitations imposed by the direct coupling of sensing and communication objectives in existing single-stage optimization approaches. The leader layer prioritizes ensuring communication performance for multiple UEs by adopting a max–min fairness strategy to optimize the SINR. The communication beamforming vector is derived using the bisection method in conjunction with second-order cone programming, thereby guaranteeing that the communication links satisfy both fairness and reliability requirements across UEs. Under the constraint of fixed communication beamforming parameters from the follower layer, the problem of maximizing the sensing SCNR is formulated as a semi-definite programming (SDP) problem, enabling collaborative design of the sensing beam through convex optimization techniques. Therefore, it achieves a coherent synergy by prioritizing communication performance to precisely enhance sensing performance, in contrast to the performance trade-offs inherent in traditional approaches. Finally, we utilize the sensing and communication (S&C) MI rate as a metric to evaluate whether the beamforming solution can effectively balance sensing and communication performance.
Notations: Bold letters denote vectors and matrices. Italicized letters denote variables. and denote the transpose operation and the Hermitian operation, respectively. denotes a complex matrix. Re[a] denotes the real part of a scalar a. denotes the trace of a matrix A. denotes the expectation operation. and denote the l2-norm of a and the Frobenius norms of A. . represents a circularly symmetric complex Gaussian random variable with zero mean and variance .
4. Simulation Results
In this section, to verify the effectiveness of ISAC-OCS, we evaluate the sensing and communication performance associated with the previously discussed ISAC beamforming solutions. We consider a scenario in which the transmit RSU simultaneously delivers
data streams to UEs and transmits
sensing signal to the target, while the receive RSU captures
clutter patches. Both the transmit RSU and BS are each equipped with
transmit antennas, and the receive RSU is additionally equipped with
receive antennas. The parameters for the communication model include a specified communication noise variance
. For the sensing model, we consider parameters such as sensing noise variance
and sensing channel gain variance
. Additionally, we set the transmit power of the ISAC transmitter at
P = 0 dBW, along with a defined Lagrangian parameter
and noise variance
. In our analysis, we randomize the positions of the transmit RSU, receive RSU, UEs, and target, resulting in randomized azimuth angles for these entities. Under this setup, we conduct a comparative evaluation of the MMO beamforming method [
26] and the WMMSE-ISAC method [
25] in relation to the ISAC-OCS. Notably, ISAC-OCS demonstrates a significant advancement in methodological design compared to MMO and WMMSE-ISAC. MMO focuses solely on communication beamforming design and lacks an integrated perceptual SCNR optimization component. Although WMMSE-ISAC combines perceptual mutual information and communication mutual information through weighted summation into a unified objective for joint optimization, this approach is susceptible to substantial performance degradation in one of the functionalities due to inappropriate weight selection. All simulation experiments presented in this section were conducted in a standardized computational environment to ensure the reliability and reproducibility of the results. For the software framework, MATLAB R2024a was employed as the core platform. In terms of hardware configuration, the experiments were run on a workstation equipped with an Intel® Core™ i9-14900HX processor.
To evaluate the angular resolution of various beamforming techniques, we conduct a comparative analysis of the beam patterns generated by three distinct methods, with the communication power ratio set to 0.5. As illustrated in
Figure 2, the proposed multi-beam architecture simultaneously incorporates both communication-directed and sensing-oriented beams, necessitating joint evaluation using two critical metrics, (i) the absolute peak intensity of beam lobes, and (ii) inter-peak isolation, characterized by null depth in concave regions.
The MMO method adopts a communication-centric beamforming strategy, resulting in a beam pattern with lower peaks in the sensing beams (approximately 2.3 dB lower than ISAC-OCS) and higher peaks in the communication beams (up to 6.2 dB higher than ISAC-OCS) compared to the other approaches. However, its shallow inter-beam null depths fundamentally limit angular discrimination capability due to insufficient suppression of sidelobe interference. In contrast, the ISAC-OCS solution enhances the peak gains of sensing beams (peak gain > 0.9 dB relative to WMMSE-ISAC) at the expense of communication beam attenuation (peak reduction ≈ 1.8 dB relative to WMMSE-ISAC). Notably, it achieves significantly deeper null depths (≥12 dB) between adjacent beams compared to both MMO and WMMSE-ISAC, owing to stronger interference suppression. By analyzing the communication beam peak characteristics of this solution, we conclude—as previously discussed—that it does not follow a conventional communication-centric beamforming strategy. Instead, it enables joint optimization of communication and sensing performance. In summary, although the ISAC-OCS yields relatively lower communication peaks, it exhibits more pronounced concave drops between adjacent lobes, thereby enhancing beamforming angular resolution.
After clutter suppression is applied to the received echo signals, target angle estimation is performed.
Figure 3 shows the normalized spatial spectrum obtained via the MUSIC algorithm for three optimized beamforming methods. As a high-resolution subspace method, the MUSIC algorithm theoretically achieves superior resolution compared to conventional beamforming techniques, exceeding the limitations imposed by the Rayleigh criterion. Given that the experiment focuses on angle estimation for a single target, the utilization of four receiving antenna sensors suffices. The findings demonstrate that three optimized methods effectively detect the target angle; however, the beam spectrum peak generated by the ISAC-OCS design is notably the sharpest, signifying enhanced spatial resolution and more effective sidelobe suppression. The collateral valve inhibition achieved by ISAC-OCS exceeds that of MMO by more than 1.6 dB and outperforms WMMSE-ISAC by over 1.2 dB. Furthermore, the angle estimation accuracy associated with ISAC-OCS surpasses that of the other two methods, thereby corroborating its superior sensing performance.
The Root Mean Square Error (RMSE) obtained from DOA estimation serves as a crucial metric for evaluating the perceptual performance of beamforming design. The experimental framework employs rigorous Monte Carlo validation to ensure a comprehensive performance assessment.
Figure 4 illustrates a comparison of the RMSE in angle estimation across the three optimized beamforming methods’ varying noise power levels. As noise power diminishes to lower values, the RMSE values for all three methods converge, indicating minimal differences in estimation performance under low-noise conditions. Nevertheless, throughout the entire noise power spectrum, the RMSE corresponding to ISAC-OCS remains consistently lower than other beamforming methods, evidencing its ability to sustain high estimation accuracy even in complex noise environments. This result further substantiates the reliability of the ISAC-OCS beamforming design approach with respect to sensing performance.
Beyond angular resolution, the joint performance of communication and sensing is evaluated via SMI and CMI. With the sensing power ratio ranging from 0.01 to 0.99, the SMI rate at SNR = 0 dB and SNR = 5 dB is illustrated in
Figure 5. This investigation evaluates the characteristics of SMI across three distinct methodologies under varying conditions of sensing power allocation and signal quality.
The systematic comparison demonstrates that the ISAC-OCS technique consistently outperforms both WMMSE-ISAC and MMO methods in terms of sensing performance, with this advantage remaining evident across various channel quality conditions. All methods examined exhibit fundamentally similar behavioral patterns, where enhanced signal conditions yield proportional improvements in SMI rates at fixed sensing power configurations. When the sensing power is 0.2, the SMI rate of ISAC-OCS is about 0.65 dB higher than that of MMO and approximately 0.1 dB higher than that of WMMSE-ISAC. The overall findings indicate that although all approaches experience improvements in enhanced signal environments, the ISAC-OCS method emerges as the most robust solution for improving sensing performance.
For communication performance,
Figure 6 considers two scenarios in which the transmitting SNR is established at 0 dB and 5 dB, respectively. The figure illustrates the fluctuation of the CMI rate in relation to the communication power ratio, utilizing three distinct beamforming optimization solutions across different SNR levels. For each curve, 200 Monte Carlo experiments are performed. In
Figure 6, all methodologies exhibit fundamentally similar behavioral patterns, characterized by an enhancement in CMI rates as the allocation of communication power increases across the entire operational range. Notably, the MMO and ISAC-OCS techniques, which share identical theoretical frameworks for deriving beamforming vectors, demonstrate almost indistinguishable communication performance results (the CMI rate of ISAC-OCS is marginally higher than that of MMO, with a difference of approximately 0.08 dB). Both methodologies consistently outperform the WMMSE-ISAC approach across all assessed SNR conditions, thereby confirming their superior efficacy in maintaining communication quality.
Figure 7 examines the CMI rate of the three optimized beamforming methods as a function of user distance under varying transmission SNRs. The experimental results reveal that, irrespective of transmission SNR fluctuations, the CMI rates for ISAC-OCS and MMO outperform those of WMMSE-ISAC. Although ISAC-OCS and MMO exhibit comparable overall trends, a detailed analysis indicates that ISAC-OCS achieves marginally higher rates (the CMI rate of ISAC-OCS is marginally higher than that of MMO, with a difference of approximately 0.09 dB), highlighting its advantage in communication performance.
Similarly,
Figure 8 evaluates the CMI rate as the number of users varies under different transmission SNR conditions, yielding results consistent with those in
Figure 6 and
Figure 7: the CMI rate for ISAC-OCS consistently exceeds that of MMO and is substantially superior to WMMSE-ISAC (the CMI rate of ISAC-OCS is marginally higher than that of MMO, with a difference of approximately 0.07 dB). These findings collectively affirm the comprehensive performance benefits of the ISAC-OCS beamforming strategy within integrated communication and sensing systems. The experimental results substantiate that the proposed methodology achieves communication performance comparable to that of MMO while substantially outperforming WMMSE-ISAC. This finding highlights the necessity for further investigation into the comparative sensing performance characteristics of these competing schemes, as the communication-sensing trade-off is a critical design consideration in integrated systems.
Furthermore, it is observed that the CMI rate decreases as the number of UEs increases. This is because more UEs means more communication interference. On the one hand, more MUI degrades the performance of communication. On the other hand, the more communication to sensing interference leads to the decrease in SMI rate. In
Figure 9, we compare the ISAC-OCS hierarchical optimization method with the deep learning-based method [
31], observing the differences in the CMI rate and SMI rate between the two methods.
As the number of UEs increases, the rate of all four schemes exhibits a decreasing trend. The SMI (DL-based ISAC) consistently achieves a higher rate than SMI (ISAC-OCS) across all UE numbers. As the number of UEs increases from 2 to 6, the CMI (ISAC-OCS) achieves a higher rate than CMI (DL-based ISAC). The advantage of DL-based ISAC may stem from its advanced deep learning-driven resource allocation and information processing mechanisms, which enable better adaptability to the growing number of UEs. However, the DL-based ISAC method is highly reliant on high-quality training data. Performance degradation occurs significantly when there is a discrepancy between the actual operational scenario and the training dataset. In contrast, ISAC-OCS does not require an offline training phase. Although its SMI rate is lower than that of DL-based approaches, it operates without dependence on specialized computing hardware such as GPUs. Moreover, unlike DL-based ISAC methods, which incur substantial computational costs during model training, ISAC-OCS achieves efficient resource utilization. Consequently, ISAC-OCS demonstrates superior adaptability in complex and dynamic environments, particularly with respect to computational efficiency and practical deployment.
While ISAC-OCS demonstrates communication performance comparable to MMO, it has been specifically optimized for sensing capabilities, leading to enhanced sensing performance relative to the MMO method and superior overall performance compared to WMMSE-ISAC, as supported by
Figure 3,
Figure 4 and
Figure 5. To further explore the joint communication and sensing performance across different methods, we analyze the S&C rate in relation to power allocation and SNR.
In
Figure 10, we investigate S&C rate variations as functions of communication power ratios for different schemes under transmit SNR values of 0 dB and 5 dB, respectively. Each performance curve is obtained by averaging over 200 independent Monte Carlo trials to ensure statistical robustness. As anticipated, the S&C rate curves corresponding to higher SNRs consistently surpass those obtained at lower SNRs, indicating the significant influence of signal quality on overall system performance. Among the evaluated methods, ISAC-OCS exhibits marginally higher S&C rates than both WMMSE-ISAC and MMO across the entire range of communication power ratios (when the communication power is approximately 0.5, the S&C MI rate of ISAC-OCS is about 0.41 dB higher than that of MMO and approximately 0.29 dB higher than that of WMMSE-ISAC).
Furthermore, it is observed that the S&C rates achieved at intermediate power allocation settings ( and ) are generally lower than those obtained under extreme optimization biases ( or ). This indicates that the overall performance at these intermediate points fails to match the specialized effectiveness achieved in purely sensing-centric or communication-centric configurations. However, allocating power according to balanced ratios ( and ) is beneficial in scenarios that require the simultaneous fulfillment of both sensing and communication needs, providing a trade-off solution under practical system constraints. In contrast, the adoption of extreme power allocation ( or ) is only appropriate for scenarios dominated by either communication or sensing demands and may result in significant performance degradation if applied indiscriminately in integrated operational environments.
In
Figure 11, the communication power ratios are set at 0.5 and 0.25, respectively. The figure illustrates the variation in the S&C MI rate curves as functions of SNR for the different examined methods. Evidently, the S&C MI rates achieved by the ISAC-OCS scheme consistently surpass those attained by the baseline approaches across the entire SNR range (when the noise power is approximately 1, the S&C MI rate of ISAC-OCS is about 0.52 dB higher than that of MMO and approximately 0.28 dB higher than that of WMMSE-ISAC). This enhanced performance highlights the efficacy of ISAC-OCS in simultaneously optimizing sensing and communication objectives.
At the end of the preceding section, we analyze the asymptotic complexity of the three methods. The ISAC-OCS method is distinguished by its combination of the bisection method and convex optimization, which results in a notably higher complexity order compared to MMO.
Figure 12 illustrates the runtime curves of the three methods as a function of the number of antennas.
As illustrated in
Figure 12, MMO exhibits the shortest runtime across various specification arrays, while ISAC-OCS exhibits a longer runtime compared to MMO. Notably, WMMSE-ISAC incurs a greater computational time than ISAC-OCS as the number of array elements increases (
and
). Therefore, MMO and WMMSE-ISAC exhibit the lowest and highest levels of complexity compared to the others, aligning with the previous analysis.
In summary, the CMI rate, SMI rate, and S&C MI rate provided by ISAC-OCS are among the fastest compared to the other methods. Although the computational complexity of ISAC-OCS is slightly higher compared to the MMO, it achieves a superior balance between communication and sensing performance while maintaining a high S&C MI rate. Consequently, the moderate increase in computational complexity is justifiable for practical implementation in real-world ISAC-RSU systems.