1. Introduction
Sixth-generation mobile networks are expected to support many emerging applications, including Internet of Things (IoT) networks, vehicle-to-everything (V2X) services, and unmanned aerial vehicle (UAV) networks. These applications require high-quality wireless connectivity and reliable sensing capability. To meet the growing demand for communication and sensing, multiple-input multiple-output joint communication and sensing (MIMO-JCAS) systems have attracted increasing interest. In MIMO-JCAS systems, the wireless infrastructure is expected to deliver high-rate data streams and provide transmit-side spatial sensing capability, which is commonly characterized by the quality of the generated sensing beam pattern or the accuracy of the approximated transmit covariance [
1,
2].
However, because the hardware cost and power consumption of fully digital precoding become prohibitive in large-scale antenna arrays, hybrid beamforming (HBF) has been widely studied in MIMO-JCAS systems [
3,
4]. In HBF architectures, the overall precoder is decomposed into a low-dimensional digital precoder and an analog precoder implemented by phase shifters, thereby achieving an attractive tradeoff between beamforming gain and hardware efficiency. Nevertheless, HBF design for MIMO-JCAS systems remains challenging. A direct difficulty comes from the element-wise constant-modulus constraint imposed by the analog phase-shifter network, which leads to a nonconvex feasible set. It should be noted that such constant-modulus or unit-modulus constraints are not unique to analog precoding, and similar constraints also arise in other wireless scenarios, such as reconfigurable intelligent surface (RIS)-empowered networks, where the reflecting phase shifts are typically subject to unit-modulus constraints. The constraints can be handled by various optimization-based and AI-based methods [
5]. Therefore, the main difficulty in the considered MIMO-JCAS HBF problem does not stem from the constant-modulus constraint alone, but from the strong coupling between the digital and analog precoders, and the additional need to coordinate communication-rate maximization and sensing-performance optimization within the same hybrid beamforming design [
6].
To address the hybrid beamforming design problem in MIMO-JCAS systems, various model-based optimization approaches have been developed. Among them, alternating optimization (AO)-based methods are widely adopted because the original nonconvex problem can be decomposed into tractable subproblems. For example, in dual-function radar-communication (DFRC) hybrid beamforming design, the optimization problem is decomposed into three subproblems and is solved iteratively by AO, so that a favorable tradeoff between communication performance and radar beampattern matching is achieved [
7]. More realistic DFRC hybrid architectures are considered in [
8], where fully connected and partially connected structures are studied. For these two architectures, MADMM and RPM-TR are developed, respectively, and good communication rates as well as desirable radar beampattern performance are achieved. The corresponding design is further extended to wideband orthogonal frequency division multiplexing (OFDM)-based DFRC systems in [
9], where the transmit and receive hybrid beamformers are jointly optimized in the presence of frequency selectivity. Although AO-based approaches are capable of producing feasible solutions, they are often sensitive to hyperparameters and are likely to converge to poor local optima, which limits their performance in complex scenarios [
10,
11,
12].
To reduce online computational cost and latency, data-driven deep learning (DL) methods are also introduced into JCAS and HBF design. In JCAS systems, autoencoder-based end-to-end frameworks are studied, where waveforms or transceiver mappings for communication and sensing tasks are directly learned [
13,
14]. In HBF design, data-driven neural architectures are also used to learn mappings from channel-related observations to hybrid precoders, thereby improving real-time capability and reducing explicit optimization effort during inference [
15,
16]. These studies show that complex nonlinear relationships in wireless systems can be effectively learned by DL. However, purely data-driven approaches usually require large training datasets, and their model interpretability is generally weaker than that of model-based optimization methods. These limitations become more significant when physically meaningful constraints and iterative decision logic need to be preserved.
Deep unfolding is an effective framework for alleviating the limitations of AO-based optimization and purely data-driven DL methods. By transforming an iterative optimization procedure into a neural network with a finite number of layers, deep unfolding preserves algorithmic interpretability while enabling key hyperparameters to be learned from data [
17,
18,
19,
20]. In hybrid beamforming design, alternating optimization can be unfolded into a trainable architecture, where the analog precoder is updated through a lightweight unfolded network and the digital precoder is obtained by a closed-form step, so that computational complexity is reduced while good beamforming performance is maintained [
21]. Beyond hybrid beamforming, projected-gradient-type deep unfolding has also been investigated in other wireless optimization tasks. A mixture-of-experts-augmented deep unfolding framework is developed for activity detection in IRS-aided massive access systems, where the unfolded projected-gradient updates are used to improve detection robustness under mixed channel fading conditions [
22]. More closely related to MIMO-JCAS systems, a modified projected gradient ascent (PGA) procedure can be unfolded, where repeated analog updates and a weighted sensing-gradient term are introduced, so that convergence is improved and favorable communication-sensing performance is achieved [
23]. These studies indicate that deep unfolding provides a practical balance among interpretability, convergence efficiency, and online complexity for wireless optimization problems, while its use in MIMO-JCAS HBF design still requires careful treatment of analog-digital precoder coupling, constant-modulus constraints, and communication-sensing tradeoffs.
Despite these advances, an important limitation remains. In most existing deep-unfolded designs, the hyperparameters learned during training remain fixed during inference. As a result, the update strategy cannot adapt to the current optimization state, even though the relative importance of communication-rate improvement and sensing-error reduction may vary substantially across iterations. In addition, the two precoder updates are affected by different structural difficulties. For the analog precoder, Euclidean-space updates followed by projection may not align well with the geometry induced by the constant-modulus constraint, which can reduce update efficiency. For the digital precoder, the relative magnitudes of the communication-rate gradient and the sensing-error gradient may vary dynamically during the iterative process, so a fixed balancing factor may not always provide a desirable update direction. These issues motivate the development of a more adaptive unfolded HBF design for MIMO-JCAS systems.
To address the above issues, this paper proposes a state-driven adaptive deep-unfolded PGA algorithm for hybrid beamforming in MIMO-JCAS systems. The main contributions of this paper are summarized as follows:
A state-driven adaptive hyperparameter control mechanism is developed for unfolded HBF in MIMO-JCAS systems to better handle the dynamic communication-sensing tradeoff during iterative hybrid precoder optimization. Specifically, a lightweight deep neural network (DNN) is introduced to generate scaling factors for the analog step size, the digital step size, and the gradient-balancing factor according to the current iteration state. The state vector is constructed from the current joint objective, communication rate, sensing error, gradient norms, hyperparameter statistics, iteration index, and objective variation, so that the controller can capture the evolving optimization status of the coupled analog-digital precoder updates. In this way, the unfolded network is enabled to adapt its update behavior to different optimization stages, which improves the efficiency and robustness of hybrid beamforming optimization compared with using static hyperparameters.
A manifold-based update framework is developed for the analog precoder. Instead of directly performing Euclidean updates followed by projection, the analog-precoder gradient is projected onto the tangent space associated with the complex circle manifold, and the update is then carried out along a direction that is more consistent with the geometry of the unit-modulus constraint set. Specifically, the standard Euclidean gradient is first computed. To prevent the update from violating hardware constraints, this gradient is projected onto the local tangent space of the current precoder state. This crucial step strips away the invalid gradient components that attempt to alter signal amplitudes, isolating only the valid components that modify phases. The analog precoder is then updated along this geometry-aware tangent direction. Finally, an element-wise normalization is applied as a retraction step to precisely pull the intermediate state back onto the feasible circular boundary. As a result, the analog-precoder update becomes better matched to the feasible set, which improves the update efficiency of shallow unfolded models.
A learnable and adaptive gradient-balancing strategy is incorporated into the digital precoder update. To address the dynamic imbalance between the communication-rate gradient and the sensing-error gradient during joint optimization, the fixed balancing factor used in conventional methods is extended to a trainable parameter and is further adjusted by the state-driven controller. This design allows the digital precoder to be updated with a direction that is better suited to the current rate–sensing tradeoff, thereby improving joint optimization effectiveness.
The remainder of this paper is organized as follows.
Section 2 introduces the system and signal model and formulates the hybrid beamforming optimization problem.
Section 3 presents the deep-unfolded PGA algorithm based on Riemannian manifold optimization and discusses the gradient imbalance between the communication and sensing terms in the digital precoder update to introduce the learnable gradient balancing factor. Based on
Section 3, a state-driven adaptive mechanism and the corresponding adaptive deep-unfolded PGA model are developed in
Section 4.
Section 5 reports the numerical results and provides the corresponding analysis. Finally,
Section 4 concludes the paper.
Throughout this paper, bold uppercase letters denote matrices, bold lowercase letters denote vectors, and scalar quantities are represented by regular letters. The operators , , and denote the transpose, complex conjugate, and Hermitian transpose, respectively. The sets of n-dimensional complex-valued (real-valued) vectors and complex-valued (real-valued) matrices are denoted by () and (), respectively. The symbols ⊗ and ⊙ denote the Kronecker product and the Hadamard product, respectively. The notations , , and represent the modulus of a scalar, the Euclidean norm, and the Frobenius norm, respectively. In addition, and denote vectorization and the trace operation, respectively. The notation denotes a complex Gaussian distribution with mean and variance , while denotes the uniform distribution over the interval . The notation denotes the real part of a complex-valued quantity.
Figure 1 illustrates the system architecture of the considered MIMO-JCAS hybrid beamforming system.
4. State-Driven Adaptive Deep-Unfolded Network
We improve the overall network performance through the learnable ; however, the learned is still static and associated with the layer index, which leads to poor model robustness. The step-size parameters are in a similar situation. This motivates us to introduce an iteration-state-driven hyperparameter control mechanism to adjust the parameters according to the current iteration state, thereby enhancing the optimization efficiency, model robustness, and final joint performance of shallow unfolded networks.
4.1. Definition of the State Vector
To enable dynamic adjustment of the step sizes
and
and the gradient balancing factor
while preserving the interpretability of deep unfolding, we define a lightweight control neural network. Its role is to take the current iteration-state vector as input and output the scaling factors for the step sizes and the gradient balancing factor. The state vector is defined to characterize the state in the iteration, which can be written as:
Here,
,
, and
denote the joint objective, the communication rate, and the sensing error at the
i-th outer iteration. These features provide a direct evaluation of the current optimization stage and explicitly quantify the real-time tradeoff status between the communication rate and the sensing error. Moreover,
and
denote the Frobenius norms of the gradients of the communication rate and the sensing error with respect to the analog precoder, respectively, while
and
denote the corresponding Frobenius norms with respect to the digital precoder. The gradient norms serve as the fundamental driving forces of the updates. Including them is crucial because they allow the controller to perceive which objective is dominating the descent direction at any given moment, providing necessary numerical foundations for adaptively generating the gradient balancing factor
.
and
denote the step size and gradient balancing factor of the
i-th outer iteration. We further introduce the statistic
to characterize the overall hyperparameter level in the current layer, which is defined as:
where
is the average size of the
i-th inner step.
By feeding the current base scale of the hyperparameters (, , ) into the controller, the network gains awareness of its own parameter space. This self-awareness is essential to strictly prevent the generated scaling factors from causing step-size explosion or vanishing gradients.
In addition, the normalized index is introduced to characterize the depth of the current iteration layer, which enables the network to distinguish between early-stage exploration and late-stage exploitation.
To capture the changing trend of the joint objective,
is introduced to describe the improvement achieved during optimization and to provide additional dynamic information, which is defined as:
where
denotes the change in the joint objective at the
i-th outer iteration, which informs the controller whether the optimization is improving steadily or plateauing, allowing it to adaptively decay or adjust the step sizes to guarantee stable convergence.
4.2. Adaptive Network Outputs and Update Procedure
In summary, the constructed state vector includes the joint objective, the communication rate, the sensing error, the gradient information of the communication and sensing terms with respect to the precoders, the hyperparameter statistics, the normalized layer index, and the variation trend of the joint objective. These features provide a comprehensive description of the optimization state during iteration. Based on the state vector, the lightweight control neural network outputs the scaling factors to dynamically adjust the hyperparameters at the current iteration.
The hyperparameter scaling factors are denoted by
,
, and
, corresponding to the scaling of the inner-loop step size, the outer-loop step size, and the gradient balancing factor at the
i-th outer iteration, respectively. With these definitions, the update procedure is rewritten as:
The analog precoder is then updated according to Equations (8) and (9), and:
followed by Equation (11).
The digital precoder is updated as:
followed by the normalization operation in Equation (13).
4.3. State-Driven Adaptive Deep-Unfolded Network Architecture
Consider an I-layer deep-unfolded network based on the modified PGA procedure, which aims to output a feasible hybrid precoder with improved communication and sensing performance by maximizing the joint objective . The alternating iterative procedure of the original PGA algorithm is mapped into a deep-unfolded network architecture, where the outer loop and inner loop correspond to the updates of the digital precoder and the analog precoder , respectively. Building on this framework, a manifold-based update mechanism is introduced into the inner loop to better accommodate the unit-modulus constraint, while a learnable gradient balancing factor is incorporated into the outer loop. Based on the current iteration state, a lightweight control neural network generates the scaling factors. Specifically, the network output is passed through a sigmoid function and then linearly mapped to predefined positive intervals, so that the hyperparameters can remain bounded and numerically stable during inference. The overall procedure is summarized in Algorithm 1.
Figure 3 illustrates the architecture of the proposed model, including the manifold-based analog update, the Euclidean digital update, and the lightweight control neural network. Different colored arrows and boxes are used to distinguish different update flows and functional modules in the unfolded architecture.
| Algorithm 1 State-Driven Adaptive Deep-Unfolded projected gradient ascent |
- Require:
, , w, , the trained base hyperparameters , and the trained control network. - Ensure:
and . - 1:
Initialization: Generate , and set . - 2:
for to do - 3:
Obtain , , and . - 4:
Obtain from Equation (22), and obtain from Equation (23). - 5:
Form the state vector based on Equation (21). - 6:
Generate , , and from via the lightweight control network. - 7:
Obtain from Equation (24b), and obtain from Equation (24c). - 8:
Set . - 9:
for to do - 10:
Obtain from Equation (24a). - 11:
Obtain the gradients and at -
based on Equations (14) and (16). - 12:
Obtain from Equation (8), and obtain from Equation (9). - 13:
Obtain based on Equations (25) and (11). - 14:
end for - 15:
Set . - 16:
Obtain the gradients and at -
based on Equations (15) and (17). - 17:
Obtain based on Equation (26) using and . - 18:
Apply the normalization in Equation (13). - 19:
Set . - 20:
end for - 21:
return and as the solution to and .
|
4.4. Model Training
Based on the above architecture, the loss function is defined as:
The proposed network is trained in an end-to-end unsupervised manner using PyTorch (version 2.5.1). The simulation setup is specified as
,
, and
, with
and
. To allow a fair comparison among different methods on a unified dataset, the same channel-data setting as in [
23] is adopted. The unfolded architecture uses
outer iterations and
inner iterations. The learnable base step sizes are initialized to
. The gradient-balancing parameter is initialized as
. In implementation, this is realized by setting
and absorbing the factor
into the radar-gradient weight. The state-driven controller is implemented as a two-hidden-layer MLP with dimensions 12–32–32–3, where ReLU is used in the hidden layers and the outputs are mapped to
. Adam is adopted for optimization, where the learning rates for the base step sizes, the gradient-balancing parameter, and the controller are set to
,
, and
, respectively.
5. Simulation Results
5.1. Computational Complexity Analysis
The computational complexity of the proposed method is mainly determined by the gradient evaluations in the analog and digital precoder updates. Let
and
denote the complexity of one analog and one digital precoder update, respectively. For the analog precoder update, the computation of
has complexity
, while the computation of
involves the covariance mismatch term
and has dominant complexity
. The manifold projection and retraction are element-wise operations with complexity
, which is negligible compared with the gradient computations. Therefore, the complexity of one analog precoder update is given by
For the digital precoder update, the computation of
has complexity
, while the computation of
is dominated by the covariance-related matrix operations with complexity
. The transmit-power normalization requires
operations and is not dominant. Therefore, the complexity of one digital precoder update is
Since the analog precoder is updated
J times and the digital precoder is updated once in each unfolded outer layer, the overall computational complexity before approximation is
where
accounts for the additional manifold projection and retraction operations, and
denotes the forward-pass complexity of the lightweight state-driven controller.
For typical HBF transceivers, it is generally true that the number of transmit antennas is much larger than the number of RF chains and users, i.e.,
[
23].
Under this commonly used HBF setting, the covariance-related terms dominate the complexity, and the above expression can be approximated as
Therefore, compared with the conventional unfolded PGA algorithm, the proposed method preserves the same dominant complexity order. The additional manifold projection/retraction and state-driven controller introduce only
and
overheads, respectively, which are negligible compared with the dominant gradient computations.
5.2. Communication and Sensing Performance
We next evaluate the performance of the proposed improved deep-unfolded PGA algorithm in terms of communication and sensing performance. For comparison, the following methods are selected as baseline algorithms.
In experiments, the tradeoff parameter is set to , and the signal-to-noise ratio is set to . The corresponding numerical results are shown in the figures.
For clarity, the abbreviations used in the legends of the following performance figures are defined as follows.
UPGA-J10 denotes the conventional unfolded projected gradient ascent algorithm with
inner iterations [
23].
UPGA-J20 denotes the conventional unfolded projected gradient ascent algorithm with
inner iterations [
23].
UPGA-J10-eta denotes the unfolded PGA algorithm with inner iterations and the proposed learnable gradient-balancing factor, but without the manifold-based analog update and without the state-driven controller.
UPGA-J10-Manifold denotes the unfolded PGA algorithm with inner iterations and the proposed manifold-based analog update, but without the learnable gradient-balancing factor and without the state-driven controller.
UPGA-J10-Adaptive denotes the complete proposed method, which includes the manifold-based analog update, the learnable gradient-balancing factor, and the state-driven adaptive controller.
MADMM-RCG denotes the MADMM-RCG baseline algorithm adapted from [
8].
Fully digital ZF denotes the fully digital zero-forcing beamforming baseline without the hybrid analog–digital hardware constraint.
Figure 4,
Figure 5 and
Figure 6 jointly illustrate the iterative evolution of the compared methods in terms of communication rate, sensing error, and the joint objective
, respectively. These three metrics should be interpreted together. This is because the proposed method targets joint communication–sensing optimization rather than communication-rate maximization alone. It can be observed that the proposed state-driven adaptive deep-unfolded model maintains a competitive communication rate while achieving the lowest sensing error during the iterative process. As a result, it obtains the highest joint objective, which indicates a more favorable communication–sensing tradeoff.
The comparison among the unfolded variants further verifies the contribution of each proposed component. Compared with the conventional deep-unfolded PGA model with , introducing a trainable gradient-balancing factor improves the balance between the communication-rate gradient and the sensing-error gradient in the digital update. Incorporating the manifold-based analog update further makes the analog update more consistent with the geometry of the constant-modulus constraint.The simulation results have proved the effectiveness of these two mechanisms. The complete state-driven adaptive deep-unfolded model combines these two mechanisms and further adjusts the effective step sizes and gradient-balancing factor according to the current optimization state. Therefore, it reaches a higher joint objective and a stable high-performance region faster than the other unfolded baselines under the same number of outer iterations.
Table 1 provides a quantitative summary of the final performance and runtime of different methods. The reported values are obtained by evaluating each test sample individually and then averaging the results over the whole test set. The proposed state-driven adaptive deep-unfolded model achieves the lowest final sensing error and the highest final joint objective among all compared methods. Compared with UPGA-J10, it reduces the final sensing error by approximately 57.5% and improves the final joint objective by approximately 154.9%. Compared with UPGA-J20, it further reduces the sensing error by approximately 23.5% and improves the final joint objective by approximately 13.3%, while reducing the runtime by approximately 32.0%. This indicates that increasing the number of inner iterations alone is less efficient than the proposed state-driven adaptive design.
The proposed model also outperforms MADMM-RCG in terms of both final communication rate and final joint objective. Although the proposed model requires a slightly longer runtime than some lightweight unfolded baselines, its runtime remains moderate and is significantly lower than that of UPGA-J20. Therefore, the proposed model achieves a better performance–runtime tradeoff rather than relying on excessive computational cost.
5.3. Empirical Convergence Analysis
To further evaluate the empirical convergence behavior of the proposed method, we record the number of unfolded layers required to satisfy a practical convergence criterion. Based on the previously defined absolute variation of the joint objective, we further define the relative variation as
where
is a small positive constant introduced to avoid numerical instability.
The convergence layer is defined as the first unfolded layer at which the relative variation of the joint objective remains below a predefined threshold for p consecutive unfolded layers. In this paper, we set . This criterion is used to characterize practical numerical stability rather than to claim theoretical convergence to a stationary point.
In
Table 2, “Avg. convergence layer” denotes the average number of layers required for convergence, while “Std.” denotes the standard deviation of the required convergence layers.
As shown in
Table 2, the proposed method achieves the highest convergence ratio among all compared iterative algorithms. All test samples satisfy the practical convergence criterion, while UPGA-J10, UPGA-J20, and MADMM-RCG achieve convergence ratios of
,
, and
, respectively. Compared with UPGA-J20, the proposed method reduces the average convergence layer from
to
while improving the convergence ratio from
to
. Although MADMM-RCG has a smaller average convergence iteration, its lower convergence ratio and larger standard deviation indicate less stable convergence behavior across test samples. These results show that the proposed method achieves more reliable empirical convergence than the baseline iterative algorithms.
Table 3 reports the empirical convergence behavior of the proposed method under different convergence thresholds over 100 independent test samples. Under the
criterion, all test samples reach practical convergence, with an average convergence layer of 62.21 and a standard deviation of 17.66. When a looser threshold of
is adopted, the proposed method reaches practical convergence within 10.53 layers on average, indicating rapid early-stage stabilization of the unfolded inference process. As the threshold becomes stricter, the required convergence layer generally increases, which is expected because a smaller threshold imposes a more stringent stability requirement. Even under the stringent
criterion, all test samples still satisfy the practical convergence condition within the maximum unfolded depth. For the very strict
criterion,
of the test samples reach practical convergence, with an average convergence layer of 105.81. These results demonstrate that the proposed method exhibits stable empirical convergence behavior across different convergence thresholds.
5.4. Effect of the Number of Sensing Targets on the Joint Objective
Figure 7 shows the joint objective comparison under different numbers of sensing targets, where
L varies from 1 to 5. This experiment evaluates the robustness of different methods under varying sensing-scene complexities.
The proposed method consistently achieves the highest joint objective across all target settings. The increasement of L makes the communication-sensing tradeoff harder to optimize. The advantage of the proposed method comes from its state-driven adaptive control mechanism. The state features allow the controller to adjust the analog step size, digital step size, and gradient-balancing factor according to the current optimization condition. In particular, the adaptive gradient-balancing factor changes the relative contribution of the rate-gradient and sensing-gradient terms, thereby adjusting the effective update direction when the sensing task becomes more complex.
This demonstrates that the proposed method is not tailored to a single sensing setting, but has better robustness to variations in sensing-scene complexity.
5.5. Joint Objective Under Different SNR Conditions
Figure 8 compares the joint objective
achieved by different methods under different SNR conditions. As the SNR increases, the joint objective first increases and then decreases after reaching its peak around
. In the low- and medium-SNR regions, the performance gaps among different methods are relatively small. This is because the system performance is still largely noise-limited in these regions, so the benefit brought by different precoder update strategies is not fully reflected in the joint objective.
After around , the joint objective decreases because the sensing mismatch term increases faster than the communication-rate gain. Under the fixed tradeoff weight w, this reduces the overall value of .
Compared with the baseline methods, the proposed state-driven adaptive deep-unfolded model shows a slower degradation of the joint objective in the high-SNR region. This indicates that the proposed model provides a more stable communication–sensing tradeoff under varying SNR conditions. Such robustness benefits from the state-driven controller, which adaptively adjusts the effective step sizes and the gradient-balancing factor according to the current optimization state.
Remark on active jamming: The SNR-based evaluation mainly reflects passive noise-level variations. In practical MIMO-JCAS systems, active jamming and adversarial interference may also degrade communication and sensing performance. A CGAN-based fusion CNN framework is used for few-shot jamming signal classification [
26]. The classified jamming type can be regarded as additional state information for describing the external interference environment. Therefore, in future jamming-aware extensions, jamming classifier outputs, together with interference-plus-noise power or received covariance features, can be incorporated into the state vector of the proposed controller to support interference-aware update control.
5.6. Evolution of the Scaling Factor
This paper further examines the gradient variations under different signal-to-noise ratio (SNR) conditions. It is observed that, as the SNR increases, the scale of variation in
gradually becomes larger. To compensate for this gradient variation, the evolution of
in the iterations should exhibit a trend opposite to that of
, which is particularly evident under high-SNR conditions. By jointly considering
Figure 2 and
Figure 9, it can be seen that the proposed method is able to adaptively correct the imbalance between the communication gradient and the sensing gradient on the digital side, thereby improving the robustness and joint optimization performance of the model under different SNR conditions.
5.7. Effect of the Tradeoff Parameter w
Figure 10,
Figure 11 and
Figure 12 jointly evaluate the performance of different methods under different tradeoff weights
w. As
w increases, the optimization places more emphasis on sensing-error reduction. As a result, the sensing error generally decreases, while the communication rate decreases accordingly. This confirms that
w controls the operating point between communication and sensing performance.
Figure 12 further shows that the proposed state-driven adaptive deep-unfolded model consistently achieves the highest joint objective across different values of
w. This indicates good generalization to different communication–sensing preferences, rather than effectiveness only under one fixed tradeoff setting. Such generalization benefits from the state-driven controller. Different values of
w change the relative importance of communication-rate maximization and sensing-error reduction, which requires different update behaviors during optimization. By using the current optimization state to adjust the step sizes and gradient-balancing factor, the proposed model can better match these changing communication–sensing preferences and maintain a favorable joint objective across different values of
w.
5.8. Performance Under Different Channel Models
To further evaluate whether the proposed state-driven unfolded optimizer is overly specialized to the original channel distribution, we additionally test its performance under different channel models. In addition to the channel used in the main simulations, Rayleigh fading and Rician fading channels are further considered.
For the Rayleigh fading channel, each channel coefficient is independently generated from a circularly symmetric complex Gaussian distribution, i.e.,
. For the Rician fading channel, both a deterministic LoS component and a scattered NLoS component are considered, and the channel is modeled as
where
and
. The Rician factor is set to
dB in the simulations.
To ensure a fair comparison across different channel models, all generated channel datasets are normalized to the same average per-element channel power.
Figure 13 compares the performance of different methods under the Rayleigh fading channel and the Rician fading channel. These two channel models have different statistical characteristics from the original channel. It can be seen that the proposed method consistently achieves the highest final joint objective in both channel models. The performance advantage mainly comes from the state-driven adaptive update mechanism, which dynamically adjusts the update behavior according to the current optimization state, rather than relying on fixed step sizes or a fixed communication-sensing balancing strategy. The results demonstrate that the proposed method is not limited to the original channel and can remain effective under channel models with different statistical characteristics.
5.9. Performance Under the Non-Uniform Linear Array (NULA) Configuration
Table 4 presents the quantitative results under the NULA configuration. Compared with the default ULA setting in
Table 1, the proposed model maintains almost the same joint performance when the antenna arrangement is changed. Specifically, the final joint objective of the proposed model is
under the ULA setting and
under the NULA setting, showing only a slight decrease of about
. Meanwhile, the proposed model still achieves the lowest sensing error under the NULA configuration.
Although UPGA-J10 and UPGA-J10- achieve higher communication rates under the NULA configuration, their sensing errors are much larger, resulting in significantly lower joint objectives. MADMM-RCG obtains a relatively low sensing error, but its communication rate and final joint objective are still inferior to those of the proposed model. Overall, the proposed model achieves the best joint objective under both the ULA and NULA configurations. This indicates that the proposed state-driven adaptive framework is not overly specialized to the original ULA setting and can maintain strong adaptability under the tested non-uniform antenna arrangement.