Dynamic Resource Allocation in Full-Duplex Integrated Sensing and Communication: A Multi-Objective Memetic Grey Wolf Optimizer Approach

Feng, Xu; Wang, Jianquan; Sun, Lei; Zhang, Chaoyi; Wang, Teng

doi:10.3390/electronics14193763

Open AccessArticle

Dynamic Resource Allocation in Full-Duplex Integrated Sensing and Communication: A Multi-Objective Memetic Grey Wolf Optimizer Approach

by

Xu Feng

^1,2,

Jianquan Wang

^2,3,

Lei Sun

^2,3

,

Chaoyi Zhang

^1,2,*

and

Teng Wang

^1,2

¹

Beijing Engineering Research Center of Industrial Spectrum Imaging, University of Science and Technology Beijing, Beijing 100083, China

²

School of Automation and Electrical Engineering, University of Science and Technology Beijing, Beijing 100083, China

³

Institute of Industrial Internet, School of Automation and Electrical Engineering, University of Science and Technology Beijing, Beijing 100083, China

^*

Author to whom correspondence should be addressed.

Electronics 2025, 14(19), 3763; https://doi.org/10.3390/electronics14193763

Submission received: 9 August 2025 / Revised: 12 September 2025 / Accepted: 17 September 2025 / Published: 23 September 2025

(This article belongs to the Special Issue Integrated Sensing and Communications for 6G)

Download

Browse Figures

Versions Notes

Abstract

To meet the dual demands of 6G cellular networks for high spectral efficiency and environmental sensing, this paper proposes a full-duplex (FD) integrated sensing and communication (ISAC) dynamic resource allocation framework. At the heart of the framework lies a dynamic frame structure that can self-adapt the time-domain resource ratio between sensing and communication, designed to flexibly handle complex traffic demands. In FD mode, however, the trade-off between communication and sensing performance, exacerbated by severe self-interference (SI), morphs into a non-convex, NP-hard multi-objective optimization problem (MOP). To tackle this, we propose an Adaptive Hybrid Memetic Multi-Objective Grey Wolf Optimizer (AM-MOGWO). Finally, simulations were conducted on a high-fidelity platform that integrates 3GPP-standardized channels, which was further extended to a challenging multi-cell interference scenario to validate the algorithm’s robustness. AM-MOGWO was systematically benchmarked against standard Grey Wolf Optimizer (GWO), random search (RS), and the genetic algorithm (GA). Simulation results demonstrate that in both the single-cell and the more complex multi-cell environments, the proposed algorithm excels in locating the Pareto-optimal solution set, where its solution set significantly outperforms the baseline methods. Its hypervolume (HV) metric surpasses the second-best approach by more than 93%. This result quantitatively demonstrates the algorithm’s superiority in finding a high-quality set of trade-off solutions, confirming the framework’s high efficiency in complex interference environments.

Keywords:

ISAC; GWO; 6G; MOP

1. Introduction

As the 6G vision crystallizes, wireless networks are evolving from single-purpose communication services toward multifunctional convergence [1]. Across domains such as connected vehicles, smart factories, smart cities, and the metaverse, the demand for both reliable sensing and high-efficiency communication is surging [2,3]. Yet because communications and radar rely on separate hardware and platforms, they trigger severe contention for spectrum resources [4]. ISAC has emerged to reconcile soaring resource demands with holistic network capabilities; through their co-design, it promises to spark a new generation of intelligent applications [5].

For ISAC, much of the current research centers on partitioning communication and sensing functions by separating them in time or space. Reference [6] proposes a three-stage Time Division Duplexing (TDD) frame structure, while [7,8] subsequently design a dynamically adjustable frame to meet the joint communication-and-sensing demands in vehicular networks. Alternatively, by combining spatial and temporal division, configuring a single antenna array can also achieve switching between communication and sensing [9]. Although such designs avoid mutual interference [10], they also prevent ISAC systems from fully exploiting their spectral-efficiency potential. To meet 6G’s stringent demands on spectrum utilization, the FD mode [3] has now become the focal point of ISAC research. Unlike the interference-avoidance approaches mentioned earlier, FD mode allows a base station to transmit and receive simultaneously on the same time–frequency resources, maximizing resource utilization [11,12]. For instance, [13] repurposes downlink resources by having the BS serve as both radar transceiver and communication transmitter, whereas [14] reuses uplink resources for the same purpose. At the same time, FD operation introduces a thornier physical constraint—strong SI [11]—which turns what was once a straightforward resource-allocation task into a far more intricate problem. Furthermore, much of the existing research simplifies the network model to an idealized single-cell scenario, thereby overlooking the significant impact of inter-cell interference that is prevalent in practical deployments [15].

In ISAC-centric systems, resource scheduling is itself a pivotal lever for reconciling the fundamental tension between sensing accuracy and communication performance [16], and research into algorithms for this allocation problem continues to advance. Early efforts mostly leaned on classical mathematical-optimization tools: both [17,18] employed semidefinite relaxation (SDR) to optimize joint communication-and-radar beamforming under different application scenarios. Thereafter, a more mainstream approach has been the weighted-sum method, which linearly fuses multiple objectives into a single utility function [19], for instance, investigates joint beamforming design algorithms aimed at maximizing the weighted sum of sensing and communication performance. Reference [20] builds on non-orthogonal multiple access (NOMA) to formulate a beamforming problem that maximizes the weighted sum of communication throughput and effective sensing power. Yet this approach lacks flexibility, heavily relies on manually set weights, and cannot capture the complete Pareto frontier. To overcome this limitation, researchers have turned to multi-objective evolutionary algorithms (MOEAs) [21]; however, when applied to ISAC-specific problems, these methods often suffer from slow convergence and a tendency to fall into local optima. Recently, deep reinforcement learning (DRL) has also been brought into ISAC research: ref. [22] devises a hybrid DRL agent to maximize the system’s cooperative rate. Yet emerging DRL approaches still suffer from high training costs and unstable policy convergence.

Motivated by the issues above, this paper formulates an FD-mode ISAC system that explicitly accounts for self-interference and incorporates state-of-the-art self-interference cancelation (SIC) techniques [12]. To tackle the resulting NP-hard MOP [23] that is difficult to solve, we propose an AM-MOGWO. By integrating Lévy flights and an adaptive hybrid search, AM-MOGWO delivers strong global exploration while avoiding the high training costs of other approaches. The main contributions of this paper are summarized as follows:

The proposed AM-MOGWO utilizes a problem-driven fusion of diverse strategies—including Lévy Flight, an adaptive hybrid search, and Memetic Computing—enabling it to exhibit superior global exploration capabilities and efficient convergence performance when solving complex NP-hard problems such as ISAC resource allocation. To ensure a rigorous and effective evaluation of the proposed optimization algorithms, this paper first establishes a high-fidelity system model for an ISAC network.
This paper constructs a comprehensive ISAC system model that simulates a realistic operational environment by incorporating key physical factors—including 3GPP-standardized channels, the Doppler effect, environmental clutter, and bidirectional self-interference—thereby situating the resource allocation problem within a challenging and practical scenario for validation.
The proposed algorithm’s performance is comprehensively evaluated in both a foundational single-cell environment and a more challenging, realistic multi-cell interference scenario. In both settings, the superiority of AM-MOGWO over baseline methods is systematically demonstrated through visual Pareto front dominance and quantitative metrics, yielding critical insights for the practical design and deployment of ISAC systems.

The remainder of this paper is organized as follows. Section 2 details the ISAC system model we have constructed, including the scenario description, channel model, and the multi-cell interference extension. Section 3 elaborates on the core mechanisms and pseudocode of the proposed AM-MOGWO algorithm. Section 4 presents and discusses the simulation results, including the performance comparison of the algorithms in both single-cell and multi-cell scenarios. Section 5 provides a summary of the main work in this study. Finally, Section 6 outlines future research directions. The overall framework of the proposed AM-MOGWO algorithm is illustrated in Figure 1.

2. Materials and Methods

This chapter establishes the system model for a multiuser FD ISAC network. The model is constructed to reflect a challenging and realistic operational environment by explicitly incorporating key physical phenomena: bidirectional SI resulting from FD operation, environmental clutter, and a composite channel structure that combines 3GPP-compliant, large-scale path loss with small-scale Rician fading [24]. Based on this comprehensive system model, we formulate the resource allocation challenge as a MOP. The primary goal is to jointly maximize two conflicting objectives—the aggregate downlink communication rate and the radar sensing mutual information—while adhering to the quality-of-service (QoS) requirements of all users [25].

2.1. Scenario Description

2.1.1. A Foundational Single-Cell Scenario

The operational environment for the ISAC system under investigation is a single-cell downlink scenario, as conceptually illustrated in Figure 2. In this framework, a central base station (BS) performs two concurrent tasks using a unified signal. For communication, the BS transmits downlink data to a set of mobile users. Simultaneously, for sensing, it leverages the same signal to detect and track a separate, non-communicating target, such as a vehicle. As the figure shows, the BS processes the echo signal reflected from the radar target to extract sensing information. For our performance evaluation, we define the specific parameters of this environment. The BS is situated at the center of a 50-m-radius cell at a height of 25 m. It serves ten mobile user equipments (UEs), which are randomly located within an annulus of 10 m to 50 m from the BS and move at speeds uniformly distributed between 30 and 60 km/h. To model a realistic sensing environment, the performance is primarily impeded by environmental clutter. We simulate this by distributing twenty static scatterers throughout the cell, each with a random radar cross-section (RCS) [26] in the range of [0.01, 0.1] m². These scatterers generate unwanted echoes, representing the main source of radar interference.

2.1.2. A More Challenging Multi-Cell Interference Scenario

To more accurately evaluate the performance and robustness of the proposed AM-MOGWO algorithm in practical network deployments, we construct a more challenging multi-cell interference scenario. This scenario adopts the classic hexagonal network topology, which is standard in cellular network research. The model comprises J = 7 cells: a central cell (indexed j = 0) and a full first tier of six interfering cells (indexed j ∈ {1, …, 6}). Based on our simulation parameters, the inter-site distance is set to 100 m.

Under this layout, the signal reception environment for a UE in the central cell becomes significantly more complex. In addition to the desired signal from the serving base station and the intra-cell self-interference introduced by the full-duplex mode, the UE will also continuously receive downlink signals from all six neighboring interfering base stations. This aggregate inter-cell interference is a key bottleneck affecting the performance of modern cellular networks. It makes the resource allocation problem in ISAC systems trickier and places higher demands on the optimization algorithm’s ability to find globally optimal solutions under strong interference.

2.2. Dynamic TDM Frame

Inheriting and evolving the flexible OFDM-based numerology of 5G New Radio (NR) [27], which supports multiple sub-carrier spacings (SCSs) to enable diverse service requirements, provides a robust foundation for integrating the advanced functionalities envisioned for 6G.

Building upon this principle, this work proposes a dynamic Time-Division Multiplexing (TDM) frame structure designed to facilitate adaptable resource allocation for ISAC [8]. As depicted in Figure 3, each transmission frame is temporally partitioned into two distinct, contiguous phases: an initial Sensing Phase comprising

N_{s}

OFDM symbols, where the BS executes radar-centric tasks, followed by a Communication Phase of

N_{c}

symbols dedicated to multi-user downlink data transmission.

The cardinal advantage of this architecture lies in its intrinsic flexibility and adaptability. By dynamically tuning the ratio of

N_{s}

to

N_{c}

, the frame structure endows the system with the capability to manage the fundamental sensing-communication performance trade-off [1] in response to real-time service demands—shifting focus from high-precision sensing to high-throughput communication as needed. This adjustable time-domain partitioning thus constitutes a key degree of freedom (DoF) for the resource allocation optimization problem investigated herein.

2.3. Channel Model

The performance evaluation of resource allocation algorithms is critically dependent on the fidelity of the underlying channel model. For analytical tractability, many existing studies on ISAC resort to simplified channel assumptions, such as pure Line-of-Sight (

L O S

) propagation or otherwise idealized fading models. Such simplifications, however, risk overlooking the intricate propagation characteristics of real-world environments, potentially leading to overly optimistic conclusions regarding algorithmic performance. In stark contrast, to ensure that the robustness and effectiveness of our proposed algorithm are rigorously validated, this work establishes a high-fidelity, composite channel model. By integrating both large-scale fading effects, which govern the average path loss, and small-scale fading phenomena, which capture rapid multipath-induced fluctuations, we create a challenging yet realistic simulation environment that mirrors practical urban deployment scenarios.

2.3.1. Large-Scale Fading

This work adopts the 3GPP TR 38.901 path-loss model for the Urban Macro (UMa) scenario to ensure a realistic characterization of signal propagation. A defining feature of this model, which distinguishes it from simpler deterministic approaches, is its use of a probabilistic mechanism to determine whether a given user link is in a

L O S

or Non-Line-of-Sight (

N L O S

) state.

The model elegantly assigns distinct roles to the two-dimensional distance and the three-dimensional distance. The two-dimensional distance serves as the criterion for determining the link state. In urban environments, the existence of a direct path depends more on the layout of ground-level obstacles such as buildings. Therefore, the horizontal distance between the user and the base station is a more effective indicator for assessing the likelihood of

L O S

, while the three-dimensional distance is used to compute the path loss. According to the fundamental physics of electromagnetic propagation, signal energy attenuates in direct proportion to the actual distance it travels through space—the three-dimensional slant range.

The

L O S

probability for each user link, denoted as

P_{L O S}

, is a function of the two-dimensional (

2 D

) horizontal distance

d_{2 D}

, given by

\begin{array}{l} P_{L O S} (d_{2 D}) = \{\begin{matrix} 1 & i f d_{2 D} \leq 18 m \\ \frac{18}{d_{2 D}} + (1 - \frac{18}{d_{2 D}}) e^{- \frac{d_{2 D}}{63}} & i f d_{2 D} > 18 m \end{matrix} \end{array}

(1)

The path loss (

P L

, in dB) for each link is then calculated by first probabilistically assigning a

L O S

or

N L O S

state and subsequently applying the corresponding formula.

The path loss for the

L O S

case is formulated as

{P L}_{L O S}

:

{P L}_{L O S} = 28.0 + 22 {l o g}_{10} (d_{3 D}) + 20 {l o g}_{10} (f_{c})

(2)

The path loss for the

L O S

case is formulated as

{P L}_{N L O S}

:

{P L}_{N L O S} = 13.54 + 39.08 {l o g}_{10} (d_{3 D}) + 20 {l o g}_{10} (f_{c}) - 0.6 (h_{u e} - 1.5)

(3)

The complete path loss model described in (1)–(3) is adopted from the 3GPP TR 38.901 standard for the UMa scenario [24]. Where

d_{3 D}

denotes the 3D slant range (m), f_c is the carrier frequency (GHz), and

h_{u e}

represent the antenna heights of user equipment, respectively. To ensure the model remains physically consistent, the final

N L O S

path loss value is taken as the maximum of the calculated

L O S

and

N L O S

values.

2.3.2. Small-Scale Fading

The small-scale fading effects, which arise from multipath interference, are modeled using a Rician distribution [28]. This choice is motivated by the operational context of ISAC systems, which frequently utilize higher-frequency bands such as millimeter-wave (mmWave). Propagation in such bands is characterized by a high probability of a stable

L O S

path, a feature that the Rician model accurately represents.

The Rician fading model explicitly decomposes the channel coefficient into two constituent parts: a deterministic

L O S

component and a random, scattered

N L O S

component. To form the complete channel gain, these small-scale effects are scaled by the large-scale path loss discussed previously. The resulting composite channel gain,

h

, is formulated as follows [28]:

h = \sqrt{G_{L}} (\sqrt{\frac{K}{K + 1}} h_{l o s} + \sqrt{\frac{1}{K + 1}} h_{n l o s})

(4)

where

G_{L}

is the linear-scale, large-scale power gain, derived from the path loss in dB (

{P L}_{d B}

) as

G_{L} = 10^{\frac{- {P L}_{d B}}{10}}

. The term K denotes the Rician K-factor, which is the power ratio of the deterministic

L O S

component to the scattered components; for the special case where K = 0, the channel reduces to Rayleigh fading. The terms

h_{l o s}

and

h_{n l o s}

represent the deterministic

L O S

and the random scattered components, respectively, with

h_{l o s}

being modeled as a circularly symmetric complex Gaussian (CSCG) random variable, i.e.,

h_{n l o s} = C N (0,1)

[29].

This synthesis of large-scale path loss and small-scale fading effects results in a high-fidelity channel model. This model serves as a realistic and robust foundation for evaluating our algorithm’s performance under practical propagation conditions.

2.4. System Performance Evaluation Indicators

In order to evaluate the performance of the dual functions of the proposed ISAC system, and to establish the objective function for the subsequent optimization problem, this section will, respectively, derive and define the core performance indicators for the two dimensions of sensing and communication.

2.4.1. Radar Metric

The sensing task is to reduce the prior uncertainty of the target state by measuring the target echo signals. Based on information theory [30], Mutual Information (MI) can directly quantify this reduction in uncertainty, and is therefore widely used as an effective metric for measuring the estimation performance of radar systems [31].

For a channel that can be approximated as Gaussian, the MI is a function of the available bandwidth and the Signal-to-Clutter-plus-Interference-plus-Noise Ratio (SCNR). Therefore, this work adopts the radar mutual information rate (in bit/s) as the sensing performance metric, which is calculated as follows [30]:

I^{r a d} = B_{s e n s} \cdot \log_{2} (1 + SCNR)

(5)

where

B_{sens}

is the bandwidth utilized for sensing, and the SCNR is the ratio of the desired signal power to the aggregate power of all impairments, including environmental clutter, residual self-interference, and thermal noise.

2.4.2. Communication Metric

For the communication system, this paper adopts the Aggregate Downlink Rate of all downlink users as the key performance indicator. Its theoretical basis is the classic Shannon–Hartley theorem [32], which specifies the maximum rate at which information can be transmitted without error over a channel with a given bandwidth and signal-to-noise ratio. Therefore, the communication performance is defined as follows [7]:

C_{c o m} = \sum B {l o g}_{2} (1 + S I N R)

(6)

where

B

represents the system bandwidth and

S I N R

represents the Signal-to-Interference-plus-Noise Ratio. In this ISAC system, the calculation of

S I N R

will comprehensively consider the effects of Inter-Carrier Interference (ICI) caused by user mobility, residual SI of the system, and thermal noise.

2.5. System Modeling

The core task of this chapter is to establish precise mathematical performance models for the two conflicting functions of communication and sensing based on the aforementioned system model, and ultimately to construct a multi-objective optimization problem. The basis for the modeling is the dynamic time-division duplexing frame structure operating in FD mode, which was defined in Section 2.2. Our goal is to express the system’s sensing performance (radar MI) and communication performance (downlink aggregate rate) as functions of resource allocation variables, which mainly include symbol allocation in the time domain (

N_{s}

,

N_{c}

) and the allocation scheme in the power domain.

2.5.1. Modeling in the Single-Cell Scenario

We first model the sensing performance. Within the sensing phase composed of

N_{s}

symbols, the BS transmits sensing signals and processes radar echoes. The total signal

y_{r a d}

processed by its receiver can be decomposed into four parts: the desired target echo

s_{t a r g e t}

, clutter from the environment

s_{c l u t t e r}

, residual self-interference

s_{S I}

introduced by FD operation, and additive white Gaussian noise

n

:

y_{r a d} = s_{t a r g e t} + s_{c l u t t e r} + s_{S I} + n

(7)

To analyze this signal, we derive the power of its respective components. For a single target at a distance

d_{t}

with a RCS of

σ_{t}

, according to the monostatic radar equation, its echo power

P_{echo, n}

can be expressed as [28]

P_{echo, n} = \frac{P_{s, n} G_{bs}^{2} λ_{c}^{2} σ_{t}}{(4 π)^{3} d_{t}^{4} L}

(8)

where

P_{s, n}

is the total sensing transmit power,

G_{b s}

is the antenna gain,

λ_{c}

is the carrier wavelength, and

L

is the system loss.

Similarly, the total clutter power

P_{c l u t t e r, n}

from

N_{c l}

non-target scatterers in the environment is [28]

P_{c l u t t e r, n} = \sum_{j = 1}^{N_{c l}} \frac{P_{s, n} G_{b s}^{2} λ_{c}^{2} σ_{c l, j}}{(4 π)^{3} d_{c l, j}^{4} L}

(9)

To construct a high-fidelity simulation environment, our model explicitly considers the impact of environmental clutter, which is a primary source of interference in practical radar sensing. The parameters for the clutter scatterers are detailed in Section 4.

In FD operation mode, the residual self-interference

P_{c \to s, n}

refers to the interference leaked from the communication transmit link during the sensing slot. Its power is proportional to the total communication transmit power

P_{c}

and is uniformly distributed on all

N_{s c}

subcarriers:

P_{c \to s, n} = \frac{η_{c \to s} P_{c}}{N_{s c}}

(10)

where

η_{c \to s}

is the residual self-interference coefficient from communication to sensing. The thermal noise power of the receiver,

P_{n o i s e, s}

, is

\begin{matrix} P_{n o i s e, s} = N_{0} Δ f F_{n} \end{matrix}

(11)

where

N_{0}

is the noise power spectral density,

Δ f

is the subcarrier bandwidth, and

F_{n}

is the noise figure. Thus, the SCNR of the radar on subcarrier

{S C N R}_{n}

can be precisely expressed as

\begin{matrix} \begin{matrix} {S C N R}_{n} = \frac{P_{e c h o, n}}{P_{c l u t t e r, n} + P_{c \to s, n} + P_{n o i s e, s}} \end{matrix} \end{matrix}

(12)

Based on this SCNR, we adopt the radar MI as the final sensing performance metric. For a Gaussian channel, and considering that the sensing task only occupies a time proportion of

\frac{N_{s}}{N_{s} + N_{c}}

, the total radar mutual information of the system,

I^{r a d}

, can be obtained by accumulating the mutual information of all subcarriers:

I^{r a d} = \frac{N_{s}}{N_{s} + N_{c}} \sum_{n = 1}^{N_{s c}} Δ f \log_{2} (1 + {S C N R}_{n})

(13)

where

Δ f

is the bandwidth of a single subcarrier. In the simulations of this paper, we make a common assumption that the sensing power is uniformly distributed on all

N_{s c}

subcarriers. Under this condition, the SCNR values of each subcarrier are the same, denoted as SCNR. At this time, the above equation can be simplified to

\begin{matrix} I_{r a d} = \frac{N_{s}}{N_{s} + N_{c}} \cdot B \cdot \log_{2} (1 + {S C N R}_{n}) \end{matrix}

(14)

where

B = N_{s c} Δ f

is the total system bandwidth.

In the considered cellular system, a BS simultaneously serves a set of K uplink users, indexed by

k \in U = \{1,2, \dots, K\}

, and a set of L downlink users, indexed by

l \in L = \{1,2, \dots, L\}

.

During the communication phase, which consists of

N_{c}

symbols, the signal

y_{c, l, n}

received by downlink user

l \in L

on its allocated subcarrier

n \in L_{l}

is modeled as a linear superposition of the desired signal, cross-phase SI, ICI, and AWGN:

\begin{matrix} y_{c, l, n} = \sqrt{G_{b s} G_{u e} h_{l, n} p_{c, l, n}} s_{l, n} + i_{s \to c, n} + i_{i c i, l, n} + z_{c, n} \end{matrix}

(15)

where

s_{l, n}

is the data symbol with unit energy (i.e.,

\begin{matrix} E [| s_{l, n} |^{2}] = 1 \end{matrix}

),

p_{c, l, n}

is the transmit power allocated to user

l

on this subcarrier,

h_{l, n}

is the channel gain, and

G_{b s}

,

G_{u e}

are the antenna gains of the BS and the user, respectively. The average power of each signal component is then derived to construct the SINR expression, starting with the desired signal power,

P_{s i g, l, n}

, which is given by the second-order moment of the first term in (16):

P_{s i g, l, n} = E [{|h_{l, n} \sqrt{G_{b} G_{u e} p_{c, l, n}} s_{l, n}|}^{2}] = G_{b s} G_{u e} {h_{l, n} |}^{2} p_{c, l, n}

(16)

Next, the interference term,

i_{s \to c, n}

, is identified as the cross-slot self-interference arising from the sensing hardware leakage during the communication reception period, with its per-subcarrier power given by

P_{s \to c, n} = η_{s \to c} \frac{P_{s}}{N_{s c}}

(17)

where

P_{s}

is the total sensing transmit power.

The ICI power,

P_{i c i, l, n}

, stems from the Doppler effect induced by user mobility, which disrupts the orthogonality among OFDM subcarriers. Its value is approximated to be proportional to the desired signal power,

P_{s i g, l, n}

, on the same subcarrier:

P_{i c i, l, n} = κ_{i c i} (f_{d, l}) \cdot P_{s i g, l, n}

(18)

where the dimensionless ICI coefficient,

κ_{i c i} (f_{d, l})

, is a function of the user’s Doppler shift (

f_{d, l}

) and the OFDM symbol duration (

T_{s y m}

), and can be approximated as [29]

κ_{i c i} (f_{d, l}) \approx \frac{1}{12} (π f_{d, l} T_{s y m})^{2}

(19)

The average thermal noise power,

P_{n o i s e, c}

, over a single subcarrier bandwidth,

Δ f

, is given by

P_{n o i s e, c} = N_{0} Δ f F_{n}

(20)

where

N_{0}

,

Δ f

, and

F_{n}

are the noise power spectral density, the subcarrier bandwidth, and the linear value of the receiver’s noise figure, respectively.

The SINR for user

l

on subcarrier

n

, based on the preceding derivations, is defined as the ratio of the desired signal power to the total power of all interference and noise components:

{S I N R}_{l, n} = \frac{P_{s i g, l, n}}{P_{s \to c, n} + P_{i c i, l, n} + P_{n o i s e, c}}

(21)

Grounded in Shannon’s channel capacity theory, the aggregate downlink communication rate,

R_{t o t a l}

, is the sum of the rates of all downlink users, scaled by the time proportion,

\frac{N_{c}}{N_{s} + N_{c}}

, allocated to the communication task:

R_{t o t a l} = \frac{N_{c}}{N_{s} + N_{c}} \sum_{l \in L} \sum_{n \in L_{l}} Δ f \log_{2} (1 + {S I N R}_{l, n})

(22)

The performance metrics,

R_{t o t a l}

and

I_{r a d a r}

, precisely derived in this section, form a solid basis for the subsequent formulation of the multi-objective optimization problem.

2.5.2. Modeling Extension for the Multi-Cell Scenario

While the model in Section 2.5.1 establishes a performance baseline in an ideal single-cell scenario, a more rigorous evaluation must account for interference from neighboring cells, a critical factor in dense network deployments. Therefore, this section extends the aforementioned model to a more challenging multi-cell interference scenario.

In this setting, a user in the central cell is subject not only to intra-cell self-interference but also to significant interference from the downlink transmissions of all neighboring base stations. Therefore, the SINR for user l on subcarrier n is formulated as

{S I N R}_{l, n} = \frac{P_{s i g, l, n}}{P_{s \to c, n} + P_{n o i s e, c} + P_{i n t e r - c e l l, l, n}}

(23)

where

P_{s i g, l, n}

,

P_{s \to c, n}

, and

P_{n o i s e, c}

are the powers of the desired signal, intra-cell residual self-interference, and thermal noise, respectively. The inter-cell interference,

P_{i n t e r - c e l l, l, n}

, is the aggregate power from all interfering base stations, given by

P_{i n t e r - c e l l, l, n} = \sum_{j = 1}^{J - 1} G_{b s, j} G_{u e, l} | h_{j, l, n} |^{2} p_{c, j, n}

(24)

In this formula,

h_{j, l, n}

is the channel gain from the j-th interferer to user l on subcarrier n, with

G_{b s, j}

and

G_{u e, l}

being the respective antenna gains. For a worst-case interference model, all interfering BSs transmit at maximum power (

P_{m a x}

), distributed evenly across the

N_{s c}

total subcarriers. Thus, the transmit power per subcarrier is

p_{c, j, n} = P_{m a x} / N_{s c}

.

2.6. Multi-Objective Problem Formulation

Building upon the detailed mathematical descriptions of the ISAC system model and performance metrics from the preceding chapter, this section formulates a MOP. The objective of this MOP is to find the Pareto-optimal tradeoff between the dual communication and sensing functions by strategically allocating time- and power-domain resources, subject to the system’s physical limitations and QoS requirements.

Following the performance metric derivations, the MOP is formulated to simultaneously maximize the communication performance, i.e., the aggregate rate

R_{t o t a l}

, and the sensing performance, i.e., the radar mutual information

I_{r a d}

. This is achieved by optimizing a vector of resource allocation variables,

x

, which encompasses the sensing duration

N_{s}

and power allocation schemes, leading to the following formulation:

\begin{array}{l} \max_{x} {f_{1} (x) = R_{total} (x), f_{2} (x) = I_{r a d} (x)} \\ s . t . C 1 : P_{C} + P_{S} \leq P_{m a x} \\ C 2 : R_{t o t a l} \geq R_{m i n} \\ C 3 : N_{s} \in {N_{s, m i n}, \dots, N_{s, m a x}}, N_{s} \in Z \\ C 4 : P_{C} \geq 0, P_{S} \geq 0 \end{array}

(25)

The formulated optimization problem is governed by several key constraints. The maximum transmit power constraint (C1) reflects the physical limitation of the BS’s hardware, stipulating that the combined instantaneous power for communication (

P_{C}

) and sensing (

P_{S}

) cannot exceed the power amplifier’s maximum rating,

P_{m a x}

. In addition to this hardware limit, the QoS constraint (C2) guarantees a minimum user experience by requiring the aggregate data rate to remain above a threshold,

R_{m i n}

. Structurally, the integer symbol constraint (C3) dictates that the number of sensing symbols,

N_{s}

, must be an integer, as an OFDM symbol represents the fundamental discrete unit of time-domain resources. To ensure that neither function is deprived of resources, this value is also bounded within a feasible operational range, i.e.,

N_{s, m i n} \leq N_{s} \leq N_{s, m a x}

. Finally, the non-negative power constraint (C4) enforces a fundamental physical law.

3. A Grey Wolf Optimizer for Dynamic Resource Allocation

The joint resource optimization for the ISAC system under investigation is a mathematically complex Mixed-Integer Non-Linear Programming (MINLP) problem. It is fundamentally NP-hard due to the non-convexity of the objective functions and the coupled nature of the constraints, rendering traditional gradient-based or convex optimization methods either inapplicable or unable to guarantee global optimality. Consequently, employing metaheuristic algorithms, such as the AM-MOGWO proposed herein, provides an effective pathway to finding a high-quality Pareto-optimal solution set within an acceptable computational complexity.

This chapter presents a detailed exposition of the AM-MOGWO framework, which is specifically designed for the unique characteristics of the ISAC problem. We begin with the fundamental mathematical principles of the standard GWO. Subsequently, we elaborate on the architecture of our proposed AM-MOGWO, detailing its multi-objective handling mechanism, the solution encoding scheme, and a series of strategies designed to enhance its performance.

3.1. Standard GWO

GWO is a metaheuristic algorithm inspired by the social hierarchy and hunting behavior of grey wolves [33], where in the search process is guided by the three most optimal wolves (the leaders) steering the entire population of search agents toward the most promising regions of the search space.

To emulate the social hierarchy of the wolf pack, each iteration of the algorithm involves sorting the population based on fitness values, where the top three solutions are designated as the alpha (

X_{α}

), beta (

X_{β}

), and delta (

X_{δ}

) wolves. These three leaders are considered to be the closest approximations to the prey (i.e., the optimal solution), while the remaining individuals, termed omega (ω) wolves, update their positions under the collective guidance of this leading trio.

The search process of the GWO emulates the pack’s hunting behaviors—such as encircling, chasing, and attacking the prey—wherein the position of each omega (ω) wolf is updated based on the collective guidance of the alpha (α), beta (β), and delta (δ) leaders. This process begins with the calculation of the distance vectors between the omega wolf and this leading trio [33]:

\{\begin{matrix} \vec{D_{α}} = | \vec{C_{1}} \cdot \vec{X_{α}} - \vec{X} \\ \vec{D_{β}} = | \vec{C_{1}} \cdot \vec{X_{β}} - \vec{X} \\ \vec{D_{δ}} = | \vec{C_{1}} \cdot \vec{X_{δ}} - \vec{X} \end{matrix}

(26)

Using these distance vectors, the potential next positions towards the three leader wolves are then calculated as

\{\begin{matrix} \vec{X_{1}} = \vec{X_{α}} - \vec{A} \cdot \vec{D} \\ \vec{X_{2}} = \vec{X_{β}} - \vec{A} \cdot \vec{D} \\ \vec{X_{3}} = \vec{X_{δ}} - \vec{A} \cdot \vec{D} \end{matrix}

(27)

The position of the omega (ω) wolf for the next iteration,

X (t + 1)

, is then determined by averaging the three aforementioned potential positions:

X (t + 1) = \frac{X_{1} + X_{2} + X_{3}}{3}

(28)

where t indicates the current iteration,

\vec{X} (t)

is the position vector of the omega wolf, and the coefficient vectors

\vec{A}

and

\vec{C}

are calculated via

\vec{A} = 2 \vec{a} \cdot {\vec{r}}_{1} - \vec{a}

and

\vec{C} = 2 \cdot \vec{r_{2}}

, with

{\vec{r}}_{1}

and

\vec{r_{2}}

being random vectors with elements drawn from [0, 1]. The parameter is linearly decreased from 2 to 0 over the course of iterations to balance the algorithm’s global exploration and local exploitation capabilities.

3.2. AM-MOGWO

The standard GWO is primarily designed for single-objective optimization and is prone to premature convergence to local optima when tackling complex, multi-modal problems such as the ISAC resource allocation task formulated herein. To address these limitations and efficiently solve the established multi-objective problem, we propose an AM-MOGWO. While this algorithm incorporates a Pareto-dominance-based mechanism to handle multiple objectives, its core innovation lies in a novel adaptive hybrid search framework constructed by fusing multiple advanced search strategies.

The proposed AM-MOGWO is a Pareto-dominance-based algorithm that employs a bounded external archive to store all non-dominated solutions discovered during the search process; this set of archived solutions constitutes the final output. For the purpose of guiding the population search and determining the alpha, beta, and delta leaders, the total utility function, W, as modeled in the preceding chapter, is adopted as a scalar fitness function. It must be emphasized, however, that this fitness function’s role is strictly confined to being an internal guidance mechanism for the algorithm.

Each solution to the optimization problem is encoded as a D-dimensional position vector,

\vec{X} = [N_{s}, P_{c}, P_{s}, \dots]

, where each element corresponds to a resource allocation variable, with the sensing duration,

N_{s}

, being constrained as an integer.

To address the inherent challenge of balancing global exploration with local exploitation in standard GWO and to enhance its search efficiency within complex optimization spaces, the proposed AM-MOGWO integrates a series of performance-boosting strategies.

First, the Lévy Flight mechanism is incorporated to enhance the algorithm’s global exploration capability and effectively avert premature convergence [34]. This mechanism, a random walk model that emulates the foraging behavior of various organisms, is characterized by step lengths drawn from a heavy-tailed distribution, which enables a combination of fine-grained local searches and occasional long-distance jumps. The position update for a search agent executing a Lévy Flight is consequently formulated as [34]:

\vec{X} (t + 1) = \vec{X} (t) + α \oplus L (λ)

(29)

where α > 0 is a step-size control factor and the operator

\oplus

denotes the entry-wise product (i.e., Hadamard product). The components of the Lévy step-length vector,

L (λ)

, are then generated as follows:

L (s) \sim \frac{λ \cdot Γ (λ) \cdot \sin (π λ / 2)}{π} \cdot \frac{1}{s^{1 + λ}}

(30)

This entire mechanism, whose formulation involves the Gamma function,

Γ (\cdot)

, is designed to probabilistically guide a subset of the population towards large-scale exploration.

Second, the Opposition-Based Learning (OBL) strategy is incorporated to maintain population diversity within the search space [35]. This strategy is founded on the principle that a solution’s opposite may offer a better approximation to the optimum than the solution itself. Thus, for a given solution,

\vec{X} = [x_{1}, \dots, x_{D}]

, in the D-dimensional space, its opposite counterpart,

\vec{X^{'}}

, is defined as [35]

X_{j}^{'} = l b_{j} + u b_{j} - x_{j}, j = 1, \dots, D

(31)

The OBL strategy is applied during both population initialization and random perturbation phases to prevent premature population aggregation, where

l b_{j}

and

u b_{j}

represent the lower and upper bounds of the search space in the j-th dimension, respectively.

Furthermore, to intensify the local exploitation capability around elite solutions, we integrate principles from Chaotic Search [36] and Memetic Computing [37]. This strategy leverages the ergodic and stochastic properties of chaotic maps, such as the Logistic map, to conduct a fine-grained local search in the vicinity of a wolf’s position. The iterative formula for the Logistic map is given by [38]

z_{k + 1} = μ \cdot z_{k} \cdot (1 - z_{k}), z_{k} \in (0,1)

(32)

where

μ

is the control parameter, and the resulting chaotic sequence is utilized to generate a set of high-quality neighboring solutions around a wolf’s position, thereby accelerating convergence to the optimum.

The fusion of the aforementioned enhancement strategies with the standard GWO hunting behavior results in a novel adaptive hybrid update framework. Within this framework, the final position update for an omega (ω) wolf is no longer determined by a simple average. Instead, an adaptive strategy selection mechanism is introduced, which intelligently switches among three distinct behaviors—standard hunting, global exploration (Lévy Flight), and local exploitation (Chaotic Search)—based on the current iteration phase and search performance. The core logic of this improved update can thus be summarized as

\vec{X} (t + 1) = \{\begin{matrix} {\vec{X}}_{α} - {\vec{A}}_{1} \cdot {\vec{D}}_{α} + α \oplus L (λ) & i f p < p_{m} \\ \frac{{\vec{X}}_{1} + {\vec{X}}_{2} + {\vec{X}}_{3}}{3} + z_{k} ({\vec{X}}_{α}) & i f p \geq p_{m} \end{matrix}

(33)

where

p

is a random number drawn from

[0, 1]

, while

p_{m}

is an adaptive switching probability that can be adjusted based on the iteration count or population diversity. The first conditional line of the update rule represents an exploratory hunting behavior integrated with Lévy Flight. In contrast, the second line enacts a fine-grained exploitation, biasing the search towards the α wolf’s position while being perturbed by the chaotic sequence

z_{k}

. The intermediate position vectors,

{\vec{X}}_{1}, {\vec{X}}_{2}

, and

{\vec{X}}_{3}

, are computed as in the standard GWO. Through this hybrid update mechanism, AM-MOGWO maintains an efficient and dynamic balance between global exploration and local exploitation throughout the entire optimization process. The detailed implementation of this adaptive mechanism within the complete AM-MOGWO framework is formally outlined in Algorithm 1.

Algorithm 1 The Proposed AM-MOGWO Framework

Input : N

(Population size), T_{m a x}

(Max iterations), l b, u b

(Search bounds)

Output : A

(The Pareto front)

1 : Initialize population \vec{X} (0)

with OBL, archive A \leftarrow \emptyset

; t \leftarrow 0

2 : Evaluate objectives and fitness for \vec{X} (0)

3 : A

\leftarrow

Update Archive (A, \vec{X} (0)

)

4: Repeat

5 : (\vec{X_{α}}, \vec{X_{β}}, \vec{X_{δ}})

\leftarrow

Select Leaders (\vec{X} (t), W

)

6 : For i = 1

to N

7: // Adaptive Hybrid Position Update Rule

8 : {\vec{X}}_{i} (t + 1) = \{\begin{matrix} {\vec{X}}_{α} - {\vec{A}}_{1} \cdot {\vec{D}}_{α} + α ⊙ L (λ) & i f p < p_{m} \\ \frac{{\vec{X}}_{1} + {\vec{X}}_{2} + {\vec{X}}_{3}}{3} + z_{k} ({\vec{X}}_{α}) & i f p \geq p_{m} \end{matrix}

9: end for

10 : Enforce boundary constraints on the new population X (t + 1)

11 : Evaluate objectives and fitness for X (t + 1)

.

12 : A

\leftarrow

Update Archive (A, X (t + 1)

)

13: // Memetic Step

14 : {\vec{X}}_{e l i t e}' \leftarrow

Chaotic Local Search (Select Elite (A

)).

15 : A

\leftarrow

Update Archive (A, {\vec{X}}_{e l i t e}'

)

16 : Update control parameters a, p_{m}

; and t + 1

.

17 : until t \geq T_{m a x}

18 : return A

4. Results

This section presents a comprehensive performance evaluation of the proposed AM-MOGWO algorithm, conducted through a two-stage simulation framework to assess its effectiveness and robustness.

In Section 4.1, we analyze the algorithm’s performance in a foundational single-cell scenario. In this setting, we investigate the impact of key system parameters, such as the self-interference coefficient, on the Pareto-optimal frontier and benchmark AM-MOGWO against several baseline methods using HV and Inverted Generational Distance (IGD) metrics. In Section 4.2, we validate the algorithm’s robustness and practical applicability in a more challenging multi-cell interference scenario. Here, we focus on demonstrating the superiority of AM-MOGWO over the baseline algorithms in a realistic, interference-limited environment. The key simulation parameters are listed in Table 1.

4.1. Performance Analysis in the Single-Cell Scenario

Figure 4 illustrates the impact of SI on system performance: the system remains highly robust under low-to-moderate SI levels. The communication success rate stays at or above at 95% until the residual SI coefficient reaches a critical threshold of approximately

1 \times 1 0^{- 2}

. However, once this threshold is crossed, system performance deteriorates sharply, with the communication success rate plummeting toward zero in a cliff-like fashion. This demonstrates a pronounced threshold effect, rather than a gradual, linear decline.

To further elucidate this phenomenon, Figure 5 presents the CDFs of the communication rate under both high and low self-interference levels. The figure shows that high self-interference (orange line) causes the entire rate distribution to shift markedly toward the lower-rate region. Under low-interference conditions, the median communication rate (at a CDF of 0.5) is approximately 1.6 Gbps, whereas it drops to about 0.8 Gbps under high interference. Despite the performance degradation, the system still satisfies the 1.0 Gbps QoS target with a probability of approximately 25% under high-interference conditions.

A deeper insight is that, even under ideal self-interference suppression (the leftmost region of the x-axis in Figure 4), the inherent stochastic fading of the channel itself constitutes another fundamental bottleneck. Therefore, to achieve the ultra-high-reliability communications envisioned for 6G, it is imperative to integrate advanced interference-cancelation techniques with more robust channel-enhancement technologies.

To evaluate the proposed algorithm’s performance, we compare it against the standard GWO, the GA, and RS. Qualitative analysis in Figure 6 shows that the proposed AM-MOGWO algorithm successfully produces a broad and continuous Pareto front, whereas all benchmark algorithms converge to a single point that is dominated by the AM-MOGWO front. This visually demonstrates AM-MOGWO’s ability to thoroughly explore the entire multi-objective solution space rather than prematurely converging to a single local optimum.

To quantitatively assess the overall quality of the solution sets, we adopt the HV metric, whose higher values indicate better comprehensive performance. Figure 7 presents the HV distributions of the four algorithms over 30 independent runs. In terms of performance, the median HV of AM-MOGWO is consistently around 1.18, whereas all benchmark algorithms fall within the 0.2–0.25 range. These gaps confirm that the Pareto front discovered by AM-MOGWO is of substantially higher quality. In terms of stability, AM-MOGWO’s boxplot is noticeably more compact, indicating that the algorithm exhibits highly consistent performance and strong robustness across multiple runs. Although the other algorithms also converge reliably, their HV values remain very low, reflecting the inherent limitation of single-point solutions under a multi-objective evaluation framework. Thus, the HV-metric comparison further corroborates the exceptional performance and high stability of the AM-MOGWO algorithm.

We also computed the IGD metric to gauge how closely the obtained solution set approximates the true Pareto-optimal front; lower IGD values indicate higher-quality solutions that lie closer to the optimal frontier. A lower value of this metric signifies a higher-quality solution set that lies closer to the true Pareto front. Figure 8 compares the IGD-value distributions of the algorithms. The median IGD value of AM-MOGWO is approximately 0.3, whereas the IGD values for the other three algorithms all exceed 0.8. This result indicates that, on average, the solution set identified by AM-MOGWO is the closest to the true Pareto front, demonstrating the best convergence among all methods. The consistently high and stable IGD values of standard GWO, GA, and RS precisely illustrate that their single-point solutions remain persistently and reliably distant from the full Pareto-optimal front. Although AM-MOGWO achieves the lowest mean IGD, its distribution spans a wider range, reflecting minor, normal fluctuations in the shape of the obtained front across runs. Nevertheless, given its orders-of-magnitude advantage in IGD values, AM-MOGWO still delivers the best overall performance in approximating the true Pareto-optimal front.

4.2. Robustness Validation in the Multi-Cell Interference Scenario

Figure 9 plots the generated Pareto Front against the average performance points of the three baseline algorithms. It is clear from the figure that the Pareto Front formed by AM-MOGWO completely dominates the average solutions of all baseline algorithms in the objective space. For a precise quantitative comparison, we analyzed the specific performance data of the baseline algorithms: the best-performing baseline was the Genetic Algorithm, with its average performance point located at a communication rate of 0.231 Gbps and a radar mutual information of 55.5 Mbps. However, at the same communication rate of 0.231 Gbps, the Pareto solutions offered by the AM-MOGWO algorithm can achieve a radar performance of over 150 Mbps, representing a performance gain of more than 170%. In contrast, the performance points of the other two baseline algorithms, Standard GWO and Random Search (approximately 49.7 Mbps and 49.5 Mbps, respectively), are dominated by AM-MOGWO with an even larger margin. More critically, the AM-MOGWO algorithm successfully delineates a clear performance trade-off boundary, providing decision-makers with a series of Pareto-optimal solutions to choose from to adapt to different task priorities. This stands in stark contrast to the baseline algorithms, which only converge to a single, suboptimal point, highlighting its fundamental advantage in solving multi-objective problems.

For a more precise quantitative analysis, Figure 10 presents the average performance of each algorithm on the HV metric. The proposed AM-MOGWO algorithm achieved the best average value of 7.931 × 10¹⁷, an improvement of approximately 0.35% compared to the second-best Random Search algorithm (7.903 × 10¹⁷), demonstrating its advantage in the coverage and breadth of the solution set.

Figure 11 presents the average performance of each algorithm on the IGD metric. The advantage of AM-MOGWO is even more significant on the IGD metric, which evaluates algorithm convergence. Its average IGD value was 0.511790, an improvement of approximately 3.7% compared to the second-best Genetic Algorithm (0.531435). This result demonstrates AM-MOGWO’s superior ability to guide the search process towards the true Pareto front.

5. Conclusions

This paper investigates the dynamic resource allocation problem for FD ISAC systems in 6G networks. We analyze the trade-off between communication throughput and sensing accuracy, which, under severe SI, becomes a complex non-convex MOP. To solve this, we propose an efficient and robust intelligent optimization algorithm, AM-MOGWO.

The proposed AM-MOGWO enhances the standard GWO by integrating OBL and Lévy flight strategies. This approach improves the initial population’s quality and diversity and strengthens the algorithm’s global search capability. The algorithm also incorporates chaotic search and memetic computing to intensify local refinement around elite solutions. An adaptive switching mechanism integrates these strategies, ensuring a dynamic balance between global exploration and local exploitation.

We validated the algorithm on a 3GPP-compliant, high-fidelity simulation platform against benchmarks, including standard GWO, GA, and RS. The results highlight AM-MOGWO’s superior performance. For the HV metric, AM-MOGWO achieved a median value of approximately 1.18, over 93% higher than the runner-up’s 0.25. The IGD metric was 0.3, far surpassing the benchmarks, all of which scored above 0.8. These findings confirm the algorithm’s excellent convergence and diversity in approximating the Pareto front.

6. Future Works and Outlook

This study provides a robust theoretical framework and an effective optimization algorithm for the FD-ISAC resource allocation problem. Future works can further enhance the framework’s practical applicability in several key directions.

A primary research direction is the evaluation of the proposed AM-MOGWO algorithm’s hardware feasibility and computational complexity. Real-time ISAC applications, such as vehicular networks, require low computational cost for practical deployment. Therefore, future studies should port the algorithm to specific hardware platforms like FPGAs or embedded GPUs. This would enable a quantitative analysis of its runtime, power consumption, and resource utilization in real physical systems.

Another important direction is extending the framework to more complex dynamic scenarios to comprehensively verify its universality. This includes high-speed mobility environments, such as high-speed rail or drone communications, to study the impact of fast time-varying channels and large Doppler shifts. It also involves exploring the algorithm’s scheduling strategies and scalability in dense user environments to meet the demands of massive device access.

The multi-cell interference model established in this study serves as a foundation for future network-level ISAC research. Based on this model, advanced technologies like Coordinated Multi-Point (CoMP) or Inter-Cell Interference Coordination (ICIC) can be further investigated. The aim is to proactively manage inter-cell interference, breakthrough performance bottlenecks, and improve the overall efficiency of the entire cellular network.

Author Contributions

Conceptualization, X.F. and C.Z.; methodology, X.F. and C.Z.; software, X.F.; validation, X.F. and T.W.; formal analysis, X.F.; investigation, X.F.; data curation, X.F. and T.W.; writing—original draft preparation, X.F.; writing—review and editing, L.S., C.Z. and J.W.; visualization, X.F. and T.W.; supervision, L.S., C.Z. and J.W.; project administration, L.S., C.Z. and J.W.; funding acquisition, L.S., C.Z. and J.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Joint Research Fund for Beijing Natural Science Foundation and Haidian Original Innovation under Grant L232001, Shanxi major science and technology programs project under Grant No. 202301020101001, Research Topic of the Chinese Ethnic Community Research Institute (Research Base of the State Ethnic Affairs Commission) for 2025 under Grant ZLLL22, GuangDong Basic and Applied Basic Research Foundation under Grant 2024A1515011866 and 2024A1515011480, Central Guidance on Local Science and Technology Development Fund of ShanXi Province under Grant YDZJSX20231D005, YDZJSX2022B019 and YDZJSX20231B017, National Natural Science Foundation of China under Grant 62002026, University of Science and Technology Beijing Young Faculty International Exchange and Development Program under Grant QNXM20230016, the Beijing Science and Technology Plan under Grant Z231100005923025.

Data Availability Statement

The data and the code of this study are available from the first author upon request.

Acknowledgments

The authors would like to acknowledge the support from editors and comments from all the reviewers.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

FD	Full-duplex
ISAC	Integrated sensing and communication
SI	Self-interference
MOP	Multi-objective optimization problem
AM-MOGWO	Adaptive Hybrid Memetic Multi-Objective Grey Wolf Optimizer
GWO	Grey wolf optimizer
RS	Random search
GA	Genetic algorithm
HV	Hyper volume
TDD	Time division duplexing
SDR	Semidefinite relaxation
NOMA	Non-orthogonal multiple access
MOEAs	Multi-objective evolutionary algorithms
DRL	Deep reinforcement learning
SIC	Self-interference cancelation
QoS	Quality-of-service
BS	Base station
UEs	User equipments
RCS	Radar cross-section
NR	5G new radio
SCS	Sub-carrier spacing
TDM	Time-division multiplexing
DoF	Degree of freedom
UMa	Urban macro
LOS	Line-of-sight
NLOS	Non-line-of-sight
2D	Two-dimensional
PL	Path loss
3D	Three-dimensional
mmWave	Millimeter-wave
CSCG	Circularly symmetric complex Gaussian
SCNR	Signal-to-clutter-plus-Interference-plus-noise ratio
ICI	Inter-Carrier Interference
MI	Mutual information
MINLP	Mixed-integer non-linear programming
OBL	Opposition-based learning
IGD	Inverted generational distance
CDF	Cumulative distribution function
CoMP	Coordinated multi-point
ICIC	Inter-cell interference coordination

References

Liu, F.; Cui, Y.; Masouros, C.; Xu, J.; Han, T.X.; Eldar, Y.C. Integrated Sensing and Communications: Toward Dual-Functional Wireless Networks for 6G and Beyond. IEEE J. Sel. Areas Commun. 2022, 40, 1728–1767. [Google Scholar] [CrossRef]
Chiriyath, A.R.; Paul, B.; Bliss, D.W.; Rock, S.M. Radar-Communications Convergence: Coexistence, Cooperation, and Co-Design. IEEE Trans. Cogn. Commun. Netw. 2017, 3, 1–12. [Google Scholar] [CrossRef]
Zhang, J.A.; Liu, F.; Eldar, Y.C.; Li, G.Y.; Guo, Y.J.; Hanzo, L. Enabling Joint Communication and Radar Sensing in Mobile Networks—A Survey. IEEE Commun. Surv. Tutor. 2022, 24, 306–345. [Google Scholar] [CrossRef]
Roh, W.; Seol, J.-Y.; Park, J.; Lee, B.; Lee, J.; Kim, Y.; Cho, J.; Cheun, K.; Aryanfar, F. Millimeter-Wave Beamforming as an Enabling Technology for 5G Cellular Communications: Theoretical Feasibility and Prototype Results. IEEE Commun. Mag. 2014, 52, 106–113. [Google Scholar] [CrossRef]
Xiao, Z.; Liu, R.; He, Z.-Q.; Zhu, Y.; Schober, R.; Kumar, P.V. On the Performance-Cost Tradeoff in Integrated Sensing and Communication Systems. IEEE Trans. Wirel. Commun. 2023, 22, 9170–9184. [Google Scholar]
Liu, F.; Masouros, C.; Petropulu, A.P.; Griffiths, H.; Hanzo, L. Joint Radar and Communication Design: Applications, State-of-the-Art, and the Road Ahead. IEEE Trans. Commun. 2020, 68, 3834–3862. [Google Scholar] [CrossRef]
Zhang, Q.; Wang, X.; Li, Z.; Wei, Z. Design and Performance Evaluation of Joint Sensing and Communication Integrated System for 5G mmWave Enabled CAVs. IEEE J. Sel. Top. Signal Process. 2021, 15, 1500–1514. [Google Scholar] [CrossRef]
Zhang, Q.; Sun, H.; Gao, X.; Wang, X.; Feng, Z. Time-Division ISAC Enabled Connected Automated Vehicles Cooperation Algorithm Design and Performance Evaluation. IEEE J. Sel. Areas Commun. 2022, 40, 2206–2218. [Google Scholar] [CrossRef]
Zhang, J.A.; Cantoni, A.; Huang, X.; Guo, Y.J.; Heath, R.W. Framework for an Innovative Perceptive Mobile Network Using Joint Communication and Sensing. In Proceedings of the IEEE 85th Vehicular Technology Conference (VTC2017-Spring), Sydney, Australia, 4–7 June 2017; pp. 1–5. [Google Scholar]
Han, L.; Wu, K. Joint Wireless Communication and Radar Sensing Systems—State of the Art and Future Prospects. IET Microw. Antennas Propag. 2013, 11, 876–885. [Google Scholar] [CrossRef]
Sabharwal, A.; Schniter, P.; Guo, D.; Bliss, D.W.; Rangarajan, S.; Wichman, R. In-Band Full-Duplex Wireless: Challenges and Opportunities. IEEE J. Sel. Areas Commun. 2014, 32, 1637–1652. [Google Scholar] [CrossRef]
Kolodziej, K.E.; Perry, B.T.; Herd, J.S. In-Band Full-Duplex Technology: Techniques and Systems Survey. IEEE Trans. Microw. Theory Techn. 2019, 67, 3025–3041. [Google Scholar] [CrossRef]
Liu, F.; Liu, Y.-F.; Li, A.; Masouros, C.; Eldar, Y.C. Cramér–Rao Bound Optimization for Joint Radar-Communication Beamforming. IEEE Trans. Signal Process. 2022, 70, 240–253. [Google Scholar] [CrossRef]
Wang, X.; Fei, Z.; Zhang, J.A.; Huang, J. Sensing-Assisted Secure Uplink Communications with Full-Duplex Base Station. IEEE Commun. Lett. 2022, 26, 249–253. [Google Scholar] [CrossRef]
Babu, N.; Masouros, C.; Papadias, C.B.; Eldar, Y.C. Precoding for Multi-Cell ISAC: From Coordinated Beamforming to Coordinated Multipoint and Bi-Static Sensing. IEEE Trans. Wirel. Commun. 2024, 23, 14637–14651. [Google Scholar] [CrossRef]
Rahman, M.L.; Zhang, J.-A.; Chen, X.; Saaifan, K.K.; Kouzayha, N.; Alkhateeb, A. Integrated Sensing and Communication: A Review. IEEE Open J. Commun. Soc. 2023, 4, 2381–2443. [Google Scholar]
Hua, H.; Xu, J.; Han, T.X. Rate-Splitting Multiple Access for ISAC: A Novel Framework for Transmit Beamforming and Subcarrier Allocation. IEEE Trans. Veh. Technol. 2023, 72, 10588–10603. [Google Scholar] [CrossRef]
Chen, J.; Wu, K.; Niu, J.; Li, Y.; Xu, P.; Zhang, J.A. Spectral and Energy Efficient Waveform Design for RIS-Assisted ISAC. IEEE Trans. Commun. 2025, 73, 158–172. [Google Scholar] [CrossRef]
Dou, C.; Huang, N.; Wu, Y.; Qian, L.; Quek, T.Q.S. Sensing-Efficient NOMA-Aided Integrated Sensing and Communication: A Joint Sensing Scheduling and Beamforming Optimization. IEEE Trans. Veh. Technol. 2023, 72, 13591–13603. [Google Scholar] [CrossRef]
Wang, Z.; Liu, Y.; Mu, X.; Ding, Z.; Dobre, O.A. NOMA Empowered Integrated Sensing and Communication. IEEE Commun. Lett. 2022, 26, 677–681. [Google Scholar] [CrossRef]
Liu, S.; Hu, J.; Tie, Z.; Shi, J. Joint Subcarrier and Power Allocation for OTFS Based Integrated Sensing and Communication System. In Proceedings of the 2023 International Conference on Ubiquitous Communication (Ucom), Xi’an, China, 15–17 December 2023; pp. 350–355. [Google Scholar]
Gong, J.; Gao, D.; Tian, R.; Liu, X.; Peng, M. Frame Structure Design and Task Scheduling in 5G NR for ISAC Systems: A Deep Reinforcement Learning Approach. IEEE Trans. Veh. Technol. 2025. Early Access. [Google Scholar] [CrossRef]
Deb, K. Multi-Objective Optimization Using Evolutionary Algorithms; John Wiley & Sons: Chichester, UK, 2001. [Google Scholar]
3GPP. Study on Channel Model for Frequencies from 0.5 to 100 GHz; Technical Report TR 38.901, V16.1.0; 3rd Generation Partnership Project (3GPP): Valbonne, France, 2020. [Google Scholar]
Vassaki, S.; Poulakis, M.I.; Panagopoulos, A.D.; Constantinou, P. Power Allocation in Cognitive Satellite Terrestrial Networks with QoS Constraints. IEEE Commun. Lett. 2013, 17, 1344–1347. [Google Scholar] [CrossRef]
Huang, E.; DeLude, C.; Romberg, J.; Mukhopadhyay, S.; Swaminathan, M. Anisotropic Scatterer Models for Representing RCS of Complex Objects. In Proceedings of the 2021 IEEE Radar Conference (RadarConf21), Atlanta, GA, USA, 7–21 May 2021; pp. 1–6. [Google Scholar]
Technical Specification TS 38.211; NR; Physical Channels and Modulation; V16.7.0; 3rd Generation Partnership Project (3GPP): Valbonne, France, 2021.
Rappaport, T.S. Wireless Communications: Principles and Practice, 2nd ed.; Prentice Hall: Upper Saddle River, NJ, USA, 2002; pp. 175–190. [Google Scholar]
Proakis, J.G.; Salehi, M. Digital Communications, 5th ed.; McGraw-Hill: New York, NY, USA, 2008. [Google Scholar]
Bell, M.R. Information Theory and Radar: Mutual Information and the Design and Analysis of Radar Waveforms and Systems. Ph.D. Thesis, California Institute of Technology, Pasadena, CA, USA, 1988. [Google Scholar]
Tang, B.; Li, J. Spectrally Constrained MIMO Radar Waveform Design Based on Mutual Information. IEEE Trans. Signal Process. 2019, 67, 821–834. [Google Scholar] [CrossRef]
Shannon, C.E. A Mathematical Theory of Communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef]
Mirjalili, S.; Mirjalili, S.M.; Lewis, A. Grey Wolf Optimizer. Adv. Eng. Softw. 2014, 69, 46–61. [Google Scholar] [CrossRef]
Yang, X.-S. Nature-Inspired Metaheuristic Algorithms, 2nd ed.; Luniver Press: Frome, UK, 2010. [Google Scholar]
Tizhoosh, H.R. Opposition-Based Learning: A New Scheme for Machine Intelligence. In Proceedings of the International Conference on Computational Intelligence for Modelling, Control and Automation and International Conference on Intelligent Agents, Web Technologies and Internet Commerce (CIMCA-IAWTIC’05), Vienna, Austria, 28–30 November 2005; pp. 695–701. [Google Scholar]
Li, B.; Jiang, W.S. A new class of chaotic optimization algorithms. Control Decis. 1997, 12, 537–541. [Google Scholar]
Moscato, P. On Evolution, Search, Optimization, Genetic Algorithms and Martial Arts: Towards Memetic Algorithms; Technical Report C3P Report 826; California Institute of Technology: Pasadena, CA, USA, 1989. [Google Scholar]
May, R.M. Simple mathematical models with very complicated dynamics. Nature 1976, 261, 459–467. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Framework of the proposed AM-MOGWO algorithm. The main loop alternates between an Exploration Phase, which combines the Grey Wolf Optimizer (GWO) with Lévy Flights, and an Exploitation Phase, which uses GWO with a Chaotic Search.

Figure 2. System Model of ISAC.

Figure 3. Illustration of the TDM based ISAC Frame Structure. The blue squares represent Sensing Resource Elements, and the orange squares represent Communication Resource Elements.

Figure 4. Analysis of the threshold effect of residual SI on communication performance.

Figure 5. CDF of communication rate under different SI levels.

Figure 6. Comparison of average algorithm performance against the consolidated Pareto Front.

Figure 7. Box plot comparison of the HV indicator.

Figure 8. Box plot comparison of the IGD indicator.

Figure 9. Algorithm Performance Comparison in a Multi-Cell Scenario.

Figure 10. Comparison of HV Performance.

Figure 11. Comparison of Average IGD Performance.

Table 1. Key simulation parameters of ISAC system.

Parameter	Value
Network Layout	Hexagonal Grid
Number of Cells	7
Inter-Site Distance	100 m
$Carrier Frequency f_{c}$	28 GHz [4]
$System Bandwidth B$	144 MHz [27]
$Number of Downlink Users L$	10 [19]
Channel Model	3GPP UMa and Rician [24]
Rician K-factor K	0.1
$Max BS Transmit Power p_{m a x}$	40 W (46 dBm) [24]
$Antenna Gains (BS / UE) (G_{b s}$ $/ G_{u e}$ )	25 dBi/5 dBi [24]
$Thermal Noise Density N_{0}$	−174 dBm/Hz [28]
$User RCS σ_{t}$	0.5 m² [26]
Residual Self-Interference Coeff. $η$	0.01 [11]
$Symbols per Frame N$	140 [27]
$Min . Communication Rate Constraint R_{m i n}$	0.4 Gbps
$Population Size N_{p}$	80
$Max Iterations T_{m a x}$	200

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Feng, X.; Wang, J.; Sun, L.; Zhang, C.; Wang, T. Dynamic Resource Allocation in Full-Duplex Integrated Sensing and Communication: A Multi-Objective Memetic Grey Wolf Optimizer Approach. Electronics 2025, 14, 3763. https://doi.org/10.3390/electronics14193763

AMA Style

Feng X, Wang J, Sun L, Zhang C, Wang T. Dynamic Resource Allocation in Full-Duplex Integrated Sensing and Communication: A Multi-Objective Memetic Grey Wolf Optimizer Approach. Electronics. 2025; 14(19):3763. https://doi.org/10.3390/electronics14193763

Chicago/Turabian Style

Feng, Xu, Jianquan Wang, Lei Sun, Chaoyi Zhang, and Teng Wang. 2025. "Dynamic Resource Allocation in Full-Duplex Integrated Sensing and Communication: A Multi-Objective Memetic Grey Wolf Optimizer Approach" Electronics 14, no. 19: 3763. https://doi.org/10.3390/electronics14193763

APA Style

Feng, X., Wang, J., Sun, L., Zhang, C., & Wang, T. (2025). Dynamic Resource Allocation in Full-Duplex Integrated Sensing and Communication: A Multi-Objective Memetic Grey Wolf Optimizer Approach. Electronics, 14(19), 3763. https://doi.org/10.3390/electronics14193763

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Dynamic Resource Allocation in Full-Duplex Integrated Sensing and Communication: A Multi-Objective Memetic Grey Wolf Optimizer Approach

Abstract

1. Introduction

2. Materials and Methods

2.1. Scenario Description

2.1.1. A Foundational Single-Cell Scenario

2.1.2. A More Challenging Multi-Cell Interference Scenario

2.2. Dynamic TDM Frame

2.3. Channel Model

2.3.1. Large-Scale Fading

2.3.2. Small-Scale Fading

2.4. System Performance Evaluation Indicators

2.4.1. Radar Metric

2.4.2. Communication Metric

2.5. System Modeling

2.5.1. Modeling in the Single-Cell Scenario

2.5.2. Modeling Extension for the Multi-Cell Scenario

2.6. Multi-Objective Problem Formulation

3. A Grey Wolf Optimizer for Dynamic Resource Allocation

3.1. Standard GWO

3.2. AM-MOGWO

4. Results

4.1. Performance Analysis in the Single-Cell Scenario

4.2. Robustness Validation in the Multi-Cell Interference Scenario

5. Conclusions

6. Future Works and Outlook

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI