Multi-UAV-Assisted ISAC System: Joint User Association, Trajectory Design, and Resource Allocation

Wang, Jinwei; Xu, Renhui; Peng, Laixian; Wei, Xianglin

doi:10.3390/e27090967

Open AccessArticle

Multi-UAV-Assisted ISAC System: Joint User Association, Trajectory Design, and Resource Allocation

¹

The College of Communications Engineering, Army Engineering University of PLA, Nanjing 210007, China

²

The 63rd Research Institute, National University of Defense Technology, Nanjing 210007, China

^*

Author to whom correspondence should be addressed.

Entropy 2025, 27(9), 967; https://doi.org/10.3390/e27090967

Submission received: 25 July 2025 / Revised: 15 September 2025 / Accepted: 16 September 2025 / Published: 17 September 2025

(This article belongs to the Section Information Theory, Probability and Statistics)

Download

Browse Figures

Versions Notes

Abstract

Unmanned aerial vehicle (UAV)-assisted integrated sensing and communication (ISAC) systems have developed rapidly in the sixth generation (6G) era. However, factors such as the mobility of ground users and malicious jamming pose significant challenges to systems’ performance and reliability. Against this backdrop, this paper designs a multi-UAV-assisted ISAC system model under malicious jamming environments. Under the constraint of sensing accuracy, the total communication rate of the system is maximized through joint optimization of user association, UAV trajectory, and transmit power. The problem is then decomposed into three subproblems, which are solved using the improved auction algorithm (IAA), dream optimization algorithm (DOA), and rapidly-exploring random trees-based optimizer algorithm (RRTOA). The global optimal solution is approached through the alternating optimization-based predictive scheduling algorithm (AOPSA). Meanwhile, this paper also introduces a long short-term memory (LSTM) network to predict users’ dynamic positions, addressing the impact of user mobility and enhancing the system’s real-time performance. Simulation results show that compared with the baseline scheme, the proposed algorithm achieves a 188% improvement in communication rate, which verifies its effectiveness and superiority.

Keywords:

unmanned aerial vehicle; integrated sensing and communication system; malicious jamming; alternating optimization; long short-term memory; user mobility

1. Introduction

With the deep integration of wireless communication technology and intelligent sensing, integrated sensing and communication (ISAC) has become one of the core technologies for sixth generation (6G) communication systems [1,2]. By sharing resources such as spectrum and hardware, ISAC can achieve collaborative optimization of communication and sensing, significantly improving spectrum efficiency and system performance [3]. In recent years, unmanned aerial vehicles (UAVs) have demonstrated extensive potential in ISAC systems due to their flexible deployment, wide coverage, and line-of-sight (LoS) communication links [4]. Integrating ISAC modules onto UAVs can not only enhance resource utilization but also provide communication and sensing services for ground users simultaneously [5,6].

In a UAV-assisted ISAC system, UAV trajectory design, user scheduling, and resource allocation are crucial factors affecting system performance. Currently, numerous studies focus on utilizing a single UAV to provide ISAC services for users [7,8,9,10,11,12]. For instance, Liu et al. [7] proposed a joint optimization problem of user scheduling, transmit power, and UAV trajectory, aiming to maximize energy efficiency and minimize radar mutual information (MI) under the premise of satisfying user sensing fairness. Deng et al. [8] studied the UAV-assisted integrated periodic sensing and communication (IPSAC) mechanism. Under the premise of satisfying sensing frequency and beam pattern gain requirement, the authors jointly optimized UAV trajectory, user association, target sensing selection, and transmit beamforming to maximize system communication rate. Zhou et al. [9] proposed a joint optimization problem of UAV CPU frequency, UAV radar sensing power, user transmit power, and UAV trajectory, aiming to minimize the total energy consumption of the UAV and users. Liu et al. [10] jointly optimized UAV trajectory, task scheduling, and power allocation. Under the constraint of communication rate, they maximized the radar estimation rate. Zhu et al. [11] proposed a reconfigurable intelligent surface (RIS)-assisted ISAC system operating in the millimeter-wave band, which configures the radio propagation environment by altering the phase of radio signals to enhance communication transmission rates. Zhu et al. [12] investigated a simultaneous transmitting and reflecting reconfigurable intelligent surface (STAR-RIS)-assisted ISAC dual secure communication system. By jointly designing the base station’s transmit beamforming and receive filter, along with the STAR-RIS’s transmission and reflection coefficients, the system maximizes the long-term average secrecy rate of users while ensuring the echo signal-to-noise ratio threshold and user rate constraints.

However, constrained by size and power, a single UAV has limited service capability and coverage range, making it difficult to handle scenarios where the geographical locations of communication users or targets are widely distributed. This has driven in-depth research on multi-UAV-assisted ISAC systems [13,14,15,16]. For example, Zhang et al. [13] investigated the resource allocation problem in multi-UAV-assisted ISAC systems. By jointly optimizing the UAV trajectory, user association, and beamforming design, the sum of weighted bit rates of all ground users was maximized while ensuring the sensing signal gain of targets. Qin et al. [14] proposed a joint optimization problem of user association, UAV trajectory planning, and power allocation, aiming to maximize the minimum weighted spectral efficiency among UAVs. Then, the authors introduced centralized and distributed schemes for solving the optimization problem. Wang et al. [15] considered a scenario where multiple UAVs collaboratively detect targets and transmit the collected data to a central UAV. Regarding the communication and sensing scheduling problem, the authors formulated it into two optimization problems, with the objectives of maximizing the average utility function (MAUF) and minimizing the completion time (MCT), respectively. Liu et al. [16] introduced radar MI to measure the sensing performance of ISAC from the perspective of information entropy. To achieve fair communication, the authors maximized the minimum communication rate of each Internet of Things (IoT) node through joint optimization of node scheduling, transmit power, and the three-dimensional (3D) trajectory of UAVs.

Although the aforementioned studies have made significant progress in UAV-assisted ISAC systems, most of them assume that users are in a stationary state and do not fully consider the impact of user mobility on system performance. In fact, some work has begun to focus on ISAC systems in dynamic scenarios. For example, Al-habob et al. [17] investigated a downlink multi-user ISAC system with multiple mobile eavesdropping UAVs, aiming to maximize user secrecy rates and ensure sensing quality by designing predictive beamforming schemes. However, this work did not consider user mobility and employed only a single base station to provide ISAC services. Pan et al. [18] explored a multi-UAV cooperative ISAC dynamic system, which maximizes the positioning accuracy of moving targets while ensuring communication service quality by jointly optimizing trajectories, base station association, spectrum, and power resources. Nevertheless, these studies still do not address the realistic threat of malicious jamming.

As an active attacker, malicious jamming aims to weaken and disrupt the normal communication activities of communication parties through various means (e.g., signal interference, electromagnetic attacks, etc.), thereby degrading communication performance. Jammers use advanced electronic jamming equipment to emit noise signals that are similar to or within the same frequency band as the communication signals, severely reducing the signal-to-noise ratio at the receiver and leading to degraded or even interrupted communication quality. In ISAC systems, malicious jamming not only disrupts communication links but also indirectly affects sensing accuracy and exacerbates power allocation conflicts.

Currently, limited work has focused on UAV-assisted ISAC systems in malicious jamming environments. Liu et al. [19] studied a UAV-enabled ISAC system under malicious jamming attacks, which maximizes the minimum transmission rate of users by optimizing transmit beamforming while meeting sensing constraints. However, this work only considered a simple scenario with a single fixed UAV deployment, lacking an analysis of multi-UAV dynamic trajectory optimization and user mobility. Mei et al. [20] explored the minimization of age of information (AoI) in scenarios with malicious jamming UAVs, reducing AoI by jointly designing the trajectories and power of legitimate UAVs, but similarly without considering multi-UAV cooperation and user mobility. Therefore, under multiple constraints such as mobility, sensing requirements, communication guarantees, and malicious jamming, achieving efficient, robust, and dynamic trajectory planning and resource allocation for UAVs has become a critical problem that urgently needs to be addressed.

Unlike existing studies, the innovation of this paper lies in being the first to thoroughly investigate the joint optimization of user association, UAV trajectory design, and power allocation in multi-UAV-assisted ISAC systems under the complex scenario of coexisting dynamic users and malicious jamming attacks. This paper designs a multi-UAV-assisted ISAC system in a malicious jamming environment, where multiple UAVs provide downlink communication services while sensing environmental information around ground users and dynamically optimizing their trajectories to mitigate the impact of malicious jamming. Additionally, we introduce a long short-term memory (LSTM) network to predict user positions, thereby addressing the challenges posed by mobility. The main contributions of this paper are summarized as follows:

A joint optimization problem is built to jointly optimize user association, UAV trajectory, and transmit power, aiming to maximize the total communication rate of all users under the constraint of sensing accuracy. Due to the non-convexity of the problem, an effective alternating optimization-based predictive scheduling algorithm (AOPSA) is put forward to solve it.
We decompose the optimization problem into three subproblems, which are solved using the improved auction algorithm (IAA), dream optimization algorithm (DOA), and rapidly-exploring random trees-based optimizer algorithm (RRTOA).
An LSTM network is introduced to capture the temporal characteristics of user positions, and it is combined with AOPSA to enhance the system’s real-time performance and optimization efficiency.
A series of simulations are conducted to evaluate the performance of AOPSA. Simulation results demonstrate that LSTM can effectively predict user positions. Moreover, AOPSA outperforms four benchmarks with diverse parameter settings.

The remainder of this paper is organized as follows. Section 2 describes the multi-UAV-assisted ISAC system model under malicious jamming environments. In Section 3, we propose the joint optimization problem of user association, UAV trajectory, and UAV transmit power. Section 4 provides a detailed introduction to the proposed optimization algorithms. Section 5 presents the simulation results of the proposed algorithms and discusses these results. Finally, Section 6 concludes the paper.

2. System Model

We consider a multi-UAV-assisted ISAC system under malicious jamming environments as shown in Figure 1. The system model includes the following: a central UAV, denoted as

{UAV}_{0}

; M edge UAVs equipped with communication and sensing modules, denoted as

{UAV}_{m} \in M = {1, 2, \dots, M}

; K ground mobile users, denoted as

{UE}_{k} \in K = {1, 2, \dots, K}

; and a static malicious jammer. In this system,

{UAV}_{0}

provides global management and control, edge UAVs perform ISAC service for the users by sensing the environmental information around the users and sending the sensing information back to the users [16]. The jammer operates in a fixed-frequency mode, continuously emitting jamming signals within the primary communication frequency band (e.g., the ISAC shared band) to disrupt the communication link between edge UAVs and users. Such jammers are typically stationary (e.g., ground-based fixed jamming stations) and transmit at a constant power, generating strong interference via high-gain antennas. Due to the distance-dependent attenuation characteristics of wireless signal propagation, the jamming intensity experienced by user nodes is negatively correlated with their distance from the jammer. Based on feedback regarding the jamming intensity received by ground users, the UAV can perform radar detection scans over suspected jammer locations to estimate its approximate position [21,22].

Assume

{UAV}_{0}

hovers directly above the center of the area at a fixed altitude of

H_{0}

. Edge UAVs fly at a fixed altitude H, with flying time

T_{total}

divided into T time slots of length

δ_{t} = (T_{total} / T)

.

δ_{t}

should be sufficiently small to ensure sensing and communication channel state information (CSI) remain nearly constant. We consider a 3D Cartesian coordinate system. At each time slot

t \in T = {1, 2, \dots, T}

, the coordinates of

{UAV}_{m}

can be expressed as

q_{m} (t) = {[x_{m} (t), y_{m} (t), H]}^{T}

, the coordinates of

{UE}_{k}

can be expressed as

q_{k} (t) = {[x_{k} (t), y_{k} (t), 0]}^{T}

, and the jammer’s coordinates can be expressed as

q_{J} = {[x_{J}, y_{J}, 0]}^{T}

. The motion constraints of

{UAV}_{m}

can be expressed as

\begin{matrix} ∥ q_{m} (t + 1) - q_{m} (t) ∥ \leq v_{max} δ_{t}, \forall t \end{matrix}

(1)

\begin{matrix} ∥ q_{m} (t) - q_{n} (t) ∥ \geq D_{min}, \forall t, m \neq n \end{matrix}

(2)

where

∥ \cdot ∥

denotes the

L_{2}

norm,

v_{max}

is the maximum flying speed of edge UAVs, and

D_{min}

represents the minimum collision avoidance distance between edge UAVs.

To prevent mutual interference between radar signals and communication signals, the UAVs employ the time-division multiplexing (TDM) technique for radar detection and communication [16]. As depicted in Figure 2, each time slot is divided into two subtime slots by the allocation weight parameter ℓ. One subtime slot is allocated for the sensing task, while the other subtime slot is designated for the communication task.

We consider the air-to-ground communication channel mainly experiences Nakagami-m fading. Therefore, the communication channel gain from

{UAV}_{m}

to

{UE}_{k}

at time slot t can be expressed as [16,23]

h_{m k}^{com} (t) = \partial_{m k} (t) \frac{G_{s} G_{r} λ^{2}}{{(4 π)}^{2} d_{m k}^{2} (t)} = \frac{\partial_{m k} (t) β_{com}}{d_{m k}^{2} (t)}

(3)

where

\partial_{m k} (t)

represents the small-scale fading coefficient following the Nakagami-m distribution,

G_{s}

denotes the antenna gain of the transmitter of

{UAV}_{m}

,

G_{r}

denotes the antenna gain of the receiver of

{UE}_{k}

,

d_{m k} (t) = ∥ q_{m} (t) - q_{k} (t) ∥

represents the distance between

{UAV}_{m}

and

{UE}_{k}

,

λ

is the signal wavelength, and

β_{com} = (G_{s} G_{r} λ^{2}) / {(4 π)}^{2}

. Considering the transmit and return links of the radar signal, the channel power gain of the radar detection link

{UAV}_{m}

and

{UE}_{k}

can be expressed as

h_{m k}^{rad} (t) = \frac{G_{s} G_{e} λ^{2} σ}{{(4 π)}^{3} d_{m k}^{4} (t)} = \frac{β_{rad}}{d_{m k}^{4} (t)}

(4)

where

G_{e}

denotes the antenna gain of the radar receiver of

{UAV}_{m}

,

σ

represents the radar cross section (RCS) of the target, and

β_{rad} = (G_{s} G_{e} λ^{2} σ) / {(4 π)}^{3}

. Considering that the ground-to-ground communication channel mainly experiences Rayleigh fading, the jamming channel gain from the malicious jammer to

{UE}_{k}

can be expressed as [24]

h_{J k}^{jam} (t) = \frac{G_{J} G_{r} λ^{2}}{{(4 π)}^{2} d_{J k}^{2} (t)} ψ = \frac{β_{jam}}{d_{J k}^{2} (t)} ψ

(5)

where

G_{J}

denotes the antenna gain of the transmitter of the jammer,

d_{J k} (t) = ∥ q_{J} - q_{k} (t) ∥

represents the distance between the jammer and

{UE}_{k}

,

ψ

is a random variable following the exponential distribution with unit mean, and

β_{jam} = (G_{J} G_{r} λ^{2}) / {(4 π)}^{2}

.

In this work, the UAVs communicate with users via orthogonal frequency division multiple access (OFDMA). At time slot t, the signal-to-interference-plus-noise ratio (SINR) of the communication link between

{UAV}_{m}

and

{UE}_{k}

can be expressed as

γ_{m k}^{com} (t) = \frac{p_{m} (t) h_{m k}^{com} (t)}{\sum_{l = 1, l \neq m}^{M} p_{l} (t) h_{l k}^{com} (t) + σ_{noise}^{2} + p_{J} h_{J k}^{jam} (t)}

(6)

where

p_{m} (t)

denotes the transmit power of

{UAV}_{m}

and satisfies

0 \leq p_{m} (t) \leq p_{max}

, and

p_{max}

is the maximum transmit power of edge UAVs.

σ_{noise}^{2}

is the additive white Gaussian noise (AWGN) power, and

p_{J}

represents the transmit power of the jammer. The SINR of the sensing link between

{UAV}_{m}

and

{UE}_{k}

can be expressed as

γ_{m k}^{rad} (t) = \frac{p_{m} (t) h_{m k}^{rad} (t)}{\sum_{l = 1, l \neq m}^{M} p_{l} (t) h_{l k}^{rad} (t) + σ_{noise}^{2}}

(7)

To ensure the sensing accuracy of edge UAVs, the SINR of the sensing link needs to satisfy the constraint

γ_{m k}^{rad} (t) \geq γ_{t h}^{rad}

[25], where

γ_{t h}^{rad}

represents the minimum SINR for accurate radar sensing.

We introduce a binary integer variable

α_{m k} (t)

to represent user association. If

α_{m k} (t) = 1

, it indicates that

{UAV}_{m}

and

{UE}_{k}

are associated at time slot t, otherwise

α_{m k} (t) = 0

. Assuming that each user can associate with at most one UAV at each time slot, it can be expressed as

α_{m k} (t) = \{\begin{matrix} 1, & if {UAV}_{m} and {UE}_{k} are associated \\ 0, & otherwise \end{matrix}

(8)

\sum_{m = 1}^{M} α_{m k} (t) \leq 1, \forall k, t

(9)

At time slot t, the communication rate from

{UAV}_{m}

to

{UE}_{k}

can be expressed as

R_{m k}^{com} (t) = {log}_{2} (1 + γ_{m k}^{com} (t))

(10)

Therefore, the total communication rate of

{UE}_{k}

can be expressed as

R_{k}^{com} = \sum_{t = 1}^{T} \sum_{m = 1}^{M} α_{m k} (t) (1 - l) R_{m k}^{com} (t)

(11)

The total communication rate of all users can be expressed as

R_{sum}^{com} = \sum_{k = 1}^{K} R_{k}^{com}

(12)

3. Problem Formulation

Our goal is to maximize the total communication rate of all users through the joint optimization of user association, UAV trajectory, and UAV transmit power. The variables of user association, UAV trajectory, and transmit power are defined as follows:

α = {α_{m k} (t), \forall m, k, t}

,

q = {q_{m} (t), \forall m, t}

, and

p = {p_{m} (t), \forall m, t}

. The joint optimization problem can be formulated as follows

\begin{array}{l} (13) & P 0 : & max_{α, q, p} R_{sum}^{com} \\ (13a) & s . t . & \sum_{m = 1}^{M} α_{m k} (t) \leq 1, \forall k, t \\ (13b) & α_{m k} (t) \in {0, 1}, \forall m, k, t \\ (13c) & γ_{m k}^{rad} (t) \geq γ_{t h}^{rad}, \forall m, k, t \\ (13d) & 0 \leq p_{m} (t) \leq p_{max}, \forall m, t \\ (13e) & x_{min} \leq x_{m} (t) \leq x_{max}, \forall m, t \\ (13f) & y_{min} \leq y_{m} (t) \leq y_{max}, \forall m, t \\ (13g) & ∥ q_{m} (t + 1) - q_{m} (t) ∥ \leq v_{max} δ_{t}, \forall t \\ (13h) & ∥ q_{m} (t) - q_{n} (t) ∥ \geq D_{min}, \forall t, m \neq n \\ (13i) & p_{m} (t) \cdot δ_{t} + κ \cdot ∥ q_{m} (t + 1) - q_{m} (t) ∥ \leq E_{max} / T, \forall m \in M, t \in T \end{array}

where (13a) and (13b) are user association constraints, (13c) is the sensing accuracy constraint, (13d) is the UAV transmit power constraint, (13e) and (13f) are the UAV horizontal position constraints, (13g) and (13h) are the UAV motion trajectory constraints, and (13i) is the UAV energy consumption constraint, where

κ

is the flight energy consumption coefficient per unit distance and

E_{\max}

is the maximum battery capacity of the UAV.

4. Algorithm Design

It is evident that problem

P 0

is a typical non-convex mixed integer nonlinear programming (MINLP) problem, which is challenging to solve directly. On one hand, the user association optimization variable

α

is a binary integer. On the other hand, even with fixed user association, the problem remains non-convex with respect to UAV trajectory

q

and transmit power

p

. Therefore, to solve problem

P 0

, we decompose the original optimization problem into three subproblems: user association optimization, UAV trajectory optimization, and UAV transmit power optimization. Then, we solve these three subproblems through alternating iterations to obtain the solution of the original problem. Before introducing the algorithms, we first present the specific mechanism of LSTM for predicting user positions.

4.1. LSTM Predicting User Positions

Assuming ground users follow a random walk model and their current position information can be obtained through a global positioning system. The position information of K users over the past S time slots can be represented as

I_{history} = [\begin{matrix} q_{1} (t_{1}) & q_{2} (t_{1}) & \dots & q_{K} (t_{1}) \\ q_{1} (t_{2}) & q_{2} (t_{2}) & \dots & q_{K} (t_{2}) \\ ⋮ & ⋮ & ⋱ & ⋮ \\ q_{1} (t_{S}) & q_{2} (t_{S}) & \dots & q_{K} (t_{S}) \end{matrix}]

(14)

Based on

I_{history}

, we utilize LSTM to predict future user positions. The LSTM network is a type of time-recurrent neural network, suitable for processing and predicting sequential data or time series. Take Figure 3 as an example to introduce the composition and functions of the LSTM network unit.

The first part is generally referred to as the forget gate, which is used to determine whether the information from the past time step needs to be forgotten; the second part is called the input gate, through which new information is input into the network; the third part is called the output gate, which is used to pass the information to the next unit. As an excellent perceptron, the LSTM network can fully perceive the relevant features of the time series, such as the temporal correlation between the user’s position information and service changes within a certain period of time, thereby predicting the user’s position at the next moment. The dataset is generated by simulating the random walk trajectories of K users within the task area. In each time slot, users move according to a random direction with inertia. The initial direction of users is uniformly distributed within

[0, 2 π]

, with the turning angle per step constrained to

\pm π / 6

. Boundary reflection is implemented to ensure users remain within the task area.

Setting the time interval

T_{s}

as the cycle for adjusting the optimization scheme, the flowchart of alternating optimization schemes based on user position prediction is shown in Figure 4.

In this figure,

R_{max}

represents the objective function value obtained using the optimal optimization scheme for time slot

t + T_{s}

,

R (t + T_{s})

represents the objective function value obtained, still using the optimal optimization scheme for time slot t, and

θ

is the adjustment threshold, for example 95%.

For the LSTM prediction model, the input size is

I_{history}

. Let

ξ

denote the output size and h denote the number of units in the hidden layer of the network. According to the connection relationships between neurons, the computational complexity of LSTM can be expressed as

O (4 h I_{history} + 4 h^{2} + 3 h + h ξ)

[26]. We train the LSTM prediction model offline, and then use the trained model for solving the optimization problem to ensure real-time performance.

4.2. Determine the Initial Positions of UAVs with Kmeans++

Based on the positions of all users at the initial time slot, we use the Kmeans++ algorithm [27] to determine the initial positions of the UAVs, as shown in Algorithm 1.

Algorithm 1 Kmeans++ Algorithm

Input: Users’ initial positions
Output: UAVs’ initial positions

1:: Randomly select a user as the first center $c_{1}$ ;
2:: Calculate the shortest distance $d_{k}$ from ${UE}_{k}$ to the selected center set $C$ ;
3:: Select a new user as the next center with probability $d_{k}^{2} / \sum_{k = 1}^{K} d_{k}^{2}$ ;
4:: Repeat steps 2–3 until $C = {c_{1}, c_{2}, \dots, c_{M}}$ ;
5:: Assign each user to the nearest center, forming M clusters;
6:: Calculate the average position of users in each cluster as the new center;
7:: Repeat steps 5–6 until the cluster centers are stable.

4.3. User Association Optimization

For given

q

and

p

, the user association optimization subproblem can be formulated as

\begin{array}{l} (15) & P 1 : & max_{α} R_{sum}^{com} \\ (15a) & s . t . & \sum_{m = 1}^{M} α_{m k} (t) \leq 1, \forall k, t \\ (15b) & α_{m k} (t) \in {0, 1}, \forall m, k, t \\ (15c) & γ_{m k}^{rad} (t) \geq γ_{t h}^{rad}, \forall m, k, t \end{array}

The traditional auction algorithm is difficult to adapt to many-to-many association requirements. Therefore, we propose an IAA based on the dynamic pricing mechanism to solve

P 1

, maximizing user rates while ensuring sensing service quality. The process of solving

P 1

with IAA is described as follows:

We define the price of

{UAV}_{m}

as

ψ_{m}^{price}

, initialize the prices of all UAVs as

{[ψ_{1}^{price}, ψ_{2}^{price}, \dots, ψ_{M}^{price}]}^{T} \leftarrow 0

, and the set of unassigned users

U \leftarrow {1, 2, \dots, K}

.

For each unassigned

{UE}_{k} \in U

:

Select the set of valid UAVs that satisfy the sensing constraint

$M_{k} = {m | γ_{m k}^{rad} (t) \geq γ_{t h}^{rad}}$

(16)
Calculate the net benefit for ${UAV}_{m} \in M_{k}$ to ${UE}_{k}$

$b_{m k} = R_{m k}^{com} (t) - ψ_{m}^{price}$

(17)
Select the UAV with the maximum net benefit

$m^{*} = arg max_{m \in M_{k}} b_{m k}$

(18)
Update the price of $m^{*}$ , with the update mechanism as follows

$b_{m k}^{(2)} = max_{m \in M_{k}, m \neq m^{*}} b_{m k}$

(19)

$Δ p_{m^{*}} = (b_{m^{*} k} - b_{m k}^{(2)}) + ε_{basic} \cdot (1 + \frac{b_{m^{*} k} - b_{m k}^{(2)}}{b_{m^{*} k} + ε_{0}})$

(20)

where $b_{m k}^{(2)}$ is the suboptimal net benefit, $Δ p_{m^{*}}$ is the price increment, $ε_{basic}$ is the basic step size, and $ε_{0}$ prevents division by zero.

The detailed process of solving

P 1

with IAA is shown in Algorithm 2. The computational complexity of IAA is mainly related to the number of UAVs and users. Therefore, the total computational complexity of IAA is

O (K M)

. This algorithm coordinates resource competition among multiple UAVs by introducing dynamic price adjustment increment

Δ p_{m^{*}}

, providing an effective solution paradigm for joint optimization problems in dynamic environments. Its distributed characteristics and low computational complexity make it particularly suitable for real-time resource scheduling scenarios in large-scale multi-UAV networks.

Algorithm 2 Improved Auction Algorithm (IAA)

Input:

γ_{m k}^{rad} (t)

,

R_{m k}^{com} (t)

,

γ_{t h}^{rad}

,

ε_{basic}

,

q

,

p

Output:

α_{best}

Initialize:

U \leftarrow {1, 2, \dots, K}

,

{[ψ_{1}^{price}, ψ_{2}^{price}, \dots, ψ_{M}^{price}]}^{T} \leftarrow 0

1:: while $U \neq ⌀$ do
2:: Randomly select ${UE}_{k} \in U$ , filter $M_{k}$ based on Equation (16);
3:: if $M_{k} = ⌀$ then
4:: Update $U = U ∖ {k}$ ;
5:: else
6:: Select UAV $m^{*}$ based on Equations (17) and (18);
7:: Update $α_{m k} = 1, U = U ∖ {k}$ ;
8:: Update the price of UAV $m^{*}$ based on Equations (19) and (20);
9:: end if
10:: end while
11:: return $α_{best}$ .

4.4. UAV Trajectory Optimization

For given

α

and

p

, the UAV trajectory optimization subproblem can be formulated as

\begin{array}{l} (21) & P 2 : & max_{q} R_{sum}^{com} \\ (21a) & s . t . & γ_{m k}^{rad} (t) \geq γ_{t h}^{rad}, \forall m, k, t \\ (21b) & x_{min} \leq x_{m} (t) \leq x_{max}, \forall m, t \\ (21c) & y_{min} \leq y_{m} (t) \leq y_{max}, \forall m, t \\ (21d) & ∥ q_{m} (t + 1) - q_{m} (t) ∥ \leq v_{max} δ_{t}, \forall t \\ (21e) & ∥ q_{m} (t) - q_{n} (t) ∥ \geq D_{min}, \forall t, m \neq n \\ (21f) & p_{m} (t) \cdot δ_{t} + κ \cdot ∥ q_{m} (t + 1) - q_{m} (t) ∥ \leq E_{max} / T, \forall m \in M, t \in T \end{array}

We use the DOA [28] to solve

P 2

. DOA is a novel meta-heuristic algorithm inspired by human dream behavior, which simulates the processes of memory retention, forgetting, and self-organization in dreams. By integrating memory, forgetting, and supplementary strategies, it effectively balances global exploration and local exploitation. The memory strategy serves as the foundation of the algorithm, resetting individuals to the previous optimal values within the group or population. This is followed by the forgetting supplement, which facilitates a gradual shift from exploration to exploitation. In the exploration stage, global optimization is enhanced through grouping, while in the exploitation stage, the entire population collaborates to improve local optimization. Reference [28] demonstrates that compared to algorithms such as particle swarm optimization (PSO) and successive convex approximation (SCA), DOA performs superiorly in terms of convergence, stability, and overall performance. The process of solving

P 2

with DOA is described as follows.

4.4.1. Optimization Algorithm Assumptions

Based on the characteristics of human dreams and knowledge of optimization algorithms, we summarize the following four hypotheses:

The quality of dreams can be evaluated using fitness values.

The initiation of dreams is closely related to the foundation of existing memories.

Humans partially forget existing memories and supplement the forgotten parts with logically self-organized information.

Memory ability varies among individuals or groups and exhibits a certain degree of randomness.

These hypotheses help us better understand the principles of the algorithm. The workflow, exploration phase, and various strategies in the development phase of the algorithm are concrete embodiments of these four hypotheses.

4.4.2. Initialization

Based on the optimal UAV positions obtained from the last alternating optimization

q^{last} = [x_{1}^{last} y_{1}^{last}, x_{2}^{last} y_{2}^{last}, \dots, x_{M}^{last} y_{M}^{last}]

, initialize the population

Q

and ensure that each individual in

Q

satisfies constraints (21a)–(21f). Population

Q

can be expressed as

Q = [\begin{matrix} q_{1} \\ q_{2} \\ ⋮ \\ q_{N_{1}} \end{matrix}] = [\begin{matrix} x_{1, 1} & y_{1, 1} & x_{2, 1} & y_{2, 1} & \dots & x_{M, 1} & y_{M, 1} \\ x_{1, 2} & y_{1, 2} & x_{2, 2} & y_{2, 2} & \dots & x_{M, 2} & y_{M, 2} \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋱ & ⋮ & ⋮ \\ x_{1, N_{1}} & y_{1, N_{1}} & x_{2, N_{1}} & y_{2, N_{1}} & \dots & x_{M, N_{1}} & y_{M, N_{1}} \end{matrix}]

(22)

where

N_{1}

is the number of individuals and

q_{i}

represents the ith individual in

Q

. Each individual represents the horizontal coordinates of all UAVs, i.e.,

(x_{m, i}, y_{m, i})

represents the horizontal coordinates of the mth UAV in the ith individual. The initialization population is formulated as follows

x_{m, i} = x_{m}^{last} + r \times (x_{max} - x_{min})

(23)

y_{m, i} = y_{m}^{last} + r \times (y_{max} - y_{min})

(24)

where r is a random number between 0 and 1.

4.4.3. Fitness Function

Transforming constraints (21a)–(21f) into penalty function terms, the fitness function can be expressed as

F_{fitness} (q_{i}) = R_{sum}^{com} - ϖ_{1} \sum_{v \in V_{q}} Φ_{v}

(25)

where

Φ_{v}

is the constraint violation indicator function,

ϖ_{1}

is the penalty coefficient, and

V_{q}

is the set of constraints (21a)–(21f).

4.4.4. Exploration Phase $(0 < u \leq u_{d})$

We divide the population into five groups based on the the difference in memory ability, the number of forgetting dimension

f_{F}

for each group can be expressed as

f_{F} = ϑ_{randi} (⌈\frac{D_{\dim}}{8 F}⌉, max \{2, ⌈\frac{D_{\dim}}{3 F}⌉\})

(26)

where

F = 1, 2, 3, 4, 5

represents the group number,

ϑ_{randi} (a, b)

represents a random integer selected from a to b, and

D_{dim} = 2 M

represents the number of dimensions.

Memory Strategy: Each iteration is regarded as a dream behavior. Before each iteration, individuals in each group reset their positions to the best individual from the previous iteration within the group. The update formula can be expressed as

q_{i}^{u + 1} = q_{{best}_{F}}^{u}

(27)

where

q_{i}^{u + 1}

represents the ith individual at iteration

u + 1

, and

q_{{best}_{F}}^{u}

represents the best individual in group F at iteration u.

Forgetting and supplementary strategy: Individuals in each group randomly forget information in certain dimensions while dreaming. The forgetting and supplementary strategy follows the memory strategy, allowing individuals to forget and self-supplement position information in the forgotten dimensions. The update formula can be expressed as

\begin{matrix} x_{m, i}^{u + 1} & = x_{m, {best}_{F}}^{u} + \frac{1}{2} (x_{m}^{last} + r \times (x_{max} - x_{min})) \end{matrix} \begin{matrix} \times (cos (π \times \frac{u + u_{max} - u_{d}}{u_{max}}) + 1) \end{matrix}

(28)

\begin{matrix} y_{m, i}^{u + 1} & = y_{m, {best}_{F}}^{u} + \frac{1}{2} (y_{m}^{last} + r \times (y_{max} - y_{min})) \end{matrix} \begin{matrix} \times (cos (π \times \frac{u + u_{max} - u_{d}}{u_{max}}) + 1) \end{matrix}

(29)

where

x_{m, i}^{u + 1}

and

y_{m, i}^{u + 1}

represent the x and y coordinates of the mth UAV of the ith individual at iteration

u + 1

, respectively.

x_{m, {best}_{F}}^{u}

and

y_{m, {best}_{F}}^{u}

represent the x and y coordinates of the mth UAV of the best individual in group F at iteration u, respectively.

u_{max}

is the maximum number of iterations,

u_{d}

is the maximum number of iterations in the exploration phase, and

u_{d} = 0.9 u_{max}

.

4.4.5. Exploitation Phase $(u_{d} < u \leq u_{max})$

Memory strategy: This phase does not perform grouping. Before each dream, all individuals reset their positions to the best individual from the previous iteration in the entire population. The update formula can be expressed as

q_{i}^{u + 1} = q_{best}^{u}

(30)

where

q_{best}^{u}

represents the best individual in the entire population at iteration u. The number of forgetting dimensions for all individuals in the population is the same, expressed as

f = ϑ_{randi} (2, max \{2, ⌈\frac{D_{dim}}{3}⌉\})

(31)

Forgetting and supplementary strategy: The position update formula can be expressed as

\begin{matrix} x_{m, i}^{u + 1} & = x_{m, best}^{u} + \frac{1}{2} (x_{m}^{last} + r \times (x_{max} - x_{min})) \end{matrix} \begin{matrix} \times (cos (π \times \frac{u}{u_{max}}) + 1) \end{matrix}

(32)

\begin{matrix} y_{m, i}^{u + 1} & = y_{m, best}^{u} + \frac{1}{2} (y_{m}^{last} + r \times (y_{max} - y_{min})) \end{matrix} \begin{matrix} \times (cos (π \times \frac{u}{u_{max}}) + 1) \end{matrix}

(33)

where

x_{m, best}^{u}

and

y_{m, best}^{u}

represent the x and y coordinates of the mth UAV of the best individual in the entire population at iteration u, respectively.

The detailed process of solving

P 2

with DOA is shown in Algorithm 3. The computational complexity of DOA is mainly concentrated in the exploration phase and exploitation phase. The computational complexity of the exploration phase can be expressed as

O (u_{d} \times N_{1} \times D_{dim})

, and the computational complexity of the exploitation phase can be expressed as

O ((u_{max} - u_{d}) \times N_{1} \times D_{dim})

. Therefore, the total computational complexity of DOA is

O (u_{max} \times N_{1} \times D_{dim})

.

Algorithm 3 Dream Optimization Algorithm (DOA)

Input:

γ_{m k}^{rad} (t)

,

R_{m k}^{com} (t)

,

γ_{t h}^{rad}

,

N_{1}

,

u_{max}

,

α

,

p

Output:

q_{best}

,

F_{fitness} (q_{best})

Initialize:

u \leftarrow 1

, Generate initial population

Q

based on Equations (22)–(24)

1:: while $0 < u \leq u_{d}$ do
2:: for $F = 1 : 5$ do
3:: Update $q_{{best}_{F}}$ and $F_{fitness} (q_{{best}_{F}})$ ;
4:: Update $f_{F}$ based on Equation (26);
5:: Update $q_{i}^{u + 1}$ based on Equation (27);
6:: for each individual $i \in F$ do
7:: Randomly select $f_{F}$ dimensions to forget;
8:: for each forgotten dimension m do
9:: Update $x_{m, i}^{u + 1}$ and $y_{m, i}^{u + 1}$ based on Equations (28) and (29);
10:: Check and repair constraints (21a)–(21f) for $x_{m, i}^{u + 1}$ and $y_{m, i}^{u + 1}$ ;
11:: end for
12:: end for
13:: end for
14:: Update $q_{best}$ and $F_{fitness} (q_{best})$ based on Equation (25);
15:: Update $u = u + 1$ ;
16:: end while
17:: while $u_{d} < u \leq u_{max}$ do
18:: Update $q_{i}^{u + 1}$ based on Equation (30);
19:: Update f based on Equation (31);
20:: for each individual i do
21:: Randomly select f dimensions to forget;
22:: for each forgotten dimension m do
23:: Update $x_{m, i}^{u + 1}$ and $y_{m, i}^{u + 1}$ based on Equations (32) and (33);
24:: Check and repair constraints (21a)–(21f) for $x_{m, i}^{u + 1}$ and $y_{m, i}^{u + 1}$ ;
25:: end for
26:: end for
27:: Update $q_{best}$ and $F_{fitness} (q_{best})$ based on Equation (25);
28:: Update $u = u + 1$ ;
29:: end while
30:: return $q_{best}$ and $F_{fitness} (q_{best})$ .

4.5. UAV Transmit Power Optimization

For given

α

and

q

, the UAV transmit power optimization subproblem can be expressed as

\begin{array}{l} (34) & P 3 : & max_{p} R_{sum}^{com} \\ (34a) & s . t . & γ_{m k}^{rad} (t) \geq γ_{t h}^{rad}, \forall m, k, t \\ (34b) & 0 \leq p_{m} (t) \leq p_{max}, \forall m, t \\ (34c) & p_{m} (t) \cdot δ_{t} + κ \cdot ∥ q_{m} (t + 1) - q_{m} (t) ∥ \leq E_{max} / T, \forall m \in M, t \in T \end{array}

We use RRTOA [29] to solve

P 3

. RRTOA is a novel meta-heuristic algorithm inspired by the search mechanism of rapidly-exploring random trees in robot path planning. The rapidly-exploring random trees algorithm effectively explores the search space by incrementally expanding a tree structure, demonstrating strong adaptability to complex constraints and high-dimensional planning requirements. However, traditional rapidly-exploring random trees algorithms suffer from issues such as slow convergence rates and undesirable path quality. Particularly in regions near the goal, the tree expansion becomes sparse, leading to reduced computational efficiency. To address these limitations, RRTOA has been effectively improved. RRTOA utilizes the adaptive step size wandering strategy, absolute difference-based adaptive step size strategy, and boundary-based adaptive step size strategy, which can effectively search the solution space while guiding the population to find high-quality solutions. Reference [29] shows that compared to similar meta-heuristic algorithms, RRTOA achieves competitive results across various problems. The process of solving

P 3

with RRTOA is described as follows.

4.5.1. Initialization

Based on the optimal UAV power allocation scheme from the last alternating optimization

p^{last} = [p_{1}^{last}, p_{2}^{last}, \dots, p_{M}^{last}]

, initialize population

P

and ensure individuals in the population satisfy constraints (34a)–(34c). Population

P

can be expressed as

P = [\begin{matrix} p_{1} \\ p_{2} \\ ⋮ \\ p_{N_{2}} \end{matrix}] = [\begin{matrix} p_{1, 1} & p_{2, 1} & \dots & p_{M, 1} \\ p_{1, 2} & p_{2, 2} & \dots & p_{M, 2} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ p_{1, N_{2}} & p_{2, N_{2}} & \dots & p_{M, N_{2}} \end{matrix}]

(35)

where

N_{2}

is the number of individuals, each individual

p_{j}

represents a UAV power allocation scheme. The initialization formula for the population is as follows

p_{m, j} = p_{m}^{last} + r_{1} \times (p_{max} - p_{min})

(36)

where

p_{m, j}

represents the power of the mth UAV in the jth individual, and

r_{1}

is a random number between 0 and 1.

4.5.2. Fitness Function

Transforming constraints (34a)–(34c) into penalty function terms, the fitness function is expressed as

F_{fitness} (p_{j}) = R_{sum}^{com} - ϖ_{2} \sum_{v \in V_{p}} Φ_{v}

(37)

where

Φ_{v}

is the constraint violation indicator function,

ϖ_{2}

is the penalty coefficient, and

V_{p}

is the set of constraints (34a)–(34c).

4.5.3. Adaptive Step Size Wandering Strategy

To address the issue of low exploration efficiency caused by a fixed step size, RRTOA models the random sampling mechanism as an adaptive step size wandering strategy to enhance the algorithm’s search capability. The adaptive step size wandering strategy greatly improves the effectiveness of the global search by combining global random initialization methods, effectively preventing the algorithm from falling into local optima. We define monotonic functions

A_{1}

and

A_{2}

that vary with the iteration number

u^{'}

to control the dynamic adjustment of the step size, which can be expressed as

A_{1} = \frac{ln (u_{max}^{'} - u^{'})}{ln (u_{max}^{'})}, A_{2} = \sqrt[3]{u^{'} / u_{max}^{'}}

(38)

where

u_{max}^{'}

is the maximum number of iterations, and

u^{'}

is the current iteration number.

When

r_{1} < A_{1}

, the update formula is as follows

p_{m, j}^{u^{'} + 1} = p_{m, j}^{u^{'}} + R_{1}

(39)

R_{1} = (r_{1} - \frac{A_{1}}{2}) \times A_{1} \times \frac{(p_{max} - p_{min})}{\hat{λ}}

(40)

where

p_{m, j}^{u^{'} + 1}

represents the power of the mth UAV in the jth individual at iteration

u^{'} + 1

,

p_{m, j}^{u^{'}}

represents the power of the mth UAV in the jth individual at iteration

u^{'}

,

R_{1}

is the adaptive step size of this strategy, and

\hat{λ} = 10

is the step size penalty factor.

4.5.4. Absolute Difference-Based Adaptive Step Size Strategy

RRTOA introduces an absolute difference-based adaptive step size strategy. This strategy dynamically adjusts the step size by calculating the absolute difference between the current particle position and the current best position, achieving a dynamic balance between large-scale exploration and local fine-tuning of the search space. Consequently, it enhances the algorithm’s global optimization capability and efficiency.

When

r_{2} < (A_{2} / 10)

, the update formula is as follows

p_{m, j}^{u^{'} + 1} = p_{m, best}^{u^{'}} + R_{2}

(41)

R_{2} = a_{1} | p_{m, best}^{u^{'}} - p_{m, j}^{u^{'}} |

(42)

where

p_{m, best}^{u^{'}}

represents the power of the mth UAV in the best individual at iteration

u^{'}

,

R_{2}

is the adaptive step size of this strategy,

r_{2}

is a random number between 0 and 1, and

a_{1}

is the adaptive step size adjustment coefficient of this strategy, which can be expressed as

a_{1} = 5 \times (r_{2} - A_{2} / 20) \times cos (2 π r_{2}) \times e^{b}

(43)

where

e^{b}

is the random disturbance factor, with

b = e^{cos (π - π / u^{'})}

.

4.5.5. Boundary-Based Adaptive Step Size Strategy

To achieve more precise optimization, a boundary-based adaptive step size strategy is introduced. The core of this strategy involves exploring with very small random steps, which enables RRTOA agents to conduct meticulous searches in the surrounding area. This not only ensures localized searching around the current best solution but also prevents convergence to local minima.

When

r_{3} < (A_{2} / 50)

, the update formula is as follows

p_{m, j}^{u^{'} + 1} = p_{m, best}^{u^{'}} + R_{3}

(44)

R_{3} = (p_{max} - p_{min}) \times cos (10 π u^{'} / u_{max}^{'}) \times a_{2}

(45)

where

R_{3}

is the adaptive step size of this strategy,

r_{3}

is a random number between 0 and 1, and

a_{2}

is the adaptive step size adjustment coefficient of this strategy, which can be expressed as

a_{2} = r_{3} \times (r_{3} - A_{2} / 100) \times A_{1} \times (1 - u^{'} / u_{max}^{'})

(46)

The detailed process of solving

P 3

with RRTOA is shown in Algorithm 4. The computational complexity of RRTOA is mainly affected by population size, number of iterations, and problem dimension. Therefore, the total computational complexity of RRTOA is

O (u_{max}^{'} \times N_{2} \times M)

.

Algorithm 4 RRT-Based Optimizer Algorithm (RRTOA)

Input:

γ_{m k}^{rad} (t)

,

R_{m k}^{com} (t)

,

γ_{t h}^{rad}

,

N_{2}

,

u_{max}^{'}

,

α

,

q

Output:

p_{best}

,

F_{fitness} (p_{best})

Initialize:

u^{'} \leftarrow 1

, generate initial population

P

based on Equations (35) and (36)

1:: while $u^{'} \leq u_{max}^{'}$ do
2:: if $r_{1} < A_{1}$ then
3:: Update $p_{m, j}^{u^{'} + 1}$ based on Equations (39) and (40);
4:: end if
5:: if $r_{2} < (A_{2} / 10)$ then
6:: Update $p_{m, j}^{u^{'} + 1}$ based on Equations (41)–(43);
7:: end if
8:: if $r_{3} < (A_{2} / 50)$ then
9:: Update $p_{m, j}^{u^{'} + 1}$ based on Equations (44)–(46);
10:: end if
11:: Update $p_{best}$ and $F_{fitness} (p_{best})$ based on Equation (37);
12:: Update $u^{'} = u^{'} + 1$ ;
13:: end while
14:: return $p_{best}$ and $F_{fitness} (p_{best})$ .

4.6. Alternating Optimization-Based Predictive Scheduling Algorithm

We define the relevant variables in AOPSA:

{\hat{μ}}_{max}

is the maximum number of iterations,

ε

is the convergence threshold, and

R_{sum}^{com} (\hat{μ})

represents the objective function value at iteration

\hat{μ}

. The detailed process of solving

P 0

with AOPSA is shown in Algorithm 5. The computational complexity of AOPSA is

O ({\hat{μ}}_{max} \times (u_{max} \times N_{1} \times D_{dim} + u_{max}^{'} \times N_{2} \times M + K M))

.

Algorithm 5 Alternating Optimization-based Predictive Scheduling Algorithm (AOPSA)

Input: M, K,

{\hat{μ}}_{max}

,

ε

,

γ_{m k}^{rad} (t)

,

R_{m k}^{com} (t)

,

γ_{t h}^{rad}

,

N_{1}

,

N_{2}

Output:

α

,

q

,

p

,

R_{sum}^{com}

Initialize: UAV initial positions,

p \leftarrow p_{max}

,

α \leftarrow 0

,

R_{sum}^{com} \leftarrow 0

1:: for $\hat{μ} = 1$ : ${\hat{μ}}_{max}$ do
2:: Optimize user association $α$ using Algorithm 2;
3:: Optimize UAV trajectory $q$ using Algorithm 3;
4:: Optimize UAV transmit power $p$ using Algorithm 4;
5:: Calculate $R_{sum}^{com} (\hat{μ})$ ;
6:: if $| R_{sum}^{com} (\hat{μ}) - R_{sum}^{com} | < ε$ then
7:: break;
8:: else
9:: Update $R_{sum}^{com} \leftarrow R_{sum}^{com} (\hat{μ})$ ;
10:: end if
11:: end for
12:: return $α$ , $q$ , $p$ , $R_{sum}^{com}$ .

5. Simulation Results

This section evaluates the performance of AOPSA through simulation. Regarding the frequency selection, we adopt the millimeter-wave frequency band (28 GHz). This frequency band has abundant spectrum resources and can meet the system’s requirements for high-speed data transmission and high-precision sensing. The initial positions of ground users are randomly distributed within a 2D area of 800 m × 800 m. The main simulation parameters are shown in Table 1, and the settings of the values mainly refer to [16,25].

To demonstrate the effectiveness of AOPSA, we compare AOPSA with three other schemes as well as the joint optimization scheme in [30]. The details are as follows:

Scheme 1: UAVs are uniformly distributed throughout the area and remain stationary at altitude H. Only user association optimization and UAV power optimization are performed.

Scheme 2: UAV power is fixed at

p_{max}

. Only user association optimization and UAV trajectory optimization are performed.

Scheme 3: UAVs are uniformly distributed throughout the area and remain stationary at altitude H. UAV power is fixed at

p_{max}

, and only user association optimization is performed.

Reference [30]: Adopt spectral clustering, coalition game, and SCA to sequentially solve the three subproblems of UAV position optimization, user association optimization, and transmit power optimization.

We compare and analyze the computational complexity of AOPSA and the reference [30] algorithm as shown in Table 2. The complexity analysis for AOPSA has been presented in Section 4 and will not be reiterated here. The reference [30] algorithm employs coalition game, spectral clustering, and SCA to sequentially solve three subproblems: user association optimization, UAV position optimization, and transmit power optimization. In the user association optimization subproblem, the coalition game requires traversing all possible user–UAV combinations in the worst case, resulting in exponential computational complexity

O (M^{K})

for solving subproblem

P 1

. The UAV trajectory optimization subproblem involves three components: similarity matrix construction

O (K^{2})

, eigen decomposition

O (K^{3})

, and initial clustering

O (K M)

. For the power optimization subproblem, the computational complexity reaches

O (M^{3.5})

. This analysis reveals that AOPSA significantly reduces the computational complexity and is suited for large scale scenarios.

Figure 5 describes the comparison result of the LSTM-predicted values and actual values for a single user position. To more intuitively present the user position at each time slot, we further convert the user coordinates into the distance between the user and the jammer. As shown in Figure 5, the two curves of predicted values and actual values achieve good fitting (coefficient of determination

R^{2} = 0.92

). This indicates that the LSTM model can effectively capture the temporal features of user position changes and achieve relatively accurate predictions.

Before comparing the performance of different schemes, we validate the convergence behavior of the AOPSA. As illustrated in Figure 6, we compare the total communication rate of AOPSA (per single time slot) with that of the algorithm in reference [30] as a function of iteration numbers. The results demonstrate that AOPSA rapidly increases the communication rate to 53 bps/Hz within the first four iterations and maintains stable performance thereafter. Compared to reference [30], our proposed algorithm achieves approximately 23.2% performance improvement, demonstrating favorable convergence behavior and effectiveness.

Figure 7 shows the comparison of total communication rate for five optimization schemes over the entire flight time. It can be observed that as the number of time slots increases, the total communication rates of all schemes show an upward trend. The best performance is achieved by AOPSA, which reaches approximately 3750 bps/Hz at 60th time slot. Reference [30] reaches 2650 bps/Hz, while baseline scheme 3 is only around 1300 bps/Hz. Compared with [30], AOPSA achieves a 41.5% performance improvement, and compared with baseline scheme 3, the performance improvement reaches up to 188%. This result fully demonstrates AOPSA’s effectiveness in solving the joint optimization problem of user association, UAV trajectory optimization, and power allocation.

Figure 8 compares the variation trend of the total communication rate for five optimization schemes under different numbers of users. It can be seen that with the increase in the number of users, the total communication rates of all schemes show an upward trend. AOPSA consistently maintains the best performance, demonstrating higher scalability. This indicates that AOPSA has stronger resource optimization and scheduling capabilities when dealing with multiple scenarios, and can effectively cope with challenges brought by changes in the number of users, providing a reliable performance guarantee for practical deployments.

Figure 9 illustrates the performance comparison between the AOPSA and the reference [30] algorithm in terms of the total communication rate over all time slots under different

γ_{t h}^{r a d}

. It can be observed that under the same

γ_{t h}^{r a d}

constraint, the total communication rate of AOPSA is significantly higher than that of the reference [30] algorithm. As the

γ_{t h}^{r a d}

value increases, the total communication rates of both algorithms exhibit a decreasing trend. This result indicates that a higher

γ_{t h}^{r a d}

value requires more resources to meet the sensing performance requirements, thereby leading to a degradation in communication performance. Nevertheless, AOPSA still demonstrates superior performance, achieving an effective balance between communication and sensing.

As shown in Figure 10, we compare the total communication rate of AOPSA under different prediction position noise conditions across time slots. It can be observed that the blue curve represents performance under ideal prediction conditions, with the communication rate increasing from approximately 1100 bps/Hz to 3300 bps/Hz while maintaining the highest transmission efficiency. In practical scenarios with prediction position noise, system performance decreases to varying degrees: the red curve (position noise ±5%) shows communication rates ranging from 1000 bps/Hz to 3000 bps/Hz; the yellow curve (position noise ±10%) exhibits the lowest performance, with rates increasing from 900 bps/Hz to 2600 bps/Hz. This demonstrates that position prediction accuracy significantly impacts system communication performance. As position noise increases, the deviation between system performance and ideal prediction gradually enlarges, further emphasizing the importance of adopting robust prediction algorithms in practical applications.

As shown in Figure 11, the total communication rate of the AOPSA and the algorithm in reference [30] is compared across all time slots under different

p_{J}

values. The results demonstrate that as

p_{J}

increases, the performance of both algorithms declines. Nevertheless, AOPSA consistently outperforms the reference [30] algorithm. This indicates that the jamming power

p_{J}

directly affects the SINR of the communication link. Specifically, an increase in

p_{J}

intensifies the jamming imposed by the jammer on the user, leading to a reduction in the communication SINR and a consequent significant decrease in the communication rate.

Figure 12 presents a comparison of the total communication rate between AOPSA and the algorithm in reference [30] across all time slots under different

p_{max}

values. As shown in the figure, the performance of both algorithms improves as

p_{max}

increases, with AOPSA consistently outperforming the reference [30] algorithm. This result indicates that a higher

p_{max}

value expands the feasible region for optimizing the UAV transmit power, allowing the algorithm to more flexibly adjust the power allocation within the extended constraint, thereby achieving significant performance enhancement.

6. Conclusions

This paper focuses on the multi-UAV-assisted ISAC system under malicious jamming environments, studying the joint optimization problem of user association, UAV trajectory planning, and transmit power allocation. To address the impact of user mobility and enhance system real-time performance, this paper proposes a user position prediction model based on LSTM to achieve accurate prediction of user positions in future time slots. To reduce the impact of malicious jamming and maximize the total communication rate of users, this paper adopts IAA, DOA, and RRTOA to optimize user association, UAV trajectory, and UAV transmit power, respectively. On this basis, the AOPSA is proposed to achieve joint optimization of the three variables. Simulation results demonstrate that the proposed algorithms can significantly improve the system’s total communication rate under malicious jamming environments, providing reliable technical support for practical deployment.

However, this paper primarily focuses on the analysis and discussion of stationary jammers. In practical scenarios, jammers are often mobile and possess a certain level of intelligence, enabling them to dynamically adjust their jamming methods and positions based on the countermeasures adopted by the communication system. This significantly enhances jamming efficiency and concealment. Such dynamic and intelligent jamming behaviors pose more severe challenges to existing anti-jamming schemes. Therefore, in future work, we will further investigate the underlying mechanisms of dynamic intelligent jamming scenarios and, on this basis, develop corresponding intelligent anti-jamming and collaborative communication strategies to improve the robustness of the system in such dynamic and intelligent jamming environments.

Author Contributions

Conceptualization, J.W., X.W., and R.X.; methodology, J.W. and X.W.; software, L.P. and R.X.; validation, J.W.; formal analysis, J.W.; investigation, J.W.; resources, J.W.; data curation, J.W.; writing—original draft preparation, J.W.; writing—review and editing, J.W. and X.W.; visualization, J.W.; supervision, L.P. and R.X.; project administration, J.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

UAV	Unmanned aerial vehicle
ISAC	Integrated sensing and communication
6G	Sixth generation
LoS	Line-of-sight
MI	Mutual information
RIS	Reconfigurable intelligent surface
IoT	Internet of Things
3D	Three-dimensional
AoI	Age of information
CSI	Channel state information
TDM	Time-division multiplexing
RCS	Radar cross section
OFDMA	Orthogonal frequency division multiple access
SINR	Signal-to-interference-plus-noise ratio
AWGN	Additive white Gaussian noise
MINLP	Mixed integer nonlinear programming
LSTM	Long short-term memory
IAA	Improved auction algorithm
PSO	Particle swarm optimization
SCA	Successive convex approximation
DOA	Dream optimization algorithm
RRTOA	Rapidly-exploring random trees-based optimizer algorithm

References

Wei, Z.; Qu, H.; Wang, Y.; Yuan, X.; Wu, H.; Du, Y. Integrated Sensing and Communication Signals Toward 5G-A and 6G: A Survey. IEEE Internet Things J. 2023, 10, 11068–11092. [Google Scholar] [CrossRef]
Liu, F.; Cui, Y.; Masouros, C.; Xu, J.; Han, T.; Eldar, Y. Integrated Sensing and Communications: Toward Dual-Functional Wireless Networks for 6G and Beyond. IEEE J. Sel. Areas Commun. 2022, 40, 1728–1767. [Google Scholar] [CrossRef]
Zhang, J.A.; Rahman, M.d.L.; Wu, K.; Huang, X.; Guo, Y.J.; Chen, S.; Yuan, J. Enabling Joint Communication and Radar Sensing in Mobile Networks—A Survey. IEEE Commun. Surv. Tutorials. 2022, 24, 306–345. [Google Scholar] [CrossRef]
Meng, K.; Wu, Q.; Xu, J.; Chen, W.; Feng, Z.; Schober, R.; Swindlehurst, A.L. UAV-Enabled Integrated Sensing and Communication: Opportunities and Challenges. IEEE Wirel. Commun. 2024, 31, 97–104. [Google Scholar] [CrossRef]
Mu, J.; Zhang, R.; Cui, Y.; Gao, N.; Jing, X. UAV Meets Integrated Sensing and Communication: Challenges and Future Directions. IEEE Commun. Mag. 2023, 61, 62–67. [Google Scholar] [CrossRef]
Meng, K.; He, X.; Wu, Q.; Li, D. Multi-UAV Collaborative Sensing and Communication: Joint Task Allocation and Power Optimization. IEEE Trans. Wirel. Commun. 2023, 22, 4232–4246. [Google Scholar] [CrossRef]
Liu, Y.; Liu, S.; Liu, X.; Liu, Z.; Durrani, T.S. Sensing Fairness-Based Energy Efficiency Optimization for UAV Enabled Integrated Sensing and Communication. IEEE Wirel. Commun. Lett. 2023, 12, 1702–1706. [Google Scholar] [CrossRef]
Meng, K.; Wu, Q.; Ma, S.; Chen, W.; Wang, K.; Li, J. Throughput Maximization for UAV-Enabled Integrated Periodic Sensing and Communication. IEEE Trans. Wirel. Commun. 2023, 22, 671–687. [Google Scholar] [CrossRef]
Zhou, Y.; Liu, X.; Zhai, X.; Zhu, Q.; Durrani, T.S. UAV-Enabled Integrated Sensing, Computing, and Communication for Internet of Things: Joint Resource Allocation and Trajectory Design. IEEE Internet Things J. 2024, 11, 12717–12727. [Google Scholar] [CrossRef]
Liu, Z.; Liu, X.; Liu, Y.; Leung, V.C.M.; Durrani, T.S. UAV Assisted Integrated Sensing and Communications for Internet of Things: 3D Trajectory Optimization and Resource Allocation. IEEE Trans. Wirel. Commun. 2024, 23, 8654–8667. [Google Scholar] [CrossRef]
Zhu, Z.; Li, Z.; Chu, Z.; Guan, Y.; Wu, Q.; Xiao, P.; Renzo, M.D.; Lee, I. Intelligent Reflecting Surface Assisted mmWave Integrated Sensing and Communication Systems. IEEE Internet Things J. 2024, 11, 29427–29437. [Google Scholar] [CrossRef]
Zhu, Z.; Gong, M.; Sun, G.; Liu, P.; Mi, D. AI-enabled STAR-RIS aided MISO ISAC Secure Communications. Tsinghua Sci. Technol. 2025, 30, 998–1011. [Google Scholar] [CrossRef]
Zhang, R.; Zhang, Y.; Tang, R.; Zhao, H.; Xiao, Q.; Wang, C. A Joint UAV Trajectory, User Association, and Beamforming Design Strategy for Multi-UAV-Assisted ISAC Systems. IEEE Internet Things J. 2024, 11, 29360–29374. [Google Scholar] [CrossRef]
Qin, Y.; Zhang, Z.; Li, X.; Huangfu, W.; Zhang, H. Deep Reinforcement Learning Based Resource Allocation and Trajectory Planning in Integrated Sensing and Communications UAV Network. IEEE Trans. Wirel. Commun. 2023, 22, 8158–8169. [Google Scholar] [CrossRef]
Wang, Q.; Chai, R.; Sun, R.; Pu, R.; Chen, Q. ISAC-Enabled Multi-UAV Cooperative Perception and Trajectory Optimization. IEEE Internet Things J. 2024, 11, 40982–40995. [Google Scholar] [CrossRef]
Liu, X.; Liu, Y.; Liu, Z.; Durrani, T.S. Fair Integrated Sensing and Communication for Multi-UAV-Enabled Internet of Things: Joint 3-D Trajectory and Resource Optimization. IEEE Internet Things J. 2024, 11, 29546–29556. [Google Scholar] [CrossRef]
Al-habob, A.A.; Dobre, O.A.; Jing, Y. Predictive Beamforming Approach for Secure Integrated Sensing and Communication with Multiple Aerial Eavesdroppers. IEEE Trans. Commun. 2025; early access. [Google Scholar] [CrossRef]
Pan, Y.; Li, R.; Da, X.; Hu, H.; Zhang, M.; Zhai, D.; Cumanan, K.; Dobre, O.A. Cooperative Trajectory Planning and Resource Allocation for UAV-Enabled Integrated Sensing and Communication Systems. IEEE Trans. Veh. Technol. 2024, 73, 6502–6516. [Google Scholar] [CrossRef]
Liu, Y.; Zhang, B.; Hu, J.; Guo, D. Beamforming Optimization for UAV-Enable ISAC Systems under Malicious Jamming Attack. In Proceedings of the 2024 IEEE 24th International Conference on Communication Technology (ICCT), Chengdu, China, 18–20 October 2024; pp. 1229–1233. [Google Scholar] [CrossRef]
Mei, H.; Zhang, H.; Zhou, X.; Wang, J. AoI Minimization for Air–Ground Integrated Sensing and Communication Networks with Jamming Attack. IEEE Trans. Veh. Technol. 2025, 74, 12776–12790. [Google Scholar] [CrossRef]
Jing, X.; Liu, F.; Masouros, C.; Zeng, Y. ISAC From the Sky: UAV Trajectory Design for Joint Communication and Target Localization. IEEE Trans. Wirel. Commun. 2024, 23, 12857–12872. [Google Scholar] [CrossRef]
Gu, S.; Luo, C.; Luo, Y.; Ma, X. Jointly Optimize Throughput and Localization Accuracy: UAV Trajectory Design for Multiuser Integrated Communication and Sensing. IEEE Internet Things J. 2024, 11, 39497–39511. [Google Scholar] [CrossRef]
Jiang, X.; Sheng, M.; Zhao, N.; Liu, J.; Niyato, D.; Yu, F.R. Outage Analysis of UAV-Aided Networks With Underlaid Ambient Backscatter Communications. IEEE Trans. Wirel. Commun. 2023, 22, 7492–7505. [Google Scholar] [CrossRef]
Huang, Z.; Sheng, Z.; Nasir, A.A.; Yu, H. Energy Efficiency Maximization for UAV-Assisted Full-Duplex Communication in the Presence of Multiple Malicious Jammers. IEEE Syst. J. 2024, 18, 1257–1268. [Google Scholar] [CrossRef]
Liu, Y.; Liu, X.; Liu, Z.; Yu, Y.; Jia, M.; Na, Z.; Durrani, T.S. Secure Rate Maximization for ISAC-UAV Assisted Communication Amidst Multiple Eavesdroppers. IEEE Trans. Veh. Technol. 2024, 73, 15843–15847. [Google Scholar] [CrossRef]
Song, T.; Liu, C.; Wu, R.; Jin, Y.; Jiang, D. A Hierarchical Scheme for Remaining Useful Life Prediction With Long Short-Term Memory Networks. Neurocomputing 2022, 487, 22–33. [Google Scholar] [CrossRef]
Mi, J.; Wen, X.; Sun, C.; Lu, Z.; Jing, W. Energy-efficient and Low Package Loss Clustering in UAV-assisted WSN using Kmeans++ and Fuzzy Logic. In Proceedings of the 2019 IEEE/CIC International Conference on Communications Workshops (ICCC Workshops), Changchun, China, 11–13 August 2019; pp. 210–215. [Google Scholar] [CrossRef]
Lang, Y.; Gao, Y. Dream Optimization Algorithm (DOA): A Novel Metaheuristic Optimization Algorithm Inspired By Human Dreams and its Applications to Real-world Engineering Problems. Comput. Methods Appl. Mech. Eng. 2025, 436, 117718. [Google Scholar] [CrossRef]
Lai, G.; Li, T.; Shi, B. RRT-Based Optimizer: A Novel Metaheuristic Algorithm Based on Rapidly-Exploring Random Trees Algorithm. IEEE Access. 2025, 13, 42744–42776. [Google Scholar] [CrossRef]
Wang, X.; Fei, Z.; Zhang, J.A.; Huang, J.; Yuan, J. Constrained Utility Maximization in Dual-Functional Radar-Communication Multi-UAV Networks. IEEE Trans. Commun. 2021, 69, 2660–2672. [Google Scholar] [CrossRef]

Figure 1. Multi-UAV-assisted ISAC system model under malicious jamming environments.

Figure 2. Time slot division.

Figure 3. LSTM predicted model.

Figure 4. Flowchart of alternating optimization schemes based on user position prediction.

Figure 5. Comparison of LSTM-predicted values and actual values for a single user position.

Figure 6. Convergence analysis [30].

Figure 7. Comparison of total communication rate for five schemes (

M = 5

,

K = 20

) [30].

Figure 7. Comparison of total communication rate for five schemes (

M = 5

,

K = 20

) [30].

Figure 8. Comparison of total communication rate for five schemes under different numbers of users (

M = 5

) [30].

Figure 8. Comparison of total communication rate for five schemes under different numbers of users (

M = 5

) [30].

Figure 9. Comparison of the total communication rate between AOPSA and reference [30] under different values of

γ_{t h}^{rad}

(

M = 5

,

K = 20

).

Figure 9. Comparison of the total communication rate between AOPSA and reference [30] under different values of

γ_{t h}^{rad}

(

M = 5

,

K = 20

).

Figure 10. Comparison of total communication rate of AOPSA under different position noise (

M = 5

,

K = 20

).

Figure 10. Comparison of total communication rate of AOPSA under different position noise (

M = 5

,

K = 20

).

Figure 11. Comparison of the total communication rate between AOPSA and reference [30] under different values of

p_{J}

(

M = 5

,

K = 20

).

Figure 11. Comparison of the total communication rate between AOPSA and reference [30] under different values of

p_{J}

(

M = 5

,

K = 20

).

Figure 12. Comparison of the total communication rate between AOPSA and reference [30] under different values of

p_{max}

(

M = 5

,

K = 20

).

Figure 12. Comparison of the total communication rate between AOPSA and reference [30] under different values of

p_{max}

(

M = 5

,

K = 20

).

Table 1. Main simulation parameters.

Parameter	Symbol	Value
UAV transmit antenna gain	$G_{s}$	40 dBi
UAV receive antenna gain	$G_{e}$	30 dBi
Jammer transmit antenna gain	$G_{J}$	20 dBi
User receive antenna gain	$G_{r}$	5 dBi
Number of time slots	T	60
Length of time slot	$δ_{t}$	0.5 s
RCS of target	$σ$	1 $m^{2}$
UAV collision avoidance distance	$D_{min}$	30 m
UAV maximum speed	$v_{max}$	30 m/s
UAV flying altitude	H	100 m
Noise power	$σ_{noise}^{2}$	−110 dBm
UAV maximum transmit power	$p_{max}$	35 dBm
Jammer transmit power	$p_{J}$	20 dBm
Minimum SINR for radar sensing	$γ_{t h}^{rad}$	−10 dB
Convergence threshold	$ε$	0.001
Maximum number of iterations	${\hat{μ}}_{max}$	15
Allocation weight parameter	ℓ	0.5

Table 2. Comparison of computational complexity.

	AOPSA	Reference [30]
Subproblem $P 1$	$O (K M)$	$O (μ_{1} \times K \times M^{K})$
Subproblem $P 2$	$O (u_{\max} \times N_{1} \times D_{dim})$	$O (K^{3} + K^{2} + K M)$
Subproblem $P 3$	$O (u_{\max}^{'} \times N_{2} \times M)$	$O (μ_{2} \times M^{3.5})$
Problem $P 0$	$\begin{matrix} O ({\hat{μ}}_{\max} (u_{\max} \times N_{1} \times D_{dim} + \\ u_{\max}^{'} \times N_{2} \times M + K M)) \end{matrix}$	$\begin{matrix} O (η_{\max} (μ_{1} \times K \times M^{K} + K^{3} + \\ μ_{2} \times M^{3.5})) \end{matrix}$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wang, J.; Xu, R.; Peng, L.; Wei, X. Multi-UAV-Assisted ISAC System: Joint User Association, Trajectory Design, and Resource Allocation. Entropy 2025, 27, 967. https://doi.org/10.3390/e27090967

AMA Style

Wang J, Xu R, Peng L, Wei X. Multi-UAV-Assisted ISAC System: Joint User Association, Trajectory Design, and Resource Allocation. Entropy. 2025; 27(9):967. https://doi.org/10.3390/e27090967

Chicago/Turabian Style

Wang, Jinwei, Renhui Xu, Laixian Peng, and Xianglin Wei. 2025. "Multi-UAV-Assisted ISAC System: Joint User Association, Trajectory Design, and Resource Allocation" Entropy 27, no. 9: 967. https://doi.org/10.3390/e27090967

APA Style

Wang, J., Xu, R., Peng, L., & Wei, X. (2025). Multi-UAV-Assisted ISAC System: Joint User Association, Trajectory Design, and Resource Allocation. Entropy, 27(9), 967. https://doi.org/10.3390/e27090967

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Multi-UAV-Assisted ISAC System: Joint User Association, Trajectory Design, and Resource Allocation

Abstract

1. Introduction

2. System Model

3. Problem Formulation

4. Algorithm Design

4.1. LSTM Predicting User Positions

4.2. Determine the Initial Positions of UAVs with Kmeans++

4.3. User Association Optimization

4.4. UAV Trajectory Optimization

4.4.1. Optimization Algorithm Assumptions

4.4.2. Initialization

4.4.3. Fitness Function

4.4.4. Exploration Phase $(0 < u \leq u_{d})$

4.4.5. Exploitation Phase $(u_{d} < u \leq u_{max})$

4.5. UAV Transmit Power Optimization

4.5.1. Initialization

4.5.2. Fitness Function

4.5.3. Adaptive Step Size Wandering Strategy

4.5.4. Absolute Difference-Based Adaptive Step Size Strategy

4.5.5. Boundary-Based Adaptive Step Size Strategy

4.6. Alternating Optimization-Based Predictive Scheduling Algorithm

5. Simulation Results

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Multi-UAV-Assisted ISAC System: Joint User Association, Trajectory Design, and Resource Allocation

Abstract

1. Introduction

2. System Model

3. Problem Formulation

4. Algorithm Design

4.1. LSTM Predicting User Positions

4.2. Determine the Initial Positions of UAVs with Kmeans++

4.3. User Association Optimization

4.4. UAV Trajectory Optimization

4.4.1. Optimization Algorithm Assumptions

4.4.2. Initialization

4.4.3. Fitness Function

4.4.4. Exploration Phase ( 0 < u ≤ u d )

4.4.5. Exploitation Phase ( u d < u ≤ u max )

4.5. UAV Transmit Power Optimization

4.5.1. Initialization

4.5.2. Fitness Function

4.5.3. Adaptive Step Size Wandering Strategy

4.5.4. Absolute Difference-Based Adaptive Step Size Strategy

4.5.5. Boundary-Based Adaptive Step Size Strategy

4.6. Alternating Optimization-Based Predictive Scheduling Algorithm

5. Simulation Results

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

4.4.4. Exploration Phase $(0 < u \leq u_{d})$

4.4.5. Exploitation Phase $(u_{d} < u \leq u_{max})$