Digital Twin-Driven Trajectory and Resource Optimization for UAV Swarms in Low-Altitude Urban Logistics and Communication Environments

Tong, Hanyang; Song, Ziyang; Zhu, Zhenyan; Sun, Jinlong

doi:10.3390/drones10050376

Open AccessArticle

Digital Twin-Driven Trajectory and Resource Optimization for UAV Swarms in Low-Altitude Urban Logistics and Communication Environments

School of Communications and Information Engineering, Nanjing University of Posts and Telecommunications, Nanjing 210003, China

^*

Author to whom correspondence should be addressed.

Drones 2026, 10(5), 376; https://doi.org/10.3390/drones10050376

Submission received: 1 April 2026 / Revised: 7 May 2026 / Accepted: 11 May 2026 / Published: 14 May 2026

(This article belongs to the Section Innovative Urban Mobility)

Download

Browse Figures

Versions Notes

Highlights

What are the main findings?

A digital twin-driven framework solves bottlenecks of Unmanned Aerial Vehicle (UAV) swarm urban logistics involving positioning uncertainty and non-line-of-sight (NLoS) blockage.
The adaptive continuous optimization outperforms conventional static planning, balancing communication reliability and delivery efficiency.

What are the implications of the main findings?

This work provides a practical solution for dual-role UAV swarms in dense urban logistics and communications.
It offers a new scheme for UAV dynamic optimization in uncertain and obstacle-dense environments.

Abstract

Unmanned aerial vehicles (UAVs) serve as both communication relays and aerial couriers in modern urban logistics networks. Conventional trajectory optimization methods assume perfect localization and isotropic free-space tracking signal propagation, which limits their effectiveness in urban canyons. To address the positional uncertainty and signal blockage from buildings, we propose a digital twin-driven framework for continuous trajectory and resource optimization in UAV swarms. We model an urban environment containing random high-rise structures, applying a non-line-of-sight (NLoS) uncertainty to reflect realistic communication degradation. The digital twin (DT) architecture utilizes a dual-layer spatial representation that captures a dynamically decaying positional uncertainty radius of the recipient. We define a strict visual localization boundary that initiates deterministic target tracking with a state transition mechanism. To manage the complexity of swarm routing, we apply Density-Based Spatial Clustering of Applications with Noise (DBSCAN), assigning one UAV courier and one logistics transfer station to each cluster. The system executes a continuous re-optimization loop using an adaptive multi-objective Genetic Algorithm. This framework jointly minimizes cumulative outage probability and total flight time while enforcing a signal-to-noise ratio threshold and throughput constraints. This continuous adaptation mechanism mitigates NLoS blockage risks, supporting reliable communication and efficient delivery in Global Navigation Satellite System (GNSS)-degraded and obstacle-dense urban environments.

Keywords:

UAV relay; digital twin; trajectory optimization; logistics transfer station deployment; positional uncertainty

1. Introduction

Unmanned aerial vehicles (UAVs) have established themselves as indispensable assets within the evolving architecture of next-generation wireless networks and intelligent logistics systems. In the context of Sixth Generation (6G) and Beyond 5G (B5G) networks, the deployment of UAV swarms as mobile base stations, communication relays, and aerial couriers offers unprecedented flexibility, rapid deployment capabilities, and the capacity to operate in environments where traditional terrestrial infrastructure is heavily degraded, economically unviable, or entirely non-existent. The inherent agility of multi-rotor UAV platforms allows them to establish high-probability line-of-sight (LoS) links with ground-level package recipients, theoretically yielding significant enhancements in spectral efficiency, coverage probability, and overall system throughput.

Currently, numerous studies are dedicated to researching trajectory optimization for UAV in various scenarios and their corresponding communication systems. A method to jointly optimize the flight trajectory and transmission power of UAVs to minimize the decreasing trend of effective coverage is proposed in [1]. A hierarchical reinforcement learning-based trajectory planning algorithm for UAV formation is proposed to plan the trajectories of UAV formation in complex and unknown environments in [2]. An optimal deployment scheme for UAVs as relay stations is studied in [3,4]. Moreover, the strategic deployment of multiple UAVs combined with advanced user pairing techniques is proven to significantly enhance overall network capacity and efficiency in complex scenarios [5]. To maximize the flexibility of UAV and plan its trajectory, a considerable number of studies have focused on the Traveling Salesman Problem (TSP) and its variants [6]. However, high computational complexity and enormous scale remain persistent limitations of traditional TSP formulations. To address this challenge, many studies apply swarm intelligence (SI) to optimize UAV trajectory [7,8]. Recent advancements focus on adapting to dynamic physical environments; for instance, online trajectory optimization based on outage probability knowledge maps is proposed to maintain connectivity and minimize energy consumption [9]. An evolutionary algorithm is proposed to build an innovative multi-UAV system in [10]. The application of various SI algorithms for complicated tasks by UAV swarms is discussed in [11,12].

Furthermore, recent literature has proposed several advanced variants of Genetic Algorithms (GAs) to address complex multi-objective optimization challenges across diverse domains. These notable advancements include collaborative multi-objective approaches designed for efficient task offloading in Mobile edge computing (MEC)-assisted vehicular networks [13], hybrid algorithms combining particle swarm optimization and GAs with niching technology for edge server placement [14], and length-adaptive non-dominated sorting GAs developed for high-dimensional feature selection [15]. While these studies provide significant improvements to the internal structures of GAs, our work further investigates the integration of the algorithm with real-time 3D physical-layer data from digital twin (DT) to address highly dynamic urban environments.

As the UAV industry continues to evolve, researchers note that idealized environmental assumptions often fail to account for the dynamic factors present in real-world applications. Consequently, various studies incorporate digital twin technology into research on UAV relay systems [16,17]. A novel architecture named Heterogeneous UAV Digital Twin Network (HU-DTN) is proposed in [18]. A digital twin assisted path-planning framework for a UAV swarm with two phases: global path planning and real time trajectory planning is proposed in [19]. Furthermore, DT technology is increasingly integrated with advanced learning algorithms to jointly optimize UAV trajectories, task offloading, and caching in intelligent transportation systems [20]. DT technology also proves highly effective in ensuring communication continuity and managing dynamic mobility; for instance, a novel DT-assisted handover authentication scheme is proposed to drastically reduce signaling overhead and connection interruptions in 5G and beyond networks [21]. Some studies achieve interactive software bridging between two open-source digital twin platforms such that the same scene is evaluated with high fidelity across NVIDIA’s Sionna and Aerial Omniverse Digital Twin [22]. In addition to these, a digital twin (DT)-based framework for autonomous drone navigation in Global Positioning System(GPS)-denied indoor environments is presented in [23].

Despite these profound theoretical advantages, the practical deployment of UAV swarms in dense urban operations is severely bottlenecked by legacy trajectory planning paradigms that rely on idealized environmental assumptions. Traditional frameworks universally formulate the UAV routing challenge as a variant of the TSP or Vehicle Routing Problem (VRP), utilizing static optimization methodologies to compute a fixed trajectory prior to deployment. These legacy models operate on two flawed assumptions: perfect a priori knowledge of the package recipients’ coordinates and isotropic, unobstructed electromagnetic signal propagation. In realistic urban environments, these assumptions systematically fail. First, GPS signals are subject to severe multi-path fading, shadowing, and interference within urban canyons, resulting in significant positional uncertainty for ground-level package recipients. When an aerial courier executes a statically optimized trajectory based on erroneous coordinate data, it expends critical battery reserves and mission time executing localized search patterns upon arrival, fundamentally degrading the efficiency of the logistics delivery network. Second, the physical topology of modern metropolitan centers—characterized by dense clusters of high-rise structures—introduces severe building blockages. A communication link intersecting a concrete and steel structure experiences an abrupt channel state transition from an LoS state to a non-line-of-sight (NLoS) state, incurring excess attenuation penalties that drop the signal-to-noise ratio (SNR) below operational thresholds, thereby inducing communication outages and violating quality-of-service mandates.

To bridge the gap between design-time planning and runtime physical execution, this research proposes a comprehensive optimization framework driven by a high-fidelity DT architecture and an explicit 3D stochastic Boolean blockage model. The digital twin serves as the cognitive computational core of the swarm, maintaining a dual-layer representation of positional information that explicitly models the uncertainty gap between the true location of the package recipient and sensor measurements. By continuously feeding updated spatial probabilities into a hybrid Genetic Algorithm (GA), the digital twin enables real-time, continuous trajectory re-optimization. Furthermore, this framework mathematically models the physical urban environment, incorporating an exhaustive 3D building blockage topology. The simulation environment explicitly enforces a severe attenuation penalty for NLoS propagation caused by high-rise intersections, counterbalanced by a minimal 1 dB penalty for LoS links. To achieve precision in logistics delivery, the architecture integrates a strictly defined visual localization boundary, requiring complex quadratic geometric intersection algorithms to trigger a navigation state transition from stochastic uncertainty estimation to deterministic visual tracking.

The ultimate objective is the joint minimization of cumulative outage probability and total flight time, subject to a rigid minimum throughput requirement, ensuring highly reliable communication relaying and efficient physical parcel delivery in profoundly constrained urban airspaces. Based on the above considerations, the principal contributions are summarized as follows:

At the architectural level, we propose a digital twin system that models recipient positional uncertainty using a dual-layer spatial representation. Initially, UAV navigation relies exclusively on an uncertainty layer, which tracks stochastic positions within a decaying error circle. Parallel to this is the ground truth layer, defining the exact physical destination and its visual localization boundary. Crucially, the true coordinates remain strictly unobservable to the system until the UAV physically breaches the visual boundary. This design accurately captures the realistic transition from GPS-degraded stochastic searching to deterministic visual tracking.
At the operational level, we implement a continuous trajectory re-optimization strategy facilitated by the digital twin framework. Rather than relying on a static, pre-computed flight path, the aerial courier’s route is continuously re-optimized every few seconds during the active delivery mission. As the digital twin updates detected target positions, a Genetic Algorithm constantly adapts the trajectory, uncovering more efficient flight corridors and actively mitigating NLoS building blockages.
At the decomposition level, we integrate Density-Based Spatial Clustering of Applications with Noise (DBSCAN) to resolve the combinatorial complexities of large-scale drone logistics. This approach geographically partitions the scattered package recipients into highly cohesive delivery zones, strategically determining the optimal location for one central logistics transfer station and assigning exactly one dedicated UAV courier per cluster, ensuring real-time computational viability.
At the algorithmic level, we design a multi-objective adaptive weighting mechanism for the hybrid Genetic Algorithm. The system autonomously extracts regional characteristics—specifically recipient scale and spatial density—to generate cluster-specific weights. This dynamically balances the dual mandates of minimizing cumulative communication outage probability and reducing total physical delivery time, while strictly enforcing a minimum throughput constraint.

The remainder of this paper is organized as follows. Section 2 establishes the system model, including the DT framework, positional uncertainty, and 3D blockage. Section 3 details station deployment and trajectory optimization using the adaptive GA. Section 4 presents numerical results and discussions, followed by conclusions and future work in Section 5.

2. System Model

2.1. Digital Twin Framework and Position Uncertainty Modeling

We consider a UAV swarm-aided logistics and communication system. The system comprises K UAVs (numbered from 1 to K) serving P user nodes, with each UAV playing dual roles as both a delivery agent and a communication relay. In this scenario, the UAV must carry the package from the transfer station, traverse all user nodes, and ultimately return to the transfer station. Rather than assuming perfect knowledge of user locations and isotropic free-space channels, we incorporate the inherent location errors of positioning systems and the severe physical obstacles present in urban environments.

To accurately reflect a modern metropolitan core, the simulation bounds define a constrained urban grid. Within this boundary, the environment generates exactly

N_{b} = 20

distinct 3D high-rise building structures. The maximum structural heights for the m-th building, denoted as

h_{m}

(

1 \leq m \leq 20

), are distributed uniformly within a range spanning from 50 m to 200 m. These buildings are represented as solid rectangular prisms and act as definitive electromagnetic obstacles. We denote the set of user nodes as

CN = {1, 2, \dots, P}

. For each user node

p \in CN

, rather than treating its position as a fixed, precisely known coordinate, we model it through a dual-layer digital twin framework that captures positioning uncertainty [24].

Each user node p has a true position at coordinates

({\tilde{x}}_{p}, {\tilde{y}}_{p})

, representing the actual physical location of the package recipient on the ground. This serves as the center of both the error circle and the visual localization circle. Around each true position, we define an error circle with radius

r_{e} (p, t)

. This represents the GPS-degraded positioning uncertainty range. When we “detect” a user’s position, we observe a coordinate somewhere within this error circle. Mathematically, the detected position

({\hat{x}}_{p} (t), {\hat{y}}_{p} (t))

at any detection instant satisfies:

{({\tilde{x}}_{p} - {\hat{x}}_{p} (t))}^{2} + {({\tilde{y}}_{p} - {\hat{y}}_{p} (t))}^{2} \leq r_{e} {(p, t)}^{2}

(1)

Furthermore, we define a strict visual localization circle, centered at the true position

({\tilde{x}}_{p}, {\tilde{y}}_{p})

. This represents the operational boundary within which the UAV’s onboard optical sensors can pinpoint the exact user location. Once a UAV breaches this visual circle, it purges the uncertainty error and engages deterministic visual tracking to navigate directly to

({\tilde{x}}_{p}, {\tilde{y}}_{p})

. Before entering this circle, the UAV operates strictly based on the fluctuating detected positions, as illustrated in Figure 1.

The visual boundary

r_{v}

is determined by UAV’s altitude and the camera’s effective field of view (FOV) required for reliable target detection (e.g.,

r_{v} = 38

m at

H = 200

m). To handle visual detection failures caused by poor lighting or physical obstacles, the system uses the minimum error

r_{m i n}

as a fail-safe boundary. If the camera fails to detect the target, the UAV switches to DT-guided navigation to safely reach the

r_{m i n}

zone. This ensures mission success with a minimal increase in flight time.

To facilitate real-time monitoring and decision-making, we maintain digital twin representations. For user node p, the digital twin state at time t is given by:

D T_{p} (t) = {({\tilde{x}}_{p}, {\tilde{y}}_{p}), r_{e} (p, t), r_{v}, ({\hat{x}}_{p} (t), {\hat{y}}_{p} (t)), v i s i t e d (t)}

(2)

where

v i s i t e d (t)

is a boolean flag indicating whether the package has been delivered.

For the analysis of the UAV, we divide the flight process into N time slots. The horizontal coordinate of the k-th UAV at the n-th time slot is

(x_{U_{k}} [n], y_{U_{k}} [n])

for

0 \leq n \leq N

. UAV operates at a predefined, constant cruising altitude H, making the path optimization essentially a 2D routing problem under 3D blockage geometry. To prevent physical collisions, we introduce a strict vertical safety margin

Δ h

, ensuring

H = max (h_{m}) + Δ h

. The UAV’s digital twin modeling is represented as:

D T_{U_{k}} (t) = {(x_{U_{k}} [n], y_{U_{k}} [n], H), V, T_{t o t a l}, τ}

(3)

where V is the cruising velocity,

T_{t o t a l}

represents the total flight time from departure to return, and

τ

records the time elapsed since the last trajectory optimization.

Considering realistic update frequencies and processing delays, we incorporate a Zero-Order Hold (ZOH) fallback mechanism [25] to handle NLoS-induced packet loss. During data dropouts, the positional uncertainty decay

r_{e} (p, t)

temporarily pauses, and conservatively navigates using the last successfully received state.

2.2. UAV Flight and Time Constraints

To focus on the complex coupling of 3D environmental blockages, positional uncertainty, and communication reliability, this study abstracts specific physical logistics constraints such as varying parcel weights and volumes. We assume that UAV possesses sufficient payload capacity for a single cluster deployment. Furthermore, UAV energy and power limitations are implicitly enforced through the strict maximum flight time constraint.

UAVs operate at a constant altitude H with an average velocity V. The continuous re-optimization strategy implies that the trajectory of UAV is not predetermined; rather, it emerges from a sequence of optimization decisions made every

τ

seconds. For each cluster

C_{j}

, a dedicated UAV must complete its delivery and data collection mission within a maximum allowed flight time

T_{c o n}

. The UAV’s flight distance between consecutive optimization instances is:

d_{f l y} (t, t + τ) = V \cdot min (τ, t_{e a r l y})

(4)

where

t_{e a r l y}

accounts for early arrival at the visual localization circle. The cumulative flight distance for the entire mission is:

D_{t o t a l} = \sum_{t = 0}^{T_{t o t a l}} d_{f l y} (t, t + τ)

(5)

Importantly, the continuous re-optimization approach provides natural robustness to time constraints: at each optimization step, the remaining time budget (

T_{c o n}

−

T_{t o t a l}

) is explicitly considered when evaluating trajectory fitness, ensuring UAV does not commit to paths it cannot complete. This operational mechanism embeds temporal safety directly into the fitness landscape of the algorithm.

2.3. Two-Jump Communication Model and Throughput Analysis

Communication processes within each cluster involve two jumps: the first jump transmits signals from user nodes to the regional UAV, and the second jump transmits signals from the UAV to the transfer station. We stipulate that only one logistics transfer station is planned per clustered region. Let the location of the transfer station within the jth cluster be denoted as

(x_{t_{j}}, y_{t_{j}})

, where

1 \leq j \leq J

. Thus, we define the set of transfer stations as

T \overset{Δ}{=} \{(x_{t_{1}}, y_{t_{1}}), (x_{t_{2}}, y_{t_{2}}) \dots (x_{t_{N_{t}}}, y_{t_{N_{t}}})\}

.

A critical addition to this model is the strict enforcement of the 3D stochastic Boolean building blockage. When the 3D spatial vector connecting the transmitter and receiver intersects the physical volume of any of the 20 building prisms, the channel transitions from a LoS state to a NLoS state [26,27,28].

To rigorously model the channel, we consider both the large-scale distance-dependent path loss (subject to 3D blockage) and the small-scale Rayleigh fading. The complex channel coefficient for the first jump is modeled as

h_{A - U} [p, k, n] = \sqrt{A C_{A - U} [p, k, n]} \cdot g_{A - U} [p, k, n]

, where

g_{A - U} [p, k, n] \sim CN (0, 1)

is the small-scale fading coefficient. This study employs the Rayleigh fading model instead of Rician fading to represent the worst-case scenario of severe signal scattering during building blockages. Optimizing the system under this strict condition guarantees equal or better reliability in actual urban environments. According to [29], the SNR for the first jump is expressed as:

\begin{matrix} {SNR}_{A - U} [p, k, n] = \frac{P_{A} [p] \cdot {| h_{A - U} [p, k, n] |}^{2}}{δ^{2}} \\ = γ_{A} [p] \cdot A C_{A - U} [p, k, n] \cdot {| g_{A - U} [p, k, n] |}^{2} \end{matrix}

(6)

Here,

P_{A} [p]

represents the transmission power from user node

A_{p}

, and we define

γ_{A} [p] = P_{A} [p] / δ^{2}

.

The term

A C_{A - U} [p, k, n]

denotes the deterministic large-scale attenuation coefficient between the detected user node position

({\hat{x}}_{p}, {\hat{y}}_{p})

and the UAV at time slot n, expressed as:

A C_{A - U} [p, k, n] = \frac{β_{e f f}}{{(x_{U_{k}} [n] - {\hat{x}}_{p})}^{2} + {(y_{U_{k}} [n] - {\hat{y}}_{p})}^{2} + H^{2}}

(7)

Crucially, the reference channel gain

β_{e f f}

dynamically shifts based on the 3D blockage intersection testing. Under pure LoS conditions, the channel incurs a negligible excess attenuation penalty. However, if the link is NLoS (obstructed by a building),

β_{e f f}

is scaled down by a severe excess loss penalty, fundamentally degrading the SNR.

β_{e f f}

dynamically shifts based on 3D building intersections. The excess penetration loss parameters are set based on the 3GPP TR 38.901 Urban Micro standard, setting a small loss for LoS paths and a severe 35 dB penetration penalty for NLoS blockages.

Similarly, the channel coefficient for the second jump is

h_{U - T} [p, k, j, n] = \sqrt{A C_{U - T} [p, k, j, n]} \cdot g_{U - T} [p, k, j, n]

, with

g_{U - T} \sim CN (0, 1)

. The corresponding SNR is:

\begin{matrix} {SNR}_{U - T} [p, k, j, n] = \frac{P_{U} [k] \cdot {| h_{U - T} [p, k, j, n] |}^{2}}{δ^{2}} \\ = γ_{U} [k] \cdot A C_{U - T} [p, k, j, n] \cdot {| g_{U - T} [p, k, j, n] |}^{2} \end{matrix}

(8)

where

γ_{U} [k] = P_{U} [k] / δ^{2}

, and the attenuation coefficient is:

A C_{U - T} [p, k, j, n] = \frac{β_{e f f}}{{(x_{U_{k}} [n] - x_{t_{j}})}^{2} + {(y_{U_{k}} [n] - y_{t_{j}})}^{2} + H^{2}}

(9)

In this logistics scenario, UAVs must parse and process data packets to acquire necessary scheduling information; thus, the Decode-and-Forward (DF) protocol is adopted. Under the half-duplex DF protocol, the equivalent end-to-end SNR is bottlenecked by the weaker link [30]:

{SNR}_{t o t a l} [p, k, j, n] = min ({SNR}_{A - U} [p, k, n], {SNR}_{U - T} [p, k, j, n])

(10)

According to [31], the throughput in the

n^{t h}

timeslot within the

j^{t h}

cluster region can be expressed as:

R [p, j, n] = \sum_{k} \frac{1}{2} {log}_{2} (1 + {SNR}_{t o t a l} [p, k, j, n])

(11)

Therefore, the total throughput within this area can be calculated as:

R_{t o t a l} [p, j] = \sum_{n} R [p, j, n]

(12)

where the summation extends over all time slots during which the UAV serves user node p.

2.4. Outage Probability Analysis

To achieve stable and reliable communication within the heavily obstructed urban logistics system, the system’s outage probability must be rigorously analyzed and minimized.

An outage is defined as the event where the end-to-end

S N R_{t o t a l}

falls below a predefined threshold

η

. In a DF relay system, a successful end-to-end transmission mandates both the A-U and U-T links to independently operate above this threshold. Given that the small-scale fading components

g_{A - U}

and

g_{U - T}

follow a standard complex Gaussian distribution, their respective channel power gains

| g_{A - U} |^{2}

and

| g_{U - T} |^{2}

strictly follow an exponential distribution

Exp (1)

with a unit mean.

Consequently, within the jth region and the nth time slot, the system outage probability can be rigorously derived by applying the cumulative distribution function (CDF) of the exponential distribution [32]:

\begin{matrix} P_{o u t a g e} [n, j] = 1 - P ({SNR}_{A - U} [p, k, n] \geq η) \\ \cdot P ({SNR}_{U - T} [p, k, j, n] \geq η) \\ = 1 - e^{(- \frac{η}{γ_{A} \cdot A C_{A - U} [p, k, n]})} \cdot e^{(- \frac{η}{γ_{U} \cdot A C_{U - T} [p, k, j, n]})} \\ = 1 - e^{[- η (\frac{1}{γ_{A} \cdot A C_{A - U} [p, k, n]} + \frac{1}{γ_{U} \cdot A C_{U - T} [p, k, j, n]})]} \end{matrix}

(13)

In our 3D urban model, this formulation explicitly demonstrates how the 3D Boolean building blockage logic governs communication reliability. When a communication ray intersects a high-rise structure, the severe NLoS penalty drastically reduces

β_{e f f}

within the attenuation coefficients (

A C_{A - U}

and

A C_{U - T}

). This reduction mathematically amplifies the negative exponent in the equation above, causing the power outage probability

P_{o u t a g e}

to continuously increase.

The total outage cost accumulated over the entire flight mission is thus computed as

P_{o u t} = \sum_{n} P_{o u t a g e} [n, j]

(14)

3. Optimization of Transfer Station Locations and Traversal Trajectories

To achieve maximal system utility, we aim to satisfy the rigid minimum throughput requirements and minimize the cumulative outage probability and the total flight time under severe 3D environmental blockage. In recent high-tier studies, balancing the trade-off between communication reliability (e.g., outage probability, signaling delay) and UAV flight efficiency is widely formulated as a critical multi-objective optimization challenge [33,34]. The problem requires the joint optimization of transfer station deployment

T_{j}

, real-time uncertainty transition, and continuous trajectory planning

U (t)

.

While recent studies successfully address similar joint deployment and trajectory planning challenges using bilevel optimization approaches [35], our framework must further account for continuous spatial uncertainties and dynamic 3D NLoS blockages.

3.1. Blockage-Aware Station Deployment and Clustering

Directly optimizing the routing sequence across P broadly scattered user nodes incurs an intractable combinatorial complexity of

O (P!)

. To decouple this massive state space, we employ DBSCAN algorithm. Recent study highlights that density-based clustering efficiently groups users based on spatial distribution and network connectivity, significantly improving UAV deployment efficiency in complex environments [36]. Utilizing an empirically calibrated search radius

ϵ

and a minimum neighbor threshold

m i n P t s

, the algorithm partitions the nodes into highly cohesive delivery zones

C = {C_{1}, C_{2}, \dots, C_{J}}

.

Unlike K-means, which requires a predefined number of clusters and often places transfer stations in suboptimal empty areas, DBSCAN groups nodes based on actual spatial density. Furthermore, we do not employ hierarchical clustering due to its high computational complexity, which hinders real-time routing. Capacitated clustering is also unsuitable, as it prioritizes capacity constraints over spatial proximity, potentially forcing distant nodes together and degrading communication links. DBSCAN resolves these issues by adapting to irregular user distributions and identifying isolated nodes as noise. This density-based approach significantly reduces the total flight range and outage probability, thereby improving overall communication reliability.

By assigning one UAV to each isolated cluster, the swarm routing is divided into independent tasks. The transfer stations are sparsely distributed to ensure large spatial separation between different clusters. This spatial isolation, combined with safety buffers between operational zones and altitude redundancy in flight paths, naturally prevents inter-UAV collisions. Consequently, the framework effectively eliminates the risk of swarm conflicts and removes the need for complex cooperative scheduling.

For a given cluster

C_{j}

, the preliminary spatial centroid is defined as

T_{j}^{(0)} = ({\bar{x}}_{C_{j}}, {\bar{y}}_{C_{j}})

. The average intra-cluster spatial distance

{\bar{d}}_{j}

is formalized as:

{\bar{d}}_{j} = \frac{1}{| C_{j} |} \sum_{p \in C_{j}} \sqrt{{({\hat{x}}_{p} - x_{t_{j}})}^{2} + {({\hat{y}}_{p} - y_{t_{j}})}^{2}}

(15)

To integrate physical obstacles, let the 3D building matrix be defined as

B = {B_{1}, \dots, B_{N_{b}}}

, where the geometrical constraints of the m-th building are given by its Axis-Aligned Bounding Box (AABB) coordinates and height:

B_{m} = {x_{m}, y_{m}, w_{m}, l_{m}, h_{m}}

. To prevent deployment within signal shadows or structural volumes, a 2D spatial safety margin function is formulated as:

D_{m a r g i n} (T_{j}, B_{m}) = \sqrt{{(x_{t_{j}} - x_{m})}^{2} + {(y_{t_{j}} - y_{m})}^{2}}

(16)

We introduce a binary constraint indicator

σ_{j} \in {0, 1}

to signify the spatial viability of the transfer station

T_{j}

:

σ_{j} = \prod_{m = 1}^{N_{b}} Θ (D_{m a r g i n} (T_{j}, B_{m}) \geq d_{m a r g i n})

(17)

where

Θ (\cdot)

is the symbolic indicator function. If

σ_{j} = 0

,

T_{j}

is iteratively repelled toward the topological median of the open spatial grid until the condition is satisfied.

3.2. Digital Twin State and Uncertainty Transition

The Digital Twin state mitigates positional randomness by continuously narrowing the spatial error circle

r_{e} (p, t)

associated with the detected coordinates

({\hat{x}}_{p}, {\hat{y}}_{p})

. The decay function of the error radius is explicitly given by:

r_{e} (p, t) = max (r_{m i n}, r_{e_{0}} \cdot e^{- λ \cdot n_{d} (p, t)})

(18)

where

r_{e_{0}}

reflects the maximum initial Global Navigation Satellite System (GNSS) multipath drift typical in urban canyons, and

r_{m i n}

is the irreducible minimum residual error bounded by hardware GPS limits and UAV hovering jitter. The variable

n_{d} (p, t)

denotes the number of successfully synchronized DT observations, and

λ

controls the information gain rate of the spatial calibration.

The transition from stochastic probability to deterministic visual tracking is triggered strictly by a continuous geometric evaluation. Let UAV flight direction vector over interval

τ

be

d = U (t) - U (t - τ)

, and the relative vector from the estimated node center

C_{p}

to the UAV’s position be

f = U (t - τ) - C_{p}

. The geometric intersection with the visual boundary

r_{v}

is solved via the quadratic discriminant:

Δ (t) = 4 {(f \cdot d)}^{2} - 4 (d \cdot d) (f \cdot f - r_{v}^{2})

(19)

The visual state transition indicator

S_{v i s u a l} (t) \in {0, 1}

is formulated as:

S_{v i s u a l} (t) = Θ (Δ (t) \geq 0 \land t_{r o o t} \in [0, 1])

(20)

When

S_{v i s u a l} (t) = 1

, the deterministic tracking mode is triggered, and the uncertainty layer of the corresponding node is cleared in the digital twin, forcing UAV’s heading vector to point directly toward the true target coordinates

({\tilde{x}}_{p}, {\tilde{y}}_{p})

.

However, in bad weather with low visibility, visual sensors may fail to trigger the state transition (

S_{v i s u a l} (t) = 1

) and the geometric discriminant

Δ (t) < 0

persists. To handle this, an autonomous fallback is integrated. UAV maintains the stochastic navigation mode, and the digital twin still shrinks the error circle to a minimum of

r_{m i n}

meters. This allows UAV to safely execute a degraded delivery within a

r_{m i n}

-meter range of the target. Therefore, it successfully prevents system deadlock and mission failure without visual input.

3.3. 3D Boolean Blockage and Attenuation Formulation

We denote the 3D spatial ray connecting

U (t)

and node

N_{p} = ({\hat{x}}_{p}, {\hat{y}}_{p}, 0)

as

L_{U, N}

. For any building

B_{m}

, let

g (L_{U, N}, B_{m})

represent the z-coordinate of the intersection point between the ray and the 2D footprint of the m-th building. The cumulative blockage state across all

N_{b}

buildings is:

I_{b l o c k} (U (t), N_{p}) = max_{m \in B} {Θ (L_{U, N} \cap B_{m}^{2 D} \neq Ø) \cdot Θ (g (L_{U, N}, B_{m}) \leq h_{m})}

(21)

where

B_{m}^{2 D}

represents the 2D projected footprint of the m-th building on the ground plane. This binary state completely dictates the physical channel attenuation penalty

η_{l o s s}

(in decibels) applied to the system:

η_{l o s s} (t) = η_{L o S} \cdot (1 - I_{b l o c k} (U (t), N_{p})) + η_{N L o S} \cdot I_{b l o c k} (U (t), N_{p})

(22)

Modeling buildings as AABB prisms is a simplified approach that ignores edge diffraction and signal reflections. Although ray-tracing provides accurate channel estimations, it is mainly used for offline modeling. Since our research focuses on real-time trajectory re-planning, our system adds a safety distance (

d_{m a r g i n}

) to keep UAV away from building edges. Using bounding boxes with safety margins is a common method in UAV routing to ensure collision avoidance and reduce computational complexity [37,38]. Furthermore, we include small-scale fading in the channel model to account for random reflection errors.

3.4. Formulation of Throughput Constraints and Penalty Metric

In the proposed optimization framework, maintaining high-capacity data synchronization is an active, strict constraint imposed upon the continuous spatial decision variable

U (t)

. The instantaneous effective

S N R_{e f f} (U (t))

over the UAV-to-station jump is formulated as a highly non-linear function dependent on the UAV’s position and the blockage indicator:

S N R_{e f f} (U (t)) = \frac{γ_{U} \cdot β_{0}}{({‖ U (t) - T_{j} ‖}_{2}^{2} + H^{2}) \cdot 10^{\frac{η_{l o s s} (U (t))}{10}}}

(23)

Because the 3D Boolean blockage indicator introduces extreme discontinuity into the SNR landscape, solving this constrained problem via traditional gradient-based interior-point methods is mathematically intractable. To facilitate resolution via the Genetic Algorithm, we relax this hard constraint into a differentiable throughput penalty metric

F_{R} (U (t))

:

F_{R} (U (t)) = \frac{1}{{min}_{p \in C_{j}} (\sum_{t \in T_{p}} R_{i n s t} (U (t)))}

(24)

3.5. Environment-Aware Adaptive Weighting Mechanism

To dynamically adapt the fitness function to the varying topological characteristics of different urban clusters, we design an environment-aware multi-objective weighting mechanism. For any cluster

C_{j}

, utilizing the user scale

N_{j} = | C_{j} |

and the average intra-cluster spatial distance

{\bar{d}}_{j}

defined in (16), the structural density factor is formulated as

{\bar{ρ}}_{j} = 1 / {\bar{d}}_{j}

. The normalized regional features are extracted as:

\begin{matrix} {\hat{N}}_{j} = \frac{N_{j}}{\sum_{k = 1}^{J} N_{k}}, {\hat{ρ}}_{j} = \frac{{\bar{ρ}}_{j}}{\sum_{k = 1}^{J} {\bar{ρ}}_{k}} \end{matrix}

(25)

To balance the conflicting objectives of outage minimization (prioritized in highly dense areas), throughput maximization, and flight time efficiency (prioritized in sparse, low-user areas), the initial multi-objective weights are mapped linearly:

\begin{matrix} ω_{10} [j] = α \cdot {\hat{N}}_{j} + (1 - α) \cdot {\hat{ρ}}_{j} \end{matrix}

(26)

\begin{matrix} ω_{20} [j] = β \cdot {\hat{N}}_{j} + ψ_{0} \end{matrix}

(27)

\begin{matrix} ω_{30} [j] = γ \cdot (1 - {\hat{N}}_{j}) + (1 - γ) \cdot (1 - {\hat{ρ}}_{j}) \end{matrix}

(28)

where

α, β, γ \in [0, 1]

are empirical balancing coefficients, and

ψ_{0}

is a constant ensuring a baseline throughput weight for small-scale clusters. The ultimate normalized weight vector

ω_{j} = [ω_{1} [j], ω_{2} [j], ω_{3} [j]]

governing the optimization trajectory is given by:

\begin{matrix} ω_{i} [j] = \frac{ω_{i 0} [j]}{\sum_{k = 1}^{3} ω_{k 0} [j]}, \forall i \in {1, 2, 3} \end{matrix}

(29)

3.6. Joint Optimization Problem Formulation

Synthesizing the mathematical components established above, the path planning and station deployment challenge fundamentally relies on bridging the physical 3D environment with the virtual Digital Twin state. We formally cast this as a continuous multi-objective Mixed-Integer Nonlinear Programming (MINLP) problem. The overall joint optimization problem

(P_{1})

is formulated as:

\begin{matrix} (P_{1}) : min_{U (t), T_{j}} \sum_{j = 1}^{J} \{\begin{matrix} ω_{1} [j] \cdot P_{o u t} (U (t), T_{j}) + ω_{2} [j] \cdot F_{R} (U (t)) + ω_{3} [j] \cdot (\frac{T_{t o t a l} (U (t))}{T_{c o n}}) \end{matrix}\} \\ s . t . C 1 : σ_{j} (T_{j}, B) = 1, \forall j \in {1, \dots, J} \\ C 2 : \sum_{t \in T} S_{v i s u a l} (p, t) \geq 1, \forall p \in C_{j} \\ C 3 : I_{b l o c k} (U (t), N_{p}) \in {0, 1}, \forall t \in T_{t o t a l} \\ C 4 : ‖ U (t) - U (t - τ) ‖_{2} \leq V τ, \forall t \in T_{t o t a l} \\ C 5 : H = 200 \geq max_{m \in B} (h_{m}) \\ C 6 : T_{t o t a l} (U (t)) \leq T_{c o n} \end{matrix}

(30)

The feasibility of

(P_{1})

is bounded by constraints (C1)–(C6), which mathematically enforce the 2D station safety margin, visual localization triggers, 3D NLoS penalty bounds, and fundamental UAV physical limits (i.e., UAV mobility constraints, safe altitude, and battery endurance), respectively.

Furthermore, to ensure the operational feasibility of these constraints under dynamic conditions, the system incorporates specific handling protocols. In practical scenarios, to mitigate potential violations of the flight time constraint (C6) caused by excessive detours, we implement an emergency return protocol. If the remaining time

T_{c o n} - T_{t o t a l}

falls below the estimated return time plus a safety margin, UAV immediately abandons unvisited nodes and returns to the transfer station. These nodes are then rescheduled for the next deployment cycle to guarantee UAV safety.

By constructing

(P_{1})

in this highly coupled manner, adaptive GA is mathematically compelled to leverage DT’s real-time error reductions while organically shaping the trajectory

U (t)

around the explicitly defined 3D NLoS penalty walls to find the LoS-optimal flight corridors.

To effectively solve the formulated MINLP problem

(P_{1})

, we propose a dynamic, iterative execution architecture, as illustrated in Figure 2.

Rather than functioning as a static, one-time path planner, the proposed flowchart highlights the receding-horizon execution logic driven by the digital twin. The theoretical parameters derived in Section 2—specifically the positional uncertainty

r_{e} (p, t)

and the 3D NLoS penalty

η_{l o s s}

—are not evaluated a priori. Instead, they act as dynamic triggers synchronized within a

τ

-second feedback loop. By embedding the boolean environment evaluation directly into the fitness assessment phase of multi-objective GA, the system effectively decouples the complex constraints of

(P_{1})

. This continuous feedback mechanism ensures that UAV can instantaneously adapt its sequence and waypoints to bypass unforeseen building blockages, successfully translating the theoretical MINLP into executable, low-latency flight commands.

To further validate the computational feasibility of these real-time commands, we evaluate the complexity of the proposed optimization framework. The computational complexity per optimization cycle is estimated at

O (G_{m a x} \cdot P_{s i z e} \cdot N_{j} \cdot N_{b})

, where

G_{m a x}

and

P_{s i z e}

denote the maximum iterations and population size of the GA, while

N_{j}

and

N_{b}

represent the number of nodes and buildings within a cluster, respectively. Through the application of DBSCAN spatial clustering, which partitions the global network into localized zones, the computational overhead is largely kept within a manageable range, effectively mitigating the impact of combinatorial complexity typical of traditional routing. This ensures that the receding-horizon logic maintains real-time performance even as urban complexity increases.

4. Numerical Results and Discussion

We consider a two-dimensional region with an area of

1200 \times 1200 m^{2}

where

P = 48

user nodes are randomly distributed. Each user node has a transmission power of

P_{A} [p] = 1000 mW

, UAV relay power is

P_{U} [k] = 1000 mW

, the noise power spectral density is

- 169 dBm / Hz

, the outage probability threshold is

η = 0.5

, the minimum throughput requirement is

5.3 bps / Hz

, and the maximum allowable flight time is

T_{c o n} = 800 s

.

For the digital twin component, we set the initial error circle radius

r_{e_{0}} = 15 m

and the minimum error circle radius

r_{m i n} = 5 m

. The visual positioning circle radius is

r_{v} = 38 m

. The position detection interval and trajectory optimization interval during flight are set to

τ = 10 s

. The position error is assumed to follow a uniform distribution within the error circle.

The DBSCAN clustering parameters are set with a neighborhood radius of

ϵ = 140 m

and minimum points

m i n P t s = 3

. The Genetic Algorithm is configured with a population size of 50, a maximum of 500 iterations, a crossover rate of 0.8, and a mutation rate of 0.1.

Regarding the configuration of high-rise obstructions, we define the drone’s flight altitude as

H = 200 m

, while buildings are randomly generated within this space, with heights uniformly distributed within the range

h_{m} \in [50, 200) m

.

To ensure a fair comparison, all baseline methods are evaluated under identical experimental configurations. Specifically, all schemes share the same random seeds for 3D building generation, user node distribution, and initial positional errors. Furthermore, the energy, communication, and kinematic parameters remain strictly consistent across all evaluations.

It is worth noting that the presented numerical results are average values derived from extensive Monte Carlo simulations across random 3D urban topologies, ensuring statistical robustness. Furthermore, given that most existing DT-assisted path planning studies focus on distance minimization, we abstracted this core logic into the evaluated Distance-centric scheme to conduct a fair comparison. This confirms that relying solely on DT state updates is insufficient; the system must couple DT tracking with physical-layer NLoS avoidance.

All numerical simulations were executed on a workstation equipped with an [Intel Core i7-12700H CPU and 16 GB RAM] using MATLAB (R2024a).

Figure 3 illustrates the system performance under varying DBSCAN clustering radii (

ϵ

). We select

ϵ = 140

m as the optimal threshold, because excessively large radii indiscriminately merge geographically isolated nodes, which substantially increases both flight detours and severe NLoS outage risks.

Figure 4 verifies the computational efficiency of the proposed hybrid GA. The total objective cost consistently converges within 15 generations with minimal standard deviation, ensuring extremely low calculation latency for real-time trajectory re-optimization. In addition, empirical measurements were conducted to evaluate the real-time processing capability. Experimental observations suggest that even as the intra-cluster node count

N_{j}

increases from 10 to a stress-test scale of 50, the average processing time per GA iteration increases by approximately 36.48 ms. Therefore, this level of efficiency supports the feasibility of completing the re-optimization within the

τ = 10

s interval.

Figure 5 illustrates the average outage probability versus the initial error radius

(r_{e})

for three path planning schemes, evaluated over multiple Monte Carlo simulations in 3D urban environments. The Proposed Scheme consistently achieves the lowest and most stable outage probability. By integrating digital twin continuous learning with communication-aware trajectory optimization, UAV dynamically reduces spatial uncertainty and proactively circumvents severe NLoS blockages. In contrast, the baseline schemes suffer from significant performance degradation. Baseline 2 (Pure Geo-GA), which minimizes only geometric distance, exhibits the highest outage probability. It frequently executes direct spatial crossings, forcing the UAV into building shadows with severe penetration losses. Meanwhile, Baseline 1 (No DT Updates) experiences high performance fluctuation; lacking dynamic error reduction, the UAV navigates under persistent positional uncertainty, leading to redundant detours that increase exposure to hazardous NLoS conditions.

These results demonstrate that combining DT-driven uncertainty reduction with physical-layer-aware planning is essential for robust UAV networks.

Figure 6 illustrates the system’s robustness against DT packet loss under different update frequencies. As shown, a high-frequency update provides only limited performance gains over the baseline. Moreover, it doubles the signaling overhead and may not provide enough computational time for the GA to fully converge. Conversely, a delayed update leads to severe performance fluctuations. These results indicate that

τ = 10

s serves as a relatively better operational trade-off for the system. Furthermore, even under severe packet loss, the ZOH fallback mechanism prevents system deadlock. Notably, the system exhibits a non-monotonic trend: at high loss rates (30–

40 %

), prolonged ZOH state freezing limits dynamic maneuvering, causing a straighter flight. This paradoxically reduces costs compared to the over-correction peak at

20 %

, but inherently sacrifices active NLoS avoidance.

Figure 7 demonstrates the normalized performance impact—specifically regarding total outage cost and total flight time—as the visual locating radius (

r_{v}

) varies. The data is smoothed and scaled using Min-Max normalization. Overall, both the outage cost and the flight time exhibit a downward trend as the visual radius expands. This confirms the early interception effect of the onboard visual sensors. A broader visual range enables the continuous boundary intersection mechanism to capture the true target coordinates earlier, proactively truncating the redundant detours caused by initial location errors.

Locally, the two performance metrics demonstrate a highly synchronized positive correlation, reflecting the non-convex geometric characteristics of 3D dense urban environments. Slight variations in the visual radius alter the equivalent capture boundaries of the target nodes, occasionally triggering the genetic algorithm to converge on a different local optimal topological sequence. Extended detour trajectories inherently carry a higher probabilistic risk of intersecting with building-induced NLoS shadows. Therefore, the flight distance and the communication outage cost are tightly bound.

Figure 8 presents a comprehensive evaluation of UAV flight trajectories under three distinct path planning strategies. The top row (a–c) provides 3D spatial visualizations, where grey blocks represent high-rise obstacles and dashed circles indicate the spatial uncertainty regions of ground target nodes. The colored solid lines here trace the actual UAV flight trajectories. The bottom row (d–f) complements this with top-down 2D projections overlaid on instantaneous communication outage probability heatmaps. In these heatmaps, lower-outage areas (dark blue) represent robust LoS communication corridors, whereas higher-outage areas (red and yellow) indicate severe signal degradation. The green stars denote the deployed locations of the logistics transfer stations.

By analyzing the baselines, the contrasting routing behaviors become evident. When applying the Distance-centric scheme, shown in (c) and (f), the UAV strictly prioritizes the shortest geometric path without channel awareness. Consequently, the trajectory forms direct line segments that frequently intersect the physical building footprints, flying directly into the red NLoS shadow zones and suffering from severe link disconnections. Conversely, the Outage-centric strategy, depicted in (a) and (d), strictly prioritizes physical-layer channel quality over flight efficiency. To maintain LoS links, it confines the UAV entirely within the safe blue corridors, resulting in highly meandering, fragmented obstacle-avoidance maneuvers that substantially penalize kinematic efficiency and increase total energy consumption.

The Proposed balanced scheme, illustrated in (b) and (e), effectively resolves this trade-off by combining DT-driven spatial uncertainty reduction with a communication-aware optimization objective. The 3D visualization (b) shows the UAV intelligently navigating through the gaps between buildings with calculated trajectory deviations. Furthermore, the 2D heatmap (e) highlights the algorithmic intelligence of this approach: the DT-assisted trajectory dynamically skims the boundaries of the shadow zones (cyan and light blue regions). It proactively avoids critical red blockages to prevent severe outages, yet accepts minor, tolerable channel fluctuations to maintain a highly efficient and relatively direct overall flight path.

Furthermore, the proposed framework is designed to actively reduce the negative impacts of poor or variable communication qualities. The system model clearly includes small-scale Rayleigh fading and heavy NLoS building blockage penalties. In situations with unstable links, the environment-aware weighting mechanism automatically changes the optimization priorities. Specifically, by increasing the weight of communication reliability (

ω_{1}

) in high-density areas, the UAV actively avoids fading zones. Combined with the re-optimization cycle, this continuous update method ensures that the system can maintain the required performance metrics even in difficult urban communication environments.

5. Conclusions

In this paper, we addressed recipient positional uncertainty and 3D building blockages in low-altitude UAV logistics networks by proposing a (DT)-driven trajectory and resource optimization framework.

The proposed DT architecture utilizes a dual-layer spatial representation with a dynamically decaying uncertainty radius, enabling seamless transitions to deterministic visual target tracking. To manage large-scale swarm complexity, we integrate DBSCAN for strategic spatial clustering and transfer station deployment. Furthermore, an adaptive multi-objective Genetic Algorithm is implemented for continuous trajectory re-optimization, dynamically balancing cumulative outage probability and flight time under strict throughput constraints.

Extensive simulations in dense 3D urban environments confirm that our approach proactively circumvents severe NLoS blockages. By effectively addressing the trade-off between communication reliability and kinematic efficiency, this DT-assisted, physical-layer-aware paradigm provides a highly viable solution for robust UAV delivery networks. The source code and implementation details for the proposed balanced scheme are provided in the Supplementary Materials.

In future work, we will focus on three key directions. First, we plan to validate the proposed framework through hardware-in-the-loop (HIL) simulations or small-scale flight tests to evaluate performance under real-world channel and physical positioning errors. Second, we will explore deep reinforcement learning to enhance UAV decision-making in complex urban areas. Finally, we aim to investigate the cooperation between UAVs and ground vehicles to create an integrated air–ground delivery network.

Supplementary Materials

The code repository is openly available in Github at https://github.com/zuiweng-tong/UAV-Trajectory-Optimization-and-Ablation-experiment (accessed on 31 March 2026).

Author Contributions

Conceptualization, H.T. and J.S.; methodology, H.T. and J.S.; software, H.T., Z.S. and Z.Z.; validation, H.T. and Z.S.; formal analysis, H.T. and Z.Z.; writing—original draft preparation, H.T.; writing—review and editing, J.S.; supervision, J.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Natural Science Foundation of the Jiangsu Higher Education Institutions of China, grant number 23KJA510004, and the Open Project of Shaanxi Key Laboratory of Information Communication Network and Security, grant number ICNS202507.

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

AABB	Axis-Aligned Bounding Box
B5G	Beyond 5G
CDF	Cumulative distribution function
CSI	Channel State Information
DBSCAN	Density-Based Spatial Clustering of Applications with Noise
DF	Decode-and-Forward
DT	Digital twin
GA	Genetic Algorithm
GNSS	Global Navigation Satellite System
GPS	Global Positioning System
LoS	Line-of-sight
MINLP	Mixed-Integer Nonlinear Programming
NLoS	Non-line-of-sight
SNR	Signal-to-noise ratio
TSP	Traveling Salesman Problem
UAV	Unmanned aerial vehicle
VRP	Vehicle Routing Problem
6G	Sixth Generation

References

Yang, L.; Guo, D.; Liu, Y.; Feng, L. Joint trajectory and power optimization for UAV-assisted communication networks. In Proceedings of the 2024 10th International Conference on Computer and Communications (ICCC), Chengdu, China, 13–16 December 2024. [Google Scholar]
He, H.; Yuan, W.; Chen, S.; Yang, J. Two-Timescale Trajectory Planning for UAV Formation Serving Hotspot in Unknown Environments with Complex Obstacles. IEEE Trans. Cogn. Commun. Netw. 2026, 12, 4261–4276. [Google Scholar] [CrossRef]
Dabiri, M.T.; Hasna, M. A Novel MRR-UAV-Based Relay with Optical Network Coding: A Comparative Study with Optical IRS and Conventional UAV Relaying. IEEE J. Sel. Areas Commun. 2025, 43, 1607–1620. [Google Scholar] [CrossRef]
Yin, D.; Yang, X.; Yu, H.; Chen, S.; Wang, C. An Air-to-Ground Relay Communication Planning Method for UAVs Swarm Applications. IEEE Trans. Intell. Veh. 2023, 8, 2983–2997. [Google Scholar] [CrossRef]
Wang, J.; Liu, M.; Sun, J.; Gui, G.; Gacanin, H. Multiple Unmanned-Aerial-Vehicles Deployment and User Pairing for Nonorthogonal Multiple Access Schemes. IEEE Internet Things J. 2021, 8, 1883–1895. [Google Scholar] [CrossRef]
Li, M.; Liu, X.; Wang, H. Completion Time Minimization Considering GNs’ Energy for UAV-Assisted Data Collection. IEEE Wirel. Commun. Lett. 2023, 12, 2128–2132. [Google Scholar] [CrossRef]
Wang, X.; Ma, T.; Zhang, L. Rendezvous Trajectory Planning for Air-Launched UAV Swarms Using Wind Energy. IEEE Access 2024, 12, 168531–168546. [Google Scholar] [CrossRef]
Gong, H.; Huang, B.; Jia, B.; Hao, L.; Shi, Z. Jointly Optimizing the Energy and Time for Multi-UAV 3-D Coverage of Terrestrial Regions. IEEE Trans. Mob. Comput. 2025, 24, 10312–10329. [Google Scholar] [CrossRef]
Zhao, H.; Hao, Q.; Huang, H.; Gui, G.; Ohtsuki, T.; Sari, H.; Adachi, F. Online Trajectory Optimization for Energy-Efficient Cellular-Connected UAVs with Map Reconstruction. IEEE Trans. Veh. Technol. 2024, 73, 3445–3456. [Google Scholar] [CrossRef]
Huang, Y.; Wang, H.; Bai, X.; Cai, X.; Yu, H.; Ren, Y. Biomimetic Multi-UAV Swarm Exploration with U2U Communications Under Resource Constraints. IEEE Trans. Veh. Technol. 2025, 74, 9750–9766. [Google Scholar] [CrossRef]
Zhang, S.; Li, J.; Liu, C.; Fu, L.; Bai, Z.; Li, J. SIGMA: An Agent-Based Modeling UAV Swarm Simulator for Swarm Intelligence Algorithms. IEEE Trans. Autom. Sci. Eng. 2025, 22, 19694–19708. [Google Scholar] [CrossRef]
Hu, W.; Ma, X. Optimization Algorithm of UAVs Task Assignment and Path Planning Based on Dynamic Cluster Particle Swarm Optimization. IEEE Trans. Intell. Transp. Syst. 2025, 26, 18157–18169. [Google Scholar] [CrossRef]
Chen, S.; Li, W.; Sun, J.; Pace, P.; He, L.; Fortino, G. An Efficient Collaborative Task Offloading Approach Based on Multi-Objective Algorithm in MEC-Assisted Vehicular Networks. IEEE Trans. Veh. Technol. 2025, 74, 11249–11263. [Google Scholar] [CrossRef]
Wang, B.; Zhang, Z.; Song, Y.; Chen, M.; Liu, D. nPGSAO: A Hybrid Particle Swarm Optimization and Genetic Algorithm with Niching Technology for Edge Server Placement. IEEE Internet Things J. 2025, 12, 19370–19383. [Google Scholar] [CrossRef]
Gong, Y.; Zhou, J.; Wu, Q.; Zhou, M.; Wen, J. A Length-Adaptive Non-Dominated Sorting Genetic Algorithm for Bi-Objective High-Dimensional Feature Selection. IEEE/CAA J. Autom. Sin. 2023, 10, 1834–1844. [Google Scholar] [CrossRef]
Wang, C.; Han, Y.; Zhang, L.; Jia, Z.; Zhang, H.; Hong, C.S.; Han, Z. Computing Power in the Sky: Digital Twin-Assisted Collaborative Computing with Multi-UAV Networks. IEEE Trans. Veh. Technol. 2025, 74, 14466–14482. [Google Scholar] [CrossRef]
Wang, B.; Sun, Y.; Dobre, O.A.; Nguyen, L.D.; Duong, T.Q. Digital Twin-Enabled Task-Driven UAV Communications Under Uncertainty. IEEE Trans. Veh. Technol. 2025, 74, 8454–8459. [Google Scholar] [CrossRef]
Luo, L.; Tang, F.; Chen, H.; Zhang, S.; Chen, X.; Zhao, M. Building Digital Twin Networks for Heterogeneous UAV Clusters: A Low-Latency Agent Gateway Selection Method. IEEE Trans. Cogn. Commun. Netw. 2026, 12, 1411–1419. [Google Scholar] [CrossRef]
Li, Z.; Lei, L.; Shen, G.; Cao, P.; Liu, X. Digital Twin-Assisted Path Planning for AAV Swarm Based on Improved Polar Lights Optimization. IEEE Internet Things J. 2026, 13, 2268–2284. [Google Scholar] [CrossRef]
Du, J.; Wang, J.; Li, S.; Liu, L.; Chu, X.; Chen, X.; Dong, M. Intelligent optimizations for UAV, digital twin, and ISCC enabled intelligent transportation systems. IEEE Trans. Intell. Transp. Syst. 2025, 26, 23069–23083. [Google Scholar] [CrossRef]
Li, G.; Luan, T.H.; Lai, C.; Zheng, J.; Lu, R. DTHA: A Digital Twin-Assisted Handover Authentication Scheme for 5G and Beyond. IEEE Trans. Dependable Secur. Comput. 2025, 22, 6422–6440. [Google Scholar] [CrossRef]
Belgiovine, M.; Dick, C.; Chowdhury, K. Better Together: Leveraging Multiple Digital Twins for Deployment Optimization of Airborne Base Stations. IEEE Trans. Mob. Comput. 2026, 25, 3920–3935. [Google Scholar] [CrossRef]
Kumar, M.J.; Singh, S.; Saneen, A.; Thomas, D. Digital Twin for Drone Indoor Autonomous Navigation. IEEE Sens. Lett. 2026, 10, 6001104. [Google Scholar] [CrossRef]
Yang, L.; Cao, C.; Zhao, Q.; Wu, S.; Fan, A. A Dual-Layer Mixture of Expert Model-Based Human-Like Strategy for Autonomous Driving Velocity Control. IEEE Trans. Intell. Transp. Syst. 2025, 26, 20628–20641. [Google Scholar] [CrossRef]
Schenato, L.; Sinopoli, B.; Franceschetti, M.; Poolla, K.; Sastry, S.S. Foundations of control and estimation over lossy networks. Proc. IEEE 2007, 95, 163–187. [Google Scholar] [CrossRef]
Al-Hourani, A.; Kandeepan, S.; Jamalipour, A. Modeling air-to-ground path loss for low altitude platforms in urban environments. In Proceedings of the 2014 IEEE Global Communications Conference, Austin, TX, USA, 8–12 December 2014. [Google Scholar]
Pan, M.; Aryendu, I.; Wang, Y. Bayesian Cooperative LOS/NLOS Classification with Domain Insights and Model Refinement for UAV Communication. IEEE Trans. Veh. Technol. 2025. [Google Scholar] [CrossRef]
Xie, W.; Sun, G.; Liu, B.; Li, J.; Wang, J.; Du, H.; Niyato, D.; Kim, D.I. Joint Optimization of UAV-Carried IRS for Urban Low Altitude mmWave Communications with Deep Reinforcement Learning. IEEE Trans. Mob. Comput. 2026, 25, 1381–1397. [Google Scholar] [CrossRef]
Sun, J.; Zhang, H.; Wang, X.; Yang, M.; Zhang, J.; Li, H.; Gong, C. Leveraging UAV-RIS Reflects to Improve the Security Performance of Wireless Network Systems. IEEE Netw. Lett. 2023, 5, 81–85. [Google Scholar] [CrossRef]
Laneman, J.N.; Tse, D.N.C.; Wornell, G.W. Cooperative diversity in wireless networks: Efficient protocols and outage behavior. IEEE Trans. Inf. Theory 2004, 50, 3062–3080. [Google Scholar] [CrossRef]
Jiang, X.; Wu, Z.; Yin, Z.; Yang, W.; Yang, Z. Trajectory and Communication Design for UAV-Relayed Wireless Networks. IEEE Wirel. Commun. Lett. 2019, 8, 1600–1603. [Google Scholar] [CrossRef]
Hua, M.; Wang, Y.; Zhang, Z.; Li, C.; Huang, Y.; Yang, L. Outage probability minimization for low-altitude UAV-enabled full-duplex mobile relaying systems. China Commun. 2018, 15, 9–24. [Google Scholar] [CrossRef]
Li, J.; Sun, G.; Duan, L.; Wu, Q. Multi-Objective Optimization for UAV Swarm-Assisted IoT with Virtual Antenna Arrays. IEEE Trans. Mob. Comput. 2024, 23, 4890–4907. [Google Scholar] [CrossRef]
Deng, Y.; Zhang, S.; Meer, I.A.; Ozger, M.; Cavdar, C. Joint Trajectory and Handover Management for UAVs Co-Existing with Terrestrial Users: A Multi-Agent DRL Approach. IEEE Trans. Cogn. Commun. Netw. 2026, 12, 1195–1210. [Google Scholar] [CrossRef]
Han, S.; Zhu, K.; Zhou, M.; Liu, X. Joint deployment optimization and flight trajectory planning for UAV assisted IoT data collection: A bilevel optimization approach. IEEE Trans. Intell. Transp. Syst. 2022, 23, 21492–21504. [Google Scholar] [CrossRef]
Wu, F.; Wang, Z.; Cao, J.; Peng, S.; Xu, Y.; Gao, Y.; Wu, Q.; Yang, D. Radio Map-Based Delivery Sequence Design and Trajectory Optimization in UAV Cargo Delivery Systems. IEEE Trans. Mach. Learn. Commun. Netw. 2026, 4, 17–32. [Google Scholar] [CrossRef]
Zhou, B.; Gao, F.; Wang, L.; Liu, C.; Shen, S. Robust and Efficient Quadrotor Trajectory Generation for Fast Autonomous Flight. IEEE Robot. Autom. Lett. 2019, 4, 3529–3536. [Google Scholar] [CrossRef]
Gao, F.; Wang, L.; Zhou, B.; Zhou, X.; Pan, J.; Shen, S. Teach-Repeat-Replan: A Complete and Robust System for Aggressive Flight in Complex Environments. IEEE Trans. Robot. 2020, 36, 1526–1545. [Google Scholar] [CrossRef]

Figure 1. Geometric schema of Unmanned aerial vehicles (UAVs)’s state navigation state transition: shifting from stochastic navigation under position uncertainty (

r_{e}

) to deterministic tracking upon breaching the visual localization boundary (

r_{v}

).

Figure 1. Geometric schema of Unmanned aerial vehicles (UAVs)’s state navigation state transition: shifting from stochastic navigation under position uncertainty (

r_{e}

) to deterministic tracking upon breaching the visual localization boundary (

r_{v}

).

Figure 2. Flowchart of the proposed digital twin-driven joint optimization framework based on DBSCAN and adaptive multi-objective GA.

Figure 3. Impact of the DBSCAN clustering radius (

ϵ

) on total flight time and cumulative outage cost.

Figure 3. Impact of the DBSCAN clustering radius (

ϵ

) on total flight time and cumulative outage cost.

Figure 4. Convergence trend of the proposed multi-objective Genetic Algorithm.

Figure 5. Average outage probability comparison among different path planning schemes under varying initial error radii (

r_{e}

).

Figure 5. Average outage probability comparison among different path planning schemes under varying initial error radii (

r_{e}

).

Figure 6. System robustness and degradation behavior under varying DT packet loss rates and update frequencies

(τ)

.

Figure 6. System robustness and degradation behavior under varying DT packet loss rates and update frequencies

(τ)

.

Figure 7. Normalized performance impact of the visual locating radius (

r_{v}

) on the total outage cost and total flight time.

Figure 7. Normalized performance impact of the visual locating radius (

r_{v}

) on the total outage cost and total flight time.

Figure 8. Visualizations of UAV flight trajectories under varying path planning strategies. Top row: 3D visualizations in a dense urban environment for (a) outage-centric scheme; (b) proposed balanced scheme; and (c) distance-centric scheme. Bottom row: 2D spatial heatmaps of the instantaneous communication outage probability overlaid with trajectories for (d) outage-centric scheme; (e) proposed balanced scheme; and (f) distance-centric scheme.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Tong, H.; Song, Z.; Zhu, Z.; Sun, J. Digital Twin-Driven Trajectory and Resource Optimization for UAV Swarms in Low-Altitude Urban Logistics and Communication Environments. Drones 2026, 10, 376. https://doi.org/10.3390/drones10050376

AMA Style

Tong H, Song Z, Zhu Z, Sun J. Digital Twin-Driven Trajectory and Resource Optimization for UAV Swarms in Low-Altitude Urban Logistics and Communication Environments. Drones. 2026; 10(5):376. https://doi.org/10.3390/drones10050376

Chicago/Turabian Style

Tong, Hanyang, Ziyang Song, Zhenyan Zhu, and Jinlong Sun. 2026. "Digital Twin-Driven Trajectory and Resource Optimization for UAV Swarms in Low-Altitude Urban Logistics and Communication Environments" Drones 10, no. 5: 376. https://doi.org/10.3390/drones10050376

APA Style

Tong, H., Song, Z., Zhu, Z., & Sun, J. (2026). Digital Twin-Driven Trajectory and Resource Optimization for UAV Swarms in Low-Altitude Urban Logistics and Communication Environments. Drones, 10(5), 376. https://doi.org/10.3390/drones10050376

Article Menu

Digital Twin-Driven Trajectory and Resource Optimization for UAV Swarms in Low-Altitude Urban Logistics and Communication Environments

Highlights

Abstract

1. Introduction

2. System Model

2.1. Digital Twin Framework and Position Uncertainty Modeling

2.2. UAV Flight and Time Constraints

2.3. Two-Jump Communication Model and Throughput Analysis

2.4. Outage Probability Analysis

3. Optimization of Transfer Station Locations and Traversal Trajectories

3.1. Blockage-Aware Station Deployment and Clustering

3.2. Digital Twin State and Uncertainty Transition

3.3. 3D Boolean Blockage and Attenuation Formulation

3.4. Formulation of Throughput Constraints and Penalty Metric

3.5. Environment-Aware Adaptive Weighting Mechanism

3.6. Joint Optimization Problem Formulation

4. Numerical Results and Discussion

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI