Online Synchronous Coordinated Assignment and Planning for Heterogeneous Fixed-Wing UAVs

Wang, Xindi; Zhang, Jiansong; Ma, Zhenyu; Cao, Chuanshuo; Liu, Hao

doi:10.3390/aerospace13010069

Open AccessArticle

Online Synchronous Coordinated Assignment and Planning for Heterogeneous Fixed-Wing UAVs

by

Xindi Wang

^1,2,

Jiansong Zhang

^1,2,

Zhenyu Ma

^1,2,*,

Chuanshuo Cao

^1,2 and

Hao Liu

^3,4,*

¹

National Key Laboratory of Land and Air Based Information Perception and Control, Xi’an 710065, China

²

Xi’an Modern Control Technology Research Institute, Xi’an 710065, China

³

Institute of Artificial Intelligence, Beihang University, Beijing 100191, China

⁴

Zhongguancun Laboratory, Beijing 100190, China

^*

Authors to whom correspondence should be addressed.

Aerospace 2026, 13(1), 69; https://doi.org/10.3390/aerospace13010069

Submission received: 5 September 2025 / Revised: 23 December 2025 / Accepted: 6 January 2026 / Published: 8 January 2026

Download

Browse Figures

Versions Notes

Abstract

This paper addresses the Multi-Target Reconnaissance (MTR) problem for heterogeneous Fixed-Wing Unmanned Aerial Vehicles (FW-UAVs), focusing on synchronized and time-optimal mission execution under stringent constraints. A two-stage coordinated assignment and planning framework is proposed. First, a time-balanced clustering algorithm is designed to minimize the overall mission duration while balancing individual UAV workloads by jointly employing a target reallocation strategy and an improved Genetic Algorithm (GA). Subsequently, an online trajectory planning method based on differential flatness is developed, integrating a robust replanning and flight-time synchronization strategy to ensure coordinated execution. Simulation results unequivocally demonstrate that the proposed approach enhances time optimality and temporal coordination in complex scenarios.

Keywords:

fixed-wing unmanned aerial vehicles; multi-target reconnaissance; multi-target assignment; trajectory planning; time synchronization

1. Introduction

Multi-Target Reconnaissance (MTR) is a critical mission in both military and civilian domains, with applications ranging from battlefield target engagement to disaster-site data acquisition [1,2]. Generally, MTR efforts relied on single Unmanned Aerial Vehicle (UAV) performing sequential reconnaissance of multiple targets [3,4]. However, the single UAV proves inefficient, particularly in time-sensitive scenarios. To enhance task efficiency, the deployment of Fixed-Wing Unmanned Aerial Vehicles (FW-UAVs) has gained increasing traction. FW-UAVs offer distinct advantages over single UAVs, due to their superior cooperative sensing and reconnaissance capabilities [5,6,7].

Nevertheless, in practical emergency MTR scenarios, FW-UAVs are typically heterogeneous, with different cruise speeds [8]. Furthermore, to maximize the operational capacity of each UAV while ensuring mission continuity, it is imperative that all subtasks reach saturation and are completed synchronously. MTR inherently involves multiple interrelated processes, including Multi-Target Assignment (MTA), path planning, and robust control. Consequently, two critical challenges emerge during the above processes:

Considering the varying optimal cruising speeds of heterogeneous UAVs, how can a set of targets be optimally assigned to ensure each FW-UAV completes its MTR within an equal and minimized mission duration?
Subsequent to MTA, how can kinematically feasible trajectories be generated for each FW-UAV under dynamic environmental constraints, while simultaneously guaranteeing temporal synchronization among FW-UAVs throughout the whole process?

MTA constitutes the initial phase in tackling the MTR problem. By judiciously distributing targets among specific FW-UAVs, the inherently complex multi-UAV planning task can be decomposed into simpler, independent single-UAV problems. A diverse array of algorithms has been proposed for MTA, broadly categorized into traditional heuristic methods and metaheuristic optimization approaches. Traditional methods include Genetic Algorithms (GAs) [9], Particle Swarm Optimization (PSO) [10], Ant Colony Optimization (ACO) [11], Self-Organizing Maps (SOMs), K-means, and so on [12,13]. For example, a decentralized GA-based strategy for cooperative search, where each agent independently optimizes its task, was introduced in [14]. Choi proposed a two-stage GA framework enabling individual agents to optimize local task sequences in [15]. Furthermore, a modified PSO algorithm was presented in [16] to address uncertain time constraints in rescue missions. To surmount the limitations in modeling complexity in conventional methods, Li et al. developed a deep reinforcement learning model specifically tailored for simultaneous MTA in [17]. However, a limitation of most aforementioned studies is their oversight of the inherent heterogeneity among UAVs.

The aforementioned approaches commonly fail to fully consider the diverse operational characteristics of heterogeneous UAVs. To address this issue, several algorithms have been explored. For example, the K-means method, employed in [18], tends to produce clusters for multi-robot MTA. Similarly, the SOMs method, initially developed for high-dimensional data visualization [19], was adapted in [20] for MTA and path planning through a neural mapping approach. However, these methods primarily focus on basic optimization goals and remain inadequate for complex missions that require precise temporal coordination and optimal performance under UAV heterogeneity.

Following MTA, it becomes imperative to synchronously plan kinematically feasible and safe trajectories for each UAV. A diverse range of path planning algorithms has been developed, broadly categorized into geometry-based, sampling-based, optimization-based, and intelligent approaches [21]. Geometry-based approaches, including Dijkstra, A, D, and D* Lite, are widely employed for path planning in static environments. The A* algorithm was successfully applied in [22] to address path planning under wind disturbances. Optimization-based methods, such as GA, have been extensively utilized to fulfill specific mission requirements [23,24]. Ref. [25] proposed a Generative AI (GAN) algorithm combined with traditional RRT and BFOA to predict UAV paths in a dynamic environment. Nevertheless, prior works predominantly concentrate on single-UAV path planning and neglect crucial aspects of time synchronization. Furthermore, most existing path planning approaches, though effective for single-UAV missions or static environments, remain inadequate for coordinated multi-UAV operations that demand synchronization, compliance with kinematic constraints, and obstacle avoidance in dynamic and uncertain settings.

To facilitate synchronized execution, various Trajectory Planning (TP) methods have been developed. A representative example is the cooperative penetration strategy proposed by Luo et al. [26], which utilizes a deep deterministic policy gradient (DDPG) algorithm to enable multiple UAVs to penetrate defenses. Liu et al. introduced a spatiotemporal-refined voting mechanism for PSO to address the cooperative path planning problem in [27]. Their objective function accounted for obstacles, threat regions, and arrival time constraints. However, the iterative nature of such methods poses significant challenges for large-scale real-time deployment, particularly when maintaining precise temporal coordination across heterogeneous UAVs.

Alternatively, Differential Flatness (DF) theory [28] has been applied to reduce the state dimensionality of TP problems, while consensus-based approaches adjust replanning intervals to achieve synchronization. For example, ref. [29] introduced consensus variables to align replanning intervals across UAVs, but without explicit mechanisms to match them to the global mission timeline. Although DF offers computational efficiency and consensus algorithms enhance coordination, most DF-based methods assume fully known environments. Many consensus approaches neglect heterogeneous kinematic constraints and the real-time requirement for dynamic replanning. Developing a framework that integrates the above advantages with online replanning under dynamic constraints and environmental uncertainty remains an important open challenge.

In this paper, we address the MTR problem for heterogeneous FW-UAVs by proposing a synchronized, coordinated, and online framework that integrates MTA and TP for temporally coordinated reconnaissance missions. A global optimization problem is formulated to minimize the overall mission duration and control effort while balancing individual UAV flight times, and is decomposed into two coupled sub-problems. The first sub-problem focuses on time-balanced task allocation, where K-means-based spatial initialization is combined with an iterative target reallocation procedure to equalize flight times, followed by an improved GA to optimize the intra-cluster visiting sequence. The second sub-problem addresses flight-time-consistent trajectory planning by leveraging the DF property of fixed-wing UAV dynamics, enabling the real-time generation of kinematically feasible trajectories with online replanning for collision avoidance, explicit flight-time synchronization, and robustness to dynamic and unforeseen environmental changes.

Compared with existing studies, the main contributions of this work are summarized as follows.

A new practical time-balanced clustering algorithm is proposed for heterogeneous FW-UAVs. This method minimizes the overall mission duration and balances individual UAV flight durations by strategically reallocating targets and optimizing the intra-cluster visiting sequence. By decoupling temporal coordination from route optimization, the proposed approach achieves significantly improved computational efficiency for time-coordinated MTA problems, which is further validated through theoretical time-complexity analysis and extensive numerical simulations.
A practical replanning flight-time synchronization mechanism is proposed, which adaptively adjusts the replanning duration for each UAV. Inspired by consensus-based coordination principles, this mechanism enables the synchronization of flight times, and a rigorous convergence proof is provided to guarantee persistent synchronization.
An online trajectory planning algorithm is developed using the DF property of FW-UAVs. This planner operates under stringent kinematic constraints, ensures collision avoidance in unknown and dynamic environments, and rigorously respects terminal time constraints. The applicability of DF to fixed-wing UAVs is explicitly derived and discussed, extending its use beyond conventional rotary-wing platforms.

The remainder of this paper is organized as follows. Section 2 outlines the preliminaries, including FW-UAV dynamics, graph theory, and the problem formulation. Section 3 describes the time-balanced clustering algorithm, and Section 4 details the DF-based trajectory planner with the flight-time synchronization mechanism. Section 5 presents the simulation results, and Section 6 concludes the paper.

2. Preliminaries

2.1. FW-UAV Model

In this paper, the term UAVs specifically refers to FW-UAVs. Let

p_{i}^{e} \in R^{3 \times 1}

represent the position vector of UAV i in the Earth-fixed coordinate frame, and let

Θ_{i} = {[ϕ_{i}, θ_{i}, ψ_{i}]}^{T} \in R^{3 \times 1}

denote its Euler angles. The rotation matrix

R_{b, i}^{e} \in S O (3)

, mapping vectors from the body-fixed frame to the Earth-fixed frame, is defined accordingly.

R_{a, i}^{b}

denotes the rotation matrix from the air-relative frame to the body-fixed frame. The coordinate frames are illustrated in Figure 1.

α_{i}

is the angle of attack and

β_{i}

is the sideslip angle. The dynamics of UAV i are described by the following [29,30]:

\begin{matrix} {\dot{p}}_{i}^{e} & = v_{a, i}^{e}, \\ m_{i} {\dot{v}}_{a, i}^{e} & = R_{b, i}^{e} [R_{a, i}^{b} (T h_{i} + D_{i} + L_{i})] + G_{i}, \\ {\dot{R}}_{b, i}^{e} & = R_{b, i}^{e} {⌊Ω_{i}⌋}_{\times}, \\ Ω_{i} & = Ω_{c, i}, \end{matrix}

(1)

where

m_{i}

is the mass of UAV i, and

G_{i}

denotes the gravitational force vector, with

G_{i} = - m_{i} g e_{2}

.

e_{i}

is the unit vector in the i-direction in Earth-fixed coordinate frame, and g denotes the gravitational constant.

Ω_{i} = {[ω_{x, i}, ω_{y, i}, ω_{z, i}]}^{T}

, with

ω_{μ, i}, μ = x, y, z

representing the angular velocities, and

{⌊Ω_{i}⌋}_{\times}

denotes the skew-symmetric matrix of

Ω_{i}

as

{⌊Ω_{i}⌋}_{\times} = [\begin{matrix} 0 & - ω_{z, i} & ω_{y, i} \\ ω_{z, i} & 0 & - ω_{x, i} \\ - ω_{y, i} & ω_{x, i} & 0 \end{matrix}] .

(2)

L_{i}

is the lift force of UAV i along the

y_{a}

air axis and

D_{i}

is the drag force along the negative

x_{a}

air axis.

Assumption 1.

1.: The wind speed is below 3 m/s, so sideslip $β_{i}$ can be neglected;
2.: The aerodynamic forces generated by control-surface deflections $δ_{i}$ are negligible due to the imposed limits on control-surface ranges.

Under Assumption 1, the aerodynamic forces can be expressed as

\begin{matrix} L_{i} & \approx \frac{1}{2} ρ V_{a, i}^{2} S_{i} C_{L, i} (α_{i}) a_{2}, \\ D_{i} & \approx - \frac{1}{2} ρ V_{a, i}^{2} S_{i} C_{D, i} (α_{i}) a_{1}, \end{matrix}

(3)

where

ρ

is the air density,

V_{a, i} = | | v_{a, i} | |

is the airspeed of UAV i, and

S_{i}

is the wing reference area.

C_{L, i}

and

C_{D, i}

are the lift and drag aerodynamic coefficients, respectively. Let

Ω_{c, i} = {[ω_{c, x, i}, ω_{c, y, i}, ω_{c, z, i}]}^{T}

be the commanded angular velocities and

T h_{i} = T e_{i} \cos (α_{i}) a_{1}

be the commanded thrust along the

x_{a}

air axis, where the attack angle is small.

a_{i}

is the unit vector in the i-direction in the air-relative coordinate frame. In this paper, the control input for UAV i is denoted by

U_{i}

:

U_{i} = \{ω_{c, x, i}, ω_{c, y, i}, ω_{c, z, i}, T e_{i}\} .

(4)

To simplify notation, the position vector

p_{i}^{e}

is subsequently written as

p_{i}

. The main symbols used in this article are shown in Table 1.

2.2. Graph Theory

Consider a formation of N FW-UAVs, indexed from

i = 1

to N. The interactions among them are modeled by a directed graph

G = (V, E, W)

, where

V = {v_{1}, \dots, v_{N}}

is the set of nodes, and

E \subset V \times V

is the set of directed edges.

W = [w_{i, j}] \in R^{N \times N}

indicates the weighted adjacency matrix with

w_{i, j} > 0

, if and only if

(v_{i}, v_{j}) \in E

. For node

v_{i}

, the weighted in-degree is defined as

μ_{i} = \sum_{j = 1}^{N} w_{i, j}

, where

N_{i} = {j | (v_{i}, v_{j}) \in E}

is the set of neighbors of node

v_{i}

. Define the Laplacian matrix as

L = D - W

, where

D = d i a g {μ_{i}} \in R^{N \times N}

. A directed path from node

v_{i}

to

v_{j}

is a sequence of directed edges connecting them. If a node has a directed path to every other node in the graph through a subset of edges G, G is said to have a spanning tree and the node is called the root.

2.3. Problem Formulation

In MTR, a swarm of

N \in N

heterogeneous UAVs is considered, where each UAV i has a distinct optimal cruising speed

V_{c r u i s e, i}

. Let the target set be

A = \{j | j = 1, \dots, M\}

, with their corresponding position set being

P_{t a r g e t} = \{p_{t a r g e t, j} \subseteq R^{3 \times 1} | j \in A\}

. For each UAV i, a subset of assigned targets is denoted by

A_{i} = {k | k \in A}

, with its associated ordered position set represented by

w P t_{i} \subseteq P_{t a r g e t}

. Both

A_{i}

and

w P t_{i}

are treated as ordered sequences. Simultaneously, there are Q no-fly zones randomly distributed over the mission area

M

, with their positions represented by the set

P_{o b s}

. Each UAV is required to visit all assigned targets in its respective set

A_{i}

and return to its initial departure point.

Given the randomly distributed target set A and a set of heterogeneous FW-UAVs with differing optimal cruising speeds

V_{c r u i s e, i}

, it is necessary to (1) assign each UAV i an ordered target subset

A_{i}

such that all UAVs complete their missions in equal and minimized mission durations T; and (2) in response to environmental uncertainty and limited sensing range, plan a trajectory for each UAV online to minimize control effort, while ensuring compliance with the same flight time

T_{i} = T

, obstacle avoidance Q, and kinematic constraints.

This problem is formally modeled as the following constrained optimization problem:

\begin{matrix} \min_{A_{i}, w P_{t_{i}}} \max T_{i} + \min_{P_{i}} \sum_{i = 1}^{N} \int_{0}^{T_{i}} {(p_{i}^{(3)} (t))}^{2} d t, \\ s . t . \\ S 1_{1} : ⋃_{i = 1}^{N} A_{i} = A, \cap_{i = 1}^{N} A_{i} = ⌀, \\ S 1_{2} : \max (T_{i}) - \min (T_{j}) < ε_{1}, \\ S 2_{1} : w P_{t_{i}} \subseteq P_{i}, \\ S 2_{2} : ∥ T_{i} - T ∥ \leq ε_{T}, \\ S 2_{3} : ∥ p_{i} (t) - p_{o b s, q} ∥ > R_{o b s, q} + R_{U A V}, \forall q \in Q, \\ S 2_{4} : ∥ p_{i} (t) - p_{j} (t) ∥ > 2 R_{U A V}, i, j \in N, j \neq i, \\ S 2_{5} : V_{a, i} (t) \in [V_{a, i, \min}, V_{a, i, \max}], \\ S 2_{6} : ∥ n_{i, μ} ∥ \leq n_{i, μ_m a x}, ∥ n_{i, z} ∥ \leq v_{i}^{2} / (g R_{t u r n}), \\ S 2_{7} : ϕ_{i} (t) \in [- ϕ_{i, \max}, ϕ_{i, \max}], θ_{i} (t) \in [- θ_{i, \max}, θ_{i, \max}] . \end{matrix}

(5)

where

μ \in {x, y, z}

.

S 1

represents the synchronized coordination constraints among UAVs, and

S 2

includes flight time, obstacle avoidance, and dynamic feasibility constraints for individual UAV i.

For constraints in

S 2

,

p_{i} (t) \in P_{i}

represents the position of UAV i at time t. In the individual trajectory planning process, each UAV must traverse its assigned targets in sequence order, adhering to

S 2_{1}

. To satisfy

S 2_{2}

, the actual flight time

T_{i}

must converge to the desired mission duration

T \in \cap_{i = 1}^{N} [T_{i, m i n}, T_{i, m a x}]

, where

T_{i, m i n}, T_{i, m a x}

represent the permitted bounded durations for UAV i. When flying in unknown environments, such as battlefield scenarios, UAVs must dynamically avoid time-varying obstacles and maintain safe distances from other UAVs in real time. Specifically,

S 2_{3}

ensures that UAVs maintain a safe distance from obstacles, where

p_{o b s, q}, q \in Q

is the center position of the q-th obstacle,

R_{o b s, q}

is the required safety radius, and

R_{U A V}

is the safe radius of the UAV. Concurrently,

S 2_{4}

ensures that UAVs do not conflict with one another during the simultaneous execution of their tasks. The velocity constraint

S 2_{5}

guarantees that UAVs operate within their feasible speed range. The UAV’s overloads in all directions, denoted by

n_{i, μ} = a_{i, μ} (t) / m_{i} g, μ \in x, y, z

, are constrained by structural strength and turning radius limitations, specifically

S 2_{6}

, where

n_{i, μ_m a x}

denotes the maximum allowable overload in direction

μ

, and

R_{t u r n}

is the minimum turning radius. Furthermore, each UAV must satisfy specific attitude constraints

S 2_{7}

, where

ϕ_{i}

and

θ_{i}

represent the roll angle and pitch angle of UAV i, respectively. To constrain the lateral acceleration,

ϕ_{i}

needs to satisfy

| ϕ_{i} | \leq 45^{\circ}

.

To address this constrained optimization problem, a divide-and-conquer strategy is adopted. The original problem is decomposed into two sub-problems: a time-balanced clustering-based MTA problem and a DF-based TP problem. The overall solution framework is illustrated in Figure 2.

3. Time-Balanced Clustering Algorithm for MTA

This section presents a time-balanced clustering algorithm designed for heterogeneous UAVs. This algorithm addresses the MTA problem under synchronized task execution requirements, formulated as follows:

Sub-problem 1:

\begin{matrix} \min_{A_{i}, w P t_{i}} \max T_{i}, \\ s . t . \\ S 1_{1} : ⋃_{i = 1}^{N} A_{i} = A, ⋂_{i = 1}^{N} A_{i} = ⌀, \\ S 1_{2} : \max (T_{i}) - \min (T_{j}) < ε_{1} . \end{matrix}

(6)

This sub-problem aims to minimize the maximum flight time among all UAVs by optimizing the assigned target sets

A_{i}

and their visiting sequences

w p t_{i}

, subject to two constraints.

S 1_{1}

ensures that the target sets assigned to different UAVs are mutually exclusive and that all targets are allocated.

S 1_{2}

enforces approximate equality of flight times across all UAVs.

To expedite the initial target allocation process, a standard K-means clustering algorithm is employed. Based on the spatial distribution of the target positions

p_{t a r g e t} \in P_{t a r g e t}

and the number N of UAVs, an initial division of targets into N clusters is performed. Then, an improved GA is utilized to optimize the visiting order among the targets within that cluster, forming an ordered set

A_{i}

and its corresponding ordered position set

w P t_{i}

. For notational simplicity, define

w P t = \{w P t_{1}, \dots, w P t_{N}\}

.

In practical scenarios, the varying number of targets M and the random distribution of targets

P_{t a r g e t}

, coupled with the differences in UAV optimal cruising speeds

V_{c r u i s e, i} \in V_{c r u i s e}

, result in significant discrepancies in individual UAV flight time

T_{i}

without coordination.

T_{i}

can be approximated as

T_{i} = γ_{t u r n, i} \sum_{j \in A_{i}} (| | p_{t a r g e t, j + 1} - p_{t a r g e t, j} | | / V_{c r u i s e, i}) .

(7)

where

γ_{t u r n, i}

is a scaling coefficient derived from experimental data, which corrects flight-time estimation bias according to each UAV’s minimum turning radius.

3.1. Cluster-Based Target Reallocation Strategy

To address the aforementioned limitations, a target reallocation strategy based on dynamic cluster adjustments is proposed in Algorithm 1.

Algorithm 1 Time-Balanced Clustering Algorithm for MTA

Input: Number of UAVs N, optimal cruising speeds

V_{c r u i s e}

, and target positions

P_{t a r g e t}

.

Output: Adjusted target clusters

A_{i}

, waypoint sets

w P t_{i}

, and minimized mission duration T.

1. Initialization

w P t \leftarrow K_M E A N S (P_{t a r g e t}, N)

For

i = 1, \dots, N

do

w P t_{i} \leftarrow G A (w P t_{i})

End For

k_{2} = 1

2. Iterative Refinement

While

\max (T_{j}) - \min (T_{i}) > t h r e s h o l d

and

k_{2} < L_{2}

do

k_{1} = 1

2.1. Calculate the center position

O_{A, i}

and flight time

T_{i}

for each cluster:

O_{A, i} = \sum_{i = 1}^{K_{i}} w P t_{i} / K_{i}

T_{i} \leftarrow γ_{t u r n, i} \sum_{j \in A_{i}} (| | p_{t a r g e t, j + 1} - p_{t a r g e t, j} | | / V_{c r u i s e, i}), i = 1, \dots, N

T \leftarrow T I M E_M E A N (T_{1}, \dots, T_{N}, N)

2.2. Sort. Sort UAVs in descending order of flight time

T_{i}

.

2.3. Iterative Refinement

For each UAV

i = 1, \dots, N - 1

in the sorted order do

While

| | T_{i} - T | | > t h r e s h o l d

and

k_{1} < L_{1}

do

If

T_{i} < T

do

(p_{t m p}, j) \leftarrow F I N D_P O I N T_j (w P t_{i}, O_{A, i}, P_{t a r g e t})

w P t_{j} \leftarrow R E M O V E_P O I N T (w P t_{j}, p_{t m p})

w P t_{i} \leftarrow A D D_P O I N T (w P t_{i}, p_{t m p})

Else

p_{t m p} \leftarrow F I N D_P O I N T_i (w P t_{i}, O_{A, i})

w P t_{i} \leftarrow R E M O V E_P O I N T (w P t_{i}, p_{t m p})

j \leftarrow F I N D_C L U S T E R (i, p_{t m p}, O_{A})

w P t_{j} \leftarrow A D D_P O I N T (w P t_{j}, p_{t m p})

calculate

T_{i}, i = 1, \dots, N

and update T

k_{1} \leftarrow k_{1} + 1

End If

End While

End For

2.4. Optimize inter-order

For

i = 1, \dots, N

do

w P t_{i} \leftarrow G A (w P t_{i})

End For

k_{2} \leftarrow k_{2} + 1

pruning

End While

Algorithm 1 is executed to perform adaptive adjustments of the clusters. In Step 1, the initial target clusters

A_{i}

are obtained. Step 2 is the core part of this algorithm, where the visiting order within each set

A_{i}

and

w p t_{i}

is optimized.

In Step 2.1, the mission duration

T = \sum_{i = 1}^{N} T_{i} / N

is calculated. In Step 2.2, to ensure adjustment of target clusters in each iteration, the cluster adjustment order is determined. In Step 2.3, to minimize the impact of target point selection on the existing target clusters,

p_{t m p}

is defined as the target point that is closest to the center point

O_{A, i}

of the set

A_{i}

, whose corresponding cluster is

j, j \neq i

, i.e.,:

\begin{matrix} FIND_POINT_j (w P t, O_{A, i}, P_{t a r g e t}) : p_{t m p} = \underset{p \in w P t_{j}, j > i}{\arg \min} (| | p - O_{A, i} | |) . \end{matrix}

(8)

Conversely,

FIND_POINT_i (w P t_{i}, O_{A, i})

is defined as finding the point

p_{t m p}

within set

A_{i}

that is furthest from

O_{A, i}

:

\begin{matrix} FIND_POINT_i (w P t_{i}, O_{A, i}) : p_{t m p} = \underset{p \in w P t_{i}}{\arg \max} (| | p - O_{A, i} | |) . \end{matrix}

(9)

Furthermore,

FIND_CLUSTER (p_{t m p}, O_{A})

is defined as finding the unadjusted cluster j such that the distance between its center point

O_{A, j} (j > i)

and

p_{t m p}

is minimized.

When

p_{t m p}

is added to the ordered set

w P t_{i}

, the point

p_{m i d, j} = \underset{p \in w P t_{i}}{\arg \min} (| | p - p_{t m p} | |)

closest to

p_{t m p}

and its corresponding subsequent point

p_{m i d, j + 1}

are identified within

w P t_{i}

in

ADD_POINT (w P t_{i}, p_{t m p})

. Subsequently,

p_{t m p}

is inserted between these two points to ensure that target cluster

A_{i}

and

w P t_{i}

remain ordered.

Step 2.4 ensures that the flight time

T_{i}

for each target cluster is minimized. This algorithm incorporates a 2-opt local search operator, which eliminates path crossings and reduces redundant segments within the generated routes. The 2-opt local search operator is applied to the worst 50% of individuals. The core of the improved GA comprises three main components: a selection operator, a crossover operator, and a mutation operator. In the selection phase, a portion of highly fit individuals is chosen using a roulette wheel strategy; another portion is randomly generated to maintain diversity; and the remaining individuals are constructed based on the visiting order derived from Algorithm 1. In the crossover operator, parent individuals are crossed using a random slicing method, which inherently avoids conflict detection. In the mutation operator, a randomly selected segment of an offspring’s sequence is reordered to achieve rapid exploration of the solution space. The main parameters of Algorithm 1 are constructed in Table 2. Threshold

τ_{time}

depends on the spatial distribution of UAVs and the scale of the environment. A practical selection strategy is as follows: assign an initially small value (typically below 50); then, execute Algorithm 1, and check whether the algorithm terminates because the maximum iteration limits

L_{1}

or

L_{2}

are reached. If this occurs,

τ_{time}

should be further reduced.

Algorithm 1 is heuristic in nature but admits a well-defined convergence behavior. Specifically, the MTA in Algorithm 1 has a finite search space since both the number of UAVs N and the number of targets M are finite. The algorithm is designed to accept a new assignment state only if it yields a strict reduction in the mission-duration objective while satisfying the feasibility constraints. Step 2.3 preserves constraint satisfaction and monotonically reduces flight-time imbalance, while Step 2.4 ensures a non-increasing objective value for the visiting sequence optimization [31], consistent with established convergence properties of evolutionary algorithms. To prevent cycling in the finite state space, a pruning mechanism is introduced to discard previously visited states. With explicit termination conditions, Algorithm 1 is therefore guaranteed to converge to a locally optimal solution within a finite number of iterations, which provides a practical improvement guarantee for the MTA problem.

Table 2. Main parameters of Algorithm 1.

Simulation Parameter	Value
Iteration in K_MEANS	$25 \times \min (M, 10 \times N)$ [32]
Threshold	$τ_{time}$
$L_{2}$	N
$L_{1}$	$10 \times L_{2}$
Max Generations in GA	35 × $(10 - k_{2})$
Population Size	80
Roulette Wheel Probability in Selection Operator	$0.81$
Random Probability in Selection Operator	$0.09$
Unchanged Probability in Selection Operator	$0.1$
Crossover Operator Rate	$0.65$
Mutation Rate	$0.25$

3.2. Time Complexity Analysis

Assuming that the average number of targets in each cluster is

M / N

, the overall time complexity of Step 2.3 is

O (L_{1} (N - 1) M / N)

, where

L_{1}

is the number of iterations. Considering that Step 2 needs to iterate

L_{2}

times, and the GA’s sub-generation iterates

L_{3}

times, the total time complexity of Algorithm 1 is

\begin{matrix} O (L_{2} \{[L_{1} (N - 1) M / N] + N Z L_{3} {(M / N)}^{2}\}) \approx & O (L_{1} L_{2} M + L_{2} L_{3} Z M^{2} / N) \\ \propto & O (m a x (M, \frac{M^{2}}{N})), \end{matrix}

(10)

where Z is the number of sub-generations. In contrast, if this problem were solved as a MTSP, the time complexity would be

O (M!)

.

Algorithm 1 achieves flight time

T_{i}

balance among heterogeneous UAVs, and T is selected as the mission duration for all subsequent UAVs in the subsequent planning phases.

4. Flight-Time-Consistent Algorithm Based on DF for TP

In actual flight, various constraints, including kinematic limitations and environmental factors, must be strictly satisfied. In this section, these challenges are addressed through a flight-time-consistent TP algorithm by generating executable trajectories

P_{i}

that ensure satisfy mission duration T for all UAVs

i \in N

.

Sub-problem 2: This sub-problem aims to minimize the integral of

{({p_{i}}^{(3)} (t))}^{2}

for all UAVs, subject to waypoint visit constraints, strict mission duration consistency T, obstacle avoidance, inter-UAV collision avoidance, velocity limits, overload constraints, and attitude limitations. Note that if

S 2_{7}

in (5) was transformed into a hard constraint, the closed-form solution would be prohibitively complex. Therefore, for practical implementation, these constraints are incorporated as soft constraints using penalty functions

J_{e, i}

within the optimization framework. Mathematically, the sub-problem

\min_{P_{i}} \sum_{i = 1}^{N} \int_{t = 0}^{T_{i}} {({p_{i}}^{(3)} (t))}^{2} d t

subject to

S 2

in (5) is reformulated as

\begin{matrix} \min \sum_{i = 1}^{N} [\sum_{μ \in \{x_{i}, y_{i}, z_{i}\}} (J_{μ, i}) + J_{e, i}] = \min \sum_{i = 1}^{N} [\sum_{μ \in \{x_{i}, y_{i}, z_{i}\}} (\int_{t = 0}^{T_{i}} {({p_{i, μ}}^{(3)} (t))}^{2} d t) + \int_{t = 0}^{T_{i}} J_{e, i}^{2} d t] \end{matrix}

s.t.

\begin{matrix} {S 2}_{1} : w P t_{i} \subseteq P_{i}, \\ {S 2}_{2} : | | T_{i} - T | | \leq ε_{T}, \\ {S 2}_{3} : | | p_{i} (t) - p_{o b s, q} | | > R_{o b s, q} + R_{U A V}, \forall q \in Q, \\ {S 2}_{4} : | | p_{i} (t) - p_{j} (t) | | > 2 R_{U A V}, i, j \in N, j \neq i, \\ {S 2}_{5} : V_{a, i} (t) \in [V_{a, i, m i n}, {V_{a, i}}_{, m a x}], \\ {S 2}_{6} : | | n_{i, μ} | | \leq n_{i, μ_m a x}, | | n_{i, z} | | \leq v_{i}^{2} / g R_{t u r n} . \end{matrix}

(11)

The algorithm is structured into two main steps:

Step 1: Initial Trajectory Generation: Initially, neglecting the obstacle avoidance constraints

S 2_{3} \land S 2_{4}

, an initial reference trajectory

P_{i}^{0}

is generated for each UAV. This trajectory passes through all assigned waypoints and adheres to the desired total mission duration, thereby providing a preliminary solution to sub-problem 2 by leveraging the properties of flat outputs. However, in real-world flight operations, constraints

S 2_{3} \land S 2_{4}

cannot be disregarded. Furthermore, due to unforeseen obstacles and strict motion constraints, a UAV’s actual position deviate from its desired reference, leading to uneven remaining flight times among UAVs, which in turn violates constraints

S 2_{2} \land S 2_{5} \land S 2_{6}

.

Step 2: Flight-Time-Consistent Replanning: A flight-time-consistent replanning feedback framework is proposed based on the initial reference trajectory

P_{i}^{0}

. This framework can continuously perform spatiotemporal replanning over a replanning interval

L o_{i}

through remaining flight time feedback to satisfy all constraints. By adaptively adjusting the local flight time

T_{d, i}

of

L o_{i}

, all UAVs are guided to fly in accordance with T, further satisfying constraint

S 2_{2}

. Simultaneously, within

L o_{i}

, a safe corridor satisfying environmental constraints

S 2_{3} \land S 2_{4}

is rapidly generated. Subsequently, optimal local trajectories satisfying

S 2_{5} \land S 2_{6}

are obtained within this safe corridor through DF optimization.

4.1. Flight-Time-Consistent Replanning Strategy

Given the limited sensing range of UAVs and the imperative to maintain precise flight-time consistency, a dynamic replanning strategy is proposed. The starting point of the replanning interval

L o_{i}

is the UAV’s current position

p_{i} (t)

, and the end point is the next waypoint position

p_{i, n e x t P o i n t} \in w P t_{i}

.

Assuming that all waypoints

w P t_{i}

are reachable, and their surrounding areas are guaranteed to be free of obstacles, i.e.,

| | w P t_{i} (m) - P_{o b s} (q) | | > R_{o b s, q} + R_{U A V}, \forall m, q

. To satisfy constraint

S 2_{2}

, define the desired local flight time for the UAV i as

T_{d, i}

and the replanning time interval as

τ_{p l a n n i n g}

. To ensure flight-time consistency among all UAVs, a variable

ε_{i}

for UAV i is defined as follows:

ε_{i} [k] = \frac{t_{i, n e x t P o i n t} - (T_{l, i} [k] - τ_{c o n t r o l})}{T},

(12)

where

τ_{c o n t r o l}

is the single-step control time interval,

T_{l, i}

is the actual local flight time of current replanning interval

L o_{i}

,

t_{i, n e x t P o i n t}

is the desired reference time to reach the next waypoint obtained from

P_{i}^{0}

, and

ε_{i} \in [0, 1]

. Due to physical motion constraints,

T_{l, i} \neq T_{d, i}

. Define the error variable between

T_{d, i} [k]

and

T_{l, i} [k]

as

δ_{i} [k] = T_{d, i} [k] - T_{l, i} [k]

, which characterizes flight process for the UAV i. Based on

ε_{i}

,

T_{d, i}

is designed as

\begin{matrix} T_{d, i} [k] = \frac{Γ (L o_{i})}{V_{c r u i s e, i}} [\sum a_{i, j} w_{i, j} (ε_{i} [k] - ε_{j} [k]) + b_{i, N + 1} w_{i, N + 1} (ε_{i} [k] - ε [k]) + 1], ε = t / T, \end{matrix}

(13)

where

ε

is a global absolute time reference,

ε \in [0, 1]

,

a_{i, j}, b_{i, j}

are positive coordination coefficients, and

w_{i, N + 1}

is the adjacency value from UAV i to

ε

in the communication graph, here

w_{i, N + 1} = 1, \forall i

.

Γ (\cdot)

is the estimated path length for

L o_{i}

. After specifying the

L o_{i}

, the A* algorithm is adopted to generate a sequence of sub-waypoints

w P t_{i, l o c a l}

within

L o_{i}

, then

\begin{matrix} Γ (w P t_{i, l o c a l}) = \sum_{q = 1}^{N_{l, i} - 1} | | w P t_{i, l o c a l} (q) - w P t_{i, l o c a l} (q + 1) | |, \end{matrix}

(14)

where

N_{l, i}

is the number of waypoints in

L o_{i}

.

Based on the definition above, a flight-time-consistent replanning strategy is proposed, as detailed in Algorithm 2. To handle delays arising from acceleration limits or obstacle avoidance, as well as turn restrictions induced by turning constraints, a verification module check() is incorporated into Algorithm 2, which systematically examines the satisfaction of critical dynamic constraints. If either the velocity constraint or the

S 2_{7}

constraint is violated, the framework automatically triggers Algorithm 1 to reallocate the targets and restore feasibility. Each UAV i obtains

ε_{j}

of neighboring UAVs and

ε

, then continuously adjusts its desired local flight time

T_{d, i}

for

L o_{i}

to ensure that all UAVs maintain temporal consistency. Specifically, consider a scenario where UAV i encounters an unforeseen obstacle at the k-th planning step; this encounter will cause an increase in

Γ (w P t_{i, l o c a l})

, leading to an increase in

T_{d, i} [k]

, which will further cause

ε_{i} [k]

to decrease, indicating that the UAV’s time progression is lagging. When updating at step

k + 1

, the consistency algorithm will adjust

T_{d, i} [k + 1]

to decrease, implicitly instructing UAV i to speed up during the next replanning step. Concurrently, neighboring UAVs will adaptively increase their

T_{d, j} [k + 1]

to accommodate the lagging UAV i, achieving flight-time consistency. Thus, constraint

S 2_{2}

is satisfied.

Algorithm 2 Flight-Time-Consistent Algorithm

Input: current time t, environmental map

M

, communication graph matrix L, and so on.

1. Initialization

To satisfy constraints

S 2_{1} \land S 2_{5} \land S 2_{6}

when neglecting the obstacle avoidance

constraints

S 2_{3} \land S 2_{4}

.

ε = ε_{i} = 0, i = 1, 2, \dots, N

,

k = 1

\begin{matrix} (P_{i}^{0}, T_{i}) \leftarrow G L O B A L_P L A N N I N G (M, w P t_{i}, T_{i}) \end{matrix}

2. Replanning strategy

To satisfy all constraints

S 2_{1} \land S 2_{2} \land S 2_{3} \land S 2_{4} \land S 2_{5} \land S 2_{6}

.

While

p_{i} \neq w P t_{i} (e n d)

do

w P t_{i, l o c a l} \leftarrow A^{*} (M, p_{i}, p_{i, n e x t P o i n t})

Calculate

Γ (w P t_{i, l o c a l})

Calculate

T_{d, i} [k]

\begin{matrix} (P_{i}^{k}, T_{l, i} [k]) \leftarrow L O C A L_P L A N N I N G (M, w P t_{i, l o c a l}, T_{d, i} [k]) \end{matrix}

While

t < t [k] + τ_{p l a n n i n g}

do

\begin{matrix} U_{i} \leftarrow C O N T R O L (P_{i}^{k}) \\ (X_{i}, M) \leftarrow U A V S Y S T E M (U_{i}) \end{matrix}

End While

Update

ε

,

ε_{i}

and

p_{i, n e x t P o i n t}

.

k = k + 1

check()

End While

Theorem 1.

Considering sub-problem 2 (11) with the initial trajectory obtained from Step 1, and incorporating the consistency variable (12) together with the desired flight time (13), the flight-time coordination framework can be established through the flight-time-consistent algorithm in Algorithm 2. Within this framework, the constraint

S 2_{2}

in (11) is guaranteed, and the overall solution of sub-problem 2 can be ultimately achieved under Assumption 2 and appropriate selection of design parameters.

Assumption 2.

1.: In Equations (12) and (13), $∥ δ [k] ∥ \leq Δ < \infty$ is uniformly bounded. This assumption is ensured by the local A*-based trajectory planner. In our scenarios, A* can always find a feasible path within finite time [33].
2.: In Equation (15), $H_{1, i}$ and $H_{2, i}$ are both bounded as $0 < {\underset{̲}{H}}_{1} \leq λ_{\min} (H_{1} [k]) \leq λ_{\max} (H_{1} [k]) \leq {\bar{H}}_{1}, 0 < {\underset{̲}{H}}_{2} \leq λ_{\min} (H_{2} [k]) \leq λ_{\max} (H_{2} [k]) \leq {\bar{H}}_{2}, \forall k$ . Due to the definitions of $H_{1, i}$ and $H_{2, i}$ , this bound holds naturally.
3.: The communication topology L contains at least one spanning tree.

Proof.

Considering Equation (12), we have

\begin{matrix} ε_{i} [k + 1] = - H_{1, i} [k] [a_{i, j} w_{i, j} (ε_{i} [k] - ε_{j} [k]) + b_{i, N + 1} w_{i, N + 1} (ε_{i} [k] - ε [k])] - H_{1, i} [k] + H_{2, i} [k], \end{matrix}

(15)

where

H_{2, i} = t_{i, n e x t P o i n t} + δ_{i} [k] + τ_{c o n t r o l} / T

,

H_{1, i} = Γ (w P t_{i, l o c a l}) / (V_{c r u i s e, i} T)

.

Under the Assumption 2, rewrite Equation (15) in matrix form:

ε [k + 1] = - H_{1} [k] \bar{L} ε [k] - H_{1} [k] + H_{2} [k],

(16)

where

\bar{L} = [l_{i, j}] \in R^{(N + 1) \times (N + 1)}

is the augmented Laplacian matrix that incorporates the absolute time reference

ε

. Define

a_{i, N + 1} ≜ b_{i, N + 1}

and

w_{N + 1, j} = 0, \forall j

. Then,

l_{i, j} = - a_{i, j} w_{i, j}

,

l_{i, i} = \sum_{i \neq j} a_{i, j} w_{i, j}

,

l_{N + 1, j} = 0

,

\forall i \in {1, \dots, N}, \forall j \in {1, \dots, N + 1}

. Referring to Proposition 1 in [29], a transformation matrix

I \in R^{(N - 1) \times N}

for the complementary consensus subspace is introduced such that

ϑ [k] = I ε [k] .

(17)

I satisfies

I 1_{n} = 0

. This implies that

ϑ = 0

is equivalent to

ε \in span {1_{n}}

, meaning the UAV achieves flight time coordination at step k. Choosing

I^{T} I = I_{n \times n} - \frac{1}{n} 1_{n} 1_{n}^{T}

, then,

I^{T} \bar{L} I = \bar{L}

can be further derived based on

\bar{L} 1_{n} = 0

. Furthermore, the error system is

\begin{matrix} ϑ [0] & = 0, \\ ϑ [k + 1] & = - I H_{1} [k] \bar{L} I^{T} ϑ [k] - I (H_{1} [k] + H_{2} [k]) . \end{matrix}

(18)

Equation (18) can be written as the discrete-time system as

\begin{matrix} ϑ [k + 1] = Φ [k] ϑ [k] + d [k], k \in Z_{\geq 0}, \end{matrix}

(19)

where

Φ [k] = - I H_{1} [k] \bar{L} I^{T}, d [k] = - I (H_{1} [k] + H_{2} [k])

. Based on Assumption 2, we have

\begin{matrix} ∥ Φ [k] ∥ \leq {\bar{H}}_{1} λ_{\max} (\bar{L}) = Λ . \end{matrix}

(20)

By choosing parameters

γ

and adjusting

\bar{L}

such that

Λ < γ < 1

. Therefore, there exists a constant

C > 0

such that

\begin{matrix} ∥ Φ [k - 1] \dots Φ [0] ∥ \leq C γ^{k} . \end{matrix}

(21)

The iteration solution of Equation (19) is

\begin{matrix} ϑ [k] = Φ [k - 1] \dots Φ [0] ϑ [0] + \sum_{j = 0}^{k - 1} Φ [k - 1] \dots Φ [j + 1] d [j] . \end{matrix}

(22)

Since

\sum_{ℓ = 0}^{k - 1} γ^{ℓ} \leq \sum_{ℓ = 0}^{\infty} γ^{ℓ} = \frac{1}{1 - γ}

, we obtain

∥ ϑ [k] ∥ \leq C γ^{k} ∥ ϑ [0] ∥ + \frac{C}{1 - γ} \sup_{0 \leq t \leq k} ∥ d [t] ∥ .

Hence

ϑ [k]

converges exponentially to a bounded neighborhood related to

\sup ∥ d [t] ∥

.

If the case of time variations is considered, the synchronization law (18) becomes a linear system of time variations. The system remains uniformly asymptotically stable if

Γ (w P t_{i, l o c a l})

satisfies certain conditions. Similarly, when the effect of communication delays is taken into account, (15) introduces a time delay into the system. Convergence can still be guaranteed if the maximum delay does not exceed a specified threshold. □

4.2. DF Based Planning Algorithm for TP

In this section, the DF-based planning method implements GLOBAL_PLANNING and LOCAL_PLANNING mentioned in Algorithm 2.

A nonlinear system

\dot{X} = f (X, U)

is said to be differentially flat if there exists a vector

Y = f_{1} (X, \dots, X^{(m_{1})}, \dots, U, \dots, U^{(m_{2})})

that satisfies the following properties:

Y is differentiable;
The system states X and control inputs U can be expressed as functions of Y and its finite-order derivatives $X = g_{1} (Y, \dots, Y^{(n_{1})})$ and $U = g_{2} (Y, \dots, Y^{(n_{2})}) .$

Such systems are termed flat systems, and Y is the differential flat output. Through DF, the original system can be mapped to a lower-dimensional, linear flat space. By planning the flat outputs, algebraic solutions for X and U can be directly obtained.

Theorem 2.

Considering (1), a DF output for FW-UAVs is selected as

Y_{i} = \{p_{i}, α_{i}\}

. From this flat output, all other state variables

X_{i} = \{p_{i}, v_{a, i}^{e}, V_{a, i}, Θ_{i}, Ω_{i}\}

and control inputs

U_{i} = \{Ω_{c, i}, T e_{i}\}

can be uniquely derived. The applicability of this theorem is restricted to the case of translational-flatness parametrization [34,35].

Proof.

First, based on the definition of

{\dot{p}}_{i} = v_{a, i}^{e}

,

V_{a, i} = | | v_{a, i} | |

, it can be readily shown that

p_{i}, v_{a, i}^{e}, V_{a, i}

can be characterized by

Y_{i}

as follows:

V_{a, i} = g_{1} (p_{i}) .

(23)

Then, it can be derived that

R_{a, i}^{b} = g_{2} (α_{i}) .

(24)

Furthermore, the transformation matrix

R_{a, i}^{e}

can be further expressed as

R_{a, i}^{e} = [x_{a, i}, y_{a, i}, z_{a, i}],

(25)

where

x_{a, i}

is the projection of the

x_{a}

axis of air coordinates in the inertial system. Given that the velocity vector is aligned with the

x_{a}

axis of the air-relative frame, we can obtain

x_{a, i}^{e} = \frac{v_{a, i}^{e}}{| | v_{a, i}^{e} | |} .

(26)

Based on (1), we can obtain

m_{i} {\dot{v}}_{a, i}^{e} + m_{i} g e_{2} = (T e_{i} \cos (α_{i}) - D_{i}) x_{a, i}^{e} + L_{i} y_{a, i}^{e} .

(27)

Dot multiplying both sides by

x_{a, i}^{e}

yields

{x_{a, i}^{e}}^{T} (m_{i} {\dot{v}}_{a, i}^{e} + m_{i} g e_{2}) = T e_{i} \cos (α_{i}) - D_{i} .

(28)

Substituting the above equation into (1), we get

L_{i} y_{a, i}^{e} = m_{i} {\dot{v}}_{a, i}^{e} + m_{i} g e_{2} - {x_{a, i}^{e}}^{T} (m_{i} {\dot{v}}_{a, i}^{e} + m_{i} g e_{2}) x_{a, i}^{e} .

(29)

Considering

| | y_{a, i}^{e} | | = 1

, then

y_{a, i}^{e} = \frac{{\dot{v}}_{a, i}^{e} + g e_{2} - {x_{a, i}^{e}}^{T} ({\dot{v}}_{a, i}^{e} + g e_{2}) x_{a, i}^{e}}{| | {\dot{v}}_{a, i}^{e} + g e_{2} - {x_{a, i}^{e}}^{T} ({\dot{v}}_{a, i}^{e} + g e_{2}) x_{a, i}^{e} | |} .

(30)

According to the orthogonality of coordinate, we can obtain

z_{a, i}^{e} = x_{a, i}^{e} \times y_{a, i}^{e} .

(31)

Thus

Θ_{i} = g_{3} (R_{b, i}^{e}) = g_{3} (R_{a, i}^{e} {R_{a, i}^{b}}^{T}) = g_{4} (Y_{i}) .

(32)

Considering

{⌊Ω_{i}⌋}_{\times} = {R_{b, i}^{e}}^{T} {\dot{R}}_{b, i}^{e}

, it is derived that

Ω_{i} = g_{5} (Y_{i}) .

(33)

Thus,

Ω_{c, i} = Ω_{i} = g_{5} (Y_{i})

.

According to (28),

T e_{i}

can be expressed as

T e_{i} = \frac{{x_{a, i}^{e}}^{T} (m_{i} {\dot{v}}_{a, i}^{e} + m_{i} g e_{2}) + D_{i}}{\cos (α_{i})} .

(34)

Considering

D_{i} = g_{6} (Y_{i})

, then

T e_{i} = g_{7} (Y_{i}) .

(35)

Therefore, all system states

X_{i}

and control inputs

U_{i}

are explicitly expressed as functions of

Y_{i}

and its finite-order time derivatives. □

During the planning process, constraints

S 2

defined in the state space can be mapped to constraints

Y_{i}

in the flat output space. Subsequently, all desired

X_{i}

and

U_{i}

can be inversely mapped from the planned flat outputs. However, if the real-time state of the UAV is available, the control inputs

U_{i}

can be derived by introducing a feedback controller to achieve rapid tracking of the desired state variables, as shown in Figure 3. This feedback control eliminates the need for explicit inverse mapping calculations from flat outputs to desired controls, thereby significantly improving control efficiency and responsiveness.

When applying the DF-based approach, the TP problem is reformulated as an optimization problem, in which the following aspects must be taken into account: trajectory representation, flight time allocation, cost function definition, and constraint definition.

A. Trajectory representation

For

Y_{i}

, Bézier curves are employed for fitting. Further specific details on Bézier curves can be found in [36]. To simplify, the UAV’s cruising altitude

y_{i} (t) = H_{c}

is assumed to be fixed. The angle of attack

α_{i} (t)

can then be determined based on the lift–gravity balance equation:

α_{i} (t) = C_{L, i}^{- 1} (\frac{2 m_{i} g}{ρ V_{a, i}^{2} S_{i}}),

(36)

where

C_{L, i}^{- 1} (\cdot)

denotes the inverse mapping from the lift coefficient to

α_{i} (t)

. Consequently, only the horizontal positions

x_{i}, z_{i}

are planned. The Bézier curve parameterization for a dimension

μ = x, z

of one segment is given by the following:

\begin{matrix} x_{i} = \{\begin{matrix} B_{1} (t) = s_{B_{1}} \sum_{h = 0}^{N_{p}} c_{B_{1}}^{h} b_{N_{p}}^{h} (\frac{t - T_{i, 1}}{s_{B 1}}), T_{i, 1} \leq t \leq T_{i, 2}, \\ ⋮ \\ B_{N_{l, i} - 1} (t) = s_{B_{N_{l, i} - 1}} \sum_{h = 0}^{N_{p}} c_{B_{N_{l, i} - 1}}^{h} b_{N_{p}}^{h} (\frac{t - T_{i, N_{l, i} - 1}}{s_{B_{N_{l, i} - 1}}}), T_{i, N_{l, i} - 1} \leq t \leq T_{i, N_{l, i}} . \end{matrix} \end{matrix}

(37)

where

b_{N_{p}}^{h} (t_{B}) = C (\binom{N_{p}}{h}) {t_{B}}^{h} {(1 - t_{B})}^{N_{p} - h}, t_{B} \in [0, 1]

are the Bernstein basis polynomials,

N_{p}

is the degree of the curve, and

C (\cdot)

is the binomial coefficient. For the first piecewise segment

B_{1} (t)

from the beginning point to the next waypoint,

c_{B_{1}}^{h}

denotes the h-th control point for this specific curve segment. The parameters

s_{d}, d = B_{1}, \dots, B_{N_{l, i} - 1}

are scaling factors used to scale the time interval of each segment, where

N_{l, i}

is the total number of waypoints

w P t_{i}

or

w P t_{i, l o c a l}

in the segment.

B. Flight Time Allocation

The Bézier desired flight time for each waypoint within the trajectory is allocated using the following formula:

T_{i, d} = T_{i, d - 1} + T_{i} | | p_{i, d + 1} - p_{i, d} | | / \sum_{\begin{matrix} d = 1, \dots, N_{l, i} - 1 \end{matrix}} (| | p_{i, d + 1} - p_{i, d} | |) .

(38)

C. Cost Function Definition

Based on the aforementioned definitions, for a single UAV, the cost function to be optimized is reformed to minimize the motion jerk and additional cost:

\begin{matrix} \min J_{i} = \min (\sum_{μ \in \{x_{i}, z_{i}\}} (J_{μ, i}) + J_{e, i}) = \min (\sum_{μ \in \{x_{i}, z_{i}\}} \int_{t = 0}^{T_{i}} {({p_{i, μ}}^{(3)} (t))}^{2} d t + \int_{t = 0}^{T_{i}} j_{e, i}^{2} d t), \end{matrix}

(39)

where

J_{μ, i}

represents the minimization of the UAV’s motion jerk. Taking the

x_{i}

-direction as an example, the cost function can be expressed as

J_{x_{i}, i} = {c_{x, i}}^{T} Q_{x, i} c_{x, i}

, where

c_{i} = \{c_{B_{1}}^{0}, \dots, c_{B_{N_{l, i} - 1}}^{N_{p}}\}

is the vector of all control points in the

x_{i}

-direction, which are the variables to be optimized.

Q_{x, i}

is the positive semi-definite Hessian matrix of the cost function.

For the symmetric attitude soft constraint

J_{e, i}

, a barrier function

Ψ (x, x_{\max}, x_{s m o o t h})

is defined to enforce it as a soft constraint:

Ψ (x, x_{\max}, x_{s m o o t h}) = K_{s i g} \frac{S i g (x, x_{\max}, x_{s m o o t h}) | | x | |}{ε_{s i g}},

(40)

where

ε_{s i g}

is a very small positive number to prevent a logarithm of zero or negative values, and

K_{s i g}

is a scaling factor.

S i g (x, x_{\max}, x_{s m o o t h}) = 1 / (1 + \exp (10 (x_{\max} - | | x | |) / x_{s m o o t h}))

.

Ψ (x, x_{\max} . x_{s m o o t h})

has the following properties:

$Ψ (x, x_{\max}, x_{s m o o t h}) \geq 0$ ;
As $| | x | | \leq x_{\max} - x_{s m o o t h}$ , $Ψ \to 0$ ;
When $| | x | | > x_{\max}$ , $Ψ \geq K_{s i g} (1 - ε) | | x | | / ε_{s i g}$ , which reaches a maximum value.

By minimizing this barrier function

Ψ \to 0

, it can be guaranteed that

| | x | | \leq x_{\max} - x_{s m o o t h}

. Thus, constraint

S 2_{7}

is transformed into

j_{e, i} = Ψ (ϕ_{i} (t), ϕ_{i, m a x}, ϕ_{i, s m o o t h}) + Ψ (θ_{i} (t), θ_{i, m a x} θ_{i, s m o o t h}) .

(41)

D. Constraint Definition

Waypoint Constraints: For both the initial reference trajectory

P_{i}^{0}

and subsequent local replanning trajectories

P_{i}^{k}

,

S 2_{1}

must be satisfied. In the initial global planning phase,

P_{i}^{0}

is required to pass through all assigned waypoints

w P t_{i}

and satisfy initial and terminal position, velocity, and acceleration constraints. In the online replanning phase,

P_{i}^{k}

only needs to satisfy the start and end point constraints of the current replanning segment, as intermediate waypoints are dynamically handled by the safe flight corridors. Considering the properties of Bézier curves, these waypoint constraints can be transformed into constraints on the curve control points

c_{d}^{h}, d = B_{1}, \dots, B_{N_{l, i} - 1}, h = 0 / N_{p}

and their derivatives. Taking the start point constraint of the first curve segment for

x_{i}

-direction as an example, we have

\begin{matrix} c_{B_{1}}^{0} s_{B_{1}} & = x_{i, s t a r t}, p_{i, s t a r t} = w P t_{i} (1), \\ c_{B_{1}}^{0, (1)} & = {\dot{x}}_{i, s t a r t}, \\ c_{B_{1}}^{0, (2)} {s_{B_{1}}}^{- 1} & = {\ddot{x}}_{i, s t a r t}, \end{matrix}

(42)

where

c_{B_{1}}^{h, (l)}, h = 0, \dots N_{p}, l = 0, \dots 2

represents the l-th order derivative of the h-th control point of this curve segment,

c_{B_{1}}^{h, (l)} = \frac{N_{p}!}{(N_{p} - l)!} (c_{B_{1}}^{h + 1, (l - 1)} - c_{B_{1}}^{h, (l - 1)})

.

Continuity Constraints: To ensure smooth and continuous UAV motion, the connections between adjacent Bézier curve segments must be multi-order continuity connections. Taking the first two curve segments for

x_{i}

-direction as an example, this constraint can be expressed as equality constraints on control points

c_{B_{1}}^{N_{p}}

,

c_{B_{2}}^{0}

and their derivatives, i.e.,

c_{B_{1}}^{N_{p}, (l)} s_{B_{1}}^{1 - l} = c_{B_{2}}^{0, (l)} s_{B_{2}}^{1 - l}, l = 0, 1, 2 .

(43)

Safe Flight Corridors Constraint: During the online replanning phase, to ensure obstacle avoidance constraints

S 2_{3} \land S 2_{4}

, safe flight corridor constraints are introduced. Taking the

x_{i}

-direction as an example, considering the convex hull property of Bézier curves, ensuring that all control points

c_{d}^{h}, d = B_{1}, \dots, B_{N_{l, i} - 1}, h = 0, \dots, N_{p}

of a curve segment lie within a defined convex hull guarantees that the entire trajectory segment also remains within that hull. This property also applies to the derivatives of the curve.

First, the local replanning area is discretized into a grid. Then, an A* algorithm is utilized to generate a sequence of central path points

w P t_{i, l o c a l}

that satisfy the obstacle avoidance constraints

S 2_{3} \land S 2_{4}

. Subsequently, each sub-waypoint is expanded to form a rectangular safe corridor

Ξ_{d} = \{p | \forall p \in Ξ_{d}, p s a t i s f i e s S 2_{3} \land S 2_{4}\}

, ultimately forming the total safe corridor for the local planning interval

Ξ = \cup Ξ_{d}, Ξ_{d} \cap Ξ_{d + 1} \neq ⌀

. For a certain curve segment d within the interval, it is only necessary to apply constraints to all control points of the curve, ensuring that they satisfy

x_{d, \min} \leq c_{d}^{h} s_{d} \leq x_{d, \max},

(44)

where

x_{d, \min}, x_{d, \max}

are the lower and upper bounds of the safe corridor for the d-th curve segment.

Kinematic Constraints: Since the derivatives of control points also possess the convex hull property, velocity and acceleration constraints can be directly imposed on the control point derivatives. Considering constraints

S 2_{5} \land S 2_{6}

, taking a curve segment for the

x_{i}

-direction as an example, we have

\begin{matrix} - {v_{i}}_{, m a x} & \leq c_{d}^{h, (1)} \leq {v_{i}}_{, m a x}, \\ - {a_{i}}_{, m a x} m_{i} g & \leq c_{d}^{h, (2)} {s_{d}}^{- 1} \leq {a_{i}}_{, m a x} m_{i} g . \end{matrix}

(45)

For

μ = z_{i}

, turning constraints also need to be considered.

In summary, the TP problem can be formulated as a constrained optimization problem. The waypoint visit and continuity constraints are transformed into linear equality constraints on the control points. Safe corridor constraints and kinematic limits are represented as linear inequality constraints on control points and their derivatives. Finally, attitude constraints are incorporated as soft constraints through barrier functions. Thus, this planning problem can be formally described as

\begin{matrix} \min {c_{i}}^{T} Q_{i} c_{i} + J_{e, i} \\ s . t . \\ A_{e q, i} c_{i} = b_{e q, i}, \\ A_{i e, i} c_{i} \leq b_{i e, i}, \end{matrix}

(46)

where

c_{i} = [c_{{B_{1}}_{, x}}^{0}, \dots, c_{{B_{1}}_{, x}}^{N_{p}}, \dots, c_{{B_{N_{l, i} - 1}}_{, x}}^{0}, \dots, c_{{B_{N_{l, i} - 1}}_{, x}}^{N_{p}}, c_{{B_{1}}_{, z}}^{0}, \dots, c_{{B_{N_{l, i} - 1}}_{, z}}^{N_{p}}]

. When

J_{e, i}

is not considered, this problem reduces to a convex quadratic programming problem, which can be efficiently solved using a QP solver to obtain an initial feasible trajectory. Subsequently,

J_{e, i}

is considered for further optimization to obtain the final trajectory.

5. Simulation and Analysis

The performance of the algorithm is evaluated and validated in this section. All algorithms are executed on the same hardware platform: Intel i5-9300H CPU, 32 GB RAM, and NVIDIA GeForce GTX 1650 GPU. Due to the fixed cruising altitude assumption for UAVs, the simulation environment is simplified to a two-dimensional scenario. Target positions

P_{t a r g e t}

and no-fly zone positions

P_{o b s}

are randomly generated within a 2.5 km × 2.5 km area. All UAVs are assumed to depart from the origin

(0, 0)

m, and return to

(0, 0)

m after completing the MTR mission. A precise aerodynamic force model and 6-DOF kinematic motion are constructed, while attitude control is provided by a simulated PX4 controller. Additionally, the UAV’s detectable range is set to 200 m. The MOSEK optimizer is employed to solve the convex QP problems arising from the trajectory generation. The main parameters for the UAVs and environment are constructed in Table 3.

5.1. Time-Balanced Clustering Simulation

The simulation result of Algorithm 1 is shown in Figure 4, Figure 5 and Figure 6. Figure 4 illustrates the convergence of flight times during the optimization process: the green-shaded region indicates the execution of the target reallocation, while the red-shaded region denotes the application of the intra-cluster optimization. As the optimization progresses, the individual UAV flight times gradually converge, with the flight times of the four UAVs being

T_{1} = 180 s

,

T_{2} = 185 s

,

T_{3} = 185 s, T_{4} = 186 s

. The average flight time decreases from 218 s to 184 s. Figure 5 shows the target allocation and visiting order. Each target position is represented by a solid dot, where lighter colors indicate earlier reconnaissance and darker colors indicate later reconnaissance within a UAV’s sequence. The different colored lines represent the distinct routes planned for each UAV. Specifically, Figure 5a shows the initial target routes

w P t

, where targets belonging to each UAV are spatially grouped with minimal differences in the number of assigned targets. Figure 5b illustrates the final adjusted target allocation and optimized target routes

w P t

. Notably, since

U A V_{1}

has the highest optimal cruising speed, its optimized route is consequently the longest, followed by

U A V_{2}

, while

U A V_{1}

has the shortest route, demonstrating the algorithm’s ability to balance mission durations despite heterogeneity.

Figure 6a shows the scalability of Algorithm 1 under various configurations

(M, N)

, presenting runtime, time variance, and constraint-violation ratios (percentage of optimizations completed within the threshold). The computational efficiency of the time-balanced clustering is further evaluated against representative MTA algorithms ([29,37]), as illustrated in Figure 6b with

M = 5 N

.

5.2. Flight-Time-Consistent Planning Simulation

In this section, the simulation result of Algorithm 2 is shown in Figure 7, Figure 8 and Figure 9. The allowed speeds for each UAV are

V_{a, i, m i n} = 0.5

×

V_{c r u i s e, i}

and

V_{a, i, \max} = 2

×

V_{c r u i s e, i}

. The desired mission duration for all UAVs is selected as

T = 185

s. All UAVs are subject to specific kinematic constraints:

n_{i, x_m a x} = 2

,

ϕ_{i, m a x} = 30^{\circ}, θ_{i, m a x} = 45^{\circ}

,

R_{t u r n} = 100 m

. The replanning time interval

τ_{p l a n n i n g} = 5 s

, the control time interval

τ_{c o n t r o l} = 0.05 s

, the coordination coefficients

a_{i, j} = b_{i, j} = 1

, and the adjacency value

w_{i, i + 1} = 1, i = 1, 2, 3

. The initial and terminal velocities for all UAVs are set to

V_{a, i} (0) = V_{a, i} (T) = 50 m / s, \forall i

. During fixed-wing flight, the Bank-to-Turn coordinated turning method is employed to generate

Ω_{c, i}

, and attitude control is achieved through PX4 inner-loop attitude control. To achieve inner–outer loop decoupling and reduce system complexity, the inner-loop bandwidth is designed to be ten times larger than that of the outer loop.

Figure 7 illustrates both the global and local planning processes for the UAVs. In Figure 7a, the colored solid curves depict the optimized initial reference trajectory

P_{i}^{0}

that initially disregard no-fly zones, with lighter colors indicating lower speeds. Blue circular regions represent the no-fly zones, and for clarity, only those impacting the trajectories are shown. The dashed curves represent the actual flight trajectories obtained after local online replanning by each UAV. Figure 7b provides a magnified local view, clearly demonstrating that when a global reference trajectory conflicts with a no-fly zone, a safe local trajectory can be planned through the dynamically generated safe flight corridor, enabling the UAV to circumvent the obstacle and satisfy safety constraints.

Figure 8a,b collectively demonstrate the efficacy of the proposed constraint enforcement framework. Figure 8a presents the velocity profiles of all UAVs. The straight horizontal lines indicate the upper and lower velocity constraint bounds. The dashed curves in the green-shaded region on the left represent the UAV velocities planned without explicit velocity constraints, while the solid curves on the right show the UAV velocities when constraints are actively enforced. Figure 8b displays the overload curve for UAV1. The dashed curve in the green-shaded region on the left indicates the axial overload of UAV1 when overload constraints are not considered, whereas the solid curve on the right represents the UAV’s overload when these constraints are actively applied.

Figure 9 illustrates the temporal evolution of the consistency variable

ε_{i}

and the global absolute time reference

ε

for all UAVs. The green-shaded region on the left shows the variable changes when the flight-time-consistent strategy is not introduced, highlighting significant deviations. In contrast, the right side demonstrates the changes after introducing the proposed algorithm. This comparison clearly shows that all UAVs can strictly adhere to the absolute time reference for real-time planning, thereby successfully achieving precise flight time coordination.

6. Conclusions

This article investigated the synchronous and coordinated MTA and TP problem for heterogeneous FW-UAVs under dynamic environments and kinematic constraints. A time-balanced clustering-based assignment algorithm that minimizes the overall mission completion time while simultaneously ensuring equitable flight-time balance among heterogeneous UAVs is developed. Furthermore, a robust DF-based online planning algorithm, which guarantees the generation of kinematically feasible, collision-free, and time-synchronized trajectories, is proposed. The proposed framework decomposes the original large-scale and tightly coupled optimization problem into two sub-problems, thereby enabling scalable and real-time implementation. Rigorous theoretical analysis and extensive simulations demonstrated the feasibility and effectiveness of the proposed approach in achieving synchronized MTR missions.

The hierarchical MTA–TP framework proposed in this paper is compatible with the onboard computational architecture of most existing UAV platforms. In addition, this framework can be further integrated with AI-driven autonomous systems. For example, generative AI algorithms [38] could be incorporated into the upper-layer decision-making module to provide high-quality initial cluster partitions, thereby reducing computational time. Such integration represents a promising direction for future development.

Author Contributions

Conceptualization, X.W. and J.Z.; methodology, X.W. and Z.M.; software, X.W.; validation, X.W.; formal analysis, J.Z.; investigation, J.Z.; resources, J.Z.; data curation, C.C.; writing—original draft preparation, X.W.; writing—review and editing, X.W.; visualization, X.W.; supervision, H.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The original contributions of this study are presented in this article, and further inquiries can be directed towards the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Lu, Q.; Qiu, Y.; Guan, C.; Wang, H.; Zhu, M.; Xu, B.; Li, W.; Fan, Z. Coordinated multi-UAV reconnaissance scheme for multiple targets. Appl. Sci. 2023, 13, 10920. [Google Scholar] [CrossRef]
Hu, L.; Xi, B.; Yi, G.; Zhao, H.; Zhong, J. A multiple heterogeneous UAVs reconnaissance mission planning and re-planning algorithm. J. Syst. Eng. Electron. 2022, 33, 1190–1207. [Google Scholar]
Pang, Q.; Hu, Y.; Li, W.; Zhao, Y.; Zhu, L. Research on multi-UAV cooperative reconnaissance mission planning methods: An overview. Telecommun. Eng. 2019, 59, 741–748. [Google Scholar]
Mahmud, I.; Cho, Y. Detection avoidance and priority-aware target tracking for UAV group reconnaissance operations. J. Intell. Robot. Syst. 2018, 92, 381–392. [Google Scholar] [CrossRef]
Jia, G.W.; Wang, J.F. Research review of UAV swarm mission planning method. J. Syst. Eng. Electron. 2021, 43, 99–111. [Google Scholar]
Xue, W.; Qi, J.; Shao, G.; Xiao, Z.; Zhang, Y.; Zhong, P. Low-rank approximation and multiple sparse constraint modeling for infrared low-flying fixed-wing UAV detection. IEEE J. Sel. Top. Appl. Earth Observ. Remote Sens. 2021, 14, 4150–4166. [Google Scholar] [CrossRef]
Kovacik, L.; Novak, A.; Kazda, A.; Lusiak, T. Automatic Commercial Aircraft Formation Flight. In Proceedings of the NTAD 2019, Praha, Czech Republic, 19–20 September 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 106–109. [Google Scholar]
Mosterman, P.J.; Sanabria, D.E.; Bilgin, E.; Zhang, K.; Zander, J. A heterogeneous fleet of vehicles for automated humanitarian missions. Comput. Sci. Eng. 2014, 16, 90–95. [Google Scholar] [CrossRef]
Bänziger, T.; Kunz, A.; Wegener, K. Optimizing human–robot task allocation using a simulation tool based on standardized work descriptions. J. Intell. Manuf. 2020, 31, 1635–1648. [Google Scholar] [CrossRef]
Liu, X.-F.; Zhang, J.; Wang, J. Cooperative particle swarm optimization with a bilevel resource allocation mechanism for large-scale dynamic optimization. IEEE Trans. Cybern. 2023, 53, 1000–1011. [Google Scholar] [CrossRef]
Pendharkar, P.C. An ant colony optimization heuristic for constrained task allocation problem. J. Comput. 2015, 7, 37–47. [Google Scholar] [CrossRef]
Lu, W.; Liu, H.; Ren, Z.; Gao, Q.; Liu, D.; Wang, X.; Guo, M. Task assignment with minimum cost for multi-UAV system via reinforcement learning. In Proceedings of the 2023 42nd Chinese Control Conference (CCC), Tianjin, China, 24–26 July 2023; IEEE: Piscataway, NJ, USA, 2023; pp. 1654–1658. [Google Scholar]
Kucera, J.; Novak Sedlackova, A.; Voumik, L.C. Synthetic Data and Image Processing Tools, Immersive 3D and Digital Contact Tracing Technologies, and Cognitive Artificial Intelligence and Spatial Computing Algorithms in the Metaverse Interactive Environment. Rev. Contemp. Philos. 2023, 21, 85–101. [Google Scholar] [CrossRef]
Patel, R.; Rudnick-Cohen, E.; Azarm, S.; Otte, M.; Xu, H.; Herrmann, J.W. Decentralized task allocation in multi-agent systems using a decentralized genetic algorithm. In Proceedings of the International Conference on Robotics and Automation (ICRA), Paris, France, 31 May–31 August 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 3770–3776. [Google Scholar]
Choi, H.J.; Kim, Y.D.; Kim, H.J. Genetic algorithm based decentralized task assignment for multiple unmanned aerial vehicles in dynamic environments. Int. J. Aeronaut. Space Sci. 2011, 12, 163–174. [Google Scholar] [CrossRef]
Geng, N.; Chen, Z.; Nguyen, Q.A.; Gong, D. Particle swarm optimization algorithm for the optimization of rescue task allocation with uncertain time constraints. Complex Intell. Syst. 2021, 7, 873–890. [Google Scholar] [CrossRef]
Li, S.; He, X.; Xu, X.; Zhao, T.; Song, C.; Li, J. Weapon-target assignment strategy in joint combat decision-making based on multi-head deep reinforcement learning. IEEE Access 2023, 11, 113740–113751. [Google Scholar] [CrossRef]
Elango, M.; Nachiappan, S.; Tiwari, M.K. Balancing task allocation in multi-robot systems using k-means clustering and auction based mechanisms. Expert Syst. Appl. 2011, 38, 6486–6491. [Google Scholar] [CrossRef]
Li, X. Dynamic multiobjective optimization for thrust allocation in ship application. Ocean Eng. 2020, 218, 108187. [Google Scholar] [CrossRef]
Zhu, D.; Zhou, B.; Yang, S.X. A novel algorithm of multi-AUVs task assignment and path planning based on biologically inspired neural network map. IEEE Trans. Intell. Veh. 2021, 6, 333–342. [Google Scholar] [CrossRef]
Puente-Castro, A.; Rivero, D.; Pazos, A.; Fernandez-Blanco, E. A review of artificial intelligence applied to path planning in UAV swarms. Neural Comput. Appl. 2022, 34, 153–170. [Google Scholar] [CrossRef]
Li, M.; Zhang, H. AUV 3D path planning based on A* algorithm. In Proceedings of the Chinese Automation Congress (CAC), Shanghai, China, 6–8 November 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 11–16. [Google Scholar]
Yan, Z.; Pan, X.; Yang, Z.; Yue, L. Formation control of leader-following multi-UUVs with uncertain factors and time-varying delays. IEEE Access 2019, 7, 118792–118805. [Google Scholar] [CrossRef]
Wang, X.; Liu, H.; Gao, Q. Data-driven decision making and near-optimal path planning for multiagent system in games. IEEE J. Miniaturization Air Space Syst. 2023, 4, 320–328. [Google Scholar] [CrossRef]
Sujin, B.; Kshirsagar, P.R.; Tak, T.K.; Sonekar, S.V. Enhancing UAV Path Planning in Dynamic Environments with RRT, BFOA, and Generative AI-based Predictive Models. In Proceedings of the 5th International Conference on Pervasive Computing and Social Networking (ICPCSN), Salem, India, 14–16 May 2025; IEEE: Piscataway, NJ, USA, 2025; pp. 752–758. [Google Scholar]
Luo, Y.; Song, J.; Zhao, K.; Liu, Y. UAV-cooperative penetration dynamic-tracking interceptor method based on DDPG. Appl. Sci. 2022, 12, 1618. [Google Scholar] [CrossRef]
Liu, Y.; Zhang, X.; Zhang, Y.; Guan, X. Collision free 4D path planning for multiple UAVs based on spatial refined voting mechanism and PSO approach. Chin. J. Aeronaut. 2019, 32, 1504–1519. [Google Scholar] [CrossRef]
Lu, G.; Cai, Y.; Chen, N.; Kong, F.; Ren, Y.; Zhang, F. Trajectory generation and tracking control for aggressive tail-sitter flights. Int. J. Robot. Res. 2024, 43, 241–280. [Google Scholar] [CrossRef]
Liu, T.; He, X.; Niu, Y.; Li, J.; Li, Z. TICOP: Time-critical coordinated planning for fixed-wing UAVs in unknown unstructured environments. IEEE Robot. Autom. Lett. 2024, 9, 9629–9636. [Google Scholar] [CrossRef]
Hauser, J.; Hindman, R. Aggressive flight maneuvers. In Proceedings of the 36th IEEE Conference Decision Control (ICDC), San Diego, CA, USA, 12 December 1997; pp. 4186–4191. [Google Scholar]
Rudolph, G. Convergence Properties of Evolutionary Algorithms; Verlag Dr. Kovač: Hamburg, Germany, 1997. [Google Scholar]
Amorim, R.C.; Makarenkov, V. On k-means iterations and Gaussian clusters. Neurocomputing 2023, 553, 126547. [Google Scholar] [CrossRef]
Hart, P.E.; Nilsson, N.J.; Raphael, B. A formal basis for the heuristic determination of minimum cost paths. IEEE Trans. Syst. Sci. Cybern. 1968, 4, 100–107. [Google Scholar] [CrossRef]
Elango, P.; Mohan, R. Trajectory optimisation of six degree of freedom aircraft using differential flatness. Aeronaut. J. 2018, 122, 1788–1810. [Google Scholar] [CrossRef]
Liu, T.; Li, J.; Zou, F.; Wang, B.; Niu, Y. Differential flatness-based trajectory planning and tracking for fixed-wing aircraft in clustered environments. In Proceedings of the 42nd Chinese Control Conference (CCC), Tianjin, China, 24–26 July 2023; IEEE: Piscataway, NJ, USA, 2023; pp. 3627–3631. [Google Scholar]
Vinayak, A.; Zakaria, M.A.; Baarath, K.; Majeed, A.P.P.A. A novel Bézier curve control point search algorithm for autonomous navigation using N-order polynomial search with boundary conditions. In Proceedings of the IEEE International Intelligent Transportation Systems Conference (ITSC), Indianapolis, IN, USA, 19–22 September 2021; pp. 3884–3889. [Google Scholar]
Wang, Y.; Li, H.; Shen, Q. A hierarchical multi-task and multi-agent assignment approach: Learning DQN strategy from execution. IEEE Trans. Autom. Sci. Eng. 2025, 22, 14712–14722. [Google Scholar] [CrossRef]
Sun, G.; Xie, W.; Niyato, D.; Du, H.; Kang, J.; Wu, J.; Sun, S.; Zhang, P. Generative AI for Advanced UAV Networking. IEEE Netw. 2025, 39, 244–253. [Google Scholar] [CrossRef]

Figure 1. Coordinate frames for a fixed-wing UAV.

Figure 2. Overall solution framework for synchronized coordinated assignment and planning.

Figure 3. Framework of the online trajectory planning and control architecture.

Figure 4. Evolution of individual UAV flight time during the time-balanced clustering optimization. The average flight time decreased from 218 s to 184 s, approximately

15.6 %

.

Figure 4. Evolution of individual UAV flight time during the time-balanced clustering optimization. The average flight time decreased from 218 s to 184 s, approximately

15.6 %

.

Figure 5. Comparison of target allocation and flight routes: (a) Initial clusters. Targets belong to each UAV are spatially grouped with minimal differences. (b) Final optimized time-balanced clusters and routes. Algorithm 1 can balance mission durations despite heterogeneity.

Figure 6. Comparison of runtime of Algorithm 1: (a) Comparison of runtime and constraint-violation ratio for different

(M, N)

. Runtime satisfies

O (m a x (M, M^{2} / N))

and constraint-violation ratios are less than

20%

. (b) Comparison of runtime between Algorithm 1 and other MTA methods. Algorithm 1 achieves a higher computational efficiency.

Figure 6. Comparison of runtime of Algorithm 1: (a) Comparison of runtime and constraint-violation ratio for different

(M, N)

. Runtime satisfies

O (m a x (M, M^{2} / N))

and constraint-violation ratios are less than

20%

. (b) Comparison of runtime between Algorithm 1 and other MTA methods. Algorithm 1 achieves a higher computational efficiency.

Figure 7. Global and local trajectory planning for UAVs. The differential-flatness framework enables the UAV to circumvent the obstacle and satisfy safety constraints: (a) Overall mission view. (b) Magnified view of obstacle avoidance.

Figure 8. Kinematic constraints for UAVs. Kinematic constraints can be satisfied under the proposed constraint enforcement framework: (a) Velocity profiles of heterogeneous UAVs with and without kinematic constraints. The speeds are constrained within the allowable ranges of the heterogeneous UAVs. (b) Overload profile of UAV1 with and without kinematic constraints. The overloads are constrained within the allowable ranges of the heterogeneous UAVs.

Figure 9. Evolution of consistency variables

ε_{i}

and absolute time reference

ε

. All UAVs can strictly adhere to the absolute time reference for real-time planning and

m a x (ε_{i}) - m i n (ε_{i}) < 1%

.

Figure 9. Evolution of consistency variables

ε_{i}

and absolute time reference

ε

. All UAVs can strictly adhere to the absolute time reference for real-time planning and

m a x (ε_{i}) - m i n (ε_{i}) < 1%

.

Table 1. Symbol and Meaning.

Symbol	Meaning
N	Number of UAVs
M	Number of reconnaissance targets
Q	Number of no-fly zones
$P_{t a r g e t}$	Set of target positions
A	Target index set
$w P t_{i}$	Ordered target position set of UAV i
$A_{i}$	Ordered target index set of UAV i
$P_{o b s}$	Set of no-fly zone positions
$K_{i}$	Number of targets of UAV i
$O_{A, i}$	Center of the target cluster of UAV i
$V_{c r u i s e}$	Set of optimal cruising speeds for UAVs
$P_{i}$	Post-optimization trajectory of UAV i
$p_{i}$	Current position of UAV i
$L o_{i}$	Local replanning region of UAV i
$w P t_{i, l o c a l}$	Ordered local target set in the replanning region of UAV i
$N_{l, i}$	Number of waypoints in the replanning region of UAV i
$N_{p}$	Number of Bézier curve segments

Table 3. Main parameters of the UAVs and environment.

Simulation Parameter	Value
Number of UAVs N	4
Number of targets M	40
Number of no-fly zones Q	8
Optimal cruise speed $V_{c r u i s e}$	$\{80, 60, 50, 30\}$ m/s
Bézier curve degree $N_{p}$	8
Safe radius of UAV $R_{U A V}$	10 m
Radius of static no-fly zones $R_{o b s, q}$	$[10, 100]$ m
Smooth parameters $ϕ_{i, s m o o t h}$ and $θ_{i, s m o o t h}$	$5^{\circ}$
A* grid length	5 m
Corridor width	30 m
Detection range	200 m
Forward field of view	$\pm 30^{\circ}$
Dual feasibility tolerance in MOSEK	$10^{- 4}$
Primal feasibility tolerance in MOSEK	$10^{- 4}$
Infeasibility tolerance in MOSEK	$10^{- 4}$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wang, X.; Zhang, J.; Ma, Z.; Cao, C.; Liu, H. Online Synchronous Coordinated Assignment and Planning for Heterogeneous Fixed-Wing UAVs. Aerospace 2026, 13, 69. https://doi.org/10.3390/aerospace13010069

AMA Style

Wang X, Zhang J, Ma Z, Cao C, Liu H. Online Synchronous Coordinated Assignment and Planning for Heterogeneous Fixed-Wing UAVs. Aerospace. 2026; 13(1):69. https://doi.org/10.3390/aerospace13010069

Chicago/Turabian Style

Wang, Xindi, Jiansong Zhang, Zhenyu Ma, Chuanshuo Cao, and Hao Liu. 2026. "Online Synchronous Coordinated Assignment and Planning for Heterogeneous Fixed-Wing UAVs" Aerospace 13, no. 1: 69. https://doi.org/10.3390/aerospace13010069

APA Style

Wang, X., Zhang, J., Ma, Z., Cao, C., & Liu, H. (2026). Online Synchronous Coordinated Assignment and Planning for Heterogeneous Fixed-Wing UAVs. Aerospace, 13(1), 69. https://doi.org/10.3390/aerospace13010069

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Online Synchronous Coordinated Assignment and Planning for Heterogeneous Fixed-Wing UAVs

Abstract

1. Introduction

2. Preliminaries

2.1. FW-UAV Model

2.2. Graph Theory

2.3. Problem Formulation

3. Time-Balanced Clustering Algorithm for MTA

3.1. Cluster-Based Target Reallocation Strategy

3.2. Time Complexity Analysis

4. Flight-Time-Consistent Algorithm Based on DF for TP

4.1. Flight-Time-Consistent Replanning Strategy

4.2. DF Based Planning Algorithm for TP

5. Simulation and Analysis

5.1. Time-Balanced Clustering Simulation

5.2. Flight-Time-Consistent Planning Simulation

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI