Next Article in Journal
Numerical Investigation of Frequency Acceleration Effect on Aerodynamic Characteristics of 2-DoF Flapping Wing in Hovering
Previous Article in Journal
Performance Evaluation of UAV-Coordinated Multi-Scenario Disaster Relief Operations Based on ResNet and Attention Mechanism
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Online Synchronous Coordinated Assignment and Planning for Heterogeneous Fixed-Wing UAVs

1
National Key Laboratory of Land and Air Based Information Perception and Control, Xi’an 710065, China
2
Xi’an Modern Control Technology Research Institute, Xi’an 710065, China
3
Institute of Artificial Intelligence, Beihang University, Beijing 100191, China
4
Zhongguancun Laboratory, Beijing 100190, China
*
Authors to whom correspondence should be addressed.
Aerospace 2026, 13(1), 69; https://doi.org/10.3390/aerospace13010069
Submission received: 5 September 2025 / Revised: 23 December 2025 / Accepted: 6 January 2026 / Published: 8 January 2026

Abstract

This paper addresses the Multi-Target Reconnaissance (MTR) problem for heterogeneous Fixed-Wing Unmanned Aerial Vehicles (FW-UAVs), focusing on synchronized and time-optimal mission execution under stringent constraints. A two-stage coordinated assignment and planning framework is proposed. First, a time-balanced clustering algorithm is designed to minimize the overall mission duration while balancing individual UAV workloads by jointly employing a target reallocation strategy and an improved Genetic Algorithm (GA). Subsequently, an online trajectory planning method based on differential flatness is developed, integrating a robust replanning and flight-time synchronization strategy to ensure coordinated execution. Simulation results unequivocally demonstrate that the proposed approach enhances time optimality and temporal coordination in complex scenarios.

1. Introduction

Multi-Target Reconnaissance (MTR) is a critical mission in both military and civilian domains, with applications ranging from battlefield target engagement to disaster-site data acquisition [1,2]. Generally, MTR efforts relied on single Unmanned Aerial Vehicle (UAV) performing sequential reconnaissance of multiple targets [3,4]. However, the single UAV proves inefficient, particularly in time-sensitive scenarios. To enhance task efficiency, the deployment of Fixed-Wing Unmanned Aerial Vehicles (FW-UAVs) has gained increasing traction. FW-UAVs offer distinct advantages over single UAVs, due to their superior cooperative sensing and reconnaissance capabilities [5,6,7].
Nevertheless, in practical emergency MTR scenarios, FW-UAVs are typically heterogeneous, with different cruise speeds [8]. Furthermore, to maximize the operational capacity of each UAV while ensuring mission continuity, it is imperative that all subtasks reach saturation and are completed synchronously. MTR inherently involves multiple interrelated processes, including Multi-Target Assignment (MTA), path planning, and robust control. Consequently, two critical challenges emerge during the above processes:
  • Considering the varying optimal cruising speeds of heterogeneous UAVs, how can a set of targets be optimally assigned to ensure each FW-UAV completes its MTR within an equal and minimized mission duration?
  • Subsequent to MTA, how can kinematically feasible trajectories be generated for each FW-UAV under dynamic environmental constraints, while simultaneously guaranteeing temporal synchronization among FW-UAVs throughout the whole process?
MTA constitutes the initial phase in tackling the MTR problem. By judiciously distributing targets among specific FW-UAVs, the inherently complex multi-UAV planning task can be decomposed into simpler, independent single-UAV problems. A diverse array of algorithms has been proposed for MTA, broadly categorized into traditional heuristic methods and metaheuristic optimization approaches. Traditional methods include Genetic Algorithms (GAs) [9], Particle Swarm Optimization (PSO) [10], Ant Colony Optimization (ACO) [11], Self-Organizing Maps (SOMs), K-means, and so on [12,13]. For example, a decentralized GA-based strategy for cooperative search, where each agent independently optimizes its task, was introduced in [14]. Choi proposed a two-stage GA framework enabling individual agents to optimize local task sequences in [15]. Furthermore, a modified PSO algorithm was presented in [16] to address uncertain time constraints in rescue missions. To surmount the limitations in modeling complexity in conventional methods, Li et al. developed a deep reinforcement learning model specifically tailored for simultaneous MTA in [17]. However, a limitation of most aforementioned studies is their oversight of the inherent heterogeneity among UAVs.
The aforementioned approaches commonly fail to fully consider the diverse operational characteristics of heterogeneous UAVs. To address this issue, several algorithms have been explored. For example, the K-means method, employed in [18], tends to produce clusters for multi-robot MTA. Similarly, the SOMs method, initially developed for high-dimensional data visualization [19], was adapted in [20] for MTA and path planning through a neural mapping approach. However, these methods primarily focus on basic optimization goals and remain inadequate for complex missions that require precise temporal coordination and optimal performance under UAV heterogeneity.
Following MTA, it becomes imperative to synchronously plan kinematically feasible and safe trajectories for each UAV. A diverse range of path planning algorithms has been developed, broadly categorized into geometry-based, sampling-based, optimization-based, and intelligent approaches [21]. Geometry-based approaches, including Dijkstra, A, D, and D* Lite, are widely employed for path planning in static environments. The A* algorithm was successfully applied in [22] to address path planning under wind disturbances. Optimization-based methods, such as GA, have been extensively utilized to fulfill specific mission requirements [23,24]. Ref. [25] proposed a Generative AI (GAN) algorithm combined with traditional RRT and BFOA to predict UAV paths in a dynamic environment. Nevertheless, prior works predominantly concentrate on single-UAV path planning and neglect crucial aspects of time synchronization. Furthermore, most existing path planning approaches, though effective for single-UAV missions or static environments, remain inadequate for coordinated multi-UAV operations that demand synchronization, compliance with kinematic constraints, and obstacle avoidance in dynamic and uncertain settings.
To facilitate synchronized execution, various Trajectory Planning (TP) methods have been developed. A representative example is the cooperative penetration strategy proposed by Luo et al. [26], which utilizes a deep deterministic policy gradient (DDPG) algorithm to enable multiple UAVs to penetrate defenses. Liu et al. introduced a spatiotemporal-refined voting mechanism for PSO to address the cooperative path planning problem in [27]. Their objective function accounted for obstacles, threat regions, and arrival time constraints. However, the iterative nature of such methods poses significant challenges for large-scale real-time deployment, particularly when maintaining precise temporal coordination across heterogeneous UAVs.
Alternatively, Differential Flatness (DF) theory [28] has been applied to reduce the state dimensionality of TP problems, while consensus-based approaches adjust replanning intervals to achieve synchronization. For example, ref. [29] introduced consensus variables to align replanning intervals across UAVs, but without explicit mechanisms to match them to the global mission timeline. Although DF offers computational efficiency and consensus algorithms enhance coordination, most DF-based methods assume fully known environments. Many consensus approaches neglect heterogeneous kinematic constraints and the real-time requirement for dynamic replanning. Developing a framework that integrates the above advantages with online replanning under dynamic constraints and environmental uncertainty remains an important open challenge.
In this paper, we address the MTR problem for heterogeneous FW-UAVs by proposing a synchronized, coordinated, and online framework that integrates MTA and TP for temporally coordinated reconnaissance missions. A global optimization problem is formulated to minimize the overall mission duration and control effort while balancing individual UAV flight times, and is decomposed into two coupled sub-problems. The first sub-problem focuses on time-balanced task allocation, where K-means-based spatial initialization is combined with an iterative target reallocation procedure to equalize flight times, followed by an improved GA to optimize the intra-cluster visiting sequence. The second sub-problem addresses flight-time-consistent trajectory planning by leveraging the DF property of fixed-wing UAV dynamics, enabling the real-time generation of kinematically feasible trajectories with online replanning for collision avoidance, explicit flight-time synchronization, and robustness to dynamic and unforeseen environmental changes.
Compared with existing studies, the main contributions of this work are summarized as follows.
  • A new practical time-balanced clustering algorithm is proposed for heterogeneous FW-UAVs. This method minimizes the overall mission duration and balances individual UAV flight durations by strategically reallocating targets and optimizing the intra-cluster visiting sequence. By decoupling temporal coordination from route optimization, the proposed approach achieves significantly improved computational efficiency for time-coordinated MTA problems, which is further validated through theoretical time-complexity analysis and extensive numerical simulations.
  • A practical replanning flight-time synchronization mechanism is proposed, which adaptively adjusts the replanning duration for each UAV. Inspired by consensus-based coordination principles, this mechanism enables the synchronization of flight times, and a rigorous convergence proof is provided to guarantee persistent synchronization.
  • An online trajectory planning algorithm is developed using the DF property of FW-UAVs. This planner operates under stringent kinematic constraints, ensures collision avoidance in unknown and dynamic environments, and rigorously respects terminal time constraints. The applicability of DF to fixed-wing UAVs is explicitly derived and discussed, extending its use beyond conventional rotary-wing platforms.
The remainder of this paper is organized as follows. Section 2 outlines the preliminaries, including FW-UAV dynamics, graph theory, and the problem formulation. Section 3 describes the time-balanced clustering algorithm, and Section 4 details the DF-based trajectory planner with the flight-time synchronization mechanism. Section 5 presents the simulation results, and Section 6 concludes the paper.

2. Preliminaries

2.1. FW-UAV Model

In this paper, the term UAVs specifically refers to FW-UAVs. Let p i e R 3 × 1 represent the position vector of UAV i in the Earth-fixed coordinate frame, and let Θ i = [ ϕ i , θ i , ψ i ] T R 3 × 1 denote its Euler angles. The rotation matrix R b , i e S O 3 , mapping vectors from the body-fixed frame to the Earth-fixed frame, is defined accordingly. R a , i b denotes the rotation matrix from the air-relative frame to the body-fixed frame. The coordinate frames are illustrated in Figure 1.
α i is the angle of attack and β i is the sideslip angle. The dynamics of UAV i are described by the following  [29,30]:
p ˙ i e = v a , i e , m i v ˙ a , i e = R b , i e R a , i b T h i + D i + L i + G i , R ˙ b , i e = R b , i e Ω i × , Ω i = Ω c , i ,
where m i is the mass of UAV i, and G i denotes the gravitational force vector, with G i = m i g e 2 . e i is the unit vector in the i-direction in Earth-fixed coordinate frame, and g denotes the gravitational constant. Ω i = ω x , i , ω y , i , ω z , i T , with ω μ , i , μ = x , y , z representing the angular velocities, and Ω i × denotes the skew-symmetric matrix of Ω i  as
Ω i × = 0 ω z , i ω y , i ω z , i 0 ω x , i ω y , i ω x , i 0 .
L i is the lift force of UAV i along the y a air axis and D i is the drag force along the negative x a air axis.
Assumption 1. 
1. 
The wind speed is below 3 m/s, so sideslip β i can be neglected;
2. 
The aerodynamic forces generated by control-surface deflections δ i are negligible due to the imposed limits on control-surface ranges.
Under Assumption 1, the aerodynamic forces can be expressed as
L i 1 2 ρ V a , i 2 S i C L , i α i a 2 , D i 1 2 ρ V a , i 2 S i C D , i α i a 1 ,
where ρ is the air density, V a , i = | | v a , i | | is the airspeed of UAV i, and S i is the wing reference area. C L , i and C D , i are the lift and drag aerodynamic coefficients, respectively. Let Ω c , i = ω c , x , i , ω c , y , i , ω c , z , i T be the commanded angular velocities and T h i = T e i cos α i a 1 be the commanded thrust along the x a air axis, where the attack angle is small. a i is the unit vector in the i-direction in the air-relative coordinate frame. In this paper, the control input for UAV i is denoted by U i :
U i = ω c , x , i , ω c , y , i , ω c , z , i , T e i .
To simplify notation, the position vector p i e is subsequently written as p i . The main symbols used in this article are shown in Table 1.

2.2. Graph Theory

Consider a formation of N FW-UAVs, indexed from i = 1 to N. The interactions among them are modeled by a directed graph G = ( V , E , W ) , where V = { v 1 , , v N } is the set of nodes, and E V × V is the set of directed edges. W = [ w i , j ] R N × N indicates the weighted adjacency matrix with w i , j > 0 , if and only if ( v i , v j ) E . For node v i , the weighted in-degree is defined as μ i = j = 1 N w i , j , where N i = { j | ( v i , v j ) E } is the set of neighbors of node v i . Define the Laplacian matrix as L = D W , where D = d i a g { μ i } R N × N . A directed path from node v i to v j is a sequence of directed edges connecting them. If a node has a directed path to every other node in the graph through a subset of edges G, G is said to have a spanning tree and the node is called the root.

2.3. Problem Formulation

In MTR, a swarm of N N heterogeneous UAVs is considered, where each UAV i has a distinct optimal cruising speed V c r u i s e , i . Let the target set be A = j | j = 1 , , M , with their corresponding position set being P t a r g e t = p t a r g e t , j R 3 × 1 | j A . For each UAV i, a subset of assigned targets is denoted by A i = { k | k A } , with its associated ordered position set represented by w P t i P t a r g e t . Both A i and w P t i are treated as ordered sequences. Simultaneously, there are Q no-fly zones randomly distributed over the mission area M , with their positions represented by the set P o b s . Each UAV is required to visit all assigned targets in its respective set A i and return to its initial departure point.
Given the randomly distributed target set A and a set of heterogeneous FW-UAVs with differing optimal cruising speeds V c r u i s e , i , it is necessary to (1) assign each UAV i an ordered target subset A i such that all UAVs complete their missions in equal and minimized mission durations T; and (2) in response to environmental uncertainty and limited sensing range, plan a trajectory for each UAV online to minimize control effort, while ensuring compliance with the same flight time T i = T , obstacle avoidance Q, and kinematic constraints.
This problem is formally modeled as the following constrained optimization problem:
min A i , w P t i max T i + min P i i = 1 N 0 T i ( p i ( 3 ) ( t ) ) 2 d t , s . t . S 1 1 : i = 1 N A i = A , i = 1 N A i = , S 1 2 : max ( T i ) min ( T j ) < ε 1 , S 2 1 : w P t i P i , S 2 2 : T i T ε T , S 2 3 : p i ( t ) p o b s , q > R o b s , q + R U A V , q Q , S 2 4 : p i ( t ) p j ( t ) > 2 R U A V , i , j N , j i , S 2 5 : V a , i ( t ) [ V a , i , min , V a , i , max ] , S 2 6 : n i , μ n i , μ _ m a x , n i , z v i 2 / ( g R t u r n ) , S 2 7 : ϕ i ( t ) [ ϕ i , max , ϕ i , max ] , θ i ( t ) [ θ i , max , θ i , max ] .
where μ { x , y , z } . S 1 represents the synchronized coordination constraints among UAVs, and S 2 includes flight time, obstacle avoidance, and dynamic feasibility constraints for individual UAV i.
For constraints in S 2 , p i t P i represents the position of UAV i at time t. In the individual trajectory planning process, each UAV must traverse its assigned targets in sequence order, adhering to S 2 1 . To satisfy S 2 2 , the actual flight time T i must converge to the desired mission duration T i = 1 N T i , m i n , T i , m a x , where T i , m i n , T i , m a x represent the permitted bounded durations for UAV i. When flying in unknown environments, such as battlefield scenarios, UAVs must dynamically avoid time-varying obstacles and maintain safe distances from other UAVs in real time. Specifically, S 2 3 ensures that UAVs maintain a safe distance from obstacles, where p o b s , q , q Q is the center position of the q-th obstacle, R o b s , q is the required safety radius, and R U A V is the safe radius of the UAV. Concurrently, S 2 4 ensures that UAVs do not conflict with one another during the simultaneous execution of their tasks. The velocity constraint S 2 5 guarantees that UAVs operate within their feasible speed range. The UAV’s overloads in all directions, denoted by n i , μ = a i , μ t / m i g , μ x , y , z , are constrained by structural strength and turning radius limitations, specifically S 2 6 , where n i , μ _ m a x denotes the maximum allowable overload in direction μ , and R t u r n is the minimum turning radius. Furthermore, each UAV must satisfy specific attitude constraints S 2 7 , where ϕ i and θ i represent the roll angle and pitch angle of UAV i, respectively. To constrain the lateral acceleration, ϕ i needs to satisfy | ϕ i | 45 .
To address this constrained optimization problem, a divide-and-conquer strategy is adopted. The original problem is decomposed into two sub-problems: a time-balanced clustering-based MTA problem and a DF-based TP problem. The overall solution framework is illustrated in Figure 2.

3. Time-Balanced Clustering Algorithm for MTA

This section presents a time-balanced clustering algorithm designed for heterogeneous UAVs. This algorithm addresses the MTA problem under synchronized task execution requirements, formulated as follows:
Sub-problem 1:
min A i , w P t i max T i , s . t . S 1 1 : i = 1 N A i = A , i = 1 N A i = , S 1 2 : max T i min T j < ε 1 .
This sub-problem aims to minimize the maximum flight time among all UAVs by optimizing the assigned target sets A i and their visiting sequences w p t i , subject to two constraints. S 1 1 ensures that the target sets assigned to different UAVs are mutually exclusive and that all targets are allocated. S 1 2 enforces approximate equality of flight times across all UAVs.
To expedite the initial target allocation process, a standard K-means clustering algorithm is employed. Based on the spatial distribution of the target positions p t a r g e t P t a r g e t and the number N of UAVs, an initial division of targets into N clusters is performed. Then, an improved GA is utilized to optimize the visiting order among the targets within that cluster, forming an ordered set A i and its corresponding ordered position set w P t i . For notational simplicity, define w P t = w P t 1 , , w P t N .
In practical scenarios, the varying number of targets M and the random distribution of targets P t a r g e t , coupled with the differences in UAV optimal cruising speeds V c r u i s e , i V c r u i s e , result in significant discrepancies in individual UAV flight time T i without coordination. T i can be approximated as
T i = γ t u r n , i j A i | | p t a r g e t , j + 1 p t a r g e t , j | | / V c r u i s e , i .
where γ t u r n , i is a scaling coefficient derived from experimental data, which corrects flight-time estimation bias according to each UAV’s minimum turning radius.

3.1. Cluster-Based Target Reallocation Strategy

To address the aforementioned limitations, a target reallocation strategy based on dynamic cluster adjustments is proposed in Algorithm 1.
Algorithm 1 Time-Balanced Clustering Algorithm for MTA
Input: Number of UAVs N, optimal cruising speeds V c r u i s e , and target positions P t a r g e t .
Output: Adjusted target clusters A i , waypoint sets w P t i , and minimized mission duration T.
          1. Initialization
          w P t K _ M E A N S P t a r g e t , N
          For  i = 1 , , N   do
               w P t i G A w P t i
          End For
          k 2 = 1
          2. Iterative Refinement
          While  max T j min T i > t h r e s h o l d and k 2 < L 2  do
               k 1 = 1
              2.1. Calculate the center position O A , i and flight time T i for each cluster:
               O A , i = i = 1 K i w P t i / K i
               T i γ t u r n , i j A i | | p t a r g e t , j + 1 p t a r g e t , j | | / V c r u i s e , i , i = 1 , , N
               T T I M E _ M E A N T 1 , , T N , N
              2.2. Sort. Sort UAVs in descending order of flight time T i .
              2.3. Iterative Refinement
              For each UAV i = 1 , , N 1 in the sorted order do
                  While  | | T i T | | > t h r e s h o l d and k 1 < L 1  do
                      If  T i < T  do
                           p t m p , j F I N D _ P O I N T _ j w P t i , O A , i , P t a r g e t
                           w P t j R E M O V E _ P O I N T w P t j , p t m p
                           w P t i A D D _ P O I N T w P t i , p t m p
                      Else
                           p t m p F I N D _ P O I N T _ i w P t i , O A , i
                           w P t i R E M O V E _ P O I N T w P t i , p t m p
                           j F I N D _ C L U S T E R i , p t m p , O A
                           w P t j A D D _ P O I N T w P t j , p t m p
                          calculate T i , i = 1 , , N and update T
                           k 1 k 1 + 1
                      End If
                 End While
              End For
              2.4. Optimize inter-order
              For  i = 1 , , N  do
                   w P t i G A w P t i
              End For
               k 2 k 2 + 1
              pruning
          End While
Algorithm 1 is executed to perform adaptive adjustments of the clusters. In Step 1, the initial target clusters A i are obtained. Step 2 is the core part of this algorithm, where the visiting order within each set A i and w p t i is optimized.
In Step 2.1, the mission duration T = i = 1 N T i / N is calculated. In Step 2.2, to ensure adjustment of target clusters in each iteration, the cluster adjustment order is determined. In Step 2.3, to minimize the impact of target point selection on the existing target clusters, p t m p is defined as the target point that is closest to the center point O A , i of the set A i , whose corresponding cluster is j , j i , i.e.,:
FIND _ POINT _ j w P t , O A , i , P t a r g e t : p t m p = arg min p w P t j , j > i | | p O A , i | | .
Conversely, FIND _ POINT _ i w P t i , O A , i is defined as finding the point p t m p within set A i that is furthest from O A , i :
FIND _ POINT _ i w P t i , O A , i : p t m p = arg max p w P t i | | p O A , i | | .
Furthermore, FIND _ CLUSTER p t m p , O A is defined as finding the unadjusted cluster j such that the distance between its center point O A , j j > i and p t m p is minimized.
When p t m p is added to the ordered set w P t i , the point p m i d , j = arg min p w P t i | | p p t m p | | closest to p t m p and its corresponding subsequent point p m i d , j + 1 are identified within w P t i in ADD _ POINT w P t i , p t m p . Subsequently, p t m p is inserted between these two points to ensure that target cluster A i and w P t i remain ordered.
Step 2.4 ensures that the flight time T i for each target cluster is minimized. This algorithm incorporates a 2-opt local search operator, which eliminates path crossings and reduces redundant segments within the generated routes. The 2-opt local search operator is applied to the worst 50% of individuals. The core of the improved GA comprises three main components: a selection operator, a crossover operator, and a mutation operator. In the selection phase, a portion of highly fit individuals is chosen using a roulette wheel strategy; another portion is randomly generated to maintain diversity; and the remaining individuals are constructed based on the visiting order derived from Algorithm 1. In the crossover operator, parent individuals are crossed using a random slicing method, which inherently avoids conflict detection. In the mutation operator, a randomly selected segment of an offspring’s sequence is reordered to achieve rapid exploration of the solution space. The main parameters of Algorithm 1 are constructed in Table 2. Threshold τ time depends on the spatial distribution of UAVs and the scale of the environment. A practical selection strategy is as follows: assign an initially small value (typically below 50); then, execute Algorithm 1, and check whether the algorithm terminates because the maximum iteration limits L 1 or L 2 are reached. If this occurs, τ time should be further reduced.
Algorithm 1 is heuristic in nature but admits a well-defined convergence behavior. Specifically, the MTA in Algorithm 1 has a finite search space since both the number of UAVs N and the number of targets M are finite. The algorithm is designed to accept a new assignment state only if it yields a strict reduction in the mission-duration objective while satisfying the feasibility constraints. Step 2.3 preserves constraint satisfaction and monotonically reduces flight-time imbalance, while Step 2.4 ensures a non-increasing objective value for the visiting sequence optimization [31], consistent with established convergence properties of evolutionary algorithms. To prevent cycling in the finite state space, a pruning mechanism is introduced to discard previously visited states. With explicit termination conditions, Algorithm 1 is therefore guaranteed to converge to a locally optimal solution within a finite number of iterations, which provides a practical improvement guarantee for the MTA problem.
Table 2. Main parameters of Algorithm 1.
Table 2. Main parameters of Algorithm 1.
Simulation ParameterValue
Iteration in K_MEANS 25 × min ( M , 10 × N ) [32]
Threshold τ time
L 2 N
L 1 10 × L 2
Max Generations in GA35 × ( 10 k 2 )
Population Size80
Roulette Wheel Probability in Selection Operator 0.81
Random Probability in Selection Operator 0.09
Unchanged Probability in Selection Operator 0.1
Crossover Operator Rate 0.65
Mutation Rate 0.25

3.2. Time Complexity Analysis

Assuming that the average number of targets in each cluster is M / N , the overall time complexity of Step 2.3 is O ( L 1 ( N 1 ) M / N ) , where L 1 is the number of iterations. Considering that Step 2 needs to iterate L 2 times, and the GA’s sub-generation iterates L 3 times, the total time complexity of Algorithm 1 is
O L 2 L 1 N 1 M / N + N Z L 3 ( M / N ) 2 O L 1 L 2 M + L 2 L 3 Z M 2 / N O m a x M , M 2 N ,
where Z is the number of sub-generations. In contrast, if this problem were solved as a MTSP, the time complexity would be O ( M ! ) .
Algorithm 1 achieves flight time T i balance among heterogeneous UAVs, and T is selected as the mission duration for all subsequent UAVs in the subsequent planning phases.

4. Flight-Time-Consistent Algorithm Based on DF for TP

In actual flight, various constraints, including kinematic limitations and environmental factors, must be strictly satisfied. In this section, these challenges are addressed through a flight-time-consistent TP algorithm by generating executable trajectories P i that ensure satisfy mission duration T for all UAVs i N .
Sub-problem 2: This sub-problem aims to minimize the integral of ( p i 3 t ) 2 for all UAVs, subject to waypoint visit constraints, strict mission duration consistency T, obstacle avoidance, inter-UAV collision avoidance, velocity limits, overload constraints, and attitude limitations. Note that if S 2 7 in (5) was transformed into a hard constraint, the closed-form solution would be prohibitively complex. Therefore, for practical implementation, these constraints are incorporated as soft constraints using penalty functions J e , i within the optimization framework. Mathematically, the sub-problem min P i i = 1 N t = 0 T i ( p i 3 t ) 2 d t subject to S 2 in (5) is reformulated as
min i = 1 N μ x i , y i , z i J μ , i + J e , i = min i = 1 N μ x i , y i , z i t = 0 T i ( p i , μ 3 t ) 2 d t + t = 0 T i J e , i 2 d t
s.t.
S 2 1 : w P t i P i , S 2 2 : | | T i T | | ε T , S 2 3 : | | p i t p o b s , q | | > R o b s , q + R U A V , q Q , S 2 4 : | | p i ( t ) p j ( t ) | | > 2 R U A V , i , j N , j i , S 2 5 : V a , i t V a , i , m i n , V a , i , m a x , S 2 6 : | | n i , μ | | n i , μ _ m a x , | | n i , z | | v i 2 / g R t u r n .
The algorithm is structured into two main steps:
Step 1: Initial Trajectory Generation: Initially, neglecting the obstacle avoidance constraints S 2 3 S 2 4 , an initial reference trajectory P i 0 is generated for each UAV. This trajectory passes through all assigned waypoints and adheres to the desired total mission duration, thereby providing a preliminary solution to sub-problem 2 by leveraging the properties of flat outputs. However, in real-world flight operations, constraints S 2 3 S 2 4 cannot be disregarded. Furthermore, due to unforeseen obstacles and strict motion constraints, a UAV’s actual position deviate from its desired reference, leading to uneven remaining flight times among UAVs, which in turn violates constraints S 2 2 S 2 5 S 2 6 .
Step 2: Flight-Time-Consistent Replanning: A flight-time-consistent replanning feedback framework is proposed based on the initial reference trajectory P i 0 . This framework can continuously perform spatiotemporal replanning over a replanning interval L o i through remaining flight time feedback to satisfy all constraints. By adaptively adjusting the local flight time T d , i of L o i , all UAVs are guided to fly in accordance with T, further satisfying constraint S 2 2 . Simultaneously, within L o i , a safe corridor satisfying environmental constraints S 2 3 S 2 4 is rapidly generated. Subsequently, optimal local trajectories satisfying S 2 5 S 2 6 are obtained within this safe corridor through DF optimization.

4.1. Flight-Time-Consistent Replanning Strategy

Given the limited sensing range of UAVs and the imperative to maintain precise flight-time consistency, a dynamic replanning strategy is proposed. The starting point of the replanning interval L o i is the UAV’s current position p i t , and the end point is the next waypoint position p i , n e x t P o i n t w P t i .
Assuming that all waypoints w P t i are reachable, and their surrounding areas are guaranteed to be free of obstacles, i.e., | | w P t i m P o b s q | | > R o b s , q + R U A V , m , q . To satisfy constraint S 2 2 , define the desired local flight time for the UAV i as T d , i and the replanning time interval as τ p l a n n i n g . To ensure flight-time consistency among all UAVs, a variable ε i for UAV i is defined as follows:
ε i k = t i , n e x t P o i n t T l , i k τ c o n t r o l T ,
where τ c o n t r o l is the single-step control time interval, T l , i is the actual local flight time of current replanning interval L o i , t i , n e x t P o i n t is the desired reference time to reach the next waypoint obtained from P i 0 , and ε i 0 , 1 . Due to physical motion constraints, T l , i T d , i . Define the error variable between T d , i k and T l , i k as δ i k = T d , i k T l , i k , which characterizes flight process for the UAV i. Based on ε i , T d , i is designed as
T d , i k = Γ L o i V c r u i s e , i a i , j w i , j ε i k ε j k + b i , N + 1 w i , N + 1 ε i k ε k + 1 , ε = t / T ,
where ε is a global absolute time reference, ε 0 , 1 , a i , j , b i , j are positive coordination coefficients, and w i , N + 1 is the adjacency value from UAV i to ε in the communication graph, here w i , N + 1 = 1 , i . Γ · is the estimated path length for L o i . After specifying the L o i , the A* algorithm is adopted to generate a sequence of sub-waypoints w P t i , l o c a l within L o i , then
Γ w P t i , l o c a l = q = 1 N l , i 1 | | w P t i , l o c a l q w P t i , l o c a l q + 1 | | ,
where N l , i is the number of waypoints in L o i .
Based on the definition above, a flight-time-consistent replanning strategy is proposed, as detailed in Algorithm 2. To handle delays arising from acceleration limits or obstacle avoidance, as well as turn restrictions induced by turning constraints, a verification module check() is incorporated into Algorithm 2, which systematically examines the satisfaction of critical dynamic constraints. If either the velocity constraint or the S 2 7 constraint is violated, the framework automatically triggers Algorithm 1 to reallocate the targets and restore feasibility. Each UAV i obtains ε j of neighboring UAVs and ε , then continuously adjusts its desired local flight time T d , i for L o i to ensure that all UAVs maintain temporal consistency. Specifically, consider a scenario where UAV i encounters an unforeseen obstacle at the k-th planning step; this encounter will cause an increase in Γ w P t i , l o c a l , leading to an increase in T d , i k , which will further cause ε i k to decrease, indicating that the UAV’s time progression is lagging. When updating at step k + 1 , the consistency algorithm will adjust T d , i k + 1 to decrease, implicitly instructing UAV i to speed up during the next replanning step. Concurrently, neighboring UAVs will adaptively increase their T d , j k + 1 to accommodate the lagging UAV i, achieving flight-time consistency. Thus, constraint S 2 2 is satisfied.
Algorithm 2 Flight-Time-Consistent Algorithm
Input: current time t, environmental map M , communication graph matrix L, and so on.
          1. Initialization
          To satisfy constraints S 2 1 S 2 5 S 2 6 when neglecting the obstacle avoidance
          constraints S 2 3 S 2 4 .
           ε = ε i = 0 , i = 1 , 2 , , N , k = 1
          P i 0 , T i G L O B A L _ P L A N N I N G M , w P t i , T i
          2. Replanning strategy
          To satisfy all constraints S 2 1 S 2 2 S 2 3 S 2 4 S 2 5 S 2 6 .
          While  p i w P t i e n d   do
               w P t i , l o c a l A * M , p i , p i , n e x t P o i n t
              Calculate Γ w P t i , l o c a l
              Calculate T d , i k
              P i k , T l , i k L O C A L _ P L A N N I N G M , w P t i , l o c a l , T d , i k
              While  t < t k + τ p l a n n i n g  do
                  U i C O N T R O L P i k X i , M U A V S Y S T E M U i
              End While
              Update ε , ε i and p i , n e x t P o i n t .
               k = k + 1
              check()
          End While
Theorem 1.
Considering sub-problem 2 (11) with the initial trajectory obtained from Step 1, and incorporating the consistency variable (12) together with the desired flight time (13), the flight-time coordination framework can be established through the flight-time-consistent algorithm in Algorithm 2. Within this framework, the constraint S 2 2 in (11) is guaranteed, and the overall solution of sub-problem 2 can be ultimately achieved under Assumption 2 and appropriate selection of design parameters.
Assumption 2.
1. 
In Equations (12) and (13), δ [ k ] Δ < is uniformly bounded. This assumption is ensured by the local A*-based trajectory planner. In our scenarios, A* can always find a feasible path within finite time [33].
2. 
In Equation (15), H 1 , i and H 2 , i are both bounded as 0 < H ̲ 1 λ min ( H 1 [ k ] ) λ max ( H 1 [ k ] ) H ¯ 1 , 0 < H ̲ 2 λ min ( H 2 [ k ] ) λ max ( H 2 [ k ] ) H ¯ 2 , k . Due to the definitions of H 1 , i and H 2 , i , this bound holds naturally.
3. 
The communication topology L contains at least one spanning tree.
Proof. 
Considering Equation (12), we have
ε i k + 1 = H 1 , i k a i , j w i , j ε i k ε j k + b i , N + 1 w i , N + 1 ε i k ε k H 1 , i k + H 2 , i k ,
where H 2 , i = t i , n e x t P o i n t + δ i k + τ c o n t r o l / T , H 1 , i = Γ w P t i , l o c a l / V c r u i s e , i T .
Under the Assumption 2, rewrite Equation (15) in matrix form:
ε k + 1 = H 1 k L ¯ ε k H 1 k + H 2 k ,
where L ¯ = l i , j R N + 1 × N + 1 is the augmented Laplacian matrix that incorporates the absolute time reference ε . Define a i , N + 1 b i , N + 1 and w N + 1 , j = 0 , j . Then, l i , j = a i , j w i , j , l i , i = i j a i , j w i , j , l N + 1 , j = 0 , i { 1 , , N } , j { 1 , , N + 1 } . Referring to Proposition 1 in [29], a transformation matrix I R N 1 × N for the complementary consensus subspace is introduced such that
ϑ k = I ε k .
I satisfies I 1 n = 0 . This implies that ϑ = 0 is equivalent to ε span { 1 n } , meaning the UAV achieves flight time coordination at step k. Choosing I T I = I n × n 1 n 1 n 1 n T , then, I T L ¯ I = L ¯ can be further derived based on L ¯ 1 n = 0 . Furthermore, the error system is
ϑ 0 = 0 , ϑ k + 1 = I H 1 k L ¯ I T ϑ k I H 1 k + H 2 k .
Equation (18) can be written as the discrete-time system as
ϑ [ k + 1 ] = Φ [ k ] ϑ [ k ] + d [ k ] , k Z 0 ,
where Φ [ k ] = I H 1 [ k ] L ¯ I T , d [ k ] = I H 1 [ k ] + H 2 [ k ] . Based on Assumption 2, we have
Φ [ k ] H ¯ 1 λ max ( L ¯ ) = Λ .
By choosing parameters γ and adjusting L ¯ such that Λ < γ < 1 . Therefore, there exists a constant C > 0 such that
Φ [ k 1 ] Φ [ 0 ] C γ k .
The iteration solution of Equation (19) is
ϑ [ k ] = Φ [ k 1 ] Φ [ 0 ] ϑ [ 0 ] + j = 0 k 1 Φ [ k 1 ] Φ [ j + 1 ] d [ j ] .
Since = 0 k 1 γ = 0 γ = 1 1 γ , we obtain ϑ [ k ] C γ k ϑ [ 0 ] + C 1 γ sup 0 t k d [ t ] . Hence ϑ [ k ] converges exponentially to a bounded neighborhood related to sup d [ t ] .
If the case of time variations is considered, the synchronization law (18) becomes a linear system of time variations. The system remains uniformly asymptotically stable if Γ ( w P t i , l o c a l ) satisfies certain conditions. Similarly, when the effect of communication delays is taken into account, (15) introduces a time delay into the system. Convergence can still be guaranteed if the maximum delay does not exceed a specified threshold. □

4.2. DF Based Planning Algorithm for TP

In this section, the DF-based planning method implements GLOBAL_PLANNING and LOCAL_PLANNING mentioned in Algorithm 2.
A nonlinear system X ˙ = f X , U is said to be differentially flat if there exists a vector Y = f 1 X , , X m 1 , , U , , U m 2 that satisfies the following properties:
  • Y is differentiable;
  • The system states X and control inputs U can be expressed as functions of Y and its finite-order derivatives X = g 1 Y , , Y n 1 and U = g 2 Y , , Y n 2 .
Such systems are termed flat systems, and Y is the differential flat output. Through DF, the original system can be mapped to a lower-dimensional, linear flat space. By planning the flat outputs, algebraic solutions for X and U can be directly obtained.
Theorem 2.
Considering (1), a DF output for FW-UAVs is selected as Y i = p i , α i . From this flat output, all other state variables X i = p i , v a , i e , V a , i , Θ i , Ω i and control inputs U i = Ω c , i , T e i can be uniquely derived. The applicability of this theorem is restricted to the case of translational-flatness parametrization [34,35].
Proof. 
First, based on the definition of p ˙ i = v a , i e , V a , i = | | v a , i | | , it can be readily shown that p i , v a , i e , V a , i can be characterized by Y i as follows:
V a , i = g 1 p i .
Then, it can be derived that
R a , i b = g 2 α i .
Furthermore, the transformation matrix R a , i e can be further expressed as
R a , i e = x a , i , y a , i , z a , i ,
where x a , i is the projection of the x a axis of air coordinates in the inertial system. Given that the velocity vector is aligned with the x a axis of the air-relative frame, we can obtain
x a , i e = v a , i e | | v a , i e | | .
Based on (1), we can obtain
m i v ˙ a , i e + m i g e 2 = T e i cos α i D i x a , i e + L i y a , i e .
Dot multiplying both sides by x a , i e yields
x a , i e T m i v ˙ a , i e + m i g e 2 = T e i cos α i D i .
Substituting the above equation into (1), we get
L i y a , i e = m i v ˙ a , i e + m i g e 2 x a , i e T m i v ˙ a , i e + m i g e 2 x a , i e .
Considering | | y a , i e | | = 1 , then
y a , i e = v ˙ a , i e + g e 2 x a , i e T v ˙ a , i e + g e 2 x a , i e | | v ˙ a , i e + g e 2 x a , i e T v ˙ a , i e + g e 2 x a , i e | | .
According to the orthogonality of coordinate, we can obtain
z a , i e = x a , i e × y a , i e .
Thus
Θ i = g 3 R b , i e = g 3 R a , i e R a , i b T = g 4 Y i .
Considering Ω i × = R b , i e T R ˙ b , i e , it is derived that
Ω i = g 5 Y i .
Thus, Ω c , i = Ω i = g 5 Y i .
According to (28), T e i can be expressed as
T e i = x a , i e T m i v ˙ a , i e + m i g e 2 + D i cos α i .
Considering D i = g 6 Y i , then
T e i = g 7 Y i .
Therefore, all system states X i and control inputs U i are explicitly expressed as functions of Y i and its finite-order time derivatives. □
During the planning process, constraints S 2 defined in the state space can be mapped to constraints Y i in the flat output space. Subsequently, all desired X i and U i can be inversely mapped from the planned flat outputs. However, if the real-time state of the UAV is available, the control inputs U i can be derived by introducing a feedback controller to achieve rapid tracking of the desired state variables, as shown in Figure 3. This feedback control eliminates the need for explicit inverse mapping calculations from flat outputs to desired controls, thereby significantly improving control efficiency and responsiveness.
When applying the DF-based approach, the TP problem is reformulated as an optimization problem, in which the following aspects must be taken into account: trajectory representation, flight time allocation, cost function definition, and constraint definition.
A. Trajectory representation
For Y i , Bézier curves are employed for fitting. Further specific details on Bézier curves can be found in [36]. To simplify, the UAV’s cruising altitude y i t = H c is assumed to be fixed. The angle of attack α i t can then be determined based on the lift–gravity balance equation:
α i t = C L , i 1 2 m i g ρ V a , i 2 S i ,
where C L , i 1 · denotes the inverse mapping from the lift coefficient to α i t . Consequently, only the horizontal positions x i , z i are planned. The Bézier curve parameterization for a dimension μ = x , z of one segment is given by the following:
x i = B 1 ( t ) = s B 1 h = 0 N p c B 1 h b N p h t T i , 1 s B 1 , T i , 1 t T i , 2 , B N l , i 1 ( t ) = s B N l , i 1 h = 0 N p c B N l , i 1 h b N p h t T i , N l , i 1 s B N l , i 1 , T i , N l , i 1 t T i , N l , i .
where b N p h t B = C N p h t B h 1 t B N p h , t B 0 , 1 are the Bernstein basis polynomials, N p is the degree of the curve, and C · is the binomial coefficient. For the first piecewise segment B 1 ( t ) from the beginning point to the next waypoint, c B 1 h denotes the h-th control point for this specific curve segment. The parameters s d , d = B 1 , , B N l , i 1 are scaling factors used to scale the time interval of each segment, where N l , i is the total number of waypoints w P t i or w P t i , l o c a l in the segment.
B. Flight Time Allocation
The Bézier desired flight time for each waypoint within the trajectory is allocated using the following formula:
T i , d = T i , d 1 + T i | | p i , d + 1 p i , d | | / d = 1 , , N l , i 1 | | p i , d + 1 p i , d | | .
C. Cost Function Definition
Based on the aforementioned definitions, for a single UAV, the cost function to be optimized is reformed to minimize the motion jerk and additional cost:
min J i = min μ x i , z i J μ , i + J e , i = min μ x i , z i t = 0 T i p i , μ 3 t 2 d t + t = 0 T i j e , i 2 d t ,
where J μ , i represents the minimization of the UAV’s motion jerk. Taking the x i -direction as an example, the cost function can be expressed as J x i , i = c x , i T Q x , i c x , i , where c i = c B 1 0 , , c B N l , i 1 N p is the vector of all control points in the x i -direction, which are the variables to be optimized. Q x , i is the positive semi-definite Hessian matrix of the cost function.
For the symmetric attitude soft constraint J e , i , a barrier function Ψ x , x max , x s m o o t h is defined to enforce it as a soft constraint:
Ψ x , x max , x s m o o t h = K s i g S i g x , x max , x s m o o t h | | x | | ε s i g ,
where ε s i g is a very small positive number to prevent a logarithm of zero or negative values, and K s i g is a scaling factor. S i g x , x max , x s m o o t h = 1 / 1 + exp 10 x max | | x | | / x s m o o t h . Ψ x , x max . x s m o o t h has the following properties:
  • Ψ x , x max , x s m o o t h 0 ;
  • As | | x | | x max x s m o o t h , Ψ 0 ;
  • When | | x | | > x max , Ψ K s i g ( 1 ε ) | | x | | / ε s i g , which reaches a maximum value.
By minimizing this barrier function Ψ 0 , it can be guaranteed that | | x | | x max x s m o o t h . Thus, constraint S 2 7 is transformed into
j e , i = Ψ ϕ i t , ϕ i , m a x , ϕ i , s m o o t h + Ψ θ i t , θ i , m a x θ i , s m o o t h .
D. Constraint Definition
Waypoint Constraints: For both the initial reference trajectory P i 0 and subsequent local replanning trajectories P i k , S 2 1 must be satisfied. In the initial global planning phase, P i 0 is required to pass through all assigned waypoints w P t i and satisfy initial and terminal position, velocity, and acceleration constraints. In the online replanning phase, P i k only needs to satisfy the start and end point constraints of the current replanning segment, as intermediate waypoints are dynamically handled by the safe flight corridors. Considering the properties of Bézier curves, these waypoint constraints can be transformed into constraints on the curve control points c d h , d = B 1 , , B N l , i 1 , h = 0 / N p and their derivatives. Taking the start point constraint of the first curve segment for x i -direction as an example, we have
c B 1 0 s B 1 = x i , s t a r t , p i , s t a r t = w P t i 1 , c B 1 0 , 1 = x ˙ i , s t a r t , c B 1 0 , 2 s B 1 1 = x ¨ i , s t a r t ,
where c B 1 h , l , h = 0 , N p , l = 0 , 2 represents the l-th order derivative of the h-th control point of this curve segment, c B 1 h , l = N p ! N p l ! c B 1 h + 1 , l 1 c B 1 h , l 1 .
Continuity Constraints: To ensure smooth and continuous UAV motion, the connections between adjacent Bézier curve segments must be multi-order continuity connections. Taking the first two curve segments for x i -direction as an example, this constraint can be expressed as equality constraints on control points c B 1 N p , c B 2 0 and their derivatives, i.e.,
c B 1 N p , l s B 1 1 l = c B 2 0 , l s B 2 1 l , l = 0 , 1 , 2 .
Safe Flight Corridors Constraint: During the online replanning phase, to ensure obstacle avoidance constraints S 2 3 S 2 4 , safe flight corridor constraints are introduced. Taking the x i -direction as an example, considering the convex hull property of Bézier curves, ensuring that all control points c d h , d = B 1 , , B N l , i 1 , h = 0 , , N p of a curve segment lie within a defined convex hull guarantees that the entire trajectory segment also remains within that hull. This property also applies to the derivatives of the curve.
First, the local replanning area is discretized into a grid. Then, an A* algorithm is utilized to generate a sequence of central path points w P t i , l o c a l that satisfy the obstacle avoidance constraints S 2 3 S 2 4 . Subsequently, each sub-waypoint is expanded to form a rectangular safe corridor Ξ d = p | p Ξ d , p s a t i s f i e s S 2 3 S 2 4 , ultimately forming the total safe corridor for the local planning interval Ξ = Ξ d , Ξ d Ξ d + 1 . For a certain curve segment d within the interval, it is only necessary to apply constraints to all control points of the curve, ensuring that they satisfy
x d , min c d h s d x d , max ,
where x d , min , x d , max are the lower and upper bounds of the safe corridor for the d-th curve segment.
Kinematic Constraints: Since the derivatives of control points also possess the convex hull property, velocity and acceleration constraints can be directly imposed on the control point derivatives. Considering constraints S 2 5 S 2 6 , taking a curve segment for the x i -direction as an example, we have
v i , m a x c d h , 1 v i , m a x , a i , m a x m i g c d h , 2 s d 1 a i , m a x m i g .
For μ = z i , turning constraints also need to be considered.
In summary, the TP problem can be formulated as a constrained optimization problem. The waypoint visit and continuity constraints are transformed into linear equality constraints on the control points. Safe corridor constraints and kinematic limits are represented as linear inequality constraints on control points and their derivatives. Finally, attitude constraints are incorporated as soft constraints through barrier functions. Thus, this planning problem can be formally described as
min c i T Q i c i + J e , i s . t . A e q , i c i = b e q , i , A i e , i c i b i e , i ,
where c i = c B 1 , x 0 , , c B 1 , x N p , , c B N l , i 1 , x 0 , , c B N l , i 1 , x N p , c B 1 , z 0 , , c B N l , i 1 , z N p . When J e , i is not considered, this problem reduces to a convex quadratic programming problem, which can be efficiently solved using a QP solver to obtain an initial feasible trajectory. Subsequently, J e , i is considered for further optimization to obtain the final trajectory.

5. Simulation and Analysis

The performance of the algorithm is evaluated and validated in this section. All algorithms are executed on the same hardware platform: Intel i5-9300H CPU, 32 GB RAM, and NVIDIA GeForce GTX 1650 GPU. Due to the fixed cruising altitude assumption for UAVs, the simulation environment is simplified to a two-dimensional scenario. Target positions P t a r g e t and no-fly zone positions P o b s are randomly generated within a 2.5 km × 2.5 km area. All UAVs are assumed to depart from the origin 0 , 0 m, and return to 0 , 0 m after completing the MTR mission. A precise aerodynamic force model and 6-DOF kinematic motion are constructed, while attitude control is provided by a simulated PX4 controller. Additionally, the UAV’s detectable range is set to 200 m. The MOSEK optimizer is employed to solve the convex QP problems arising from the trajectory generation. The main parameters for the UAVs and environment are constructed in Table 3.

5.1. Time-Balanced Clustering Simulation

The simulation result of Algorithm 1 is shown in Figure 4, Figure 5 and Figure 6. Figure 4 illustrates the convergence of flight times during the optimization process: the green-shaded region indicates the execution of the target reallocation, while the red-shaded region denotes the application of the intra-cluster optimization. As the optimization progresses, the individual UAV flight times gradually converge, with the flight times of the four UAVs being T 1 = 180 s , T 2 = 185 s , T 3 = 185 s , T 4 = 186 s . The average flight time decreases from 218 s to 184 s. Figure 5 shows the target allocation and visiting order. Each target position is represented by a solid dot, where lighter colors indicate earlier reconnaissance and darker colors indicate later reconnaissance within a UAV’s sequence. The different colored lines represent the distinct routes planned for each UAV. Specifically, Figure 5a shows the initial target routes w P t , where targets belonging to each UAV are spatially grouped with minimal differences in the number of assigned targets. Figure 5b illustrates the final adjusted target allocation and optimized target routes w P t . Notably, since U A V 1 has the highest optimal cruising speed, its optimized route is consequently the longest, followed by U A V 2 , while U A V 1 has the shortest route, demonstrating the algorithm’s ability to balance mission durations despite heterogeneity.
Figure 6a shows the scalability of Algorithm 1 under various configurations ( M , N ) , presenting runtime, time variance, and constraint-violation ratios (percentage of optimizations completed within the threshold). The computational efficiency of the time-balanced clustering is further evaluated against representative MTA algorithms ([29,37]), as illustrated in Figure 6b with M = 5 N .

5.2. Flight-Time-Consistent Planning Simulation

In this section, the simulation result of Algorithm 2 is shown in Figure 7, Figure 8 and Figure 9. The allowed speeds for each UAV are V a , i , m i n = 0.5 × V c r u i s e , i and V a , i , max = 2 × V c r u i s e , i . The desired mission duration for all UAVs is selected as T = 185 s. All UAVs are subject to specific kinematic constraints: n i , x _ m a x = 2 , ϕ i , m a x = 30 ,   θ i , m a x = 45 , R t u r n = 100 m . The replanning time interval τ p l a n n i n g = 5 s , the control time interval τ c o n t r o l = 0.05 s , the coordination coefficients a i , j = b i , j = 1 , and the adjacency value w i , i + 1 = 1 , i = 1 , 2 , 3 . The initial and terminal velocities for all UAVs are set to V a , i 0 = V a , i T = 50 m / s , i . During fixed-wing flight, the Bank-to-Turn coordinated turning method is employed to generate Ω c , i , and attitude control is achieved through PX4 inner-loop attitude control. To achieve inner–outer loop decoupling and reduce system complexity, the inner-loop bandwidth is designed to be ten times larger than that of the outer loop.
Figure 7 illustrates both the global and local planning processes for the UAVs. In Figure 7a, the colored solid curves depict the optimized initial reference trajectory P i 0 that initially disregard no-fly zones, with lighter colors indicating lower speeds. Blue circular regions represent the no-fly zones, and for clarity, only those impacting the trajectories are shown. The dashed curves represent the actual flight trajectories obtained after local online replanning by each UAV. Figure 7b provides a magnified local view, clearly demonstrating that when a global reference trajectory conflicts with a no-fly zone, a safe local trajectory can be planned through the dynamically generated safe flight corridor, enabling the UAV to circumvent the obstacle and satisfy safety constraints.
Figure 8a,b collectively demonstrate the efficacy of the proposed constraint enforcement framework. Figure 8a presents the velocity profiles of all UAVs. The straight horizontal lines indicate the upper and lower velocity constraint bounds. The dashed curves in the green-shaded region on the left represent the UAV velocities planned without explicit velocity constraints, while the solid curves on the right show the UAV velocities when constraints are actively enforced. Figure 8b displays the overload curve for UAV1. The dashed curve in the green-shaded region on the left indicates the axial overload of UAV1 when overload constraints are not considered, whereas the solid curve on the right represents the UAV’s overload when these constraints are actively applied.
Figure 9 illustrates the temporal evolution of the consistency variable ε i and the global absolute time reference ε for all UAVs. The green-shaded region on the left shows the variable changes when the flight-time-consistent strategy is not introduced, highlighting significant deviations. In contrast, the right side demonstrates the changes after introducing the proposed algorithm. This comparison clearly shows that all UAVs can strictly adhere to the absolute time reference for real-time planning, thereby successfully achieving precise flight time coordination.

6. Conclusions

This article investigated the synchronous and coordinated MTA and TP problem for heterogeneous FW-UAVs under dynamic environments and kinematic constraints. A time-balanced clustering-based assignment algorithm that minimizes the overall mission completion time while simultaneously ensuring equitable flight-time balance among heterogeneous UAVs is developed. Furthermore, a robust DF-based online planning algorithm, which guarantees the generation of kinematically feasible, collision-free, and time-synchronized trajectories, is proposed. The proposed framework decomposes the original large-scale and tightly coupled optimization problem into two sub-problems, thereby enabling scalable and real-time implementation. Rigorous theoretical analysis and extensive simulations demonstrated the feasibility and effectiveness of the proposed approach in achieving synchronized MTR missions.
The hierarchical MTA–TP framework proposed in this paper is compatible with the onboard computational architecture of most existing UAV platforms. In addition, this framework can be further integrated with AI-driven autonomous systems. For example, generative AI algorithms [38] could be incorporated into the upper-layer decision-making module to provide high-quality initial cluster partitions, thereby reducing computational time. Such integration represents a promising direction for future development.

Author Contributions

Conceptualization, X.W. and J.Z.; methodology, X.W. and Z.M.; software, X.W.; validation, X.W.; formal analysis, J.Z.; investigation, J.Z.; resources, J.Z.; data curation, C.C.; writing—original draft preparation, X.W.; writing—review and editing, X.W.; visualization, X.W.; supervision, H.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The original contributions of this study are presented in this article, and further inquiries can be directed towards the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Lu, Q.; Qiu, Y.; Guan, C.; Wang, H.; Zhu, M.; Xu, B.; Li, W.; Fan, Z. Coordinated multi-UAV reconnaissance scheme for multiple targets. Appl. Sci. 2023, 13, 10920. [Google Scholar] [CrossRef]
  2. Hu, L.; Xi, B.; Yi, G.; Zhao, H.; Zhong, J. A multiple heterogeneous UAVs reconnaissance mission planning and re-planning algorithm. J. Syst. Eng. Electron. 2022, 33, 1190–1207. [Google Scholar]
  3. Pang, Q.; Hu, Y.; Li, W.; Zhao, Y.; Zhu, L. Research on multi-UAV cooperative reconnaissance mission planning methods: An overview. Telecommun. Eng. 2019, 59, 741–748. [Google Scholar]
  4. Mahmud, I.; Cho, Y. Detection avoidance and priority-aware target tracking for UAV group reconnaissance operations. J. Intell. Robot. Syst. 2018, 92, 381–392. [Google Scholar] [CrossRef]
  5. Jia, G.W.; Wang, J.F. Research review of UAV swarm mission planning method. J. Syst. Eng. Electron. 2021, 43, 99–111. [Google Scholar]
  6. Xue, W.; Qi, J.; Shao, G.; Xiao, Z.; Zhang, Y.; Zhong, P. Low-rank approximation and multiple sparse constraint modeling for infrared low-flying fixed-wing UAV detection. IEEE J. Sel. Top. Appl. Earth Observ. Remote Sens. 2021, 14, 4150–4166. [Google Scholar] [CrossRef]
  7. Kovacik, L.; Novak, A.; Kazda, A.; Lusiak, T. Automatic Commercial Aircraft Formation Flight. In Proceedings of the NTAD 2019, Praha, Czech Republic, 19–20 September 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 106–109. [Google Scholar]
  8. Mosterman, P.J.; Sanabria, D.E.; Bilgin, E.; Zhang, K.; Zander, J. A heterogeneous fleet of vehicles for automated humanitarian missions. Comput. Sci. Eng. 2014, 16, 90–95. [Google Scholar] [CrossRef]
  9. Bänziger, T.; Kunz, A.; Wegener, K. Optimizing human–robot task allocation using a simulation tool based on standardized work descriptions. J. Intell. Manuf. 2020, 31, 1635–1648. [Google Scholar] [CrossRef]
  10. Liu, X.-F.; Zhang, J.; Wang, J. Cooperative particle swarm optimization with a bilevel resource allocation mechanism for large-scale dynamic optimization. IEEE Trans. Cybern. 2023, 53, 1000–1011. [Google Scholar] [CrossRef]
  11. Pendharkar, P.C. An ant colony optimization heuristic for constrained task allocation problem. J. Comput. 2015, 7, 37–47. [Google Scholar] [CrossRef]
  12. Lu, W.; Liu, H.; Ren, Z.; Gao, Q.; Liu, D.; Wang, X.; Guo, M. Task assignment with minimum cost for multi-UAV system via reinforcement learning. In Proceedings of the 2023 42nd Chinese Control Conference (CCC), Tianjin, China, 24–26 July 2023; IEEE: Piscataway, NJ, USA, 2023; pp. 1654–1658. [Google Scholar]
  13. Kucera, J.; Novak Sedlackova, A.; Voumik, L.C. Synthetic Data and Image Processing Tools, Immersive 3D and Digital Contact Tracing Technologies, and Cognitive Artificial Intelligence and Spatial Computing Algorithms in the Metaverse Interactive Environment. Rev. Contemp. Philos. 2023, 21, 85–101. [Google Scholar] [CrossRef]
  14. Patel, R.; Rudnick-Cohen, E.; Azarm, S.; Otte, M.; Xu, H.; Herrmann, J.W. Decentralized task allocation in multi-agent systems using a decentralized genetic algorithm. In Proceedings of the International Conference on Robotics and Automation (ICRA), Paris, France, 31 May–31 August 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 3770–3776. [Google Scholar]
  15. Choi, H.J.; Kim, Y.D.; Kim, H.J. Genetic algorithm based decentralized task assignment for multiple unmanned aerial vehicles in dynamic environments. Int. J. Aeronaut. Space Sci. 2011, 12, 163–174. [Google Scholar] [CrossRef]
  16. Geng, N.; Chen, Z.; Nguyen, Q.A.; Gong, D. Particle swarm optimization algorithm for the optimization of rescue task allocation with uncertain time constraints. Complex Intell. Syst. 2021, 7, 873–890. [Google Scholar] [CrossRef]
  17. Li, S.; He, X.; Xu, X.; Zhao, T.; Song, C.; Li, J. Weapon-target assignment strategy in joint combat decision-making based on multi-head deep reinforcement learning. IEEE Access 2023, 11, 113740–113751. [Google Scholar] [CrossRef]
  18. Elango, M.; Nachiappan, S.; Tiwari, M.K. Balancing task allocation in multi-robot systems using k-means clustering and auction based mechanisms. Expert Syst. Appl. 2011, 38, 6486–6491. [Google Scholar] [CrossRef]
  19. Li, X. Dynamic multiobjective optimization for thrust allocation in ship application. Ocean Eng. 2020, 218, 108187. [Google Scholar] [CrossRef]
  20. Zhu, D.; Zhou, B.; Yang, S.X. A novel algorithm of multi-AUVs task assignment and path planning based on biologically inspired neural network map. IEEE Trans. Intell. Veh. 2021, 6, 333–342. [Google Scholar] [CrossRef]
  21. Puente-Castro, A.; Rivero, D.; Pazos, A.; Fernandez-Blanco, E. A review of artificial intelligence applied to path planning in UAV swarms. Neural Comput. Appl. 2022, 34, 153–170. [Google Scholar] [CrossRef]
  22. Li, M.; Zhang, H. AUV 3D path planning based on A* algorithm. In Proceedings of the Chinese Automation Congress (CAC), Shanghai, China, 6–8 November 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 11–16. [Google Scholar]
  23. Yan, Z.; Pan, X.; Yang, Z.; Yue, L. Formation control of leader-following multi-UUVs with uncertain factors and time-varying delays. IEEE Access 2019, 7, 118792–118805. [Google Scholar] [CrossRef]
  24. Wang, X.; Liu, H.; Gao, Q. Data-driven decision making and near-optimal path planning for multiagent system in games. IEEE J. Miniaturization Air Space Syst. 2023, 4, 320–328. [Google Scholar] [CrossRef]
  25. Sujin, B.; Kshirsagar, P.R.; Tak, T.K.; Sonekar, S.V. Enhancing UAV Path Planning in Dynamic Environments with RRT, BFOA, and Generative AI-based Predictive Models. In Proceedings of the 5th International Conference on Pervasive Computing and Social Networking (ICPCSN), Salem, India, 14–16 May 2025; IEEE: Piscataway, NJ, USA, 2025; pp. 752–758. [Google Scholar]
  26. Luo, Y.; Song, J.; Zhao, K.; Liu, Y. UAV-cooperative penetration dynamic-tracking interceptor method based on DDPG. Appl. Sci. 2022, 12, 1618. [Google Scholar] [CrossRef]
  27. Liu, Y.; Zhang, X.; Zhang, Y.; Guan, X. Collision free 4D path planning for multiple UAVs based on spatial refined voting mechanism and PSO approach. Chin. J. Aeronaut. 2019, 32, 1504–1519. [Google Scholar] [CrossRef]
  28. Lu, G.; Cai, Y.; Chen, N.; Kong, F.; Ren, Y.; Zhang, F. Trajectory generation and tracking control for aggressive tail-sitter flights. Int. J. Robot. Res. 2024, 43, 241–280. [Google Scholar] [CrossRef]
  29. Liu, T.; He, X.; Niu, Y.; Li, J.; Li, Z. TICOP: Time-critical coordinated planning for fixed-wing UAVs in unknown unstructured environments. IEEE Robot. Autom. Lett. 2024, 9, 9629–9636. [Google Scholar] [CrossRef]
  30. Hauser, J.; Hindman, R. Aggressive flight maneuvers. In Proceedings of the 36th IEEE Conference Decision Control (ICDC), San Diego, CA, USA, 12 December 1997; pp. 4186–4191. [Google Scholar]
  31. Rudolph, G. Convergence Properties of Evolutionary Algorithms; Verlag Dr. Kovač: Hamburg, Germany, 1997. [Google Scholar]
  32. Amorim, R.C.; Makarenkov, V. On k-means iterations and Gaussian clusters. Neurocomputing 2023, 553, 126547. [Google Scholar] [CrossRef]
  33. Hart, P.E.; Nilsson, N.J.; Raphael, B. A formal basis for the heuristic determination of minimum cost paths. IEEE Trans. Syst. Sci. Cybern. 1968, 4, 100–107. [Google Scholar] [CrossRef]
  34. Elango, P.; Mohan, R. Trajectory optimisation of six degree of freedom aircraft using differential flatness. Aeronaut. J. 2018, 122, 1788–1810. [Google Scholar] [CrossRef]
  35. Liu, T.; Li, J.; Zou, F.; Wang, B.; Niu, Y. Differential flatness-based trajectory planning and tracking for fixed-wing aircraft in clustered environments. In Proceedings of the 42nd Chinese Control Conference (CCC), Tianjin, China, 24–26 July 2023; IEEE: Piscataway, NJ, USA, 2023; pp. 3627–3631. [Google Scholar]
  36. Vinayak, A.; Zakaria, M.A.; Baarath, K.; Majeed, A.P.P.A. A novel Bézier curve control point search algorithm for autonomous navigation using N-order polynomial search with boundary conditions. In Proceedings of the IEEE International Intelligent Transportation Systems Conference (ITSC), Indianapolis, IN, USA, 19–22 September 2021; pp. 3884–3889. [Google Scholar]
  37. Wang, Y.; Li, H.; Shen, Q. A hierarchical multi-task and multi-agent assignment approach: Learning DQN strategy from execution. IEEE Trans. Autom. Sci. Eng. 2025, 22, 14712–14722. [Google Scholar] [CrossRef]
  38. Sun, G.; Xie, W.; Niyato, D.; Du, H.; Kang, J.; Wu, J.; Sun, S.; Zhang, P. Generative AI for Advanced UAV Networking. IEEE Netw. 2025, 39, 244–253. [Google Scholar] [CrossRef]
Figure 1. Coordinate frames for a fixed-wing UAV.
Figure 1. Coordinate frames for a fixed-wing UAV.
Aerospace 13 00069 g001
Figure 2. Overall solution framework for synchronized coordinated assignment and planning.
Figure 2. Overall solution framework for synchronized coordinated assignment and planning.
Aerospace 13 00069 g002
Figure 3. Framework of the online trajectory planning and control architecture.
Figure 3. Framework of the online trajectory planning and control architecture.
Aerospace 13 00069 g003
Figure 4. Evolution of individual UAV flight time during the time-balanced clustering optimization. The average flight time decreased from 218 s to 184 s, approximately 15.6 % .
Figure 4. Evolution of individual UAV flight time during the time-balanced clustering optimization. The average flight time decreased from 218 s to 184 s, approximately 15.6 % .
Aerospace 13 00069 g004
Figure 5. Comparison of target allocation and flight routes: (a) Initial clusters. Targets belong to each UAV are spatially grouped with minimal differences. (b) Final optimized time-balanced clusters and routes. Algorithm 1 can balance mission durations despite heterogeneity.
Figure 5. Comparison of target allocation and flight routes: (a) Initial clusters. Targets belong to each UAV are spatially grouped with minimal differences. (b) Final optimized time-balanced clusters and routes. Algorithm 1 can balance mission durations despite heterogeneity.
Aerospace 13 00069 g005
Figure 6. Comparison of runtime of Algorithm 1: (a) Comparison of runtime and constraint-violation ratio for different ( M , N ) . Runtime satisfies O m a x M , M 2 / N and constraint-violation ratios are less than 20% . (b) Comparison of runtime between Algorithm 1 and other MTA methods. Algorithm 1 achieves a higher computational efficiency.
Figure 6. Comparison of runtime of Algorithm 1: (a) Comparison of runtime and constraint-violation ratio for different ( M , N ) . Runtime satisfies O m a x M , M 2 / N and constraint-violation ratios are less than 20% . (b) Comparison of runtime between Algorithm 1 and other MTA methods. Algorithm 1 achieves a higher computational efficiency.
Aerospace 13 00069 g006
Figure 7. Global and local trajectory planning for UAVs. The differential-flatness framework enables the UAV to circumvent the obstacle and satisfy safety constraints: (a) Overall mission view. (b) Magnified view of obstacle avoidance.
Figure 7. Global and local trajectory planning for UAVs. The differential-flatness framework enables the UAV to circumvent the obstacle and satisfy safety constraints: (a) Overall mission view. (b) Magnified view of obstacle avoidance.
Aerospace 13 00069 g007
Figure 8. Kinematic constraints for UAVs. Kinematic constraints can be satisfied under the proposed constraint enforcement framework: (a) Velocity profiles of heterogeneous UAVs with and without kinematic constraints. The speeds are constrained within the allowable ranges of the heterogeneous UAVs. (b) Overload profile of UAV1 with and without kinematic constraints. The overloads are constrained within the allowable ranges of the heterogeneous UAVs.
Figure 8. Kinematic constraints for UAVs. Kinematic constraints can be satisfied under the proposed constraint enforcement framework: (a) Velocity profiles of heterogeneous UAVs with and without kinematic constraints. The speeds are constrained within the allowable ranges of the heterogeneous UAVs. (b) Overload profile of UAV1 with and without kinematic constraints. The overloads are constrained within the allowable ranges of the heterogeneous UAVs.
Aerospace 13 00069 g008
Figure 9. Evolution of consistency variables ε i and absolute time reference ε . All UAVs can strictly adhere to the absolute time reference for real-time planning and m a x ( ε i ) m i n ( ε i ) < 1% .
Figure 9. Evolution of consistency variables ε i and absolute time reference ε . All UAVs can strictly adhere to the absolute time reference for real-time planning and m a x ( ε i ) m i n ( ε i ) < 1% .
Aerospace 13 00069 g009
Table 1. Symbol and Meaning.
Table 1. Symbol and Meaning.
SymbolMeaning
NNumber of UAVs
MNumber of reconnaissance targets
QNumber of no-fly zones
P t a r g e t Set of target positions
ATarget index set
w P t i Ordered target position set of UAV i
A i Ordered target index set of UAV i
P o b s Set of no-fly zone positions
K i Number of targets of UAV i
O A , i Center of the target cluster of UAV i
V c r u i s e Set of optimal cruising speeds for UAVs
P i Post-optimization trajectory of UAV i
p i Current position of UAV i
L o i Local replanning region of UAV i
w P t i , l o c a l Ordered local target set in the replanning region of UAV i
N l , i Number of waypoints in the replanning region of UAV i
N p Number of Bézier curve segments
Table 3. Main parameters of the UAVs and environment.
Table 3. Main parameters of the UAVs and environment.
Simulation ParameterValue
Number of UAVs N4
Number of targets M40
Number of no-fly zones Q8
Optimal cruise speed V c r u i s e 80 , 60 , 50 , 30 m/s
Bézier curve degree N p 8
Safe radius of UAV R U A V 10 m
Radius of static no-fly zones R o b s , q 10 , 100 m
Smooth parameters ϕ i , s m o o t h and θ i , s m o o t h 5
A* grid length5 m
Corridor width30 m
Detection range200 m
Forward field of view ± 30
Dual feasibility tolerance in MOSEK 10 4
Primal feasibility tolerance in MOSEK 10 4
Infeasibility tolerance in MOSEK 10 4
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wang, X.; Zhang, J.; Ma, Z.; Cao, C.; Liu, H. Online Synchronous Coordinated Assignment and Planning for Heterogeneous Fixed-Wing UAVs. Aerospace 2026, 13, 69. https://doi.org/10.3390/aerospace13010069

AMA Style

Wang X, Zhang J, Ma Z, Cao C, Liu H. Online Synchronous Coordinated Assignment and Planning for Heterogeneous Fixed-Wing UAVs. Aerospace. 2026; 13(1):69. https://doi.org/10.3390/aerospace13010069

Chicago/Turabian Style

Wang, Xindi, Jiansong Zhang, Zhenyu Ma, Chuanshuo Cao, and Hao Liu. 2026. "Online Synchronous Coordinated Assignment and Planning for Heterogeneous Fixed-Wing UAVs" Aerospace 13, no. 1: 69. https://doi.org/10.3390/aerospace13010069

APA Style

Wang, X., Zhang, J., Ma, Z., Cao, C., & Liu, H. (2026). Online Synchronous Coordinated Assignment and Planning for Heterogeneous Fixed-Wing UAVs. Aerospace, 13(1), 69. https://doi.org/10.3390/aerospace13010069

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop