Next Article in Journal
A Factors–Responses–Consequences Framework for Assessing Wildlife Impacts of Uncrewed Aerial Systems: A Systematic Review
Previous Article in Journal
Comparative Evaluation of Segmentation-Based and Pose-Assisted Head Temperature Estimation from UAS Thermal Imagery Under Controlled Conditions
Previous Article in Special Issue
Method for Controlling the Movement of an AUV Follower Based on Visual Information About the Position of the AUV Leader Using Reinforcement Learning Methods
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Cooperative Online 3D Path Planning for Fixed-Wing UAVs

1
School of Automation Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China
2
School of Astronautics, Northwestern Polytechnical University, Xi’an 710072, China
3
Jinan Aerospace Information Laboratory of Shandong Province, Qilu Aerospace Information Research Institute, Jinan 250100, China
*
Author to whom correspondence should be addressed.
Drones 2026, 10(4), 297; https://doi.org/10.3390/drones10040297
Submission received: 3 March 2026 / Revised: 9 April 2026 / Accepted: 14 April 2026 / Published: 17 April 2026
(This article belongs to the Special Issue Intelligent Cooperative Technologies of UAV Swarm Systems)

Highlights

What are the main findings?
  • The proposed Cooperative-3D-Quick-Dubins-RRT* algorithm integrates an offline Dubins motion-primitive database specifically tailored for RRT* and four acceleration strategies, enabling real-time kinodynamically feasible planning for fixed-wing UAVs in complex 3D environments.
  • Experimental results show an 86.04% reduction in runtime and 22.63% shorter path lengths compared to conventional Dubins-RRT*, while achieving collision-free trajectories in cluttered terrains and no-fly zones. Hardware-in-the-loop simulations further validate the strong engineering applicability of the proposed method under real onboard execution conditions.
What is the implication of the main finding?
  • This study demonstrates that offline primitive and online search can be effectively combined to overcome the computational bottleneck in UAV path planning, enabling online replanning in dynamic scenarios.
  • The successful deployment in complex 3D environments with restricted zones provides a scalable solution for multi-UAV cooperative missions, paving the way for autonomous swarm navigation in real-world urban or mountainous terrains.

Abstract

Addressing high dynamics, stringent non-holonomic constraints, and limited onboard computation in cooperative online trajectory planning for multiple fixed-wing UAVs in complex 3D obstacle environments, this paper proposes a Cooperative-3D-Quick-Dubins-RRT*. First, an offline motion-primitive database is engineered to align with RRT* mechanics: an unconstrained expansion mode facilitates rapid space exploration, while a constrained rewiring mode ensures kinodynamic continuity. This architecture, synergized with four targeted acceleration strategies (dimensionality reduction, elliptical sampling, tree pruning, and pre-discretized collision checking), significantly accelerates convergence. Second, a Dubins-detour-based time-coordination mechanism is designed to map cooperative timing constraints into controllable path-length adjustments, and the feasible adjustment range is analyzed to ensure realizability. Finally, simulations and hardware-in-the-loop experiments across a variety of representative scenarios are conducted for validation. The results show that, compared with the classical Dubins-RRT*, the proposed method achieves clear advantages in planning time and path length, demonstrating its suitability for online cooperative obstacle-avoidance planning of multiple UAVs.

1. Introduction

With increasing demands for UAVs in complex missions such as infrastructure inspection and precision agriculture, swarm cooperation has become a key trend in UAV system development. Compared to rotary-wing platforms, fixed-wing UAVs offer significant advantages in endurance, speed, and payload capacity, making them particularly well-suited for large-scale, long-range operations. However, a primary challenge in such applications is the development of an efficient online trajectory planning framework capable of supporting cooperative behaviors.
Unlike 2D scenarios, 3D path planning is considered NP-hard due to the additional vertical axis and the exponential expansion of the configuration space [1,2]. This complexity stems from the significantly larger set of candidate points and segments required to ensure safety in 3D compared to 2D environments [3,4]. Consequently, achieving computationally efficient and reliable trajectory generation in 3D spaces remains a fundamental challenge, particularly for fixed-wing UAVs subject to non-holonomic constraints.
In response, researchers have proposed a wide range of path planning algorithms, which can be broadly categorized as follows.
From an optimization perspective, heuristic techniques have been explored, such as using Genetic Algorithms (GAs) for planning on building surfaces [5] or combining GA with Voronoi diagrams [6] to lower computational costs. Some studies even project 3D space onto 2D manifolds to simplify path generation [7]. Some studies have incorporated the concept of receding horizon control or local optimization, updating trajectories only within a finite time window [8,9]. However, these efficiency-oriented strategies often simplify or overlook the inherent non-holonomic constraints of fixed-wing UAVs, potentially leading to dynamically infeasible commands. Other heuristic approaches, including plant growth mechanisms [10] and hydrodynamic-based fluid algorithms [11,12], have also emerged. Although they significantly enhance solution speed, such methods are typically restricted to regular geometric obstacles and struggles with complex, irregular 3D terrains.
Sampling-based approaches, such as RRT [13] and PRM [14], have demonstrated strong performance in high-dimensional environments due to their probabilistic exploration capabilities. Specifically, Yang and Sukkarieh [15] enhanced 3D RRT efficiency by developing a path pruning algorithm to remove redundant waypoints. To further enhance path quality, reference [16] introduces the Rapidly exploring Random Tree Star (RRT*) algorithm. By randomly distributing sample nodes in the configuration space to capture environmental topological features, RRT* represents connectivity through an evolving tree structure. Unlike the basic RRT, RRT* incorporates a parent selection and rewiring mechanism based on a cost function, which iteratively optimizes the connections of intermediate nodes to ensure the asymptotic optimality of the resultant path [17].
However, classical sampling algorithms often neglect non-holonomic constraints, yielding piecewise linear paths with sharp turns that are difficult for fixed-wing UAVs to track. To enhance flyability, researchers have integrated smoothing curves into sampling frameworks. Reference [18] utilizes B-splines within the RRT framework to improve path smoothness. Reference [19] employs Dubins curves to smooth piecewise linear tracks; however, such “plan-then-smooth” strategies inevitably cause the trajectory to deviate from the original collision-free path, introducing new safety risks. Furthermore, References [20,21] utilize polynomial trajectories to connect waypoints via differential flatness theory. Nevertheless, these methods suffer from soaring computational complexity as the scenario scale increases due to frequent nonlinear programming or spline interpolation, making them unsuitable for real-time obstacle avoidance.
To explicitly account for kinodynamic constraints during planning, Dubins-curve-based approaches have been introduced within kinodynamic frameworks. Existing methods either incorporate Dubins paths as heuristics to guide sampling [22,23] or construct precomputed roadmaps [24]. However, these approaches still rely heavily on repeated online analytical solutions and dense collision checking, which results in significant computational overhead, particularly in complex 3D environments.
Crucially, even if efficient and dynamically feasible trajectory planning can be achieved for a single UAV, extending these methods to multi-UAV cooperative scenarios introduces additional layers of complexity. Multi-UAV cooperative trajectory planning requires not only obstacle avoidance but also explicit handling of inter-agent collision avoidance and coordination constraints. Conflict-based search (CBS) addresses spatiotemporal conflicts through decoupled planning and conflict resolution [25], while optimization-based methods formulate multi-objective coordination problems considering path length, time, and threat exposure [26]. Other approaches focus on decentralized collision avoidance and network-based coordination mechanisms to improve real-time responsiveness [27], or integrate task allocation with trajectory planning for mission-level optimization [28].
Despite these advances, two key challenges remain unresolved. First, kinodynamic RRT*-based planners suffer from prohibitive computational latency caused by repeated online analytical solving during tree expansion, making real-time replanning in complex 3D environments difficult to achieve. Second, for fixed-wing UAVs operating at near-constant cruise speeds, temporal synchronization among multiple agents is difficult to achieve through speed control alone, and path-length modulation remains an insufficiently explored mechanism for cooperative time coordination.
To address both challenges, this paper proposes a distributed cooperative trajectory planning framework for fixed-wing UAV swarms. Unlike existing approaches that rely on repeated online solving of kinodynamic models or post-processing-based trajectory smoothing, the proposed framework fundamentally shifts the computational burden from online analytical computation to offline-constructed motion primitives and efficient index-based matching.
To tackle the computational intractability of direct 3D kinodynamic planning, the framework adopts a hierarchical structure that decomposes the problem into a 2D path planning stage followed by a 3D altitude refinement stage. Within the 2D planning stage, a dual-mode motion primitive database, combined with dimensionality-reduced sampling, an ellipsoidal heuristic domain, a pruning mechanism, and localized collision detection, collectively reduces planning latency while ensuring kinodynamic feasibility. In the 3D refinement stage, gradient-descent-based altitude optimization smooths the vertical profile while preserving the coupling between vertical increments and horizontal arc-lengths to maintain flyability. Furthermore, in contrast to conventional time coordination methods that rely on speed adjustment or scheduling strategies, a Dubins-detour-based mechanism is introduced to achieve path-length-driven temporal synchronization among multiple UAVs.
The synergistic design of these components enables simultaneous improvements in planning efficiency, kinodynamic feasibility, and multi-agent temporal coordination in complex cluttered 3D environments.
The main contributions are summarized as follows:
(1)
Modeling: A fixed-wing UAV kinodynamic model and a Digital Elevation Model (DEM) for realistic terrain are established. A constrained optimization model is developed, incorporating non-holonomic constraints, no-fly zones (NFZs), altitude limits, and spatiotemporal coordination requirements.
(2)
Method: A multi-UAV cooperative online trajectory planning architecture is proposed, featuring a dual-mode offline motion-primitive database (inspired by [29]) tailored to the expansion and rewiring processes of RRT*. Through the integration of sampling space dimensionality reduction, elliptical heuristic sampling, tree pruning, and pre-discretized collision checking, the computational load is shifted from intensive online numerical solving to efficient index matching. Additionally, a time-coordination scheme leveraging Dubins detours is presented to ensure synchronization by determining feasible path-length adjustment intervals.
(3)
Validation: A series of comparative experiments are conducted, including multi-UAV cooperative planning simulations, parameter sensitivity analysis, comparisons with conventional Dubins-RRT*, and hardware-in-the-loop (HIL) simulations.

2. Problem Description

2.1. Scenario Description

As illustrated in Figure 1, a typical cooperative mission for UAV swarms unfolds within a highly non-structured and uncertain environment. In the initial phase, a heterogeneous swarm departs from dispersed start points, navigating through intricate 3D corridors to bypass complex terrain elevations and pre-defined no-fly zones (NFZs). During transit, the system must continuously respond to real-time command updates from the ground station—such as target re-assignment—and react to emergent threats like dynamic radar detection zones.
The core challenge of such missions lies in spatiotemporal synchronization: each individual UAV must not only perform autonomous local obstacle avoidance but also adaptively modulate its flight velocity or path length. This ensures the entire swarm can negotiate cluttered spaces and achieve precise simultaneous arrival at the target within a pre-defined time window. Such mission-driven, high-frequency trajectory replanning necessitates an algorithm capable of providing near-instantaneous decision feedback while strictly adhering to non-holonomic constraints and resolving the strong coupling between spatial maneuvers and temporal coordination.
Consequently, the cooperative online trajectory planning problem is defined as searching for collision-free paths within a 3D space cluttered with terrain elevations, static and dynamic obstacles, and no-fly zones. This must simultaneously satisfy multi-fold constraints: safety (e.g., minimum clearance), kinodynamic (e.g., minimum turning radius, maximum climbing angle), and mission requirements (e.g., time coordination, formation keeping). The primary objective is to achieve reliable single-agent planning, consistent multi-agent coordination, and smooth task transitions under these constraints, thereby enhancing the swarm’s adaptability in complex dynamic scenarios.

2.2. Problem Formulation

2.2.1. Fixed-Wing UAV Motion Model

The motion of fixed-wing UAVs in 3D space is highly nonlinear. Since trajectory planning focuses on path generation rather than low-level tracking control, a simplified 3-degree-of-freedom (3-DOF) bank-to-turn (BTT) model is adopted. The UAV dynamics with control inputs of overload and roll angle are formulated as
d x d t = v cos θ cos ψ d y d t = v cos θ sin ψ d z d t = v sin θ d v d t = g N x sin θ d θ d t = g v N z cos φ cos θ d ψ d t = g N z sin φ v cos θ
where x , y , z denotes the UAV position; v is the velocity; θ and ψ represent the flight path angle and heading angle, respectively; N x and N z represent the tangential and normal overloads, respectively; and φ represents the roll angle.
The proposed method adopts a 3-DOF reduced-order UAV model, which simplifies high-order aerodynamic transients while preserving the essential non-holonomic constraints, including minimum turning radius and heading continuity. These constraints are the primary determinants of feasible trajectory generation for fixed-wing UAVs in three-dimensional environments. Such a modeling approach is commonly employed in real-time motion planning, where computational efficiency is critical. Within a hierarchical flight control architecture, the trajectory planning layer focuses on generating kinodynamically feasible reference paths, whereas low-level controllers handle tracking errors and compensate for dynamic disturbances. While higher-fidelity models may improve accuracy under aggressive maneuvers or highly dynamic conditions, they also introduce substantial computational overhead that is incompatible with real-time replanning requirements. Therefore, the adopted model provides an effective balance between computational tractability and motion feasibility for online path planning tasks in complex environments.

2.2.2. 3D Terrain Model

The Digital Elevation Model (DEM), a fundamental dataset in Geographic Information Systems (GISs), is widely used in mapping, terrain analysis, engineering, and military applications. Given the raster format’s efficiency in querying elevations, this paper adopts it to represent 3D terrain, as shown in Figure 2. A raster DEM comprises elevation grids and georeferencing data, the latter containing the geographic extent and spatial resolution. Ground elevation at any point can be obtained via interpolation methods, such as bilinear or nearest-neighbor interpolation.
Since the DEM is defined in the geodetic (LLA) coordinate system, while trajectory planning is computed in the local North-East-Down (NED) frame, frequent coordinate transformations are required for terrain collision detection, increasing computational overhead. In practice, the mission scale of UAVs is much smaller than that of ballistic missiles or long-range rockets. To simplify the transformation, this paper approximates the OXY plane of the NED frame as the tangent plane at the initial planning point a, effectively treating the local ellipsoid as a flat surface and neglecting the curvature effect on horizontal distances, as illustrated in Figure 3.
Let point b be the point whose elevation is to be calculated. The arc length a b is approximated by the length of segment a b , and Δ D denotes the horizontal distance error:
Δ D = a b a b = R tan β β
where R denotes the Earth’s radius and β denotes the central angle of a b . With R = 6371   km , when D = 100   km , Δ D is merely 8.2111 m, which is negligible compared to D. Given the coordinates of point b in the NED frame, its geodetic coordinates can be rapidly approximated as follows:
l o n b = l o n a + x b / R cos l a t a × 180 / π l a t b = l a t a + y b / R × 180 / π
where l n g a , l a t a denotes the geodetic coordinates of the initial planning point. After obtaining the geodetic coordinates of point b, the local elevation h b is calculated via bilinear interpolation:
l n g 1 = l n g b l n g o / Δ l n g Δ l n g l n g 2 = l n g b l n g o / Δ l n g Δ l n g l a t 1 = l a t b l a t 0 / Δ l a t Δ l a t l a t 2 = l a t b l a t 0 / Δ l a t Δ l a t h 1 = l n g 2 l n g b Δ l n g h l n g 1 , l a t 1 + l n g b l n g 1 Δ l n g h l n g 2 , l a t 1 h 2 = l n g 2 l n g b Δ l n g h l n g 1 , l a t 2 + l n g b l n g 1 Δ l n g h l n g 2 , l a t 2 h b = l a t 2 l a t b Δ l a t h 1 + l a t b l a t 1 Δ l a t h 2
where Δ l n g and Δ l a t are the spatial resolutions of the DEM in the longitude and latitude directions, respectively; l n g 0 , l a t 0 is the coordinate of the lower-left vertex of the DEM grid; and h l n g 1 , l a t 1 , h l n g 1 , l a t 2 , h l n g 2 , l a t 1 and h l n g 2 , l a t 2 are the elevations of the lower-left, upper-left, lower-right, and upper-right neighboring vertices, respectively. l n g b , l a t b denotes the geodetic coordinates of the initial planning point.

2.2.3. Constraints

(1)
Non-holonomic Constraint
Non-holonomic constraints restrict an aircraft’s motion in specific directions; for instance, fixed-wing UAVs cannot move vertically or slide sideways. Some studies [9,10] treat the UAV as a point mass capable of moving in any direction, imposing constraints only on the maximum turning and pitch angles. This approach essentially searches for paths in geometric space and then approximates kinodynamic feasibility via posterior angular constraints. The planning result heavily depends on the search step size: a small step size leads to long computation times, making online application difficult, while a large step size fails to satisfy velocity direction and non-holonomic constraints at the start and goal, resulting in poor path smoothness.
This paper explicitly considers non-holonomic constraints by searching in a high-dimensional state space and utilizing Dubins motion primitives to satisfy the kinodynamic differential equations of the fixed-wing UAV. The aircraft’s state is described in the path coordinate system, and every path point must satisfy Equation (1).
Furthermore, to ensure flight feasibility and safety, the fixed-wing UAV’s state must satisfy the following constraints:
θ min θ k i θ max φ k i φ max R k i R min ,   R min = V 2 / ( g tan φ max )
where θ k i , φ k i and R k i denote the flight path angle, roll angle, and turning radius of the i-th waypoint for UAV k, respectively. θ min and θ max are the minimum and maximum flight path angles. Under the zero angle-of-attack assumption, the flight path angle constraint is equivalent to the pitch angle constraint, typically ranging from −30° to 30°. φ max is the maximum allowable roll angle, typically ranging from 30° to 45°. R min denotes the minimum turning radius. This paper employs Dubins curves satisfying the minimum turning radius constraint as motion primitives, explicitly guaranteeing this constraint during sampling and planning without the need for posterior validation.
(2)
No-Fly Zone Constraints
To ensure flight safety and safeguard designated airspace, UAVs must avoid various no-fly zones (NFZs) during missions. These typically include airport airspace, military restricted areas, densely populated zones, nature reserves, and areas surrounding critical infrastructure. According to regulations such as the Civil Unmanned Aircraft System Air Traffic Management Measures, NFZs are established to prevent potential threats to civil aviation, public safety, and sensitive facilities. Since UAV flight is prohibited throughout the volume of an NFZ, it is modeled as a three-dimensional obstacle extending from the ground to the UAV’s maximum flight altitude.
Based on geometric characteristics, NFZs are classified into cylindrical and polygonal types.
Cylindrical NFZs typically represent circular areas centered on a specific point, such as airport runways, communication towers, or power facilities. Let the center of the cylindrical NFZ be x center , y center and the protection radius be r n f z . The NFZ is defined as
N F Z circle = x , y , z | x x center 2 + y y center 2 r n f z , 0 z h max
where h max denotes the UAV’s maximum flight altitude.
Polygonal NFZs are suitable for representing irregular areas, such as urban built-up zones, industrial parks, and nature reserves. Let a polygonal NFZ be defined by n vertices q 1 , q 2 , , q n ordered counterclockwise, where the j-th vertex has coordinates q j = x j , y j . The NFZ can be represented as
N F Z polygon = x , y , z | x , y Q , 0 z h max
where Q denotes the polygonal region bounded by the vertex sequence q 1 , q 2 , , q n .
Considering all NFZs collectively, let m denote the total number of NFZs in the planning space. Any waypoint p 1 , p 2 , p N along the UAV trajectory must satisfy
p i k = 1 m N F Z k , i 1 , 2 , , N
This constraint ensures the UAV remains within authorized airspace throughout the flight, guaranteeing both flight safety and regulatory compliance.
(3)
Flight Altitude Constraints
In low-altitude environments, UAVs must maintain a sufficient safety margin above the ground to account for terrain variations, turbulence, and positioning errors. For missions requiring proximity to the surface, such as emergency response or geological surveying, an appropriate Above-Ground-Level (AGL) altitude ensures both safety and sensor effectiveness. Therefore, the height of any waypoint in the trajectory x k must exceed the minimum clearance altitude h r e l :
h k i h terrain + h r e l , k = 1 , 2 , , N , i = 1 , 2 , , M
where h k i denotes the altitude of the i-th waypoint for UAV k, and h terrain represents the terrain elevation at that point.
Conversely, while higher altitudes improve obstacle clearance, fixed-wing UAVs are subject to aerodynamic and propulsion limitations that define a service ceiling. Hence, the altitude of any waypoint must not exceed this maximum operational limit h max :
h k i h max , k = 1 , 2 , , N , i = 1 , 2 , , M
(4)
Terminal State Constraints
In cooperative missions, UAVs must typically satisfy specific terminal state constraints upon reaching the target area to facilitate precise operations or formation assembly. For UAV k, the terminal position and attitude must satisfy
p k t e r = x k goal , y k goal , z k goal θ k t e r = θ k goal , ψ k t e r = ψ k goal , φ k t e r = φ k goal
where p k goal denotes the desired terminal position, and θ k goal , ψ k goal , and φ k goal represent the desired terminal flight path angle, heading angle, and roll angle, respectively.
(5)
Time Coordination Constraints
Multi-UAV cooperative missions often require strict temporal coordination. For scenarios such as simultaneous multi-point rescue, time-window delivery, or synchronized monitoring, UAVs must arrive at the target area at specified times or with specific time intervals. Let t f k denote the actual arrival time of the k-th UAV; the time coordination constraint is formulated as
k N 1 t f k + 1 t f k t e x p k = 0 , k = 1 , 2 , , N
where t e x p k represents the desired time difference between UAV k and UAV k + 1. For simultaneous arrival missions, t e x p k is set to 0.

2.2.4. Objective Function

Let T = x k t , k = 1 , 2 , , N denote the set of all UAV trajectories, where x k t represents the trajectory of the k-th UAV. Each waypoint consists of position p, heading angle ψ , and flight-path angle θ . To minimize trajectory length and maximize smoothness, the cost function is defined as
min   J T = k N L x k L x k = ω 1 C o s t Length + ω 2 C o s t Smooth C o s t Length = k N i = 1 M k 1 p k , i + 1 p k , i C o s t Smooth = k N i = 1 M k 1 θ k , i + 1 θ k , i + ψ k , i + 1 ψ k , i
where ω 1 and ω 2 are weighting coefficients. C o s t Length denotes the trajectory length cost, and C o s t Smooth denotes the trajectory smoothness cost. M k is the number of waypoints for the k-th UAV, and p k , i + 1 p k , i represents the Euclidean distance between consecutive waypoints.

3. Methodology

3.1. Overall Framework of Cooperative Trajectory Planning

The problem formulated in Section 2 imposes three competing demands: kinodynamic feasibility, real-time replanning, and multi-agent consistency. These demands collectively motivate the proposed hierarchical RRT*-based framework illustrated in Figure 4. Dubins curves serve as motion primitives to explicitly satisfy Equation (5) throughout the search process. The framework consists of three components: single-UAV trajectory planning, desired flight time consensus, and Dubins detour path adjustment. Time coordination is achieved via an ad hoc data link, and the theoretical convergence guarantee of the consensus process is detailed in Section 3.4.2.
To address the computational intractability inherent in direct 3D planning while preserving solution quality and real-time performance, a hierarchical planning framework is proposed—decomposing the problem into a 2D topological path planning stage followed by a 3D trajectory refinement stage. Notably, in conventional single-UAV trajectory planning, dynamic interactions with neighboring UAVs are often neglected, which can lead to spatial conflicts, especially in dense swarm operations where proximity and maneuver coupling are critical. The proposed hierarchical structure addresses this limitation by separating the planning process into two sequential stages: the 2D path planning phase focuses on obstacle-aware routing and waypoint sequencing in the horizontal plane, while the 3D refinement stage optimizes the altitude profile subject to kinodynamic feasibility constraints, and leverages temporal phasing to resolve potential inter-agent conflicts. Importantly, kinodynamic coupling between horizontal and vertical motion is preserved throughout—the altitude optimization explicitly accounts for the coupling between vertical increments and horizontal arc-lengths to ensure flyability. By exploiting the vertical dimension as an additional degree of freedom for conflict resolution, the framework enhances the robustness of multi-UAV cooperation under dynamic constraints without compromising flight smoothness or mission timeliness.
Within this architecture, each UAV first independently generates a kinodynamically feasible trajectory using the Quick-Dubins-RRT* algorithm. Flight time information is then exchanged via the ad hoc network to achieve consensus on the group’s desired flight time under strongly connected topology. Subsequently, each UAV adjusts its path length based on the consensus time. If environmental disturbances cause time deviations, the consensus process is re-triggered to enable dynamic adaptive coordination. This framework operates without a central node, constituting a distributed cooperative planning structure.

3.2. Quick-Dubins-RRT* Algorithm Based on Offline Dubins Motion Primitives

Within the aforementioned cooperative framework, individual UAVs must rapidly generate feasible trajectories in complex 3D environments that satisfy the non-holonomic kinodynamic constraints of fixed-wing dynamics. To this end, this paper proposes the Quick-Dubins-RRT* algorithm, which leverages a precomputed Dubins motion primitive database. By pre-generating motion primitives that satisfy minimum turning radius and heading continuity constraints, and embedding them into the RRT* expansion process, the algorithm ensures kinodynamic feasibility during tree generation. Furthermore, the state space is modeled to accommodate 3D flight characteristics, achieving a sequentially integrated combination of 2D Dubins curves and longitudinal motion, where the horizontal path is planned first and the vertical profile is subsequently optimized subject to kinodynamic coupling constraints.

3.2.1. Motion-Primitives-RRT*

The Motion-Primitives-RRT* was proposed in [29] to address kinodynamic path planning problems for complex nonlinear systems. Inspired by graph-search methods for state space discretization, it solves Two-Point Boundary Value Problems (TPBVPs) using a precomputed motion primitive database, achieving a favorable balance between real-time computational efficiency and solution quality, along with strong generality. The algorithm follows the classic RRT* framework, with the specific procedure outlined in Algorithm 1:
Algorithm 1. Motion-Primitives-RRT* Algorithm
Input: initial state x 0 , terminal state x f , motion primitives database, i t e r max
Output: trajectory x t
V x 0 ; E Ø
for   i = 1   to   i t e r max   do
      x rand Sample ( )
      x nearest Nearest V , x rand
      x new , p Steer x nearest , x rand , DataBase
      if   CollisionFree p   then
            N near Near V , x new , DataBase
            x parent ChooseBestParent V , N near , DataBase
            V V x new ; E E p
            E Rewire E , N near , DataBase
      endif
endfor
x t = FindWayBack V , E
During the RRT* tree expansion, the algorithm first generates random sampling within the state space. Unlike the classic RRT, which samples from a continuous state space, Motion-Primitives-RRT* samples from a discretized state space based on the resolution of the motion primitive database. The algorithm then finds the nearest node x nearest in the tree to the sample x rand and queries the database to compute the optimal path p and the corresponding new node x new . If p is collision-free, the algorithm identifies neighboring nodes N near within the database range, selects the optimal parent from them, and adds the new node x new and path p to the tree. Subsequently, it rewires the neighbors N near : if connecting a neighbor to the new node x new yields a lower cost, the edge set E is updated.

3.2.2. Dubins Motion Primitive Database

The motion primitive database is built upon the 3D Dubins path model to ensure kinodynamic feasibility. To achieve high-throughput online indexing while minimizing onboard storage overhead, a two-stage optimization strategy is implemented.
First, considering that a fixed-wing UAV in 3D space is characterized by five state variables x , y , z , θ , ψ , a direct offline sampling of this 5D space would trigger the “curse of dimensionality”, leading to prohibitive computational costs and excessive memory requirements. To mitigate this, the longitudinal motion is decoupled from the horizontal components during database construction. The database exclusively stores 2D horizontal Dubins paths, while the longitudinal motion is approximated using a bilinear interpolant model [19,24] during path extraction.
Building upon this dimensionality reduction, the primitive types are specifically pruned. Although conventional Dubins paths consist of six types (RLR, LRL, RSL, LSR, RSR, LSL), long-range fixed-wing missions rarely necessitate the “tight-turn” characteristics of RLR and LRL. By excluding these complex types, the database size is further compressed, and unnecessary branching evaluations during the tree expansion process are eliminated.
The resulting database is constructed offline in MATLAB and stored in a structured binary format. Each entry in the database matrix corresponds to a set of primitives sharing the same terminal grid position, and is represented by a structure containing the initial heading, terminal heading, the discretized 2D Dubins path, and its associated cost.
To demonstrate the time efficiency of the database, Table 1 compares the average computation time for 1000 path queries using three methods: database retrieval, 3D Dubins calculation, and GPOSII. The tests were conducted in Matlab 2022b on an AMD R5 5600U CPU. The results show that database retrieval is an order of magnitude faster than real-time 3D Dubins calculation. In contrast, GPOSII takes over 1 s per calculation and may fail to find a solution, making it unsuitable for online path planning.

3.3. Search Efficiency Enhancement Strategies

To improve online planning efficiency and stability, a set of coordinated acceleration and structural optimization strategies is developed. Rather than being independent heuristic components, these strategies are jointly designed to reformulate the 3D kinodynamic planning process into a more structured and computationally efficient pipeline. Specifically, the sampling space dimensionality reduction (Section 3.3.1) and the ellipsoidal predictive domain (Section 3.3.2) act as complementary spatial filtering mechanisms, effectively constraining the search space and concentrating sampling in high-value regions. This focused exploration facilitates faster convergence toward the goal. In parallel, the dual-mode motion primitive database, cost-constrained pruning (Section 3.3.3), and efficient collision checking (Section 3.3.4) provide computational and structural support for rapid node expansion within the refined space. Through this coordinated design, the proposed strategies form an integrated framework that improves search efficiency while maintaining solution quality.

3.3.1. Sampling Space Dimensionality Reduction

The primary cause of slow convergence in Kinodynamic RRT* is the high dimensionality of the sampling space [30], as the algorithm samples not only positions in Euclidean space but also other state variables such as heading and flight-path angles.
To overcome this, the proposed method leverages the dual-mode primitive framework to implement a multi-layered acceleration strategy. First, during database construction, the longitudinal motion is separated from the horizontal Dubins path and handled independently via a decoupled interpolation model, which significantly reduces the dimensionality of the offline storage space and eases the online sampling burden. Second, by specifically pruning the primitive types (excluding RLR and LRL types unnecessary for long-range missions), the database becomes more compact, enhancing retrieval speed.
To further compress the database size and enhance search efficiency without sacrificing terminal precision, a dual-mode strategy is introduced to regulate the interaction between the RRT* tree and the database: During the expansion phase, the algorithm prioritizes spatial permeability and reachability. By invoking a free-terminal sub-database (Free Mode), the solver ignores specific heading constraints at the sampled nodes. This allows the search tree to permeate through complex environments using the most direct and computationally efficient primitives, significantly accelerating the discovery of an initial feasible solution. Subsequently, during the rewiring phase, the algorithm switches to a fixed-terminal sub-database (Fixed Mode). This ensures that during local topological optimization, newly generated connections strictly satisfy heading continuity and non-holonomic constraints. This dual-mode mechanism ensures that the final trajectory is kinodynamically executable while significantly reducing the dependency on large-scale offline storage, making the online search process both lightweight and agile.
As illustrated in Figure 5, the orientation derived from this optimized primitive aligns more naturally with the local path geometry than a random assignment. This mechanism effectively collapses the active search space from 5D to 3D to bypass the curse of dimensionality, while ensuring that discarded angular constraints are progressively re-satisfied during the Fixed Mode rewiring to maintain global kinodynamic consistency.

3.3.2. Elliptical Prediction Domain

Informed-RRT*, proposed by Jonathan et al. [31], improves upon the traditional RRT* by accelerating convergence through sampling restriction. Instead of sampling the entire space, it limits sampling to a region likely to contain superior paths. After finding the first feasible path, the sampling space is constrained within an ellipse (or ellipsoid in 3D) focused on the start and goal points.
To quantify the sampling efficiency, we define the ellipsoidal heuristic subspace X s u b X free   as the set of all points satisfying the admissible cost constraint. Its Lebesgue measure, denoted by μ X s u b , represents the volume of the viable sampling region. In a d -dimensional Euclidean space (where d = 3 in this study), the volume of this ellipsoidal hyper-body is given by
μ X s u b = v d a b d 1
where v d is the volume of a unit d -ball, while a and b represent the semi-major axis and the identical semi-minor axes, respectively. According to the geometric properties of a prolate hyperspheroid, these axes are determined by the current best path cost c b e s t and the theoretical minimum cost c min (the Euclidean distance between the start and goal):
a = c b e s t 2 ,   b = c b e s t 2 c min 2 2
By substituting these geometric relations into the volume formula, the functional relationship between the sampling measure and the current solution cost can be derived as
μ X s u b = v d 2 d c b e s t c b e s t 2 c min 2 d 1 2
It is evident from this equation that as the algorithm iterates and c b e s t approaches c min , the term ( c b e s t 2 c min 2 ) tends to zero, leading to a rapid contraction of the sampling domain. This reflects the progressive focusing of the sampling distribution from the global configuration space toward a narrow region surrounding the optimal solution.
In the limit,
lim c b e s t c min μ X s u b μ X f r e e = 0
where X f r e e denotes the entire obstacle-free configuration space. This contraction increases the probability of generating samples that can further improve the current solution, thereby empirically accelerating the convergence of the algorithm.
In this work, the semi-axis lengths of the ellipsoid are calculated as follows:
a = 0.5 l b = a 2 0.5 x s x g 2 r = 0.5 i = 1 n 1 h i + 1 h i 2 + i = 2 n 1 h i + 1 2 h i + h i 1 2 h s h g 2
where l is the path length, h is the height of the i-th waypoint, and h s and h g are the heights of the start and goal points, respectively. The sampling procedure incorporating the elliptical prediction domain is outlined in Algorithm 2:
Algorithm 2. Informed-Sample
Input: start point x s , end point x g , the lengths of the three axes a , b , r
Output: sample point x rand
if   a <   then
      x ball SampleUnitBall
      L diag a , b , r
      C RotationMatrix x s , x g
      x center x s + x g / 2
      x rand CL x ball + x center MapRange
else
      x rand Sample MapRange
endif
x rand Rasterize x rand
When sampling within the elliptical domain, a point x ball is first sampled from a unit hypersphere. A rotation matrix L is then computed based on the direction from the start to the goal. Finally, x ball is linearly transformed to obtain the sample point. Since the state space is discrete, the sampled point is gridded according to the motion primitive database resolution before output.

3.3.3. Pruning Strategy

Pruning the RRT* tree accelerates neighborhood searches and reduces redundant parent reselection and rewiring operations, thus decreasing runtime. During parent reselection and rewiring, nodes often exhibit a “ray-shaped clustering” phenomenon, as shown in Figure 6, where numerous nodes share the same parent and have similar headings, differing only slightly in path length. These ray-like paths, originating from a common parent, exhibit high overlap. Consequently, during rewiring, it is difficult to find superior alternatives, resulting in significant computational waste in the later stages of iteration.
Based on the above analysis, the proposed pruning strategy is defined as follows: After expanding a new node x new , a pruning neighborhood is established centered at x new , defined by an ellipsoid with principal axes 2 δ s , δ s , and δ s , where δ s is the pruning radius coefficient. A larger δ s removes more nodes but may degrade path quality if set excessively. For each neighboring node x near N near within this pruning region, a pruning test is conducted. Let h denote the heuristic cost to the goal x g , approximated by the Euclidean distance. If the cost from the start to x new plus the heuristic cost of x new is less than that of x near , x new is considered superior. A flag indicates node status; if set to false, the node is pruned and excluded from subsequent neighborhood computations.

3.3.4. Improved Collision Detection

Collision detection significantly impacts the computational speed of obstacle-avoidance path planning. Conventional methods involve traversing every waypoint and checking all regions, resulting in numerous redundant operations. To reduce the number of checks, this paper proposes a strategy where, upon expanding a new node, the distances from this node to all no-fly and threat zones are calculated. Obstacles within the range of the motion primitive database are marked as potential risks. During collision checks for a path from a node to its successor, only the marked obstacles are verified. Since the number of potential obstacles is typically much smaller than the total in the map, this strategy effectively improves detection efficiency. For distance computation, threat zones are modeled as hemispheres, and polygonal no-fly zones are approximated as cylinders centered at their centroid with a radius equal to the maximum distance from the centroid to any vertex.

3.3.5. Computational Complexity and Efficiency Analysis

To provide a theoretical foundation for the observed efficiency gains, this section analyzes the computational complexity of the proposed Quick-Dubins-RRT* framework from both algorithmic and system perspectives.
The total computational cost per iteration of the proposed Quick-Dubins-RRT* can be decomposed into three main components: nearest-neighbor search, steering (trajectory generation), and collision validation:
T i t e r T nearest + T steer + T collision
where T nearest   represents the nearest-neighbor identification. T steer   represents the trajectory generation. T collision refers to the obstacle-free validation.
The Steer procedure in conventional kinodynamic RRT* planners is typically the most computationally intensive component, requiring the online solution of a Two-Point Boundary Value Problem (TPBVP). For conventional Dubins-RRT*, the need to evaluate six candidate types and discretize the resulting curves online leads to a computational cost T solve that scales with the complexity of the vehicle’s differential constraints. In this work, we achieve a fundamental acceleration by shifting this burden to a precomputed framework. To ensure the database is both kinodynamically complete and physically feasible for onboard systems, a two-stage manifold compression and indexing strategy is implemented:
First, a canonical state-space transformation is applied: Given any start pose and sampled goal pose, we first apply a coordinate translation and rotation to normalize the relative transformation into a canonical frame. This reduces the mapping complexity from a prohibitive global state space O X 2 Y 2 Φ 2 to a localized relative manifold O X Y Φ , effectively collapsing the dimensionality of the required storage.
Furthermore, a Symmetry-based Manifold Compression is implemented. Leveraging the geometric symmetry of fixed-wing UAV flight dynamics, the database only stores motion primitives for the first quadrant of the horizontal plane. Any query in the other three quadrants is mapped to this primary manifold through a sequence of reflection and rotation operators I :
x , y , ψ s , ψ g = I x r e l , y r e l , ψ s , ψ g
where x , y , ψ s , ψ g is the transformed query key used to index the database, ensuring that the retrieved path geometry is consistent with the original relative pose after an inverse symmetric transformation. x r e l , y r e l denotes the relative displacement of the goal node with respect to the start node in the horizontal plane. ψ s and ψ g represent the initial heading of the start node and the terminal heading of the goal node, respectively.
This strategy effectively compresses the physical storage requirement to approximately 25% of the original manifold size. Since the mapping I consists solely of elementary arithmetic and logic operations, it introduces negligible computational overhead, thereby maintaining the O 1 complexity of the steering process.
The computational complexity of the collision validation procedure, denoted as T collision , is synthesized from two decoupled stochastic and physical processes: spatial search reduction and discretized evaluation. In a standard implementation, every waypoint on a candidate path must be checked against the global obstacle set O with cardinality M .
The proposed method employs a spatial decoupling strategy that restricts the search to an active obstacle subset O local O , where O local = m M . This subset is dynamically determined by the bounding box of the current motion primitive. By defining σ as the local obstacle density and V p as the spatial volume swept by the primitive, the number of obstacles m to be considered is
m σ V p
Beyond the reduced search space, the actual computational effort is determined by the point-wise evaluation logic along the trajectory. As the path is discretized into a sequence of waypoints, the number of evaluation points is intrinsically proportional to the path length L path . By introducing the adaptive step parameter K , the algorithm performs only L path / K discrete checks. Consequently, the expected complexity of the validation procedure is formulated as
T collision = O m L path   K
where m represents the count of obstacles in the localized mission space, significantly lower than the global count M used in conventional Dubins-RRT*.
From a theoretical perspective, the asymptotic computational complexity of RRT* is primarily governed by nearest-neighbor and neighborhood queries, which can be performed in O log n time using efficient spatial data structures. Therefore, the overall complexity of constructing a tree with n nodes is O n log n , as established in [16]. In the proposed framework, the dominant per-iteration cost is effectively shifted away from steering and collision validation toward nearest-neighbor operations. Specifically, T steer is reduced to near-constant time via database retrieval, and T collision is significantly reduced through spatial filtering and adaptive discretization, which are either bounded or efficiently approximated. Consequently, the per-iteration complexity is dominated by T nearest , which remains O log n , and the overall asymptotic complexity of the algorithm preserves the O n log n structure of RRT*.
Based on the aforementioned theoretical derivations, the comparison of computational characteristics among different RRT*-based planners is summarized in Table 2.
As shown in Table 2, while all methods share a similar asymptotic complexity dominated by nearest-neighbor search, their practical performance differs significantly due to the computational cost of steering and collision validation. In standard RRT*, the steering operation is computationally inexpensive, but the method does not explicitly account for kinodynamic constraints. In contrast, conventional Dubins-RRT* incorporates non-holonomic constraints at the cost of expensive online TPBVP solving and dense collision checking along curved trajectories. The proposed Quick-Dubins-RRT* addresses these limitations by replacing the online steering process with efficient database retrieval and restricting collision checking to a localized obstacle subset with adaptive discretization. As a result, the dominant computational burden is effectively shifted to nearest-neighbor queries, while the costs associated with trajectory generation and collision validation are significantly reduced. This coordinated design leads to a substantial improvement in practical efficiency, even though the asymptotic complexity remains unchanged.
It is worth noting that the near-constant-time steering complexity discussed above primarily applies to the tree expansion phase, where rapid sampling is critical. In the subsequent altitude optimization stage (Section 3.4.1), limited online motion-primitive reconstruction is performed. However, this process is confined to a sparse set of nodes along the final solution path, and therefore does not affect the overall computational efficiency or asymptotic complexity of the proposed method.

3.4. Altitude Optimization and Temporal Coordination Mechanism

After obtaining a kinodynamically feasible path, a path optimization and time coordination mechanism is designed to enhance path executability and satisfy temporal consistency constraints in multi-fixed-wing UAV cooperative missions. First, gradient-based optimization is employed to smooth the path altitude, improving flight stability. Second, a Dubins-detour-based time adjustment strategy is introduced. By controlling the detour radius and number of loops without compromising kinodynamic feasibility, this strategy enables flexible regulation of flight time.

3.4.1. Altitude Optimization Based on Gradient Descent

Sampled 3D paths may exhibit excessive altitude fluctuations, so gradient descent is employed to optimize the path altitude. The core idea is to adjust the altitude of each waypoint along the gradient direction, defined by the sum of a smoothing gradient f c and a terrain repulsion gradient f r :
Δ z i = α f c + f r
where α is the learning rate.
The smoothing gradient is defined as
f c = k c z i 1 z i d i 1 + z i + 1 z i d i
where z i is the altitude of waypoint i, d i is the horizontal distance between waypoint i and i + 1, and k c is the smoothing coefficient.
To ensure the minimum ground clearance, the terrain repulsion gradient is defined as
f r = k r z i h t i d safe , z i < h t i + d safe 0 , z i h t i + d safe
where h t i is the local terrain elevation at waypoint i and d safe is the safety distance. Multiple iterations effectively reduce unnecessary altitude variations, as shown in Figure 7.
Although the altitude optimization in Equations (23)–(25) is performed on the z-axis, the kinodynamic feasibility regarding the maximum flight path angle θ max is strictly maintained through a primitive-constrained update mechanism.
During each gradient descent iteration, to ensure a stable and incremental deformation of the trajectory, a conservative gradient step size α ( α = 0.1 ) is selected, which prevents the optimization from overshooting the feasible kinodynamic manifold in a single iteration. And the candidate altitude z i ( k + 1 ) = z i ( k ) + α J is not directly committed. Instead, it is passed to the 3D Dubins generator as a boundary condition. As defined in our motion primitive logic, the generator explicitly evaluates the coupling between the vertical increment Δ z and the horizontal arc-length L 2 D . The update is accepted if and only if
| Δ z | L 2 D tan θ max
If the gradient step α attempts to push the waypoint beyond this kinodynamic manifold, the segment is flagged as infeasible and the update is discarded. This ensures that the smoothing process converges to a profile that is inherently flyable for fixed-wing UAVs, effectively preventing excessive climb or dive rates.

3.4.2. Dubins Detour Strategy for Temporal Coordination

Common temporal coordination constraints in UAV swarm missions include simultaneous arrival and sequential arrival. The prerequisite for achieving these constraints is a consistent time reference across the distributed nodes. Fundamentally, this distributed coordination is a consensus-seeking process over a networked multi-agent system.
According to the foundational framework established by Olfati-Saber et al. [32], the estimated time of arrival (ETA) τ k is expected to converge to a synchronized value if the communication graph G is connected. The existence of a simple zero eigenvalue of the graph Laplacian L provides a sufficient condition for consensus in continuous-time settings.
To account for practical communication constraints such as discrete-time updates, switching topologies, and intermittent packet loss, we further refer to the results of Ren and Beard [33]. Their analysis indicates that consensus can still be achieved if the union of the interaction graphs over a bounded time interval [ t , t + T ] contains a spanning tree. Under the adopted broadcast protocol, where each UAV periodically transmits its state at a fixed frequency, the time-accumulated communication graph is expected to satisfy this condition in practice, thereby supporting the convergence of the distributed time coordination process.
Based on this theoretically stable consensus, the specific coordination parameters are calculated as follows:
Let v k denote the flight speed of UAV k, l e n k the length of its initial path, and Δ l e n k the detour length introduced by the Dubins maneuver. For simultaneous arrival, Δ l e n k is calculated as
Δ l e n k = t max v k l e n k , k = 1 , 2 , , N
where t max is the estimated flight time of the slowest UAV in the group.
For sequential arrival, the arrival time of the first UAV, t first , is determined first. Each UAV then calculates its estimated time based on t first . Assuming the arrival sequence is pre-defined and the time interval for UAV k relative to the first UAV is t d k , t first is calculated as
t a = min t j + t d k l e n k / v k , k = 1 , 2 , , N t f i r s t = t j , t d i f min 0 t j t d i f min , t d i f min < 0
where t j denotes the flight time of the UAV designated to arrive first, and t d i f min represents the minimum difference between the expected and estimated arrival times when setting t j as t first . Once t first is determined, Δ l e n k is calculated as
Δ l e n k = t f i r s t + t d k / v k l e n k
After obtaining the required detour length Δ l e n k , the path is adjusted via lateral or holding maneuvers. The adjustment first modifies the straight segments of the 2D Dubins path to tune the range, followed by kinodynamic extension to 3D space. When UAV k performs a detour on the i-th Dubins segment, the adjustment length Δ L of the straight segment is calculated as
Δ L = l e n i + Δ l e n k 2 z E z S 2 l e n i 2 z E z S 2
where l e n i is the length of the i-th 3D Dubins path. Based on the value of Δ L , the detour strategy is determined.
This paper employs two strategies for the straight segments: holding detours and lateral maneuvers.
Holding detours are suitable for cases where Δ L 2 π r min , with no upper limit on path length adjustment. The number of holding loops and turning radius are calculated as
n = Δ L / 2 π r min
R = Δ L / 2 π n
Lateral maneuvers are employed for smaller Δ L . To analyze the adjustable range of path length for lateral maneuvers, a coordinate system is established as shown in Figure 8, where the straight segment of the Dubins path coincides with the OX axis and its midpoint is the origin. Let R be the turning radius during the detour, 2a the length of the straight segment, and B 0 , b the coordinates of the detour point.
The lateral maneuver path is symmetric about the OY axis; hence, only the right half is analyzed. Let s 1 and s 2 be the endpoints of the straight segment of the right half. With the construction lines shown in Figure 8, the length L 1 of segment s 1 s 2 is
L 1 = a 2 + b 2 R 2 4 R 2 = a 2 + b 2 4 b R
The length L 2 of the arc B s 1 is
L 2 = R arctan 2 R L 1 + arctan b 2 R a
Thus, the total detour length is L total = 2 L 1 + 4 L 2 .
To ensure L 1 is real, a 2 + b 2 4 b R > 0 . Let f b = a 2 + b 2 4 b R ; f b reaches its minimum a 2 4 R 2 at b = 2 R . Therefore, when a > 2 R , lateral maneuvers can achieve detours of any length. When a < 2 R , f b has two zeros:
b 0 = 2 R ± 4 R 2 a 2
In this case, b is only valid within the interval 0 , 2 R 4 R 2 a 2 . When b = 2 R 4 R 2 a 2 , L 1 = 0 , and the detour path consists of three circular arcs with a total length of
L totalmax = 4 R π 2 arctan 4 R 2 a 2 a
Hence, for a < 2 R , the adjustable path length range is 0 , L totalmax 2 a . Since the detour length depends monotonically on both b and R, fixing one parameter allows us to solve for the other via binary search to satisfy Δ L = L total 2 a . For long detours, the UAV must perform this maneuver over multiple Dubins path segments.
To complement the geometric construction of the detour strategies described above, we further provide a qualitative analysis of the convergence behavior of the temporal coordination process.
Define the path length mismatch for UAV k at iteration t as
e k ( t ) = L k t L target
where L k t denotes the cumulative path length after detour adjustments, and L target is the synchronized reference path length determined by the coordination strategy.
In the proposed adjustment framework, the total required compensation is iteratively distributed across multiple Dubins segments. The detour length increment Δ L k i generated on each Dubins segment (denoted as Δ L in the previous derivations) is interpreted here as an incremental adjustment in an iterative error reduction process. For each segment, a feasible detour Δ L k i leads to the update:
e k t + 1 = e k Δ L k i
Since the detour length is a monotonic function of the geometric parameters (as established in the preceding derivations), each feasible adjustment contributes to reducing the path length mismatch. Although feasible detours may not exist for every segment, due to environmental constraints, the iterative allocation mechanism ensures that the error is progressively reduced over the trajectory. This provides a practical convergence indication for the temporal coordination process, linking the geometric feasibility of the detour strategies with the overall synchronization objective.
To ensure that the insertion of detour maneuvers does not compromise flight safety, a Safety-Embedded Verification mechanism is integrated into the coordination process.
For each candidate detour generated by adjusting geometric parameters, the algorithm explicitly performs a collision validation before acceptance. Specifically, let P detour   denote the spatial footprint of the candidate maneuver. The maneuver is accepted if and only if it satisfies
P detour   O local = Ø
where O local represents the union of all local environmental constraints, including no-fly zones (NFZs) and terrain obstacles derived from the Digital Elevation Model (DEM).
This verification is executed in real time for every detour candidate, ensuring that any inserted holding or lateral maneuver is explicitly checked against the environment before being incorporated into the trajectory. If a candidate detour is found to be infeasible due to collision risk, it is immediately discarded, and the algorithm attempts to allocate the remaining path length compensation to subsequent segments with more favorable spatial conditions. In cases where no feasible adjustment can be found, the original trajectory is preserved locally. As a result, the proposed strategy guarantees that temporal coordination is achieved without introducing unsafe trajectory segments. Collision avoidance is strictly prioritized over time synchronization, ensuring that the expanded footprint of any detour maneuver does not intersect with prohibited regions.

3.5. Algorithmic Procedure and Computational Complexity Analysis

Integrating the single-agent rapid planning algorithm, efficiency enhancement strategies, and temporal coordination mechanism, the complete execution flow of the Cooperative-Quick-Dubins-RRT* cooperative 3D path planning algorithm is outlined in Algorithm 3. The algorithm sequentially undergoes three phases: online rapid planning, altitude smoothing optimization, and Dubins-detour-based temporal coordination.
In the online rapid planning phase, each UAV independently executes the Quick-Dubins-RRT* iteration enhanced with four acceleration strategies. During sampling, besides the conventional goal-biasing strategy [34], a goal-acceleration strategy is added to Algorithm 2: if the newly generated node x new from the previous iteration is non-empty and satisfies h x new < h parent x new , the algorithm directly targets the goal with x new as the nearest neighbor, enabling continuous tree extension toward the goal in obstacle-free directions; otherwise, Algorithm 2 is invoked for resampling within the ellipsoidal domain. During tree expansion, instead of directly connecting x new to its nearest neighbor, the optimal parent is selected from the neighbor set N near . A lookup table from DataBaseVF is used if the sample target is not the goal; otherwise, a 3D Dubins path is computed in real-time. Rewiring utilizes table lookup from DataBase to ensure global kinematic continuity. Rewiring and pruning are executed sequentially within the same neighborhood loop. Upon meeting the termination condition, FindWayBack is called to backtrack and generate the node list.
In the altitude smoothing optimization phase, PathHSmooth is invoked with nodelist as input. Gradient descent iterations are applied to the altitude of each waypoint: the smoothing gradient f c equalizes climb rates across adjacent segments, while the terrain repulsion gradient f r ensures the minimum ground clearance d safe . Each candidate altitude update is validated against the kinodynamic feasibility constraint | Δ z | L 2 D tan θ max before being committed, ensuring the smoothed profile remains flyable for fixed-wing UAVs. The output is the smoothed coordinate matrix pathXYH.
In the temporal coordination phase, UAVs broadcast their estimated flight times via an ad hoc data link. After reaching consensus on the expected arrival time, each UAV calculates its required path extension, allocated proportionally based on the length of each Dubins straight segment. If the required extension per segment satisfies Δ L 2 π r min , a holding detour is applied with no upper limit on adjustment range; otherwise, a lateral maneuver is employed for smaller adjustments. AdjUAVpathLength is then called to insert the detour segments, outputting the final path PathLLH.
Algorithm 3. Cooperative-3D-Quick-Dubins-RRT*
Input: Initial state x 0 , terminal state x f , maximum iterations i t e r max , extension step size step, pruning neighborhood parameter δ s , motion primitive database without terminal constraints DataBaseVF, motion primitive database with terminal constraints DataBase
Output: 3D path x ( · )
V x 0 ; E Ø ; a
for   i = 1   to   i t e r max   do
    if   isEmpty x new   then
          x rand x f
          x nearest x new
    e l s e
          x rand Sample ( x s , x g , a , b , r )
          x nearest Nearest V , x rand
    e n d i f
    i f   x rand x nearest > range DataBaseVF   t h e n
          x new x rand + s t e p × x nearest x rand / x nearest x rand
    e l s e
          x new x rand
    e n d i f
    i f   isFeasible x new   t h e n
          x parent , p ChooseBestParent V , N near , DataBaseVF
          if CollisionFree p   then
              V V x new ; E E p
              E Rewire E , N near , DataBase
              E Prune x new , N near , δ s
              if isEqual x new , x f | |   FindBetterPath   then
                   a , b , r Getabc x ·
              e n d i f
          e n d i f
    e n d i f
e n d f o r
nodelist = FindWayNode V , E
PathXYH PathHSmooth nodelist
PathLLH AdjUAVpathLength PathXYH
x ( · ) = PathLLH

4. Simulation and Experimental Results

4.1. Multi-UAV Cooperative Path Planning Simulation

4.1.1. Scenarios and Parameter Settings

The proposed path planning algorithm is simulated based on a cooperative delivery mission. Multiple UAVs depart from their respective distribution stations, traverse a complex terrain area, and converge at a designated assembly point. The flight environment includes mountainous terrain, polygonal no-fly zones (e.g., nature reserves, airport clear zones), and threat zones (e.g., electromagnetic interference areas near communication base stations). The delivery assembly point is fixed, and docking positions are preplanned.
The constant-speed assumption is adopted throughout the simulations to reflect the typical cruise phase of long-range fixed-wing UAV operations, where maintaining an optimal airspeed is essential for endurance efficiency. It is worth noting that the proposed framework can be extended to variable-speed scenarios by augmenting the motion primitive database with additional velocity layers, which will be investigated in future work. Across the four experimental subsections, different speed values are selected to match the specific requirements of each evaluation context: the main cooperative planning simulation in Section 4.1 adopts 140 m/s to reflect the realistic flight envelope of long-range fixed-wing UAVs; the parameter sensitivity analysis in Section 4.2 uses 15 m/s to provide finer spatial resolution in the small-scale 2D test environment; and both the comparative experiment in Section 4.3 and the HIL validation in Section 4.4 adopt 30 m/s to match the flight envelope of the onboard flight controller used in the hardware platform. Since the core mechanisms of the proposed algorithm—database indexing, dimensionality-reduced sampling, ellipsoidal heuristic domain, pruning, and localized collision detection—operate on the kinematic state space defined by heading angles and turning radius constraints rather than on absolute speed values, the observed performance improvements are expected to generalize across different speed settings.
UAV performance parameters are listed in Table 3. Initial states for path planning are given in Table 4, desired terminal states in Table 5, obstacle parameters in Table 6, and Cooperative-Quick-Dubins-RRT* algorithm parameters in Table 7.

4.1.2. Results and Analysis

Figure 9 presents the 3D spatial trajectories of the six UAVs in the cooperative path planning scenario, while Figure 10 illustrates the corresponding flight altitude profiles. As shown in Figure 9, all six UAVs successfully generate feasible paths that avoid no-fly and threat zones. The overall trajectories remain smooth after adjustment via Dubins detours. Figure 10 demonstrates that the altitude profiles adapt smoothly to the terrain variations while strictly maintaining the minimum ground clearance, with no abrupt height changes. This validates the effectiveness of the gradient-descent-based altitude optimization.
Figure 11 presents the 2D path planning results before and after detour adjustment for the six UAVs. Figure 11a shows the original paths generated by individual planning. All paths successfully avoid no-fly and threat zones while satisfying kinodynamic constraints. However, due to dispersed starting positions and varying terrain, the path lengths exhibit significant differences. Figure 11b illustrates the results after Dubins detour adjustment. By inserting holding or lateral maneuver segments into the Dubins straight-line paths, each UAV compensates its range. Consequently, the path length error among UAVs is reduced to within 0.05 km (as shown in Table 8), which is within the acceptable range. The adjusted paths maintain excellent overall smoothness, validating the effectiveness of the proposed temporal coordination mechanism.

4.1.3. Simulation and Analysis in Dynamic Scenarios

To evaluate the algorithm’s adaptability to dynamic environments, four dynamic simulation scenarios are designed based on the configuration in Section 4.1.1. In each scenario, UAVs have completed initial cooperative planning and flown for a period before dynamic events are triggered. Upon triggering, UAVs restart local planning and cooperative adjustment from neighboring nodes near their current positions.
(1)
Scenario 1 (Delayed Coordination): After independently completing shortest-path planning, all six UAVs trigger the coordination mechanism simultaneously at 100 s to adjust the remaining segments via Dubins detours.
(2)
Scenario 2 (Team Scale Reconfiguration): At 200 s, UAV 1 exits the formation (e.g., due to signal loss), while UAV 7 and UAV 8 join from new start points, triggering a team-scale expansion.
(3)
Scenario 3 (Target Reallocation): At 200 s, UAV 1’s delivery target is reassigned to a new location, and UAV 2 and UAV 6 swap targets. The three affected UAVs trigger local replanning, while the others maintain their original segments. The fleet then performs coordinated detours.
(4)
Scenario 4 (Global Map Switching): At 200 s, the layout of no-fly and threat zones undergoes a global update. The algorithm automatically detects collisions between remaining paths and the new map. UAVs with conflicts trigger online replanning, while conflict-free UAVs maintain their paths. The fleet then performs unified coordination.
Figure 12 presents the simulation results for the four dynamic scenarios. The dashed lines indicate historical paths before the dynamic events, while the solid lines represent the final trajectories after replanning and coordination. The white circles mark the neighboring nodes, which serve as the new planning start points.
For Scenario 1, as shown in Figure 12a, after delaying coordination initiation by 100 s, all UAVs successfully converge their arrival times through Dubins detours. This demonstrates the flexibility of the coordination mechanism, which is insensitive to initiation timing.
For Scenario 2, in Figure 12b, after UAV 7 and UAV 8 join the formation, the remaining five UAVs reconfigure their paths from their respective neighboring nodes. The updated formation of seven UAVs successfully completes coordinated detours under a new time baseline, generating kinodynamically feasible and collision-free trajectories. This validates the framework’s adaptability to dynamic team expansion.
For Scenario 3, illustrated in Figure 12c, after UAVs 1, 2, and 6 complete local replanning from their neighboring nodes, they merge back into the unified coordination process. The new trajectories exhibit smooth transitions and remain collision-free, verifying the compatibility between local replanning and global coordination under target reallocation.
For Scenario 4, as depicted in Figure 12d, upon global obstacle switching, the algorithm automatically identifies affected UAVs and triggers online replanning. The replanned paths successfully avoid the new no-fly and threat zones, while unaffected UAVs maintain their original segments to participate directly in coordination. This validates the algorithm’s real-time responsiveness to sudden environmental changes.

4.2. Parameter Sensitivity Analysis

The discretization resolution, sampling range, and neighborhood pruning range of the motion primitive database significantly impact the performance of the Quick-Dubins-RRT* algorithm. This section investigates the effects of these parameters on solution quality and computational efficiency.
The 2D test scenario is illustrated in Figure 13a. The UAV speed is set to 15 m/s with a maximum bank angle of 30°. The start point is (50, 70) with an initial heading of 0°, and the goal point is (480, 260) with a goal heading of −90°. All results are averaged over 100 runs.

4.2.1. Effect of Motion Primitive Resolution

With a fixed sampling range of 100 m and a pruning range of 0 m, three databases with different resolutions are tested, featuring grid resolutions of 5 m, 10 m, and 20 m, and heading angle discretizations of 15°, 22.5°, and 45°, respectively. The path length and runtime versus iteration curves are shown in Figure 14.
It can be observed that lower database resolution leads to a faster increase in runtime as iterations grow. This is because lower grid resolution results in a smaller discrete state space, causing extensive repeated sampling and sampling failures after a certain number of iterations, thereby increasing the sampling overhead. The average path lengths for different resolutions are similar at the same iteration count; however, higher-resolution databases yield slightly better paths as iterations progress. Therefore, increasing the discretization resolution of the motion primitive database is consistently beneficial for enhancing both solution quality and computational efficiency.

4.2.2. Effect of Motion Primitive Sampling Range

The sampling range of the database determines the maximum extension step and node neighborhood size. With a fixed heading discretization of 15°, a grid resolution of 5 m, and zero pruning range, four databases with sampling ranges of 50 m, 75 m, 100 m, and 150 m are tested. Figure 15 plots the path length and runtime against iterations. Larger sampling ranges result in longer runtimes for the same number of iterations. However, significant time differences only emerge at high iteration counts or with excessively large ranges. Path length is significantly affected: larger ranges enable faster convergence to the optimal path. This is because larger extension steps accelerate tree growth toward the goal, while larger neighborhoods allow more rewiring opportunities, improving path quality. Although larger ranges slightly increase computation time, they greatly enhance path quality.
Note that the sampling range should not be increased blindly. Excessively large ranges drastically increase rewiring time and database storage requirements. Therefore, the sampling range must be selected by comprehensively considering runtime, convergence speed, and memory usage, choosing an appropriate value that keeps runtime acceptable.

4.2.3. Effect of Pruning Neighborhood Size

Although pruning the RRT accelerates computation, excessive pruning degrades solution quality. With a fixed sampling range of 100 m, heading discretization of 15°, and grid resolution of 5 m, tests are conducted with pruning parameters δ s of 0 m, 10 m, 20 m, and 30 m. Figure 16 shows the path length and runtime versus iterations. Larger pruning ranges result in shorter runtimes and slower runtime growth as iterations increase. However, the final converged path length gradually increases with the pruning range. Small pruning ranges can accelerate computation without significantly affecting path quality, whereas large ranges noticeably degrade it. Setting the pruning range to 1–2 times the grid resolution achieves a good balance between computational speed and path quality.

4.3. Comparison with Conventional Dubins-RRT*

The proposed Quick-Dubins-RRT* is a systematic enhancement of the conventional Dubins-RRT* at the search strategy level, as both share the same kinodynamic modeling framework (Dubins curves) and tree expansion mechanism. This choice of baseline directly isolates and quantifies the marginal contribution of the proposed acceleration strategies under strictly controlled variables. Therefore, this section compares the Quick-Dubins-RRT* algorithm with the traditional Dubins-RRT* in a dense 2D obstacle environment (Figure 13). The UAV speed is set to 30 m/s with a maximum bank angle of 30°. The start point is (1000, 400) with an initial heading of 90°, and the goal point is (4760, 4200) with a goal heading of 0°. All results are averaged over 100 runs.
Figure 17 compares the path length and runtime versus iterations for both algorithms. Quick-Dubins-RRT* demonstrates significant improvements in both solution quality and computational efficiency. After 1000 iterations, the average runtime is reduced by 86.04%, and the path length is shortened by 8.928%, indicating that Quick-Dubins-RRT* is highly suitable for online path planning. Furthermore, as shown in Figure 18, the average time to find the first feasible path is reduced by 97.34%, with a path length reduction of 22.63%. Quick-Dubins-RRT* also exhibits a much lower variance in computation time, demonstrating superior solution stability.

4.4. Hardware-in-the-Loop Simulation Validation

Due to the inherent constraints of real-world flight testing for fixed-wing UAVs, such as limited testing conditions, high operational costs, and safety considerations, it is challenging to directly conduct large-scale outdoor experiments for swarm trajectory planning validation. To address this issue, a hardware-in-the-loop (HIL) simulation framework is adopted to provide a controllable yet realistic evaluation environment. By integrating an onboard flight controller with a high-fidelity flight dynamics simulation module, the system is capable of reproducing key characteristics of real-world UAV flight, including motion dynamics, control responses, and environmental interactions. This enables the experimental setup to approximate real flight conditions as closely as possible within a laboratory environment.
To further verify the engineering feasibility and real-time execution capabilities of the proposed path planning algorithm, we have established a hardware-in-the-loop (HIL) simulation system that integrates virtual and real components. This integrated simulation framework comprises several key modules: swarm mission planning software, a hardware-in-the-loop system, a virtual simulation system, a hybrid real-virtual mobile ad hoc network (MANET) communication system, and a dynamic scenario visualization system. These modules perform real-time, online data exchange to collectively constitute the integrated hardware-in-the-loop simulation platform.
The hardware-in-the-loop simulation procedure is illustrated in Figure 19.
The functional modules of the proposed system are described as follows:
(1)
The swarm mission planning software provides functionalities including UAV scenario configuration, human–machine interaction for command generation, construction of swarm-level command sets, loading of preplanned waypoints, and visualization of two-dimensional operational trajectories of the UAV swarm.
(2)
The hardware-in-the-loop (HIL) simulation system is constructed based on key components such as an intelligent mission planner, a flight control module, an ad hoc communication datalink, and flight dynamics simulation software. This subsystem enables real-time hardware-based simulation of UAV mission planning and autonomous decision-making processes, serving as a core architecture for the integrated virtual–physical simulation framework.
(3)
The virtual simulation system is developed based on digital simulation techniques, incorporating a fixed-wing UAV six-degree-of-freedom (6-DOF) model as well as trajectory tracking and control models. It enables online validation of UAV behaviors in a purely simulated environment and operates in conjunction with the HIL subsystem to support real-time hybrid simulation of swarm operations.
(4)
The hybrid ad hoc communication simulation system is established using both physical ad hoc datalink hardware and a corresponding simulated communication module. By designing appropriate communication strategies, this system forms a unified communication network for the hybrid simulation framework, ensuring reliable data exchange between the virtual and HIL subsystems during cooperative UAV simulations.
(5)
The flight dynamics simulation software provides the flight control module with simulated environmental inputs, including atmospheric conditions and sensor disturbances. This enables realistic emulation of UAV flight processes within a laboratory environment, effectively approximating real-world operating conditions.
(6)
The ground control station primarily features swarm visualization for trajectory display, a graphical interface for flight attitude, and the reception of cluster simulation data. It enables 2D visualized simulation monitoring and telemetry of UAV trajectories and attitude data.
(7)
The dynamic visualization and simulation system supports performance evaluation of swarm mission execution, synchronization of simulation scenarios, and three-dimensional visualization of UAV trajectories. It facilitates comprehensive assessment of swarm operational effectiveness and enables intuitive visualization of coordinated UAV behaviors within the hybrid simulation environment.
The HIL test scenario is configured in accordance with the settings described in Section 4.1.1. During the simulation, the mission platform first transmits no-fly zone and threat zone information to the trajectory planner, followed by task assignment for formation assembly at designated locations. The resulting UAV trajectories, as shown in Figure 20, demonstrate a line-abreast formation executing the cooperative delivery mission.
To evaluate the trajectory tracking performance of the proposed planning framework under realistic onboard flight control constraints, HIL validation experiments are conducted.
A total of six UAV nodes are deployed in the hybrid simulation environment, including three physical nodes and three virtual nodes. The onboard flight controller of the physical UAVs operates within a velocity range of [25, 40] m/s, with a position error not exceeding 5 m, a heading error within 1.5°, and an attitude error within 0.5°. The control system provides a data update frequency of no less than 200 Hz.
In terms of communication, the system adopts a broadcast mechanism based on the User Datagram Protocol (UDP), integrated with serial interfaces for multi-source data interaction. This design supports periodic state broadcasting and synchronous updates among multiple UAV nodes, meeting the requirements of low-latency communication and distributed coordination. The maximum inter-UAV communication range is approximately 10 km.
To ensure compatibility with onboard flight constraints, the experimental scenario is configured according to the flight envelope of the UAV. Specifically, the cruising speed is set to 30 m/s, and the maximum bank angle is limited to 30°. In addition, the mission area is scaled to approximately one-third of the original scenario while preserving the obstacle topology and constraint structure, thereby maintaining both hardware feasibility and task representativeness.
Under this setting, the flight controller receives the planned waypoints and executes trajectory tracking in a closed loop. Figure 21 presents the time histories of the UAV state variables output by the onboard controller, including longitude, latitude, altitude, airspeed, roll angle, pitch angle, and yaw angle.
As shown in Figure 21, the onboard flight controller responds to the planned waypoint sequence throughout the mission, confirming that the trajectories generated by the proposed Quick-Dubins-RRT* algorithm are executable by a real flight control system. It should be noted that, prior to uploading to the flight controller, the planned path undergoes a Douglas–Peucker (DP)-based re-discretization process to ensure stable tracking performance. This intermediate step introduces two observable effects: first, the altitude profile exhibits a stepwise variation pattern, as each discretized waypoint transition triggers a new altitude command rather than a continuously smooth climb or descent; second, the realized arrival times of the six UAVs (1405.71 s, 1411.32 s, 1413.36 s, 1403.73 s, 1406.67 s, and 1407.96 s, respectively) deviate slightly from the pure algorithmic simulation results, since the fidelity of the DP discretization directly affects the conformity of the actual flight trajectory to the originally planned path. Nevertheless, all six UAVs complete the mission within a narrow time window with a maximum inter-arrival deviation of approximately 9.63 s, demonstrating that the proposed cooperative planning framework retains satisfactory temporal coordination capability under real onboard execution conditions, and demonstrating its strong engineering applicability.
However, it should be noted that the HIL framework cannot fully capture all uncertainties present in real-world flight environments, such as complex atmospheric disturbances, unmodeled aerodynamic effects, and sensor noise variability.
Nevertheless, as an intermediate validation stage between pure simulation and real-world deployment, the HIL system provides a reliable and controllable platform for evaluating trajectory tracking performance and multi-UAV coordination under realistic system constraints. Therefore, the experimental results can be considered strong evidence of the practical feasibility of the proposed method.

5. Conclusions

In this paper, a cooperative online 3D path planning framework, centered on the Cooperative-3D-Quick-Dubins-RRT* algorithm, is proposed to address the real-time kinodynamic requirements of fixed-wing UAV swarms in complex environments. By integrating an offline dual-mode motion primitive database with four acceleration strategies—sampling space dimensionality reduction, elliptical prediction domain, pruning, and improved collision detection—the algorithm achieves a significant leap in computational efficiency.
The effectiveness and online planning capability of the proposed algorithm are validated in a baseline scenario involving six UAVs for cooperative delivery, as well as in four dynamic scenarios: delayed coordination, mission scale reconstruction, target reallocation, and global map switching. Parameter sensitivity analysis demonstrates that appropriate increases in discretization accuracy and sampling range, combined with a pruning range set to 1–2 times the grid resolution, yield optimal algorithmic performance. Comparative experiments against the conventional Dubins-RRT* algorithm show that the proposed method reduces average runtime by 86.04% and shortens the first feasible path length by 22.63%, confirming its suitability for real-time online trajectory planning. Hardware-in-the-loop simulations further validate the engineering applicability of the proposed method: all six UAVs successfully completed the cooperative mission within a maximum inter-arrival deviation of 9.63 s under real onboard flight control constraints, demonstrating reliable temporal coordination capability in realistic execution conditions.
Although the proposed framework assumes a constant-speed cruise flight model, future work will investigate trajectory planning for variable-speed flight by incorporating tangential overload constraints and applying differential flatness theory to optimize the UAV velocity profile, thereby further enhancing the algorithm’s applicability to a broader range of operational scenarios.

Author Contributions

Conceptualization, Y.N. and D.Z.; methodology, Y.N., C.L., X.Z. and D.Z.; software and validation, X.Z. and C.L.; writing—original draft preparation, X.Z. and C.L.; writing—review and editing, Y.N. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The result data supporting the findings of this study are available at https://github.com/foolchicken/Collaborative-online-3D-path-planning-for-fixed-wing-UAVs/tree/main (accessed on 17 November 2025).

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
UAVUnmanned Aerial Vehicle
RRTRapidly exploring Random Tree
PRMProbabilistic Road Map
PSOParticle Swarm Optimization
TPBVPTwo-Point Boundary Value Problem
ACOAnt Colony Optimization
SCASLSine Cosine optimization Algorithm with Self-learning strategy and Levy flight
DEMDigital Elevation Model
GISsGeographic Information Systems
HILHardware-in-the-Loop
OSGOpen Scene Graph
UDPUser Datagram Protocol

References

  1. Duan, H.B.; Qiao, P.X. Pigeon-inspired optimization: A new swarm intelligence optimizer for air robot path planning. Int. J. Intell. Comput. Cybern. 2014, 7, 24–37. [Google Scholar] [CrossRef]
  2. Zeng, Z.; Sammut, K.; Lian, L.; He, F.; Lammas, A.; Tang, Y. A comparison of optimization techniques for AUV path planning in environments with ocean currents. Robot. Auton. Syst. 2016, 82, 61–72. [Google Scholar] [CrossRef]
  3. Hota, S.; Ghose, D. Optimal path planning for an aerial vehicle in 3D space. In Proceedings of the 49th IEEE Conference on Decision and Control (CDC), Atlanta, GA, USA, 15–17 December 2010; pp. 4902–4907. [Google Scholar]
  4. Qi, Z.; Shao, Z.; Ping, Y.S.; Hiot, L.M.; Leong, Y.K. An Improved Heuristic Algorithm for UAV Path Planning in 3D Environment. In Proceedings of the 2nd International Conference on Intelligent Human-Machine Systems and Cybernetics, Nanjing, China, 26–28 August 2010; pp. 258–261. [Google Scholar]
  5. Uras, A. Path planning on a cuboid using genetic algorithms. Inf. Sci. 2008, 178, 3275–3287. [Google Scholar] [CrossRef]
  6. Miller, B.; Stepanyan, K.; Miller, A.; Andreev, M. 3D path planning in a threat environment. In Proceedings of the 50th IEEE Conference on Decision and Control and European Control Conference, Orlando, FL, USA, 12–15 December 2011; pp. 6864–6869. [Google Scholar]
  7. Pehlivanoglu, Y.V. A new vibrational genetic algorithm enhanced with a Voronoi diagram for path planning of autonomous UAV. Aerosp. Sci. Technol. 2012, 16, 47–55. [Google Scholar] [CrossRef]
  8. Yu, J.; Wu, X.; Jiang, A.; Yong, E. Research on UAV path planning method based on the multi-precision planning windows. Syst. Eng. Electron. 2024, 46, 1767–1776. [Google Scholar]
  9. Ren, M.; Huo, X. Real time path planning method for unmanned aerial vehicles based on asynchronous dual precision rolling window. Sci. China Inf. Sci. 2010, 40, 561–568. [Google Scholar]
  10. Zhou, Y.; Su, Y.; Xie, A.; Kong, L. A newly bio-inspired path planning algorithm for autonomous obstacle avoidance of UAV. Chin. J. Aeronaut. 2021, 34, 199–209. [Google Scholar] [CrossRef]
  11. Wu, J.; Wang, H.; Wang, Y.; Liu, Y. UAV Reactive Interfered Fluid Path Planning. Acta Autom. Sin. 2023, 49, 272–287. [Google Scholar] [CrossRef]
  12. Yao, P.; Wang, H.; Su, Z. UAV feasible path planning based on disturbed fluid and trajectory propagation. Chin. J. Aeronaut. 2015, 28, 1163–1177. [Google Scholar] [CrossRef]
  13. LaValle, S.M. Rapidly-Exploring Random Trees: A New Tool for Path Planning; Technical Report; Iowa State University: Ames, IA, USA, 1998. [Google Scholar]
  14. Kavraki, L.E.; Svestka, P.; Latombe, J.C.; Overmars, M.H. Probabilistic Roadmaps for Path Planning in High-Dimensional Configuration Spaces. IEEE Trans. Robot. Autom. 1996, 12, 566–580. [Google Scholar] [CrossRef]
  15. Yang, K.; Sukkarieh, S. 3D smooth path planning for a UAV in cluttered natural environments. In Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, Nice, France, 22–26 September 2008; pp. 794–800. [Google Scholar]
  16. Karaman, S.; Frazzoli, E. Incremental sampling-based algorithms for optimal motion planning. Int. J. Robot. Res. 2011, 30, 846–894. [Google Scholar] [CrossRef]
  17. Cui, R.; Li, Y.; Yan, W. Mutual information-based multi-AUV path planning for scalar field sampling using multidimensional RRT*. IEEE Trans. Syst. Man Cybern. Syst. 2017, 46, 993–1004. [Google Scholar] [CrossRef]
  18. Du, M.; Mei, T.; Liang, H.; Chen, J.; Huang, R.; Zhao, P. Drivers’ visual behavior-guided RRT motion planner for autonomous on-road driving. Sensors 2016, 16, 102. [Google Scholar] [CrossRef]
  19. Schopferer, S.; Benders, S. Minimum-Risk Path Planning for Long-Range and Low-Altitude Flights of Autonomous Unmanned Aircraft. In Proceedings of the AIAA Scitech 2020 Forum, Orlando, FL, USA, 6–10 January 2020. [Google Scholar]
  20. Liu, T.; He, X.; Niu, Y.; Li, J.; Li, Z. TICOP: Time-Critical Coordinated Planning for Fixed-Wing UAVs in Unknown Unstructured Environments. IEEE Robot. Autom. Lett. 2024, 9, 9629–9636. [Google Scholar] [CrossRef]
  21. Tal, E.; Ryou, G.; Karaman, S. Aerobatic Trajectory Generation for a VTOL Fixed-Wing Aircraft Using Differential Flatness. IEEE Trans. Robot. 2023, 39, 4805–4819. [Google Scholar] [CrossRef]
  22. Cui, P.; Yan, W.S.; Guo, X.X. Path Planning for Robot with Pose Constraints Using Dubins-RRT*. In Proceedings of the 2018 3rd IEEE International Conference on Advanced Robotics and Mechatronics (ICARM), Shenzhen, China, 18–20 July 2018; pp. 560–565. [Google Scholar]
  23. Pharpatara, P.; Herisse, B.; Bestaoui, Y. 3-D Trajectory Planning of Aerial Vehicles Using RRT*. IEEE Trans. Control Syst. Technol. 2017, 25, 1116–1123. [Google Scholar] [CrossRef]
  24. Niendorf, M.; Schmitt, F.; Adolf, F. Multi-query Path Planning for an Unmanned Fixed-Wing Aircraft. In Proceedings of the AIAA Guidance, Navigation, and Control (GNC) Conference, Boston, MA, USA, 19–22 August 2013. [Google Scholar]
  25. Liu, X.; Su, Y.; Wu, Y.; Guo, Y. Multi-Conflict-Based Optimal Algorithm for Multi-UAV Cooperative Path Planning. Drones 2023, 7, 217. [Google Scholar] [CrossRef]
  26. Bai, H.; Fan, T.; Niu, Y.; Cui, Z. Multi-UAV Cooperative Trajectory Planning Based on Many-Objective Evolutionary Algorithm. Complex Syst. Model. Simul. 2022, 2, 130–141. [Google Scholar] [CrossRef]
  27. Huang, Y.; Tang, J.; Lao, S. Cooperative Multi-UAV Collision Avoidance Based on a Complex Network. Appl. Sci. 2019, 9, 3943. [Google Scholar] [CrossRef]
  28. Wang, K.; Song, M.; Li, M. Cooperative Multi-UAV Conflict Avoidance Planning in a Complex Urban Environment. Sustainability 2021, 13, 6807. [Google Scholar] [CrossRef]
  29. Sakcak, B.; Bascetta, L.; Ferretti, G.; Prandini, M. Sampling-based optimal kinodynamic planning with motion primitives. Auton. Robot. 2019, 43, 1715–1732. [Google Scholar] [CrossRef]
  30. Zheng, D.; Tsiotras, P. Accelerating Kinodynamic RRT* Through Dimensionality Reduction. In Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Prague, Czech Republic, 27 September–1 October 2021; pp. 3674–3680. [Google Scholar]
  31. Gammell, J.D.; Srinivasa, S.S.; Barfoot, T.D. Informed RRT*: Optimal sampling-based path planning focused via direct sampling of an admissible ellipsoidal heuristic. In Proceedings of the 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems, Chicago, IL, USA, 14–18 September 2014; pp. 2997–3004. [Google Scholar]
  32. Olfati-Saber, R.; Fax, J.A.; Murray, R.M. Consensus and cooperation in networked multi-agent systems. Proc. IEEE 2007, 95, 215–233. [Google Scholar] [CrossRef]
  33. Ren, W.; Beard, R.W. Consensus seeking in multiagent systems under dynamically changing interaction topologies. IEEE Trans. Autom. Control 2005, 50, 655–661. [Google Scholar] [CrossRef]
  34. Feng, H.; Hu, Q.; Zhao, Z.; Feng, X. Smooth path planning under maximum curvature constraints for autonomous underwater vehicles based on rapidly-exploring random tree star with B-spline curves. Eng. Appl. Artif. Intell. 2024, 133, 108583. [Google Scholar] [CrossRef]
Figure 1. Scenario of cooperative online trajectory planning for fixed-wing UAVs.
Figure 1. Scenario of cooperative online trajectory planning for fixed-wing UAVs.
Drones 10 00297 g001
Figure 2. Raster DEM.
Figure 2. Raster DEM.
Drones 10 00297 g002
Figure 3. Approximation of the ellipsoid by a tangent plane.
Figure 3. Approximation of the ellipsoid by a tangent plane.
Drones 10 00297 g003
Figure 4. Framework of cooperative obstacle avoidance trajectory planning. Blue arrows indicate the internal control loop and data processing flow within a single UAV, while green arrows represent the information exchange via the Self-organizing Network Data Link.
Figure 4. Framework of cooperative obstacle avoidance trajectory planning. Blue arrows indicate the internal control loop and data processing flow within a single UAV, while green arrows represent the information exchange via the Self-organizing Network Data Link.
Drones 10 00297 g004
Figure 5. Comparison of different sampling strategies. (a) Sampling the full state space. (b) Sampling the reduced state space. The red dot indicates the target point, while the green dots indicate the intermediate waypoints.
Figure 5. Comparison of different sampling strategies. (a) Sampling the full state space. (b) Sampling the reduced state space. The red dot indicates the target point, while the green dots indicate the intermediate waypoints.
Drones 10 00297 g005
Figure 6. Ray-shaped aggregation phenomenon in the RRT, where the gray shaded areas represent obstacles.
Figure 6. Ray-shaped aggregation phenomenon in the RRT, where the gray shaded areas represent obstacles.
Drones 10 00297 g006
Figure 7. Illustration of altitude smoothing results.
Figure 7. Illustration of altitude smoothing results.
Drones 10 00297 g007
Figure 8. Illustration of the lateral maneuver path.
Figure 8. Illustration of the lateral maneuver path.
Drones 10 00297 g008
Figure 9. Spatial path graph.
Figure 9. Spatial path graph.
Drones 10 00297 g009
Figure 10. Altitude profiles of UAV trajectories. (a) UAV 1. (b) UAV 2. (c) UAV 3. (d) UAV 4. (e) UAV 5. (f) UAV 6.
Figure 10. Altitude profiles of UAV trajectories. (a) UAV 1. (b) UAV 2. (c) UAV 3. (d) UAV 4. (e) UAV 5. (f) UAV 6.
Drones 10 00297 g010
Figure 11. The result of multi-UAV path planning. (a) Original trajectory. (b) Adjusted trajectory, where the red graphics indicate no-fly zones.
Figure 11. The result of multi-UAV path planning. (a) Original trajectory. (b) Adjusted trajectory, where the red graphics indicate no-fly zones.
Drones 10 00297 g011
Figure 12. Online planning results in dynamic scenes. (a) Delayed cooperation. (b) Mission scaling and reconstruction. (c) Target reassignment and exchange. (d) Global map switching.
Figure 12. Online planning results in dynamic scenes. (a) Delayed cooperation. (b) Mission scaling and reconstruction. (c) Target reassignment and exchange. (d) Global map switching.
Drones 10 00297 g012aDrones 10 00297 g012b
Figure 13. Testing scenario. (a) Scenario 1. (b) Scenario 2.
Figure 13. Testing scenario. (a) Scenario 1. (b) Scenario 2.
Drones 10 00297 g013
Figure 14. Performance comparison of different resolution databases. (a) The variation in running time with the number of iterations. (b) The variation in path length with the number of iterations.
Figure 14. Performance comparison of different resolution databases. (a) The variation in running time with the number of iterations. (b) The variation in path length with the number of iterations.
Drones 10 00297 g014
Figure 15. Performance comparison of databases with different sampling ranges. (a) The variation in running time with the number of iterations. (b) The variation in path length with the number of iterations.
Figure 15. Performance comparison of databases with different sampling ranges. (a) The variation in running time with the number of iterations. (b) The variation in path length with the number of iterations.
Drones 10 00297 g015
Figure 16. Performance comparison of different prune ranges. (a) The variation in running time with the number of iterations. (b) The variation in path length with the number of iterations.
Figure 16. Performance comparison of different prune ranges. (a) The variation in running time with the number of iterations. (b) The variation in path length with the number of iterations.
Drones 10 00297 g016
Figure 17. The variation in path length and running time with the number of iterations. (a) Running time comparison; (b) Path length comparison.
Figure 17. The variation in path length and running time with the number of iterations. (a) Running time comparison; (b) Path length comparison.
Drones 10 00297 g017
Figure 18. The calculation time and path length for the first solution. (a) Running time comparison; (b) Path length comparison.
Figure 18. The calculation time and path length for the first solution. (a) Running time comparison; (b) Path length comparison.
Drones 10 00297 g018
Figure 19. Hardware-in-the-loop simulation procedure. Ad-hoc denotes a self-organizing, infrastructure-free wireless network.
Figure 19. Hardware-in-the-loop simulation procedure. Ad-hoc denotes a self-organizing, infrastructure-free wireless network.
Drones 10 00297 g019
Figure 20. Results of the hardware-in-the-loop (HIL) simulation. The red volumetric areas represent the no-fly zones. The blue lines depict the flight trajectories of the UAVs. The green lines between the UAVs represent the communication links, while the mint-green areas in front of the UAVs indicate the sensor scanning regions. (a) UAV trajectories at mission commencement. (b) UAV trajectories at mid-mission. (c) UAV trajectories upon mission completion. (d) Full-scenario simulation visualization of the cooperative mission.
Figure 20. Results of the hardware-in-the-loop (HIL) simulation. The red volumetric areas represent the no-fly zones. The blue lines depict the flight trajectories of the UAVs. The green lines between the UAVs represent the communication links, while the mint-green areas in front of the UAVs indicate the sensor scanning regions. (a) UAV trajectories at mission commencement. (b) UAV trajectories at mid-mission. (c) UAV trajectories upon mission completion. (d) Full-scenario simulation visualization of the cooperative mission.
Drones 10 00297 g020aDrones 10 00297 g020b
Figure 21. Output curves of the onboard flight controller in the hardware-in-the-loop (HIL) simulation. (a) UAV 1. UAV2. UAV3. (b) UAV4. UAV5. UAV 6.
Figure 21. Output curves of the onboard flight controller in the hardware-in-the-loop (HIL) simulation. (a) UAV 1. UAV2. UAV3. (b) UAV4. UAV5. UAV 6.
Drones 10 00297 g021
Table 1. Comparison of computational time for different trajectory generation methods.
Table 1. Comparison of computational time for different trajectory generation methods.
Calculation MethodMotion Primitive Database3D DubinsGPOPSII
Average time (ms)0.02080.299133268.431
Minimum time (ms)0.01200.23551870.385
Maximum time (ms)0.0298116.667914,432.608
Table 2. Comparison of computational characteristics among different RRT*-based planners.
Table 2. Comparison of computational characteristics among different RRT*-based planners.
ProcedureStandard RRT* [16]Conventional Dubins-RRT*Proposed Quick-Dubins-RRT*
Search
(nearest-neighbor query)
O log n O log n O log n
(dominant cost)
SteerSimple straight-line interpolation (low cost)Online TPBVP solving (computationally expensive)Database retrieval (near-constant time)
SampleUniform random samplingUniform random samplingReduced sampling space (guided + constrained)
Obstacle FreeGlobal collision checking (high cost)Discretized Dubins curve checkingLocalized subset + adaptive discretization
StorageMinimalMinimalPrecomputed motion primitive database (compressed)
Table 3. Performance parameters of the fixed-wing UAVs.
Table 3. Performance parameters of the fixed-wing UAVs.
Parameters of UAVNumeric Value
Maximum flight path angle30°
Maximum roll angle30°
Maximum fly height7000 m
Cruising speed140 m/s
Minimum height above ground500 m
Table 4. Initial states of the UAVs.
Table 4. Initial states of the UAVs.
Number of UAVPositionHeading AngleFlight Path Angle
1(100.452348°, 27.954586°, 4500.0 m)
2(100.352348°, 27.914586°, 4300.0 m)
3(100.272580°, 27.836248°, 5000.0 m)
4(100.178415°, 27.740273°, 4000.0 m)
5(100.102122°, 27.666560°, 5000.0 m)
6(100.022122°, 27.596560°, 4500.0 m)
Table 5. Terminal states of the UAVs.
Table 5. Terminal states of the UAVs.
Number of UAVPositionHeading AngleFlight Path Angle
1(100.938226°, 27.145075°, 3000)
2(100.934318°, 27.135888°, 3200)
3(100.931072°, 27.126711°, 3150)
4(100.927853°, 27.118753°, 3400)
5(100.925587°, 27.100253°, 3500)
6(100.922434°, 27.093504°, 3550)
Table 6. Parameters of obstacles in the mission environment.
Table 6. Parameters of obstacles in the mission environment.
Number of ObstaclesPositionType of Obstacles
1Vertex (100.4486°, 27.6211°),
(100.4382°, 27.5478°), (100.3477°, 27.5460°),
(100.2613°, 27.6175°), (100.3265°, 27.7379°)
no-fly zone
2Vertex
(100.5982°, 27.2263°), (100.5675°, 27.1858°),
(100.4996°, 27.2040°), (100.5433°, 27.2408°)
Vertex
Threat zone
3Center position (100.6332°, 27.5967°), range 4500 mno-fly zone
4Center position (100.6860°, 27.3718°), range 6000 mno-fly zone
5Center position (100.7896°, 27.7116°), range 9000 mno-fly zone
6Center position (100.4559°, 27.3847°), range 8600 mThreat zone
7Center position (100.1983°, 27.2877°), range 10,000 mThreat zone
8Center position (100.7742°, 27.1877°), range 2500 mThreat zone
Table 7. Parameter settings of the Quick-Dubins-RRT algorithm.
Table 7. Parameter settings of the Quick-Dubins-RRT algorithm.
ParametersNumeric Value
Database sample range20,000 m
Course angle resolution15°
Grid resolution1000 m
Prune range δ s 1500 m
Maximum expend step10,000 m
Table 8. UAV trajectory lengths before and after time coordination adjustment.
Table 8. UAV trajectory lengths before and after time coordination adjustment.
Number of UAVPath Length Before AdjustmentPath Length After Adjustment
1103.2012 km108.63 km
2105.8713 km108.65 km
3103.1137 km108.63 km
4108.6725 km108.67 km
5103.2006 km108.64 km
6105.4413 km108.65 km
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Nie, Y.; Zhang, X.; Li, C.; Zhang, D. Cooperative Online 3D Path Planning for Fixed-Wing UAVs. Drones 2026, 10, 297. https://doi.org/10.3390/drones10040297

AMA Style

Nie Y, Zhang X, Li C, Zhang D. Cooperative Online 3D Path Planning for Fixed-Wing UAVs. Drones. 2026; 10(4):297. https://doi.org/10.3390/drones10040297

Chicago/Turabian Style

Nie, Yonggang, Xinyue Zhang, Chaoyue Li, and Dong Zhang. 2026. "Cooperative Online 3D Path Planning for Fixed-Wing UAVs" Drones 10, no. 4: 297. https://doi.org/10.3390/drones10040297

APA Style

Nie, Y., Zhang, X., Li, C., & Zhang, D. (2026). Cooperative Online 3D Path Planning for Fixed-Wing UAVs. Drones, 10(4), 297. https://doi.org/10.3390/drones10040297

Article Metrics

Back to TopTop