RG-HDP-VD: A Physics-Aware Cooperative Trajectory Planning Framework for Heterogeneous Multi-UAVs
Highlights
- A physics-aware cooperative planning framework (RG-HDP-VD) was developed and validated in real-world flights, integrating mass-augmented energy topology, regret-guided arbitration, and velocity decomposition.
- The framework demonstrates superior scalability in saturated airspace, maintaining a 95% success rate where baselines fail, while reducing average planning time by ~45% and lowering total system energy by 6.7%.
- Physics-consistent right-of-way allocation mitigates energy-inefficient congestion by prioritizing high-penalty platforms, providing a highly scalable alternative to conventional methods that prevents deadlocks.
- Mapping rigid time-window constraints into length-feasible regions via velocity envelopes offers a robust way to maintain spatiotemporal feasibility and prevent timing failures without relying on terminal loitering.
Abstract
1. Introduction
- Insufficient modeling of physical heterogeneity and payload changes can trigger energy imbalance and deadlocks. Heavy-lift and light UAVs exhibit strong asymmetry in hovering power. In bottlenecks, distance-based arbitration underestimates the waiting penalty of heavy-load platforms, forcing them to idle where hovering dominates expenditure. This may trigger energy-inefficient congestion. Consistent with the need for realistic cost modeling highlighted in recent surveys [5,6], this gap calls for a physically consistent, regret-guided arbitration mechanism.
- Rigid coupling between time windows and fixed-speed assumptions leads to rapid feasible-set shrinkage and cascading failures. SAT missions with tight windows often suffer from solution-space contraction. Fixed-speed assumptions rigidly couple arrival time to path length, meaning any detour directly erodes time margins. While spacetime planning variants (e.g., spacetime RRT* [26]) and learning-based conflict resolution [23,24,27] attempt to handle these constraints, they often lack the explicit velocity elasticity needed for robust rhythm modulation. To prevent single-agent infeasibility from propagating through the fleet, time feasibility should be transformed into distributed velocity envelopes along flight segments, structurally mitigating the contraction of the feasible set.
- Regret-Guided dynamic right-of-way arbitration. A physically grounded fairness mechanism that explicitly captures energy asymmetry across heterogeneous platforms to prevent energy-inefficient congestion in bottleneck airspace.
- Velocity-decomposition-based elastic spatiotemporal planning. By introducing velocity envelopes, rigid fixed-speed constraints are decoupled into an elastic path-length feasible set, expanding feasibility under tight time windows in non-convex terrain.
- A physics-aware layered cooperative planning framework (RG-HDP-VD). The framework integrates mass-augmented energy topology, dynamic arbitration, and executable 4D trajectory generation with adaptive smoothing (B-spline or PCHIP) and continuous collision checking.
2. Problem Formulation and System Modeling
2.1. System Modeling and Performance Constraints
2.1.1. Dynamic Mass Model
2.1.2. Mass-Augmented Energy Cost Model
- Asymmetric Motion Power: distinguishes between climbing and descending efficiency, penalizing aggressive maneuvering under heavy-load states;
- Nonlinear Hovering Power: , where is the hovering power coefficient, implying that the cost for heavy-load platforms to wait at bottlenecks is significantly higher, providing a physical basis for subsequent right-of-way arbitration.
2.1.3. Kinematic Feasibility Under a Velocity Envelope
2.2. Spatiotemporal Cooperative Constraints
2.2.1. Continuous Inter-Agent Safety Constraint
2.2.2. Cooperative Arrival Time Window Constraint
2.3. Cooperative Path Planning Problem Definition
- Energy Efficiency (): A normalized energy term based on Equation (2), guiding the planner to generate energy-saving paths consistent with heterogeneous energy efficiency characteristics;
- Synchronization Accuracy (): Quantifies the deviation relative to the target arrival time:where is the final arrival time of the trajectory, and is a normalization scale (can be taken as or uniformly to avoid division by zero). is given by mission rank and arrival interval (e.g., ). Under strict constraints, this term should converge within the allowable time window error range;
- Threat Exposure (): Conditional Value-at-Risk (CVaR) is employed to assess spatial safety. Let be the threat potential field value at position , and define the random variable as the risk distribution along trajectory (e.g., constituted by time-domain sampling of ). Then, -CVaR [29] is defined as the expected value of the worst high-risk segments:Here, is an auxiliary scalar (VaR-like threshold) introduced in the standard CVaR reformulation; the infimum over yields , where . This metric enhances robustness against non-deterministic disturbances by heavily penalizing the highest-risk segments of the path rather than the average risk [30].
3. The RG-HDP-VD Physics-Aware Cooperative Planning Framework
3.1. System Architecture and Problem Decomposition
3.2. Mass-Augmented Energy Topology
3.3. Regret-Guided Arbitration and Time Anchoring
3.3.1. Global Cooperative Time Anchoring
3.3.2. Regret-Guided Dynamic Right-of-Way Arbitration
3.3.3. HDP Decentralized Priority Coordination and Occupancy Closed-Loop
3.4. Elastic Execution Based on Velocity Decomposition
3.4.1. Principle of Velocity Decomposition
3.4.2. Implementation of VD-TSRRT* Planning Algorithm
- Physical Pre-check: Utilizing the topological prior of the baseline path from L1, if its length satisfies the intersection condition between the physically reachable time domain and the target window, sampling is skipped, and the path is output directly:
- 2.
- Pruning & Adaptive Bias: A maximum physical length upper bound is introduced, including a time tolerance and a fixed margin . For a tree node , if its total length estimate , it is judged as “inevitably late” and forcibly pruned:Simultaneously, if the current optimal path is too short and leads to an “inevitably early” status (i.e., ), the algorithm automatically decreases the goal bias. This forces the random tree to grow more circuitously into the surrounding free space so that it can enter the feasible length interval.
- 3.
- Feasibility-Aware Cost and Asymptotic Optimality: A step-type cost function is constructed, treating the time feasibility intersection as a hard threshold:
| Algorithm 1. VD-TSRRT*(σ_base, t_low, t_high, v_min, v_max) |
| 1. L_max ← v_max · (t_high + ε_t) + ΔL 2. ) then 3. return σ_base 4. end if 5. Initialize tree T with x_start; optionally insert a prefix of σ_base into T 6. best_node ← null; best_feas ← +∞; best_gap ← +∞ 7. for iter = 1 … maxIter do 8. x_new ← SampleAndExtend(T) 9. if g(x_new) + h(x_new) > L_max then continue end if (pruning; Equation (23)) 10. Rewire-Parent(T, x_new); Rewire-Child(T, x_new) 11. if NearGoal(x_new) then 12. L_curr ← PathLength(T, x_new) 13. ) then (feasible; Equation (19)) 14. if L_curr < best_feas then best_feas ← L_curr; best_node ← x_new end if 15. else 16. gap ← TimeGap(L_curr, t_low, t_high, v_min, v_max, ε_t) (Equation (23)) 17. if gap < best_gap then best_gap ← gap; best_node ← x_new end if 18. end if 19. end if 20. if best_node ≠ null and L_est(best_node)/v_min < t_low − ε_t then decrease goalBias end if 21. end for 22. return BacktrackPath(T, best_node) |
3.4.3. Fallback Strategy and Sequential Avoidance
3.5. Trajectory Realization and Continuous Verification
4. Experimental Evaluation and Analysis
4.1. Experimental Setup
4.1.1. Simulation Environment Deployment and Heterogeneous Physical Models
4.1.2. Mission Scenario Design
4.1.3. Experimental Design and Evaluation Metrics
4.2. Baseline Comparison
4.3. Core Mechanism Ablation and Mechanism Analysis
4.3.1. RG Mechanism Ablation: Energy Efficiency Gains and Waiting Suppression Mechanism
4.3.2. VD Mechanism Ablation: Spatiotemporal Feasibility and Synchronization Accuracy Gains of Velocity Decomposition
- Terminal Airspace Saturation: Free space near the target area is extremely limited, and terrain undulations restrict available loitering radii. Once the Baseline attempts to loiter at a bottleneck, the detained airframe quickly transforms into a “long-duration dynamic obstacle,” causing the continuous collision detection and inter-agent separation constraints in the terminal airspace to fail simultaneously;
- Occupancy Cascade: The loitering strategy under fixed speed possesses extremely high spatial exclusivity. The loitering behavior of high-priority UAVs generates long-duration dynamic occlusion in the Spatiotemporal Occupancy Map (-Map), compressing the feasible region for low-priority UAVs and forcing them to seek more distant loitering points, leading to an exponential increase in energy consumption and risk costs;
- Cost Divergence: To avoid channels occupied by loitering aircraft, subsequent UAVs are forced to execute large-scale detours. This detouring induced by “passive loitering” causes surges in path length and Risk Cost, making the underlying sampling planner unable to converge to a solution satisfying cost constraints within limited iterations.
4.4. Parameter Sensitivity Analysis
4.4.1. Experiment A: Synchronization Pressure ()
4.4.2. Experiment B: Energy-Fairness Anchor ()
4.4.3. Discussion of Secondary Parameters
5. Real-World Flight Demonstration
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations
| UAV | Unmanned Aerial Vehicle |
| HMUAS | Heterogeneous Multi-Unmanned Aerial Systems |
| MUCPP | Multi-UAV Cooperative Path Planning |
| SAT | Simultaneous Arrival Task |
| RG-HDP-VD | Regret-Guided Heuristic Decentralized Prioritized Planning with Velocity Decomposition |
| RG | Regret-Guided (right-of-way arbitration) |
| HDP | Heuristic Decentralized Prioritized Planning |
| VD | Velocity Decomposition |
| A* | A-star Search |
| RRT* | Rapidly exploring Random Tree Star |
| VD-TSRRT* | Velocity-Decomposition Time–Space RRT* |
| CDF | Cumulative Distribution Function |
| CVaR | Conditional Value-at-Risk |
| APF | Artificial Potential Field |
| ORCA | Optimal Reciprocal Collision Avoidance |
| CBS | Conflict-Based Search |
| EECBS | Enhanced Edge-Weighted Conflict-Based Search |
| RDP | Ramer–Douglas–Peucker (path simplification) |
| PCHIP | Piecewise Cubic Hermite Interpolating Polynomial |
Appendix A
| Symbol | Meaning |
|---|---|
| is the number of UAVs | |
| Environment constraints set | |
| Terrain obstacles (hard constraints) | |
| Threat regions (risk exposure) | |
| for timed trajectory) | |
| Common mission timeline length | |
| Global cooperative time anchor | |
| Safety radius (inflation for body size and uncertainty) | |
| (Minkowski sum) | |
| Minimum safe separation threshold | |
| Global spatiotemporal occupancy/constraint map used in HDP updates | |
| (payload drop causes piecewise change) | |
| (motion + hover) | |
| Motion power and hover power terms | |
| Global weighted objective for multi-UAV planning | |
| Normalized energy/synchronization/risk components | |
| Weights for the three objective components | |
| Tail-risk metric penalizing high-risk exposure segments | |
| (RG-based prioritization) | |
| Energy costs of “wait/hover” vs. “detour” actions in RG arbitration | |
| Geometric path length used in VD feasibility check | |
References
- Chung, S.-J.; Paranjape, A.A.; Dames, P.; Shen, S.; Kumar, V. A Survey on Aerial Swarm Robotics. IEEE Trans. Robot. 2018, 34, 837–855. [Google Scholar] [CrossRef]
- Mellinger, D.; Kumar, V. Minimum Snap Trajectory Generation and Control for Quadrotors. In Proceedings of the 2011 IEEE International Conference on Robotics and Automation (ICRA), Shanghai, China, 9–13 May 2011; pp. 2520–2525. [Google Scholar] [CrossRef]
- Richter, C.; Bry, A.; Roy, N. Polynomial Trajectory Planning for Aggressive Quadrotor Flight in Dense Indoor Environments. In Robotics Research; Springer: Cham, Switzerland, 2016; pp. 649–666. [Google Scholar] [CrossRef]
- Kumar, P.; Pal, K.; Govil, M.C. Comprehensive Review of Path Planning Techniques for Unmanned Aerial Vehicles (UAVs). ACM Comput. Surv. 2025, 58, 1–44. [Google Scholar] [CrossRef]
- Rahman, M.; Sarkar, N.I.; Lutui, R. A Survey on Multi-UAV Path Planning: Classification, Algorithms, Open Research Problems, and Future Directions. Drones 2025, 9, 263. [Google Scholar] [CrossRef]
- Wang, L.; Huang, W.; Li, H.; Li, W.; Chen, J.; Wu, W. A Review of Collaborative Trajectory Planning for Multiple Unmanned Aerial Vehicles. Processes 2024, 12, 1272. [Google Scholar] [CrossRef]
- Babel, L. Coordinated Target Assignment and UAV Path Planning with Timing Constraints. J. Intell. Robot. Syst. 2019, 94, 857–869. [Google Scholar] [CrossRef]
- Yan, F.; Zhu, X.; Zhou, Z.; Chu, J. A Hierarchical Mission Planning Method for Simultaneous Arrival of Multi-UAV Coalition. Appl. Sci. 2019, 9, 1986. [Google Scholar] [CrossRef]
- Andreychuk, A.; Yakovlev, K.; Surynek, P.; Atzmon, D.; Stern, R. Multi-Agent Pathfinding with Continuous Time. Artif. Intell. 2022, 305, 103662. [Google Scholar] [CrossRef]
- Phillips, M.; Likhachev, M. SIPP: Safe Interval Path Planning for Dynamic Environments. In Proceedings of the 2011 IEEE International Conference on Robotics and Automation (ICRA), Shanghai, China, 9–13 May 2011; pp. 5628–5635. [Google Scholar] [CrossRef]
- Ait Saadi, A.; Bhuyan, B.P.; Ramdane-Cherif, A. Power Consumption Model for Unmanned Aerial Vehicles Using Recurrent Neural Network Techniques. Aerosp. Sci. Technol. 2025, 157, 109819. [Google Scholar] [CrossRef]
- van den Berg, J.; Guy, S.J.; Lin, M.-C.; Manocha, D. Reciprocal n-Body Collision Avoidance. In Robotics Research; Springer: Berlin, Germany, 2011; pp. 3–19. [Google Scholar] [CrossRef]
- Khatib, O. Real-Time Obstacle Avoidance for Manipulators and Mobile Robots. In Proceedings of the 1985 IEEE International Conference on Robotics and Automation (ICRA), St. Louis, MO, USA, 25–28 March 1985; pp. 500–505. [Google Scholar] [CrossRef]
- Sharon, G.; Stern, R.; Felner, A.; Sturtevant, N.R. Conflict-Based Search for Optimal Multi-Agent Path Finding. Artif. Intell. 2015, 219, 40–66. [Google Scholar] [CrossRef]
- Li, J.; Ruml, W.; Koenig, S. EECBS: A Bounded-Suboptimal Search for Multi-Agent Path Finding. In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), Virtual Event, 2–9 February 2021; pp. 12353–12362. [Google Scholar] [CrossRef]
- Liu, X.; Su, Y.; Wu, Y.; Guo, Y. Multi-Conflict-Based Optimal Algorithm for Multi-UAV Cooperative Path Planning. Drones 2023, 7, 217. [Google Scholar] [CrossRef]
- Semiz, F.; Polat, F. Incremental Multi-Agent Path Finding. Future Gener. Comput. Syst. 2021, 116, 220–233. [Google Scholar] [CrossRef]
- Silver, D. Cooperative Pathfinding. In Proceedings of the First Artificial Intelligence and Interactive Digital Entertainment Conference (AIIDE), Marina del Rey, CA, USA, 1–5 June 2005; pp. 117–122. [Google Scholar] [CrossRef]
- Velagapudi, P.; Sycara, K.P.; Scerri, P. Decentralized Prioritized Planning in Large Multirobot Teams. In Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Taipei, Taiwan, 18–22 October 2010; pp. 4603–4609. [Google Scholar] [CrossRef]
- Guo, Y.; Liu, X.; Jiang, W.; Zhang, W. Collision-Free 4D Dynamic Path Planning for Multiple UAVs Based on Dynamic Priority RRT* and Artificial Potential Field. Drones 2023, 7, 180. [Google Scholar] [CrossRef]
- Ma, H.; Harabor, D.; Stuckey, P.J.; Li, J.; Koenig, S. Searching with Consistent Prioritization for Multi-Agent Path Finding. In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), Honolulu, HI, USA, 27 January–1 February 2019; pp. 7643–7650. [Google Scholar] [CrossRef]
- Chagas, F.S.; Ruseno, N.; Bechina, A.A.A. Artificial Intelligence Approaches for UAV Deconfliction: A Comparative Review and Framework Proposal. Automation 2025, 6, 54. [Google Scholar] [CrossRef]
- Zhang, M.; Yan, C.; Dai, W.; Xiang, X.; Low, K.H. Tactical Conflict Resolution in Urban Airspace for Unmanned Aerial Vehicles Operations Using Attention-Based Deep Reinforcement Learning. Green Energy Intell. Transp. 2023, 2, 100107. [Google Scholar] [CrossRef]
- Kong, X.; Zhou, Y.; Li, Z.; Wang, S. Multi-UAV Simultaneous Target Assignment and Path Planning Based on Deep Reinforcement Learning in Dynamic Multiple Obstacles Environments. Front. Neurorobotics 2024, 17, 1302898. [Google Scholar] [CrossRef]
- Yan, H.; Chen, Y.; Yang, S.-H. New Energy Consumption Model for Rotary-Wing UAV Propulsion. IEEE Wirel. Commun. Lett. 2021, 10, 2009–2012. [Google Scholar] [CrossRef]
- Burzyński, W.; Stecz, W. Trajectory Planning with Multiplatform Spacetime RRT*. Appl. Intell. 2024, 54, 9524–9541. [Google Scholar] [CrossRef]
- Guo, Y.; Liu, X.; Jiang, W.; Zhang, W. HDP-TSRRT*: A Time–Space Cooperative Path Planning Algorithm for Multiple UAVs. Drones 2023, 7, 170. [Google Scholar] [CrossRef]
- Rockafellar, R.T.; Uryasev, S. Optimization of Conditional Value-at-Risk. J. Risk 2000, 2, 21–42. [Google Scholar] [CrossRef]
- Hakobyan, A.; Kim, G.C.; Yang, I. Risk-Aware Motion Planning and Control Using CVaR-Constrained Optimization. IEEE Robot. Autom. Lett. 2019, 4, 3924–3931. [Google Scholar] [CrossRef]
- Karaman, S.; Frazzoli, E. Sampling-Based Algorithms for Optimal Motion Planning. Int. J. Robot. Res. 2011, 30, 846–894. [Google Scholar] [CrossRef]
- Douglas, D.H.; Peucker, T.K. Algorithms for the Reduction of the Number of Points Required to Represent a Digitized Line or Its Caricature. Can. Cartogr. 1973, 10, 112–122. [Google Scholar] [CrossRef]
- Zhou, B.; Gao, F.; Wang, L.; Liu, C.; Shen, S. Robust and Efficient Quadrotor Trajectory Generation for Fast Autonomous Flight. IEEE Robot. Autom. Lett. 2019, 4, 3529–3536. [Google Scholar] [CrossRef]
- Fritsch, F.N.; Carlson, R.E. Monotone Piecewise Cubic Interpolation. SIAM J. Numer. Anal. 1980, 17, 238–246. [Google Scholar] [CrossRef]
- Redon, S.; Kheddar, A.; Coquillart, S. Fast Continuous Collision Detection between Rigid Bodies. Comput. Graph. Forum 2002, 21, 279–287. [Google Scholar] [CrossRef]
- Pan, J.; Chitta, S.; Manocha, D. FCL: A General Purpose Library for Collision and Proximity Queries. In Proceedings of the 2012 IEEE International Conference on Robotics and Automation (ICRA), St. Paul, MN, USA, 14–18 May 2012; pp. 3859–3866. [Google Scholar] [CrossRef]
- Fournier, A.; Fussell, D.; Carpenter, L. Computer Rendering of Stochastic Models. Commun. ACM 1982, 25, 371–384. [Google Scholar] [CrossRef]
















| Parameter | Symbol | Heavy-Lift UAVs | Light UAVs | Unit |
|---|---|---|---|---|
| Empty/Payload Mass | 20.0/10.0 | 2.0/0.5 | kg | |
| Max Flight Speed | 12 | 15 | m/s | |
| Hovering Power Coeff. | 280 | 80 | ||
| Safety Collision Radius | 5.0 | 3.0 | m | |
| Elastic Velocity Envelope | 7–12 | 7–15 | m/s |
| Method | Success Rate (%) | (s) | 108J) | |
|---|---|---|---|---|
| 4 | OURS | 100 | 0.8 | 0.56 |
| ECBS | 90 | 4.2 | 0.63 | |
| ORCA | 55 | 11.5 | 0.69 | |
| 8 | OURS | 100 | 1.12 | 1.1 |
| ECBS | 55 | 8.3 | 1.32 | |
| ORCA | 15 | 21.5 | 1.46 | |
| 12 | OURS | 100 | 2.8 | 1.7 |
| ECBS | 30 | 13.6 | 2.17 | |
| ORCA | 5 | 27.4 | 2.36 | |
| 16 | OURS | 95 | 4.1 | 2.31 |
| ECBS | 0 | 18.2 | 3.03 | |
| ORCA | 0 | 34.8 | 3.21 |
| Success Rate | 108J) | (s) | ||
|---|---|---|---|---|
| 0.2 | 0 | 0% | 1.02 | 32.51 |
| 0.2 | 20% | 1.04 | 16.25 | |
| 0.5 | 100% | 1.07 | 5.52 | |
| 1 | 100% | 1.10 | 1.12 | |
| 2 | 100% | 1.18 | 0.32 | |
| 0.5 | 0 | 0% | 0.98 | 33.33 |
| 0.2 | 20% | 1.03 | 18.98 | |
| 0.5 | 80% | 1.06 | 10.38 | |
| 1 | 100% | 1.09 | 5.54 | |
| 2 | 100% | 1.15 | 2.37 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Han, D.; Hua, Z.; Zhu, X.; Luo, L.; Jiang, H.; Wang, L. RG-HDP-VD: A Physics-Aware Cooperative Trajectory Planning Framework for Heterogeneous Multi-UAVs. Drones 2026, 10, 192. https://doi.org/10.3390/drones10030192
Han D, Hua Z, Zhu X, Luo L, Jiang H, Wang L. RG-HDP-VD: A Physics-Aware Cooperative Trajectory Planning Framework for Heterogeneous Multi-UAVs. Drones. 2026; 10(3):192. https://doi.org/10.3390/drones10030192
Chicago/Turabian StyleHan, Dan, Zhaoyuan Hua, Xinyu Zhu, Liang Luo, Hao Jiang, and Lifang Wang. 2026. "RG-HDP-VD: A Physics-Aware Cooperative Trajectory Planning Framework for Heterogeneous Multi-UAVs" Drones 10, no. 3: 192. https://doi.org/10.3390/drones10030192
APA StyleHan, D., Hua, Z., Zhu, X., Luo, L., Jiang, H., & Wang, L. (2026). RG-HDP-VD: A Physics-Aware Cooperative Trajectory Planning Framework for Heterogeneous Multi-UAVs. Drones, 10(3), 192. https://doi.org/10.3390/drones10030192

