PAIR: A Hybrid A* with PPO Path Planner for Multi-UAV Navigation in 2-D Dynamic Urban MEC Environments

Taher, Bahaa Hussein; Luo, Juan; Qiao, Ying; Sayegh, Hussein Ridha

doi:10.3390/drones10010058

This is an early access version, the complete PDF, HTML, and XML versions will be available soon.

Open AccessArticle

PAIR: A Hybrid A^* with PPO Path Planner for Multi-UAV Navigation in 2-D Dynamic Urban MEC Environments

by

Bahaa Hussein Taher

^1,2,*

,

Juan Luo

¹

,

Ying Qiao

¹

and

Hussein Ridha Sayegh

¹

College of Computer Science and Electronic Engineering, Hunan University, Changsha 410082, China

²

College of Computer Science and Information Technology, University of Sumer, Al Rifaee 64005, Iraq

^*

Author to whom correspondence should be addressed.

Drones 2026, 10(1), 58; https://doi.org/10.3390/drones10010058

Submission received: 6 December 2025 / Revised: 26 December 2025 / Accepted: 7 January 2026 / Published: 13 January 2026

(This article belongs to the Special Issue Path Planning, Trajectory Tracking and Guidance for UAVs: 3rd Edition)

Download Versions Notes

Abstract

Emerging multi-unmanned aerial vehicle (multi-UAV) applications in smart cities must navigate cluttered airspace while meeting tight mobile edge computing (MEC) deadlines. Classical grid planners, including A-star (A*), D-star Lite (D* Lite), and conflict-based search with D-star Lite (CBS-D*) and metaheuristics such asparticle swarm optimization (PSO), either replan too slowly in dynamic scenes or waste energy on long detours. This paper presents PPO-adjusted incremental refinement (PAIR), a decentralized hybrid planner that couples an A* global backbone with a continuous PPO refinement module for multi-UAV navigation on two-dimensional (2-D) urban grids. A* produces feasible waypoint routes, while a shared risk-aware PPO policy applies local offsets from a compact state encoding. MEC tasks are allocated by a separate heterogeneous scheduler; PPO optimizes geometric objectives (path length, risk, and a normalized propulsion-energy surrogate). Across nine benchmark scenarios with static and Markovian dynamic obstacles, PAIR achieves 100% mission success (matching the strongest baselines) while delivering the best energy surrogate (104.9 normalized units) and shortest mean travel time (207.8 s) on a reproducible

100 \times 100

grid at fixed UAV speed. Relative to the strongest non-learning baseline (PSO), PAIR reduces energy by about 4% and travel time by about 3%, and yields roughly 10–20% gains over the remaining planners. An obstacle-density sweep with 5–30 moving obstacles further shows that PAIR maintains shorter paths and the lowest cumulative replanning time, supporting real-time multi-UAV navigation in dynamic urban MEC environments.

Keywords: multi-UAV path planning; dynamic obstacle avoidance; hybrid A*–PPO algorithm; mobile edge computing (MEC); reinforcement learning; decentralized UAV navigation

Share and Cite

MDPI and ACS Style

Taher, B.H.; Luo, J.; Qiao, Y.; Sayegh, H.R. PAIR: A Hybrid A^* with PPO Path Planner for Multi-UAV Navigation in 2-D Dynamic Urban MEC Environments. Drones 2026, 10, 58. https://doi.org/10.3390/drones10010058

AMA Style

Taher BH, Luo J, Qiao Y, Sayegh HR. PAIR: A Hybrid A^* with PPO Path Planner for Multi-UAV Navigation in 2-D Dynamic Urban MEC Environments. Drones. 2026; 10(1):58. https://doi.org/10.3390/drones10010058

Chicago/Turabian Style

Taher, Bahaa Hussein, Juan Luo, Ying Qiao, and Hussein Ridha Sayegh. 2026. "PAIR: A Hybrid A^* with PPO Path Planner for Multi-UAV Navigation in 2-D Dynamic Urban MEC Environments" Drones 10, no. 1: 58. https://doi.org/10.3390/drones10010058

APA Style

Taher, B. H., Luo, J., Qiao, Y., & Sayegh, H. R. (2026). PAIR: A Hybrid A^* with PPO Path Planner for Multi-UAV Navigation in 2-D Dynamic Urban MEC Environments. Drones, 10(1), 58. https://doi.org/10.3390/drones10010058

Article Menu

PAIR: A Hybrid A^* with PPO Path Planner for Multi-UAV Navigation in 2-D Dynamic Urban MEC Environments

Abstract

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI