Next Article in Journal
EMAFG-RTDETR: An Improved RTDETR Algorithm for UAV-Based Concrete Defect Detection
Previous Article in Journal
Robust Extended Object Tracking Based on Variational Bayesian for Unmanned Aerial Vehicles Under Unknown Outliers
Previous Article in Special Issue
A Flexible Combinatorial Auction Algorithm (FCAA) for Multi-Task Collaborative Scheduling of Heterogeneous UAVs
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Scalable Pursuit–Evasion Game for Multi-Fixed-Wing UAV Based on Dynamic Target Assignment and Hierarchical Reinforcement Learning

1
Aviation Engineering School, Air Force Engineering University, Xi’an 710038, China
2
School of Computer Science, Beijing University of Posts and Telecommunications, Beijing 100876, China
3
93420 Forces, Shijiazhuang 050011, China
*
Author to whom correspondence should be addressed.
Submission received: 11 November 2025 / Revised: 18 December 2025 / Accepted: 19 December 2025 / Published: 23 December 2025

Abstract

The unmanned aerial vehicle (UAV) pursuit–evasion game is the fundamental framework for promoting autonomous decision-making and collaborative control of multi-UAV systems. Faced with the limitations of current deep reinforcement learning methods in terms of transferability and generalization for scalable multi-fixed-wing UAV pursuit–evasion game scenarios, this paper proposes a hierarchical collaborative pursuit–evasion game framework based on target allocation and hierarchical reinforcement learning. The framework comprises three layers: target allocation layer, maneuver decision-making layer, and flight control layer. The target allocation layer employs a dynamic target assignment method based on a dynamic value adjustment mechanism, decomposing the multi-vs.-multi pursuit–evasion game into several one-vs.-one confrontations. The maneuver decision-making layer utilizes a maneuver decision-making method based on trajectory prediction and hierarchical reinforcement learning to generate adversarial maneuver commands. The flight control layer adopts a stable gradient-assisted reinforcement learning flight controller to ensure stable UAV flight. Comparisons with other algorithms across 3V3, 6V6, 9V9, and 12V12 scenarios demonstrate that the proposed method achieves high win rates in diverse game scales. The comparison results also demonstrate the advantages of the framework proposed in this paper in terms of training efficiency and large-scale scalability.
Keywords: scalability; target allocation; hierarchical reinforcement learning; pursuit–evasion game; stability-assisted gradient; trajectory prediction scalability; target allocation; hierarchical reinforcement learning; pursuit–evasion game; stability-assisted gradient; trajectory prediction

Share and Cite

MDPI and ACS Style

Tan, M.; Sun, H.; Ding, D.; Zhou, H.; Liu, Y. Scalable Pursuit–Evasion Game for Multi-Fixed-Wing UAV Based on Dynamic Target Assignment and Hierarchical Reinforcement Learning. Drones 2026, 10, 5. https://doi.org/10.3390/drones10010005

AMA Style

Tan M, Sun H, Ding D, Zhou H, Liu Y. Scalable Pursuit–Evasion Game for Multi-Fixed-Wing UAV Based on Dynamic Target Assignment and Hierarchical Reinforcement Learning. Drones. 2026; 10(1):5. https://doi.org/10.3390/drones10010005

Chicago/Turabian Style

Tan, Mulai, Haocheng Sun, Dali Ding, Huan Zhou, and Yongli Liu. 2026. "Scalable Pursuit–Evasion Game for Multi-Fixed-Wing UAV Based on Dynamic Target Assignment and Hierarchical Reinforcement Learning" Drones 10, no. 1: 5. https://doi.org/10.3390/drones10010005

APA Style

Tan, M., Sun, H., Ding, D., Zhou, H., & Liu, Y. (2026). Scalable Pursuit–Evasion Game for Multi-Fixed-Wing UAV Based on Dynamic Target Assignment and Hierarchical Reinforcement Learning. Drones, 10(1), 5. https://doi.org/10.3390/drones10010005

Article Metrics

Back to TopTop