Scalable Pursuit–Evasion Game for Multi-Fixed-Wing UAV Based on Dynamic Target Assignment and Hierarchical Reinforcement Learning

Tan, Mulai; Sun, Haocheng; Ding, Dali; Zhou, Huan; Liu, Yongli

doi:10.3390/drones10010005

This is an early access version, the complete PDF, HTML, and XML versions will be available soon.

Open AccessArticle

Scalable Pursuit–Evasion Game for Multi-Fixed-Wing UAV Based on Dynamic Target Assignment and Hierarchical Reinforcement Learning

by

Mulai Tan

¹

,

Haocheng Sun

²

,

Dali Ding

^1,*

,

Huan Zhou

¹

and

Yongli Liu

³

¹

Aviation Engineering School, Air Force Engineering University, Xi’an 710038, China

²

School of Computer Science, Beijing University of Posts and Telecommunications, Beijing 100876, China

³

93420 Forces, Shijiazhuang 050011, China

^*

Author to whom correspondence should be addressed.

Drones 2026, 10(1), 5; https://doi.org/10.3390/drones10010005

Submission received: 11 November 2025 / Revised: 18 December 2025 / Accepted: 19 December 2025 / Published: 23 December 2025

(This article belongs to the Special Issue Distributed Control, Optimization, and Game of UAV Swarm Systems (2nd Edition))

Download Versions Notes

Abstract

The unmanned aerial vehicle (UAV) pursuit–evasion game is the fundamental framework for promoting autonomous decision-making and collaborative control of multi-UAV systems. Faced with the limitations of current deep reinforcement learning methods in terms of transferability and generalization for scalable multi-fixed-wing UAV pursuit–evasion game scenarios, this paper proposes a hierarchical collaborative pursuit–evasion game framework based on target allocation and hierarchical reinforcement learning. The framework comprises three layers: target allocation layer, maneuver decision-making layer, and flight control layer. The target allocation layer employs a dynamic target assignment method based on a dynamic value adjustment mechanism, decomposing the multi-vs.-multi pursuit–evasion game into several one-vs.-one confrontations. The maneuver decision-making layer utilizes a maneuver decision-making method based on trajectory prediction and hierarchical reinforcement learning to generate adversarial maneuver commands. The flight control layer adopts a stable gradient-assisted reinforcement learning flight controller to ensure stable UAV flight. Comparisons with other algorithms across 3V3, 6V6, 9V9, and 12V12 scenarios demonstrate that the proposed method achieves high win rates in diverse game scales. The comparison results also demonstrate the advantages of the framework proposed in this paper in terms of training efficiency and large-scale scalability.

Keywords: scalability; target allocation; hierarchical reinforcement learning; pursuit–evasion game; stability-assisted gradient; trajectory prediction

Share and Cite

MDPI and ACS Style

Tan, M.; Sun, H.; Ding, D.; Zhou, H.; Liu, Y. Scalable Pursuit–Evasion Game for Multi-Fixed-Wing UAV Based on Dynamic Target Assignment and Hierarchical Reinforcement Learning. Drones 2026, 10, 5. https://doi.org/10.3390/drones10010005

AMA Style

Tan M, Sun H, Ding D, Zhou H, Liu Y. Scalable Pursuit–Evasion Game for Multi-Fixed-Wing UAV Based on Dynamic Target Assignment and Hierarchical Reinforcement Learning. Drones. 2026; 10(1):5. https://doi.org/10.3390/drones10010005

Chicago/Turabian Style

Tan, Mulai, Haocheng Sun, Dali Ding, Huan Zhou, and Yongli Liu. 2026. "Scalable Pursuit–Evasion Game for Multi-Fixed-Wing UAV Based on Dynamic Target Assignment and Hierarchical Reinforcement Learning" Drones 10, no. 1: 5. https://doi.org/10.3390/drones10010005

APA Style

Tan, M., Sun, H., Ding, D., Zhou, H., & Liu, Y. (2026). Scalable Pursuit–Evasion Game for Multi-Fixed-Wing UAV Based on Dynamic Target Assignment and Hierarchical Reinforcement Learning. Drones, 10(1), 5. https://doi.org/10.3390/drones10010005

Article Menu

Scalable Pursuit–Evasion Game for Multi-Fixed-Wing UAV Based on Dynamic Target Assignment and Hierarchical Reinforcement Learning

Abstract

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI