Next Article in Journal
MMER-LMF: Multi-Modal Emotion Recognition in Lightweight Modality Fusion
Previous Article in Journal
Non-Invasive Voltage Measurement Device Based on MEMS Electric Field Sensor and Applications
Previous Article in Special Issue
Knowledge-Enhanced Deep Reinforcement Learning for Multi-Agent Game
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Distributed Pursuit–Evasion Game Decision-Making Based on Multi-Agent Deep Reinforcement Learning

by
Yanghui Lin
1,
Han Gao
1,2,* and
Yuanqing Xia
1
1
School of Automation, Beijing Institute of Technology, Beijing 100081, China
2
Advanced Technology Research Institute, Beijing Institute of Technology, Beijing 100081, China
*
Author to whom correspondence should be addressed.
Electronics 2025, 14(11), 2141; https://doi.org/10.3390/electronics14112141
Submission received: 14 April 2025 / Revised: 17 May 2025 / Accepted: 22 May 2025 / Published: 24 May 2025
(This article belongs to the Special Issue Advanced Control Strategies and Applications of Multi-Agent Systems)

Abstract

Pursuit–evasion games are a fundamental framework for advancing autonomous decision-making and cooperative control in multi-UAV systems. However, the application of reinforcement learning to pursuit–evasion games involving fixed-wing UAVs remains challenging due to constraints, such as minimum velocity, limited turning radius, and high-dimensional continuous action spaces. To address these issues, this paper proposes a method that integrates automatic curriculum learning with multi-agent proximal policy optimization. A self-play mechanism is introduced to simultaneously train both pursuers and evaders, enabling dynamic and adaptive encirclement strategies. In addition, a reward structure specifically tailored for the encirclement task was designed to guide the pursuers in gradually achieving the encirclement of the evader while ensuring their own safety. To further improve training efficiency and convergence, this paper develops a subgame curriculum learning framework that progressively exposes agents to increasingly complex scenarios, facilitating experience accumulation and skill transfer. The simulation results demonstrate that the proposed approach improves learning efficiency and cooperative pursuit performance under realistic fixed-wing UAV dynamics. This work provides a practical and scalable solution for multiple fixed-wing UAV pursuit–evasion missions in complex environments.
Keywords: deep reinforcement learning; curriculum learning; pursuit–evasion game; unmanned aerial vehicle deep reinforcement learning; curriculum learning; pursuit–evasion game; unmanned aerial vehicle

Share and Cite

MDPI and ACS Style

Lin, Y.; Gao, H.; Xia, Y. Distributed Pursuit–Evasion Game Decision-Making Based on Multi-Agent Deep Reinforcement Learning. Electronics 2025, 14, 2141. https://doi.org/10.3390/electronics14112141

AMA Style

Lin Y, Gao H, Xia Y. Distributed Pursuit–Evasion Game Decision-Making Based on Multi-Agent Deep Reinforcement Learning. Electronics. 2025; 14(11):2141. https://doi.org/10.3390/electronics14112141

Chicago/Turabian Style

Lin, Yanghui, Han Gao, and Yuanqing Xia. 2025. "Distributed Pursuit–Evasion Game Decision-Making Based on Multi-Agent Deep Reinforcement Learning" Electronics 14, no. 11: 2141. https://doi.org/10.3390/electronics14112141

APA Style

Lin, Y., Gao, H., & Xia, Y. (2025). Distributed Pursuit–Evasion Game Decision-Making Based on Multi-Agent Deep Reinforcement Learning. Electronics, 14(11), 2141. https://doi.org/10.3390/electronics14112141

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop