Previous Article in Journal
Comparative Assessment of the Efficacy of Drone Spraying and Gun Spraying for Nano-Urea Application in a Maize Crop
Previous Article in Special Issue
Reinforcement Learning Stabilization for Quadrotor UAVs via Lipschitz-Constrained Policy Regularization
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Safe UAV Control Against Wind Disturbances via Demonstration-Guided Reinforcement Learning

Department of Green Energy and Information Technology, National Taitung University, Taitung 95092, Taiwan
*
Authors to whom correspondence should be addressed.
Drones 2026, 10(1), 2; https://doi.org/10.3390/drones10010002 (registering DOI)
Submission received: 17 October 2025 / Revised: 5 December 2025 / Accepted: 17 December 2025 / Published: 19 December 2025

Abstract

Unmanned Aerial Vehicle (UAV) operating in complex environments require guaranteed safety mechanisms while maintaining high performance. This study addresses the challenge of ensuring strict flight safety during policy execution by implementing a Control Barrier Function (CBF) as a real-time action filter, thereby providing a rigorous, formal guarantee. The methodology integrates the primary Proximal Policy Optimization (PPO) algorithm with a Demonstration-Guided Reinforcement Learning (DGRL), which leverages Proportional–Integral–Derivative (PID) expert trajectories to significantly accelerate learning convergence and enhance sample efficiency. Comprehensive results confirm the efficacy of the hybrid architecture, demonstrating a significant reduction in constraint violations and proving the framework’s ability to substantially accelerate training compared to PPO. In conclusion, the proposed methodology successfully unifies formal safety guarantees with efficient, adaptive reinforcement learning, making it highly suitable for safety-critical autonomous systems.
Keywords: unmanned aerial vehicle; control barrier function; proximal policy optimization; demonstration-guided reinforcement learning unmanned aerial vehicle; control barrier function; proximal policy optimization; demonstration-guided reinforcement learning

Share and Cite

MDPI and ACS Style

Huang, Y.-H.; Liu, E.-J.; Wu, B.-C.; Ning, Y.-J. Safe UAV Control Against Wind Disturbances via Demonstration-Guided Reinforcement Learning. Drones 2026, 10, 2. https://doi.org/10.3390/drones10010002

AMA Style

Huang Y-H, Liu E-J, Wu B-C, Ning Y-J. Safe UAV Control Against Wind Disturbances via Demonstration-Guided Reinforcement Learning. Drones. 2026; 10(1):2. https://doi.org/10.3390/drones10010002

Chicago/Turabian Style

Huang, Yan-Hao, En-Jui Liu, Bo-Cing Wu, and Yong-Jie Ning. 2026. "Safe UAV Control Against Wind Disturbances via Demonstration-Guided Reinforcement Learning" Drones 10, no. 1: 2. https://doi.org/10.3390/drones10010002

APA Style

Huang, Y.-H., Liu, E.-J., Wu, B.-C., & Ning, Y.-J. (2026). Safe UAV Control Against Wind Disturbances via Demonstration-Guided Reinforcement Learning. Drones, 10(1), 2. https://doi.org/10.3390/drones10010002

Article Metrics

Back to TopTop