This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Open AccessArticle
LDA-D3QN-Based Autonomous Navigation for Unmanned Surface Vehicles in Complex Obstacle Scenarios
by
Guoquan Xiao
Guoquan Xiao 1
,
Ruijie Rao
Ruijie Rao 1,
Yuanming Chen
Yuanming Chen 2
and
Xiaobin Hong
Xiaobin Hong 1,*
1
School of Mechanic and Automotive Engineering, South China University of Technology, Guangzhou 510641, China
2
School of Civil Engineering and Transportation, South China University of Technology, Guangzhou 510641, China
*
Author to whom correspondence should be addressed.
Drones 2026, 10(6), 468; https://doi.org/10.3390/drones10060468 (registering DOI)
Submission received: 7 May 2026
/
Revised: 16 June 2026
/
Accepted: 16 June 2026
/
Published: 18 June 2026
Abstract
Autonomous navigation of unmanned surface vehicles (USVs) in complex obstacle scenarios remains challenging due to redundant perception inputs, unstable value estimation, and inefficient policy convergence. To address these problems, this paper proposes LDA-D3QN, an improved deep reinforcement learning method for USV autonomous navigation. The proposed method constructs a compact navigation state representation by combining target-related information with local obstacle features, allowing the agent to retain key decision-making information while reducing unnecessary environmental redundancy. Based on this representation, an enhanced value-learning framework is developed to improve the stability of navigation decisions in cluttered environments. Moreover, a reward-guided and staged training strategy is introduced to help the agent gradually adapt to increasingly complex navigation tasks. The proposed method was evaluated on a Unity–ROS–MATLAB integrated simulation platform. Experimental results show that LDA-D3QN achieves superior overall navigation performance compared with several representative reinforcement learning algorithms. Specifically, the proposed method achieves a final training success rate of 91.4%, outperforming PPO (82.3%), Dueling DQN (78.5%), Double DQN (79.8%), and Rainbow DQN (86.5%). Additional tests in complex multi-obstacle and multi-target scenarios further demonstrate that the learned policy can generate safe, stable, and effective navigation behaviors. Preliminary validation using real-USV sensor data also confirms the feasibility of the LiDAR and GPS data processing procedures, providing a basis for future closed-loop autonomous navigation experiments and multi-sensor fusion deployment.
Share and Cite
MDPI and ACS Style
Xiao, G.; Rao, R.; Chen, Y.; Hong, X.
LDA-D3QN-Based Autonomous Navigation for Unmanned Surface Vehicles in Complex Obstacle Scenarios. Drones 2026, 10, 468.
https://doi.org/10.3390/drones10060468
AMA Style
Xiao G, Rao R, Chen Y, Hong X.
LDA-D3QN-Based Autonomous Navigation for Unmanned Surface Vehicles in Complex Obstacle Scenarios. Drones. 2026; 10(6):468.
https://doi.org/10.3390/drones10060468
Chicago/Turabian Style
Xiao, Guoquan, Ruijie Rao, Yuanming Chen, and Xiaobin Hong.
2026. "LDA-D3QN-Based Autonomous Navigation for Unmanned Surface Vehicles in Complex Obstacle Scenarios" Drones 10, no. 6: 468.
https://doi.org/10.3390/drones10060468
APA Style
Xiao, G., Rao, R., Chen, Y., & Hong, X.
(2026). LDA-D3QN-Based Autonomous Navigation for Unmanned Surface Vehicles in Complex Obstacle Scenarios. Drones, 10(6), 468.
https://doi.org/10.3390/drones10060468
Article Metrics
Article Access Statistics
For more information on the journal statistics, click
here.
Multiple requests from the same IP address are counted as one view.