Next Article in Journal
Performance Analysis of Cooperative NOMA Networks with Imperfect CSI over Nakagami-m Fading Channels
Next Article in Special Issue
A Generic Design of Driver Drowsiness and Stress Recognition Using MOGA Optimized Deep MKL-SVM
Previous Article in Journal
Magnetoelastic Humidity Sensors with TiO2 Nanotube Sensing Layers
Previous Article in Special Issue
Simultaneous Optimization of Vehicle Arrival Time and Signal Timings within a Connected Vehicle Environment
 
 
Article

An Autonomous Path Planning Model for Unmanned Ships Based on Deep Reinforcement Learning

School of Information Science and Technology, Dalian Maritime University, Dalian 116026, China
*
Author to whom correspondence should be addressed.
Sensors 2020, 20(2), 426; https://doi.org/10.3390/s20020426
Received: 27 November 2019 / Revised: 31 December 2019 / Accepted: 2 January 2020 / Published: 11 January 2020
Deep reinforcement learning (DRL) has excellent performance in continuous control problems and it is widely used in path planning and other fields. An autonomous path planning model based on DRL is proposed to realize the intelligent path planning of unmanned ships in the unknown environment. The model utilizes the deep deterministic policy gradient (DDPG) algorithm, through the continuous interaction with the environment and the use of historical experience data; the agent learns the optimal action strategy in a simulation environment. The navigation rules and the ship’s encounter situation are transformed into a navigation restricted area, so as to achieve the purpose of planned path safety in order to ensure the validity and accuracy of the model. Ship data provided by ship automatic identification system (AIS) are used to train this path planning model. Subsequently, the improved DRL is obtained by combining DDPG with the artificial potential field. Finally, the path planning model is integrated into the electronic chart platform for experiments. Through the establishment of comparative experiments, the results show that the improved model can achieve autonomous path planning, and it has good convergence speed and stability. View Full-Text
Keywords: unmanned ships; deep reinforcement learning; DDPG; autonomous path planning; end-to-end; collision avoidance unmanned ships; deep reinforcement learning; DDPG; autonomous path planning; end-to-end; collision avoidance
Show Figures

Figure 1

MDPI and ACS Style

Guo, S.; Zhang, X.; Zheng, Y.; Du, Y. An Autonomous Path Planning Model for Unmanned Ships Based on Deep Reinforcement Learning. Sensors 2020, 20, 426. https://doi.org/10.3390/s20020426

AMA Style

Guo S, Zhang X, Zheng Y, Du Y. An Autonomous Path Planning Model for Unmanned Ships Based on Deep Reinforcement Learning. Sensors. 2020; 20(2):426. https://doi.org/10.3390/s20020426

Chicago/Turabian Style

Guo, Siyu, Xiuguo Zhang, Yisong Zheng, and Yiquan Du. 2020. "An Autonomous Path Planning Model for Unmanned Ships Based on Deep Reinforcement Learning" Sensors 20, no. 2: 426. https://doi.org/10.3390/s20020426

Find Other Styles
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Access Map by Country/Region

1
Back to TopTop