Skip Content
You are currently on the new version of our website. Access the old version .

26 Results Found

  • Article
  • Open Access
5 Citations
3,711 Views
16 Pages

In this paper, we propose a reinforcement learning-based end-to-end learning method for the autonomous driving of a mobile robot in a dynamic environment with obstacles. Applying two additional techniques for reinforcement learning simultaneously hel...

  • Article
  • Open Access
83 Citations
9,966 Views
22 Pages

19 October 2020

Since path planning for multi-arm manipulators is a complicated high-dimensional problem, effective and fast path generation is not easy for the arbitrarily given start and goal locations of the end effector. Especially, when it comes to deep reinfor...

  • Article
  • Open Access
5 Citations
2,068 Views
26 Pages

22 November 2023

This paper explores the use of deep reinforcement learning in solving the multi-agent aircraft traffic planning (individual paths) and collision avoidance problem for a multiple UAS, such as that for a cargo drone network. Specifically, the Deep Q-Ne...

  • Article
  • Open Access
87 Citations
9,045 Views
15 Pages

13 January 2020

In order to enhance performance of robot systems in the manufacturing industry, it is essential to develop motion and task planning algorithms. Especially, it is important for the motion plan to be generated automatically in order to deal with variou...

  • Article
  • Open Access
4 Citations
2,418 Views
17 Pages

26 September 2022

The manipulation of complex robotics, which is in general high-dimensional continuous control without an accurate dynamic model, summons studies and applications of reinforcement learning (RL) algorithms. Typically, RL learns with the objective of ma...

  • Article
  • Open Access
1 Citations
3,106 Views
15 Pages

18 December 2022

Although broad reinforcement learning (BRL) provides a more intelligent autonomous decision-making method for the collision avoidance problem of unmanned surface vehicles (USVs), the algorithm still has the problem of over-estimation and has difficul...

  • Article
  • Open Access
1 Citations
2,121 Views
19 Pages

Extended Maximum Actor–Critic Framework Based on Policy Gradient Reinforcement for System Optimization

  • Jung-Hyun Kim,
  • Yong-Hoon Choi,
  • You-Rak Choi,
  • Jae-Hyeok Jeong and
  • Min-Suk Kim

11 February 2025

Recently, significant research efforts have been directed toward leveraging Artificial Intelligence for sensor data processing and system control. In particular, it is essential to determine the optimal path and trajectory by calculating sensor data...

  • Article
  • Open Access
6 Citations
3,537 Views
17 Pages

6 June 2024

To address the local minimum issue commonly encountered in active collision avoidance using artificial potential field (APF), this paper presents a novel algorithm that integrates APF with deep reinforcement learning (DRL) for robotic arms. Firstly,...

  • Article
  • Open Access
1 Citations
3,225 Views
26 Pages

24 August 2025

The energy management strategy of Hybrid Electric Vehicles (HEVs) plays a key role in improving fuel economy and reducing battery energy consumption. This paper proposes a Deep Reinforcement Learning-based energy management strategy optimized by the...

  • Article
  • Open Access
11 Citations
5,192 Views
18 Pages

Episodic Self-Imitation Learning with Hindsight

  • Tianhong Dai,
  • Hengyan Liu and
  • Anil Anthony Bharath

21 October 2020

Episodic self-imitation learning, a novel self-imitation algorithm with a trajectory selection module and an adaptive loss function, is proposed to speed up reinforcement learning. Compared to the original self-imitation learning algorithm, which sam...

  • Article
  • Open Access
1 Citations
2,480 Views
21 Pages

31 March 2025

Recently, autonomous flight has emerged as a key technology in the aerospace and defense sectors; however, traditional code-based autonomous flight systems face limitations in complex environments. Although reinforcement learning offers an alternativ...

  • Article
  • Open Access
29 Citations
5,870 Views
19 Pages

14 March 2021

In the workspace of robot manipulators in practice, it is common that there are both static and periodic moving obstacles. Existing results in the literature have been focusing mainly on the static obstacles. This paper is concerned with multi-arm ma...

  • Article
  • Open Access
16 Citations
4,651 Views
18 Pages

7 December 2022

In this paper, we propose a deep deterministic policy gradient (DDPG)-based path-planning method for mobile robots by applying the hindsight experience replay (HER) technique to overcome the performance degradation resulting from sparse reward proble...

  • Article
  • Open Access
18 Citations
5,143 Views
15 Pages

A Path-Planning Method Based on Improved Soft Actor-Critic Algorithm for Mobile Robots

  • Tinglong Zhao,
  • Ming Wang,
  • Qianchuan Zhao,
  • Xuehan Zheng and
  • He Gao

The path planning problem has gained more attention due to the gradual popularization of mobile robots. The utilization of reinforcement learning techniques facilitates the ability of mobile robots to successfully navigate through an environment cont...

  • Article
  • Open Access
20 Citations
5,189 Views
20 Pages

29 September 2022

This paper presents a deep reinforcement learning-based path planning algorithm for the multi-arm robot manipulator when there are both fixed and moving obstacles in the workspace. Considering the problem properties such as high dimensionality and co...

  • Article
  • Open Access
3 Citations
3,229 Views
15 Pages

15 December 2022

Sharing prior knowledge across multiple robotic manipulation tasks is a challenging research topic. Although the state-of-the-art deep reinforcement learning (DRL) algorithms have shown immense success in single robotic tasks, it is still challenging...

  • Article
  • Open Access
2 Citations
5,315 Views
15 Pages

1 November 2019

In terms of deep reinforcement learning (RL), exploration is highly significant in achieving better generalization. In benchmark studies, ε-greedy random actions have been used to encourage exploration and prevent over-fitting, thereby improv...

  • Article
  • Open Access
38 Citations
9,851 Views
16 Pages

A Collaborative Control Method of Dual-Arm Robots Based on Deep Reinforcement Learning

  • Luyu Liu,
  • Qianyuan Liu,
  • Yong Song,
  • Bao Pang,
  • Xianfeng Yuan and
  • Qingyang Xu

18 February 2021

Collaborative control of a dual-arm robot refers to collision avoidance and working together to accomplish a task. To prevent the collision of two arms, the control strategy of a robot arm needs to avoid competition and to cooperate with the other on...

  • Article
  • Open Access
22 Citations
5,899 Views
24 Pages

30 June 2022

In this paper, the application of the policy gradient Reinforcement Learning-based (RL) method for obstacle avoidance is proposed. This method was successfully used to control the movements of a robot using trial-and-error interactions with its envir...

  • Article
  • Open Access
1 Citations
1,254 Views
23 Pages

22 April 2025

In addressing the optimal motion planning issue for multi-arm rock drilling robots, this paper introduces a high-precision motion planning method based on Multi-Strategy Sampling RRT* (MSS-RRT*). A dual Jacobi iterative inverse solution method, coupl...

  • Feature Paper
  • Article
  • Open Access
2 Citations
3,560 Views
17 Pages

Event-Triggered Hierarchical Planner for Autonomous Navigation in Unknown Environment

  • Changhao Chen,
  • Bifeng Song,
  • Qiang Fu,
  • Dong Xue and
  • Lei He

27 November 2023

End-to-end deep neural network (DNN)-based motion planners have shown great potential in high-speed autonomous UAV flight. Yet, most existing methods only employ a single high-capacity DNN, which typically lacks generalization ability and suffers fro...

  • Article
  • Open Access
17 Citations
4,041 Views
29 Pages

23 June 2024

Aerial robots (drones) offer critical advantages in missions where human participation is impeded due to hazardous conditions. Among these, search and rescue missions in disaster-stricken areas are particularly challenging due to the dynamic and unpr...

  • Article
  • Open Access
5 Citations
2,528 Views
19 Pages

Biped Robots Control in Gusty Environments with Adaptive Exploration Based DDPG

  • Yilin Zhang,
  • Huimin Sun,
  • Honglin Sun,
  • Yuan Huang and
  • Kenji Hashimoto

As technology rapidly evolves, the application of bipedal robots in various environments has widely expanded. These robots, compared to their wheeled counterparts, exhibit a greater degree of freedom and a higher complexity in control, making the cha...

  • Article
  • Open Access
11 Citations
3,057 Views
32 Pages

End-to-End AUV Local Motion Planning Method Based on Deep Reinforcement Learning

  • Xi Lyu,
  • Yushan Sun,
  • Lifeng Wang,
  • Jiehui Tan and
  • Liwen Zhang

14 September 2023

This study aims to solve the problems of sparse reward, single policy, and poor environmental adaptability in the local motion planning task of autonomous underwater vehicles (AUVs). We propose a two-layer deep deterministic policy gradient algorithm...

  • Article
  • Open Access
1 Citations
2,019 Views
19 Pages

Stable and Efficient Reinforcement Learning Method for Avoidance Driving of Unmanned Vehicles

  • Sun-Ho Jang,
  • Woo-Jin Ahn,
  • Yu-Jin Kim,
  • Hyung-Gil Hong,
  • Dong-Sung Pae and
  • Myo-Taeg Lim

6 September 2023

Reinforcement learning (RL) has demonstrated considerable potential in solving challenges across various domains, notably in autonomous driving. Nevertheless, implementing RL in autonomous driving comes with its own set of difficulties, such as the o...

  • Article
  • Open Access
2 Citations
2,583 Views
23 Pages

Mars Exploration: Research on Goal-Driven Hierarchical DQN Autonomous Scene Exploration Algorithm

  • Zhiguo Zhou,
  • Ying Chen,
  • Jiabao Yu,
  • Bowen Zu,
  • Qian Wang,
  • Xuehua Zhou and
  • Junwei Duan

22 August 2024

In the non-deterministic, large-scale navigation environment under the Mars exploration mission, there is a large space for action and many environmental states. Traditional reinforcement learning algorithms that can only obtain rewards at target poi...