Next Article in Journal
Hamilton–Jacobi Wave Theory in Manifestly-Covariant Classical and Quantum Gravity
Previous Article in Journal
Corn Classification System based on Computer Vision
Open AccessArticle

Supervised Reinforcement Learning via Value Function

Space Engineering University, 81 Road, Huairou District, Beijing 101400, China
*
Author to whom correspondence should be addressed.
Symmetry 2019, 11(4), 590; https://doi.org/10.3390/sym11040590
Received: 21 March 2019 / Revised: 15 April 2019 / Accepted: 22 April 2019 / Published: 24 April 2019
Using expert samples to improve the performance of reinforcement learning (RL) algorithms has become one of the focuses of research nowadays. However, in different application scenarios, it is hard to guarantee both the quantity and quality of expert samples, which prohibits the practical application and performance of such algorithms. In this paper, a novel RL decision optimization method is proposed. The proposed method is capable of reducing the dependence on expert samples via incorporating the decision-making evaluation mechanism. By introducing supervised learning (SL), our method optimizes the decision making of the RL algorithm by using demonstrations or expert samples. Experiments are conducted in Pendulum and Puckworld scenarios to test the proposed method, and we use representative algorithms such as deep Q-network (DQN) and Double DQN (DDQN) as benchmarks. The results demonstrate that the method adopted in this paper can effectively improve the decision-making performance of agents even when the expert samples are not available. View Full-Text
Keywords: artificial intelligence; reinforcement learning; supervised learning; DQN; DDQN; expert samples; demonstration artificial intelligence; reinforcement learning; supervised learning; DQN; DDQN; expert samples; demonstration
Show Figures

Figure 1

MDPI and ACS Style

Pan, Y.; Zhang, J.; Yuan, C.; Yang, H. Supervised Reinforcement Learning via Value Function. Symmetry 2019, 11, 590.

Show more citation formats Show less citations formats
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Access Map by Country/Region

1
Search more from Scilit
 
Search
Back to TopTop