Body Calibration: Automatic Inter-Task Mapping between Multi-Legged Robots with Different Embodiments in Transfer Reinforcement Learning
Abstract
:1. Introduction
2. Theories, Related Work, and Approach
2.1. Reinforcement Learning
2.2. Transfer Learning
2.3. Heterogeneity in Robots
2.4. Mappings Leveraging with Ontology
2.5. Learning of Inter-Task Mapping
2.6. Body Representation in Human Brain
2.7. Approach
3. Proposed Method: Body Calibration
3.1. Number of Executed Actions
3.2. Body Vector
3.3. Foot Vector
3.4. Body Diagram
3.5. Mapping between Body Diagrams
4. Experiments
4.1. Experimental Setup
4.2. Conditions
4.3. Results of Learning Simulation
4.4. Results for Body Calibration
4.5. Results of Transfer to Robots
4.6. Discussion
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Sutton, R.S.; Barto, A.G. Reinforcement Learning: An Introduction; The MIT Press: Cambridge, MA, USA, 1998. [Google Scholar]
- Kober, J.; Bagnell, J.A.; Peters, J. Reinforcement learning in robotics: A survey. Int. J. Robot. Res. 2013, 32, 1238–1274. [Google Scholar] [CrossRef] [Green Version]
- Taylor, M.E. Transfer in Reinforcement Learning Domains; Ser. Studies in Computational Intelligence; Springer: Berlin/Heidelberg, Germany, 2009; p. 216. [Google Scholar]
- Lazaric, A. Reinforcement Learning—State of the art. In Transfer in Reinforcement Learning: A Framework and A Survey; Springer: Berlin/Heidelberg, Germany, 2012; Volume 12, pp. 143–173. [Google Scholar]
- Kono, H.; Murata, Y.; Kamimura, A.; Tomita, K.; Suzuki, T. Transfer Learning Method Using Ontology for Heterogeneous Multi-agent Reinforcement Learning. Int. J. Adv. Comput. Sci. Appl. 2014, 5, 15–164. [Google Scholar] [CrossRef] [Green Version]
- Taylor, M.E.; Kuhlmann, G.; Stone, P. Autonomous transfer for reinforcement learning. In Proceedings of the Seventh International Joint Conference on Autonomous Agents and Multiagent Systems, Estoril, Portugal, 12–16 May 2008; pp. 283–290. [Google Scholar]
- Fachantidis, A.; Partalas, I.; Taylor, M.E.; Vlahavas, I. Transfer learning with probabilistic mapping selection. Adapt. Behav. 2015, 23, 3–19. [Google Scholar] [CrossRef] [Green Version]
- Cheng, Q.; Wang, X.; Shen, L. An Autonomous Inter-task Mapping Learning Method via Artificial Network for Transfer Learning. In Proceedings of the IEEE International Conference on Robotics and Biomimetics, Macau, China, 5–8 December 2017; pp. 768–773. [Google Scholar]
- Mnih, V.; Kavukcuoglu, K.; Silver, D.; Rusu, A.A.; Veness, J.; Bellemare, M.G.; Graves, A.; Riedmiller, M.; Fidjel, A.K.; Ostrovski, G.; et al. Human-Level control through deep reinforcement learning. Nature 2015, 518, 529–533. [Google Scholar] [CrossRef]
- Hou, Y.; Ong, Y.; Feng, L.; Zurada, J.M. An Evolutionary Transfer Reinforcement Learning Framework for Multiagent Systems. IEEE Trans. Evol. Comput. 2017, 21, 601–615. [Google Scholar] [CrossRef]
- Peng, X.B.; Coumans, E.; Zhang, T.; Lee, T.-W.E.; Tan, J.; Levine, S. Learning Agile Robotic Locomotion Skills by Imitating Animals. In Proceedings of the Robotics: Science and Systems, Corvalis, OR, USA, 12–16 July 2020. [Google Scholar]
- Kono, H.; Katayama, R.; Takakuwa, Y.; Wen, W.; Suzuki, T. Activation and Spreading Sequence for Spreading Activation Policy Selection Method in Transfer Reinforcement Learning. Int. J. Adv. Comput. Sci. Appl. 2019, 10, 7–16. [Google Scholar] [CrossRef]
- Ota, J. Understanding Brain Plasticity on Body Representations to Promote Their Adaptive Functions; 2018 Annual Report; Kaken: Tokyo, Japan, 2018. [Google Scholar]
- Wen, W.; Katsutoshi, M.; Shunsuke, H.; Qi, A.; Hiroshi, Y.; Yusuke, T.; Atsushi, Y.; Hajime, A. Goal-Directed Movement Enhances Body Representation Updating. Front. Hum. Neurosci. 2016, 10, 1–10. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Watkins, C.J.C.H.; Dayan, P. Q-Learning. Mach. Learn. 1992, 8, 279–292. [Google Scholar] [CrossRef]
- Peng, X.B.; Andrychowicz, M.; Zaremba, W.; Abbeel, P. Sim-to-Real Transfer of Robotic Control with Dynamics Randomization. In Proceedings of the 2018 IEEE International Conference on Robotics and Automation, Brisbane, Australia, 21–25 May 2018; pp. 3803–3810. [Google Scholar]
- Baar, J.V.; Sullivan, A.; Cordorel, R.; Jha, D.; Romeres, D.; Nikovski, D. Sim–to–Real Transfer Learning using Robustified Controllers in Robotic Tasks involving Complex Dynamics. In Proceedings of the 2019 International Conference on Robotics and Automation (ICRA), Montreal, QC, Canada, 20–24 May 2019; pp. 6001–6007. [Google Scholar]
- Hwasser, M.; Kragic, D.; Antonova, R. Variational Auto-Regularized Alignment for Sim–to–Real Control. In Proceedings of the 2020 IEEE International Conference on Robotics and Automation (ICRA), Paris, France, 31 May–31 August 2020; pp. 2732–2738. [Google Scholar]
- Webots: Open Source Robot Simulator. Available online: https://cyberbotics.com/ (accessed on 30 January 2022).
- ROBOTIS: Dynamixel XL-320. Available online: https://www.robotis.us/dynamixel-xl-320/ (accessed on 30 January 2022).
Parameters | Values |
---|---|
Number of trials | 5 |
Maximum number of episodes | 100,000 |
Learning rate | 0.1 |
Discount rate | 0.99 |
Reward at reaching goal r | 10 |
Reward per step | −0.05 |
Reward for body contact with ground | −0.1 |
Temperature of Boltzmann selection T | 0.1 |
Action No. , Motor IDs , and Number of Executed Actions | Body Vector | Foot Vector |
---|---|---|
() | ||
() | ||
() | ||
() | ||
() | ||
() | ||
() | ||
() | ||
() | ||
() | ||
() | ||
() |
Action No. , Motor IDs , and Number of Executed Actions | Body Vector | Foot Vector |
---|---|---|
() | ||
() | ||
() | ||
() | ||
() | ||
() | ||
() | ||
() | ||
() | ||
() | ||
() | ||
() |
Action No. , Motor IDs , and Number of Executed Actions | Body Vector | Foot Vector |
---|---|---|
() | ||
() | ||
() | ||
() | ||
() | ||
() | ||
() | ||
() |
Action Number of Virtual Agent | Mapping to Robot 1 by Proposed Method | Mapping to Robot 1 by Hand Coding | Mapping to Robot 2 by Proposed Method | Mapping to Robot 2 by Hand Coding |
---|---|---|---|---|
0 | 0 | 0 | 0 | 0 |
1 | 1 | 1 | 1 | 1 |
2 | 2 | 2 | 7 | 2 |
3 | 3 | 3 | 6 | 3 |
4 | 7 | 4 | 4 | N/M 4 |
5 | 6 | 5 | 5 | N/M 4 |
6 | 5 | 6 | N/M 4 | N/M 4 |
7 | 4 | 7 | N/M 4 | N/M 4 |
8 | 11 | 8 | N/M 4 | 4 |
9 | 10 | 9 | N/M 4 | 5 |
10 | 9 | 10 | 3 | 6 |
11 | 8 | 11 | 2 | 7 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Ikeda, S.; Kono, H.; Watanabe, K.; Suzuki, H. Body Calibration: Automatic Inter-Task Mapping between Multi-Legged Robots with Different Embodiments in Transfer Reinforcement Learning. Actuators 2022, 11, 140. https://doi.org/10.3390/act11050140
Ikeda S, Kono H, Watanabe K, Suzuki H. Body Calibration: Automatic Inter-Task Mapping between Multi-Legged Robots with Different Embodiments in Transfer Reinforcement Learning. Actuators. 2022; 11(5):140. https://doi.org/10.3390/act11050140
Chicago/Turabian StyleIkeda, Satoru, Hitoshi Kono, Kaori Watanabe, and Hidekazu Suzuki. 2022. "Body Calibration: Automatic Inter-Task Mapping between Multi-Legged Robots with Different Embodiments in Transfer Reinforcement Learning" Actuators 11, no. 5: 140. https://doi.org/10.3390/act11050140
APA StyleIkeda, S., Kono, H., Watanabe, K., & Suzuki, H. (2022). Body Calibration: Automatic Inter-Task Mapping between Multi-Legged Robots with Different Embodiments in Transfer Reinforcement Learning. Actuators, 11(5), 140. https://doi.org/10.3390/act11050140