Path Searching of Robot Manipulator Using Reinforcement Learning-Reduction of Searched Configuration Space Using SOM and Multistage Learning--Reference-Cited by-同舟云学术

Path Searching of Robot Manipulator Using Reinforcement Learning-Reduction of Searched Configuration Space Using SOM and Multistage Learning-

Published:2010-08-20 Issue:4 Volume:22 Page:532-541
ISSN:1883-8049
Container-title:Journal of Robotics and Mechatronics
language:en
Short-container-title:J. Robot. Mechatron.

Author:

Aoyagi Seiji, ,Hiraoka Kenji

Abstract

Reinforcement learning is applicable to a robot manipulator required to search for a path adaptable to an unknown environment. Searching for an optimal path in configuration space (C-space), i.e., joint angle space, however, takes much convergence time and memory resources. We propose two ways to overcome this problem. One is restructuring C-space by using Self-Organizing Maps (SOM). Another is doing reinforcement learning at multistage, stage 1 of which searches a path in C-space without considering obstacles, so does stage 2 with considering them near path 1, reducing searched space and convergence time. We propose further reducing searched space by adjusting the path in stage 2 to that in stage 1 through dynamic programming (DP) matching.

Publisher

Fuji Technology Press Ltd.

Subject

Electrical and Electronic Engineering,General Computer Science

Reference17 articles.

1. R. S. Sutton and A. G. Barto (translated by S. Mikami and M. Minagawa), “Reinforcement Learning,” Morikita Publishing Co., Ltd., Tokyo, 2000.

2. E. W. Dijkstra, “A Note on Two Problems in Connexion with Graphs,” Numerische Mathematik, Vol.1, No.1, pp. 269-271, 1959.

3. E. Hart, N. J. Nilsson, and B. Raphael, “A Formal Basis for the Heuristic Determination of Minimum Cost Paths,” IEEE Transactions on Systems Science and Cybernetics, Vol.4, No.2, pp. 100-108, 1968.

4. J. C. Latombe, “Robot Motion Planning,” Kluwer Academic Publisher, Norwell, MA, 1991.

5. J. Kuffner and S. M. Lavalle, “RRT-Connect: An Efficient Approach to Single-Query Path Planning,” in Proc. IEEE Int. Conf. Robotics and Automation (ICRA 2000), pp. 995-1001, 2000.

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Self-Generation of Reward by Moderate-Based Index for Senor Inputsvspace;Journal of Robotics and Mechatronics;2015-02-20

2. Self-Generation of Reward Based on Sensory Irritation Resulted from Interaction Between a Human and Arobot;Journal of Japan Society for Fuzzy Theory and Intelligent Informatics;2015

3. 10.2493/jjspe.77.280;Journal of the Japan Society for Precision Engineering;2011