Author:
Aoyagi Seiji, ,Hiraoka Kenji
Abstract
Reinforcement learning is applicable to a robot manipulator required to search for a path adaptable to an unknown environment. Searching for an optimal path in configuration space (C-space), i.e., joint angle space, however, takes much convergence time and memory resources. We propose two ways to overcome this problem. One is restructuring C-space by using Self-Organizing Maps (SOM). Another is doing reinforcement learning at multistage, stage 1 of which searches a path in C-space without considering obstacles, so does stage 2 with considering them near path 1, reducing searched space and convergence time. We propose further reducing searched space by adjusting the path in stage 2 to that in stage 1 through dynamic programming (DP) matching.
Publisher
Fuji Technology Press Ltd.
Subject
Electrical and Electronic Engineering,General Computer Science
Reference17 articles.
1. R. S. Sutton and A. G. Barto (translated by S. Mikami and M. Minagawa), “Reinforcement Learning,” Morikita Publishing Co., Ltd., Tokyo, 2000.
2. E. W. Dijkstra, “A Note on Two Problems in Connexion with Graphs,” Numerische Mathematik, Vol.1, No.1, pp. 269-271, 1959.
3. E. Hart, N. J. Nilsson, and B. Raphael, “A Formal Basis for the Heuristic Determination of Minimum Cost Paths,” IEEE Transactions on Systems Science and Cybernetics, Vol.4, No.2, pp. 100-108, 1968.
4. J. C. Latombe, “Robot Motion Planning,” Kluwer Academic Publisher, Norwell, MA, 1991.
5. J. Kuffner and S. M. Lavalle, “RRT-Connect: An Efficient Approach to Single-Query Path Planning,” in Proc. IEEE Int. Conf. Robotics and Automation (ICRA 2000), pp. 995-1001, 2000.
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献