Proposal of PSwithEFP and its Evaluation in Multi-Agent Reinforcement Learning-Reference-Cited by-同舟云学术

Proposal of PSwithEFP and its Evaluation in Multi-Agent Reinforcement Learning

Published:2017-09-20 Issue:5 Volume:21 Page:930-938
ISSN:1883-8014
Container-title:Journal of Advanced Computational Intelligence and Intelligent Informatics
language:en
Short-container-title:JACIII

Author:

Miyazaki Kazuteru,Furukawa Koudai,Kobayashi Hiroaki, , ,

Abstract

When multiple agents learn a task simultaneously in an environment, the learning results often become unstable. This problem is known as the concurrent learning problem and to date, several methods have been proposed to resolve it. In this paper, we propose a new method that incorporates expected failure probability (EFP) into the action selection strategy to give agents a kind of mutual adaptability. The effectiveness of the proposed method is confirmed using Keepaway task.

Publisher

Fuji Technology Press Ltd.

Subject

Artificial Intelligence,Computer Vision and Pattern Recognition,Human-Computer Interaction

Reference30 articles.

1. R. S. Sutton and A. G. Barto, “Reinforcement Learning: An Introduction,” A Bradford Book, MIT Press, 1998.

2. S. Arai and N. Tanaka, “Experimental Analysis of Reward Design for Continuing Task in Multiagent Domains – RoboCup Soccer Keepaway –,” Trans. of the Japanese Society for Artificial Intelligence, Vol.21, No.6, pp. 537-546, 2006 (in Japanese).

3. S. Kuroda, K. Miyazaki, and H. Kobayashi, “Introduction of Fixed Mode States into Online Reinforcement Learning with Penalty and Reward and Its Application to Waist Trajectory Generation of Biped Robot,” J. Adv. Comput. Intell. Intell. Inform., Vol.16, No.6, pp. 758-768, 2013.

4. T. Matsui, T. Goto, and K. Izumi, “Acquiring a Government Bond Trading Strategy Using Reinforcement Learning,” J. Adv. Comput. Intell. Intell. Inform., Vol.13, No.6, pp. 691-696, 2009.

5. K. Merrick and M. L. Maher, “Motivated Reinforcement Learning for Adaptive Characters in Open-Ended Simulation Games,” Proc. of the Int. Conf. on Advanced in Computer Entertainment Technology, pp. 127-134, 2007.

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Proposal and Evaluation of Detour Path Suppression Method in PS Reinforcement Learning;SICE Journal of Control, Measurement, and System Integration;2019-09-01

2. On Stable Profit Sharing Reinforcement Learning with Expected Failure Probability;Biologically Inspired Cognitive Architectures 2018;2018-08-24

3. Proposal and Evaluation of Reward Sharing Method Based on Safety Level;SICE Journal of Control, Measurement, and System Integration;2018-05-01