Proposal and Evaluation of Detour Path Suppression Method in PS Reinforcement Learning-Reference-Cited by-同舟云学术

Proposal and Evaluation of Detour Path Suppression Method in PS Reinforcement Learning

Published:2019-09-01 Issue:5 Volume:12 Page:190-198
ISSN:1882-4889
Container-title:SICE Journal of Control, Measurement, and System Integration
language:en
Short-container-title:SICE Journal of Control, Measurement, and System Integration

Author:

Shiraishi Daisuke¹,Miyazaki Kazuteru²,Kobayashi Hiroaki¹

Affiliation:

1. Meiji University

2. National Institution for Academic Degrees and Quality Enhancement of Higher Education

Publisher

Informa UK Limited

Link

https://www.tandfonline.com/doi/pdf/10.9746/jcmsi.12.190

Reference29 articles.

1. [1] R.S. Sutton and A.G. Barto: Reinforcement learning: An introduction, A Bradford Book, MIT Press, 1998.

2. [2] C.J.C.H. Watkins and P. Dayan: Technical note: Q-learning, Machine Learning, Vol. 8, No. 3-4, pp. 55-68, 1992.

3. [3] G.A. Rummery and M. Niranjan: On-line Q-learning using connectionist systems, Technical Report CUED/F-INFENG/, TR-166, 1994.

4. [4] J.J. Grefenstette: Credit assignment in rule discovery systems based on genetic algorithms, Machine Learning, Vol. 3, No. 2-3, pp. 225-245, 1988.

5. [5] K. Miyazaki, M. Yamamura, and S. Kobayashi: A theory of profit sharing in reinforcement learning, Transactions of the Japanese Society for Artificial Intelligence, Vol. 9, No. 4, pp. 580-587, 1994 (in Japanese).

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Reward Value-Based Goal Selection for Agents’ Cooperative Route Learning Without Communication in Reward and Goal Dynamism;SN Computer Science;2020-05