Evaluation of the Improved Penalty Avoiding Rational Policy Making Algorithm in Real World Environment-Reference-Cited by-同舟云学术

Evaluation of the Improved Penalty Avoiding Rational Policy Making Algorithm in Real World Environment

Published:2012 Issue: Volume: Page:270-280
ISSN:0302-9743
Container-title:Intelligent Information and Database Systems
language:
Short-container-title:

Author:

Miyazaki Kazuteru,Itou Masaki,Kobayashi Hiroaki

Publisher

Springer Berlin Heidelberg

Link

http://link.springer.com/content/pdf/10.1007/978-3-642-28487-8_28

Reference15 articles.

1. Abbeel, P., Ng, A.Y.: Exploration and apprenticeship learning in reinforcement learning. In: Proc. of the 22nd International Conference on Machine Learning, pp. 1–8 (2005)

2. Arai, S., Tanaka, N.: Experimental Analysis of Reward Design for Continuing Task in Multiagent Domains – RoboCup Soccer Keepaway. Transactions of the Japanese Society for Artificial Intelligence 21(6), 537–546 (2006) (in Japanese)

3. Kimura, H., Kobayashi, S.: An analysis of actor/critic algorithm using eligibility traces: reinforcement learning with imperfect value function. In: Proc. of the 15th Int. Conf. on Machine Learning, pp. 278–286 (1998)

4. Hong, T., Wu, C.: An Improved Weighted Clustering Algorithm for Determination of Application Nodes in Heterogeneous Sensor Networks. J. of Information Hiding and Multimedia Signal Processing. 2(2), 173–184 (2011)

5. Kuroda, S., Miyazaki, K., Kobayashi, H.: Introduction of Fixed Mode States into Online Profit Sharing and Its Application to Waist Trajectory Generation of Biped Robot. In: European Workshop on Reinforcement Learning 9 (2011)

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Eligibility traces in an autonomous soccer robot with obstacle avoidance and navigation policy;Applied Soft Computing;2024-10