Update Method of Cost Function to Learn Robust Policy Parameters

Author:

Fujiwara Daigo1,Yamamoto Kosuke1,Ariizumi Ryo2,Hayakawa Tomohiro1,Matsuno Fumitoshi1

Affiliation:

1. Graduate School of Engineering, Kyoto University

2. Graduate School of Engineering, Nagoya University

Publisher

The Institute of Systems, Control and Information Engineers

Reference14 articles.

1. [1] L. P. Kaelbling, M. L. Littman and A. W. Moore: Reinforcement learning: A survey; Journal of Artificial Intelligence Reseach, Vol. 4, No. 1 (1996)

2. [2] J. Kober, J. A. Bagnell and J. Peters: Reinforcement learning in robotics: A survey; International Journal of Robotics Research, Vol. 32, Vo. 11, pp. 1238–1274 (2013)

3. [3] E. A. Theodorou, J. Buchli and S. Schaal: A generalized path integral control approach to reinforcement learning; Journal of Machine Learning Research, Vol. 11, pp. 3137–3181 (2010)

4. [4] J. Kober and J. Peters: Policy search for motor primitives in robotics; Machine Learning, Vol. 84, Nos. 1– 2, pp. 171–203 (2011)

5. [5] A. Ijspeert, J. Nakanishi, S. Schaal, S. Becker, S. Thrun and K. Obermayer: Learning attractor landscapes for learning motor primitives; Advances in Neural Information Processing Systems, pp. 1547–1554 (2002)

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3