Estimating Passive Dynamics Distributions and State Costs in Linearly Solvable Markov Decision Processes during Z Learning Execution-Reference-Cited by-同舟云学术

Estimating Passive Dynamics Distributions and State Costs in Linearly Solvable Markov Decision Processes during Z Learning Execution

Published:2014-01-01 Issue:1 Volume:7 Page:48-54
ISSN:1882-4889
Container-title:SICE Journal of Control, Measurement, and System Integration
language:en
Short-container-title:SICE Journal of Control, Measurement, and System Integration

Author:

Burdelis Mauricio¹,Ikeda Kazushi¹

Affiliation:

1. Graduate School of Information Science, Nara Institute of Science and Technology (NAIST)

Publisher

Informa UK Limited

Link

https://www.tandfonline.com/doi/pdf/10.9746/jcmsi.7.48

Reference17 articles.

1. [1] R.S. Sutton and A.G. Barto: Reinforcement learning: An introduction, MIT Press, 1998.

2. [2] M.A.P. Burdelis and K. Ikeda: Temporal difference approach in linearly solvable Markov decision problems, Proc. Artificial Life and Robotics, GS12-3, 2011.

3. [3] T. Kollar and N. Roy: Trajectory optimization using reinforcement learning for map exploration, The International Journal of Robotics Research, Vol. 27, pp. 175-196, 2008.

4. [4] J. Buchli, F. Stulp, E. Theodorou, and S. Schaal: Learning variable impedance control, The International Journal of Robotics Research, Vol. 30, pp. 820-833, 2011.

5. [5] J. Nie and S. Haykin: A dynamic channel assignment policy through Q-learning, IEEE Transactions on Neural Networks, Vol. 10, No. 6, pp. 1443-1455, 1999.

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Robustness of linearly solvable Markov games employing inaccurate dynamics model;Artificial Life and Robotics;2017-10-31

2. An active inference approach to on-line agent monitoring in safety–critical systems;Advanced Engineering Informatics;2015-10