Characterizing reinforcement learning methods through parameterized learning problems-Reference-Cited by-同舟云学术

Characterizing reinforcement learning methods through parameterized learning problems

Published:2011-06-03 Issue:1-2 Volume:84 Page:205-247
ISSN:0885-6125
Container-title:Machine Learning
language:en
Short-container-title:Mach Learn

Author:

Kalyanakrishnan Shivaram,Stone Peter

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Software

Link

http://link.springer.com/content/pdf/10.1007/s10994-011-5251-x.pdf

Reference128 articles.

1. Albus, J. S. (1981). Brains, behavior and robotics. New York: McGraw-Hill.

2. Åström, K. J. (1965). Optimal control of Markov processes with incomplete state information. Journal of Mathematical Analysis and Applications, 10, 174–205.

3. Baird, L., & Moore, A. (1999). Gradient descent for general reinforcement learning. In M. J. Kearns, S. A. Solla, & D. A. Cohn (Eds.), Advances in neural information processing systems 11 (NIPS 1998) (pp. 968–974). Cambridge: MIT Press.

4. Bakker, B., Zhumatiy, V., Gruener, G., & Schmidhuber, J. (2003). A robot that reinforcement-learns to identify and memorize important previous observations. In Proceedings of the 2003 IEEE/RSJ international conference on intelligent robots and systems (IROS 2003) (pp. 430–435). New York: IEEE Press.

5. Banko, M., & Brill, E. (2001). Scaling to very very large corpora for natural language disambiguation. In Proceedings of 39th annual meeting of the association for computational linguistics (ACL 2001) (pp. 26–33). Association for Computational Linguistics.

Cited by 16 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Parameterized deep reinforcement learning-enabled maintenance decision-support and life-cycle risk assessment for highway bridge portfolios;Structural Safety;2022-07

2. An Architecture for Bidirectional Learning Games;International Journal of Game-Based Learning;2022-01

3. Efficient Batch-Mode Reinforcement Learning Using Extreme Learning Machines;IEEE Transactions on Systems, Man, and Cybernetics: Systems;2021-06

4. Cooperative and Competitive Reinforcement and Imitation Learning for a Mixture of Heterogeneous Learning Modules;Frontiers in Neurorobotics;2018-09-27

5. Residual Sarsa algorithm with function approximation;Cluster Computing;2017-11-10