Critical factors in the empirical performance of temporal difference and evolutionary methods for reinforcement learning-Reference-Cited by-同舟云学术

Critical factors in the empirical performance of temporal difference and evolutionary methods for reinforcement learning

Published:2009-07-17 Issue:1 Volume:21 Page:1-35
ISSN:1387-2532
Container-title:Autonomous Agents and Multi-Agent Systems
language:en
Short-container-title:Auton Agent Multi-Agent Syst

Author:

Whiteson Shimon,Taylor Matthew E.,Stone Peter

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence

Link

http://link.springer.com/content/pdf/10.1007/s10458-009-9100-2.pdf

Reference82 articles.

1. Albus J. S. (1981) Brains, behavior, and robotics. Byte Books, Peterborough, NH

2. Anderson, C. W. (1986). Learning and problem solving with multilayer connectionist systems. Ph.D. thesis, University of Massachusetts, Amherst, MA.

3. Baird, L., & Moore, A. (1999). Gradient descent for general reinforcement learning. In Advances in Neural Information Processing Systems (Vol. 11). Cambridge, MA: MIT Press.

4. Bakker, B. (2002). Reinforcement learning with long short-term memory. In Advances in Neural Information Processing Systems (Vol. 14, pp. 1475–1482).

5. Barto, A., & Duff, M. (1994). Monte Carlo matrix inversion and reinforcement learning. In Advances in Neural Information Processing Systems (Vol. 6, pp. 687–694).

Cited by 16 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Evolutionary Reinforcement Learning: A Survey;Intelligent Computing;2023-01

2. Steering approaches to Pareto-optimal multiobjective reinforcement learning;Neurocomputing;2017-11

3. Neuroevolution in Games: State of the Art and Open Challenges;IEEE Transactions on Computational Intelligence and AI in Games;2017-03

4. Coevolutionary CMA-ES for Knowledge-Free Learning of Game Position Evaluation;IEEE Transactions on Computational Intelligence and AI in Games;2016-12

5. Discovering Rubik's Cube Subgroups using Coevolutionary GP;Proceedings of the Genetic and Evolutionary Computation Conference 2016;2016-07-20