Reinforcement learning with replacing eligibility traces-Reference-Cited by-同舟云学术

Reinforcement learning with replacing eligibility traces

Published:1996 Issue:1-3 Volume:22 Page:123-158
ISSN:0885-6125
Container-title:Machine Learning
language:en
Short-container-title:Mach Learn

Author:

Singh Satinder P.,Sutton Richard S.

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Software

Link

http://link.springer.com/content/pdf/10.1007/BF00114726.pdf

Reference34 articles.

1. Albus, J. S., (1981).Brain, Behavior, and Robotics, chapter 6, pages 139?179, Byte Books.

2. Baase, S., (1988).Computer Algorithms: Introduction to design and analysis. Reading, MA: Addison-Wesley.

3. Barnard, E., (1993). Temporal-difference methods and Markov models.IEEE Transactions on Systems, Man. and Cybernetics,23(2), 357?365.

4. Barto, A. G. & Duff, M., (1994). Monte Carlo matrix inversion and reinforcement learning. InAdvances in Neural Information Processing Systems 6, pages 687?694, San Mateo, CA, Morgan Kaufrnann.

5. Barto, A. G., Sutton, R. S., & Anderson, C. W., (1983). Neuronlike elements that can solve difficult learning control problems.IEEE Trans. on Systems, Man, and Cybernetics,13, 835?846.

Cited by 248 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A survey on load frequency control using reinforcement learning-based data-driven controller;Applied Soft Computing;2024-11

2. Eligibility traces in an autonomous soccer robot with obstacle avoidance and navigation policy;Applied Soft Computing;2024-10

3. Cognitive graphs: Representational substrates for planning.;Decision;2024-08-29

4. Controlling optical-cavity locking using reinforcement learning;Machine Learning: Science and Technology;2024-07-24

5. Reinforcement Learning-Based Adaptive Stateless Routing for Ambient Backscatter Wireless Sensor Networks;IEEE Transactions on Communications;2024-07