Residual Algorithms: Reinforcement Learning with Function Approximation-Reference-Cited by-同舟云学术

Residual Algorithms: Reinforcement Learning with Function Approximation

Published:1995 Issue: Volume: Page:30-37
ISSN:
Container-title:Machine Learning Proceedings 1995
language:
Short-container-title:

Author:

Baird Leemon

Publisher

Elsevier

Reference11 articles.

1. Baird, L. C. (1995). Advantage Learning. To be published as a U.S. Air Force technical report by the Department of Computer Science, U.S. Air Force Academy.

2. Dynamic Programming: Deterministic and Stochastic Models;Bertsekas,1987

3. Bradtke, S. J (1993). Reinforcement learning applied to linear quadratic regulation. Proceedings of the Fifth Conference on Neural Information Processing Systems (pp. 295–302). Morgan Kaufmann.

4. Learning representations by back-propagating errors;Rumelhart;Nature,1986

5. Learning to predict by the methods of temporal differences;Sutton;Machine Learning,1988

Cited by 187 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Decentralized Adaptive temporal-difference learning over time-varying networks and its finite-time analysis;Neurocomputing;2024-11

2. High-Probability Sample Complexities for Policy Evaluation With Linear Function Approximation;IEEE Transactions on Information Theory;2024-08

3. A parallelized environmental-sensing and multi-tasks model for intelligent marine structure control in ocean waves coupling deep reinforcement learning and computational fluid dynamics;Physics of Fluids;2024-08-01

4. Multi-agent Gradient-Based Off-Policy Actor-Critic Algorithm for Distributed Reinforcement Learning;International Journal of Computational Intelligence Systems;2024-06-24

5. The impact of data distribution on Q-learning with function approximation;Machine Learning;2024-06-07