Restricted gradient-descent algorithm for value-function approximation in reinforcement learning-Reference-Cited by-同舟云学术

Restricted gradient-descent algorithm for value-function approximation in reinforcement learning

Published:2008-03 Issue:4-5 Volume:172 Page:454-482
ISSN:0004-3702
Container-title:Artificial Intelligence
language:en
Short-container-title:Artificial Intelligence

Author:

da Motta Salles Barreto André,Anderson Charles W.

Publisher

Elsevier BV

Subject

Artificial Intelligence,Linguistics and Language,Language and Linguistics

Reference91 articles.

1. C.W. Anderson, Learning and problem solving with multilayer connectionist systems, PhD thesis, Computer and Information Science, University of Massachusetts, 1986

2. Learning to control an inverted pendulum using neural networks;Anderson;IEEE Control Systems Magazine,1989

3. C.W. Anderson, Q-learning with hidden-unit restarting, in: Advances in Neural Information Processing Systems, 1993, pp. 81–88

4. L.C. Baird, Residual algorithms: Reinforcement learning with function approximation, in: International Conference on Machine Learning, 1995, pp. 30–37

5. Monte Carlo matrix inversion and reinforcement learning;Barto,1994

Cited by 28 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Multi-AGV Dynamic Scheduling in an Automated Container Terminal: A Deep Reinforcement Learning Approach;Mathematics;2022-12-02

2. Differential radial basis function network for sequence modelling;Expert Systems with Applications;2022-03

3. Multi-Agent Reinforcement Learning via Adaptive Kalman Temporal Difference and Successor Representation;Sensors;2022-02-11

4. AKF-SR: Adaptive Kalman filtering-based successor representation;Neurocomputing;2022-01

5. Efficient Batch-Mode Reinforcement Learning Using Extreme Learning Machines;IEEE Transactions on Systems, Man, and Cybernetics: Systems;2021-06