Technical Note-Reference-Cited by-同舟云学术

Technical Note

Published:1992 Issue: Volume: Page:55-68
ISSN:
Container-title:Reinforcement Learning
language:
Short-container-title:

Author:

Watkins Christopher J. C. H.,Dayan Peter

Publisher

Springer US

Link

http://link.springer.com/content/pdf/10.1007/978-1-4615-3618-5_4.pdf

Reference14 articles.

1. Barto, A.G., Bradtke, S.J. & Singh, S.P. (1991). Real-time learning and control using asynchronous dynamic programming. (COINS technical report 91–57). Amherst: University of Massachusetts.

2. Barto, A.G. & Singh, S.P. (1990). On the computational economics of reinforcement learning. In D.S. Touretzky, J. Elman, T.J. Sejnowski & G.E. Hinton, (Eds.), Proceedings of the 1990 Connectionist Models Summer School. San Mateo, CA: Morgan Kaufmann.

3. Bellman, R.E. & Dreyfus, S.E. (1962). Applied dynamic programming. RAND Corporation.

4. Proceedings of the 1991 International Joint Conference on Artificial Intelligence;D Chapman,1991

5. Kushner, H. & Clark, D. (1978). Stochastic approximation methods for constrained and unconstrained systems. Berlin, Germany: Springer-Verlag.

Cited by 25 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Proposal of a Course-Classification Support System Using Deep Learning and its Evaluation When Combined with Reinforcement Learning;Journal of Advanced Computational Intelligence and Intelligent Informatics;2024-03-20

2. DQN based Anti-blocking Routing Algorithm for IRS-assisted MANET;2023 IEEE 98th Vehicular Technology Conference (VTC2023-Fall);2023-10-10

3. Dynamic graph combinatorial optimization with multi-attention deep reinforcement learning;Proceedings of the 30th International Conference on Advances in Geographic Information Systems;2022-11

4. Solving Reward-Collecting Problems with UAVs: A Comparison of Online Optimization and Q-Learning;Journal of Intelligent & Robotic Systems;2022-02

5. Proposal and Evaluation of Deep Profit Sharing Method in a Mixed Reward and Penalty Environment;Studies in Computational Intelligence;2022