Neural Fitted Q Iteration – First Experiences with a Data Efficient Neural Reinforcement Learning Method-Reference-Cited by-同舟云学术

Neural Fitted Q Iteration – First Experiences with a Data Efficient Neural Reinforcement Learning Method

Published:2005 Issue: Volume: Page:317-328
ISSN:0302-9743
Container-title:Machine Learning: ECML 2005
language:
Short-container-title:

Author:

Riedmiller Martin

Publisher

Springer Berlin Heidelberg

Link

http://link.springer.com/content/pdf/10.1007/11564096_32.pdf

Reference9 articles.

1. Boyan, J., Moore, C.: Generalization in reinforcement learning: Safely approximating the value function. In: Advances in Neural Information Processing Systems 7. Morgan Kaufmann, San Francisco (1995)

2. Ernst, D., Wehenkel, L., Geurts, P.: Tree-based batch mode reinforcement learning. Journal of Machine Learning Research 6, 503–556 (2005)

3. Gordon, G.J.: Stable function approximation in dynamic programming. In: Prieditis, A., Russell, S. (eds.) Proceedings of the ICML, San Francisco, CA (1995)

4. Lin, L.-J.: Self-improving reactive agents based on reinforcement learning, planning and teaching. Machine Learning 8, 293–321 (1992)

5. Lagoudakis, M., Parr, R.: Least-squares policy iteration. Journal of Machine Learning Research 4, 1107–1149 (2003)

Cited by 398 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Reinforcement learning for electric vehicle charging scheduling: A systematic review;Transportation Research Part E: Logistics and Transportation Review;2024-10

2. Learning Adaptive Control of a UUV Using a Bio-Inspired Experience Replay Mechanism;International Journal of Advanced Research in Science, Communication and Technology;2024-09-05

3. M ³ Rec: A Context-Aware Offline Meta-Level Model-Based Reinforcement Learning Approach for Cold-Start Recommendation;ACM Transactions on Information Systems;2024-08-19

4. Verifying the Generalization of Deep Learning to Out-of-Distribution Domains;Journal of Automated Reasoning;2024-08-03

5. Multi-objective cooperative transportation for reconfigurable robot using isomorphic mapping multi-agent reinforcement learning;Mechatronics;2024-08