Batch Reinforcement Learning-Reference-Cited by-同舟云学术

Batch Reinforcement Learning

Published:2012 Issue: Volume: Page:45-73
ISSN:1867-4534
Container-title:Adaptation, Learning, and Optimization
language:
Short-container-title:

Author:

Lange Sascha,Gabel Thomas,Riedmiller Martin

Publisher

Springer Berlin Heidelberg

Link

http://link.springer.com/content/pdf/10.1007/978-3-642-27645-3_2

Reference43 articles.

1. Antos, A., Munos, R., Szepesvari, C.: Fitted Q-iteration in continuous action-space MDPs. In: Advances in Neural Information Processing Systems, vol. 20, pp. 9–16 (2008)

2. Baird, L.: Residual algorithms: Reinforcement learning with function approximation. In: Proc. of the Twelfth International Conference on Machine Learning, pp. 30–37 (1995)

3. Bernstein, D., Givan, D., Immerman, N., Zilberstein, S.: The Complexity of Decentralized Control of Markov Decision Processes. Mathematics of Operations Research 27(4), 819–840 (2002)

4. Bertsekas, D., Tsitsiklis, J.: Neuro-dynamic programming. Athena Scientific, Belmont (1996)

5. Bonarini, A., Caccia, C., Lazaric, A., Restelli, M.: Batch reinforcement learning for controlling a mobile wheeled pendulum robot. In: IFIP AI, pp. 151–160 (2008)

Cited by 94 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Dual Behavior Regularized Offline Deterministic Actor–Critic;IEEE Transactions on Systems, Man, and Cybernetics: Systems;2024-08

2. Leveraging machine learning for efficient EV integration as mobile battery energy storage systems: Exploring strategic frameworks and incentives;Journal of Energy Storage;2024-07

3. Conservative In-Distribution Q-Learning for Offline Reinforcement Learning;2024 International Joint Conference on Neural Networks (IJCNN);2024-06-30

4. Data-Driven Reinforcement Learning for Optimal Motor Control in Washing Machines;2024 IEEE Conference on Artificial Intelligence (CAI);2024-06-25

5. Offline Reinforcement Learning With Behavior Value Regularization;IEEE Transactions on Cybernetics;2024-06