Probabilistic Policy Reuse for inter-task transfer learning-Reference-Cited by-同舟云学术

Probabilistic Policy Reuse for inter-task transfer learning

Published:2010-07 Issue:7 Volume:58 Page:866-871
ISSN:0921-8890
Container-title:Robotics and Autonomous Systems
language:en
Short-container-title:Robotics and Autonomous Systems

Author:

Fernández Fernando,García Javier,Veloso Manuela

Publisher

Elsevier BV

Subject

Computer Science Applications,General Mathematics,Software,Control and Systems Engineering

Reference19 articles.

1. Reinforcement learning: a survey;Kaelbling;Journal of Artificial Intelligence Research,1996

2. C. Watkins, Learning from delayed rewards, Ph.D. Thesis, Cambridge University, Cambridge, England, 1989.

3. Practical issues in temporal difference learning;Tesauro;Machine Learning,1992

4. P. Stone, R.S. Sutton, G. Kuhlmann, Reinforcement learning for RoboCup-soccer Keepaway, Adaptive Behavior 13 (3).

5. M.E. Taylor, P. Stone, Y. Liu, Value functions for RL-based behavior transfer: a comparative study, in: Proceedings of the Twentieth National Conference on Artificial Intelligence, 2005.

Cited by 39 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Context-aware composition of agent policies by Markov decision process entity embeddings and agent ensembles;Semantic Web;2024-01-09

2. Efficient Bayesian Policy Reuse With a Scalable Observation Model in Deep Reinforcement Learning;IEEE Transactions on Neural Networks and Learning Systems;2023

3. Robust Optimal Well Control using an Adaptive Multigrid Reinforcement Learning Framework;Mathematical Geosciences;2022-11-04

4. A taxonomy for similarity metrics between Markov decision processes;Machine Learning;2022-10-14

5. Transfer und Reinforcement Learning in der Produktionssteuerung;Zeitschrift für wirtschaftlichen Fabrikbetrieb;2022-09-01