Active deep Q-learning with demonstration-Reference-Cited by-同舟云学术

Active deep Q-learning with demonstration

Published:2019-11-08 Issue:9-10 Volume:109 Page:1699-1725
ISSN:0885-6125
Container-title:Machine Learning
language:en
Short-container-title:Mach Learn

Author:

Chen Si-An^ORCID,Tangkaratt Voot,Lin Hsuan-Tien,Sugiyama Masashi

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Software

Link

http://link.springer.com/content/pdf/10.1007/s10994-019-05849-4.pdf

Reference35 articles.

1. Barto, A. G., Sutton, R. S., & Anderson, C. W. (1983). Neuronlike adaptive elements that can solve difficult learning control problems. IEEE Trans Systems, Man, and Cybernetics, 13(5), 834–846.

2. Brockman, G., Cheung, V., Pettersson, L., Schneider, J., Schulman, J., Tang, J., & Zaremba, W. (2016). Openai gym. CoRR abs/1606.01540. arXiv:1606.01540 .

3. Brys, T., Harutyunyan, A., Suay, H.B., Chernova, S., Taylor, M.E., & Nowé, A. (2015). Reinforcement learning from demonstration through shaping. In IJCAI AAAI Press, pp. 3352–3358.

4. Dagan, I., & Engelson, S.P. (1995). Committee-based sampling for training probabilistic classifiers. In Machine learning, proceedings of the twelfth international conference on machine learning, Tahoe City, California, USA, July 9–12, 1995, pp. 150–157, https://doi.org/10.1016/b978-1-55860-377-6.50027-x .

5. Fortunato, M., Azar, M.G., Piot, B., Menick, J., Osband, I., Graves, A., Mnih, V., Munos, R., Hassabis, D., Pietquin, O., Blundell, C., & Legg, S. (2017). Noisy networks for exploration. CoRR abs/1706.10295. arXiv:1706.10295 .

Cited by 18 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Local instance-based transfer learning for reinforcement learning;Engineering Applications of Artificial Intelligence;2024-07

2. Improved senescent cell segmentation on bright‐field microscopy images exploiting representation level contrastive learning;International Journal of Imaging Systems and Technology;2024-03

3. A double Actor-Critic learning system embedding improved Monte Carlo tree search;Neural Computing and Applications;2024-02-23

4. Learning and Repair of Deep Reinforcement Learning Policies from Fuzz-Testing Data;Proceedings of the IEEE/ACM 46th International Conference on Software Engineering;2024-02-06

5. An introduction to artificial intelligence in machine vision for postharvest detection of disorders in horticultural products;Postharvest Biology and Technology;2023-12