1. Human-level control through deep reinforcement learning;Mnih;Nature,2015
2. Mastering the game of Go with deep neural networks and tree search;Silver;Nature,2016
3. End-to-end training of deep visuomotor policies;Levine;J. Mach. Learn. Res.,2016
4. The arcade learning environment: an evaluation platform for general agents;Bellemare;J. Artif. Intell. Res.,2013
5. Y. Tassa, Y. Doron, A. Muldal, T. Erez, Y. Li, D. de L. Casas, D. Budden, A. Abdolmaleki, J. Merel, A. Lefrancq, T. Lillicrap, M. Riedmiller, Deepmind control suite, ArXiv Preprint ArXiv:1801.00690 (2018).