1. 40. G. Brockman , V. Cheung , L. Pettersson , J. Schneider , J. Schulman , J. Tang and W. Zaremba , “Openai gym,” arXiv preprint arXiv:1606.01540 (2016).
2. Language Understanding for Text-based Games using Deep Reinforcement Learning
3. 46. M. Abadi , A. Agarwal , P. Barham , E. Brevdo , Z. Chen , C. Citro , G. S. Corrado , A. Davis , J. Dean , M. Devin , et al., “Tensorflow: Large-scale machine learning on heterogeneous distributed systems,” arXiv preprint arXiv:1603.04467 (2016).
4. 36. T. Schaul , J. Quan , I. Antonoglou and D. Silver , “Prioritized experience replay,” arXiv preprint arXiv:1511.05952 (2015).
5. Outdoor autonomous landing on a moving platform for quadrotors using an omnidirectional camera