1. Baram, N., Anschel, O., Mannor, S.: Model-based Adversarial Imitation Learning (2016). http://arxiv.org/abs/1612.02179
2. Dulac-Arnold, G., et al.: Deep Reinforcement Learning in Large Discrete Action Spaces (2016). http://arxiv.org/abs/1512.07679
3. Hausknecht, M., Stone, P.: Deep Recurrent Q-Learning for Partially Observable MDPs (2017). http://arxiv.org/abs/1507.06527
4. Brockman, G., et al.: OpenAI Gym (2016). http://arxiv.org/abs/1606.01540
5. Denil, M., Agrawal, P., Kulkarni, T.D., Erez, T., Battaglia, P., de Freitas, N.: Learning to Perform Physics Experiments via Deep Reinforcement Learning (2017). http://arxiv.org/abs/1611.01843