1. Continuous control with deep reinforcement learning;lillicrap;ArXiv Preprint,2015
2. Emergence of grounded compositional language in multi-agent populations;mordatch;ArXiv Preprint,2017
3. Self-organizing maps for storage and transfer of knowledge in reinforcement learning
4. Policy gradient methods for reinforcement learning with function approximation;sutton;Advances in Neural Information Processing Systems 12,2000
5. Proximal policy optimization algorithms;schulman;CoRR,2017