1. Deep learning, reinforcement learning, and world models
2. Sutton RS, Barto AG. Reinforcement learning: an introduction. Cambridge (MA): MIT Press; 2018.
3. Lillicrap TP Hunt JJ Pritzel A et al. Continuous control with deep reinforcement learning. Preprint arXiv:150902971. 2015.
4. Fujimoto S Hoof H Meger D. Addressing function approximation error in actor-critic methods. International Conference on Machine Learning; 2018; p. 1582–1591.
5. Schulman J Wolski F Dhariwal P et al. Proximal policy optimization algorithms. Preprint arXiv:170706347. 2017.