1. Deep reinforcement learning: A state-of-the-art walkthrough;Lazaridis;Journal of Artificial Intelligence Research,2020
2. Pilco: A model-based and data-efficient approach to policy search;Deisenroth,2011
3. Reinforcement learning: An introduction;Sutton,2018
4. To recognize shapes, first learn to generate images;Hinton;Progress in Brain Research,2007
5. Weighted importance sampling for off-policy learning with linear function approximation;Mahmood;Advances in Neural Information Processing Systems,2014