1. Improving language understanding by generative pre-training;Radford,2018
2. P. Henderson, R. Islam, P. Bachman, J. Pineau, D. Precup, D. Meger, Deep reinforcement learning that matters, in: Proceedings of the AAAI Conference on Artificial Intelligence, no. 1, 2018.
3. Simple random search of static linear policies is competitive for reinforcement learning;Mania;Adv. Neural Inf. Process. Syst.,2018
4. Trust region policy optimization;Schulman,2015
5. Proximal policy optimization algorithms;Schulman,2017