1. Apprenticeship learning via inverse reinforcement learning
2. Adrià Puigdomènech Badia , Bilal Piot , Steven Kapturowski , Pablo Sprechmann , Alex Vitvitskyi , Zhaohan Daniel Guo , and Charles Blundell . 2020 . Agent57: Outperforming the atari human benchmark . In International Conference on Machine Learning. PMLR, 507–517 . Adrià Puigdomènech Badia, Bilal Piot, Steven Kapturowski, Pablo Sprechmann, Alex Vitvitskyi, Zhaohan Daniel Guo, and Charles Blundell. 2020. Agent57: Outperforming the atari human benchmark. In International Conference on Machine Learning. PMLR, 507–517.
3. Lukas Biewald. 2020. Experiment Tracking with Weights and Biases. https://www.wandb.com/ Software available from wandb.com. Lukas Biewald. 2020. Experiment Tracking with Weights and Biases. https://www.wandb.com/ Software available from wandb.com.
4. Mariusz Bojarski Davide Del Testa Daniel Dworakowski Bernhard Firner Beat Flepp Prasoon Goyal Lawrence D Jackel Mathew Monfort Urs Muller Jiakai Zhang 2016. End to end learning for self-driving cars. arXiv preprint arXiv:1604.07316(2016). Mariusz Bojarski Davide Del Testa Daniel Dworakowski Bernhard Firner Beat Flepp Prasoon Goyal Lawrence D Jackel Mathew Monfort Urs Muller Jiakai Zhang 2016. End to end learning for self-driving cars. arXiv preprint arXiv:1604.07316(2016).
5. On the utility of learning about humans for human-ai coordination;Carroll Micah;Advances in Neural Information Processing Systems,2019