1. Learning robust rewards with adversarial inverse reinforcement learning;fu;ICLRE,2018
2. Domain Randomization and Generative Models for Robotic Grasping
3. Brax - a differen-tiable physics engine for large scale rigid body simulation;freeman;NeurIPS Datasets and Benchmakrs,2021
4. Proximal policy optimization algorithms;schulman;ArXiv Preprint,2017
5. Mastering atari with discrete world models;hafner;ICLRE,2021