1. Solving rubik’s cube with a robot hand;Akkaya,2019
2. Learning to walk via deep reinforcement learning;Haarnoja;Robotics: Science and Systems,2019
3. Human-level control through deep reinforcement learning
4. Combo: Conservative offline model-based policy optimization;Yu;Advances in neural information processing systems,2021
5. Deep reinforcement learning in a handful of trials using probabilistic dynamics models;Chua;Advances in neural information processing systems,2018