1. Learning to understand goal specifications by modelling reward;Bahdanau,2019
2. Ask your humans: Using human instructions to improve generalization in reinforcement learning;Chen,2021
3. Implicit quantile networks for distributional reinforcement learning;Dabney,2018
4. Coordinated behavior of cooperative agents using deep reinforcement learning;Diallo;Neurocom-puting,2020
5. One-shot imitation learning;Duan,2017