1. Diversity is all you need: learning skills without a reward function;eysenbach;Proc of International Conference on Learning Representations,2019
2. Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor;haarnoja;Proc of International Conference on Machine Learning,2018
3. Variational intrinsic control;gregor;Proc of International Conference on Learning Representations,2017
4. Categorical reparameterization with Gumbel-Softmax;jang;Proc of International Conference on Learning Representations,2017
5. ProMP: Proximal meta-policy search;rothfuss;Proc of International Conference on Learning Representations,2019