1. The Option-Critic Architecture
2. Generative adversarial nets;goodfellow;Proc Adv Neural Inf Process Syst,2014
3. Near-optimal representation learning for hierarchical reinforcement learning;nachum;arXiv 1810 01257,2018
4. Efficient off-policy meta-reinforcement learning via probabilistic context variables;rakelly;Proc Int Conf Mach Learn,2019
5. Diversity is all you need: Learning skills without a reward function;eysenbach;arXiv 1802 06070,2018