1. Meta learning shared hierarchies;frans;Proc Int Conf Learn Representations,0
2. Actor-attention-critic for multi-agent reinforcement learning;iqbal;Proc Int Conf Mach Learn,0
3. Zero-shot task generalization with multi-task deep reinforcement learning;oh;Proc 34th Int Conf Mach Learn,0
4. Using reward machines for high-level task specification and decomposition in reinforcement learning;icarte;Proc Int Conf Mach Learn,0
5. The option-critic architecture;bacon;Proc 31st AAAI Conf Artif Intell,0