1. A Markovian Decision Process
2. Imitation learning via off-policy distribution matching;kostrikov;International Conference on Learning Representations,2019
3. Infogail: Interpretable imitation learning from visual demonstrations;li;Advances in neural information processing systems,2017
4. Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor;haarnoja;Proceedings of the 35th International Conference on Machine Learning,2018
5. Stand-alone self-attention in vision models;ramachandran;ArXiv Preprint,2019