1. Convergent actor critic by humans;macglashan;International Conference on Intelligent Robots and Systems,2016
2. Active attention-modified policy shaping: Socially interactive agents track;faulkner;Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems ser AAMAS ‘19,2019
3. Reward-rational (implicit) choice: A unifying formalism for reward learning;jeon;Advances in Neural Information Processing Systems 33 Annual Conference on Neural Information Processing Systems 2020 NeurIPS 2020,2020