1. Kareem Amin, Nan Jiang, and Satinder Singh. 2017. Repeated inverse reinforcement learning. Advances in neural information processing systems , Vol. 30 (2017).
2. Dario Amodei, Chris Olah, Jacob Steinhardt, Paul Christiano, John Schulman, and Dan Mané. 2016. Concrete problems in AI safety. arXiv preprint arXiv:1606.06565 (2016).
3. Learning from Richer Human Guidance
4. Erik Båvenstrand and Jakob Berggren. 2019. Performance evaluation of imitation learning algorithms with human experts.
5. Kush Bhatia, Ashwin Pananjady, Peter Bartlett, Anca Dragan, and Martin J Wainwright. 2020. Preference learning along multiple criteria: A game-theoretic perspective. Advances in neural information processing systems , Vol. 33 (2020), 7413--7424.