1. Apprenticeship learning via inverse reinforcement learning
2. Brenna D Argall , Sonia Chernova , Manuela Veloso , and Brett Browning . 2009. A survey of robot learning from demonstration. Robotics and autonomous systems , Vol. 57 , 5 ( 2009 ), 469--483. Brenna D Argall, Sonia Chernova, Manuela Veloso, and Brett Browning. 2009. A survey of robot learning from demonstration. Robotics and autonomous systems , Vol. 57, 5 (2009), 469--483.
3. Andrea Bajcsy , Dylan P Losey , Marcia K O'Malley , and Anca D Dragan . 2017 . Learning robot objectives from physical human interaction . In Conference on Robot Learning. PMLR, 217--226 . Andrea Bajcsy, Dylan P Losey, Marcia K O'Malley, and Anca D Dragan. 2017. Learning robot objectives from physical human interaction. In Conference on Robot Learning. PMLR, 217--226.
4. A Markovian decision process;Bellman Richard;Journal of mathematics and mechanics,1957
5. Learning reward functions from diverse sources of human feedback: Optimally integrating demonstrations and preferences