1. Dqn-tamer: Human-in-the-loop reinforcement learning with intractable feedback;Arakawa,2018
2. Multi-robot path planning method using reinforcement learning;Bae;Applied sciences,2019
3. Learning polite behavior with situation models;Barraquand,2008
4. Assessment and learning: contradictory or complementary;Boud,1995
5. Reinforcement learning from demonstration through shaping;Brys,2015