Knowledge- and ambiguity-aware robot learning from corrective and evaluative feedback-Reference-Cited by-同舟云学术

Knowledge- and ambiguity-aware robot learning from corrective and evaluative feedback

Published:2023-01-16 Issue:23 Volume:35 Page:16821-16839
ISSN:0941-0643
Container-title:Neural Computing and Applications
language:en
Short-container-title:Neural Comput & Applic

Author:

Celemin Carlos^ORCID,Kober Jens

Abstract

AbstractIn order to deploy robots that could be adapted by non-expert users, interactive imitation learning (IIL) methods must be flexible regarding the interaction preferences of the teacher and avoid assumptions of perfect teachers (oracles), while considering they make mistakes influenced by diverse human factors. In this work, we propose an IIL method that improves the human–robot interaction for non-expert and imperfect teachers in two directions. First, uncertainty estimation is included to endow the agents with a lack of knowledge awareness (epistemic uncertainty) and demonstration ambiguity awareness (aleatoric uncertainty), such that the robot can request human input when it is deemed more necessary. Second, the proposed method enables the teachers to train with the flexibility of using corrective demonstrations, evaluative reinforcements, and implicit positive feedback. The experimental results show an improvement in learning convergence with respect to other learning methods when the agent learns from highly ambiguous teachers. Additionally, in a user study, it was found that the components of the proposed method improve the teaching experience and the data efficiency of the learning process.

Funder

European Research Council

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Software

Link

https://link.springer.com/content/pdf/10.1007/s00521-022-08118-z.pdf

Reference48 articles.

1. Sutton RS, Barto AG (2018) Reinforcement learning: an introduction. MIT Press, Cambridge

2. Kober J, Bagnell JA, Peters J (2013) Reinforcement learning in robotics: a survey. Int J Robot Res 32(11):1238–1274

3. Silver D, Schrittwieser J, Simonyan K, Antonoglou I, Huang A, Guez A, Hubert T, Baker L, Lai M, Bolton A (2017) Mastering the game of Go without human knowledge. Nature 550(7676):354–359

4. Arulkumaran K, Deisenroth MP, Brundage M, Bharath AA (2017) Deep reinforcement learning: a brief survey. IEEE Signal Process Mag 34(6):26–38

5. Mnih V, Kavukcuoglu K, Silver D, Rusu AA, Veness J, Bellemare MG, Graves A, Riedmiller M, Fidjeland AK, Ostrovski G (2015) Human-level control through deep reinforcement learning. Nature 518(7540):529–533

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Modeling Variation in Human Feedback with User Inputs: An Exploratory Methodology;Proceedings of the 2024 ACM/IEEE International Conference on Human-Robot Interaction;2024-03-11

2. A Closer Look at Reward Decomposition for High-Level Robotic Explanations;2023 IEEE International Conference on Development and Learning (ICDL);2023-11-09

3. Advanced Power Converters and Learning in Diverse Robotic Innovation: A Review;Energies;2023-10-19

4. Chat with the Environment: Interactive Multimodal Perception Using Large Language Models;2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS);2023-10-01

5. EValueAction: a proposal for policy evaluation in simulation to support interactive imitation learning;2023 IEEE 21st International Conference on Industrial Informatics (INDIN);2023-07-18