Affiliation:
1. Cognitive Robotics, Delft University of Technology, 2628 CD Delft, The Netherlands
2. Honda Research Institute Europe, 63073 Offenbach am Main, Germany
Abstract
Humans often demonstrate diverse behaviors due to their personal preferences, for instance, related to their individual execution style or personal margin for safety. In this paper, we consider the problem of integrating both path and velocity preferences into trajectory planning for robotic manipulators. We first learn reward functions that represent the user path and velocity preferences from kinesthetic demonstration. We then optimize the trajectory in two steps, first the path and then the velocity, to produce trajectories that adhere to both task requirements and user preferences. We design a set of parameterized features that capture the fundamental preferences in a pick-and-place type of object transportation task, both in the shape and timing of the motion. We demonstrate that our method is capable of generalizing such preferences to new scenarios. We implement our algorithm on a Franka Emika 7-DoF robot arm and validate the functionality and flexibility of our approach in a user study. The results show that non-expert users are able to teach the robot their preferences with just a few iterations of feedback.
Funder
European Research Council
European Space Agency
Honda Research Institute Europe
Subject
Artificial Intelligence,Control and Optimization,Mechanical Engineering
Reference32 articles.
1. Learning preferences for manipulation tasks from online coactive feedback;Jain;Int. J. Robot. Res.,2015
2. Duchaine, V., and Gosselin, C.M. (2007, January 22–24). General model of human–robot cooperation using a novel velocity based variable impedance control. Proceedings of the Second Joint EuroHaptics Conference and Symp. on Haptic Interfaces for Virtual Environment and Teleoperator Systems, Tsukuba, Japan.
3. Teaching robots to cooperate with humans in dynamic manipulation tasks based on multi-modal human-in-the-loop approach;Peternel;Auton. Robot.,2014
4. Bajcsy, A., Losey, D.P., O’Malley, M.K., and Dragan, A.D. (2017, January 13–15). Learning robot objectives from physical human interaction. Proceedings of the Conference on Robot Learning, Mountain View, CA, USA.
5. Learning the correct robot trajectory in real-time from physical human interactions;Losey;ACM Trans. Hum.-Robot Interact.,2019
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献