Hierarchical Task-Parameterized Learning from Demonstration for Collaborative Object Movement-Reference-Cited by-同舟云学术

Hierarchical Task-Parameterized Learning from Demonstration for Collaborative Object Movement

Published:2019-12-02 Issue: Volume:2019 Page:1-25
ISSN:1176-2322
Container-title:Applied Bionics and Biomechanics
language:en
Short-container-title:Applied Bionics and Biomechanics

Author:

Hu Siyao¹^ORCID,Kuchenbecker Katherine J.¹²^ORCID

Affiliation:

1. Department of Mechanical Engineering and Applied Mechanics and GRASP Laboratory, University of Pennsylvania, Philadelphia 19104, USA

2. Haptic Intelligence Department, Max Planck Institute for Intelligent Systems, 70569 Stuttgart, Germany

Abstract

Learning from demonstration (LfD) enables a robot to emulate natural human movement instead of merely executing preprogrammed behaviors. This article presents a hierarchical LfD structure of task-parameterized models for object movement tasks, which are ubiquitous in everyday life and could benefit from robotic support. Our approach uses the task-parameterized Gaussian mixture model (TP-GMM) algorithm to encode sets of demonstrations in separate models that each correspond to a different task situation. The robot then maximizes its expected performance in a new situation by either selecting a good existing model or requesting new demonstrations. Compared to a standard implementation that encodes all demonstrations together for all test situations, the proposed approach offers four advantages. First, a simply defined distance function can be used to estimate test performance by calculating the similarity between a test situation and the existing models. Second, the proposed approach can improve generalization, e.g., better satisfying the demonstrated task constraints and speeding up task execution. Third, because the hierarchical structure encodes each demonstrated situation individually, a wider range of task situations can be modeled in the same framework without deteriorating performance. Last, adding or removing demonstrations incurs low computational load, and thus, the robot’s skill library can be built incrementally. We first instantiate the proposed approach in a simulated task to validate these advantages. We then show that the advantages transfer to real hardware for a task where naive participants collaborated with a Willow Garage PR2 robot to move a handheld object. For most tested scenarios, our hierarchical method achieved significantly better task performance and subjective ratings than both a passive model with only gravity compensation and a single TP-GMM encoding all demonstrations.

Funder

National Science Foundation

Publisher

Hindawi Limited

Subject

Biomedical Engineering,Bioengineering,Medicine (miscellaneous),Biotechnology

Link

http://downloads.hindawi.com/journals/abb/2019/9765383.pdf

Reference19 articles.

1. A survey of robot learning from demonstration

2. Hidden semi-Markov models

3. Learning Controllers for Reactive and Proactive Behaviors in Human–Robot Collaboration