Inverse KKT: Learning cost functions of manipulation tasks from demonstrations-Reference-Cited by-同舟云学术

Inverse KKT: Learning cost functions of manipulation tasks from demonstrations

Published:2017-12 Issue:13-14 Volume:36 Page:1474-1488
ISSN:0278-3649
Container-title:The International Journal of Robotics Research
language:en
Short-container-title:The International Journal of Robotics Research

Author:

Englert Peter¹,Vien Ngo Anh²,Toussaint Marc¹

Affiliation:

1. Machine Learning & Robotics Lab, Universität Stuttgart, Germany

2. School of EEECS, Queen’s University Belfast, UK

Abstract

Inverse optimal control (IOC) assumes that demonstrations are the solution to an optimal control problem with unknown underlying costs, and extracts parameters of these underlying costs. We propose the framework of inverse Karush–Kuhn–Tucker (KKT), which assumes that the demonstrations fulfill the KKT conditions of an unknown underlying constrained optimization problem, and extracts parameters of this underlying problem. Using this we can exploit the latter to extract the relevant task spaces and parameters of a cost function for skills that involve contacts. For a typical linear parameterization of cost functions this reduces to a quadratic program, ensuring guaranteed and very efficient convergence, but we can deal also with arbitrary non-linear parameterizations of cost functions. We also present a non-parametric variant of inverse KKT that represents the cost function as a functional in reproducing kernel Hilbert spaces. The aim of our approach is to push learning from demonstration to more complex manipulation scenarios that include the interaction with objects and therefore the realization of contacts/constraints within the motion. We demonstrate the approach on manipulation tasks such as sliding a box, closing a drawer and opening a door.

Publisher

SAGE Publications

Subject

Applied Mathematics,Artificial Intelligence,Electrical and Electronic Engineering,Mechanical Engineering,Modeling and Simulation,Software

Link

http://journals.sagepub.com/doi/pdf/10.1177/0278364917745980

Reference40 articles.

1. Autonomous Helicopter Aerobatics through Apprenticeship Learning

2. Apprenticeship learning via inverse reinforcement learning

3. Imitating human reaching motions using physically inspired optimization principles

4. A survey of robot learning from demonstration

Cited by 53 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Isoperimetric Constraint Inference for Discrete-Time Nonlinear Systems Based on Inverse Optimal Control;IEEE Transactions on Cybernetics;2024-09

2. Learning human actions from complex manipulation tasks and their transfer to robots in the circular factory;at - Automatisierungstechnik;2024-09-01

3. Modelling flight trajectories with multi-modal generative adversarial imitation learning;Applied Intelligence;2024-06

4. Inverse Optimal Control with System Output;2024 IEEE 7th International Conference on Industrial Cyber-Physical Systems (ICPS);2024-05-12

5. GREEN PATH: an expert system for space planning and design by the generation of human trajectories;Multimedia Tools and Applications;2024-02-15