Learning variable impedance control-Reference-Cited by-同舟云学术

Learning variable impedance control

Published:2011-04-01 Issue:7 Volume:30 Page:820-833
ISSN:0278-3649
Container-title:The International Journal of Robotics Research
language:en
Short-container-title:The International Journal of Robotics Research

Author:

Buchli Jonas¹,Stulp Freek²,Theodorou Evangelos²,Schaal Stefan²

Affiliation:

1. Computational Learning and Motor Control Lab, University of Southern California, Los Angeles, USA, Department of Advanced Robotics, Italian Institute of Technology, Genova, Italy

2. Computational Learning and Motor Control Lab, University of Southern California, Los Angeles, USA

Abstract

One of the hallmarks of the performance, versatility, and robustness of biological motor control is the ability to adapt the impedance of the overall biomechanical system to different task requirements and stochastic disturbances. A transfer of this principle to robotics is desirable, for instance to enable robots to work robustly and safely in everyday human environments. It is, however, not trivial to derive variable impedance controllers for practical high degree-of-freedom (DOF) robotic tasks. In this contribution, we accomplish such variable impedance control with the reinforcement learning (RL) algorithm PI2 (P olicyI mprovement withP ath I ntegrals). PI2 is a model-free, sampling-based learning method derived from first principles of stochastic optimal control. The PI2 algorithm requires no tuning of algorithmic parameters besides the exploration noise. The designer can thus fully focus on the cost function design to specify the task. From the viewpoint of robotics, a particular useful property of PI2 is that it can scale to problems of many DOFs, so that reinforcement learning on real robotic systems becomes feasible. We sketch the PI2 algorithm and its theoretical properties, and how it is applied to gain scheduling for variable impedance control. We evaluate our approach by presenting results on several simulated and real robots. We consider tasks involving accurate tracking through via points, and manipulation tasks requiring physical contact with the environment. In these tasks, the optimal strategy requires both tuning of a reference trajectory and the impedance of the end-effector. The results show that we can use path integral based reinforcement learning not only for planning but also to derive variable gain feedback controllers in realistic scenarios. Thus, the power of variable impedance control is made available to a wide variety of robotic systems and practical applications.

Publisher

SAGE Publications

Subject

Applied Mathematics,Artificial Intelligence,Electrical and Electronic Engineering,Mechanical Engineering,Modeling and Simulation,Software

Link

http://journals.sagepub.com/doi/pdf/10.1177/0278364911402527

Reference42 articles.

1. Stability and motor adaptation in human arm movements

2. Monte Carlo and quasi-Monte Carlo methods

Cited by 201 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An Adaptive Control Method and Learning Strategy for Ultrasound-Guided Puncture Robot;Electronics;2024-01-31

2. Cartesian Stiffness Shaping of Compliant Robots—Incremental Learning and Optimization Based on Sequential Quadratic Programming;Actuators;2024-01-13

3. FORCE BASED IMPEDANCE CONTROL OF 5-BAR PARALLEL ROBOT MANIPULATOR;Mühendislik Bilimleri ve Tasarım Dergisi;2023-12-30

4. Robotic abrasive belt grinding of complex curved blades based on a novel force control architecture integrating smooth trajectories;Journal of Manufacturing Processes;2023-12

5. Model-based variable impedance learning control for robotic manipulation;Robotics and Autonomous Systems;2023-12