Model-free reinforcement learning for robust locomotion using demonstrations from trajectory optimization-Reference-Cited by-同舟云学术

Model-free reinforcement learning for robust locomotion using demonstrations from trajectory optimization

Published:2022-08-31 Issue: Volume:9 Page:
ISSN:2296-9144
Container-title:Frontiers in Robotics and AI
language:
Short-container-title:Front. Robot. AI

Author:

Bogdanovic Miroslav,Khadiv Majid,Righetti Ludovic

Abstract

We present a general, two-stage reinforcement learning approach to create robust policies that can be deployed on real robots without any additional training using a single demonstration generated by trajectory optimization. The demonstration is used in the first stage as a starting point to facilitate initial exploration. In the second stage, the relevant task reward is optimized directly and a policy robust to environment uncertainties is computed. We demonstrate and examine in detail the performance and robustness of our approach on highly dynamic hopping and bounding tasks on a quadruped robot.

Publisher

Frontiers Media SA

Subject

Artificial Intelligence,Computer Science Applications

Reference33 articles.

1. Learning variable impedance control for contact sensitive tasks;Bogdanovic;IEEE Robot. Autom. Lett.,2020

2. Multicontact locomotion of legged robots;Carpentier;IEEE Trans. Robot.,2018

3. Pybullet, a python module for physics simulation for games, robotics and machine learning CoumansE. BaiY.

4. Robust trajectory optimization over uncertain terrain with stochastic complementarity;Drnach;IEEE Robot. Autom. Lett.,2021

5. Reinforcement learning of single legged locomotion;Fankhauser,2013

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Efficient Reinforcement Learning for 3D Jumping Monopods;Sensors;2024-08-01

2. Two-Stage Learning of Highly Dynamic Motions with Rigid and Articulated Soft Quadrupeds;2024 IEEE International Conference on Robotics and Automation (ICRA);2024-05-13

3. DTC: Deep Tracking Control;Science Robotics;2024-01-17

4. Imitating and Finetuning Model Predictive Control for Robust and Symmetric Quadrupedal Locomotion;IEEE Robotics and Automation Letters;2023-11

5. Adaptive Locomotion Learning for Quadruped Robots by Combining DRL with a Cosine Oscillator Based Rhythm Controller;Applied Sciences;2023-10-07