Hybrid Bipedal Locomotion Based on Reinforcement Learning and Heuristics-Reference-Cited by-同舟云学术

Hybrid Bipedal Locomotion Based on Reinforcement Learning and Heuristics

Published:2022-10-07 Issue:10 Volume:13 Page:1688
ISSN:2072-666X
Container-title:Micromachines
language:en
Short-container-title:Micromachines

Author:

Wang Zhicheng^ORCID,Wei Wandi^ORCID,Xie Anhuan^ORCID,Zhang Yifeng^ORCID,Wu Jun^ORCID,Zhu Qiuguo^ORCID

Abstract

Locomotion control has long been vital to legged robots. Agile locomotion can be implemented through either model-based controller or reinforcement learning. It is proven that robust controllers can be obtained through model-based methods and learning-based policies have advantages in generalization. This paper proposed a hybrid framework of locomotion controller that combines deep reinforcement learning and simple heuristic policy and assigns them to different activation phases, which provides guidance for adaptive training without producing conflicts between heuristic knowledge and learned policies. The training in simulation follows a step-by-step stochastic curriculum to guarantee success. Domain randomization during training and assistive extra feedback loops on real robot are also adopted to smooth the transition to the real world. Comparison experiments are carried out on both simulated and real Wukong-IV humanoid robots, and the proposed hybrid approach matches the canonical end-to-end approaches with higher rate of success, faster converging speed, and 60% less tracking error in velocity tracking tasks.

Funder

the National Key R&D Program of China

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Mechanical Engineering,Control and Systems Engineering

Link

https://www.mdpi.com/2072-666X/13/10/1688/pdf

Reference37 articles.

1. Cyborg and Bionic Systems: Signposting the Future

2. Legged Robots That Balance

3. Virtual actuator control;Pratt;Proceedings of the 1996 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS),1996

4. Virtual model control of a bipedal walking robot;Pratt;Proceedings of the 1997 IEEE International Conference on Robotics and Automation (ICRA),1997

5. On the Dynamic Stability of Biped Locomotion

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Bionic Jumping of Humanoid Robot via Online Centroid Trajectory Optimization and High Dynamic Motion Controller;Journal of Bionic Engineering;2024-09-04

2. Deep reinforcement learning-based pitch attitude control of a beaver-like underwater robot;Ocean Engineering;2024-09

3. Design and Analysis of a Novel Omnidirectional Step-climbing Robot;Sensors and Materials;2024-06-18

4. Realization of a Human-like Gait for a Bipedal Robot Based on Gait Analysis;Machines;2024-01-25

5. Learning Robust Locomotion for Bipedal Robot via Embedded Mechanics Properties;Journal of Bionic Engineering;2024-01-18