Abstract
Locomotion control has long been vital to legged robots. Agile locomotion can be implemented through either model-based controller or reinforcement learning. It is proven that robust controllers can be obtained through model-based methods and learning-based policies have advantages in generalization. This paper proposed a hybrid framework of locomotion controller that combines deep reinforcement learning and simple heuristic policy and assigns them to different activation phases, which provides guidance for adaptive training without producing conflicts between heuristic knowledge and learned policies. The training in simulation follows a step-by-step stochastic curriculum to guarantee success. Domain randomization during training and assistive extra feedback loops on real robot are also adopted to smooth the transition to the real world. Comparison experiments are carried out on both simulated and real Wukong-IV humanoid robots, and the proposed hybrid approach matches the canonical end-to-end approaches with higher rate of success, faster converging speed, and 60% less tracking error in velocity tracking tasks.
Funder
the National Key R&D Program of China
Subject
Electrical and Electronic Engineering,Mechanical Engineering,Control and Systems Engineering
Reference37 articles.
1. Cyborg and Bionic Systems: Signposting the Future
2. Legged Robots That Balance
3. Virtual actuator control;Pratt;Proceedings of the 1996 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS),1996
4. Virtual model control of a bipedal walking robot;Pratt;Proceedings of the 1997 IEEE International Conference on Robotics and Automation (ICRA),1997
5. On the Dynamic Stability of Biped Locomotion
Cited by
9 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献