Learning and Reusing Quadruped Robot Movement Skills from Biological Dogs for Higher-Level Tasks
Author:
Wan Qifeng1, Luo Aocheng1, Meng Yan1, Zhang Chong2, Chi Wanchao2, Zhang Shenghao2, Liu Yuzhen2, Zhu Qiuguo3, Kong Shihan1, Yu Junzhi14ORCID
Affiliation:
1. State Key Laboratory for Turbulence and Complex Systems, Department of Advanced Manufacturing and Robotics, College of Engineering, Peking University, Beijing 100871, China 2. Tencent Robotics X, Shenzhen 518057, China 3. Institute of Cyber-Systems and Control, College of Control Science and Engineering, Zhejiang University, Hangzhou 310027, China 4. Science and Technology on Integrated Information System Laboratory, Institute of Software, Chinese Academy of Sciences, Beijing 100190, China
Abstract
In the field of quadruped robots, the most classic motion control algorithm is based on model prediction control (MPC). However, this method poses challenges as it necessitates the precise construction of the robot’s dynamics model, making it difficult to achieve agile movements similar to those of a biological dog. Due to these limitations, researchers are increasingly turning to model-free learning methods, which significantly reduce the difficulty of modeling and engineering debugging and simultaneously reduce real-time optimization computational burden. Inspired by the growth process of humans and animals, from learning to walk to fluent movements, this article proposes a hierarchical reinforcement learning framework for the motion controller to learn some higher-level tasks. First, some basic motion skills can be learned from motion data captured from a biological dog. Then, with these learned basic motion skills as a foundation, the quadruped robot can focus on learning higher-level tasks without starting from low-level kinematics, which saves redundant training time. By utilizing domain randomization techniques during the training process, the trained policy function can be directly transferred to a physical robot without modification, and the resulting controller can perform more biomimetic movements. By implementing the method proposed in this article, the agility and adaptability of the quadruped robot can be maximally utilized to achieve efficient operations in complex terrains.
Funder
Beijing Natural Science Foundation CIE-Tencent Robotics X Rhino-Bird Focused Research Program National Natural Science Foundation of China
Subject
Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry
Reference31 articles.
1. Kang, R., Meng, F., Chen, X., Yu, Z., Fan, X., Ming, A., and Huang, Q. (2020). Structural design and crawling pattern generator of a planar quadruped robot for high-payload locomotion. Sensors, 20. 2. Development of a penguin-inspired swimming robot with air lubrication system;Pan;IEEE Trans. Ind. Electron.,2022 3. On the biomimetic design of agile-robot legs;Garcia;Sensors,2011 4. Zhang, X., Yi, H., Liu, J., Li, Q., and Luo, X. (2021). A bio-inspired compliance planning and implementation method for hydraulically actuated quadruped robots with consideration of ground stiffness. Sensors, 21. 5. Bouman, A., Ginting, M.F., Alatur, N., Palieri, M., Fan, D.D., Touma, T., Pailevanian, T., Kim, S., Otsu, K., and Burdick, J. (2020, January 24–30). Autonomous spot: Long-range autonomous exploration of extreme environments with legged locomotion. Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Las Vegas, NV, USA.
|
|