Learning and Reusing Quadruped Robot Movement Skills from Biological Dogs for Higher-Level Tasks-Reference-Cited by-同舟云学术

Learning and Reusing Quadruped Robot Movement Skills from Biological Dogs for Higher-Level Tasks

Published:2023-12-20 Issue:1 Volume:24 Page:28
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Wan Qifeng¹,Luo Aocheng¹,Meng Yan¹,Zhang Chong²,Chi Wanchao²,Zhang Shenghao²,Liu Yuzhen²,Zhu Qiuguo³,Kong Shihan¹,Yu Junzhi¹⁴^ORCID

Affiliation:

1. State Key Laboratory for Turbulence and Complex Systems, Department of Advanced Manufacturing and Robotics, College of Engineering, Peking University, Beijing 100871, China

2. Tencent Robotics X, Shenzhen 518057, China

3. Institute of Cyber-Systems and Control, College of Control Science and Engineering, Zhejiang University, Hangzhou 310027, China

4. Science and Technology on Integrated Information System Laboratory, Institute of Software, Chinese Academy of Sciences, Beijing 100190, China

Abstract

In the field of quadruped robots, the most classic motion control algorithm is based on model prediction control (MPC). However, this method poses challenges as it necessitates the precise construction of the robot’s dynamics model, making it difficult to achieve agile movements similar to those of a biological dog. Due to these limitations, researchers are increasingly turning to model-free learning methods, which significantly reduce the difficulty of modeling and engineering debugging and simultaneously reduce real-time optimization computational burden. Inspired by the growth process of humans and animals, from learning to walk to fluent movements, this article proposes a hierarchical reinforcement learning framework for the motion controller to learn some higher-level tasks. First, some basic motion skills can be learned from motion data captured from a biological dog. Then, with these learned basic motion skills as a foundation, the quadruped robot can focus on learning higher-level tasks without starting from low-level kinematics, which saves redundant training time. By utilizing domain randomization techniques during the training process, the trained policy function can be directly transferred to a physical robot without modification, and the resulting controller can perform more biomimetic movements. By implementing the method proposed in this article, the agility and adaptability of the quadruped robot can be maximally utilized to achieve efficient operations in complex terrains.

Funder

Beijing Natural Science Foundation

CIE-Tencent Robotics X Rhino-Bird Focused Research Program

National Natural Science Foundation of China

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/24/1/28/pdf

Reference31 articles.

1. Kang, R., Meng, F., Chen, X., Yu, Z., Fan, X., Ming, A., and Huang, Q. (2020). Structural design and crawling pattern generator of a planar quadruped robot for high-payload locomotion. Sensors, 20.

2. Development of a penguin-inspired swimming robot with air lubrication system;Pan;IEEE Trans. Ind. Electron.,2022

3. On the biomimetic design of agile-robot legs;Garcia;Sensors,2011

4. Zhang, X., Yi, H., Liu, J., Li, Q., and Luo, X. (2021). A bio-inspired compliance planning and implementation method for hydraulically actuated quadruped robots with consideration of ground stiffness. Sensors, 21.

5. Bouman, A., Ginting, M.F., Alatur, N., Palieri, M., Fan, D.D., Touma, T., Pailevanian, T., Kim, S., Otsu, K., and Burdick, J. (2020, January 24–30). Autonomous spot: Long-range autonomous exploration of extreme environments with legged locomotion. Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Las Vegas, NV, USA.