Adaptive Locomotion Learning for Quadruped Robots by Combining DRL with a Cosine Oscillator Based Rhythm Controller-Reference-Cited by-同舟云学术

Adaptive Locomotion Learning for Quadruped Robots by Combining DRL with a Cosine Oscillator Based Rhythm Controller

Published:2023-10-07 Issue:19 Volume:13 Page:11045
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Zhang Xiaoping¹²^ORCID,Wu Yitong¹^ORCID,Wang Huijiang²,Iida Fumiya²,Wang Li¹

Affiliation:

1. School of Electrical and Control Engineering, North China University of Technology, Beijing 100144, China

2. Department of Engineering, University of Cambridge, Cambridge CB2 1PZ, UK

Abstract

Animals have evolved to adapt to complex and uncertain environments, acquiring locomotion skills for diverse surroundings. To endow a robot’s animal-like locomotion ability, in this paper, we propose a learning algorithm for quadruped robots based on deep reinforcement learning (DRL) and a rhythm controller that is based on a cosine oscillator. For a quadruped robot, two cosine oscillators are utilized at the hip joint and the knee joint of one leg, respectively, and, finally, eight oscillators form the controller to realize the quadruped robot’s locomotion rhythm during moving. The coupling between the cosine oscillators of the rhythm controller is realized by the phase difference, which is simpler and easier to realize when dealing with the complex coupling relationship between different joints. DRL is used to help learn the controller parameters and, in the reward function design, we address the challenge of terrain adaptation without relying on the complex camera-based vision processing but based on the proprioceptive information, where a state estimator is introduced to achieve the robot’s posture and help finally utilize the food-end coordinate. Experiments are carried out in CoppeliaSim, and all of the flat, uphill and downhill conditions are considered. The results show that the robot can successfully accomplish all the above skills and, at the same time, with the reward function designed, the robot’s pitch angle, yaw angle and roll angle are very small, which means that the robot is relatively stable during walking. Then, the robot is transplanted to a new scene; the results show that although the environment is previously unencountered, the robot can still fulfill the task, which demonstrates the effectiveness and robustness of this proposed method.

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/13/19/11045/pdf

Reference34 articles.

1. Quadruped robot control through model predictive control with pd compensator;Chang;Int. J. Control. Autom. Syst.,2021

2. Gait optimization of a quadruped robot using evolutionary computation;Kim;J. Bionic Eng.,2021

3. Sakakibara, Y., Kan, K., Hosoda, Y., Hattori, M., and Fujie, M. (1990, January 3–6). Foot trajectory for a quadruped walking machine. Proceedings of the IEEE International Workshop on Intelligent Robots and Systems, Towards a New Frontier of Applications, Ibaraki, Japan.

4. Sun, L., Meng, M.Q.H., Chen, W., Liang, H., and Mei, T. (2007, January 3–7). Design of quadruped robot based neural network. Proceedings of the Advances in Neural Networks—ISNN 2007: 4th International Symposium on Neural Networks, ISNN 2007, Nanjing, China. Proceedings, Part I 4.

5. Li, X., Zhang, X., Niu, J., and Li, C. (2022, January 7–10). A stable walking strategy of quadruped robot based on zmp in trotting gait. Proceedings of the 2022 IEEE International Conference on Mechatronics and Automation (ICMA), Guangxi, China.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Velocity Tracking Method for Quadruped Robot with Rhythm Controller;2024 IEEE 13th Data Driven Control and Learning Systems Conference (DDCLS);2024-05-17