Learning Advanced Locomotion for Quadrupedal Robots: A Distributed Multi-Agent Reinforcement Learning Framework with Riemannian Motion Policies-Reference-Cited by-同舟云学术

Learning Advanced Locomotion for Quadrupedal Robots: A Distributed Multi-Agent Reinforcement Learning Framework with Riemannian Motion Policies

Published:2024-05-28 Issue:6 Volume:13 Page:86
ISSN:2218-6581
Container-title:Robotics
language:en
Short-container-title:Robotics

Author:

Wang Yuliu¹²^ORCID,Sagawa Ryusuke¹²^ORCID,Yoshiyasu Yusuke²^ORCID

Affiliation:

1. Intelligent and Mechanical Interaction System, University of Tsukuba, Tsukuba 305-8577, Ibaraki, Japan

2. Computer Vision Research Team, Artificial Intelligence Research Center, The National Institute of Advanced Industrial Science and Technology, 1-1-1 Umezono, Tsukuba 305-8560, Ibaraki, Japan

Abstract

Recent advancements in quadrupedal robotics have explored the motor potential of these machines beyond simple walking, enabling highly dynamic skills such as jumping, backflips, and even bipedal locomotion. While reinforcement learning has demonstrated excellent performance in this domain, it often relies on complex reward function tuning and prolonged training times, and the interpretability is not satisfactory. Riemannian motion policies, a reactive control method, excel in handling highly dynamic systems but are generally limited to fully actuated systems, making their application to underactuated quadrupedal robots challenging. To address these limitations, we propose a novel framework that treats each leg of a quadrupedal robot as an intelligent agent and employs multi-agent reinforcement learning to coordinate the motion of all four legs. This decomposition satisfies the conditions for utilizing Riemannian motion policies and eliminates the need for complex reward functions, simplifying the learning process for high-level motion modalities. Our simulation experiments demonstrate that the proposed method enables quadrupedal robots to learn stable locomotion using three, two, or even a single leg, offering advantages in training speed, success rate, and stability compared to traditional approaches, and better interpretability. This research explores the possibility of developing more efficient and adaptable control policies for quadrupedal robots.

Funder

Japan Science and Technology Agency Support for Pioneering Research Initiated by the Next Generation

Japan Society for the Promotion of Science

New Energy and Industrial Technology Development Organization

Publisher

MDPI AG

Link

https://www.mdpi.com/2218-6581/13/6/86/pdf

Reference29 articles.

1. Bjelonic, M., Grandia, R., Harley, O., Galliard, C., Zimmermann, S., and Hutter, M. (October, January 27). Whole-body MPC and online gait sequence generation for wheeled-legged robots. Proceedings of the 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Prague, Czech Republic.

2. Zero Moment Point Estimation Based on Resonant Frequencies of Wheel Joint for Wheel-Legged Mobile Robot;Nagano;IEEJ J. Ind. Appl.,2022

3. Smith, L., Kew, J.C., Li, T., Luu, L., Peng, X.B., Ha, S., Tan, J., and Levine, S. (2023). Learning and adapting agile locomotion skills by transferring experience. arXiv.

4. Reinforcement learning-based stable jump control method for asteroid-exploration quadruped robots;Qi;Aerosp. Sci. Technol.,2023

5. Tang, Z., Kim, D., and Ha, S. (2021). Proceedings of the 3rd International Conference on Robot Intelligence Technology and Applications, Springer.