Stability Control of a Biped Robot on a Dynamic Platform Based on Hybrid Reinforcement Learning-Reference-Cited by-同舟云学术

Stability Control of a Biped Robot on a Dynamic Platform Based on Hybrid Reinforcement Learning

Published:2020-08-10 Issue:16 Volume:20 Page:4468
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Xi Ao^ORCID,Chen Chao

Abstract

In this work, we introduced a novel hybrid reinforcement learning scheme to balance a biped robot (NAO) on an oscillating platform, where the rotation of the platform is considered as the external disturbance to the robot. The platform had two degrees of freedom in rotation, pitch and roll. The state space comprised the position of center of pressure, and joint angles and joint velocities of two legs. The action space consisted of the joint angles of ankles, knees, and hips. By adding the inverse kinematics techniques, the dimension of action space was significantly reduced. Then, a model-based system estimator was employed during the offline training procedure to estimate the dynamics model of the system by using novel hierarchical Gaussian processes, and to provide initial control inputs, after which the reduced action space of each joint was obtained by minimizing the cost of reaching the desired stable state. Finally, a model-free optimizer based on DQN (λ) was introduced to fine tune the initial control inputs, where the optimal control inputs were obtained for each joint at any state. The proposed reinforcement learning not only successfully avoided the distribution mismatch problem, but also improved the sample efficiency. Simulation results showed that the proposed hybrid reinforcement learning mechanism enabled the NAO robot to balance on an oscillating platform with different frequencies and magnitudes. Both control performance and robustness were guaranteed during the experiments.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/20/16/4468/pdf

Reference42 articles.

1. The Mechanical System Design Handbook Modeling, Measurement, and Control;Nwokah,2001

2. Bipedal Robots: Modeling, Design and Walking Synthesis;Chevallereau,2008

3. Learning an Efficient Gait Cycle of a Biped Robot Based on Reinforcement Learning and Artificial Neural Networks

4. ZERO-MOMENT POINT — THIRTY FIVE YEARS OF ITS LIFE

5. Omnidirectional Walking Using ZMP and Preview Control for the NAO Humanoid Robot;Strom,2009

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Hierarchical Stabilization and Tracking Control of a Flexible-Joint Bipedal Robot Based on Anti-Windup and Adaptive Approximation Control;Journal of Robotics;2024-03-13

2. Compliant gait control method based on CVSLIP-FF model for biped robot walking over uneven terrain;ISA Transactions;2024-03

3. Walking Stability of Biped Robot Based on Machine Learning Algorithm;Lecture Notes in Mechanical Engineering;2023

4. Decoupled Multi-Loop Robust Control for a Walk-Assistance Robot Employing a Two-Wheeled Inverted Pendulum;Machines;2021-09-22

5. A Disturbance Rejection Control Method Based on Deep Reinforcement Learning for a Biped Robot;Applied Sciences;2021-02-10