Learning an Efficient Gait Cycle of a Biped Robot Based on Reinforcement Learning and Artificial Neural Networks-Reference-Cited by-同舟云学术

Learning an Efficient Gait Cycle of a Biped Robot Based on Reinforcement Learning and Artificial Neural Networks

Published:2019-02-01 Issue:3 Volume:9 Page:502
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Gil Cristyan,Calvo Hiram^ORCID,Sossa Humberto

Abstract

Programming robots for performing different activities requires calculating sequences of values of their joints by taking into account many factors, such as stability and efficiency, at the same time. Particularly for walking, state of the art techniques to approximate these sequences are based on reinforcement learning (RL). In this work we propose a multi-level system, where the same RL method is used first to learn the configuration of robot joints (poses) that allow it to stand with stability, and then in the second level, we find the sequence of poses that let it reach the furthest distance in the shortest time, while avoiding falling down and keeping a straight path. In order to evaluate this, we focus on measuring the time it takes for the robot to travel a certain distance. To our knowledge, this is the first work focusing both on speed and precision of the trajectory at the same time. We implement our model in a simulated environment using q-learning. We compare with the built-in walking modes of an NAO robot by improving normal-speed and enhancing robustness in fast-speed. The proposed model can be extended to other tasks and is independent of a particular robot model.

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

http://www.mdpi.com/2076-3417/9/3/502/pdf

Reference34 articles.

Cited by 31 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A global bibliometric and visualized analysis of gait analysis and artificial intelligence research from 1992 to 2022;Frontiers in Robotics and AI;2023-11-17

2. A Multiobjective Collaborative Deep Reinforcement Learning Algorithm for Jumping Optimization of Bipedal Robot;Advanced Intelligent Systems;2023-11-04

3. Reward-Adaptive Reinforcement Learning: Dynamic Policy Gradient Optimization for Bipedal Locomotion;IEEE Transactions on Pattern Analysis and Machine Intelligence;2023-06-01

4. Designing a Biped Robot's Gait using Reinforcement Learning's -Actor Critic Method;2023 International Conference on Inventive Computation Technologies (ICICT);2023-04-26

5. Fault-Tolerant Control for Robotic Systems Using a Wavelet Type-2 Fuzzy Brain Emotional Learning Controller and a TOPSIS-Based Self-organizing Algorithm;International Journal of Fuzzy Systems;2023-04-18