Learning to balance an NAO robot using reinforcement learning with symbolic inverse kinematic-Reference-Cited by-同舟云学术

Learning to balance an NAO robot using reinforcement learning with symbolic inverse kinematic

Published:2016-04-29 Issue:11 Volume:39 Page:1735-1748
ISSN:0142-3312
Container-title:Transactions of the Institute of Measurement and Control
language:en
Short-container-title:Transactions of the Institute of Measurement and Control

Author:

Tutsoy Onder¹,Erol Barkana Duygun²,Colak Sule¹

Affiliation:

1. Electrical and Electronic Engineering Department, Adana Science and Technology University, Turkey

2. Electrical and Electronic Engineering Department, Yeditepe University, İstanbul, Turkey

Abstract

An autonomous humanoid robot (HR) with learning and control algorithms is able to balance itself during sitting down, standing up, walking and running operations, as humans do. In this study, reinforcement learning (RL) with a complete symbolic inverse kinematic (IK) solution is developed to balance the full lower body of a three-dimensional (3D) NAO HR which has 12 degrees of freedom. The IK solution converts the lower body trajectories, which are learned by RL, into reference positions for the joints of the NAO robot. This reduces the dimensionality of the learning and control problems since the IK integrated with the RL eliminates the need to use whole HR states. The IK solution in 3D space takes into account not only the legs but also the full lower body; hence, it is possible to incorporate the effect of the foot and hip lengths on the IK solution. The accuracy and capability of following real joint states are evaluated in the simulation environment. MapleSim is used to model the full lower body, and the developed RL is combined with this model by utilizing Modelica and Maple software properties. The results of the simulation show that the value function is maximized, temporal difference error is reduced to zero, the lower body is stabilized at the upright, and the convergence speed of the RL is improved with use of the symbolic IK solution.

Publisher

SAGE Publications

Subject

Instrumentation

Link

http://journals.sagepub.com/doi/pdf/10.1177/0142331216645176

Reference51 articles.

1. Evaluation of on-line analytic and numeric inverse kinematics approaches driven by partial vision input

2. Control of constrained systems of controllability index two

Cited by 45 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Enhanced Euler–Lagrange Formulation for Analyzing Human Gait With Moving Base Reference;Journal of Mechanisms and Robotics;2024-06-24

2. Analysis of Instantaneous Kinematic Properties Regarding the Shape of Robotic Mechanisms;Journal of Mechanisms and Robotics;2024-05-21

3. Reinforcement Learning of Bipedal Walking Using a Simple Reference Motion;Applied Sciences;2024-02-22

4. Variable Pivot Gait Based a Novel Dynamics Correction Method for Human Lower Limbs Model;Journal of Biomechanical Engineering;2024-02-09

5. Distributed impedance control of coordinated dissimilar upper-limb exoskeleton arms;Control Engineering Practice;2024-01