Deep reinforcement learning-based attitude motion control for humanoid robots with stability constraints-Reference-Cited by-同舟云学术

Deep reinforcement learning-based attitude motion control for humanoid robots with stability constraints

Published:2020-04-13 Issue:3 Volume:47 Page:335-347
ISSN:0143-991X
Container-title:Industrial Robot: the international journal of robotics research and application
language:en
Short-container-title:IR

Author:

Shi Qun,Ying Wangda,Lv Lei,Xie Jiajun

Abstract

Purpose This paper aims to present an intelligent motion attitude control algorithm, which is used to solve the poor precision problems of motion-manipulation control and the problems of motion balance of humanoid robots. Aiming at the problems of a few physical training samples and low efficiency, this paper proposes an offline pre-training of the attitude controller using the identification model as a priori knowledge of online training in the real physical environment. Design/methodology/approach The deep reinforcement learning (DRL) of continuous motion and continuous state space is applied to motion attitude control of humanoid robots and the robot motion intelligent attitude controller is constructed. Combined with the stability analysis of the training process and control process, the stability constraints of the training process and control process are established and the correctness of the constraints is demonstrated in the experiment. Findings Comparing with the proportion integration differentiation (PID) controller, PID + MPC controller and MPC + DOB controller in the humanoid robots environment transition walking experiment, the standard deviation of the tracking error of robots’ upper body pitch attitude trajectory under the control of the intelligent attitude controller is reduced by 60.37 per cent, 44.17 per cent and 26.58 per cent. Originality/value Using an intelligent motion attitude control algorithm to deal with the strong coupling nonlinear problem in biped robots walking can simplify the control process. The offline pre-training of the attitude controller using the identification model as a priori knowledge of online training in the real physical environment makes up the problems of a few physical training samples and low efficiency. The result of using the theory described in this paper shows the performance of the motion-manipulation control precision and motion balance of humanoid robots and provides some inspiration for the application of using DRL in biped robots walking attitude control.

Publisher

Emerald

Subject

Industrial and Manufacturing Engineering,Computer Science Applications,Control and Systems Engineering

Reference37 articles.

1. A comparison of four methods for non-linear data modeling;Chemometrics and Intelligent Laboratory Systems,1994

2. Study of PID control algorithm and intelligent PID controller,2017

3. Longitudinal model identification and velocity control of an autonomous car;IEEE Transactions on Intelligent Transportation Systems,2014

4. A theoretically grounded application of dropout in recurrent neural networks;Advances in Neural Information Processing Systems,2016

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Q-learning approach to the continuous control problem of robot inverted pendulum balancing;Intelligent Systems with Applications;2024-03

2. Robot kinematics analysis and trajectory planning based on artificial potential field method;Applied Mathematics and Nonlinear Sciences;2024-01-01

3. Design and Control of a Lower Limb Rehabilitation Robot Based on Human Motion Intention Recognition with Multi-Source Sensor Information;Machines;2022-11-28

4. Learning to Move an Object by the Humanoid Robots by Using Deep Reinforcement Learning;Intelligent Environments 2021;2021-06-14

5. Stability Control of a Biped Robot on a Dynamic Platform Based on Hybrid Reinforcement Learning;Sensors;2020-08-10