Reinforcement Q-Learning Control With Reward Shaping Function for Swing Phase Control in a Semi-active Prosthetic Knee-Reference-Cited by-同舟云学术

Reinforcement Q-Learning Control With Reward Shaping Function for Swing Phase Control in a Semi-active Prosthetic Knee

Published:2020-11-26 Issue: Volume:14 Page:
ISSN:1662-5218
Container-title:Frontiers in Neurorobotics
language:
Short-container-title:Front. Neurorobot.

Author:

Hutabarat Yonatan,Ekkachai Kittipong,Hayashibe Mitsuhiro,Kongprawechnon Waree

Abstract

In this study, we investigated a control algorithm for a semi-active prosthetic knee based on reinforcement learning (RL). Model-free reinforcement Q-learning control with a reward shaping function was proposed as the voltage controller of a magnetorheological damper based on the prosthetic knee. The reward function was designed as a function of the performance index that accounts for the trajectory of the subject-specific knee angle. We compared our proposed reward function to a conventional single reward function under the same random initialization of a Q-matrix. We trained this control algorithm to adapt to several walking speed datasets under one control policy and subsequently compared its performance with that of other control algorithms. The results showed that our proposed reward function performed better than the conventional single reward function in terms of the normalized root mean squared error and also showed a faster convergence trend. Furthermore, our control strategy converged within our desired performance index and could adapt to several walking speeds. Our proposed control structure has also an overall better performance compared to user-adaptive control, while some of its walking speeds performed better than the neural network predictive control from existing studies.

Publisher

Frontiers Media SA

Subject

Artificial Intelligence,Biomedical Engineering

Reference21 articles.

1. Motor synergy development in high-performing deep reinforcement learning algorithms;Chai;IEEE Robot. Autom. Lett.,2020

2. Swing phase control of semi-active prosthetic knee using neural network predictive control with particle swarm optimization;Ekkachai;IEEE Trans. Neural Syst. Rehabil. Eng.,2016

3. A novel approach to model magneto-rheological dampers using EHM with a feed-forward neural network;Ekkachai;Science Asia,2012

4. Force control of a magnetorheological damper using an elementary hysteresis model-based feedforward neural network;Ekkachai;Smart Mater. Struct.,2013

5. Undesired state-action prediction in multi-agent reinforcement learning for linked multi-component robotic system control;Fernandez-Gauna;Inform. Sci.,2013

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An Optimized Control Strategy Based on Multidimensional Feature Operation Pattern;IEEE Transactions on Control Systems Technology;2024-07

2. Implementation of PID controller and enhanced red deer algorithm in optimal path planning of substation inspection robots;Journal of Field Robotics;2024-04-05

3. Magnetorheological fluid in prostheses: A state-of-the-art review;Journal of Intelligent Material Systems and Structures;2024-02-06

4. Immune deep reinforcement learning-based path planning for mobile robot in unknown environment;Applied Soft Computing;2023-09

5. Reinforcement learning based variable damping control of wearable robotic limbs for maintaining astronaut pose during extravehicular activity;Frontiers in Neurorobotics;2023-02-15