Reinforcement Learning for Control of Human Locomotion in Simulation-Reference-Cited by-同舟云学术

Reinforcement Learning for Control of Human Locomotion in Simulation

Published:2023-12-20 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Dashkovets Andrii,Laschowski Brokoslaw^ORCID

Abstract

AbstractControl of robotic leg prostheses and exoskeletons is an open challenge. Computer modeling and simulation can be used to study the dynamics and control of human walking and extract principles that can be programmed into robotic legs to behave similar to biological legs. In this study, we present the development of an efficient two-layer Q-learning algorithm, with k-d trees, that operates over continuous action spaces and a reward model that estimates the degree of muscle activation similarity between the agent and human state-to-action pairs and state-to-action sequences. We used a human musculoskeletal model acting in a high-dimensional, physics-based simulation environment to train and evaluate our algorithm to simulate biomimetic walking. We used imitation learning and artificial bio-mechanics data to accelerate training via expert demonstrations and used experimental human data to compare and validate our predictive simulations, achieving 79% accuracy. Also, when compared to the previous state-of-the-art that used deep deterministic policy gradient, our algorithm was significantly more efficient with lower computational and memory storage requirements (i.e., requiring 7 times less RAM and 87 times less CPU compute), which can benefit real-time embedded computing. Overall, our new two-layer Q-learning algorithm using sequential data for continuous imitation of human locomotion serves as a first step towards the development of bioinspired controllers for robotic prosthetic legs and exoskeletons. Future work will focus on improving the prediction accuracy compared to experimental data and expanding our simulations to other locomotor activities.

Publisher

Cold Spring Harbor Laboratory

Reference35 articles.

1. “A review of current state-of-the-art control methods for lower-limb powered prostheses;Annual Reviews in Control,2023

2. “Control strategies for active lower extremity prosthetics and orthotics: a review;Journal of NeuroEngineering and Rehabilitation,2015

3. Simulating ideal assistive devices to reduce the metabolic cost of walking with heavy loads

4. Simulating Ideal Assistive Devices to Reduce the Metabolic Cost of Running

5. Design of patient-specific gait modifications for knee osteoarthritis rehabilitation

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. StairNet: visual recognition of stairs for human–robot locomotion;BioMedical Engineering OnLine;2024-02-15