Towards Autonomous Intra-cortical Brain Machine Interfaces: Applying Bandit Algorithms for Online Reinforcement Learning-Reference-Cited by-同舟云学术

Towards Autonomous Intra-cortical Brain Machine Interfaces: Applying Bandit Algorithms for Online Reinforcement Learning

Published:2020-01-09 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Shaikh Shoeb^ORCID,So Rosa,Sibindi Tafadzwa,Libedinsky Camilo,Basu Arindam

Abstract

AbstractThis paper presents application of Banditron - an online reinforcement learning algorithm (RL) in a discrete state intra-cortical Brain Machine Interface (iBMI) setting. We have analyzed two datasets from non-human primates (NHPs) - NHP A and NHP B each performing a 4-option discrete control task over a total of 8 days. Results show average improvements of ≈ 15%, 6% in NHP A and 15%, 21% in NHP B over state of the art algorithms - Hebbian Reinforcement Learning (HRL) and Attention Gated Reinforcement Learning (AGREL) respectively. Apart from yielding a superior decoding performance, Banditron is also the most computationally friendly as it requires two orders of magnitude less multiply-and-accumulate operations than HRL and AGREL. Furthermore, Banditron provides average improvements of at least 40%, 15% in NHPs A, B respectively compared to popularly employed supervised methods - LDA, SVM across test days. These results pave the way towards an alternate paradigm of temporally robust hardware friendly reinforcement learning based iBMIs.

Publisher

Cold Spring Harbor Laboratory

Reference21 articles.

1. Prevalence and causes of paralysis-united states, 2013;American journal of public health,2016

2. Pandarinath , Nuyujukian , Blabe , Sorice , et al., “High performance communication by people with paralysis using an intracortical brain-computer interface,” eLife, p. e18554, 2017.

3. Collinger , Wodlinger , Downey , Wang , et al., “High-performance neuroprosthetic control by an individual with tetraplegia.” Lancet (London, England), no. 9866, pp. 557–64, 2013.

4. Opposite Effects of mGluR1a and mGluR5 Activation on Nucleus Accumbens Medium Spiny Neuron Dendritic Spine Density

5. Robust Closed-Loop Control of a Cursor in a Person with Tetraplegia using Gaussian Process Regression;Neural Computation,2018

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Intelligent Intracortical Brain-Machine Interfaces;Handbook of Biochips;2022

2. Intelligent Intracortical Brain-Machine Interfaces;Handbook of Biochips;2020-12-11