Lightweight Reinforcement Algorithms for autonomous, scalable intra-cortical Brain Machine Interfaces-Reference-Cited by-同舟云学术

Lightweight Reinforcement Algorithms for autonomous, scalable intra-cortical Brain Machine Interfaces

Published:2020-12-09 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Shaikh Shoeb^ORCID,So Rosa,Sibindi Tafadzwa,Libedinsky Camilo,Basu Arindam

Abstract

AbstractIntra-cortical Brain Machine Interfaces (iBMIs) with wireless capability could scale the number of recording channels by integrating an intention decoder to reduce data rates. However, the need for frequent retraining due to neural signal non-stationarity is a big impediment. This paper presents an alternate paradigm of online reinforcement learning (RL) with a binary evaluative feedback in iBMIs to tackle this issue. This paradigm eliminates time-consuming calibration procedures. Instead, it relies on updating the model on a sequential sample-by-sample basis based on an instantaneous evaluative binary feedback signal. However, batch updates of weight in popular deep networks is very resource consuming and incompatible with constraints of an implant. In this work, using offline open-loop analysis on pre-recorded data, we show application of a simple RL algorithm - Banditron -in discrete-state iBMIs and compare it against previously reported state of the art RL algorithms – Hebbian RL, Attention gated RL, deep Q-learning. Owing to its simplistic single-layer architecture, Banditron is found to yield at least two orders of magnitude of reduction in power dissipation compared to state of the art RL algorithms. At the same time, post-hoc analysis performed on four pre-recorded experimental datasets procured from the motor cortex of two non-human primates performing joystick-based movement-related tasks indicate Banditron performing significantly better than state of the art RL algorithms by at least 5%, 10%, 7% and 7% in experiments 1, 2, 3 and 4 respectively. Furthermore, we propose a non-linear variant of Banditron, Banditron-RP, which gives an average improvement of 6%, 2% in decoding accuracy in experiments 2,4 respectively with only a moderate increase in power consumption.

Publisher

Cold Spring Harbor Laboratory

Reference50 articles.

1. Prevalence and causes of paralysis-united states, 2013;American journal of public health,2016

2. C. Pandarinath , P. Nuyujukian , et al., “High performance communication by people with paralysis using an intracortical brain-computer interface,” eLife, p. e18554, 2017.

3. P. Nuyujukian , J. A. Sanabria , et al., “Cortical control of a tablet computer by people with paralysis,” in PloS one, 2018.

4. J. D. Simeral , S.-P. Kim , et al., “Neural control of cursor trajectory and click by a human with tetraplegia 1000 days after implant of an intracortical microelectrode array,” Journal of Neural Engineering, vol. 8, no. 2, p. 025027.

5. Opposite Effects of mGluR1a and mGluR5 Activation on Nucleus Accumbens Medium Spiny Neuron Dendritic Spine Density