Learning fast while changing slow in spiking neural networks-Reference-Cited by-同舟云学术

Learning fast while changing slow in spiking neural networks

Published:2024-07-10 Issue:3 Volume:4 Page:034002
ISSN:2634-4386
Container-title:Neuromorphic Computing and Engineering
language:
Short-container-title:Neuromorph. Comput. Eng.

Author:

Capone Cristiano^ORCID,Muratore Paolo^ORCID

Abstract

Abstract Reinforcement learning (RL) faces substantial challenges when applied to real-life problems, primarily stemming from the scarcity of available data due to limited interactions with the environment. This limitation is exacerbated by the fact that RL often demands a considerable volume of data for effective learning. The complexity escalates further when implementing RL in recurrent spiking networks, where inherent noise introduced by spikes adds a layer of difficulty. Life-long learning machines must inherently resolve the plasticity-stability paradox. Striking a balance between acquiring new knowledge and maintaining stability is crucial for artificial agents. To address this challenge, we draw inspiration from machine learning technology and introduce a biologically plausible implementation of proximal policy optimization, referred to as lf-cs (learning fast changing slow). Our approach results in two notable advancements: firstly, the capacity to assimilate new information into a new policy without requiring alterations to the current policy; and secondly, the capability to replay experiences without experiencing policy divergence. Furthermore, when contrasted with other experience replay techniques, our method demonstrates the added advantage of being computationally efficient in an online setting. We demonstrate that the proposed methodology enhances the efficiency of learning, showcasing its potential impact on neuromorphic and real-world applications.

Funder

EBRAINS-Italy IR00011 PNRR Project

Publisher

IOP Publishing

Link

https://iopscience.iop.org/article/10.1088/2634-4386/ad5c96/pdf

Reference31 articles.

1. Human-level control through deep reinforcement learning;Mnih;Nature,2015

2. Improved robustness of reinforcement learning policies upon conversion to spiking neuronal network platforms applied to atari breakout game;Patel;Neural Netw.,2019

3. Deep reinforcement learning with population-coded spiking neural network for continuous control;Tang,2021

4. Toward robust and scalable deep spiking reinforcement learning;Akl;Front. Neurorobot.,2023

5. The remarkable robustness of surrogate gradient learning for instilling complex function in spiking neural networks;Zenke;Neural Comput.,2021