Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule-Reference-Cited by-同舟云学术

Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule

Published:2007-08 Issue:8 Volume:19 Page:2245-2279
ISSN:0899-7667
Container-title:Neural Computation
language:en
Short-container-title:Neural Computation

Author:

Baras Dorit,Meir Ron¹

Affiliation:

1. Department of Electrical Engineering, Technion, Haifa 32000, Israel

Abstract

Learning agents, whether natural or artificial, must update their internal parameters in order to improve their behavior over time. In reinforcement learning, this plasticity is influenced by an environmental signal, termed a reward, that directs the changes in appropriate directions. We apply a recently introduced policy learning algorithm from machine learning to networks of spiking neurons and derive a spike-time-dependent plasticity rule that ensures convergence to a local optimum of the expected average reward. The approach is applicable to a broad class of neuronal models, including the Hodgkin-Huxley model. We demonstrate the effectiveness of the derived rule in several toy problems. Finally, through statistical analysis, we show that the synaptic plasticity rule established is closely related to the widely used BCM rule, for which good biological evidence exists.

Publisher

MIT Press - Journals

Subject

Cognitive Neuroscience,Arts and Humanities (miscellaneous)

Link

https://www.mitpressjournals.org/doi/pdf/10.1162/neco.2007.19.8.2245

Reference11 articles.

1. Infinite-Horizon Policy-Gradient Estimation

2. Theory for the development of neuron selectivity: orientation specificity and binocular interaction in visual cortex

3. Relating STDP to BCM

4. OnActor-Critic Algorithms

5. Spike-Timing-Dependent Hebbian Plasticity as Temporal Difference Learning

Cited by 40 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Astrocytes enhance plasticity response during reversal learning;Communications Biology;2024-07-12

2. Deciphering the astrocytic contribution to learning and relearning;2024-01-12

3. Adaptive control of synaptic plasticity integrates micro- and macroscopic network function;Neuropsychopharmacology;2022-08-29

4. Second-order information bottleneck based spiking neural networks for sEMG recognition;Information Sciences;2022-03

5. A Hebbian Approach to Non-Spatial Prelinguistic Reasoning;Brain Sciences;2022-02-17