Reinforcement Learning With Modulated Spike Timing–Dependent Synaptic Plasticity-Reference-Cited by-同舟云学术

Reinforcement Learning With Modulated Spike Timing–Dependent Synaptic Plasticity

Published:2007-12 Issue:6 Volume:98 Page:3648-3665
ISSN:0022-3077
Container-title:Journal of Neurophysiology
language:en
Short-container-title:Journal of Neurophysiology

Author:

Farries Michael A.,Fairhall Adrienne L.

Abstract

Spike timing–dependent synaptic plasticity (STDP) has emerged as the preferred framework linking patterns of pre- and postsynaptic activity to changes in synaptic strength. Although synaptic plasticity is widely believed to be a major component of learning, it is unclear how STDP itself could serve as a mechanism for general purpose learning. On the other hand, algorithms for reinforcement learning work on a wide variety of problems, but lack an experimentally established neural implementation. Here, we combine these paradigms in a novel model in which a modified version of STDP achieves reinforcement learning. We build this model in stages, identifying a minimal set of conditions needed to make it work. Using a performance-modulated modification of STDP in a two-layer feedforward network, we can train output neurons to generate arbitrarily selected spike trains or population responses. Furthermore, a given network can learn distinct responses to several different input patterns. We also describe in detail how this model might be implemented biologically. Thus our model offers a novel and biologically plausible implementation of reinforcement learning that is capable of training a neural population to produce a very wide range of possible mappings between synaptic input and spiking output.

Publisher

American Physiological Society

Subject

Physiology,General Neuroscience

Link

https://www.physiology.org/doi/pdf/10.1152/jn.00364.2007

Reference86 articles.

1. Functional Significance of Long-Term Potentiation for Sequence Learning and Prediction

2. Specific long-lasting potentiation of synaptic transmission in hippocampal slices

3. Independent Coding of Movement Direction and Reward Prediction by Single Pallidal Neurons

4. Synaptic plasticity in a cerebellum-like structure depends on temporal order

5. Two Coincidence Detectors for Spike Timing-Dependent Plasticity in Somatosensory Cortex

Cited by 98 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. From Information to Knowledge: A Role for Knowledge Networks in Decision Making and Action Selection;Information;2024-08-15

2. Trustworthy Artificial Intelligence Methods for Users’ Physical and Environmental Security: A Comprehensive Review;Applied Sciences;2023-11-06

3. Weight versus Node Perturbation Learning in Temporally Extended Tasks: Weight Perturbation Often Performs Similarly or Better;Physical Review X;2023-04-11

4. Sleep prevents catastrophic forgetting in spiking neural networks by forming a joint synaptic weight representation;PLOS Computational Biology;2022-11-18

5. Training spiking neuronal networks to perform motor control using reinforcement and evolutionary learning;Frontiers in Computational Neuroscience;2022-09-30