Dopamine transients delivered in learning contexts do not act as model-free prediction errors-Reference-Cited by-同舟云学术

Dopamine transients delivered in learning contexts do not act as model-free prediction errors

Published:2019-03-12 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Sharpe Melissa J.,Batchelor Hannah M.,Mueller Lauren E.,Chang Chun Yun,Maes Etienne J.P.,Niv Yael,Schoenbaum Geoffrey

Abstract

AbstractDopamine neurons fire transiently in response to unexpected rewards. These neural correlates are proposed to signal the reward prediction error described in model-free reinforcement learning algorithms. This error term represents the unpredicted or ‘excess’ value of the rewarding event. In model-free reinforcement learning, this value is then stored as part of the learned value of any antecedent cues, contexts or events, making them intrinsically valuable, independent of the specific rewarding event that caused the prediction error. In support of equivalence between dopamine transients and this model-free error term, proponents cite causal optogenetic studies showing that artificially induced dopamine transients cause lasting changes in behavior. Yet none of these studies directly demonstrate the presence of cached value under conditions appropriate for associative learning. To address this gap in our knowledge, we conducted three studies where we optogenetically activated dopamine neurons while rats were learning associative relationships, both with and without reward. In each experiment, the antecedent cues failed to acquired value and instead entered into value-independent associative relationships with the other cues or rewards. These results show that dopamine transients, constrained within appropriate learning situations, support valueless associative learning.

Publisher

Cold Spring Harbor Laboratory

Reference37 articles.

1. Importance of unpredictability for reward responses in primate dopamine neurons

2. A Neural Substrate of Prediction and Reward

3. Toward a modern theory of adaptive networks: Expectation and prediction.

4. Sutton, R.S. & Barto, A.G. Reinforcement learning: An introduction (MIT press Cambridge, 1998).

5. Dopamine reward prediction-error signalling: a two-component response

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Disruptions in effort-based decision-making following acute optogenetic stimulation of ventral tegmental area dopamine cells;Learning & Memory;2021-03-15

2. Disruptions in effort-based decision-making following acute optogenetic stimulation of ventral tegmental area dopamine cells;2020-12-09

3. Lost in Translation? On the Need for Convergence in Animal and Human Studies on the Role of Dopamine in Diet-Induced Obesity;Current Addiction Reports;2019-08-08

4. Behavioural and computational evidence for memory consolidation biased by reward-prediction errors;2019-07-26