On the normative advantages of dopamine and striatal opponency for learning and choice-Reference-Cited by-同舟云学术

On the normative advantages of dopamine and striatal opponency for learning and choice

Published:2023-03-22 Issue: Volume:12 Page:
ISSN:2050-084X
Container-title:eLife
language:en
Short-container-title:

Author:

Jaskir Alana¹^ORCID,Frank Michael J¹^ORCID

Affiliation:

1. Department of Cognitive, Linguistic and Psychological Sciences, Carney Institute for Brain Science, Brown University

Abstract

The basal ganglia (BG) contribute to reinforcement learning (RL) and decision-making, but unlike artificial RL agents, it relies on complex circuitry and dynamic dopamine modulation of opponent striatal pathways to do so. We develop the OpAL* model to assess the normative advantages of this circuitry. In OpAL*, learning induces opponent pathways to differentially emphasize the history of positive or negative outcomes for each action. Dynamic DA modulation then amplifies the pathway most tuned for the task environment. This efficient coding mechanism avoids a vexing explore–exploit tradeoff that plagues traditional RL models in sparse reward environments. OpAL* exhibits robust advantages over alternative models, particularly in environments with sparse reward and large action spaces. These advantages depend on opponent and nonlinear Hebbian plasticity mechanisms previously thought to be pathological. Finally, OpAL* captures risky choice patterns arising from DA and environmental manipulations across species, suggesting that they result from a normative biological mechanism.

Funder

National Institute of Mental Health

National Institutes of Health

Publisher

eLife Sciences Publications, Ltd

Subject

General Immunology and Microbiology,General Biochemistry, Genetics and Molecular Biology,General Medicine,General Neuroscience

Link

https://cdn.elifesciences.org/articles/85107/elife-85107-v2.pdf

Reference96 articles.

1. Prefrontal cortex-driven dopamine signals in the striatum show unique spatial and pharmacological properties;Adrover;The Journal of Neuroscience,2020

2. A neurobiological theory of automaticity in perceptual categorization;Ashby;Psychological Review,2007