Differential reinforcement encoding along the hippocampal long axis helps resolve the explore

Differential reinforcement encoding along the hippocampal long axis helps resolve the explore–exploit dilemma

Published:2020-10-26 Issue:1 Volume:11 Page:
ISSN:2041-1723
Container-title:Nature Communications
language:en
Short-container-title:Nat Commun

Author:

Dombrovski Alexandre Y.^ORCID,Luna Beatriz,Hallquist Michael N.^ORCID

Abstract

AbstractWhen making decisions, should one exploit known good options or explore potentially better alternatives? Exploration of spatially unstructured options depends on the neocortex, striatum, and amygdala. In natural environments, however, better options often cluster together, forming structured value distributions. The hippocampus binds reward information into allocentric cognitive maps to support navigation and foraging in such spaces. Here we report that human posterior hippocampus (PH) invigorates exploration while anterior hippocampus (AH) supports the transition to exploitation on a reinforcement learning task with a spatially structured reward function. These dynamics depend on differential reinforcement representations in the PH and AH. Whereas local reward prediction error signals are early and phasic in the PH tail, global value maximum signals are delayed and sustained in the AH body. AH compresses reinforcement information across episodes, updating the location and prominence of the value maximum and displaying goal cell-like ramping activity when navigating toward it.

Publisher

Springer Science and Business Media LLC

Subject

General Physics and Astronomy,General Biochemistry, Genetics and Molecular Biology,General Chemistry

Link

https://www.nature.com/articles/s41467-020-18864-0.pdf

Reference99 articles.

1. Sutton, R. S. & Barto, A. G. Reinforcement Learning: an Introduction (MIT Press, 1998).

2. Badre, D., Doll, B. B., Long, N. M. & Frank, M. J. Rostrolateral prefrontal cortex and individual differences in uncertainty-driven exploration. Neuron 73, 595–607 (2012).

3. Beharelle, A. R., Polanía, R., Hare, T. A. & Ruff, C. C. Transcranial stimulation over frontopolar cortex elucidates the choice attributes and neural mechanisms used to resolve exploration–exploitation trade-offs. J. Neurosci. 35, 14544–14556 (2015).

4. Blanchard, T. C. & Gershman, S. J. Pure correlates of exploration and exploitation in the human brain. Cogn. Affect. Behav. Neurosci. 18, 117–126 (2018).

5. Daw, N. D., O’Doherty, J. P., Dayan, P., Seymour, B. & Dolan, R. J. Cortical substrates for exploratory decisions in humans. Nature 441, 876–879 (2006).

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Exploration-Exploitation and Suicidal Behavior in Borderline Personality Disorder and Depression;JAMA Psychiatry;2024-07-10

2. Reward-based option competition in human dorsal stream and transition from stochastic exploration to exploitation in continuous space;Science Advances;2024-02-23

3. Effects of childhood maltreatment and major depressive disorder on functional connectivity in hippocampal subregions;Brain Imaging and Behavior;2024-02-07

4. Exploration versus exploitation decisions in the human brain: A systematic review of functional neuroimaging and neuropsychological studies.;Neuropsychologia;2024-01

5. Reinforcement-based option competition in human dorsal stream during exploration/exploitation of a continuous space;2023-05-23