Cooperative update of beliefs and state-transition functions in human reinforcement learning-Reference-Cited by-同舟云学术

Cooperative update of beliefs and state-transition functions in human reinforcement learning

Published:2019-11-27 Issue:1 Volume:9 Page:
ISSN:2045-2322
Container-title:Scientific Reports
language:en
Short-container-title:Sci Rep

Author:

Higashi Hiroshi^ORCID,Minami Tetsuto^ORCID,Nakauchi Shigeki

Abstract

AbstractIt is widely known that reinforcement learning systems in the brain contribute to learning via interactions with the environment. These systems are capable of solving multidimensional problems, in which some dimensions are relevant to a reward, while others are not. To solve these problems, computational models use Bayesian learning, a strategy supported by behavioral and neural evidence in human. Bayesian learning takes into account beliefs, which represent a learner’s confidence in a particular dimension being relevant to the reward. Beliefs are given as a posterior probability of the state-transition (reward) function that maps the optimal actions to the states in each dimension. However, when it comes to implementing this learning strategy, the order in which beliefs and state-transition functions update remains unclear. The present study investigates this update order using a trial-by-trial analysis of human behavior and electroencephalography signals during a task in which learners have to identify the reward-relevant dimension. Our behavioral and neural results reveal a cooperative update—within 300 ms after the outcome feedback, the state-transition functions are updated, followed by the beliefs for each dimension.

Funder

MEXT | Japan Society for the Promotion of Science

Publisher

Springer Science and Business Media LLC

Subject

Multidisciplinary

Link

http://www.nature.com/articles/s41598-019-53600-9.pdf

Reference62 articles.

1. Niv, Y. et al. Reinforcement learning in multidimensional environments relies on attention mechanisms. Journal of Neuroscience 35, 8145–8157, https://doi.org/10.1523/JNEUROSCI.2978-14.2015 (2015).

2. Badre, D., Kayser, A. S. & D’Esposito, M. Frontal cortex and the discovery of abstract action rules. Neuron 66, 315–326, https://doi.org/10.1016/j.neuron.2010.03.025 (2010).

3. Badre, D. & Frank, M. J. Mechanisms of hierarchical reinforcement learning in cortico-striatal circuits 2: Evidence from fMRI. Cerebral Cortex 22, 527–536, https://doi.org/10.1093/cercor/bhr117 (2012).

4. Frank, M. J. & Badre, D. Mechanisms of hierarchical reinforcement learning in corticostriatal circuits 1: Computational analysis. Cerebral Cortex 22, 509–526, https://doi.org/10.1093/cercor/bhr114 (2012).

5. Yoshida, W. & Ishii, S. Model-based reinforcement learning: a computational model and an fMRI study. Neurocomputing 63, 253–269, https://doi.org/10.1016/j.neucom.2004.04.012 (2005).

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Dimension-wise Sequential Update for Learning a Multidimensional Environment in Humans;Journal of Cognitive Neuroscience;2023

2. Troubled past: A critical psychometric assessment of the self-report Survey of Autobiographical Memory (SAM);Behavior Research Methods;2021-06-22