Cost-Benefit Arbitration Between Multiple Reinforcement-Learning Systems-Reference-Cited by-同舟云学术

Cost-Benefit Arbitration Between Multiple Reinforcement-Learning Systems

Published:2017-07-21 Issue:9 Volume:28 Page:1321-1333
ISSN:0956-7976
Container-title:Psychological Science
language:en
Short-container-title:Psychol Sci

Author:

Kool Wouter¹,Gershman Samuel J.¹²,Cushman Fiery A.¹

Affiliation:

1. Department of Psychology, Harvard University

2. Center for Brain Science, Harvard University

Abstract

Human behavior is sometimes determined by habit and other times by goal-directed planning. Modern reinforcement-learning theories formalize this distinction as a competition between a computationally cheap but inaccurate model-free system that gives rise to habits and a computationally expensive but accurate model-based system that implements planning. It is unclear, however, how people choose to allocate control between these systems. Here, we propose that arbitration occurs by comparing each system’s task-specific costs and benefits. To investigate this proposal, we conducted two experiments showing that people increase model-based control when it achieves greater accuracy than model-free control, and especially when the rewards of accurate performance are amplified. In contrast, they are insensitive to reward amplification when model-based and model-free control yield equivalent accuracy. This suggests that humans adaptively balance habitual and planned action through on-line cost-benefit analysis.

Publisher

SAGE Publications

Subject

General Psychology

Link

http://journals.sagepub.com/doi/pdf/10.1177/0956797617708288

Reference34 articles.

1. Simple Plans or Sophisticated Habits? State, Transition and Learning Interactions in the Two-Step Task

2. Conflict monitoring and decision making: Reconciling two perspectives on anterior cingulate function

3. Motivation and Cognitive Control: From Behavior to Neural Mechanism

4. Perceived difficulty, energization, and the magnitude of goal valence

5. Model-Based Influences on Humans' Choices and Striatal Prediction Errors

Cited by 170 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The role of distal landmarks and individual differences in acquiring spatial representations that support flexible and automatic wayfinding;Journal of Environmental Psychology;2024-09

2. Model-free decision-making underlies motor errors in rapid sequential movements under threat;Communications Psychology;2024-08-27

3. The preference for surprise in reinforcement learning underlies the differences in developmental changes in risk preference between autistic and neurotypical youth;2024-08-23

4. The consequences of AI training on human decision-making;Proceedings of the National Academy of Sciences;2024-08-06

5. Learning and memory processes in behavioural addiction: A systematic review;Neuroscience & Biobehavioral Reviews;2024-08