Intrinsic rewards explain context-sensitive valuation in reinforcement learning-Reference-Cited by-同舟云学术

Intrinsic rewards explain context-sensitive valuation in reinforcement learning

Published:2023-07-17 Issue:7 Volume:21 Page:e3002201
ISSN:1545-7885
Container-title:PLOS Biology
language:en
Short-container-title:PLoS Biol

Author:

Molinaro Gaia^ORCID,Collins Anne G. E.^ORCID

Abstract

When observing the outcome of a choice, people are sensitive to the choice’s context, such that the experienced value of an option depends on the alternatives: getting $1 when the possibilities were 0 or 1 feels much better than when the possibilities were 1 or 10. Context-sensitive valuation has been documented within reinforcement learning (RL) tasks, in which values are learned from experience through trial and error. Range adaptation, wherein options are rescaled according to the range of values yielded by available options, has been proposed to account for this phenomenon. However, we propose that other mechanisms—reflecting a different theoretical viewpoint—may also explain this phenomenon. Specifically, we theorize that internally defined goals play a crucial role in shaping the subjective value attributed to any given option. Motivated by this theory, we develop a new “intrinsically enhanced” RL model, which combines extrinsically provided rewards with internally generated signals of goal achievement as a teaching signal. Across 7 different studies (including previously published data sets as well as a novel, preregistered experiment with replication and control studies), we show that the intrinsically enhanced model can explain context-sensitive valuation as well as, or better than, range adaptation. Our findings indicate a more prominent role of intrinsic, goal-dependent rewards than previously recognized within formal models of human RL. By integrating internally generated signals of reward, standard RL theories should better account for human behavior, including context-sensitive valuation and beyond.

Funder

University of California Berkeley

Foundation for the National Institutes of Health

National Science Foundation

Publisher

Public Library of Science (PLoS)

Subject

General Agricultural and Biological Sciences,General Immunology and Microbiology,General Biochemistry, Genetics and Molecular Biology,General Neuroscience

Reference62 articles.

1. BOLD Subjective Value Signals Exhibit Robust Range Adaptation;KM Cox;J Neurosci,2014

2. Medial orbitofrontal cortex codes relative rather than absolute value of financial rewards in humans;R Elliott;Eur J Neurosci,2008

3. Efficient coding and the neural representation of value;K Louie;Ann N Y Acad Sci,2012

4. Activity in human reward-sensitive brain areas is strongly context dependent;S Nieuwenhuis;Neuroimage,2005

5. Value normalization in decision making: theory and evidence;A Rangel;Curr Opin Neurobiol,2012

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The Neural Correlates of Ambiguity and Risk in Human Decision-Making under an Active Inference Framework;2024-08-23

2. The Neural Correlates of Ambiguity and Risk in Human Decision-Making under an Active Inference Framework;2024-08-23

3. Fundamental processes in sensorimotor learning: Reasoning, refinement, and retrieval;eLife;2024-08-01

4. The computational structure of consummatory anhedonia;Trends in Cognitive Sciences;2024-06

5. Frequent winners explain apparent skewness preferences in experience-based decisions;Proceedings of the National Academy of Sciences;2024-03-15