Asymmetric and adaptive reward coding via normalized reinforcement learning-Reference-Cited by-同舟云学术

Asymmetric and adaptive reward coding via normalized reinforcement learning

Published:2022-07-21 Issue:7 Volume:18 Page:e1010350
ISSN:1553-7358
Container-title:PLOS Computational Biology
language:en
Short-container-title:PLoS Comput Biol

Author:

Louie Kenway^ORCID

Abstract

Learning is widely modeled in psychology, neuroscience, and computer science by prediction error-guided reinforcement learning (RL) algorithms. While standard RL assumes linear reward functions, reward-related neural activity is a saturating, nonlinear function of reward; however, the computational and behavioral implications of nonlinear RL are unknown. Here, we show that nonlinear RL incorporating the canonical divisive normalization computation introduces an intrinsic and tunable asymmetry in prediction error coding. At the behavioral level, this asymmetry explains empirical variability in risk preferences typically attributed to asymmetric learning rates. At the neural level, diversity in asymmetries provides a computational mechanism for recently proposed theories of distributional RL, allowing the brain to learn the full probability distribution of future rewards. This behavioral and computational flexibility argues for an incorporation of biologically valid value functions in computational models of learning and decision-making.

Publisher

Public Library of Science (PLoS)

Subject

Computational Theory and Mathematics,Cellular and Molecular Neuroscience,Genetics,Molecular Biology,Ecology,Modeling and Simulation,Ecology, Evolution, Behavior and Systematics

Reference55 articles.

1. Hierarchically organized behavior and its neural foundations: a reinforcement learning perspective.;MM Botvinick;Cognition,2009

2. Goals and habits in the brain;RJ Dolan;Neuron,2013

3. Human-level control through deep reinforcement learning;V Mnih;Nature,2015

4. Mastering the game of Go without human knowledge;D Silver;Nature,2017

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Dynamics Learning Rate Bias in Pigeons: Insights from Reinforcement Learning and Neural Correlates;Animals;2024-02-01

2. Craving money? Evidence from the laboratory and the field;Science Advances;2024-01-12

3. Distributional reinforcement learning in prefrontal cortex;Nature Neuroscience;2024-01-10

4. An opponent striatal circuit for distributional reinforcement learning;2024-01-03

5. Multi-timescale reinforcement learning in the brain;2023-11-14