Reinforcement Learning and Episodic Memory in Humans and Animals: An Integrative Framework-Reference-Cited by-同舟云学术

Reinforcement Learning and Episodic Memory in Humans and Animals: An Integrative Framework

Published:2017-01-03 Issue:1 Volume:68 Page:101-128
ISSN:0066-4308
Container-title:Annual Review of Psychology
language:en
Short-container-title:Annu. Rev. Psychol.

Author:

Gershman Samuel J.¹,Daw Nathaniel D.²

Affiliation:

1. Department of Psychology and Center for Brain Science, Harvard University, Cambridge, Massachusetts 02138;

2. Princeton Neuroscience Institute and Department of Psychology, Princeton University, Princeton, New Jersey 08544

Abstract

We review the psychology and neuroscience of reinforcement learning (RL), which has experienced significant progress in the past two decades, enabled by the comprehensive experimental study of simple learning and decision-making tasks. However, one challenge in the study of RL is computational: The simplicity of these tasks ignores important aspects of reinforcement learning in the real world: (a) State spaces are high-dimensional, continuous, and partially observable; this implies that (b) data are relatively sparse and, indeed, precisely the same situation may never be encountered twice; furthermore, (c) rewards depend on the long-term consequences of actions in ways that violate the classical assumptions that make RL tractable. A seemingly distinct challenge is that, cognitively, theories of RL have largely involved procedural and semantic memory, the way in which knowledge about action values or world models extracted gradually from many experiences can drive choice. This focus on semantic memory leaves out many aspects of memory, such as episodic memory, related to the traces of individual events. We suggest that these two challenges are related. The computational challenge can be dealt with, in part, by endowing RL systems with episodic memory, allowing them to (a) efficiently approximate value functions over complex state spaces, (b) learn with very little data, and (c) bridge long-term dependencies between actions and rewards. We review the computational theory underlying this proposal and the empirical evidence to support it. Our proposal suggests that the ubiquitous and diverse roles of memory in RL may function as part of an integrated learning system.

Publisher

Annual Reviews

Subject

General Psychology

Link

https://www.annualreviews.org/doi/pdf/10.1146/annurev-psych-122414-033625

Cited by 271 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Predictions about reward outcomes in rhesus monkeys.;Behavioral Neuroscience;2024-02

2. Predictable navigation through spontaneous brain states with cognitive-map-like representations;Progress in Neurobiology;2024-02

3. Suppressing memory associations impacts decision-making preference: Evidence from the think/no-think paradigm;Consciousness and Cognition;2024-02

4. Reinforcement-Learning-Informed Queries Guide Behavioral Change;Clinical Psychological Science;2024-01-24

5. Goal-directed learning in adolescence: neurocognitive development and contextual influences;Nature Reviews Neuroscience;2024-01-23