Active Inference, epistemic value, and vicarious trial and error-Reference-Cited by-同舟云学术

Active Inference, epistemic value, and vicarious trial and error

Published:2016-06-17 Issue:7 Volume:23 Page:322-338
ISSN:1549-5485
Container-title:Learning & Memory
language:en
Short-container-title:Learn. Mem.

Author:

Pezzulo Giovanni^ORCID,Cartoni Emilio,Rigoli Francesco,Pio-Lopez Léo,Friston Karl

Abstract

Balancing habitual and deliberate forms of choice entails a comparison of their respective merits—the former being faster but inflexible, and the latter slower but more versatile. Here, we show that arbitration between these two forms of control can be derived from first principles within an Active Inference scheme. We illustrate our arguments with simulations that reproduce rodent spatial decisions in T-mazes. In this context, deliberation has been associated with vicarious trial and error (VTE) behavior (i.e., the fact that rodents sometimes stop at decision points as if deliberating between choice alternatives), whose neurophysiological correlates are “forward sweeps” of hippocampal place cells in the arms of the maze under consideration. Crucially, forward sweeps arise early in learning and disappear shortly after, marking a transition from deliberative to habitual choice. Our simulations show that this transition emerges as the optimal solution to the trade-off between policies that maximize reward or extrinsic value (habitual policies) and those that also consider the epistemic value of exploratory behavior (deliberative or epistemic policies)—the latter requiring VTE and the retrieval of episodic information via forward sweeps. We thus offer a novel perspective on the optimality principles that engender forward sweeps and VTE, and on their role on deliberate choice.

Funder

Wellcome Trust

European Community

Goal-Leaders

HFSP

Publisher

Cold Spring Harbor Laboratory

Subject

Cellular and Molecular Neuroscience,Cognitive Neuroscience,Neuropsychology and Physiological Psychology

Reference87 articles.

1. Attias H . 2003. Planning by Probabilistic Inference. In: Proceedings of the Ninth International Workshop on Artificial Intelligence and Statistics. Key West, FL.

2. Goal-directed instrumental action: contingency and incentive learning and their cortical substrates

3. Beal MJ . 2003. Variational algorithms for approximate Bayesian inference. University of London, London, UK.

4. Effects of pharmacological manipulations of NMDA-receptors on deliberation in the Multiple-T task

5. Planning as inference

Cited by 44 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Space is a latent sequence: A theory of the hippocampus;Science Advances;2024-08-02

2. Active Data Selection and Information Seeking;Algorithms;2024-03-12

3. Exploration versus exploitation decisions in the human brain: A systematic review of functional neuroimaging and neuropsychological studies.;Neuropsychologia;2024-01

4. To deliberate, remember; to anticipate, forget: Cognitive deliberation profiles underpinning active forgetting-dependent everyday-like memory performance in young and aged mice;2023-04-30

5. Evidence for entropy maximisation in human free choice behaviour;Cognition;2023-03