Meta-control of the exploration-exploitation dilemma emerges from probabilistic inference over a hierarchy of time scales-Reference-Cited by-同舟云学术

Meta-control of the exploration-exploitation dilemma emerges from probabilistic inference over a hierarchy of time scales

Published:2019-11-20 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Marković Dimitrije^ORCID,Goschke Thomas,Kiebel Stefan J.^ORCID

Abstract

AbstractCognitive control is typically understood as a set of mechanisms which enable humans to reach goals that require integrating the consequences of actions over longer time scales. Importantly, using routine beheavior or making choices beneficial only at a short time scales would prevent one from attaining these goals. During the past two decades, researchers have proposed various computational cognitive models that successfully account for behaviour related to cognitive control in a wide range of laboratory tasks. As humans operate in a dynamic and uncertain environment, making elaborate plans and integrating experience over multiple time scales is computationally expensive, the specific question of how uncertain consequences at different time scales are integrated into adaptive decisions remains poorly understood. Here, we propose that precisely the problem of integrating experience and forming elaborate plans over multiple time scales is a key component for better understanding how human agents solve cognitive control dilemmas such as the exploration-exploitation dilemma. In support of this conjecture, we present a computational model of probabilistic inference over hidden states and actions, which are represented as a hierarchy of time scales. Simulations of goal-reaching agents instantiating the model in an uncertain and dynamic task environment show how the exploration-exploitation dilemma may be solved by inferring meta-control states which adapt behaviour to changing contexts.

Publisher

Cold Spring Harbor Laboratory

Reference71 articles.

1. A Primer on Foraging and the Explore/Exploit Trade-Off for Psychiatry Research;Neuropsychopharmacology,2017

2. Constructing Temporal Abstractions Autonomously in Reinforcement Learning;Ai Magazine,2018

3. Frontal Cortex and the Hierarchical Control of Behavior

4. Pure correlates of exploration and exploitation in the human brain;Cognitive Affective & Behavioral Neuroscience,2018

5. Model-based hierarchical reinforcement learning and human action control;PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES,2014

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Transfer of learned cognitive flexibility to novel stimuli and task sets;2021-07-23

2. Forward planning driven by context-dependent conflict processing in anterior cingulate cortex;2021-07-22

3. The exploration-exploitation trade-off in a foraging task is affected by mood-related arousal and valence;Cognitive, Affective, & Behavioral Neuroscience;2021-06