Human complex exploration strategies are enriched by noradrenaline-modulated heuristics-Reference-Cited by-同舟云学术

Human complex exploration strategies are enriched by noradrenaline-modulated heuristics

Published:2021-01-04 Issue: Volume:10 Page:
ISSN:2050-084X
Container-title:eLife
language:en
Short-container-title:

Author:

Dubois Magda¹²^ORCID,Habicht Johanna¹²,Michely Jochen¹²³,Moran Rani¹²^ORCID,Dolan Ray J¹²^ORCID,Hauser Tobias U¹²^ORCID

Affiliation:

1. Max Planck UCL Centre for Computational Psychiatry and Ageing Research, London, United Kingdom

2. Wellcome Trust Centre for Neuroimaging, University College London, London, United Kingdom

3. Department of Psychiatry and Psychotherapy, Charité – Universitätsmedizin Berlin, Berlin, Germany

Abstract

An exploration-exploitation trade-off, the arbitration between sampling a lesser-known against a known rich option, is thought to be solved using computationally demanding exploration algorithms. Given known limitations in human cognitive resources, we hypothesised the presence of additional cheaper strategies. We examined for such heuristics in choice behaviour where we show this involves a value-free random exploration, that ignores all prior knowledge, and a novelty exploration that targets novel options alone. In a double-blind, placebo-controlled drug study, assessing contributions of dopamine (400 mg amisulpride) and noradrenaline (40 mg propranolol), we show that value-free random exploration is attenuated under the influence of propranolol, but not under amisulpride. Our findings demonstrate that humans deploy distinct computationally cheap exploration strategies and that value-free random exploration is under noradrenergic control.

Funder

Max-Planck-Gesellschaft

Wellcome Trust

Jacobs Foundation

Medical Research Foundation

Brain and Behavior Research Foundation

European Research Council

Publisher

eLife Sciences Publications, Ltd

Subject

General Immunology and Microbiology,General Biochemistry, Genetics and Molecular Biology,General Medicine,General Neuroscience

Link

https://cdn.elifesciences.org/articles/59907/elife-59907-v2.pdf

Reference102 articles.

1. Analysis of Thompson sampling for the multi-armed bandit problem;Agrawal;Journal of Machine Learning Research : JMLR,2012

2. An integrative theory of locus coeruleus-norepinephrine function: adaptive gain and optimal performance;Aston-Jones;Annual Review of Neuroscience,2005

3. Using confidence bounds for exploitation-exploration trade-offs;Auer;Journal of Machine Learning Research : JMLR,2003

4. Bishop CM. 2006. in Information Science and Statistics.

5. Anxiety, depression, and decision making: a computational perspective;Bishop;Annual Review of Neuroscience,2018

Cited by 38 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Toward a computational role for locus coeruleus/norepinephrine arousal systems;Current Opinion in Behavioral Sciences;2024-10

2. Surprising sounds influence risky decision making;Nature Communications;2024-09-13

3. Testing the convergent validity, domain generality, and temporal stability of selected measures of people’s tendency to explore;Nature Communications;2024-09-04

4. Dopamine reveals adaptive learning of actions representation;2024-07-29

5. Exploration–Exploitation Mechanisms in Recurrent Neural Networks and Human Learners in Restless Bandit Problems;Computational Brain & Behavior;2024-05-24