Learning at variable attentional load requires cooperation between working memory, meta-learning and attention-augmented reinforcement learning-Reference-Cited by-同舟云学术

Learning at variable attentional load requires cooperation between working memory, meta-learning and attention-augmented reinforcement learning

Published:2020-09-28 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Womelsdorf Thilo^ORCID,Watson Marcus R.,Tiesinga Paul^ORCID

Abstract

AbstractFlexible learning of changing reward contingencies can be realized with different strategies. A fast learning strategy involves using working memory of recently rewarded objects to guide choices. A slower learning strategy uses prediction errors to gradually update value expectations to improve choices. How the fast and slow strategies work together in scenarios with real-world stimulus complexity is not well known. Here, we disentangle their relative contributions in rhesus monkeys while they learned the relevance of object features at variable attentional load. We found that learning behavior across six subjects is consistently best predicted with a model combining (i) fast working memory (ii) slower reinforcement learning from differently weighted positive and negative prediction errors, as well as (iii) selective suppression of non-chosen feature values and (iv) a meta-learning mechanism adjusting exploration rates based on a memory trace of recent errors. These mechanisms cooperate differently at low and high attentional loads. While working memory was essential for efficient learning at lower attentional loads, enhanced weighting of negative prediction errors and meta-learning were essential for efficient learning at higher attentional loads. Together, these findings pinpoint a canonical set of learning mechanisms and demonstrate how they cooperate when subjects flexibly adjust to environments with variable real-world attentional demands.Significance statementLearning which visual features are relevant for achieving our goals is challenging in real-world scenarios with multiple distracting features and feature dimensions. It is known that in such scenarios learning benefits significantly from attentional prioritization. Here we show that beyond attention, flexible learning uses a working memory system, a separate learning gain for avoiding negative outcomes, and a meta-learning process that adaptively increases exploration rates whenever errors accumulate. These subcomponent processes of cognitive flexibility depend on distinct learning signals that operate at varying timescales, including the most recent reward outcome (for working memory), memories of recent outcomes (for adjusting exploration), and reward prediction errors (for attention augmented reinforcement learning). These results illustrate the specific mechanisms that cooperate during cognitive flexibility.

Publisher

Cold Spring Harbor Laboratory

Reference68 articles.

1. Hierarchical Error Representation: A Computational Model of Anterior Cingulate and Dorsolateral Prefrontal Cortex

2. Averbeck BB (2017) Amygdala and ventral striatum population codes implement multiple learning rates for reinforcement learning. IEEE Symposium Series on Computational Intelligence (SSCI):1–5.

3. Rostrolateral Prefrontal Cortex and Individual Differences in Uncertainty-Driven Exploration

4. Attentional Selection Can Be Predicted by Reinforcement Learning of Task-relevant Stimulus Features Weighted by Value-independent Stickiness;J Cogn Neurosci,2016

5. The Meaning of Behavior: Discriminating Reflex and Volition in the Brain

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. How Working Memory and Reinforcement Learning Are Intertwined: A Cognitive, Neural, and Computational Perspective;Journal of Cognitive Neuroscience;2021-12-23

2. A Kiosk Station for the Assessment of Multiple Cognitive Domains and Cognitive Enrichment of Monkeys;Frontiers in Behavioral Neuroscience;2021-08-26

3. Gains and Losses affect Learning Differentially at Low and High Attentional Load;2020-09-02