Abstract
AbstractReal-world choice options have many features or attributes, whereas the reward outcome from those options only depends on a few features/attributes. It is currently unclear how humans learn and make decisions when multiple features and conjunctions of features are predictive of reward, and moreover, how selective attention contributes to these processes. Here, we examined human behavior during a three-dimensional learning task in which reward outcomes for different stimuli could be predicted based on a combination of an informative feature and conjunction. Using multiple approaches, we found that choice behavior and reward probabilities estimated by participants were best described by a model that learned the predictive values of both the informative feature and the informative conjunction. Moreover, attention was controlled by the difference in these values in a cooperative manner such that attention depended on the integrated feature and conjunction values, and the resulting attention weights modulated learning by increasing the learning rate on attended features and conjunctions. However, there was little effect of attention on decision making. These results suggest that in high-dimensional environments, humans direct their attention not only to selectively process reward-predictive attributes, but also to find parsimonious representations of the reward contingencies to achieve more efficient learning.
Publisher
Cold Spring Harbor Laboratory
Reference50 articles.
1. Acerbi, L. , & Ma, W. J . (2017). Practical Bayesian optimization for model fitting with bayesian adaptive direct search. Advances in Neural Information Processing Systems, 30.
2. Neural Mechanisms of Credit Assignment in a Multicue Environment
3. Value-driven attentional capture
4. Hippocampal pattern separation supports reinforcement learning;Nature Communications,2019
5. Learning the value of information in an uncertain world
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献