Abstract
AbstractIn reinforcement-learning studies, the environment is typically object-based; that is, objects are predictive of a reward. Recently, studies also adopted rule-based environments in which stimulus dimensions are predictive of a reward. In the current study, we investigated how people learned (1) in an object-based environment, (2) following a switch to a rule-based environment, (3) following a switch to a different rule-based environment, and (4) following a switch back to an object-based environment. To do so, we administered a reinforcement-learning task comprising of four blocks with consecutively an object-based environment, a rule-based environment, another rule-based environment, and an object-based environment. Computational-modeling results suggest that people (1) initially adopt rule-based learning despite its suboptimal nature in an object-based environment, (2) learn rules after a switch to a rule-based environment, (3) experience interference from previously-learned rules following a switch to a different rule-based environment, and (4) learn objects after a final switch to an object-based environment. These results imply people have a hard time adjusting to switches between object-based and rule-based environments, although they do learn to do so.
Funder
Dutch National Science Foundation
Publisher
Springer Science and Business Media LLC
Subject
Developmental and Educational Psychology,Neuropsychology and Physiological Psychology
Reference28 articles.
1. Balcarras, M., & Womelsdorf, T. (2016). A flexible mechanism of rule selection enables rapid feature-based reinforcement learning. Frontiers in Neuroscience, 10, 125. https://doi.org/10.3389/fnins.2016.00125
2. Ballard, I., Miller, E. M., Piantadosi, S. T., Goodman, N. D., & Mcclure, S. M. (2018). Beyond reward prediction errors: Human striatum updates rule values during learning. Cerebral Cortex, 28, 3965–3975. https://doi.org/10.1093/cercor/bhx259
3. Best, C. A., Yim, H., & Sloutsky, V. M. (2013). The cost of selective attention in category learning: Developmental differences between adults and infants. Journal of Experimental Child Psychology, 116(2), 105–119. https://doi.org/10.1016/j.jecp.2013.05.002
4. Bröder, A., & Schiffer, S. (2006). Adaptive flexibility and maladaptive routines in selecting fast and frugal decision strategies. Journal of Experimental Psychology: Learning Memory and Cognition, 32(4), 904–918.
5. Choung, O. H., Lee, S. W., & Jeong, Y. (2017). Exploring feature dimensions to learn a new policy in an uninformed reinforcement learning task. Scientific Reports, 7(1), 1–12. https://doi.org/10.1038/s41598-017-17687-2