Examinations of Biases by Model Misspecification and Parameter Reliability of Reinforcement Learning Models-Reference-Cited by-同舟云学术

Examinations of Biases by Model Misspecification and Parameter Reliability of Reinforcement Learning Models

Published:2023-06-21 Issue:4 Volume:6 Page:651-670
ISSN:2522-0861
Container-title:Computational Brain & Behavior
language:en
Short-container-title:Comput Brain Behav

Author:

Toyama Asako^ORCID,Katahira Kentaro,Kunisato Yoshihiko

Abstract

Abstract Reinforcement learning models have the potential to clarify meaningful individual differences in the decision-making process. This study focused on two aspects regarding the nature of a reinforcement learning model and its parameters: the problems of model misspecification and reliability. Online participants, N = 453, completed self-report measures and a probabilistic learning task twice 1.5 months apart, and data from the task were fitted using several reinforcement learning models. To address the problem of model misspecification, we compared the models with and without the influence of choice history, or perseveration. Results showed that the lack of a perseveration term in the model led to a decrease in learning rates for win and loss outcomes, with slightly different influences depending on outcome volatility, and increases in inverse temperature. We also conducted simulations to examine the mechanism of the observed biases and revealed that failure to incorporate perseveration directly affected the estimation bias in the learning rate and indirectly affected that in inverse temperature. Furthermore, in both model fittings and model simulations, the lack of perseveration caused win-stay probability underestimation and loss-shift probability overestimation. We also assessed the parameter reliability. Test–retest reliabilities were poor (learning rates) to moderate (inverse temperature and perseveration magnitude). A learning effect was noted in the inverse temperature and perseveration magnitude parameters, showing an increment of the estimates in the second session. We discuss possible misinterpretations of results and limitations considering the estimation biases and parameter reliability.

Funder

Japan Society for the Promotion of Science

Publisher

Springer Science and Business Media LLC

Subject

Developmental and Educational Psychology,Neuropsychology and Physiological Psychology

Link

https://link.springer.com/content/pdf/10.1007/s42113-023-00175-4.pdf

Reference54 articles.

1. Akaishi, R., Umeda, K., Nagase, A., & Sakai, K. (2014). Autonomous mechanism of internal choice estimate underlies decision inertia. Neuron, 81(1), 195–206. https://doi.org/10.1016/j.neuron.2013.10.018

2. Ballard, I. C., & McClure, S. M. (2019). Joint modeling of reaction times and choice improves parameter identifiability in reinforcement learning models. Journal of Neuroscience Methods, 317, 37–44. https://doi.org/10.1016/j.jneumeth.2019.01.006

3. Bates, D., Mächler, M., Bolker, B., & Walker, S. (2015). Fitting linear mixed-effects models using lme4. Journal of Statistical Software, 67(1), 1–48. https://doi.org/10.18637/jss.v067.i01

4. Behrens, T. E., Woolrich, M. W., Walton, M. E., & Rushworth, M. F. (2007). Learning the value of information in an uncertain world. Nature Neuroscience, 10(9), 1214–1221. https://doi.org/10.1038/nn1954

5. Bishop, C. M. (2006). Pattern recognition and machine learning. Springer.

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Computational phenotyping of aberrant belief updating in individuals with schizotypal traits and schizophrenia;Biological Psychiatry;2024-08

2. Does the reliability of computational models truly improve with hierarchical modeling? Some recommendations and considerations for the assessment of model parameter reliability;Psychonomic Bulletin & Review;2024-05-08

3. Active reinforcement learning versus action bias and hysteresis: control with a mixture of experts and nonexperts;PLOS Computational Biology;2024-03-29