Better Than Maximum Likelihood Estimation of Model- based and Model-free Learning Styles-Reference-Cited by-同舟云学术

Better Than Maximum Likelihood Estimation of Model- based and Model-free Learning Styles

Published:2023-07-27 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Yazdani Sadjad¹,Vahabie Abdol-Hossein¹,Nadjar-Araabi Babak¹,Ahmadabadi Majid Nili¹

Affiliation:

1. University of Tehran

Abstract

Abstract Various decision-making systems work together to shape human behavior. Goal-directed and habitual systems are the two most important systems studied by reinforcement learning (RL) through model-based (MB) and model-free (MF) learning styles, respectively. Human behavior resembles the combination of these two decision-making paradigms, achieved by the weighted sum of the action values of the two styles in an RL framework. The weighting parameter is often extracted by the maximum likelihood (ML) or maximum a-posteriori (MAP) estimation method. In this study, we employ RL agents that use a combination of MB and MF decision-making to perform the well-known Daw two-stage task. ML and MAP methods result in less reliable estimates of the weighting parameter, where a large bias toward extreme values is often observed. We propose the knearest neighbor as an alternative nonparametric estimate to improve the estimation error, where we devise a set of 20 features extracted from the behavior of the RL agent. Simulated experiments examine the proposed method. Our method reduces the bias and variance of the estimation error based on the obtained results. Human behavior data from previous studies is investigated as well. The proposed method results in predicting indices such as age, gender, IQ, the dwell time of gaze, and psychiatric disorder indices which are missed by the traditional method. In brief, the proposed method increases the reliability of the estimated parameters and enhances the applicability of reinforcement learning paradigms in clinical trials.

Publisher

Research Square Platform LLC

Reference40 articles.

1. Challenges and promises for translating computational tools into clinical practice;Ahn WY;Current Opinion in Behavioral Sciences,2016

2. Interactions among working memory, reinforcement learning, and effort in value-based choice: A new paradigm and selective deficits in schizophrenia;Collins AGE;Biological Psychiatry,2017

3. Reduced model-based decision-making in schizophrenia;Culbreth AJ;Journal of Abnormal Psychology,2016

4. Daw, N. D. (2015). Of goals and habits. Proceedings of the National Academy of Sciences, 112(45), 13749–13750. https://doi.org/10.1073/pnas.1518488112

5. Model-based influences on humans’ choices and striatal prediction errors;Daw ND;Neuron,2011