Harnessing the flexibility of neural networks to predict dynamic theoretical parameters underlying human choice behavior-Reference-Cited by-同舟云学术

Harnessing the flexibility of neural networks to predict dynamic theoretical parameters underlying human choice behavior

Published:2024-01-04 Issue:1 Volume:20 Page:e1011678
ISSN:1553-7358
Container-title:PLOS Computational Biology
language:en
Short-container-title:PLoS Comput Biol

Author:

Ger Yoav^ORCID,Nachmani Eliya,Wolf Lior,Shahar Nitzan^ORCID

Abstract

Reinforcement learning (RL) models are used extensively to study human behavior. These rely on normative models of behavior and stress interpretability over predictive capabilities. More recently, neural network models have emerged as a descriptive modeling paradigm that is capable of high predictive power yet with limited interpretability. Here, we seek to augment the expressiveness of theoretical RL models with the high flexibility and predictive power of neural networks. We introduce a novel framework, which we term theoretical-RNN (t-RNN), whereby a recurrent neural network is trained to predict trial-by-trial behavior and to infer theoretical RL parameters using artificial data of RL agents performing a two-armed bandit task. In three studies, we then examined the use of our approach to dynamically predict unseen behavior along with time-varying theoretical RL parameters. We first validate our approach using synthetic data with known RL parameters. Next, as a proof-of-concept, we applied our framework to two independent datasets of humans performing the same task. In the first dataset, we describe differences in theoretical RL parameters dynamic among clinical psychiatric vs. healthy controls. In the second dataset, we show that the exploration strategies of humans varied dynamically in response to task phase and difficulty. For all analyses, we found better performance in the prediction of actions for t-RNN compared to the stationary maximum-likelihood RL method. We discuss the use of neural networks to facilitate the estimation of latent RL parameters underlying choice behavior.

Funder

Israel Science Foundation

Tel Aviv University Center for AI and Data Science

the Israeli Science Foundation

Publisher

Public Library of Science (PLoS)

Reference49 articles.

1. Trial-by-trial data analysis using computational models;ND Daw;Decision making, affect, and learning: Attention and performance XXIII,2011

2. Ten simple rules for the computational modeling of behavioral data;RC Wilson;Elife,2019

3. The interpretation of computational model parameters depends on the context;MK Eckstein;Elife,2022

4. A neural substrate of prediction and reward;W Schultz;Science,1997

5. Model-based influences on humans’ choices and striatal prediction errors;ND Daw;Neuron,2011

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Exploration–Exploitation Mechanisms in Recurrent Neural Networks and Human Learners in Restless Bandit Problems;Computational Brain & Behavior;2024-05-24

2. Active reinforcement learning versus action bias and hysteresis: control with a mixture of experts and nonexperts;PLOS Computational Biology;2024-03-29