Mice exhibit stochastic and efficient action switching during probabilistic decision making-Reference-Cited by-同舟云学术

Mice exhibit stochastic and efficient action switching during probabilistic decision making

Published:2021-05-14 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Beron Celia C.^ORCID,Neufeld Shay Q.^ORCID,Linderman Scott W.^ORCID,Sabatini Bernardo L.^ORCID

Abstract

AbstractIn probabilistic and nonstationary environments, individuals must use internal and external cues to flexibly make decisions that lead to desirable outcomes. To gain insight into the process by which animals choose between actions, we trained mice in a task with time-varying reward probabilities. In our implementation of such a “two-armed bandit” task, thirsty mice use information about recent action and action-outcome histories to choose between two ports that deliver water probabilistically. Here, we comprehensively modeled choice behavior in this task, including the trial-to-trial changes in port selection – i.e. action switching behavior. We find that mouse behavior is, at times, deterministic and, at others, apparently stochastic. The behavior deviates from that of a theoretically optimal agent performing Bayesian inference in a Hidden Markov Model (HMM). We formulate a set of models based on logistic regression, reinforcement learning, and ‘sticky’ Bayesian inference that we demonstrate are mathematically equivalent and that accurately describe mouse behavior. The switching behavior of mice in the task is captured in each model by a stochastic action policy, a history-dependent representation of action value, and a tendency to repeat actions despite incoming evidence. The models parsimoniously capture behavior across different environmental conditionals by varying the ‘stickiness’ parameter, and, like the mice, they achieve nearly maximal reward rates. These results indicate that mouse behavior reaches near-maximal performance with reduced action switching and can be described by a set of equivalent models with a small number of relatively fixed parameters.SignificanceTo obtain rewards in changing and uncertain environments, animals must adapt their behavior. We found that mouse choice and trial-to-trial switching behavior in a dynamic and probabilistic two-choice task could be modeled by equivalent theoretical, algorithmic, and descriptive models. These models capture components of evidence accumulation, choice history bias, and stochasticity in mouse behavior. Furthermore, they reveal that mice adapt their behavior in different environmental contexts by modulating their level of ‘stickiness’ to their previous choice. Despite deviating from the behavior of a theoretically ideal observer, the empirical models achieve comparable levels of near-maximal reward. These results make predictions to guide interrogation of the neural mechanisms underlying flexible decision-making strategies.

Publisher

Cold Spring Harbor Laboratory

Reference51 articles.

1. Psychological flexibility as a fundamental aspect of health

2. Demystifying cognitive flexibility: Implications for clinical and developmental neuroscience

3. Cognitive flexibility in neurological disorders: Cognitive components and event-related potentials;Neuroscience & Biobehavioral Reviews,2017

4. Representation of Action-Specific Reward Values in the Striatum

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Mixture of Learning Strategies Underlies Rodent Behavior in Dynamic Foraging;2022-03-17

2. Impulsivity Relates to Multi-Trial Choice Strategy in Probabilistic Reversal Learning;Frontiers in Psychiatry;2022-03-14

3. Mice alternate between discrete strategies during perceptual decision-making;Nature Neuroscience;2022-02