Mice exhibit stochastic and efficient action switching during probabilistic decision making.

Beron, Celia C; Neufeld, Shay Q; Linderman, Scott W; Sabatini, Bernardo L

Beron, Celia C; Neufeld, Shay Q; Linderman, Scott W; Sabatini, Bernardo L.

Afiliação

Beron CC; Department of Neurobiology, Harvard Medical School, Boston, MA 02115.
Neufeld SQ; HHMI, Harvard Medical School, Boston, MA 02115.
Linderman SW; Department of Neurobiology, Harvard Medical School, Boston, MA 02115.
Sabatini BL; HHMI, Harvard Medical School, Boston, MA 02115.

Proc Natl Acad Sci U S A ; 119(15): e2113961119, 2022 04 12.

Article em En | MEDLINE | ID: mdl-35385355

ABSTRACT

ABSTRACT

In probabilistic and nonstationary environments, individuals must use internal and external cues to flexibly make decisions that lead to desirable outcomes. To gain insight into the process by which animals choose between actions, we trained mice in a task with time-varying reward probabilities. In our implementation of such a two-armed bandit task, thirsty mice use information about recent action and actionoutcome histories to choose between two ports that deliver water probabilistically. Here we comprehensively modeled choice behavior in this task, including the trial-to-trial changes in port selection, i.e., action switching behavior. We find that mouse behavior is, at times, deterministic and, at others, apparently stochastic. The behavior deviates from that of a theoretically optimal agent performing Bayesian inference in a hidden Markov model (HMM). We formulate a set of models based on logistic regression, reinforcement learning, and sticky Bayesian inference that we demonstrate are mathematically equivalent and that accurately describe mouse behavior. The switching behavior of mice in the task is captured in each model by a stochastic action policy, a history-dependent representation of action value, and a tendency to repeat actions despite incoming evidence. The models parsimoniously capture behavior across different environmental conditionals by varying the stickiness parameter, and like the mice, they achieve nearly maximal reward rates. These results indicate that mouse behavior reaches near-maximal performance with reduced action switching and can be described by a set of equivalent models with a small number of relatively fixed parameters.

Assuntos

Comportamento de Escolha; Tomada de Decisões; Camundongos; Animais; Camundongos/psicologia; Recompensa; Incerteza

Palavras-chave

Bayesian inference; decision making; exploreexploit; perseveration; stochastic choice

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Comportamento de Escolha / Tomada de Decisões / Camundongos Tipo de estudo: Prognostic_studies Limite: Animals Idioma: En Revista: Proc Natl Acad Sci U S A Ano de publicação: 2022 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google