Simple Plans or Sophisticated Habits? State, Transition and Learning Interactions in the Two-Step Task.

Akam, Thomas; Costa, Rui; Dayan, Peter

Akam, Thomas; Costa, Rui; Dayan, Peter.

Afiliação

Akam T; Champalimaud Neuroscience Program, Champalimaud Centre for the Unknown, Lisbon, Portugal.
Costa R; Department of Experimental Psychology, University of Oxford, Oxford, United Kingdom.
Dayan P; Champalimaud Neuroscience Program, Champalimaud Centre for the Unknown, Lisbon, Portugal.

PLoS Comput Biol ; 11(12): e1004648, 2015 Dec.

Article em En | MEDLINE | ID: mdl-26657806

ABSTRACT

ABSTRACT

The recently developed 'two-step' behavioural task promises to differentiate model-based from model-free reinforcement learning, while generating neurophysiologically-friendly decision datasets with parametric variation of decision variables. These desirable features have prompted its widespread adoption. Here, we analyse the interactions between a range of different strategies and the structure of transitions and outcomes in order to examine constraints on what can be learned from behavioural performance. The task involves a trade-off between the need for stochasticity, to allow strategies to be discriminated, and a need for determinism, so that it is worth subjects' investment of effort to exploit the contingencies optimally. We show through simulation that under certain conditions model-free strategies can masquerade as being model-based. We first show that seemingly innocuous modifications to the task structure can induce correlations between action values at the start of the trial and the subsequent trial events in such a way that analysis based on comparing successive trials can lead to erroneous conclusions. We confirm the power of a suggested correction to the analysis that can alleviate this problem. We then consider model-free reinforcement learning strategies that exploit correlations between where rewards are obtained and which actions have high expected value. These generate behaviour that appears model-based under these, and also more sophisticated, analyses. Exploiting the full potential of the two-step task as a tool for behavioural neuroscience requires an understanding of these issues.

Assuntos

Comportamento de Escolha/fisiologia; Hábitos; Modelos Neurológicos; Modelos Estatísticos; Reforço Psicológico; Reversão de Aprendizagem/fisiologia; Análise e Desempenho de Tarefas; Simulação por Computador; Humanos

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Reforço Psicológico / Reversão de Aprendizagem / Análise e Desempenho de Tarefas / Comportamento de Escolha / Modelos Estatísticos / Hábitos / Modelos Neurológicos Tipo de estudo: Prognostic_studies / Risk_factors_studies Limite: Humans Idioma: En Revista: PLoS Comput Biol Assunto da revista: BIOLOGIA / INFORMATICA MEDICA Ano de publicação: 2015 Tipo de documento: Article País de afiliação: Portugal

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google