Habits, action sequences and reinforcement learning.

Dezfouli, Amir; Balleine, Bernard W

Dezfouli, Amir; Balleine, Bernard W.

Afiliação

Dezfouli A; Brain & Mind Research Institute, University of Sydney, Camperdown, NSW 2050, Australia.

Eur J Neurosci ; 35(7): 1036-51, 2012 Apr.

Article em En | MEDLINE | ID: mdl-22487034

RESUMO

It is now widely accepted that instrumental actions can be either goal-directed or habitual; whereas the former are rapidly acquired and regulated by their outcome, the latter are reflexive, elicited by antecedent stimuli rather than their consequences. Model-based reinforcement learning (RL) provides an elegant description of goal-directed action. Through exposure to states, actions and rewards, the agent rapidly constructs a model of the world and can choose an appropriate action based on quite abstract changes in environmental and evaluative demands. This model is powerful but has a problem explaining the development of habitual actions. To account for habits, theorists have argued that another action controller is required, called model-free RL, that does not form a model of the world but rather caches action values within states allowing a state to select an action based on its reward history rather than its consequences. Nevertheless, there are persistent problems with important predictions from the model; most notably the failure of model-free RL correctly to predict the insensitivity of habitual actions to changes in the action-reward contingency. Here, we suggest that introducing model-free RL in instrumental conditioning is unnecessary, and demonstrate that reconceptualizing habits as action sequences allows model-based RL to be applied to both goal-directed and habitual actions in a manner consistent with what real animals do. This approach has significant implications for the way habits are currently investigated and generates new experimental predictions.

Assuntos

Objetivos; Hábitos; Aprendizagem/fisiologia; Reforço Psicológico; Animais; Humanos; Distribuição Aleatória

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Reforço Psicológico / Objetivos / Hábitos / Aprendizagem Tipo de estudo: Clinical_trials / Prognostic_studies Limite: Animals / Humans Idioma: En Ano de publicação: 2012 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google