Your browser doesn't support javascript.
loading
Does phasic dopamine release cause policy updates?
Carter, Francis; Cossette, Marie-Pierre; Trujillo-Pisanty, Ivan; Pallikaras, Vasilios; Breton, Yannick-André; Conover, Kent; Caplan, Jill; Solis, Pavel; Voisard, Jacques; Yaksich, Alexandra; Shizgal, Peter.
Afiliação
  • Carter F; Department of Psychology, Concordia University, Montreal, Quebec, Canada.
  • Cossette MP; Montreal Institute for Learning Algorithms, Université de Montréal, Montreal, Quebec, Canada.
  • Trujillo-Pisanty I; Department of Psychology, Concordia University, Montreal, Quebec, Canada.
  • Pallikaras V; Department of Psychology, Concordia University, Montreal, Quebec, Canada.
  • Breton YA; Department of Psychology, Langara College, Vancouver, British Columbia, Canada.
  • Conover K; Department of Psychology, Concordia University, Montreal, Quebec, Canada.
  • Caplan J; Department of Psychology, Concordia University, Montreal, Quebec, Canada.
  • Solis P; Department of Psychology, Concordia University, Montreal, Quebec, Canada.
  • Voisard J; Department of Psychology, Concordia University, Montreal, Quebec, Canada.
  • Yaksich A; Department of Psychology, Concordia University, Montreal, Quebec, Canada.
  • Shizgal P; Department of Psychology, Concordia University, Montreal, Quebec, Canada.
Eur J Neurosci ; 59(6): 1260-1277, 2024 Mar.
Article em En | MEDLINE | ID: mdl-38039083
ABSTRACT
Phasic dopamine activity is believed to both encode reward-prediction errors (RPEs) and to cause the adaptations that these errors engender. If so, a rat working for optogenetic stimulation of dopamine neurons will repeatedly update its policy and/or action values, thus iteratively increasing its work rate. Here, we challenge this view by demonstrating stable, non-maximal work rates in the face of repeated optogenetic stimulation of midbrain dopamine neurons. Furthermore, we show that rats learn to discriminate between world states distinguished only by their history of dopamine activation. Comparison of these results to reinforcement learning simulations suggests that the induced dopamine transients acted more as rewards than RPEs. However, pursuit of dopaminergic stimulation drifted upwards over a time scale of days and weeks, despite its stability within trials. To reconcile the results with prior findings, we consider multiple roles for dopamine signalling.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Dopamina / Aprendizagem Limite: Animals Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Dopamina / Aprendizagem Limite: Animals Idioma: En Ano de publicação: 2024 Tipo de documento: Article