The role of state uncertainty in the dynamics of dopamine.
Curr Biol
; 32(5): 1077-1087.e9, 2022 03 14.
Article
em En
| MEDLINE
| ID: mdl-35114098
Reinforcement learning models of the basal ganglia map the phasic dopamine signal to reward prediction errors (RPEs). Conventional models assert that, when a stimulus predicts a reward with fixed delay, dopamine activity during the delay should converge to baseline through learning. However, recent studies have found that dopamine ramps up before reward in certain conditions even after learning, thus challenging the conventional models. In this work, we show that sensory feedback causes an unbiased learner to produce RPE ramps. Our model predicts that when feedback gradually decreases during a trial, dopamine activity should resemble a "bump," whose ramp-up phase should, furthermore, be greater than that of conditions where the feedback stays high. We trained mice on a virtual navigation task with varying brightness, and both predictions were empirically observed. In sum, our theoretical and experimental results reconcile the seemingly conflicting data on dopamine behaviors under the RPE hypothesis.
Palavras-chave
Texto completo:
1
Coleções:
01-internacional
Base de dados:
MEDLINE
Assunto principal:
Recompensa
/
Dopamina
Tipo de estudo:
Prognostic_studies
Limite:
Animals
Idioma:
En
Revista:
Curr Biol
Assunto da revista:
BIOLOGIA
Ano de publicação:
2022
Tipo de documento:
Article