Your browser doesn't support javascript.
loading
A robotic model of hippocampal reverse replay for reinforcement learning.
Whelan, Matthew T; Jimenez-Rodriguez, Alejandro; Prescott, Tony J; Vasilaki, Eleni.
Afiliação
  • Whelan MT; Department of Computer Science, The University of Sheffield, Sheffield, United Kingdom.
  • Jimenez-Rodriguez A; Sheffield Robotics, Sheffield, United Kingdom.
  • Prescott TJ; Department of Computer Science, The University of Sheffield, Sheffield, United Kingdom.
  • Vasilaki E; Sheffield Robotics, Sheffield, United Kingdom.
Bioinspir Biomim ; 18(1)2022 12 02.
Article em En | MEDLINE | ID: mdl-36327454
ABSTRACT
Hippocampal reverse replay, a phenomenon in which recently active hippocampal cells reactivate in the reverse order, is thought to contribute to learning, particularly reinforcement learning (RL), in animals. Here, we present a novel computational model which exploits reverse replay to improve stability and performance on a homing task. The model takes inspiration from the hippocampal-striatal network, and learning occurs via a three-factor RL rule. To augment this model with hippocampal reverse replay, we derived a policy gradient learning rule that associates place-cell activity with responses in cells representing actions and a supervised learning rule of the same form, interpreting the replay activity as a 'target' frequency. We evaluated the model using a simulated robot spatial navigation task inspired by the Morris water maze. Results suggest that reverse replay can improve performance stability over multiple trials. Our model exploits reverse reply as an additional source for propagating information about desirable synaptic changes, reducing the requirements for long-time scales in eligibility traces combined with low learning rates. We conclude that reverse replay can positively contribute to RL, although less stable learning is possible in its absence. Analogously, we postulate that reverse replay may enhance RL in the mammalian hippocampal-striatal system rather than provide its core mechanism.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Robótica / Procedimentos Cirúrgicos Robóticos / Navegação Espacial Tipo de estudo: Prognostic_studies Limite: Animals Idioma: En Ano de publicação: 2022 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Robótica / Procedimentos Cirúrgicos Robóticos / Navegação Espacial Tipo de estudo: Prognostic_studies Limite: Animals Idioma: En Ano de publicação: 2022 Tipo de documento: Article