Búsqueda | Portal Regional de la BVS

A probabilistic successor representation for context-dependent learning.

Geerts, Jesse P; Gershman, Samuel J; Burgess, Neil; Stachenfeld, Kimberly L.

Psychol Rev ; 131(2): 578-597, 2024 Mar.

Artículo en Inglés | MEDLINE | ID: mdl-37166847

RESUMEN

Two of the main impediments to learning complex tasks are that relationships between different stimuli, including rewards, can be uncertain and context-dependent. Reinforcement learning (RL) provides a framework for learning, by predicting total future reward directly (model-free RL), or via predictions of future states (model-based RL). Within this framework, "successor representation" (SR) predicts total future occupancy of all states. A recent theoretical proposal suggests that the hippocampus encodes the SR in order to facilitate prediction of future reward. However, this proposal does not take into account how learning should adapt under uncertainty and switches of context. Here, we introduce a theory of learning SRs using prediction errors which includes optimally balancing uncertainty in new observations versus existing knowledge. We then generalize that approach to a multicontext setting, allowing the model to learn and maintain multiple task-specific SRs and infer which one to use at any moment based on the accuracy of its predictions. Thus, the context used for predictions can be determined by both the contents of the states themselves and the distribution of transitions between them. This probabilistic SR model captures animal behavior in tasks which require contextual memory and generalization, and unifies previous SR theory with hippocampal-dependent contextual decision-making. (PsycInfo Database Record (c) 2024 APA, all rights reserved).

Asunto(s)

Aprendizaje , Refuerzo en Psicología , Animales , Humanos , Recompensa , Incertidumbre , Generalización Psicológica

Rapid learning of predictive maps with STDP and theta phase precession.

George, Tom M; de Cothi, William; Stachenfeld, Kimberly L; Barry, Caswell.

Elife ; 122023 03 16.

Artículo en Inglés | MEDLINE | ID: mdl-36927826

RESUMEN

The predictive map hypothesis is a promising candidate principle for hippocampal function. A favoured formalisation of this hypothesis, called the successor representation, proposes that each place cell encodes the expected state occupancy of its target location in the near future. This predictive framework is supported by behavioural as well as electrophysiological evidence and has desirable consequences for both the generalisability and efficiency of reinforcement learning algorithms. However, it is unclear how the successor representation might be learnt in the brain. Error-driven temporal difference learning, commonly used to learn successor representations in artificial agents, is not known to be implemented in hippocampal networks. Instead, we demonstrate that spike-timing dependent plasticity (STDP), a form of Hebbian learning, acting on temporally compressed trajectories known as 'theta sweeps', is sufficient to rapidly learn a close approximation to the successor representation. The model is biologically plausible - it uses spiking neurons modulated by theta-band oscillations, diffuse and overlapping place cell-like state representations, and experimentally matched parameters. We show how this model maps onto known aspects of hippocampal circuitry and explains substantial variance in the temporal difference successor matrix, consequently giving rise to place cells that demonstrate experimentally observed successor representation-related phenomena including backwards expansion on a 1D track and elongation near walls in 2D. Finally, our model provides insight into the observed topographical ordering of place field sizes along the dorsal-ventral axis by showing this is necessary to prevent the detrimental mixing of larger place fields, which encode longer timescale successor representations, with more fine-grained predictions of spatial location.

Asunto(s)

Hipocampo , Neuronas , Neuronas/fisiología , Hipocampo/fisiología , Refuerzo en Psicología , Terapia Conductista , Algoritmos , Ritmo Teta/fisiología , Modelos Neurológicos , Potenciales de Acción/fisiología

Compositional Sequence Generation in the Entorhinal-Hippocampal System.

McNamee, Daniel C; Stachenfeld, Kimberly L; Botvinick, Matthew M; Gershman, Samuel J.

Entropy (Basel) ; 24(12)2022 Dec 08.

Artículo en Inglés | MEDLINE | ID: mdl-36554196

RESUMEN

Neurons in the medial entorhinal cortex exhibit multiple, periodically organized, firing fields which collectively appear to form an internal representation of space. Neuroimaging data suggest that this grid coding is also present in other cortical areas such as the prefrontal cortex, indicating that it may be a general principle of neural functionality in the brain. In a recent analysis through the lens of dynamical systems theory, we showed how grid coding can lead to the generation of a diversity of empirically observed sequential reactivations of hippocampal place cells corresponding to traversals of cognitive maps. Here, we extend this sequence generation model by describing how the synthesis of multiple dynamical systems can support compositional cognitive computations. To empirically validate the model, we simulate two experiments demonstrating compositionality in space or in time during sequence generation. Finally, we describe several neural network architectures supporting various types of compositionality based on grid coding and highlight connections to recent work in machine learning leveraging analogous techniques.

Flexible modulation of sequence generation in the entorhinal-hippocampal system.

McNamee, Daniel C; Stachenfeld, Kimberly L; Botvinick, Matthew M; Gershman, Samuel J.

Nat Neurosci ; 24(6): 851-862, 2021 06.

Artículo en Inglés | MEDLINE | ID: mdl-33846626

RESUMEN

Exploration, consolidation and planning depend on the generation of sequential state representations. However, these algorithms require disparate forms of sampling dynamics for optimal performance. We theorize how the brain should adapt internally generated sequences for particular cognitive functions and propose a neural mechanism by which this may be accomplished within the entorhinal-hippocampal circuit. Specifically, we demonstrate that the systematic modulation along the medial entorhinal cortex dorsoventral axis of grid population input into the hippocampus facilitates a flexible generative process that can interpolate between qualitatively distinct regimes of sequential hippocampal reactivations. By relating the emergent hippocampal activity patterns drawn from our model to empirical data, we explain and reconcile a diversity of recently observed, but apparently unrelated, phenomena such as generative cycling, diffusive hippocampal reactivations and jumping trajectory events.

Asunto(s)

Corteza Entorrinal/fisiología , Hipocampo/fisiología , Red Nerviosa/fisiología , Redes Neurales de la Computación , Animales , Humanos

A general model of hippocampal and dorsal striatal learning and decision making.

Geerts, Jesse P; Chersi, Fabian; Stachenfeld, Kimberly L; Burgess, Neil.

Proc Natl Acad Sci U S A ; 117(49): 31427-31437, 2020 12 08.

Artículo en Inglés | MEDLINE | ID: mdl-33229541

RESUMEN

Humans and other animals use multiple strategies for making decisions. Reinforcement-learning theory distinguishes between stimulus-response (model-free; MF) learning and deliberative (model-based; MB) planning. The spatial-navigation literature presents a parallel dichotomy between navigation strategies. In "response learning," associated with the dorsolateral striatum (DLS), decisions are anchored to an egocentric reference frame. In "place learning," associated with the hippocampus, decisions are anchored to an allocentric reference frame. Emerging evidence suggests that the contribution of hippocampus to place learning may also underlie its contribution to MB learning by representing relational structure in a cognitive map. Here, we introduce a computational model in which hippocampus subserves place and MB learning by learning a "successor representation" of relational structure between states; DLS implements model-free response learning by learning associations between actions and egocentric representations of landmarks; and action values from either system are weighted by the reliability of its predictions. We show that this model reproduces a range of seemingly disparate behavioral findings in spatial and nonspatial decision tasks and explains the effects of lesions to DLS and hippocampus on these tasks. Furthermore, modeling place cells as driven by boundaries explains the observation that, unlike navigation guided by landmarks, navigation guided by boundaries is robust to "blocking" by prior state-reward associations due to learned associations between place cells. Our model, originally shaped by detailed constraints in the spatial literature, successfully characterizes the hippocampal-striatal system as a general system for decision making via adaptive combination of stimulus-response learning and the use of a cognitive map.

Asunto(s)

Cuerpo Estriado/fisiología , Toma de Decisiones , Hipocampo/fisiología , Aprendizaje , Modelos Neurológicos , Adaptación Fisiológica , Simulación por Computador , Aprendizaje por Laberinto , Memoria Espacial , Análisis y Desempeño de Tareas

What Is a Cognitive Map? Organizing Knowledge for Flexible Behavior.

Behrens, Timothy E J; Muller, Timothy H; Whittington, James C R; Mark, Shirley; Baram, Alon B; Stachenfeld, Kimberly L; Kurth-Nelson, Zeb.

Neuron ; 100(2): 490-509, 2018 10 24.

Artículo en Inglés | MEDLINE | ID: mdl-30359611

RESUMEN

It is proposed that a cognitive map encoding the relationships between entities in the world supports flexible behavior, but the majority of the neural evidence for such a system comes from studies of spatial navigation. Recent work describing neuronal parallels between spatial and non-spatial behaviors has rekindled the notion of a systematic organization of knowledge across multiple domains. We review experimental evidence and theoretical frameworks that point to principles unifying these apparently disparate functions. These principles describe how to learn and use abstract, generalizable knowledge and suggest that map-like representations observed in a spatial context may be an instance of general coding mechanisms capable of organizing knowledge of all kinds. We highlight how artificial agents endowed with such principles exhibit flexible behavior and learn map-like representations observed in the brain. Finally, we speculate on how these principles may offer insight into the extreme generalizations, abstractions, and inferences that characterize human cognition.

Asunto(s)

Encéfalo/fisiología , Procesos Mentales/fisiología , Modelos Neurológicos , Humanos

Author Correction: The hippocampus as a predictive map.

Stachenfeld, Kimberly L; Botvinick, Matthew M; Gershman, Samuel J.

Nat Neurosci ; 21(6): 895, 2018 Jun.

Artículo en Inglés | MEDLINE | ID: mdl-29695823

RESUMEN

In the version of this article initially published, equation (7) read.

The hippocampus as a predictive map.

Stachenfeld, Kimberly L; Botvinick, Matthew M; Gershman, Samuel J.

Nat Neurosci ; 20(11): 1643-1653, 2017 Nov.

Artículo en Inglés | MEDLINE | ID: mdl-28967910

RESUMEN

A cognitive map has long been the dominant metaphor for hippocampal function, embracing the idea that place cells encode a geometric representation of space. However, evidence for predictive coding, reward sensitivity and policy dependence in place cells suggests that the representation is not purely spatial. We approach this puzzle from a reinforcement learning perspective: what kind of spatial representation is most useful for maximizing future reward? We show that the answer takes the form of a predictive representation. This representation captures many aspects of place cell responses that fall outside the traditional view of a cognitive map. Furthermore, we argue that entorhinal grid cells encode a low-dimensionality basis set for the predictive representation, useful for suppressing noise in predictions and extracting multiscale structure for hierarchical planning.

Asunto(s)

Mapeo Encefálico/métodos , Hipocampo/fisiología , Aprendizaje/fisiología , Cadenas de Markov , Desempeño Psicomotor/fisiología , Refuerzo en Psicología , Animales , Humanos , Ratones

Noradrenergic control of error perseveration in medial prefrontal cortex.

Caetano, Marcelo S; Jin, Lu E; Harenberg, Linda; Stachenfeld, Kimberly L; Arnsten, Amy F T; Laubach, Mark.

Front Integr Neurosci ; 6: 125, 2012.

Artículo en Inglés | MEDLINE | ID: mdl-23293590

RESUMEN

The medial prefrontal cortex (mPFC) plays a key role in behavioral variability, action monitoring, and inhibitory control. The functional role of mPFC may change over the lifespan due to a number of aging-related issues, including dendritic regression, increased cAMP signaling, and reductions in the efficacy of neuromodulators to influence mPFC processing. A key neurotransmitter in mPFC is norepinephrine. Previous studies have reported aging-related changes in the sensitivity of mPFC-dependent tasks to noradrenergic agonist drugs, such as guanfacine. Here, we assessed the effects of yohimbine, an alpha-2 noradrenergic antagonist, in cohorts of younger and older rats in a classic test of spatial working memory (using a T-maze). Older rats (23-29 mo.) were impaired by a lower dose of yohimbine compared to younger animals (5-10 mo.). To determine if the drug acts on alpha-2 noradrenergic receptors in mPFC and if its effects are specific to memory-guided performance, we made infusions of yohimbine into mPFC of a cohort of young rats (6 mo.) using an operant delayed response task. The task involved testing rats in blocks of trials with memory- and stimulus-guided performance. Yohimbine selectively impaired memory-guided performance and was associated with error perseveration. Infusions of muscimol (a GABA-A agonist) at the same sites also selectively impaired memory-guided performance, but did not lead to error perseveration. Based on these results, we propose several potential interpretations for the role for the noradrenergic system in the performance of delayed response tasks, including the encoding of previous response locations, task rules (i.e., using a win-stay strategy instead of a win-shift strategy), and performance monitoring (e.g., prospective encoding of outcomes).

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA