Time-scale invariant contingency yields one-shot reinforcement learning despite extremely long delays to reinforcement.

Gallistel, Charles R; Shahan, Timothy A

Gallistel, Charles R; Shahan, Timothy A.

Afiliação

Gallistel CR; Department of Psychology & Rutgers Center for Cognitive Sciences, Rutgers The State University of New Jersey, Piscataway, NJ 08854-8020.
Shahan TA; Department of Psychology, Utah State University, Logan, UT 84322-2810.

Proc Natl Acad Sci U S A ; 121(30): e2405451121, 2024 Jul 23.

Article em En | MEDLINE | ID: mdl-39008663

ABSTRACT

ABSTRACT

Reinforcement learning inspires much theorizing in neuroscience, cognitive science, machine learning, and AI. A central question concerns the conditions that produce the perception of a contingency between an action and reinforcement-the assignment-of-credit problem. Contemporary models of associative and reinforcement learning do not leverage the temporal metrics (measured intervals). Our information-theoretic approach formalizes contingency by time-scale invariant temporal mutual information. It predicts that learning may proceed rapidly even with extremely long action-reinforcer delays. We show that rats can learn an action after a single reinforcement, even with a 16-min delay between the action and reinforcement (15-fold longer than any delay previously shown to support such learning). By leveraging metric temporal information, our solution obviates the need for windows of associability, exponentially decaying eligibility traces, microstimuli, or distributions over Bayesian belief states. Its three equations have no free parameters; they predict one-shot learning without iterative simulation.

Assuntos

Reforço Psicológico; Animais; Ratos; Aprendizagem/fisiologia; Fatores de Tempo; Teorema de Bayes

Palavras-chave

assignment-of-credit; instrumental learning; mutual information; operant conditioning

Texto completo

Adicionar na Minha BVS

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Reforço Psicológico Limite: Animals Idioma: En Revista: Proc Natl Acad Sci U S A Ano de publicação: 2024 Tipo de documento: Article País de publicação: Estados Unidos

Texto completo

Adicionar na Minha BVS

Imprimir

XML

PubMed Links

Buscar no Google