Learning and forgetting using reinforced Bayesian change detection.

Moens, Vincent; Zénon, Alexandre

Moens, Vincent; Zénon, Alexandre.

Afiliación

Moens V; CoAction Lab, Institue of Neuroscience, Université Catholique de Louvain, Bruxelles, Belgium.
Zénon A; CoAction Lab, Institue of Neuroscience, Université Catholique de Louvain, Bruxelles, Belgium.

PLoS Comput Biol ; 15(4): e1006713, 2019 04.

Article en En | MEDLINE | ID: mdl-30995214

RESUMEN

Agents living in volatile environments must be able to detect changes in contingencies while refraining to adapt to unexpected events that are caused by noise. In Reinforcement Learning (RL) frameworks, this requires learning rates that adapt to past reliability of the model. The observation that behavioural flexibility in animals tends to decrease following prolonged training in stable environment provides experimental evidence for such adaptive learning rates. However, in classical RL models, learning rate is either fixed or scheduled and can thus not adapt dynamically to environmental changes. Here, we propose a new Bayesian learning model, using variational inference, that achieves adaptive change detection by the use of Stabilized Forgetting, updating its current belief based on a mixture of fixed, initial priors and previous posterior beliefs. The weight given to these two sources is optimized alongside the other parameters, allowing the model to adapt dynamically to changes in environmental volatility and to unexpected observations. This approach is used to implement the "critic" of an actor-critic RL model, while the actor samples the resulting value distributions to choose which action to undertake. We show that our model can emulate different adaptation strategies to contingency changes, depending on its prior assumptions of environmental stability, and that model parameters can be fit to real data with high accuracy. The model also exhibits trade-offs between flexibility and computational costs that mirror those observed in real data. Overall, the proposed method provides a general framework to study learning flexibility and decision making in RL contexts.

Asunto(s)

Teorema de Bayes; Aprendizaje; Adaptación Psicológica; Algoritmos; Animales; Conducta Animal; Biología Computacional; Simulación por Computador; Toma de Decisiones; Técnicas de Apoyo para la Decisión; Humanos; Modelos Psicológicos; Refuerzo en Psicología; Recompensa

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Teorema de Bayes / Aprendizaje Tipo de estudio: Diagnostic_studies / Prognostic_studies Límite: Animals / Humans Idioma: En Revista: PLoS Comput Biol Asunto de la revista: BIOLOGIA / INFORMATICA MEDICA Año: 2019 Tipo del documento: Article País de afiliación: Bélgica Pais de publicación: Estados Unidos

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google