Learning in deep neural networks and brains with similarity-weighted interleaved learning.

Saxena, Rajat; Shobe, Justin L; McNaughton, Bruce L

Saxena, Rajat; Shobe, Justin L; McNaughton, Bruce L.

Afiliação

Saxena R; Department of Neurobiology and Behavior, University of California, Irvine, CA 92697.
Shobe JL; Department of Neurobiology and Behavior, University of California, Irvine, CA 92697.
McNaughton BL; Department of Neurobiology and Behavior, University of California, Irvine, CA 92697.

Proc Natl Acad Sci U S A ; 119(27): e2115229119, 2022 07 05.

Article em En | MEDLINE | ID: mdl-35759669

RESUMO

Understanding how the brain learns throughout a lifetime remains a long-standing challenge. In artificial neural networks (ANNs), incorporating novel information too rapidly results in catastrophic interference, i.e., abrupt loss of previously acquired knowledge. Complementary Learning Systems Theory (CLST) suggests that new memories can be gradually integrated into the neocortex by interleaving new memories with existing knowledge. This approach, however, has been assumed to require interleaving all existing knowledge every time something new is learned, which is implausible because it is time-consuming and requires a large amount of data. We show that deep, nonlinear ANNs can learn new information by interleaving only a subset of old items that share substantial representational similarity with the new information. By using such similarity-weighted interleaved learning (SWIL), ANNs can learn new information rapidly with a similar accuracy level and minimal interference, while using a much smaller number of old items presented per epoch (fast and data-efficient). SWIL is shown to work with various standard classification datasets (Fashion-MNIST, CIFAR10, and CIFAR100), deep neural network architectures, and in sequential learning frameworks. We show that data efficiency and speedup in learning new items are increased roughly proportionally to the number of nonoverlapping classes stored in the network, which implies an enormous possible speedup in human brains, which encode a high number of separate categories. Finally, we propose a theoretical model of how SWIL might be implemented in the brain.

Assuntos

Aprendizagem; Neocórtex; Redes Neurais de Computação; Humanos; Modelos Neurológicos; Neocórtex/fisiologia; Teoria de Sistemas

Palavras-chave

complementary learning systems; learning; memory; memory consolidation; neural networks

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Redes Neurais de Computação / Neocórtex / Aprendizagem Limite: Humans Idioma: En Revista: Proc Natl Acad Sci U S A Ano de publicação: 2022 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google