Pesquisa | Biblioteca Virtual em Saúde

Representations and generalization in artificial and brain neural networks.

Li, Qianyi; Sorscher, Ben; Sompolinsky, Haim.

Proc Natl Acad Sci U S A ; 121(27): e2311805121, 2024 Jul 02.

Artigo em Inglês | MEDLINE | ID: mdl-38913896

RESUMO

Humans and animals excel at generalizing from limited data, a capability yet to be fully replicated in artificial intelligence. This perspective investigates generalization in biological and artificial deep neural networks (DNNs), in both in-distribution and out-of-distribution contexts. We introduce two hypotheses: First, the geometric properties of the neural manifolds associated with discrete cognitive entities, such as objects, words, and concepts, are powerful order parameters. They link the neural substrate to the generalization capabilities and provide a unified methodology bridging gaps between neuroscience, machine learning, and cognitive science. We overview recent progress in studying the geometry of neural manifolds, particularly in visual object recognition, and discuss theories connecting manifold dimension and radius to generalization capacity. Second, we suggest that the theory of learning in wide DNNs, especially in the thermodynamic limit, provides mechanistic insights into the learning processes generating desired neural representational geometries and generalization. This includes the role of weight norm regularization, network architecture, and hyper-parameters. We will explore recent advances in this theory and ongoing challenges. We also discuss the dynamics of learning and its relevance to the issue of representational drift in the brain.

Assuntos

Encéfalo , Redes Neurais de Computação , Encéfalo/fisiologia , Humanos , Animais , Inteligência Artificial , Modelos Neurológicos , Generalização Psicológica/fisiologia , Cognição/fisiologia

Neural representational geometry underlies few-shot concept learning.

Sorscher, Ben; Ganguli, Surya; Sompolinsky, Haim.

Proc Natl Acad Sci U S A ; 119(43): e2200800119, 2022 10 25.

Artigo em Inglês | MEDLINE | ID: mdl-36251997

RESUMO

Understanding the neural basis of the remarkable human cognitive capacity to learn novel concepts from just one or a few sensory experiences constitutes a fundamental problem. We propose a simple, biologically plausible, mathematically tractable, and computationally powerful neural mechanism for few-shot learning of naturalistic concepts. We posit that the concepts that can be learned from few examples are defined by tightly circumscribed manifolds in the neural firing-rate space of higher-order sensory areas. We further posit that a single plastic downstream readout neuron learns to discriminate new concepts based on few examples using a simple plasticity rule. We demonstrate the computational power of our proposal by showing that it can achieve high few-shot learning accuracy on natural visual concepts using both macaque inferotemporal cortex representations and deep neural network (DNN) models of these representations and can even learn novel visual concepts specified only through linguistic descriptors. Moreover, we develop a mathematical theory of few-shot learning that links neurophysiology to predictions about behavioral outcomes by delineating several fundamental and measurable geometric properties of neural representations that can accurately predict the few-shot learning performance of naturalistic concepts across all our numerical simulations. This theory reveals, for instance, that high-dimensional manifolds enhance the ability to learn new concepts from few examples. Intriguingly, we observe striking mismatches between the geometry of manifolds in the primate visual pathway and in trained DNNs. We discuss testable predictions of our theory for psychophysics and neurophysiological experiments.

Assuntos

Formação de Conceito , Redes Neurais de Computação , Animais , Humanos , Aprendizagem/fisiologia , Macaca , Plásticos , Primatas , Vias Visuais/fisiologia

A unified theory for the computational and mechanistic origins of grid cells.

Sorscher, Ben; Mel, Gabriel C; Ocko, Samuel A; Giocomo, Lisa M; Ganguli, Surya.

Neuron ; 111(1): 121-137.e13, 2023 01 04.

Artigo em Inglês | MEDLINE | ID: mdl-36306779

RESUMO

The discovery of entorhinal grid cells has generated considerable interest in how and why hexagonal firing fields might emerge in a generic manner from neural circuits, and what their computational significance might be. Here, we forge a link between the problem of path integration and the existence of hexagonal grids, by demonstrating that such grids arise in neural networks trained to path integrate under simple biologically plausible constraints. Moreover, we develop a unifying theory for why hexagonal grids are ubiquitous in path-integrator circuits. Such trained networks also yield powerful mechanistic hypotheses, exhibiting realistic levels of biological variability not captured by hand-designed models. We furthermore develop methods to analyze the connectome and activity maps of our networks to elucidate fundamental mechanisms underlying path integration. These methods provide a road map to go from connectomic and physiological measurements to conceptual understanding in a manner that could generalize to other settings.

Assuntos

Células de Grade , Células de Grade/fisiologia , Córtex Entorrinal/fisiologia , Modelos Neurológicos , Redes Neurais de Computação , Sistemas Computacionais

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA