Learning low-dimensional generalizable natural features from retina using a U-net.

Wang, Siwei; Hoshal, Benjamin; de Laittre, Elizabeth A; Marre, Olivier; Berry, Michael J; Palmer, Stephanie E

Wang, Siwei; Hoshal, Benjamin; de Laittre, Elizabeth A; Marre, Olivier; Berry, Michael J; Palmer, Stephanie E.

Afiliação

Wang S; Department of Organismal Biology and Anatomy, University of Chicago.
Hoshal B; Department of Organismal Biology and Anatomy, University of Chicago.
de Laittre EA; Committee on Computational Neuroscience, University of Chicago.
Marre O; Sorbonne Université, INSERM, CNRS, Institut de la Vision.
Berry MJ; Princeton Neuroscience Institute, Princeton University.
Palmer SE; Department of Organismal Biology and Anatomy, University of Chicago.

Adv Neural Inf Process Syst ; 35: 11355-11368, 2022 Dec.

Article em En | MEDLINE | ID: mdl-37362058

RESUMO

Much of sensory neuroscience focuses on presenting stimuli that are chosen by the experimenter because they are parametric and easy to sample and are thought to be behaviorally relevant to the organism. However, it is not generally known what these relevant features are in complex, natural scenes. This work focuses on using the retinal encoding of natural movies to determine the presumably behaviorally-relevant features that the brain represents. It is prohibitive to parameterize a natural movie and its respective retinal encoding fully. We use time within a natural movie as a proxy for the whole suite of features evolving across the scene. We then use a task-agnostic deep architecture, an encoder-decoder, to model the retinal encoding process and characterize its representation of "time in the natural scene" in a compressed latent space. In our end-to-end training, an encoder learns a compressed latent representation from a large population of salamander retinal ganglion cells responding to natural movies, while a decoder samples from this compressed latent space to generate the appropriate future movie frame. By comparing latent representations of retinal activity from three movies, we find that the retina has a generalizable encoding for time in the natural scene: the precise, low-dimensional representation of time learned from one movie can be used to represent time in a different movie, with up to 17 ms resolution. We then show that static textures and velocity features of a natural movie are synergistic. The retina simultaneously encodes both to establishes a generalizable, low-dimensional representation of time in the natural scene.

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Ano de publicação: 2022 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Ano de publicação: 2022 Tipo de documento: Article