Recurrent Connections in the Primate Ventral Visual Stream Mediate a Trade-Off Between Task Performance and Network Size During Core Object Recognition.

Nayebi, Aran; Sagastuy-Brena, Javier; Bear, Daniel M; Kar, Kohitij; Kubilius, Jonas; Ganguli, Surya; Sussillo, David; DiCarlo, James J; Yamins, Daniel L K

Nayebi, Aran; Sagastuy-Brena, Javier; Bear, Daniel M; Kar, Kohitij; Kubilius, Jonas; Ganguli, Surya; Sussillo, David; DiCarlo, James J; Yamins, Daniel L K.

Afiliação

Nayebi A; Stanford University, Stanford, CA 94305, U.S.A. anayebi@stanford.edu.
Sagastuy-Brena J; Stanford University, Stanford, CA 94305, U.S.A. jvrsgsty@stanford.edu.
Bear DM; Stanford University, Stanford, CA 94305, U.S.A. dbear@stanford.edu.
Kar K; MIT, Cambridge, MA 02139, U.S.A. kohitij@mit.edu.
Kubilius J; MIT, Cambridge, MA 02139, U.S.A.
Ganguli S; KU Leuven, Leuven 3000, Belgium qbilius@gmail.com.
Sussillo D; Stanford University, Stanford, CA 94305, U.S.A. sganguli@stanford.edu.
DiCarlo JJ; Stanford University, Stanford, CA 94305, U.S.A. sussillo@stanford.edu.
Yamins DLK; MIT, Cambridge, MA 02139, U.S.A. dicarlo@mit.edu.

Neural Comput ; 34(8): 1652-1675, 2022 07 14.

Article em En | MEDLINE | ID: mdl-35798321

RESUMO

The computational role of the abundant feedback connections in the ventral visual stream is unclear, enabling humans and nonhuman primates to effortlessly recognize objects across a multitude of viewing conditions. Prior studies have augmented feedforward convolutional neural networks (CNNs) with recurrent connections to study their role in visual processing; however, often these recurrent networks are optimized directly on neural data or the comparative metrics used are undefined for standard feedforward networks that lack these connections. In this work, we develop task-optimized convolutional recurrent (ConvRNN) network models that more correctly mimic the timing and gross neuroanatomy of the ventral pathway. Properly chosen intermediate-depth ConvRNN circuit architectures, which incorporate mechanisms of feedforward bypassing and recurrent gating, can achieve high performance on a core recognition task, comparable to that of much deeper feedforward networks. We then develop methods that allow us to compare both CNNs and ConvRNNs to finely grained measurements of primate categorization behavior and neural response trajectories across thousands of stimuli. We find that high-performing ConvRNNs provide a better match to these data than feedforward networks of any depth, predicting the precise timings at which each stimulus is behaviorally decoded from neural activation patterns. Moreover, these ConvRNN circuits consistently produce quantitatively accurate predictions of neural dynamics from V4 and IT across the entire stimulus presentation. In fact, we find that the highest-performing ConvRNNs, which best match neural and behavioral data, also achieve a strong Pareto trade-off between task performance and overall network size. Taken together, our results suggest the functional purpose of recurrence in the ventral pathway is to fit a high-performing network in cortex, attaining computational power through temporal rather than spatial complexity.

Assuntos

Análise e Desempenho de Tarefas; Percepção Visual; Animais; Humanos; Macaca mulatta/fisiologia; Redes Neurais de Computação; Reconhecimento Visual de Modelos/fisiologia; Reconhecimento Psicológico/fisiologia; Vias Visuais/fisiologia; Percepção Visual/fisiologia

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Análise e Desempenho de Tarefas / Percepção Visual Tipo de estudo: Prognostic_studies Limite: Animals / Humans Idioma: En Ano de publicação: 2022 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google