Convolutional neural network-based encoding and decoding of visual object recognition in space and time.

Seeliger, K; Fritsche, M; Güçlü, U; Schoenmakers, S; Schoffelen, J-M; Bosch, S E; van Gerven, M A J

Seeliger, K; Fritsche, M; Güçlü, U; Schoenmakers, S; Schoffelen, J-M; Bosch, S E; van Gerven, M A J.

Afiliação

Seeliger K; Radboud University, Donders Institute for Brain, Cognition and Behaviour, Montessorilaan 3, 6525 HR Nijmegen, The Netherlands. Electronic address: kseeliger@posteo.jp.
Fritsche M; Radboud University, Donders Institute for Brain, Cognition and Behaviour, Montessorilaan 3, 6525 HR Nijmegen, The Netherlands.
Güçlü U; Radboud University, Donders Institute for Brain, Cognition and Behaviour, Montessorilaan 3, 6525 HR Nijmegen, The Netherlands.
Schoenmakers S; Radboud University, Donders Institute for Brain, Cognition and Behaviour, Montessorilaan 3, 6525 HR Nijmegen, The Netherlands.
Schoffelen JM; Radboud University, Donders Institute for Brain, Cognition and Behaviour, Montessorilaan 3, 6525 HR Nijmegen, The Netherlands.
Bosch SE; Radboud University, Donders Institute for Brain, Cognition and Behaviour, Montessorilaan 3, 6525 HR Nijmegen, The Netherlands.
van Gerven MAJ; Radboud University, Donders Institute for Brain, Cognition and Behaviour, Montessorilaan 3, 6525 HR Nijmegen, The Netherlands.

Neuroimage ; 180(Pt A): 253-266, 2018 10 15.

Article em En | MEDLINE | ID: mdl-28723578

RESUMO

Representations learned by deep convolutional neural networks (CNNs) for object recognition are a widely investigated model of the processing hierarchy in the human visual system. Using functional magnetic resonance imaging, CNN representations of visual stimuli have previously been shown to correspond to processing stages in the ventral and dorsal streams of the visual system. Whether this correspondence between models and brain signals also holds for activity acquired at high temporal resolution has been explored less exhaustively. Here, we addressed this question by combining CNN-based encoding models with magnetoencephalography (MEG). Human participants passively viewed 1,000 images of objects while MEG signals were acquired. We modelled their high temporal resolution source-reconstructed cortical activity with CNNs, and observed a feed-forward sweep across the visual hierarchy between 75 and 200 ms after stimulus onset. This spatiotemporal cascade was captured by the network layer representations, where the increasingly abstract stimulus representation in the hierarchical network model was reflected in different parts of the visual cortex, following the visual ventral stream. We further validated the accuracy of our encoding model by decoding stimulus identity in a left-out validation set of viewed objects, achieving state-of-the-art decoding accuracy.

Assuntos

Redes Neurais de Computação; Reconhecimento Visual de Modelos/fisiologia; Córtex Visual/fisiologia; Adulto; Feminino; Humanos; Magnetoencefalografia/métodos; Masculino; Processamento de Sinais Assistido por Computador; Adulto Jovem

Palavras-chave

Decoding; Deep learning; Encoding; Magnetoencephalography; Visual neuroscience

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Reconhecimento Visual de Modelos / Córtex Visual / Redes Neurais de Computação Limite: Adult / Female / Humans / Male Idioma: En Ano de publicação: 2018 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google