A self-supervised deep neural network for image completion resembles early visual cortex fMRI activity patterns for occluded scenes.

Svanera, Michele; Morgan, Andrew T; Petro, Lucy S; Muckli, Lars

Svanera, Michele; Morgan, Andrew T; Petro, Lucy S; Muckli, Lars.

Affiliation

Svanera M; Centre for Cognitive Neuroimaging, Institute of Neuroscience and Psychology, University of Glasgow, UK.
Morgan AT; michele.svanera@glasgow.ac.uk.
Petro LS; Centre for Cognitive Neuroimaging, Institute of Neuroscience and Psychology, University of Glasgow, UK.
Muckli L; andrew.morgan@glasgow.ac.uk.

J Vis ; 21(7): 5, 2021 07 06.

Article in En | MEDLINE | ID: mdl-34259828

ABSTRACT

The promise of artificial intelligence in understanding biological vision relies on the comparison of computational models with brain data with the goal of capturing functional principles of visual information processing. Convolutional neural networks (CNN) have successfully matched the transformations in hierarchical processing occurring along the brain's feedforward visual pathway, extending into ventral temporal cortex. However, we are still to learn if CNNs can successfully describe feedback processes in early visual cortex. Here, we investigated similarities between human early visual cortex and a CNN with encoder/decoder architecture, trained with self-supervised learning to fill occlusions and reconstruct an unseen image. Using representational similarity analysis (RSA), we compared 3T functional magnetic resonance imaging (fMRI) data from a nonstimulated patch of early visual cortex in human participants viewing partially occluded images, with the different CNN layer activations from the same images. Results show that our self-supervised image-completion network outperforms a classical object-recognition supervised network (VGG16) in terms of similarity to fMRI data. This work provides additional evidence that optimal models of the visual system might come from less feedforward architectures trained with less supervision. We also find that CNN decoder pathway activations are more similar to brain processing compared to encoder activations, suggesting an integration of mid- and low/middle-level features in early visual cortex. Challenging an artificial intelligence model to learn natural image representations via self-supervised learning and comparing them with brain data can help us to constrain our understanding of information processing, such as neuronal predictive coding.

Subject(s)

Magnetic Resonance Imaging; Visual Cortex; Artificial Intelligence; Humans; Neural Networks, Computer; Visual Cortex/diagnostic imaging; Visual Perception

Fulltext

Add to My VHL

XML

PubMed Links

Search on Google

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Visual Cortex / Magnetic Resonance Imaging Type of study: Prognostic_studies Limits: Humans Language: En Journal: J Vis Journal subject: OFTALMOLOGIA Year: 2021 Document type: Article Country of publication: United States

Fulltext

Add to My VHL

XML

PubMed Links

Search on Google