Meaning maps and saliency models based on deep convolutional neural networks are insensitive to image meaning when predicting human fixations.

Pedziwiatr, Marek A; Kümmerer, Matthias; Wallis, Thomas S A; Bethge, Matthias; Teufel, Christoph

Pedziwiatr, Marek A; Kümmerer, Matthias; Wallis, Thomas S A; Bethge, Matthias; Teufel, Christoph.

Afiliação

Pedziwiatr MA; Cardiff University, Cardiff University Brain Research Imaging Centre (CUBRIC), School of Psychology, Cardiff, United Kingdom. Electronic address: marek.pedziwi@gmail.com.
Kümmerer M; University of Tübingen, Tübingen, Germany.
Wallis TSA; University of Tübingen, Tübingen, Germany.
Bethge M; University of Tübingen, Tübingen, Germany.
Teufel C; Cardiff University, Cardiff University Brain Research Imaging Centre (CUBRIC), School of Psychology, Cardiff, United Kingdom.

Cognition ; 206: 104465, 2021 01.

Article em En | MEDLINE | ID: mdl-33096374

ABSTRACT

ABSTRACT

Eye movements are vital for human vision, and it is therefore important to understand how observers decide where to look. Meaning maps (MMs), a technique to capture the distribution of semantic information across an image, have recently been proposed to support the hypothesis that meaning rather than image features guides human gaze. MMs have the potential to be an important tool far beyond eye-movements research. Here, we examine central assumptions underlying MMs. First, we compared the performance of MMs in predicting fixations to saliency models, showing that DeepGaze II - a deep neural network trained to predict fixations based on high-level features rather than meaning - outperforms MMs. Second, we show that whereas human observers respond to changes in meaning induced by manipulating object-context relationships, MMs and DeepGaze II do not. Together, these findings challenge central assumptions underlying the use of MMs to measure the distribution of meaning in images.

Assuntos

Movimentos Oculares; Redes Neurais de Computação; Humanos; Semântica

Palavras-chave

Deep neural networks; Eye movements; Meaning maps; Natural scenes; Saliency

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Redes Neurais de Computação / Movimentos Oculares Tipo de estudo: Prognostic_studies / Risk_factors_studies Limite: Humans Idioma: En Revista: Cognition Ano de publicação: 2021 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google