Pesquisa | BVS Doenças Infecciosas e Parasitárias

1.

Disentangled deep generative models reveal coding principles of the human face processing network.

Soulos, Paul; Isik, Leyla.

PLoS Comput Biol ; 20(2): e1011887, 2024 Feb.

Artigo em Inglês | MEDLINE | ID: mdl-38408105

RESUMO

Despite decades of research, much is still unknown about the computations carried out in the human face processing network. Recently, deep networks have been proposed as a computational account of human visual processing, but while they provide a good match to neural data throughout visual cortex, they lack interpretability. We introduce a method for interpreting brain activity using a new class of deep generative models, disentangled representation learning models, which learn a low-dimensional latent space that "disentangles" different semantically meaningful dimensions of faces, such as rotation, lighting, or hairstyle, in an unsupervised manner by enforcing statistical independence between dimensions. We find that the majority of our model's learned latent dimensions are interpretable by human raters. Further, these latent dimensions serve as a good encoding model for human fMRI data. We next investigate the representation of different latent dimensions across face-selective voxels. We find that low- and high-level face features are represented in posterior and anterior face-selective regions, respectively, corroborating prior models of human face recognition. Interestingly, though, we find identity-relevant and irrelevant face features across the face processing network. Finally, we provide new insight into the few "entangled" (uninterpretable) dimensions in our model by showing that they match responses in the ventral stream and carry information about facial identity. Disentangled face encoding models provide an exciting alternative to standard "black box" deep learning approaches for modeling and interpreting human brain data.

Assuntos

Reconhecimento Facial , Córtex Visual , Humanos , Reconhecimento Facial/fisiologia , Encéfalo/fisiologia , Córtex Visual/fisiologia , Mapeamento Encefálico , Imageamento por Ressonância Magnética/métodos

2.

Rapid Processing of Observed Touch through Social Perceptual Brain Regions: An EEG-fMRI Fusion Study.

Lee Masson, Haemy; Isik, Leyla.

J Neurosci ; 43(45): 7700-7711, 2023 11 08.

Artigo em Inglês | MEDLINE | ID: mdl-37871963

RESUMO

Seeing social touch triggers a strong social-affective response that involves multiple brain networks, including visual, social perceptual, and somatosensory systems. Previous studies have identified the specific functional role of each system, but little is known about the speed and directionality of the information flow. Is this information extracted via the social perceptual system or from simulation from somatosensory cortex? To address this, we examined the spatiotemporal neural processing of observed touch. Twenty-one human participants (seven males) watched 500-ms video clips showing social and nonsocial touch during electroencephalogram (EEG) recording. Visual and social-affective features were rapidly extracted in the brain, beginning at 90 and 150 ms after video onset, respectively. Combining the EEG data with functional magnetic resonance imaging (fMRI) data from our prior study with the same stimuli reveals that neural information first arises in early visual cortex (EVC), then in the temporoparietal junction and posterior superior temporal sulcus (TPJ/pSTS), and finally in the somatosensory cortex. EVC and TPJ/pSTS uniquely explain EEG neural patterns, while somatosensory cortex does not contribute to EEG patterns alone, suggesting that social-affective information may flow from TPJ/pSTS to somatosensory cortex. Together, these findings show that social touch is processed quickly, within the timeframe of feedforward visual processes, and that the social-affective meaning of touch is first extracted by a social perceptual pathway. Such rapid processing of social touch may be vital to its effective use during social interaction.SIGNIFICANCE STATEMENT Seeing physical contact between people evokes a strong social-emotional response. Previous research has identified the brain systems responsible for this response, but little is known about how quickly and in what direction the information flows. We demonstrated that the brain processes the social-emotional meaning of observed touch quickly, starting as early as 150 ms after the stimulus onset. By combining electroencephalogram (EEG) data with functional magnetic resonance imaging (fMRI) data, we show for the first time that the social-affective meaning of touch is first extracted by a social perceptual pathway and followed by the later involvement of somatosensory simulation. This rapid processing of touch through the social perceptual route may play a pivotal role in effective usage of touch in social communication and interaction.

Assuntos

Percepção do Tato , Tato , Humanos , Masculino , Afeto/fisiologia , Encéfalo/fisiologia , Mapeamento Encefálico/métodos , Eletroencefalografia , Imageamento por Ressonância Magnética , Córtex Somatossensorial/diagnóstico por imagem , Córtex Somatossensorial/fisiologia , Tato/fisiologia , Percepção do Tato/fisiologia , Feminino

3.

Functional selectivity for social interaction perception in the human superior temporal sulcus during natural viewing.

Lee Masson, Haemy; Isik, Leyla.

Neuroimage ; 245: 118741, 2021 12 15.

Artigo em Inglês | MEDLINE | ID: mdl-34800663

RESUMO

Recognizing others' social interactions is a crucial human ability. Using simple stimuli, previous studies have shown that social interactions are selectively processed in the superior temporal sulcus (STS), but prior work with movies has suggested that social interactions are processed in the medial prefrontal cortex (mPFC), part of the theory of mind network. It remains unknown to what extent social interaction selectivity is observed in real world stimuli when controlling for other covarying perceptual and social information, such as faces, voices, and theory of mind. The current study utilizes a functional magnetic resonance imaging (fMRI) movie paradigm and advanced machine learning methods to uncover the brain mechanisms uniquely underlying naturalistic social interaction perception. We analyzed two publicly available fMRI datasets, collected while both male and female human participants (n = 17 and 18) watched two different commercial movies in the MRI scanner. By performing voxel-wise encoding and variance partitioning analyses, we found that broad social-affective features predict neural responses in social brain regions, including the STS and mPFC. However, only the STS showed robust and unique selectivity specifically to social interactions, independent from other covarying features. This selectivity was observed across two separate fMRI datasets. These findings suggest that naturalistic social interaction perception recruits dedicated neural circuity in the STS, separate from the theory of mind network, and is a critical dimension of human social understanding.

Assuntos

Mapeamento Encefálico/métodos , Aprendizado de Máquina , Imageamento por Ressonância Magnética , Interação Social , Lobo Temporal/diagnóstico por imagem , Lobo Temporal/fisiologia , Teoria da Mente , Adulto , Conjuntos de Dados como Assunto , Feminino , Humanos , Processamento de Imagem Assistida por Computador , Masculino , Filmes Cinematográficos

4.

The speed of human social interaction perception.

Isik, Leyla; Mynick, Anna; Pantazis, Dimitrios; Kanwisher, Nancy.

Neuroimage ; 215: 116844, 2020 07 15.

Artigo em Inglês | MEDLINE | ID: mdl-32302763

RESUMO

The ability to perceive others' social interactions, here defined as the directed contingent actions between two or more people, is a fundamental part of human experience that develops early in infancy and is shared with other primates. However, the neural computations underlying this ability remain largely unknown. Is social interaction recognition a rapid feedforward process or a slower post-perceptual inference? Here we used magnetoencephalography (MEG) decoding to address this question. Subjects in the MEG viewed snapshots of visually matched real-world scenes containing a pair of people who were either engaged in a social interaction or acting independently. The presence versus absence of a social interaction could be read out from subjects' MEG data spontaneously, even while subjects performed an orthogonal task. This readout generalized across different people and scenes, revealing abstract representations of social interactions in the human brain. These representations, however, did not come online until quite late, at 300 âms after image onset, well after feedforward visual processes. In a second experiment, we found that social interaction readout still occurred at this same late latency even when subjects performed an explicit task detecting social interactions. We further showed that MEG responses distinguished between different types of social interactions (mutual gaze vs joint attention) even later, around 500 âms after image onset. Taken together, these results suggest that the human brain spontaneously extracts information about others' social interactions, but does so slowly, likely relying on iterative top-down computations.

Assuntos

Encéfalo/fisiologia , Magnetoencefalografia/métodos , Tempo de Reação/fisiologia , Interação Social , Percepção Social/psicologia , Percepção Visual/fisiologia , Adolescente , Adulto , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Estimulação Luminosa/métodos , Adulto Jovem

5.

Perceiving social interactions in the posterior superior temporal sulcus.

Isik, Leyla; Koldewyn, Kami; Beeler, David; Kanwisher, Nancy.

Proc Natl Acad Sci U S A ; 114(43): E9145-E9152, 2017 10 24.

Artigo em Inglês | MEDLINE | ID: mdl-29073111

RESUMO

Primates are highly attuned not just to social characteristics of individual agents, but also to social interactions between multiple agents. Here we report a neural correlate of the representation of social interactions in the human brain. Specifically, we observe a strong univariate response in the posterior superior temporal sulcus (pSTS) to stimuli depicting social interactions between two agents, compared with (i) pairs of agents not interacting with each other, (ii) physical interactions between inanimate objects, and (iii) individual animate agents pursuing goals and interacting with inanimate objects. We further show that this region contains information about the nature of the social interaction-specifically, whether one agent is helping or hindering the other. This sensitivity to social interactions is strongest in a specific subregion of the pSTS but extends to a lesser extent into nearby regions previously implicated in theory of mind and dynamic face perception. This sensitivity to the presence and nature of social interactions is not easily explainable in terms of low-level visual features, attention, or the animacy, actions, or goals of individual agents. This region may underlie our ability to understand the structure of our social world and navigate within it.

Assuntos

Relações Interpessoais , Lobo Temporal/diagnóstico por imagem , Lobo Temporal/fisiologia , Adulto , Atenção/fisiologia , Encéfalo/diagnóstico por imagem , Encéfalo/fisiologia , Feminino , Humanos , Imageamento por Ressonância Magnética , Masculino , Experimentação Humana não Terapêutica , Estimulação Luminosa

6.

General Transformations of Object Representations in Human Visual Cortex.

Ward, Emily J; Isik, Leyla; Chun, Marvin M.

J Neurosci ; 38(40): 8526-8537, 2018 10 03.

Artigo em Inglês | MEDLINE | ID: mdl-30126975

RESUMO

The brain actively represents incoming information, but these representations are only useful to the extent that they flexibly reflect changes in the environment. How does the brain transform representations across changes, such as in size or viewing angle? We conducted a fMRI experiment and a magnetoencephalography experiment in humans (both sexes) in which participants viewed objects before and after affine viewpoint changes (rotation, translation, enlargement). We used a novel approach, representational transformation analysis, to derive transformation functions that linked the distributed patterns of brain activity evoked by an object before and after an affine change. Crucially, transformations derived from one object could predict a postchange representation for novel objects. These results provide evidence of general operations in the brain that are distinct from neural representations evoked by particular objects and scenes.SIGNIFICANCE STATEMENT The dominant focus in cognitive neuroscience has been on how the brain represents information, but these representations are only useful to the extent that they flexibly reflect changes in the environment. How does the brain transform representations, such as linking two states of an object, for example, before and after an object undergoes a physical change? We used a novel method to derive transformations between the brain activity evoked by an object before and after an affine viewpoint change. We show that transformations derived from one object undergoing a change generalized to a novel object undergoing the same change. This result shows that there are general perceptual operations that transform object representations from one state to another.

Assuntos

Reconhecimento Visual de Modelos/fisiologia , Córtex Visual/fisiologia , Mapeamento Encefálico , Feminino , Humanos , Imageamento por Ressonância Magnética , Magnetoencefalografia , Masculino , Estimulação Luminosa/métodos

7.

What is changing when: Decoding visual information in movies from human intracranial recordings.

Isik, Leyla; Singer, Jedediah; Madsen, Joseph R; Kanwisher, Nancy; Kreiman, Gabriel.

Neuroimage ; 180(Pt A): 147-159, 2018 10 15.

Artigo em Inglês | MEDLINE | ID: mdl-28823828

RESUMO

The majority of visual recognition studies have focused on the neural responses to repeated presentations of static stimuli with abrupt and well-defined onset and offset times. In contrast, natural vision involves unique renderings of visual inputs that are continuously changing without explicitly defined temporal transitions. Here we considered commercial movies as a coarse proxy to natural vision. We recorded intracranial field potential signals from 1,284 electrodes implanted in 15 patients with epilepsy while the subjects passively viewed commercial movies. We could rapidly detect large changes in the visual inputs within approximately 100 ms of their occurrence, using exclusively field potential signals from ventral visual cortical areas including the inferior temporal gyrus and inferior occipital gyrus. Furthermore, we could decode the content of those visual changes even in a single movie presentation, generalizing across the wide range of transformations present in a movie. These results present a methodological framework for studying cognition during dynamic and natural vision.

Assuntos

Córtex Visual/fisiologia , Percepção Visual/fisiologia , Adolescente , Adulto , Mapeamento Encefálico/métodos , Criança , Pré-Escolar , Epilepsia Resistente a Medicamentos/terapia , Terapia por Estimulação Elétrica , Eletrodos Implantados , Potenciais Evocados Visuais/fisiologia , Feminino , Humanos , Masculino , Filmes Cinematográficos , Estimulação Luminosa , Processamento de Sinais Assistido por Computador , Adulto Jovem

8.

A fast, invariant representation for human action in the visual system.

Isik, Leyla; Tacchetti, Andrea; Poggio, Tomaso.

J Neurophysiol ; 119(2): 631-640, 2018 02 01.

Artigo em Inglês | MEDLINE | ID: mdl-29118198

RESUMO

Humans can effortlessly recognize others' actions in the presence of complex transformations, such as changes in viewpoint. Several studies have located the regions in the brain involved in invariant action recognition; however, the underlying neural computations remain poorly understood. We use magnetoencephalography decoding and a data set of well-controlled, naturalistic videos of five actions (run, walk, jump, eat, drink) performed by different actors at different viewpoints to study the computational steps used to recognize actions across complex transformations. In particular, we ask when the brain discriminates between different actions, and when it does so in a manner that is invariant to changes in 3D viewpoint. We measure the latency difference between invariant and noninvariant action decoding when subjects view full videos as well as form-depleted and motion-depleted stimuli. We were unable to detect a difference in decoding latency or temporal profile between invariant and noninvariant action recognition in full videos. However, when either form or motion information is removed from the stimulus set, we observe a decrease and delay in invariant action decoding. Our results suggest that the brain recognizes actions and builds invariance to complex transformations at the same time and that both form and motion information are crucial for fast, invariant action recognition. NEW & NOTEWORTHY The human brain can quickly recognize actions despite transformations that change their visual appearance. We use neural timing data to uncover the computations underlying this ability. We find that within 200 ms action can be read out of magnetoencephalography data and that this representation is invariant to changes in viewpoint. We find form and motion are needed for this fast action decoding, suggesting that the brain quickly integrates complex spatiotemporal features to form invariant action representations.

Assuntos

Encéfalo/fisiologia , Percepção de Movimento , Reconhecimento Visual de Modelos , Adulto , Feminino , Humanos , Masculino , Movimento , Tempo de Reação

9.

Invariant recognition drives neural representations of action sequences.

Tacchetti, Andrea; Isik, Leyla; Poggio, Tomaso.

PLoS Comput Biol ; 13(12): e1005859, 2017 12.

Artigo em Inglês | MEDLINE | ID: mdl-29253864

RESUMO

Recognizing the actions of others from visual stimuli is a crucial aspect of human perception that allows individuals to respond to social cues. Humans are able to discriminate between similar actions despite transformations, like changes in viewpoint or actor, that substantially alter the visual appearance of a scene. This ability to generalize across complex transformations is a hallmark of human visual intelligence. Advances in understanding action recognition at the neural level have not always translated into precise accounts of the computational principles underlying what representations of action sequences are constructed by human visual cortex. Here we test the hypothesis that invariant action discrimination might fill this gap. Recently, the study of artificial systems for static object perception has produced models, Convolutional Neural Networks (CNNs), that achieve human level performance in complex discriminative tasks. Within this class, architectures that better support invariant object recognition also produce image representations that better match those implied by human and primate neural data. However, whether these models produce representations of action sequences that support recognition across complex transformations and closely follow neural representations of actions remains unknown. Here we show that spatiotemporal CNNs accurately categorize video stimuli into action classes, and that deliberate model modifications that improve performance on an invariant action recognition task lead to data representations that better match human neural recordings. Our results support our hypothesis that performance on invariant discrimination dictates the neural representations of actions computed in the brain. These results broaden the scope of the invariant recognition framework for understanding visual intelligence from perception of inanimate objects and faces in static images to the study of human perception of action sequences.

Assuntos

Reconhecimento Psicológico/fisiologia , Percepção Visual/fisiologia , Biologia Computacional , Sinais (Psicologia) , Discriminação Psicológica/fisiologia , Humanos , Magnetoencefalografia , Modelos Neurológicos , Redes Neurais de Computação , Estimulação Luminosa , Córtex Visual/fisiologia

10.

The dynamics of invariant object recognition in the human visual system.

Isik, Leyla; Meyers, Ethan M; Leibo, Joel Z; Poggio, Tomaso.

J Neurophysiol ; 111(1): 91-102, 2014 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-24089402

RESUMO

The human visual system can rapidly recognize objects despite transformations that alter their appearance. The precise timing of when the brain computes neural representations that are invariant to particular transformations, however, has not been mapped in humans. Here we employ magnetoencephalography decoding analysis to measure the dynamics of size- and position-invariant visual information development in the ventral visual stream. With this method we can read out the identity of objects beginning as early as 60 ms. Size- and position-invariant visual information appear around 125 ms and 150 ms, respectively, and both develop in stages, with invariance to smaller transformations arising before invariance to larger transformations. Additionally, the magnetoencephalography sensor activity localizes to neural sources that are in the most posterior occipital regions at the early decoding times and then move temporally as invariant information develops. These results provide previously unknown latencies for key stages of human-invariant object recognition, as well as new and compelling evidence for a feed-forward hierarchical model of invariant object recognition where invariance increases at each successive visual area along the ventral stream.

Assuntos

Reconhecimento Visual de Modelos , Tempo de Reação , Córtex Visual/fisiologia , Adolescente , Adulto , Potenciais Evocados Visuais , Feminino , Humanos , Masculino

11.

A shared neural code for perceiving and remembering social interactions in the human superior temporal sulcus.

Lee Masson, Haemy; Chen, Janice; Isik, Leyla.

Neuropsychologia ; 196: 108823, 2024 04 15.

Artigo em Inglês | MEDLINE | ID: mdl-38346576

RESUMO

Recognizing and remembering social information is a crucial cognitive skill. Neural patterns in the superior temporal sulcus (STS) support our ability to perceive others' social interactions. However, despite the prominence of social interactions in memory, the neural basis of remembering social interactions is still unknown. To fill this gap, we investigated the brain mechanisms underlying memory of others' social interactions during free spoken recall of a naturalistic movie. By applying machine learning-based fMRI encoding analyses to densely labeled movie and recall data we found that a subset of the STS activity evoked by viewing social interactions predicted neural responses in not only held-out movie data, but also during memory recall. These results provide the first evidence that activity in the STS is reinstated in response to specific social content and that its reactivation underlies our ability to remember others' interactions. These findings further suggest that the STS contains representations of social interactions that are not only perceptually driven, but also more abstract or conceptual in nature.

Assuntos

Interação Social , Lobo Temporal , Humanos , Lobo Temporal/diagnóstico por imagem , Lobo Temporal/fisiologia , Encéfalo/fisiologia , Memória/fisiologia , Mapeamento Encefálico , Imageamento por Ressonância Magnética

12.

Multidimensional neural representations of social features during movie viewing.

Lee Masson, Haemy; Chang, Lucy; Isik, Leyla.

Soc Cogn Affect Neurosci ; 19(1)2024 May 27.

Artigo em Inglês | MEDLINE | ID: mdl-38722755

RESUMO

The social world is dynamic and contextually embedded. Yet, most studies utilize simple stimuli that do not capture the complexity of everyday social episodes. To address this, we implemented a movie viewing paradigm and investigated how everyday social episodes are processed in the brain. Participants watched one of two movies during an MRI scan. Neural patterns from brain regions involved in social perception, mentalization, action observation and sensory processing were extracted. Representational similarity analysis results revealed that several labeled social features (including social interaction, mentalization, the actions of others, characters talking about themselves, talking about others and talking about objects) were represented in the superior temporal gyrus (STG) and middle temporal gyrus (MTG). The mentalization feature was also represented throughout the theory of mind network, and characters talking about others engaged the temporoparietal junction (TPJ), suggesting that listeners may spontaneously infer the mental state of those being talked about. In contrast, we did not observe the action representations in the frontoparietal regions of the action observation network. The current findings indicate that STG and MTG serve as key regions for social processing, and that listening to characters talk about others elicits spontaneous mental state inference in TPJ during natural movie viewing.

Assuntos

Mapeamento Encefálico , Encéfalo , Imageamento por Ressonância Magnética , Filmes Cinematográficos , Percepção Social , Teoria da Mente , Humanos , Feminino , Masculino , Imageamento por Ressonância Magnética/métodos , Adulto Jovem , Encéfalo/fisiologia , Encéfalo/diagnóstico por imagem , Adulto , Teoria da Mente/fisiologia , Mentalização/fisiologia , Estimulação Luminosa/métodos

13.

How does the primate brain combine generative and discriminative computations in vision?

Peters, Benjamin; DiCarlo, James J; Gureckis, Todd; Haefner, Ralf; Isik, Leyla; Tenenbaum, Joshua; Konkle, Talia; Naselaris, Thomas; Stachenfeld, Kimberly; Tavares, Zenna; Tsao, Doris; Yildirim, Ilker; Kriegeskorte, Nikolaus.

ArXiv ; 2024 Jan 11.

Artigo em Inglês | MEDLINE | ID: mdl-38259351

RESUMO

Vision is widely understood as an inference problem. However, two contrasting conceptions of the inference process have each been influential in research on biological vision as well as the engineering of machine vision. The first emphasizes bottom-up signal flow, describing vision as a largely feedforward, discriminative inference process that filters and transforms the visual information to remove irrelevant variation and represent behaviorally relevant information in a format suitable for downstream functions of cognition and behavioral control. In this conception, vision is driven by the sensory data, and perception is direct because the processing proceeds from the data to the latent variables of interest. The notion of "inference" in this conception is that of the engineering literature on neural networks, where feedforward convolutional neural networks processing images are said to perform inference. The alternative conception is that of vision as an inference process in Helmholtz's sense, where the sensory evidence is evaluated in the context of a generative model of the causal processes that give rise to it. In this conception, vision inverts a generative model through an interrogation of the sensory evidence in a process often thought to involve top-down predictions of sensory data to evaluate the likelihood of alternative hypotheses. The authors include scientists rooted in roughly equal numbers in each of the conceptions and motivated to overcome what might be a false dichotomy between them and engage the other perspective in the realm of theory and experiment. The primate brain employs an unknown algorithm that may combine the advantages of both conceptions. We explain and clarify the terminology, review the key empirical evidence, and propose an empirical research program that transcends the dichotomy and sets the stage for revealing the mysterious hybrid algorithm of primate vision.

14.

Seeing social interactions.

McMahon, Emalie; Isik, Leyla.

Trends Cogn Sci ; 27(12): 1165-1179, 2023 12.

Artigo em Inglês | MEDLINE | ID: mdl-37805385

RESUMO

Seeing the interactions between other people is a critical part of our everyday visual experience, but recognizing the social interactions of others is often considered outside the scope of vision and grouped with higher-level social cognition like theory of mind. Recent work, however, has revealed that recognition of social interactions is efficient and automatic, is well modeled by bottom-up computational algorithms, and occurs in visually-selective regions of the brain. We review recent evidence from these three methodologies (behavioral, computational, and neural) that converge to suggest the core of social interaction perception is visual. We propose a computational framework for how this process is carried out in the brain and offer directions for future interdisciplinary investigations of social perception.

Assuntos

Interação Social , Percepção Social , Humanos , Encéfalo , Cognição

15.

Relational visual representations underlie human social interaction recognition.

Malik, Manasi; Isik, Leyla.

Nat Commun ; 14(1): 7317, 2023 11 11.

Artigo em Inglês | MEDLINE | ID: mdl-37951960

RESUMO

Humans effortlessly recognize social interactions from visual input. Attempts to model this ability have typically relied on generative inverse planning models, which make predictions by inverting a generative model of agents' interactions based on their inferred goals, suggesting humans use a similar process of mental inference to recognize interactions. However, growing behavioral and neuroscience evidence suggests that recognizing social interactions is a visual process, separate from complex mental state inference. Yet despite their success in other domains, visual neural network models have been unable to reproduce human-like interaction recognition. We hypothesize that humans rely on relational visual information in particular, and develop a relational, graph neural network model, SocialGNN. Unlike prior models, SocialGNN accurately predicts human interaction judgments across both animated and natural videos. These results suggest that humans can make complex social interaction judgments without an explicit model of the social and physical world, and that structured, relational visual representations are key to this behavior.

Assuntos

Reconhecimento Psicológico , Interação Social , Humanos , Julgamento , Redes Neurais de Computação

16.

Hierarchical organization of social action features along the lateral visual pathway.

McMahon, Emalie; Bonner, Michael F; Isik, Leyla.

Curr Biol ; 33(23): 5035-5047.e8, 2023 12 04.

Artigo em Inglês | MEDLINE | ID: mdl-37918399

RESUMO

Recent theoretical work has argued that in addition to the classical ventral (what) and dorsal (where/how) visual streams, there is a third visual stream on the lateral surface of the brain specialized for processing social information. Like visual representations in the ventral and dorsal streams, representations in the lateral stream are thought to be hierarchically organized. However, no prior studies have comprehensively investigated the organization of naturalistic, social visual content in the lateral stream. To address this question, we curated a naturalistic stimulus set of 250 3-s videos of two people engaged in everyday actions. Each clip was richly annotated for its low-level visual features, mid-level scene and object properties, visual social primitives (including the distance between people and the extent to which they were facing), and high-level information about social interactions and affective content. Using a condition-rich fMRI experiment and a within-subject encoding model approach, we found that low-level visual features are represented in early visual cortex (EVC) and middle temporal (MT) area, mid-level visual social features in extrastriate body area (EBA) and lateral occipital complex (LOC), and high-level social interaction information along the superior temporal sulcus (STS). Communicative interactions, in particular, explained unique variance in regions of the STS after accounting for variance explained by all other labeled features. Taken together, these results provide support for representation of increasingly abstract social visual content-consistent with hierarchical organization-along the lateral visual stream and suggest that recognizing communicative actions may be a key computational goal of the lateral visual pathway.

Assuntos

Córtex Visual , Humanos , Vias Visuais , Reconhecimento Visual de Modelos , Lobo Temporal , Encéfalo , Imageamento por Ressonância Magnética/métodos , Mapeamento Encefálico/métodos , Estimulação Luminosa/métodos

17.

A data-driven investigation of human action representations.

Dima, Diana C; Hebart, Martin N; Isik, Leyla.

Sci Rep ; 13(1): 5171, 2023 03 30.

Artigo em Inglês | MEDLINE | ID: mdl-36997625

RESUMO

Understanding actions performed by others requires us to integrate different types of information about people, scenes, objects, and their interactions. What organizing dimensions does the mind use to make sense of this complex action space? To address this question, we collected intuitive similarity judgments across two large-scale sets of naturalistic videos depicting everyday actions. We used cross-validated sparse non-negative matrix factorization to identify the structure underlying action similarity judgments. A low-dimensional representation, consisting of nine to ten dimensions, was sufficient to accurately reconstruct human similarity judgments. The dimensions were robust to stimulus set perturbations and reproducible in a separate odd-one-out experiment. Human labels mapped these dimensions onto semantic axes relating to food, work, and home life; social axes relating to people and emotions; and one visual axis related to scene setting. While highly interpretable, these dimensions did not share a clear one-to-one correspondence with prior hypotheses of action-relevant dimensions. Together, our results reveal a low-dimensional set of robust and interpretable dimensions that organize intuitive action similarity judgments and highlight the importance of data-driven investigations of behavioral representations.

Assuntos

Reconhecimento Visual de Modelos , Semântica , Humanos , Julgamento , Emoções , Atividades Humanas

18.

Social-affective features drive human representations of observed actions.

Dima, Diana C; Tomita, Tyler M; Honey, Christopher J; Isik, Leyla.

Elife ; 112022 05 24.

Artigo em Inglês | MEDLINE | ID: mdl-35608254

RESUMO

Humans observe actions performed by others in many different visual and social settings. What features do we extract and attend when we view such complex scenes, and how are they processed in the brain? To answer these questions, we curated two large-scale sets of naturalistic videos of everyday actions and estimated their perceived similarity in two behavioral experiments. We normed and quantified a large range of visual, action-related, and social-affective features across the stimulus sets. Using a cross-validated variance partitioning analysis, we found that social-affective features predicted similarity judgments better than, and independently of, visual and action features in both behavioral experiments. Next, we conducted an electroencephalography experiment, which revealed a sustained correlation between neural responses to videos and their behavioral similarity. Visual, action, and social-affective features predicted neural patterns at early, intermediate, and late stages, respectively, during this behaviorally relevant time window. Together, these findings show that social-affective features are important for perceiving naturalistic actions and are extracted at the final stage of a temporal gradient in the brain.

Assuntos

Mapeamento Encefálico , Encéfalo , Encéfalo/fisiologia , Eletroencefalografia , Humanos , Julgamento/fisiologia , Estimulação Luminosa , Percepção Visual/fisiologia

19.

The neurodevelopmental origins of seeing social interactions.

McMahon, Emalie; Isik, Leyla.

Trends Cogn Sci ; 28(3): 195-196, 2024 03.

Artigo em Inglês | MEDLINE | ID: mdl-38296745

Assuntos

Interação Social , Humanos

20.

Abstract social interaction representations along the lateral pathway.

McMahon, Emalie; Isik, Leyla.

Trends Cogn Sci ; 28(5): 392-393, 2024 May.

Artigo em Inglês | MEDLINE | ID: mdl-38632007

Assuntos

Interação Social , Humanos , Vias Neurais/fisiologia , Encéfalo/fisiologia

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

Assuntos

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA