Pesquisa | Biblioteca Virtual em Saúde

1.

Stacked regressions and structured variance partitioning for interpretable brain maps.

Lin, Ruogu; Naselaris, Thomas; Kay, Kendrick; Wehbe, Leila.

Neuroimage ; 298: 120772, 2024 Sep.

Artigo em Inglês | MEDLINE | ID: mdl-39117095

RESUMO

Relating brain activity associated with a complex stimulus to different properties of that stimulus is a powerful approach for constructing functional brain maps. However, when stimuli are naturalistic, their properties are often correlated (e.g., visual and semantic features of natural images, or different layers of a convolutional neural network that are used as features of images). Correlated properties can act as confounders for each other and complicate the interpretability of brain maps, and can impact the robustness of statistical estimators. Here, we present an approach for brain mapping based on two proposed methods: stacking different encoding models and structured variance partitioning. Our stacking algorithm combines encoding models that each uses as input a feature space that describes a different stimulus attribute. The algorithm learns to predict the activity of a voxel as a linear combination of the outputs of different encoding models. We show that the resulting combined model can predict held-out brain activity better or at least as well as the individual encoding models. Further, the weights of the linear combination are readily interpretable; they show the importance of each feature space for predicting a voxel. We then build on our stacking models to introduce structured variance partitioning, a new type of variance partitioning that takes into account the known relationships between features. Our approach constrains the size of the hypothesis space and allows us to ask targeted questions about the similarity between feature spaces and brain regions even in the presence of correlations between the feature spaces. We validate our approach in simulation, showcase its brain mapping potential on fMRI data, and release a Python package. Our methods can be useful for researchers interested in aligning brain activity with different layers of a neural network, or with other types of correlated feature spaces.

Assuntos

Algoritmos , Mapeamento Encefálico , Encéfalo , Imageamento por Ressonância Magnética , Humanos , Mapeamento Encefálico/métodos , Imageamento por Ressonância Magnética/métodos , Encéfalo/fisiologia , Encéfalo/diagnóstico por imagem , Processamento de Imagem Assistida por Computador/métodos

2.

Temporal asymmetry of neural representations predicts memory decisions.

Ye, Zhifang; Zhao, Yufei; Allen, Emily J; Naselaris, Thomas; Kay, Kendrick; Hutchinson, J Benjamin; Kuhl, Brice A.

bioRxiv ; 2024 Jul 19.

Artigo em Inglês | MEDLINE | ID: mdl-39071351

RESUMO

A stimulus can be familiar for multiple reasons. It might have been recently encountered, or is similar to recent experience, or is similar to 'typical' experience. Understanding how the brain translates these sources of similarity into memory decisions is a fundamental, but challenging goal. Here, using fMRI, we computed neural similarity between a current stimulus and events from different temporal windows in the past and future (from seconds to days). We show that trial-by-trial memory decisions (is this stimulus 'old'?) were predicted by the difference in similarity to past vs. future events (temporal asymmetry). This relationship was (i) evident in lateral parietal and occipitotemporal cortices, (ii) strongest when considering events from the recent past (minutes ago), and (iii) most pronounced when veridical (true) memories were weak. These findings suggest a new perspective in which the brain supports memory decisions by comparing what actually occurred to what is likely to occur.

3.

Disentangling signal and noise in neural responses through generative modeling.

Kay, Kendrick; Prince, Jacob S; Gebhart, Thomas; Tuckute, Greta; Zhou, Jingyang; Naselaris, Thomas; Schutt, Heiko.

bioRxiv ; 2024 Aug 22.

Artigo em Inglês | MEDLINE | ID: mdl-38712051

RESUMO

Measurements of neural responses to identically repeated experimental events often exhibit large amounts of variability. This noise is distinct from signal , operationally defined as the average expected response across repeated trials for each given event. Accurately distinguishing signal from noise is important, as each is a target that is worthy of study (many believe noise reflects important aspects of brain function) and it is important not to confuse one for the other. Here, we describe a principled modeling approach in which response measurements are explicitly modeled as the sum of samples from multivariate signal and noise distributions. In our proposed method-termed Generative Modeling of Signal and Noise (GSN)-the signal distribution is estimated by subtracting the estimated noise distribution from the estimated data distribution. Importantly, GSN improves estimates of the signal distribution, but does not provide improved estimates of responses to individual events. We validate GSN using ground-truth simulations and show that it compares favorably with related methods. We also demonstrate the application of GSN to empirical fMRI data to illustrate a simple consequence of GSN: by disentangling signal and noise components in neural responses, GSN denoises principal components analysis and improves estimates of dimensionality. We end by discussing other situations that may benefit from GSN's characterization of signal and noise, such as estimation of noise ceilings for computational models of neural activity. A code toolbox for GSN is provided with both MATLAB and Python implementations.

4.

How does the primate brain combine generative and discriminative computations in vision?

Peters, Benjamin; DiCarlo, James J; Gureckis, Todd; Haefner, Ralf; Isik, Leyla; Tenenbaum, Joshua; Konkle, Talia; Naselaris, Thomas; Stachenfeld, Kimberly; Tavares, Zenna; Tsao, Doris; Yildirim, Ilker; Kriegeskorte, Nikolaus.

ArXiv ; 2024 Jan 11.

Artigo em Inglês | MEDLINE | ID: mdl-38259351

RESUMO

Vision is widely understood as an inference problem. However, two contrasting conceptions of the inference process have each been influential in research on biological vision as well as the engineering of machine vision. The first emphasizes bottom-up signal flow, describing vision as a largely feedforward, discriminative inference process that filters and transforms the visual information to remove irrelevant variation and represent behaviorally relevant information in a format suitable for downstream functions of cognition and behavioral control. In this conception, vision is driven by the sensory data, and perception is direct because the processing proceeds from the data to the latent variables of interest. The notion of "inference" in this conception is that of the engineering literature on neural networks, where feedforward convolutional neural networks processing images are said to perform inference. The alternative conception is that of vision as an inference process in Helmholtz's sense, where the sensory evidence is evaluated in the context of a generative model of the causal processes that give rise to it. In this conception, vision inverts a generative model through an interrogation of the sensory evidence in a process often thought to involve top-down predictions of sensory data to evaluate the likelihood of alternative hypotheses. The authors include scientists rooted in roughly equal numbers in each of the conceptions and motivated to overcome what might be a false dichotomy between them and engage the other perspective in the realm of theory and experiment. The primate brain employs an unknown algorithm that may combine the advantages of both conceptions. We explain and clarify the terminology, review the key empirical evidence, and propose an empirical research program that transcends the dichotomy and sets the stage for revealing the mysterious hybrid algorithm of primate vision.

5.

Re-expression of CA1 and entorhinal activity patterns preserves temporal context memory at long timescales.

Zou, Futing; Wanjia, Guo; Allen, Emily J; Wu, Yihan; Charest, Ian; Naselaris, Thomas; Kay, Kendrick; Kuhl, Brice A; Hutchinson, J Benjamin; DuBrow, Sarah.

Nat Commun ; 14(1): 4350, 2023 07 19.

Artigo em Inglês | MEDLINE | ID: mdl-37468489

RESUMO

Converging, cross-species evidence indicates that memory for time is supported by hippocampal area CA1 and entorhinal cortex. However, limited evidence characterizes how these regions preserve temporal memories over long timescales (e.g., months). At long timescales, memoranda may be encountered in multiple temporal contexts, potentially creating interference. Here, using 7T fMRI, we measured CA1 and entorhinal activity patterns as human participants viewed thousands of natural scene images distributed, and repeated, across many months. We show that memory for an image's original temporal context was predicted by the degree to which CA1/entorhinal activity patterns from the first encounter with an image were re-expressed during re-encounters occurring minutes to months later. Critically, temporal memory signals were dissociable from predictors of recognition confidence, which were carried by distinct medial temporal lobe expressions. These findings suggest that CA1 and entorhinal cortex preserve temporal memories across long timescales by coding for and reinstating temporal context information.

Assuntos

Córtex Entorrinal , Hipocampo , Humanos , Lobo Temporal/diagnóstico por imagem , Imageamento por Ressonância Magnética , Reconhecimento Psicológico

6.

Second Sight: Using brain-optimized encoding models to align image distributions with human brain activity.

Kneeland, Reese; Ojeda, Jordyn; St-Yves, Ghislain; Naselaris, Thomas.

ArXiv ; 2023 Jun 01.

Artigo em Inglês | MEDLINE | ID: mdl-37396609

RESUMO

Two recent developments have accelerated progress in image reconstruction from human brain activity: large datasets that offer samples of brain activity in response to many thousands of natural scenes, and the open-sourcing of powerful stochastic image-generators that accept both low- and high-level guidance. Most work in this space has focused on obtaining point estimates of the target image, with the ultimate goal of approximating literal pixel-wise reconstructions of target images from the brain activity patterns they evoke. This emphasis belies the fact that there is always a family of images that are equally compatible with any evoked brain activity pattern, and the fact that many image-generators are inherently stochastic and do not by themselves offer a method for selecting the single best reconstruction from among the samples they generate. We introduce a novel reconstruction procedure (Second Sight) that iteratively refines an image distribution to explicitly maximize the alignment between the predictions of a voxel-wise encoding model and the brain activity patterns evoked by any target image. We use an ensemble of brain-optimized deep neural networks trained on the Natural Scenes Dataset (NSD) as our encoding model, and a latent diffusion model as our image generator. At each iteration, we generate a small library of images and select those that best approximate the measured brain activity when passed through our encoding model. We extract semantic and structural guidance from the selected images, used for generating the next library. We show that this process converges on a distribution of high-quality reconstructions by refining both semantic content and low-level image details across iterations. Images sampled from these converged image distributions are competitive with state-of-the-art reconstruction algorithms. Interestingly, the time-to-convergence varies systematically across visual cortex, with earlier visual areas generally taking longer and converging on narrower image distributions, relative to higher-level brain areas. Second Sight thus offers a succinct and novel method for exploring the diversity of representations across visual brain areas.

7.

Brain-optimized deep neural network models of human visual areas learn non-hierarchical representations.

St-Yves, Ghislain; Allen, Emily J; Wu, Yihan; Kay, Kendrick; Naselaris, Thomas.

Nat Commun ; 14(1): 3329, 2023 06 07.

Artigo em Inglês | MEDLINE | ID: mdl-37286563

RESUMO

Deep neural networks (DNNs) optimized for visual tasks learn representations that align layer depth with the hierarchy of visual areas in the primate brain. One interpretation of this finding is that hierarchical representations are necessary to accurately predict brain activity in the primate visual system. To test this interpretation, we optimized DNNs to directly predict brain activity measured with fMRI in human visual areas V1-V4. We trained a single-branch DNN to predict activity in all four visual areas jointly, and a multi-branch DNN to predict each visual area independently. Although it was possible for the multi-branch DNN to learn hierarchical representations, only the single-branch DNN did so. This result shows that hierarchical representations are not necessary to accurately predict human brain activity in V1-V4, and that DNNs that encode brain-like visual representations may differ widely in their architecture, ranging from strict serial hierarchies to multiple independent branches.

Assuntos

Encéfalo , Redes Neurais de Computação , Animais , Humanos , Encéfalo/diagnóstico por imagem , Aprendizagem , Imageamento por Ressonância Magnética , Primatas

8.

Stacked regressions and structured variance partitioning for interpretable brain maps.

Lin, Ruogu; Naselaris, Thomas; Kay, Kendrick; Wehbe, Leila.

bioRxiv ; 2023 Apr 24.

Artigo em Inglês | MEDLINE | ID: mdl-37163111

RESUMO

Relating brain activity associated with a complex stimulus to different properties of that stimulus is a powerful approach for constructing functional brain maps. However, when stimuli are naturalistic, their properties are often correlated (e.g., visual and semantic features of natural images, or different layers of a convolutional neural network that are used as features of images). Correlated properties can act as confounders for each other and complicate the interpretability of brain maps, and can impact the robustness of statistical estimators. Here, we present an approach for brain mapping based on two proposed methods: stacking different encoding models and structured variance partitioning. Our stacking algorithm combines encoding models that each use as input a feature space that describes a different stimulus attribute. The algorithm learns to predict the activity of a voxel as a linear combination of the outputs of different encoding models. We show that the resulting combined model can predict held-out brain activity better or at least as well as the individual encoding models. Further, the weights of the linear combination are readily interpretable; they show the importance of each feature space for predicting a voxel. We then build on our stacking models to introduce structured variance partitioning, a new type of variance partitioning that takes into account the known relationships between features. Our approach constrains the size of the hypothesis space and allows us to ask targeted questions about the similarity between feature spaces and brain regions even in the presence of correlations between the feature spaces. We validate our approach in simulation, showcase its brain mapping potential on fMRI data, and release a Python package. Our methods can be useful for researchers interested in aligning brain activity with different layers of a neural network, or with other types of correlated feature spaces.

9.

Reconstructing seen images from human brain activity via guided stochastic search.

Kneeland, Reese; Ojeda, Jordyn; St-Yves, Ghislain; Naselaris, Thomas.

ArXiv ; 2023 May 02.

Artigo em Inglês | MEDLINE | ID: mdl-37205268

RESUMO

Visual reconstruction algorithms are an interpretive tool that map brain activity to pixels. Past reconstruction algorithms employed brute-force search through a massive library to select candidate images that, when passed through an encoding model, accurately predict brain activity. Here, we use conditional generative diffusion models to extend and improve this search-based strategy. We decode a semantic descriptor from human brain activity (7T fMRI) in voxels across most of visual cortex, then use a diffusion model to sample a small library of images conditioned on this descriptor. We pass each sample through an encoding model, select the images that best predict brain activity, and then use these images to seed another library. We show that this process converges on high-quality reconstructions by refining low-level image details while preserving semantic content across iterations. Interestingly, the time-to-convergence differs systematically across visual cortex, suggesting a succinct new way to measure the diversity of representations across visual brain areas.

10.

Color-biased regions in the ventral visual pathway are food selective.

Pennock, Ian M L; Racey, Chris; Allen, Emily J; Wu, Yihan; Naselaris, Thomas; Kay, Kendrick N; Franklin, Anna; Bosten, Jenny M.

Curr Biol ; 33(1): 134-146.e4, 2023 01 09.

Artigo em Inglês | MEDLINE | ID: mdl-36574774

RESUMO

Color-biased regions have been found between face- and place-selective areas in the ventral visual pathway. To investigate the function of the color-biased regions in a pathway responsible for object recognition, we analyzed the natural scenes dataset (NSD), a large 7T fMRI dataset from 8 participants who each viewed up to 30,000 trials of images of colored natural scenes over more than 30 scanning sessions. In a whole-brain analysis, we correlated the average color saturation of the images with voxel responses, revealing color-biased regions that diverge into two streams, beginning in V4 and extending medially and laterally relative to the fusiform face area in both hemispheres. We drew regions of interest (ROIs) for the two streams and found that the images for each ROI that evoked the largest responses had certain characteristics: they contained food, circular objects, warmer hues, and had higher color saturation. Further analyses showed that food images were the strongest predictor of activity in these regions, implying the existence of medial and lateral ventral food streams (VFSs). We found that color also contributed independently to voxel responses, suggesting that the medial and lateral VFSs use both color and form to represent food. Our findings illustrate how high-resolution datasets such as the NSD can be used to disentangle the multifaceted contributions of many visual features to the neural representations of natural scenes.

Assuntos

Vias Visuais , Percepção Visual , Humanos , Vias Visuais/fisiologia , Percepção Visual/fisiologia , Encéfalo/fisiologia , Mapeamento Encefálico , Imageamento por Ressonância Magnética , Reconhecimento Visual de Modelos/fisiologia , Estimulação Luminosa

11.

Brain-optimized inference improves reconstructions of fMRI brain activity.

Kneeland, Reese; Ojeda, Jordyn; St-Yves, Ghislain; Naselaris, Thomas.

ArXiv ; 2023 Dec 12.

Artigo em Inglês | MEDLINE | ID: mdl-38168454

RESUMO

The release of large datasets and developments in AI have led to dramatic improvements in decoding methods that reconstruct seen images from human brain activity. We evaluate the prospect of further improving recent decoding methods by optimizing for consistency between reconstructions and brain activity during inference. We sample seed reconstructions from a base decoding method, then iteratively refine these reconstructions using a brain-optimized encoding model that maps images to brain activity. At each iteration, we sample a small library of images from an image distribution (a diffusion model) conditioned on a seed reconstruction from the previous iteration. We select those that best approximate the measured brain activity when passed through our encoding model, and use these images for structural guidance during the generation of the small library in the next iteration. We reduce the stochasticity of the image distribution at each iteration, and stop when a criterion on the "width" of the image distribution is met. We show that when this process is applied to recent decoding methods, it outperforms the base decoding method as measured by human raters, a variety of image feature metrics, and alignment to brain activity. These results demonstrate that reconstruction quality can be significantly improved by explicitly aligning decoding distributions to brain activity distributions, even when the seed reconstruction is output from a state-of-the-art decoding algorithm. Interestingly, the rate of refinement varies systematically across visual cortex, with earlier visual areas generally converging more slowly and preferring narrower image distributions, relative to higher-level brain areas. Brain-optimized inference thus offers a succinct and novel method for improving reconstructions and exploring the diversity of representations across visual brain areas.

12.

Modulation of Spectral Representation and Connectivity Patterns in Response to Visual Narrative in the Human Brain.

Sabra, Zahraa; Alawieh, Ali; Bonilha, Leonardo; Naselaris, Thomas; AuYong, Nicholas.

Front Hum Neurosci ; 16: 886938, 2022.

Artigo em Inglês | MEDLINE | ID: mdl-36277048

RESUMO

The regional brain networks and the underlying neurophysiological mechanisms subserving the cognition of visual narrative in humans have largely been studied with non-invasive brain recording. In this study, we specifically investigated how regional and cross-regional cortical activities support visual narrative interpretation using intracranial stereotactic electroencephalograms recordings from thirteen human subjects (6 females, and 7 males). Widely distributed recording sites across the brain were sampled while subjects were explicitly instructed to observe images from fables presented in "sequential" order, and a set of images drawn from multiple fables presented in "scrambled" order. Broadband activity mainly within the frontal and temporal lobes were found to encode if a presented image is part of a visual narrative (sequential) or random image set (scrambled). Moreover, the temporal lobe exhibits strong activation in response to visual narratives while the frontal lobe is more engaged when contextually novel stimuli are presented. We also investigated the dynamics of interregional interactions between visual narratives and contextually novel series of images. Interestingly, the interregional connectivity is also altered between sequential and scrambled sequences. Together, these results suggest that both changes in regional neuronal activity and cross-regional interactions subserve visual narrative and contextual novelty processing.

13.

Multiple traces and altered signal-to-noise in systems consolidation: Evidence from the 7T fMRI Natural Scenes Dataset.

Vanasse, Thomas J; Boly, Melanie; Allen, Emily J; Wu, Yihan; Naselaris, Thomas; Kay, Kendrick; Cirelli, Chiara; Tononi, Giulio.

Proc Natl Acad Sci U S A ; 119(44): e2123426119, 2022 11.

Artigo em Inglês | MEDLINE | ID: mdl-36279446

RESUMO

The brain mechanisms of memory consolidation remain elusive. Here, we examine blood-oxygen-level-dependent (BOLD) correlates of image recognition through the scope of multiple influential systems consolidation theories. We utilize the longitudinal Natural Scenes Dataset, a 7-Tesla functional magnetic resonance imaging human study in which â¼135,000 trials of image recognition were conducted over the span of a year among eight subjects. We find that early- and late-stage image recognition associates with both medial temporal lobe (MTL) and visual cortex when evaluating regional activations and a multivariate classifier. Supporting multiple-trace theory (MTT), parts of the MTL activation time course show remarkable fit to a 20-y-old MTT time-dynamical model predicting early trace intensity increases and slight subsequent interference (R2 > 0.90). These findings contrast a simplistic, yet common, view that memory traces are transferred from MTL to cortex. Next, we test the hypothesis that the MTL trace signature of memory consolidation should also reflect synaptic "desaturation," as evidenced by an increased signal-to-noise ratio. We find that the magnitude of relative BOLD enhancement among surviving memories is positively linked to the rate of removal (i.e., forgetting) of competing traces. Moreover, an image-feature and time interaction of MTL and visual cortex functional connectivity suggests that consolidation mechanisms improve the specificity of a distributed trace. These neurobiological effects do not replicate on a shorter timescale (within a session), implicating a prolonged, offline process. While recognition can potentially involve cognitive processes outside of memory retrieval (e.g., re-encoding), our work largely favors MTT and desaturation as perhaps complementary consolidative memory mechanisms.

Assuntos

Mapeamento Encefálico , Imageamento por Ressonância Magnética , Humanos , Testes Neuropsicológicos , Lobo Temporal/fisiologia , Oxigênio

14.

A massive 7T fMRI dataset to bridge cognitive neuroscience and artificial intelligence.

Allen, Emily J; St-Yves, Ghislain; Wu, Yihan; Breedlove, Jesse L; Prince, Jacob S; Dowdle, Logan T; Nau, Matthias; Caron, Brad; Pestilli, Franco; Charest, Ian; Hutchinson, J Benjamin; Naselaris, Thomas; Kay, Kendrick.

Nat Neurosci ; 25(1): 116-126, 2022 01.

Artigo em Inglês | MEDLINE | ID: mdl-34916659

RESUMO

Extensive sampling of neural activity during rich cognitive phenomena is critical for robust understanding of brain function. Here we present the Natural Scenes Dataset (NSD), in which high-resolution functional magnetic resonance imaging responses to tens of thousands of richly annotated natural scenes were measured while participants performed a continuous recognition task. To optimize data quality, we developed and applied novel estimation and denoising techniques. Simple visual inspections of the NSD data reveal clear representational transformations along the ventral visual pathway. Further exemplifying the inferential power of the dataset, we used NSD to build and train deep neural network models that predict brain activity more accurately than state-of-the-art models from computer vision. NSD also includes substantial resting-state and diffusion data, enabling network neuroscience perspectives to constrain and enhance models of perception and memory. Given its unprecedented scale, quality and breadth, NSD opens new avenues of inquiry in cognitive neuroscience and artificial intelligence.

Assuntos

Neurociência Cognitiva , Imageamento por Ressonância Magnética , Inteligência Artificial , Encéfalo/diagnóstico por imagem , Encéfalo/fisiologia , Mapeamento Encefálico/métodos , Humanos , Imageamento por Ressonância Magnética/métodos , Redes Neurais de Computação , Reconhecimento Psicológico

15.

NeuroGen: Activation optimized image synthesis for discovery neuroscience.

Gu, Zijin; Jamison, Keith Wakefield; Khosla, Meenakshi; Allen, Emily J; Wu, Yihan; St-Yves, Ghislain; Naselaris, Thomas; Kay, Kendrick; Sabuncu, Mert R; Kuceyeski, Amy.

Neuroimage ; 247: 118812, 2022 02 15.

Artigo em Inglês | MEDLINE | ID: mdl-34936922

RESUMO

Functional MRI (fMRI) is a powerful technique that has allowed us to characterize visual cortex responses to stimuli, yet such experiments are by nature constructed based on a priori hypotheses, limited to the set of images presented to the individual while they are in the scanner, are subject to noise in the observed brain responses, and may vary widely across individuals. In this work, we propose a novel computational strategy, which we call NeuroGen, to overcome these limitations and develop a powerful tool for human vision neuroscience discovery. NeuroGen combines an fMRI-trained neural encoding model of human vision with a deep generative network to synthesize images predicted to achieve a target pattern of macro-scale brain activation. We demonstrate that the reduction of noise that the encoding model provides, coupled with the generative network's ability to produce images of high fidelity, results in a robust discovery architecture for visual neuroscience. By using only a small number of synthetic images created by NeuroGen, we demonstrate that we can detect and amplify differences in regional and individual human brain response patterns to visual stimuli. We then verify that these discoveries are reflected in the several thousand observed image responses measured with fMRI. We further demonstrate that NeuroGen can create synthetic images predicted to achieve regional response patterns not achievable by the best-matching natural images. The NeuroGen framework extends the utility of brain encoding models and opens up a new avenue for exploring, and possibly precisely controlling, the human visual system.

Assuntos

Aprendizado Profundo , Processamento de Imagem Assistida por Computador/métodos , Imageamento por Ressonância Magnética/métodos , Córtex Visual/diagnóstico por imagem , Córtex Visual/fisiologia , Conjuntos de Dados como Assunto , Humanos , Aumento da Imagem/métodos

16.

Voxel-to-voxel predictive models reveal unexpected structure in unexplained variance.

Mell, Maggie Mae; St-Yves, Ghislain; Naselaris, Thomas.

Neuroimage ; 238: 118266, 2021 09.

Artigo em Inglês | MEDLINE | ID: mdl-34129949

RESUMO

Encoding models based on deep convolutional neural networks (DCNN) predict BOLD responses to natural scenes in the human visual system more accurately than many other currently available models. However, DCNN-based encoding models fail to predict a significant amount of variance in the activity of most voxels in all visual areas. This failure could reflect limitations in the data (e.g., a noise ceiling), or could reflect limitations of the DCNN as a model of computation in the brain. Understanding the source and structure of the unexplained variance could therefore provide helpful clues for improving models of brain computation. Here, we characterize the structure of the variance that DCNN-based encoding models cannot explain. Using a publicly available dataset of BOLD responses to natural scenes, we determined if the source of unexplained variance was shared across voxels, individual brains, retinotopic locations, and hierarchically distant visual brain areas. We answered these questions using voxel-to-voxel (vox2vox) models that predict activity in a target voxel given activity in a population of source voxels. We found that simple linear vox2vox models increased within-subject prediction accuracy over DCNN-based models for any pair of source/target visual areas, clearly demonstrating that the source of unexplained variance is widely shared within and across visual brain areas. However, vox2vox models were not more accurate than DCNN-based encoding models when source and target voxels came from different brains, demonstrating that the source of unexplained variance was not shared across brains. Importantly, control analyses demonstrated that the source of unexplained variance was not encoded in the mean activity of source voxels, or the activity of voxels in white matter. Interestingly, the weights of vox2vox models revealed preferential connection of target voxel activity to source voxels with adjacent receptive fields, even when source and target voxels were in different functional brain areas. Finally, we found that the prediction accuracy of the vox2vox models decayed with hierarchical distance between the source and target voxels but showed detailed patterns of dependence on hierarchical relationships that we did not observe in DCNNs. Given these results, we argue that the structured variance unexplained by DCNN-based encoding models is unlikely to be entirely caused by non-neural artifacts (e.g., spatially correlated measurement noise) or a failure of DCNNs to approximate the features encoded in brain activity; rather, our results point to a need for brain models that provide both mechanistic and computational explanations for structured ongoing activity in the brain. Keywords: fMRI, encoding models, deep neural networks, functional connectivity.

Assuntos

Mapeamento Encefálico/métodos , Encéfalo/diagnóstico por imagem , Modelos Neurológicos , Humanos , Processamento de Imagem Assistida por Computador/métodos , Imageamento por Ressonância Magnética

17.

Generative Feedback Explains Distinct Brain Activity Codes for Seen and Mental Images.

Breedlove, Jesse L; St-Yves, Ghislain; Olman, Cheryl A; Naselaris, Thomas.

Curr Biol ; 30(12): 2211-2224.e6, 2020 06 22.

Artigo em Inglês | MEDLINE | ID: mdl-32359428

RESUMO

The relationship between mental imagery and vision is a long-standing problem in neuroscience. Currently, it is not known whether differences between the activity evoked during vision and reinstated during imagery reflect different codes for seen and mental images. To address this problem, we modeled mental imagery in the human brain as feedback in a hierarchical generative network. Such networks synthesize images by feeding abstract representations from higher to lower levels of the network hierarchy. When higher processing levels are less sensitive to stimulus variation than lower processing levels, as in the human brain, activity in low-level visual areas should encode variation in mental images with less precision than seen images. To test this prediction, we conducted an fMRI experiment in which subjects imagined and then viewed hundreds of spatially varying naturalistic stimuli. To analyze these data, we developed imagery-encoding models. These models accurately predicted brain responses to imagined stimuli and enabled accurate decoding of their position and content. They also allowed us to compare, for every voxel, tuning to seen and imagined spatial frequencies, as well as the location and size of receptive fields in visual and imagined space. We confirmed our prediction, showing that, in low-level visual areas, imagined spatial frequencies in individual voxels are reduced relative to seen spatial frequencies and that receptive fields in imagined space are larger than in visual space. These findings reveal distinct codes for seen and mental images and link mental imagery to the computational abilities of generative networks.

Assuntos

Retroalimentação Psicológica , Imaginação/fisiologia , Córtex Visual/fisiologia , Percepção Visual/fisiologia , Adulto , Feminino , Humanos , Imageamento por Ressonância Magnética , Modelos Psicológicos

18.

Spectral Encoding of Seen and Attended Object Categories in the Human Brain.

Sabra, Zahraa; Bonilha, Leonardo; Naselaris, Thomas.

J Neurosci ; 40(2): 327-342, 2020 01 08.

Artigo em Inglês | MEDLINE | ID: mdl-31694964

RESUMO

Local field potentials (LFPs) encode visual information via variations in power at many frequencies. These variations are complex and depend on stimulus and cognitive state in ways that have yet to be fully characterized. Specifically, the frequencies (or combinations of frequencies) that most robustly encode specific types of visual information are not fully known. To address this knowledge gap, we used intracranial EEG to record LFPs at 858 widely distributed recording sites as human subjects (six males, five females) indicated whether briefly presented natural scenes depicted one of three attended object categories. Principal component analysis applied to power spectra of the LFPs near stimulus onset revealed a broadband component (1-100 Hz) and two narrowband components (1-8 and 8-30 Hz, respectively) that encoded information about both seen and attended categories. Interestingly, we found that seen and attended categories were not encoded with the same fidelity by these distinct spectral components. Model-based tuning and decoding analyses revealed that power variations along the broadband component were most sharply tuned and offered more accurate decoding for seen than for attended categories. Power along the narrowband delta-theta (1-8 Hz) component robustly decoded information about both seen and attended categories, while the alpha-beta (8-30 Hz) component was specialized for attention. We conclude that, when viewing natural scenes, information about the seen category is encoded via broadband and sub-gamma (<30 Hz) power variations, while the attended category is most robustly encoded in the sub-gamma range. More generally, these results suggest that power variation along different spectral components can encode qualitatively different kinds of visual information.SIGNIFICANCE STATEMENT In this article, we characterize how changes in visual stimuli depicting specific objects (cars, faces, and buildings) and changes in attention to those objects affect the frequency content of local field potentials in the human brain. In contrast to many previous studies that have investigated encoding by variations in power at high (>30 Hz) frequencies, we find that the most important variation patterns are broadband (i.e., distributed across multiple frequencies) and narrowband, but in lower frequencies (<30 Hz). Interestingly, we find that seen and attended categories are not encoded with the same fidelity by these distinct spectral encoding patterns, suggesting that power at different frequencies can encode qualitatively different kinds of information.

Assuntos

Atenção/fisiologia , Encéfalo/fisiologia , Simulação por Computador , Modelos Neurológicos , Percepção Visual/fisiologia , Adolescente , Adulto , Eletroencefalografia , Potenciais Evocados Visuais/fisiologia , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Estimulação Luminosa , Adulto Jovem

19.

Tracking the Development of Functional Connectomes for Face Processing.

Joseph, Jane E; Vanderweyen, Davy; Swearingen, Joshua; Vaughan, Brandon K; Novo, Derek; Zhu, Xun; Gebregziabher, Mulugeta; Bonilha, Leonardo; Bhatt, Ramesh; Naselaris, Thomas; Dean, Brian.

Brain Connect ; 9(2): 231-239, 2019 03.

Artigo em Inglês | MEDLINE | ID: mdl-30489152

RESUMO

Face processing capacities become more specialized and advanced during development, but neural underpinnings of these processes are not fully understood. The present study applied graph theory-based network analysis to task-negative (resting blocks) and task-positive (viewing faces) functional magnetic resonance imaging data in children (5-17 years) and adults (18-42 years) to test the hypothesis that the development of a specialized network for face processing is driven by task-positive processing (face viewing) more than by task-negative processing (visual fixation) and by both progressive and regressive changes in network properties. Predictive modeling was used to predict age from node-based network properties derived from task-positive and task-negative states in a whole-brain network (WBN) and a canonical face network (FN). The best-fitting model indicated that FN maturation was marked by both progressive and regressive changes in information diffusion (eigenvector centrality) in the task-positive state, with regressive changes outweighing progressive changes. Hence, FN maturation was characterized by reductions in information diffusion potentially reflecting the development of more specialized modules. In contrast, WBN maturation was marked by a balance of progressive and regressive changes in hub-connectivity (betweenness centrality) in the task-negative state. These findings suggest that the development of specialized networks like the FN depends on dynamic developmental changes associated with domain-specific information (e.g., face processing), but maturation of the brain as a whole can be predicted from task-free states.

Assuntos

Conectoma/métodos , Reconhecimento Facial/fisiologia , Adolescente , Adulto , Encéfalo/patologia , Encéfalo/fisiologia , Criança , Pré-Escolar , Simulação por Computador , Feminino , Humanos , Imageamento por Ressonância Magnética/métodos , Masculino , Rede Nervosa/diagnóstico por imagem , Rede Nervosa/fisiologia , Vias Neurais/patologia , Descanso

20.

Cognitive Computational Neuroscience: A New Conference for an Emerging Discipline.

Naselaris, Thomas; Bassett, Danielle S; Fletcher, Alyson K; Kording, Konrad; Kriegeskorte, Nikolaus; Nienborg, Hendrikje; Poldrack, Russell A; Shohamy, Daphna; Kay, Kendrick.

Trends Cogn Sci ; 22(5): 365-367, 2018 05.

Artigo em Inglês | MEDLINE | ID: mdl-29500078

RESUMO

Understanding the computational principles that underlie complex behavior is a central goal in cognitive science, artificial intelligence, and neuroscience. In an attempt to unify these disconnected communities, we created a new conference called Cognitive Computational Neuroscience (CCN). The inaugural meeting revealed considerable enthusiasm but significant obstacles remain.

Assuntos

Neurociência Cognitiva , Biologia Computacional , Congressos como Assunto , Inteligência Artificial , Humanos

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA