Búsqueda | BVS CLAP/SMR-OPS/OMS

1.

Neural tracking measures of speech intelligibility: Manipulating intelligibility while keeping acoustics unchanged.

Karunathilake, I M Dushyanthi; Kulasingham, Joshua P; Simon, Jonathan Z.

Proc Natl Acad Sci U S A ; 120(49): e2309166120, 2023 Dec 05.

Artículo en Inglés | MEDLINE | ID: mdl-38032934

RESUMEN

Neural speech tracking has advanced our understanding of how our brains rapidly map an acoustic speech signal onto linguistic representations and ultimately meaning. It remains unclear, however, how speech intelligibility is related to the corresponding neural responses. Many studies addressing this question vary the level of intelligibility by manipulating the acoustic waveform, but this makes it difficult to cleanly disentangle the effects of intelligibility from underlying acoustical confounds. Here, using magnetoencephalography recordings, we study neural measures of speech intelligibility by manipulating intelligibility while keeping the acoustics strictly unchanged. Acoustically identical degraded speech stimuli (three-band noise-vocoded, ~20 s duration) are presented twice, but the second presentation is preceded by the original (nondegraded) version of the speech. This intermediate priming, which generates a "pop-out" percept, substantially improves the intelligibility of the second degraded speech passage. We investigate how intelligibility and acoustical structure affect acoustic and linguistic neural representations using multivariate temporal response functions (mTRFs). As expected, behavioral results confirm that perceived speech clarity is improved by priming. mTRFs analysis reveals that auditory (speech envelope and envelope onset) neural representations are not affected by priming but only by the acoustics of the stimuli (bottom-up driven). Critically, our findings suggest that segmentation of sounds into words emerges with better speech intelligibility, and most strongly at the later (~400 ms latency) word processing stage, in prefrontal cortex, in line with engagement of top-down mechanisms associated with priming. Taken together, our results show that word representations may provide some objective measures of speech comprehension.

Asunto(s)

Inteligibilidad del Habla , Percepción del Habla , Inteligibilidad del Habla/fisiología , Estimulación Acústica/métodos , Habla/fisiología , Ruido , Acústica , Magnetoencefalografía/métodos , Percepción del Habla/fisiología

2.

Effects of aging on cortical representations of continuous speech.

Karunathilake, I M Dushyanthi; Dunlap, Jason L; Perera, Janani; Presacco, Alessandro; Decruy, Lien; Anderson, Samira; Kuchinsky, Stefanie E; Simon, Jonathan Z.

J Neurophysiol ; 129(6): 1359-1377, 2023 06 01.

Artículo en Inglés | MEDLINE | ID: mdl-37096924

RESUMEN

Understanding speech in a noisy environment is crucial in day-to-day interactions and yet becomes more challenging with age, even for healthy aging. Age-related changes in the neural mechanisms that enable speech-in-noise listening have been investigated previously; however, the extent to which age affects the timing and fidelity of encoding of target and interfering speech streams is not well understood. Using magnetoencephalography (MEG), we investigated how continuous speech is represented in auditory cortex in the presence of interfering speech in younger and older adults. Cortical representations were obtained from neural responses that time-locked to the speech envelopes with speech envelope reconstruction and temporal response functions (TRFs). TRFs showed three prominent peaks corresponding to auditory cortical processing stages: early (â¼50 ms), middle (â¼100 ms), and late (â¼200 ms). Older adults showed exaggerated speech envelope representations compared with younger adults. Temporal analysis revealed both that the age-related exaggeration starts as early as â¼50 ms and that older adults needed a substantially longer integration time window to achieve their better reconstruction of the speech envelope. As expected, with increased speech masking envelope reconstruction for the attended talker decreased and all three TRF peaks were delayed, with aging contributing additionally to the reduction. Interestingly, for older adults the late peak was delayed, suggesting that this late peak may receive contributions from multiple sources. Together these results suggest that there are several mechanisms at play compensating for age-related temporal processing deficits at several stages but which are not able to fully reestablish unimpaired speech perception.NEW & NOTEWORTHY We observed age-related changes in cortical temporal processing of continuous speech that may be related to older adults' difficulty in understanding speech in noise. These changes occur in both timing and strength of the speech representations at different cortical processing stages and depend on both noise condition and selective attention. Critically, their dependence on noise condition changes dramatically among the early, middle, and late cortical processing stages, underscoring how aging differentially affects these stages.

Asunto(s)

Percepción del Habla , Habla , Habla/fisiología , Percepción Auditiva , Ruido , Percepción del Habla/fisiología , Estimulación Acústica/métodos

3.

Neural speech restoration at the cocktail party: Auditory cortex recovers masked speech of both attended and ignored speakers.

Brodbeck, Christian; Jiao, Alex; Hong, L Elliot; Simon, Jonathan Z.

PLoS Biol ; 18(10): e3000883, 2020 10.

Artículo en Inglés | MEDLINE | ID: mdl-33091003

RESUMEN

Humans are remarkably skilled at listening to one speaker out of an acoustic mixture of several speech sources. Two speakers are easily segregated, even without binaural cues, but the neural mechanisms underlying this ability are not well understood. One possibility is that early cortical processing performs a spectrotemporal decomposition of the acoustic mixture, allowing the attended speech to be reconstructed via optimally weighted recombinations that discount spectrotemporal regions where sources heavily overlap. Using human magnetoencephalography (MEG) responses to a 2-talker mixture, we show evidence for an alternative possibility, in which early, active segregation occurs even for strongly spectrotemporally overlapping regions. Early (approximately 70-millisecond) responses to nonoverlapping spectrotemporal features are seen for both talkers. When competing talkers' spectrotemporal features mask each other, the individual representations persist, but they occur with an approximately 20-millisecond delay. This suggests that the auditory cortex recovers acoustic features that are masked in the mixture, even if they occurred in the ignored speech. The existence of such noise-robust cortical representations, of features present in attended as well as ignored speech, suggests an active cortical stream segregation process, which could explain a range of behavioral effects of ignored background speech.

Asunto(s)

Corteza Auditiva/fisiología , Habla/fisiología , Estimulación Acústica , Acústica , Adulto , Atención/fisiología , Femenino , Humanos , Magnetoencefalografía , Masculino , Persona de Mediana Edad , Modelos Biológicos , Factores de Tiempo , Adulto Joven

4.

Poststroke acute dysexecutive syndrome, a disorder resulting from minor stroke due to disruption of network dynamics.

Marsh, Elisabeth B; Brodbeck, Christian; Llinas, Rafael H; Mallick, Dania; Kulasingham, Joshua P; Simon, Jonathan Z; Llinás, Rodolfo R.

Proc Natl Acad Sci U S A ; 117(52): 33578-33585, 2020 12 29.

Artículo en Inglés | MEDLINE | ID: mdl-33318200

RESUMEN

Stroke patients with small central nervous system infarcts often demonstrate an acute dysexecutive syndrome characterized by difficulty with attention, concentration, and processing speed, independent of lesion size or location. We use magnetoencephalography (MEG) to show that disruption of network dynamics may be responsible. Nine patients with recent minor strokes and eight age-similar controls underwent cognitive screening using the Montreal cognitive assessment (MoCA) and MEG to evaluate differences in cerebral activation patterns. During MEG, subjects participated in a visual picture-word matching task. Task complexity was increased as testing progressed. Cluster-based permutation tests determined differences in activation patterns within the visual cortex, fusiform gyrus, and lateral temporal lobe. At visit 1, MoCA scores were significantly lower for patients than controls (median [interquartile range] = 26.0 [4] versus 29.5 [3], P = 0.005), and patient reaction times were increased. The amplitude of activation was significantly lower after infarct and demonstrated a pattern of temporal dispersion independent of stroke location. Differences were prominent in the fusiform gyrus and lateral temporal lobe. The pattern suggests that distributed network dysfunction may be responsible. Additionally, controls were able to modulate their cerebral activity based on task difficulty. In contrast, stroke patients exhibited the same low-amplitude response to all stimuli. Group differences remained, to a lesser degree, 6 mo later; while MoCA scores and reaction times improved for patients. This study suggests that function is a globally distributed property beyond area-specific functionality and illustrates the need for longer-term follow-up studies to determine whether abnormal activation patterns ultimately resolve or another mechanism underlies continued recovery.

Asunto(s)

Red Nerviosa/fisiopatología , Accidente Cerebrovascular/fisiopatología , Enfermedad Aguda , Adolescente , Adulto , Anciano , Conducta , Mapeo Encefálico , Femenino , Humanos , Imagen por Resonancia Magnética , Magnetoencefalografía , Masculino , Persona de Mediana Edad , Red Nerviosa/diagnóstico por imagen , Accidente Cerebrovascular/diagnóstico por imagen , Síndrome , Análisis y Desempeño de Tareas , Factores de Tiempo , Adulto Joven

5.

Neural Markers of Speech Comprehension: Measuring EEG Tracking of Linguistic Speech Representations, Controlling the Speech Acoustics.

Gillis, Marlies; Vanthornhout, Jonas; Simon, Jonathan Z; Francart, Tom; Brodbeck, Christian.

J Neurosci ; 41(50): 10316-10329, 2021 12 15.

Artículo en Inglés | MEDLINE | ID: mdl-34732519

RESUMEN

When listening to speech, our brain responses time lock to acoustic events in the stimulus. Recent studies have also reported that cortical responses track linguistic representations of speech. However, tracking of these representations is often described without controlling for acoustic properties. Therefore, the response to these linguistic representations might reflect unaccounted acoustic processing rather than language processing. Here, we evaluated the potential of several recently proposed linguistic representations as neural markers of speech comprehension. To do so, we investigated EEG responses to audiobook speech of 29 participants (22 females). We examined whether these representations contribute unique information over and beyond acoustic neural tracking and each other. Indeed, not all of these linguistic representations were significantly tracked after controlling for acoustic properties. However, phoneme surprisal, cohort entropy, word surprisal, and word frequency were all significantly tracked over and beyond acoustic properties. We also tested the generality of the associated responses by training on one story and testing on another. In general, the linguistic representations are tracked similarly across different stories spoken by different readers. These results suggests that these representations characterize the processing of the linguistic content of speech.SIGNIFICANCE STATEMENT For clinical applications, it would be desirable to develop a neural marker of speech comprehension derived from neural responses to continuous speech. Such a measure would allow for behavior-free evaluation of speech understanding; this would open doors toward better quantification of speech understanding in populations from whom obtaining behavioral measures may be difficult, such as young children or people with cognitive impairments, to allow better targeted interventions and better fitting of hearing devices.

Asunto(s)

Comprensión/fisiología , Lingüística , Acústica del Lenguaje , Percepción del Habla/fisiología , Electroencefalografía/métodos , Femenino , Humanos , Masculino , Procesamiento de Señales Asistido por Computador

6.

Cortical Processing of Arithmetic and Simple Sentences in an Auditory Attention Task.

Kulasingham, Joshua P; Joshi, Neha H; Rezaeizadeh, Mohsen; Simon, Jonathan Z.

J Neurosci ; 41(38): 8023-8039, 2021 09 22.

Artículo en Inglés | MEDLINE | ID: mdl-34400518

RESUMEN

Cortical processing of arithmetic and of language rely on both shared and task-specific neural mechanisms, which should also be dissociable from the particular sensory modality used to probe them. Here, spoken arithmetical and non-mathematical statements were employed to investigate neural processing of arithmetic, compared with general language processing, in an attention-modulated cocktail party paradigm. Magnetoencephalography (MEG) data were recorded from 22 human subjects listening to audio mixtures of spoken sentences and arithmetic equations while selectively attending to one of the two speech streams. Short sentences and simple equations were presented diotically at fixed and distinct word/symbol and sentence/equation rates. Critically, this allowed neural responses to acoustics, words, and symbols to be dissociated from responses to sentences and equations. Indeed, the simultaneous neural processing of the acoustics of words and symbols was observed in auditory cortex for both streams. Neural responses to sentences and equations, however, were predominantly to the attended stream, originating primarily from left temporal, and parietal areas, respectively. Additionally, these neural responses were correlated with behavioral performance in a deviant detection task. Source-localized temporal response functions (TRFs) revealed distinct cortical dynamics of responses to sentences in left temporal areas and equations in bilateral temporal, parietal, and motor areas. Finally, the target of attention could be decoded from MEG responses, especially in left superior parietal areas. In short, the neural responses to arithmetic and language are especially well segregated during the cocktail party paradigm, and the correlation with behavior suggests that they may be linked to successful comprehension or calculation.SIGNIFICANCE STATEMENT Neural processing of arithmetic relies on dedicated, modality independent cortical networks that are distinct from those underlying language processing. Using a simultaneous cocktail party listening paradigm, we found that these separate networks segregate naturally when listeners selectively attend to one type over the other. Neural responses in the left temporal lobe were observed for both spoken sentences and equations, but the latter additionally showed bilateral parietal activity consistent with arithmetic processing. Critically, these responses were modulated by selective attention and correlated with task behavior, consistent with reflecting high-level processing for speech comprehension or correct calculations. The response dynamics show task-related differences that were used to reliably decode the attentional target of sentences or equations.

Asunto(s)

Atención/fisiología , Percepción Auditiva/fisiología , Corteza Cerebral/fisiología , Solución de Problemas/fisiología , Comprensión/fisiología , Femenino , Humanos , Magnetoencefalografía , Masculino , Matemática , Percepción del Habla/fisiología , Adulto Joven

7.

NLGC: Network localized Granger causality with application to MEG directional functional connectivity analysis.

Soleimani, Behrad; Das, Proloy; Dushyanthi Karunathilake, I M; Kuchinsky, Stefanie E; Simon, Jonathan Z; Babadi, Behtash.

Neuroimage ; 260: 119496, 2022 10 15.

Artículo en Inglés | MEDLINE | ID: mdl-35870697

RESUMEN

Identifying the directed connectivity that underlie networked activity between different cortical areas is critical for understanding the neural mechanisms behind sensory processing. Granger causality (GC) is widely used for this purpose in functional magnetic resonance imaging analysis, but there the temporal resolution is low, making it difficult to capture the millisecond-scale interactions underlying sensory processing. Magnetoencephalography (MEG) has millisecond resolution, but only provides low-dimensional sensor-level linear mixtures of neural sources, which makes GC inference challenging. Conventional methods proceed in two stages: First, cortical sources are estimated from MEG using a source localization technique, followed by GC inference among the estimated sources. However, the spatiotemporal biases in estimating sources propagate into the subsequent GC analysis stage, may result in both false alarms and missing true GC links. Here, we introduce the Network Localized Granger Causality (NLGC) inference paradigm, which models the source dynamics as latent sparse multivariate autoregressive processes and estimates their parameters directly from the MEG measurements, integrated with source localization, and employs the resulting parameter estimates to produce a precise statistical characterization of the detected GC links. We offer several theoretical and algorithmic innovations within NLGC and further examine its utility via comprehensive simulations and application to MEG data from an auditory task involving tone processing from both younger and older participants. Our simulation studies reveal that NLGC is markedly robust with respect to model mismatch, network size, and low signal-to-noise ratio, whereas the conventional two-stage methods result in high false alarms and mis-detections. We also demonstrate the advantages of NLGC in revealing the cortical network-level characterization of neural activity during tone processing and resting state by delineating task- and age-related connectivity changes.

Asunto(s)

Imagen por Resonancia Magnética , Magnetoencefalografía , Algoritmos , Encéfalo/diagnóstico por imagen , Simulación por Computador , Humanos , Imagen por Resonancia Magnética/métodos , Magnetoencefalografía/métodos

8.

Dynamic estimation of auditory temporal response functions via state-space models with Gaussian mixture process noise.

Miran, Sina; Presacco, Alessandro; Simon, Jonathan Z; Fu, Michael C; Marcus, Steven I; Babadi, Behtash.

PLoS Comput Biol ; 16(8): e1008172, 2020 08.

Artículo en Inglés | MEDLINE | ID: mdl-32813712

RESUMEN

Estimating the latent dynamics underlying biological processes is a central problem in computational biology. State-space models with Gaussian statistics are widely used for estimation of such latent dynamics and have been successfully utilized in the analysis of biological data. Gaussian statistics, however, fail to capture several key features of the dynamics of biological processes (e.g., brain dynamics) such as abrupt state changes and exogenous processes that affect the states in a structured fashion. Although Gaussian mixture process noise models have been considered as an alternative to capture such effects, data-driven inference of their parameters is not well-established in the literature. The objective of this paper is to develop efficient algorithms for inferring the parameters of a general class of Gaussian mixture process noise models from noisy and limited observations, and to utilize them in extracting the neural dynamics that underlie auditory processing from magnetoencephalography (MEG) data in a cocktail party setting. We develop an algorithm based on Expectation-Maximization to estimate the process noise parameters from state-space observations. We apply our algorithm to simulated and experimentally-recorded MEG data from auditory experiments in the cocktail party paradigm to estimate the underlying dynamic Temporal Response Functions (TRFs). Our simulation results show that the richer representation of the process noise as a Gaussian mixture significantly improves state estimation and capturing the heterogeneity of the TRF dynamics. Application to MEG data reveals improvements over existing TRF estimation techniques, and provides a reliable alternative to current approaches for probing neural dynamics in a cocktail party scenario, as well as attention decoding in emerging applications such as smart hearing aids. Our proposed methodology provides a framework for efficient inference of Gaussian mixture process noise models, with application to a wide range of biological data with underlying heterogeneous and latent dynamics.

Asunto(s)

Vías Auditivas/fisiología , Algoritmos , Humanos , Magnetoencefalografía/métodos , Modelos Neurológicos

9.

Neuro-current response functions: A unified approach to MEG source analysis under the continuous stimuli paradigm.

Das, Proloy; Brodbeck, Christian; Simon, Jonathan Z; Babadi, Behtash.

Neuroimage ; 211: 116528, 2020 05 01.

Artículo en Inglés | MEDLINE | ID: mdl-31945510

RESUMEN

Characterizing the neural dynamics underlying sensory processing is one of the central areas of investigation in systems and cognitive neuroscience. Neuroimaging techniques such as magnetoencephalography (MEG) and Electroencephalography (EEG) have provided significant insights into the neural processing of continuous stimuli, such as speech, thanks to their high temporal resolution. Existing work in the context of auditory processing suggests that certain features of speech, such as the acoustic envelope, can be used as reliable linear predictors of the neural response manifested in M/EEG. The corresponding linear filters are referred to as temporal response functions (TRFs). While the functional roles of specific components of the TRF are well-studied and linked to behavioral attributes such as attention, the cortical origins of the underlying neural processes are not as well understood. In this work, we address this issue by estimating a linear filter representation of cortical sources directly from neuroimaging data in the context of continuous speech processing. To this end, we introduce Neuro-Current Response Functions (NCRFs), a set of linear filters, spatially distributed throughout the cortex, that predict the cortical currents giving rise to the observed ongoing MEG (or EEG) data in response to continuous speech. NCRF estimation is cast within a Bayesian framework, which allows unification of the TRF and source estimation problems, and also facilitates the incorporation of prior information on the structural properties of the NCRFs. To generalize this analysis to M/EEG recordings which lack individual structural magnetic resonance (MR) scans, NCRFs are extended to free-orientation dipoles and a novel regularizing scheme is put forward to lessen reliance on fine-tuned coordinate co-registration. We present a fast estimation algorithm, which we refer to as the Champ-Lasso algorithm, by leveraging recent advances in optimization, and demonstrate its utility through application to simulated and experimentally recorded MEG data under auditory experiments. Our simulation studies reveal significant improvements over existing methods that typically operate in a two-stage fashion, in terms of spatial resolution, response function reconstruction, and recovering dipole orientations. The analysis of experimentally-recorded MEG data without MR scans corroborates existing findings, but also delineates the distinct cortical distribution of the underlying neural processes at high spatiotemporal resolution. In summary, we provide a principled modeling and estimation paradigm for MEG source analysis tailored to extracting the cortical origin of electrophysiological responses to continuous stimuli.

Asunto(s)

Corteza Cerebral/fisiología , Electroencefalografía/métodos , Neuroimagen Funcional/métodos , Magnetoencefalografía/métodos , Percepción del Habla/fisiología , Adulto , Algoritmos , Teorema de Bayes , Electroencefalografía/normas , Femenino , Neuroimagen Funcional/normas , Humanos , Magnetoencefalografía/normas , Masculino , Adulto Joven

10.

High gamma cortical processing of continuous speech in younger and older listeners.

Kulasingham, Joshua P; Brodbeck, Christian; Presacco, Alessandro; Kuchinsky, Stefanie E; Anderson, Samira; Simon, Jonathan Z.

Neuroimage ; 222: 117291, 2020 11 15.

Artículo en Inglés | MEDLINE | ID: mdl-32835821

RESUMEN

Neural processing along the ascending auditory pathway is often associated with a progressive reduction in characteristic processing rates. For instance, the well-known frequency-following response (FFR) of the auditory midbrain, as measured with electroencephalography (EEG), is dominated by frequencies from â¼100â¯Hz to several hundred Hz, phase-locking to the acoustic stimulus at those frequencies. In contrast, cortical responses, whether measured by EEG or magnetoencephalography (MEG), are typically characterized by frequencies of a few Hz to a few tens of Hz, time-locking to acoustic envelope features. In this study we investigated a crossover case, cortically generated responses time-locked to continuous speech features at FFR-like rates. Using MEG, we analyzed responses in the high gamma range of 70-200â¯Hz to continuous speech using neural source-localized reverse correlation and the corresponding temporal response functions (TRFs). Continuous speech stimuli were presented to 40 subjects (17 younger, 23 older adults) with clinically normal hearing and their MEG responses were analyzed in the 70-200â¯Hz band. Consistent with the relative insensitivity of MEG to many subcortical structures, the spatiotemporal profile of these response components indicated a cortical origin with â¼40â¯ms peak latency and a right hemisphere bias. TRF analysis was performed using two separate aspects of the speech stimuli: a) the 70-200â¯Hz carrier of the speech, and b) the 70-200â¯Hz temporal modulations in the spectral envelope of the speech stimulus. The response was dominantly driven by the envelope modulation, with a much weaker contribution from the carrier. Age-related differences were also analyzed to investigate a reversal previously seen along the ascending auditory pathway, whereby older listeners show weaker midbrain FFR responses than younger listeners, but, paradoxically, have stronger cortical low frequency responses. In contrast to both these earlier results, this study did not find clear age-related differences in high gamma cortical responses to continuous speech. Cortical responses at FFR-like frequencies shared some properties with midbrain responses at the same frequencies and with cortical responses at much lower frequencies.

Asunto(s)

Envejecimiento/fisiología , Vías Auditivas/fisiología , Percepción Auditiva/fisiología , Percepción del Habla/fisiología , Adolescente , Adulto , Anciano , Corteza Auditiva/fisiología , Electroencefalografía/métodos , Potenciales Evocados Auditivos/fisiología , Femenino , Humanos , Magnetoencefalografía/métodos , Masculino , Persona de Mediana Edad , Habla , Adulto Joven

11.

Exaggerated cortical representation of speech in older listeners: mutual information analysis.

Zan, Peng; Presacco, Alessandro; Anderson, Samira; Simon, Jonathan Z.

J Neurophysiol ; 124(4): 1152-1164, 2020 10 01.

Artículo en Inglés | MEDLINE | ID: mdl-32877288

RESUMEN

Aging is associated with an exaggerated representation of the speech envelope in auditory cortex. The relationship between this age-related exaggerated response and a listener's ability to understand speech in noise remains an open question. Here, information-theory-based analysis methods are applied to magnetoencephalography recordings of human listeners, investigating their cortical responses to continuous speech, using the novel nonlinear measure of phase-locked mutual information between the speech stimuli and cortical responses. The cortex of older listeners shows an exaggerated level of mutual information, compared with younger listeners, for both attended and unattended speakers. The mutual information peaks for several distinct latencies: early (â¼50 ms), middle (â¼100 ms), and late (â¼200 ms). For the late component, the neural enhancement of attended over unattended speech is affected by stimulus signal-to-noise ratio, but the direction of this dependency is reversed by aging. Critically, in older listeners and for the same late component, greater cortical exaggeration is correlated with decreased behavioral inhibitory control. This negative correlation also carries over to speech intelligibility in noise, where greater cortical exaggeration in older listeners is correlated with worse speech intelligibility scores. Finally, an age-related lateralization difference is also seen for the â¼100 ms latency peaks, where older listeners show a bilateral response compared with younger listeners' right lateralization. Thus, this information-theory-based analysis provides new, and less coarse-grained, results regarding age-related change in auditory cortical speech processing, and its correlation with cognitive measures, compared with related linear measures.NEW & NOTEWORTHY Cortical representations of natural speech are investigated using a novel nonlinear approach based on mutual information. Cortical responses, phase-locked to the speech envelope, show an exaggerated level of mutual information associated with aging, appearing at several distinct latencies (â¼50, â¼100, and â¼200 ms). Critically, for older listeners only, the â¼200 ms latency response components are correlated with specific behavioral measures, including behavioral inhibition and speech comprehension.

Asunto(s)

Envejecimiento/fisiología , Percepción del Habla , Adolescente , Adulto , Anciano , Potenciales Evocados Auditivos , Femenino , Humanos , Teoría de la Información , Magnetoencefalografía , Masculino , Persona de Mediana Edad , Tiempo de Reacción , Corteza Sensoriomotora/crecimiento & desarrollo , Corteza Sensoriomotora/fisiología

12.

Mutual information analysis of neural representations of speech in noise in the aging midbrain.

Zan, Peng; Presacco, Alessandro; Anderson, Samira; Simon, Jonathan Z.

J Neurophysiol ; 122(6): 2372-2387, 2019 12 01.

Artículo en Inglés | MEDLINE | ID: mdl-31596649

RESUMEN

Younger adults with normal hearing can typically understand speech in the presence of a competing speaker without much effort, but this ability to understand speech in challenging conditions deteriorates with age. Older adults, even with clinically normal hearing, often have problems understanding speech in noise. Earlier auditory studies using the frequency-following response (FFR), primarily believed to be generated by the midbrain, demonstrated age-related neural deficits when analyzed with traditional measures. Here we use a mutual information paradigm to analyze the FFR to speech (masked by a competing speech signal) by estimating the amount of stimulus information contained in the FFR. Our results show, first, a broadband informational loss associated with aging for both FFR amplitude and phase. Second, this age-related loss of information is more severe in higher-frequency FFR bands (several hundred hertz). Third, the mutual information between the FFR and the stimulus decreases as noise level increases for both age groups. Fourth, older adults benefit neurally, i.e., show a reduction in loss of information, when the speech masker is changed from meaningful (talker speaking a language that they can comprehend, such as English) to meaningless (talker speaking a language that they cannot comprehend, such as Dutch). This benefit is not seen in younger listeners, which suggests that age-related informational loss may be more severe when the speech masker is meaningful than when it is meaningless. In summary, as a method, mutual information analysis can unveil new results that traditional measures may not have enough statistical power to assess.NEW & NOTEWORTHY Older adults, even with clinically normal hearing, often have problems understanding speech in noise. Auditory studies using the frequency-following response (FFR) have demonstrated age-related neural deficits with traditional methods. Here we use a mutual information paradigm to analyze the FFR to speech masked by competing speech. Results confirm those from traditional analysis but additionally show that older adults benefit neurally when the masker changes from a language that they comprehend to a language they cannot.

Asunto(s)

Envejecimiento/fisiología , Corteza Cerebral/fisiología , Electroencefalografía , Entropía , Teoría de la Información , Mesencéfalo/fisiología , Enmascaramiento Perceptual/fisiología , Percepción del Habla/fisiología , Adolescente , Adulto , Anciano , Femenino , Humanos , Masculino , Persona de Mediana Edad , Ruido , Adulto Joven

13.

Cortical Representations of Speech in a Multitalker Auditory Scene.

Puvvada, Krishna C; Simon, Jonathan Z.

J Neurosci ; 37(38): 9189-9196, 2017 09 20.

Artículo en Inglés | MEDLINE | ID: mdl-28821680

RESUMEN

The ability to parse a complex auditory scene into perceptual objects is facilitated by a hierarchical auditory system. Successive stages in the hierarchy transform an auditory scene of multiple overlapping sources, from peripheral tonotopically based representations in the auditory nerve, into perceptually distinct auditory-object-based representations in the auditory cortex. Here, using magnetoencephalography recordings from men and women, we investigate how a complex acoustic scene consisting of multiple speech sources is represented in distinct hierarchical stages of the auditory cortex. Using systems-theoretic methods of stimulus reconstruction, we show that the primary-like areas in the auditory cortex contain dominantly spectrotemporal-based representations of the entire auditory scene. Here, both attended and ignored speech streams are represented with almost equal fidelity, and a global representation of the full auditory scene with all its streams is a better candidate neural representation than that of individual streams being represented separately. We also show that higher-order auditory cortical areas, by contrast, represent the attended stream separately and with significantly higher fidelity than unattended streams. Furthermore, the unattended background streams are more faithfully represented as a single unsegregated background object rather than as separated objects. Together, these findings demonstrate the progression of the representations and processing of a complex acoustic scene up through the hierarchy of the human auditory cortex.SIGNIFICANCE STATEMENT Using magnetoencephalography recordings from human listeners in a simulated cocktail party environment, we investigate how a complex acoustic scene consisting of multiple speech sources is represented in separate hierarchical stages of the auditory cortex. We show that the primary-like areas in the auditory cortex use a dominantly spectrotemporal-based representation of the entire auditory scene, with both attended and unattended speech streams represented with almost equal fidelity. We also show that higher-order auditory cortical areas, by contrast, represent an attended speech stream separately from, and with significantly higher fidelity than, unattended speech streams. Furthermore, the unattended background streams are represented as a single undivided background object rather than as distinct background objects.

Asunto(s)

Corteza Auditiva/fisiología , Red Nerviosa/fisiología , Patrones de Reconocimiento Fisiológico/fisiología , Percepción del Habla/fisiología , Habla/fisiología , Estimulación Acústica/métodos , Señales (Psicología) , Femenino , Humanos , Masculino , Adulto Joven

14.

Neural source dynamics of brain responses to continuous stimuli: Speech processing from acoustics to comprehension.

Brodbeck, Christian; Presacco, Alessandro; Simon, Jonathan Z.

Neuroimage ; 172: 162-174, 2018 05 15.

Artículo en Inglés | MEDLINE | ID: mdl-29366698

RESUMEN

Human experience often involves continuous sensory information that unfolds over time. This is true in particular for speech comprehension, where continuous acoustic signals are processed over seconds or even minutes. We show that brain responses to such continuous stimuli can be investigated in detail, for magnetoencephalography (MEG) data, by combining linear kernel estimation with minimum norm source localization. Previous research has shown that the requirement to average data over many trials can be overcome by modeling the brain response as a linear convolution of the stimulus and a kernel, or response function, and estimating a kernel that predicts the response from the stimulus. However, such analysis has been typically restricted to sensor space. Here we demonstrate that this analysis can also be performed in neural source space. We first computed distributed minimum norm current source estimates for continuous MEG recordings, and then computed response functions for the current estimate at each source element, using the boosting algorithm with cross-validation. Permutation tests can then assess the significance of individual predictor variables, as well as features of the corresponding spatio-temporal response functions. We demonstrate the viability of this technique by computing spatio-temporal response functions for speech stimuli, using predictor variables reflecting acoustic, lexical and semantic processing. Results indicate that processes related to comprehension of continuous speech can be differentiated anatomically as well as temporally: acoustic information engaged auditory cortex at short latencies, followed by responses over the central sulcus and inferior frontal gyrus, possibly related to somatosensory/motor cortex involvement in speech perception; lexical frequency was associated with a left-lateralized response in auditory cortex and subsequent bilateral frontal activity; and semantic composition was associated with bilateral temporal and frontal brain activity. We conclude that this technique can be used to study the neural processing of continuous stimuli in time and anatomical space with the millisecond temporal resolution of MEG. This suggests new avenues for analyzing neural processing of naturalistic stimuli, without the necessity of averaging over artificially short or truncated stimuli.

Asunto(s)

Encéfalo/fisiología , Comprensión/fisiología , Magnetoencefalografía/métodos , Procesamiento de Señales Asistido por Computador , Percepción del Habla/fisiología , Acústica , Adolescente , Adulto , Mapeo Encefálico/métodos , Femenino , Humanos , Masculino , Adulto Joven

15.

Over-representation of speech in older adults originates from early response in higher order auditory cortex.

Brodbeck, Christian; Presacco, Alessandro; Anderson, Samira; Simon, Jonathan Z.

Acta Acust United Acust ; 104(5): 774-777, 2018.

Artículo en Inglés | MEDLINE | ID: mdl-30686956

RESUMEN

Previous research has found that, paradoxically, while older adults have more difficulty comprehending speech in challenging circumstances than younger adults, their brain responses track the envelope of the acoustic signal more robustly. Here we investigate this puzzle by using magnetoencephalography (MEG) source localization to determine the anatomical origin of this difference. Our results indicate that this robust tracking in older adults does not arise merely from having the same responses as younger adults but with larger amplitudes; instead, they recruit additional regions, inferior to core auditory cortex, with a short latency of ~30 ms relative to the acoustic signal.

16.

Overlapping communities reveal rich structure in large-scale brain networks during rest and task conditions.

Najafi, Mahshid; McMenamin, Brenton W; Simon, Jonathan Z; Pessoa, Luiz.

Neuroimage ; 135: 92-106, 2016 07 15.

Artículo en Inglés | MEDLINE | ID: mdl-27129758

RESUMEN

Large-scale analysis of functional MRI data has revealed that brain regions can be grouped into stable "networks" or communities. In many instances, the communities are characterized as relatively disjoint. Although recent work indicates that brain regions may participate in multiple communities (for example, hub regions), the extent of community overlap is poorly understood. To address these issues, here we investigated large-scale brain networks based on "rest" and task human functional MRI data by employing a mixed-membership Bayesian model that allows each brain region to belong to all communities simultaneously with varying membership strengths. The approach allowed us to 1) compare the structure of disjoint and overlapping communities; 2) determine the relationship between functional diversity (how diverse is a region's functional activation repertoire) and membership diversity (how diverse is a region's affiliation to communities); 3) characterize overlapping community structure; 4) characterize the degree of non-modularity in brain networks; 5) study the distribution of "bridges", including bottleneck and hub bridges. Our findings revealed the existence of dense community overlap that was not limited to "special" hubs. Furthermore, the findings revealed important differences between community organization during rest and during specific task states. Overall, we suggest that dense overlapping communities are well suited to capture the flexible and task dependent mapping between brain regions and their functions.

Asunto(s)

Mapeo Encefálico/métodos , Encéfalo/fisiología , Cognición/fisiología , Modelos Neurológicos , Red Nerviosa/fisiología , Descanso/fisiología , Análisis y Desempeño de Tareas , Teorema de Bayes , Simulación por Computador , Humanos , Imagen por Resonancia Magnética/métodos

17.

Robust decoding of selective auditory attention from MEG in a competing-speaker environment via state-space modeling.

Akram, Sahar; Presacco, Alessandro; Simon, Jonathan Z; Shamma, Shihab A; Babadi, Behtash.

Neuroimage ; 124(Pt A): 906-917, 2016 Jan 01.

Artículo en Inglés | MEDLINE | ID: mdl-26436490

RESUMEN

The underlying mechanism of how the human brain solves the cocktail party problem is largely unknown. Recent neuroimaging studies, however, suggest salient temporal correlations between the auditory neural response and the attended auditory object. Using magnetoencephalography (MEG) recordings of the neural responses of human subjects, we propose a decoding approach for tracking the attentional state while subjects are selectively listening to one of the two speech streams embedded in a competing-speaker environment. We develop a biophysically-inspired state-space model to account for the modulation of the neural response with respect to the attentional state of the listener. The constructed decoder is based on a maximum a posteriori (MAP) estimate of the state parameters via the Expectation Maximization (EM) algorithm. Using only the envelope of the two speech streams as covariates, the proposed decoder enables us to track the attentional state of the listener with a temporal resolution of the order of seconds, together with statistical confidence intervals. We evaluate the performance of the proposed model using numerical simulations and experimentally measured evoked MEG responses from the human brain. Our analysis reveals considerable performance gains provided by the state-space model in terms of temporal resolution, computational complexity and decoding accuracy.

Asunto(s)

Atención/fisiología , Percepción Sonora/fisiología , Percepción del Habla/fisiología , Estimulación Acústica , Adulto , Algoritmos , Percepción Auditiva/fisiología , Ambiente , Femenino , Humanos , Magnetoencefalografía , Masculino , Modelos Neurológicos , Adulto Joven

18.

Effect of informational content of noise on speech representation in the aging midbrain and cortex.

Presacco, Alessandro; Simon, Jonathan Z; Anderson, Samira.

J Neurophysiol ; 116(5): 2356-2367, 2016 11 01.

Artículo en Inglés | MEDLINE | ID: mdl-27605531

RESUMEN

The ability to understand speech is significantly degraded by aging, particularly in noisy environments. One way that older adults cope with this hearing difficulty is through the use of contextual cues. Several behavioral studies have shown that older adults are better at following a conversation when the target speech signal has high contextual content or when the background distractor is not meaningful. Specifically, older adults gain significant benefit in focusing on and understanding speech if the background is spoken by a talker in a language that is not comprehensible to them (i.e., a foreign language). To understand better the neural mechanisms underlying this benefit in older adults, we investigated aging effects on midbrain and cortical encoding of speech when in the presence of a single competing talker speaking in a language that is meaningful or meaningless to the listener (i.e., English vs. Dutch). Our results suggest that neural processing is strongly affected by the informational content of noise. Specifically, older listeners' cortical responses to the attended speech signal are less deteriorated when the competing speech signal is an incomprehensible language rather than when it is their native language. Conversely, temporal processing in the midbrain is affected by different backgrounds only during rapid changes in speech and only in younger listeners. Additionally, we found that cognitive decline is associated with an increase in cortical envelope tracking, suggesting an age-related over (or inefficient) use of cognitive resources that may explain their difficulty in processing speech targets while trying to ignore interfering noise.

Asunto(s)

Envejecimiento/fisiología , Corteza Auditiva/fisiología , Comprensión/fisiología , Mesencéfalo/fisiología , Ruido , Percepción del Habla/fisiología , Adolescente , Adulto , Anciano , Electroencefalografía/tendencias , Femenino , Humanos , Magnetoencefalografía/tendencias , Masculino , Persona de Mediana Edad , Ruido/efectos adversos , Habla/fisiología , Adulto Joven

19.

Evidence of degraded representation of speech in noise, in the aging midbrain and cortex.

Presacco, Alessandro; Simon, Jonathan Z; Anderson, Samira.

J Neurophysiol ; 116(5): 2346-2355, 2016 11 01.

Artículo en Inglés | MEDLINE | ID: mdl-27535374

RESUMEN

Humans have a remarkable ability to track and understand speech in unfavorable conditions, such as in background noise, but speech understanding in noise does deteriorate with age. Results from several studies have shown that in younger adults, low-frequency auditory cortical activity reliably synchronizes to the speech envelope, even when the background noise is considerably louder than the speech signal. However, cortical speech processing may be limited by age-related decreases in the precision of neural synchronization in the midbrain. To understand better the neural mechanisms contributing to impaired speech perception in older adults, we investigated how aging affects midbrain and cortical encoding of speech when presented in quiet and in the presence of a single-competing talker. Our results suggest that central auditory temporal processing deficits in older adults manifest in both the midbrain and in the cortex. Specifically, midbrain frequency following responses to a speech syllable are more degraded in noise in older adults than in younger adults. This suggests a failure of the midbrain auditory mechanisms needed to compensate for the presence of a competing talker. Similarly, in cortical responses, older adults show larger reductions than younger adults in their ability to encode the speech envelope when a competing talker is added. Interestingly, older adults showed an exaggerated cortical representation of speech in both quiet and noise conditions, suggesting a possible imbalance between inhibitory and excitatory processes, or diminished network connectivity that may impair their ability to encode speech efficiently.

Asunto(s)

Envejecimiento/fisiología , Corteza Auditiva/fisiología , Mesencéfalo/fisiología , Ruido , Percepción del Habla/fisiología , Estimulación Acústica/métodos , Adolescente , Adulto , Anciano , Electroencefalografía/tendencias , Femenino , Humanos , Magnetoencefalografía/tendencias , Masculino , Persona de Mediana Edad , Ruido/efectos adversos , Habla/fisiología , Adulto Joven

20.

Encoding of natural sounds by variance of the cortical local field potential.

Ding, Nai; Simon, Jonathan Z; Shamma, Shihab A; David, Stephen V.

J Neurophysiol ; 115(5): 2389-98, 2016 06 01.

Artículo en Inglés | MEDLINE | ID: mdl-26912594

RESUMEN

Neural encoding of sensory stimuli is typically studied by averaging neural signals across repetitions of the same stimulus. However, recent work has suggested that the variance of neural activity across repeated trials can also depend on sensory inputs. Here we characterize how intertrial variance of the local field potential (LFP) in primary auditory cortex of awake ferrets is affected by continuous natural sound stimuli. We find that natural sounds often suppress the intertrial variance of low-frequency LFP (<16 Hz). However, the amount of the variance reduction is not significantly correlated with the amplitude of the mean response at the same recording site. Moreover, the variance changes occur with longer latency than the mean response. Although the dynamics of the mean response and intertrial variance differ, spectro-temporal receptive field analysis reveals that changes in LFP variance have frequency tuning similar to multiunit activity at the same recording site, suggesting a local origin for changes in LFP variance. In summary, the spectral tuning of LFP intertrial variance and the absence of a correlation with the amplitude of the mean evoked LFP suggest substantial heterogeneity in the interaction between spontaneous and stimulus-driven activity across local neural populations in auditory cortex.

Asunto(s)

Corteza Auditiva/fisiología , Percepción Auditiva , Potenciales Evocados Auditivos , Animales , Corteza Auditiva/citología , Hurones , Neuronas/fisiología , Tiempo de Reacción , Sonido

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA