Pesquisa | Portal Regional da BVS

1.

Neural encoding of linguistic speech cues is unaffected by cognitive decline, but decreases with increasing hearing impairment.

Bolt, Elena; Giroud, Nathalie.

Sci Rep ; 14(1): 19105, 2024 08 17.

Artigo em Inglês | MEDLINE | ID: mdl-39154048

RESUMO

The multivariate temporal response function (mTRF) is an effective tool for investigating the neural encoding of acoustic and complex linguistic features in natural continuous speech. In this study, we investigated how neural representations of speech features derived from natural stimuli are related to early signs of cognitive decline in older adults, taking into account the effects of hearing. Participants without ( n = 25 ) and with ( n = 19 ) early signs of cognitive decline listened to an audiobook while their electroencephalography responses were recorded. Using the mTRF framework, we modeled the relationship between speech input and neural response via different acoustic, segmented and linguistic encoding models and examined the response functions in terms of encoding accuracy, signal power, peak amplitudes and latencies. Our results showed no significant effect of cognitive decline or hearing ability on the neural encoding of acoustic and linguistic speech features. However, we found a significant interaction between hearing ability and the word-level segmentation model, suggesting that hearing impairment specifically affects encoding accuracy for this model, while other features were not affected by hearing ability. These results suggest that while speech processing markers remain unaffected by cognitive decline and hearing loss per se, neural encoding of word-level segmented speech features in older adults is affected by hearing loss but not by cognitive decline. This study emphasises the effectiveness of mTRF analysis in studying the neural encoding of speech and argues for an extension of research to investigate its clinical impact on hearing loss and cognition.

Assuntos

Disfunção Cognitiva , Eletroencefalografia , Perda Auditiva , Percepção da Fala , Humanos , Masculino , Feminino , Idoso , Disfunção Cognitiva/fisiopatologia , Perda Auditiva/fisiopatologia , Percepção da Fala/fisiologia , Fala/fisiologia , Pessoa de Meia-Idade , Sinais (Psicologia) , Linguística , Estimulação Acústica , Idoso de 80 Anos ou mais

2.

Dissociating prosodic from syntactic delta activity during natural speech comprehension.

Chalas, Nikos; Meyer, Lars; Lo, Chia-Wen; Park, Hyojin; Kluger, Daniel S; Abbasi, Omid; Kayser, Christoph; Nitsch, Robert; Gross, Joachim.

Curr Biol ; 34(15): 3537-3549.e5, 2024 Aug 05.

Artigo em Inglês | MEDLINE | ID: mdl-39047734

RESUMO

Decoding human speech requires the brain to segment the incoming acoustic signal into meaningful linguistic units, ranging from syllables and words to phrases. Integrating these linguistic constituents into a coherent percept sets the root of compositional meaning and hence understanding. One important cue for segmentation in natural speech is prosodic cues, such as pauses, but their interplay with higher-level linguistic processing is still unknown. Here, we dissociate the neural tracking of prosodic pauses from the segmentation of multi-word chunks using magnetoencephalography (MEG). We find that manipulating the regularity of pauses disrupts slow speech-brain tracking bilaterally in auditory areas (below 2 Hz) and in turn increases left-lateralized coherence of higher-frequency auditory activity at speech onsets (around 25-45 Hz). Critically, we also find that multi-word chunks-defined as short, coherent bundles of inter-word dependencies-are processed through the rhythmic fluctuations of low-frequency activity (below 2 Hz) bilaterally and independently of prosodic cues. Importantly, low-frequency alignment at chunk onsets increases the accuracy of an encoding model in bilateral auditory and frontal areas while controlling for the effect of acoustics. Our findings provide novel insights into the neural basis of speech perception, demonstrating that both acoustic features (prosodic cues) and abstract linguistic processing at the multi-word timescale are underpinned independently by low-frequency electrophysiological brain activity in the delta frequency range.

Assuntos

Compreensão , Magnetoencefalografia , Percepção da Fala , Humanos , Percepção da Fala/fisiologia , Compreensão/fisiologia , Masculino , Feminino , Adulto , Adulto Jovem , Fala/fisiologia , Ritmo Delta/fisiologia , Encéfalo/fisiologia , Linguística

3.

Longitudinal genome-wide association study reveals early QTL that predict biomass accumulation under cold stress in sorghum.

Agnew, Erica; Ziegler, Greg; Lee, Scott; Lizárraga, César; Fahlgren, Noah; Baxter, Ivan; Mockler, Todd C; Shakoor, Nadia.

Front Plant Sci ; 15: 1278802, 2024.

Artigo em Inglês | MEDLINE | ID: mdl-38807776

RESUMO

Introduction: Sorghum bicolor is a promising cellulosic feedstock crop for bioenergy due to its high biomass yields. However, early growth phases of sorghum are sensitive to cold stress, limiting its planting in temperate environments. Cold adaptability is crucial for cultivating bioenergy and grain sorghum at higher latitudes and elevations, or for extending the growing season. Identifying genes and alleles that enhance biomass accumulation under early cold stress can lead to improved sorghum varieties through breeding or genetic engineering. Methods: We conducted image-based phenotyping on 369 accessions from the sorghum Bioenergy Association Panel (BAP) in a controlled environment with early cold treatment. The BAP includes diverse accessions with dense genotyping and varied racial, geographical, and phenotypic backgrounds. Daily, non-destructive imaging allowed temporal analysis of growth-related traits and water use efficiency (WUE). A genome-wide association study (GWAS) was performed to identify genomic intervals and genes associated with cold stress response. Results: The GWAS identified transient quantitative trait loci (QTL) strongly associated with growth-related traits, enabling an exploration of the genetic basis of cold stress response at different developmental stages. This analysis of daily growth traits, rather than endpoint traits, revealed early transient QTL predictive of final phenotypes. The study identified both known and novel candidate genes associated with growth-related traits and temporal responses to cold stress. Discussion: The identified QTL and candidate genes contribute to understanding the genetic mechanisms underlying sorghum's response to cold stress. These findings can inform breeding and genetic engineering strategies to develop sorghum varieties with improved biomass yields and resilience to cold, facilitating earlier planting, extended growing seasons, and cultivation at higher latitudes and elevations.

4.

Extending Subcortical EEG Responses to Continuous Speech to the Sound-Field.

Bachmann, Florine L; Kulasingham, Joshua P; Eskelund, Kasper; Enqvist, Martin; Alickovic, Emina; Innes-Brown, Hamish.

Trends Hear ; 28: 23312165241246596, 2024.

Artigo em Inglês | MEDLINE | ID: mdl-38738341

RESUMO

The auditory brainstem response (ABR) is a valuable clinical tool for objective hearing assessment, which is conventionally detected by averaging neural responses to thousands of short stimuli. Progressing beyond these unnatural stimuli, brainstem responses to continuous speech presented via earphones have been recently detected using linear temporal response functions (TRFs). Here, we extend earlier studies by measuring subcortical responses to continuous speech presented in the sound-field, and assess the amount of data needed to estimate brainstem TRFs. Electroencephalography (EEG) was recorded from 24 normal hearing participants while they listened to clicks and stories presented via earphones and loudspeakers. Subcortical TRFs were computed after accounting for non-linear processing in the auditory periphery by either stimulus rectification or an auditory nerve model. Our results demonstrated that subcortical responses to continuous speech could be reliably measured in the sound-field. TRFs estimated using auditory nerve models outperformed simple rectification, and 16âminutes of data was sufficient for the TRFs of all participants to show clear wave V peaks for both earphones and sound-field stimuli. Subcortical TRFs to continuous speech were highly consistent in both earphone and sound-field conditions, and with click ABRs. However, sound-field TRFs required slightly more data (16âminutes) to achieve clear wave V peaks compared to earphone TRFs (12âminutes), possibly due to effects of room acoustics. By investigating subcortical responses to sound-field speech stimuli, this study lays the groundwork for bringing objective hearing assessment closer to real-life conditions, which may lead to improved hearing evaluations and smart hearing technologies.

Assuntos

Estimulação Acústica , Eletroencefalografia , Potenciais Evocados Auditivos do Tronco Encefálico , Percepção da Fala , Humanos , Potenciais Evocados Auditivos do Tronco Encefálico/fisiologia , Masculino , Feminino , Percepção da Fala/fisiologia , Estimulação Acústica/métodos , Adulto , Adulto Jovem , Limiar Auditivo/fisiologia , Fatores de Tempo , Nervo Coclear/fisiologia , Voluntários Saudáveis

5.

Effects and prediction of cognitive load on encoding model of brain response to auditory and linguistic stimuli in educational multimedia.

Asaadi, Amir Hosein; Amiri, S Hamid; Bosaghzadeh, Alireza; Ebrahimpour, Reza.

Sci Rep ; 14(1): 9133, 2024 04 21.

Artigo em Inglês | MEDLINE | ID: mdl-38644370

RESUMO

Multimedia is extensively used for educational purposes. However, certain types of multimedia lack proper design, which could impose a cognitive load on the user. Therefore, it is essential to predict cognitive load and understand how it impairs brain functioning. Participants watched a version of educational multimedia that applied Mayer's principles, followed by a version that did not. Meanwhile, their electroencephalography (EEG) was recorded. Subsequently, they participated in a post-test and completed a self-reported cognitive load questionnaire. The audio envelope and word frequency were extracted from the multimedia, and the temporal response functions (TRFs) were obtained using a linear encoding model. We observed that the behavioral data are different between the two groups and the TRFs of the two multimedia versions were different. We saw changes in the amplitude and latencies of both early and late components. In addition, correlations were found between behavioral data and the amplitude and latencies of TRF components. Cognitive load decreased participants' attention to the multimedia, and semantic processing of words also occurred with a delay and smaller amplitude. Hence, encoding models provide insights into the temporal and spatial mapping of the cognitive load activity, which could help us detect and reduce cognitive load in potential environments such as educational multimedia or simulators for different purposes.

Assuntos

Encéfalo , Cognição , Eletroencefalografia , Multimídia , Humanos , Cognição/fisiologia , Masculino , Feminino , Encéfalo/fisiologia , Adulto Jovem , Adulto , Estimulação Acústica , Linguística , Atenção/fisiologia

6.

Assessing Speech Audibility via Syllabic-Rate Neural Responses in Adults and Children With and Without Hearing Loss.

Pendyala, Varsha; Sethares, William; Easwar, Vijayalakshmi.

Trends Hear ; 28: 23312165241227815, 2024.

Artigo em Inglês | MEDLINE | ID: mdl-38545698

RESUMO

An objective method for assessing speech audibility is essential to evaluate hearing aid benefit in children who are unable to participate in hearing tests. With consonant-vowel syllables, brainstem-dominant responses elicited at the voice fundamental frequency have proven successful for assessing audibility. This study aimed to harness the neural activity elicited by the slow envelope of the same repetitive consonant-vowel syllables to assess audibility. In adults and children with normal hearing and children with hearing loss wearing hearing aids, neural activity elicited by the stimulus /su∫i/ or /sa∫i/ presented at 55-75âdB SPL was analyzed using the temporal response function approach. No-stimulus runs or very low stimulus level (15âdB SPL) were used to simulate inaudible conditions in adults and children with normal hearing. Both groups of children demonstrated higher response amplitudes relative to adults. Detectability (sensitivity; true positive rate) ranged between 80.1 and 100%, and did not vary by group or stimulus level but varied by stimulus, with /sa∫i/ achieving 100% detectability at 65âdB SPL. The average minimum time needed to detect a response ranged between 3.7 and 6.4âmin across stimuli and listener groups, with the shortest times recorded for stimulus /sa∫i/ and in children with hearing loss. Specificity was >94.9%. Responses to the slow envelope of non-meaningful consonant-vowel syllables can be used to ascertain audible vs. inaudible speech with sufficient accuracy within clinically feasible test times. Such responses can increase the clinical usefulness of existing objective approaches to evaluate hearing aid benefit.

Assuntos

Surdez , Auxiliares de Audição , Perda Auditiva Neurossensorial , Perda Auditiva , Percepção da Fala , Adulto , Criança , Humanos , Fala , Percepção da Fala/fisiologia , Perda Auditiva/diagnóstico , Perda Auditiva Neurossensorial/reabilitação

7.

Cortical linear encoding and decoding of sounds: Similarities and differences between naturalistic speech and music listening.

Simon, Adèle; Bech, Søren; Loquet, Gérard; Østergaard, Jan.

Eur J Neurosci ; 59(8): 2059-2074, 2024 Apr.

Artigo em Inglês | MEDLINE | ID: mdl-38303522

RESUMO

Linear models are becoming increasingly popular to investigate brain activity in response to continuous and naturalistic stimuli. In the context of auditory perception, these predictive models can be 'encoding', when stimulus features are used to reconstruct brain activity, or 'decoding' when neural features are used to reconstruct the audio stimuli. These linear models are a central component of some brain-computer interfaces that can be integrated into hearing assistive devices (e.g., hearing aids). Such advanced neurotechnologies have been widely investigated when listening to speech stimuli but rarely when listening to music. Recent attempts at neural tracking of music show that the reconstruction performances are reduced compared with speech decoding. The present study investigates the performance of stimuli reconstruction and electroencephalogram prediction (decoding and encoding models) based on the cortical entrainment of temporal variations of the audio stimuli for both music and speech listening. Three hypotheses that may explain differences between speech and music stimuli reconstruction were tested to assess the importance of the speech-specific acoustic and linguistic factors. While the results obtained with encoding models suggest different underlying cortical processing between speech and music listening, no differences were found in terms of reconstruction of the stimuli or the cortical data. The results suggest that envelope-based linear modelling can be used to study both speech and music listening, despite the differences in the underlying cortical mechanisms.

Assuntos

Música , Percepção da Fala , Percepção Auditiva/fisiologia , Fala , Percepção da Fala/fisiologia , Eletroencefalografia , Estimulação Acústica

8.

Estimating and approaching the maximum information rate of noninvasive visual brain-computer interface.

Shi, Nanlin; Miao, Yining; Huang, Changxing; Li, Xiang; Song, Yonghao; Chen, Xiaogang; Wang, Yijun; Gao, Xiaorong.

Neuroimage ; 289: 120548, 2024 Apr 01.

Artigo em Inglês | MEDLINE | ID: mdl-38382863

RESUMO

An essential priority of visual brain-computer interfaces (BCIs) is to enhance the information transfer rate (ITR) to achieve high-speed communication. Despite notable progress, noninvasive visual BCIs have encountered a plateau in ITRs, leaving it uncertain whether higher ITRs are achievable. In this study, we used information theory to study the characteristics and capacity of the visual-evoked channel, which leads us to investigate whether and how we can decode higher information rates in a visual BCI system. Using information theory, we estimate the upper and lower bounds of the information rate with the white noise (WN) stimulus. Consequently, we found out that the information rate is determined by the signal-to-noise ratio (SNR) in the frequency domain, which reflects the spectrum resources of the channel. Based on this discovery, we propose a broadband WN BCI by implementing stimuli on a broader frequency band than the steady-state visual evoked potentials (SSVEPs)-based BCI. Through validation, the broadband BCI outperforms the SSVEP BCI by an impressive 7 bps, setting a record of 50 bps. The integration of information theory and the decoding analysis presented in this study offers valuable insights applicable to general sensory-evoked BCIs, providing a potential direction of next-generation human-machine interaction systems.

Assuntos

Interfaces Cérebro-Computador , Humanos , Potenciais Evocados Visuais , Eletroencefalografia , Razão Sinal-Ruído , Comunicação , Estimulação Luminosa , Algoritmos

9.

Neural encoding of musical expectations in a non-human primate.

Bianco, Roberta; Zuk, Nathaniel J; Bigand, Félix; Quarta, Eros; Grasso, Stefano; Arnese, Flavia; Ravignani, Andrea; Battaglia-Mayer, Alexandra; Novembre, Giacomo.

Curr Biol ; 34(2): 444-450.e5, 2024 01 22.

Artigo em Inglês | MEDLINE | ID: mdl-38176416

RESUMO

The appreciation of music is a universal trait of humankind.1,2,3 Evidence supporting this notion includes the ubiquity of music across cultures4,5,6,7 and the natural predisposition toward music that humans display early in development.8,9,10 Are we musical animals because of species-specific predispositions? This question cannot be answered by relying on cross-cultural or developmental studies alone, as these cannot rule out enculturation.11 Instead, it calls for cross-species experiments testing whether homologous neural mechanisms underlying music perception are present in non-human primates. We present music to two rhesus monkeys, reared without musical exposure, while recording electroencephalography (EEG) and pupillometry. Monkeys exhibit higher engagement and neural encoding of expectations based on the previously seeded musical context when passively listening to real music as opposed to shuffled controls. We then compare human and monkey neural responses to the same stimuli and find a species-dependent contribution of two fundamental musical features-pitch and timing12-in generating expectations: while timing- and pitch-based expectations13 are similarly weighted in humans, monkeys rely on timing rather than pitch. Together, these results shed light on the phylogeny of music perception. They highlight monkeys' capacity for processing temporal structures beyond plain acoustic processing, and they identify a species-dependent contribution of time- and pitch-related features to the neural encoding of musical expectations.

Assuntos

Música , Animais , Percepção da Altura Sonora/fisiologia , Motivação , Eletroencefalografia/métodos , Primatas , Estimulação Acústica , Percepção Auditiva/fisiologia

10.

Reliability and generalizability of neural speech tracking in younger and older adults.

Panela, Ryan A; Copelli, Francesca; Herrmann, Björn.

Neurobiol Aging ; 134: 165-180, 2024 Feb.

Artigo em Inglês | MEDLINE | ID: mdl-38103477

RESUMO

Neural tracking of spoken speech is considered a potential clinical biomarker for speech-processing difficulties, but the reliability of neural speech tracking is unclear. Here, younger and older adults listened to stories in two sessions while electroencephalography was recorded to investigate the reliability and generalizability of neural speech tracking. Speech tracking amplitude was larger for older than younger adults, consistent with an age-related loss of inhibition. The reliability of neural speech tracking was moderate (ICC â¼0.5-0.75) and tended to be higher for older adults. However, reliability was lower for speech tracking than for neural responses to noise bursts (ICC >0.8), which we used as a benchmark for maximum reliability. Neural speech tracking generalized moderately across different stories (ICC â¼0.5-0.6), which appeared greatest for audiobook-like stories spoken by the same person. Hence, a variety of stories could possibly be used for clinical assessments. Overall, the current data are important for developing a biomarker of speech processing but suggest that further work is needed to increase the reliability to meet clinical standards.

Assuntos

Percepção da Fala , Fala , Humanos , Idoso , Fala/fisiologia , Envelhecimento/fisiologia , Percepção da Fala/fisiologia , Reprodutibilidade dos Testes , Eletroencefalografia , Biomarcadores

11.

Different patterns of foreground and background processing contribute to texture segregation in humans: an electrophysiological study.

Zhang, Baoqiang; Hu, Saisai; Zhang, Tingkang; Hai, Min; Wang, Yongchun; Li, Ya; Wang, Yonghui.

PeerJ ; 11: e16139, 2023.

Artigo em Inglês | MEDLINE | ID: mdl-37810782

RESUMO

Background: Figure-ground segregation is a necessary process for accurate visual recognition. Previous neurophysiological and human brain imaging studies have suggested that foreground-background segregation relies on both enhanced foreground representation and suppressed background representation. However, in humans, it is not known when and how foreground and background processing play a role in texture segregation. Methods: To answer this question, it is crucial to extract and dissociate the neural signals elicited by the foreground and background of a figure texture with high temporal resolution. Here, we combined an electroencephalogram (EEG) recording and a temporal response function (TRF) approach to specifically track the neural responses to the foreground and background of a figure texture from the overall EEG recordings in the luminance-tracking TRF. A uniform texture was included as a neutral condition. The texture segregation visual evoked potential (tsVEP) was calculated by subtracting the uniform TRF from the foreground and background TRFs, respectively, to index the specific segregation activity. Results: We found that the foreground and background of a figure texture were processed differently during texture segregation. In the posterior region of the brain, we found a negative component for the foreground tsVEP in the early stage of foreground-background segregation, and two negative components for the background tsVEP in the early and late stages. In the anterior region, we found a positive component for the foreground tsVEP in the late stage, and two positive components for the background tsVEP in the early and late stages of texture processing. Discussion: In this study we investigated the temporal profile of foreground and background processing during texture segregation in human participants at a high time resolution. The results demonstrated that the foreground and background jointly contribute to figure-ground segregation in both the early and late phases of texture processing. Our findings provide novel evidence for the neural correlates of foreground-background modulation during figure-ground segregation in humans.

Assuntos

Potenciais Evocados Visuais , Reconhecimento Visual de Modelos , Humanos , Reconhecimento Visual de Modelos/fisiologia , Visão Ocular , Eletroencefalografia/métodos , Encéfalo

12.

EEG-based speaker-listener neural coupling reflects speech-selective attentional mechanisms beyond the speech stimulus.

Li, Jiawei; Hong, Bo; Nolte, Guido; Engel, Andreas K; Zhang, Dan.

Cereb Cortex ; 33(22): 11080-11091, 2023 11 04.

Artigo em Inglês | MEDLINE | ID: mdl-37814353

RESUMO

When we pay attention to someone, do we focus only on the sound they make, the word they use, or do we form a mental space shared with the speaker we want to pay attention to? Some would argue that the human language is no other than a simple signal, but others claim that human beings understand each other because they form a shared mental ground between the speaker and the listener. Our study aimed to explore the neural mechanisms of speech-selective attention by investigating the electroencephalogram-based neural coupling between the speaker and the listener in a cocktail party paradigm. The temporal response function method was employed to reveal how the listener was coupled to the speaker at the neural level. The results showed that the neural coupling between the listener and the attended speaker peaked 5 s before speech onset at the delta band over the left frontal region, and was correlated with speech comprehension performance. In contrast, the attentional processing of speech acoustics and semantics occurred primarily at a later stage after speech onset and was not significantly correlated with comprehension performance. These findings suggest a predictive mechanism to achieve speaker-listener neural coupling for successful speech comprehension.

Assuntos

Percepção da Fala , Fala , Humanos , Fala/fisiologia , Percepção da Fala/fisiologia , Eletroencefalografia , Idioma , Acústica da Fala

13.

Quantifying decision-making in dynamic, continuously evolving environments.

Ruesseler, Maria; Weber, Lilian Aline; Marshall, Tom Rhys; O'Reilly, Jill; Hunt, Laurence Tudor.

Elife ; 122023 10 26.

Artigo em Inglês | MEDLINE | ID: mdl-37883173

RESUMO

During perceptual decision-making tasks, centroparietal electroencephalographic (EEG) potentials report an evidence accumulation-to-bound process that is time locked to trial onset. However, decisions in real-world environments are rarely confined to discrete trials; they instead unfold continuously, with accumulation of time-varying evidence being recency-weighted towards its immediate past. The neural mechanisms supporting recency-weighted continuous decision-making remain unclear. Here, we use a novel continuous task design to study how the centroparietal positivity (CPP) adapts to different environments that place different constraints on evidence accumulation. We show that adaptations in evidence weighting to these different environments are reflected in changes in the CPP. The CPP becomes more sensitive to fluctuations in sensory evidence when large shifts in evidence are less frequent, and the potential is primarily sensitive to fluctuations in decision-relevant (not decision-irrelevant) sensory input. A complementary triphasic component over occipito-parietal cortex encodes the sum of recently accumulated sensory evidence, and its magnitude covaries with parameters describing how different individuals integrate sensory evidence over time. A computational model based on leaky evidence accumulation suggests that these findings can be accounted for by a shift in decision threshold between different environments, which is also reflected in the magnitude of pre-decision EEG activity. Our findings reveal how adaptations in EEG responses reflect flexibility in evidence accumulation to the statistics of dynamic sensory environments.

Assuntos

Tomada de Decisões , Eletroencefalografia , Humanos , Tomada de Decisões/fisiologia , Lobo Parietal/fisiologia , Tempo de Reação/fisiologia

14.

Leading and following: Noise differently affects semantic and acoustic processing during naturalistic speech comprehension.

Zhang, Xinmiao; Li, Jiawei; Li, Zhuoran; Hong, Bo; Diao, Tongxiang; Ma, Xin; Nolte, Guido; Engel, Andreas K; Zhang, Dan.

Neuroimage ; 282: 120404, 2023 11 15.

Artigo em Inglês | MEDLINE | ID: mdl-37806465

RESUMO

Despite the distortion of speech signals caused by unavoidable noise in daily life, our ability to comprehend speech in noisy environments is relatively stable. However, the neural mechanisms underlying reliable speech-in-noise comprehension remain to be elucidated. The present study investigated the neural tracking of acoustic and semantic speech information during noisy naturalistic speech comprehension. Participants listened to narrative audio recordings mixed with spectrally matched stationary noise at three signal-to-ratio (SNR) levels (no noise, 3 dB, -3 dB), and 60-channel electroencephalography (EEG) signals were recorded. A temporal response function (TRF) method was employed to derive event-related-like responses to the continuous speech stream at both the acoustic and the semantic levels. Whereas the amplitude envelope of the naturalistic speech was taken as the acoustic feature, word entropy and word surprisal were extracted via the natural language processing method as two semantic features. Theta-band frontocentral TRF responses to the acoustic feature were observed at around 400 ms following speech fluctuation onset over all three SNR levels, and the response latencies were more delayed with increasing noise. Delta-band frontal TRF responses to the semantic feature of word entropy were observed at around 200 to 600 ms leading to speech fluctuation onset over all three SNR levels. The response latencies became more leading with increasing noise and decreasing speech comprehension and intelligibility. While the following responses to speech acoustics were consistent with previous studies, our study revealed the robustness of leading responses to speech semantics, which suggests a possible predictive mechanism at the semantic level for maintaining reliable speech comprehension in noisy environments.

Assuntos

Compreensão , Percepção da Fala , Humanos , Compreensão/fisiologia , Semântica , Fala/fisiologia , Percepção da Fala/fisiologia , Eletroencefalografia , Acústica , Estimulação Acústica

15.

Visuospatial attention revamps cortical processing of sound amid audiovisual uncertainty.

Cervantes Constantino, Francisco; Sánchez-Costa, Thaiz; Cipriani, Germán A; Carboni, Alejandra.

Psychophysiology ; 60(10): e14329, 2023 Oct.

Artigo em Inglês | MEDLINE | ID: mdl-37166096

RESUMO

Selective attentional biases arising from one sensory modality manifest in others. The effects of visuospatial attention, important in visual object perception, are unclear in the auditory domain during audiovisual (AV) scene processing. We investigate temporal and spatial factors that underlie such transfer neurally. Auditory encoding of random tone pips in AV scenes was addressed via a temporal response function model (TRF) of participants' electroencephalogram (N = 30). The spatially uninformative pips were associated with spatially distributed visual contrast reversals ("flips"), through asynchronous probabilistic AV temporal onset distributions. Participants deployed visuospatial selection on these AV stimuli to perform a task. A late (~300 ms) cross-modal influence over the neural representation of pips was found in the original and a replication study (N = 21). Transfer depended on selected visual input being (i) presented during or shortly after a related sound, in relatively limited temporal distributions (<165 ms); (ii) positioned across limited (1:4) visual foreground to background ratios. Neural encoding of auditory input, as a function of visual input, was largest at visual foreground quadrant sectors and lowest at locations opposite to the target. The results indicate that ongoing neural representations of sounds incorporate visuospatial attributes for auditory stream segregation, as cross-modal transfer conveys information that specifies the identity of multisensory signals. A potential mechanism is by enhancing or recalibrating the tuning properties of the auditory populations that represent them as objects. The results account for the dynamic evolution under visual attention of multisensory integration, specifying critical latencies at which relevant cortical networks operate.

16.

Delta-Band Neural Responses to Individual Words Are Modulated by Sentence Processing.

Slaats, Sophie; Weissbart, Hugo; Schoffelen, Jan-Mathijs; Meyer, Antje S; Martin, Andrea E.

J Neurosci ; 43(26): 4867-4883, 2023 06 28.

Artigo em Inglês | MEDLINE | ID: mdl-37221093

RESUMO

To understand language, we need to recognize words and combine them into phrases and sentences. During this process, responses to the words themselves are changed. In a step toward understanding how the brain builds sentence structure, the present study concerns the neural readout of this adaptation. We ask whether low-frequency neural readouts associated with words change as a function of being in a sentence. To this end, we analyzed an MEG dataset by Schoffelen et al. (2019) of 102 human participants (51 women) listening to sentences and word lists, the latter lacking any syntactic structure and combinatorial meaning. Using temporal response functions and a cumulative model-fitting approach, we disentangled delta- and theta-band responses to lexical information (word frequency), from responses to sensory and distributional variables. The results suggest that delta-band responses to words are affected by sentence context in time and space, over and above entropy and surprisal. In both conditions, the word frequency response spanned left temporal and posterior frontal areas; however, the response appeared later in word lists than in sentences. In addition, sentence context determined whether inferior frontal areas were responsive to lexical information. In the theta band, the amplitude was larger in the word list condition â¼100 milliseconds in right frontal areas. We conclude that low-frequency responses to words are changed by sentential context. The results of this study show how the neural representation of words is affected by structural context and as such provide insight into how the brain instantiates compositionality in language.SIGNIFICANCE STATEMENT Human language is unprecedented in its combinatorial capacity: we are capable of producing and understanding sentences we have never heard before. Although the mechanisms underlying this capacity have been described in formal linguistics and cognitive science, how they are implemented in the brain remains to a large extent unknown. A large body of earlier work from the cognitive neuroscientific literature implies a role for delta-band neural activity in the representation of linguistic structure and meaning. In this work, we combine these insights and techniques with findings from psycholinguistics to show that meaning is more than the sum of its parts; the delta-band MEG signal differentially reflects lexical information inside and outside sentence structures.

Assuntos

Encéfalo , Idioma , Humanos , Feminino , Encéfalo/fisiologia , Linguística , Psicolinguística , Mapeamento Encefálico , Semântica

17.

Auditory Word Comprehension Is Less Incremental in Isolated Words.

Gaston, Phoebe; Brodbeck, Christian; Phillips, Colin; Lau, Ellen.

Neurobiol Lang (Camb) ; 4(1): 29-52, 2023.

Artigo em Inglês | MEDLINE | ID: mdl-37229141

RESUMO

Partial speech input is often understood to trigger rapid and automatic activation of successively higher-level representations of words, from sound to meaning. Here we show evidence from magnetoencephalography that this type of incremental processing is limited when words are heard in isolation as compared to continuous speech. This suggests a less unified and automatic word recognition process than is often assumed. We present evidence from isolated words that neural effects of phoneme probability, quantified by phoneme surprisal, are significantly stronger than (statistically null) effects of phoneme-by-phoneme lexical uncertainty, quantified by cohort entropy. In contrast, we find robust effects of both cohort entropy and phoneme surprisal during perception of connected speech, with a significant interaction between the contexts. This dissociation rules out models of word recognition in which phoneme surprisal and cohort entropy are common indicators of a uniform process, even though these closely related information-theoretic measures both arise from the probability distribution of wordforms consistent with the input. We propose that phoneme surprisal effects reflect automatic access of a lower level of representation of the auditory input (e.g., wordforms) while the occurrence of cohort entropy effects is task sensitive, driven by a competition process or a higher-level representation that is engaged late (or not at all) during the processing of single words.

18.

Incorporating models of subcortical processing improves the ability to predict EEG responses to natural speech.

Lindboom, Elsa; Nidiffer, Aaron; Carney, Laurel H; Lalor, Edmund C.

Hear Res ; 433: 108767, 2023 06.

Artigo em Inglês | MEDLINE | ID: mdl-37060895

RESUMO

The goal of describing how the human brain responds to complex acoustic stimuli has driven auditory neuroscience research for decades. Often, a systems-based approach has been taken, in which neurophysiological responses are modeled based on features of the presented stimulus. This includes a wealth of work modeling electroencephalogram (EEG) responses to complex acoustic stimuli such as speech. Examples of the acoustic features used in such modeling include the amplitude envelope and spectrogram of speech. These models implicitly assume a direct mapping from stimulus representation to cortical activity. However, in reality, the representation of sound is transformed as it passes through early stages of the auditory pathway, such that inputs to the cortex are fundamentally different from the raw audio signal that was presented. Thus, it could be valuable to account for the transformations taking place in lower-order auditory areas, such as the auditory nerve, cochlear nucleus, and inferior colliculus (IC) when predicting cortical responses to complex sounds. Specifically, because IC responses are more similar to cortical inputs than acoustic features derived directly from the audio signal, we hypothesized that linear mappings (temporal response functions; TRFs) fit to the outputs of an IC model would better predict EEG responses to speech stimuli. To this end, we modeled responses to the acoustic stimuli as they passed through the auditory nerve, cochlear nucleus, and inferior colliculus before fitting a TRF to the output of the modeled IC responses. Results showed that using model-IC responses in traditional systems analyzes resulted in better predictions of EEG activity than using the envelope or spectrogram of a speech stimulus. Further, it was revealed that model-IC derived TRFs predict different aspects of the EEG than acoustic-feature TRFs, and combining both types of TRF models provides a more accurate prediction of the EEG response.

Assuntos

Córtex Auditivo , Colículos Inferiores , Humanos , Fala/fisiologia , Vias Auditivas/fisiologia , Eletroencefalografia , Córtex Auditivo/fisiologia , Colículos Inferiores/fisiologia , Estimulação Acústica/métodos , Percepção Auditiva/fisiologia

19.

Speech intelligibility changes the temporal evolution of neural speech tracking.

Chen, Ya-Ping; Schmidt, Fabian; Keitel, Anne; Rösch, Sebastian; Hauswald, Anne; Weisz, Nathan.

Neuroimage ; 268: 119894, 2023 03.

Artigo em Inglês | MEDLINE | ID: mdl-36693596

RESUMO

Listening to speech with poor signal quality is challenging. Neural speech tracking of degraded speech has been used to advance the understanding of how brain processes and speech intelligibility are interrelated. However, the temporal dynamics of neural speech tracking and their relation to speech intelligibility are not clear. In the present MEG study, we exploited temporal response functions (TRFs), which has been used to describe the time course of speech tracking on a gradient from intelligible to unintelligible degraded speech. In addition, we used inter-related facets of neural speech tracking (e.g., speech envelope reconstruction, speech-brain coherence, and components of broadband coherence spectra) to endorse our findings in TRFs. Our TRF analysis yielded marked temporally differential effects of vocoding: â¼50-110 ms (M50TRF), â¼175-230 ms (M200TRF), and â¼315-380 ms (M350TRF). Reduction of intelligibility went along with large increases of early peak responses M50TRF, but strongly reduced responses in M200TRF. In the late responses M350TRF, the maximum response occurred for degraded speech that was still comprehensible then declined with reduced intelligibility. Furthermore, we related the TRF components to our other neural "tracking" measures and found that M50TRF and M200TRF play a differential role in the shifting center frequency of the broadband coherence spectra. Overall, our study highlights the importance of time-resolved computation of neural speech tracking and decomposition of coherence spectra and provides a better understanding of degraded speech processing.

Assuntos

Inteligibilidade da Fala , Percepção da Fala , Humanos , Inteligibilidade da Fala/fisiologia , Percepção da Fala/fisiologia , Encéfalo/fisiologia , Percepção Auditiva , Cognição , Estimulação Acústica

20.

Detecting cortical responses to continuous running speech using EEG data from only one channel.

Aljarboa, Ghadah S; Bell, Steve L; Simpson, David M.

Int J Audiol ; 62(3): 199-208, 2023 03.

Artigo em Inglês | MEDLINE | ID: mdl-35152811

RESUMO

OBJECTIVE: To explore the detection of cortical responses to continuous speech using a single EEG channel. Particularly, to compare detection rates and times using a cross-correlation approach and parameters extracted from the temporal response function (TRF). DESIGN: EEG from 32-channels were recorded whilst presenting 25-min continuous English speech. Detection parameters were cross-correlation between speech and EEG (XCOR), peak value and power of the TRF filter (TRF-peak and TRF-power), and correlation between predicted TRF and true EEG (TRF-COR). A bootstrap analysis was used to determine response statistical significance. Different electrode configurations were compared: Using single channels Cz or Fz, or selecting channels with the highest correlation value. STUDY SAMPLE: Seventeen native English-speaking subjects with mild-to-moderate hearing loss. RESULTS: Significant cortical responses were detected from all subjects at Fz channel with XCOR and TRF-COR. Lower detection time was seen for XCOR (mean = 4.8 min) over TRF parameters (best TRF-COR, mean = 6.4 min), with significant time differences from XCOR to TRF-peak and TRF-power. Analysing multiple EEG channels and testing channels with the highest correlation between envelope and EEG reduced detection sensitivity compared to Fz alone. CONCLUSIONS: Cortical responses to continuous speech can be detected from a single channel with recording times that may be suitable for clinical application.

Assuntos

Perda Auditiva , Percepção da Fala , Humanos , Eletroencefalografia , Fala , Percepção da Fala/fisiologia

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA