Speech fine structure contains critical temporal cues to support speech segmentation.

Teng, Xiangbin; Cogan, Gregory B; Poeppel, David

Teng, Xiangbin; Cogan, Gregory B; Poeppel, David.

Afiliação

Teng X; Department of Neuroscience, Max-Planck-Institute for Empirical Aesthetics, Frankfurt, 60322, Germany. Electronic address: xiangbin.teng@gmail.com.
Cogan GB; Department of Neurosurgery, Duke University, Durham, NC, USA, 27710.
Poeppel D; Department of Neuroscience, Max-Planck-Institute for Empirical Aesthetics, Frankfurt, 60322, Germany; Department of Psychology, New York University, New York, NY, USA, 10003.

Neuroimage ; 202: 116152, 2019 11 15.

Article em En | MEDLINE | ID: mdl-31484039

ABSTRACT

ABSTRACT

Segmenting the continuous speech stream into units for further perceptual and linguistic analyses is fundamental to speech recognition. The speech amplitude envelope (SE) has long been considered a fundamental temporal cue for segmenting speech. Does the temporal fine structure (TFS), a significant part of speech signals often considered to contain primarily spectral information, contribute to speech segmentation? Using magnetoencephalography, we show that the TFS entrains cortical responses between 3 and 6â¯Hz and demonstrate, using mutual information analysis, that (i) the temporal information in the TFS can be reconstructed from a measure of frame-to-frame spectral change and correlates with the SE and (ii) that spectral resolution is key to the extraction of such temporal information. Furthermore, we show behavioural evidence that, when the SE is temporally distorted, the TFS provides cues for speech segmentation and aids speech recognition significantly. Our findings show that it is insufficient to investigate solely the SE to understand temporal speech segmentation, as the SE and the TFS derived from a band-filtering method convey comparable, if not inseparable, temporal information. We argue for a more synthetic view of speech segmentation - the auditory system groups speech signals coherently in both temporal and spectral domains.

Assuntos

Sinais (Psicologia); Acústica da Fala; Inteligibilidade da Fala/fisiologia; Percepção da Fala/fisiologia; Adulto; Feminino; Humanos; Teoria da Informação; Magnetoencefalografia; Masculino; Reconhecimento Psicológico; Processamento de Sinais Assistido por Computador; Fatores de Tempo; Adulto Jovem

Palavras-chave

Cortical entrainment; Spectral correlation; Spectro-temporal; Speech segmentation

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Acústica da Fala / Inteligibilidade da Fala / Percepção da Fala / Sinais (Psicologia) Idioma: En Ano de publicação: 2019 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google