Decoding speech sounds from neurophysiological data: Practical considerations and theoretical implications.

Sarrett, McCall E; Toscano, Joseph C

Sarrett, McCall E; Toscano, Joseph C.

Afiliação

Sarrett ME; Department of Psychological and Brain Sciences, Villanova University, Villanova, Pennsylvania, USA.
Toscano JC; Psychology Department, Gonzaga University, Spokane, Washington, USA.

Psychophysiology ; 61(4): e14475, 2024 Apr.

Article em En | MEDLINE | ID: mdl-37947235

RESUMO

Machine learning techniques have proven to be a useful tool in cognitive neuroscience. However, their implementation in scalp-recorded electroencephalography (EEG) is relatively limited. To address this, we present three analyses using data from a previous study that examined event-related potential (ERP) responses to a wide range of naturally-produced speech sounds. First, we explore which features of the EEG signal best maximize machine learning accuracy for a voicing distinction, using a support vector machine (SVM). We manipulate three dimensions of the EEG signal as input to the SVM: number of trials averaged, number of time points averaged, and polynomial fit. We discuss the trade-offs in using different feature sets and offer some recommendations for researchers using machine learning. Next, we use SVMs to classify specific pairs of phonemes, finding that we can detect differences in the EEG signal that are not otherwise detectable using conventional ERP analyses. Finally, we characterize the timecourse of phonetic feature decoding across three phonological dimensions (voicing, manner of articulation, and place of articulation), and find that voicing and manner are decodable from neural activity, whereas place of articulation is not. This set of analyses addresses both practical considerations in the application of machine learning to EEG, particularly for speech studies, and also sheds light on current issues regarding the nature of perceptual representations of speech.

Assuntos

Fonética; Percepção da Fala; Humanos; Percepção da Fala/fisiologia; Fala/fisiologia; Potenciais Evocados; Eletroencefalografia/métodos

Palavras-chave

Analysis/Statistical Methods; Auditory Processes; EEG; ERPs; Language/Speech; Machine Learning

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Percepção da Fala / Fonética Limite: Humans Idioma: En Revista: Psychophysiology Ano de publicação: 2024 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google