A machine learning perspective on the emotional content of Parkinsonian speech.

Sechidis, Konstantinos; Fusaroli, Riccardo; Orozco-Arroyave, Juan Rafael; Wolf, Detlef; Zhang, Yan-Ping

Sechidis, Konstantinos; Fusaroli, Riccardo; Orozco-Arroyave, Juan Rafael; Wolf, Detlef; Zhang, Yan-Ping.

Afiliação

Sechidis K; Roche Pharmaceutical Research & Early Development Informatics, Roche Innovation Center Basel, F. Hoffmann-La Roche Ltd., Basel, 4070, Switzerland. Electronic address: kostas.sechidis@novartis.com.
Fusaroli R; School of Communication and Culture & the Interacting Minds Centre, Aarhus University, Aarhus, Denmark. Electronic address: fusaroli@cas.au.dk.
Orozco-Arroyave JR; Faculty of Engineering, Universidad de Antioquia UdeA, Medellín, 1226, Colombia; Pattern Recognition Laboratory, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, 91054, Germany. Electronic address: rafael.orozco@udea.edu.co.
Wolf D; Roche Pharmaceutical Research & Early Development Informatics, Roche Innovation Center Basel, F. Hoffmann-La Roche Ltd., Basel, 4070, Switzerland. Electronic address: detlef.wolf@roche.com.
Zhang YP; Roche Pharmaceutical Research & Early Development Informatics, Roche Innovation Center Basel, F. Hoffmann-La Roche Ltd., Basel, 4070, Switzerland. Electronic address: yan-ping.zhang_schaerer@roche.com.

Artif Intell Med ; 115: 102061, 2021 05.

Article em En | MEDLINE | ID: mdl-34001321

RESUMO

Patients with Parkinson's disease (PD) have distinctive voice patterns, often perceived as expressing sad emotion. While this characteristic of Parkinsonian speech has been supported through the perspective of listeners, where both PD and healthy control (HC) subjects repeat the same speaking tasks, it has never been explored through a machine learning modelling approach. Our work provides an objective evaluation of this characteristic of the PD speech, by building a transfer learning system to assess how the PD pathology affects the sadness perception. To do so we introduce a Mixture-of-Experts (MoE) architecture for speech emotion recognition designed to be transferable across datasets. Firstly, by relying on publicly available emotional speech corpora, we train the MoE model and then we use it to quantify perceived sadness in never seen before PD and matched HC speech recordings. To build our models (experts), we extracted spectral features of the voicing parts of speech and we trained a gradient boosting decision trees model in each corpus to predict happiness vs. sadness. MoE predictions are created by weighting each expert's prediction according to the distance between the new sample and the expert-specific training samples. The MoE approach systematically infers more negative emotional characteristics in PD speech than in HC. Crucially, these judgments are related to the disease severity and the severity of speech impairment in the PD patients: the more impairment, the more likely the speech is to be judged as sad. Our findings pave the way towards a better understanding of the characteristics of PD speech and show how publicly available datasets can be used to train models that provide interesting insights on clinical data.

Assuntos

Doença de Parkinson; Fala; Emoções; Felicidade; Humanos; Aprendizado de Máquina

Palavras-chave

Machine learning; Mixture-of-experts; Parkinson's disease; Speech emotion recognition

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Doença de Parkinson / Fala Idioma: En Ano de publicação: 2021 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Doença de Parkinson / Fala Idioma: En Ano de publicação: 2021 Tipo de documento: Article