Your browser doesn't support javascript.
loading
Classification of Parkinson's disease from smartphone recording data using time-frequency analysis and convolutional neural network.
Worasawate, Denchai; Asawaponwiput, Warisara; Yoshimura, Natsue; Intarapanich, Apichart; Surangsrirat, Decho.
Afiliação
  • Worasawate D; Department of Electrical Engineering, Faculty of Engineering, Kasetsart University, Bangkok, Thailand.
  • Asawaponwiput W; Department of Electrical Engineering, Faculty of Engineering, Kasetsart University, Bangkok, Thailand.
  • Yoshimura N; Institute of Innovative Research, Tokyo Institute of Technology, Yokohama, Japan.
  • Intarapanich A; Educational Technology Team, National Electronics and Computer Technology Center, Pathum Thani, Thailand.
  • Surangsrirat D; Assistive Technology and Medical Devices Research Center, National Science and Technology Development Agency, Pathum Thani, Thailand.
Technol Health Care ; 31(2): 705-718, 2023.
Article em En | MEDLINE | ID: mdl-36155539
ABSTRACT

BACKGROUND:

Parkinson's disease (PD) is a long-term neurodegenerative disease of the central nervous system. The current diagnosis is dependent on clinical observation and the abilities and experience of a trained specialist. One of the symptoms that affects most patients is voice impairment.

OBJECTIVE:

Voice samples are non-invasive data that can be collected remotely for diagnosis and disease progression monitoring. In this study, we analyzed voice recording data from a smartphone as a possible medical self-diagnosis tool by using only one-second voice recording. The data from one of the largest mobile PD studies, the mPower study, was used.

METHODS:

A total of 29,798 ten-second voice recordings on smartphone from 4,051 participants were used for the analysis. The voice recordings were from sustained phonation by participants saying /aa/ for ten seconds into an iPhone microphone. A dataset comprising 385,143 short one-second audio samples was generated from the original ten-second voice recordings. The samples were converted to a spectrogram using a short-time Fourier transform. CNN models were then applied to classify the samples.

RESULTS:

Classification accuracies of the proposed method with LeNet-5, ResNet-50, and VGGNet-16 are 97.7 ± 0.1%, 98.6 ± 0.2%, and 99.3 ± 0.1%, respectively.

CONCLUSIONS:

We achieve a respectable classification performance using a generalized approach on a dataset with a large number of samples. The result emphasizes that an analysis based on one-second clip recorded on a smartphone could be a promising non-invasive and remotely available PD biomarker.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Doença de Parkinson / Voz / Doenças Neurodegenerativas Limite: Humans Idioma: En Revista: Technol Health Care Assunto da revista: ENGENHARIA BIOMEDICA / SERVICOS DE SAUDE Ano de publicação: 2023 Tipo de documento: Article País de afiliação: Tailândia

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Doença de Parkinson / Voz / Doenças Neurodegenerativas Limite: Humans Idioma: En Revista: Technol Health Care Assunto da revista: ENGENHARIA BIOMEDICA / SERVICOS DE SAUDE Ano de publicação: 2023 Tipo de documento: Article País de afiliação: Tailândia