The Effect of Noise on Deep Learning for Classification of Pathological Voice.

Hasebe, Koki; Fujimura, Shintaro; Kojima, Tsuyoshi; Tamura, Keiichi; Kawai, Yoshitaka; Kishimoto, Yo; Omori, Koichi

Hasebe, Koki; Fujimura, Shintaro; Kojima, Tsuyoshi; Tamura, Keiichi; Kawai, Yoshitaka; Kishimoto, Yo; Omori, Koichi.

Afiliación

Hasebe K; Department of Otolaryngology, Head and Neck Surgery, Graduate School of Medicine, Kyoto University, Kyoto, Japan.
Fujimura S; Department of Otolaryngology, Head and Neck Surgery, Graduate School of Medicine, Kyoto University, Kyoto, Japan.
Kojima T; Department of Otolaryngology, Head and Neck Surgery, Graduate School of Medicine, Kyoto University, Kyoto, Japan.
Tamura K; Department of Otolaryngology, Head and Neck Surgery, Graduate School of Medicine, Kyoto University, Kyoto, Japan.
Kawai Y; Department of Otolaryngology, Head and Neck Surgery, Graduate School of Medicine, Kyoto University, Kyoto, Japan.
Kishimoto Y; Department of Otolaryngology, Head and Neck Surgery, Graduate School of Medicine, Kyoto University, Kyoto, Japan.
Omori K; Department of Otolaryngology, Head and Neck Surgery, Graduate School of Medicine, Kyoto University, Kyoto, Japan.

Laryngoscope ; 134(8): 3537-3541, 2024 Aug.

Article en En | MEDLINE | ID: mdl-38280184

ABSTRACT

ABSTRACT

OBJECTIVE:

This study aimed to evaluate the significance of background noise in machine learning models assessing the GRBAS scale for voice disorders.

METHODS:

A dataset of 1406 voice samples was collected from retrospective data, and a 5-layer 1D convolutional neural network (CNN) model was constructed using TensorFlow. The dataset was divided into training, validation, and test data. Gaussian noise was added to test samples at various intensities to assess the model's noise resilience. The model's performance was evaluated using accuracy, F1 score, and quadratic weighted Cohen's kappa score.

RESULTS:

The model's performance on the GRBAS scale generally declined with increasing noise intensities. For the G scale, accuracy dropped from 70.9% (original) to 8.5% (at the highest noise), F1 score from 69.2% to 1.3%, and Cohen's kappa from 0.679 to 0.0. Similar declines were observed for the remaining RBAS components.

CONCLUSION:

The model's performance was affected by background noise, with substantial decreases in evaluation metrics as noise levels intensified. Future research should explore noise-tolerant techniques, such as data augmentation, to improve the model's noise resilience in real-world settings. LEVEL OF EVIDENCE This study evaluates a machine learning model using a single dataset without comparative controls. Given its non-comparative design and specific focus, it aligns with Level 4 evidence (Case-series) under the 2011 OCEBM guidelines Laryngoscope, 1343537-3541, 2024.

Asunto(s)

Aprendizaje Profundo; Ruido; Trastornos de la Voz; Humanos; Estudios Retrospectivos; Trastornos de la Voz/diagnóstico; Trastornos de la Voz/fisiopatología; Trastornos de la Voz/etiología; Calidad de la Voz/fisiología; Masculino; Femenino; Redes Neurales de la Computación

Palabras clave

1DCNN; GRBAS scale; machine learning; noise resilience; voice disorders

Texto completo

Imprimir

XML

PubMed Links

Buscar en Google

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Trastornos de la Voz / Aprendizaje Profundo / Ruido Tipo de estudio: Prognostic_studies Límite: Female / Humans / Male Idioma: En Revista: Laryngoscope Asunto de la revista: OTORRINOLARINGOLOGIA Año: 2024 Tipo del documento: Article País de afiliación: Japón

Texto completo

Imprimir

XML

PubMed Links

Buscar en Google