Multidirectional regression (MDR)-based features for automatic voice disorder detection.

Muhammad, Ghulam; Mesallam, Tamer A; Malki, Khalid H; Farahat, Mohamed; Mahmood, Awais; Alsulaiman, Mansour

Muhammad, Ghulam; Mesallam, Tamer A; Malki, Khalid H; Farahat, Mohamed; Mahmood, Awais; Alsulaiman, Mansour.

Afiliación

Muhammad G; Department of Computer Engineering, College of Computer and Information Sciences, King Saud University, Riyadh, Saudi Arabia. ghulam@ksu.edu.sa

J Voice ; 26(6): 817.e19-27, 2012 Nov.

Article en En | MEDLINE | ID: mdl-23177748

ABSTRACT

ABSTRACT

BACKGROUND AND

OBJECTIVE:

Objective assessment of voice pathology has a growing interest nowadays. Automatic speech/speaker recognition (ASR) systems are commonly deployed in voice pathology detection. The aim of this work was to develop a novel feature extraction method for ASR that incorporates distributions of voiced and unvoiced parts, and voice onset and offset characteristics in a time-frequency domain to detect voice pathology. MATERIALS AND

METHODS:

The speech samples of 70 dysphonic patients with six different types of voice disorders and 50 normal subjects were analyzed. The Arabic spoken digits (1-10) were taken as an input. The proposed feature extraction method was embedded into the ASR system with Gaussian mixture model (GMM) classifier to detect voice disorder.

RESULTS:

Accuracy of 97.48% was obtained in text independent (all digits' training) case, and over 99% accuracy was obtained in text dependent (separate digit's training) case. The proposed method outperformed the conventional Mel frequency cepstral coefficient (MFCC) features.

CONCLUSION:

The results of this study revealed that incorporating voice onset and offset information leads to efficient automatic voice disordered detection.

Asunto(s)

Acústica; Modelos Estadísticos; Procesamiento de Señales Asistido por Computador; Acústica del Lenguaje; Medición de la Producción del Habla; Trastornos de la Voz/diagnóstico; Calidad de la Voz; Adolescente; Adulto; Algoritmos; Automatización; Estudios de Casos y Controles; Femenino; Humanos; Modelos Lineales; Masculino; Persona de Mediana Edad; Reconocimiento de Normas Patrones Automatizadas; Valor Predictivo de las Pruebas; Espectrografía del Sonido; Factores de Tiempo; Trastornos de la Voz/fisiopatología; Adulto Joven

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google

Texto completo: 1 Bases de datos: MEDLINE Asunto principal: Acústica del Lenguaje / Medición de la Producción del Habla / Calidad de la Voz / Acústica / Procesamiento de Señales Asistido por Computador / Trastornos de la Voz / Modelos Estadísticos Tipo de estudio: Diagnostic_studies / Observational_studies / Prognostic_studies / Risk_factors_studies Límite: Adolescent / Adult / Female / Humans / Male / Middle aged Idioma: En Revista: J Voice Asunto de la revista: OTORRINOLARINGOLOGIA Año: 2012 Tipo del documento: Article País de afiliación: Arabia Saudita

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google