Automatic intelligibility assessment of pathologic speech over the telephone.
Logoped Phoniatr Vocol
; 36(4): 175-81, 2011 Dec.
Article
in En
| MEDLINE
| ID: mdl-21875389
ABSTRACT
Objective assessment of intelligibility on the telephone is desirable for voice and speech assessment and rehabilitation. A total of 82 patients after partial laryngectomy read a standardized text which was synchronously recorded by a headset and via telephone. Five experienced raters assessed intelligibility perceptually on a five-point scale. Objective evaluation was performed by support vector regression on the word accuracy (WA) and word correctness (WR) of a speech recognition system, and a set of prosodic features. WA and WR alone exhibited correlations to human evaluation between |r| = 0.57 and |r| = 0.75. The correlation was r = 0.79 for headset and r = 0.86 for telephone recordings when prosodic features and WR were combined. The best feature subset was optimal for both signal qualities. It consists of WR, the average duration of the silent pauses before a word, the standard deviation of the fundamental frequency on the entire sample, the standard deviation of jitter, and the ratio of the durations of the voiced sections and the entire recording.
Full text:
1
Collection:
01-internacional
Database:
MEDLINE
Main subject:
Speech Acoustics
/
Speech Intelligibility
/
Speech Perception
/
Telephone
/
Voice Quality
/
Signal Processing, Computer-Assisted
/
Speech Recognition Software
/
Laryngectomy
Type of study:
Health_economic_evaluation
Limits:
Adult
/
Aged
/
Aged80
/
Female
/
Humans
/
Male
/
Middle aged
Country/Region as subject:
Europa
Language:
En
Journal:
Logoped Phoniatr Vocol
Journal subject:
PATOLOGIA DA FALA E LINGUAGEM
Year:
2011
Document type:
Article
Affiliation country:
Germany