Your browser doesn't support javascript.
loading
The Effects of Modulating Fundamental Frequency and Speech Rate on the Intelligibility, Communication Efficiency, and Perceived Naturalness of Synthetic Speech.
Vojtech, Jennifer M; Noordzij, Jacob P; Cler, Gabriel J; Stepp, Cara E.
Afiliación
  • Vojtech JM; Department of Biomedical Engineering, Boston University, MA.
  • Noordzij JP; Department of Speech, Language, and Hearing Sciences, Boston University, MA.
  • Cler GJ; Department of Biomedical Engineering, Boston University, MA.
  • Stepp CE; Department of Speech, Language, and Hearing Sciences, Boston University, MA.
Am J Speech Lang Pathol ; 28(2S): 875-886, 2019 07 15.
Article en En | MEDLINE | ID: mdl-31306599
Purpose This study investigated how modulating fundamental frequency (f0) and speech rate differentially impact the naturalness, intelligibility, and communication efficiency of synthetic speech. Method Sixteen sentences of varying prosodic content were developed via a speech synthesizer. The f0 contour and speech rate of these sentences were altered to produce 4 stimulus sets: (a) normal rate with a fixed f0 level, (b) slow rate with a fixed f0 level, (c) normal rate with prosodically natural f0 variation, and (d) normal rate with prosodically unnatural f0 variation. Sixteen listeners provided orthographic transcriptions and judgments of naturalness for these stimuli. Results Sentences with f0 variation were rated as more natural than those with a fixed f0 level. Conversely, sentences with a fixed f0 level demonstrated higher intelligibility than those with f0 variation. Speech rate did not affect the intelligibility of stimuli with a fixed f0 level. Communication efficiency was highest for sentences produced at a normal rate and a fixed f0 level. Conclusions Sentence-level f0 variation increased naturalness ratings of synthesized speech, whether the variation was prosodically natural or not. However, these f0 variations reduced intelligibility. There is evidence of a trade-off in naturalness and intelligibility of synthesized speech, which may impact future speech synthesis designs. Supplemental Material https://doi.org/10.23641/asha.8847833.
Asunto(s)

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Inteligibilidad del Habla / Calidad de la Voz / Simulación por Computador Límite: Adolescent / Adult / Female / Humans / Male Idioma: En Revista: Am J Speech Lang Pathol Asunto de la revista: PATOLOGIA DA FALA E LINGUAGEM Año: 2019 Tipo del documento: Article Pais de publicación: Estados Unidos

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Inteligibilidad del Habla / Calidad de la Voz / Simulación por Computador Límite: Adolescent / Adult / Female / Humans / Male Idioma: En Revista: Am J Speech Lang Pathol Asunto de la revista: PATOLOGIA DA FALA E LINGUAGEM Año: 2019 Tipo del documento: Article Pais de publicación: Estados Unidos