Listening like a speech-training app: Expert and non-expert listeners' goodness ratings of children's speech.

Strömbergsson, Sofia; Fröjdh, Molly; Pettersson, Magdalena; Grósz, Tamás; Getman, Yaroslav; Kurimo, Mikko

Strömbergsson, Sofia; Fröjdh, Molly; Pettersson, Magdalena; Grósz, Tamás; Getman, Yaroslav; Kurimo, Mikko.

Afiliação

Strömbergsson S; Division of Speech and Language Pathology, Department of Clinical Sciences, Intervention and Technology, Karolinska Institutet, Stockholm, Sweden.
Fröjdh M; Division of Speech and Language Pathology, Department of Clinical Sciences, Intervention and Technology, Karolinska Institutet, Stockholm, Sweden.
Pettersson M; Division of Speech and Language Pathology, Department of Clinical Sciences, Intervention and Technology, Karolinska Institutet, Stockholm, Sweden.
Grósz T; Department of lnformation and Communications Engineering, Aalto University, Espoo, Finland.
Getman Y; Department of lnformation and Communications Engineering, Aalto University, Espoo, Finland.
Kurimo M; Department of lnformation and Communications Engineering, Aalto University, Espoo, Finland.

Clin Linguist Phon ; : 1-22, 2024 Jun 09.

Article em En | MEDLINE | ID: mdl-38853471

ABSTRACT

ABSTRACT

Speech training apps are being developed that provide automatic feedback concerning children's production of known target words, as a score on a 1-5 scale. However, this 'goodness' scale is still poorly understood. We investigated listeners' ratings of 'how many stars the app should provide as feedback' on children's utterances, and whether listener agreement is affected by clinical experience and/or access to anchor stimuli. In addition, we explored the association between goodness ratings and clinical measures of speech accuracy; the Percentage of Consonants Correct (PCC) and the Percentage of Phonemes Correct (PPC). Twenty speech-language pathologists and 20 non-expert listeners participated; half of the listeners in each group had access to anchor stimuli. The listeners rated 120 words, collected from children with and without speech sound disorder. Concerning reliability, intra-rater agreement was generally high, whereas inter-rater agreement was moderate. Access to anchor stimuli was associated with higher agreement, but only for non-expert listeners. Concerning the association between goodness ratings and the PCC/PPC, correlations were moderate for both listener groups, under both conditions. The results indicate that the task of rating goodness is difficult, regardless of clinical experience, and that access to anchor stimuli is insufficient for achieving reliable ratings. This raises concerns regarding the 1-5 rating scale as the means of feedback in speech training apps. More specific listener instructions, particularly regarding the intended context for the app, are suggested in collection of human ratings underlying the development of speech training apps. Until then, alternative means of feedback should be preferred.

Palavras-chave

Speech accuracy; automatic assessment; perceptual assessment; speech sound disorder

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article