Evaluating acoustic representations and normalization for rhoticity classification in children with speech sound disorders.
JASA Express Lett
; 4(2)2024 02 01.
Article
em En
| MEDLINE
| ID: mdl-38299984
ABSTRACT
The effects of different acoustic representations and normalizations were compared for classifiers predicting perception of children's rhotic versus derhotic /ɹ/. Formant and Mel frequency cepstral coefficient (MFCC) representations for 350 speakers were z-standardized, either relative to values in the same utterance or age-and-sex data for typical /ɹ/. Statistical modeling indicated age-and-sex normalization significantly increased classifier performances. Clinically interpretable formants performed similarly to MFCCs and were endorsed for deep neural network engineering, achieving mean test-participant-specific F1-score = 0.81 after personalization and replication (σx = 0.10, med = 0.83, n = 48). Shapley additive explanations analysis indicated the third formant most influenced fully rhotic predictions.
Texto completo:
1
Base de dados:
MEDLINE
Assunto principal:
Transtorno Fonológico
Tipo de estudo:
Diagnostic_studies
/
Prognostic_studies
/
Risk_factors_studies
Limite:
Child
/
Humans
Idioma:
En
Ano de publicação:
2024
Tipo de documento:
Article