Your browser doesn't support javascript.
loading
Automated Measures of Syntactic Complexity in Natural Speech Production: Older and Younger Adults as a Case Study.
Agmon, Galit; Pradhan, Sameer; Ash, Sharon; Nevler, Naomi; Liberman, Mark; Grossman, Murray; Cho, Sunghye.
Afiliação
  • Agmon G; Frontotemporal Degeneration Center, Department of Neurology, Perelman School of Medicine, University of Pennsylvania, Philadelphia.
  • Pradhan S; Linguistic Data Consortium, University of Pennsylvania, Philadelphia.
  • Ash S; Frontotemporal Degeneration Center, Department of Neurology, Perelman School of Medicine, University of Pennsylvania, Philadelphia.
  • Nevler N; Frontotemporal Degeneration Center, Department of Neurology, Perelman School of Medicine, University of Pennsylvania, Philadelphia.
  • Liberman M; Linguistic Data Consortium, University of Pennsylvania, Philadelphia.
  • Grossman M; Frontotemporal Degeneration Center, Department of Neurology, Perelman School of Medicine, University of Pennsylvania, Philadelphia.
  • Cho S; Linguistic Data Consortium, University of Pennsylvania, Philadelphia.
J Speech Lang Hear Res ; 67(2): 545-561, 2024 Feb 12.
Article em En | MEDLINE | ID: mdl-38215342
ABSTRACT

PURPOSE:

Multiple methods have been suggested for quantifying syntactic complexity in speech. We compared eight automated syntactic complexity metrics to determine which best captured verified syntactic differences between old and young adults.

METHOD:

We used natural speech samples produced in a picture description task by younger (n = 76, ages 18-22 years) and older (n = 36, ages 53-89 years) healthy participants, manually transcribed and segmented into sentences. We manually verified that older participants produced fewer complex structures. We developed a metric of syntactic complexity using automatically extracted syntactic structures as features in a multidimensional metric. We compared our metric to seven other metrics Yngve score, Frazier score, Frazier-Roark score, developmental level, syntactic frequency, mean dependency distance, and sentence length. We examined the success of each metric in identifying the age group using logistic regression models. We repeated the analysis with automatic transcription and segmentation using an automatic speech recognition (ASR) system.

RESULTS:

Our multidimensional metric was successful in predicting age group (area under the curve [AUC] = 0.87), and it performed better than the other metrics. High AUCs were also achieved by the Yngve score (0.84) and sentence length (0.84). However, in a fully automated pipeline with ASR, the performance of these two metrics dropped (to 0.73 and 0.46, respectively), while the performance of the multidimensional metric remained relatively high (0.81).

CONCLUSIONS:

Syntactic complexity in spontaneous speech can be quantified by directly assessing syntactic structures and considering them in a multivariable manner. It can be derived automatically, saving considerable time and effort compared to manually analyzing large-scale corpora, while maintaining high face validity and robustness. SUPPLEMENTAL

MATERIAL:

https//doi.org/10.23641/asha.24964179.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Fala / Percepção da Fala Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Fala / Percepção da Fala Idioma: En Ano de publicação: 2024 Tipo de documento: Article