Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 7.351
Filtrar
1.
Am J Speech Lang Pathol ; 33(4): 1930-1951, 2024 Jul 03.
Artigo em Inglês | MEDLINE | ID: mdl-38838243

RESUMO

PURPOSE: This study investigated the effects of the SPEAK OUT! & LOUD Crowd therapy program on speaking rate, percent pause time, intelligibility, naturalness, and communicative participation in individuals with Parkinson's disease (PD). METHOD: Six adults with PD completed 12 individual SPEAK OUT! sessions across four consecutive weeks followed by group-based LOUD Crowd sessions for five consecutive weeks. Most therapy sessions were conducted via telehealth, with two participants completing the SPEAK OUT! portion in person. Speech samples were recorded at six time points: three baseline time points prior to SPEAK OUT!, two post-SPEAK OUT! time points, and one post-LOUD Crowd time point. Acoustic measures of speaking rate and percent pause time and listener ratings of speech intelligibility and naturalness were obtained for each time point. Participant self-ratings of communicative participation were also collected at pre- and posttreatment time points. RESULTS: Results showed significant improvement in communicative participation scores at a group level following completion of the SPEAK OUT! & LOUD Crowd treatment program. Two participants showed a significant decrease in speaking rate and increase in percent pause time following treatment. Changes in intelligibility and naturalness were not statistically significant. CONCLUSIONS: These findings provide preliminary support for the effectiveness of the SPEAK OUT! & LOUD Crowd treatment program in improving communicative participation for people with mild-to-moderate hypokinetic dysarthria secondary to PD. This study is also the first to demonstrate positive effects of this treatment program for people receiving the therapy via telehealth.


Assuntos
Doença de Parkinson , Inteligibilidade da Fala , Medida da Produção da Fala , Fonoterapia , Humanos , Doença de Parkinson/complicações , Doença de Parkinson/terapia , Masculino , Feminino , Idoso , Pessoa de Meia-Idade , Fonoterapia/métodos , Disartria/etiologia , Disartria/terapia , Disartria/reabilitação , Resultado do Tratamento , Acústica da Fala , Fatores de Tempo , Qualidade da Voz , Telemedicina
2.
J Acoust Soc Am ; 155(6): 3848-3860, 2024 Jun 01.
Artigo em Inglês | MEDLINE | ID: mdl-38884524

RESUMO

The ability to accurately classify accents and assess accentedness in non-native speakers are challenging tasks due primarily to the complexity and diversity of accent and dialect variations. In this study, embeddings from advanced pretrained language identification (LID) and speaker identification (SID) models are leveraged to improve the accuracy of accent classification and non-native accentedness assessment. Findings demonstrate that employing pretrained LID and SID models effectively encodes accent/dialect information in speech. Furthermore, the LID and SID encoded accent information complement an end-to-end (E2E) accent identification (AID) model trained from scratch. By incorporating all three embeddings, the proposed multi-embedding AID system achieves superior accuracy in AID. Next, leveraging automatic speech recognition (ASR) and AID models is investigated to explore accentedness estimation. The ASR model is an E2E connectionist temporal classification model trained exclusively with American English (en-US) utterances. The ASR error rate and en-US output of the AID model are leveraged as objective accentedness scores. Evaluation results demonstrate a strong correlation between scores estimated by the two models. Additionally, a robust correlation between objective accentedness scores and subjective scores based on human perception is demonstrated, providing evidence for the reliability and validity of using AID-based and ASR-based systems for accentedness assessment in non-native speech. Such advanced systems would benefit accent assessment in language learning as well as speech and speaker assessment for intelligibility, quality, and speaker diarization and speech recognition advancements.


Assuntos
Percepção da Fala , Interface para o Reconhecimento da Fala , Humanos , Percepção da Fala/fisiologia , Acústica da Fala , Fonética , Idioma , Medida da Produção da Fala/métodos , Feminino , Masculino
3.
J Acoust Soc Am ; 155(6): 3877-3888, 2024 Jun 01.
Artigo em Inglês | MEDLINE | ID: mdl-38888391

RESUMO

The quality of speech input influences the efficiency of L1 and L2 acquisition. This study examined modifications in infant-directed speech (IDS) and foreigner-directed speech (FDS) in Standard Mandarin-a tonal language-and explored how IDS and FDS features were manifested in disyllabic words and a longer discourse. The study aimed to determine which characteristics of IDS and FDS were enhanced in comparison with adult-directed speech (ADS), and how IDS and FDS differed when measured in a common set of acoustic parameters. For words, it was found that tone-bearing vowel duration, mean and range of fundamental frequency (F0), and the lexical tone contours were enhanced in IDS and FDS relative to ADS, except for the dipping Tone 3 that exhibited an unexpected lowering in FDS, but no modification in IDS when compared with ADS. For the discourse, different aspects of temporal and F0 enhancements were emphasized in IDS and FDS: the mean F0 was higher in IDS whereas the total discourse duration was greater in FDS. These findings add to the growing literature on L1 and L2 speech input characteristics and their role in language acquisition.


Assuntos
Acústica da Fala , Humanos , Feminino , Masculino , Lactente , Adulto , Fonética , Medida da Produção da Fala/métodos , Adulto Jovem , Multilinguismo , Qualidade da Voz , Acústica , Idioma , Fatores de Tempo , Percepção da Fala
4.
Lang Speech Hear Serv Sch ; 55(3): 838-852, 2024 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-38748925

RESUMO

PURPOSE: Comprehensive spoken language assessment should include the evaluation of language use in naturalistic contexts. Discourse elicitation and analysis provides the opportunity for such an evaluation to occur. In this article, our overall aim was to describe adolescents' language performance on four elicitation tasks and determine if there are task-related differences across the elicitation tasks. METHOD: Forty-four typically developing adolescents with ages ranging from 12;2 to 17;11 (years;months; M = 15;2; 21 boys and 23 girls) participated in the study. They completed four spoken discourse tasks: (a) story generation using a wordless picture book, (b) fable retell, (c) six personal narratives in response to emotion-based prompts, and (d) monologic response to two stories that contained a moral dilemma. Responses were transcribed and analyzed for four language performance measures tapping into language productivity, syntactic complexity, lexical diversity, and verbal facility. RESULTS: Despite individual variability in performance, mean scores were close to median scores for most measures, suggesting a symmetrical distribution. As expected, all four language performance measures were significantly different across the four elicitation tasks. The personal narrative task elicited the longest samples, with the highest verbal fluency. In contrast, both lexical diversity and syntactic complexity were the strongest in response to the fable retell and the moral dilemma tasks. CONCLUSIONS: This investigation provides speech-language pathologists with an overview of how task-related factors may impact adolescent language performance. These findings may be used to support their clinical decision-making processes in choosing a suitable discourse task when conducting a comprehensive spoken language assessment. Three hypothetical case examples are used to illustrate the decision-making process. SUPPLEMENTAL MATERIAL: https://doi.org/10.23641/asha.25761768.


Assuntos
Testes de Linguagem , Humanos , Adolescente , Masculino , Feminino , Criança , Narração , Idioma , Medida da Produção da Fala/métodos
5.
Lang Speech Hear Serv Sch ; 55(3): 870-883, 2024 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-38758707

RESUMO

PURPOSE: Transcription of conjoined independent clauses within language samples varies across professionals. Some transcribe these clauses as two separate utterances, whereas others conjoin them within a single utterance. As an inquiry into equitable practice, we examined rates of conjoined independent clauses produced by children and the impact of separating these clauses within utterances on measures of mean length of utterance (MLU) by a child's English dialect, clinical status, and age. METHOD: The data were archival and included 246 language samples from children classified by their dialect (African American English or Southern White English) and clinical status (developmental language disorder [DLD] or typically developing [TD]), with those in the TD group further classified by their age (4 years [TD4] or 6 years [TD6]). RESULTS: Rates of conjoined independent clauses and the impact of these clauses on MLU varied by clinical status (DLD < TD) and age (TD4 < TD6), but not by dialect. Correlations between the rate of conjoined clauses, MLU, and language test scores were also similar across the two dialects. CONCLUSIONS: Transcription decisions regarding conjoined independent clauses within language samples lead to equitable measurement outcomes across dialects of English. Nevertheless, transcribing conjoined independent clauses as two separate utterances reduces one's ability to detect syntactic differences between children with and without DLD and document syntactic growth as children age. SUPPLEMENTAL MATERIAL: https://doi.org/10.23641/asha.25822675.


Assuntos
Linguagem Infantil , Transtornos do Desenvolvimento da Linguagem , Humanos , Criança , Pré-Escolar , Masculino , Feminino , Idioma , Medida da Produção da Fala/métodos , Testes de Linguagem , Negro ou Afro-Americano
6.
J Acoust Soc Am ; 155(5): 3206-3212, 2024 May 01.
Artigo em Inglês | MEDLINE | ID: mdl-38738937

RESUMO

Modern humans and chimpanzees share a common ancestor on the phylogenetic tree, yet chimpanzees do not spontaneously produce speech or speech sounds. The lab exercise presented in this paper was developed for undergraduate students in a course entitled "What's Special About Human Speech?" The exercise is based on acoustic analyses of the words "cup" and "papa" as spoken by Viki, a home-raised, speech-trained chimpanzee, as well as the words spoken by a human. The analyses allow students to relate differences in articulation and vocal abilities between Viki and humans to the known anatomical differences in their vocal systems. Anatomical and articulation differences between humans and Viki include (1) potential tongue movements, (2) presence or absence of laryngeal air sacs, (3) presence or absence of vocal membranes, and (4) exhalation vs inhalation during production.


Assuntos
Pan troglodytes , Acústica da Fala , Fala , Humanos , Animais , Pan troglodytes/fisiologia , Fala/fisiologia , Língua/fisiologia , Língua/anatomia & histologia , Vocalização Animal/fisiologia , Especificidade da Espécie , Medida da Produção da Fala , Laringe/fisiologia , Laringe/anatomia & histologia , Fonética
7.
Am J Speech Lang Pathol ; 33(4): 1952-1964, 2024 Jul 03.
Artigo em Inglês | MEDLINE | ID: mdl-38809826

RESUMO

PURPOSE: The current study compared temporal and spectral acoustic contrast between vowel segments produced by speakers with dysarthria across three speech tasks-interactive, solo habitual, and solo clear. METHOD: Nine speakers with dysarthria secondary to amyotrophic lateral sclerosis participated in the study. Each speaker was paired with a typical interlocutor over videoconferencing software. The speakers produced the vowels /i, ɪ, ɛ, æ/ in /h/-vowel-/d/ words. For the solo tasks, speakers read the stimuli aloud in both their habitual and clear speaking styles. For the interactive task, speakers produced a target stimulus for their interlocutor to select among the four possibilities. We measured the duration difference between long and short vowels, as well as the F1/F2 Euclidean distance between adjacent vowels, and also determined how well the vowels could be classified based on their acoustic characteristics. RESULTS: Temporal contrast between long and short vowels was higher in the interactive task than in both solo tasks. Spectral distance between adjacent vowel pairs was also higher for some pairs in the interactive task than the habitual speech task. Finally, vowel classification accuracy was highest in the interactive task. CONCLUSIONS: Overall, we found evidence that individuals with dysarthria produced vowels with greater acoustic contrast in structured interactions than they did in solo tasks. Furthermore, the speech adjustments they made to the vowel segments differed from those observed in solo speech.


Assuntos
Esclerose Lateral Amiotrófica , Disartria , Fonética , Acústica da Fala , Medida da Produção da Fala , Humanos , Disartria/etiologia , Disartria/fisiopatologia , Disartria/diagnóstico , Masculino , Feminino , Pessoa de Meia-Idade , Idoso , Esclerose Lateral Amiotrófica/complicações , Esclerose Lateral Amiotrófica/fisiopatologia , Inteligibilidade da Fala , Qualidade da Voz , Dados Preliminares , Espectrografia do Som , Fatores de Tempo , Idoso de 80 Anos ou mais , Acústica
8.
Codas ; 36(4): e20230047, 2024.
Artigo em Português, Inglês | MEDLINE | ID: mdl-38808777

RESUMO

PURPOSE: To compare the acoustic measurements of Cepstral Peak Prominence Smoothed (CPPS) and Acoustic Voice Quality Index (AVQI) of children with normal and altered voices, to relationship with auditory-perceptual judgment (APJ) and to establish cut-off points. METHODS: Vocal recordings of the sustained vowel and number counting tasks of 185 children were selected from a database and submitted to acoustic analysis with extraction of CPPS and AVQI measurements, and to APJ. The APJ was performed individually for each task, classified as normal or altered, and for the tasks together defining whether the child would pass or fail in a situation of vocal screening. RESULTS: Children with altered APJ and who failed the screening had lower CPPS values and higher AVQI values, than those with normal APJ and who passed the screening. The APJ of the sustained vowel task was related to CPPS and AVQI, and APJ of the number counting task was related only to AVQI and CPPS numbers. The cut-off points that differentiate children with and without vocal deviation are 14.07 for the vowel CPPS, 7.62 for the CPPS numbers and 2.01 for the AVQI. CONCLUSION: Children with altered voices, have higher AVQI values and lower CPPS values, when detected in children with voices within the normal range. The acoustic measurements were related to the auditory perceptual judgment of vocal quality in the sustained vowel task, however, the number counting task was related only to the AVQI and CPPS. The cut-off points that differentiate children with and without vocal deviation are 14.07 for the CPPS vowel, 7.62 for the CPPS numbers and 2.01 for the AVQI. The three measures were similar in identifying voices without deviation and dysphonic voices.


OBJETIVO: Comparar as medidas acústicas de Cepstral Peak Prominence Smoothed (CPPS) e Acoustic Voice Quality Index (AVQI) de crianças com vozes normais e alteradas, relacionar com o julgamento perceptivo-auditivo (JPA) da voz e estabelecer pontos de corte. MÉTODO: Gravações vocais das tarefas de vogal sustentada e contagem de números de 185 crianças foram selecionadas em um banco de dados e submetidas a análise acústica com extração das medidas de CPPS e AVQI, e ao JPA. O JPA foi realizado individualmente para cada tarefa e as amostras foram classificadas posteriormente como normal ou alterada, e para as tarefas em conjunto definindo-se se a criança passaria ou falharia em uma situação de triagem vocal. RESULTADOS: Crianças com JPA alterado e que falharam na triagem apresentaram valores menores de CPPS e maiores de AVQI, do que as com JPA normal e que passaram na triagem. O JPA da tarefa de vogal sustentada se relacionou ao CPPS e AVQI, e da tarefa de contagem de números relacionou-se apenas ao AVQI e CPPS números. Os pontos de corte que diferenciam crianças com e sem desvio vocal são 14,07 para o CPPS vogal, 7,62 para o CPPS números e 2,01 para o AVQI. CONCLUSÃO: Crianças com JPA alterado apresentaram maiores valores de AVQI e menores valores de CPPs. O JPA da tarefa de vogal previu todas as medidas acústicas, porém, de contagem previu apenas as medidas extraídas dela. As três medidas foram semelhantes na identificação de vozes sem desvio e vozes disfônicas.


Assuntos
Acústica da Fala , Qualidade da Voz , Humanos , Qualidade da Voz/fisiologia , Criança , Feminino , Masculino , Percepção Auditiva/fisiologia , Distúrbios da Voz/diagnóstico , Distúrbios da Voz/fisiopatologia , Adolescente , Estudos de Casos e Controles , Medida da Produção da Fala , Julgamento
9.
J Speech Lang Hear Res ; 67(6): 1712-1730, 2024 Jun 06.
Artigo em Inglês | MEDLINE | ID: mdl-38749007

RESUMO

PURPOSE: The goal of this study was to assess various recording methods, including combinations of high- versus low-cost microphones, recording interfaces, and smartphones in terms of their ability to produce commonly used time- and spectral-based voice measurements. METHOD: Twenty-four vowel samples representing a diversity of voice quality deviations and severities from a wide age range of male and female speakers were played via a head-and-thorax model and recorded using a high-cost, research standard GRAS 40AF (GRAS Sound & Vibration) microphone and amplification system. Additional recordings were made using various combinations of headset microphones (AKG C555 L [AKG Acoustics GmbH], Shure SM35-XLR [Shure Incorporated], AVID AE-36 [AVID Products, Inc.]) and audio interfaces (Focusrite Scarlett 2i2 [Focusrite Audio Engineering Ltd.] and PC, Focusrite and smartphone, smartphone via a TRRS adapter), as well as smartphones direct (Apple iPhone 13 Pro, Google Pixel 6) using their built-in microphones. The effect of background noise from four different room conditions was also evaluated. Vowel samples were analyzed for measures of fundamental frequency, perturbation, cepstral peak prominence, and spectral tilt (low vs. high spectral ratio). RESULTS: Results show that a wide variety of recording methods, including smartphones with and without a low-cost headset microphone, can effectively track the wide range of acoustic characteristics in a diverse set of typical and disordered voice samples. Although significant differences in acoustic measures of voice may be observed, the presence of extremely strong correlations (rs > .90) with the recording standard implies a strong linear relationship between the results of different methods that may be used to predict and adjust any observed differences in measurement results. CONCLUSION: Because handheld smartphone distance and positioning may be highly variable when used in actual clinical recording situations, smartphone + a low-cost headset microphone is recommended as an affordable recording method that controls mouth-to-microphone distance and positioning and allows both hands to be available for manipulation of the smartphone device.


Assuntos
Smartphone , Acústica da Fala , Humanos , Feminino , Masculino , Adulto , Adulto Jovem , Medida da Produção da Fala/instrumentação , Medida da Produção da Fala/métodos , Reprodutibilidade dos Testes , Qualidade da Voz , Pessoa de Meia-Idade , Adolescente
10.
J Speech Lang Hear Res ; 67(6): 1660-1681, 2024 Jun 06.
Artigo em Inglês | MEDLINE | ID: mdl-38758676

RESUMO

PURPOSE: Literature suggests a dependency of the acoustic metrics, smoothed cepstral peak prominence (CPPS) and harmonics-to-noise ratio (HNR), on human voice loudness and fundamental frequency (F0). Even though this has been explained with different oscillatory patterns of the vocal folds, so far, it has not been specifically investigated. In the present work, the influence of three elicitation levels, calibrated sound pressure level (SPL), F0 and vowel on the electroglottographic (EGG) and time-differentiated EGG (dEGG) metrics hybrid open quotient (OQ), dEGG OQ and peak dEGG, as well as on the acoustic metrics CPPS and HNR, was examined, and their suitability for voice assessment was evaluated. METHOD: In a retrospective study, 29 women with a mean age of 25 years (± 8.9, range: 18-53) diagnosed with structural vocal fold pathologies were examined before and after voice therapy or phonosurgery. Both acoustic and EGG signals were recorded simultaneously during the phonation of the sustained vowels /ɑ/, /i/, and /u/ at three elicited levels of loudness (soft/comfortable/loud) and unconstrained F0 conditions. RESULTS: A linear mixed-model analysis showed a significant effect of elicitation effort levels on peak dEGG, HNR, and CPPS (all p < .01). Calibrated SPL significantly influenced HNR and CPPS (both p < .01). Furthermore, F0 had a significant effect on peak dEGG and CPPS (p < .0001). All metrics showed significant changes with regard to vowel (all p < .05). However, the treatment had no effect on the examined metrics, regardless of the treatment type (surgery vs. voice therapy). CONCLUSIONS: The value of the investigated metrics for voice assessment purposes when sampled without sufficient control of SPL and F0 is limited, in that they are significantly influenced by the phonatory context, be it speech or elicited sustained vowels. Future studies should explore the diagnostic value of new data collation approaches such as voice mapping, which take SPL and F0 effects into account.


Assuntos
Disfonia , Acústica da Fala , Humanos , Feminino , Adulto , Disfonia/fisiopatologia , Disfonia/terapia , Estudos Retrospectivos , Adulto Jovem , Pessoa de Meia-Idade , Adolescente , Qualidade da Voz/fisiologia , Eletrodiagnóstico/métodos , Glote/fisiopatologia , Fonação/fisiologia , Prega Vocal/fisiopatologia , Treinamento da Voz , Medida da Produção da Fala/métodos
11.
J Acoust Soc Am ; 155(5): 3071-3089, 2024 May 01.
Artigo em Inglês | MEDLINE | ID: mdl-38717213

RESUMO

This study investigated how 40 Chinese learners of English as a foreign language (EFL learners) differed from 40 native English speakers in the production of four English tense-lax contrasts, /i-ɪ/, /u-ʊ/, /ɑ-ʌ/, and /æ-ε/, by examining the acoustic measurements of duration, the first three formant frequencies, and the slope of the first formant movement (F1 slope). The dynamic formant trajectory was modeled using discrete cosine transform coefficients to demonstrate the time-varying properties of formant trajectories. A discriminant analysis was employed to illustrate the extent to which Chinese EFL learners relied on different acoustic parameters. This study found that: (1) Chinese EFL learners overemphasized durational differences and weakened spectral differences for the /i-ɪ/, /u-ʊ/, and /ɑ-ʌ/ pairs, although they maintained sufficient spectral differences for /æ-ε/. In contrast, native English speakers predominantly used spectral differences across all four pairs; (2) in non-low tense-lax contrasts, unlike native English speakers, Chinese EFL learners failed to exhibit different F1 slope values, indicating a non-nativelike tongue-root placement during the articulatory process. The findings underscore the contribution of dynamic spectral patterns to the differentiation between English tense and lax vowels, and reveal the influence of precise articulatory gestures on the realization of the tense-lax contrast.


Assuntos
Multilinguismo , Fonética , Acústica da Fala , Humanos , Masculino , Feminino , Adulto Jovem , Medida da Produção da Fala , Adulto , Idioma , Acústica , Aprendizagem , Qualidade da Voz , Espectrografia do Som , População do Leste Asiático
12.
J Acoust Soc Am ; 155(5): 3521-3536, 2024 May 01.
Artigo em Inglês | MEDLINE | ID: mdl-38809098

RESUMO

This electromagnetic articulography study explores the kinematic profile of Intonational Phrase boundaries in Seoul Korean. Recent findings suggest that the scope of phrase-final lengthening is conditioned by word- and/or phrase-level prominence. However, evidence comes mainly from head-prominence languages, which conflate positions of word prosody with positions of phrasal prominence. Here, we examine phrase-final lengthening in Seoul Korean, an edge-prominence language with no word prosody, with respect to focus location as an index of phrase-level prominence and Accentual Phrase (AP) length as an index of word demarcation. Results show that phrase-final lengthening extends over the phrase-final syllable. The effect is greater the further away that focus occurs. It also interacts with the domains of AP and prosodic word: lengthening is greater in smaller APs, whereas shortening is observed in the initial gesture of the phrase-final word. Additional analyses of kinematic displacement and peak velocity revealed that Korean phrase-final gestures bear the kinematic profile of IP boundaries concurrently to what is typically considered prominence marking. Based on these results, a gestural coordination account is proposed, in which boundary-related events interact systematically with phrase-level prominence as well as lower prosodic levels, and how this proposal relates to the findings in head-prominence languages is discussed.


Assuntos
Fonética , Acústica da Fala , Humanos , Masculino , Feminino , Adulto Jovem , Fenômenos Biomecânicos , Adulto , Idioma , Gestos , Medida da Produção da Fala , República da Coreia , Qualidade da Voz , Fatores de Tempo
13.
J Commun Disord ; 109: 106428, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38744198

RESUMO

PURPOSE: This study examines whether there are differences in the speech of speakers with dysarthria, speakers with apraxia and healthy speakers in spectral acoustic measures during production of the central-peninsular Spanish alveolar sibilant fricative /s/. METHOD: To this end, production of the sibilant was analyzed in 20 subjects with dysarthria, 8 with apraxia of speech and 28 healthy speakers. Participants produced 12 sV(C) words. The variables compared across groups were the fricative's spectral amplitude difference (AmpD) and spectral moments in the temporal midpoint of fricative execution. RESULTS: The results indicate that individuals with dysarthria can be distinguished from healthy speakers in terms of the spectral characteristics AmpD, standard deviation (SD), center of gravity (CoG) and skewness, the last two in context with unrounded vowel, while no differences in kurtosis were detected. Participants with AoS group differ significantly from healthy speaker group in AmpD, SD and CoG and Kurtosis, the first one followed unrounded vowel and the latter two followed by rounded vowels. In addition, speakers with apraxia of speech group returned significant differences with respect to speakers with dysarthria group in AmpD, CoG and skewness. CONCLUSIONS: The differences found between the groups in the measures studied as a function of the type of vowel context could provide insights into the distinctive manifestations of motor speech disorders, contributing to the differential diagnosis between apraxia and dysarthria in motor control processes.


Assuntos
Apraxias , Disartria , Acústica da Fala , Humanos , Disartria/fisiopatologia , Disartria/etiologia , Apraxias/fisiopatologia , Masculino , Feminino , Pessoa de Meia-Idade , Adulto , Idoso , Fonética , Medida da Produção da Fala
14.
J Acoust Soc Am ; 155(4): 2836-2848, 2024 Apr 01.
Artigo em Inglês | MEDLINE | ID: mdl-38682915

RESUMO

This paper evaluates an innovative framework for spoken dialect density prediction on children's and adults' African American English. A speaker's dialect density is defined as the frequency with which dialect-specific language characteristics occur in their speech. Rather than treating the presence or absence of a target dialect in a user's speech as a binary decision, instead, a classifier is trained to predict the level of dialect density to provide a higher degree of specificity in downstream tasks. For this, self-supervised learning representations from HuBERT, handcrafted grammar-based features extracted from ASR transcripts, prosodic features, and other feature sets are experimented with as the input to an XGBoost classifier. Then, the classifier is trained to assign dialect density labels to short recorded utterances. High dialect density level classification accuracy is achieved for child and adult speech and demonstrated robust performance across age and regional varieties of dialect. Additionally, this work is used as a basis for analyzing which acoustic and grammatical cues affect machine perception of dialect.


Assuntos
Negro ou Afro-Americano , Acústica da Fala , Humanos , Adulto , Criança , Masculino , Feminino , Medida da Produção da Fala/métodos , Idioma , Pré-Escolar , Adulto Jovem , Percepção da Fala , Adolescente , Fonética , Linguagem Infantil
15.
J Speech Lang Hear Res ; 67(6): 1682-1711, 2024 Jun 06.
Artigo em Inglês | MEDLINE | ID: mdl-38662942

RESUMO

PURPOSE: Pitch variations (tone productions) have been reported as a measure to differentiate Cantonese-speaking children with and without childhood apraxia of speech (CAS). This study aims to examine fundamental frequency (F0) changes within syllables and the effects of syllable structure, lexical status, and syllable positions on F0 in Cantonese-speaking preschool children with and without CAS. METHOD: Six children with CAS, six children with non-CAS speech sound disorder plus language disorder (S&LD), 22 children with speech sound disorder only (SSD), and 63 children with typical speech-language development (TD) performed the tone sequencing task (TST). Growth curve analysis was employed to analyze and compare the F0 values within syllables with three Cantonese tones (high level, high rising, and low falling). The analysis considered the effects of syllable structure (vowel and consonant-vowel), lexical status (word and nonword), and syllable position (initial, medial, and final) on F0, as well as comparisons within and between groups. RESULTS: Within each group, the effects of syllable structure and position on F0 values were found with different patterns. Between-group comparisons showed that the CAS group had reduced F0 contrasts. The CAS group could be differentiated from the control groups based on interactions of F0 with syllable structure and position, but not lexical status. The dissimilarity of F0 values detected between the CAS and SSD/TD groups was more prominent than that observed between the CAS and S&LD groups. CONCLUSIONS: This study demonstrated that Cantonese-speaking children with CAS had difficulty in varying F0 within syllables as compared to those without CAS, suggesting pitch variation difficulty and language-specific impairment profiles in CAS. Future investigations of objective measures for identifying Cantonese speakers with CAS and cross-linguistic investigations using growth curve analysis and the TST are suggested.


Assuntos
Apraxias , Fonética , Humanos , Pré-Escolar , Apraxias/diagnóstico , Masculino , Feminino , Acústica da Fala , Transtorno Fonológico/diagnóstico , Medida da Produção da Fala/métodos , Idioma , Fala/fisiologia
16.
J Fluency Disord ; 80: 106058, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38636390

RESUMO

PURPOSE: To document the trajectory of early childhood stuttering longitudinally for 14. years with a consideration on the features of overt and covert stuttering related to recovery status. METHOD: Thirty-eight participants were observed longitudinally at three different time points: early childhood (Occasion 1), middle childhood (Occasion 2), and late adolescence (Occasion 3). Data collection involved speech samples and reports of stuttering experiences. Recovery on Occasion 3 was estimated through analysis of speech samples, parent and expert judgments, and self- judgement. Two categories of persistence were used: persistent-subjective (no observable stuttering) and persistent-objective (observable stuttering). RESULTS: The recovery rate was 65.6%. The majority of the participants showed minimal disfluent speech with 88% showing less than 1% syllables stuttered and 97% showing less than 3% syllables stuttered in the collected speech samples. All participants classified as persistent reported covert symptoms of stuttering. No relapses in recovery were observed between Occasion 2 and Occasion 3. Late recovery was only observed for those classified as persistent-subjective on Occasion 2. About 64% of the participants showing observable stuttering (persistent-objective) on Occasion 2 showed no observable stuttering (persistent-subjective) on Occasion 3. CONCLUSIONS: Children continue to recover from early childhood stuttering as they age.The inclusion of self-reports adds to the understanding of recovery especially concerning the covert stuttering behaviours. The presence of overt symptoms of stuttering in the speech samples of children aged 7 to 13 years seems to be associated with the likelihood of late recovery of stuttering.


Assuntos
Gagueira , Humanos , Gagueira/fisiopatologia , Masculino , Feminino , Criança , Adolescente , Estudos Prospectivos , Seguimentos , Estudos Longitudinais , Pré-Escolar , Medida da Produção da Fala
17.
J Speech Lang Hear Res ; 67(6): 1643-1659, 2024 Jun 06.
Artigo em Inglês | MEDLINE | ID: mdl-38683058

RESUMO

PURPOSE: The aim of this study was to determine (a) diagnostic accuracy of acoustic measures of glottal stop production (GSP; intensity differences, slopes, complete voicing cessation) to distinguish between unilateral vocal fold paresis/paralysis (UVFP) patients and controls; (b) if acoustic measures of GSP significantly correlated with an acoustic measure of voice disorder severity, acoustic voice quality index (AVQI); and (c) if acoustic measures from another type of voicing cessation, voiceless consonant production, also significantly differed between groups. METHOD: Ninety-seven patients with unilateral paresis/paralysis and 35 controls with normal laryngostroboscopic signs produced two sets of five repeated [i] and four repeated [isi]. Tokens were randomized by type between groups and analyzed blinded using a customized Praat program that computed intensity differences and slopes between vowel maxima and glottal stop minima for inter-[i] tokens and vowel maxima and voiceless consonant minima for intra-[isi] tokens. The number of voicing cessations for inter-[i] tokens was obtained. RESULTS: Onset and offset intensity differences and number of voicing cessations from inter-[i] tokens had the greatest areas under the curve (.854, .856, and .835, respectively). Correlation coefficients were significant (p < .01) between AVQI and all GSP acoustic measures with weak/medium effect sizes. No significant differences were found between controls and participants with UVFP for acoustic measures from intra-[isi]. CONCLUSIONS: Acoustic GSP measures demonstrated good diagnostic accuracy and some relationship to severity of voice disorder. No significant differences in acoustic measures for medial voiceless fricative consonants between controls and participants with UVFP suggested that voicing cessation for voiceless fricatives differs from voicing cessation for GSP.


Assuntos
Glote , Acústica da Fala , Paralisia das Pregas Vocais , Qualidade da Voz , Humanos , Paralisia das Pregas Vocais/fisiopatologia , Paralisia das Pregas Vocais/diagnóstico , Masculino , Feminino , Pessoa de Meia-Idade , Adulto , Estudos Retrospectivos , Glote/fisiopatologia , Qualidade da Voz/fisiologia , Idoso , Medida da Produção da Fala/métodos , Adulto Jovem , Índice de Gravidade de Doença , Distúrbios da Voz/diagnóstico , Distúrbios da Voz/fisiopatologia
18.
J Speech Lang Hear Res ; 67(5): 1370-1384, 2024 May 07.
Artigo em Inglês | MEDLINE | ID: mdl-38619435

RESUMO

OBJECTIVES: The study aimed to investigate the predictive potential of language environment and vocal development status measures obtained through integrated analysis of Language ENvironment Analysis (LENA) recordings during the prelinguistic stage for subsequent speech and language development in Korean-acquiring children. Specifically, this study explored whether measures from both LENA-automated analysis and human coding at 6-8 months and 12-14 months of age predict vocabulary and phonological development at 18-20 months. METHOD: One-day home recordings from 20 children were collected using a LENA recorder at 6-8 months, 12-14 months, and 18-20 months. Both LENA-automated measures and measures from human coding were obtained from recordings at 6-8 months and 12-14 months. The number of different words, consonant inventory, and utterance structure inventory were identified from recordings of 18-20 months. Correlation and multiple regression analyses were performed to investigate whether measures related to early language environment and child vocalization at 6-8 months and 12-14 months were predictive of vocabulary and phonological measures at 18-20 months. RESULTS: The results showed that the two main LENA-automated measures, conversational turn count (CTC) and child vocalization count, were positively correlated with all vocabulary and phonological measures at 18-20 months. Multiple regression analysis revealed that CTC during the prelinguistic stages was the most significant predictor of a number of different words, consonant inventory, and utterance structure inventory at 18-20 months. Also, adult word count in LENA-automated measures, child-directed speech ratio, and canonical babbling ratio measured by human coding significantly predicted some vocabulary and phonological measures at 18-20 months. CONCLUSION: This study highlights the multifaceted nature of language acquisition and collectively emphasizes the value of considering both quantitative and qualitative aspects of language input to understand early language development in children.


Assuntos
Linguagem Infantil , Desenvolvimento da Linguagem , Fala , Vocabulário , Humanos , Masculino , Feminino , Lactente , Fala/fisiologia , Fonética , Medida da Produção da Fala/métodos
19.
Codas ; 36(3): e20230175, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38629682

RESUMO

PURPOSE: To assess the influence of the listener experience, measurement scales and the type of speech task on the auditory-perceptual evaluation of the overall severity (OS) of voice deviation and the predominant type of voice (rough, breathy or strain). METHODS: 22 listeners, divided into four groups participated in the study: speech-language pathologist specialized in voice (SLP-V), SLP non specialized in voice (SLP-NV), graduate students with auditory-perceptual analysis training (GS-T), and graduate students without auditory-perceptual analysis training (GS-U). The subjects rated the OS of voice deviation and the predominant type of voice of 44 voices by visual analog scale (VAS) and the numerical scale (score "G" from GRBAS), corresponding to six speech tasks such as sustained vowel /a/ and /ɛ/, sentences, number counting, running speech, and all five previous tasks together. RESULTS: Sentences obtained the best interrater reliability in each group, using both VAS and GRBAS. SLP-NV group demonstrated the best interrater reliability in OS judgment in different speech tasks using VAS or GRBAS. Sustained vowel (/a/ and /ɛ/) and running speech obtained the best interrater reliability among the groups of listeners in judging the predominant vocal quality. GS-T group got the best result of interrater reliability in judging the predominant vocal quality. CONCLUSION: The time of experience in the auditory-perceptual judgment of the voice, the type of training to which they were submitted, and the type of speech task influence the reliability of the auditory-perceptual evaluation of vocal quality.


Assuntos
Disfonia , Percepção da Fala , Humanos , Fala , Reprodutibilidade dos Testes , Medida da Produção da Fala , Variações Dependentes do Observador , Qualidade da Voz , Acústica da Fala
20.
Am J Speech Lang Pathol ; 33(4): 1811-1830, 2024 Jul 03.
Artigo em Inglês | MEDLINE | ID: mdl-38625101

RESUMO

PURPOSE: Adults with aphasia gesture more than adults without aphasia. However, less is known about the role of gesture in different discourse contexts for individuals with different types of aphasia. In this study, we asked whether patterns of speech and gesture production of individuals with aphasia vary by aphasia and discourse type and also differ from the speech and gestures produced by adults without aphasia. METHOD: We compared the amount, diversity, and complexity of speech and gesture production in adults with anomic or Broca's aphasia and adults with no aphasia (n = 20/group) in their first- versus third-person narratives. RESULTS: Adults with Broca's aphasia showed the lowest performance in their amount, diversity, and complexity of speech production, followed by adults with anomic aphasia and adults without aphasia. This pattern was reversed for gesture production. Speech and gesture production also varied by discourse context. Adults with either type of aphasia used a lower amount of and less diverse speech in third-person than in first-person narratives; this pattern was also reversed for gesture production. CONCLUSIONS: Overall, our results provide evidence for a compensatory role of gesture in aphasia communication. Adults with Broca's aphasia, who showed the greatest speech production difficulties, also relied most on gesture, and this pattern was particularly pronounced in the third-person narrative context.


Assuntos
Afasia de Broca , Gestos , Humanos , Masculino , Feminino , Pessoa de Meia-Idade , Idoso , Afasia de Broca/diagnóstico , Fala , Adulto , Anomia/diagnóstico , Anomia/psicologia , Afasia/psicologia , Medida da Produção da Fala , Narração , Comunicação
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...