Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 1.194
Filtrar
1.
J Acoust Soc Am ; 155(6): 3848-3860, 2024 Jun 01.
Artigo em Inglês | MEDLINE | ID: mdl-38884524

RESUMO

The ability to accurately classify accents and assess accentedness in non-native speakers are challenging tasks due primarily to the complexity and diversity of accent and dialect variations. In this study, embeddings from advanced pretrained language identification (LID) and speaker identification (SID) models are leveraged to improve the accuracy of accent classification and non-native accentedness assessment. Findings demonstrate that employing pretrained LID and SID models effectively encodes accent/dialect information in speech. Furthermore, the LID and SID encoded accent information complement an end-to-end (E2E) accent identification (AID) model trained from scratch. By incorporating all three embeddings, the proposed multi-embedding AID system achieves superior accuracy in AID. Next, leveraging automatic speech recognition (ASR) and AID models is investigated to explore accentedness estimation. The ASR model is an E2E connectionist temporal classification model trained exclusively with American English (en-US) utterances. The ASR error rate and en-US output of the AID model are leveraged as objective accentedness scores. Evaluation results demonstrate a strong correlation between scores estimated by the two models. Additionally, a robust correlation between objective accentedness scores and subjective scores based on human perception is demonstrated, providing evidence for the reliability and validity of using AID-based and ASR-based systems for accentedness assessment in non-native speech. Such advanced systems would benefit accent assessment in language learning as well as speech and speaker assessment for intelligibility, quality, and speaker diarization and speech recognition advancements.


Assuntos
Percepção da Fala , Interface para o Reconhecimento da Fala , Humanos , Percepção da Fala/fisiologia , Acústica da Fala , Fonética , Idioma , Medida da Produção da Fala/métodos , Feminino , Masculino
2.
J Acoust Soc Am ; 155(6): 3877-3888, 2024 Jun 01.
Artigo em Inglês | MEDLINE | ID: mdl-38888391

RESUMO

The quality of speech input influences the efficiency of L1 and L2 acquisition. This study examined modifications in infant-directed speech (IDS) and foreigner-directed speech (FDS) in Standard Mandarin-a tonal language-and explored how IDS and FDS features were manifested in disyllabic words and a longer discourse. The study aimed to determine which characteristics of IDS and FDS were enhanced in comparison with adult-directed speech (ADS), and how IDS and FDS differed when measured in a common set of acoustic parameters. For words, it was found that tone-bearing vowel duration, mean and range of fundamental frequency (F0), and the lexical tone contours were enhanced in IDS and FDS relative to ADS, except for the dipping Tone 3 that exhibited an unexpected lowering in FDS, but no modification in IDS when compared with ADS. For the discourse, different aspects of temporal and F0 enhancements were emphasized in IDS and FDS: the mean F0 was higher in IDS whereas the total discourse duration was greater in FDS. These findings add to the growing literature on L1 and L2 speech input characteristics and their role in language acquisition.


Assuntos
Acústica da Fala , Humanos , Feminino , Masculino , Lactente , Adulto , Fonética , Medida da Produção da Fala/métodos , Adulto Jovem , Multilinguismo , Qualidade da Voz , Acústica , Idioma , Fatores de Tempo , Percepção da Fala
3.
Lang Speech Hear Serv Sch ; 55(3): 838-852, 2024 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-38748925

RESUMO

PURPOSE: Comprehensive spoken language assessment should include the evaluation of language use in naturalistic contexts. Discourse elicitation and analysis provides the opportunity for such an evaluation to occur. In this article, our overall aim was to describe adolescents' language performance on four elicitation tasks and determine if there are task-related differences across the elicitation tasks. METHOD: Forty-four typically developing adolescents with ages ranging from 12;2 to 17;11 (years;months; M = 15;2; 21 boys and 23 girls) participated in the study. They completed four spoken discourse tasks: (a) story generation using a wordless picture book, (b) fable retell, (c) six personal narratives in response to emotion-based prompts, and (d) monologic response to two stories that contained a moral dilemma. Responses were transcribed and analyzed for four language performance measures tapping into language productivity, syntactic complexity, lexical diversity, and verbal facility. RESULTS: Despite individual variability in performance, mean scores were close to median scores for most measures, suggesting a symmetrical distribution. As expected, all four language performance measures were significantly different across the four elicitation tasks. The personal narrative task elicited the longest samples, with the highest verbal fluency. In contrast, both lexical diversity and syntactic complexity were the strongest in response to the fable retell and the moral dilemma tasks. CONCLUSIONS: This investigation provides speech-language pathologists with an overview of how task-related factors may impact adolescent language performance. These findings may be used to support their clinical decision-making processes in choosing a suitable discourse task when conducting a comprehensive spoken language assessment. Three hypothetical case examples are used to illustrate the decision-making process. SUPPLEMENTAL MATERIAL: https://doi.org/10.23641/asha.25761768.


Assuntos
Testes de Linguagem , Humanos , Adolescente , Masculino , Feminino , Criança , Narração , Idioma , Medida da Produção da Fala/métodos
4.
Lang Speech Hear Serv Sch ; 55(3): 870-883, 2024 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-38758707

RESUMO

PURPOSE: Transcription of conjoined independent clauses within language samples varies across professionals. Some transcribe these clauses as two separate utterances, whereas others conjoin them within a single utterance. As an inquiry into equitable practice, we examined rates of conjoined independent clauses produced by children and the impact of separating these clauses within utterances on measures of mean length of utterance (MLU) by a child's English dialect, clinical status, and age. METHOD: The data were archival and included 246 language samples from children classified by their dialect (African American English or Southern White English) and clinical status (developmental language disorder [DLD] or typically developing [TD]), with those in the TD group further classified by their age (4 years [TD4] or 6 years [TD6]). RESULTS: Rates of conjoined independent clauses and the impact of these clauses on MLU varied by clinical status (DLD < TD) and age (TD4 < TD6), but not by dialect. Correlations between the rate of conjoined clauses, MLU, and language test scores were also similar across the two dialects. CONCLUSIONS: Transcription decisions regarding conjoined independent clauses within language samples lead to equitable measurement outcomes across dialects of English. Nevertheless, transcribing conjoined independent clauses as two separate utterances reduces one's ability to detect syntactic differences between children with and without DLD and document syntactic growth as children age. SUPPLEMENTAL MATERIAL: https://doi.org/10.23641/asha.25822675.


Assuntos
Linguagem Infantil , Transtornos do Desenvolvimento da Linguagem , Humanos , Criança , Pré-Escolar , Masculino , Feminino , Idioma , Medida da Produção da Fala/métodos , Testes de Linguagem , Negro ou Afro-Americano
5.
J Speech Lang Hear Res ; 67(6): 1712-1730, 2024 Jun 06.
Artigo em Inglês | MEDLINE | ID: mdl-38749007

RESUMO

PURPOSE: The goal of this study was to assess various recording methods, including combinations of high- versus low-cost microphones, recording interfaces, and smartphones in terms of their ability to produce commonly used time- and spectral-based voice measurements. METHOD: Twenty-four vowel samples representing a diversity of voice quality deviations and severities from a wide age range of male and female speakers were played via a head-and-thorax model and recorded using a high-cost, research standard GRAS 40AF (GRAS Sound & Vibration) microphone and amplification system. Additional recordings were made using various combinations of headset microphones (AKG C555 L [AKG Acoustics GmbH], Shure SM35-XLR [Shure Incorporated], AVID AE-36 [AVID Products, Inc.]) and audio interfaces (Focusrite Scarlett 2i2 [Focusrite Audio Engineering Ltd.] and PC, Focusrite and smartphone, smartphone via a TRRS adapter), as well as smartphones direct (Apple iPhone 13 Pro, Google Pixel 6) using their built-in microphones. The effect of background noise from four different room conditions was also evaluated. Vowel samples were analyzed for measures of fundamental frequency, perturbation, cepstral peak prominence, and spectral tilt (low vs. high spectral ratio). RESULTS: Results show that a wide variety of recording methods, including smartphones with and without a low-cost headset microphone, can effectively track the wide range of acoustic characteristics in a diverse set of typical and disordered voice samples. Although significant differences in acoustic measures of voice may be observed, the presence of extremely strong correlations (rs > .90) with the recording standard implies a strong linear relationship between the results of different methods that may be used to predict and adjust any observed differences in measurement results. CONCLUSION: Because handheld smartphone distance and positioning may be highly variable when used in actual clinical recording situations, smartphone + a low-cost headset microphone is recommended as an affordable recording method that controls mouth-to-microphone distance and positioning and allows both hands to be available for manipulation of the smartphone device.


Assuntos
Smartphone , Acústica da Fala , Humanos , Feminino , Masculino , Adulto , Adulto Jovem , Medida da Produção da Fala/instrumentação , Medida da Produção da Fala/métodos , Reprodutibilidade dos Testes , Qualidade da Voz , Pessoa de Meia-Idade , Adolescente
6.
J Speech Lang Hear Res ; 67(6): 1660-1681, 2024 Jun 06.
Artigo em Inglês | MEDLINE | ID: mdl-38758676

RESUMO

PURPOSE: Literature suggests a dependency of the acoustic metrics, smoothed cepstral peak prominence (CPPS) and harmonics-to-noise ratio (HNR), on human voice loudness and fundamental frequency (F0). Even though this has been explained with different oscillatory patterns of the vocal folds, so far, it has not been specifically investigated. In the present work, the influence of three elicitation levels, calibrated sound pressure level (SPL), F0 and vowel on the electroglottographic (EGG) and time-differentiated EGG (dEGG) metrics hybrid open quotient (OQ), dEGG OQ and peak dEGG, as well as on the acoustic metrics CPPS and HNR, was examined, and their suitability for voice assessment was evaluated. METHOD: In a retrospective study, 29 women with a mean age of 25 years (± 8.9, range: 18-53) diagnosed with structural vocal fold pathologies were examined before and after voice therapy or phonosurgery. Both acoustic and EGG signals were recorded simultaneously during the phonation of the sustained vowels /ɑ/, /i/, and /u/ at three elicited levels of loudness (soft/comfortable/loud) and unconstrained F0 conditions. RESULTS: A linear mixed-model analysis showed a significant effect of elicitation effort levels on peak dEGG, HNR, and CPPS (all p < .01). Calibrated SPL significantly influenced HNR and CPPS (both p < .01). Furthermore, F0 had a significant effect on peak dEGG and CPPS (p < .0001). All metrics showed significant changes with regard to vowel (all p < .05). However, the treatment had no effect on the examined metrics, regardless of the treatment type (surgery vs. voice therapy). CONCLUSIONS: The value of the investigated metrics for voice assessment purposes when sampled without sufficient control of SPL and F0 is limited, in that they are significantly influenced by the phonatory context, be it speech or elicited sustained vowels. Future studies should explore the diagnostic value of new data collation approaches such as voice mapping, which take SPL and F0 effects into account.


Assuntos
Disfonia , Acústica da Fala , Humanos , Feminino , Adulto , Disfonia/fisiopatologia , Disfonia/terapia , Estudos Retrospectivos , Adulto Jovem , Pessoa de Meia-Idade , Adolescente , Qualidade da Voz/fisiologia , Eletrodiagnóstico/métodos , Glote/fisiopatologia , Fonação/fisiologia , Prega Vocal/fisiopatologia , Treinamento da Voz , Medida da Produção da Fala/métodos
7.
J Acoust Soc Am ; 155(4): 2836-2848, 2024 Apr 01.
Artigo em Inglês | MEDLINE | ID: mdl-38682915

RESUMO

This paper evaluates an innovative framework for spoken dialect density prediction on children's and adults' African American English. A speaker's dialect density is defined as the frequency with which dialect-specific language characteristics occur in their speech. Rather than treating the presence or absence of a target dialect in a user's speech as a binary decision, instead, a classifier is trained to predict the level of dialect density to provide a higher degree of specificity in downstream tasks. For this, self-supervised learning representations from HuBERT, handcrafted grammar-based features extracted from ASR transcripts, prosodic features, and other feature sets are experimented with as the input to an XGBoost classifier. Then, the classifier is trained to assign dialect density labels to short recorded utterances. High dialect density level classification accuracy is achieved for child and adult speech and demonstrated robust performance across age and regional varieties of dialect. Additionally, this work is used as a basis for analyzing which acoustic and grammatical cues affect machine perception of dialect.


Assuntos
Negro ou Afro-Americano , Acústica da Fala , Humanos , Adulto , Criança , Masculino , Feminino , Medida da Produção da Fala/métodos , Idioma , Pré-Escolar , Adulto Jovem , Percepção da Fala , Adolescente , Fonética , Linguagem Infantil
8.
J Speech Lang Hear Res ; 67(6): 1682-1711, 2024 Jun 06.
Artigo em Inglês | MEDLINE | ID: mdl-38662942

RESUMO

PURPOSE: Pitch variations (tone productions) have been reported as a measure to differentiate Cantonese-speaking children with and without childhood apraxia of speech (CAS). This study aims to examine fundamental frequency (F0) changes within syllables and the effects of syllable structure, lexical status, and syllable positions on F0 in Cantonese-speaking preschool children with and without CAS. METHOD: Six children with CAS, six children with non-CAS speech sound disorder plus language disorder (S&LD), 22 children with speech sound disorder only (SSD), and 63 children with typical speech-language development (TD) performed the tone sequencing task (TST). Growth curve analysis was employed to analyze and compare the F0 values within syllables with three Cantonese tones (high level, high rising, and low falling). The analysis considered the effects of syllable structure (vowel and consonant-vowel), lexical status (word and nonword), and syllable position (initial, medial, and final) on F0, as well as comparisons within and between groups. RESULTS: Within each group, the effects of syllable structure and position on F0 values were found with different patterns. Between-group comparisons showed that the CAS group had reduced F0 contrasts. The CAS group could be differentiated from the control groups based on interactions of F0 with syllable structure and position, but not lexical status. The dissimilarity of F0 values detected between the CAS and SSD/TD groups was more prominent than that observed between the CAS and S&LD groups. CONCLUSIONS: This study demonstrated that Cantonese-speaking children with CAS had difficulty in varying F0 within syllables as compared to those without CAS, suggesting pitch variation difficulty and language-specific impairment profiles in CAS. Future investigations of objective measures for identifying Cantonese speakers with CAS and cross-linguistic investigations using growth curve analysis and the TST are suggested.


Assuntos
Apraxias , Fonética , Humanos , Pré-Escolar , Apraxias/diagnóstico , Masculino , Feminino , Acústica da Fala , Transtorno Fonológico/diagnóstico , Medida da Produção da Fala/métodos , Idioma , Fala/fisiologia
9.
J Speech Lang Hear Res ; 67(6): 1643-1659, 2024 Jun 06.
Artigo em Inglês | MEDLINE | ID: mdl-38683058

RESUMO

PURPOSE: The aim of this study was to determine (a) diagnostic accuracy of acoustic measures of glottal stop production (GSP; intensity differences, slopes, complete voicing cessation) to distinguish between unilateral vocal fold paresis/paralysis (UVFP) patients and controls; (b) if acoustic measures of GSP significantly correlated with an acoustic measure of voice disorder severity, acoustic voice quality index (AVQI); and (c) if acoustic measures from another type of voicing cessation, voiceless consonant production, also significantly differed between groups. METHOD: Ninety-seven patients with unilateral paresis/paralysis and 35 controls with normal laryngostroboscopic signs produced two sets of five repeated [i] and four repeated [isi]. Tokens were randomized by type between groups and analyzed blinded using a customized Praat program that computed intensity differences and slopes between vowel maxima and glottal stop minima for inter-[i] tokens and vowel maxima and voiceless consonant minima for intra-[isi] tokens. The number of voicing cessations for inter-[i] tokens was obtained. RESULTS: Onset and offset intensity differences and number of voicing cessations from inter-[i] tokens had the greatest areas under the curve (.854, .856, and .835, respectively). Correlation coefficients were significant (p < .01) between AVQI and all GSP acoustic measures with weak/medium effect sizes. No significant differences were found between controls and participants with UVFP for acoustic measures from intra-[isi]. CONCLUSIONS: Acoustic GSP measures demonstrated good diagnostic accuracy and some relationship to severity of voice disorder. No significant differences in acoustic measures for medial voiceless fricative consonants between controls and participants with UVFP suggested that voicing cessation for voiceless fricatives differs from voicing cessation for GSP.


Assuntos
Glote , Acústica da Fala , Paralisia das Pregas Vocais , Qualidade da Voz , Humanos , Paralisia das Pregas Vocais/fisiopatologia , Paralisia das Pregas Vocais/diagnóstico , Masculino , Feminino , Pessoa de Meia-Idade , Adulto , Estudos Retrospectivos , Glote/fisiopatologia , Qualidade da Voz/fisiologia , Idoso , Medida da Produção da Fala/métodos , Adulto Jovem , Índice de Gravidade de Doença , Distúrbios da Voz/diagnóstico , Distúrbios da Voz/fisiopatologia
10.
J Speech Lang Hear Res ; 67(5): 1370-1384, 2024 May 07.
Artigo em Inglês | MEDLINE | ID: mdl-38619435

RESUMO

OBJECTIVES: The study aimed to investigate the predictive potential of language environment and vocal development status measures obtained through integrated analysis of Language ENvironment Analysis (LENA) recordings during the prelinguistic stage for subsequent speech and language development in Korean-acquiring children. Specifically, this study explored whether measures from both LENA-automated analysis and human coding at 6-8 months and 12-14 months of age predict vocabulary and phonological development at 18-20 months. METHOD: One-day home recordings from 20 children were collected using a LENA recorder at 6-8 months, 12-14 months, and 18-20 months. Both LENA-automated measures and measures from human coding were obtained from recordings at 6-8 months and 12-14 months. The number of different words, consonant inventory, and utterance structure inventory were identified from recordings of 18-20 months. Correlation and multiple regression analyses were performed to investigate whether measures related to early language environment and child vocalization at 6-8 months and 12-14 months were predictive of vocabulary and phonological measures at 18-20 months. RESULTS: The results showed that the two main LENA-automated measures, conversational turn count (CTC) and child vocalization count, were positively correlated with all vocabulary and phonological measures at 18-20 months. Multiple regression analysis revealed that CTC during the prelinguistic stages was the most significant predictor of a number of different words, consonant inventory, and utterance structure inventory at 18-20 months. Also, adult word count in LENA-automated measures, child-directed speech ratio, and canonical babbling ratio measured by human coding significantly predicted some vocabulary and phonological measures at 18-20 months. CONCLUSION: This study highlights the multifaceted nature of language acquisition and collectively emphasizes the value of considering both quantitative and qualitative aspects of language input to understand early language development in children.


Assuntos
Linguagem Infantil , Desenvolvimento da Linguagem , Fala , Vocabulário , Humanos , Masculino , Feminino , Lactente , Fala/fisiologia , Fonética , Medida da Produção da Fala/métodos
11.
J Speech Lang Hear Res ; 67(2): 384-399, 2024 Feb 12.
Artigo em Inglês | MEDLINE | ID: mdl-38289853

RESUMO

PURPOSE: The purpose of this study was to quantify sentence-level articulatory kinematics in individuals treated for oral squamous cell carcinoma (ITOC) compared to control speakers while also assessing the effect of treatment site (jaw vs. tongue). Furthermore, this study aimed to assess the relation between articulatory-kinematic measures and self-reported speech problems. METHOD: Articulatory-kinematic data from the tongue tip, tongue back, and jaw were collected using electromagnetic articulography in nine Dutch ITOC and eight control speakers. To quantify articulatory kinematics, the two-dimensional articulatory working space (AWS; in mm2), one-dimensional anteroposterior range of motion (AP-ROM; in mm), and superior-inferior range of motion (SI-ROM in mm) were calculated and examined. Self-reported speech problems were assessed with the Speech Handicap Index (SHI). RESULTS: Compared to a sex-matched control group, ITOC showed significantly smaller AWS, AP-ROM, and SI-ROM for both the tongue tip and tongue back sensor, but no significant differences were observed for the jaw sensor. This pattern was found for both individuals treated for tongue and jaw tumors. Moderate nonsignificant correlations were found between the SHI and the AWS of the tongue back and jaw sensors. CONCLUSIONS: Despite large individual variation, ITOC showed reduced one- and two-dimensional tongue, but not jaw, movements compared to control speakers and treatment for tongue and jaw tumors resulted in smaller tongue movements. A larger sample size is needed to establish a more generalizable connection between the AWS and the SHI. Further research should explore how these kinematic changes in ITOC are related to acoustic and perceptual measures of speech.


Assuntos
Carcinoma de Células Escamosas , Neoplasias Maxilomandibulares , Neoplasias Bucais , Humanos , Inteligibilidade da Fala , Medida da Produção da Fala/métodos , Neoplasias Bucais/cirurgia , Acústica da Fala , Fala , Língua/cirurgia , Fenômenos Biomecânicos , Fenômenos Eletromagnéticos , Arcada Osseodentária
12.
Clin Linguist Phon ; 38(2): 97-115, 2024 03.
Artigo em Inglês | MEDLINE | ID: mdl-36592050

RESUMO

To study the possibility of using acoustic parameters, i.e., Acoustic Voice Quality Index (AVQI) and Maximum Phonation Time (MPT) for predicting the degree of lung involvement in COVID-19 patients. This cross-sectional case-control study was conducted on the voice samples collected from 163 healthy individuals and 181 patients with COVID-19. Each participant produced a sustained vowel/a/, and a phonetically balanced Persian text containing 36 syllables. AVQI and MPT were measured using Praat scripts. Each patient underwent a non-enhanced chest computed tomographic scan and the Total Opacity score was rated to assess the degree of lung involvement. The results revealed significant differences between patients with COVID-19 and healthy individuals in terms of AVQI and MPT. A significant difference was also observed between male and female participants in AVQI and MPT. The results from the receiver operating characteristic curve analysis and area under the curve indicated that MPT (0.909) had higher diagnostic accuracy than AVQI (0.771). A significant relationship was observed between AVQI and TO scores. In the case of MPT, however, no such relationship was observed. The findings indicated that MPT was a better classifier in differentiating patients from healthy individuals, in comparison with AVQI. The results also showed that AVQI can be used as a predictor of the degree of patients' and recovered individuals' lung involvement. A formula is suggested for calculating the degree of lung involvement using AVQI.


Assuntos
COVID-19 , Disfonia , Humanos , Masculino , Feminino , Disfonia/diagnóstico , Acústica da Fala , Estudos de Casos e Controles , Estudos de Viabilidade , Estudos Transversais , Reprodutibilidade dos Testes , Índice de Gravidade de Doença , Acústica , Tomografia , Medida da Produção da Fala/métodos
13.
Int J Speech Lang Pathol ; 26(2): 267-277, 2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-37272348

RESUMO

PURPOSE: The primary objective of this study was to determine if speech and pause measures obtained using a passage reading task and timing measures from a monosyllabic diadochokinesis (DDK) task differ across speakers of Canadian French diagnosed with amyotrophic lateral sclerosis (ALS) presenting with and without bulbar symptoms, and healthy controls. The secondary objective was to determine if these measures can reflect the severity of bulbar symptoms. METHOD: A total of 29 Canadian French speakers with ALS (classified as bulbar symptomatic [n = 14] or pre-symptomatic [n = 15]) and 17 age-matched healthy controls completed a passage reading task and a monosyllabic DDK task (/pa/ and /ta/), for up to three follow-up visits. Measures of speaking rate, total duration, speech duration, and pause events were extracted from the passage reading recordings using a semi-automated speech and pause analysis procedure. Manual analysis of DDK recordings provided measures of DDK rate and variability. RESULT: Group comparisons revealed significant differences (p = < .05) between the symptomatic group and the pre-symptomatic and control groups for all passage measures and DDK rates. Only the DDK rate in /ta/ differentiated the pre-symptomatic and control groups. Repeated measures correlations revealed moderate correlations (rrm = > 0.40; p = < 0.05) between passage measures of total duration, speaking rate, speech duration, and number of pauses, and ALSFRS-R total and bulbar scores, as well as between DDK rate and ALSFRS-R total score. CONCLUSION: Speech and pause measures in passage and timing measures in monosyllabic DDK tasks might be suitable for monitoring bulbar functional symptoms in French speakers with ALS, but more work is required to identify which measures are sensitive to the earliest stages of the disease.


Assuntos
Esclerose Lateral Amiotrófica , Fala , Humanos , Esclerose Lateral Amiotrófica/complicações , Canadá , Medida da Produção da Fala/métodos , Idioma
14.
Am J Speech Lang Pathol ; 33(1): 393-405, 2024 Jan 03.
Artigo em Inglês | MEDLINE | ID: mdl-38060689

RESUMO

PURPOSE: Variability in auditory-perceptual ratings of voice limits their utility, with the poorest reliability often noted for vocal strain. The purpose of this study was to determine whether an experimental method, called visual sort and rate (VSR), promoted stronger rater reliability than visual analog scale (VAS), for ratings of strain in two clinical populations: adductor laryngeal dystonia (ADLD) and vocal hyperfunction (VH). METHOD: Connected speech samples from speakers with ADLD and VH as well as age- and sex-matched controls were selected from a database. Fifteen inexperienced listeners rated strain for two speaker sets (25 ADLD speakers and five controls; 25 VH speakers and five controls) across four rating blocks: VAS-ADLD, VSR-ADLD, VAS-VH, and VSR-VH. For the VAS task, listeners rated each speaker for strain using a vertically oriented 100-mm VAS. For the VSR task, stimuli were distributed into sets of samples with a range of severities in each set. Listeners sorted and ranked samples for strain within each set, and final ratings were captured on a vertically oriented 100-mm VAS. Intrarater reliability (Pearson's r) and interrater variability (mean of the squared differences between a listener's ratings and group mean ratings) were compared across rating methods and populations using two repeated-measures analyses of variance. RESULTS: Intrarater reliability of strain was significantly stronger when listeners used VSR compared to VAS; listeners also showed significantly better intrarater reliability in ADLD than VH. Listeners demonstrated significantly less interrater variability (better reliability) when using VSR compared to VAS. No significant effect of population or interactions was found between listeners for measures of interrater variability. CONCLUSIONS: VSR increases intrarater reliability for ratings of vocal strain in speakers with VH and ADLD. VSR decreases variability of auditory-perceptual judgments of strain between inexperienced listeners in these clinical populations. Future research should determine whether benefits of VSR extend to voice clinicians and/or clinical settings.


Assuntos
Disfonia , Percepção da Fala , Voz , Humanos , Qualidade da Voz , Julgamento , Reprodutibilidade dos Testes , Medida da Produção da Fala/métodos
15.
Laryngoscope ; 134(5): 2272-2276, 2024 May.
Artigo em Inglês | MEDLINE | ID: mdl-37942827

RESUMO

INTRODUCTION: Auditory-perceptual evaluations of dysphonia, though essential for comprehensive voice evaluation, are subject to listener bias. Knowledge of an underlying voice disorder can influence auditory-perceptual ratings. Accented speech results in increased listener effort and delays in word identification. Yet, little is known about the impact of foreign language accents on auditory-perceptual ratings for dysphonic speakers. The purpose of this work was to determine the impact of a foreign language accent on experts' auditory-perceptual ratings of dysphonic speakers. METHODS: Twelve voice-specializing SLPs who spoke with a General American English (GAE) accent rated vocal percepts of 28 speakers with a foreign language accent and 28 with a GAE accent, all of whom had been diagnosed with a voice disorder. Speaker groups were matched based on sex, age, and mean smoothed cepstral peak prominence. Four linear mixed-effects models assessed the impact of a foreign language accent on expert auditory-perceptual ratings of the overall severity of dysphonia, roughness, breathiness, and strain. RESULTS: The twelve raters demonstrated good inter- and intra-rater reliability (ICC[3, k] = .89; mean ICC = .89). The linear mixed-effects models revealed no significant impact of foreign language accent on ratings of overall severity of dysphonia, roughness, breathiness, or strain. CONCLUSION: Despite the possibility of increased listener effort and bias, foreign language accent incongruence had no effect on expert listeners' auditory-perceptual evaluations for dysphonic speakers. Findings support the use of auditory-perceptual evaluations for voice disorders across sociolinguistically diverse populations. LEVEL OF EVIDENCE: 3 Laryngoscope, 134:2272-2276, 2024.


Assuntos
Disfonia , Percepção da Fala , Humanos , Disfonia/diagnóstico , Reprodutibilidade dos Testes , Idioma , Fala , Acústica da Fala , Medida da Produção da Fala/métodos
16.
J Speech Lang Hear Res ; 67(1): 116-125, 2024 Jan 08.
Artigo em Inglês | MEDLINE | ID: mdl-37992413

RESUMO

PURPOSE: This study compared the interrater reliability of adult naïve listeners' perceptual assessments of different speech variables in children with a cleft palate with or without a cleft lip (CP ± L). In addition, the study investigated whether the listeners were able to perceive differences in these speech variables before and after speech therapy for cleft palate speech disorders. METHOD: Thirty-four speech samples of 14 children with a CP ± L (14 samples collected immediately before 10 hr of speech intervention, 14 samples collected immediately after speech intervention, and six randomly selected samples that were duplicated to assess intrarater reliability) were perceptually assessed by 26 adult naïve listeners. The listening panel consisted of nine men and 17 women (age range: 18-51 years). The speech variables included speech understandability, speech acceptability, hypernasality, hyponasality, nasal airflow, and articulation, which were assessed on a visual analog scale. Furthermore, the need for speech therapy was assessed. RESULTS: Good to very good interrater reliability was observed for the naïve listeners' ratings of all speech variables. A significant time effect was found for the pre- and postevolution of the speech variables "speech understandability," "speech acceptability," "nasal airflow," and "articulation." This time effect indicates an improvement of these variables postintervention. According to the naïve listeners, children were less in need of additional speech therapy after the 10-hr intervention period compared to assessments before this intervention period. CONCLUSIONS: Adult naïve listeners perceptually identified an improvement in different speech variables after 10 hr of cleft palate speech therapy. These findings confirm previous assessments of expert speech-language pathologists and suggest that speech improvements after cleft palate speech therapy can also be perceived by communication partners outside the therapy room. Perceptual ratings of naïve listeners can, thus, be used to add life-situation significance to the assessments of experts. Future research could include both expert raters and caregivers or relatives of children with a CP ± L in listening panels, as previous knowledge on craniofacial anomalies may lead to different results.


Assuntos
Fenda Labial , Fissura Palatina , Distúrbios da Voz , Masculino , Adulto , Criança , Humanos , Feminino , Adolescente , Adulto Jovem , Pessoa de Meia-Idade , Fissura Palatina/complicações , Fissura Palatina/terapia , Fala , Reprodutibilidade dos Testes , Medida da Produção da Fala/métodos , Distúrbios da Fala/etiologia , Distúrbios da Fala/terapia , Fenda Labial/complicações , Fenda Labial/terapia
17.
Vestn Otorinolaringol ; 88(5): 23-26, 2023.
Artigo em Russo | MEDLINE | ID: mdl-37970766

RESUMO

In order to evaluate the effectiveness of the treatment in patients with functional dysphonia, the Cepstral Peak Prominence (CPP) test was used. Twenty dysphonic women aged from 18 to 47 years were under observation. The control group consisted of 20 healthy women of close age. Patients underwent 5-7 sessions electrostimulation of laryngeal muscles and phonopedic treatment, after which a complete restoration of the voice was noted. The Praat clinical program was used, installed on a Hewlett-Packard 630 laptop (Pentium B960, 2.2 GHz). A SHURE SM94 condenser microphone was used as well. In the control group, the results were as follows: M=7.49 (SD=1.26) dB. In the main group before treatment: M=5.00 (SD=1.07) dB, after treatment: M=7.95 (SD=1.34) dB. Differences in KT values in the main group before and after treatment (5.00 dB and 7.95 dB, respectively) were significant at p<0.0001. Differences in KT values in the main group before treatment (5.00 dB) and in the control group (7.49 dB) were significant at p<0.0001. Differences in KT values in the main group after treatment (7.95 dB) and in the control group (7.49 dB) were not significant at p>0.05. The study showed high sensitivity of the method. The CPP data after treatment were higher than those before treatment and did not differ from the control ones. It is concluded that CPP is a highly sensitive method for evaluating the degree of periodicity of an acoustic signal and can be used to evaluate the effectiveness of treatment in patients with functional dysphonia.


Assuntos
Disfonia , Voz , Humanos , Feminino , Disfonia/diagnóstico , Disfonia/terapia , Acústica da Fala , Medida da Produção da Fala/métodos , Acústica
18.
J Fluency Disord ; 78: 106016, 2023 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-37852018

RESUMO

PURPOSE: Previous work shows that linguistic features (e.g., word length, word frequency) impact the predictability of stuttering events. Most of this work has been conducted using reading tasks. Our study examined how linguistic features impact the predictability of stuttering events during spontaneous speech. METHODS: The data were sourced from the FluencyBank database and consisted of interviews with 35 adult stutterers (27,009 words). Three logistic regression mixed models were fit as the primary analyses: one model with four features (i.e., initial phoneme, grammatical function, word length, and word position within a sentence), a second model with six features (i.e., the features from the previous model plus word frequency and neighborhood density), and a third model with nine features (i.e., the features from the previous model plus bigram frequency, word concreteness, and typical age of word acquisition). We compared our models using the Area Under the Curve statistic. RESULTS: The four-feature model revealed that initial phoneme, grammatical function, and word length were predictive of stuttering events. The six-feature model revealed that initial phoneme, word length, word frequency, and neighborhood density were predictive of stuttering events. The nine-feature model was not more predictive than the six-feature model. CONCLUSION: Linguistic features that were previously found to be predictive of stuttering during reading were predictive of stuttering during spontaneous speech. The results indicate the influence of linguistic processes on the predictability of stuttering events such that words associated with increased planning demands (e.g., longer words, low frequency words) were more likely to be stuttered.


Assuntos
Fala , Gagueira , Adulto , Humanos , Gagueira/diagnóstico , Medida da Produção da Fala/métodos , Linguística/métodos , Idioma
19.
Codas ; 35(6): e20220209, 2023.
Artigo em Português, Inglês | MEDLINE | ID: mdl-37820100

RESUMO

PURPOSE: to identify a set of requirements for the development of an auditory-perceptual training simulator (APT) based on the experience of professors who provide APT. METHODS: This is a cross-sectional, descriptive study with a quantitative approach. Twenty-two professors answered an online questionnaire containing 31 items related to APT, involving items about the professional profile, conditions for APT in undergraduate and postgraduate courses in Speech Therapy, APT structure, and evaluation of the APT effect. RESULT: it was observed that there is a variation in APT procedures performed in Brazil. The main requirements indicated by the respondents for the APT involve the use of synthesized voices in the initial moments, followed by human voices later; the use of speech tasks with sustained vowels and connected speech; the insertion of complementary information such as gender, age, the profession of the speaker and the spectrography of the vocal signal; training with a minimum time of six hours; the evaluation of the training effect by comparing intra- and inter-judge agreement before and after training; the addition of the parameters of general degree of vocal deviation, roughness, breathiness, and strain; the use of validated continuous and numerical scales; and offering it from the second year of the undergraduate program. CONCLUSION: although there is variability in the response of experts, a minimum set of requirements indicated for performing APT with new judges was identified.


OBJETIVO: identificar um conjunto de requisitos para o desenvolvimento de um simulador de treinamento perceptivo-auditivo (TPA) a partir da experiência de docentes que realizam o TPA. MÉTODO: Trata-se de um estudo transversal, descritivo, com abordagem quantitativa. Vinte e dois docentes responderam um questionário online contendo 31 itens relacionados ao TPA, envolvendo itens sobre o perfil profissional, condições para o TPA nos cursos de graduação e pós-graduação em Fonoaudiologia, estrutura do TPA, avaliação do efeito do TPA. RESULTADO: observou-se que existe variação nos procedimentos de TPA realizados no Brasil. Os principais requisitos indicados pelos respondentes para o TPA envolvem o uso de vozes sintetizadas nos momentos iniciais, seguindo para vozes humanas posteriormente; a utilização de tarefas de fala com vogais sustentadas e fala encadeada; a inserção de informações complementares tais como o gênero, idade, profissão do falante e a espectrografia do sinal vocal; treinamento com tempo mínimo de seis horas; a avaliação do efeito do treinamento pela comparação da concordância intra e inter-juizes pré e pós treinamento; a adição dos parâmetros de grau geral de desvio vocal, rugosidade, soprosidade e tensão; a utilização de escalas contínuas e numéricas validadas; e ser realizado a partir do segundo ano de graduação. CONCLUSÃO: embora haja uma variabilidade da resposta dos especialistas, foi identificado um conjunto mínimo de requisitos indicados para a realização de TPA com novos juízes.


Assuntos
Disfonia , Percepção da Fala , Humanos , Acústica da Fala , Julgamento , Estudos Transversais , Qualidade da Voz , Medida da Produção da Fala/métodos , Reprodutibilidade dos Testes , Variações Dependentes do Observador
20.
Codas ; 35(5): e20220009, 2023.
Artigo em Português, Inglês | MEDLINE | ID: mdl-37792751

RESUMO

PURPOSE: to describe the profile of fluency concerning the typology of disfluencies, speed, and frequency of disruptions in spontaneous speech, reading, and retelling; to compare the fluency profile in adults who stutter in spontaneous speech, reading, and retelling of text. METHODS: The present work is a cross-sectional comparative study with a sample composed of 15 adults who stutter of both sexes, with higher education or equivalent to complete elementary school II. Samples were collected in the tasks of spontaneous speech, reading, and text retelling through video calls made individually with the participants. The first 200 syllables expressed in each task were transcribed and analyzed according to the Fluency Profile Assessment Protocol (FPAP). The study compared the frequency of common and stuttering disfluencies and the speed in the different tasks surveyed. The Kruskal & Wallis test was used together with Duncan's multiple comparisons test to compare the medians and verify possible differences between the tasks researched with a significance level of 5%. RESULTS: The reading task presented a lower number of common disfluencies and a percentage of speech discontinuity about spontaneous speech and retelling tasks. No statistically significant differences were found between stuttering disfluencies in the three tasks surveyed. CONCLUSION: This study showed that there are differences in the occurrence of common disfluencies - hesitations, interjections, and revisions - and in the percentage of speech discontinuity during an oral reading of adults who stutter concerning spontaneous speech and text retelling.


OBJETIVO: descrever o perfil da fluência em relação à tipologia das disfluências, velocidade e frequência de rupturas na fala espontânea, na leitura e no reconto; comparar o perfil da fluência em adultos que gaguejam na fala espontânea, na leitura e no reconto de texto. MÉTODO: O trabalho é um estudo transversal comparativo com amostra composta por 15 adultos que gaguejam de ambos os sexos, com formação superior ou equivalente ao ensino fundamental II completo. Foram coletadas amostras nas tarefas de fala espontânea, leitura e reconto de texto por meio de video chamadas realizadas individualmente. As 200 primeiras sílabas expressas de cada tarefa foram transcritas e analisadas segundo o Protocolo de Avaliação do Perfil da Fluência (PAPF). O estudo comparou a frequência das disfluências comuns e gagas e a velocidade nas tarefas pesquisadas. Adotou-se o teste de Kruskal & Wallis em conjunto com o de comparações múltiplas de Duncan para comparar as medianas e verificar possíveis diferenças entre as tarefas pesquisadas com nível de significância de 5%. RESULTADOS: A tarefa de leitura apresentou menor número de disfluências comuns e percentual de descontinuidade de fala em relação às tarefas de fala espontânea e reconto. Não foram encontradas diferenças estatisticamente significantes entre as disfluências gagas nas três tarefas pesquisadas. CONCLUSÃO: Este trabalho mostrou que existem diferenças na ocorrência das disfluências comuns - hesitações, interjeições e revisões - e no percentual de descontinuidade de fala durante a leitura oral de adultos que gaguejam em relação à fala espontânea e ao reconto de texto.


Assuntos
Fala , Gagueira , Masculino , Feminino , Adulto , Humanos , Leitura , Estudos Transversais , Medida da Produção da Fala/métodos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...