Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 1.186
Filtrar
1.
J Acoust Soc Am ; 155(4): 2836-2848, 2024 Apr 01.
Artigo em Inglês | MEDLINE | ID: mdl-38682915

RESUMO

This paper evaluates an innovative framework for spoken dialect density prediction on children's and adults' African American English. A speaker's dialect density is defined as the frequency with which dialect-specific language characteristics occur in their speech. Rather than treating the presence or absence of a target dialect in a user's speech as a binary decision, instead, a classifier is trained to predict the level of dialect density to provide a higher degree of specificity in downstream tasks. For this, self-supervised learning representations from HuBERT, handcrafted grammar-based features extracted from ASR transcripts, prosodic features, and other feature sets are experimented with as the input to an XGBoost classifier. Then, the classifier is trained to assign dialect density labels to short recorded utterances. High dialect density level classification accuracy is achieved for child and adult speech and demonstrated robust performance across age and regional varieties of dialect. Additionally, this work is used as a basis for analyzing which acoustic and grammatical cues affect machine perception of dialect.


Assuntos
Negro ou Afro-Americano , Acústica da Fala , Humanos , Adulto , Criança , Masculino , Feminino , Medida da Produção da Fala/métodos , Idioma , Pré-Escolar , Adulto Jovem , Percepção da Fala , Adolescente , Fonética , Linguagem Infantil
2.
J Speech Lang Hear Res ; 67(5): 1370-1384, 2024 May 07.
Artigo em Inglês | MEDLINE | ID: mdl-38619435

RESUMO

OBJECTIVES: The study aimed to investigate the predictive potential of language environment and vocal development status measures obtained through integrated analysis of Language ENvironment Analysis (LENA) recordings during the prelinguistic stage for subsequent speech and language development in Korean-acquiring children. Specifically, this study explored whether measures from both LENA-automated analysis and human coding at 6-8 months and 12-14 months of age predict vocabulary and phonological development at 18-20 months. METHOD: One-day home recordings from 20 children were collected using a LENA recorder at 6-8 months, 12-14 months, and 18-20 months. Both LENA-automated measures and measures from human coding were obtained from recordings at 6-8 months and 12-14 months. The number of different words, consonant inventory, and utterance structure inventory were identified from recordings of 18-20 months. Correlation and multiple regression analyses were performed to investigate whether measures related to early language environment and child vocalization at 6-8 months and 12-14 months were predictive of vocabulary and phonological measures at 18-20 months. RESULTS: The results showed that the two main LENA-automated measures, conversational turn count (CTC) and child vocalization count, were positively correlated with all vocabulary and phonological measures at 18-20 months. Multiple regression analysis revealed that CTC during the prelinguistic stages was the most significant predictor of a number of different words, consonant inventory, and utterance structure inventory at 18-20 months. Also, adult word count in LENA-automated measures, child-directed speech ratio, and canonical babbling ratio measured by human coding significantly predicted some vocabulary and phonological measures at 18-20 months. CONCLUSION: This study highlights the multifaceted nature of language acquisition and collectively emphasizes the value of considering both quantitative and qualitative aspects of language input to understand early language development in children.


Assuntos
Linguagem Infantil , Desenvolvimento da Linguagem , Fala , Vocabulário , Humanos , Masculino , Feminino , Lactente , Fala/fisiologia , Fonética , Medida da Produção da Fala/métodos
3.
J Speech Lang Hear Res ; 67(2): 384-399, 2024 Feb 12.
Artigo em Inglês | MEDLINE | ID: mdl-38289853

RESUMO

PURPOSE: The purpose of this study was to quantify sentence-level articulatory kinematics in individuals treated for oral squamous cell carcinoma (ITOC) compared to control speakers while also assessing the effect of treatment site (jaw vs. tongue). Furthermore, this study aimed to assess the relation between articulatory-kinematic measures and self-reported speech problems. METHOD: Articulatory-kinematic data from the tongue tip, tongue back, and jaw were collected using electromagnetic articulography in nine Dutch ITOC and eight control speakers. To quantify articulatory kinematics, the two-dimensional articulatory working space (AWS; in mm2), one-dimensional anteroposterior range of motion (AP-ROM; in mm), and superior-inferior range of motion (SI-ROM in mm) were calculated and examined. Self-reported speech problems were assessed with the Speech Handicap Index (SHI). RESULTS: Compared to a sex-matched control group, ITOC showed significantly smaller AWS, AP-ROM, and SI-ROM for both the tongue tip and tongue back sensor, but no significant differences were observed for the jaw sensor. This pattern was found for both individuals treated for tongue and jaw tumors. Moderate nonsignificant correlations were found between the SHI and the AWS of the tongue back and jaw sensors. CONCLUSIONS: Despite large individual variation, ITOC showed reduced one- and two-dimensional tongue, but not jaw, movements compared to control speakers and treatment for tongue and jaw tumors resulted in smaller tongue movements. A larger sample size is needed to establish a more generalizable connection between the AWS and the SHI. Further research should explore how these kinematic changes in ITOC are related to acoustic and perceptual measures of speech.


Assuntos
Carcinoma de Células Escamosas , Neoplasias Maxilomandibulares , Neoplasias Bucais , Humanos , Inteligibilidade da Fala , Medida da Produção da Fala/métodos , Neoplasias Bucais/cirurgia , Acústica da Fala , Fala , Língua/cirurgia , Fenômenos Biomecânicos , Fenômenos Eletromagnéticos , Arcada Osseodentária
4.
Clin Linguist Phon ; 38(2): 97-115, 2024 03.
Artigo em Inglês | MEDLINE | ID: mdl-36592050

RESUMO

To study the possibility of using acoustic parameters, i.e., Acoustic Voice Quality Index (AVQI) and Maximum Phonation Time (MPT) for predicting the degree of lung involvement in COVID-19 patients. This cross-sectional case-control study was conducted on the voice samples collected from 163 healthy individuals and 181 patients with COVID-19. Each participant produced a sustained vowel/a/, and a phonetically balanced Persian text containing 36 syllables. AVQI and MPT were measured using Praat scripts. Each patient underwent a non-enhanced chest computed tomographic scan and the Total Opacity score was rated to assess the degree of lung involvement. The results revealed significant differences between patients with COVID-19 and healthy individuals in terms of AVQI and MPT. A significant difference was also observed between male and female participants in AVQI and MPT. The results from the receiver operating characteristic curve analysis and area under the curve indicated that MPT (0.909) had higher diagnostic accuracy than AVQI (0.771). A significant relationship was observed between AVQI and TO scores. In the case of MPT, however, no such relationship was observed. The findings indicated that MPT was a better classifier in differentiating patients from healthy individuals, in comparison with AVQI. The results also showed that AVQI can be used as a predictor of the degree of patients' and recovered individuals' lung involvement. A formula is suggested for calculating the degree of lung involvement using AVQI.


Assuntos
COVID-19 , Disfonia , Humanos , Masculino , Feminino , Disfonia/diagnóstico , Acústica da Fala , Estudos de Casos e Controles , Estudos de Viabilidade , Estudos Transversais , Reprodutibilidade dos Testes , Índice de Gravidade de Doença , Acústica , Tomografia , Medida da Produção da Fala/métodos
5.
Laryngoscope ; 134(5): 2272-2276, 2024 May.
Artigo em Inglês | MEDLINE | ID: mdl-37942827

RESUMO

INTRODUCTION: Auditory-perceptual evaluations of dysphonia, though essential for comprehensive voice evaluation, are subject to listener bias. Knowledge of an underlying voice disorder can influence auditory-perceptual ratings. Accented speech results in increased listener effort and delays in word identification. Yet, little is known about the impact of foreign language accents on auditory-perceptual ratings for dysphonic speakers. The purpose of this work was to determine the impact of a foreign language accent on experts' auditory-perceptual ratings of dysphonic speakers. METHODS: Twelve voice-specializing SLPs who spoke with a General American English (GAE) accent rated vocal percepts of 28 speakers with a foreign language accent and 28 with a GAE accent, all of whom had been diagnosed with a voice disorder. Speaker groups were matched based on sex, age, and mean smoothed cepstral peak prominence. Four linear mixed-effects models assessed the impact of a foreign language accent on expert auditory-perceptual ratings of the overall severity of dysphonia, roughness, breathiness, and strain. RESULTS: The twelve raters demonstrated good inter- and intra-rater reliability (ICC[3, k] = .89; mean ICC = .89). The linear mixed-effects models revealed no significant impact of foreign language accent on ratings of overall severity of dysphonia, roughness, breathiness, or strain. CONCLUSION: Despite the possibility of increased listener effort and bias, foreign language accent incongruence had no effect on expert listeners' auditory-perceptual evaluations for dysphonic speakers. Findings support the use of auditory-perceptual evaluations for voice disorders across sociolinguistically diverse populations. LEVEL OF EVIDENCE: 3 Laryngoscope, 134:2272-2276, 2024.


Assuntos
Disfonia , Percepção da Fala , Humanos , Disfonia/diagnóstico , Reprodutibilidade dos Testes , Idioma , Fala , Acústica da Fala , Medida da Produção da Fala/métodos
6.
Am J Speech Lang Pathol ; 33(1): 393-405, 2024 Jan 03.
Artigo em Inglês | MEDLINE | ID: mdl-38060689

RESUMO

PURPOSE: Variability in auditory-perceptual ratings of voice limits their utility, with the poorest reliability often noted for vocal strain. The purpose of this study was to determine whether an experimental method, called visual sort and rate (VSR), promoted stronger rater reliability than visual analog scale (VAS), for ratings of strain in two clinical populations: adductor laryngeal dystonia (ADLD) and vocal hyperfunction (VH). METHOD: Connected speech samples from speakers with ADLD and VH as well as age- and sex-matched controls were selected from a database. Fifteen inexperienced listeners rated strain for two speaker sets (25 ADLD speakers and five controls; 25 VH speakers and five controls) across four rating blocks: VAS-ADLD, VSR-ADLD, VAS-VH, and VSR-VH. For the VAS task, listeners rated each speaker for strain using a vertically oriented 100-mm VAS. For the VSR task, stimuli were distributed into sets of samples with a range of severities in each set. Listeners sorted and ranked samples for strain within each set, and final ratings were captured on a vertically oriented 100-mm VAS. Intrarater reliability (Pearson's r) and interrater variability (mean of the squared differences between a listener's ratings and group mean ratings) were compared across rating methods and populations using two repeated-measures analyses of variance. RESULTS: Intrarater reliability of strain was significantly stronger when listeners used VSR compared to VAS; listeners also showed significantly better intrarater reliability in ADLD than VH. Listeners demonstrated significantly less interrater variability (better reliability) when using VSR compared to VAS. No significant effect of population or interactions was found between listeners for measures of interrater variability. CONCLUSIONS: VSR increases intrarater reliability for ratings of vocal strain in speakers with VH and ADLD. VSR decreases variability of auditory-perceptual judgments of strain between inexperienced listeners in these clinical populations. Future research should determine whether benefits of VSR extend to voice clinicians and/or clinical settings.


Assuntos
Disfonia , Percepção da Fala , Voz , Humanos , Qualidade da Voz , Julgamento , Reprodutibilidade dos Testes , Medida da Produção da Fala/métodos
7.
Int J Speech Lang Pathol ; 26(2): 267-277, 2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-37272348

RESUMO

PURPOSE: The primary objective of this study was to determine if speech and pause measures obtained using a passage reading task and timing measures from a monosyllabic diadochokinesis (DDK) task differ across speakers of Canadian French diagnosed with amyotrophic lateral sclerosis (ALS) presenting with and without bulbar symptoms, and healthy controls. The secondary objective was to determine if these measures can reflect the severity of bulbar symptoms. METHOD: A total of 29 Canadian French speakers with ALS (classified as bulbar symptomatic [n = 14] or pre-symptomatic [n = 15]) and 17 age-matched healthy controls completed a passage reading task and a monosyllabic DDK task (/pa/ and /ta/), for up to three follow-up visits. Measures of speaking rate, total duration, speech duration, and pause events were extracted from the passage reading recordings using a semi-automated speech and pause analysis procedure. Manual analysis of DDK recordings provided measures of DDK rate and variability. RESULT: Group comparisons revealed significant differences (p = < .05) between the symptomatic group and the pre-symptomatic and control groups for all passage measures and DDK rates. Only the DDK rate in /ta/ differentiated the pre-symptomatic and control groups. Repeated measures correlations revealed moderate correlations (rrm = > 0.40; p = < 0.05) between passage measures of total duration, speaking rate, speech duration, and number of pauses, and ALSFRS-R total and bulbar scores, as well as between DDK rate and ALSFRS-R total score. CONCLUSION: Speech and pause measures in passage and timing measures in monosyllabic DDK tasks might be suitable for monitoring bulbar functional symptoms in French speakers with ALS, but more work is required to identify which measures are sensitive to the earliest stages of the disease.


Assuntos
Esclerose Lateral Amiotrófica , Fala , Humanos , Esclerose Lateral Amiotrófica/complicações , Canadá , Medida da Produção da Fala/métodos , Idioma
8.
J Speech Lang Hear Res ; 67(1): 116-125, 2024 Jan 08.
Artigo em Inglês | MEDLINE | ID: mdl-37992413

RESUMO

PURPOSE: This study compared the interrater reliability of adult naïve listeners' perceptual assessments of different speech variables in children with a cleft palate with or without a cleft lip (CP ± L). In addition, the study investigated whether the listeners were able to perceive differences in these speech variables before and after speech therapy for cleft palate speech disorders. METHOD: Thirty-four speech samples of 14 children with a CP ± L (14 samples collected immediately before 10 hr of speech intervention, 14 samples collected immediately after speech intervention, and six randomly selected samples that were duplicated to assess intrarater reliability) were perceptually assessed by 26 adult naïve listeners. The listening panel consisted of nine men and 17 women (age range: 18-51 years). The speech variables included speech understandability, speech acceptability, hypernasality, hyponasality, nasal airflow, and articulation, which were assessed on a visual analog scale. Furthermore, the need for speech therapy was assessed. RESULTS: Good to very good interrater reliability was observed for the naïve listeners' ratings of all speech variables. A significant time effect was found for the pre- and postevolution of the speech variables "speech understandability," "speech acceptability," "nasal airflow," and "articulation." This time effect indicates an improvement of these variables postintervention. According to the naïve listeners, children were less in need of additional speech therapy after the 10-hr intervention period compared to assessments before this intervention period. CONCLUSIONS: Adult naïve listeners perceptually identified an improvement in different speech variables after 10 hr of cleft palate speech therapy. These findings confirm previous assessments of expert speech-language pathologists and suggest that speech improvements after cleft palate speech therapy can also be perceived by communication partners outside the therapy room. Perceptual ratings of naïve listeners can, thus, be used to add life-situation significance to the assessments of experts. Future research could include both expert raters and caregivers or relatives of children with a CP ± L in listening panels, as previous knowledge on craniofacial anomalies may lead to different results.


Assuntos
Fenda Labial , Fissura Palatina , Distúrbios da Voz , Masculino , Adulto , Criança , Humanos , Feminino , Adolescente , Adulto Jovem , Pessoa de Meia-Idade , Fissura Palatina/complicações , Fissura Palatina/terapia , Fala , Reprodutibilidade dos Testes , Medida da Produção da Fala/métodos , Distúrbios da Fala/etiologia , Distúrbios da Fala/terapia , Fenda Labial/complicações , Fenda Labial/terapia
9.
Vestn Otorinolaringol ; 88(5): 23-26, 2023.
Artigo em Russo | MEDLINE | ID: mdl-37970766

RESUMO

In order to evaluate the effectiveness of the treatment in patients with functional dysphonia, the Cepstral Peak Prominence (CPP) test was used. Twenty dysphonic women aged from 18 to 47 years were under observation. The control group consisted of 20 healthy women of close age. Patients underwent 5-7 sessions electrostimulation of laryngeal muscles and phonopedic treatment, after which a complete restoration of the voice was noted. The Praat clinical program was used, installed on a Hewlett-Packard 630 laptop (Pentium B960, 2.2 GHz). A SHURE SM94 condenser microphone was used as well. In the control group, the results were as follows: M=7.49 (SD=1.26) dB. In the main group before treatment: M=5.00 (SD=1.07) dB, after treatment: M=7.95 (SD=1.34) dB. Differences in KT values in the main group before and after treatment (5.00 dB and 7.95 dB, respectively) were significant at p<0.0001. Differences in KT values in the main group before treatment (5.00 dB) and in the control group (7.49 dB) were significant at p<0.0001. Differences in KT values in the main group after treatment (7.95 dB) and in the control group (7.49 dB) were not significant at p>0.05. The study showed high sensitivity of the method. The CPP data after treatment were higher than those before treatment and did not differ from the control ones. It is concluded that CPP is a highly sensitive method for evaluating the degree of periodicity of an acoustic signal and can be used to evaluate the effectiveness of treatment in patients with functional dysphonia.


Assuntos
Disfonia , Voz , Humanos , Feminino , Disfonia/diagnóstico , Disfonia/terapia , Acústica da Fala , Medida da Produção da Fala/métodos , Acústica
10.
J Fluency Disord ; 78: 106016, 2023 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-37852018

RESUMO

PURPOSE: Previous work shows that linguistic features (e.g., word length, word frequency) impact the predictability of stuttering events. Most of this work has been conducted using reading tasks. Our study examined how linguistic features impact the predictability of stuttering events during spontaneous speech. METHODS: The data were sourced from the FluencyBank database and consisted of interviews with 35 adult stutterers (27,009 words). Three logistic regression mixed models were fit as the primary analyses: one model with four features (i.e., initial phoneme, grammatical function, word length, and word position within a sentence), a second model with six features (i.e., the features from the previous model plus word frequency and neighborhood density), and a third model with nine features (i.e., the features from the previous model plus bigram frequency, word concreteness, and typical age of word acquisition). We compared our models using the Area Under the Curve statistic. RESULTS: The four-feature model revealed that initial phoneme, grammatical function, and word length were predictive of stuttering events. The six-feature model revealed that initial phoneme, word length, word frequency, and neighborhood density were predictive of stuttering events. The nine-feature model was not more predictive than the six-feature model. CONCLUSION: Linguistic features that were previously found to be predictive of stuttering during reading were predictive of stuttering during spontaneous speech. The results indicate the influence of linguistic processes on the predictability of stuttering events such that words associated with increased planning demands (e.g., longer words, low frequency words) were more likely to be stuttered.


Assuntos
Fala , Gagueira , Adulto , Humanos , Gagueira/diagnóstico , Medida da Produção da Fala/métodos , Linguística/métodos , Idioma
11.
J Speech Lang Hear Res ; 66(12): 4849-4859, 2023 12 11.
Artigo em Inglês | MEDLINE | ID: mdl-37902504

RESUMO

PURPOSE: Most people with dysphonia present with voices that vary along more than one voice quality (VQ) dimension. This study sought to examine the effect of covariance between breathy and rough VQ in natural voices. METHOD: A two-dimensional matrix of 16 /a/ vowels was selected such that two VQ dimensions (breathiness and roughness) were sampled on a 4-point severity scale (none, mild, moderate, and severe). Ten listeners evaluated 480 stimuli (16 stimuli × 10 repetitions × 3 blocks) on one-dimensional magnitude estimation (1DME) tasks and a novel two-dimensional magnitude estimation (2DME) task that allowed for simultaneous measurement of breathiness and roughness. RESULTS: Data indicated high intra- and interrater reliabilities for both breathiness and roughness in the 2DME and 1DME tasks. Correlation analyses revealed a strong correlation between 2DME and 1DME judgments for breathiness and roughness (r > .95). There was also a minimal correlation between breathy and rough VQ in the 2DME task (r < .10). CONCLUSIONS: Covarying roughness or breathiness had less impact on the perception of the other VQ in natural dysphonic voices in 2DME compared to 1DME. An understanding and quantification of the perceptual interactions among the dimensions will aid in the refinement of computational models and in the establishment of the validity of clinical scales for VQ perception.


Assuntos
Disfonia , Percepção da Fala , Humanos , Qualidade da Voz , Medida da Produção da Fala/métodos , Reprodutibilidade dos Testes , Disfonia/diagnóstico , Julgamento , Acústica da Fala
12.
Codas ; 35(5): e20220009, 2023.
Artigo em Português, Inglês | MEDLINE | ID: mdl-37792751

RESUMO

PURPOSE: to describe the profile of fluency concerning the typology of disfluencies, speed, and frequency of disruptions in spontaneous speech, reading, and retelling; to compare the fluency profile in adults who stutter in spontaneous speech, reading, and retelling of text. METHODS: The present work is a cross-sectional comparative study with a sample composed of 15 adults who stutter of both sexes, with higher education or equivalent to complete elementary school II. Samples were collected in the tasks of spontaneous speech, reading, and text retelling through video calls made individually with the participants. The first 200 syllables expressed in each task were transcribed and analyzed according to the Fluency Profile Assessment Protocol (FPAP). The study compared the frequency of common and stuttering disfluencies and the speed in the different tasks surveyed. The Kruskal & Wallis test was used together with Duncan's multiple comparisons test to compare the medians and verify possible differences between the tasks researched with a significance level of 5%. RESULTS: The reading task presented a lower number of common disfluencies and a percentage of speech discontinuity about spontaneous speech and retelling tasks. No statistically significant differences were found between stuttering disfluencies in the three tasks surveyed. CONCLUSION: This study showed that there are differences in the occurrence of common disfluencies - hesitations, interjections, and revisions - and in the percentage of speech discontinuity during an oral reading of adults who stutter concerning spontaneous speech and text retelling.


OBJETIVO: descrever o perfil da fluência em relação à tipologia das disfluências, velocidade e frequência de rupturas na fala espontânea, na leitura e no reconto; comparar o perfil da fluência em adultos que gaguejam na fala espontânea, na leitura e no reconto de texto. MÉTODO: O trabalho é um estudo transversal comparativo com amostra composta por 15 adultos que gaguejam de ambos os sexos, com formação superior ou equivalente ao ensino fundamental II completo. Foram coletadas amostras nas tarefas de fala espontânea, leitura e reconto de texto por meio de video chamadas realizadas individualmente. As 200 primeiras sílabas expressas de cada tarefa foram transcritas e analisadas segundo o Protocolo de Avaliação do Perfil da Fluência (PAPF). O estudo comparou a frequência das disfluências comuns e gagas e a velocidade nas tarefas pesquisadas. Adotou-se o teste de Kruskal & Wallis em conjunto com o de comparações múltiplas de Duncan para comparar as medianas e verificar possíveis diferenças entre as tarefas pesquisadas com nível de significância de 5%. RESULTADOS: A tarefa de leitura apresentou menor número de disfluências comuns e percentual de descontinuidade de fala em relação às tarefas de fala espontânea e reconto. Não foram encontradas diferenças estatisticamente significantes entre as disfluências gagas nas três tarefas pesquisadas. CONCLUSÃO: Este trabalho mostrou que existem diferenças na ocorrência das disfluências comuns - hesitações, interjeições e revisões - e no percentual de descontinuidade de fala durante a leitura oral de adultos que gaguejam em relação à fala espontânea e ao reconto de texto.


Assuntos
Fala , Gagueira , Masculino , Feminino , Adulto , Humanos , Leitura , Estudos Transversais , Medida da Produção da Fala/métodos
13.
Codas ; 35(6): e20220209, 2023.
Artigo em Português, Inglês | MEDLINE | ID: mdl-37820100

RESUMO

PURPOSE: to identify a set of requirements for the development of an auditory-perceptual training simulator (APT) based on the experience of professors who provide APT. METHODS: This is a cross-sectional, descriptive study with a quantitative approach. Twenty-two professors answered an online questionnaire containing 31 items related to APT, involving items about the professional profile, conditions for APT in undergraduate and postgraduate courses in Speech Therapy, APT structure, and evaluation of the APT effect. RESULT: it was observed that there is a variation in APT procedures performed in Brazil. The main requirements indicated by the respondents for the APT involve the use of synthesized voices in the initial moments, followed by human voices later; the use of speech tasks with sustained vowels and connected speech; the insertion of complementary information such as gender, age, the profession of the speaker and the spectrography of the vocal signal; training with a minimum time of six hours; the evaluation of the training effect by comparing intra- and inter-judge agreement before and after training; the addition of the parameters of general degree of vocal deviation, roughness, breathiness, and strain; the use of validated continuous and numerical scales; and offering it from the second year of the undergraduate program. CONCLUSION: although there is variability in the response of experts, a minimum set of requirements indicated for performing APT with new judges was identified.


OBJETIVO: identificar um conjunto de requisitos para o desenvolvimento de um simulador de treinamento perceptivo-auditivo (TPA) a partir da experiência de docentes que realizam o TPA. MÉTODO: Trata-se de um estudo transversal, descritivo, com abordagem quantitativa. Vinte e dois docentes responderam um questionário online contendo 31 itens relacionados ao TPA, envolvendo itens sobre o perfil profissional, condições para o TPA nos cursos de graduação e pós-graduação em Fonoaudiologia, estrutura do TPA, avaliação do efeito do TPA. RESULTADO: observou-se que existe variação nos procedimentos de TPA realizados no Brasil. Os principais requisitos indicados pelos respondentes para o TPA envolvem o uso de vozes sintetizadas nos momentos iniciais, seguindo para vozes humanas posteriormente; a utilização de tarefas de fala com vogais sustentadas e fala encadeada; a inserção de informações complementares tais como o gênero, idade, profissão do falante e a espectrografia do sinal vocal; treinamento com tempo mínimo de seis horas; a avaliação do efeito do treinamento pela comparação da concordância intra e inter-juizes pré e pós treinamento; a adição dos parâmetros de grau geral de desvio vocal, rugosidade, soprosidade e tensão; a utilização de escalas contínuas e numéricas validadas; e ser realizado a partir do segundo ano de graduação. CONCLUSÃO: embora haja uma variabilidade da resposta dos especialistas, foi identificado um conjunto mínimo de requisitos indicados para a realização de TPA com novos juízes.


Assuntos
Disfonia , Percepção da Fala , Humanos , Acústica da Fala , Julgamento , Estudos Transversais , Qualidade da Voz , Medida da Produção da Fala/métodos , Reprodutibilidade dos Testes , Variações Dependentes do Observador
14.
Am J Speech Lang Pathol ; 32(5S): 2493-2511, 2023 10 17.
Artigo em Inglês | MEDLINE | ID: mdl-37656150

RESUMO

PURPOSE: The purpose of this investigation was to compare the effects of two specific treatment protocols for acquired apraxia of speech (AOS): Sound Production Treatment (SPT) and Metrical Pacing Therapy (MPT), and to examine changes in communicative participation. METHOD: Four speakers with chronic AOS and aphasia were each administered SPT and MPT in a replicated crossover design (ABACA/ACABA) with nonconcurrent multiple baselines across participants and behaviors. Treatment outcomes were compared with respect to whole word correctness (WWC) for treated and untreated multisyllabic word targets. Speech intelligibility was assessed using the Chapel Hill Multilingual Intelligibility Test, and communicative participation was measured using the Communicative Participation Item Bank at baseline, washout, and follow-up phases. RESULTS: Three of the four participants experienced statistically significant improvements in WWC with SPT, and three of the four participants with MPT. Based on a priori criteria, three participants demonstrated relatively greater benefit from SPT and one participant demonstrated relatively greater benefit from MPT. There were measurable improvements in intelligibility following SPT for three of the four participants. Only one participant in this investigation reported a significant change in communicative participation, and only following MPT. CONCLUSIONS: This study demonstrated that individuals in the chronic stages of AOS can benefit from both SPT and MPT, corroborating prior research on articulatory kinematic and rate and/or rhythm control treatment approaches. It contributes a comparison of two protocols for AOS with respect to whole word targets, intelligibility, and individual self-report of communicative participation changes. More participants showed a relative advantage of SPT over MPT. One individual reported communicative participation improvement after MPT. SUPPLEMENTAL MATERIAL: https://doi.org/10.23641/asha.23971929.


Assuntos
Afasia , Apraxias , Humanos , Fala , Projetos de Pesquisa , Fonoterapia/métodos , Apraxias/diagnóstico , Apraxias/terapia , Afasia/terapia , Inteligibilidade da Fala , Medida da Produção da Fala/métodos
15.
J Speech Lang Hear Res ; 66(7): 2164-2183, 2023 07 12.
Artigo em Inglês | MEDLINE | ID: mdl-37267440

RESUMO

PURPOSE: This study investigates the hypothesis that younger speakers and speakers with more severe speech sound disorders are more likely to use simpler (undifferentiated) tongue gestures due to difficulties with, or immaturity of, lingual motor control. METHOD: The hypothesis is tested using cross-sectional secondary data analysis of synchronous audio and high-speed ultrasound recordings from children with idiopathic speech sound disorders (n = 30, aged 5;0-12;11 [years;months]) and typically developing children (n = 29, aged 5;8-12;10), producing /a/, /t/, /ɹ/, /l/, /s/, and /ʃ/ in an intervocalic /aCa/ environment. Tongue shape complexity is measured using NINFL (Number of INFLections) and modified curvature index (MCI) from splines fitted to ultrasound images at the point of maximal lingual gesture. Age, perceived accuracy, and consonant are used as predictors. RESULTS: The results suggest that as age increases, children with speech sound disorders have lower MCI compared to typically developing children. Increase in age also led to decrease of MCI for the typically developing group. In the group of children with speech sound disorders, perceptually incorrect /ɹ/ productions have lower MCI than correct productions, relative to /a/. CONCLUSIONS: There is some evidence of systematic tongue shape complexity differences between typically developing children and children with speech sound disorders when accounting for increase in age. Among children with speech sound disorders, increase in age and perceptually incorrect consonant realizations are associated with decreasing tongue shape complexity.


Assuntos
Transtorno Fonológico , Humanos , Criança , Transtorno Fonológico/diagnóstico por imagem , Estudos Transversais , Língua/diagnóstico por imagem , Ultrassonografia/métodos , Gestos , Fala , Fonética , Medida da Produção da Fala/métodos
16.
Sensors (Basel) ; 23(11)2023 May 30.
Artigo em Inglês | MEDLINE | ID: mdl-37299922

RESUMO

Biometrics-based authentication has become the most well-established form of user recognition in systems that demand a certain level of security. For example, the most commonplace social activities stand out, such as access to the work environment or to one's own bank account. Among all biometrics, voice receives special attention due to factors such as ease of collection, the low cost of reading devices, and the high quantity of literature and software packages available for use. However, these biometrics may have the ability to represent the individual impaired by the phenomenon known as dysphonia, which consists of a change in the sound signal due to some disease that acts on the vocal apparatus. As a consequence, for example, a user with the flu may not be properly authenticated by the recognition system. Therefore, it is important that automatic voice dysphonia detection techniques be developed. In this work, we propose a new framework based on the representation of the voice signal by the multiple projection of cepstral coefficients to promote the detection of dysphonic alterations in the voice through machine learning techniques. Most of the best-known cepstral coefficient extraction techniques in the literature are mapped and analyzed separately and together with measures related to the fundamental frequency of the voice signal, and its representation capacity is evaluated on three classifiers. Finally, the experiments on a subset of the Saarbruecken Voice Database prove the effectiveness of the proposed material in detecting the presence of dysphonia in the voice.


Assuntos
Disfonia , Voz , Humanos , Disfonia/diagnóstico , Acústica da Fala , Qualidade da Voz , Medida da Produção da Fala/métodos
17.
Am J Speech Lang Pathol ; 32(4): 1565-1577, 2023 07 10.
Artigo em Inglês | MEDLINE | ID: mdl-37257202

RESUMO

PURPOSE: The purpose of this study was to determine normative values for cepstral peak prominence measures across the life span as a function of sex using clinically relevant stimuli (/ɑ/, /i/, and two sentences of The Rainbow Passage) and two commonly used software types: Praat (Version 6.0.50) and Analysis of Dysphonia in Speech and Voice (ADSV). METHOD: One hundred fifty speakers (75 males, 75 females; evenly distributed into three age groups) without voice disorders aged 18-91 years were recorded via headset microphone in a sound-treated booth. Cepstral measures were analyzed using common analysis methods in Praat and ADSV by sex, stimuli, and software type. Kruskal-Wallis tests and post hoc Mood's Median tests for significant factors were performed on cepstral measures to assess the effects of age group, sex, stimuli, and software type. RESULTS: The results revealed statistically significant effects of sex, stimuli, and software type on cepstral measures, but no statistical effect of age group on cepstral values. Females had lower average cepstral values compared to males. Across stimuli, the highest average cepstral measure was found for sustained /ɑ/, followed by sustained /i/, and then of the two sentences of The Rainbow Passage. Average cepstral measures in Praat were higher than those from ADSV. CONCLUSIONS: The current work did not find a statistical effect of age group on cepstral values; thus, normative cepstral values were reported by sex, stimuli, and software type. Future work should examine the applicability of these normative values for discriminating speakers with and without voice disorders.


Assuntos
Disfonia , Fala , Masculino , Feminino , Humanos , Acústica da Fala , Longevidade , Qualidade da Voz , Software , Medida da Produção da Fala/métodos
18.
Am J Speech Lang Pathol ; 32(4S): 1923-1937, 2023 08 17.
Artigo em Inglês | MEDLINE | ID: mdl-37105919

RESUMO

PURPOSE: Few studies have reported on the vowel space area (VSA) in both acoustic and kinematic domains. This study examined acoustic and kinematic VSAs for speakers with and without dysarthria and evaluated effects of normalization on acoustic and kinematic VSAs and the relationship between these measures. METHOD: Vowel data from 12 speakers with and without dysarthria, presenting with a range of speech abilities, were examined. The speakers included four speakers with Parkinson's disease (PD), four speakers with brain injury (BI), and four neurotypical (NT) speakers. Speech acoustic and kinematic data were acquired simultaneously using electromagnetic articulography during a passage reading task. Raw and normalized VSAs calculated from corner vowels /i/, /æ/, /ɑ/, and /u/ were evaluated. Normalization was achieved through z score transformations to the acoustic and kinematic data. The effect of normalization on variability within and across groups was evaluated. Regression analysis was used across speakers to assess the association between acoustic and kinematic VSAs for both raw and normalized data. RESULTS: When evaluating the speakers as three different groups (i.e., PD, BI, and NT), normalization reduced the standard deviations within each group and changed the relative differences in average magnitude between groups. Regression analysis revealed a significant relationship between normalized, but not raw, acoustic and kinematic VSAs, after the exclusion of an outlier speaker. CONCLUSIONS: Normalization reduces the variability across speakers, within groups, and changes average magnitudes affecting speaker group comparisons. Normalization also influences the correlation between acoustic and kinematic measures. Further investigation of the impact of normalization techniques upon acoustic and kinematic measures is warranted. SUPPLEMENTAL MATERIAL: https://doi.org/10.23641/asha.22669747.


Assuntos
Doença de Parkinson , Inteligibilidade da Fala , Humanos , Medida da Produção da Fala/métodos , Acústica da Fala , Disartria/diagnóstico , Disartria/etiologia , Fenômenos Biomecânicos , Acústica , Doença de Parkinson/complicações , Fonética
19.
J Speech Lang Hear Res ; 66(4): 1280-1290, 2023 04 12.
Artigo em Inglês | MEDLINE | ID: mdl-37014996

RESUMO

PURPOSE: Extemporaneous speech tasks provide an ecologically valid sample to examine speech acoustics, but differing methodologies exist in the literature for segmentation. Therefore, the purpose of this study was to examine the utility and reliability of a segmentation approach for extemporaneous speech specified by systemic functional linguistics (SFL) and its potential research and clinical applications. METHOD: Ten speakers without communication disorders served as participants in this study, and they responded to self-selected extemporaneous speaking prompts. Two expert analysts and one clinician analyst utilized a segmentation procedure specified by SFL to segment the extemporaneous speech samples into clauses and clause complexes. Intra- and interrater reliability were calculated for each analyst and pair of analysts. Acoustic measures of duration, speech rate, and intercomplex pause durations were calculated for each clause complex. RESULTS: Analyses for both intra- and interrater reliability revealed high percent agreement that was significantly greater than chance for expert and clinician analysts and between each pair of analysts (p < .001). Acoustic analyses revealed expected variation in number and duration of spoken syllables of clause complexes between and within speakers. CONCLUSIONS: The segmentation approach for extemporaneous speech specified by SFL is a reliable method for trained analysts that is informed by lexico-grammar and allows for acoustic measurement of speech production. It is also a reliable method for clinician analysts for speakers without communication disorders, and future work will investigate its utility for speakers with motor speech disorders. SUPPLEMENTAL MATERIAL: https://doi.org/10.23641/asha.22357138.


Assuntos
Linguística , Fala , Humanos , Reprodutibilidade dos Testes , Medida da Produção da Fala/métodos , Acústica da Fala
20.
Int J Lang Commun Disord ; 58(4): 1251-1267, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-36861494

RESUMO

BACKGROUND: Speech-language pathologists often multitask in order to be efficient with their commonly large caseloads. In stuttering assessment, multitasking often involves collecting multiple measures simultaneously. AIMS: The present study sought to determine reliability when collecting multiple measures simultaneously versus individually. METHODS & PROCEDURES: Over two time periods, 50 graduate students viewed videos of four persons who stutter (PWS) and counted the number of stuttered syllables and total number of syllables uttered, and rated speech naturalness. Students were randomly assigned to one of two groups: the simultaneous group, in which all measures were gathered during one viewing; and the individual group, in which one measure was gathered per viewing. Relative and absolute intra- and inter-rater reliability values were calculated for each measure. OUTCOMES & RESULTS: The following results were notable: better intra-rater relative reliability for the number of stuttered syllables for the individual group (intraclass correlation coefficient (ICC) = 0.839) compared with the simultaneous group (ICC = 0.350), smaller intra-rater standard error of measurement (SEM) (i.e., better absolute reliability) for the number of stuttered syllables for the individual group (7.40) versus the simultaneous group (15.67), and better inter-rater absolute reliability for the total number of syllables for the individual group (88.29) compared with the simultaneous group (125.05). Absolute reliability was unacceptable for all measures across both groups. CONCLUSIONS & IMPLICATIONS: These findings show that judges are likely to be more reliable when identifying stuttered syllables in isolation than when simultaneously collecting them with total syllables spoken and naturalness data. Results are discussed in terms of narrowing the reliability gap between data collection methods for stuttered syllables, improving overall reliability of stuttering measurements, and a procedural change when implementing widely used stuttering assessment protocols. WHAT THIS PAPER ADDS: What is already known on the subject The reliability of stuttering judgments has been found to be unacceptable across a number of studies, including those examining the reliability of the most popular stuttering assessment tool, the Stuttering Severity Instrument (4th edition). The SSI-4, and other assessment applications, involve collecting multiple measures simultaneously. It has been suggested, but not examined, that collecting measures simultaneously, which occurs in the most popular stuttering assessment protocols, may result in substantially inferior reliability when compared to collecting measures individually. What this paper adds to existing knowledge The present study has multiple novel findings. First, relative and absolute intra-rater reliability were substantially better when stuttered syllables data were collected individually compared to when the same data were collected simultaneously with total number of syllables and speech naturalness data. Second, inter-rater absolute reliability for total number of syllables was also substantially better when collected individually. Third, intra-rater and inter-rater reliability were similar when speech naturalness ratings were given individually compared to when they were given while simultaneously counting stuttered and fluent syllables. What are the potential or actual clinical implications of this work? Clinicians can be more reliable when identifying stuttered syllables individually compared to when they judge stuttering along with other clinical measures of stuttering. In addition, when clinicians and researchers use current popular protocols for assessing stuttering that recommend simultaneous data collection, including the SSI-4, they should instead consider collecting stuttering event counts individually. This procedural change will lead to more reliable data and stronger clinical decision making.


Assuntos
Gagueira , Humanos , Reprodutibilidade dos Testes , Índice de Gravidade de Doença , Fala , Medida da Produção da Fala/métodos , Gagueira/diagnóstico
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA