RESUMO
Speech perception studies typically rely on trained research assistants to score orthographic listener transcripts for words correctly identified. While the accuracy of the human scoring protocol has been validated with strong intra- and inter-rater reliability, the process of hand-scoring the transcripts is time-consuming and resource intensive. Here, an open-source computer-based tool for automated scoring of listener transcripts is built (Autoscore) and validated on three different human-scored data sets. Results show that not only is Autoscore highly accurate, achieving approximately 99% accuracy, but extremely efficient. Thus, Autoscore affords a practical research tool, with clinical application, for scoring listener intelligibility of speech.
Assuntos
Transtornos da Percepção Auditiva/diagnóstico , Transtornos da Linguagem/diagnóstico , Software/normas , Percepção da Fala , Humanos , Testes Neuropsicológicos/normas , Reprodutibilidade dos TestesRESUMO
The effect of background noise on intelligibility of disordered speech was assessed. Speech-shaped noise was mixed with neurologically healthy (control) and disordered (dysarthric) speech at a series of signal-to-noise ratios. In addition, bandpass filtered control and dysarthric speech conditions were assessed to determine the effect of noise on both naturally and artificially degraded speech. While significant effects of both the amount of noise and the type of speech were revealed, no interaction between the two factors was observed, in either the broadband or filtered testing conditions. Thus, it appears that there is no multiplicative effect of the presence of background noise on intelligibility of disordered speech relative to control speech. That is, the decrease in intelligibility due to increasing levels of noise is similar for both types of speech, and both types of testing conditions, and the function for dysarthric speech is simply shifted downward due to the inherent source degradations of the speech itself. Last, large-scale online crowdsourcing via Amazon Mechanical Turk was utilized to collect data for the current study. Findings and implications for this data and data collection approach are discussed.
Assuntos
Disartria/fisiopatologia , Ruído/efeitos adversos , Mascaramento Perceptivo , Acústica da Fala , Inteligibilidade da Fala , Percepção da Fala , Estimulação Acústica , Adolescente , Adulto , Idoso , Audiometria da Fala , Estudos de Casos e Controles , Crowdsourcing , Disartria/diagnóstico , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Adulto JovemRESUMO
A positive relationship between rhythm perception and improved understanding of a naturally dysrhythmic speech signal, ataxic dysarthria, has been previously reported [Borrie, Lansford, and Barrett. (2017). J. Speech Lang. Hear. Res. 60, 3110-3117]. The current follow-on investigation suggests that this relationship depends on the nature of the dysrhythmia. When the corrupted rhythm cues are relatively predictable, affording some learnable acoustic regularity, the relationship is replicated. However, this relationship is nonexistent, along with any intelligibility improvements, when the corrupted rhythm cues are unpredictable. Findings highlight a key role for rhythm perception and distributional regularities in adaptation to dysrhythmic speech.
Assuntos
Estimulação Acústica/métodos , Disartria/fisiopatologia , Aprendizagem/fisiologia , Inteligibilidade da Fala/fisiologia , Percepção da Fala/fisiologia , Adulto , Disartria/diagnóstico , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Adulto JovemRESUMO
There is substantial individual variability in understanding speech in adverse listening conditions. This study examined whether a relationship exists between processing speech in noise (environmental degradation) and dysarthric speech (source degradation), with regard to intelligibility performance and the use of metrical stress to segment the degraded speech signals. Ninety native speakers of American English transcribed speech in noise and dysarthric speech. For each type of listening adversity, transcriptions were analyzed for proportion of words correct and lexical segmentation errors indicative of stress cue utilization. Consistent with the hypotheses, intelligibility performance for speech in noise was correlated with intelligibility performance for dysarthric speech, suggesting similar cognitive-perceptual processing mechanisms may support both. The segmentation results also support this postulation. While stress-based segmentation was stronger for speech in noise relative to dysarthric speech, listeners utilized metrical stress to parse both types of listening adversity. In addition, reliance on stress cues for parsing speech in noise was correlated with reliance on stress cues for parsing dysarthric speech. Taken together, the findings demonstrate a preference to deploy the same cognitive-perceptual strategy in conditions where metrical stress offers a route to segmenting degraded speech.
Assuntos
Disartria/psicologia , Ruído/efeitos adversos , Mascaramento Perceptivo , Acústica da Fala , Percepção da Fala , Qualidade da Voz , Estimulação Acústica , Adulto , Idoso , Audiometria da Fala , Cognição , Compreensão , Sinais (Psicologia) , Disartria/diagnóstico , Disartria/fisiopatologia , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Inteligibilidade da Fala , Adulto JovemRESUMO
OBJECTIVE: The aim of this exploratory study was to describe audiologist communication behaviours during appointments for hearing device monitoring and management before and after participation in counselling skills training. DESIGN: The study used a longitudinal design with three assessment points over 6 months. STUDY SAMPLE: The sample included 10 audiologists and audiology graduate students interacting in a professional setting with their clients. RESULTS: Audiologists reported improvement in their counselling skills from pre-training to follow-up, which was consistent with objective findings that audiologist relative speaking time decreased from pre-training to post-training as well as from pre-training to follow-up. Observer-rated scores of participants' counselling skills; however, yielded no significant differences across time. CONCLUSIONS: Some improvement was noted in audiologists' counselling behaviour following a 1-day communication skills workshop and continued learning support. It is evident; however, that further training, such as increased training and performance feedback, is needed to maintain and enhance audiologist progress in the various aspects of counselling.
Assuntos
Atitude do Pessoal de Saúde , Audiologistas/psicologia , Percepção Auditiva , Comunicação , Aconselhamento , Auxiliares de Audição , Transtornos da Audição/terapia , Audição , Capacitação em Serviço/métodos , Pessoas com Deficiência Auditiva/reabilitação , Agendamento de Consultas , Competência Clínica , Conhecimentos, Atitudes e Prática em Saúde , Transtornos da Audição/diagnóstico , Transtornos da Audição/fisiopatologia , Transtornos da Audição/psicologia , Humanos , Estudos Longitudinais , Pessoas com Deficiência Auditiva/psicologia , Relações Profissional-Paciente , Fatores de TempoRESUMO
Speech perception abilities vary substantially across listeners, particularly in adverse conditions including those stemming from environmental degradation (e.g., noise) or from talker-related challenges (e.g., nonnative or disordered speech). This study examined adult listeners' recognition of words in phrases produced by six talkers representing three speech varieties: a nonnative accent (Spanish-accented English), a regional dialect (Irish English), and a disordered variety (ataxic dysarthria). Semantically anomalous phrases from these talkers were presented in a transcription task and intelligibility scores, percent words correct, were compared across the three speech varieties. Three cognitive-linguistic areas-receptive vocabulary, cognitive flexibility, and inhibitory control of attention-were assessed as possible predictors of individual word recognition performance. Intelligibility scores for the Spanish accent were significantly correlated with scores for the Irish English and ataxic dysarthria. Scores for the Irish English and dysarthric speech, in contrast, were not correlated. Furthermore, receptive vocabulary was the only cognitive-linguistic assessment that significantly predicted intelligibility scores. These results suggest that, rather than a global skill of perceiving speech that deviates from native dialect norms, listeners may possess specific abilities to overcome particular types of acoustic-phonetic deviation. Furthermore, vocabulary size offers performance benefits for intelligibility of speech that deviates from one's typical dialect norms.
Assuntos
Fala , Humanos , Individualidade , Ruído , Fonética , Percepção da FalaRESUMO
This study investigated the influence of visual speech information on perceptual processing of neurologically degraded speech. Fifty listeners identified spastic dysarthric speech under both audio (A) and audiovisual (AV) conditions. Condition comparisons revealed that the addition of visual speech information enhanced processing of the neurologically degraded input in terms of (a) acuity (percent phonemes correct) of vowels and consonants and (b) recognition (percent words correct) of predictive and nonpredictive phrases. Listeners exploited stress-based segmentation strategies more readily in AV conditions, suggesting that the perceptual benefit associated with adding visual speech information to the auditory signal-the AV advantage-has both segmental and suprasegmental origins. Results also revealed that the magnitude of the AV advantage can be predicted, to some degree, by the extent to which an individual utilizes syllabic stress cues to inform word recognition in AV conditions. Findings inform the development of a listener-specific model of speech perception that applies to processing of dysarthric speech in everyday communication contexts.
Assuntos
Disartria/fisiopatologia , Acústica da Fala , Inteligibilidade da Fala , Percepção da Fala , Percepção Visual , Qualidade da Voz , Estimulação Acústica , Adulto , Audiometria da Fala , Recursos Audiovisuais , Cognição , Sinais (Psicologia) , Disartria/diagnóstico , Feminino , Humanos , Masculino , Fonética , Estimulação Luminosa , Reconhecimento Psicológico , Adulto JovemRESUMO
PURPOSE: According to the interpersonal synergy model of spoken dialogue, interlocutors modify their communicative behaviors to meet the contextual demands of a given conversation. Although a growing body of research supports this postulation for linguistic behaviors (e.g., semantics, syntax), little is understood about how this model applies to speech behaviors (e.g., speech rate, pitch). The purpose of this study is to test the hypothesis that interlocutors adjust their speech behaviors across different conversational tasks with different conversational goals. METHOD: In this study, 28 participants each engaged in two different types of conversations (i.e., relational and informational) with two partners (i.e., Partner 1 and Partner 2), yielding a total of 112 conversations. We compared six acoustic measures of participant speech behavior across conversational task and partner. RESULTS: Linear mixed-effects models demonstrated significant differences between speech feature measures in informational and relational conversations. Furthermore, these findings were generally robust across conversations with different partners. CONCLUSIONS: Results suggest that contextual demands influence speech behaviors. These findings provide empirical support for the interpersonal synergy model and highlight important considerations for assessing speech behaviors in individuals with communication disorders.
Assuntos
Relações Interpessoais , Fala , Humanos , Masculino , Feminino , Adulto Jovem , Adulto , Acústica da Fala , Comportamento Verbal , ComunicaçãoRESUMO
Lexical alignment, a communication phenomenon where conversational partners adapt their word choices to become more similar, plays an important role in the development of language and social communication skills. While this has been studied extensively in the conversations of preschool-aged children and their parents in Western, Educated, Industrialized, Rich, and Democratic (WEIRD) communities, research in other pediatric populations is sparse. This study makes significant expansions on the existing literature by focusing on alignment in naturalistic conversations of school-aged children from a non-WEIRD population across multiple conversational tasks and with different types of adult partners. Typically developing children aged 5 to 8 years (n = 45) engaged in four semi-structured conversations that differed by task (problem-solving vs. play-based) and by partner (parent vs. university student), resulting in a corpus of 180 conversations. Lexical alignment scores were calculated and compared to sham conversations, representing alignment occurring at the level of chance. Both children and adults coordinated their conversational utterances by re-using or aligning each other's word choices. This alignment behavior persisted across conversational tasks and partners, although the degree of alignment was moderated by the conversational context. These findings suggest that lexical alignment is a robust phenomenon in conversations between school-age children and adults. Furthermore, this study extends lexical alignment findings to a non-WEIRD culture, suggesting that alignment may be a coordination strategy employed by adults and children across diverse linguistic and cultural backgrounds.
Assuntos
Comunicação , Habilidades Sociais , Adulto , Pré-Escolar , Humanos , Criança , Linguagem Infantil , Idioma , PaisRESUMO
This investigation examined perceptual learning of dysarthric speech. Forty listeners were randomly assigned to one of two identification training tasks, aimed at highlighting either the linguistic (word identification task) or indexical (speaker identification task) properties of the neurologically degraded signal. Twenty additional listeners served as a control group, passively exposed to the training stimuli. Immediately following exposure to dysarthric speech, all three listener groups completed an identical phrase transcription task. Analysis of listener transcripts revealed remarkably similar intelligibility improvements for listeners trained to attend to either the linguistic or the indexical properties of the signal. Perceptual learning effects were also evaluated with regards to underlying error patterns indicative of segmental and suprasegmental processing. The findings of this study suggest that elements within both the linguistic and indexical properties of the dysarthric signal are learnable and interact to promote improved processing of this type and severity of speech degradation. Thus, the current study extends support for the development of a model of perceptual processing in which the learning of indexical properties is encoded and retained in conjunction with linguistic properties of the signal.
Assuntos
Aprendizagem por Discriminação , Disartria/fisiopatologia , Fonética , Reconhecimento Psicológico , Acústica da Fala , Inteligibilidade da Fala , Percepção da Fala , Qualidade da Voz , Estimulação Acústica , Adulto , Análise de Variância , Atenção , Audiometria de Tons Puros , Audiometria da Fala , Limiar Auditivo , Distribuição de Qui-Quadrado , Sinais (Psicologia) , Feminino , Humanos , Aprendizagem , Masculino , Modelos Psicológicos , Índice de Gravidade de Doença , Adulto JovemRESUMO
PURPOSE: Although recruitment of cognitive-linguistic resources to support dysarthric speech perception and adaptation is presumed by theoretical accounts of effortful listening and supported by cross-disciplinary empirical findings, prospective relationships have received limited attention in the disordered speech literature. This study aimed to examine the predictive relationships between cognitive-linguistic parameters and intelligibility outcomes associated with familiarization with dysarthric speech in young adult listeners. METHOD: A cohort of 156 listener participants between the ages of 18 and 50 years completed a three-phase perceptual training protocol (pretest, training, and posttest) with one of three speakers with dysarthria. Additionally, listeners completed the National Institutes of Health Toolbox Cognition Battery to obtain measures of the following cognitive-linguistic constructs: working memory, inhibitory control of attention, cognitive flexibility, processing speed, and vocabulary knowledge. RESULTS: Elastic net regression models revealed that select cognitive-linguistic measures and their two-way interactions predicted both initial intelligibility and intelligibility improvement of dysarthric speech. While some consistency across models was shown, unique constellations of select cognitive factors and their interactions predicted initial intelligibility and intelligibility improvement of the three different speakers with dysarthria. CONCLUSIONS: Current findings extend empirical support for theoretical models of speech perception in adverse listening conditions to dysarthric speech signals. Although predictive relationships were complex, vocabulary knowledge, working memory, and cognitive flexibility often emerged as important variables across the models.
Assuntos
Inteligibilidade da Fala , Percepção da Fala , Humanos , Adulto Jovem , Adolescente , Adulto , Pessoa de Meia-Idade , Disartria/psicologia , Estudos Prospectivos , CogniçãoRESUMO
PURPOSE: People with dysarthria have been rated as less confident and less likable and are often assumed by listeners to have reduced cognitive abilities relative to neurotypical speakers. This study explores whether educational information about dysarthria can shift these attitudes in a group of speakers with hypokinetic dysarthria secondary to Parkinson's disease. METHOD: One hundred seventeen listeners were recruited via Amazon Mechanical Turk to transcribe sentences and rate the confidence, intelligence, and likability of eight speakers with mild hypokinetic dysarthria. Listeners were assigned to one of four conditions. In one condition, listeners were provided with no educational information prior to exposure to speakers with dysarthria (n = 29). In another condition, listeners were given educational statements from the American Speech-Language-Hearing Association website (n = 29). In a third condition, listeners were given additional information stating that dysarthria does not indicate reduced intelligence or understanding (n = 30). Finally, in a fourth condition, listeners only heard samples from neurotypical, age-matched adults (n = 29). RESULTS: Results revealed statistically significant effects of educational statements on ratings of speakers' confidence, intelligence, and likability. However, educational statements did not affect listeners' transcription accuracy. CONCLUSIONS: This study presents preliminary evidence that educational material can positively influence listener impressions of speakers with hypokinetic dysarthria, especially when it is explicitly stated that the disorder does not affect intelligence or understanding. This initial examination provides preliminary support for educational awareness campaigns and self-disclosure of communicative difficulties in people with mild dysarthria.
Assuntos
Doença de Parkinson , Percepção da Fala , Adulto , Humanos , Inteligibilidade da Fala , Disartria/etiologia , Disartria/complicações , Doença de Parkinson/complicações , Doença de Parkinson/psicologia , Atitude , CogniçãoRESUMO
PURPOSE: The ability to understand speech under adverse listening conditions is highly variable across listeners. Despite this, studies have found that listeners with normal hearing display consistency in their ability to perceive speech across different types of degraded speech, suggesting that, for at least these listeners, global skills may be involved in navigating the ambiguity in speech signals. However, there are substantial differences in the perceptual challenges faced by listeners with normal and impaired hearing. This study examines whether listeners with sensorineural hearing loss demonstrate the same type of consistency as normal-hearing listeners when processing neurotypical (i.e., control) speech that has been degraded by external noise and speech that is neurologically degraded such as dysarthria. METHOD: Listeners with normal hearing (n = 31) and listeners with sensorineural hearing loss (n = 36) completed an intelligibility task with neurotypical speech in noise and with dysarthric speech in quiet. RESULTS: Findings were consistent with previous work demonstrating a relationship between the ability to perceive neurotypical speech in noise and dysarthric speech for listeners with normal hearing, albeit at a higher intelligibility level than previously observed. This relationship was also observed for listeners with hearing loss, although listeners with more severe hearing losses performed better with dysarthric speech than with neurotypical speech in noise. CONCLUSIONS: This study demonstrated a high level of consistency in intelligibility performance for listeners across two different types of degraded speech, even when those listeners were further challenged by the presence of sensorineural hearing loss. Clinical implications for both listeners with hearing loss and their communication partners with dysarthria are discussed.
Assuntos
Surdez , Perda Auditiva Neurossensorial , Percepção da Fala , Humanos , Disartria/etiologia , Ruído , Inteligibilidade da FalaRESUMO
PURPOSE: Background noise reduces speech intelligibility. Time-frequency (T-F) masking is an established signal processing technique that improves intelligibility of neurotypical speech in background noise. Here, we investigated a novel application of T-F masking, assessing its potential to improve intelligibility of neurologically degraded speech in background noise. METHOD: Listener participants (N = 422) completed an intelligibility task either in the laboratory or online, listening to and transcribing audio recordings of neurotypical (control) and neurologically degraded (dysarthria) speech under three different processing types: speech in quiet (quiet), speech mixed with cafeteria noise (noise), and speech mixed with cafeteria noise and then subsequently processed by an ideal quantized mask (IQM) to remove the noise. RESULTS: We observed significant reductions in intelligibility of dysarthric speech, even at highly favorable signal-to-noise ratios (+11 to +23 dB) that did not impact neurotypical speech. We also observed significant intelligibility improvements from speech in noise to IQM-processed speech for both control and dysarthric speech across a wide range of noise levels. Furthermore, the overall benefit of IQM processing for dysarthric speech was comparable with that of the control speech in background noise, as was the intelligibility data collected in the laboratory versus online. CONCLUSIONS: This study demonstrates proof of concept, validating the application of T-F masks to a neurologically degraded speech signal. Given that intelligibility challenges greatly impact communication, and thus the lives of people with dysarthria and their communication partners, the development of clinical tools to enhance intelligibility in this clinical population is critical.
Assuntos
Disartria , Percepção da Fala , Humanos , Disartria/etiologia , Disartria/terapia , Inteligibilidade da Fala , Percepção Auditiva , Cognição , Laboratórios , Mascaramento PerceptivoRESUMO
PURPOSE: Defined as the similarity of speech behaviors between interlocutors, speech entrainment plays an important role in successful adult conversations. According to theoretical models of entrainment and research on motoric, cognitive, and social developmental milestones, the ability to entrain should develop throughout adolescence. However, little is known about the specific developmental trajectory or the role of speech entrainment in conversational outcomes of this age group. The purpose of this study is to characterize speech entrainment patterns in the conversations of neurotypical early adolescents. METHOD: This study utilized a corpus of 96 task-based conversations between adolescents between the ages of 9 and 14 years and a comparison corpus of 32 task-based conversations between adults. For each conversational turn, two speech entrainment scores were calculated for 429 acoustic features across rhythmic, articulatory, and phonatory dimensions. Predictive modeling was used to evaluate the degree of entrainment and relationship between entrainment and two metrics of conversational success. RESULTS: Speech entrainment increased throughout early adolescence but did not reach the level exhibited in conversations between adults. Additionally, speech entrainment was predictive of both conversational quality and conversational efficiency. Furthermore, models that included all acoustic features and both entrainment types performed better than models that only included individual acoustic feature sets or one type of entrainment. CONCLUSIONS: Our findings show that speech entrainment skills are largely developed during early adolescence with continued development possibly occurring across later adolescence. Additionally, results highlight the role of speech entrainment in successful conversation in this population, suggesting the import of continued exploration of this phenomenon in both neurotypical and neurodivergent adolescents. We also provide evidence of the value of using holistic measures that capture the multidimensionality of speech entrainment and provide a validated methodology for investigating entrainment across multiple acoustic features and entrainment types.
Assuntos
Comunicação , Fala , Adulto , Humanos , Adolescente , Criança , Fonação , Medida da Produção da Fala , AcústicaRESUMO
PURPOSE: Communication atypicalities are considered promising markers of a broad range of clinical conditions. However, little is known about the mechanisms and confounders underlying them. Medications might have a crucial, relatively unknown role both as potential confounders and offering an insight on the mechanisms at work. The integration of regulatory documents with disproportionality analyses provides a more comprehensive picture to account for in future investigations of communication-related markers. The aim of this study was to identify a list of drugs potentially associated with communicative atypicalities within psychotic and affective disorders. METHOD: We developed a query using the Medical Dictionary for Regulatory Activities to search for communicative atypicalities within the FDA Adverse Event Reporting System (updated June 2021). A Bonferroni-corrected disproportionality analysis (reporting odds ratio) was separately performed on spontaneous reports involving psychotic, affective, and non-neuropsychiatric disorders, to account for the confounding role of different underlying conditions. Drug-adverse event associations not already reported in the Side Effect Resource database of labeled adverse drug reactions (unexpected) were subjected to further robustness analyses to account for expected biases. RESULTS: A list of 291 expected and 91 unexpected potential confounding medications was identified, including drugs that may irritate (inhalants) or desiccate (anticholinergics) the larynx, impair speech motor control (antipsychotics), or induce nodules (acitretin) or necrosis (vascular endothelial growth factor receptor inhibitors) on vocal cords; sedatives and stimulants; neurotoxic agents (anti-infectives); and agents acting on neurotransmitter pathways (dopamine agonists). CONCLUSIONS: We provide a list of medications to account for in future studies of communication-related markers in affective and psychotic disorders. The current test case illustrates rigorous procedures for digital phenotyping, and the methodological tools implemented for large-scale disproportionality analyses can be considered a road map for investigations of communication-related markers in other clinical populations. SUPPLEMENTAL MATERIAL: https://doi.org/10.23641/asha.23721345.
Assuntos
Efeitos Colaterais e Reações Adversas Relacionados a Medicamentos , Farmacovigilância , Estados Unidos , Humanos , Sistemas de Notificação de Reações Adversas a Medicamentos , Fator A de Crescimento do Endotélio Vascular , United States Food and Drug Administration , Efeitos Colaterais e Reações Adversas Relacionados a Medicamentos/epidemiologia , Bases de Dados Factuais , Transtornos do Humor , ComunicaçãoRESUMO
PURPOSE: As evidenced by perceptual learning studies involving adult listeners and speakers with dysarthria, adaptation to dysarthric speech is driven by signal predictability (speaker property) and a flexible speech perception system (listener property). Here, we extend adaptation investigations to adolescent populations and examine whether adult and adolescent listeners can learn to better understand an adolescent speaker with dysarthria. METHOD: Classified by developmental stage, adult (n = 42) and adolescent (n = 40) listeners completed a three-phase perceptual learning protocol (pretest, familiarization, and posttest). During pretest and posttest, all listeners transcribed speech produced by a 13-year-old adolescent with spastic dysarthria associated with cerebral palsy. During familiarization, half of the adult and adolescent listeners engaged in structured familiarization (audio and lexical feedback) with the speech of the adolescent speaker with dysarthria; and the other half, with the speech of a neurotypical adolescent speaker (control). RESULTS: Intelligibility scores increased from pretest to posttest for all listeners. However, listeners who received dysarthria familiarization achieved greater intelligibility improvements than those who received control familiarization. Furthermore, there was a significant effect of developmental stage, where the adults achieved greater intelligibility improvements relative to the adolescents. CONCLUSIONS: This study provides the first tranche of evidence that adolescent dysarthric speech is learnable-a finding that holds even for adolescent listeners whose speech perception systems are not yet fully developed. Given the formative role that social interactions play during adolescence, these findings of improved intelligibility afford important clinical implications.
Assuntos
Inteligibilidade da Fala , Percepção da Fala , Adulto , Humanos , Adolescente , Disartria/etiologia , Aprendizagem , CogniçãoRESUMO
Differences in perceptual strategies for lexical segmentation of moderate hypokinetic dysarthric speech, apparently related to the conditions of the familiarization procedure, have been previously reported [Borrie et al., Language and Cognitive Processes (2012)]. The current follow-up investigation examined whether this difference was also observed when familiarization stimuli highlighted syllabic strength contrast cues. Forty listeners completed an identical transcription task following familiarization with dysarthric phrases presented under either passive or explicit learning conditions. Lexical boundary error patterns revealed that syllabic strength cues were exploited in both familiarization conditions. Comparisons with data previously reported afford further insight into perceptual learning of dysarthric speech.
Assuntos
Sinais (Psicologia) , Disartria/fisiopatologia , Reconhecimento Psicológico , Acústica da Fala , Inteligibilidade da Fala , Percepção da Fala , Estimulação Acústica , Adulto , Análise de Variância , Audiometria da Fala , Humanos , Aprendizagem , Estimulação Luminosa , Leitura , Adulto JovemRESUMO
Conversational entrainment, also known as alignment, accommodation, convergence, and coordination, is broadly defined as similarity of communicative behavior between interlocutors. Within current literature, specific terminology, definitions, and measurement approaches are wide-ranging and highly variable. As new ways of measuring and quantifying entrainment are developed and research in this area continues to expand, consistent terminology and a means of organizing entrainment research is critical, affording cohesion and assimilation of knowledge. While systems for categorizing entrainment do exist, these efforts are not entirely comprehensive in that specific measurement approaches often used within entrainment literature cannot be categorized under existing frameworks. The purpose of this review article is twofold: First, we propose an expanded version of an earlier framework which allows for the categorization of all measures of entrainment of speech behaviors and includes refinements, additions, and explanations aimed at improving its clarity and accessibility. Second, we present an extensive literature review, demonstrating how current literature fits into the given framework. We conclude with a discussion of how the proposed entrainment framework presented herein can be used to unify efforts in entrainment research.
RESUMO
PURPOSE: Acoustic-prosodic entrainment, defined as the tendency for individuals to modify their speech behaviors to more closely align with the behaviors of their conversation partner, plays an important role in successful interaction. From a mechanistic perspective, acoustic-prosodic entrainment is, by its very nature, a rhythmic activity. Accordingly, it is highly plausible that an individual's rhythm perception abilities play a role in their ability to successfully entrain. Here, we examine the impact of rhythm perception in speaking rate entrainment and subsequent conversational quality. METHOD: A round-robin paradigm was used to collect 90 dialogues from neurotypical adults. Additional assessments determined participants' rhythm perception abilities, social competence, and partner familiarity (i.e., whether the conversation partners knew each other prior to the interaction. Mediation analysis was used to examine the relationships between rhythm perception scores, speaking rate entrainment (using a measure of static local synchrony), and a measure of conversational success (i.e., conversational quality) based on third-party listener observations. Findings were compared to the same analysis with three additional predictive factors: participant gender, partner familiarity, and social competence. RESULTS: Results revealed a relationship between rhythm perception and speaking rate entrainment. In unfamiliar conversation partners, there was a relationship between speaking rate entrainment and conversational quality. The relationships between entrainment and each of the three additional factors (i.e., gender, partner familiarity, and social competence) were nonsignificant. CONCLUSIONS: In unfamiliar conversation partners, better rhythm perception abilities were indicative of increased conversational quality mediated by higher levels of speaking rate entrainment. These results support theoretical postulations specifying rhythm perception abilities as a component of acoustic-prosodic entrainment, which, in turn, facilitates conversational success. Knowledge of this relationship contributes to the development of a causal framework for considering a mechanism by which rhythm perception deficits in clinical populations may impact conversational success.