Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 56
Filtrar
1.
J Voice ; 2024 Sep 20.
Artigo em Inglês | MEDLINE | ID: mdl-39306498

RESUMO

BACKGROUND: As part of a larger goal to create best practices for voice data collection to fuel voice artificial intelligence (AI) research, the objective of this study was to investigate the ability of readily available iOS and Android tablets with and without low-cost headset microphones to produce recordings and subsequent acoustic measures of voice comparable to "research quality" instrumentation. METHODS: Recordings of 24 sustained vowel samples representing a wide range of typical and disordered voices were played via a head-and-torso model and recorded using a research quality standard microphone/preamplifier/audio interface. Acoustic measurements from the standard were compared with two popular tablets using their built-in microphones and with low-cost headset microphones at different distances from the mouth. RESULTS: Voice measurements obtained via tablets + headset microphones close to the mouth (2.5 and 5 cm) strongly correlated (r's > 0.90) with the research standard and resulted in no significant differences for measures of vocal frequency and perturbation. In contrast, voice measurements obtained using the tablets' built-in microphones at typical reading distances (30 and 45 cm) tended to show substantial variability in measurement, greater mean differences in voice measurements, and relatively poorer correlations vs the standard. CONCLUSION: Findings from this study support preliminary recommendations from the Bridge2AI-Voice Consortium recommending the use of smartphones paired with low-cost headset microphones as adequate methods of recording for large-scale voice data collection from a variety of clinical and nonclinical settings. Compared with recording using a tablet direct, a headset microphone controls for recording distance and reduces the effects of background noise, resulting in decreased variability in recording quality. DATA AVAILABILITY: Data supporting the results reported in this article may be obtained upon request from the contact author.

2.
J Speech Lang Hear Res ; 67(10): 3612-3630, 2024 Oct 08.
Artigo em Inglês | MEDLINE | ID: mdl-39259876

RESUMO

OBJECTIVE: Differentiating adductor laryngeal dystonia (ADLD) and primary muscle tension dysphonia (pMTD) can be challenging. Unlike pMTD, ADLD is described as "task-dependent" with voiced phonemes purportedly provoking greater sign expression than voiceless phonemes. We evaluated the ability of two automated acoustic measures, the Cepstral Spectral Index of Dysphonia (CSID) and creak, to detect task dependency and to discriminate ADLD and pMTD. METHOD: CSID, % creak, and listener ratings of dysphonia severity were obtained from audio recordings of patients with ADLD (n = 29) or pMTD (n = 33) reading two sentences loaded with either voiced or voiceless phonemes. RESULTS: Group × Sentence Type interaction effects confirmed that both "normalized" CSID and % creak detected task-dependent sign expression in ADLD (i.e., worse symptoms on the voiced- vs. voiceless-loaded sentence). However, a stepwise binary logistic regression analysis with group (ADLD vs. pMTD) as the dependent variable and % creak and normalized CSID variables (voiced, voiceless, and voiced vs. voiceless difference) as covariates revealed that the normalized CSID voiceless-laden sentence z score was the only significant predictor of group membership. Estimates of diagnostic precision from the normalized CSID voiceless sentence z scores were superior to % creak or listener ratings. Finally, the CSID possessed the strongest correlations with listener severity ratings regardless of group or sentence type. CONCLUSIONS: Although both normalized CSID and % creak detected task-dependent performance as a distinguishing feature of ADLD, a CSID profile wherein (a) the voiceless sentence z score was less severe than the voiced sentence and (b) the normalized voiceless sentence z score was within approximately 2 SDs (or less) of typical expectations provided the best estimates of diagnostic precision. Automated acoustic measures such as the CSID and creak provide useful information to objectively discriminate ADLD and pMTD.


Assuntos
Disfonia , Acústica da Fala , Humanos , Disfonia/diagnóstico , Disfonia/fisiopatologia , Masculino , Feminino , Pessoa de Meia-Idade , Idoso , Adulto , Diagnóstico Diferencial , Doenças da Laringe/diagnóstico , Doenças da Laringe/fisiopatologia , Índice de Gravidade de Doença , Distonia/diagnóstico , Distonia/fisiopatologia
3.
J Speech Lang Hear Res ; 67(6): 1712-1730, 2024 Jun 06.
Artigo em Inglês | MEDLINE | ID: mdl-38749007

RESUMO

PURPOSE: The goal of this study was to assess various recording methods, including combinations of high- versus low-cost microphones, recording interfaces, and smartphones in terms of their ability to produce commonly used time- and spectral-based voice measurements. METHOD: Twenty-four vowel samples representing a diversity of voice quality deviations and severities from a wide age range of male and female speakers were played via a head-and-thorax model and recorded using a high-cost, research standard GRAS 40AF (GRAS Sound & Vibration) microphone and amplification system. Additional recordings were made using various combinations of headset microphones (AKG C555 L [AKG Acoustics GmbH], Shure SM35-XLR [Shure Incorporated], AVID AE-36 [AVID Products, Inc.]) and audio interfaces (Focusrite Scarlett 2i2 [Focusrite Audio Engineering Ltd.] and PC, Focusrite and smartphone, smartphone via a TRRS adapter), as well as smartphones direct (Apple iPhone 13 Pro, Google Pixel 6) using their built-in microphones. The effect of background noise from four different room conditions was also evaluated. Vowel samples were analyzed for measures of fundamental frequency, perturbation, cepstral peak prominence, and spectral tilt (low vs. high spectral ratio). RESULTS: Results show that a wide variety of recording methods, including smartphones with and without a low-cost headset microphone, can effectively track the wide range of acoustic characteristics in a diverse set of typical and disordered voice samples. Although significant differences in acoustic measures of voice may be observed, the presence of extremely strong correlations (rs > .90) with the recording standard implies a strong linear relationship between the results of different methods that may be used to predict and adjust any observed differences in measurement results. CONCLUSION: Because handheld smartphone distance and positioning may be highly variable when used in actual clinical recording situations, smartphone + a low-cost headset microphone is recommended as an affordable recording method that controls mouth-to-microphone distance and positioning and allows both hands to be available for manipulation of the smartphone device.


Assuntos
Smartphone , Acústica da Fala , Humanos , Feminino , Masculino , Adulto , Adulto Jovem , Medida da Produção da Fala/instrumentação , Medida da Produção da Fala/métodos , Reprodutibilidade dos Testes , Qualidade da Voz , Pessoa de Meia-Idade , Adolescente
4.
J Voice ; 2023 Sep 20.
Artigo em Inglês | MEDLINE | ID: mdl-37739864

RESUMO

OBJECTIVES: The purpose was to examine the influence of auditory vs visual vs combined audiovisual input on perception and production of one's own voice, using immersive virtual reality technology. METHODS: Thirty-one vocally healthy men and women were investigated under 18 sensory input conditions, using immersive virtual reality technology. Conditions included two auditory rooms with varying reverberation times, two visual rooms with varying volumes, and the combination of audiovisual conditions. All conditions were repeated with and without background noise. Speech tasks included counting, sustained vowel phonation, an all-voiced sentence from the Consensus Auditory-Perceptual Evaluation of Voice, and the first sentence from the Rainbow Passage, randomly ordered. Perception outcome measures were participants' self-reported perceptions of their vocal loudness, vocal effort, and vocal comfort in speech. Production outcome measures were sound pressure level (SPL) and spectral moments (spectral mean and standard deviation in Hz, skewness, and kurtosis). Statistical analyses used self-reported vocal effort, vocal loudness, and vocal comfort in percent (0 = "not at all," 100 = extremely), SPL in dB, and spectral moments in Hz. The reference level was a baseline audiovisual deprivation condition. RESULTS: Results suggested (i) increased self-perceived vocal loudness and effort, and decreased comfort, with increasing room volume, speaker-to-listener distance, audiovisual input, and background noise, and (ii) increased SPL and fluctuations in spectral moments across conditions. CONCLUSIONS: Not only auditory, but also visual and audiovisual input influenced voice perception and production in ways that have not been previously documented. Findings contribute to the basic science understanding the role of visual, audiovisual and auditory input in voice perception and production, and also to models of voice training and therapy. The findings also set the foundation for the use of virtual reality in voice and speech training, as a potentially power solution to the generalization problem.

5.
J Voice ; 2023 Apr 03.
Artigo em Inglês | MEDLINE | ID: mdl-37019804

RESUMO

PURPOSE: The purpose of this study was to assess the relationship and comparability of cepstral and spectral measures of voice obtained from a high-cost "flat" microphone and precision sound level meter (SLM) vs. high-end and entry level models of commonly and currently used smartphones (iPhone i12 and iSE; Samsung s21 and s9 smartphones). Device comparisons were also conducted in different settings (sound-treated booth vs. typical "quiet" office room) and at different mouth-to-microphone distances (15 and 30 cm). METHODS: The SLM and smartphone devices were used to record a series of speech and vowel samples from a prerecorded diverse set of 24 speakers representing a wide range of sex, age, fundamental frequency (F0), and voice quality types. Recordings were analyzed for the following measures: smoothed cepstral peak prominence (CPP in dB); the low vs high spectral ratio (L/H Ratio in dB); and the Cepstral Spectral Index of Dysphonia (CSID). RESULTS: A strong device effect was observed for L/H Ratio (dB) in both vowel and sentence contexts and for CSID in the sentence context. In contrast, device had a weak effect on CPP (dB), regardless of context. Recording distance was observed to have a small-to-moderate effect on measures of CPP and CSID but had a negligible effect on L/H Ratio. With the exception of L/H Ratio in the vowel context, setting was observed to have a strong effect on all three measures. While these aforementioned effects resulted in significant differences between measures obtained with SLM vs. smartphone devices, the intercorrelations of the measurements were extremely strong (r's > 0.90), indicating that all devices were able to capture the range of voice characteristics represented in the voice sample corpus. Regression modeling showed that acoustic measurements obtained from smartphone recordings could be successfully converted to comparable measurements obtained by a "gold standard" (precision SLM recordings conducted in a sound-treated booth at 15 cm) with small degrees of error. CONCLUSIONS: These findings indicate that a variety of commonly available modern smartphones can be used to collect high quality voice recordings usable for informative acoustic analysis. While device, setting, and distance can have significant effects on acoustic measurements, these effects are predictable and can be accounted for using regression modeling.

6.
J Voice ; 2023 Mar 10.
Artigo em Inglês | MEDLINE | ID: mdl-36907680

RESUMO

OBJECTIVES: The primary aim was to compare two methods for perceptual evaluation of voice - paired comparison (PC) and visual analog scale (VAS) ratings. Secondary aims were to assess the correspondence between two dimensions of voice- overall severity of voice quality and resonant voice, and to investigate the influence of rater experience on perceptual rating scores and rating confidence scores. STUDY DESIGN: Experimental design. METHODS: Voice samples from six children (pre and post therapy) were rated by 15 Speech-Language Pathologists specialized in voice. Raters completed four tasks corresponding to the two rating methods and voice qualities: PC-severity, PC-resonance, VAS-severity, and VAS-resonance. For PC tasks, raters chose the better of two voice samples (better voice quality or better resonance, depending on the task) and indicated the degree of confidence in each choice. Rating and confidence score were combined to produce a number on a 1-10 scale (PC-confidence adjusted). VAS ratings involved rating voices on a scale for degree of severity and resonance, respectively. RESULTS: PC-confidence adjusted and VAS ratings were moderately correlated for overall severity and also vocal resonance. VAS ratings were normally distributed and had greater rater consistency than PC-confidence adjusted ratings. VAS scores reliably predicted binary PC choices (choice of voice sample only). Overall severity and vocal resonance were weakly correlated and rater experience was not linearly related to rating scores or confidence. CONCLUSIONS: Results suggest that the VAS rating method holds advantages over PC, including normally distributed ratings, superior consistency of ratings, and the ability to provide more finely grained detail regarding the auditory perception of voice. Overall severity and vocal resonance were not redundant in the current data set, suggesting that resonant voice and overall severity are not isomorphic. Finally, the number of years of clinical experience was not linearly related to perceptual ratings or rating confidence.

7.
J Voice ; 2023 Jan 30.
Artigo em Inglês | MEDLINE | ID: mdl-36725408

RESUMO

OBJECTIVES: The advantages of cepstral measurements in the evaluation of dysphonia have been noted in previous studies. However, there is an unclarity regarding the results of cepstral analyzes effect in determining the severity of dysphonia. The aims of this study were to determine the cut-off values of cepstral peak prominence, cepstral peak prominence standard deviation, low frequency/ high frequency ratio, low frequency/high frequency ratio standard deviation, and cepstral spectral index of dysphonia for predicting the voice severity within a Turkish speaking population, as well as to confirm the discriminative power of these cut-off values. MATERIALS METHODS: One hundred ninety-five individuals with voice disorders and an equal number of age and gender-matched individuals without voice disorders were included. Included subjects had visited the Hacettepe University Hospitals Speech and Language Therapy Department for voice evaluation between January 2017 and September 2021. The voice recordings from all participants included the six CAPE-V/Turkish sentences and sustained vowel /a/. Three raters provided auditory perceptual ratings of the voice samples using the GRBAS scale (grade) and overall severity for the CAPE-V/Turkish. Participants were categorized into normal and mild, moderate, and severely dysphonic groups based on the auditory perceptual evaluation. Analysis of Dysphonia in Speech and Voice (ADSV) software was used for cepstral spectral acoustic analysis. RESULTS: In the sustained vowel context, the area under the curve (ROC) for the CSID value was >0.8, except for mild vs. moderate dysphonia groups. In connected speech contexts, the ROC of the CPP value was also >0.8, except for normal vs. mild dysphonia groups. The cut-off values of CPP and CSID demonstrated high sensitivity and specificity for predicting voice severities. CONCLUSION: The cut-off values for the parameters that predicted voice severities showed a significant degree of discriminative power for categorizing voice severities among Turkish-speaking people.

8.
J Biomech Eng ; 144(11)2022 11 01.
Artigo em Inglês | MEDLINE | ID: mdl-35579176

RESUMO

A vortex whistle produces a fundamental frequency proportional to the inlet flowrate. Recent investigations using vortex whistles have focused on the use of this relationship to quantify aspects of respiratory function. Despite promising results, there is a lack of understanding of the physical mechanisms underlying vortex whistle function. This paper begins with a principled study of the aero-acoustic properties of the vortex whistle. First, a high-fidelity computational fluid dynamics (CFD) simulation was developed to predict the unsteady flow field induced by the vortex whistle when the expiratory flow is applied. A computational aero-acoustic analysis (CAA) was applied to predict the acoustic response of the vortex whistle and to capture the frequency and level of the signature spectral peaks. The CFD is validated against prior experimental data on the vortex whistle. The CFD was used to: (a) determine the source of the vortex whistle harmonics and (b) investigate the effect of an outlet tube terminator, proposed by Awan and Awan (2020, "Use of a Vortex Whistle for Measures of Respiratory Capacity," J. Voice). The CFD and CAA indicated that the harmonics are generated by the cylindrical cavity of the vortex whistle, and the outlet terminator increases harmonic signal-to-noise ratio by increasing the pressure fluctuation within the cylindrical cavity. These results support the addition of the outlet tube terminator and provide insight into future design modifications that will enhance the reliability of the vortex whistle analyses and enable additional measures of respiratory capacity.


Assuntos
Hidrodinâmica , Vocalização Animal , Acústica , Animais , Simulação por Computador , Reprodutibilidade dos Testes , Vocalização Animal/fisiologia
9.
Am J Speech Lang Pathol ; 31(2): 959-973, 2022 03 10.
Artigo em Inglês | MEDLINE | ID: mdl-35050724

RESUMO

PURPOSE: The purpose of this study was to establish the frequency response of a selection of low-cost headset microphones that could be given to subjects for remote voice recordings and to examine the effect of microphone type and frequency response on key acoustic measures related to voice quality obtained from speech and vowel samples. METHOD: The frequency responses of three low-cost headset microphones were evaluated using pink noise generated via a head-and-torso model. Each of the headset microphones was then used to record a series of speech and vowel samples prerecorded from 24 speakers who represented a diversity of sex, age, fundamental frequency (F o), and voice quality types. Recordings were later analyzed for the following measures: smoothed cepstral peak prominence (CPP; dB), low versus high spectral ratio (L/H ratio; dB), CPP F o (Hz), and cepstral spectral index of dysphonia (CSID). RESULTS: The frequency response of the microphones under test was observed to have nonsignificant effects on measures of the CPP and CPP F o, significant effects on the CSID in speech contexts, and strong and significant effects on the measure of spectral tilt (L/H ratio). However, the correlations between the various headset microphones and a reference precision microphone were excellent (rs > .90). CONCLUSIONS: The headset microphones under test all showed the capability to track a wide range of diversity in the voice signal. Though the use of higher quality microphones that have demonstrated specifications is recommended for typical research and clinical purposes, low-cost electret microphones may be used to provide valid measures of voice, specifically when the same microphone and signal chain is used for the evaluation of pre- versus posttreatment change or intergroup comparisons.


Assuntos
Disfonia , Voz , Disfonia/diagnóstico , Humanos , Acústica da Fala , Medida da Produção da Fala , Voz/fisiologia , Qualidade da Voz
10.
J Voice ; 36(5): 630-636, 2022 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-32917456

RESUMO

OBJECTIVES: A vortex whistle produces a tone which has a frequency proportional to the inlet air flow rate. The objectives of this study were to replicate previous studies demonstrating the use of a vortex whistle as an accurate flow meter, and to assess the degree of relationship between measures of vital capacity (VC) obtained using low-cost methods (vortex whistle and hand-held spirometry) vs. pneumotach-based spirometry. METHODOLOGY: A vortex whistle was designed using 3D modeling software and manufactured using a 3D printer with non-toxic, biodegradable polylactic acid (PLA). The digitized vortex whistle tone was analyzed using custom software to produce a frequency vs. time contour. As the frequency is proportional to flow, the integral of this curve corresponds to the overall volume by a linear relationship. The accuracy of vortex whistle volume estimates were assessed using (1) controlled flow rates from a consistent volume calibration syringe, and (2) with 66 subjects between the ages of 18-30 yrs. in comparison to hand-held spirometry and two pneumotach systems. RESULTS: Observations from the calibration syringe experiment confirmed that the vortex whistle and software are able to effectively track the flow rate, with a correlation coefficient between the average flow and the average frequency of r2 = 0.9965. Results from the human VC testing showed that measures obtained using both vortex whistles and hand-held spirometry correlated very strongly (r > 0.94) with computerized pneumotach systems, and the strength of correlations obtained via vortex whistles vs. hand-held spirometry were highly comparable. DISCUSSION & CONCLUSIONS: When coupled with the analysis software described herein, valid and reliable frequency/flow curves and volume estimates may be obtained using a vortex whistle that are highly comparable to the hand-held spirometer. The use of the vortex whistle has the potential to bring measures of basic respiratory function to clinicians and patients alike at a fraction of the cost of currently used spirometric methods.


Assuntos
Respiração , Sistema Respiratório , Adolescente , Adulto , Humanos , Espirometria , Volume de Ventilação Pulmonar , Capacidade Vital , Adulto Jovem
11.
J Voice ; 2021 Nov 27.
Artigo em Inglês | MEDLINE | ID: mdl-34848103

RESUMO

OBJECTIVE: A limitation of traditional semi-occluded vocal tract exercises (SOVTE) is the single sustained vowel task that precludes co-articulated singing. This study investigated immediate effects of a variably occluded face mask (VOFM) on vocal effort, acoustic, and aerodynamic measures in sung low and high pitches of healthy singers. DESIGN: Single-group, pre-post intervention study. METHODS: The outlet ports of disposable anesthesia facemasks were fitted with plastic caps with two drilled openings sizes (9.6 mm, 6.4 mm). Twenty-three singers with no voice complaints provided baseline vocal effort, acoustic, and aerodynamic measures in high and low pitches. Participants trained in four conditions: two VOFM sizes (9.6 mm, 6.4 mm) in combination with the 20th and 80th percentile of the singer's pitch range. Participants were trained on three phonatory tasks: repeated consonant/vowel syllables, sung sentence, and sustained vowel. Vocal effort before and after training was compared using a visual-analog scale, while standardized mean differences captured acoustic and aerodynamic changes before and after training. RESULTS: Participants reported decreased vocal effort after VOFM training at all occlusion and pitch combinations. On average, consistent beneficial effect sizes were found in cepstral peak prominence (CPP) and cepstral spectral index of dysphonia (CSID) for all 4 occlusion-pitch combinations, and vocal intensity and mean estimated subglottal pressure increased for all 4 occlusion-pitch training combinations. Changes in mean phonatory airflow and resistance were less consistent. DISCUSSION: There was an immediate effect of decreased vocal effort in singing after VOFM training. Acoustic and aerodynamic effects were variable and modest. Future studies should explore changes in these outcomes after VOFM in singing voice therapy.

12.
J Voice ; 2021 Nov 05.
Artigo em Inglês | MEDLINE | ID: mdl-34750034

RESUMO

INTRODUCTION: Voice disorders are a common communication disorder in children. Behavioral voice therapy is recommended by both Otolaryngologists and Speech-Language Pathologists as a first-line approach for treatment of benign vocal fold lesions that affect children in large numbers. However, the role of cognitive mechanisms critical to voice therapy have not yet been explored. OBJECTIVE: This proof-of-concept study aims to provide preliminary data on the potential relation between cognitive abilities and behavioral results of voice therapy for children with benign phonotraumatic vocal fold lesions. METHODS: Six children (4;05 -9;02 years) diagnosed with vocal fold nodules completed a battery of cognitive tests from the NEuroPSYchological (NEPSY-II) Assessment and a standard course of "Adventures in Voice" therapy. Recordings pre and post intervention were analyzed acoustically using the Cepstral Spectral Index of Dysphonia (CSID) and perceptually using visual analog scales (VAS) for severity and resonance separately. Raw and age-corrected scaled scores from the NEPSY-II were then examined for their possible relation to voice outcomes. RESULTS: Multiple cognitive functions correlated with voice outcomes. Raw score measures for Design Fluency, Inhibition, Fingertip Tapping, and Narrative Memory correlated favorably with all voice outcome measures. Age correlated with all NEPSY-II raw scores and perceptual voice outcome measures. Scaled scores for Auditory Attention, Design Fluency, Fingertip Tapping (Dominant hand), and Narrative Memory correlated with all voice outcome measures. CONCLUSION: Results suggest that there is merit to further investigation of the relation between cognitive skills and their development and voice treatment outcomes in children with benign phonotraumatic lesions. Future studies with larger samples will build on present findings.

13.
J Voice ; 34(1): 9-19, 2020 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-30391019

RESUMO

OBJECTIVES: The objective of this study was to investigate the ability of a two-stage method of cepstral peak identification to effectively discriminate rough vs breathy vs typical voice in sustained vowel productions. It was hypothesized that a dual-stage search for cepstral peak prominences (CPP's) above and below specified quefrency/F0 cutoffs would result in a CPP difference that would be characteristic of the rough, diplophonic voice type. METHODOLOGY: Central one-second portions of sustained vowel /a/ productions were obtained from 90 subjects (rough, breathy, and normophonic voices). All voice samples were analyzed using a a two-stage cepstral analysis process in which a CPPHigh-Low difference value was obtained by identifying cepstral peaks above and below a lower limit for expected F0 (150 Hz for females and 90 Hz for males), called CPPHigh and CPPLow respectively. RESULTS: The CPPHigh-Low difference value was observed to be a highly significant predictor, with negative values for this parameter characteristic of a dominant subharmonic in the voice signal and the perception of diplophonic, rough voice. Correct classification of rough vs nonrough voice samples was 82.2% (sensitivity 0.80 and specificity 0.833). In the consideration of three group classification (breathy vs. normophonic vs. rough), models incorporating two predictors (the CPP obtained from a single search through a 60 to 300 Hz frequency range (CPPDefault) and the CPPHigh-Low difference value) correctly classified 78.88% of the voice samples. CONCLUSIONS: Rough, diplophonic voices were consistently observed to have a subharmonic peak that was greater in amplitude than the cepstral peak obtained within the region of the expected F0, resulting in a negative value for the CPPHigh-Low difference. The two-stage cepstral analysis process described herein is visually intuitive from the graphical display of a cepstrum and is a simple extended calculation derived from cepstral analysis procedures that have been recommended as essential in the acoustic description of vocal quality.


Assuntos
Acústica , Processamento de Sinais Assistido por Computador , Acústica da Fala , Medida da Produção da Fala , Qualidade da Voz , Adulto , Idoso , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Reconhecimento Automatizado de Padrão , Espectrografia do Som , Adulto Jovem
14.
Folia Phoniatr Logop ; 71(5-6): 275-285, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-31117110

RESUMO

PURPOSE: This article examines cepstral/spectral analyses of sustained /α/ vowels produced by speakers with hypokinetic dysarthria secondary to idiopathic Parkinson's disease (PD) before and after Lee Silverman Voice Treatment (LSVT®LOUD) and the relationship of these measures with overall voice intensity. METHODOLOGY: Nine speakers with PD were examined in a pre-/post-treatment design, with multiple daily audio recordings before and after treatment. Sustained vowels were analyzed for cepstral peak prominence (CPP), CPP standard deviation (CPP SD), low/high spectral ratio (L/H SR), and Cepstral/Spectral Index of Dysphonia (CSID) using the KAYPENTAX computer software. RESULTS: CPP and CPP SD increased significantly and CSID decreased significantly from pre- to post-treatment recordings, with strong effect sizes. Increased CPP indicates increased dominance of harmonics in the spectrum following LSVT. After restricting the frequency cutoff to the region just above the first formant and second formant and below the third formant, L/H SR was observed to decrease significantly following treatment. Correlation analyses demonstrated that CPP was more strongly associated with CSID before treatment than after. CONCLUSION: In addition to increased vocal intensity following LSVT, speakers with PD exhibited both improved harmonic structure and voice quality as reflected by cepstral/spectral analysis, indicating that there was improved harmonic structure and reduced dysphonia following treatment.


Assuntos
Disartria/terapia , Doença de Parkinson/terapia , Fonação , Espectrografia do Som , Distúrbios da Voz/terapia , Qualidade da Voz , Treinamento da Voz , Idoso , Idoso de 80 Anos ou mais , Disartria/diagnóstico , Feminino , Seguimentos , Humanos , Masculino , Pessoa de Meia-Idade , Doença de Parkinson/diagnóstico , Acústica da Fala , Distúrbios da Voz/diagnóstico
15.
J Voice ; 33(5): 809.e1-809.e10, 2019 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-30082107

RESUMO

OBJECTIVE: This study reports on experiments designed to examine the effects of a variably occluded face mask (VOFM) on (a) estimated subglottal pressure (Psub) in normophonic participants and (b) aerodynamic and acoustic characteristics of voice in dysphonic participants. DESIGN: A prospective design was used for experiment 1, and a prospective, randomized cohort design was used for experiment 2. METHODS: The outlet ports of disposable anesthesia face masks were fitted with plastic caps with variable diameter drilled openings (9.6, 6.4, 3.2, and 1.6 mm) to create a series of mask openings. In experiment 1, Psub was measured in normophonic participants using the face mask during syllable repetitions in unoccluded and variable diameter opening conditions. In experiment 2, aerodynamic and acoustic measures were obtained in a group of dysphonic speakers before and after syllable and speech repetition tasks using the VOFM. RESULTS: In experiment 1, mean PSub was observed to decrease while using the VOFM in all occlusion conditions versus nonocclusion, with a significant reduction in Psub observed between the baseline and the 6.4 mm condition. In experiment 2, standardized mean differences showed that many dysphonic participants produced reduced Psub, increased airflow, and improved acoustic measures after the use of the VOFM in at least one occlusion condition. CONCLUSIONS: Beneficial changes in both aerodynamic and acoustic characteristics of voice may be obtained in dysphonic speakers using a VOFM. By moving the place of occlusion outside of the oral cavity, therapeutic stimuli options may be extended beyond vowel and humming elicitations to syllable and speech contexts and assist with generalization of voice therapy targets to conversational speech.


Assuntos
Disfonia/terapia , Máscaras , Fonação , Acústica da Fala , Qualidade da Voz , Treinamento da Voz , Adolescente , Adulto , Idoso , Idoso de 80 Anos ou mais , Disfonia/diagnóstico , Disfonia/fisiopatologia , Desenho de Equipamento , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Estudos Prospectivos , Recuperação de Função Fisiológica , Fatores de Tempo , Resultado do Tratamento , Adulto Jovem
16.
Am J Speech Lang Pathol ; 27(3): 887-905, 2018 08 06.
Artigo em Inglês | MEDLINE | ID: mdl-29955816

RESUMO

Purpose: The aim of this study was to recommend protocols for instrumental assessment of voice production in the areas of laryngeal endoscopic imaging, acoustic analyses, and aerodynamic procedures, which will (a) improve the evidence for voice assessment measures, (b) enable valid comparisons of assessment results within and across clients and facilities, and (c) facilitate the evaluation of treatment efficacy. Method: Existing evidence was combined with expert consensus in areas with a lack of evidence. In addition, a survey of clinicians and a peer review of an initial version of the protocol via VoiceServe and the American Speech-Language-Hearing Association's Special Interest Group 3 (Voice and Voice Disorders) Community were used to create the recommendations for the final protocols. Results: The protocols include recommendations regarding technical specifications for data acquisition, voice and speech tasks, analysis methods, and reporting of results for instrumental evaluation of voice production in the areas of laryngeal endoscopic imaging, acoustics, and aerodynamics. Conclusion: The recommended protocols for instrumental assessment of voice using laryngeal endoscopic imaging, acoustic, and aerodynamic methods will enable clinicians and researchers to collect a uniform set of valid and reliable measures that can be compared across assessments, clients, and facilities.


Assuntos
Acústica da Fala , Patologia da Fala e Linguagem/normas , Prega Vocal/fisiopatologia , Distúrbios da Voz/diagnóstico , Qualidade da Voz , Acústica , Fenômenos Biomecânicos , Consenso , Humanos , Laringoscopia/normas , Variações Dependentes do Observador , Valor Preditivo dos Testes , Reprodutibilidade dos Testes , Medida da Produção da Fala/normas , Patologia da Fala e Linguagem/métodos , Estroboscopia/normas , Distúrbios da Voz/fisiopatologia , Distúrbios da Voz/terapia
17.
J Voice ; 32(2): 169-176, 2018 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-28688672

RESUMO

OBJECTIVE: The objective, instrumental acoustic measure of Cepstral Spectral Index of Dysphonia (CSID) correlates with audio-perceptual measures, is sensitive in detecting voice abnormalities, and tracks change following treatment. The goals of the current study were to (1) test the agreement between CSID versus auditory perceptual measures of pre- versus posttreatment voice change, and (2) investigate whether change in scores is based on voice disorder or phonemic structure of sentence stimuli. METHODS: Forty patients with benign voice disorders produced sentences and a sustained /a/ vowel from the Consensus Auditory Perceptual Evaluation of Voice protocol before and after treatment. CSID was calculated, and overall audio-perceptual voice severity was judged by 7 blinded, trained raters using a 100-mm visual analog scale. Differences between CSID and audio-perceptual measures of voice change across voice disorder and stimuli, and correlation between change in CSID and perceptual rating scores were assessed pre- and postintervention. RESULTS: Across all subjects, there were significant correlations between CSID and perceptual ratings change scores (P < 0.001), and no significant differences in pre- and posttreatment change. Disorder-specific analyses indicated that all tested sentence/vowel contexts are effective measures for pre- versus posttreatment change in atrophy and paralysis cases. Acoustic versus perceptual measures of voice change were significantly correlated in lesion cases for the sentence "How hard did he hit him" and with all sentences combined. There were no significant findings observed for muscle tension dysphonia. CONCLUSION: CSID provides an accurate objective correlate to auditory-perceptual posttreatment change in overall voice severity ratings. Implications for outcomes testing and disorder-specific findings are discussed.


Assuntos
Acústica , Acústica da Fala , Percepção da Fala , Medida da Produção da Fala/métodos , Distúrbios da Voz/diagnóstico , Distúrbios da Voz/terapia , Qualidade da Voz , Humanos , Julgamento , Valor Preditivo dos Testes , Estudos Retrospectivos , Índice de Gravidade de Doença , Resultado do Tratamento , Distúrbios da Voz/fisiopatologia , Distúrbios da Voz/psicologia
19.
J Voice ; 31(2): 243.e9-243.e19, 2017 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-27501923

RESUMO

BACKGROUND: Friedreich Ataxia (FRDA) is the most common hereditary ataxia, with dysarthria as one of its key clinical signs. OBJECTIVE: To describe the voice profile of individuals with FRDA to inform outcome marker development and goals of speech therapy. METHODS: Thirty-six individuals with FRDA and 30 age-matched controls provided sustained vowel and connected speech samples. Speech and voice samples were analyzed acoustically using the Analysis of Dysphonia in Speech and Voice program and perceptually using the Consensus Auditory-Perceptual Evaluation of Voice form. Correlations between dysphonia and overall dysarthria severity, demographic, clinical, and genetic information were explored. RESULTS: Individuals with FRDA presented with mild dysphonia characterized by hoarseness (combined roughness and breathiness), increased strain, and altered pitch variability (increased in vowel productions; slightly decreased on reading samples). Acoustically, individuals with FRDA had significantly higher scores on the Cepstral Spectral Index of Dysphonia during vowel production. A combination of perceptual and acoustic measures of dysphonia used in this study was quite effective in categorizing the FRDA versus control participants, with >80% overall accuracy. CONCLUSIONS: Although dysphonia severity in FRDA did not correlate significantly with overall disease severity, speaking rate and syllabic duration significantly correlated with age at disease onset and disease duration, and also have an effect on listener perception of dysphonia. The relationship between dysphonia and dysarthria in FRDA suggests that reducing overall dysphonia severity via therapeutic techniques that improve phonatory stability and increase speaking rate is a viable target for speech therapy.


Assuntos
Disfonia/terapia , Ataxia de Friedreich/complicações , Fonação , Acústica da Fala , Fonoterapia/métodos , Qualidade da Voz , Treinamento da Voz , Acústica , Adulto , Estudos de Casos e Controles , Disfonia/diagnóstico , Disfonia/etiologia , Disfonia/fisiopatologia , Feminino , Ataxia de Friedreich/diagnóstico , Ataxia de Friedreich/genética , Ataxia de Friedreich/fisiopatologia , Predisposição Genética para Doença , Humanos , Julgamento , Masculino , Pessoa de Meia-Idade , Variações Dependentes do Observador , Fenótipo , Valor Preditivo dos Testes , Recuperação de Função Fisiológica , Reprodutibilidade dos Testes , Índice de Gravidade de Doença , Processamento de Sinais Assistido por Computador , Espectrografia do Som , Inteligibilidade da Fala , Percepção da Fala , Medida da Produção da Fala , Fatores de Tempo , Resultado do Tratamento , Adulto Jovem
20.
J Voice ; 31(2): 218-228, 2017 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-27241579

RESUMO

OBJECTIVES: Intermittent dysphonia within an utterance is common, but presents difficulty for both perceptual and objective voice evaluation. This study examined the ability of measures from the within-sample cepstral peak prominence (CPP) distribution to differentiate normal voices from intermittently and consistently dysphonic voices. STUDY DESIGN: Exploratory design. METHODS: Sixty samples of the sentence "We were away a year ago" were classified as normal, intermittently dysphonic, or consistently dysphonic by four judges. Measures of CPP within each sample were obtained, and further analysis with examined CPP distribution variability and patterns of CPP outliers. RESULTS: Whereas the mean CPP was the strongest single discriminator among the three voice types, the normal and intermittent dysphonia groups were not significantly different on CPP distribution skewness and measures of CPP distribution outliers. Both the normal and intermittently dysphonic voices differed significantly from the consistently dysphonic samples on these variables. A combination of measures of the CPP distribution was effective for a linear prediction of percent dysphonia duration for the speech samples (r = 0.825; rho = 0.81). The CPP standard deviation significantly improved the use of the mean CPP in discriminant function analyses and also the classification of the intermittently dysphonic voices. CONCLUSIONS: Auditory-perceptual judgment of dysphonic segments and the typically robust acoustic measurement of mean CPP are both ineffective for classifying intermittently dysphonic voices. However, dysphonia duration may be effectively predicted via measures of the CPP distribution, and acoustic classification of dysphonic voice types via cepstral methods may be improved with an analysis of the CPP distribution across an utterance.


Assuntos
Acústica , Disfonia/diagnóstico , Acústica da Fala , Percepção da Fala , Medida da Produção da Fala , Qualidade da Voz , Análise de Variância , Estudos de Casos e Controles , Análise Discriminante , Disfonia/fisiopatologia , Disfonia/psicologia , Feminino , Humanos , Julgamento , Modelos Lineares , Masculino , Variações Dependentes do Observador , Valor Preditivo dos Testes , Reprodutibilidade dos Testes , Índice de Gravidade de Doença , Processamento de Sinais Assistido por Computador , Fatores de Tempo
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA