Pesquisa | Biblioteca Virtual em Saúde

1.

Toward Generalizable Machine Learning Models in Speech, Language, and Hearing Sciences: Estimating Sample Size and Reducing Overfitting.

Ghasemzadeh, Hamzeh; Hillman, Robert E; Mehta, Daryush D.

J Speech Lang Hear Res ; 67(3): 753-781, 2024 Mar 11.

Artigo em Inglês | MEDLINE | ID: mdl-38386017

RESUMO

PURPOSE: Many studies using machine learning (ML) in speech, language, and hearing sciences rely upon cross-validations with single data splitting. This study's first purpose is to provide quantitative evidence that would incentivize researchers to instead use the more robust data splitting method of nested k-fold cross-validation. The second purpose is to present methods and MATLAB code to perform power analysis for ML-based analysis during the design of a study. METHOD: First, the significant impact of different cross-validations on ML outcomes was demonstrated using real-world clinical data. Then, Monte Carlo simulations were used to quantify the interactions among the employed cross-validation method, the discriminative power of features, the dimensionality of the feature space, the dimensionality of the model, and the sample size. Four different cross-validation methods (single holdout, 10-fold, train-validation-test, and nested 10-fold) were compared based on the statistical power and confidence of the resulting ML models. Distributions of the null and alternative hypotheses were used to determine the minimum required sample size for obtaining a statistically significant outcome (5% significance) with 80% power. Statistical confidence of the model was defined as the probability of correct features being selected for inclusion in the final model. RESULTS: ML models generated based on the single holdout method had very low statistical power and confidence, leading to overestimation of classification accuracy. Conversely, the nested 10-fold cross-validation method resulted in the highest statistical confidence and power while also providing an unbiased estimate of accuracy. The required sample size using the single holdout method could be 50% higher than what would be needed if nested k-fold cross-validation were used. Statistical confidence in the model based on nested k-fold cross-validation was as much as four times higher than the confidence obtained with the single holdout-based model. A computational model, MATLAB code, and lookup tables are provided to assist researchers with estimating the minimum sample size needed during study design. CONCLUSION: The adoption of nested k-fold cross-validation is critical for unbiased and robust ML studies in the speech, language, and hearing sciences. SUPPLEMENTAL MATERIAL: https://doi.org/10.23641/asha.25237045.

Assuntos

Aprendizado de Máquina , Fala , Humanos , Tamanho da Amostra , Idioma , Audição

2.

Detecting Mild Phonotrauma in Daily Life.

Van Stan, Jarrad H; Burns, James; Hron, Tiffiny; Zeitels, Steven; Panuganti, Bharat A; Purnell, Phillip R; Mehta, Daryush D; Hillman, Robert E; Ghasemzadeh, Hamzeh.

Laryngoscope ; 133(11): 3094-3099, 2023 11.

Artigo em Inglês | MEDLINE | ID: mdl-37194664

RESUMO

OBJECTIVE: The aim of this study was to gain quantitative insights into the role of daily voice use associated with mild phonotrauma via the Daily Phonotrauma Index (DPI), a measure derived from neck-surface acceleration magnitude (NSAM) and difference between the first two harmonic magnitudes (H1 - H2). METHODS: An ambulatory voice monitor recorded weeklong voice use for 151 female patients with phonotraumatic vocal hyperfunction (PVH) and 181 female vocally healthy controls. Three laryngologists rated phonotrauma severity from each patient's laryngoscopy. Mixed generalized linear models evaluated the accuracy, sensitivity, and specificity of the original DPI trained on all patients versus a mild DPI version trained on only patients rated with mild phonotrauma. Individual contribution of NSAM and H1 - H2 to each DPI model was also evaluated. RESULTS: Reliability across the laryngologists' phonotrauma ratings was moderate (Fleiss κ = 0.41). There were 70, 69, and 12 patients with mild, moderate, and severe phonotrauma, respectively. The mild DPI, compared to the original DPI, correctly classified more patients with mild phonotrauma (Cohen's d = 0.9) and less controls (d = -0.9) and did not change in overall accuracy. H1 - H2 contributed less to mild phonotrauma classification than NSAM for mild DPI. CONCLUSIONS: Compared with the original DPI, the mild DPI exhibited higher sensitivity to mild phonotrauma and lower specificity to controls, but the same overall classification accuracy. These results support the mild DPI as a promising detector of early phonotrauma and that NSAM may be associated with early phonotrauma, and H1 - H2 may be a biomarker associated with vocal fold vibration in the presence of lesions. LEVEL OF EVIDENCE: Level 4, case-control study Laryngoscope, 133:3094-3099, 2023.

Assuntos

Distúrbios da Voz , Voz , Humanos , Feminino , Estudos de Casos e Controles , Reprodutibilidade dos Testes , Prega Vocal/patologia

3.

Acoustic Voice and Speech Biomarkers of Treatment Status during Hospitalization for Acute Decompensated Heart Failure.

Murton, Olivia M; Dec, G William; Hillman, Robert E; Majmudar, Maulik D; Steiner, Johannes; Guttag, John V; Mehta, Daryush D.

Appl Sci (Basel) ; 13(3)2023 Feb.

Artigo em Inglês | MEDLINE | ID: mdl-37064434

RESUMO

This study investigates acoustic voice and speech features as biomarkers for acute decompensated heart failure (ADHF), a serious escalation of heart failure symptoms including breathlessness and fatigue. ADHF-related systemic fluid accumulation in the lungs and laryngeal tissues is hypothesized to affect phonation and respiration for speech. A set of daily spoken recordings from 52 patients undergoing inpatient ADHF treatment was analyzed to identify voice and speech biomarkers for ADHF and to examine the trajectory of biomarkers during treatment. Results indicated that speakers produce more stable phonation, a more creaky voice, faster speech rates, and longer phrases after ADHF treatment compared to their pre-treatment voices. This project builds on work to develop a method of monitoring ADHF using speech biomarkers and presents a more detailed understanding of relevant voice and speech features.

4.

Acoustic-Aerodynamic Voice Outcome Ratios Identify Changes in Vocal Function Following Vocal Fold Medialization for Unilateral Vocal Fold Paralysis.

Bullock, Latané; Toles, Laura E; Hillman, Robert E; Mehta, Daryush D.

J Voice ; 2023 Apr 15.

Artigo em Inglês | MEDLINE | ID: mdl-37068982

RESUMO

PURPOSE: This study aimed to determine whether ratio-based measures that combine acoustic (output) and aerodynamic (input) parameters detect postoperative change in vocal function following vocal fold medialization for unilateral vocal fold paralysis. METHOD: Pre- and postoperative acoustic and aerodynamic measures were analyzed retrospectively from 149 patients who underwent vocal fold medialization for unilateral vocal fold paralysis. A 2 × 2 repeated-measures analysis of variance was conducted for each of four acoustic-aerodynamic ratios-traditional vocal efficiency (VE), sound pressure level to aerodynamic power (SPL/AP), SPL to average airflow (SPL/AFLOW), and SPL to subglottal pressure (SPL/Ps)-to investigate the main effects and interaction of treatment stage and loudness level (comfortable and loud). RESULTS: The patient group showed significant postoperative improvements in self reports of vocal function (voice-related quality of life) and clinical auditory-perceptual judgments of dysphonia (consensus auditory-perceptual evaluation of voice). Main effects for both treatment stage and loudness level were statistically significant for all measures except SPL/Ps. There were interaction effects for VE and SPL/AP, suggesting that magnitude of the treatment effect differs based on loudness. SPL/AFLOW had medium-to-large effect sizes in both loudness conditions. There were postoperative changes in SPL/Ps that were dependent on the magnitude of the reduction in AFLOW; as expected, SPL/Ps increased postoperatively in a subgroup that had large postoperative reductions in AFLOW at the comfortable loudness level. CONCLUSIONS: Acoustic-aerodynamic ratios can aid in tracking changes in vocal function following vocal fold medialization. SPL/AFLOW exhibited the largest effect size, which is expected since a reduction in abnormally high AFLOW typically accompanies the increased modulation of glottal air flow associated with successful vocal fold medialization. Future study is needed to model physiological changes in acoustic-aerodynamic voice outcome ratios across different types of voice disorders.

5.

Effect of nodule size and stiffness on phonation threshold and collision pressures in a synthetic hemilaryngeal vocal fold model.

Motie-Shirazi, Mohsen; Zañartu, Matías; Peterson, Sean D; Mehta, Daryush D; Hillman, Robert E; Erath, Byron D.

J Acoust Soc Am ; 153(1): 654, 2023 01.

Artigo em Inglês | MEDLINE | ID: mdl-36732229

RESUMO

Synthetic vocal fold (VF) replicas were used to explore the role of nodule size and stiffness on kinematic, aerodynamic, and acoustic measures of voiced speech production. Emphasis was placed on determining how changes in collision pressure may contribute to the development of phonotrauma. This was performed by adding spherical beads with different sizes and moduli of elasticity at the middle of the medial surface of synthetic silicone VF models, representing nodules of varying size and stiffness. The VF models were incorporated into a hemilaryngeal flow facility. For each case, self-sustained oscillations were investigated at the phonation threshold pressure. It was found that increasing the nodule diameter increased the open quotient, phonation threshold pressure, and phonation threshold flow rate. However, these values did not change considerably as a function of the modulus of elasticity of the nodule. Nevertheless, the ratio of collision pressure to subglottal pressure increased significantly for both increasing nodule size and stiffness. This suggests that over time, both growth in size and fibrosis of nodules will lead to an increasing cycle of compensatory vocal hyperfunction that accelerates phonotrauma.

Assuntos

Vibração , Prega Vocal , Pressão , Fonação , Elasticidade

6.

Quantifying the Occupational Voice Use of Teachers.

Eastman, Sarah E; Ghasemzadeh, Hamzeh; Van Stan, Jarrad; Hillman, Robert E; Mehta, Daryush D.

Perspect ASHA Spec Interest Groups ; 8(6): 1363-1379, 2023 Dec.

Artigo em Inglês | MEDLINE | ID: mdl-38312372

RESUMO

Purpose: The teaching profession is a high-voice use occupation at elevated risk for developing voice disorders. Continued research on teachers' vocal demands is necessary to advocate for and establish vocal health programs. This study quantified ambulatory vocal dose measures for teachers during both on- and off-work periods, comparing their occupational voice use to that in other studies that have reported percent phonation ranging from 17% to 30%. Method: Participants included 26 full-time, female school teachers between 23 and 55 years of age across multiple grades and subjects, including individuals with and without a voice disorder. Ambulatory voice data were collected from weeklong voice monitoring that recorded phonatory activity through anterior neck-surface vibration. Three vocal dose measures-time, cycle, and distance doses-were computed for each participant for three time periods: on-work weekdays, off-work weekdays, and off-work weekend days. Results: The teachers' average percent phonation was 16.2% on-work weekdays, 8.4% off-work weekdays, and 8.0% off-work weekend days. No statistically significant differences for vocal dose measures were found between off-work weekdays and weekend days. Overall, all vocal dose measures were approximately 2 times higher during work relative to off-work time periods. Conclusions: This study provides values for vocal dose measures for school teachers using ambulatory voice-monitoring technology. The vocal demands of this particular teacher sample and voice activity detection algorithm are potential factors contributing to percent phonation values on the lower end of the range reported in the literature. Future work is needed to continue to understand occupational voice use and its associated risks related to voice health, with the ultimate goal of preventing and managing voice disorders in individuals engaged in high-risk occupations.

7.

Kalman Filter Implementation of Subglottal Impedance-Based Inverse Filtering to Estimate Glottal Airflow during Phonation.

Cortés, Juan P; Alzamendi, Gabriel A; Weinstein, Alejandro J; Yuz, Juan I; Espinoza, Víctor M; Mehta, Daryush D; Hillman, Robert E; Zañartu, Matías.

Appl Sci (Basel) ; 12(1)2022 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-36313121

RESUMO

Subglottal Impedance-Based Inverse Filtering (IBIF) allows for the continuous, non-invasive estimation of glottal airflow from a surface accelerometer placed over the anterior neck skin below the larynx. It has been shown to be advantageous for the ambulatory monitoring of vocal function, specifically in the use of high-order statistics to understand long-term vocal behavior. However, during long-term ambulatory recordings over several days, conditions may drift from the laboratory environment where the IBIF parameters were initially estimated due to sensor positioning, skin attachment, or temperature, among other factors. Observation uncertainties and model mismatch may result in significant deviations in the glottal airflow estimates; unfortunately, they are very difficult to quantify in ambulatory conditions due to a lack of a reference signal. To address this issue, we propose a Kalman filter implementation of the IBIF filter, which allows for both estimating the model uncertainty and adapting the airflow estimates to correct for signal deviations. One-way analysis of variance (ANOVA) results from laboratory experiments using the Rainbow Passage indicate an improvement using the modified Kalman filter on amplitude-based measures for phonotraumatic vocal hyperfunction (PVH) subjects compared to the standard IBIF; the latter showing a statistically difference (p-value = 0.02, F = 4.1) with respect to a reference glottal volume velocity signal estimated from a single notch filter used here as ground-truth in this work. In contrast, maximum flow declination rates from subjects with vocal phonotrauma exhibit a small but statistically difference between the ground-truth signal and the modified Kalman filter when using one-way ANOVA (p-value = 0.04, F = 3.3). Other measures did not have significant differences with either the modified Kalman filter or IBIF compared to ground-truth, with the exception of H1-H2, whose performance deteriorates for both methods. Overall, both methods (modified Kalman filter and IBIF) show similar glottal airflow measures, with the advantage of the modified Kalman filter to improve amplitude estimation. Moreover, Kalman filter deviations from the IBIF output airflow might suggest a better representation of some fine details in the ground-truth glottal airflow signal. Other applications may take more advantage from the adaptation offered by the modified Kalman filter implementation.

8.

Collision Pressure and Dissipated Power Dose in a Self-Oscillating Silicone Vocal Fold Model With a Posterior Glottal Opening.

Motie-Shirazi, Mohsen; Zañartu, Matías; Peterson, Sean D; Mehta, Daryush D; Hillman, Robert E; Erath, Byron D.

J Speech Lang Hear Res ; 65(8): 2829-2845, 2022 08 17.

Artigo em Inglês | MEDLINE | ID: mdl-35914018

RESUMO

PURPOSE: The goal of this study was to experimentally evaluate how compensating for the adverse acoustic effects of a posterior glottal opening (PGO) by increasing subglottal pressure and changing supraglottal compression, as have been associated with vocal hyperfunction, influences the risk of vocal fold (VF) trauma. METHOD: A self-oscillating synthetic silicone model of the VFs with an airflow bypass that modeled a PGO was investigated in a hemilaryngeal flow facility. The influence of compensatory mechanisms on collision pressure and dissipated collision power was investigated for different PGO areas and supraglottal compression. Compensatory behaviors were mimicked by increasing the subglottal pressure to achieve a target sound pressure level (SPL). RESULTS: Increasing the subglottal pressure to compensate for decreased SPL due to a PGO produced higher values for both collision pressure and dissipated collision power. Whereas a 10-mm2 PGO area produced a 12% increase in the peak collision pressure, the dissipated collision power increased by 122%, mainly due to an increase in the magnitude of the collision velocity. This suggests that the value of peak collision pressure may not fully capture the mechanisms by which phonotrauma occurs. It was also found that an optimal value of supraglottal compression exists that maximizes the radiated SPL, indicating the potential utility of supraglottal compression as a compensatory mechanism. CONCLUSIONS: Larger PGO areas are expected to increase the risk of phonotrauma due to the concomitant increase in dissipated collision power associated with maintaining SPL. Furthermore, the risk of VF damage may not be fully characterized by only the peak collision pressure.

Assuntos

Doenças da Laringe , Laringe , Glote , Humanos , Fonação , Silicones , Prega Vocal

9.

Clinical Utility of the Ratio of Sound Pressure Level to Subglottal Pressure in Patients Surgically Treated for Phonotraumatic Vocal Fold Lesions.

Toles, Laura E; Seidman, Ariana Y; Hillman, Robert E; Mehta, Daryush D.

J Speech Lang Hear Res ; 65(8): 2778-2788, 2022 08 17.

Artigo em Inglês | MEDLINE | ID: mdl-35914023

RESUMO

PURPOSE: This study aimed to determine whether a simplified, and potentially more stable, acoustic-aerodynamic voice outcome ratio (ratio of sound pressure level [SPL] to subglottal pressure) is comparable to a traditional vocal efficiency measure (ratio of acoustic power to the product of average subglottal pressure and average phonatory airflow) in terms of the ability to detect change in vocal function following surgical removal of bilateral phonotraumatic lesions. METHOD: Pre- and postoperative acoustic and aerodynamic measures were analyzed retrospectively from 75 female patients who underwent surgical removal of bilateral phonotraumatic lesions. A 2 × 2 repeated-measures analysis of variance was conducted for each of three acoustic-aerodynamic voice outcome ratios-traditional vocal efficiency, an SPL-based ratio with both airflow and subglottal pressure, and a simplified SPL-based ratio with subglottal pressure only-to investigate the main effects of treatment stage (pre- and postsurgery), loudness condition (comfortable and loud), and their interaction. Post hoc paired samples t tests were conducted for statistically significant interactions. The within-subject variability of the measures was assessed using the coefficient of variation. RESULTS: Although exhibiting an expected main effect of loudness (higher values in the loud condition), the traditional vocal efficiency ratio did not exhibit a main effect of treatment. For both SPL-based ratios, there were significant main effects of treatment stage (higher values postoperatively) and loudness condition (lower values in the loud condition). Within-subject, postoperative changes in the two SPL-based ratios moderately correlated with one another. The simplified ratio of SPL to subglottal pressure (without airflow) exhibited the least within-subject variability relative to the other two acoustic-aerodynamic ratios. CONCLUSIONS: These findings indicate that SPL-based acoustic-aerodynamic voice outcome ratios increase significantly following the surgical removal of phonotraumatic vocal fold lesions. The simplified ratio of SPL to subglottal pressure exhibits the least variability and can be easily obtained without requiring the measurement of airflow.

Assuntos

Acústica da Fala , Prega Vocal , Feminino , Humanos , Fonação , Estudos Retrospectivos , Som , Prega Vocal/cirurgia

10.

Lombard Effect in Individuals With Nonphonotraumatic Vocal Hyperfunction: Impact on Acoustic, Aerodynamic, and Vocal Fold Vibratory Parameters.

Castro, Christian; Prado, Pavel; Espinoza, Víctor M; Testart, Alba; Marfull, Daphne; Manriquez, Rodrigo; Stepp, Cara E; Mehta, Daryush D; Hillman, Robert E; Zañartu, Matías.

J Speech Lang Hear Res ; 65(8): 2881-2895, 2022 08 17.

Artigo em Inglês | MEDLINE | ID: mdl-35930680

RESUMO

PURPOSE: This exploratory study aims to investigate variations in voice production in the presence of background noise (Lombard effect) in individuals with nonphonotraumatic vocal hyperfunction (NPVH) and individuals with typical voices using acoustic, aerodynamic, and vocal fold vibratory measures of phonatory function. METHOD: Nineteen participants with NPVH and 19 participants with typical voices produced simple vocal tasks in three sequential background conditions: baseline (in quiet), Lombard (in noise), and recovery (5 min after removing the noise). The Lombard condition consisted of speech-shaped noise at 80 dB SPL through audiometric headphones. Acoustic measures from a microphone, glottal aerodynamic parameters estimated from the oral airflow measured with a circumferentially vented pneumotachograph mask, and vocal fold vibratory parameters from high-speed videoendoscopy were analyzed. RESULTS: During the Lombard condition, both groups exhibited a decrease in open quotient and increases in sound pressure level, peak-to-peak glottal airflow, maximum flow declination rate, and subglottal pressure. During the recovery condition, the acoustic and aerodynamic measures of individuals with typical voices returned to those of the baseline condition; however, recovery measures for individuals with NPVH did not return to baseline values. CONCLUSIONS: As expected, individuals with NPVH and participants with typical voices exhibited a Lombard effect in the presence of elevated background noise levels. During the recovery condition, individuals with NPVH did not return to their baseline state, pointing to a persistence of the Lombard effect after noise removal. This behavior could be related to disruptions in laryngeal motor control and may play a role in the etiology of NPVH. SUPPLEMENTAL MATERIAL: https://doi.org/10.23641/asha.20415600.

Assuntos

Prega Vocal , Voz , Acústica , Glote , Humanos , Fonação

11.

Daily Phonotrauma Index: An Objective Indicator of Large Differences in Self-Reported Vocal Status in the Daily Life of Females With Phonotraumatic Vocal Hyperfunction.

Nudelman, Charles J; Ortiz, Andrew J; Fox, Annie B; Mehta, Daryush D; Hillman, Robert E; Van Stan, Jarrad H.

Am J Speech Lang Pathol ; 31(3): 1412-1423, 2022 05 10.

Artigo em Inglês | MEDLINE | ID: mdl-35394805

RESUMO

PURPOSE: The purpose of this study is to evaluate if the Daily Phonotrauma Index (DPI) can quantitatively discriminate large differences in overall vocal status in the daily life of patients with phonotraumatic vocal hyperfunction (PVH). METHOD: For 1-4 weeks, 23 females with PVH wore an ambulatory voice monitor and answered three vocal status questions (i.e., difficulty producing soft, high-pitched phonation; discomfort; and fatigue) at the beginning, at 5-hr intervals, and the end of each day. DPI values were obtained for each patient's time periods of worst and best self-rated vocal status, and data for the group were analyzed for significant changes using a linear mixed-effects regression model. RESULTS: The DPI was significantly lower during periods self-rated as "best vocal status" compared to during periods self-rated as "worst vocal status" (mean difference in DPI = 0.53) with a medium-to-large effect size (Cohen's d = -0.68). CONCLUSIONS: In a group of patients with phonotraumatic lesions, the DPI indicated lower potential for phonotrauma during time periods of better vocal status compared to time periods of worse vocal status. Assuming that a large portion of variance in vocal status for patients with PVH is associated with the extent to which voicing is phonotraumatic, these results support the validity of obtaining estimates of DPI for much shorter time periods (i.e., an estimate every 2 min of voicing) than previous studies (i.e., a single estimate for the entire day or week). Future work can investigate the DPI's use for in-clinic assessment/treatment and ambulatory biofeedback and can gain further insights into phonatory mechanisms that underlie DPI via comparisons with other physiologically relevant measures and computational vocal fold modeling.

Assuntos

Distúrbios da Voz , Voz , Feminino , Humanos , Masculino , Fonação , Autorrelato , Prega Vocal , Distúrbios da Voz/diagnóstico

12.

Clinical Cutoff Scores for Acoustic Indices of Vocal Hyperfunction That Combine Relative Fundamental Frequency and Cepstral Peak Prominence.

Kapsner-Smith, Mara R; Díaz-Cádiz, Manuel E; Vojtech, Jennifer M; Buckley, Daniel P; Mehta, Daryush D; Hillman, Robert E; Tracy, Lauren F; Noordzij, J Pieter; Eadie, Tanya L; Stepp, Cara E.

J Speech Lang Hear Res ; 65(4): 1349-1369, 2022 04 04.

Artigo em Inglês | MEDLINE | ID: mdl-35263546

RESUMO

PURPOSE: This study examined the discriminative ability of acoustic indices of vocal hyperfunction combining smoothed cepstral peak prominence (CPPS) and relative fundamental frequency (RFF). METHOD: Demographic, CPPS, and RFF parameters were entered into logistic regression models trained on two 1:1 case-control groups: individuals with and without nonphonotraumatic vocal hyperfunction (NPVH; n = 360) and phonotraumatic vocal hyperfunction (PVH; n = 240). Equations from the final models were used to predict group membership in two independent test sets (n = 100 each). RESULTS: Both CPPS and RFF parameters significantly improved model fits for NPVH and PVH after accounting for demographics. CPPS explained unique variance beyond RFF in both models. RFF explained unique variance beyond CPPS in the PVH model. Final models included CPPS and RFF offset parameters for both NPVH and PVH; RFF onset parameters were significant only in the PVH model. Area under the receiver operating characteristic curve analysis for the independent test sets revealed acceptable classification for NPVH (72%) and good classification for PVH (86%). CONCLUSIONS: A combination of CPPS and RFF parameters showed better discriminative ability than either measure alone for PVH. Clinical cutoff scores for acoustic indices of vocal hyperfunction are proposed for assessment and screening purposes.

Assuntos

Distúrbios da Voz , Voz , Acústica , Humanos , Curva ROC , Acústica da Fala , Distúrbios da Voz/diagnóstico

13.

Framework for Indirect Spatial Calibration of the Horizontal Plane of Endoscopic Laryngeal Images.

Ghasemzadeh, Hamzeh; Deliyski, Dimitar D; Hillman, Robert E; Mehta, Daryush D.

J Voice ; 2022 Jan 02.

Artigo em Inglês | MEDLINE | ID: mdl-34986994

RESUMO

OBJECTIVE: Calibrated horizontal-plane measurements from laryngeal images could contribute significantly to refining evidence-based practice and developing patient-specific models and precision-medicine approaches. Laser-projection endoscopes can address the need for direct calibrated measures; however, these systems are not widely available. This study presents the framework for an alternative indirect horizontal-plane calibration approach. METHOD: A spatial attribute of a common object, a distinct characteristic that is maintained across images, may be used as a scale for the normalization of other spatial measurements. The outcome of this indirect approach could be used for absolute measurements (eg, in units of mm) or relative measurements (eg, percent change), depending on the information that is available from the common attribute. The required conditions of a common attribute for achieving a valid calibration outcome were studied. Three conditions were derived: registration accuracy of the common attribute, size consistency of the common attribute, and similarity in the vertical distance between the region of interest (ROI) (eg, vocal fold) and the common attribute. Any common attribute satisfying these three conditions was called proper and would result in a valid indirect calibration outcome. Three tests were presented for evaluating the properness of a common attribute. A data-driven statistical method was presented that can evaluate the registration accuracy of a common attribute. The second test used variation in calibrated lengths of a common attribute under different phonatory configurations for evaluating the size consistency condition. Finally, the effect of differences between vertical distances of the ROI and the common attribute was mathematically tested and quantified. The application of the proposed framework for indirect calibration was demonstrated using a pre existing dataset with a vocal fold as the ROI and four different common attributes (vocal fold length, vocal fold width, blood vessel on the vocal fold, and blood vessel on nearby tissue). RESULTS: The proposed registration-accuracy test was able to detect and eliminate instances of common attributes with low accuracies. The analysis suggested that among the studied four common attributes, the vocal fold length had the highest (ie, best) registration accuracy; however, the vocal fold length exhibited the lowest (ie, worst) size consistency. The analysis also suggested that, among the studied attributes, the vocal fold width offered the best trade-off among the three conditions and, hence, was a proper common attribute for calibrating spatial aspects of the vocal folds (length, displacement of edges, velocity, etc). CONCLUSION: Indirect calibration is a feasible alternative for calibration of laryngeal endoscopic images, given a proper common attribute is selected. Future work is needed to systematically evaluate the effects of various phonatory conditions on the characteristics of common attributes.

14.

Automated Relative Fundamental Frequency Algorithms for Use With Neck-Surface Accelerometer Signals.

Groll, Matti D; Vojtech, Jennifer M; Hablani, Surbhi; Mehta, Daryush D; Buckley, Daniel P; Noordzij, J Pieter; Stepp, Cara E.

J Voice ; 36(2): 156-169, 2022 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-32653267

RESUMO

OBJECTIVE: Relative fundamental frequency (RFF) has been suggested as a potential acoustic measure of vocal effort. However, current clinical standards for RFF measures require time-consuming manual markings. Previous semi-automated algorithms have been developed to calculate RFF from microphone signals. The current study aimed to develop fully automated algorithms to calculate RFF from neck-surface accelerometer signals for ecological momentary assessment and ambulatory monitoring of voice. METHODS: Training a set of 2646 /vowel-fricative-vowel/ utterances from 317 unique speakers, with and without voice disorders, was used to develop automated algorithms to calculate RFF values from neck-surface accelerometer signals. The algorithms first rejected utterances with poor vowel-to-noise ratios, then identified fricative locations, then used signal features to determine voicing boundary cycles, and finally calculated corresponding RFF values. These automated RFF values were compared to the clinical gold-standard of manual RFF calculated from simultaneously collected microphone signals in a novel test set of 639 utterances from 77 unique speakers. RESULTS: Automated accelerometer-based RFF values resulted in an average mean bias error (MBE) across all cycles of 0.027 ST, with an MBE of 0.152 ST and -0.252 ST in the offset and onset cycles closest to the fricative, respectively. CONCLUSION: All MBE values were smaller than the expected changes in RFF values following successful voice therapy, suggesting that the current algorithms could be used for ecological momentary assessment and ambulatory monitoring via neck-surface accelerometer signals.

Assuntos

Distúrbios da Voz , Voz , Acelerometria , Algoritmos , Humanos , Acústica da Fala , Distúrbios da Voz/diagnóstico , Qualidade da Voz

15.

Ambulatory Voice Biofeedback: Acquisition and Retention of Modified Daily Voice Use in Patients With Phonotraumatic Vocal Hyperfunction.

Van Stan, Jarrad H; Ortiz, Andrew J; Sternad, Dagmar; Mehta, Daryush D; Huo, Chuanbing; Hillman, Robert E.

Am J Speech Lang Pathol ; 31(1): 409-418, 2022 01 18.

Artigo em Inglês | MEDLINE | ID: mdl-34843372

RESUMO

PURPOSE: Voice ambulatory biofeedback (VAB) has potential to improve carryover of therapeutic voice use into daily life. Previous work in vocally healthy participants demonstrated that motor learning inspired variations to VAB produced expected differences in acquisition and retention of modified daily voice use. This proof-of-concept study was designed to evaluate whether these VAB variations have the same desired effects on acquisition and retention in patients with phonotraumatic vocal hyperfunction (PVH). METHOD: Seventeen female patients with PVH wore an ambulatory voice monitor for 6 days: three baseline days, one biofeedback day, one short-term retention day, and one long-term retention day. Short- and long-term retention were 1- and 7-days postbiofeedback, respectively. Patients were block-randomized to receive one of three types of VAB: 100%, 25%, and Summary. Performance was measured in terms of adherence time below a subject-specific vocal intensity threshold. RESULTS: All three types of VAB produced a biofeedback effect with 13 out of 17 patients displaying an increase in adherence time compared to baseline days. Additionally, multiple patients from each VAB group increased their adherence time during short- and/or long-term retention monitoring compared to baseline. CONCLUSIONS: These findings show that VAB can be associated with acquisition and retention of desired voice use in patients with PVH. Specifically, all three feedback types improved multiple patients' performance and retention for up to 1 week after biofeedback removal. Future work can investigate the impact of incorporating VAB into voice therapy.

Assuntos

Distúrbios da Voz , Voz , Biorretroalimentação Psicológica , Feminino , Humanos , Distúrbios da Voz/diagnóstico , Distúrbios da Voz/terapia

16.

Amount and Characteristics of Speaking and Singing Voice Use in Vocally Healthy Female College Student Singers During a Typical Week.

Toles, Laura E; Ortiz, Andrew J; Marks, Katherine L; Mehta, Daryush D; Van Stan, Jarrad H; Hillman, Robert E.

J Voice ; 36(2): 203-211, 2022 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-32451254

RESUMO

OBJECTIVES: Singers, college students, and females are groups known to be at an elevated risk of developing functional/hyperfunctional voice disorders; therefore, female college students majoring in vocal performance may be at an even higher risk. To mitigate this risk, it would be helpful to know the "safe limits" for voice use that would help maintain vocal health in this vulnerable group, but there is a paucity of high-quality objective information upon which to base such limits. This study employed weeklong ambulatory voice monitoring in a large group of vocally healthy female college student singers to begin providing the types of objective data that could be used to help develop improved vocal health guidelines. METHODS: Participants included 64 vocally healthy females currently enrolled in a vocal performance or similar program at a college or university. An ambulatory voice monitor recorded neck-surface acceleration throughout a typical week. A singing classifier was applied to the data to separate singing from speech. Weeklong vocal dose measures and distributional characteristics for standard voice measures were computed separately for singing and speech, and for both types of phonation combined. RESULTS: Participants spent 6.2% of the total monitoring time speaking and 2.1% singing (with total phonation time being 8.4%). Singing had a higher fo mode, more pitch variability, higher average sound pressure level (SPL), negatively skewed SPL distributions, lower average CPP, and higher H1-H2 values than speaking. CONCLUSIONS: These results provide a basis for beginning to establish vocal health guidelines for female students enrolled in college-level vocal performance programs and for future studies of the types of voice disorders that are common in this group. Results also demonstrate the potential value that ambulatory voice monitoring may have in helping to objectively identify vocal behaviors that could contribute to voice problems in this population.

Assuntos

Canto , Distúrbios da Voz , Feminino , Humanos , Fonação , Estudantes , Distúrbios da Voz/diagnóstico , Distúrbios da Voz/prevenção & controle , Qualidade da Voz

17.

Ambulatory Monitoring of Subglottal Pressure Estimated from Neck-Surface Vibration in Individuals with and without Voice Disorders.

Cortés, Juan P; Lin, Jon Z; Marks, Katherine L; Espinoza, Víctor M; Ibarra, Emiro J; Zañartu, Matías; Hillman, Robert E; Mehta, Daryush D.

Appl Sci (Basel) ; 12(21)2022 Nov 01.

Artigo em Inglês | MEDLINE | ID: mdl-36777332

RESUMO

The aerodynamic voice assessment of subglottal air pressure can discriminate between speakers with typical voices from patients with voice disorders, with further evidence validating subglottal pressure as a clinical outcome measure. Although estimating subglottal pressure during phonation is an important component of a standard voice assessment, current methods for estimating subglottal pressure rely on non-natural speech tasks in a clinical or laboratory setting. This study reports on the validation of a method for subglottal pressure estimation in individuals with and without voice disorders that can be translated to connected speech to enable the monitoring of vocal function and behavior in real-world settings. During a laboratory calibration session, a participant-specific multiple regression model was derived to estimate subglottal pressure from a neck-surface vibration signal that can be recorded during natural speech production. The model was derived for vocally typical individuals and patients diagnosed with phonotraumatic vocal fold lesions, primary muscle tension dysphonia, and unilateral vocal fold paralysis. Estimates of subglottal pressure using the developed method exhibited significantly lower error than alternative methods in the literature, with average errors ranging from 1.13 to 2.08 cm H2O for the participant groups. The model was then applied during activities of daily living, thus yielding ambulatory estimates of subglottal pressure for the first time in these populations. Results point to the feasibility and potential of real-time monitoring of subglottal pressure during an individual's daily life for the prevention, assessment, and treatment of voice disorders.

18.

Relationships Among Personality, Daily Speaking Voice Use, and Phonotrauma in Adult Female Singers.

Toles, Laura E; Roy, Nelson; Sogg, Stephanie; Marks, Katherine L; Ortiz, Andrew J; Fox, Annie B; Mehta, Daryush D; Hillman, Robert E.

J Speech Lang Hear Res ; 64(12): 4580-4598, 2021 12 13.

Artigo em Inglês | MEDLINE | ID: mdl-34731578

RESUMO

PURPOSE: This study sought to determine whether personality traits related to extraversion and impulsivity are more strongly associated with singers with nodules compared to vocally healthy singers and to understand the relationship between personality and the types of daily speaking voice use. METHOD: Weeklong ambulatory voice recordings and personality inventories were obtained for 47 female singers with nodules and 47 vocally healthy female singers. Paired t tests investigated trait differences between groups. Relationships between traits and weeklong speaking voice measures (vocal dose, sound pressure level [SPL], neck surface acceleration magnitude [NSAM], fundamental frequency, cepstral peak prominence [CPP], and the ratio of the first two harmonic magnitudes [H 1 -H 2]) were examined using pairwise Pearson r coefficients. Multiple regressions were performed to estimate voice parameters that correlated with two or more traits. RESULTS: Singers with nodules scored higher on the Social Potency scale (reflecting a tendency toward social dominance) and lower on the Control scale (reflecting impulsivity) compared to the vocally healthy singers. In vocally healthy singers, vocal dose measures were positively correlated with a combination of Wellbeing (i.e., happiness) and Social Potency, mean SPL was positively correlated with Wellbeing, SPL variability was positively correlated with Social Potency and negatively with Harm Avoidance, and CPP mean was positively correlated with Wellbeing. Singers with nodules had a negative correlation between NSAM skewness and Social Potency. Both groups had negative correlations between H 1 -H 2 mean and Social Potency and Social Closeness. CONCLUSIONS: Singers with nodules are more socially dominant and impulsive than vocally healthy singers. Personality traits are related to daily speaking voice use, particularly in vocally healthy singers. Individuals with higher levels of traits related to happiness and social dominance and lower Harm Avoidance tended to speak more, with higher laryngeal forces, with more SPL variability, and with more pressed glottal closure, which could increase risk of phonotrauma.

Assuntos

Laringe , Canto , Distúrbios da Voz , Voz , Adulto , Feminino , Humanos , Personalidade , Distúrbios da Voz/etiologia

19.

Psychometric Analysis of an Ecological Vocal Effort Scale in Individuals With and Without Vocal Hyperfunction During Activities of Daily Living.

Marks, Katherine L; Verdi, Alessandra; Toles, Laura E; Stipancic, Kaila L; Ortiz, Andrew J; Hillman, Robert E; Mehta, Daryush D.

Am J Speech Lang Pathol ; 30(6): 2589-2604, 2021 11 04.

Artigo em Inglês | MEDLINE | ID: mdl-34665647

RESUMO

Objective The purpose of this study was to examine the psychometric properties of an ecological vocal effort scale linked to a voicing task. Method Thirty-eight patients with nodules, 18 patients with muscle tension dysphonia, and 45 vocally healthy control individuals participated in a week of ambulatory voice monitoring. A global vocal status question was asked hourly throughout the day. Participants produced a vowel-consonant-vowel syllable string and rated the vocal effort needed to produce the task on a visual analog scale. Test-retest reliability was calculated for a subset using the intraclass correlation coefficient, ICC(A, 1). Construct validity was assessed by (a) comparing the weeklong vocal effort ratings between the patient and control groups and (b) comparing weeklong vocal effort ratings before and after voice rehabilitation in a subset of 25 patients. Cohen's d, the standard error of measurement (SEM), and the minimal detectable change (MDC) assessed sensitivity. The minimal clinically important difference (MCID) assessed responsiveness. Results Test-retest reliability was excellent, ICC(A, 1) = .96. Weeklong mean effort was statistically higher in the patients than in controls (d = 1.62) and lower after voice rehabilitation (d = 1.75), supporting construct validity and sensitivity. SEM was 4.14, MDC was 11.47, and MCID was 9.74. Since the MCID was within the error of the measure, we must rely upon the MDC to detect real changes in ecological vocal effort. Conclusion The ecological vocal effort scale offers a reliable, valid, and sensitive method of monitoring vocal effort changes during the daily life of individuals with and without vocal hyperfunction.

Assuntos

Disfonia , Voz , Atividades Cotidianas , Disfonia/diagnóstico , Humanos , Psicometria , Reprodutibilidade dos Testes

20.

Estimation of Subglottal Pressure, Vocal Fold Collision Pressure, and Intrinsic Laryngeal Muscle Activation From Neck-Surface Vibration Using a Neural Network Framework and a Voice Production Model.

Ibarra, Emiro J; Parra, Jesús A; Alzamendi, Gabriel A; Cortés, Juan P; Espinoza, Víctor M; Mehta, Daryush D; Hillman, Robert E; Zañartu, Matías.

Front Physiol ; 12: 732244, 2021.

Artigo em Inglês | MEDLINE | ID: mdl-34539451

RESUMO

The ambulatory assessment of vocal function can be significantly enhanced by having access to physiologically based features that describe underlying pathophysiological mechanisms in individuals with voice disorders. This type of enhancement can improve methods for the prevention, diagnosis, and treatment of behaviorally based voice disorders. Unfortunately, the direct measurement of important vocal features such as subglottal pressure, vocal fold collision pressure, and laryngeal muscle activation is impractical in laboratory and ambulatory settings. In this study, we introduce a method to estimate these features during phonation from a neck-surface vibration signal through a framework that integrates a physiologically relevant model of voice production and machine learning tools. The signal from a neck-surface accelerometer is first processed using subglottal impedance-based inverse filtering to yield an estimate of the unsteady glottal airflow. Seven aerodynamic and acoustic features are extracted from the neck surface accelerometer and an optional microphone signal. A neural network architecture is selected to provide a mapping between the seven input features and subglottal pressure, vocal fold collision pressure, and cricothyroid and thyroarytenoid muscle activation. This non-linear mapping is trained solely with 13,000 Monte Carlo simulations of a voice production model that utilizes a symmetric triangular body-cover model of the vocal folds. The performance of the method was compared against laboratory data from synchronous recordings of oral airflow, intraoral pressure, microphone, and neck-surface vibration in 79 vocally healthy female participants uttering consecutive /pæ/ syllable strings at comfortable, loud, and soft levels. The mean absolute error and root-mean-square error for estimating the mean subglottal pressure were 191 Pa (1.95 cm H2O) and 243 Pa (2.48 cm H2O), respectively, which are comparable with previous studies but with the key advantage of not requiring subject-specific training and yielding more output measures. The validation of vocal fold collision pressure and laryngeal muscle activation was performed with synthetic values as reference. These initial results provide valuable insight for further vocal fold model refinement and constitute a proof of concept that the proposed machine learning method is a feasible option for providing physiologically relevant measures for laboratory and ambulatory assessment of vocal function.

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA