|

1.

Spectral features related to the auditory perception of twang-like voices.

Saldías O'Hrens, Marcelo; Castro, Christian; Espinoza, Víctor M; Stoney, Justin; Quezada, Camilo; Laukkanen, Anne-Maria.

Logoped Phoniatr Vocol ; : 1-18, 2024 Apr 24.

Article En | MEDLINE | ID: mdl-38656176

BACKGROUND: To the best of our knowledge, studies on the relationship between spectral energy distribution and the degree of perceived twang-like voices are still sparse. Through an auditory-perceptual test we aimed to explore the spectral features that may relate with the auditory-perception of twang-like voices. METHODS: Ten judges who were blind to the test's tasks and stimuli rated the amount of twang perceived on seventy-six audio samples. The stimuli consisted of twenty voices recorded from eight CCM singers who sustained the vowel [a:] in different pitches, with and without a twang-like voice. Also, forty filtered and sixteen synthesized-manipulated stimuli were included. RESULTS AND CONCLUSIONS: Based on the intra-rater reliability scores, four judges were identified as suitable to be included in the analyses. Results showed that the frequency of F1 and F2 correlated strongly with the auditory-perception of twang-like voices (0.90 and 0.74, respectively), whereas F3 showed a moderate negative correlation (-0.52). The frequency difference between F1 and F3 showed a strong negative correlation (-0.82). The mean energy between 1-2 kHz and 2-3 kHz correlated moderately (0.51 and 0.42, respectively). The frequency of F4 and F5, and the energy above 3 kHz showed weak correlations. Since the spectral changes under 2 kHz have been associated with the jaw, lips, and tongue adjustments (i.e. vowel articulation) and a higher vertical laryngeal position might affect the frequency of all formants (including F1 and F2), our results suggest that vowel articulation and the laryngeal height may be relevant when performing twang-like voices.

2.

Kalman Filter Implementation of Subglottal Impedance-Based Inverse Filtering to Estimate Glottal Airflow during Phonation.

Cortés, Juan P; Alzamendi, Gabriel A; Weinstein, Alejandro J; Yuz, Juan I; Espinoza, Víctor M; Mehta, Daryush D; Hillman, Robert E; Zañartu, Matías.

Appl Sci (Basel) ; 12(1)2022 Jan.

Article En | MEDLINE | ID: mdl-36313121

Subglottal Impedance-Based Inverse Filtering (IBIF) allows for the continuous, non-invasive estimation of glottal airflow from a surface accelerometer placed over the anterior neck skin below the larynx. It has been shown to be advantageous for the ambulatory monitoring of vocal function, specifically in the use of high-order statistics to understand long-term vocal behavior. However, during long-term ambulatory recordings over several days, conditions may drift from the laboratory environment where the IBIF parameters were initially estimated due to sensor positioning, skin attachment, or temperature, among other factors. Observation uncertainties and model mismatch may result in significant deviations in the glottal airflow estimates; unfortunately, they are very difficult to quantify in ambulatory conditions due to a lack of a reference signal. To address this issue, we propose a Kalman filter implementation of the IBIF filter, which allows for both estimating the model uncertainty and adapting the airflow estimates to correct for signal deviations. One-way analysis of variance (ANOVA) results from laboratory experiments using the Rainbow Passage indicate an improvement using the modified Kalman filter on amplitude-based measures for phonotraumatic vocal hyperfunction (PVH) subjects compared to the standard IBIF; the latter showing a statistically difference (p-value = 0.02, F = 4.1) with respect to a reference glottal volume velocity signal estimated from a single notch filter used here as ground-truth in this work. In contrast, maximum flow declination rates from subjects with vocal phonotrauma exhibit a small but statistically difference between the ground-truth signal and the modified Kalman filter when using one-way ANOVA (p-value = 0.04, F = 3.3). Other measures did not have significant differences with either the modified Kalman filter or IBIF compared to ground-truth, with the exception of H1-H2, whose performance deteriorates for both methods. Overall, both methods (modified Kalman filter and IBIF) show similar glottal airflow measures, with the advantage of the modified Kalman filter to improve amplitude estimation. Moreover, Kalman filter deviations from the IBIF output airflow might suggest a better representation of some fine details in the ground-truth glottal airflow signal. Other applications may take more advantage from the adaptation offered by the modified Kalman filter implementation.

3.

Lombard Effect in Individuals With Nonphonotraumatic Vocal Hyperfunction: Impact on Acoustic, Aerodynamic, and Vocal Fold Vibratory Parameters.

Castro, Christian; Prado, Pavel; Espinoza, Víctor M; Testart, Alba; Marfull, Daphne; Manriquez, Rodrigo; Stepp, Cara E; Mehta, Daryush D; Hillman, Robert E; Zañartu, Matías.

J Speech Lang Hear Res ; 65(8): 2881-2895, 2022 08 17.

Article En | MEDLINE | ID: mdl-35930680

PURPOSE: This exploratory study aims to investigate variations in voice production in the presence of background noise (Lombard effect) in individuals with nonphonotraumatic vocal hyperfunction (NPVH) and individuals with typical voices using acoustic, aerodynamic, and vocal fold vibratory measures of phonatory function. METHOD: Nineteen participants with NPVH and 19 participants with typical voices produced simple vocal tasks in three sequential background conditions: baseline (in quiet), Lombard (in noise), and recovery (5 min after removing the noise). The Lombard condition consisted of speech-shaped noise at 80 dB SPL through audiometric headphones. Acoustic measures from a microphone, glottal aerodynamic parameters estimated from the oral airflow measured with a circumferentially vented pneumotachograph mask, and vocal fold vibratory parameters from high-speed videoendoscopy were analyzed. RESULTS: During the Lombard condition, both groups exhibited a decrease in open quotient and increases in sound pressure level, peak-to-peak glottal airflow, maximum flow declination rate, and subglottal pressure. During the recovery condition, the acoustic and aerodynamic measures of individuals with typical voices returned to those of the baseline condition; however, recovery measures for individuals with NPVH did not return to baseline values. CONCLUSIONS: As expected, individuals with NPVH and participants with typical voices exhibited a Lombard effect in the presence of elevated background noise levels. During the recovery condition, individuals with NPVH did not return to their baseline state, pointing to a persistence of the Lombard effect after noise removal. This behavior could be related to disruptions in laryngeal motor control and may play a role in the etiology of NPVH. SUPPLEMENTAL MATERIAL: https://doi.org/10.23641/asha.20415600.

Vocal Cords , Voice , Acoustics , Glottis , Humans , Phonation

4.

Ambulatory Monitoring of Subglottal Pressure Estimated from Neck-Surface Vibration in Individuals with and without Voice Disorders.

Cortés, Juan P; Lin, Jon Z; Marks, Katherine L; Espinoza, Víctor M; Ibarra, Emiro J; Zañartu, Matías; Hillman, Robert E; Mehta, Daryush D.

Appl Sci (Basel) ; 12(21)2022 Nov 01.

Article En | MEDLINE | ID: mdl-36777332

The aerodynamic voice assessment of subglottal air pressure can discriminate between speakers with typical voices from patients with voice disorders, with further evidence validating subglottal pressure as a clinical outcome measure. Although estimating subglottal pressure during phonation is an important component of a standard voice assessment, current methods for estimating subglottal pressure rely on non-natural speech tasks in a clinical or laboratory setting. This study reports on the validation of a method for subglottal pressure estimation in individuals with and without voice disorders that can be translated to connected speech to enable the monitoring of vocal function and behavior in real-world settings. During a laboratory calibration session, a participant-specific multiple regression model was derived to estimate subglottal pressure from a neck-surface vibration signal that can be recorded during natural speech production. The model was derived for vocally typical individuals and patients diagnosed with phonotraumatic vocal fold lesions, primary muscle tension dysphonia, and unilateral vocal fold paralysis. Estimates of subglottal pressure using the developed method exhibited significantly lower error than alternative methods in the literature, with average errors ranging from 1.13 to 2.08 cm H2O for the participant groups. The model was then applied during activities of daily living, thus yielding ambulatory estimates of subglottal pressure for the first time in these populations. Results point to the feasibility and potential of real-time monitoring of subglottal pressure during an individual's daily life for the prevention, assessment, and treatment of voice disorders.

5.

Estimation of Subglottal Pressure, Vocal Fold Collision Pressure, and Intrinsic Laryngeal Muscle Activation From Neck-Surface Vibration Using a Neural Network Framework and a Voice Production Model.

Ibarra, Emiro J; Parra, Jesús A; Alzamendi, Gabriel A; Cortés, Juan P; Espinoza, Víctor M; Mehta, Daryush D; Hillman, Robert E; Zañartu, Matías.

Front Physiol ; 12: 732244, 2021.

Article En | MEDLINE | ID: mdl-34539451

The ambulatory assessment of vocal function can be significantly enhanced by having access to physiologically based features that describe underlying pathophysiological mechanisms in individuals with voice disorders. This type of enhancement can improve methods for the prevention, diagnosis, and treatment of behaviorally based voice disorders. Unfortunately, the direct measurement of important vocal features such as subglottal pressure, vocal fold collision pressure, and laryngeal muscle activation is impractical in laboratory and ambulatory settings. In this study, we introduce a method to estimate these features during phonation from a neck-surface vibration signal through a framework that integrates a physiologically relevant model of voice production and machine learning tools. The signal from a neck-surface accelerometer is first processed using subglottal impedance-based inverse filtering to yield an estimate of the unsteady glottal airflow. Seven aerodynamic and acoustic features are extracted from the neck surface accelerometer and an optional microphone signal. A neural network architecture is selected to provide a mapping between the seven input features and subglottal pressure, vocal fold collision pressure, and cricothyroid and thyroarytenoid muscle activation. This non-linear mapping is trained solely with 13,000 Monte Carlo simulations of a voice production model that utilizes a symmetric triangular body-cover model of the vocal folds. The performance of the method was compared against laboratory data from synchronous recordings of oral airflow, intraoral pressure, microphone, and neck-surface vibration in 79 vocally healthy female participants uttering consecutive /pæ/ syllable strings at comfortable, loud, and soft levels. The mean absolute error and root-mean-square error for estimating the mean subglottal pressure were 191 Pa (1.95 cm H2O) and 243 Pa (2.48 cm H2O), respectively, which are comparable with previous studies but with the key advantage of not requiring subject-specific training and yielding more output measures. The validation of vocal fold collision pressure and laryngeal muscle activation was performed with synthetic values as reference. These initial results provide valuable insight for further vocal fold model refinement and constitute a proof of concept that the proposed machine learning method is a feasible option for providing physiologically relevant measures for laboratory and ambulatory assessment of vocal function.

6.

Glottal Aerodynamics Estimated From Neck-Surface Vibration in Women With Phonotraumatic and Nonphonotraumatic Vocal Hyperfunction.

Espinoza, Víctor M; Mehta, Daryush D; Van Stan, Jarrad H; Hillman, Robert E; Zañartu, Matías.

J Speech Lang Hear Res ; 63(9): 2861-2869, 2020 09 15.

Article En | MEDLINE | ID: mdl-32755502

Purpose The purpose of this study was to determine whether estimates of glottal aerodynamic measures based on neck-surface vibration are comparable to those previously obtained using oral airflow and air pressure signals (Espinoza et al., 2017) in terms of discriminating patients with phonotraumatic and nonphonotraumatic vocal hyperfunction (PVH and NPVH) from vocally healthy controls. Method Consecutive /pae/ syllables at comfortable and loud level were produced by 16 women with PVH (organic vocal fold lesions), 16 women with NPVH (primary muscle tension dysphonia), and 32 vocally healthy women who were each matched to a patient according to age and occupation. Subglottal impedance-based inverse filtering of the anterior neck-surface accelerometer (ACC) signal yielded estimates of peak-to-peak glottal airflow, open quotient, and maximum flow declination rate. Average subglottal pressure and microphone-based sound pressure level (SPL) were also estimated from the ACC signal using subject-specific linear regression models. The ACC-based measures of glottal aerodynamics were normalized for SPL and statistically compared between each patient and matched-control group. Results Patients with PVH and NPVH exhibited lower SPL-normalized glottal aerodynamics values than their respective control subjects (p values ranging from < .01 to .07) with very large effect sizes (1.04-2.16), regardless of loudness condition or measurement method (i.e., ACC-based values maintained discriminatory power). Conclusions The results of this study demonstrate that ACC-based estimates of most glottal aerodynamic measures are comparable to those previously obtained from oral airflow and air pressure (Espinoza et al., 2017) in terms of differentiating between hyperfunctional (PVH and NPVH) and normal vocal function. ACC-based estimates of glottal aerodynamic measures may be used to assess vocal function during continuous speech and enables this assessment of daily voice use during ambulatory monitoring to provide better insight into the pathophysiological mechanisms associated with vocal hyperfunction.

Dysphonia , Voice , Air Pressure , Female , Glottis , Humans , Phonation , Vibration , Vocal Cords

7.

Improved subglottal pressure estimation from neck-surface vibration in healthy speakers producing non-modal phonation.

Lin, Jon Z; Espinoza, Víctor M; Marks, Katherine L; Zañartu, Matías; Mehta, Daryush D.

IEEE J Sel Top Signal Process ; 14(2): 449-460, 2020 Feb.

Article En | MEDLINE | ID: mdl-34079612

Subglottal air pressure plays a major role in voice production and is a primary factor in controlling voice onset, offset, sound pressure level, glottal airflow, vocal fold collision pressures, and variations in fundamental frequency. Previous work has shown promise for the estimation of subglottal pressure from an unobtrusive miniature accelerometer sensor attached to the anterior base of the neck during typical modal voice production across multiple pitch and vowel contexts. This study expands on that work to incorporate additional accelerometer-based measures of vocal function to compensate for non-modal phonation characteristics and achieve an improved estimation of subglottal pressure. Subjects with normal voices repeated /p/-vowel syllable strings from loud-to-soft levels in multiple vowel contexts (/É/, /i/, and /u/), pitch conditions (comfortable, lower than comfortable, higher than comfortable), and voice quality types (modal, breathy, strained, and rough). Subject-specific, stepwise regression models were constructed using root-mean-square (RMS) values of the accelerometer signal alone (baseline condition) and in combination with cepstral peak prominence, fundamental frequency, and glottal airflow measures derived using subglottal impedance-based inverse filtering. Five-fold cross-validation assessed the robustness of model performance using the root-mean-square error metric for each regression model. Each cross-validation fold exhibited up to a 25% decrease in prediction error when the model incorporated multidimensional aspects of the accelerometer signal compared with RMS-only models. Improved estimation of subglottal pressure for non-modal phonation was thus achievable, lending to future studies of subglottal pressure estimation in patients with voice disorders and in ambulatory voice recordings.

8.

The difference between first and second harmonic amplitudes correlates between glottal airflow and neck-surface accelerometer signals during phonation.

Mehta, Daryush D; Espinoza, Víctor M; Van Stan, Jarrad H; Zañartu, Matías; Hillman, Robert E.

J Acoust Soc Am ; 145(5): EL386, 2019 05.

Article En | MEDLINE | ID: mdl-31153299

Miniature high-bandwidth accelerometers on the anterior neck surface are used in laboratory and ambulatory settings to obtain vocal function measures. This study compared the widely applied L1-L2 measure (historically, H1-H2)-the difference between the log-magnitude of the first and second harmonics-computed from the glottal airflow waveform with L1-L2 derived from the raw neck-surface acceleration signal in 79 vocally healthy female speakers. Results showed a significant correlation (r = 0.72) between L1-L2 values estimated from both airflow and accelerometer signals, suggesting that raw accelerometer-based estimates of L1-L2 may be interpreted as reflecting glottal physiological parameters and voice quality attributes during phonation.

Phonation/physiology , Voice Quality/physiology , Voice/physiology , Accelerometry/methods , Female , Glottis/physiology , Humans , Respiratory Physiological Phenomena , Second Harmonic Generation Microscopy/methods , Speech Acoustics

9.

Estimating Vocal Fold Contact Pressure from Raw Laryngeal High-Speed Videoendoscopy Using a Hertz Contact Model.

Díaz-Cádiz, Manuel E; Peterson, Sean D; Galindo, Gabriel E; Espinoza, Víctor M; Motie-Shirazi, Mohsen; Erath, Byron D; Zañartu, Matías.

Appl Sci (Basel) ; 9(11)2019 Jun 01.

Article En | MEDLINE | ID: mdl-34267956

The development of trauma-induced lesions of the vocal folds (VFs) has been linked to a high collision pressure on the VF surface. However, there are no direct methods for the clinical assessment of VF collision, thus limiting the objective assessment of these disorders. In this study, we develop a video processing technique to directly quantify the mechanical impact of the VFs using solely laryngeal kinematic data. The technique is based on an edge tracking framework that estimates the kinematic sequence of each VF edge with a Kalman filter approach and a Hertzian impact model to predict the contact force during the collision. The proposed formulation overcomes several limitations of prior efforts since it uses a more relevant VF contact geometry, it does not require calibrated physical dimensions, it is normalized by the tissue properties, and it applies a correction factor for using a superior view only. The proposed approach is validated against numerical models, silicone vocal fold models, and prior studies. A case study with high-speed videoendoscopy recordings provides initial insights between the sound pressure level and contact pressure. Thus, the proposed method has a high potential in clinical practice and could also be adapted to operate with laryngeal stroboscopic systems.

10.

Ambulatory assessment of phonotraumatic vocal hyperfunction using glottal airflow measures estimated from neck-surface acceleration.

Cortés, Juan P; Espinoza, Víctor M; Ghassemi, Marzyeh; Mehta, Daryush D; Van Stan, Jarrad H; Hillman, Robert E; Guttag, John V; Zañartu, Matías.

PLoS One ; 13(12): e0209017, 2018.

Article En | MEDLINE | ID: mdl-30571719

Phonotraumatic vocal hyperfunction (PVH) is associated with chronic misuse and/or abuse of voice that can result in lesions such as vocal fold nodules. The clinical aerodynamic assessment of vocal function has been recently shown to differentiate between patients with PVH and healthy controls to provide meaningful insight into pathophysiological mechanisms associated with these disorders. However, all current clinical assessment of PVH is incomplete because of its inability to objectively identify the type and extent of detrimental phonatory function that is associated with PVH during daily voice use. The current study sought to address this issue by incorporating, for the first time in a comprehensive ambulatory assessment, glottal airflow parameters estimated from a neck-mounted accelerometer and recorded to a smartphone-based voice monitor. We tested this approach on 48 patients with vocal fold nodules and 48 matched healthy-control subjects who each wore the voice monitor for a week. Seven glottal airflow features were estimated every 50 ms using an impedance-based inverse filtering scheme, and seven high-order summary statistics of each feature were computed every 5 minutes over voiced segments. Based on a univariate hypothesis testing, eight glottal airflow summary statistics were found to be statistically different between patient and healthy-control groups. L1-regularized logistic regression for a supervised classification task yielded a mean (standard deviation) area under the ROC curve of 0.82 (0.25) and an accuracy of 0.83 (0.14). These results outperform the state-of-the-art classification for the same classification task and provide a new avenue to improve the assessment and treatment of hyperfunctional voice disorders.

Glottis/physiopathology , Point-of-Care Testing , Voice Disorders/diagnosis , Voice Disorders/physiopathology , Accelerometry , Adult , Air Movements , Diagnosis, Computer-Assisted , Female , Humans , Middle Aged , Smartphone , Vocal Cords/physiopathology , Voice , Voice Disorders/etiology , Young Adult

11.

Glottal Aerodynamic Measures in Women With Phonotraumatic and Nonphonotraumatic Vocal Hyperfunction.

Espinoza, Víctor M; Zañartu, Matías; Van Stan, Jarrad H; Mehta, Daryush D; Hillman, Robert E.

J Speech Lang Hear Res ; 60(8): 2159-2169, 2017 08 16.

Article En | MEDLINE | ID: mdl-28785762

Purpose: The purpose of this study was to determine the validity of preliminary reports showing that glottal aerodynamic measures can identify pathophysiological phonatory mechanisms for phonotraumatic and nonphonotraumatic vocal hyperfunction, which are each distinctly different from normal vocal function. Method: Glottal aerodynamic measures (estimates of subglottal air pressure, peak-to-peak airflow, maximum flow declination rate, and open quotient) were obtained noninvasively using a pneumotachograph mask with an intraoral pressure catheter in 16 women with organic vocal fold lesions, 16 women with muscle tension dysphonia, and 2 associated matched control groups with normal voices. Subjects produced /pae/ syllable strings from which glottal airflow was estimated using inverse filtering during /ae/ vowels, and subglottal pressure was estimated during /p/ closures. All measures were normalized for sound pressure level (SPL) and statistically tested for differences between patient and control groups. Results: All SPL-normalized measures were significantly lower in the phonotraumatic group as compared with measures in its control group. For the nonphonotraumatic group, only SPL-normalized subglottal pressure and open quotient were significantly lower than measures in its control group. Conclusions: Results of this study confirm previous hypotheses and preliminary results indicating that SPL-normalized estimates of glottal aerodynamic measures can be used to describe the different pathophysiological phonatory mechanisms associated with phonotraumatic and nonphonotraumatic vocal hyperfunction.

Air Pressure , Dysphonia/physiopathology , Glottis/physiopathology , Laryngeal Diseases/physiopathology , Phonation/physiology , Adult , Algorithms , Female , Humans , Signal Processing, Computer-Assisted , User-Computer Interface

12.

Using Ambulatory Voice Monitoring to Investigate Common Voice Disorders: Research Update.

Mehta, Daryush D; Van Stan, Jarrad H; Zañartu, Matías; Ghassemi, Marzyeh; Guttag, John V; Espinoza, Víctor M; Cortés, Juan P; Cheyne, Harold A; Hillman, Robert E.

Front Bioeng Biotechnol ; 3: 155, 2015.

Article En | MEDLINE | ID: mdl-26528472

Many common voice disorders are chronic or recurring conditions that are likely to result from inefficient and/or abusive patterns of vocal behavior, referred to as vocal hyperfunction. The clinical management of hyperfunctional voice disorders would be greatly enhanced by the ability to monitor and quantify detrimental vocal behaviors during an individual's activities of daily life. This paper provides an update on ongoing work that uses a miniature accelerometer on the neck surface below the larynx to collect a large set of ambulatory data on patients with hyperfunctional voice disorders (before and after treatment) and matched-control subjects. Three types of analysis approaches are being employed in an effort to identify the best set of measures for differentiating among hyperfunctional and normal patterns of vocal behavior: (1) ambulatory measures of voice use that include vocal dose and voice quality correlates, (2) aerodynamic measures based on glottal airflow estimates extracted from the accelerometer signal using subject-specific vocal system models, and (3) classification based on machine learning and pattern recognition approaches that have been used successfully in analyzing long-term recordings of other physiological signals. Preliminary results demonstrate the potential for ambulatory voice monitoring to improve the diagnosis and treatment of common hyperfunctional voice disorders.