RESUMEN
Many voice disorders are linked to imbalanced muscle activity and known to exhibit asymmetric vocal fold vibration. However, the relation between imbalanced muscle activation and asymmetric vocal fold vibration is not well understood. This study introduces an asymmetric triangular body-cover model of the vocal folds, controlled by the activation of bilateral intrinsic laryngeal muscles, to investigate the effects of muscle imbalance on vocal fold oscillation. Various scenarios were considered, encompassing imbalance in individual muscles and muscle pairs, as well as accounting for asymmetry in lumped element parameters. Measurements of amplitude and phase asymmetries were employed to match the oscillatory behavior of two pathological cases: unilateral paralysis and muscle tension dysphonia. The resulting simulations exhibit muscle imbalance consistent with expectations in the composition of these voice disorders, yielding asymmetries exceeding 30% for paralysis and below 5% for dysphonia. This underscores the relevance of muscle imbalance in representing phonatory scenarios and its potential for characterizing asymmetry in vocal fold vibration.
Asunto(s)
Músculos Laríngeos , Fonación , Vibración , Pliegues Vocales , Pliegues Vocales/fisiología , Pliegues Vocales/fisiopatología , Humanos , Músculos Laríngeos/fisiología , Músculos Laríngeos/fisiopatología , Simulación por Computador , Disfonía/fisiopatología , Parálisis de los Pliegues Vocales/fisiopatología , Modelos Biológicos , Fenómenos BiomecánicosRESUMEN
Mathematical models that accurately simulate the physiological systems of the human body serve as cornerstone instruments for advancing medical science and facilitating innovative clinical interventions. One application is the modeling of the subglottal tract and neck skin properties for its use in the ambulatory assessment of vocal function, by enabling non-invasive monitoring of glottal airflow via a neck surface accelerometer. For the technique to be effective, the development of an accurate building block model for the subglottal tract is required. Such a model is expected to utilize glottal volume velocity as the input parameter and yield neck skin acceleration as the corresponding output. In contrast to preceding efforts that employed frequency-domain methods, the present paper leverages system identification techniques to derive a parsimonious continuous-time model of the subglottal tract using time-domain data samples. Additionally, an examination of the model order is conducted through the application of various information criteria. Once a low-order model is successfully fitted, an inverse filter based on a Kalman smoother is utilized for the estimation of glottal volume velocity and related aerodynamic metrics, thereby constituting the most efficient execution of these estimates thus far. Anticipated reductions in computational time and complexity due to the lower order of the subglottal model hold particular relevance for real-time monitoring. Simultaneously, the methodology proves efficient in generating a spectrum of aerodynamic features essential for ambulatory vocal function assessment.
RESUMEN
Many voice disorders are linked to imbalanced muscle activity and known to exhibit asymmetric vocal fold vibration. However, the relation between imbalanced muscle activation and asymmetric vocal fold vibration is not well understood. This study introduces an asymmetric triangular body-cover model of the vocal folds, controlled by the activation of intrinsic laryngeal muscles, to investigate the effects of muscle imbalance on vocal fold oscillation. Various scenarios were considered, encompassing imbalance in individual muscles and muscle pairs, as well as accounting for asymmetry in lumped element parameters. The results highlight the antagonistic effect between the thyroarytenoid and cricothyroid muscles on the elastic and mass components of the vocal folds, as well as the impact on the vocal process from the imbalance in the lateral cricoarytenoid and interarytenoid adductor muscles. Measurements of amplitude and phase asymmetry were employed to emulate the oscillatory behavior of two pathological cases: unilateral paralysis and muscle tension dysphonia. The resulting simulations exhibit muscle imbalance consistent with expectations in the composition of these voice disorders, yielding asymmetries exceeding 30% for paralysis and below 5% for dysphonia. This underscores the versatility of muscle imbalance in representing phonatory scenarios and its potential for characterizing asymmetry in vocal fold vibration.
RESUMEN
The use of non-invasive skin accelerometers placed over the extrathoracic trachea has been proposed in the literature for measuring vocal function. Glottal airflow is estimated using inverse filtering or Bayesian techniques based on a subglottal impedance-based model when utilizing these sensors. However, deviations in glottal airflow estimates can arise due to sensor positioning and model mismatch, and addressing them requires a significant computational load. In this paper, we utilize system identification techniques to obtain a low order state-space representation of the subglottal impedance-based model. We then employ the resulting low order model in a Kalman smoother to estimate the glottal airflow. Our proposed approach reduces the model order by 94% and requires only 1.5% of the computing time compared to previous Bayesian methods in the literature, while achieving slightly better accuracy when correcting for glottal airflow deviations. Additionally, our Kalman smoother approach provides a measure of uncertainty in the airflow estimate, which is valuable when measurements are taken under different conditions. With its comparable accuracy in signal estimation and reduced computational load, the proposed approach has the potential for real-time estimation of glottal airflow and its associated uncertainty in wearable voice ambulatory monitors using neck-surface acceleration.
RESUMEN
End-to-end deep learning models have shown promising results for the automatic screening of Parkinson's disease by voice and speech. However, these models often suffer degradation in their performance when applied to scenarios involving multiple corpora. In addition, they also show corpus-dependent clusterings. These facts indicate a lack of generalisation or the presence of certain shortcuts in the decision, and also suggest the need for developing new corpus-independent models. In this respect, this work explores the use of domain adversarial training as a viable strategy to develop models that retain their discriminative capacity to detect Parkinson's disease across diverse datasets. The paper presents three deep learning architectures and their domain adversarial counterparts. The models were evaluated with sustained vowels and diadochokinetic recordings extracted from four corpora with different demographics, dialects or languages, and recording conditions. The results showed that the space distribution of the embedding features extracted by the domain adversarial networks exhibits a higher intra-class cohesion. This behaviour is supported by a decrease in the variability and inter-domain divergence computed within each class. The findings suggest that domain adversarial networks are able to learn the common characteristics present in Parkinsonian voice and speech, which are supposed to be corpus, and consequently, language independent. Overall, this effort provides evidence that domain adaptation techniques refine the existing end-to-end deep learning approaches for Parkinson's disease detection from voice and speech, achieving more generalizable models.
RESUMEN
The neurocomputational model 'Directions into Velocities of Articulators' (DIVA) was developed to account for various aspects of normal and disordered speech production and acquisition. The neural substrates of DIVA were established through functional magnetic resonance imaging (fMRI), providing physiological validation of the model. This study introduces DIVA_EEG an extension of DIVA that utilizes electroencephalography (EEG) to leverage the high temporal resolution and broad availability of EEG over fMRI. For the development of DIVA_EEG, EEG-like signals were derived from original equations describing the activity of the different DIVA maps. Synthetic EEG associated with the utterance of syllables was generated when both unperturbed and perturbed auditory feedback (first formant perturbations) were simulated. The cortical activation maps derived from synthetic EEG closely resembled those of the original DIVA model. To validate DIVA_EEG, the EEG of individuals with typical voices (N = 30) was acquired during an altered auditory feedback paradigm. The resulting empirical brain activity maps significantly overlapped with those predicted by DIVA_EEG. In conjunction with other recent model extensions, DIVA_EEG lays the foundations for constructing a complete neurocomputational framework to tackle vocal and speech disorders, which can guide model-driven personalized interventions.
RESUMEN
Subglottal Impedance-Based Inverse Filtering (IBIF) allows for the continuous, non-invasive estimation of glottal airflow from a surface accelerometer placed over the anterior neck skin below the larynx. It has been shown to be advantageous for the ambulatory monitoring of vocal function, specifically in the use of high-order statistics to understand long-term vocal behavior. However, during long-term ambulatory recordings over several days, conditions may drift from the laboratory environment where the IBIF parameters were initially estimated due to sensor positioning, skin attachment, or temperature, among other factors. Observation uncertainties and model mismatch may result in significant deviations in the glottal airflow estimates; unfortunately, they are very difficult to quantify in ambulatory conditions due to a lack of a reference signal. To address this issue, we propose a Kalman filter implementation of the IBIF filter, which allows for both estimating the model uncertainty and adapting the airflow estimates to correct for signal deviations. One-way analysis of variance (ANOVA) results from laboratory experiments using the Rainbow Passage indicate an improvement using the modified Kalman filter on amplitude-based measures for phonotraumatic vocal hyperfunction (PVH) subjects compared to the standard IBIF; the latter showing a statistically difference (p-value = 0.02, F = 4.1) with respect to a reference glottal volume velocity signal estimated from a single notch filter used here as ground-truth in this work. In contrast, maximum flow declination rates from subjects with vocal phonotrauma exhibit a small but statistically difference between the ground-truth signal and the modified Kalman filter when using one-way ANOVA (p-value = 0.04, F = 3.3). Other measures did not have significant differences with either the modified Kalman filter or IBIF compared to ground-truth, with the exception of H1-H2, whose performance deteriorates for both methods. Overall, both methods (modified Kalman filter and IBIF) show similar glottal airflow measures, with the advantage of the modified Kalman filter to improve amplitude estimation. Moreover, Kalman filter deviations from the IBIF output airflow might suggest a better representation of some fine details in the ground-truth glottal airflow signal. Other applications may take more advantage from the adaptation offered by the modified Kalman filter implementation.
RESUMEN
PURPOSE: This exploratory study aims to investigate variations in voice production in the presence of background noise (Lombard effect) in individuals with nonphonotraumatic vocal hyperfunction (NPVH) and individuals with typical voices using acoustic, aerodynamic, and vocal fold vibratory measures of phonatory function. METHOD: Nineteen participants with NPVH and 19 participants with typical voices produced simple vocal tasks in three sequential background conditions: baseline (in quiet), Lombard (in noise), and recovery (5 min after removing the noise). The Lombard condition consisted of speech-shaped noise at 80 dB SPL through audiometric headphones. Acoustic measures from a microphone, glottal aerodynamic parameters estimated from the oral airflow measured with a circumferentially vented pneumotachograph mask, and vocal fold vibratory parameters from high-speed videoendoscopy were analyzed. RESULTS: During the Lombard condition, both groups exhibited a decrease in open quotient and increases in sound pressure level, peak-to-peak glottal airflow, maximum flow declination rate, and subglottal pressure. During the recovery condition, the acoustic and aerodynamic measures of individuals with typical voices returned to those of the baseline condition; however, recovery measures for individuals with NPVH did not return to baseline values. CONCLUSIONS: As expected, individuals with NPVH and participants with typical voices exhibited a Lombard effect in the presence of elevated background noise levels. During the recovery condition, individuals with NPVH did not return to their baseline state, pointing to a persistence of the Lombard effect after noise removal. This behavior could be related to disruptions in laryngeal motor control and may play a role in the etiology of NPVH. SUPPLEMENTAL MATERIAL: https://doi.org/10.23641/asha.20415600.
Asunto(s)
Pliegues Vocales , Voz , Acústica , Glotis , Humanos , FonaciónRESUMEN
Poor laryngeal muscle coordination that results in abnormal glottal posturing is believed to be a primary etiologic factor in common voice disorders such as non-phonotraumatic vocal hyperfunction. Abnormal activity of antagonistic laryngeal muscles is hypothesized to play a key role in the alteration of normal vocal fold biomechanics that results in the dysphonia associated with such disorders. Current low-order models of the vocal folds are unsatisfactory to test this hypothesis since they do not capture the co-contraction of antagonist laryngeal muscle pairs. To address this limitation, a self-sustained triangular body-cover model with full intrinsic muscle control is introduced. The proposed scheme shows good agreement with prior studies using finite element models, excised larynges, and clinical studies in sustained and time-varying vocal gestures. Simulations of vocal fold posturing obtained with distinct antagonistic muscle activation yield clear differences in kinematic, aerodynamic, and acoustic measures. The proposed tool is deemed sufficiently accurate and flexible for future comprehensive investigations of non-phonotraumatic vocal hyperfunction and other laryngeal motor control disorders.
Asunto(s)
Disfonía , Voz , Glotis , Humanos , Músculos Laríngeos/fisiología , Pliegues Vocales/fisiología , Voz/fisiologíaRESUMEN
The ambulatory assessment of vocal function can be significantly enhanced by having access to physiologically based features that describe underlying pathophysiological mechanisms in individuals with voice disorders. This type of enhancement can improve methods for the prevention, diagnosis, and treatment of behaviorally based voice disorders. Unfortunately, the direct measurement of important vocal features such as subglottal pressure, vocal fold collision pressure, and laryngeal muscle activation is impractical in laboratory and ambulatory settings. In this study, we introduce a method to estimate these features during phonation from a neck-surface vibration signal through a framework that integrates a physiologically relevant model of voice production and machine learning tools. The signal from a neck-surface accelerometer is first processed using subglottal impedance-based inverse filtering to yield an estimate of the unsteady glottal airflow. Seven aerodynamic and acoustic features are extracted from the neck surface accelerometer and an optional microphone signal. A neural network architecture is selected to provide a mapping between the seven input features and subglottal pressure, vocal fold collision pressure, and cricothyroid and thyroarytenoid muscle activation. This non-linear mapping is trained solely with 13,000 Monte Carlo simulations of a voice production model that utilizes a symmetric triangular body-cover model of the vocal folds. The performance of the method was compared against laboratory data from synchronous recordings of oral airflow, intraoral pressure, microphone, and neck-surface vibration in 79 vocally healthy female participants uttering consecutive /pæ/ syllable strings at comfortable, loud, and soft levels. The mean absolute error and root-mean-square error for estimating the mean subglottal pressure were 191 Pa (1.95 cm H2O) and 243 Pa (2.48 cm H2O), respectively, which are comparable with previous studies but with the key advantage of not requiring subject-specific training and yielding more output measures. The validation of vocal fold collision pressure and laryngeal muscle activation was performed with synthetic values as reference. These initial results provide valuable insight for further vocal fold model refinement and constitute a proof of concept that the proposed machine learning method is a feasible option for providing physiologically relevant measures for laboratory and ambulatory assessment of vocal function.
RESUMEN
Lumped elements models of vocal folds are relevant research tools that can enhance the understanding of the pathophysiology of many voice disorders. In this paper, we use the port-Hamiltonian framework to obtain an energy-based model for the fluid-structure interactions between the vocal folds and the airflow in the glottis. The vocal fold behavior is represented by a three-mass model and the airflow is described as a fluid with irrotational flow. The proposed approach allows to go beyond the usual quasi-steady one-dimensional flow assumption in lumped mass models. The simulation results show that the proposed energy-based model successfully reproduces the oscillations of the vocal folds, including the collision phenomena, and it is useful to analyze the energy exchange between the airflow and the vocal folds.
RESUMEN
This study introduces the in vivo application of a Bayesian framework to estimate subglottal pressure, laryngeal muscle activation, and vocal fold contact pressure from calibrated transnasal high-speed videoendoscopy and oral airflow data. A subject-specific, lumped-element vocal fold model is estimated using an extended Kalman filter and two observation models involving glottal area and glottal airflow. Model-based inferences using data from a vocally healthy male individual are compared with empirical estimates of subglottal pressure and reference values for muscle activation and contact pressure in the literature, thus providing baseline error metrics for future clinical investigations.
Asunto(s)
Fonación , Voz , Teorema de Bayes , Glotis , Humanos , Masculino , Vibración , Pliegues VocalesRESUMEN
Removal of artifacts induced by muscle activity is crucial for analysis of the electroencephalogram (EEG), and continues to be a challenge in experiments where the subject may speak, change facial expressions, or move. Ensemble empirical mode decomposition with canonical correlation analysis (EEMD-CCA) has been proven to be an efficient method for denoising of EEG contaminated with muscle artifacts. EEMD-CCA, likewise the majority of algorithms, does not incorporate any statistical information of the artifact, namely, electromyogram (EMG) recorded over the muscles actively contaminating the EEG. In this paper, we propose to extend EEMD-CCA in order to include an EMG array as information to aid the removal of artifacts, assessing the performance gain achieved when the number of EMG channels grow. By filtering adaptively (recursive least squares, EMG array as reference) each component resulting from CCA, we aim to ameliorate the distortion of brain signals induced by artifacts and denoising methods. We simulated several noise scenarios based on a linear contamination model, between real and synthetic EEG and EMG signals, and varied the number of EMG channels available to the filter. Our results exhibit a substantial improvement in the performance as the number of EMG electrodes increase from 2 to 16. Further increasing the number of EMG channels up to 128 did not have a significant impact on the performance. We conclude by recommending the use of EMG electrodes to filter components, as it is a computationally inexpensive enhancement that impacts significantly on performance using only a few electrodes.
RESUMEN
The amplitude of auditory steady-state responses (ASSRs) generated in the brainstem of rats exponentially decreases over the sequential averaging of EEG epochs. This behavior is partially due to the adaptation of the ASSR induced by the continuous and monotonous stimulation. In this study, we analyzed the potential clinical relevance of the ASSR adaptation. ASSR were elicited in eight anesthetized adult rats by 8-kHz tones, modulated in amplitude at 115 Hz. We called independent epochs to those EEG epochs acquired with sufficiently long inter-stimulus interval, so the ASSR contained in any given epoch is not affected by the previous stimulation. We tested whether the detection of ASSRs is improved when the response is computed by averaging independent EEG epochs, containing only unadapted auditory responses. The improvements in the ASSR detection obtained with standard, weighted and sorted averaging were compared. In the absence of artifacts, when the ASSR was elicited by continuous acoustic stimulation, the computation of the ASSR amplitude relied upon the averaging method. While the adaptive behavior of the ASSR was still evident after the weighting of epochs, the sorted averaging resulted in under-estimations of the ASSR amplitude. In the absence of artifacts, the ASSR amplitudes computed by averaging independent epochs did not depend on the averaging procedure. Averaging independent epochs resulted in higher ASSR amplitudes and halved the number of EEG epochs needed to be acquired to achieve the maximum detection rate of the ASSR. Acquisition protocols based on averaging independent EEG epochs, in combination with appropriate averaging methods for artifact reduction might contribute to develop more accurate hearing assessments based on ASSRs.
Asunto(s)
Adaptación Fisiológica , Tronco Encefálico/fisiología , Electroencefalografía/métodos , Potenciales Evocados Auditivos del Tronco Encefálico/fisiología , Pruebas Auditivas/métodos , Estimulación Acústica , Animales , Artefactos , Umbral Auditivo/fisiología , Femenino , Masculino , Modelos Animales , Ratas , Ratas WistarRESUMEN
Phonotraumatic vocal hyperfunction (PVH) is associated with chronic misuse and/or abuse of voice that can result in lesions such as vocal fold nodules. The clinical aerodynamic assessment of vocal function has been recently shown to differentiate between patients with PVH and healthy controls to provide meaningful insight into pathophysiological mechanisms associated with these disorders. However, all current clinical assessment of PVH is incomplete because of its inability to objectively identify the type and extent of detrimental phonatory function that is associated with PVH during daily voice use. The current study sought to address this issue by incorporating, for the first time in a comprehensive ambulatory assessment, glottal airflow parameters estimated from a neck-mounted accelerometer and recorded to a smartphone-based voice monitor. We tested this approach on 48 patients with vocal fold nodules and 48 matched healthy-control subjects who each wore the voice monitor for a week. Seven glottal airflow features were estimated every 50 ms using an impedance-based inverse filtering scheme, and seven high-order summary statistics of each feature were computed every 5 minutes over voiced segments. Based on a univariate hypothesis testing, eight glottal airflow summary statistics were found to be statistically different between patient and healthy-control groups. L1-regularized logistic regression for a supervised classification task yielded a mean (standard deviation) area under the ROC curve of 0.82 (0.25) and an accuracy of 0.83 (0.14). These results outperform the state-of-the-art classification for the same classification task and provide a new avenue to improve the assessment and treatment of hyperfunctional voice disorders.
Asunto(s)
Glotis/fisiopatología , Pruebas en el Punto de Atención , Trastornos de la Voz/diagnóstico , Trastornos de la Voz/fisiopatología , Acelerometría , Adulto , Movimientos del Aire , Diagnóstico por Computador , Femenino , Humanos , Persona de Mediana Edad , Teléfono Inteligente , Pliegues Vocales/fisiopatología , Voz , Trastornos de la Voz/etiología , Adulto JovenRESUMEN
Purpose: The purpose of this study was to determine the validity of preliminary reports showing that glottal aerodynamic measures can identify pathophysiological phonatory mechanisms for phonotraumatic and nonphonotraumatic vocal hyperfunction, which are each distinctly different from normal vocal function. Method: Glottal aerodynamic measures (estimates of subglottal air pressure, peak-to-peak airflow, maximum flow declination rate, and open quotient) were obtained noninvasively using a pneumotachograph mask with an intraoral pressure catheter in 16 women with organic vocal fold lesions, 16 women with muscle tension dysphonia, and 2 associated matched control groups with normal voices. Subjects produced /pae/ syllable strings from which glottal airflow was estimated using inverse filtering during /ae/ vowels, and subglottal pressure was estimated during /p/ closures. All measures were normalized for sound pressure level (SPL) and statistically tested for differences between patient and control groups. Results: All SPL-normalized measures were significantly lower in the phonotraumatic group as compared with measures in its control group. For the nonphonotraumatic group, only SPL-normalized subglottal pressure and open quotient were significantly lower than measures in its control group. Conclusions: Results of this study confirm previous hypotheses and preliminary results indicating that SPL-normalized estimates of glottal aerodynamic measures can be used to describe the different pathophysiological phonatory mechanisms associated with phonotraumatic and nonphonotraumatic vocal hyperfunction.
Asunto(s)
Presión del Aire , Disfonía/fisiopatología , Glotis/fisiopatología , Enfermedades de la Laringe/fisiopatología , Fonación/fisiología , Adulto , Algoritmos , Femenino , Humanos , Procesamiento de Señales Asistido por Computador , Interfaz Usuario-ComputadorRESUMEN
Purpose: Our goal was to test prevailing assumptions about the underlying biomechanical and aeroacoustic mechanisms associated with phonotraumatic lesions of the vocal folds using a numerical lumped-element model of voice production. Method: A numerical model with a triangular glottis, posterior glottal opening, and arytenoid posturing is proposed. Normal voice is altered by introducing various prephonatory configurations. Potential compensatory mechanisms (increased subglottal pressure, muscle activation, and supraglottal constriction) are adjusted to restore an acoustic target output through a control loop that mimics a simplified version of auditory feedback. Results: The degree of incomplete glottal closure in both the membranous and posterior portions of the folds consistently leads to a reduction in sound pressure level, fundamental frequency, harmonic richness, and harmonics-to-noise ratio. The compensatory mechanisms lead to significantly increased vocal-fold collision forces, maximum flow-declination rate, and amplitude of unsteady flow, without significantly altering the acoustic output. Conclusion: Modeling provided potentially important insights into the pathophysiology of phonotraumatic vocal hyperfunction by demonstrating that compensatory mechanisms can counteract deterioration in the voice acoustic signal due to incomplete glottal closure, but this also leads to high vocal-fold collision forces (reflected in aerodynamic measures), which significantly increases the risk of developing phonotrauma.
Asunto(s)
Simulación por Computador , Glotis/patología , Glotis/fisiopatología , Modelos Biológicos , Trastornos de la Voz/patología , Trastornos de la Voz/fisiopatología , Acústica , Algoritmos , Percepción Auditiva , Retroalimentación Sensorial , Humanos , Músculos Laríngeos/fisiopatología , Masculino , Voz/fisiologíaRESUMEN
Despite the frequent observation of a persistent opening in the posterior cartilaginous glottis in normal and pathological phonation, its influence on the self-sustained oscillations of the vocal folds is not well understood. The effects of a posterior gap on the vocal fold tissue dynamics and resulting acoustics were numerically investigated using a specially designed flow solver and a reduced-order model of human phonation. The inclusion of posterior gap areas of 0.03-0.1 cm(2) reduced the energy transfer from the fluid to the vocal folds by more than 42%-80% and the radiated sound pressure level by 6-14 dB, respectively. The model was used to simulate vocal hyperfucntion, i.e., patterns of vocal misuse/abuse associated with many of the most common voice disorders. In this first approximation, vocal hyperfunction was modeled by introducing a compensatory increase in lung air pressure to regain the vocal loudness level that was produced prior to introducing a large glottal gap. This resulted in a significant increase in maximum flow declination rate and amplitude of unsteady flow, thereby mimicking clinical studies. The amplitude of unsteady flow was found to be linearly correlated with collision forces, thus being an indicative measure of vocal hyperfunction.
Asunto(s)
Simulación por Computador , Glotis/fisiopatología , Fonación/fisiología , Pliegues Vocales/fisiopatología , Trastornos de la Voz/fisiopatología , Presión del Aire , Humanos , Modelos Lineales , Modelos Teóricos , Ventilación Pulmonar/fisiología , Espectrografía del Sonido , Acústica del Lenguaje , Estadística como Asunto , Calidad de la Voz/fisiologíaRESUMEN
A model-based inverse filtering scheme is proposed for an accurate, non-invasive estimation of the aerodynamic source of voiced sounds at the glottis. The approach, referred to as subglottal impedance-based inverse filtering (IBIF), takes as input the signal from a lightweight accelerometer placed on the skin over the extrathoracic trachea and yields estimates of glottal airflow and its time derivative, offering important advantages over traditional methods that deal with the supraglottal vocal tract. The proposed scheme is based on mechano-acoustic impedance representations from a physiologically-based transmission line model and a lumped skin surface representation. A subject-specific calibration protocol is used to account for individual adjustments of subglottal impedance parameters and mechanical properties of the skin. Preliminary results for sustained vowels with various voice qualities show that the subglottal IBIF scheme yields comparable estimates with respect to current aerodynamics-based methods of clinical vocal assessment. A mean absolute error of less than 10% was observed for two glottal airflow measures -maximum flow declination rate and amplitude of the modulation component- that have been associated with the pathophysiology of some common voice disorders caused by faulty and/or abusive patterns of vocal behavior (i.e., vocal hyperfunction). The proposed method further advances the ambulatory assessment of vocal function based on the neck acceleration signal, that previously have been limited to the estimation of phonation duration, loudness, and pitch. Subglottal IBIF is also suitable for other ambulatory applications in speech communication, in which further evaluation is underway.