Your browser doesn't support javascript.
loading
Temporal and Spectral Models as Correlates to Auditory-Perceptual Judgments of Overall Severity and Listener Comfort in Tracheoesophageal Voice.
Doyle, Philip C; Ghasemzadeh, Hamzeh; Searl, Jeff.
Affiliation
  • Doyle PC; Otolaryngology Head and Neck Surgery, Division of Laryngology, School of Medicine Stanford University, Stanford University, Stanford, CA 94305, USA.
  • Ghasemzadeh H; Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA 02114, USA.
  • Searl J; Department of Surgery Harvard Medical School, Boston, MA 02114, USA.
Appl Sci (Basel) ; 14(1)2024 Jan 01.
Article de En | MEDLINE | ID: mdl-38699704
ABSTRACT

INTRODUCTION:

This study pursued two

objectives:

(1) to determine the potential association between listener (n = 51) judgments of 20 male tracheoesophageal speaker samples for two auditory-perceptual dimensions of voice, overall severity (OS) and listener comfort (LC); and (2) to assess the temporal and spectral acoustic correlates for these auditory-perceptual dimensions.

METHODOLOGY:

Three separate correlation analyses were performed to evaluate the association between OS and LC. First, scores of OS and LC from all listeners were pooled together, and then the correlation between OS and LC was computed. Second, scores of OS and LC were averaged over all listeners to derive a single estimate of OS and LC for each TE speaker sample; the correlation between the average OS and LC was then computed. Third, listener-to-listener variability in the association between OS and LC was evaluated by computing the correlation between OS and LC scores from each listener across all TE samples. Finally, two stepwise multiple regression models were created to relate the average LC score to spectral and temporal variation in the acoustic signal.

RESULTS:

While the pooled OS and LC scores had a moderate positive correlation (r = 0.66, p < 0.00001), the averaged OS and LC exhibited a near perfect positive correlation (r = 0.99, p < 0.00001). The significant differences between the pooled and averaged scores were explained by significant listener-to-listener variability in the association between OS and LC. OS and LC scores from 5 listeners had non-significant correlations, 10 had moderate correlations (r < 0.7), 35 listeners had high correlations (0.7 < r < 0.9), and 1 listener had a very high correlation (r < 0.9 < 1). Finally, the acoustic models created based on the spectral and temporal variations in the signal were able to account for 87.7% and 61.8% of variation in the average LC score.

CONCLUSIONS:

The strong correlations between OS and LC suggest that LC may, in fact, provide a more comprehensive auditory-perceptual surrogate for the voice quality of TE speakers. Although OS and LC are distinct conceptual dimensions, LC appears to have the advantage of assessing the social impact and potential communication disability that may exist in interactions between TE speakers and listeners.
Mots clés

Texte intégral: 1 Collection: 01-internacional Base de données: MEDLINE Langue: En Journal: Appl Sci (Basel) Année: 2024 Type de document: Article Pays d'affiliation: États-Unis d'Amérique Pays de publication: Suisse

Texte intégral: 1 Collection: 01-internacional Base de données: MEDLINE Langue: En Journal: Appl Sci (Basel) Année: 2024 Type de document: Article Pays d'affiliation: États-Unis d'Amérique Pays de publication: Suisse