Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 33
Filtrar
Más filtros

Banco de datos
País/Región como asunto
Tipo del documento
País de afiliación
Intervalo de año de publicación
1.
J Acoust Soc Am ; 155(2): 1264-1271, 2024 02 01.
Artículo en Inglés | MEDLINE | ID: mdl-38345424

RESUMEN

The problem of characterizing voice quality has long caused debate and frustration. The richness of the available descriptive vocabulary is overwhelming, but the density and complexity of the information voices convey lead some to conclude that language can never adequately specify what we hear. Others argue that terminology lacks an empirical basis, so that language-based scales are inadequate a priori. Efforts to provide meaningful instrumental characterizations have also had limited success. Such measures may capture sound patterns but cannot at present explain what characteristics, intentions, or identity listeners attribute to the speaker based on those patterns. However, some terms continually reappear across studies. These terms align with acoustic dimensions accounting for variance across speakers and languages and correlate with size and arousal across species. This suggests that labels for quality rest on a bedrock of biology: We have evolved to perceive voices in terms of size/arousal, and these factors structure both voice acoustics and descriptive language. Such linkages could help integrate studies of signals and their meaning, producing a truly interdisciplinary approach to the study of voice.


Asunto(s)
Percepción del Habla , Acústica del Lenguaje , Calidad de la Voz , Sonido , Audición
2.
J Acoust Soc Am ; 151(5): 3462, 2022 05.
Artículo en Inglés | MEDLINE | ID: mdl-35649890

RESUMEN

This study replicates and extends the recent findings of Lee, Keating, and Kreiman [J. Acoust. Soc. Am. 146(3), 1568-1579 (2019)] on acoustic voice variation in read speech, which showed remarkably similar acoustic voice spaces for groups of female and male talkers and the individual talkers within these groups. Principal component analysis was applied to acoustic indices of voice quality measured from phone conversations for 99/100 of the same talkers studied previously. The acoustic voice spaces derived from spontaneous speech are highly similar to those based on read speech, except that unlike read speech, variability in fundamental frequency accounted for significant acoustic variability. Implications of these findings for prototype models of speaker recognition and discrimination are considered.


Asunto(s)
Habla , Voz , Acústica , Femenino , Humanos , Masculino , Acústica del Lenguaje , Calidad de la Voz
3.
J Acoust Soc Am ; 151(2): 1393, 2022 02.
Artículo en Inglés | MEDLINE | ID: mdl-35232083

RESUMEN

This study compares human speaker discrimination performance for read speech versus casual conversations and explores differences between unfamiliar voices that are "easy" versus "hard" to "tell together" versus "tell apart." Thirty listeners were asked whether pairs of short style-matched or -mismatched, text-independent utterances represented the same or different speakers. Listeners performed better when stimuli were style-matched, particularly in read speech-read speech trials (equal error rate, EER, of 6.96% versus 15.12% in conversation-conversation trials). In contrast, the EER was 20.68% for the style-mismatched condition. When styles were matched, listeners' confidence was higher when speakers were the same versus different; however, style variation caused decreases in listeners' confidence for the "same speaker" trials, suggesting a higher dependency of this task on within-speaker variability. The speakers who were "easy" or "hard" to "tell together" were not the same as those who were "easy" or "hard" to "tell apart." Analysis of speaker acoustic spaces suggested that the difference observed in human approaches to "same speaker" and "different speaker" tasks depends primarily on listeners' different perceptual strategies when dealing with within- versus between-speaker acoustic variability.


Asunto(s)
Percepción del Habla , Voz , Acústica , Humanos , Habla
4.
J Acoust Soc Am ; 149(1): 457, 2021 01.
Artículo en Inglés | MEDLINE | ID: mdl-33514179

RESUMEN

No agreed-upon method currently exists for objective measurement of perceived voice quality. This paper describes validation of a psychoacoustic model designed to fill this gap. This model includes parameters to characterize the harmonic and inharmonic voice sources, vocal tract transfer function, fundamental frequency, and amplitude of the voice, which together serve to completely quantify the integral sound of a target voice sample. In experiment 1, 200 voices with and without diagnosed vocal pathology were fit with the model using analysis-by-synthesis. The resulting synthetic voice samples were not distinguishable from the original voice tokens, suggesting that the model has all the parameters it needs to fully quantify voice quality. In experiment 2 parameters that model the harmonic voice source were removed one by one, and the voice tokens were re-synthesized with the reduced model. In every case the lower-dimensional models provided worse perceptual matches to the quality of the natural tokens than did the original set, indicating that the psychoacoustic model cannot be reduced in dimensionality without loss of fit to the data. Results confirm that this model can be validly applied to quantify voice quality in clinical and research applications.


Asunto(s)
Psicoacústica , Trastornos de la Voz , Voz , Femenino , Humanos , Masculino , Habla , Acústica del Lenguaje , Calidad de la Voz
5.
J Acoust Soc Am ; 146(3): 1568, 2019 09.
Artículo en Inglés | MEDLINE | ID: mdl-31590565

RESUMEN

Little is known about the nature or extent of everyday variability in voice quality. This paper describes a series of principal component analyses to explore within- and between-talker acoustic variation and the extent to which they conform to expectations derived from current models of voice perception. Based on studies of faces and cognitive models of speaker recognition, the authors hypothesized that a few measures would be important across speakers, but that much of within-speaker variability would be idiosyncratic. Analyses used multiple sentence productions from 50 female and 50 male speakers of English, recorded over three days. Twenty-six acoustic variables from a psychoacoustic model of voice quality were measured every 5 ms on vowels and approximants. Across speakers the balance between higher harmonic amplitudes and inharmonic energy in the voice accounted for the most variance (females = 20%, males = 22%). Formant frequencies and their variability accounted for an additional 12% of variance across speakers. Remaining variance appeared largely idiosyncratic, suggesting that the speaker-specific voice space is different for different people. Results further showed that voice spaces for individuals and for the population of talkers have very similar acoustic structures. Implications for prototype models of voice perception and recognition are discussed.


Asunto(s)
Variación Biológica Individual , Variación Biológica Poblacional , Acústica del Lenguaje , Voz/fisiología , Adulto , Femenino , Humanos , Masculino , Fonética , Psicoacústica
6.
J Acoust Soc Am ; 144(1): 375, 2018 07.
Artículo en Inglés | MEDLINE | ID: mdl-30075658

RESUMEN

Little is known about human and machine speaker discrimination ability when utterances are very short and the speaking style is variable. This study compares text-independent speaker discrimination ability of humans and machines based on utterances shorter than 2 s in two different speaking styles (read sentences and speech directed towards pets, characterized by exaggerated prosody). Recordings of 50 female speakers drawn from the UCLA Speaker Variability Database were used as stimuli. Performance of 65 human listeners was compared to i-vector-based automatic speaker verification systems using mel-frequency cepstral coefficients, voice quality features, which were inspired by a psychoacoustic model of voice perception, or their combination by score-level fusion. Humans always outperformed machines, except in the case of style-mismatched pairs from perceptually-marked speakers. Speaker representations by humans and machines were compared using multi-dimensional scaling (MDS). Canonical correlation analysis showed a weak correlation between machine and human MDS spaces. Multiple regression showed that means of voice quality features could represent the most important human MDS dimension well, but not the dimensions from machines. These results suggest that speaker representations by humans and machines are different, and machine performance might be improved by better understanding how different acoustic features relate to perceived speaker identity.


Asunto(s)
Acústica del Lenguaje , Percepción del Habla/fisiología , Habla/fisiología , Voz/fisiología , Adolescente , Adulto , Comprensión/fisiología , Femenino , Humanos , Lenguaje , Masculino , Calidad de la Voz , Adulto Joven
7.
J Acoust Soc Am ; 139(3): 1404-10, 2016 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-27036277

RESUMEN

A psychoacoustic model of the voice source spectrum is proposed. The model is characterized by four spectral slope parameters: the difference in amplitude between the first two harmonics (H1-H2), the second and fourth harmonics (H2-H4), the fourth harmonic and the harmonic nearest 2 kHz in frequency (H4-2 kHz), and the harmonic nearest 2 kHz and that nearest 5 kHz (2 kHz-5 kHz). As a step toward model validation, experiments were conducted to establish the acoustic and perceptual independence of these parameters. In experiment 1, the model was fit to a large number of voice sources. Results showed that parameters are predictable from one another, but that these relationships are due to overall spectral roll-off. Two additional experiments addressed the perceptual independence of the source parameters. Listener sensitivity to H1-H2, H2-H4, and H4-2 kHz did not change as a function of the slope of an adjacent component, suggesting that sensitivity to these components is robust. Listener sensitivity to changes in spectral slope from 2 kHz to 5 kHz depended on complex interactions between spectral slope, spectral noise levels, and H4-2 kHz. It is concluded that the four parameters represent non-redundant acoustic and perceptual aspects of voice quality.


Asunto(s)
Acústica , Modelos Teóricos , Acústica del Lenguaje , Calidad de la Voz , Adulto , Femenino , Humanos , Masculino , Psicoacústica , Espectrografía del Sonido , Medición de la Producción del Habla , Adulto Joven
8.
Acta Acust United Acust ; 102(2): 209-213, 2016.
Artículo en Inglés | MEDLINE | ID: mdl-27134616

RESUMEN

Experiments using animal and human larynx models are often conducted without a vocal tract. While it is often assumed that the absence of a vocal tract has only small effects on vocal fold vibration, it is not actually known how sound production and quality are affected. In this study, the validity of using data obtained in the absence of a vocal tract for voice perception studies was investigated. Using a two-layer self-oscillating physical model, three series of voice stimuli were created: one produced with conditions of left-right symmetric vocal fold stiffness, and two with left-right asymmetries in vocal fold body stiffness. Each series included a set of stimuli created with a physical vocal tract, and a second set created without a physical vocal tract. Stimuli were re-synthesized to equalize the mean F0 for each series and normalized for amplitude. Listeners were asked to evaluate the three series in a sort-and-rate task. Multidimensional scaling analysis was applied to examine the perceptual interaction between the voice source and the vocal tract resonances. The results showed that the presence or absence of a vocal tract can significantly affect perception of voice quality changes due to parametric changes in vocal fold properties, except when the parametric changes in vocal fold properties produced an abrupt shift in vocal fold vibratory pattern resulting in a salient quality change.

9.
J Acoust Soc Am ; 138(1): 1-10, 2015 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-26233000

RESUMEN

Models of the voice source differ in their fits to natural voices, but it is unclear which differences in fit are perceptually salient. This study examined the relationship between the fit of five voice source models to 40 natural voices, and the degree of perceptual match among stimuli synthesized with each of the modeled sources. Listeners completed a visual sort-and-rate task to compare versions of each voice created with the different source models, and the results were analyzed using multidimensional scaling. Neither fits to pulse shapes nor fits to landmark points on the pulses predicted observed differences in quality. Further, the source models fit the opening phase of the glottal pulses better than they fit the closing phase, but at the same time similarity in quality was better predicted by the timing and amplitude of the negative peak of the flow derivative (part of the closing phase) than by the timing and/or amplitude of peak glottal opening. Results indicate that simply knowing how (or how well) a particular source model fits or does not fit a target source pulse in the time domain provides little insight into what aspects of the voice source are important to listeners.


Asunto(s)
Percepción Auditiva/fisiología , Calidad de la Voz/fisiología , Estimulación Acústica , Adolescente , Adulto , Glotis/fisiología , Humanos , Persona de Mediana Edad , Modelos Biológicos , Localización de Sonidos/fisiología , Espectrografía del Sonido , Adulto Joven
10.
J Acoust Soc Am ; 136(5): 2798-806, 2014 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-25373979

RESUMEN

The influence of epilaryngeal area on glottal flow and the acoustic signal has been described [Titze, J. Acoust. Soc. Am. 123, 2733-2749 (2008)], but it is not known how (or whether) changes in epilaryngeal area influence perceived voice quality. This study examined these relationships in a kinematic vocal tract model. Epilaryngeal constrictions and expansions were simulated at the levels of the aryepiglottic folds and the ventricular folds in the context of four glottal configurations representing normal vibration to severe vocal fold paralysis, for the three corner vowels /a/, /i/, and /u/. Minimum and maximum glottal flow, maximum flow declination rate, spectral slope, cepstral peak prominence, and the harmonics-to-noise ratio were measured, and listeners completed a perceptual sort-and-rate task for all samples. Epilaryngeal constriction and expansion caused salient differences in voice quality. The location of constriction was also perceivable. Vowels simulated with aryepiglottic constriction demonstrated lower maximum airflow and less noise than the other epilaryngeal shapes, and listeners consistently perceived them as distinct from other stimuli. Acoustic differences decreased with increasing severity of simulated paralysis. Results of epilaryngeal constriction and expansion were similar for /a/ and /i/, and produced slightly different patterns for /u/.


Asunto(s)
Laringe/anatomía & histología , Fonación/fisiología , Acústica del Lenguaje , Percepción del Habla , Adulto , Antropometría , Fenómenos Biomecánicos , Equipos de Comunicación para Personas con Discapacidad , Simulación por Computador , Glotis/fisiología , Glotis/ultraestructura , Humanos , Laringe/patología , Periodicidad , Fonética , Vibración , Parálisis de los Pliegues Vocales/fisiopatología , Parálisis de los Pliegues Vocales/psicología , Pliegues Vocales , Calidad de la Voz
11.
J Acoust Soc Am ; 133(2): 1078-89, 2013 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-23363123

RESUMEN

This study investigates the importance of source spectrum slopes in the perception of phonation by White Hmong listeners. In White Hmong, nonmodal phonation (breathy or creaky voice) accompanies certain lexical tones, but its importance in tonal contrasts is unclear. In this study, native listeners participated in two perceptual tasks, in which they were asked to identify the word they heard. In the first task, participants heard natural stimuli with manipulated F0 and duration (phonation unchanged). Results indicate that phonation is important in identifying the breathy tone, but not the creaky tone. Thus, breathiness can be viewed as contrastive in White Hmong. Next, to understand which parts of the source spectrum listeners use to perceive contrastive breathy phonation, source spectrum slopes were manipulated in the second task to create stimuli ranging from modal to breathy sounding, with F0 held constant. Results indicate that changes in H1-H2 (difference in amplitude between the first and second harmonics) and H2-H4 (difference in amplitude between the second and fourth harmonics) are independently important for distinguishing breathy from modal phonation, consistent with the view that the percept of breathiness is influenced by a steep drop in harmonic energy in the lower frequencies.


Asunto(s)
Fonación , Fonética , Percepción de la Altura Tonal , Reconocimiento en Psicología , Calidad de la Voz , Estimulación Acústica , Adulto , Audiometría del Habla , Señales (Psicología) , Femenino , Humanos , Modelos Logísticos , Masculino , Persona de Mediana Edad , Psicoacústica , Espectrografía del Sonido , Factores de Tiempo
12.
J Acoust Soc Am ; 133(1): 453-62, 2013 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-23297917

RESUMEN

At present, it is not well understood how changes in vocal fold biomechanics correspond to changes in voice quality. Understanding such cross-domain links from physiology to acoustics to perception in the "speech chain" is of both theoretical and clinical importance. This study investigates links between changes in body layer stiffness, which is regulated primarily by the thyroarytenoid muscle, and the consequent changes in acoustics and voice quality under left-right symmetric and asymmetric stiffness conditions. Voice samples were generated using three series of two-layer physical vocal fold models, which differed only in body stiffness. Differences in perceived voice quality in each series were then measured in a "sort and rate" listening experiment. The results showed that increasing body stiffness better maintained vocal fold adductory position, thereby exciting more high-order harmonics, differences that listeners readily perceived. Changes to the degree of left-right stiffness mismatch and the resulting left-right vibratory asymmetry did not produce perceptually significant differences in quality unless the stiffness mismatch was large enough to cause a change in vibratory mode. This suggests that a vibration pattern with left-right asymmetry does not necessarily result in a salient deviation in voice quality, and thus may not always be of clinical significance.


Asunto(s)
Acústica , Modelos Anatómicos , Fonación , Acústica del Lenguaje , Percepción del Habla , Pliegues Vocales/fisiología , Calidad de la Voz , Fenómenos Biomecánicos , Elasticidad , Femenino , Humanos , Modelos Lineales , Masculino , Presión , Procesamiento de Señales Asistido por Computador , Espectrografía del Sonido , Vibración , Pliegues Vocales/anatomía & histología
13.
J Acoust Soc Am ; 133(3): 1656-66, 2013 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-23464035

RESUMEN

Because voice signals result from vocal fold vibration, perceptually meaningful vibratory measures should quantify those aspects of vibration that correspond to differences in voice quality. In this study, glottal area waveforms were extracted from high-speed videoendoscopy of the vocal folds. Principal component analysis was applied to these waveforms to investigate the factors that vary with voice quality. Results showed that the first principal component derived from tokens without glottal gaps was significantly (p < 0.01) associated with the open quotient (OQ). The alternating-current (AC) measure had a significant effect (p < 0.01) on the first principal component among tokens exhibiting glottal gaps. A measure AC/OQ, defined as the ratio of AC to OQ, was proposed to combine both amplitude and temporal characteristics of the glottal area waveform for both complete and incomplete glottal closures. Analyses of "glide" phonations in which quality varied continuously from breathy to pressed showed that the AC/OQ measure was able to characterize the corresponding continuum of glottal area waveform variation, regardless of the presence or absence of glottal gaps.


Asunto(s)
Glotis/anatomía & histología , Glotis/fisiología , Fonación , Acústica del Lenguaje , Calidad de la Voz , Fenómenos Biomecánicos , Femenino , Humanos , Laringoscopía , Modelos Lineales , Masculino , Análisis de Componente Principal , Factores de Tiempo , Vibración , Grabación en Video , Pliegues Vocales/anatomía & histología , Pliegues Vocales/fisiología
14.
J Acoust Soc Am ; 131(1): 492-500, 2012 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-22280610

RESUMEN

Although the amount of inharmonic energy (noise) present in a human voice is an important determinant of vocal quality, little is known about the perceptual interaction between harmonic and inharmonic aspects of the voice source. This paper reports three experiments investigating this issue. Results indicate that perception of the harmonic slope and of noise levels are both influenced by complex interactions between the spectral shape and relative levels of harmonic and noise energy in the voice source. Just-noticeable differences (JNDs) for the noise-to-harmonics ratio (NHR) varied significantly with the NHR and harmonic spectral slope, but NHR had no effect on JNDs for NHR when harmonic slopes were steepest, and harmonic slope had no effect when NHRs were highest. Perception of changes in the harmonic source slope depended on NHR and on the harmonic source slope: JNDs increased when spectra rolled off steeply, with this effect in turn depending on NHR. Finally, all effects were modulated by the shape of the noise spectrum. It thus appears that, beyond masking, understanding perception of individual parameters requires knowledge of the acoustic context in which they function, consistent with the view that voices are integral patterns that resist decomposition.


Asunto(s)
Ruido , Calidad de la Voz/fisiología , Estimulación Acústica , Análisis de Varianza , Percepción Auditiva/fisiología , Simulación por Computador , Umbral Diferencial/fisiología , Femenino , Humanos , Masculino , Fonética , Proyectos Piloto , Relación Señal-Ruido , Espectrografía del Sonido
15.
J Acoust Soc Am ; 132(4): 2625-32, 2012 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-23039455

RESUMEN

Increases in open quotient are widely assumed to cause changes in the amplitude of the first harmonic relative to the second (H1*-H2*), which in turn correspond to increases in perceived vocal breathiness. Empirical support for these assumptions is rather limited, and reported relationships among these three descriptive levels have been variable. This study examined the empirical relationship among H1*-H2*, the glottal open quotient (OQ), and glottal area waveform skewness, measured synchronously from audio recordings and high-speed video images of the larynges of six phonetically knowledgeable, vocally healthy speakers who varied fundamental frequency and voice qualities quasi-orthogonally. Across speakers and voice qualities, OQ, the asymmetry coefficient, and fundamental frequency accounted for an average of 74% of the variance in H1*-H2*. However, analyses of individual speakers showed large differences in the strategies used to produce the same intended voice qualities. Thus, H1*-H2* can be predicted with good overall accuracy, but its relationship to phonatory characteristics appears to be speaker dependent.


Asunto(s)
Glotis/fisiología , Fonación , Fonética , Acústica del Lenguaje , Calidad de la Voz , Fenómenos Biomecánicos , Femenino , Glotis/anatomía & histología , Humanos , Laringoscopía , Modelos Lineales , Masculino , Medición de la Producción del Habla , Factores de Tiempo , Grabación en Video
16.
Laryngoscope ; 132(1): 130-134, 2022 01.
Artículo en Inglés | MEDLINE | ID: mdl-34216152

RESUMEN

INTRODUCTION: Vibratory asymmetry and neuromuscular compensation are often seen in laryngeal neuromuscular pathology. However, the ramifications of these findings on voice quality are unclear. This study investigated the effects of varying levels of vibratory asymmetry and neuromuscular compensation on cepstral peak prominence (CPP), an analog of voice quality. STUDY DESIGN: In vivo canine phonation model. METHODS: Varying degrees of vocal fold vibratory asymmetry were achieved by stimulating one recurrent laryngeal nerve (RLN) over 11 levels from threshold to maximal muscle activation. For each of these levels, phonation was induced at systematically varied combinations of neuromuscular compensation: three levels each of contralateral RLN stimulation (80%, 90%, and 100% of maximal), superior laryngeal nerve (SLN) activation (0%, 50%, and 100% of maximal), and airflow levels (500, 700, and 900 mL/s). Vocal fold symmetry was determined by assessing the opening phase of the vibratory cycle in high-speed video recordings. Voice quality was estimated acoustically by calculating CPP for each voice sample. RESULTS: Eight hundred twenty-two phonatory conditions with varying degrees of vibratory asymmetry were evaluated. CPP was highest at vibratory symmetry. Increasing levels of asymmetry resulted in significant decreases in CPP. CPP increased significantly with increasing contralateral RLN activation. CPP was significantly higher at 50% SLN activation than 0% or 100% SLN activation. CONCLUSION: Voice quality, as approximated by CPP, is best at vibratory symmetry and deteriorates with increasing degrees of asymmetry. Voice quality may be improved with neuromuscular compensation by increased adduction of the contralateral vocal fold or increased vocal fold tension at mid-levels of SLN activation. LEVEL OF EVIDENCE: NA, Basic Science Laryngoscope, 132:130-134, 2022.


Asunto(s)
Músculos Laríngeos/anatomía & histología , Nervios Laríngeos/anatomía & histología , Laringe/anatomía & histología , Calidad de la Voz/fisiología , Animales , Perros , Músculos Laríngeos/fisiología , Nervios Laríngeos/fisiología , Laringe/fisiología , Masculino , Vibración
17.
Laryngoscope ; 131(12): 2740-2746, 2021 12.
Artículo en Inglés | MEDLINE | ID: mdl-34106487

RESUMEN

OBJECTIVES: Laryngeal vibratory asymmetry occurring with paresis may result in a perceptually normal or abnormal voice. The present study aims to determine the relationships between the degree of vibratory asymmetry, acoustic measures, and perception of sound stimuli. STUDY DESIGN: Animal Model of Voice Production, Perceptual Analysis of Voice. METHODS: In an in vivo canine model of phonation, symmetric and asymmetric laryngeal vibration were obtained via graded unilateral recurrent laryngeal nerve (RLN) stimulation simulating near paralysis to full activation. Phonation was performed at various contralateral RLN and bilateral superior laryngeal nerve stimulation levels. Naïve listeners rated the perceptual quality of 182 unique phonatory samples using a visual sort-and-rate task. Cepstral peak prominence (CPP) was calculated for each phonatory condition. The relationships among vibratory symmetry, CPP, and perceptual ratings were evaluated. RESULTS: A significant relationship emerged between RLN stimulation and perceptual rating, such that sound samples from low RLN levels were preferred to those from high RLN levels. When symmetric vibration was achieved at mid-RLN stimulation, listeners preferred samples from symmetric vibration over those from asymmetric vibration. However, when symmetry was achieved at high RLN levels, a strained voice quality resulted that listeners dispreferred over asymmetric conditions at lower RLN levels. CPP did not have a linear relationship with perceptual ratings. CONCLUSIONS: Laryngeal vibratory asymmetry produces variable perceptual differences in phonatory sound quality. Though CPP has been correlated with dysphonia in previous research, its complex relationship with quality limits its usefulness as clinical marker of voice quality perception. LEVEL OF EVIDENCE: NA, basic science Laryngoscope, 131:2740-2746, 2021.


Asunto(s)
Disfonía/fisiopatología , Nervios Laríngeos/fisiopatología , Parálisis de los Pliegues Vocales/complicaciones , Pliegues Vocales/fisiopatología , Calidad de la Voz/fisiología , Acústica , Animales , Modelos Animales de Enfermedad , Perros , Disfonía/diagnóstico , Estimulación Eléctrica , Femenino , Humanos , Masculino , Fonación/fisiología , Vibración , Parálisis de los Pliegues Vocales/fisiopatología , Pliegues Vocales/inervación
18.
J Acoust Soc Am ; 128(4): 2085-9, 2010 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-20968379

RESUMEN

Little is known about the perceptual importance of changes in the shape of the source spectrum, although many measures have been proposed and correlations with different vocal qualities (breathiness, roughness, nasality, strain...) have frequently been reported. This study investigated just-noticeable differences in the relative amplitudes of the first two harmonics (H1-H2) for speakers of Mandarin and English. Listeners heard pairs of vowels that differed only in the amplitude of the first harmonic and judged whether or not the voice tokens were identical in voice quality. Across voices and listeners, just-noticeable-differences averaged 3.18 dB. This value is small relative to the range of values across voices, indicating that H1-H2 is a perceptually valid acoustic measure of vocal quality. For both groups of listeners, differences in the amplitude of the first harmonic were easier to detect when the source spectral slope was steeply falling so that F0 dominated the spectrum. Mandarin speakers were significantly more sensitive (by about 1 dB) to differences in first harmonic amplitudes than were English speakers. Two explanations for these results are possible: Mandarin speakers may have learned to hear changes in harmonic amplitudes due to changes in voice quality that are correlated with the tones of Mandarin; or Mandarin speakers' experience with tonal contrasts may increase their sensitivity to small differences in the amplitude of F0 (which is also the first harmonic).


Asunto(s)
Vías Auditivas/fisiología , Fonética , Percepción de la Altura Tonal , Detección de Señal Psicológica , Acústica del Lenguaje , Voz , Estimulación Acústica , Adulto , Audiometría , Femenino , Humanos , Masculino , Persona de Mediana Edad , Multilingüismo , Enmascaramiento Perceptual , Espectrografía del Sonido , Calidad de la Voz , Adulto Joven
20.
Behav Res Methods ; 42(4): 1030-41, 2010 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-21139170

RESUMEN

Voice quality is an important perceptual cue in many disciplines, but knowledge of its nature is limited by a poor understanding of the relevant psychoacoustics. This article (aimed at researchers studying voice, speech, and vocal behavior) describes the UCLA voice synthesizer, software for voice analysis and synthesis designed to test hypotheses about the relationship between acoustic parameters and voice quality perception. The synthesizer provides experimenters with a useful tool for creating and modeling voice signals. In particular, it offers an integrated approach to voice analysis and synthesis and allows easy, precise, spectral-domain manipulations of the harmonic voice source. The synthesizer operates in near real time, using a parsimonious set of acoustic parameters for the voice source and vocal tract that a user can modify to accurately copy the quality of most normal and pathological voices. The software, user's manual, and audio files may be downloaded from http://brm.psychonomic-journals.org/content/supplemental. Future updates may be downloaded from www.surgery.medsch.ucla.edu/glottalaffairs/.


Asunto(s)
Espectrografía del Sonido , Software de Reconocimiento del Habla , Calidad de la Voz , Señales (Psicología) , Habla
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA