Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 49
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
J Acoust Soc Am ; 152(6): 3548, 2022 12.
Artigo em Inglês | MEDLINE | ID: mdl-36586864

RESUMO

A well-known concept of singing voice pedagogy is "formant tuning," where the lowest two vocal tract resonances ( fR1, fR2) are systematically tuned to harmonics of the laryngeal voice source to maximize the level of radiated sound. A comprehensive evaluation of this resonance tuning concept is still needed. Here, the effect of fR1, fR2 variation was systematically evaluated in silico across the entire fundamental frequency range of classical singing for three voice source characteristics with spectral slopes of -6, -12, and -18 dB/octave. Respective vocal tract transfer functions were generated with a previously introduced low-dimensional computational model, and resultant radiated sound levels were expressed in dB(A). Two distinct strategies for optimized sound output emerged for low vs high voices. At low pitches, spectral slope was the predominant factor for sound level increase, and resonance tuning only had a marginal effect. In contrast, resonance tuning strategies became more prevalent and voice source strength played an increasingly marginal role as fundamental frequency increased to the upper limits of the soprano range. This suggests that different voice classes (e.g., low male vs high female) likely have fundamentally different strategies for optimizing sound output, which has fundamental implications for pedagogical practice.


Assuntos
Canto , Voz , Masculino , Feminino , Humanos , Simulação por Computador , Som , Vibração
2.
J Acoust Soc Am ; 149(6): 4565, 2021 06.
Artigo em Inglês | MEDLINE | ID: mdl-34241428

RESUMO

In recent studies, it has been assumed that vocal tract formants (Fn) and the voice source could interact. However, there are only few studies analyzing this assumption in vivo. Here, the vowel transition /i/-/a/-/u/-/i/ of 12 professional classical singers (6 females, 6 males) when phonating on the pitch D4 [fundamental frequency (ƒo) ca. 294 Hz] were analyzed using transnasal high speed videoendoscopy (20.000 fps), electroglottography (EGG), and audio recordings. Fn data were calculated using a cepstral method. Source-filter interaction candidates (SFICs) were determined by (a) algorithmic detection of major intersections of Fn/nƒo and (b) perceptual assessment of the EGG signal. Although the open quotient showed some increase for the /i-a/ and /u-i/ transitions, there were no clear effects at the expected Fn/nƒo intersections. In contrast, ƒo adjustments and changes in the phonovibrogram occurred at perceptually derived SFICs, suggesting level-two interactions. In some cases, these were constituted by intersections between higher nƒo and Fn. The presented data partially corroborates that vowel transitions may result in level-two interactions also in professional singers. However, the lack of systematically detectable effects suggests either the absence of a strong interaction or existence of confounding factors, which may potentially counterbalance the level-two-interactions.


Assuntos
Canto , Voz , Feminino , Humanos , Masculino , Ocupações , Fonação , Qualidade da Voz
3.
J Exp Biol ; 221(Pt 16)2018 08 23.
Artigo em Inglês | MEDLINE | ID: mdl-29880637

RESUMO

The complex and elaborate vocalizations uttered by many of the 10,000 extant bird species are considered a major driver in their evolutionary success, warranting study of the underlying mechanisms of vocal production. Additionally, birdsong has developed into a highly productive model system for vocal imitation learning and motor control, where, in contrast to humans, we have experimental access to the entire neuromechanical control loop. In human voice production, complex laryngeal geometry, vocal fold tissue properties, airflow and laryngeal musculature all interact to ultimately control vocal fold kinematics. Quantifying vocal fold kinematics is thus critical to understanding neuromechanical control of voiced sound production, but in vivo imaging of vocal fold kinematics in birds is experimentally challenging. Here, we adapted and tested electroglottography (EGG) as a novel tool for examining vocal fold kinematics in the avian vocal organ, the syrinx. We furthermore imaged and quantified syringeal kinematics in the pigeon (Columba livia) syrinx with unprecedented detail. Our results show that EGG signals predict (1) the relative amount of contact between the avian equivalent of vocal folds and (2) essential parameters describing vibratory kinematics, such as fundamental frequency, and timing of syringeal opening and closing events. As such, EGG provides novel opportunities for measuring syringeal vibratory kinematic parameters in vivo Furthermore, the opportunity for imaging syringeal vibratory kinematics from multiple planar views (horizontal and coronal) simultaneously promotes birds as an excellent model system for studying kinematics and control of voiced sound production in general, including in humans and other mammals.


Assuntos
Columbidae/fisiologia , Técnicas In Vitro/veterinária , Vocalização Animal/fisiologia , Animais , Fenômenos Biomecânicos , Fenômenos Eletrofisiológicos , Técnicas In Vitro/métodos , Masculino , Gravação em Vídeo , Prega Vocal/fisiologia
4.
J Exp Biol ; 221(Pt 12)2018 06 20.
Artigo em Inglês | MEDLINE | ID: mdl-29615529

RESUMO

Although the call repertoire and its communicative function are relatively well explored in Japanese macaques (Macaca fuscata), little empirical data are available on the physics and the physiology of this species' vocal production mechanism. Here, a 6 year old female Japanese macaque was trained to phonate under an operant conditioning paradigm. The resulting 'coo' calls and spontaneously uttered 'growl' and 'chirp' calls were recorded with sound pressure level (SPL) calibrated microphones and electroglottography (EGG), a non-invasive method for assessing the dynamics of phonation. A total of 448 calls were recorded, complemented by ex vivo recordings on an excised Japanese macaque larynx. In this novel multidimensional investigative paradigm, in vivo and ex vivo data were matched via comparable EGG waveforms. Subsequent analysis suggests that the vocal range (range of fundamental frequency and SPL) of the macaque was comparable to that of a 7-10 year old human, with the exception of low intensity chirps, the production of which may be facilitated by the species' vocal membranes. In coo calls, redundant control of fundamental frequency in relation to SPL was also comparable to that in humans. EGG data revealed that growls, coos and chirps were produced by distinct laryngeal vibratory mechanisms. EGG further suggested changes in the degree of vocal fold adduction in vivo, resulting in spectral variation within the emitted coo calls, ranging from 'breathy' (including aerodynamic noise components) to 'non-breathy'. This is again analogous to humans, corroborating the notion that phonation in humans and non-human primates is based on universal physical and physiological principles.


Assuntos
Macaca/fisiologia , Fonação/fisiologia , Vocalização Animal/fisiologia , Animais , Feminino , Laringe , Pressão , Espectrografia do Som
5.
J Acoust Soc Am ; 137(1): 407-18, 2015 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-25618070

RESUMO

Accurate measurement of the softest sound levels of phonation presents technical and methodological challenges. This study aimed at (1) reliably obtaining normative data on sustained softest sound levels for the vowel [a:] at comfortable pitch; (2) comparing the results for different frequency and time weighting methods; and (3) refining the Union of European Phoniatricians' recommendation on allowed background noise levels for scientific and equipment manufacturers' purposes. Eighty healthy untrained participants (40 females, 40 males) were investigated in quiet rooms using a head-mounted microphone and a sound level meter at 30 cm distance. The one-second-equivalent sound levels were more stable and more representative for evaluating the softest sustained phonations than the fast-time-weighted levels. At 30 cm, these levels were in the range of 48-61 dB(C)/41-53 dB(A) for females and 49 - 64 dB(C)/35-53 dB(A) for males (5% to 95% quantile range). These ranges may serve as reference data in evaluating vocal normality. In order to reach a signal-to-noise ratio of at least 10 dB for more than 95% of the normal population, the background noise should be below 25 dB(A) and 38 dB(C), respectively, for the softest phonation measurements at 30 cm distance. For the A-weighting, this is 15 dB lower than the previously recommended value.


Assuntos
Fonação/fisiologia , Acústica/instrumentação , Adolescente , Adulto , Calibragem , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Ruído , Valores de Referência , Processamento de Sinais Assistido por Computador , Razão Sinal-Ruído , Fatores de Tempo , Transdutores , Qualidade da Voz , Adulto Jovem
6.
J Exp Biol ; 217(Pt 6): 955-63, 2014 Mar 15.
Artigo em Inglês | MEDLINE | ID: mdl-24622896

RESUMO

Previous research has suggested that the peaks in the first derivative (dEGG) of the electroglottographic (EGG) signal are good approximate indicators of the events of glottal opening and closing. These findings were based on high-speed video (HSV) recordings with frame rates 10 times lower than the sampling frequencies of the corresponding EGG data. The present study attempts to corroborate these previous findings, utilizing super-HSV recordings. The HSV and EGG recordings (sampled at 27 and 44 kHz, respectively) of an excised canine larynx phonation were synchronized by an external TTL signal to within 0.037 ms. Data were analyzed by means of glottovibrograms, digital kymograms, the glottal area waveform and the vocal fold contact length (VFCL), a new parameter representing the time-varying degree of 'zippering' closure along the anterior-posterior (A-P) glottal axis. The temporal offsets between glottal events (depicted in the HSV recordings) and dEGG peaks in the opening and closing phase of glottal vibration ranged from 0.02 to 0.61 ms, amounting to 0.24-10.88% of the respective glottal cycle durations. All dEGG double peaks coincided with vibratory A-P phase differences. In two out of the three analyzed video sequences, peaks in the first derivative of the VFCL coincided with dEGG peaks, again co-occurring with A-P phase differences. The findings suggest that dEGG peaks do not always coincide with the events of glottal closure and initial opening. Vocal fold contacting and de-contacting do not occur at infinitesimally small instants of time, but extend over a certain interval, particularly under the influence of A-P phase differences.


Assuntos
Cães/fisiologia , Glote/fisiologia , Fonação , Gravação em Vídeo , Animais , Feminino , Espectrografia do Som , Vibração , Prega Vocal/fisiologia
7.
Sci Rep ; 14(1): 13132, 2024 06 07.
Artigo em Inglês | MEDLINE | ID: mdl-38849382

RESUMO

Voice production of humans and most mammals is governed by the MyoElastic-AeroDynamic (MEAD) principle, where an air stream is modulated by self-sustained vocal fold oscillation to generate audible air pressure fluctuations. An alternative mechanism is found in ultrasonic vocalizations of rodents, which are established by an aeroacoustic (AA) phenomenon without vibration of laryngeal tissue. Previously, some authors argued that high-pitched human vocalization is also produced by the AA principle. Here, we investigate the so-called "whistle register" voice production in nine professional female operatic sopranos singing a scale from C6 (≈ 1047 Hz) to G6 (≈ 1568 Hz). Super-high-speed videolaryngoscopy revealed vocal fold collision in all participants, with closed quotients from 30 to 73%. Computational modeling showed that the biomechanical requirements to produce such high-pitched voice would be an increased contraction of the cricothyroid muscle, vocal fold strain of about 50%, and high subglottal pressure. Our data suggest that high-pitched operatic soprano singing uses the MEAD mechanism. Consequently, the commonly used term "whistle register" does not reflect the physical principle of a whistle with regard to voice generation in high pitched classical singing.


Assuntos
Canto , Prega Vocal , Humanos , Feminino , Canto/fisiologia , Fenômenos Biomecânicos , Prega Vocal/fisiologia , Adulto , Som , Voz/fisiologia , Fonação/fisiologia
8.
J Exp Biol ; 216(Pt 21): 4054-64, 2013 Nov 01.
Artigo em Inglês | MEDLINE | ID: mdl-24133151

RESUMO

Elephants' low-frequency vocalizations are produced by flow-induced self-sustaining oscillations of laryngeal tissue. To date, little is known in detail about the vibratory phenomena in the elephant larynx. Here, we provide a first descriptive report of the complex oscillatory features found in the excised larynx of a 25 year old female African elephant (Loxodonta africana), the largest animal sound generator ever studied experimentally. Sound production was documented with high-speed video, acoustic measurements, air flow and sound pressure level recordings. The anatomy of the larynx was studied with computed tomography (CT) and dissections. Elephant CT vocal anatomy data were further compared with the anatomy of an adult human male. We observed numerous unusual phenomena, not typically reported in human vocal fold vibrations. Phase delays along both the inferior-superior and anterior-posterior (A-P) dimension were commonly observed, as well as transverse travelling wave patterns along the A-P dimension, previously not documented in the literature. Acoustic energy was mainly created during the instant of glottal opening. The vestibular folds, when adducted, participated in tissue vibration, effectively increasing the generated sound pressure level by 12 dB. The complexity of the observed phenomena is partly attributed to the distinct laryngeal anatomy of the elephant larynx, which is not simply a large-scale version of its human counterpart. Travelling waves may be facilitated by low fundamental frequencies and increased vocal fold tension. A travelling wave model is proposed, to account for three types of phenomena: A-P travelling waves, 'conventional' standing wave patterns, and irregular vocal fold vibration.


Assuntos
Elefantes/anatomia & histologia , Elefantes/fisiologia , Laringe/anatomia & histologia , Laringe/fisiologia , Acústica , Movimentos do Ar , Comunicação Animal , Animais , Feminino , Humanos , Masculino , Modelos Biológicos , Pressão , Som , Tomografia Computadorizada por Raios X , Vibração , Gravação de Videoteipe
9.
J Acoust Soc Am ; 133(2): 1055-64, 2013 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-23363121

RESUMO

Recently, endoscopic high-speed laryngoscopy has been established for commercial use as a state-of-the-art technique to examine vocal fold kinematics. Since modern cameras provide sampling rates of several thousand frames per second, a high volume of data has to be considered for visual and objective analysis. A method for visualizing endoscopic high speed videos in three-dimensional cycle-based graphs combining and extending the approaches of phonovibrograms and electroglottographic wavegrams is presented. To build a phonovibrographic wavegram, individual cycles of a phonovibrogram are segmented, normalized in cycle duration, and concatenated over time. For analyzing purposes, the emerging three-dimensional scalar field is visualized with different rendering techniques providing information of different aspects of vocal fold kinematics. The phonovibrographic wavegram incorporates information about the glottal closure type, size, and location of the amplitudes, symmetry, periodicity, and phase information. The potential of the approach to visualize the characteristics of vocal fold vibration in a compact and intuitive way is demonstrated within two healthy and three pathologic subjects. The phonovibrographic wavegram allows a comprehensive analysis of vocal fold kinematics and reveals information that remains hidden with other visualization techniques.


Assuntos
Interpretação de Imagem Assistida por Computador , Doenças da Laringe/fisiopatologia , Laringoscopia , Fonação , Gravação em Vídeo , Prega Vocal/fisiopatologia , Adulto , Fenômenos Biomecânicos , Estudos de Casos e Controles , Disfonia/fisiopatologia , Feminino , Humanos , Doenças da Laringe/diagnóstico , Masculino , Pessoa de Meia-Idade , Pólipos/fisiopatologia , Valor Preditivo dos Testes , Fatores de Tempo , Vibração , Paralisia das Pregas Vocais/fisiopatologia , Prega Vocal/patologia
10.
Science ; 379(6635): 881-882, 2023 03 03.
Artigo em Inglês | MEDLINE | ID: mdl-36862789

RESUMO

Toothed whales evolved a third way of making sounds similar to that of land mammals and birds.


Assuntos
Cetáceos , Som , Vocalização Animal , Animais , Cetáceos/fisiologia , Oceanos e Mares
11.
J Voice ; 2023 Apr 18.
Artigo em Inglês | MEDLINE | ID: mdl-37080890

RESUMO

Various authors have argued that belting is to be produced by "speech-like" sounds, with the first and second supraglottic vocal tract resonances (fR1 and fR2) at frequencies of the vowels determined by the lyrics to be sung. Acoustically, the hallmark of belting has been identified as a dominant second harmonic, possibly enhanced by first resonance tuning (fR1≈2fo). It is not clear how both these concepts - (a) phonating with "speech-like," unmodified vowels; and (b) producing a belting sound with a dominant second harmonic, typically enhanced by fR1 - can be upheld when singing across a singer's entire musical pitch range. For instance, anecdotal reports from pedagogues suggest that vowels with a low fR1, such as [i] or [u], might have to be modified considerably (by raising fR1) in order to phonate at higher pitches. These issues were systematically addressed in silico with respect to treble singing, using a linear source-filter voice production model. The dominant harmonic of the radiated spectrum was assessed in 12987 simulations, covering a parameter space of 37 fundamental frequencies (fo) across the musical pitch range from C3 to C6; 27 voice source spectral slope settings from -4 to -30 dB/octave; computed for 13 different IPA vowels. The results suggest that, for most unmodified vowels, the stereotypical belting sound characteristics with a dominant second harmonic can only be produced over a pitch range of about a musical fifth, centered at fo≈0.5fR1. In the [ɔ] and [ɑ] vowels, that range is extended to an octave, supported by a low second resonance. Data aggregation - considering the relative prevalence of vowels in American English - suggests that, historically, belting with fR1≈2fo was derived from speech, and that songs with an extended musical pitch range likely demand considerable vowel modification. We thus argue that - on acoustical grounds - the pedagogical commandment for belting with unmodified, "speech-like" vowels can not always be fulfilled.

12.
J Voice ; 2023 Apr 18.
Artigo em Inglês | MEDLINE | ID: mdl-37080891

RESUMO

OBJECTIVES/HYPOTHESIS: Vibrato is a core aesthetic element in singing. It varies considerably by both genre and era. Though studied extensively in Western classical singing over the years, there is a dearth of studies on vibrato in contemporary commercial music. In addressing this research gap, the objective of this study was to find and investigate common crossover song material from the opera, operetta, and Schlager singing styles from the historical early 20th to the contemporary 21st century epochs. STUDY DESIGN/METHODS: A total of 51 commercial recordings of two songs, "Es muss was Wunderbares sein" by Ralph Benatzky, and "Die ganze Welt ist himmelblau" by Robert Stolz, from "The White Horse Inn" ("Im weißen Rößl") were collected from opera, operetta, and Schlager singers. Each sample was annotated using Praat and analyzed in a custom Matlab- and Python-based algorithmic approach of singing voice separation and sine wave fitting novel to vibrato research. RESULTS: With respect to vibrato rate and extent, the three most notable findings were that (1) fo and vibrato were inherently connected; (2) Schlager, as a historical aesthetic category, has unique vibrato characteristics, with higher overall rate and lower overall extent; and (3) fo and vibrato extent varied over time based on the historical or contemporary recording year for each genre. CONCLUSIONS: Though these results should be interpreted with caution due to the limited sample size, conducting such acoustical analysis is relevant for voice pedagogy. This study sheds light on the complexity of vocal vibrato production physiology and acoustics while providing insight into various aesthetic choices when performing music of different genres and stylistic time periods. In the age of crossover singing training and commercially available recordings, this investigation reveals important distinctions regarding vocal vibrato across genres and eras that bear beneficial implications for singers and teachers of singing.

13.
J Voice ; 2023 Feb 01.
Artigo em Inglês | MEDLINE | ID: mdl-36737267

RESUMO

Voice is a major means of communication for humans, non-human mammals and many other vertebrates like birds and anurans. The physical and physiological principles of voice production are described by two theories: the MyoElastic-AeroDynamic (MEAD) theory and the Source-Filter Theory (SFT). While MEAD employs a multiphysics approach to understand the motor control and dynamics of self-sustained vibration of vocal folds or analogous tissues, SFT predominantly uses acoustics to understand spectral changes of the source via linear propagation through the vocal tract. Because the two theories focus on different aspects of voice production, they are often applied distinctly in specific areas of science and engineering. Here, we argue that the MEAD and the SFT are linked integral aspects of a holistic theory of voice production, describing a dynamically coupled system. The aim of this manuscript is to provide a comprehensive review of both the MEAD and the source-filter theory with its nonlinear extension, the latter of which suggests a number of conceptual similarities to sound production in brass instruments. We discuss the application of both theories to voice production of humans as well as of animals. An appraisal of voice production in the light of non-linear dynamics supports the notion that voice production can best be described with a systems view, considering coupled systems rather than isolated contributions of individual sub-systems.

14.
J Voice ; 2023 Jan 13.
Artigo em Inglês | MEDLINE | ID: mdl-36642590

RESUMO

The thorax (TH), the thoracic diaphragm (TD), and the abdominal wall (AW) are three sub-systems of the respiratory apparatus whose displacement motion has been well studied with the use of magnetic resonance imaging (MRI). Another sub-system, which has however received less research attention with respect to breathing, is the pelvic floor (PF). In particular, there is no study that has investigated the displacement of all four sub-systems simultaneously. Addressing this issue, it was the purpose of this feasibility study to establish a data acquisition paradigm for time-synchronous quantitative analysis of dynamic MRI data from these four major contributors to respiration and phonation (TH, TD, AW, and PF). Three healthy females were asked to breathe in and out forcefully while being recorded in a 1.5-Tesla whole body MR-scanner. Spanning a sequence of 15.12 seconds, 40 MRI data frames were acquired. Each data frame contained two slices, simultaneously documenting the mid-sagittal (TH, TD, PF) and transversal (AW) planes. The displacement motion of the four anatomical structures of interest was documented using kymographic analysis, resulting in time-varying calibrated structure displacement data. After computing the fundamental frequency of the cyclical breathing motion, the phase offsets of the TH, PF, and AW with respect to the TD were computed. Data analysis revealed three fundamentally different displacement patterns. Total structure displacement was in the range of 0.94 cm (TH) to 4.27 cm (TD). Phase delays of up to 90∘ (i.e., a quarter of a breathing cycle) between different structures were found. Motion offsets in the range of -28.30∘ to 14.90∘ were computed for the PF with respect to the TD. The diversity of results in only three investigated participants suggests a variety of possible breathing strategies, warranting further research.

15.
Curr Biol ; 33(21): 4727-4732.e4, 2023 11 06.
Artigo em Inglês | MEDLINE | ID: mdl-37794583

RESUMO

Most mammals produce vocal sounds according to the myoelastic-aerodynamic (MEAD) principle, through self-sustaining oscillation of laryngeal tissues.1,2 In contrast, cats have long been believed to produce their low-frequency purr vocalizations through a radically different mechanism involving active muscle contractions (AMC), where neurally driven electromyographic burst patterns (typically at 20-30 Hz) cause the intrinsic laryngeal muscles to actively modulate the respiratory airflow. Direct empirical evidence for this AMC mechanism is sparse.3 Here, the fundamental frequency (fo) ranges of eight domestic cats (Felis silvestris catus) were investigated in an excised larynx setup, to test the prediction of the AMC hypothesis that vibration should be impossible without neuromuscular activity, and thus unattainable in excised larynx setups, which are based on MEAD principles. Surprisingly, all eight excised larynges produced self-sustained oscillations at typical cat purring rates. Histological analysis of cat larynges revealed the presence of connective tissue masses, up to 4 mm in diameter, embedded in the vocal fold.4 This vocal fold specialization appears to allow the unusually low fo values observed in purring. While our data do not fully reject the AMC hypothesis for purring, they show that cat larynges can easily produce sounds in the purr regime with fundamental frequencies of 25 to 30 Hz without neural input or muscular contraction. This strongly suggests that the physical and physiological basis of cat purring involves the same MEAD-based mechanisms as other cat vocalizations (e.g., meows) and most other vertebrate vocalizations but is potentially augmented by AMC.


Assuntos
Laringe , Animais , Gatos , Laringe/fisiologia , Prega Vocal/fisiologia , Vocalização Animal/fisiologia , Som , Vibração , Músculos Laríngeos , Mamíferos
16.
J Voice ; 36(5): 735.e19-735.e25, 2022 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-33243666

RESUMO

The social distancing measures implemented world-wide in the wake of the novel Coronavirus (COVID-19) crisis have forced voice pedagogues to alter their teaching habits, likely shifting from customary in-person teaching to virtual teaching. An online survey, distributed world-wide in April/May 2020, investigated how singing voice pedagogues were impacted by the COVID-19 crisis. The collected responses from 387 survey participants suggest that, overall, voice teachers were only moderately satisfied with having to teach virtually, indicating that virtual voice teaching is not a sufficient replacement for in-person teaching. The participants indicated that during virtual teaching the singing voice can be assessed relatively well through features which provide both acoustic and visual clues. In contrast, depending on utilized technology, it may be harder to judge those aspects of the singing voice that are solely defined acoustically, such as dynamic range and spectral composition. This may be explained by limitations imposed by "out of the box" technology for online communication, which is typically optimized for speech instead of singing. This calls for better information on technological solutions for virtual voice teaching.


Assuntos
COVID-19 , Canto , Hábitos , Humanos , Ensino , Qualidade da Voz , Treinamento da Voz
17.
Science ; 377(6607): 760-763, 2022 08 12.
Artigo em Inglês | MEDLINE | ID: mdl-35951711

RESUMO

Human speech production obeys the same acoustic principles as vocal production in other animals but has distinctive features: A stable vocal source is filtered by rapidly changing formant frequencies. To understand speech evolution, we examined a wide range of primates, combining observations of phonation with mathematical modeling. We found that source stability relies upon simplifications in laryngeal anatomy, specifically the loss of air sacs and vocal membranes. We conclude that the evolutionary loss of vocal membranes allows human speech to mostly avoid the spontaneous nonlinear phenomena and acoustic chaos common in other primate vocalizations. This loss allows our larynx to produce stable, harmonic-rich phonation, ideally highlighting formant changes that convey most phonetic information. Paradoxically, the increased complexity of human spoken language thus followed simplification of our laryngeal anatomy.


Assuntos
Evolução Biológica , Laringe , Fonação , Primatas , Fala , Prega Vocal , Animais , Humanos , Laringe/anatomia & histologia , Fonética , Acústica da Fala , Prega Vocal/anatomia & histologia
18.
J Acoust Soc Am ; 129(4): 2253-62, 2011 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-21476680

RESUMO

While vocal fold adduction is an important parameter in speech, relatively little has been known on the adjustment of the vocal fold adduction in singing. This study investigates the possibility of separate adjustments of cartilaginous and membranous vocal fold adduction in singing. Six female and seven male subjects, singers and non-singers, were asked to imitate an instructor in producing four phonation types: "aBducted falsetto" (FaB), "aDducted falsetto" (FaD), "aBducted Chest" (CaB), and "aDducted Chest" (CaD). The phonations were evaluated using videostroboscopy, videokymography (VKG), electroglottography (EGG), and audio recordings. All the subjects showed less posterior (cartilaginous) vocal fold adduction in phonation types FaB and CaB than in FaD and CaD, and less membranous vocal fold adduction (smaller closed quotient) in FaB and FaD than in CaB and CaD. The findings indicate that the exercises enabled the singers to separately manipulate (a) cartilaginous adduction and (b) membranous medialization of the glottis though vocal fold bulging. Membranous adduction (monitored via videokymographic closed quotient) was influenced by both membranous medialization and cartilaginous adduction. Individual control over these types of vocal fold adjustments allows singers to create different vocal timbres.


Assuntos
Cartilagens Laríngeas/fisiologia , Música , Prega Vocal/fisiologia , Qualidade da Voz/fisiologia , Feminino , Glote/anatomia & histologia , Glote/fisiologia , Humanos , Cartilagens Laríngeas/anatomia & histologia , Masculino , Membranas/anatomia & histologia , Membranas/fisiologia , Fonação/fisiologia , Prega Vocal/anatomia & histologia
19.
J Voice ; 35(3): 365-375, 2021 May.
Artigo em Inglês | MEDLINE | ID: mdl-32165022

RESUMO

Subharmonics are an important class of voice signals, relevant for speech, pathological voice, singing, and animal bioacoustics. They arise from special cases of amplitude (AM) or frequency modulation (FM) of the time-domain signal. Surprisingly, to date there is only one open source subharmonics detector available to the scientific community: Sun's subharmonic-to-harmonic ratio (SHR). Here, this algorithm was subjected to a formal evaluation with two data sets of synthesized and empirical speech samples. Both data sets consisted of electroglottographic (EGG) signals, ie, a physiological correlate of vocal fold oscillation that bypasses vocal tract acoustics. Data Set I contained 2560 synthesized EGG signals with varying degrees of AM and FM, fundamental frequency (fo), periodicity, and signal-to-noise ratio (SNR). Data Set II was made up of 25 EGG samples extracted from the CMU Arctic speech data base. For a "ground truth" of subharmonicity, these samples were manually annotated by a group of five external experts. Analysis of the synthesized data suggested that the SHR metric is relatively robust as long as the subharmonic modulation extent is below 0.35 and 0.7 for the FM and AM scenarios, respectively. In the CMU Arctic speech data samples, the SHR analysis reached a maximum sensitivity of about 87% at a specificity of over 90%, but only for adaptive algorithm parameter settings. In contrast, the algorithm's default parameter settings could only successfully classify about 9% of all subharmonic instances. The SHR is a useful metric for assessing the degree of subharmonics contained in voice signals, but only at adaptive parameter settings. In particular, the frequency ceiling should be set to five times the highest fo, and the frame length to at least five times the largest fundamental period of the analyzed signal. For subharmonic classification a threshold of SHR  ≥  0.01 is recommended.


Assuntos
Algoritmos , Canto , Prega Vocal , Voz , Fonação , Acústica da Fala , Prega Vocal/diagnóstico por imagem
20.
J Voice ; 2021 Nov 08.
Artigo em Inglês | MEDLINE | ID: mdl-34763995

RESUMO

The choice of terms to describe and assess the singing voice is an essential part of vocal pedagogy. However, previous work suggested that singing terminology used in academia may be somewhat ambiguous. To address this issue, the authors a) compiled a comprehensive inventory of singing voice assessment terms used by English-speaking academic institutions worldwide and b) with the help of 22 highly experienced singing voice teachers, grouped the most prevalent terms based on their conceptual similarity. Only about a fifth of all targeted institutions provided materials and information online. Overall, a total of 292 different terms were found in the 64 available sources. This surprisingly large number of terms could be reduced by approximately 61% through lexical grouping. In the resulting data set, only 24 of the 114 terms occurred in at least 20% of the online sources, suggesting a rather low current density of information as well as little to no systematic and coordinated use of terms across institutions. The singing voice expert's similarity rating of the 24 most prevalent terms revealed a non-uniform distribution, suggesting that only some of these terms can be used interchangeably. Overall, these findings hint at the underlying complexity of voice assessment on a descriptive and qualitative level, highlighting the need for further research in this area.

SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa