RESUMO
While language is expressed in multiple modalities, including sign, writing, or whistles, speech is arguably the most common. The human vocal tract is capable of producing the bewildering diversity of the 7000 or so currently spoken languages, but relatively little is known about its genetic bases, especially in what concerns normal variation. Here, we capitalize on five cohorts totaling 632 Dutch twins with structural magnetic resonance imaging (MRI) data. Two raters placed clearly defined (semi)landmarks on each MRI scan, from which we derived 146 measures capturing the dimensions and shape of various vocal tract structures, but also aspects of the head and face. We used Genetic Covariance Structure Modeling to estimate the additive genetic, common environmental or non-additive genetic, and unique environmental components, while controlling for various confounds and for any systematic differences between the two raters. We found high heritability, h2, for aspects of the skull and face, the mandible, the anteroposterior (horizontal) dimension of the vocal tract, and the position of the hyoid bone. These findings extend the existing literature, and open new perspectives for understanding the complex interplay between genetics, environment, and culture that shape our vocal tracts, and which may help explain cross-linguistic differences in phonetics and phonology.
Assuntos
Idioma , Imageamento por Ressonância Magnética , Humanos , Fala , Fonética , Estudos de CoortesRESUMO
Lateral vocalization is a cross-linguistically common phenomenon where a lateral is realized as a glide, such as [w, j], or a vowel [u, i]. In this paper, we focus on the articulatory triggers that could cause lateral vocalization. We examined Brazilian Portuguese, a language known for the process of lateral vocalization in coda position. We examined the lateral in onset and coda position in four vocalic environments and compared the dynamic tongue contours and contours at the point of maximum constriction in each environment. We also performed biomechanical simulations of lateral articulation and the vocalized lateral. The results indicate increased tongue body retraction in coda position, which is accompanied by tongue body raising. Simulations further revealed that vocalized laterals mainly recruit intrinsic lingual muscles along with the styloglossus. Taken together, the data suggest that vocalization is a result of positional phonetic effects including lenition and additional retraction in the coda position.
Assuntos
Idioma , Fonética , Brasil , Músculos Faciais , Língua/fisiologiaRESUMO
The voices of heroes and villains in cartoons contribute to their uniqueness and helps shape how we perceive them. However, not much research has looked at the acoustic properties of character voices and the possible contributions these have to cartoon character archetypes. We present a quantitative examination of how voice quality distinguishes between characters based on their alignment as either protagonists or antagonists, performing Principal Component Analysis (PCA) on the Long-term Average Spectra (LTAS) of concatenated passages of the speech of various characters obtained from four different animated cartoons. We then assessed if the categories of "protagonists" and "antagonists" (determined via an a priori classification) could be distinguished using a classification algorithm, and if so, what acoustic characteristics could help distinguish the two categories. The overall results support the idea that protagonists and antagonists can be distinguished by their voice qualities. Support Vector Machine (SVM) analysis yielded an average classification accuracy of 96% across the cartoons. Visualisation of the spectral traits constituting the difference did not yield consistent results but reveals a low-versus-high frequency energy dominance pattern segregating antagonists and protagonists. Future studies can look into how other variables might be confounded with voice quality in distinguishing between these categories.
Assuntos
Qualidade da Voz , Voz , Acústica , Humanos , Fala , Acústica da Fala , Treinamento da VozRESUMO
The standard Beijing variety of Mandarin has a clear alveolar-retroflex contrast for phonemes featuring voiceless sibilant frication (i.e., /s/, /Ê/, /Ês/, /ÊÊ/, /ÊsÊ°/, /ÊÊÊ°/). However, some studies show that varieties in the 'outer circle', such in Taiwan, have a reduced contrast for these speech sounds via a process known as 'deretroflexion'. The variety of Mandarin spoken in Singapore is also considered as 'outer circle', as it exhibits influences from Min Nan varieties. We investigated how bilinguals of Singapore Mandarin and English perceive and produce speech tokens in minimal pairs differing only in the alveolar/retroflex place of articulation. In all, 50 participants took part in two tasks. In Task 1, participants performed a lexical identification task for minimal pairs differing only the alveolar/retroflex place of articulation, as spoken by native speakers of two varieties: Beijing Mandarin and Singapore Mandarin. No difference in comprehension of the words was observed between the two varieties indicating that both varieties contain sufficient acoustic information for discrimination. In Task 2, participants read aloud from the list of minimal pairs while their voices were recorded. Acoustic analysis revealed that the phonemes do indeed differ acoustically in terms of center of gravity of the frication and in an alternative measure: long-term averaged spectra. The magnitude of this difference appears to be smaller than previously reported differences for the Beijing variety. These findings show that although some deretroflexion is evident in the speech of bilinguals of the Singaporean variety of Mandarin, it does not translate to ambiguity in the speech signal.
RESUMO
The genetic basis of the human vocal system is largely unknown, as are the sequence variants that give rise to individual differences in voice and speech. Here, we couple data on diversity in the sequence of the genome with voice and vowel acoustics in speech recordings from 12,901 Icelanders. We show how voice pitch and vowel acoustics vary across the life span and correlate with anthropometric, physiological, and cognitive traits. We found that voice pitch and vowel acoustics have a heritable component and discovered correlated common variants in ABCC9 that associate with voice pitch. The ABCC9 variants also associate with adrenal gene expression and cardiovascular traits. By showing that voice and vowel acoustics are influenced by genetics, we have taken important steps toward understanding the genetics and evolution of the human vocal system.
Assuntos
Acústica da Fala , Voz , Humanos , Fala/fisiologia , AcústicaRESUMO
Performers use various laryngeal settings to create voices for characters and personas they portray. Although some research demonstrates the sociophonetic associations of laryngeal voice quality, few studies have documented or examined the role of harsh voice quality, particularly with vibration of the epilaryngeal structures (growling). This article qualitatively examines phonetic properties of vocal performances in a corpus of popular American media and evaluates the association of voice qualities in these performances with representations of social identity and stereotype. In several cases, contrasting laryngeal states create sociophonetic contrast, and harsh voice quality is paired with the portrayal of racial stereotypes of black people. These cases indicate exaggerated emotional states and are associated with yelling/shouting modes of expression. Overall, however, the functioning of harsh voice quality as it occurs in the data is broader and may involve aggressive posturing, comedic inversion of aggressiveness, vocal pathology, and vocal homage.
Assuntos
Negro ou Afro-Americano , Laringe/fisiologia , Meios de Comunicação de Massa , Fonação/fisiologia , Prega Vocal/fisiologia , Qualidade da Voz/fisiologia , Humanos , Estados UnidosRESUMO
While speech and language do not fossilize, they still leave traces that can be extracted and interpreted. Here, we suggest that the shape of the hard structures of the vocal tract may also allow inferences about the speech of long-gone humans. These build on recent experimental and modelling studies, showing that there is extensive variation between individuals in the precise shape of the vocal tract, and that this variation affects speech and language. In particular, we show that detailed anatomical information concerning two components of the vocal tract (the lower jaw and the hard palate) can be extracted and digitized from the osteological remains of three historical populations from The Netherlands, and can be used to conduct three-dimensional biomechanical simulations of vowel production. We could recover the signatures of inter-individual variation between these vowels, in acoustics and articulation. While 'proof-of-concept', this study suggests that older and less well-preserved remains could be used to draw inferences about historic and prehistoric languages. Moreover, it forces us to clarify the meaning and use of the uniformitarian principle in linguistics, and to consider the wider context of language use, including the anatomy, physiology and cognition of the speakers. This article is part of the theme issue 'Reconstructing prehistoric languages'.
Assuntos
Evolução Cultural , Idioma , Fala/fisiologia , Adulto , Feminino , História do Século XV , História do Século XVI , História do Século XIX , História Medieval , Humanos , Masculino , Pessoa de Meia-Idade , Países Baixos , Medida da Produção da Fala , Adulto JovemRESUMO
Six aryepiglottic trills with varied laryngeal parameters were recorded using high-speed laryngoscopy to investigate the nature of the oscillatory behavior of the upper margin of the epilaryngeal tube. Image analysis techniques were applied to extract data about the patterns of aryepiglottic fold oscillation, with a focus on the oscillatory frequencies of the folds. The acoustic impact of aryepiglottic trilling is also considered, along with possible interactions between the aryepiglottic vibration and vocal fold vibration during the voiced trill. Overall, aryepiglottic trilling is deemed to be correctly labeled as a trill in phonetic terms, while also acting as a means to alter the quality of voicing to be auditorily harsh. In terms of its characterization, aryepiglottic vibration is considerably irregular, but it shows indications of contributing quasi-harmonic excitation of the vocal tract, particularly noticeable under conditions of glottal voicelessness. Aryepiglottic vibrations appear to be largely independent of glottal vibration in terms of oscillatory frequency but can be increased in frequency by increasing overall laryngeal constriction. There is evidence that aryepiglottic vibration induces an alternating vocal fold vibration pattern. It is concluded that aryepiglottic trilling, like ventricular phonation, should be regarded as a complex, if highly irregular, sound source.
Assuntos
Epiglote/fisiologia , Músculos Laríngeos/fisiologia , Laringe/fisiologia , Fonação/fisiologia , Voz/fisiologia , Humanos , Laringoscopia , Vibração , Gravação de VideoteipeRESUMO
Linguistic diversity is affected by multiple factors, but it is usually assumed that variation in the anatomy of our speech organs plays no explanatory role. Here we use realistic computer models of the human speech organs to test whether inter-individual and inter-group variation in the shape of the hard palate (the bony roof of the mouth) affects acoustics of speech sounds. Based on 107 midsagittal MRI scans of the hard palate of human participants, we modelled with high accuracy the articulation of a set of five cross-linguistically representative vowels by agents learning to produce speech sounds. We found that different hard palate shapes result in subtle differences in the acoustics and articulatory strategies of the produced vowels, and that these individual-level speech idiosyncrasies are amplified by the repeated transmission of language across generations. Therefore, we suggest that, besides culture and environment, quantitative biological variation can be amplified, also influencing language.
Assuntos
Palato Duro/fisiologia , Fala/fisiologia , Adolescente , Adulto , Variação Anatômica , Simulação por Computador , Feminino , Humanos , Imageamento por Ressonância Magnética , Masculino , Pessoa de Meia-Idade , Modelos Anatômicos , Palato Duro/anatomia & histologia , Palato Duro/diagnóstico por imagem , Fonética , Acústica da Fala , Adulto JovemRESUMO
People vary at most levels, from the molecular to the cognitive, and the shape of the hard palate (the bony roof of the mouth) is no exception. The patterns of variation in the hard palate are important for the forensic sciences and (palaeo)anthropology, and might also play a role in speech production, both in pathological cases and normal variation. Here we describe a method based on Bézier curves, whose main aim is to generate possible shapes of the hard palate in humans for use in computer simulations of speech production and language evolution. Moreover, our method can also capture existing patterns of variation using few and easy-to-interpret parameters, and fits actual data obtained from MRI traces very well with as little as two or three free parameters. When compared to the widely-used Principal Component Analysis (PCA), our method fits actual data slightly worse for the same number of degrees of freedom. However, it is much better at generating new shapes without requiring a calibration sample, its parameters have clearer interpretations, and their ranges are grounded in geometrical considerations.
Assuntos
Modelos Anatômicos , Palato Duro/anatomia & histologia , Simulação por Computador , Humanos , Imageamento por Ressonância Magnética , Palato Duro/diagnóstico por imagem , Análise de Componente PrincipalRESUMO
Purpose: Recent proposals suggest that (a) the high dimensionality of speech motor control may be reduced via modular neuromuscular organization that takes advantage of intrinsic biomechanical regions of stability and (b) computational modeling provides a means to study whether and how such modularization works. In this study, the focus is on the larynx, a structure that is fundamental to speech production because of its role in phonation and numerous articulatory functions. Method: A 3-dimensional model of the larynx was created using the ArtiSynth platform (http://www.artisynth.org). This model was used to simulate laryngeal articulatory states, including inspiration, glottal fricative, modal prephonation, plain glottal stop, vocal-ventricular stop, and aryepiglotto-epiglottal stop and fricative. Results: Speech-relevant laryngeal biomechanics is rich with "quantal" or highly stable regions within muscle activation space. Conclusions: Quantal laryngeal biomechanics complement a modular view of speech control and have implications for the articulatory-biomechanical grounding of numerous phonetic and phonological phenomena.
Assuntos
Laringe/fisiologia , Fala/fisiologia , Adulto , Fenômenos Biomecânicos , Simulação por Computador , Humanos , Laringe/anatomia & histologia , Masculino , Modelos Biológicos , RespiraçãoRESUMO
PURPOSE Physiological and phonetic studies suggest that, at moderate levels of epilaryngeal stricture, the ventricular folds impinge upon the vocal folds and influence their dynamical behavior, which is thought to be responsible for constricted laryngeal sounds. In this work, the authors examine this hypothesis through biomechanical modeling. METHOD The dynamical response of a low-dimensional, lumped-element model of the vocal folds under the influence of vocal-ventricular fold coupling was evaluated. The model was assessed for F0 and cover-mass phase difference. Case studies of simulations of different constricted phonation types and of glottal stop illustrate various additional aspects of model performance. RESULTS Simulated vocal-ventricular fold coupling lowers F0 and perturbs the mucosal wave. It also appears to reinforce irregular patterns of oscillation, and it can enhance laryngeal closure in glottal stop production. CONCLUSION The effects of simulated vocal-ventricular fold coupling are consistent with sounds, such as creaky voice, harsh voice, and glottal stop, that have been observed to involve epilaryngeal stricture and apparent contact between the vocal folds and ventricular folds. This supports the view that vocal-ventricular fold coupling is important in the vibratory dynamics of such sounds and, furthermore, suggests that these sounds may intrinsically require epilaryngeal stricture.