Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 11 de 11
Filter
1.
Science ; 164(3884): 1185-7, 1969 Jun 06.
Article in English | MEDLINE | ID: mdl-4976883

ABSTRACT

The vowel repertoire of a rhesus monkey (Macaca mulatta) was explored by means of a computer program that calculated formant frequencies from the area function of the animal's supralaryngeal vocal tract, which was systematically varied within the limits imposed by anatomical constraints. The resulting vowels were compared with those of humans and with recorded vocalizations of nonhuman primates. The computer model indicates that the acoustic "vowel space" of a rhesus monkey is quite restricted compared to that of the human. This limitation results from the lack of a pharyngeal region that can change its cross-sectional area. These animals thus lack the output mechanism necessary for production of human speech. Man's speech output mechanism is apparently species-specific.


Subject(s)
Haplorhini , Phonetics , Speech , Vocalization, Animal , Animals , Computers , Humans , Models, Biological , Pharynx/anatomy & histology , Species Specificity , Speech, Alaryngeal
2.
J Speech Hear Res ; 18(4): 686-706, 1975 Dec.
Article in English | MEDLINE | ID: mdl-1207100

ABSTRACT

The voice onset time (VOT) and the duration of the burst of frication noise at the release of a plosive consonant were measured from spectrograms of word-initial consonant clusters. Mean data from three speakers reading English words in a sentence frame indicated that the VOT changed as a function of the place of articulation of the plosive and as a function of the identity of the following vowel or sonorant consonant. Burst durations varied in a similar way such that the remaining interval of aspiration in /p, t, k/ was nearly the same duration in comparable phonetic environments. The VOT was longer before sonorants and high vowels than before mid- and low vowels. Aspiration was also seen in an /s/-sonorant cluster. To explain these regularities, production strategies and perceptual cues to a voicing decision for English plosives are considered. Variations in VOT are explained in terms of articulatory mechanisms, perceptual constraints, and phonological rules. Some VOT data obtained from a connected discourse were also analyzed and organized into a set of rules for predicting voice onset time in any sentence context.


Subject(s)
Speech , Acoustics , Cues , Humans , Male , Phonetics , Psychoacoustics , Speech/physiology , Time Factors , Voice
3.
J Acoust Soc Am ; 82(3): 737-93, 1987 Sep.
Article in English | MEDLINE | ID: mdl-2958525

ABSTRACT

The automatic conversion of English text to synthetic speech is presently being performed, remarkably well, by a number of laboratory systems and commercial devices. Progress in this area has been made possible by advances in linguistic theory, acoustic-phonetic characterization of English sound patterns, perceptual psychology, mathematical modeling of speech production, structured programming, and computer hardware design. This review traces the early work on the development of speech synthesizers, discovery of minimal acoustic cues for phonetic contrasts, evolution of phonemic rule programs, incorporation of prosodic rules, and formulation of techniques for text analysis. Examples of rules are used liberally to illustrate the state of the art. Many of the examples are taken from Klattalk, a text-to-speech system developed by the author. A number of scientific problems are identified that prevent current systems from achieving the goal of completely human-sounding speech. While the emphasis is on rule programs that drive a format synthesizer, alternatives such as articulatory synthesis and waveform concatenation are also reviewed. An extensive bibliography has been assembled to show both the breadth of synthesis activity and the wealth of phenomena covered by rules in the best of these programs. A recording of selected examples of the historical development of synthetic speech, enclosed as a 33 1/3-rpm record, is described in the Appendix.


Subject(s)
Communication Aids for Disabled , Phonetics , Self-Help Devices , Speech , Humans , Language , Linguistics
4.
J Acoust Soc Am ; 87(2): 820-57, 1990 Feb.
Article in English | MEDLINE | ID: mdl-2137837

ABSTRACT

Voice quality variations include a set of voicing sound source modifications ranging from laryngealized to normal to breathy phonation. Analysis of reiterant imitations of two sentences by ten female and six male talkers has shown that the potential acoustic cues to this type of voice quality variation include: (1) increases to the relative amplitude of the fundamental frequency component as open quotient increases; (2) increases to the amount of aspiration noise that replaces higher frequency harmonics as the arytenoids become more separated; (3) increases to lower formant bandwidths; and (4) introduction of extra pole zeros in the vocal-tract transfer function associated with tracheal coupling. Perceptual validation of the relative importance of these cues for signaling a breathy voice quality has been accomplished using a new voicing source model for synthesis of more natural male and female voices. The new formant synthesizer, KLSYN88, is fully documented here. Results of the perception study indicate that, contrary to previous research which emphasizes the importance of increased amplitude of the fundamental component, aspiration noise is perceptually most important. Without its presence, increases to the fundamental component may induce the sensation of nasality in a high-pitched voice. Further results of the acoustic analysis include the observations that: (1) over the course of a sentence, the acoustic manifestations of breathiness vary considerably--tending to increase for unstressed syllables, in utterance-final syllables, and at the margins of voiceless consonants; (2) on average, females are more breathy than males, but there are very large differences between subjects within each gender; (3) many utterances appear to end in a "breathy-laryngealized" type of vibration; and (4) diplophonic irregularities in the timing of glottal periods occur frequently, especially at the end of an utterance. Diplophonia and other deviations from perfect periodicity may be important aspects of naturalness in synthesis.


Subject(s)
Communication Aids for Disabled , Self-Help Devices , Speech Perception/physiology , Voice Quality/physiology , Voice/physiology , Adolescent , Adult , Female , Humans , Male , Middle Aged , Sex Factors
7.
J Acoust Soc Am ; 54(4): 1102-4, 1973 Oct.
Article in English | MEDLINE | ID: mdl-4757455

Subject(s)
Phonetics , Voice , Humans , Time Factors
9.
J Acoust Soc Am ; 55(4): 822-32, 1974 Apr.
Article in English | MEDLINE | ID: mdl-4833078
11.
J Acoust Soc Am ; 40(1): 54-61, 1966 Jul.
Article in English | MEDLINE | ID: mdl-5941767
SELECTION OF CITATIONS
SEARCH DETAIL