Búsqueda | Biblioteca Virtual en Salud

1.

Correcting the record: Phonetic potential of primate vocal tracts and the legacy of Philip Lieberman (1934-2022).

Ekström, Axel G.

Am J Primatol ; 86(8): e23637, 2024 Aug.

Artículo en Inglés | MEDLINE | ID: mdl-38741274

RESUMEN

The phonetic potential of nonhuman primate vocal tracts has been the subject of considerable contention in recent literature. Here, the work of Philip Lieberman (1934-2022) is considered at length, and two research papers-both purported challenges to Lieberman's theoretical work-and a review of Lieberman's scientific legacy are critically examined. I argue that various aspects of Lieberman's research have been consistently misinterpreted in the literature. A paper by Fitch et al. overestimates the would-be "speech-ready" capacities of a rhesus macaque, and the data presented nonetheless supports Lieberman's principal position-that nonhuman primates cannot articulate the full extent of human speech sounds. The suggestion that no vocal anatomical evolution was necessary for the evolution of human speech (as spoken by all normally developing humans) is not supported by phonetic or anatomical data. The second challenge, by Boë et al., attributes vowel-like qualities of baboon calls to articulatory capacities based on audio data; I argue that such "protovocalic" properties likely result from disparate articulatory maneuvers compared to human speakers. A review of Lieberman's scientific legacy by Boë et al. ascribes a view of speech evolution (which the authors term "laryngeal descent theory") to Lieberman, which contradicts his writings. The present article documents a pattern of incorrect interpretations of Lieberman's theoretical work in recent literature. Finally, the apparent trend of vowel-like formant dispersions in great ape vocalization literature is discussed with regard to Lieberman's theoretical work. The review concludes that the "Lieberman account" of primate vocal tract phonetic capacities remains supported by research: the ready articulation of fully human speech reflects species-unique anatomy.

Asunto(s)

Fonética , Primates , Vocalización Animal , Animales , Primates/fisiología , Primates/anatomía & histología , Humanos , Historia del Siglo XX , Habla/fisiología , Evolución Biológica

2.

Neural correlates of lexical-tone and vowel-quality processing in 6- and 9-month-old German-learning infants and adults.

Götz, Antonia; Männel, Claudia; Schwarzer, Gudrun; Krasotkina, Anna; Höhle, Barbara.

J Child Lang ; : 1-23, 2024 Apr 29.

Artículo en Inglés | MEDLINE | ID: mdl-38682697

RESUMEN

We examined the neurophysiological underpinnings of lexical-tone and vowel-quality perception in learners of a non-tonal language. We tested 25 6- and 25 9-month-old German-learning infants, as well as 24 German adults and expected developmental differences for the two linguistic properties, as they are both carried by vowels, but have a different status in German. In adults, both lexical-tone and vowel-quality contrasts elicited mismatch negativities, with a stronger response to the vowel-quality contrast. Six-month-olds showed positive mismatch responses for lexical-tone and vowel-quality contrasts, with an emerging negative mismatch response for vowel-quality only. The negative mismatch responses became more pronounced for the vowel-quality contrast at 9 months, while the lexical-tone contrast elicited mainly positive mismatch responses. Our data reveal differential developmental changes in the processing of vowel properties that differ in their lexical relevance in the ambient language.

3.

The Lombard effect in children with cochlear implants: suprasegmental aspects.

Okalidou, Areti; Peng, Z Ellen; Banioti, Aggeliki; Fourakis, Marios; Kyriafinis, Georgios.

Clin Linguist Phon ; : 1-21, 2024 Apr 28.

Artículo en Inglés | MEDLINE | ID: mdl-38679889

RESUMEN

Children with cochlear implants (CI) communicate in noisy environments, such as in classrooms, where multiple talkers and reverberation are present. Speakers compensate for noise via the 'Lombard effect'. The present study examined the Lombard effect on the intensity and duration of stressed vowels in the speech of children with Cochlear Implants (CIs) as compared to children with Normal Hearing (NH), focusing on the effects of speech-shaped noise (SSN) and speech-shaped noise with reverberation (SSN+Reverberation). The sample consisted of 7 children with CIs and 7 children with NH, aged 7-12 years. Regarding intensity, a) children with CIs produced stressed vowels with an overall greater intensity across acoustic conditions as compared to NH peers, b) both groups increased their stressed vowel intensity for all vowels from Quiet to both noise conditions, and c) children with NH further increased their intensity when reverberation was added to SSN, esp. for the vowel/u/. Regarding duration, longer stressed vowels were produced by children with CIs as compared to NH in Quiet and SSN conditions but the effect was retained only for the vowels/i/,/o/and/u/when reverberation was added to noise. The SSN+Reverberation condition induced systematic lengthening in stressed vowels for children with NH. Furthermore, although greater intensity and duration ratios of stressed/unstressed syllables were observed for children with NH as compared to CIs in Quiet condition, they diminished with noise. The differences observed across groups have implications for speaking in classroom noise.

4.

Cross-cultural evaluation of learning and memory using a consonant-vowel-consonant trigram list.

Ampofo, Prince; Katschke, Jessica L; Kadey, Kylie R; Dixon, Bradley J; Halter, Colt M; Moll, Allison C; Gattuso, Maria; Morganti, Francesca; Woodard, John L.

J Int Neuropsychol Soc ; 29(10): 922-932, 2023 Dec.

Artículo en Inglés | MEDLINE | ID: mdl-37989558

RESUMEN

OBJECTIVE: Word list-learning tasks are commonly used to evaluate auditory-verbal learning and memory. However, different frequencies of word usage, subtle meaning nuances, unique word phonology, and different preexisting associations among words make translation across languages difficult. We administered lists of consonant-vowel-consonant (CVC) nonword trigrams to independent American and Italian young adult samples. We evaluated whether an auditory list-learning task using CVC nonword trigrams instead of words could be applied cross-culturally to evaluate similar learning and associative memory processes. PARTICIPANTS AND METHODS: Seventy-five native English-speaking (USA) and 104 native Italian-speaking (Italy) university students were administered 15-item lists of CVC trigrams using the Rey Auditory Verbal Learning Test paradigm with five study-test trials, an interference trial, and short- and long-term delayed recall. Bayesian t tests and mixed-design ANOVAs contrasted the primary learning indexes across the two samples and biological sex. RESULTS: Performance was comparable between nationalities on all primary memory indices except the interference trial (List B), where the Italian group recalled approximately one item more than the American sample. For both nationalities, recall increased across the five learning trials and declined significantly on the postinterference trial, demonstrating susceptibility to retroactive interference. No effects of sex, age, vocabulary, or depressive symptoms were observed. CONCLUSIONS: Using lists of unfamiliar nonword CVC trigrams, Italian and American younger adults showed a similar performance pattern across immediate and delayed recall trials. Whereas word list-learning performance is typically affected by cultural, demographic, mood, and cognitive factors, this trigram list-learning task does not show such effects, demonstrating its utility for cross-cultural memory assessment.

Asunto(s)

Comparación Transcultural , Aprendizaje , Adulto Joven , Humanos , Teorema de Bayes , Memoria , Aprendizaje Verbal , Recuerdo Mental

5.

Musical training alters neural processing of tones and vowels in classic Chinese poems.

Zhang, Zhenghua; Zhang, Hang; Sommer, Werner; Yang, Xiaohong; Wei, Zhen; Li, Weijun.

Brain Cogn ; 166: 105952, 2023 03.

Artículo en Inglés | MEDLINE | ID: mdl-36641937

RESUMEN

Long-term rigorous musical training promotes various aspects of spoken language processing. However, it is unclear whether musical training provides an advantage in recognizing segmental and suprasegmental information of spoken language. We used vowel and tone violations in spoken unfamiliar seven-character quatrains and a rhyming judgment task to investigate the effects of musical training on tone and vowel processing by recording ERPs. Compared with non-musicians, musicians were more accurate and responded faster to incorrect than correct tones. Musicians showed larger P2 components in their ERPs than non-musicians during both tone and vowel processing, revealing increased focused attention on sounds. Both groups showed enhanced N400 and LPC for incorrect vowels (vs. correct vowels) but non-musicians showed an additional P2 effect for vowel violations. Moreover, both groups showed enhanced LPC for incorrect tones (vs. correct tones) but only non-musicians showed an additional N400 effect for tone violations. These results indicate that vowel/tone processing is less effortful for musicians (vs. non-musicians). Our study suggests that long-term musical training facilitates speech tone and vowel processing in a tonal language environment by increasing the attentional focus on speech and reducing demands for detecting incorrect vowels and integration costs for tone changes.

Asunto(s)

Música , Percepción del Habla , Femenino , Humanos , Masculino , Estimulación Acústica , Electroencefalografía , Potenciales Evocados , Lenguaje , Poesía como Asunto

6.

Double trouble: Using spellings of different lengths to represent vowel length in English.

Altmiller, Ruth; Treiman, Rebecca; Kessler, Brett.

J Exp Child Psychol ; 231: 105649, 2023 07.

Artículo en Inglés | MEDLINE | ID: mdl-36871325

RESUMEN

Much previous research on spelling and reading development has focused on single-syllable words. Here we examined disyllables, asking how learners of English mark the distinction between short and long first-syllable vowels by use of vowel digraphs and double-consonant digraphs. In a behavioral study, we asked participants in Grade 2 (n = 32, mean age â¼8 years), Grade 4 (n = 33, mean age â¼10 years), Grade 6 (n = 32, mean age â¼12 years), and university (n = 32; mean age â¼20 years) to spell nonwords with short and long first-syllable vowels. We found an increase across grade levels in use of vowel digraphs to represent long vowels, and we also found increasing use of double-consonant digraphs after short vowels. Participants generally avoided using both a vowel digraph and a following consonant digraph. In a vocabulary analysis, we examined use of vowel and double-consonant digraphs in the words to which readers of different grade levels are exposed. Children used vowel digraphs less often than anticipated on the basis of the vocabulary statistics, but university students used them at similar rates. For double-consonant digraphs after short vowels, rates of digraph use were lower in the behavioral data than in the vocabulary data even for university students. These results point to the difficulty of spelling a phoneme with multiple letters when those letters simultaneously spell another sound in a word. We discuss the results in terms of the roles of statistical learning and explicit instruction in the development of spelling.

Asunto(s)

Lenguaje , Fonética , Niño , Humanos , Adulto Joven , Adulto , Vocabulario , Aprendizaje , Lectura

7.

Improving non-native duration contrast with dichotic training in dyslexic and non-dyslexic individuals.

Bouhon, Margot; Ferreira, Claire; Bahuon, Sandy; Tillmann, Barbara; Bedoin, Nathalie.

Dyslexia ; 29(2): 151-158, 2023 May.

Artículo en Inglés | MEDLINE | ID: mdl-36840422

RESUMEN

Perceiving and producing English phonemic vowel length contrasts is challenging for non-native speakers. According to multi-time resolution models, endogenous slow/fast rhythms contribute, respectively, in the right/left hemispheres, to long/short acoustic cue processing. This study introduced a perceptual training method implementing dichotic stimulation to improve /i:/-/Éª/ processing by promoting hemispheric complementarity. Twenty non-dyslexic and 20 dyslexic French adults received 1 hr-training over 3 days. Productions were evaluated with pre-/post-tests. Training enhanced vowel duration contrast in word production by /i:/ lengthening and /Éª/ shortening in both groups. Adults with dyslexia compensated fewer /i:/ lengthening by /Éª/ shortening than did non-dyslexic adults. Transfer from perceptual training to production seems possible for foreign-language learning even in dyslexic adults. The extent to which dichotic presentation contributed to training effectiveness cannot be evaluated here, but the triggering of lengthening and shortening mechanisms suggests that lateralized complementary skills have been enhanced by dichotic stimulation.

Asunto(s)

Dislexia , Adulto , Humanos , Lenguaje , Aprendizaje

8.

Merger in Eivissan Catalan: an acoustic analysis of the vowel systems of young native speakers.

Hamann, Silke; Torres-Tamarit, Francesc.

Phonetica ; 80(1-2): 43-78, 2023 02 23.

Artículo en Inglés | MEDLINE | ID: mdl-37319340

RESUMEN

The vowel system of Catalan has been the focus of many studies, though work on the varieties spoken on the island of Eivissa (Ibiza) are scarce, with a single mention of the possible merger of the mid back vowels /o, É/ (Torres Torres, Marià. 1983. Aspectes del vocalisme tònic eivissenc. Eivissa 14. 22-23). The present article provides the first acoustic analysis of the vowel inventory of 25 young native speakers of Eivissan Catalan, with a focus on the realisations of stressed /É, É/, and the back mid vowels /o, É/. We employed Pillai scores (Hay, Jennifer, Paul Warren & Katie Drager. 2006. Factors influencing speech perception in the context of a merger-in-progress. Journal of Phonetics 34. 458-484) to compare the possibly merged pairs /É, É/ and /o, É/ to the fully-contrasting neighbouring pairs /e, É/ and /o, u/. Our results show that all participants had considerable overlap of stressed /É/ and /É/, and all but one had considerable overlap of the back mid vowels, while the fully contrastive pairs (/e, É/ and /o, u/) showed almost no overlap.

Asunto(s)

Acústica del Lenguaje , Percepción del Habla , Humanos , Lenguaje , Acústica , Fonética

9.

Vowel-internal cues to vowel quality and prominence in speech perception.

Steffman, Jeremy.

Phonetica ; 80(5): 329-356, 2023 10 26.

Artículo en Inglés | MEDLINE | ID: mdl-37650429

RESUMEN

This study examines how variation in F0 and intensity impacts the perception of American English vowels. Both properties vary intrinsically as a function of vowel features in the speech production literature, raising the question of the perceptual impact of each. In addition to considering listeners' interpretation of either cue as an intrinsic property of the vowel, the possible prominence-marking function of each is considered. Two patterns of prominence strengthening in vowels, sonority expansion and hyperarticulation, are tested in light of recent findings that contextual prominence impacts vowel perception in line with these effects (i.e. a prominent vowel is expected by listeners to be realized as if it had undergone prominence strengthening). Across four vowel contrasts with different height and frontness features, listeners categorized phonetic continua with variation in formants, F0 and intensity. Results show that variation in level F0 height is interpreted as an intrinsic cue by listeners. Higher F0 cues a higher vowel, following intrinsic F0 effects in the production literature. In comparison, intensity is interpreted as a prominence-lending cue, for which effect directionality is dependent on vowel height. Higher intensity high vowels undergo perceptual re-calibration in line with (acoustic) hyperarticulation, whereas higher intensity non-high vowels undergo perceptual re-calibration in line with sonority expansion.

Asunto(s)

Señales (Psicología) , Percepción del Habla , Humanos , Lenguaje , Habla , Fonética , Acústica del Lenguaje

10.

Vowel and consonant quantity in two Swiss German dialects and their corresponding varieties of Standard German: effects of region, age, and tempo.

Zebe, Franka.

Phonetica ; 80(3-4): 185-223, 2023 06 27.

Artículo en Inglés | MEDLINE | ID: mdl-37418310

RESUMEN

The diglossic situation in German-speaking Switzerland entails that both an Alemannic dialect and a Swiss standard variety of German are spoken. One phonological property of both Alemannic and Swiss Standard German (SSG) is contrastive quantity not only in vowels but also in consonants, namely lenis and fortis. This study aims to compare vowel and plosive closure durations as well as articulation rate (AR) between Alemannic and SSG in the varieties spoken in a rural area of the canton of Lucerne (LU) and an urban area of the canton of Zurich (ZH). In addition to the segment durations, an additional measure of vowel-to-vowel + consonant duration (V/(V + C)) ratios is calculated in order to account for possible compensation between vowel and closure durations. Stimuli consisted of words containing different vowel-consonant (VC) combinations. The main differences found are longer segment durations in Alemannic compared to SSG, three phonetic vowel categories in Alemannic that differ between LU and ZH, three stable V/(V + C) ratio categories, and three phonetic consonant categories lenis, fortis, and extrafortis in both Alemannic and SSG. Most importantly, younger ZH speakers produced overall shorter closure durations, calling into question a possible reduction of consonant categories due to a contact to German Standard German (GSG).

Asunto(s)

Lenguaje , Fonética , Humanos , Suiza , Factores de Tiempo , Medicago sativa

11.

Individual differences in attention control and the processing of phonological contrasts in a second language.

Mora, Joan C; Darcy, Isabelle.

Phonetica ; 80(3-4): 153-184, 2023 06 27.

Artículo en Inglés | MEDLINE | ID: mdl-37341707

RESUMEN

This study investigated attention control in L2 phonological processing from a cognitive individual differences perspective, to determine its role in predicting phonological acquisition in adult L2 learning. Participants were 21 L1-Spanish learners of English, and 19 L1-English learners of Spanish. Attention control was measured through a novel speech-based attention-switching task. Phonological processing was assessed through a speeded ABX categorization task (perception) and a delayed sentence repetition task (production). Correlational analyses indicated that learners with more efficient attention switching skill and faster speed in correctly identifying the target phonetic features in the speech dimension under focus could perceptually discriminate L2 vowels at higher processing speed, but not at higher accuracy rates. Thus, attentional flexibility provided a processing advantage for difficult L2 contrasts but did not predict the extent to which precise representations for the target L2 vowels had been established. However, attention control was related to L2 learners' ability to distinguish the contrasting L2 vowels in production. In addition, L2 learners' accuracy in perceptually distinguishing between two contrasting vowels was significantly related to how much of a quality distinction between them they could make in production.

Asunto(s)

Multilingüismo , Percepción del Habla , Adulto , Humanos , Individualidad , Lenguaje , Fonética , Atención

12.

Parents tune their vowels to the emergence of children's words.

Odijk, Lotte; Gillis, Steven.

J Child Lang ; 50(5): 1184-1203, 2023 09.

Artículo en Inglés | MEDLINE | ID: mdl-35758136

RESUMEN

The aim of this study was to investigate the acoustic vowel space area in infant directed speech (IDS). The research question is whether the vowel space is expanded or remains constant in IDS. A corpus of spontaneous interactions of 9 dyads followed monthly from the age of 6 to 24 months was analyzed. The occurrences in the parents' speech of each word that the children eventually acquired were extracted. The surface of the vowel triangle and the convex hull of all vowels were computed. The main result is that the development of the vowel space in IDS follows an inverted U-shaped curve: the vowel space starts relatively small, gradually increases as the child's first word use approaches, and decreases again afterwards. These findings show that parents adapt their articulation to the evolving linguistic abilities of their child, and this adaptation can be detected at the level of individual lexical items.

Asunto(s)

Desarrollo del Lenguaje , Percepción del Habla , Lactante , Humanos , Niño , Preescolar , Fonética , Lenguaje Infantil , Habla , Padres , Acústica del Lenguaje

13.

A practical guide to calculating vocal tract length and scale-invariant formant patterns.

Anikin, Andrey; Barreda, Santiago; Reby, David.

Behav Res Methods ; 2023 Dec 29.

Artículo en Inglés | MEDLINE | ID: mdl-38158551

RESUMEN

Formants (vocal tract resonances) are increasingly analyzed not only by phoneticians in speech but also by behavioral scientists studying diverse phenomena such as acoustic size exaggeration and articulatory abilities of non-human animals. This often involves estimating vocal tract length acoustically and producing scale-invariant representations of formant patterns. We present a theoretical framework and practical tools for carrying out this work, including open-source software solutions included in R packages soundgen and phonTools. Automatic formant measurement with linear predictive coding is error-prone, but formant_app provides an integrated environment for formant annotation and correction with visual and auditory feedback. Once measured, formants can be normalized using a single recording (intrinsic methods) or multiple recordings from the same individual (extrinsic methods). Intrinsic speaker normalization can be as simple as taking formant ratios and calculating the geometric mean as a measure of overall scale. The regression method implemented in the function estimateVTL calculates the apparent vocal tract length assuming a single-tube model, while its residuals provide a scale-invariant vowel space based on how far each formant deviates from equal spacing (the schwa function). Extrinsic speaker normalization provides more accurate estimates of speaker- and vowel-specific scale factors by pooling information across recordings with simple averaging or mixed models, which we illustrate with example datasets and R code. The take-home messages are to record several calls or vowels per individual, measure at least three or four formants, check formant measurements manually, treat uncertain values as missing, and use the statistical tools best suited to each modeling context.

14.

Production of Mandarin consonant aspiration and monophthongs in children with Autism Spectrum Disorder.

Feng, Yan; Chen, Fei; Ma, Junzhou; Wang, Lan; Peng, Gang.

Clin Linguist Phon ; 37(10): 899-918, 2023 Oct 03.

Artículo en Inglés | MEDLINE | ID: mdl-35848409

RESUMEN

Impaired speech sound production adds difficulties to social communication in children with Autism Spectrum Disorder (ASD), while a limited attempt has been made to figure out the speech sound production among Mandarin-speaking children with ASD. The current study conducted both auditory-perceptual scoring and quantitative acoustic analysis of speech sound imitated by 27 Mandarin-speaking children with ASD (3.33-7.00 years) and 30 chronological-age-matched typically developing (TD) children. Auditory-perceptual scoring showed significantly lower scores for aspirated/unaspirated consonants and monophthongs in children with ASD. Moreover, the correlation between the developmental age of language and production accuracy in children with ASD emphasised the importance of language assessment. The quantitative acoustic analysis further indicated that the ASD group produced a much shorter voice onset time for aspirated consonants and showed a reduced vowel space than the TD group. Early interventions focusing on these production patterns should be introduced to improve the speech sound production in Mandarin-speaking children with ASD.

15.

Speech production in Mandarin-speaking children with cochlear implants: a systematic review.

Li, Jiaying; Mayr, Robert; Zhao, Fei.

Int J Audiol ; 61(9): 711-719, 2022 09.

Artículo en Inglés | MEDLINE | ID: mdl-34620034

RESUMEN

OBJECTIVE: This study aimed to systematically review and critically appraise the literature describing the phonetic characteristics and accuracy of the consonants, vowels and tones produced by Mandarin-speaking children with cochlear implants (CIs). DESIGN: The protocol in this review was designed in conformity with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement. EBSCOhost, PubMed, Scopus, PsycINFO, ProQuest Central databases were searched for relevant articles which met the inclusion criteria. STUDY SAMPLE: A total of 18 journal papers were included in this review. RESULTS: The results revealed that Mandarin-speaking children with CIs perform consistently more poorly in their production of consonants, in particular on fricatives, have a smaller and less well-defined vowel space, and exhibit greater difficulties in tone realisation, notably T2 and T3, when compared to their normal-hearing (NH) peers. The results from acoustic and accuracy analyses are negatively correlated with CI implantation age, but largely positively correlated with hearing age. CONCLUSIONS: Findings of this review highlight the factors that influence consonant, vowel and tone production in Mandarin-speaking children with CIs, thereby providing critical information for clinicians and researchers working with this population.

Asunto(s)

Implantación Coclear , Implantes Cocleares , Sordera , Percepción del Habla , Niño , Implantación Coclear/métodos , Sordera/cirugía , Humanos , Fonética , Habla

16.

Acoustic characteristics of fricatives, amplitude of formants and clarity of speech produced without and with a medical mask.

Nguyen, Duy Duong; Chacon, Antonia; Payten, Christopher; Black, Rebecca; Sheth, Meet; McCabe, Patricia; Novakovic, Daniel; Madill, Catherine.

Int J Lang Commun Disord ; 57(2): 366-380, 2022 03.

Artículo en Inglés | MEDLINE | ID: mdl-35166414

RESUMEN

BACKGROUND: Previous research has found that high-frequency energy of speech signals decreased while wearing face masks. However, no study has examined the specific spectral characteristics of fricative consonants and vowels and the perception of clarity of speech in mask wearing. AIMS: To investigate acoustic-phonetic characteristics of fricative consonants and vowels and auditory perceptual rating of clarity of speech produced with and without wearing a face mask. METHODS & PROCEDURES: A total of 16 healthcare workers read the Rainbow Passage using modal phonation in three conditions: without a face mask, with a standard surgical mask and with a KN95 mask (China GB2626-2006, a medical respirator with higher barrier level than the standard surgical mask). Speech samples were acoustically analysed for root mean square (RMS) amplitude (ARMS ) and spectral moments of four fricatives /f/, /s/, /Ê/ and /z/; and amplitude of the first three formants (A1, A2 and A3) measured from the reading passage and extracted vowels. Auditory perception of speech clarity was performed. Data were compared across mask and non-mask conditions using linear mixed models. OUTCOMES & RESULTS: The ARMS of all included fricatives was significantly lower in surgical mask and KN95 mask compared with non-mask condition. Centre of gravity of /f/ decreased in both surgical and KN95 mask while other spectral moments did not show systematic significant linear trends across mask conditions. None of the formant amplitude measures was statistically different across conditions. Speech clarity was significantly poorer in both surgical and KN95 mask conditions. CONCLUSIONS & IMPLICATIONS: Speech produced while wearing either a surgical mask or KN95 mask was associated with decreased fricative amplitude and poorer speech clarity. WHAT THIS PAPER ADDS: What is already known on the subject Previous studies have shown that the overall spectral levels in high frequency ranges and intelligibility are decreased for speech produced with a face mask. It is unclear how different types of the speech signals that is, fricatives and vowels are presented in speech produced with wearing either a medical surgical or KN95 mask. It is also unclear whether ratings of speech clarity are similar for speech produced with these face masks. What this paper adds to existing knowledge Speech data collected using a real-world, clinical and non-laboratory-controlled settings showed differences in the amplitude of fricatives and speech clarity ratings between non-mask and mask-wearing conditions. Formant amplitude did not show significant differences in mask-wearing conditions compared with non-mask. What are the potential or actual clinical implications of this work? Wearing a surgical mask or a KN95 mask had different effects on consonants and vowels. It appeared from the findings in this study that these masks only affected fricative consonants and did not affect vowel production. The poorer speech clarity in these mask-wearing conditions has important implications for speech perception in communication between clinical staff and between medical officers and patients in clinics, and between people in everyday situations. The impact of these masks on speech perception may be more pronounced in people with hearing impairment and communication disorders. In voice evaluation and/or therapy sessions, the effects of wearing a medical mask can occur bidirectionally for both the clinician and the patient. The patient may find it more challenging to understand the speech conveyed by the clinician while the clinician may not perceptually assess patient's speech and voice accurately. Given the significant correlation between clarity ratings and fricative amplitude, improving fricative signals would be useful to improve speech clarity while wearing these medical face masks.

Asunto(s)

Percepción del Habla , Habla , Acústica , Humanos , Fonética , Acústica del Lenguaje , Trastornos del Habla

17.

Training the pronunciation of L2 vowels under different conditions: the use of non-lexical materials and masking noise.

Mora, Joan C; Ortega, Mireia; Mora-Plaza, Ingrid; Aliaga-García, Cristina.

Phonetica ; 79(1): 1-43, 2022 04 15.

Artículo en Inglés | MEDLINE | ID: mdl-35427446

RESUMEN

The current study extends traditional perceptual high-variability phonetic training (HVPT) in a foreign language learning context by implementing a comprehensive training paradigm that combines perception (discrimination and identification) and production (immediate repetition) training tasks and by exploring two potentially enhancing training conditions: the use of non-lexical training stimuli and the presence of masking noise during production training. We assessed training effects on L1-Spanish/Catalan bilingual EFL learners' production of a difficult English vowel contrast (/æ/-/Ê/). The participants (N = 62) were randomly assigned to either non-lexical (N = 24) or lexical (N = 24) training and were further subdivided into two groups, one trained in noise (N = 12) and one in silence (N = 12). An untrained control group (N = 14) was also tested. Training gains, measured through spectral distance scores (Euclidean distances) with respect to native speakers' productions of /æ/ and /Ê/, were assessed through delayed word and sentence repetition tasks. The results showed an advantage of non-lexical training over lexical training, detrimental effects of noise for participants trained with nonwords, but not for those trained with words, and less accurate production of vowels elicited in isolated words than in words embedded in sentences, where training gains were only observable for participants trained with nonwords.

Asunto(s)

Multilingüismo , Percepción del Habla , Humanos , Lenguaje , Ruido , Fonética

18.

How is vowel production in Italian affected by geminate consonants and stress patterns?

Colombo, Lucia; Infanti, Michela; Arciuli, Joanne.

J Child Lang ; : 1-19, 2022 Nov 03.

Artículo en Inglés | MEDLINE | ID: mdl-36325972

RESUMEN

Italian vowels have a shorter duration before a geminate than before a singleton consonant, but a longer duration in syllables carrying stress. We asked whether children can produce the differentiation in vowel duration in singleton/geminate contexts reported for adults and whether their production changes depending on position of primary stress. Italian children (three-to-six-year-olds) and adults performed a nonword repetition. Each nonword appeared in four contexts, with the stressed/unstressed vowel preceding/following the singleton/geminate: /pa'paso/, /pap'paso/, 'papaso/, /'pappaso/. Acoustic analyses on the duration of the vowel preceding (V1) and following (V2) the medial consonant showed a type of consonant by age group interaction: the difference in vowel duration between children and adults was greater for geminate than singleton contexts, and was greater when the vowel carried stress. When V1 carried stress, its duration was shorter in the geminate than in the singleton in adults and older children, not in younger children.

19.

Effect of FFP2/3 Masks on Voice Range Profile Measurement and Voice Acoustics in Routine Voice Diagnostics.

Ho, Guan-Yuh; Kansy, Ines Kristina; Klavacs, Katharina Anna; Leonhard, Matthias; Schneider-Stickler, Berit.

Folia Phoniatr Logop ; 74(5): 335-344, 2022.

Artículo en Inglés | MEDLINE | ID: mdl-35344948

RESUMEN

INTRODUCTION: Voice diagnostics including voice range profile (VRP) measurement and acoustic voice analysis is essential in laryngology and phoniatrics. Due to COVID-19 pandemic, wearing of 2 or 3 filtering face piece (FFP2/3) masks is recommended when high-risk aerosol-generating procedures like singing and speaking are being performed. Goal of this study was to compare VRP parameters when performed without and with FFP2/3 masks. Further, formant analysis for sustained vowels, singer's formant, and analysis of reading standard text samples were performed without/with FFP2/3 masks. METHODS: Twenty subjects (6 males and 14 females) were enrolled in this study with an average age of 36 ± 16 years (mean ± SD). Fourteen patients were rated as euphonic/not hoarse and 6 patients as mildly hoarse. All subjects underwent the VRP measurements, vowel, and text recordings without/with FFP2/3 mask using the software DiVAS by XION medical (Berlin, Germany). Voice range of singing voice, equivalent of voice extension measure (eVEM), fundamental frequency (F0), sound pressure level (SPL) of soft speaking and shouting were calculated and analyzed. Maximum phonation time (MPT) and jitter-% were included for Dysphonia Severity Index (DSI) measurement. Analyses of singer's formant were performed. Spectral analyses of sustained vowels /a:/, /i:/, and /u:/ (first = F1 and second = F2 formants), intensity of long-term average spectrum, and alpha-ratio were calculated using the freeware praat. RESULTS: For all subjects, the mean values of routine voice parameters without/with mask were analyzed: no significant differences were found in results of singing voice range, eVEM, SPL, and frequency of soft speaking/shouting, except significantly lower mean SPL of shouting with FFP2/3 mask, in particular that of the female subjects (p = 0.002). Results of MPT, jitter, and DSI without/with FFP2/3 mask showed no significant differences. Further mean values analyzed without/with mask were ratio singer's formant/loud singing, with lower ratio with FFP2/3 mask (p = 0.001), and F1 and F2 of /a:/, /i:/, /u:/, with no significant differences of the results, with the exception of F2 of /i:/ with lower value with FFP2/3 mask (p = 0.005). With the exceptions mentioned, the t test revealed no significant differences for each of the routine parameters tested in the recordings without and with wearing a FFP2/3 mask. CONCLUSION: It can be concluded that VRP measurements including DSI performed with FFP2/3 masks provide reliable data in clinical routine with respect to voice condition/constitution. Spectral analyses of sustained vowel, text, and singer's formant will be affected by wearing FFP2/3 masks.

Asunto(s)

Acústica , Máscaras , Voz , Adulto , COVID-19 , Prueba de COVID-19 , Femenino , Humanos , Masculino , Persona de Mediana Edad , Pandemias , Fonación , Acústica del Lenguaje , Adulto Joven

20.

Deep learning and machine learning-based voice analysis for the detection of COVID-19: A proposal and comparison of architectures.

Costantini, Giovanni; Dr, Valerio Cesarini; Robotti, Carlo; Benazzo, Marco; Pietrantonio, Filomena; Di Girolamo, Stefano; Pisani, Antonio; Canzi, Pietro; Mauramati, Simone; Bertino, Giulia; Cassaniti, Irene; Baldanti, Fausto; Saggio, Giovanni.

Knowl Based Syst ; 253: 109539, 2022 Oct 11.

Artículo en Inglés | MEDLINE | ID: mdl-35915642

RESUMEN

Alongside the currently used nasal swab testing, the COVID-19 pandemic situation would gain noticeable advantages from low-cost tests that are available at any-time, anywhere, at a large-scale, and with real time answers. A novel approach for COVID-19 assessment is adopted here, discriminating negative subjects versus positive or recovered subjects. The scope is to identify potential discriminating features, highlight mid and short-term effects of COVID on the voice and compare two custom algorithms. A pool of 310 subjects took part in the study; recordings were collected in a low-noise, controlled setting employing three different vocal tasks. Binary classifications followed, using two different custom algorithms. The first was based on the coupling of boosting and bagging, with an AdaBoost classifier using Random Forest learners. A feature selection process was employed for the training, identifying a subset of features acting as clinically relevant biomarkers. The other approach was centered on two custom CNN architectures applied to mel-Spectrograms, with a custom knowledge-based data augmentation. Performances, evaluated on an independent test set, were comparable: Adaboost and CNN differentiated COVID-19 positive from negative with accuracies of 100% and 95% respectively, and recovered from negative individuals with accuracies of 86.1% and 75% respectively. This study highlights the possibility to identify COVID-19 positive subjects, foreseeing a tool for on-site screening, while also considering recovered subjects and the effects of COVID-19 on the voice. The two proposed novel architectures allow for the identification of biomarkers and demonstrate the ongoing relevance of traditional ML versus deep learning in speech analysis.

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

Detalles de la búsqueda