Pesquisa | Portal de Pesquisa da BVS

1.

Prelingually Deaf Children With Cochlear Implants Show Better Perception of Voice Cues and Speech in Competing Speech Than Postlingually Deaf Adults With Cochlear Implants.

Nagels, Leanne; Gaudrain, Etienne; Vickers, Deborah; Hendriks, Petra; Baskent, Deniz.

Ear Hear ; 45(4): 952-968, 2024.

Artigo em Inglês | MEDLINE | ID: mdl-38616318

RESUMO

OBJECTIVES: Postlingually deaf adults with cochlear implants (CIs) have difficulties with perceiving differences in speakers' voice characteristics and benefit little from voice differences for the perception of speech in competing speech. However, not much is known yet about the perception and use of voice characteristics in prelingually deaf implanted children with CIs. Unlike CI adults, most CI children became deaf during the acquisition of language. Extensive neuroplastic changes during childhood could make CI children better at using the available acoustic cues than CI adults, or the lack of exposure to a normal acoustic speech signal could make it more difficult for them to learn which acoustic cues they should attend to. This study aimed to examine to what degree CI children can perceive voice cues and benefit from voice differences for perceiving speech in competing speech, comparing their abilities to those of normal-hearing (NH) children and CI adults. DESIGN: CI children's voice cue discrimination (experiment 1), voice gender categorization (experiment 2), and benefit from target-masker voice differences for perceiving speech in competing speech (experiment 3) were examined in three experiments. The main focus was on the perception of mean fundamental frequency (F0) and vocal-tract length (VTL), the primary acoustic cues related to speakers' anatomy and perceived voice characteristics, such as voice gender. RESULTS: CI children's F0 and VTL discrimination thresholds indicated lower sensitivity to differences compared with their NH-age-equivalent peers, but their mean discrimination thresholds of 5.92 semitones (st) for F0 and 4.10 st for VTL indicated higher sensitivity than postlingually deaf CI adults with mean thresholds of 9.19 st for F0 and 7.19 st for VTL. Furthermore, CI children's perceptual weighting of F0 and VTL cues for voice gender categorization closely resembled that of their NH-age-equivalent peers, in contrast with CI adults. Finally, CI children had more difficulties in perceiving speech in competing speech than their NH-age-equivalent peers, but they performed better than CI adults. Unlike CI adults, CI children showed a benefit from target-masker voice differences in F0 and VTL, similar to NH children. CONCLUSION: Although CI children's F0 and VTL voice discrimination scores were overall lower than those of NH children, their weighting of F0 and VTL cues for voice gender categorization and their benefit from target-masker differences in F0 and VTL resembled that of NH children. Together, these results suggest that prelingually deaf implanted CI children can effectively utilize spectrotemporally degraded F0 and VTL cues for voice and speech perception, generally outperforming postlingually deaf CI adults in comparable tasks. These findings underscore the presence of F0 and VTL cues in the CI signal to a certain degree and suggest other factors contributing to the perception challenges faced by CI adults.

Assuntos

Implante Coclear , Implantes Cocleares , Sinais (Psicologia) , Surdez , Percepção da Fala , Humanos , Surdez/reabilitação , Masculino , Feminino , Criança , Adulto , Adulto Jovem , Adolescente , Voz/fisiologia , Estudos de Casos e Controles , Pré-Escolar , Pessoa de Meia-Idade

2.

Perception of voice cues in school-age children with hearing aids.

Babaoglu, Gizem; Rachman, Laura; Ertürk, Pinar; Özkisi Yazgan, Basak; Sennaroglu, Gonca; Gaudrain, Etienne; Baskent, Deniz.

J Acoust Soc Am ; 155(1): 722-741, 2024 01 01.

Artigo em Inglês | MEDLINE | ID: mdl-38284822

RESUMO

The just-noticeable differences (JNDs) of the voice cues of voice pitch (F0) and vocal-tract length (VTL) were measured in school-aged children with bilateral hearing aids and children and adults with normal hearing. The JNDs were larger for hearing-aided than normal-hearing children up to the age of 12 for F0 and into adulthood for all ages for VTL. Age was a significant factor for both groups for F0 JNDs, but only for the hearing-aided group for VTL JNDs. Age of maturation was later for F0 than VTL. Individual JNDs of the two groups largely overlapped for F0, but little for VTL. Hearing thresholds (unaided or aided, 500-400 Hz, overlapping with mid-range speech frequencies) did not correlate with the JNDs. However, extended low-frequency hearing thresholds (unaided, 125-250 Hz, overlapping with voice F0 ranges) correlated with the F0 JNDs. Hence, age and hearing status differentially interact with F0 and VTL perception, and VTL perception seems challenging for hearing-aided children. On the other hand, even children with profound hearing loss could do the task, indicating a hearing aid benefit for voice perception. Given the significant age effect and that for F0 the hearing-aided children seem to be catching up with age-typical development, voice cue perception may continue developing in hearing-aided children.

Assuntos

Auxiliares de Audição , Voz , Adulto , Criança , Humanos , Sinais (Psicologia) , Fala , Limiar Diferencial

3.

Relationship between irregularities in spontaneous otoacoustic emissions suppression and psychophysical tuning curves.

Engler, Sina; Gaudrain, Etienne; de Kleine, Emile; van Dijk, Pim.

J Acoust Soc Am ; 151(2): 1055, 2022 02.

Artigo em Inglês | MEDLINE | ID: mdl-35232113

RESUMO

The suppression of spontaneous otoacoustic emissions (SOAEs) allows the objective evaluation of cochlear frequency selectivity by determining the suppression tuning curve (STC). Interestingly, some STCs have additional sidelobes at the high frequency flank, which are thought to result from interaction between the probe tone and the cochlear standing wave corresponding to the SOAE being suppressed. Sidelobes are often in regions of other neighboring SOAEs but can also occur in the absence of any other SOAE. The aim of this study was to compare STCs and psychoacoustic tuning curves (PTCs). Therefore, STCs and PTCs were measured in: (1) subjects in which the STC had a sidelobe, and (2) subjects without STC sidelobes. Additionally, PTCs were measured in subjects without SOAEs. Across participant groups, the quality factor Q10dB of the PTCs was similar, independently from whether SOAEs were present or absent. Thus, the presence of an SOAE does not provide enhanced frequency selectivity at the emission frequency. Moreover, both PTC and STC show irregularities, but these are not related in a straightforward way. This suggests that different mechanisms cause these irregularities.

Assuntos

Cóclea , Emissões Otoacústicas Espontâneas , Estimulação Acústica , Humanos , Psicoacústica

4.

Vocal and semantic cues for the segregation of long concurrent speech stimuli in diotic and dichotic listening-The Long-SWoRD test.

Huet, Moïra-Phoebé; Micheyl, Christophe; Gaudrain, Etienne; Parizet, Etienne.

J Acoust Soc Am ; 151(3): 1557, 2022 03.

Artigo em Inglês | MEDLINE | ID: mdl-35364949

RESUMO

It is not always easy to follow a conversation in a noisy environment. To distinguish between two speakers, a listener must mobilize many perceptual and cognitive processes to maintain attention on a target voice and avoid shifting attention to the background noise. The development of an intelligibility task with long stimuli-the Long-SWoRD test-is introduced. This protocol allows participants to fully benefit from the cognitive resources, such as semantic knowledge, to separate two talkers in a realistic listening environment. Moreover, this task also provides the experimenters with a means to infer fluctuations in auditory selective attention. Two experiments document the performance of normal-hearing listeners in situations where the perceptual separability of the competing voices ranges from easy to hard using a combination of voice and binaural cues. The results show a strong effect of voice differences when the voices are presented diotically. In addition, analyzing the influence of the semantic context on the pattern of responses indicates that the semantic information induces a response bias in situations where the competing voices are distinguishable and indistinguishable from one another.

Assuntos

Percepção da Fala , Fala , Sinais (Psicologia) , Humanos , Mascaramento Perceptivo , Semântica , Percepção da Fala/fisiologia

5.

Effect of Spectral Contrast Enhancement on Speech-on-Speech Intelligibility and Voice Cue Sensitivity in Cochlear Implant Users.

El Boghdady, Nawal; Langner, Florian; Gaudrain, Etienne; Baskent, Deniz; Nogueira, Waldo.

Ear Hear ; 42(2): 271-289, 2021.

Artigo em Inglês | MEDLINE | ID: mdl-32925307

RESUMO

OBJECTIVES: Speech intelligibility in the presence of a competing talker (speech-on-speech; SoS) presents more difficulties for cochlear implant (CI) users compared with normal-hearing listeners. A recent study implied that these difficulties may be related to CI users' low sensitivity to two fundamental voice cues, namely, the fundamental frequency (F0) and the vocal tract length (VTL) of the speaker. Because of the limited spectral resolution in the implant, important spectral cues carrying F0 and VTL information are expected to be distorted. This study aims to address two questions: (1) whether spectral contrast enhancement (SCE), previously shown to enhance CI users' speech intelligibility in the presence of steady state background noise, could also improve CI users' SoS intelligibility, and (2) whether such improvements in SoS from SCE processing are due to enhancements in CI users' sensitivity to F0 and VTL differences between the competing talkers. DESIGN: The effect of SCE on SoS intelligibility and comprehension was measured in two separate tasks in a sample of 14 CI users with Cochlear devices. In the first task, the CI users were asked to repeat the sentence spoken by the target speaker in the presence of a single competing talker. The competing talker was the same target speaker whose F0 and VTL were parametrically manipulated to obtain the different experimental conditions. SoS intelligibility, in terms of the percentage of correctly repeated words from the target sentence, was assessed using the standard advanced combination encoder (ACE) strategy and SCE for each voice condition. In the second task, SoS comprehension accuracy and response times were measured using the same experimental setup as in the first task, but with a different corpus. In the final task, CI users' sensitivity to F0 and VTL differences were measured for the ACE and SCE strategies. The benefit in F0 and VTL discrimination from SCE processing was evaluated with respect to the improvement in SoS perception from SCE. RESULTS: While SCE demonstrated the potential of improving SoS intelligibility in CI users, this effect appeared to stem from SCE improving the overall signal to noise ratio in SoS rather than improving the sensitivity to the underlying F0 and VTL differences. A second key finding of this study was that, contrary to what has been observed in a previous study for childlike voice manipulations, F0 and VTL manipulations of a reference female speaker (target speaker) toward male-like voices provided a small but significant release from masking for the CI users tested. CONCLUSIONS: The present findings, together with those previously reported in the literature, indicate that SCE could serve as a possible background-noise-reduction strategy in commercial CI speech processors that could enhance speech intelligibility especially in the presence of background talkers that have longer VTLs compared with the target speaker.

Assuntos

Implante Coclear , Implantes Cocleares , Percepção da Fala , Sinais (Psicologia) , Feminino , Humanos , Masculino , Inteligibilidade da Fala

6.

The effects of lexical content, acoustic and linguistic variability, and vocoding on voice cue perception.

Koelewijn, Thomas; Gaudrain, Etienne; Tamati, Terrin; Baskent, Deniz.

J Acoust Soc Am ; 150(3): 1620, 2021 09.

Artigo em Inglês | MEDLINE | ID: mdl-34598602

RESUMO

Perceptual differences in voice cues, such as fundamental frequency (F0) and vocal tract length (VTL), can facilitate speech understanding in challenging conditions. Yet, we hypothesized that in the presence of spectrotemporal signal degradations, as imposed by cochlear implants (CIs) and vocoders, acoustic cues that overlap for voice perception and phonemic categorization could be mistaken for one another, leading to a strong interaction between linguistic and indexical (talker-specific) content. Fifteen normal-hearing participants performed an odd-one-out adaptive task measuring just-noticeable differences (JNDs) in F0 and VTL. Items used were words (lexical content) or time-reversed words (no lexical content). The use of lexical content was either promoted (by using variable items across comparison intervals) or not (fixed item). Finally, stimuli were presented without or with vocoding. Results showed that JNDs for both F0 and VTL were significantly smaller (better) for non-vocoded compared with vocoded speech and for fixed compared with variable items. Lexical content (forward vs reversed) affected VTL JNDs in the variable item condition, but F0 JNDs only in the non-vocoded, fixed condition. In conclusion, lexical content had a positive top-down effect on VTL perception when acoustic and linguistic variability was present but not on F0 perception. Lexical advantage persisted in the most degraded conditions and vocoding even enhanced the effect of item variability, suggesting that linguistic content could support compensation for poor voice perception in CI users.

Assuntos

Implantes Cocleares , Percepção da Fala , Estimulação Acústica , Acústica , Sinais (Psicologia) , Humanos , Linguística

7.

School-age children benefit from voice gender cue differences for the perception of speech in competing speech.

Nagels, Leanne; Gaudrain, Etienne; Vickers, Deborah; Hendriks, Petra; Baskent, Deniz.

J Acoust Soc Am ; 149(5): 3328, 2021 05.

Artigo em Inglês | MEDLINE | ID: mdl-34241121

RESUMO

Differences in speakers' voice characteristics, such as mean fundamental frequency (F0) and vocal-tract length (VTL), that primarily define speakers' so-called perceived voice gender facilitate the perception of speech in competing speech. Perceiving speech in competing speech is particularly challenging for children, which may relate to their lower sensitivity to differences in voice characteristics than adults. This study investigated the development of the benefit from F0 and VTL differences in school-age children (4-12 years) for separating two competing speakers while tasked with comprehending one of them and also the relationship between this benefit and their corresponding voice discrimination thresholds. Children benefited from differences in F0, VTL, or both cues at all ages tested. This benefit proportionally remained the same across age, although overall accuracy continued to differ from that of adults. Additionally, children's benefit from F0 and VTL differences and their overall accuracy were not related to their discrimination thresholds. Hence, although children's voice discrimination thresholds and speech in competing speech perception abilities develop throughout the school-age years, children already show a benefit from voice gender cue differences early on. Factors other than children's discrimination thresholds seem to relate more closely to their developing speech in competing speech perception abilities.

Assuntos

Percepção da Fala , Voz , Adulto , Criança , Pré-Escolar , Sinais (Psicologia) , Humanos , Instituições Acadêmicas , Fala , Acústica da Fala

8.

Does good perception of vocal characteristics relate to better speech-on-speech intelligibility for cochlear implant users?

El Boghdady, Nawal; Gaudrain, Etienne; Baskent, Deniz.

J Acoust Soc Am ; 145(1): 417, 2019 01.

Artigo em Inglês | MEDLINE | ID: mdl-30710943

RESUMO

Differences in voice pitch (F0) and vocal tract length (VTL) improve intelligibility of speech masked by a background talker (speech-on-speech; SoS) for normal-hearing (NH) listeners. Cochlear implant (CI) users, who are less sensitive to these two voice cues compared to NH listeners, experience difficulties in SoS perception. Three research questions were addressed: (1) whether increasing the F0 and VTL difference (ΔF0; ΔVTL) between two competing talkers benefits CI users in SoS intelligibility and comprehension, (2) whether this benefit is related to their F0 and VTL sensitivity, and (3) whether their overall SoS intelligibility and comprehension are related to their F0 and VTL sensitivity. Results showed: (1) CI users did not benefit in SoS perception from increasing ΔF0 and ΔVTL; increasing ΔVTL had a slightly detrimental effect on SoS intelligibility and comprehension. Results also showed: (2) the effect from increasing ΔF0 on SoS intelligibility was correlated with F0 sensitivity, while the effect from increasing ΔVTL on SoS comprehension was correlated with VTL sensitivity. Finally, (3) the sensitivity to both F0 and VTL, and not only one of them, was found to be correlated with overall SoS performance, elucidating important aspects of voice perception that should be optimized through future coding strategies.

Assuntos

Implantes Cocleares/normas , Acústica da Fala , Inteligibilidade da Fala , Percepção da Fala , Adulto , Idoso , Implantes Cocleares/efeitos adversos , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Razão Sinal-Ruído , Voz

9.

Discrimination of Voice Pitch and Vocal-Tract Length in Cochlear Implant Users.

Gaudrain, Etienne; Baskent, Deniz.

Ear Hear ; 39(2): 226-237, 2018.

Artigo em Inglês | MEDLINE | ID: mdl-28799983

RESUMO

OBJECTIVES: When listening to two competing speakers, normal-hearing (NH) listeners can take advantage of voice differences between the speakers. Users of cochlear implants (CIs) have difficulty in perceiving speech on speech. Previous literature has indicated sensitivity to voice pitch (related to fundamental frequency, F0) to be poor among implant users, while sensitivity to vocal-tract length (VTL; related to the height of the speaker and formant frequencies), the other principal voice characteristic, has not been directly investigated in CIs. A few recent studies evaluated F0 and VTL perception indirectly, through voice gender categorization, which relies on perception of both voice cues. These studies revealed that, contrary to prior literature, CI users seem to rely exclusively on F0 while not utilizing VTL to perform this task. The objective of the present study was to directly and systematically assess raw sensitivity to F0 and VTL differences in CI users to define the extent of the deficit in voice perception. DESIGN: The just-noticeable differences (JNDs) for F0 and VTL were measured in 11 CI listeners using triplets of consonant-vowel syllables in an adaptive three-alternative forced choice method. RESULTS: The results showed that while NH listeners had average JNDs of 1.95 and 1.73 semitones (st) for F0 and VTL, respectively, CI listeners showed JNDs of 9.19 and 7.19 st. These JNDs correspond to differences of 70% in F0 and 52% in VTL. For comparison to the natural range of voices in the population, the F0 JND in CIs remains smaller than the typical male-female F0 difference. However, the average VTL JND in CIs is about twice as large as the typical male-female VTL difference. CONCLUSIONS: These findings, thus, directly confirm that CI listeners do not seem to have sufficient access to VTL cues, likely as a result of limited spectral resolution, and, hence, that CI listeners' voice perception deficit goes beyond poor perception of F0. These results provide a potential common explanation not only for a number of deficits observed in CI listeners, such as voice identification and gender categorization, but also for competing speech perception.

Assuntos

Implantes Cocleares , Discriminação da Altura Tonal , Percepção da Fala , Idoso , Surdez/reabilitação , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Acústica da Fala

10.

Effect of frequency mismatch and band partitioning on vocal tract length perception in vocoder simulations of cochlear implant processing.

El Boghdady, Nawal; Baskent, Deniz; Gaudrain, Etienne.

J Acoust Soc Am ; 143(6): 3505, 2018 06.

Artigo em Inglês | MEDLINE | ID: mdl-29960490

RESUMO

The vocal tract length (VTL) of a speaker is an important voice cue that aids speech intelligibility in multi-talker situations. However, cochlear implant (CI) users demonstrate poor VTL sensitivity. This may be partially caused by the mismatch between frequencies received by the implant and those corresponding to places of stimulation along the cochlea. This mismatch can distort formant spacing, where VTL cues are encoded. In this study, the effects of frequency mismatch and band partitioning on VTL sensitivity were investigated in normal hearing listeners with vocoder simulations of CI processing. The hypotheses were that VTL sensitivity may be reduced by increased frequency mismatch and insufficient spectral resolution in how the frequency range is partitioned, specifically where formants lie. Moreover, optimal band partitioning might mitigate the detrimental effects of frequency mismatch on VTL sensitivity. Results showed that VTL sensitivity decreased with increased frequency mismatch and reduced spectral resolution near the low frequencies of the band partitioning map. Band partitioning was independent of mismatch, indicating that if a given partitioning is suboptimal, a better partitioning might improve VTL sensitivity despite the degree of mismatch. These findings suggest that customizing the frequency partitioning map may enhance VTL perception in individual CI users.

Assuntos

Implante Coclear/instrumentação , Implantes Cocleares , Laringe/fisiologia , Percepção da Altura Sonora , Acústica da Fala , Inteligibilidade da Fala , Percepção da Fala , Qualidade da Voz , Estimulação Acústica , Adulto , Limiar Auditivo , Sinais (Psicologia) , Estimulação Elétrica , Humanos , Laringe/anatomia & histologia , Adulto Jovem

11.

The discrimination of voice cues in simulations of bimodal electro-acoustic cochlear-implant hearing.

Baskent, Deniz; Luckmann, Annika; Ceha, Jessy; Gaudrain, Etienne; Tamati, Terrin N.

J Acoust Soc Am ; 143(4): EL292, 2018 04.

Artigo em Inglês | MEDLINE | ID: mdl-29716273

RESUMO

In discriminating speakers' voices, normal-hearing individuals effectively use two vocal characteristics, vocal pitch (related to fundamental frequency, F0) and vocal-tract length (VTL, related to speaker size). Typical cochlear-implant users show poor perception of these cues. However, in implant users with low-frequency residual acoustic hearing, this bimodal electro-acoustic stimulation may provide additional voice-related cues, such as low-numbered harmonics and formants, which could improve F0/VTL perception. In acoustic noise-vocoder simulations, where added low-pass filtered speech simulated residual hearing, a strong bimodal benefit was observed for F0 perception. No bimodal benefit was observed for VTL, which seems to mainly rely on vocoder spectral resolution.

Assuntos

Estimulação Acústica/métodos , Implante Coclear/métodos , Sinais (Psicologia) , Surdez/reabilitação , Estimulação Elétrica/métodos , Discriminação da Altura Tonal/fisiologia , Voz , Adulto , Feminino , Humanos , Masculino , Acústica da Fala , Adulto Jovem

12.

Musician effect on perception of spectro-temporally degraded speech, vocal emotion, and music in young adolescents.

Baskent, Deniz; Fuller, Christina D; Galvin, John J; Schepel, Like; Gaudrain, Etienne; Free, Rolien H.

J Acoust Soc Am ; 143(5): EL311, 2018 05.

Artigo em Inglês | MEDLINE | ID: mdl-29857757

RESUMO

In adult normal-hearing musicians, perception of music, vocal emotion, and speech in noise has been previously shown to be better than non-musicians, sometimes even with spectro-temporally degraded stimuli. In this study, melodic contour identification, vocal emotion identification, and speech understanding in noise were measured in young adolescent normal-hearing musicians and non-musicians listening to unprocessed or degraded signals. Different from adults, there was no musician effect for vocal emotion identification or speech in noise. Melodic contour identification with degraded signals was significantly better in musicians, suggesting potential benefits from music training for young cochlear-implant users, who experience similar spectro-temporal signal degradations.

Assuntos

Estimulação Acústica/métodos , Emoções/fisiologia , Música/psicologia , Percepção da Altura Sonora/fisiologia , Percepção da Fala/fisiologia , Voz/fisiologia , Adolescente , Percepção Auditiva/fisiologia , Criança , Feminino , Humanos , Masculino , Fatores de Tempo

13.

Effect of F0 contours on top-down repair of interrupted speech.

Clarke, Jeanne; Kazanoglu, Deniz; Baskent, Deniz; Gaudrain, Etienne.

J Acoust Soc Am ; 142(1): EL7, 2017 07.

Artigo em Inglês | MEDLINE | ID: mdl-28764445

RESUMO

Top-down repair of interrupted speech can be influenced by bottom-up acoustic cues such as voice pitch (F0). This study aims to investigate the role of the dynamic information of pitch, i.e., F0 contours, in top-down repair of speech. Intelligibility of sentences interrupted with silence or noise was measured in five F0 contour conditions (inverted, flat, original, exaggerated with a factor of 1.5 and 1.75). The main hypothesis was that manipulating F0 contours would impair linking successive segments of interrupted speech and thus negatively affect top-down repair. Intelligibility of interrupted speech was impaired only by misleading dynamic information (inverted F0 contours). The top-down repair of interrupted speech was not affected by any F0 contours manipulation.

14.

Musician advantage for speech-on-speech perception.

Baskent, Deniz; Gaudrain, Etienne.

J Acoust Soc Am ; 139(3): EL51-6, 2016 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-27036287

RESUMO

Evidence for transfer of musical training to better perception of speech in noise has been mixed. Unlike speech-in-noise, speech-on-speech perception utilizes many of the skills that musical training improves, such as better pitch perception and stream segregation, as well as use of higher-level auditory cognitive functions, such as attention. Indeed, despite the few non-musicians who performed as well as musicians, on a group level, there was a strong musician benefit for speech perception in a speech masker. This benefit does not seem to result from better voice processing and could instead be related to better stream segregation or enhanced cognitive functions.

Assuntos

Música , Ruído/efeitos adversos , Mascaramento Perceptivo , Inteligibilidade da Fala , Percepção da Fala , Estimulação Acústica , Adulto , Atenção , Audiometria da Fala , Cognição , Sinais (Psicologia) , Feminino , Humanos , Masculino , Percepção da Altura Sonora , Adulto Jovem

15.

Pitch and spectral resolution: A systematic comparison of bottom-up cues for top-down repair of degraded speech.

Clarke, Jeanne; Baskent, Deniz; Gaudrain, Etienne.

J Acoust Soc Am ; 139(1): 395-405, 2016 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-26827034

RESUMO

The brain is capable of restoring missing parts of speech, a top-down repair mechanism that enhances speech understanding in noisy environments. This enhancement can be quantified using the phonemic restoration paradigm, i.e., the improvement in intelligibility when silent interruptions of interrupted speech are filled with noise. Benefit from top-down repair of speech differs between cochlear implant (CI) users and normal-hearing (NH) listeners. This difference could be due to poorer spectral resolution and/or weaker pitch cues inherent to CI transmitted speech. In CIs, those two degradations cannot be teased apart because spectral degradation leads to weaker pitch representation. A vocoding method was developed to evaluate independently the roles of pitch and spectral resolution for restoration in NH individuals. Sentences were resynthesized with different spectral resolutions and with either retaining the original pitch cues or discarding them all. The addition of pitch significantly improved restoration only at six-bands spectral resolution. However, overall intelligibility of interrupted speech was improved both with the addition of pitch and with the increase in spectral resolution. This improvement may be due to better discrimination of speech segments from the filler noise, better grouping of speech segments together, and/or better bottom-up cues available in the speech segments.

Assuntos

Discriminação da Altura Tonal/fisiologia , Percepção da Fala/fisiologia , Estimulação Acústica , Adulto , Análise de Variância , Implantes Cocleares , Sinais (Psicologia) , Feminino , Humanos , Masculino , Ruído , Mascaramento Perceptivo/fisiologia , Espectrografia do Som , Acústica da Fala , Inteligibilidade da Fala/fisiologia , Adulto Jovem

16.

Factors limiting vocal-tract length discrimination in cochlear implant simulations.

Gaudrain, Etienne; Baskent, Deniz.

J Acoust Soc Am ; 137(3): 1298-308, 2015 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-25786943

RESUMO

Perception of voice characteristics allows normal hearing listeners to identify the gender of a speaker, and to better segregate speakers from each other in cocktail party situations. This benefit is largely driven by the perception of two vocal characteristics of the speaker: The fundamental frequency (F0) and the vocal-tract length (VTL). Previous studies have suggested that cochlear implant (CI) users have difficulties in perceiving these cues. The aim of the present study was to investigate possible causes for limited sensitivity to VTL differences in CI users. Different acoustic simulations of CI stimulation were implemented to characterize the role of spectral resolution on VTL, both in terms of number of channels and amount of channel interaction. The results indicate that with 12 channels, channel interaction caused by current spread is likely to prevent CI users from perceiving VTL differences typically found between male and female speakers.

Assuntos

Estimulação Acústica/métodos , Implantes Cocleares , Simulação por Computador , Discriminação Psicológica , Acústica da Fala , Percepção da Fala , Qualidade da Voz , Acústica , Adulto , Fatores Etários , Audiometria da Fala , Limiar Auditivo , Sinais (Psicologia) , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Fonética , Desenho de Prótese , Fatores Sexuais , Adulto Jovem

17.

A neural mechanism for recognizing speech spoken by different speakers.

Kreitewolf, Jens; Gaudrain, Etienne; von Kriegstein, Katharina.

Neuroimage ; 91: 375-85, 2014 May 01.

Artigo em Inglês | MEDLINE | ID: mdl-24434677

RESUMO

Understanding speech from different speakers is a sophisticated process, particularly because the same acoustic parameters convey important information about both the speech message and the person speaking. How the human brain accomplishes speech recognition under such conditions is unknown. One view is that speaker information is discarded at early processing stages and not used for understanding the speech message. An alternative view is that speaker information is exploited to improve speech recognition. Consistent with the latter view, previous research identified functional interactions between the left- and the right-hemispheric superior temporal sulcus/gyrus, which process speech- and speaker-specific vocal tract parameters, respectively. Vocal tract parameters are one of the two major acoustic features that determine both speaker identity and speech message (phonemes). Here, using functional magnetic resonance imaging (fMRI), we show that a similar interaction exists for glottal fold parameters between the left and right Heschl's gyri. Glottal fold parameters are the other main acoustic feature that determines speaker identity and speech message (linguistic prosody). The findings suggest that interactions between left- and right-hemispheric areas are specific to the processing of different acoustic features of speech and speaker, and that they represent a general neural mechanism when understanding speech from different speakers.

Assuntos

Encéfalo/fisiologia , Reconhecimento Psicológico/fisiologia , Fala/fisiologia , Adulto , Feminino , Lateralidade Funcional/fisiologia , Glote/anatomia & histologia , Glote/fisiologia , Humanos , Processamento de Imagem Assistida por Computador , Individualidade , Imageamento por Ressonância Magnética , Masculino , Oxigênio/sangue , Psicolinguística , Prega Vocal/anatomia & histologia , Prega Vocal/fisiologia , Adulto Jovem

18.

Evaluating speech-in-speech perception via a humanoid robot.

Meyer, Luke; Araiza-Illan, Gloria; Rachman, Laura; Gaudrain, Etienne; Baskent, Deniz.

Front Neurosci ; 18: 1293120, 2024.

Artigo em Inglês | MEDLINE | ID: mdl-38406584

RESUMO

Introduction: Underlying mechanisms of speech perception masked by background speakers, a common daily listening condition, are often investigated using various and lengthy psychophysical tests. The presence of a social agent, such as an interactive humanoid NAO robot, may help maintain engagement and attention. However, such robots potentially have limited sound quality or processing speed. Methods: As a first step toward the use of NAO in psychophysical testing of speech- in-speech perception, we compared normal-hearing young adults' performance when using the standard computer interface to that when using a NAO robot to introduce the test and present all corresponding stimuli. Target sentences were presented with colour and number keywords in the presence of competing masker speech at varying target-to-masker ratios. Sentences were produced by the same speaker, but voice differences between the target and masker were introduced using speech synthesis methods. To assess test performance, speech intelligibility and data collection duration were compared between the computer and NAO setups. Human-robot interaction was assessed using the Negative Attitude Toward Robot Scale (NARS) and quantification of behavioural cues (backchannels). Results: Speech intelligibility results showed functional similarity between the computer and NAO setups. Data collection durations were longer when using NAO. NARS results showed participants had a relatively positive attitude toward "situations of interactions" with robots prior to the experiment, but otherwise showed neutral attitudes toward the "social influence" of and "emotions in interaction" with robots. The presence of more positive backchannels when using NAO suggest higher engagement with the robot in comparison to the computer. Discussion: Overall, the study presents the potential of the NAO for presenting speech materials and collecting psychophysical measurements for speech-in-speech perception.

19.

Neural Adaptation at Stimulus Onset and Speed of Neural Processing as Critical Contributors to Speech Comprehension Independent of Hearing Threshold or Age.

Schirmer, Jakob; Wolpert, Stephan; Dapper, Konrad; Rühle, Moritz; Wertz, Jakob; Wouters, Marjoleen; Eldh, Therese; Bader, Katharina; Singer, Wibke; Gaudrain, Etienne; Baskent, Deniz; Verhulst, Sarah; Braun, Christoph; Rüttiger, Lukas; Munk, Matthias H J; Dalhoff, Ernst; Knipper, Marlies.

J Clin Med ; 13(9)2024 May 06.

Artigo em Inglês | MEDLINE | ID: mdl-38731254

RESUMO

Background: It is assumed that speech comprehension deficits in background noise are caused by age-related or acquired hearing loss. Methods: We examined young, middle-aged, and older individuals with and without hearing threshold loss using pure-tone (PT) audiometry, short-pulsed distortion-product otoacoustic emissions (pDPOAEs), auditory brainstem responses (ABRs), auditory steady-state responses (ASSRs), speech comprehension (OLSA), and syllable discrimination in quiet and noise. Results: A noticeable decline of hearing sensitivity in extended high-frequency regions and its influence on low-frequency-induced ABRs was striking. When testing for differences in OLSA thresholds normalized for PT thresholds (PTTs), marked differences in speech comprehension ability exist not only in noise, but also in quiet, and they exist throughout the whole age range investigated. Listeners with poor speech comprehension in quiet exhibited a relatively lower pDPOAE and, thus, cochlear amplifier performance independent of PTT, smaller and delayed ABRs, and lower performance in vowel-phoneme discrimination below phase-locking limits (/o/-/u/). When OLSA was tested in noise, listeners with poor speech comprehension independent of PTT had larger pDPOAEs and, thus, cochlear amplifier performance, larger ASSR amplitudes, and higher uncomfortable loudness levels, all linked with lower performance of vowel-phoneme discrimination above the phase-locking limit (/i/-/y/). Conslusions: This study indicates that listening in noise in humans has a sizable disadvantage in envelope coding when basilar-membrane compression is compromised. Clearly, and in contrast to previous assumptions, both good and poor speech comprehension can exist independently of differences in PTTs and age, a phenomenon that urgently requires improved techniques to diagnose sound processing at stimulus onset in the clinical routine.

20.

Similarities in the neural signature for the processing of behaviorally categorized and uncategorized speech sounds.

Signoret, Carine; Gaudrain, Etienne; Perrin, Fabien.

Eur J Neurosci ; 37(5): 777-85, 2013 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-23281939

RESUMO

Recent human behavioral studies have shown semantic and/or lexical processing for stimuli presented below the auditory perception threshold. Here, we investigated electroencephalographic responses to words, pseudo-words and complex sounds, in conditions where phonological and lexical categorizations were behaviorally successful (categorized stimuli) or unsuccessful (uncategorized stimuli). Data showed a greater decrease in low-beta power at left-hemisphere temporal electrodes for categorized non-lexical sounds (complex sounds and pseudo-words) than for categorized lexical sounds (words), consistent with the signature of a failure in lexical access. Similar differences between lexical and non-lexical sounds were observed for uncategorized stimuli, although these stimuli did not yield evoked potentials or theta activity. The results of the present study suggest that behaviorally uncategorized stimuli were processed at the lexical level, and provide evidence of the neural bases of the results observed in previous behavioral studies investigating auditory perception in the absence of stimulus awareness.

Assuntos

Ritmo beta , Som , Percepção da Fala/fisiologia , Fala/fisiologia , Ritmo Teta , Adulto , Percepção Auditiva , Córtex Cerebral/fisiologia , Potenciais Evocados , Feminino , Lateralidade Funcional , Humanos , Masculino , Fonética

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA