Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 16 de 16
Filtrar
1.
Dev Sci ; 27(1): e13428, 2024 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-37381667

RESUMEN

The prevalent "core phonological deficit" model of dyslexia proposes that the reading and spelling difficulties characterizing affected children stem from prior developmental difficulties in processing speech sound structure, for example, perceiving and identifying syllable stress patterns, syllables, rhymes and phonemes. Yet spoken word production appears normal. This suggests an unexpected disconnect between speech input and speech output processes. Here we investigated the output side of this disconnect from a speech rhythm perspective by measuring the speech amplitude envelope (AE) of multisyllabic spoken phrases. The speech AE contains crucial information regarding stress patterns, speech rate, tonal contrasts and intonational information. We created a novel computerized speech copying task in which participants copied aloud familiar spoken targets like "Aladdin." Seventy-five children with and without dyslexia were tested, some of whom were also receiving an oral intervention designed to enhance multi-syllabic processing. Similarity of the child's productions to the target AE was computed using correlation and mutual information metrics. Similarity of pitch contour, another acoustic cue to speech rhythm, was used for control analyses. Children with dyslexia were significantly worse at producing the multi-syllabic targets as indexed by both similarity metrics for computing the AE. However, children with dyslexia were not different from control children in producing pitch contours. Accordingly, the spoken production of multisyllabic phrases by children with dyslexia is atypical regarding the AE. Children with dyslexia may not appear to listeners to exhibit speech production difficulties because their pitch contours are intact. RESEARCH HIGHLIGHTS: Speech production of syllable stress patterns is atypical in children with dyslexia. Children with dyslexia are significantly worse at producing the amplitude envelope of multi-syllabic targets compared to both age-matched and reading-level-matched control children. No group differences were found for pitch contour production between children with dyslexia and age-matched control children. It may be difficult to detect speech output problems in dyslexia as pitch contours are relatively accurate.


Asunto(s)
Dislexia , Percepción del Habla , Niño , Humanos , Habla , Lectura , Fonética
2.
Int J Audiol ; : 1-7, 2023 Sep 26.
Artículo en Inglés | MEDLINE | ID: mdl-37750302

RESUMEN

OBJECTIVE: To examine whether the responsiveness of young children to simple sounds was associated with entertainment screen time (EST), opportunities for social interaction, and social and communication skills. DESIGN: Parents completed a questionnaire covering, for years one and two, the number of times the child met with other children; the number of words the child spoke; and the daily amount of EST. Social, attention and communication skills were assessed. STUDY SAMPLE: Participants were 118 children, aged 15 to 46 months. They were initially assessed behaviourally using simple sounds. Children who responded to such sounds were denoted the Responsive group. Children who did not were assessed using familiar songs and denoted the Unresponsive group. RESULTS: The two groups did not differ significantly in mean age or the number of opportunities to meet other children. The Unresponsive group had significantly fewer words than the Responsive group at 12 and 24 months and had significantly higher EST than the Responsive group for years 1 and 2. The Unresponsive group showed lower social, attention and communication skills than the Responsive group. CONCLUSIONS: High EST was associated with poorer auditory and social skills. Hence, it may be wise to limit the EST of young children.

3.
J Neurosci ; 41(23): 5093-5101, 2021 06 09.
Artículo en Inglés | MEDLINE | ID: mdl-33926996

RESUMEN

Understanding speech in background noise is a difficult task. The tracking of speech rhythms such as the rate of syllables and words by cortical activity has emerged as a key neural mechanism for speech-in-noise comprehension. In particular, recent investigations have used transcranial alternating current stimulation (tACS) with the envelope of a speech signal to influence the cortical speech tracking, demonstrating that this type of stimulation modulates comprehension and therefore providing evidence of a functional role of the cortical tracking in speech processing. Cortical activity has been found to track the rhythms of a background speaker as well, but the functional significance of this neural response remains unclear. Here we use a speech-comprehension task with a target speaker in the presence of a distractor voice to show that tACS with the speech envelope of the target voice as well as tACS with the envelope of the distractor speaker both modulate the comprehension of the target speech. Because the envelope of the distractor speech does not carry information about the target speech stream, the modulation of speech comprehension through tACS with this envelope provides evidence that the cortical tracking of the background speaker affects the comprehension of the foreground speech signal. The phase dependency of the resulting modulation of speech comprehension is, however, opposite to that obtained from tACS with the envelope of the target speech signal. This suggests that the cortical tracking of the ignored speech stream and that of the attended speech stream may compete for neural resources.SIGNIFICANCE STATEMENT Loud environments such as busy pubs or restaurants can make conversation difficult. However, they also allow us to eavesdrop into other conversations that occur in the background. In particular, we often notice when somebody else mentions our name, even if we have not been listening to that person. However, the neural mechanisms by which background speech is processed remain poorly understood. Here we use transcranial alternating current stimulation, a technique through which neural activity in the cerebral cortex can be influenced, to show that cortical responses to rhythms in the distractor speech modulate the comprehension of the target speaker. Our results provide evidence that the cortical tracking of background speech rhythms plays a functional role in speech processing.


Asunto(s)
Corteza Cerebral/fisiología , Comprensión/fisiología , Percepción del Habla/fisiología , Adolescente , Adulto , Femenino , Humanos , Masculino , Ruido , Estimulación Transcraneal de Corriente Directa , Adulto Joven
4.
Neuroimage ; 210: 116557, 2020 04 15.
Artículo en Inglés | MEDLINE | ID: mdl-31968233

RESUMEN

Auditory cortical activity entrains to speech rhythms and has been proposed as a mechanism for online speech processing. In particular, neural activity in the theta frequency band (4-8 â€‹Hz) tracks the onset of syllables which may aid the parsing of a speech stream. Similarly, cortical activity in the delta band (1-4 â€‹Hz) entrains to the onset of words in natural speech and has been found to encode both syntactic as well as semantic information. Such neural entrainment to speech rhythms is not merely an epiphenomenon of other neural processes, but plays a functional role in speech processing: modulating the neural entrainment through transcranial alternating current stimulation influences the speech-related neural activity and modulates the comprehension of degraded speech. However, the distinct functional contributions of the delta- and of the theta-band entrainment to the modulation of speech comprehension have not yet been investigated. Here we use transcranial alternating current stimulation with waveforms derived from the speech envelope and filtered in the delta and theta frequency bands to alter cortical entrainment in both bands separately. We find that transcranial alternating current stimulation in the theta band but not in the delta band impacts speech comprehension. Moreover, we find that transcranial alternating current stimulation with the theta-band portion of the speech envelope can improve speech-in-noise comprehension beyond sham stimulation. Our results show a distinct contribution of the theta- but not of the delta-band stimulation to the modulation of speech comprehension. In addition, our findings open up a potential avenue of enhancing the comprehension of speech in noise.


Asunto(s)
Corteza Cerebral/fisiología , Comprensión/fisiología , Ritmo Delta/fisiología , Percepción del Habla/fisiología , Ritmo Teta/fisiología , Estimulación Transcraneal de Corriente Directa , Adulto , Femenino , Humanos , Masculino , Ruido , Adulto Joven
5.
J Acoust Soc Am ; 145(3): 1493, 2019 03.
Artículo en Inglés | MEDLINE | ID: mdl-31067946

RESUMEN

The effects on speech intelligibility and sound quality of two noise-reduction algorithms were compared: a deep recurrent neural network (RNN) and spectral subtraction (SS). The RNN was trained using sentences spoken by a large number of talkers with a variety of accents, presented in babble. Different talkers were used for testing. Participants with mild-to-moderate hearing loss were tested. Stimuli were given frequency-dependent linear amplification to compensate for the individual hearing losses. A paired-comparison procedure was used to compare all possible combinations of three conditions. The conditions were: speech in babble with no processing (NP) or processed using the RNN or SS. In each trial, the same sentence was played twice using two different conditions. The participants indicated which one was better and by how much in terms of speech intelligibility and (in separate blocks) sound quality. Processing using the RNN was significantly preferred over NP and over SS processing for both subjective intelligibility and sound quality, although the magnitude of the preferences was small. SS processing was not significantly preferred over NP for either subjective intelligibility or sound quality. Objective computational measures of speech intelligibility predicted better intelligibility for RNN than for SS or NP.


Asunto(s)
Inteligibilidad del Habla , Software de Reconocimiento del Habla/normas , Anciano , Femenino , Audífonos/normas , Humanos , Masculino , Persona de Mediana Edad , Redes Neurales de la Computación , Percepción del Habla
6.
J Acoust Soc Am ; 146(1): 705, 2019 07.
Artículo en Inglés | MEDLINE | ID: mdl-31370586

RESUMEN

Speech-in-noise perception is a major problem for users of cochlear implants (CIs), especially with non-stationary background noise. Noise-reduction algorithms have produced benefits but relied on a priori information about the target speaker and/or background noise. A recurrent neural network (RNN) algorithm was developed for enhancing speech in non-stationary noise and its benefits were evaluated for speech perception, using both objective measures and experiments with CI simulations and CI users. The RNN was trained using speech from many talkers mixed with multi-talker or traffic noise recordings. Its performance was evaluated using speech from an unseen talker mixed with different noise recordings of the same class, either babble or traffic noise. Objective measures indicated benefits of using a recurrent over a feed-forward architecture, and predicted better speech intelligibility with than without the processing. The experimental results showed significantly improved intelligibility of speech in babble noise but not in traffic noise. CI subjects rated the processed stimuli as significantly better in terms of speech distortions, noise intrusiveness, and overall quality than unprocessed stimuli for both babble and traffic noise. These results extend previous findings for CI users to mostly unseen acoustic conditions with non-stationary noise.

7.
Int J Audiol ; 57(8): 624-631, 2018 08.
Artículo en Inglés | MEDLINE | ID: mdl-29764254

RESUMEN

OBJECTIVE: The objective was to evaluate and select appropriate parameters for a multi-channel transient reduction (MCTR) algorithm for detecting and attenuating transient sounds in speech. DESIGN: In each trial, the same sentence was played twice. A transient sound was presented in both sentences, but its level varied across the two depending on whether or not it had been processed by the MCTR and on the "strength" of the processing. The participant indicated their preference for which one was better and by how much in terms of the balance between the annoyance produced by the transient and the audibility of the transient (they were told that the transient should still be audible). STUDY SAMPLE: Twenty English-speaking participants were tested, 10 with normal hearing and 10 with mild-to-moderate hearing-impairment. Frequency-dependent linear amplification was provided for the latter. RESULTS: The results for both participant groups indicated that sounds processed using the MCTR were preferred over the unprocessed sounds. For the hearing-impaired participants, the medium and strong settings of the MCTR were preferred over the weak setting. CONCLUSIONS: The medium and strong settings of the MCTR reduced the annoyance produced by the transients while maintaining their audibility.


Asunto(s)
Algoritmos , Corrección de Deficiencia Auditiva/instrumentación , Audífonos , Pérdida Auditiva/rehabilitación , Ruido/prevención & control , Personas con Deficiencia Auditiva/rehabilitación , Procesamiento de Señales Asistido por Computador , Percepción del Habla , Estimulación Acústica , Audiometría del Habla , Umbral Auditivo , Audición , Pérdida Auditiva/diagnóstico , Pérdida Auditiva/fisiopatología , Pérdida Auditiva/psicología , Humanos , Genio Irritable , Ruido/efectos adversos , Prioridad del Paciente , Enmascaramiento Perceptual , Personas con Deficiencia Auditiva/psicología
8.
Audiol Res ; 14(2): 264-279, 2024 Feb 26.
Artículo en Inglés | MEDLINE | ID: mdl-38525685

RESUMEN

BACKGROUND: The Chear open-set performance test (COPT), which uses a carrier phrase followed by a monosyllabic test word, is intended for clinical assessment of speech recognition, evaluation of hearing-device performance, and the fine-tuning of hearing devices for speakers of British English. This paper assesses practice effects, test-retest reliability, and the variability across lists of the COPT. METHOD: In experiment 1, 16 normal-hearing participants were tested using an initial version of the COPT, at three speech-to-noise ratios (SNRs). Experiment 2 used revised COPT lists, with items swapped between lists to reduce differences in difficulty across lists. In experiment 3, test-retest repeatability was assessed for stimuli presented in quiet, using 15 participants with sensorineural hearing loss. RESULTS: After administration of a single practice list, no practice effects were evident. The critical difference between scores for two lists was about 2 words (out of 15) or 5 phonemes (out of 50). The mean estimated SNR required for 74% words correct was -0.56 dB, with a standard deviation across lists of 0.16 dB. For the participants with hearing loss tested in quiet, the critical difference between scores for two lists was about 3 words (out of 15) or 6 phonemes (out of 50).

9.
Clin Neurophysiol ; 160: 47-55, 2024 04.
Artículo en Inglés | MEDLINE | ID: mdl-38387402

RESUMEN

OBJECTIVE: Previous studies have reported atypical delta phase in children with dyslexia, and that delta phase modulates the amplitude of the beta-band response via delta-beta phase-amplitude coupling (PAC). Accordingly, the atypical delta-band effects in children with dyslexia may imply related atypical beta-band effects, particularly regarding delta-beta PAC. Our primary objective was to explore beta-band oscillations in children with and without dyslexia, to explore potentially atypical effects in the beta band in dyslexic children. METHODS: We collected EEG data during a rhythmic speech paradigm from 51 children (21 control; 30 dyslexia). We then assessed beta-band phase entrainment, beta-band angular velocity, beta-band power responses and delta-beta PAC. RESULTS: We found significant beta-band phase entrainment for control children but not for dyslexic children. Furthermore, children with dyslexia exhibited significantly faster beta-band angular velocity and significantly greater beta-band power. Delta-beta PAC was comparable in both groups. CONCLUSION: Atypical beta-band effects were observed in children with dyslexia. However, delta-beta PAC was comparable in both dyslexic and control children. SIGNIFICANCE: These findings offer further insights into the neurophysiological basis of atypical rhythmic speech processing by children with dyslexia, suggesting the involvement of a wide range of frequency bands.


Asunto(s)
Dislexia , Percepción del Habla , Niño , Humanos , Habla/fisiología , Dislexia/diagnóstico , Percepción del Habla/fisiología
10.
J Neurosci Methods ; 403: 110036, 2024 03.
Artículo en Inglés | MEDLINE | ID: mdl-38128783

RESUMEN

BACKGROUND: Computational models that successfully decode neural activity into speech are increasing in the adult literature, with convolutional neural networks (CNNs), backward linear models, and mutual information (MI) models all being applied to neural data in relation to speech input. This is not the case in the infant literature. NEW METHOD: Three different computational models, two novel for infants, were applied to decode low-frequency speech envelope information. Previously-employed backward linear models were compared to novel CNN and MI-based models. Fifty infants provided EEG recordings when aged 4, 7, and 11 months, while listening passively to natural speech (sung or chanted nursery rhymes) presented by video with a female singer. RESULTS: Each model computed speech information for these nursery rhymes in two different low-frequency bands, delta and theta, thought to provide different types of linguistic information. All three models demonstrated significant levels of performance for delta-band neural activity from 4 months of age, with two of three models also showing significant performance for theta-band activity. All models also demonstrated higher accuracy for the delta-band neural responses. None of the models showed developmental (age-related) effects. COMPARISONS WITH EXISTING METHODS: The data demonstrate that the choice of algorithm used to decode speech envelope information from neural activity in the infant brain determines the developmental conclusions that can be drawn. CONCLUSIONS: The modelling shows that better understanding of the strengths and weaknesses of each modelling approach is fundamental to improving our understanding of how the human brain builds a language system.


Asunto(s)
Percepción del Habla , Habla , Adulto , Humanos , Femenino , Lactante , Habla/fisiología , Electroencefalografía , Modelos Lineales , Encéfalo , Redes Neurales de la Computación , Percepción del Habla/fisiología
11.
Neuroimage Clin ; 35: 103054, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-35642984

RESUMEN

According to the sensory-neural Temporal Sampling theory of developmental dyslexia, neural sampling of auditory information at slow rates (<10 Hz, related to speech rhythm) is atypical in dyslexic individuals, particularly in the delta band (0.5-4 Hz). Here we examine the underlying neural mechanisms related to atypical sampling using a simple repetitive speech paradigm. Fifty-one children (21 control children [15M, 6F] and 30 children with dyslexia [16M, 14F]) aged 9 years with or without developmental dyslexia watched and listened as a 'talking head' repeated the syllable "ba" every 500 ms, while EEG was recorded. Occasionally a syllable was "out of time", with a temporal delay calibrated individually and adaptively for each child so that it was detected around 79.4% of the time by a button press. Phase consistency in the delta (rate of stimulus delivery), theta (speech-related) and alpha (control) bands was evaluated for each child and each group. Significant phase consistency was found for both groups in the delta and theta bands, demonstrating neural entrainment, but not the alpha band. However, the children with dyslexia showed a different preferred phase and significantly reduced phase consistency compared to control children, in the delta band only. Analysis of pre- and post-stimulus angular velocity of group preferred phases revealed that the children in the dyslexic group showed an atypical response in the delta band only. The delta-band pre-stimulus angular velocity (-130 ms to 0 ms) for the dyslexic group appeared to be significantly faster compared to the control group. It is concluded that neural responding to simple beat-based stimuli may provide a unique neural marker of developmental dyslexia. The automatic nature of this neural response may enable new tools for diagnosis, as well as opening new avenues for remediation.


Asunto(s)
Dislexia , Percepción del Habla , Estimulación Acústica , Percepción Auditiva , Niño , Humanos , Lenguaje , Habla/fisiología , Percepción del Habla/fisiología
12.
Brain Lang ; 235: 105198, 2022 12.
Artículo en Inglés | MEDLINE | ID: mdl-36343509

RESUMEN

The amplitude envelope of speech carries crucial low-frequency acoustic information that assists linguistic decoding. The sensory-neural Temporal Sampling (TS) theory of developmental dyslexia proposes atypical encoding of speech envelope information < 10 Hz, leading to atypical phonological representations. Here a backward linear TRF model and story listening were employed to estimate the speech information encoded in the electroencephalogram in the canonical delta, theta and alpha bands by 9-year-old children with and without dyslexia. TRF decoding accuracy provided an estimate of how faithfully the children's brains encoded low-frequency envelope information. Between-group analyses showed that the children with dyslexia exhibited impaired reconstruction of speech information in the delta band. However, when the quality of speech encoding for each child was estimated using child-by-child decoding models, then the dyslexic children did not differ from controls. This suggests that children with dyslexia encode neither "noisy" nor "normal" representations of the speech signal, but different representations.


Asunto(s)
Dislexia , Percepción del Habla , Humanos , Niño , Habla , Dislexia/diagnóstico , Ruido , Electroencefalografía
13.
Trends Hear ; 25: 23312165211041475, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-34606381

RESUMEN

A deep recurrent neural network (RNN) for reducing transient sounds was developed and its effects on subjective speech intelligibility and listening comfort were investigated. The RNN was trained using sentences spoken with different accents and corrupted by transient sounds, using the clean speech as the target. It was tested using sentences spoken by unseen talkers and corrupted by unseen transient sounds. A paired-comparison procedure was used to compare all possible combinations of three conditions for subjective speech intelligibility and listening comfort for two relative levels of the transients. The conditions were: no processing (NP); processing using the RNN; and processing using a multi-channel transient reduction method (MCTR). Ten participants with normal hearing and ten with mild-to-moderate hearing loss participated. For the latter, frequency-dependent linear amplification was applied to all stimuli to compensate for individual audibility losses. For the normal-hearing participants, processing using the RNN was significantly preferred over that for NP for subjective intelligibility and comfort, processing using the RNN was significantly preferred over that for MCTR for subjective intelligibility, and processing using the MCTR was significantly preferred over that for NP for comfort for the higher transient level only. For the hearing-impaired participants, processing using the RNN was significantly preferred over that for NP for both subjective intelligibility and comfort, processing using the RNN was significantly preferred over that for MCTR for comfort, and processing using the MCTR was significantly preferred over that for NP for comfort.


Asunto(s)
Audífonos , Pérdida Auditiva Sensorineural , Percepción del Habla , Humanos , Redes Neurales de la Computación , Ruido/efectos adversos , Inteligibilidad del Habla
14.
Brain Lang ; 220: 104968, 2021 09.
Artículo en Inglés | MEDLINE | ID: mdl-34111684

RESUMEN

Currently there are no reliable means of identifying infants at-risk for later language disorders. Infant neural responses to rhythmic stimuli may offer a solution, as neural tracking of rhythm is atypical in children with developmental language disorders. However, infant brain recordings are noisy. As a first step to developing accurate neural biomarkers, we investigate whether infant brain responses to rhythmic stimuli can be classified reliably using EEG from 95 eight-week-old infants listening to natural stimuli (repeated syllables or drumbeats). Both Convolutional Neural Network (CNN) and Support Vector Machine (SVM) approaches were employed. Applied to one infant at a time, the CNN discriminated syllables from drumbeats with a mean AUC of 0.87, against two levels of noise. The SVM classified with AUC 0.95 and 0.86 respectively, showing reduced performance as noise increased. Our proof-of-concept modelling opens the way to the development of clinical biomarkers for language disorders related to rhythmic entrainment.


Asunto(s)
Aprendizaje Automático , Habla , Niño , Electroencefalografía , Humanos , Lactante , Redes Neurales de la Computación , Máquina de Vectores de Soporte
15.
Front Hum Neurosci ; 14: 187, 2020.
Artículo en Inglés | MEDLINE | ID: mdl-32547377

RESUMEN

Transcranial alternating current stimulation with the speech envelope can modulate the comprehension of speech in noise. The modulation stems from the theta- but not the delta-band portion of the speech envelope, and likely reflects the entrainment of neural activity in the theta frequency band, which may aid the parsing of the speech stream. The influence of the current stimulation on speech comprehension can vary with the time delay between the current waveform and the audio signal. While this effect has been investigated for current stimulation based on the entire speech envelope, it has not yet been measured when the current waveform follows the theta-band portion of the speech envelope. Here, we show that transcranial current stimulation with the speech envelope filtered in the theta frequency band improves speech comprehension as compared to a sham stimulus. The improvement occurs when there is no time delay between the current and the speech stimulus, as well as when the temporal delay is comparatively short, 90 ms. In contrast, longer delays, as well as negative delays, do not impact speech-in-noise comprehension. Moreover, we find that the improvement of speech comprehension at no or small delays of the current stimulation is consistent across participants. Our findings suggest that cortical entrainment to speech is most influenced through current stimulation that follows the speech envelope with at most a small delay. They also open a path to enhancing the perception of speech in noise, an issue that is particularly important for people with hearing impairment.

16.
Trends Hear ; 22: 2331216518770964, 2018.
Artículo en Inglés | MEDLINE | ID: mdl-29708061

RESUMEN

Despite great advances in hearing-aid technology, users still experience problems with noise in windy environments. The potential benefits of using a deep recurrent neural network (RNN) for reducing wind noise were assessed. The RNN was trained using recordings of the output of the two microphones of a behind-the-ear hearing aid in response to male and female speech at various azimuths in the presence of noise produced by wind from various azimuths with a velocity of 3 m/s, using the "clean" speech as a reference. A paired-comparison procedure was used to compare all possible combinations of three conditions for subjective intelligibility and for sound quality or comfort. The conditions were unprocessed noisy speech, noisy speech processed using the RNN, and noisy speech that was high-pass filtered (which also reduced wind noise). Eighteen native English-speaking participants were tested, nine with normal hearing and nine with mild-to-moderate hearing impairment. Frequency-dependent linear amplification was provided for the latter. Processing using the RNN was significantly preferred over no processing by both subject groups for both subjective intelligibility and sound quality, although the magnitude of the preferences was small. High-pass filtering (HPF) was not significantly preferred over no processing. Although RNN was significantly preferred over HPF only for sound quality for the hearing-impaired participants, for the results as a whole, there was a preference for RNN over HPF. Overall, the results suggest that reduction of wind noise using an RNN is possible and might have beneficial effects when used in hearing aids.


Asunto(s)
Audífonos , Pérdida Auditiva/rehabilitación , Redes Neurales de la Computación , Ruido/prevención & control , Acústica del Lenguaje , Inteligibilidad del Habla , Viento , Acústica , Umbral Auditivo , Femenino , Pérdida Auditiva/diagnóstico , Humanos , Masculino , Distribución Aleatoria , Percepción del Habla , Adulto Joven
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA