Your browser doesn't support javascript.
loading
: 20 | 50 | 100
1 - 20 de 33
1.
Percept Mot Skills ; 131(2): 417-431, 2024 Apr.
Article En | MEDLINE | ID: mdl-38153030

In this study, we explore the feasibility and performance of detecting scalp-recorded frequency-following responses (FFRs) with a specialized machine learning (ML) model. By leveraging the strengths of feature extraction of the source separation non-negative matrix factorization (SSNMF) algorithm and its adeptness in handling limited training data, we adapted the SSNMF algorithm into a specialized ML model with a hybrid architecture to enhance FFR detection amidst background noise. We recruited 40 adults with normal hearing and evoked their scalp recorded FFRs using the English vowel/i/with a rising pitch contour. The model was trained on FFR-present and FFR-absent conditions, and its performance was evaluated using sensitivity, specificity, efficiency, false-positive rate, and false-negative rate metrics. This study revealed that the specialized SSNMF model achieved heightened sensitivity, specificity, and efficiency in detecting FFRs as the number of recording sweeps increased. Sensitivity exceeded 80% at 500 sweeps and maintained over 89% from 1000 sweeps onwards. Similarly, specificity and efficiency also improved rapidly with increasing sweeps. The progressively enhanced sensitivity, specificity, and efficiency of this specialized ML model underscore its practicality and potential for broader applications. These findings have immediate implications for FFR research and clinical use, while paving the way for further advancements in the assessment of auditory processing.


Speech Perception , Adult , Humans , Speech Perception/physiology , Auditory Perception , Machine Learning , Acoustic Stimulation , Electroencephalography
2.
Percept Mot Skills ; 130(5): 1834-1851, 2023 Oct.
Article En | MEDLINE | ID: mdl-37534595

Source-Separation Non-Negative Matrix Factorization (SSNMF) is a mathematical algorithm recently developed to extract scalp-recorded frequency-following responses (FFRs) from noise. Despite its initial success, the effects of silent intervals on algorithm performance remain undetermined. Our purpose in this study was to determine the effects of silent intervals on the extraction of FFRs, which are electrophysiological responses that are commonly used to evaluate auditory processing and neuroplasticity in the human brain. We used an English vowel /i/ with a rising frequency contour to evoke FFRs in 23 normal-hearing adults. The stimulus had a duration of 150 ms, while the silent interval between the onset of one stimulus and the offset of the next one was also 150 ms. We computed FFR Enhancement and Noise Residue to estimate algorithm performance, while silent intervals were either included (i.e., the WithSI condition) or excluded (i.e., the WithoutSI condition) in our analysis. The FFR Enhancements and Noise Residues obtained in the WithoutSI condition were significantly better (p < .05) than those obtained in the WithSI condition. On average, the exclusion of silent intervals produced a 11.78% increment in FFR Enhancement and a 20.69% decrement in Noise Residue. These results not only quantify the effects of silent intervals on the extraction of human FFRs, but also provide recommendations for designing and improving the SSNMF algorithm in future research.

3.
Int J Audiol ; 62(7): 688-698, 2023 07.
Article En | MEDLINE | ID: mdl-35522832

OBJECTIVE: One challenge in extracting the scalp-recorded frequency-following response (FFR) is related to its inherently small amplitude, which means that the response cannot be identified with confidence when only a relatively small number of recording sweeps are included in the averaging procedure. DESIGN: This study examined how the non-negative matrix factorisation (NMF) algorithm with a source separation constraint could be applied to improve the efficiency of FFR recordings. Conventional FFRs elicited by an English vowel/i/with a rising frequency contour were collected. Study sample: Fifteen normal-hearing adults and 15 normal-hearing neonates were recruited. RESULTS: The improvements of FFR recordings, defined as the correlation coefficient and root-mean-square differences across a sweep series of amplitude spectrograms before and after the application of the source separation NMF (SSNMF) algorithm, were characterised through an exponential curve fitting model. Statistical analysis of variance indicated that the SSNMF algorithm was able to enhance the FFRs recorded in both groups of participants. CONCLUSIONS: Such improvements enabled FFR extractions in a relatively small number of recording sweeps, and opened a new window to better understand how speech sounds are processed in the human brain.


Hearing Tests , Speech Perception , Infant, Newborn , Humans , Adult , Language , Algorithms , Brain Stem/physiology , Hearing , Speech Perception/physiology
4.
Semin Hear ; 43(3): 251-274, 2022 Aug.
Article En | MEDLINE | ID: mdl-36313046

The frequency-following response (FFR) provides enriched information on how acoustic stimuli are processed in the human brain. Based on recent studies, machine learning techniques have demonstrated great utility in modeling human FFRs. This tutorial focuses on the fundamental principles, algorithmic designs, and custom implementations of several supervised models (linear regression, logistic regression, k -nearest neighbors, support vector machines) and an unsupervised model ( k -means clustering). Other useful machine learning tools (Markov chains, dimensionality reduction, principal components analysis, nonnegative matrix factorization, and neural networks) are discussed as well. Each model's applicability and its pros and cons are explained. The choice of a suitable model is highly dependent on the research question, FFR recordings, target variables, extracted features, and their data types. To promote understanding, an example project implemented in Python is provided, which demonstrates practical usage of several of the discussed models on a sample dataset of six FFR features and a target response label.

5.
Percept Mot Skills ; 128(6): 2527-2543, 2021 Dec.
Article En | MEDLINE | ID: mdl-34586922

Previous research has shown the novelty of lexical-tone chimeras (artificially constructed speech sounds created by combining normal speech sounds of a given language) to native speakers of the language from which the chimera components were drawn. However, the source of such novelty remains unclear. Our goal in this study was to separate the effects of chimeric tonal novelty in Mandarin speech from the effects of auditory signal manipulations. We recruited 20 native speakers of Mandarin and constructed two sets of lexical-tone chimeras by interchanging the envelopes and fine structures of both a falling/yi4/and a rising/yi2/Mandarin tone through 1, 2, 3, 4, 6, 8, 16, 32, and 64 auditory filter banks. We conducted pitch-perception ability tasks via a two-alternative, forced-choice paradigm to produce behavioral (versus physiological) pitch perception data. We also obtained electroencephalographic measurements through the scalp-recorded frequency-following response (FFR). Analyses of variances and post hoc Greenhouse-Geisser procedures revealed that the differences observed in the participants' reaction times and FFR measurements were attributable primarily to chimeric novelty rather than signal manipulation effects. These findings can be useful in assessing neuroplasticity and developing speech-processing strategies.


Phonetics , Speech Perception , Chimera , Electroencephalography , Humans , Pitch Perception
6.
Percept Mot Skills ; 128(1): 48-58, 2021 Feb.
Article En | MEDLINE | ID: mdl-32962538

In this study, we sought to evaluate the efficiencies of multiple machine learning algorithms in detecting neonates' Frequency Following Responses (FFRs). We recorded continuous brainwaves from 43 American neonates in response to a pre-recorded monosyllable/i/with a rising frequency contour. Recordings were classified into response and no response categories. Six response features were extracted from each recording and served as predictors in FFR identification. Twenty-three supervised machine learning algorithms with mean efficiency values of 86.0%, 94.4%, 97.2%, and 97.5% when 1, 10, 100, and 1000 random iterations were implemented, respectively. These high efficiency values obtained from the neonatal FFRs demonstrate that machine learning algorithms can help assess pitch processing in neonates and can be applied to auditory screening and intervention services for neonates at risk for disorders associated with decreased pitch processing.


Machine Learning , Humans , Infant, Newborn , United States
7.
Ear Hear ; 41(4): 918-934, 2020.
Article En | MEDLINE | ID: mdl-31688319

OBJECTIVES: This study aimed to compare the effects of increasing the interphase gap (IPG) on the neural response of the electrically stimulated cochlear nerve (CN) between children with CN deficiency (CND) and children with normal-sized CNs. DESIGN: Study participants included 30 children with CND and 30 children with normal-sized CNs. All subjects were implanted with a Cochlear Nucleus device with the internal electrode array 24RE[CA] in the test ear. The stimulus was a charge-balanced, cathodic leading, biphasic pulse with a pulse-phase duration of 50 µsec. For each subject, the electrically evoked compound action potential (eCAP) input/output (I/O) function was measured for 6 IPGs (i.e., 7, 14, 21, 28, 35, and 42 µsec) at 3 electrode locations across the electrode array. For each subject and each testing electrode, the highest stimulation used to measure the eCAP I/O function was the maximum comfortable level measured with an IPG of 42 µsec. Dependent variables (DVs) were the maximum eCAP amplitude, the eCAP threshold, and the slope of the eCAP I/O function estimated using both linear and sigmoidal regression functions. For each DV, the size of the IPG effect was defined as the proportional change relative to the result measured for the 7 µsec IPG at the basal electrode location. Generalized linear mixed effect models with subject group, electrode location, and IPG duration as the fixed effects and subject as the random effect were used to compare these DVs and the size of the IPG effect on these DVs. RESULTS: Children with CND showed smaller maximum eCAP amplitudes, higher eCAP thresholds, and smaller slopes of eCAP I/O function estimated using either linear or sigmoidal regression function than children with normal-sized CNs. Increasing the IPG duration resulted in larger maximum eCAP amplitudes, lower eCAP thresholds and larger slopes of eCAP I/O function estimated using sigmoidal regression function at all three electrode locations in both study groups. Compared with children with normal-sized CNs, children with CND showed larger IPG effects on both the maximum eCAP amplitude and the slope of the eCAP I/O function estimated using either linear or sigmoidal regression function, and a smaller IPG effect on the eCAP threshold than those measured in children with normal-sized CNs. CONCLUSIONS: Increasing the IPG increases responsiveness of the electrically stimulated CN in both children with CND and children with normal-sized CNs. The maximum eCAP amplitude and the slope of the eCAP I/O function measured in human listeners with poorer CN survival are more sensitive to changes in the IPG. In contrast, the eCAP threshold in listeners with poorer CN survival is less sensitive to increases in the IPG. Further studies are warranted to identify the best parameters of eCAP results for predicting CN survival before this eCAP testing paradigm can be used as a clinical tool for evaluating neural health for individual cochlear implant patients.


Cochlear Implantation , Cochlear Implants , Action Potentials , Auditory Threshold , Child , Cochlear Nerve , Electric Stimulation , Evoked Potentials, Auditory , Female , Humans , Interphase , Male
8.
J Am Acad Audiol ; 29(2): 125-134, 2018 02.
Article En | MEDLINE | ID: mdl-29401060

BACKGROUND: The scalp-recorded frequency-following response (FFR) has been widely accepted in assessing the brain's processing of speech stimuli for people who speak tonal and nontonal languages. Characteristics of scalp-recorded FFRs with increasing number of sweeps have been delineated through the use of an exponential curve-fitting model in Chinese adults; however, characteristics of speech processing for people who speak a nontonal language remain unclear. PURPOSE: This study had two specific aims. The first was to examine the characteristics of speech processing in neonates and adults who speak a nontonal language, to evaluate the goodness of fit of an exponential model on neonatal and adult FFRs, and to determine the differences, if any, between the two groups of participants. The second aim was to assess effective recording parameters for American neonates and adults. RESEARCH DESIGN: This investigation employed a prospective between-subject study design. STUDY SAMPLE: A total of 12 American neonates (1-3 days old) and 12 American adults (24.1 ± 2.5 yr old) were recruited. Each neonate passed an automated hearing screening at birth and all adult participants had normal hearing and were native English speakers. DATA COLLECTION AND ANALYSIS: The English vowel /i/ with a rising pitch contour (117-166 Hz) was used to elicit the FFR. A total of 8,000 accepted sweeps were recorded from each participant. Three objective indices (Frequency Error, Tracking Accuracy, and Pitch Strength) were computed to estimate the frequency-tracking acuity and neural phase-locking magnitude when progressively more sweeps were included in the averaged waveform. For each objective index, the FFR trends were fit to an exponential curve-fitting model that included estimates of asymptotic amplitude, noise amplitude, and a time constant. RESULTS: Significant differences were observed between groups for Frequency Error, Tracking Accuracy, and Pitch Strength of the FFR trends. The adult participants had significantly smaller Frequency Error (p < 0.001), better Tracking Accuracy (p = 0.001), and larger Pitch Strength (p = 0.003) values than the neonate participants. The adult participants also demonstrated a faster rate of improvement (i.e., a smaller time constant) in all three objective indices compared to the neonate participants. The smaller time constants observed in adults indicate that a larger number of sweeps will be needed to adequately assess the FFR for neonates. Furthermore, the exponential curve-fitting model provided a good fit to the FFR trends with increasing number of sweeps for American neonates (mean r2 = 0.89) and adults (mean r2 = 0.96). CONCLUSIONS: Significant differences were noted between the neonatal and adult participants for Frequency Error, Tracking Accuracy, and Pitch Strength. These differences have important clinical implications in determining when to stop a recording and the number of sweeps needed to adequately assess the frequency-encoding acuity and neural phase-locking magnitude in neonates and adults. These findings lay an important foundation for establishing a normative database for American neonates and adults, and may prove to be useful in the development of diagnostic and therapeutic paradigms for neonates and adults who speak a nontonal language.


Computer Simulation , Language , Pitch Perception/physiology , Speech Perception/physiology , Adult , Female , Humans , Infant, Newborn , Male , Prospective Studies , Reference Values , United States , Young Adult
9.
J Acoust Soc Am ; 142(3): EL270, 2017 09.
Article En | MEDLINE | ID: mdl-28964068

Processing speaker-specific information is an important task in daily communication. This study examined how fundamental frequency (F0) cues were encoded at the subcortical level, as reflected by scalp-recorded frequency-following responses, and their relationship with the listener's ability in processing speech stimuli produced by multiple speakers. By using Mandarin tones with distinctive F0 contours, the results indicated that subcortical frequency-coding errors were significantly correlated with the listener's speaker-variability intolerance for both percent correct and reaction time measures. These findings lay a foundation to help improve the understanding of how speaker information is processed in individuals with normal and impaired auditory systems.


Perceptual Masking , Speech Acoustics , Speech Perception/physiology , Acoustic Stimulation , Adult , Cues , Electroencephalography , Female , Healthy Volunteers , Humans , Male , Nervous System Physiological Phenomena , Reaction Time , Sound Spectrography , Statistics, Nonparametric , Young Adult
10.
J Acoust Soc Am ; 139(6): EL190, 2016 06.
Article En | MEDLINE | ID: mdl-27369171

Voice pitch carries important information for speech understanding. This study examines the neural representation of voice pitch at the subcortical level, as reflected by the scalp-recorded frequency-following responses from ten American and ten Chinese newborns. By utilizing a set of four distinctive Mandarin pitch contours that mimic the English vowel /yi/, the results indicate that the rising and dipping pitch contours produce significantly better tracking accuracy and larger response amplitudes than the falling pitch contour. This finding suggests a hierarchy of potential stimuli when testing neonates who are born in a tonal or non-tonal linguistic environment.


Brain/physiology , Evoked Potentials, Auditory , Phonetics , Pitch Perception , Speech Acoustics , Speech Perception , Voice Quality , Acoustic Stimulation , Acoustics , Age Factors , Audiometry, Speech , Brain Mapping/methods , Child Development , Cues , Electroencephalography , Female , Humans , Infant, Newborn , Male , Ohio , Sound Spectrography , Taiwan
11.
Percept Mot Skills ; 122(1): 123-35, 2016 Feb.
Article En | MEDLINE | ID: mdl-27420311

This study investigated the development of subcortical pitch processing, as reflected by the scalp-recorded frequency-following response, during early infancy. Thirteen Chinese infants who were born and raised in Mandarin-speaking households were recruited to partake in this study. Through a prospective-longitudinal study design, infants were tested twice: at 1-3 days after birth and at three months of age. A set of four contrastive Mandarin pitch contours were used to elicit frequency-following responses. Frequency Error and Pitch Strength were derived to represent the accuracy and magnitude of the elicited responses. Paired-samples t tests were conducted and demonstrated a significant decrease in Frequency Error and a significant increase in Pitch Strength at three months of age compared to 1-3 days after birth. Results indicated the developmental trajectory of subcortical pitch processing during the first three months of life.


Brain Waves/physiology , Brain/physiology , Child Development/physiology , Pitch Perception/physiology , Female , Humans , Infant , Infant, Newborn , Male , Prospective Studies
12.
Int J Audiol ; 55(1): 53-63, 2016.
Article En | MEDLINE | ID: mdl-26305289

OBJECTIVE: Previous research has shown the usefulness of utilizing auditory chimeras in assessing a listener's perception of the envelope and fine structure for an acoustic stimulus. However, research comparing and contrasting behavioral and electrophysiological responses to this stimulus type is scarce. DESIGN: Two sets of chimeric stimuli were constructed by interchanging the envelopes and fine-structures of the rising/yi(2)/and falling/yi(4)/Mandarin pitch contours that were filtered through 1, 2, 4, 8, 16, 32, and 64 frequency banks. Behavioral pitch-perception tasks were administered through a two-alternative, forced-choice paradigm. Electrophysiological responses were measured through scalp-recorded frequency-following responses (FFRs) to the lexical-tone chimeras. STUDY SAMPLE: Twenty American and twenty Chinese adults were recruited. RESULTS: A two-way analysis of variance showed significance (p < 0.05) within and across the filter bank and language background factors for the behavioral measurements, while the frequency-following response demonstrated a significance only across the filter banks. CONCLUSIONS: Perceptual importance of envelope cues increases starting from 16 filter banks, while the FFR accuracy and magnitude decreases with increasing number of filter banks. These results can be useful in assessing experience-dependent neuroplasticity and in designing speech processing strategies for cochlear-implant users who speak tonal or non-tonal languages around the globe.


Acoustic Stimulation/methods , Chimera , Cues , Evoked Potentials, Auditory/physiology , Pitch Perception/physiology , Adult , Analysis of Variance , China , Electroencephalography , Female , Healthy Volunteers , Humans , Language , Male , Phonetics , United States , Young Adult
13.
Percept Mot Skills ; 118(3): 681-90, 2014 Jun.
Article En | MEDLINE | ID: mdl-25068740

Although scalp-recorded frequency-following response (FFR) to voice pitch has shown great potential to examine pitch processing mechanisms in human participants and animals, few reports have addressed the test-retest reliability of such a response in an animal model. The purpose of this study was to investigate the feasibility and reliability of recording such a response in an animal model and to evaluate the extent to which the response could be separated from background noise. A Chinese monosyllable with a rising pitch was used to elicit the FFR to voice pitch in four guinea pigs. Four objective measures (Root-Mean-Square, Amplitude, Tracking Accuracy, Frequency Error, and Slope Error) were computed from recorded brain waves and were used to examine the phase-locking magnitude and test-retest reliability of the response. Results demonstrated that the animal model produced FFR trends that were repeatable, reliable, and significantly different from responses to the background noise.


Brain/physiology , Guinea Pigs/physiology , Neuropsychological Tests/standards , Pitch Perception/physiology , Animals , Electroencephalography , Feasibility Studies , Female , Male , Pilot Projects , Reproducibility of Results
14.
Percept Mot Skills ; 116(2): 456-65, 2013 Apr.
Article En | MEDLINE | ID: mdl-24032322

To date, observations of the scalp-recorded frequency-following response (FFR) to voice pitch have depended on subjective interpretation of the experimenter. The purpose of this study was to develop and evaluate an automated procedure for detecting the presence of a response. Twenty American (9 boys, 1-3 days) and 20 Chinese (10 boys, 1-3 days) neonates were recruited. A Chinese monosyllable that mimicked the English vowel /i/ with a rising pitch (117-166 Hz) was used as the stimulus. Three objective indices (Frequency Error, Tracking Accuracy, and Pitch Strength) were computed from the recorded brain waves and the test results were compared with human judgments to calculate the sensitivity and specificity values. Results demonstrated that the automated procedure produced sensitivity values between 53-90% and specificity values between 80-100%, and could be used to assess the presence of an FFR for neonates who were born in a tonal or non-tonal language environment.


Brain/physiology , Neuropsychological Tests/standards , Pitch Perception/physiology , Speech Perception/physiology , China , Cross-Cultural Comparison , Electroencephalography , Female , Humans , Infant, Newborn , Male , Psycholinguistics/instrumentation , Psycholinguistics/methods , Psychometrics/instrumentation , Sensitivity and Specificity , United States
15.
Audiol Neurootol ; 17(2): 112-20, 2012.
Article En | MEDLINE | ID: mdl-21865723

OBJECTIVES: Repetitive transcranial magnetic stimulation (rTMS), a noninvasive method for altering cortical excitability, is becoming a therapeutic strategy in auditory research institutions worldwide. Application of inhibiting rTMS on these overactive cortical regions can result in effective tinnitus suppression. The aim of this study is to investigate the efficacy of theta-burst rTMS in patients with chronic tinnitus. STUDY DESIGN: Parallel randomized control study. SETTING: Tertiary referral center. PATIENTS: We enrolled 2 female and 20 male patients in this study. The evaluative tools included tinnitus frequency- and loudness-matching, tinnitus questionnaires (TQ), and the Tinnitus Handicap Inventory (THI). METHODS: The orthogonal projection of the auditory cortex on the scalp was focalized. A figure-eight coil was placed on the surface of the skull over the targeted region with the intensity setting at 80% of the resting motor threshold. We delivered 900 pulses of theta-burst rTMS daily for 10 business days. MAIN OUTCOME MEASURES: Nine of twelve patients (75%) in the active-stimulation group reported tinnitus suppression following treatment with rTMS. The treatment led to reductions of 8.58 and 8.33 in the mean TQ global and THI scores, respectively. Tinnitus loudness also decreased significantly after delivering rTMS. RESULTS: Descriptive analysis of the TQs revealed that patients experienced significant improvements in emotional distress levels and somatic symptoms. CONCLUSIONS: Our preliminary results demonstrate that theta-burst rTMS treatments offer a method of modulating tinnitus. Patients could benefit from emotional improvements, even more than auditory perceptive relief. Further studies are needed to establish a standard protocol and to clarify nervous propagation along the auditory and psychological projection following treatment with rTMS.


Auditory Cortex , Tinnitus/therapy , Transcranial Magnetic Stimulation/methods , Adult , Aged , Chronic Disease , Female , Humans , Male , Middle Aged , Surveys and Questionnaires , Treatment Outcome
16.
Percept Mot Skills ; 113(1): 67-86, 2011 Aug.
Article En | MEDLINE | ID: mdl-21987910

When the fundamental frequency (f0) is removed from a complex stimulus, the pitch of the f0 is still perceived by the listener. Through the use of the scalp-recorded frequency-following response, this study examined the relative contributions of thef0 and its harmonics in pitch processing by systematically manipulating the speech stimulus to remove component frequencies. 12 American and 12 Chinese adults were recruited. There were statistically significant effects of pitch strength and frequency error for the experimental-condition factor. There were significantly larger responses to the harmonics-only conditions than those obtained in the f0-only and control conditions. No statistically significant difference was observed between the two groups of participants. These findings indicate that neural responses associated with individual harmonics dominate the pitch processing in the human brainstem, irrespective of whether the listener's native language is nontonal or tonal.


Cross-Cultural Comparison , Language , Pitch Perception , Sound Spectrography , Speech Acoustics , Speech Perception , Acoustic Stimulation , Adult , Cues , Female , Humans , Male , Phonetics , Young Adult
17.
Int J Audiol ; 50(9): 582-93, 2011 Sep.
Article En | MEDLINE | ID: mdl-21722020

OBJECTIVE: Recent studies have shown that the frequency-following response (FFR) to voice pitch can be a useful method to evaluate the signal-processing mechanisms and neural plasticity in the human brainstem. The purpose of this study was to examine the quantitative properties of the FFR trends with an exponential curve-fitting model. DESIGN: FFR trends obtained with increasing number of sweeps (up to 8000 sweeps) at three stimulus intensities (30, 45, and 60 dB nHL) were fit to an exponential model that consisted of estimates of background noise amplitude, asymptotic response amplitude, and a time constant. Five objective indices (Frequency Error, Slope Error, Tracking Accuracy, Pitch Strength and RMS Ratio) were used to represent different perspectives of pitch processing in the human brainstem. STUDY SAMPLE: Twenty-three native speakers (16 males; age = 24.7 ± 2.1 years) of Mandarin Chinese were recruited. RESULTS: The results demonstrated that the exponential model provided a good fit (r(2) = 0.89 ± 0.10) to the FFR trends with increasing number of sweeps for the five objective indices. CONCLUSIONS: The exponential model, combined with the five objective indices, can be used for difficult-to-test patients and may prove to be useful as an assessment and diagnostic method in both clinical and basic research efforts.


Evoked Potentials, Auditory, Brain Stem/physiology , Models, Neurological , Neuronal Plasticity/physiology , Pitch Perception/physiology , Speech Perception/physiology , Voice , Adult , Algorithms , Asian People , Brain Stem/physiology , Female , Humans , Language , Male , Signal-To-Noise Ratio , Speech Acoustics , Young Adult
18.
Ear Hear ; 32(6): 699-707, 2011.
Article En | MEDLINE | ID: mdl-21543983

OBJECTIVES: Cross-language studies, as reflected by the scalp-recorded frequency-following response (FFR) to voice pitch, have shown the influence of dominant linguistic environments on the encoding of voice pitch at the brainstem level in normal-hearing adults. Research questions that remained unanswered included the characteristics of the FFR to voice pitch in neonates during their immediate postnatal period and the relative contributions of the biological capacities present at birth versus the influence of the listener's postnatal linguistic experience. The purpose of this study was to investigate the characteristics of FFR to voice pitch in neonates during their first few days of life and to examine the relative contributions of the "biological capacity" versus "linguistic experience" influences on pitch processing in the human brainstem. DESIGN: Twelve American neonates (five males, 1-3 days old) and 12 Chinese neonates (seven males, 1-3 days old) were recruited to examine the characteristics of the FFRs during their immediate postnatal days of life. Twelve American adults (three males; age: mean ± SD = 24.6 ± 3.0 yr) and 12 Chinese adults (six males; age: mean ± SD = 25.3 ± 2.6 yr) were also recruited to determine the relative contributions of biological and linguistic influences. A Chinese monosyllable that mimics the English vowel /i/ with a rising pitch (117-166 Hz) was used to elicit the FFR to voice pitch in all participants. RESULTS: Two-way analysis of variance (i.e., the language [English versus Chinese] and age [neonate versus adult] factors) showed a significant difference in Pitch Strength for language (p = 0.035, F = 4.716). A post hoc Tukey-Kramer analysis further demonstrated that Chinese adults had significantly larger Pitch Strength values than Chinese neonates (p = 0.024). This finding, coupled with the fact that American neonates and American adults had comparable Pitch Strength values, supported the linguistic experience model. On the other hand, Pitch Strength obtained from the American neonates, American adults, and Chinese neonates were not significantly different from each other, supporting the biological capacity model. CONCLUSIONS: This study demonstrated an early maturation of voice-pitch processing in neonates starting from 1 to 3 days after birth and a significant effect of linguistic experience on the neural processing of voice pitch at the brainstem level. These findings provide a significant conceptual advancement and a basis for further examination of developmental maturation of subcortical representation of speech features, such as pitch, timing, and harmonics. These findings can also be used to help identify neonates at risk for delays in voice-pitch perception and provide new directions for preventive and therapeutic interventions for patients with central auditory processing deficits, hearing loss, and other types of communication disorders.


Brain Stem/physiology , Language , Phonetics , Pitch Perception/physiology , Speech Perception/physiology , Voice/physiology , Adult , Age Factors , Brain Stem/growth & development , Child Language , China/epidemiology , Female , Humans , Infant, Newborn , Language Development Disorders/ethnology , Language Development Disorders/physiopathology , Male , Psycholinguistics , Risk Assessment/methods , Risk Factors , United States/epidemiology , Young Adult
19.
J Acoust Soc Am ; 129(1): EL21-6, 2011 Jan.
Article En | MEDLINE | ID: mdl-21302977

Speech communication usually occurs in the presence of background noise. This study examined noise tolerance in the brainstem's processing of voice pitch, as reflected by the scalp-recorded frequency-following response (FFR) from 12 normal-hearing adults. By systematically manipulating signal-to-noise ratio (SNR) across three different stimulus intensities, the results indicated that Frequency Error, Slope Error, and Tracking Accuracy remained relatively stable until SNR was degraded to 0 dB or lower (i.e., a turning point). This turning point not only provided physiological evidence supporting pitch tolerance of noise but also allowed recommendation of a minimal SNR when evaluating pitch processing in difficult-to-test patients.


Brain Stem/physiology , Noise/adverse effects , Perceptual Masking , Pitch Discrimination , Signal Detection, Psychological , Speech Acoustics , Speech Perception , Voice Quality , Acoustic Stimulation , Adult , Audiometry , Auditory Threshold , Electroencephalography , Evoked Potentials, Auditory, Brain Stem , Female , Humans , Male , Sound Spectrography , Young Adult
20.
Int J Audiol ; 50(1): 14-26, 2011 Jan.
Article En | MEDLINE | ID: mdl-21047294

OBJECTIVE: Voice pitch carries important cues for speech perception in humans. Recent studies have shown the feasibility of recording the frequency-following response (FFR) to voice pitch in normal-hearing listeners. The presence of such a response, however, has been dependent on subjective interpretation of experimenters. The purpose of this study was to develop and test an automated procedure including a control-experimental protocol and response-threshold criteria suitable for extracting FFRs to voice pitch, and compare the results to human judgments. DESIGN: A set of four Mandarin tones (Tone 1 flat; Tone 2 rising; Tone 3 dipping; and Tone 4 falling) were prepared to reflect the four contrastive pitch contours. Two distinctive algorithms, short-term autocorrelation in the time domain and narrow-band spectrogram in the frequency domain, were used to estimate the Frequency Error, Slope Error, Tracking Accuracy, Pitch Strength and Pitch-Noise Ratio of the recordings from individual listeners as well as the power and false-positive rates of each algorithm. STUDY SAMPLE: Eleven native speakers (five males; age: mean ± SD = 31.4 ± 4.7 years) of Mandarin Chinese were recruited. RESULTS: The results demonstrated that both algorithms were suitable for extracting FFRs and the objective measures showed comparable results to human judgments. CONCLUSIONS: The automated procedure used in this study, including the use of the control-experimental protocol and response thresholds used for each of the five objective indices, can be used for difficult-to-test patients and may prove to be useful as an assessment and diagnostic method in both clinical and basic research efforts.


Algorithms , Evoked Potentials, Auditory, Brain Stem , Pitch Perception , Signal Processing, Computer-Assisted , Speech Acoustics , Speech Perception , Voice , Acoustic Stimulation , Adult , Audiometry, Speech , Auditory Threshold , Cues , Female , Humans , Language , Male , Phonetics , Sound Spectrography , Time Factors
...