Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 1.538
Filter
1.
Noise Health ; 26(121): 220-225, 2024.
Article in English | MEDLINE | ID: mdl-38904826

ABSTRACT

AIMS: Digital noise reduction (DNR) minimizes the effect of noise on speech signals by continuously monitoring frequency bands in the presence of noise. In the present study, we explored the effect of DNR technology on speech intelligibility in individuals using hearing aids (HAs) and investigated implications for daily use. METHODS AND MATERIAL: Eighteen participants with bilateral moderate sensorineural hearing loss (aged 16-45 years) were included. Bilateral receiver-in-the-ear HAs were fitted in the participants. The adaptive and nonadaptive (with a signal-to-noise ratio (SNR) of +5 and -5 dB, respectively) Turkish matrix sentence test (TURMatrix) in noise and free-field hearing assessments, including hearing thresholds with hearing aids, speech recognition thresholds (SRT), and speech discrimination scores, were conducted in two different conditions: HA in the DNR-on and DNR-off conditions. RESULTS: No significant difference was observed between free-field hearing assessments with the HA in the DNR-off and DNR-on conditions (P > 0.05). Furthermore, the adaptive and nonadaptive TURMatrix revealed significant differences between the scores under the DNR-on and DNR-off conditions (P < 0.05). Nevertheless, under the DNR-on condition, there was no correlation between free-field hearing assessments with HA and TURMatrix results (P > 0.05). However, a significant correlation was observed between SRT scores with HA and TURMatrix scores (adaptive and nonadaptive, +5 and -5 dB SNR, respectively) under the DNR-off condition (P < 0.05). CONCLUSION: Our study findings suggest that DNR can improve speech intelligibility in noisy environments. Therefore, DNR can enhance an individual's auditory comfort by improving their capacity to grasp speech in background noise.


Subject(s)
Hearing Aids , Hearing Loss, Sensorineural , Noise , Speech Intelligibility , Humans , Adult , Male , Middle Aged , Hearing Loss, Sensorineural/rehabilitation , Female , Young Adult , Adolescent , Signal-To-Noise Ratio , Auditory Threshold , Speech Perception , Speech Reception Threshold Test
2.
Codas ; 36(3): e20230091, 2024.
Article in Portuguese, English | MEDLINE | ID: mdl-38836822

ABSTRACT

PURPOSE: To propose an instrument for assessing speech recognition in the presence of competing noise. To define its application strategy for use in clinical practice. To obtain evidence of criterion validity and present reference values. METHODS: The study was conducted in three stages: Organization of the material comprising the Word-with-Noise Test (Stage 1); Definition of the instrument's application strategy (Stage 2); Investigation of criterion validity and definition of reference values for the test (Stage 3) through the evaluation of 50 normal-hearing adult subjects and 12 subjects with hearing loss. RESULTS: The Word-with-Noise Test consists of lists of monosyllabic and disyllabic words and speech spectrum noise (Stage 1). The application strategy for the test was defined as the determination of the Speech Recognition Threshold with a fixed noise level at 55 dBHL (Stage 2). Regarding criterion validity, the instrument demonstrated adequate ability to distinguish between normal-hearing subjects and subjects with hearing loss (Stage 3). Reference values for the test were established as cut-off points expressed in terms of signal-to-noise ratio: 1.47 dB for the monosyllabic stimulus and -2.02 dB for the disyllabic stimulus. Conclusion: The Word-with-Noise Test proved to be quick to administer and interpret, making it a useful tool in audiological clinical practice. Furthermore, it showed satisfactory evidence of criterion validity, with established reference values.


OBJETIVO: Propor um instrumento para a avaliação do reconhecimento de fala na presença de ruído competitivo. Definir sua estratégia de aplicação, para ser aplicado na rotina clínica. Obter evidências de validade de critério e apresentar seus valores de referência. MÉTODO: Estudo realizado em três etapas: Organização do material que compôs o Teste de Palavras no Ruído (Etapa 1); Definição da estratégia de aplicação do instrumento (Etapa 2); Investigação da validade de critério e definição dos valores de referência para o teste (Etapa 3), por meio da avaliação de 50 sujeitos adultos normo-ouvintes e 12 sujeitos com perda auditiva. RESULTADOS: O Teste de Palavras no Ruído é composto por listas de vocábulos mono e dissilábicos e um ruído com espectro de fala (Etapa 1). Foi definida como estratégia de aplicação do teste, a realização do Limiar de Reconhecimento de Fala com ruído fixo em 55 dBNA (Etapa 2). Quanto à validade de critério, o instrumento apresentou adequada capacidade de distinção entre os sujeitos normo-ouvintes e os sujeitos com perda auditiva (Etapa 3). Foram definidos como valores de referência para o teste, os pontos de corte expressos em relação sinal/ruído de 1,47 dB para o estímulo monossilábico e de -2,02 dB para o dissilábico. CONCLUSÃO: O Teste de Palavras no Ruído demonstrou ser rápido e de fácil aplicação e interpretação dos resultados, podendo ser uma ferramenta útil a ser utilizada na rotina clínica audiológica. Além disso, apresentou evidências satisfatórias de validade de critério, com valores de referência estabelecidos.


Subject(s)
Noise , Humans , Reference Values , Adult , Female , Male , Young Adult , Reproducibility of Results , Middle Aged , Speech Perception/physiology , Signal-To-Noise Ratio , Auditory Threshold/physiology , Case-Control Studies , Hearing Loss/diagnosis , Hearing Loss/physiopathology , Speech Reception Threshold Test/methods , Speech Reception Threshold Test/standards , Aged , Adolescent
3.
Technol Health Care ; 32(S1): 197-206, 2024.
Article in English | MEDLINE | ID: mdl-38759049

ABSTRACT

BACKGROUND: The speech reception threshold (SRT), synonymous with the speech recognition threshold, denotes the minimum hearing level required for an individual to discern 50% of presented speech material. This threshold is measured independently in each ear with a repetitive up-down adjustment of stimulus level starting from the initial SRT value derived from pure tone thresholds (PTTs), measured via pure-tone audiometry (PTA). However, repetitive adjustments in the test contributes to increased fatigue for both patients and audiologists, compromising the reliability of the hearing tests. OBJECTIVE: Determining the first (initial) sound level closer to the finally determined SRT value, is important to reduce the number of repetitions. The existing method to determine the initial sound level is to average the PTTs called pure tone average (PTAv). METHODS: We propose a novel method using a machine learning approach to estimate a more optimal initial sound level for the SRT test. Specifically, a convolutional neural network with 1-dimensional filters (1D CNN) was implemented to predict a superior initial level than the conventional methods. RESULTS: Our approach produced a reduction of 37.92% in the difference between the initial stimulus level and the final SRT value. CONCLUSIONS: This outcome substantiates that our approach can reduce the repetitions for finding the final SRT, and, as the result, the hearing test time can be reduced.


Subject(s)
Audiometry, Pure-Tone , Speech Reception Threshold Test , Humans , Speech Reception Threshold Test/methods , Audiometry, Pure-Tone/methods , Adult , Male , Female , Machine Learning , Reproducibility of Results , Auditory Threshold/physiology , Neural Networks, Computer , Speech Perception/physiology
4.
Hear Res ; 448: 109031, 2024 Jul.
Article in English | MEDLINE | ID: mdl-38761554

ABSTRACT

In recent studies, psychophysiological measures have been used as markers of listening effort, but there is limited research on the effect of hearing loss on such measures. The aim of the current study was to investigate the effect of hearing acuity on physiological responses and subjective measures acquired during different levels of listening demand, and to investigate the relationship between these measures. A total of 125 participants (37 males and 88 females, age range 37-72 years, pure-tone average hearing thresholds at the best ear between -5.0 to 68.8 dB HL and asymmetry between ears between 0.0 and 87.5 dB) completed a listening task. A speech reception threshold (SRT) test was used with target sentences spoken by a female voice masked by male speech. Listening demand was manipulated using three levels of intelligibility: 20 % correct speech recognition, 50 %, and 80 % (IL20 %/IL50 %/IL80 %, respectively). During the task, peak pupil dilation (PPD), heart rate (HR), pre-ejection period (PEP), respiratory sinus arrhythmia (RSA), and skin conductance level (SCL) were measured. For each condition, subjective ratings of effort, performance, difficulty, and tendency to give up were also collected. Linear mixed effects models tested the effect of intelligibility level, hearing acuity, hearing asymmetry, and tinnitus complaints on the physiological reactivity (compared to baseline) and subjective measures. PPD and PEP reactivity showed a non-monotonic relationship with intelligibility level, but no such effects were found for HR, RSA, or SCL reactivity. Participants with worse hearing acuity had lower PPD at all intelligibility levels and showed lower PEP baseline levels. Additionally, PPD and SCL reactivity were lower for participants who reported suffering from tinnitus complaints. For IL80 %, but not IL50 % or IL20 %, participants with worse hearing acuity rated their listening effort to be relatively high compared to participants with better hearing. The reactivity of the different physiological measures were not or only weakly correlated with each other. Together, the results suggest that hearing acuity may be associated with altered sympathetic nervous system (re)activity. Research using psychophysiological measures as markers of listening effort to study the effect of hearing acuity on such measures are best served by the use of the PPD and PEP.


Subject(s)
Auditory Threshold , Hearing , Heart Rate , Speech Intelligibility , Speech Perception , Speech Reception Threshold Test , Humans , Male , Female , Middle Aged , Adult , Aged , Audiometry, Pure-Tone , Acoustic Stimulation , Perceptual Masking , Galvanic Skin Response , Pupil/physiology , Persons With Hearing Impairments/psychology
5.
J Acoust Soc Am ; 155(5): 2934-2947, 2024 May 01.
Article in English | MEDLINE | ID: mdl-38717201

ABSTRACT

Spatial separation and fundamental frequency (F0) separation are effective cues for improving the intelligibility of target speech in multi-talker scenarios. Previous studies predominantly focused on spatial configurations within the frontal hemifield, overlooking the ipsilateral side and the entire median plane, where localization confusion often occurs. This study investigated the impact of spatial and F0 separation on intelligibility under the above-mentioned underexplored spatial configurations. The speech reception thresholds were measured through three experiments for scenarios involving two to four talkers, either in the ipsilateral horizontal plane or in the entire median plane, utilizing monotonized speech with varying F0s as stimuli. The results revealed that spatial separation in symmetrical positions (front-back symmetry in the ipsilateral horizontal plane or front-back, up-down symmetry in the median plane) contributes positively to intelligibility. Both target direction and relative target-masker separation influence the masking release attributed to spatial separation. As the number of talkers exceeds two, the masking release from spatial separation diminishes. Nevertheless, F0 separation remains as a remarkably effective cue and could even facilitate spatial separation in improving intelligibility. Further analysis indicated that current intelligibility models encounter difficulties in accurately predicting intelligibility in scenarios explored in this study.


Subject(s)
Cues , Perceptual Masking , Sound Localization , Speech Intelligibility , Speech Perception , Humans , Female , Male , Young Adult , Adult , Speech Perception/physiology , Acoustic Stimulation , Auditory Threshold , Speech Acoustics , Speech Reception Threshold Test , Noise
6.
Trends Hear ; 28: 23312165241253653, 2024.
Article in English | MEDLINE | ID: mdl-38715401

ABSTRACT

This study aimed to preliminarily investigate the associations between performance on the integrated Digit-in-Noise Test (iDIN) and performance on measures of general cognition and working memory (WM). The study recruited 81 older adult hearing aid users between 60 and 95 years of age with bilateral moderate to severe hearing loss. The Chinese version of the Montreal Cognitive Assessment Basic (MoCA-BC) was used to screen older adults for mild cognitive impairment. Speech reception thresholds (SRTs) were measured using 2- to 5-digit sequences of the Mandarin iDIN. The differences in SRT between five-digit and two-digit sequences (SRT5-2), and between five-digit and three-digit sequences (SRT5-3), were used as indicators of memory performance. The results were compared to those from the Digit Span Test and Corsi Blocks Tapping Test, which evaluate WM and attention capacity. SRT5-2 and SRT5-3 demonstrated significant correlations with the three cognitive function tests (rs ranging from -.705 to -.528). Furthermore, SRT5-2 and SRT5-3 were significantly higher in participants who failed the MoCA-BC screening compared to those who passed. The findings show associations between performance on the iDIN and performance on memory tests. However, further validation and exploration are needed to fully establish its effectiveness and efficacy.


Subject(s)
Cognition , Cognitive Dysfunction , Hearing Aids , Memory, Short-Term , Humans , Aged , Female , Male , Middle Aged , Aged, 80 and over , Memory, Short-Term/physiology , Cognitive Dysfunction/diagnosis , Noise/adverse effects , Speech Perception/physiology , Speech Reception Threshold Test , Age Factors , Persons With Hearing Impairments/psychology , Persons With Hearing Impairments/rehabilitation , Hearing Loss/rehabilitation , Hearing Loss/diagnosis , Hearing Loss/psychology , Mental Status and Dementia Tests , Memory , Acoustic Stimulation , Predictive Value of Tests , Correction of Hearing Impairment/instrumentation , Auditory Threshold
7.
Int J Pediatr Otorhinolaryngol ; 180: 111928, 2024 May.
Article in English | MEDLINE | ID: mdl-38593717

ABSTRACT

OBJECTIVES: Communicating in noisy settings can be difficult due to interference and environmental noise, which can impact intelligibility for those with hearing impairments and those with normal hearing threshold. Speech intelligibility is commonly assessed in audiology through speech audiometry in quiet environments. Nevertheless, this test may not effectively assess hearing challenges in noisy environments, as total silence is rare in daily activities. A recently patented method, known as the SRT50 FAST, has been developed for conducting speech audiometry in noise. This new method enables the acceleration and simplification of free field speech audiometry tests involving competition noise. This study aims to establish normative scores and standardize the SRT50 FAST method as a test for evaluating speech perception in noise in pediatric patients. METHODS: The study included 30 participants with normal hearing, consisting of 11 females and 19 males, ranging in age from 6 to 11 years. A series of speech audiometry tests were conducted to determine the speech reception threshold 50% (SRT50) in competing conditions. This included testing both the fast mode (SRT50 FAST) currently being studied and the traditional method (SRT50 CLASSIC). The SRT50, or Signal to Noise Ratio (SNR) at which 50% of speech recognition occurred, was investigated for both methods. RESULTS: The mean SRT50 FAST test score was -2.69 (SD = 3.15). The dataset exhibited a normal distribution with values ranging from 3.60 to -8.60. Since the scores are expressed in SRT, higher scores indicate poorer performance. We have established a threshold of 3.60 as the upper limit of the normal range, therefore, patients with scores above this threshold are considered to have abnormal results. CONCLUSIONS: This study aimed to establish normative data for the evaluation of free field speech in noise recognition using the SRT50 FAST method in the pediatric population. This method accurately investigates the necessary signal-to-noise ratio for achieving 50% recognition scores with bisyllabic words in a quick manner. The ultimate objective is to employ this test to identify the optimal configuration of hearing rehabilitation devices, particularly for pediatric patients with hearing aids and/or cochlear implants. Additionally, it can be used to assess pediatric patients with unilateral hearing loss.


Subject(s)
Noise , Speech Perception , Humans , Male , Female , Child , Speech Perception/physiology , Reference Values , Speech Reception Threshold Test , Auditory Threshold/physiology , Audiometry, Speech/methods , Signal-To-Noise Ratio
8.
BMC Res Notes ; 17(1): 107, 2024 Apr 17.
Article in English | MEDLINE | ID: mdl-38632663

ABSTRACT

OBJECTIVE: Early detection and effective management of hearing loss constitute the key to improving the quality of life of individuals with hearing loss. However, in standardized pure tone audiometry, it is sometimes difficult for elderly patients to understand and follow all instructions. Audiologists also require time, expertise, and patience to ensure that an elderly can identify the faintest levels of stimuli during a hearing test. Therefore, this study aimed to devise and validate a formula to predict the pure tone threshold at each frequency across 0.5-4 kHz (PTTs) using speech reception threshold. METHODS: The 1226 audiograms of hearing-impaired individuals aged 60-90 years were reviewed. The random sample function randomly assigned 613 participants to the training and testing sets each. A linear model was created to predict the PTT value at each frequency based on variables significant at all frequencies across 0.5-4 kHz. The adjusted-R2 value was considered to indicate the performance of the predictive model. Pearson's correlation coefficient was used to describe the relationship between the actual and predicted PTT at 0.5, 1, 2, and 4 kHz among the testing set to measure the performance of the proposed model. RESULTS: The predictive model was devised using variables based on the speech recognition threshold (SRT) after adjusting with age in the training set. The overall prediction accuracy demonstrated a higher adjusted-R2 ranging from 0.74 to 0.89 at frequencies of 0.5, 1, and 2 kHz, whereas a low percentage of explained variance was observed at 4 kHz (adjusted-R2 = 0.41). This predictive model can serve as an adjunctive clinical tool for guiding determination of the PTTs. Moreover, the predicted PTTs can be applied in the hearing aid programming software to set appropriate hearing aid gain using standard prescriptive formulas.


Subject(s)
Hearing Loss , Speech Perception , Aged , Humans , Hearing , Quality of Life , Speech , Speech Reception Threshold Test , Middle Aged , Aged, 80 and over
9.
HNO ; 72(7): 504-514, 2024 Jul.
Article in German | MEDLINE | ID: mdl-38536465

ABSTRACT

BACKGROUND: Binaural hearing enables better speech comprehension in noisy environments and is necessary for acoustic spatial orientation. This study investigates speech discrimination in noise with separated signal sources and measures sound localization. The aim was to study characteristics and reproducibility of two selected measurement techniques which seem to be suitable for description of the aforementioned aspects of binaural hearing. MATERIALS AND METHODS: Speech reception thresholds (SRT) in noise and test-retest reliability were collected from 55 normal-hearing adults for a spatial setup of loudspeakers with angles of ±â€¯45° and ±â€¯90° using the Oldenburg sentence test. The investigations of sound localization were conducted in a semicircle and fullcircle setup (7 and 12 equidistant loudspeakers). RESULTS: SRT (S-45N45: -14.1 dB SNR; S45N-45: -16.4 dB SNR; S0N90: -13.1 dB SNR; S0N-90: -13.4 dB SNR) and test-retest reliability (4 to 6 dB SNR) were collected for speech intelligibility in noise with separated signals. The procedural learning effect for this setup could only be mitigated with 120 training sentences. Significantly smaller SRT values, resulting in better speech discrimination, were found for the test situation of the right compared to the left ear. RMS values could be gathered for sound localization in the semicircle (1,9°) as well as in the fullcircle setup (11,1°). Better results were obtained in the retest of the fullcircle setup. CONCLUSION: When using the Oldenburg sentence test in noise with spatially separated signals, it is mandatory to perform a training session of 120 sentences in order to minimize the procedural learning effect. Ear-specific SRT values for speech discrimination in noise with separated signal sources are required, which is probably due to the right-ear advantage. A training is recommended for sound localization in the fullcircle setup.


Subject(s)
Noise , Sound Localization , Speech Perception , Humans , Sound Localization/physiology , Reproducibility of Results , Female , Adult , Male , Speech Perception/physiology , Young Adult , Sensitivity and Specificity , Speech Reception Threshold Test/methods , Acoustic Stimulation/methods , Speech Discrimination Tests/methods
10.
Trends Hear ; 28: 23312165241229057, 2024.
Article in English | MEDLINE | ID: mdl-38483979

ABSTRACT

A practical speech audiometry tool is the digits-in-noise (DIN) test for hearing screening of populations of varying ages and hearing status. The test is usually conducted by a human supervisor (e.g., clinician), who scores the responses spoken by the listener, or online, where software scores the responses entered by the listener. The test has 24-digit triplets presented in an adaptive staircase procedure, resulting in a speech reception threshold (SRT). We propose an alternative automated DIN test setup that can evaluate spoken responses whilst conducted without a human supervisor, using the open-source automatic speech recognition toolkit, Kaldi-NL. Thirty self-reported normal-hearing Dutch adults (19-64 years) completed one DIN + Kaldi-NL test. Their spoken responses were recorded and used for evaluating the transcript of decoded responses by Kaldi-NL. Study 1 evaluated the Kaldi-NL performance through its word error rate (WER), percentage of summed decoding errors regarding only digits found in the transcript compared to the total number of digits present in the spoken responses. Average WER across participants was 5.0% (range 0-48%, SD = 8.8%), with average decoding errors in three triplets per participant. Study 2 analyzed the effect that triplets with decoding errors from Kaldi-NL had on the DIN test output (SRT), using bootstrapping simulations. Previous research indicated 0.70 dB as the typical within-subject SRT variability for normal-hearing adults. Study 2 showed that up to four triplets with decoding errors produce SRT variations within this range, suggesting that our proposed setup could be feasible for clinical applications.


Subject(s)
Speech Perception , Adult , Humans , Speech Reception Threshold Test , Audiometry, Speech , Noise , Hearing Tests
11.
Ear Hear ; 45(4): 860-877, 2024.
Article in English | MEDLINE | ID: mdl-38334698

ABSTRACT

OBJECTIVES: The Children's English and Spanish Speech Recognition (ChEgSS) test is a computer-based tool for assessing closed-set word recognition in English and in Spanish, with a masker that is either speech-shaped noise or competing speech. The present study was conducted to (1) characterize the psychometric properties of the ChEgSS test, (2) evaluate feasibility and reliability for a large cohort of Spanish/English bilingual children with normal hearing, and (3) establish normative data. DESIGN: Three experiments were conducted to evaluate speech perception in children (4-17 years) and adults (19-40 years) with normal hearing using the ChEgSS test. In Experiment 1, data were collected from Spanish/English bilingual and English monolingual adults at multiple, fixed signal-to-noise ratios. Psychometric functions were fitted to the word-level data to characterize variability across target words in each language and in each masker condition. In Experiment 2, Spanish/English bilingual adults were tested using an adaptive tracking procedure to evaluate the influence of different target-word normalization approaches on the reliability of estimates of masked-speech recognition thresholds corresponding to 70.7% correct word recognition and to determine the optimal number of reversals needed to obtain reliable estimates. In Experiment 3, Spanish/English bilingual and English monolingual children completed speech perception testing using the ChEgSS test to (1) characterize feasibility across age and language group, (2) evaluate test-retest reliability, and (3) establish normative data. RESULTS: Experiments 1 and 2 yielded data that are essential for stimulus normalization, optimizing threshold estimation procedures, and interpreting threshold data across test language and masker type. Findings obtained from Spanish/English bilingual and English monolingual children with normal hearing in Experiment 3 support feasibility and demonstrate reliability for use with children as young as 4 years of age. Equivalent results for testing in English and Spanish were observed for Spanish/English bilingual children, contingent on adequate proficiency in the target language. Regression-based threshold norms were established for Spanish/English bilingual and English monolingual children between 4 and 17 years of age. CONCLUSIONS: The present findings indicate the ChEgSS test is appropriate for testing a wide age range of children with normal hearing in either Spanish, English, or both languages. The ChEgSS test is currently being evaluated in a large cohort of patients with hearing loss at pediatric audiology clinics across the United States. Results will be compared with normative data established in the present study and with established clinical measures used to evaluate English- and Spanish-speaking children. Questionnaire data from parents and clinician feedback will be used to further improve test procedures.


Subject(s)
Feasibility Studies , Multilingualism , Psychometrics , Speech Perception , Humans , Child , Adolescent , Child, Preschool , Male , Female , Adult , Young Adult , Reproducibility of Results , Reference Values , Speech Reception Threshold Test/methods , Perceptual Masking , Language
12.
J Acoust Soc Am ; 155(2): 867-878, 2024 02 01.
Article in English | MEDLINE | ID: mdl-38310604

ABSTRACT

Noise-induced hearing loss interacts with age, sex, and listening conditions to affect individuals' perception of ecologically relevant stimuli like speech. The present experiments assessed the impact of age and sex on vocalization detection by noise-exposed mice trained to detect a downsweep or complex ultrasonic vocalization in quiet or in the presence of a noise background. Daily thresholds before and following intense noise exposure were collected longitudinally and compared across several factors. All mice, regardless of age, sex, listening condition, or stimulus type showed their poorest behavioral sensitivity immediately after the noise exposure. There were varying degrees of recovery over time and across factors. Old-aged mice had greater threshold shifts and less recovery compared to middle-aged mice. Mice had larger threshold shifts and less recovery for downsweeps than for complex vocalizations. Female mice were more sensitive, had smaller post-noise shifts, and had better recovery than males. Thresholds in noise were higher and less variable than thresholds in quiet, but there were comparable shifts and recovery. In mice, as in humans, the perception of ecologically relevant stimuli suffers after an intense noise exposure, and results differ from simple tone detection findings.


Subject(s)
Hearing Loss, Noise-Induced , Speech Perception , Humans , Middle Aged , Male , Female , Animals , Mice , Vocalization, Animal , Noise/adverse effects , Hearing Loss, Noise-Induced/etiology , Speech Reception Threshold Test , Auditory Threshold
13.
J Acoust Soc Am ; 155(2): 1353-1367, 2024 02 01.
Article in English | MEDLINE | ID: mdl-38364043

ABSTRACT

A personalization framework to adapt compact models to test time environments and improve their speech enhancement (SE) performance in noisy and reverberant conditions is proposed. The use-cases are when the end-user device encounters only one or a few speakers and noise types that tend to reoccur in the specific acoustic environment. Hence, a small personalized model that is sufficient to handle this focused subset of the original universal SE problem is postulated. The study addresses a major data shortage issue: although the goal is to learn from a specific user's speech signals and the test time environment, the target clean speech is unavailable for model training due to privacy-related concerns and technical difficulty of recording noise and reverberation-free voice signals. The proposed zero-shot personalization method uses no clean speech target. Instead, it employs the knowledge distillation framework, where the more advanced denoising results from an overly large teacher work as pseudo targets to train a small student model. Evaluation on various test time conditions suggests that the proposed personalization approach can significantly enhance the compact student model's test time performance. Personalized models outperform larger non-personalized baseline models, demonstrating that personalization achieves model compression with no loss in dereverberation and denoising performance.


Subject(s)
Speech Perception , Speech , Humans , Noise/adverse effects , Speech Reception Threshold Test , Acoustics
14.
Int J Audiol ; 63(4): 269-274, 2024 Apr.
Article in English | MEDLINE | ID: mdl-36847757

ABSTRACT

OBJECTIVE: Appropriate speech-in noise assessment is challenging in multilingual populations. This study aimed to assess whether first preferred language affected performance on an English Digits-in-noise (DIN) test in the local Asian multilingual population, controlling for hearing threshold, age, sex, English fluency and educational status. A secondary aim was to determine the association between DIN test scores and hearing thresholds. DESIGN: English digit-triplets in noise testing and pure-tone audiometry were conducted. Multiple regression analysis was performed with DIN scores and hearing thresholds as dependent variables. Correlation analysis was performed between DIN-SRT and hearing thresholds. STUDY SAMPLE: 165 subjects from the Singapore Longitudinal Ageing Study, a population-based longitudinal study of community-dwellers over 55 years of age. RESULTS: Mean DIN speech reception threshold (DIN-SRT) was -5.7 dB SNR (SD 3.6; range 6.7 to -11.2). Better ear pure tone average and English fluency were significantly associated with DIN-SRT. CONCLUSIONS: DIN performance was independent of first preferred language in a multilingual ageing Singaporean population after adjusting for age, gender and education. Those with poorer English fluency had a significantly lower DIN-SRT score. The DIN test has the potential to provide a quick, uniform method of testing speech in noise in this multilingual population.


Subject(s)
Multilingualism , Speech Perception , Adult , Humans , Longitudinal Studies , Noise/adverse effects , Audiometry, Pure-Tone , Language , Speech Reception Threshold Test
15.
Ear Hear ; 45(2): 451-464, 2024.
Article in English | MEDLINE | ID: mdl-38062570

ABSTRACT

OBJECTIVES: Motivated by the growing need for hearing screening in China, the present study has two objectives. First, to develop and validate a new test, called the Chinese Zodiac-in-noise (ZIN) test, for large-scale hearing screening in China. Second, to conduct a large-scale remote hearing screening in China, using the ZIN test developed. DESIGN: The ZIN test was developed following a similar procedure as the digits-in-noise test but emphasizes the importance of consonant recognition by employing the 12 zodiac animals in traditional Chinese culture as speech materials. It measures the speech reception threshold (SRT) using triplets of Chinese zodiac animals in speech-shaped noise with an adaptive procedure. RESULTS: Normative data of the test were obtained in a group of 140 normal-hearing listeners, and the performance of the test was validated by comparisons with pure-tone audiometry in 116 listeners with various hearing abilities. The ZIN test has a reference SRT of -11.0 ± 1.6 dB in normal-hearing listeners with a test-retest variability of 1.7 dB and can be completed in 3 minutes. The ZIN SRT is highly correlated with the better-ear pure-tone threshold ( r = 0.82). With a cutoff value of -7.7 dB, the ZIN test has a sensitivity of 0.85 and a specificity of 0.94 for detecting a hearing loss of 25 dB HL or more at the better ear.A large-scale remote hearing screening involving 30,552 participants was performed using the ZIN test. The large-scale study found a hearing loss proportion of 21.0% across the study sample, with a high proportion of 57.1% in the elderly study sample aged over 60 years. Age and gender were also observed to have associations with hearing loss, with older individuals and males being more likely to have hearing loss. CONCLUSIONS: The Chinese ZIN test is a valid and efficient solution for large-scale hearing screening in China. Its remote applications may improve access to hearing screening and enhance public awareness of hearing health.


Subject(s)
Deafness , Hearing Loss , Speech Perception , Aged , Male , Humans , Middle Aged , Speech , Noise , Hearing Loss/diagnosis , Audiometry, Pure-Tone/methods , Auditory Threshold , Hearing , Speech Reception Threshold Test/methods
16.
Ear Hear ; 45(3): 572-582, 2024.
Article in English | MEDLINE | ID: mdl-37990396

ABSTRACT

OBJECTIVES: The study aimed to develop and validate the Mandarin digit-in-noise (DIN) test using four digit (i.e., two-, three-, four-, and five-digit) sequences. Test-retest reliability and criterion validity were evaluated. How the number of digits affected the results was examined. The research might lead to more informed choice of DIN tests for populations with specific cognitive needs such as memory impairment. DESIGN: The International Collegium of Rehabilitative Audiology guideline for developing the DIN was adapted to create test materials. The test-retest reliability and psychometric function of each digit sequence were determined among young normal-hearing adults. The criterion validity of each digit sequence was determined by comparing the measured performance of older adult hearing aid users with that obtained from two other well-established sentence-in-noise tests: the Mandarin hearing-in-noise test and the Mandarin Chinese matrix test. The relation between the speech reception thresholds (SRTs) of each digit sequence of the DIN test and working memory capacity measured using the digit span test and the reading span test were explored among older adult hearing aid users. Together, the study sample consisted of 54 young normal-hearing adults and 56 older adult hearing aid users. RESULTS: The slopes associated with the two-, three-, four-, and five-digit DIN test were 16.58, 18.79, 20.42, and 21.09 %/dB, respectively, and the mean SRTs were -11.11, -10.99, -10.56, and -10.02 dB SNR, respectively. Test-retest SRTs did not differ by more than 0.74 dB across all digit sequences, suggesting good test-retest reliability. Spearman rank-order correlation coefficients between SRTs obtained using the DIN across the four digit (i.e., two-, three-, four-, and five-digit) sequences and the two sentence-in-noise tests were uniformly high ( rs = 0.9) across all participants, when data from all participants were considered. Results from the digit span test and reading span test correlated significantly with the results of the five-digit sequences ( rs = -0.37 and -0.42, respectively) but not with the results of the two-, three-, and four-digit sequences among older hearing aid users. CONCLUSIONS: While the three-digit sequence was found to be appropriate for clinical use for assessment of auditory perception, the two-digit sequence could be used for hearing screening. The five-digit sequence could be difficult for older hearing aid users, and with its SRT related to working memory capacity, its use in the evaluation of speech perception should be investigated further. The Mandarin DIN test was found to be reliable, and the findings are in line with SRTs obtained using standardized sentence tests, suggesting good criterion validity.


Subject(s)
Hearing Aids , Speech Perception , Humans , Aged , Reproducibility of Results , Hearing Tests/methods , Noise , Language , Speech Reception Threshold Test
17.
Eur Arch Otorhinolaryngol ; 281(5): 2341-2351, 2024 May.
Article in English | MEDLINE | ID: mdl-38110748

ABSTRACT

PURPOSE: The Oldenburg Sentence Test (OLSA) is a German matrix test designed to determine speech recognition thresholds (SRT). It is widely used for hearing-aids and cochlear implant fitting, but an age-adjusted standard is still lacking. In addition, knowing that the ability to concentrate is an important factor in OLSA performance, we hypothesized that OLSA performance would depend on the time of day it was administered. The aim of this study was to propose an age standardization for the OLSA and to determine its diurnal performance. METHODS: The Gutenberg Health Study is an ongoing population-based study and designed as a single-centre observational, prospective cohort study. Participants were interviewed about common otologic symptoms and tested with pure-tone audiometry and OLSA. Two groups-subjects with and without hearing loss-were established. The OLSA was performed in two runs. The SRT was evaluated for each participant. Results were characterized by age in 5-year cohorts, gender and speech recognition threshold (SRT). A time stamp with an hourly interval was also implemented. RESULTS: The mean OLSA SRT was - 6.9 ± 1.0 dB (group 1 male) and - 7.1 ± 0.8 dB (group 1 female) showing an inverse relationship with age in the whole cohort, whereas a linear increase was observed in those without hearing loss. OLSA-SRT values increased more in males than in females with increasing age. No statistical significance was found for the diurnal performance. CONCLUSIONS: A study with 2900 evaluable Oldenburg Sentence Tests is a novelty and representative for the population of Mainz and its surroundings. We postulate an age- and gender-standardized scale for the evaluation of the OLSA. In fact, with an intergroup standard deviation (of about 1.5 dB) compared to the age dependence of 0.7 dB/10 years, this age normalization should be considered as clinically relevant.


Subject(s)
Cochlear Implants , Deafness , Hearing Aids , Hearing Loss , Speech Perception , Female , Humans , Male , Hearing Loss/diagnosis , Prospective Studies , Speech Intelligibility , Speech Reception Threshold Test/methods
18.
Zhonghua Er Bi Yan Hou Tou Jing Wai Ke Za Zhi ; 58(10): 980-985, 2023 Oct 07.
Article in Chinese | MEDLINE | ID: mdl-37840163

ABSTRACT

Objective: To explore the safety and reliability of retrosigmoid approach BONEBRIDGE implantation in patients with auricle reconstruction using skin expansion flap. Methods: A retrospective analysis was conducted on 43 congenital aural atresia cases (43 ears) who underwent BONEBRIDGE implantation from September 2019 to January 2023 in Beijing Tongren Hospital. 30 males and 13 females were included in this work. The implantation age was 9-36 years old (median age=10 y/o). All cases underwent auricle reconstruction surgery using the posterior ear flap expansion method, with 36 cases using the single expanded postauricular flap method and 7 cases using two-flap method. BONEBRIDGE implant surgery was performed during the third stage of auricle reconstruction or after all stages. The hearing improvements were evaluated by comparing the changes in pure tone hearing threshold and speech recognition rate of patients before and after BONEBRIDGE implantation. Routine follow-up was conducted to observe the hearing results and complications. SPSS 14.0 software was applied for data statistical analysis. Results: All 43 patients healed well and had no surgical complications when discharge. The average bone conduction hearing threshold after surgery was (8.2±6.6) dBHL, and there was no statistically significant difference compared to the preoperative [(8.1±5.7) dBHL] (P=0.95). After surgery, the threshold of hearing assistance with power on was significantly lower than that without hearing assistance [(32.8±4.6) dBHL vs (60.5±5.5) dBHL], and the difference was statistically significant (P<0.001). The speech recognition rate of monosyllable words, disyllabic words and short sentences in quiet environment increased to 72%, 84%, and 98% respectively. The differences were statistically significant (P<0.001). The speech recognition rate of monosyllabic words, disyllabic words, and short sentences in noise environment was significantly increased by 70%, 80%, and 92% respectively (P<0.001). After a follow-up of 4 to 47 months (median=24 months), the hearing results were stable and the aesthetic outcomes were satisfying. One patient had delayed hematoma around coil of the implant. After aspiration and compressed dressing for one week, hematoma was not recurrent. Conclusion: For patients after auricle reconstruction using expanded postauricular flap, the preference of retrosigmoid approach is a good choice in terms of safety and reliability of operation, as well as aesthetic appearance.


Subject(s)
Congenital Abnormalities , Ear Auricle , Hearing Aids , Otologic Surgical Procedures , Plastic Surgery Procedures , Surgical Flaps , Adolescent , Adult , Child , Female , Humans , Male , Young Adult , Audiometry, Pure-Tone , Bone Conduction , Congenital Abnormalities/surgery , Ear Auricle/abnormalities , Ear Auricle/surgery , Hearing Loss, Conductive/diagnosis , Hearing Loss, Conductive/etiology , Hearing Loss, Conductive/surgery , Otologic Surgical Procedures/methods , Plastic Surgery Procedures/methods , Prosthesis Implantation/methods , Reproducibility of Results , Retrospective Studies , Skin , Speech Reception Threshold Test , Tissue Expansion , Treatment Outcome
19.
Hear Res ; 434: 108785, 2023 07.
Article in English | MEDLINE | ID: mdl-37172414

ABSTRACT

Behavioral tests are currently the gold standard in measuring speech intelligibility. However, these tests can be difficult to administer in young children due to factors such as motivation, linguistic knowledge and cognitive skills. It has been shown that measures of neural envelope tracking can be used to predict speech intelligibility and overcome these issues. However, its potential as an objective measure for speech intelligibility in noise remains to be investigated in preschool children. Here, we evaluated neural envelope tracking as a function of signal-to-noise ratio (SNR) in 14 5-year-old children. We examined EEG responses to natural, continuous speech presented at different SNRs ranging from -8 (very difficult) to 8 dB SNR (very easy). As expected delta band (0.5-4 Hz) tracking increased with increasing stimulus SNR. However, this increase was not strictly monotonic as neural tracking reached a plateau between 0 and 4 dB SNR, similarly to the behavioral speech intelligibility outcomes. These findings indicate that neural tracking in the delta band remains stable, as long as the acoustical degradation of the speech signal does not reflect significant changes in speech intelligibility. Theta band tracking (4-8 Hz), on the other hand, was found to be drastically reduced and more easily affected by noise in children, making it less reliable as a measure of speech intelligibility. By contrast, neural envelope tracking in the delta band was directly associated with behavioral measures of speech intelligibility. This suggests that neural envelope tracking in the delta band is a valuable tool for evaluating speech-in-noise intelligibility in preschoolers, highlighting its potential as an objective measure of speech in difficult-to-test populations.


Subject(s)
Speech Intelligibility , Speech Perception , Child, Preschool , Humans , Speech Perception/physiology , Noise/adverse effects , Signal-To-Noise Ratio , Speech Reception Threshold Test
20.
Noise Health ; 25(117): 104-112, 2023.
Article in English | MEDLINE | ID: mdl-37203127

ABSTRACT

Objective: The goal is to implement the developed speech material in a hearing test to assess auditory fitness for duty (AFFD), specifically in areas where the intelligibility of spoken commands is essential. Design: In study 1, a speech corpus with equal intelligibility was constructed using constant stimuli to test each target word's psychometric functions. Study 2 used an adaptive interleaving procedure to maximize equalized terms. Study 3 used Monte Carlo simulations to determine speech test accuracy. Study sample: Study 1 (n = 24) and study 2 (n = 20) were completed by civilians with normal hearing. Study 3 ran 10,000 simulations per condition across various conditions varying in slopes and speech recognition thresholds (SRTs). Results: Studies 1 and 2 produced three 8-word wordlists. The mean, standard deviation in dB SNR is -13.1 1.2 for wordlist 1, -13.7 1.6 for wordlist 2, and -13.7 1.3 for wordlist 3, with word SRTs within 3.4 dB SNR. Study 3 revealed that a 6 dB SNR range is appropriate for equally understandable speech using a closed-set adaptive technique. Conclusion: The developed speech corpus may be used in an AFFD measure. Concerning the homogeneity of the speech in noise test material, care should be taken when generalizing and using ranges and standard deviations from multiple tests.


Subject(s)
Speech Intelligibility , Speech Perception , Auditory Threshold , Hearing Tests , Reproducibility of Results , Signal-To-Noise Ratio , Speech Reception Threshold Test/methods , Humans
SELECTION OF CITATIONS
SEARCH DETAIL
...