Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 8.346
Filter
1.
Sci Rep ; 14(1): 12787, 2024 06 04.
Article in English | MEDLINE | ID: mdl-38834775

ABSTRACT

Cochlear implant users experience difficulties controlling their vocalizations compared to normal hearing peers. However, less is known about their voice quality. The primary aim of the present study was to determine if cochlear implant users' voice quality would be categorized as dysphonic by the Acoustic Voice Quality Index (AVQI) and smoothed cepstral peak prominence (CPPS). A secondary aim was to determine if vocal quality is further impacted when using bilateral implants compared to using only one implant. The final aim was to determine how residual hearing impacts voice quality. Twenty-seven cochlear implant users participated in the present study and were recorded while sustaining a vowel and while reading a standardized passage. These recordings were analyzed to calculate the AVQI and CPPS. The results indicate that CI users' voice quality was detrimentally affected by using their CI, raising to the level of a dysphonic voice. Specifically, when using their CI, mean AVQI scores were 4.0 and mean CPPS values were 11.4 dB, which indicates dysphonia. There were no significant differences in voice quality when comparing participants with bilateral implants to those with one implant. Finally, for participants with residual hearing, as hearing thresholds worsened, the likelihood of a dysphonic voice decreased.


Subject(s)
Cochlear Implants , Voice Quality , Humans , Male , Female , Middle Aged , Aged , Adult , Dysphonia/physiopathology , Speech Acoustics , Cochlear Implantation
2.
Codas ; 36(3): e20230023, 2024.
Article in Portuguese, English | MEDLINE | ID: mdl-38836821

ABSTRACT

PURPOSE: To cross-culturally adapt the Voice Quality of Life Profile (IVQLP) into Brazilian Portuguese (BP). METHODS: The cross-cultural adaptation process was performed in five stages: translation of the IVQLP into BP by three native BP experts fluent in American English; preparation of a consensus version; back-translation by a native American English expert fluent in BP; analysis by a committee of five experts and preparation of the final version of the instrument in BP, which was named IVQLP-Br; and pre-testing. The IVQLP-Br aims to assess the impacts of the voice more comprehensively, encompassing various areas of an individual's life. It has 43 items and a five-level response key. For the pre-test, the alternative "not applicable" was added as a response option. Thirty-six adults with self-reported risk of dysphonia participated in the pre-test. RESULTS: In the translation stage, ten items were modified, and during the back-translation, 15 items required adjustments. No questions required reformulation after the application of the IVQLP-Br in the target population, because the option "not applicable" appeared in 12 responses without statistical significance. CONCLUSION: The version of the IVQLP translated into BP, named the IVQLP-Br, exhibited cross-cultural equivalence and was administrable for a more detailed analysis of the impact of the voice in different domains of an individual's life. After validation, the IVQLP-Br will be able to contribute both to clinical practice and to research with BP speakers.


OBJETIVO: Traduzir e adaptar de forma transcultural o Iranian Voice Quality of Life Profile (IVQLP) para o português brasileiro (PB). MÉTODO: O processo de adaptação transcultural foi realizado por meio de cinco etapas: tradução do IVQLP para o PB por três especialistas nativos do PB e fluentes no inglês americano; elaboração de uma versão de consenso; retrotradução por um especialista nativo do inglês americano e fluente no PB; análise por um comitê de cinco especialistas e elaboração da versão final do instrumento em PB, denominado IVQLP-Br; e pré-teste. O IVQLP-Br tem o objetivo de avaliar os impactos da voz de uma forma mais abrangente, englobando vários domínios da vida dos indivíduos, apresenta 43 itens e uma chave de respostas de cinco pontos. Para o pré-teste foi acrescida como opção para o respondente a alternativa "não aplicável". Participaram do pré-teste 36 indivíduos adultos com risco autorrelatado para disfonia. RESULTADOS: Na etapa de tradução 10 itens foram modificados e na retrotradução, 15 itens necessitaram de ajustes. Nenhum item precisou ser reformulado após a aplicação na população-alvo, pois a opção "não aplicável" apareceu em doze respostas, porém, sem significância estatística. CONCLUSÃO: Conclui-se que a versão traduzida do IVQLP para o PB, denominado IVQLP-Br, apresentou equivalência transcultural e pode ser utilizada para uma análise mais detalhada do impacto da voz nos diferentes domínios da vida dos indivíduos. Após a validação, o IVQLP-Br poderá contribuir tanto para a prática clínica, quanto para pesquisas com falantes do PB.


Subject(s)
Cross-Cultural Comparison , Quality of Life , Translations , Voice Quality , Humans , Brazil , Female , Adult , Male , Surveys and Questionnaires , Middle Aged , Iran , Dysphonia/physiopathology , Dysphonia/diagnosis , Reproducibility of Results , Young Adult , Language
3.
J Speech Lang Hear Res ; 67(6): 1660-1681, 2024 Jun 06.
Article in English | MEDLINE | ID: mdl-38758676

ABSTRACT

PURPOSE: Literature suggests a dependency of the acoustic metrics, smoothed cepstral peak prominence (CPPS) and harmonics-to-noise ratio (HNR), on human voice loudness and fundamental frequency (F0). Even though this has been explained with different oscillatory patterns of the vocal folds, so far, it has not been specifically investigated. In the present work, the influence of three elicitation levels, calibrated sound pressure level (SPL), F0 and vowel on the electroglottographic (EGG) and time-differentiated EGG (dEGG) metrics hybrid open quotient (OQ), dEGG OQ and peak dEGG, as well as on the acoustic metrics CPPS and HNR, was examined, and their suitability for voice assessment was evaluated. METHOD: In a retrospective study, 29 women with a mean age of 25 years (± 8.9, range: 18-53) diagnosed with structural vocal fold pathologies were examined before and after voice therapy or phonosurgery. Both acoustic and EGG signals were recorded simultaneously during the phonation of the sustained vowels /ɑ/, /i/, and /u/ at three elicited levels of loudness (soft/comfortable/loud) and unconstrained F0 conditions. RESULTS: A linear mixed-model analysis showed a significant effect of elicitation effort levels on peak dEGG, HNR, and CPPS (all p < .01). Calibrated SPL significantly influenced HNR and CPPS (both p < .01). Furthermore, F0 had a significant effect on peak dEGG and CPPS (p < .0001). All metrics showed significant changes with regard to vowel (all p < .05). However, the treatment had no effect on the examined metrics, regardless of the treatment type (surgery vs. voice therapy). CONCLUSIONS: The value of the investigated metrics for voice assessment purposes when sampled without sufficient control of SPL and F0 is limited, in that they are significantly influenced by the phonatory context, be it speech or elicited sustained vowels. Future studies should explore the diagnostic value of new data collation approaches such as voice mapping, which take SPL and F0 effects into account.


Subject(s)
Dysphonia , Speech Acoustics , Humans , Female , Adult , Dysphonia/physiopathology , Dysphonia/therapy , Retrospective Studies , Young Adult , Middle Aged , Adolescent , Voice Quality/physiology , Electrodiagnosis/methods , Glottis/physiopathology , Phonation/physiology , Vocal Cords/physiopathology , Voice Training , Speech Production Measurement/methods
4.
Codas ; 36(4): e20230148, 2024.
Article in Portuguese, English | MEDLINE | ID: mdl-38775526

ABSTRACT

PURPOSE: To evaluate the immediate effect of the inspiratory exercise with a booster and a respiratory exerciser on the voice of women without vocal complaints. METHODS: 25 women with no vocal complaints, between 18 and 34 years old, with a score of 1 on the Vocal Disorder Screening Index (ITDV) participated. Data collection was performed before and after performing the inspiratory exercise and consisted of recording the sustained vowel /a/, connected speech and maximum phonatory times (MPT) of vowels, fricative phonemes and counting numbers. In the auditory-perceptual judgment, the Vocal Deviation Scale (VSD) was used to verify the general degree of vocal deviation. Acoustic evaluation was performed using the PRAAT software and the parameters fundamental frequency (f0), jitter, shimmer, harmonium-to-noise ratio (HNR), Cepstral Peak Prominence Smoothed (CPPS), Acoustic Voice Quality Index (AVQI) and Acoustic Breathiness Index (ABI). To measure the aerodynamic measurements, the time of each emission was extracted in the Audacity program. Data were statistically analyzed using the Statistica for Windows software and normality was tested using the Shapiro-Wilk test. To compare the results, Student's and Wilcoxon's t tests were applied, adopting a significance level of 5%. RESULTS: There were no significant differences between the results of the JPA and the acoustic measures, in the pre and post inspiratory exercise moments. As for the aerodynamic measures, it was possible to observe a significant increase in the value of the TMF /s/ (p=0.008). CONCLUSION: There was no change in vocal quality after the inspiratory exercise with stimulator and respiratory exerciser, but an increase in the MPT of the phoneme /s/ was observed after the exercise.


OBJETIVO: Avaliar o efeito imediato do exercício inspiratório com incentivador e exercitador respiratório na voz de mulheres sem queixas vocais. MÉTODO: Participaram 25 mulheres sem queixas vocais, entre 18 e 34 anos, com pontuação 1 no Índice de Triagem para Distúrbio Vocal (ITDV). A coleta de dados foi realizada nos momentos antes e após realização de exercício inspiratório e consistiu na gravação de vogal sustentada /a/, fala encadeada e tempos máximos fonatórios (TMF) de vogais, fonemas fricativos e contagem de números. No julgamento perceptivo-auditivo foi utilizada a Escala de Desvio Vocal (EDV) para verificar o grau geral do desvio vocal. Avaliação acústica foi feita no software PRAAT e foram extraídos os parâmetros frequência fundamental (f0), jitter, shimmer, proporção harmônico -ruído (HNR), Cepstral Peak Prominence Smoothed (CPPS), Acoustic Voice Quality Index (AVQI) e Acoustic Breathiness Index (ABI). Para mensuração das medidas aerodinâmicas, o tempo de emissão foi extraído no programa Audacity. Para comparar os resultados utilizou-se o teste paramétrico t de Student para amostras dependentes na análise das variáveis com distribuição normal e o teste de Wilcoxon para variáveis com distribuição não normal. RESULTADOS: Não houve diferenças entre os resultados do JPA e das medidas acústicas, nos momentos pré e pós exercício inspiratório. Quanto às medidas aerodinâmicas foi possível observar aumento significativo no valor do TMF /s/ (p=0,008). CONCLUSÃO: Não houve modificação na qualidade vocal após o exercício inspiratório com incentivador e exercitador respiratório, porém foi observado aumento do TMF do fonema /s/ após a realização do exercício.


Subject(s)
Breathing Exercises , Voice Quality , Humans , Female , Adult , Young Adult , Adolescent , Breathing Exercises/methods , Speech Acoustics , Voice Disorders/physiopathology , Voice Disorders/diagnosis , Phonation/physiology
5.
J Speech Lang Hear Res ; 67(6): 1712-1730, 2024 Jun 06.
Article in English | MEDLINE | ID: mdl-38749007

ABSTRACT

PURPOSE: The goal of this study was to assess various recording methods, including combinations of high- versus low-cost microphones, recording interfaces, and smartphones in terms of their ability to produce commonly used time- and spectral-based voice measurements. METHOD: Twenty-four vowel samples representing a diversity of voice quality deviations and severities from a wide age range of male and female speakers were played via a head-and-thorax model and recorded using a high-cost, research standard GRAS 40AF (GRAS Sound & Vibration) microphone and amplification system. Additional recordings were made using various combinations of headset microphones (AKG C555 L [AKG Acoustics GmbH], Shure SM35-XLR [Shure Incorporated], AVID AE-36 [AVID Products, Inc.]) and audio interfaces (Focusrite Scarlett 2i2 [Focusrite Audio Engineering Ltd.] and PC, Focusrite and smartphone, smartphone via a TRRS adapter), as well as smartphones direct (Apple iPhone 13 Pro, Google Pixel 6) using their built-in microphones. The effect of background noise from four different room conditions was also evaluated. Vowel samples were analyzed for measures of fundamental frequency, perturbation, cepstral peak prominence, and spectral tilt (low vs. high spectral ratio). RESULTS: Results show that a wide variety of recording methods, including smartphones with and without a low-cost headset microphone, can effectively track the wide range of acoustic characteristics in a diverse set of typical and disordered voice samples. Although significant differences in acoustic measures of voice may be observed, the presence of extremely strong correlations (rs > .90) with the recording standard implies a strong linear relationship between the results of different methods that may be used to predict and adjust any observed differences in measurement results. CONCLUSION: Because handheld smartphone distance and positioning may be highly variable when used in actual clinical recording situations, smartphone + a low-cost headset microphone is recommended as an affordable recording method that controls mouth-to-microphone distance and positioning and allows both hands to be available for manipulation of the smartphone device.


Subject(s)
Smartphone , Speech Acoustics , Humans , Female , Male , Adult , Young Adult , Speech Production Measurement/instrumentation , Speech Production Measurement/methods , Reproducibility of Results , Voice Quality , Middle Aged , Adolescent
6.
Codas ; 36(4): e20230047, 2024.
Article in Portuguese, English | MEDLINE | ID: mdl-38808777

ABSTRACT

PURPOSE: To compare the acoustic measurements of Cepstral Peak Prominence Smoothed (CPPS) and Acoustic Voice Quality Index (AVQI) of children with normal and altered voices, to relationship with auditory-perceptual judgment (APJ) and to establish cut-off points. METHODS: Vocal recordings of the sustained vowel and number counting tasks of 185 children were selected from a database and submitted to acoustic analysis with extraction of CPPS and AVQI measurements, and to APJ. The APJ was performed individually for each task, classified as normal or altered, and for the tasks together defining whether the child would pass or fail in a situation of vocal screening. RESULTS: Children with altered APJ and who failed the screening had lower CPPS values and higher AVQI values, than those with normal APJ and who passed the screening. The APJ of the sustained vowel task was related to CPPS and AVQI, and APJ of the number counting task was related only to AVQI and CPPS numbers. The cut-off points that differentiate children with and without vocal deviation are 14.07 for the vowel CPPS, 7.62 for the CPPS numbers and 2.01 for the AVQI. CONCLUSION: Children with altered voices, have higher AVQI values and lower CPPS values, when detected in children with voices within the normal range. The acoustic measurements were related to the auditory perceptual judgment of vocal quality in the sustained vowel task, however, the number counting task was related only to the AVQI and CPPS. The cut-off points that differentiate children with and without vocal deviation are 14.07 for the CPPS vowel, 7.62 for the CPPS numbers and 2.01 for the AVQI. The three measures were similar in identifying voices without deviation and dysphonic voices.


OBJETIVO: Comparar as medidas acústicas de Cepstral Peak Prominence Smoothed (CPPS) e Acoustic Voice Quality Index (AVQI) de crianças com vozes normais e alteradas, relacionar com o julgamento perceptivo-auditivo (JPA) da voz e estabelecer pontos de corte. MÉTODO: Gravações vocais das tarefas de vogal sustentada e contagem de números de 185 crianças foram selecionadas em um banco de dados e submetidas a análise acústica com extração das medidas de CPPS e AVQI, e ao JPA. O JPA foi realizado individualmente para cada tarefa e as amostras foram classificadas posteriormente como normal ou alterada, e para as tarefas em conjunto definindo-se se a criança passaria ou falharia em uma situação de triagem vocal. RESULTADOS: Crianças com JPA alterado e que falharam na triagem apresentaram valores menores de CPPS e maiores de AVQI, do que as com JPA normal e que passaram na triagem. O JPA da tarefa de vogal sustentada se relacionou ao CPPS e AVQI, e da tarefa de contagem de números relacionou-se apenas ao AVQI e CPPS números. Os pontos de corte que diferenciam crianças com e sem desvio vocal são 14,07 para o CPPS vogal, 7,62 para o CPPS números e 2,01 para o AVQI. CONCLUSÃO: Crianças com JPA alterado apresentaram maiores valores de AVQI e menores valores de CPPs. O JPA da tarefa de vogal previu todas as medidas acústicas, porém, de contagem previu apenas as medidas extraídas dela. As três medidas foram semelhantes na identificação de vozes sem desvio e vozes disfônicas.


Subject(s)
Speech Acoustics , Voice Quality , Humans , Voice Quality/physiology , Child , Female , Male , Auditory Perception/physiology , Voice Disorders/diagnosis , Voice Disorders/physiopathology , Adolescent , Case-Control Studies , Speech Production Measurement , Judgment
7.
J Acoust Soc Am ; 155(5): 3521-3536, 2024 May 01.
Article in English | MEDLINE | ID: mdl-38809098

ABSTRACT

This electromagnetic articulography study explores the kinematic profile of Intonational Phrase boundaries in Seoul Korean. Recent findings suggest that the scope of phrase-final lengthening is conditioned by word- and/or phrase-level prominence. However, evidence comes mainly from head-prominence languages, which conflate positions of word prosody with positions of phrasal prominence. Here, we examine phrase-final lengthening in Seoul Korean, an edge-prominence language with no word prosody, with respect to focus location as an index of phrase-level prominence and Accentual Phrase (AP) length as an index of word demarcation. Results show that phrase-final lengthening extends over the phrase-final syllable. The effect is greater the further away that focus occurs. It also interacts with the domains of AP and prosodic word: lengthening is greater in smaller APs, whereas shortening is observed in the initial gesture of the phrase-final word. Additional analyses of kinematic displacement and peak velocity revealed that Korean phrase-final gestures bear the kinematic profile of IP boundaries concurrently to what is typically considered prominence marking. Based on these results, a gestural coordination account is proposed, in which boundary-related events interact systematically with phrase-level prominence as well as lower prosodic levels, and how this proposal relates to the findings in head-prominence languages is discussed.


Subject(s)
Phonetics , Speech Acoustics , Humans , Male , Female , Young Adult , Biomechanical Phenomena , Adult , Language , Gestures , Speech Production Measurement , Republic of Korea , Voice Quality , Time Factors
8.
Rev Assoc Med Bras (1992) ; 70(4): e20231146, 2024.
Article in English | MEDLINE | ID: mdl-38716939

ABSTRACT

OBJECTIVE: Therapy and vocal rehabilitation in laryngeal cancer impact patients' quality of life. The objective of this study was to evaluate the evolution of the quality of life of patients with laryngeal cancer submitted to total laryngectomy and using electrolarynx. METHODS: This is an observational study with a cross-sectional design and a quantitative approach. It was conducted between April 2022 and January 2023 in a Brazilian cancer hospital. For data collection, a quality of life questionnaire, validated for patients with head and neck cancer at the University of Washington, was applied in two phases: from 7 days after total laryngectomy and, subsequently, from 70 days after surgery using electronic larynx for at least 60 days. The inclusion criteria were patients undergoing total laryngectomy included on the Aldenora Bello Cancer Hospital's election list to receive the electronic larynx. Patients who did not sign the informed consent form were not included. RESULTS: The sample consisted of 31 patients, of which approximately 84% were men and approximately 93% at the age of 50 years or older. When comparing the phases, it is possible to observe that the item speech had the greatest progress, while chewing had the least. Only the item recreation, swallowing, taste, and saliva did not show any statistical significance. The score for the general quality of life questions increased. CONCLUSION: Electronic larynx is a viable and useful method of voice rehabilitation. Our data suggest that the use of the electrolarynx as a postlaryngectomy method of verbal communication is responsible for positive effects on patients' quality of life.


Subject(s)
Laryngeal Neoplasms , Laryngectomy , Larynx, Artificial , Quality of Life , Humans , Laryngectomy/rehabilitation , Laryngectomy/psychology , Male , Middle Aged , Cross-Sectional Studies , Female , Laryngeal Neoplasms/surgery , Laryngeal Neoplasms/psychology , Aged , Surveys and Questionnaires , Voice Quality , Adult , Treatment Outcome
9.
Front Public Health ; 12: 1256152, 2024.
Article in English | MEDLINE | ID: mdl-38813421

ABSTRACT

Background: The domination of the Contemporary Commercial Music (CCM) industry in music markets has led to a significant increase in the number of CCM performers. Performing in a wide variety of singing styles involves exposing CCM singers to specific risk factors potentially leading to voice problems. This, in turn, necessitates the consideration of this particular group of voice users in the Occupational Health framework. The aim of the present research was threefold. First, it sought to profile the group of Polish CCM singers. Second, it was designed to explore the prevalence of self-reported voice problems and voice quality in this population, in both speech and singing. Third, it aimed to explore the relationships between voice problems and lifetime singing involvement, occupational voice use, smoking, alcohol consumption, vocal training, and microphone use, as potential voice risk factors. Materials and methods: The study was conducted in Poland from January 2020 to April 2023. An online survey included socio-demographic information, singing involvement characteristics, and singers' voice self-assessment. The prevalence of voice problems was assessed by the Polish versions of the Vocal Tract Discomfort Scale (VTDS) and the Singing Voice Handicap Index (SVHI). Also, a self-reported dysphonia symptoms protocol was applied. The perceived overall voice quality was assessed by a Visual Analogue Scale (VAS) of 100 mm. Results: 412 singers, 310 women and 102 men, completed the survey. Nearly half of the studied population declared lifetime singing experience over 10 years with an average daily singing time of 1 or 2 h. 283 participants received vocal training. For 11.4% of respondents, singing was the primary income source, and 42% defined their career goals as voice-related. The median scores of the VTDS were 11.00 (0-44) and 12.00 (0-40) for the Frequency and Severity subscales, respectively. The median SVHI score of 33 (0-139) was significantly higher than the normative values determined in a systematic review and meta-analysis (2018). Strong positive correlations were observed between SVHI and both VTD subscales: Frequency (r = 0.632, p < 0.001) and Severity (r = 0.611, p < 0.001). The relationships between most of the other variables studied were weak or negligible. Conclusion: The examined CCM singers exhibited substantial diversity with regard to musical genre preferences, aspirations pertaining to singing endeavors, career affiliations, and source of income. Singing voice assessment revealed a greater degree of voice problems in the examined cohort than so far reported in the literature, based on the SVH and VTDS.


Subject(s)
Music , Singing , Voice Disorders , Voice Quality , Humans , Poland , Male , Female , Adult , Cross-Sectional Studies , Middle Aged , Voice Disorders/epidemiology , Self-Assessment , Surveys and Questionnaires , Prevalence , Risk Factors , Young Adult , Speech
10.
Sci Rep ; 14(1): 12407, 2024 05 30.
Article in English | MEDLINE | ID: mdl-38811832

ABSTRACT

Many lecturers develop voice problems, such as hoarseness. Nevertheless, research on how voice quality influences listeners' perception, comprehension, and retention of spoken language is limited to a small number of audio-only experiments. We aimed to address this gap by using audio-visual virtual reality (VR) to investigate the impact of a lecturer's hoarseness on university students' heard text recall, listening effort, and listening impression. Fifty participants were immersed in a virtual seminar room, where they engaged in a Dual-Task Paradigm. They listened to narratives presented by a virtual female professor, who spoke in either a typical or hoarse voice. Simultaneously, participants performed a secondary task. Results revealed significantly prolonged secondary-task response times with the hoarse voice compared to the typical voice, indicating increased listening effort. Subjectively, participants rated the hoarse voice as more annoying, effortful to listen to, and impeding for their cognitive performance. No effect of voice quality was found on heard text recall, suggesting that, while hoarseness may compromise certain aspects of spoken language processing, this might not necessarily result in reduced information retention. In summary, our findings underscore the importance of promoting vocal health among lecturers, which may contribute to enhanced listening conditions in learning spaces.


Subject(s)
Speech Perception , Virtual Reality , Voice Quality , Humans , Female , Male , Adult , Young Adult , Speech Perception/physiology , Memory/physiology , Auditory Perception/physiology , Hoarseness/etiology , Voice/physiology
11.
J Acoust Soc Am ; 155(5): 2990-3004, 2024 May 01.
Article in English | MEDLINE | ID: mdl-38717206

ABSTRACT

Speakers can place their prosodic prominence on any locations within a sentence, generating focus prosody for listeners to perceive new information. This study aimed to investigate age-related changes in the bottom-up processing of focus perception in Jianghuai Mandarin by clarifying the perceptual cues and the auditory processing abilities involved in the identification of focus locations. Young, middle-aged, and older speakers of Jianghuai Mandarin completed a focus identification task and an auditory perception task. The results showed that increasing age led to a decrease in listeners' accuracy rate in identifying focus locations, with all participants performing the worst when dynamic pitch cues were inaccessible. Auditory processing abilities did not predict focus perception performance in young and middle-aged listeners but accounted significantly for the variance in older adults' performance. These findings suggest that age-related deteriorations in focus perception can be largely attributed to declined auditory processing of perceptual cues. Poor ability to extract frequency modulation cues may be the most important underlying psychoacoustic factor for older adults' difficulties in perceiving focus prosody in Jianghuai Mandarin. The results contribute to our understanding of the bottom-up mechanisms involved in linguistic prosody processing in aging adults, particularly in tonal languages.


Subject(s)
Aging , Cues , Speech Perception , Humans , Middle Aged , Aged , Male , Female , Aging/psychology , Aging/physiology , Young Adult , Adult , Speech Perception/physiology , Age Factors , Speech Acoustics , Acoustic Stimulation , Pitch Perception , Language , Voice Quality , Psychoacoustics , Audiometry, Speech
12.
J Acoust Soc Am ; 155(5): 3090-3100, 2024 May 01.
Article in English | MEDLINE | ID: mdl-38717212

ABSTRACT

The perceived level of femininity and masculinity is a prominent property by which a speaker's voice is indexed, and a vocal expression incongruent with the speaker's gender identity can greatly contribute to gender dysphoria. Our understanding of the acoustic cues to the levels of masculinity and femininity perceived by listeners in voices is not well developed, and an increased understanding of them would benefit communication of therapy goals and evaluation in gender-affirming voice training. We developed a voice bank with 132 voices with a range of levels of femininity and masculinity expressed in the voice, as rated by 121 listeners in independent, individually randomized perceptual evaluations. Acoustic models were developed from measures identified as markers of femininity or masculinity in the literature using penalized regression and tenfold cross-validation procedures. The 223 most important acoustic cues explained 89% and 87% of the variance in the perceived level of femininity and masculinity in the evaluation set, respectively. The median fo was confirmed to provide the primary cue, but other acoustic properties must be considered in accurate models of femininity and masculinity perception. The developed models are proposed to afford communication and evaluation of gender-affirming voice training goals and improve voice synthesis efforts.


Subject(s)
Cues , Speech Acoustics , Speech Perception , Voice Quality , Humans , Female , Male , Adult , Young Adult , Masculinity , Middle Aged , Femininity , Adolescent , Gender Identity , Acoustics
13.
J Acoust Soc Am ; 155(5): 3071-3089, 2024 May 01.
Article in English | MEDLINE | ID: mdl-38717213

ABSTRACT

This study investigated how 40 Chinese learners of English as a foreign language (EFL learners) differed from 40 native English speakers in the production of four English tense-lax contrasts, /i-ɪ/, /u-ʊ/, /ɑ-ʌ/, and /æ-ε/, by examining the acoustic measurements of duration, the first three formant frequencies, and the slope of the first formant movement (F1 slope). The dynamic formant trajectory was modeled using discrete cosine transform coefficients to demonstrate the time-varying properties of formant trajectories. A discriminant analysis was employed to illustrate the extent to which Chinese EFL learners relied on different acoustic parameters. This study found that: (1) Chinese EFL learners overemphasized durational differences and weakened spectral differences for the /i-ɪ/, /u-ʊ/, and /ɑ-ʌ/ pairs, although they maintained sufficient spectral differences for /æ-ε/. In contrast, native English speakers predominantly used spectral differences across all four pairs; (2) in non-low tense-lax contrasts, unlike native English speakers, Chinese EFL learners failed to exhibit different F1 slope values, indicating a non-nativelike tongue-root placement during the articulatory process. The findings underscore the contribution of dynamic spectral patterns to the differentiation between English tense and lax vowels, and reveal the influence of precise articulatory gestures on the realization of the tense-lax contrast.


Subject(s)
Multilingualism , Phonetics , Speech Acoustics , Humans , Male , Female , Young Adult , Speech Production Measurement , Adult , Language , Acoustics , Learning , Voice Quality , Sound Spectrography , East Asian People
14.
Article in Chinese | MEDLINE | ID: mdl-38561259

ABSTRACT

Objective: To investigate the clinical characteristics and voice outcomes after laryngeal microsurgery for vocal fold epidermoid cysts coexisting with sulcus vocalis. Methods: The clinical data of 115 vocal fold epidermoid cysts coexisting with sulcus vocalis patients in Shandong provincial ENT hospital, were retrospectively analyzed, including 49 males and 66 females, aged 17-70 years old, and the duration of hoarseness ranged from 6 months to 30 years. All patients underwent surgery through suspension laryngoscope and microscope under general anestgesia. Ninety-four patients were treated with microflap excision of sulcus vocalis, cyst wall, and contents.And 21 patients that occulted with mucosal bridges were applied mucosal bridges resection (2 cases) and mucosal bridges reconstruction (19 cases) respectively. Videolaryngoscopy, subjective voice evaluation (GRBAS), objective voice evaluation, and Voice Handicap Index(VHI) were performed before and after surgery. All patients underwent histopathologic examination and follow-up after the procedure. The preoperative acoustic parameters of patients with vocal fold epidermoid cysts coexisting with sulcus vocalis were compared with those of vocal fold mucus retention cysts and simple vocal fold epidermoid cysts by independent samples t-test. The patients were compared by paired t-test for preoperative and postoperative parameters. Results: Significant reduction or lack of mucosal waves were shown via videolaryngostroboscopy in all 115 cases.In addition, vascular changes including dilation, tortuousness, increased branches, and abrupt direction change were shown on the cystic area. Eighty-one patients were detected cysts and/or sulcus vocalis by preoperative laryngoscopy, and intraoperative microscopic findings in the remaining 34 patients. The intraoperative microscopic examination revealed a focal pouch-like deficit plunging into the vocal ligament or muscle. The deep surface of the mucosal bridges was sulcus vocalis, and that in 89 cysts was lined with caseous content. Histopathology demonstrated a cystic cavity structure lined with squamous epithelium and caseous keratin desquamation inside the cystic cavity. Four of 115 patients were lost at follow-up and excluded from the analysis of voice outcomes after surgery. There was no significant mucosal wave and the voice quality in all but 14 patients 1month after surgery. Except for the fundamental frequency and noise harmonic ratio, all other voice parameters[ G, R, B, A, VHI-10, jitter, shimmer, maximum phonatory time (MPT) ]showed a significant improvement 3 months after surgery(t=15.82, 20.82, 17.61, 7.30, 38.88, 7.84, 5.88, -6.26, respectively, P<0.05). Then mucosal waves and the voice quality were gradually improved and became steady in 6 months after surgery. The subjective and objective voice parameters[G, R, B, A, VHI-10, jitter, shimmer, noise to harmonic ratio(NHR), MPT], except for the fundamental frequency, were all significantly improved(t=23.47, 25.79, 18.37, 9.84, 54.45, 10.68, 8.07, 3.24, -9.08, respectively, P<0.05). In addition, there were 2 patients with no significant improvement after the operation. Steady function with no complications was observed during the 12 months (up to 3 years in 34 patients) follow-up period in 111 patients. Conclusion: Ruptured vocal fold epidermoid cysts can result in sulcus vocalis and mucosal bridges. Characteristics changes in preoperative videolaryngoscopy are effective diagnostic tools. The complete excision of the cyst wall and repair of the lamina propria can lead to satisfactory long-term effects.


Subject(s)
Epidermal Cyst , Laryngeal Diseases , Male , Female , Humans , Adolescent , Young Adult , Adult , Middle Aged , Aged , Vocal Cords/pathology , Epidermal Cyst/complications , Epidermal Cyst/surgery , Epidermal Cyst/pathology , Retrospective Studies , Laryngeal Diseases/surgery , Laryngeal Diseases/pathology , Voice Quality , Treatment Outcome
15.
J Acoust Soc Am ; 155(4): 2659-2669, 2024 Apr 01.
Article in English | MEDLINE | ID: mdl-38634661

ABSTRACT

Within the realm of voice classification, singers could be sub-categorized by the weight of their repertoire, the so-called "singer's Fach." However, the opposite pole terms "lyric" and "dramatic" singing are not yet well defined by their acoustic and articulatory characteristics. Nine professional singers of different singers' Fach were asked to sing a diatonic scale on the vowel /a/, first in what the singers considered as lyric and second in what they considered as dramatic. Image recording was performed using real time magnetic resonance imaging (MRI) with 25 frames/s, and the audio signal was recorded via an optical microphone system. Analysis was performed with regard to sound pressure level (SPL), vibrato amplitude, and frequency and resonance frequencies as well as articulatory settings of the vocal tract. The analysis revealed three primary differences between dramatic and lyric singing: Dramatic singing was associated with greater SPL and greater vibrato amplitude and frequency as well as lower resonance frequencies. The higher SPL is an indication of voice source changes, and the lower resonance frequencies are probably caused by the lower larynx position. However, all these strategies showed a considerable individual variability. The singers' Fach might contribute to perceptual differences even for the same singer with regard to the respective repertoire.


Subject(s)
Music , Singing , Voice Quality , Acoustics
16.
Eur Rev Med Pharmacol Sci ; 28(7): 2701-2709, 2024 Apr.
Article in English | MEDLINE | ID: mdl-38639510

ABSTRACT

OBJECTIVE: Vocal cord paralysis (VCP) is a serious complication in thyroidectomy operations; however, its management remains unclear. The present study evaluated the voice parameters of patients who underwent surgery using Intraoperative Neurophysiologic Monitoring (IONM). PATIENTS AND METHODS: A total of 52 patients (41 females and 11 males) who underwent a total thyroidectomy operation were evaluated using objective and subjective voice analysis examinations before and after surgery. Acoustic parameters, such as Fundamental Frequency (F0), Shimmer, Jitter, Noise-to-Harmonic ratio (NHR), and aerodynamic parameters, including S/Z ratio and maximum phonation time (MPT), were analyzed. Objective findings, including the VHI-10 (Voice Handicap Index) and V-RQOL (Voice-Related Quality of Life), were also analyzed. The relationship between voice parameters and IONM values was investigated. RESULTS: The objective analysis (acoustic and aerodynamic parameters) showed no difference (p>0.05). However, the subjective analysis, which involved the VHI-10 and V-RQOL measures, revealed a significant difference before and after the operation (p<0.05). The Spearman correlation analysis showed that the NHR postoperative 1st-month parameter negatively correlated (rho=-0.317, p<0.059), while the F0 postoperative 6th-month parameter positively correlated (rho=0.347) with the amplitude difference before and after dissection (Right R2-R1 difference) for the right RLN measured in IONM. CONCLUSIONS: Patients who are planning to undergo a thyroidectomy procedure should undergo voice assessment during both the preoperative and postoperative periods. IONM could improve voice quality outcomes.


Subject(s)
Vocal Cord Paralysis , Voice Disorders , Male , Female , Humans , Voice Quality , Thyroidectomy/adverse effects , Quality of Life , Acoustics , Vocal Cord Paralysis/diagnosis , Vocal Cord Paralysis/etiology , Voice Disorders/diagnosis , Voice Disorders/etiology
17.
Sci Rep ; 14(1): 8977, 2024 04 18.
Article in English | MEDLINE | ID: mdl-38637516

ABSTRACT

Why do we prefer some singers to others? We investigated how much singing voice preferences can be traced back to objective features of the stimuli. To do so, we asked participants to rate short excerpts of singing performances in terms of how much they liked them as well as in terms of 10 perceptual attributes (e.g.: pitch accuracy, tempo, breathiness). We modeled liking ratings based on these perceptual ratings, as well as based on acoustic features and low-level features derived from Music Information Retrieval (MIR). Mean liking ratings for each stimulus were highly correlated between Experiments 1 (online, US-based participants) and 2 (in the lab, German participants), suggesting a role for attributes of the stimuli in grounding average preferences. We show that acoustic and MIR features barely explain any variance in liking ratings; in contrast, perceptual features of the voices achieved around 43% of prediction. Inter-rater agreement in liking and perceptual ratings was low, indicating substantial (and unsurprising) individual differences in participants' preferences and perception of the stimuli. Our results indicate that singing voice preferences are not grounded in acoustic attributes of the voices per se, but in how these features are perceptually interpreted by listeners.


Subject(s)
Music , Singing , Voice , Humans , Voice Quality , Acoustics
18.
Sci Rep ; 14(1): 9297, 2024 04 23.
Article in English | MEDLINE | ID: mdl-38654036

ABSTRACT

Voice change is often the first sign of laryngeal cancer, leading to diagnosis through hospital laryngoscopy. Screening for laryngeal cancer solely based on voice could enhance early detection. However, identifying voice indicators specific to laryngeal cancer is challenging, especially when differentiating it from other laryngeal ailments. This study presents an artificial intelligence model designed to distinguish between healthy voices, laryngeal cancer voices, and those of the other laryngeal conditions. We gathered voice samples of individuals with laryngeal cancer, vocal cord paralysis, benign mucosal diseases, and healthy participants. Comprehensive testing was conducted to determine the best mel-frequency cepstral coefficient conversion and machine learning techniques, with results analyzed in-depth. In our tests, laryngeal diseases distinguishing from healthy voices achieved an accuracy of 0.85-0.97. However, when multiclass classification, accuracy ranged from 0.75 to 0.83. These findings highlight the challenges of artificial intelligence-driven voice-based diagnosis due to overlaps with benign conditions but also underscore its potential.


Subject(s)
Artificial Intelligence , Laryngeal Diseases , Stroboscopy , Vocal Cords , Voice Quality , Adult , Aged , Humans , Male , Middle Aged , Case-Control Studies , Health , Laryngeal Diseases/classification , Laryngeal Diseases/diagnosis , Laryngeal Diseases/physiopathology , Laryngeal Neoplasms/diagnosis , Neural Networks, Computer , Squamous Cell Carcinoma of Head and Neck , Support Vector Machine , Vocal Cord Paralysis/diagnosis , Vocal Cords/pathology , Vocal Cords/physiopathology , Voice Disorders/classification , Voice Disorders/diagnosis , Voice Disorders/physiopathology
19.
Eur Arch Otorhinolaryngol ; 281(6): 3197-3205, 2024 Jun.
Article in English | MEDLINE | ID: mdl-38568297

ABSTRACT

PURPOSE: Aim of this study was to calculate the percentage of the Automatic Speaking Valve (ASV) use in a large cohort of laryngectomized patients with voice prosthesis (VP) and to analyze the main reasons for non-use. Subsequently, a specific rehabilitation training was proposed. METHODS: One hundred-ten laryngectomized patients with VP were enrolled in the first phase of the study (census). Among them, 57 patients were included in the second phase (intervention), in which a training based on moving phonatory exercises was proposed. Structured questionnaires were used before and after training in order to investigate ASV use rate (days/week and hours/day; reasons for impeding the ASV use), average adhesive life-time during ASV use; hands-free speech duration; skin irritation. Patients also expressed their degree of on a VAS scale from 0 to 100. RESULTS: In the census phase the percentage of use of ASV (everyday, without problems) was equal to 17.27% (19/110 patients). The main causes of disuse concerned excessive fatigue and poor durability of the adhesives. The analysis of the results pre vs. post-training showed a statistically significant increase (p < 0.05) in all the investigated parameters. Patients reported a good level of treatment compliance (average frequency of performing exercises equal to 4.2 ± 2.5 days/week for 1.4 ± 1.01 h/day) and high degrees of satisfaction. After treatment, the percentage of use of AVS increased by 43% reaching a rate of 60% (66/110 patients). CONCLUSION: A specific and targeted approach that simulate the phonatory and breathing difficulties of everyday life can increase the ASV usage rate.


Subject(s)
Laryngectomy , Larynx, Artificial , Humans , Laryngectomy/rehabilitation , Laryngectomy/adverse effects , Male , Female , Middle Aged , Aged , Adult , Surveys and Questionnaires , Aged, 80 and over , Speech, Alaryngeal , Voice Quality , Prosthesis Design
20.
Langenbecks Arch Surg ; 409(1): 138, 2024 Apr 27.
Article in English | MEDLINE | ID: mdl-38676783

ABSTRACT

PURPOSE: Treating an infiltration of the recurrent laryngeal nerve (RLN) by thyroid carcinoma remains a subject of ongoing debate. Therefore, this study aims to provide a novel strategy for intraoperative phenosurgical management of RLN infiltrated by thyroid carcinoma. METHODS: Forty-two patients with thyroid carcinoma infiltrating the RLN were recruited for this study and divided into three groups. Group A comprised six individuals with medullary thyroid cancer who underwent RLN resection and arytenoid adduction. Group B consisted of 29 differentiated thyroid cancer (DTC)patients who underwent RLN resection and ansa cervicalis (ACN)-to-RLN anastomosis. Group C included seven patients whose RLN was preserved. RESULTS: The videostroboscopic analysis and voice assessment collectively indicated substantial improvements in voice quality for patients in Groups A and B one year post-surgery. Additionally, the shaving technique maintained a normal or near-normal voice in Group C one year post-surgery. CONCLUSION: The new intraoperative phonosurgical strategy is as follows: Resection of the affected RLN and arytenoid adduction is required in cases of medullary or anaplastic carcinoma, regardless of preoperative RLN function. Suppose RLN is found infiltrated by well-differentiated thyroid cancer (WDTC) during surgery, and the RLN is preoperatively paralyzed, we recommend performing resection the involved RLN and ACN-to-RLN anastomosis immediately during surgery. If vocal folds exhibit normal mobility preoperatively, the MACIS scoring system is used to assess patient risk stratification. When the MACIS score > 6.99, resection of the involved RLN and immediate ACN-to-RLN anastomosis were performed. RLN preservation was limited to patients with MACIS scores ≤ 6.99.


Subject(s)
Recurrent Laryngeal Nerve , Thyroid Neoplasms , Thyroidectomy , Humans , Thyroid Neoplasms/surgery , Thyroid Neoplasms/pathology , Male , Female , Middle Aged , Adult , Recurrent Laryngeal Nerve/surgery , Thyroidectomy/methods , Vocal Cord Paralysis/etiology , Vocal Cord Paralysis/surgery , Aged , Voice Quality , Neoplasm Invasiveness/pathology , Treatment Outcome
SELECTION OF CITATIONS
SEARCH DETAIL