Your browser doesn't support javascript.
loading
: 20 | 50 | 100
1 - 20 de 8.543
1.
J Acoust Soc Am ; 155(5): 2990-3004, 2024 May 01.
Article En | MEDLINE | ID: mdl-38717206

Speakers can place their prosodic prominence on any locations within a sentence, generating focus prosody for listeners to perceive new information. This study aimed to investigate age-related changes in the bottom-up processing of focus perception in Jianghuai Mandarin by clarifying the perceptual cues and the auditory processing abilities involved in the identification of focus locations. Young, middle-aged, and older speakers of Jianghuai Mandarin completed a focus identification task and an auditory perception task. The results showed that increasing age led to a decrease in listeners' accuracy rate in identifying focus locations, with all participants performing the worst when dynamic pitch cues were inaccessible. Auditory processing abilities did not predict focus perception performance in young and middle-aged listeners but accounted significantly for the variance in older adults' performance. These findings suggest that age-related deteriorations in focus perception can be largely attributed to declined auditory processing of perceptual cues. Poor ability to extract frequency modulation cues may be the most important underlying psychoacoustic factor for older adults' difficulties in perceiving focus prosody in Jianghuai Mandarin. The results contribute to our understanding of the bottom-up mechanisms involved in linguistic prosody processing in aging adults, particularly in tonal languages.


Aging , Cues , Speech Perception , Humans , Middle Aged , Aged , Male , Female , Aging/psychology , Aging/physiology , Young Adult , Adult , Speech Perception/physiology , Age Factors , Speech Acoustics , Acoustic Stimulation , Pitch Perception , Language , Voice Quality , Psychoacoustics , Audiometry, Speech
2.
J Acoust Soc Am ; 155(5): 3090-3100, 2024 May 01.
Article En | MEDLINE | ID: mdl-38717212

The perceived level of femininity and masculinity is a prominent property by which a speaker's voice is indexed, and a vocal expression incongruent with the speaker's gender identity can greatly contribute to gender dysphoria. Our understanding of the acoustic cues to the levels of masculinity and femininity perceived by listeners in voices is not well developed, and an increased understanding of them would benefit communication of therapy goals and evaluation in gender-affirming voice training. We developed a voice bank with 132 voices with a range of levels of femininity and masculinity expressed in the voice, as rated by 121 listeners in independent, individually randomized perceptual evaluations. Acoustic models were developed from measures identified as markers of femininity or masculinity in the literature using penalized regression and tenfold cross-validation procedures. The 223 most important acoustic cues explained 89% and 87% of the variance in the perceived level of femininity and masculinity in the evaluation set, respectively. The median fo was confirmed to provide the primary cue, but other acoustic properties must be considered in accurate models of femininity and masculinity perception. The developed models are proposed to afford communication and evaluation of gender-affirming voice training goals and improve voice synthesis efforts.


Cues , Speech Acoustics , Speech Perception , Voice Quality , Humans , Female , Male , Adult , Young Adult , Masculinity , Middle Aged , Femininity , Adolescent , Gender Identity , Acoustics
3.
J Acoust Soc Am ; 155(5): 3071-3089, 2024 May 01.
Article En | MEDLINE | ID: mdl-38717213

This study investigated how 40 Chinese learners of English as a foreign language (EFL learners) differed from 40 native English speakers in the production of four English tense-lax contrasts, /i-ɪ/, /u-ʊ/, /ɑ-ʌ/, and /æ-ε/, by examining the acoustic measurements of duration, the first three formant frequencies, and the slope of the first formant movement (F1 slope). The dynamic formant trajectory was modeled using discrete cosine transform coefficients to demonstrate the time-varying properties of formant trajectories. A discriminant analysis was employed to illustrate the extent to which Chinese EFL learners relied on different acoustic parameters. This study found that: (1) Chinese EFL learners overemphasized durational differences and weakened spectral differences for the /i-ɪ/, /u-ʊ/, and /ɑ-ʌ/ pairs, although they maintained sufficient spectral differences for /æ-ε/. In contrast, native English speakers predominantly used spectral differences across all four pairs; (2) in non-low tense-lax contrasts, unlike native English speakers, Chinese EFL learners failed to exhibit different F1 slope values, indicating a non-nativelike tongue-root placement during the articulatory process. The findings underscore the contribution of dynamic spectral patterns to the differentiation between English tense and lax vowels, and reveal the influence of precise articulatory gestures on the realization of the tense-lax contrast.


Multilingualism , Phonetics , Speech Acoustics , Humans , Male , Female , Young Adult , Speech Production Measurement , Adult , Language , Acoustics , Learning , Voice Quality , Sound Spectrography , East Asian People
4.
Rev Assoc Med Bras (1992) ; 70(4): e20231146, 2024.
Article En | MEDLINE | ID: mdl-38716939

OBJECTIVE: Therapy and vocal rehabilitation in laryngeal cancer impact patients' quality of life. The objective of this study was to evaluate the evolution of the quality of life of patients with laryngeal cancer submitted to total laryngectomy and using electrolarynx. METHODS: This is an observational study with a cross-sectional design and a quantitative approach. It was conducted between April 2022 and January 2023 in a Brazilian cancer hospital. For data collection, a quality of life questionnaire, validated for patients with head and neck cancer at the University of Washington, was applied in two phases: from 7 days after total laryngectomy and, subsequently, from 70 days after surgery using electronic larynx for at least 60 days. The inclusion criteria were patients undergoing total laryngectomy included on the Aldenora Bello Cancer Hospital's election list to receive the electronic larynx. Patients who did not sign the informed consent form were not included. RESULTS: The sample consisted of 31 patients, of which approximately 84% were men and approximately 93% at the age of 50 years or older. When comparing the phases, it is possible to observe that the item speech had the greatest progress, while chewing had the least. Only the item recreation, swallowing, taste, and saliva did not show any statistical significance. The score for the general quality of life questions increased. CONCLUSION: Electronic larynx is a viable and useful method of voice rehabilitation. Our data suggest that the use of the electrolarynx as a postlaryngectomy method of verbal communication is responsible for positive effects on patients' quality of life.


Laryngeal Neoplasms , Laryngectomy , Larynx, Artificial , Quality of Life , Humans , Laryngectomy/rehabilitation , Laryngectomy/psychology , Male , Middle Aged , Cross-Sectional Studies , Female , Laryngeal Neoplasms/surgery , Laryngeal Neoplasms/psychology , Aged , Surveys and Questionnaires , Voice Quality , Adult , Treatment Outcome
5.
Codas ; 36(4): e20230148, 2024.
Article Pt, En | MEDLINE | ID: mdl-38775526

PURPOSE: To evaluate the immediate effect of the inspiratory exercise with a booster and a respiratory exerciser on the voice of women without vocal complaints. METHODS: 25 women with no vocal complaints, between 18 and 34 years old, with a score of 1 on the Vocal Disorder Screening Index (ITDV) participated. Data collection was performed before and after performing the inspiratory exercise and consisted of recording the sustained vowel /a/, connected speech and maximum phonatory times (MPT) of vowels, fricative phonemes and counting numbers. In the auditory-perceptual judgment, the Vocal Deviation Scale (VSD) was used to verify the general degree of vocal deviation. Acoustic evaluation was performed using the PRAAT software and the parameters fundamental frequency (f0), jitter, shimmer, harmonium-to-noise ratio (HNR), Cepstral Peak Prominence Smoothed (CPPS), Acoustic Voice Quality Index (AVQI) and Acoustic Breathiness Index (ABI). To measure the aerodynamic measurements, the time of each emission was extracted in the Audacity program. Data were statistically analyzed using the Statistica for Windows software and normality was tested using the Shapiro-Wilk test. To compare the results, Student's and Wilcoxon's t tests were applied, adopting a significance level of 5%. RESULTS: There were no significant differences between the results of the JPA and the acoustic measures, in the pre and post inspiratory exercise moments. As for the aerodynamic measures, it was possible to observe a significant increase in the value of the TMF /s/ (p=0.008). CONCLUSION: There was no change in vocal quality after the inspiratory exercise with stimulator and respiratory exerciser, but an increase in the MPT of the phoneme /s/ was observed after the exercise.


OBJETIVO: Avaliar o efeito imediato do exercício inspiratório com incentivador e exercitador respiratório na voz de mulheres sem queixas vocais. MÉTODO: Participaram 25 mulheres sem queixas vocais, entre 18 e 34 anos, com pontuação 1 no Índice de Triagem para Distúrbio Vocal (ITDV). A coleta de dados foi realizada nos momentos antes e após realização de exercício inspiratório e consistiu na gravação de vogal sustentada /a/, fala encadeada e tempos máximos fonatórios (TMF) de vogais, fonemas fricativos e contagem de números. No julgamento perceptivo-auditivo foi utilizada a Escala de Desvio Vocal (EDV) para verificar o grau geral do desvio vocal. Avaliação acústica foi feita no software PRAAT e foram extraídos os parâmetros frequência fundamental (f0), jitter, shimmer, proporção harmônico -ruído (HNR), Cepstral Peak Prominence Smoothed (CPPS), Acoustic Voice Quality Index (AVQI) e Acoustic Breathiness Index (ABI). Para mensuração das medidas aerodinâmicas, o tempo de emissão foi extraído no programa Audacity. Para comparar os resultados utilizou-se o teste paramétrico t de Student para amostras dependentes na análise das variáveis com distribuição normal e o teste de Wilcoxon para variáveis com distribuição não normal. RESULTADOS: Não houve diferenças entre os resultados do JPA e das medidas acústicas, nos momentos pré e pós exercício inspiratório. Quanto às medidas aerodinâmicas foi possível observar aumento significativo no valor do TMF /s/ (p=0,008). CONCLUSÃO: Não houve modificação na qualidade vocal após o exercício inspiratório com incentivador e exercitador respiratório, porém foi observado aumento do TMF do fonema /s/ após a realização do exercício.


Breathing Exercises , Voice Quality , Humans , Female , Adult , Young Adult , Adolescent , Breathing Exercises/methods , Speech Acoustics , Voice Disorders/physiopathology , Voice Disorders/diagnosis , Phonation/physiology
6.
Article Zh | MEDLINE | ID: mdl-38561259

Objective: To investigate the clinical characteristics and voice outcomes after laryngeal microsurgery for vocal fold epidermoid cysts coexisting with sulcus vocalis. Methods: The clinical data of 115 vocal fold epidermoid cysts coexisting with sulcus vocalis patients in Shandong provincial ENT hospital, were retrospectively analyzed, including 49 males and 66 females, aged 17-70 years old, and the duration of hoarseness ranged from 6 months to 30 years. All patients underwent surgery through suspension laryngoscope and microscope under general anestgesia. Ninety-four patients were treated with microflap excision of sulcus vocalis, cyst wall, and contents.And 21 patients that occulted with mucosal bridges were applied mucosal bridges resection (2 cases) and mucosal bridges reconstruction (19 cases) respectively. Videolaryngoscopy, subjective voice evaluation (GRBAS), objective voice evaluation, and Voice Handicap Index(VHI) were performed before and after surgery. All patients underwent histopathologic examination and follow-up after the procedure. The preoperative acoustic parameters of patients with vocal fold epidermoid cysts coexisting with sulcus vocalis were compared with those of vocal fold mucus retention cysts and simple vocal fold epidermoid cysts by independent samples t-test. The patients were compared by paired t-test for preoperative and postoperative parameters. Results: Significant reduction or lack of mucosal waves were shown via videolaryngostroboscopy in all 115 cases.In addition, vascular changes including dilation, tortuousness, increased branches, and abrupt direction change were shown on the cystic area. Eighty-one patients were detected cysts and/or sulcus vocalis by preoperative laryngoscopy, and intraoperative microscopic findings in the remaining 34 patients. The intraoperative microscopic examination revealed a focal pouch-like deficit plunging into the vocal ligament or muscle. The deep surface of the mucosal bridges was sulcus vocalis, and that in 89 cysts was lined with caseous content. Histopathology demonstrated a cystic cavity structure lined with squamous epithelium and caseous keratin desquamation inside the cystic cavity. Four of 115 patients were lost at follow-up and excluded from the analysis of voice outcomes after surgery. There was no significant mucosal wave and the voice quality in all but 14 patients 1month after surgery. Except for the fundamental frequency and noise harmonic ratio, all other voice parameters[ G, R, B, A, VHI-10, jitter, shimmer, maximum phonatory time (MPT) ]showed a significant improvement 3 months after surgery(t=15.82, 20.82, 17.61, 7.30, 38.88, 7.84, 5.88, -6.26, respectively, P<0.05). Then mucosal waves and the voice quality were gradually improved and became steady in 6 months after surgery. The subjective and objective voice parameters[G, R, B, A, VHI-10, jitter, shimmer, noise to harmonic ratio(NHR), MPT], except for the fundamental frequency, were all significantly improved(t=23.47, 25.79, 18.37, 9.84, 54.45, 10.68, 8.07, 3.24, -9.08, respectively, P<0.05). In addition, there were 2 patients with no significant improvement after the operation. Steady function with no complications was observed during the 12 months (up to 3 years in 34 patients) follow-up period in 111 patients. Conclusion: Ruptured vocal fold epidermoid cysts can result in sulcus vocalis and mucosal bridges. Characteristics changes in preoperative videolaryngoscopy are effective diagnostic tools. The complete excision of the cyst wall and repair of the lamina propria can lead to satisfactory long-term effects.


Epidermal Cyst , Laryngeal Diseases , Male , Female , Humans , Adolescent , Young Adult , Adult , Middle Aged , Aged , Vocal Cords/pathology , Epidermal Cyst/complications , Epidermal Cyst/surgery , Epidermal Cyst/pathology , Retrospective Studies , Laryngeal Diseases/surgery , Laryngeal Diseases/pathology , Voice Quality , Treatment Outcome
7.
Noise Health ; 26(120): 1-7, 2024.
Article En | MEDLINE | ID: mdl-38570303

OBJECTIVE: Functional dysphonia can impair the language expression ability and adversely affect the career development of some patients. Therefore, an active exploration of effective treatment options is imperative. This study investigated the effect of Akson therapy on acoustic parameters in patients with functional dysphonia. MATERIALS AND METHODS: In this retrospective analysis, 79 patients with functional dysphonia who received conventional voice correction training from June 2020 to June 2021 were included in the reference group (RG). Our hospital has implemented Akson therapy since July 2021. Correspondingly, 72 patients with functional dysphonia who underwent Akson therapy from July 2021 to July 2022 were enrolled in the observation group (OG). The acoustic parameters such as fundamental frequency (F0), jitter, shimmer, and normalized noise energy (NNE); the aerodynamic parameters including maximum phonation time (MPT), mean airflow rate (MFR), and Voice Handicap Index-10 (VHI-10) score; and the Grade, Roughness, Breathiness, Asthenia, and Strain scale (GRBAS) score were measured before and after treatment and compared between the two groups. RESULTS: The F0, jitter, shimmer, NNE, MPT, and MFR values as well as the VHI-10 score and the grade (G), roughness (R), and breathiness (B) scores on the GRBAS did not significantly differ between the two groups before treatment (P > 0.05). However, significantly lower F0, jitter, shimmer, NNE, and MFR values and higher MPT levels were found in the OG compared to the RG after treatment (P < 0.001). Furthermore, the VHI-10 score and the G, R, and B scores were significantly lower in the OG than in the RG after treatment (P < 0.001), whereas the asthenia (A) and strain (S) scores remained at 0 before and after treatment. CONCLUSION: Akson therapy can improve the acoustic parameters of patients with functional dysphonia to a certain extent, indicating its potential application value.


Dysphonia , Humans , Dysphonia/therapy , Retrospective Studies , Asthenia , Voice Quality , Acoustics
8.
Eur Rev Med Pharmacol Sci ; 28(7): 2701-2709, 2024 Apr.
Article En | MEDLINE | ID: mdl-38639510

OBJECTIVE: Vocal cord paralysis (VCP) is a serious complication in thyroidectomy operations; however, its management remains unclear. The present study evaluated the voice parameters of patients who underwent surgery using Intraoperative Neurophysiologic Monitoring (IONM). PATIENTS AND METHODS: A total of 52 patients (41 females and 11 males) who underwent a total thyroidectomy operation were evaluated using objective and subjective voice analysis examinations before and after surgery. Acoustic parameters, such as Fundamental Frequency (F0), Shimmer, Jitter, Noise-to-Harmonic ratio (NHR), and aerodynamic parameters, including S/Z ratio and maximum phonation time (MPT), were analyzed. Objective findings, including the VHI-10 (Voice Handicap Index) and V-RQOL (Voice-Related Quality of Life), were also analyzed. The relationship between voice parameters and IONM values was investigated. RESULTS: The objective analysis (acoustic and aerodynamic parameters) showed no difference (p>0.05). However, the subjective analysis, which involved the VHI-10 and V-RQOL measures, revealed a significant difference before and after the operation (p<0.05). The Spearman correlation analysis showed that the NHR postoperative 1st-month parameter negatively correlated (rho=-0.317, p<0.059), while the F0 postoperative 6th-month parameter positively correlated (rho=0.347) with the amplitude difference before and after dissection (Right R2-R1 difference) for the right RLN measured in IONM. CONCLUSIONS: Patients who are planning to undergo a thyroidectomy procedure should undergo voice assessment during both the preoperative and postoperative periods. IONM could improve voice quality outcomes.


Vocal Cord Paralysis , Voice Disorders , Male , Female , Humans , Voice Quality , Thyroidectomy/adverse effects , Quality of Life , Acoustics , Vocal Cord Paralysis/diagnosis , Vocal Cord Paralysis/etiology , Voice Disorders/diagnosis , Voice Disorders/etiology
9.
Int J Pediatr Otorhinolaryngol ; 180: 111962, 2024 May.
Article En | MEDLINE | ID: mdl-38657429

PURPOSE: In this prospective study, we aimed to investigate the difference in voice acoustic parameters between girls with idiopathic central precocious puberty (ICPP) and those who developed normally during prepuberty. MATERIALS AND METHODS: Our study recruited 54 girls diagnosed with ICPP and randomly sampled 51 healthy prepubertal girls as the control. Tanner stages, circulating hormone levels and bone ages of the girls with ICPP and the age and body mass index (BMI) of all participants were recorded. Acoustic analyses were performed using PRAAT computer-based voice analysis software and the mean pitch (F0), jitter, shimmer, noise-to harmonic-ratio (NHR) and harmonic-to-noise ratio (HNR) values were compared in the patient and control groups. RESULTS: The two groups did not significantly differ in age or BMI. In the evaluation of the F0 and jitter values, we were found to be lower in the control group than in the patient group. However, we did not find a statistical significance. The mean shimmer values of the patient group were significantly higher than those of the control group. In addition, a statistically significant difference was noted for the mean HNR and NHR values (P < 0.001). A moderate negative correlation was found between shimmer and hormone levels in the patient group. CONCLUSIONS: Voice acoustic parameters one of the defining features of girls with ICPP. Voice changes in acoustic parameters could reflect hormonal changes during puberty. Clinicians should suspect ICPP when there is a change in the voice.


Puberty, Precocious , Humans , Puberty, Precocious/blood , Female , Child , Prospective Studies , Voice Quality/physiology , Speech Acoustics , Case-Control Studies , Voice/physiology , Body Mass Index
10.
Codas ; 36(3): e20230175, 2024.
Article En | MEDLINE | ID: mdl-38629682

PURPOSE: To assess the influence of the listener experience, measurement scales and the type of speech task on the auditory-perceptual evaluation of the overall severity (OS) of voice deviation and the predominant type of voice (rough, breathy or strain). METHODS: 22 listeners, divided into four groups participated in the study: speech-language pathologist specialized in voice (SLP-V), SLP non specialized in voice (SLP-NV), graduate students with auditory-perceptual analysis training (GS-T), and graduate students without auditory-perceptual analysis training (GS-U). The subjects rated the OS of voice deviation and the predominant type of voice of 44 voices by visual analog scale (VAS) and the numerical scale (score "G" from GRBAS), corresponding to six speech tasks such as sustained vowel /a/ and /ɛ/, sentences, number counting, running speech, and all five previous tasks together. RESULTS: Sentences obtained the best interrater reliability in each group, using both VAS and GRBAS. SLP-NV group demonstrated the best interrater reliability in OS judgment in different speech tasks using VAS or GRBAS. Sustained vowel (/a/ and /ɛ/) and running speech obtained the best interrater reliability among the groups of listeners in judging the predominant vocal quality. GS-T group got the best result of interrater reliability in judging the predominant vocal quality. CONCLUSION: The time of experience in the auditory-perceptual judgment of the voice, the type of training to which they were submitted, and the type of speech task influence the reliability of the auditory-perceptual evaluation of vocal quality.


Dysphonia , Speech Perception , Humans , Speech , Reproducibility of Results , Speech Production Measurement , Observer Variation , Voice Quality , Speech Acoustics
11.
Sci Rep ; 14(1): 9297, 2024 04 23.
Article En | MEDLINE | ID: mdl-38654036

Voice change is often the first sign of laryngeal cancer, leading to diagnosis through hospital laryngoscopy. Screening for laryngeal cancer solely based on voice could enhance early detection. However, identifying voice indicators specific to laryngeal cancer is challenging, especially when differentiating it from other laryngeal ailments. This study presents an artificial intelligence model designed to distinguish between healthy voices, laryngeal cancer voices, and those of the other laryngeal conditions. We gathered voice samples of individuals with laryngeal cancer, vocal cord paralysis, benign mucosal diseases, and healthy participants. Comprehensive testing was conducted to determine the best mel-frequency cepstral coefficient conversion and machine learning techniques, with results analyzed in-depth. In our tests, laryngeal diseases distinguishing from healthy voices achieved an accuracy of 0.85-0.97. However, when multiclass classification, accuracy ranged from 0.75 to 0.83. These findings highlight the challenges of artificial intelligence-driven voice-based diagnosis due to overlaps with benign conditions but also underscore its potential.


Artificial Intelligence , Laryngeal Diseases , Stroboscopy , Vocal Cords , Voice Quality , Adult , Aged , Humans , Male , Middle Aged , Case-Control Studies , Health , Laryngeal Diseases/classification , Laryngeal Diseases/diagnosis , Laryngeal Diseases/physiopathology , Laryngeal Neoplasms/diagnosis , Neural Networks, Computer , Squamous Cell Carcinoma of Head and Neck , Support Vector Machine , Vocal Cord Paralysis/diagnosis , Vocal Cords/pathology , Vocal Cords/physiopathology , Voice Disorders/classification , Voice Disorders/diagnosis , Voice Disorders/physiopathology
12.
Langenbecks Arch Surg ; 409(1): 138, 2024 Apr 27.
Article En | MEDLINE | ID: mdl-38676783

PURPOSE: Treating an infiltration of the recurrent laryngeal nerve (RLN) by thyroid carcinoma remains a subject of ongoing debate. Therefore, this study aims to provide a novel strategy for intraoperative phenosurgical management of RLN infiltrated by thyroid carcinoma. METHODS: Forty-two patients with thyroid carcinoma infiltrating the RLN were recruited for this study and divided into three groups. Group A comprised six individuals with medullary thyroid cancer who underwent RLN resection and arytenoid adduction. Group B consisted of 29 differentiated thyroid cancer (DTC)patients who underwent RLN resection and ansa cervicalis (ACN)-to-RLN anastomosis. Group C included seven patients whose RLN was preserved. RESULTS: The videostroboscopic analysis and voice assessment collectively indicated substantial improvements in voice quality for patients in Groups A and B one year post-surgery. Additionally, the shaving technique maintained a normal or near-normal voice in Group C one year post-surgery. CONCLUSION: The new intraoperative phonosurgical strategy is as follows: Resection of the affected RLN and arytenoid adduction is required in cases of medullary or anaplastic carcinoma, regardless of preoperative RLN function. Suppose RLN is found infiltrated by well-differentiated thyroid cancer (WDTC) during surgery, and the RLN is preoperatively paralyzed, we recommend performing resection the involved RLN and ACN-to-RLN anastomosis immediately during surgery. If vocal folds exhibit normal mobility preoperatively, the MACIS scoring system is used to assess patient risk stratification. When the MACIS score > 6.99, resection of the involved RLN and immediate ACN-to-RLN anastomosis were performed. RLN preservation was limited to patients with MACIS scores ≤ 6.99.


Recurrent Laryngeal Nerve , Thyroid Neoplasms , Thyroidectomy , Humans , Thyroid Neoplasms/surgery , Thyroid Neoplasms/pathology , Male , Female , Middle Aged , Adult , Recurrent Laryngeal Nerve/surgery , Thyroidectomy/methods , Vocal Cord Paralysis/etiology , Vocal Cord Paralysis/surgery , Aged , Voice Quality , Neoplasm Invasiveness/pathology , Treatment Outcome
13.
J Acoust Soc Am ; 155(4): 2659-2669, 2024 Apr 01.
Article En | MEDLINE | ID: mdl-38634661

Within the realm of voice classification, singers could be sub-categorized by the weight of their repertoire, the so-called "singer's Fach." However, the opposite pole terms "lyric" and "dramatic" singing are not yet well defined by their acoustic and articulatory characteristics. Nine professional singers of different singers' Fach were asked to sing a diatonic scale on the vowel /a/, first in what the singers considered as lyric and second in what they considered as dramatic. Image recording was performed using real time magnetic resonance imaging (MRI) with 25 frames/s, and the audio signal was recorded via an optical microphone system. Analysis was performed with regard to sound pressure level (SPL), vibrato amplitude, and frequency and resonance frequencies as well as articulatory settings of the vocal tract. The analysis revealed three primary differences between dramatic and lyric singing: Dramatic singing was associated with greater SPL and greater vibrato amplitude and frequency as well as lower resonance frequencies. The higher SPL is an indication of voice source changes, and the lower resonance frequencies are probably caused by the lower larynx position. However, all these strategies showed a considerable individual variability. The singers' Fach might contribute to perceptual differences even for the same singer with regard to the respective repertoire.


Music , Singing , Voice Quality , Acoustics
14.
Sci Rep ; 14(1): 8977, 2024 04 18.
Article En | MEDLINE | ID: mdl-38637516

Why do we prefer some singers to others? We investigated how much singing voice preferences can be traced back to objective features of the stimuli. To do so, we asked participants to rate short excerpts of singing performances in terms of how much they liked them as well as in terms of 10 perceptual attributes (e.g.: pitch accuracy, tempo, breathiness). We modeled liking ratings based on these perceptual ratings, as well as based on acoustic features and low-level features derived from Music Information Retrieval (MIR). Mean liking ratings for each stimulus were highly correlated between Experiments 1 (online, US-based participants) and 2 (in the lab, German participants), suggesting a role for attributes of the stimuli in grounding average preferences. We show that acoustic and MIR features barely explain any variance in liking ratings; in contrast, perceptual features of the voices achieved around 43% of prediction. Inter-rater agreement in liking and perceptual ratings was low, indicating substantial (and unsurprising) individual differences in participants' preferences and perception of the stimuli. Our results indicate that singing voice preferences are not grounded in acoustic attributes of the voices per se, but in how these features are perceptually interpreted by listeners.


Music , Singing , Voice , Humans , Voice Quality , Acoustics
15.
Eur Arch Otorhinolaryngol ; 281(6): 3197-3205, 2024 Jun.
Article En | MEDLINE | ID: mdl-38568297

PURPOSE: Aim of this study was to calculate the percentage of the Automatic Speaking Valve (ASV) use in a large cohort of laryngectomized patients with voice prosthesis (VP) and to analyze the main reasons for non-use. Subsequently, a specific rehabilitation training was proposed. METHODS: One hundred-ten laryngectomized patients with VP were enrolled in the first phase of the study (census). Among them, 57 patients were included in the second phase (intervention), in which a training based on moving phonatory exercises was proposed. Structured questionnaires were used before and after training in order to investigate ASV use rate (days/week and hours/day; reasons for impeding the ASV use), average adhesive life-time during ASV use; hands-free speech duration; skin irritation. Patients also expressed their degree of on a VAS scale from 0 to 100. RESULTS: In the census phase the percentage of use of ASV (everyday, without problems) was equal to 17.27% (19/110 patients). The main causes of disuse concerned excessive fatigue and poor durability of the adhesives. The analysis of the results pre vs. post-training showed a statistically significant increase (p < 0.05) in all the investigated parameters. Patients reported a good level of treatment compliance (average frequency of performing exercises equal to 4.2 ± 2.5 days/week for 1.4 ± 1.01 h/day) and high degrees of satisfaction. After treatment, the percentage of use of AVS increased by 43% reaching a rate of 60% (66/110 patients). CONCLUSION: A specific and targeted approach that simulate the phonatory and breathing difficulties of everyday life can increase the ASV usage rate.


Laryngectomy , Larynx, Artificial , Humans , Laryngectomy/rehabilitation , Laryngectomy/adverse effects , Male , Female , Middle Aged , Aged , Adult , Surveys and Questionnaires , Aged, 80 and over , Speech, Alaryngeal , Voice Quality , Prosthesis Design
16.
JAMA ; 331(15): 1259-1261, 2024 04 16.
Article En | MEDLINE | ID: mdl-38517420

In this Medical News article, Edward Chang, MD, chair of the department of neurological surgery at the University of California, San Francisco Weill Institute for Neurosciences joins JAMA Editor in Chief Kirsten Bibbins-Domingo, PhD, MD, MAS, to discuss the potential for AI to revolutionize communication for those unable to speak due to aphasia.


Aphasia , Artificial Intelligence , Avatar , Speech , Voice , Humans , Speech/physiology , Voice/physiology , Voice Quality , Aphasia/etiology , Aphasia/therapy , Equipment and Supplies
17.
J Speech Lang Hear Res ; 67(4): 1090-1106, 2024 Apr 08.
Article En | MEDLINE | ID: mdl-38498664

PURPOSE: This study examined speech changes induced by deep-brain stimulation (DBS) in speakers with Parkinson's disease (PD) using a set of auditory-perceptual and acoustic measures. METHOD: Speech recordings from nine speakers with PD and DBS were compared between DBS-On and DBS-Off conditions using auditory-perceptual and acoustic analyses. Auditory-perceptual ratings included voice quality, articulation precision, prosody, speech intelligibility, and listening effort obtained from 44 listeners. Acoustic measures were made for voicing proportion, second formant frequency slope, vowel dispersion, articulation rate, and range of fundamental frequency and intensity. RESULTS: No significant changes were found between DBS-On and DBS-Off for the five perceptual ratings. Four of six acoustic measures revealed significant differences between the two conditions. While articulation rate and acoustic vowel dispersion increased, voicing proportion and intensity range decreased from the DBS-Off to DBS-On condition. However, a visual examination of the data indicated that the statistical significance was mostly driven by a small number of participants, while the majority did not show a consistent pattern of such changes. CONCLUSIONS: Our data, in general, indicate no-to-minimal changes in speech production ensued from DBS stimulation. The findings are discussed with a focus on large interspeaker variability in PD in terms of their speech characteristics and the potential effects of DBS on speech.


Deep Brain Stimulation , Parkinson Disease , Humans , Acoustics , Speech Intelligibility/physiology , Voice Quality , Parkinson Disease/complications , Parkinson Disease/therapy , Brain , Speech Acoustics
18.
J Music Ther ; 61(2): 132-167, 2024 May 16.
Article En | MEDLINE | ID: mdl-38438312

Individuals with Parkinson's disease (PD) experience speech and voice-related symptoms that diminish communication and quality of life. Semi-occluded vocal tract (SOVT) exercises are targeted interventions that, when combined with the positive psychosocial benefits of therapeutic group singing (TGS), may affect outcomes. The purpose of this study was to explore the effectiveness of SOVT exercises, specifically straw phonation combined with TGS, to improve voice quality and mood for individuals with PD. We used a true experimental pretest-posttest between-subjects design (i.e., randomized controlled trial) facilitated by a board-certified music therapist. All participants (N = 27) were randomly assigned to one of three groups (a) straw phonation combined with TGS (SP + TGS, n = 10), (b) TGS (n = 10), and (c) speaking-only control group (n = 7). Participants completed voice recordings for acoustic measures and the Visual Analogue Mood Scale for mood analysis before and after a 30-min intervention. The results demonstrated significant improvement in voice quality evidenced by decreasing Acoustic Voice Quality Index scores following a single session for both SP + TGS and TGS intervention groups when compared to the control. Happiness scores improved in the experimental groups when compared to control. Although not statistically significant, participants in the experimental groups (SP + TGS, TGS) demonstrated better mean mood scores on happiness, anxiety, and angry when compared to control, indicating a positive psychological response to the singing interventions. Overall, this study indicated the effectiveness of SP + TGS and TGS as promising therapeutic interventions for voice quality and mood in individuals with PD.


Music Therapy , Parkinson Disease , Singing , Voice Quality , Humans , Parkinson Disease/psychology , Parkinson Disease/therapy , Parkinson Disease/complications , Parkinson Disease/rehabilitation , Male , Female , Aged , Middle Aged , Music Therapy/methods , Voice Training , Quality of Life , Treatment Outcome , Affect , Phonation , Voice Disorders/etiology , Voice Disorders/psychology , Voice Disorders/therapy , Voice Disorders/rehabilitation
19.
J Speech Lang Hear Res ; 67(4): 1072-1089, 2024 Apr 08.
Article En | MEDLINE | ID: mdl-38527275

PURPOSE: This study aimed to develop a valid and reliable bilingual version of the Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V) for the auditory-perceptual evaluation of voice in Catalan and Spanish speakers. METHOD: The development of this CAPE-V adaptation included Delphi methodology with 20 voice and speech experts reaching consensus on the optimal adapted terminology of the perceptual vocal attributes, considering also input from the original instrument authors. The adaptation and validation of vocal tasks followed a sequential validation procedure, with input from phoneticians and speech-language pathologists. Following pilot testing with a large sample of speech-language pathology students, a refined adapted version was empirically tested for validity and reliability. Concurrent validity was assessed by comparing the adapted CAPE-V with the reference Grade, Roughness, Breathiness, Asthenia, Strain scale. Construct validity was assessed through convergent and discriminant validity analysis. Intrarater and interrater reliability were assessed via intraclass correlation coefficient calculations. User experience was evaluated through a questionnaire. Scale properties were validated using a confusion matrix, and cutoff values were calculated to achieve the optimal balance between sensitivity and specificity. RESULTS: Through a formalized consensus process, optimal Catalan/Spanish terminology was determined for the perceptual attributes of voice present in the CAPE-V. An adapted protocol of tasks was obtained that preserves the objectives of the original instrument and the relevance of the phonetic criteria in the target languages. The results demonstrated concurrent validity, construct validity, and intrarater reliability. Interrater reliability was found to depend on the extent to which evaluators shared their internal standards. The raters identified CAPE-V as an effective and preferred instrument. CONCLUSION: An adapted, validated version of the CAPE-V is made available to clinical professionals for the evaluation of voice in Catalan and Spanish speakers.


Dysphonia , Humans , Cross-Cultural Comparison , Consensus , Reproducibility of Results , Voice Quality , Observer Variation
20.
Am J Otolaryngol ; 45(3): 104255, 2024.
Article En | MEDLINE | ID: mdl-38471418

PURPOSE: The main aim of this study was to investigate the clinical efficacy of speech therapy, delivered via tele-practice to patients with dysphonia. A secondary aim was to verify whether a telerehabilitation-only protocol could have a clinical efficacy similar to a combined telerehabilitation and in-person approach. METHODS: Thirty-two consecutive patients undergoing telerehabilitation for dysphonia were retrospectively considered. Patients were divided into two groups: those who received combined in-person and telerehabilitation treatment, and those who underwent telerehabilitation only. RESULTS: Overall, patients included in this study showed a significant improvement in their VHI-10 scores after treatment (p < 0.001). Such an improvement was also significant in both combined therapy and telerehabilitation only groups (p = 0.019, and p = 0.002, respectively). A significant reduction in general degree of dysphonia (G), roughness (R), breathiness (B) and strain (S) scores (p < 0.001, p = 0.012, p < 0.001, and p < 0.001, respectively) was noticed over the whole sample after treatment. The same parameters showed a significant improvement also in the combined therapy group, while in the telerehabilitation only group, only G, B and S scores significantly improved. Mean phonation time, Jitter and Shimmer values significantly improved in the overall sample as well as in the combined therapy group. A significantly more favorable spectrographic class relative to the vowel /a/ was found after treatment in the whole sample, as well as in both combined therapy and telerehabilitation only groups (p < 0.001, p = 0.002, p = 0.004, respectively). CONCLUSION: This study's results seem to support telerehabilitation as a potentially effective tool to administer speech therapy in dysphonic patients, both as a single modality and in combination with traditional in-person sessions. To better characterize the clinical results of telerehabilitation in dysphonia treatment, large-scale prospective investigations are mandatory.


Dysphonia , Telerehabilitation , Voice Training , Humans , Dysphonia/rehabilitation , Dysphonia/therapy , Female , Male , Telerehabilitation/methods , Retrospective Studies , Middle Aged , Treatment Outcome , Adult , Speech Therapy/methods , Aged , Voice Quality
...