Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 8.513
Filtrar
1.
Codas ; 36(3): e20230175, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38629682

RESUMO

PURPOSE: To assess the influence of the listener experience, measurement scales and the type of speech task on the auditory-perceptual evaluation of the overall severity (OS) of voice deviation and the predominant type of voice (rough, breathy or strain). METHODS: 22 listeners, divided into four groups participated in the study: speech-language pathologist specialized in voice (SLP-V), SLP non specialized in voice (SLP-NV), graduate students with auditory-perceptual analysis training (GS-T), and graduate students without auditory-perceptual analysis training (GS-U). The subjects rated the OS of voice deviation and the predominant type of voice of 44 voices by visual analog scale (VAS) and the numerical scale (score "G" from GRBAS), corresponding to six speech tasks such as sustained vowel /a/ and /ɛ/, sentences, number counting, running speech, and all five previous tasks together. RESULTS: Sentences obtained the best interrater reliability in each group, using both VAS and GRBAS. SLP-NV group demonstrated the best interrater reliability in OS judgment in different speech tasks using VAS or GRBAS. Sustained vowel (/a/ and /ɛ/) and running speech obtained the best interrater reliability among the groups of listeners in judging the predominant vocal quality. GS-T group got the best result of interrater reliability in judging the predominant vocal quality. CONCLUSION: The time of experience in the auditory-perceptual judgment of the voice, the type of training to which they were submitted, and the type of speech task influence the reliability of the auditory-perceptual evaluation of vocal quality.


Assuntos
Disfonia , Percepção da Fala , Humanos , Fala , Reprodutibilidade dos Testes , Medida da Produção da Fala , Variações Dependentes do Observador , Qualidade da Voz , Acústica da Fala
2.
Noise Health ; 26(120): 1-7, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38570303

RESUMO

OBJECTIVE: Functional dysphonia can impair the language expression ability and adversely affect the career development of some patients. Therefore, an active exploration of effective treatment options is imperative. This study investigated the effect of Akson therapy on acoustic parameters in patients with functional dysphonia. MATERIALS AND METHODS: In this retrospective analysis, 79 patients with functional dysphonia who received conventional voice correction training from June 2020 to June 2021 were included in the reference group (RG). Our hospital has implemented Akson therapy since July 2021. Correspondingly, 72 patients with functional dysphonia who underwent Akson therapy from July 2021 to July 2022 were enrolled in the observation group (OG). The acoustic parameters such as fundamental frequency (F0), jitter, shimmer, and normalized noise energy (NNE); the aerodynamic parameters including maximum phonation time (MPT), mean airflow rate (MFR), and Voice Handicap Index-10 (VHI-10) score; and the Grade, Roughness, Breathiness, Asthenia, and Strain scale (GRBAS) score were measured before and after treatment and compared between the two groups. RESULTS: The F0, jitter, shimmer, NNE, MPT, and MFR values as well as the VHI-10 score and the grade (G), roughness (R), and breathiness (B) scores on the GRBAS did not significantly differ between the two groups before treatment (P > 0.05). However, significantly lower F0, jitter, shimmer, NNE, and MFR values and higher MPT levels were found in the OG compared to the RG after treatment (P < 0.001). Furthermore, the VHI-10 score and the G, R, and B scores were significantly lower in the OG than in the RG after treatment (P < 0.001), whereas the asthenia (A) and strain (S) scores remained at 0 before and after treatment. CONCLUSION: Akson therapy can improve the acoustic parameters of patients with functional dysphonia to a certain extent, indicating its potential application value.


Assuntos
Disfonia , Humanos , Disfonia/terapia , Estudos Retrospectivos , Astenia , Qualidade da Voz , Acústica
3.
Artigo em Chinês | MEDLINE | ID: mdl-38561259

RESUMO

Objective: To investigate the clinical characteristics and voice outcomes after laryngeal microsurgery for vocal fold epidermoid cysts coexisting with sulcus vocalis. Methods: The clinical data of 115 vocal fold epidermoid cysts coexisting with sulcus vocalis patients in Shandong provincial ENT hospital, were retrospectively analyzed, including 49 males and 66 females, aged 17-70 years old, and the duration of hoarseness ranged from 6 months to 30 years. All patients underwent surgery through suspension laryngoscope and microscope under general anestgesia. Ninety-four patients were treated with microflap excision of sulcus vocalis, cyst wall, and contents.And 21 patients that occulted with mucosal bridges were applied mucosal bridges resection (2 cases) and mucosal bridges reconstruction (19 cases) respectively. Videolaryngoscopy, subjective voice evaluation (GRBAS), objective voice evaluation, and Voice Handicap Index(VHI) were performed before and after surgery. All patients underwent histopathologic examination and follow-up after the procedure. The preoperative acoustic parameters of patients with vocal fold epidermoid cysts coexisting with sulcus vocalis were compared with those of vocal fold mucus retention cysts and simple vocal fold epidermoid cysts by independent samples t-test. The patients were compared by paired t-test for preoperative and postoperative parameters. Results: Significant reduction or lack of mucosal waves were shown via videolaryngostroboscopy in all 115 cases.In addition, vascular changes including dilation, tortuousness, increased branches, and abrupt direction change were shown on the cystic area. Eighty-one patients were detected cysts and/or sulcus vocalis by preoperative laryngoscopy, and intraoperative microscopic findings in the remaining 34 patients. The intraoperative microscopic examination revealed a focal pouch-like deficit plunging into the vocal ligament or muscle. The deep surface of the mucosal bridges was sulcus vocalis, and that in 89 cysts was lined with caseous content. Histopathology demonstrated a cystic cavity structure lined with squamous epithelium and caseous keratin desquamation inside the cystic cavity. Four of 115 patients were lost at follow-up and excluded from the analysis of voice outcomes after surgery. There was no significant mucosal wave and the voice quality in all but 14 patients 1month after surgery. Except for the fundamental frequency and noise harmonic ratio, all other voice parameters[ G, R, B, A, VHI-10, jitter, shimmer, maximum phonatory time (MPT) ]showed a significant improvement 3 months after surgery(t=15.82, 20.82, 17.61, 7.30, 38.88, 7.84, 5.88, -6.26, respectively, P<0.05). Then mucosal waves and the voice quality were gradually improved and became steady in 6 months after surgery. The subjective and objective voice parameters[G, R, B, A, VHI-10, jitter, shimmer, noise to harmonic ratio(NHR), MPT], except for the fundamental frequency, were all significantly improved(t=23.47, 25.79, 18.37, 9.84, 54.45, 10.68, 8.07, 3.24, -9.08, respectively, P<0.05). In addition, there were 2 patients with no significant improvement after the operation. Steady function with no complications was observed during the 12 months (up to 3 years in 34 patients) follow-up period in 111 patients. Conclusion: Ruptured vocal fold epidermoid cysts can result in sulcus vocalis and mucosal bridges. Characteristics changes in preoperative videolaryngoscopy are effective diagnostic tools. The complete excision of the cyst wall and repair of the lamina propria can lead to satisfactory long-term effects.


Assuntos
Cisto Epidérmico , Doenças da Laringe , Masculino , Feminino , Humanos , Adolescente , Adulto Jovem , Adulto , Pessoa de Meia-Idade , Idoso , Prega Vocal/patologia , Cisto Epidérmico/complicações , Cisto Epidérmico/cirurgia , Cisto Epidérmico/patologia , Estudos Retrospectivos , Doenças da Laringe/cirurgia , Doenças da Laringe/patologia , Qualidade da Voz , Resultado do Tratamento
4.
Sci Rep ; 14(1): 8977, 2024 Apr 18.
Artigo em Inglês | MEDLINE | ID: mdl-38637516

RESUMO

Why do we prefer some singers to others? We investigated how much singing voice preferences can be traced back to objective features of the stimuli. To do so, we asked participants to rate short excerpts of singing performances in terms of how much they liked them as well as in terms of 10 perceptual attributes (e.g.: pitch accuracy, tempo, breathiness). We modeled liking ratings based on these perceptual ratings, as well as based on acoustic features and low-level features derived from Music Information Retrieval (MIR). Mean liking ratings for each stimulus were highly correlated between Experiments 1 (online, US-based participants) and 2 (in the lab, German participants), suggesting a role for attributes of the stimuli in grounding average preferences. We show that acoustic and MIR features barely explain any variance in liking ratings; in contrast, perceptual features of the voices achieved around 43% of prediction. Inter-rater agreement in liking and perceptual ratings was low, indicating substantial (and unsurprising) individual differences in participants' preferences and perception of the stimuli. Our results indicate that singing voice preferences are not grounded in acoustic attributes of the voices per se, but in how these features are perceptually interpreted by listeners.


Assuntos
Música , Canto , Voz , Humanos , Qualidade da Voz , Acústica
5.
Eur Rev Med Pharmacol Sci ; 28(7): 2701-2709, 2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-38639510

RESUMO

OBJECTIVE: Vocal cord paralysis (VCP) is a serious complication in thyroidectomy operations; however, its management remains unclear. The present study evaluated the voice parameters of patients who underwent surgery using Intraoperative Neurophysiologic Monitoring (IONM). PATIENTS AND METHODS: A total of 52 patients (41 females and 11 males) who underwent a total thyroidectomy operation were evaluated using objective and subjective voice analysis examinations before and after surgery. Acoustic parameters, such as Fundamental Frequency (F0), Shimmer, Jitter, Noise-to-Harmonic ratio (NHR), and aerodynamic parameters, including S/Z ratio and maximum phonation time (MPT), were analyzed. Objective findings, including the VHI-10 (Voice Handicap Index) and V-RQOL (Voice-Related Quality of Life), were also analyzed. The relationship between voice parameters and IONM values was investigated. RESULTS: The objective analysis (acoustic and aerodynamic parameters) showed no difference (p>0.05). However, the subjective analysis, which involved the VHI-10 and V-RQOL measures, revealed a significant difference before and after the operation (p<0.05). The Spearman correlation analysis showed that the NHR postoperative 1st-month parameter negatively correlated (rho=-0.317, p<0.059), while the F0 postoperative 6th-month parameter positively correlated (rho=0.347) with the amplitude difference before and after dissection (Right R2-R1 difference) for the right RLN measured in IONM. CONCLUSIONS: Patients who are planning to undergo a thyroidectomy procedure should undergo voice assessment during both the preoperative and postoperative periods. IONM could improve voice quality outcomes.


Assuntos
Paralisia das Pregas Vocais , Distúrbios da Voz , Masculino , Feminino , Humanos , Qualidade da Voz , Tireoidectomia/efeitos adversos , Qualidade de Vida , Acústica , Paralisia das Pregas Vocais/diagnóstico , Paralisia das Pregas Vocais/etiologia , Distúrbios da Voz/diagnóstico , Distúrbios da Voz/etiologia
6.
J Acoust Soc Am ; 155(4): 2659-2669, 2024 Apr 01.
Artigo em Inglês | MEDLINE | ID: mdl-38634661

RESUMO

Within the realm of voice classification, singers could be sub-categorized by the weight of their repertoire, the so-called "singer's Fach." However, the opposite pole terms "lyric" and "dramatic" singing are not yet well defined by their acoustic and articulatory characteristics. Nine professional singers of different singers' Fach were asked to sing a diatonic scale on the vowel /a/, first in what the singers considered as lyric and second in what they considered as dramatic. Image recording was performed using real time magnetic resonance imaging (MRI) with 25 frames/s, and the audio signal was recorded via an optical microphone system. Analysis was performed with regard to sound pressure level (SPL), vibrato amplitude, and frequency and resonance frequencies as well as articulatory settings of the vocal tract. The analysis revealed three primary differences between dramatic and lyric singing: Dramatic singing was associated with greater SPL and greater vibrato amplitude and frequency as well as lower resonance frequencies. The higher SPL is an indication of voice source changes, and the lower resonance frequencies are probably caused by the lower larynx position. However, all these strategies showed a considerable individual variability. The singers' Fach might contribute to perceptual differences even for the same singer with regard to the respective repertoire.


Assuntos
Música , Canto , Qualidade da Voz , Acústica
7.
JAMA ; 331(15): 1259-1261, 2024 04 16.
Artigo em Inglês | MEDLINE | ID: mdl-38517420

RESUMO

In this Medical News article, Edward Chang, MD, chair of the department of neurological surgery at the University of California, San Francisco Weill Institute for Neurosciences joins JAMA Editor in Chief Kirsten Bibbins-Domingo, PhD, MD, MAS, to discuss the potential for AI to revolutionize communication for those unable to speak due to aphasia.


Assuntos
Afasia , Inteligência Artificial , 60453 , Fala , Voz , Humanos , Fala/fisiologia , Voz/fisiologia , Qualidade da Voz , Afasia/etiologia , Afasia/terapia , Equipamentos e Provisões
8.
J Speech Lang Hear Res ; 67(4): 1090-1106, 2024 Apr 08.
Artigo em Inglês | MEDLINE | ID: mdl-38498664

RESUMO

PURPOSE: This study examined speech changes induced by deep-brain stimulation (DBS) in speakers with Parkinson's disease (PD) using a set of auditory-perceptual and acoustic measures. METHOD: Speech recordings from nine speakers with PD and DBS were compared between DBS-On and DBS-Off conditions using auditory-perceptual and acoustic analyses. Auditory-perceptual ratings included voice quality, articulation precision, prosody, speech intelligibility, and listening effort obtained from 44 listeners. Acoustic measures were made for voicing proportion, second formant frequency slope, vowel dispersion, articulation rate, and range of fundamental frequency and intensity. RESULTS: No significant changes were found between DBS-On and DBS-Off for the five perceptual ratings. Four of six acoustic measures revealed significant differences between the two conditions. While articulation rate and acoustic vowel dispersion increased, voicing proportion and intensity range decreased from the DBS-Off to DBS-On condition. However, a visual examination of the data indicated that the statistical significance was mostly driven by a small number of participants, while the majority did not show a consistent pattern of such changes. CONCLUSIONS: Our data, in general, indicate no-to-minimal changes in speech production ensued from DBS stimulation. The findings are discussed with a focus on large interspeaker variability in PD in terms of their speech characteristics and the potential effects of DBS on speech.


Assuntos
Estimulação Encefálica Profunda , Doença de Parkinson , Humanos , Acústica , Inteligibilidade da Fala/fisiologia , Qualidade da Voz , Doença de Parkinson/complicações , Doença de Parkinson/terapia , Encéfalo , Acústica da Fala
9.
J Speech Lang Hear Res ; 67(4): 1072-1089, 2024 Apr 08.
Artigo em Inglês | MEDLINE | ID: mdl-38527275

RESUMO

PURPOSE: This study aimed to develop a valid and reliable bilingual version of the Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V) for the auditory-perceptual evaluation of voice in Catalan and Spanish speakers. METHOD: The development of this CAPE-V adaptation included Delphi methodology with 20 voice and speech experts reaching consensus on the optimal adapted terminology of the perceptual vocal attributes, considering also input from the original instrument authors. The adaptation and validation of vocal tasks followed a sequential validation procedure, with input from phoneticians and speech-language pathologists. Following pilot testing with a large sample of speech-language pathology students, a refined adapted version was empirically tested for validity and reliability. Concurrent validity was assessed by comparing the adapted CAPE-V with the reference Grade, Roughness, Breathiness, Asthenia, Strain scale. Construct validity was assessed through convergent and discriminant validity analysis. Intrarater and interrater reliability were assessed via intraclass correlation coefficient calculations. User experience was evaluated through a questionnaire. Scale properties were validated using a confusion matrix, and cutoff values were calculated to achieve the optimal balance between sensitivity and specificity. RESULTS: Through a formalized consensus process, optimal Catalan/Spanish terminology was determined for the perceptual attributes of voice present in the CAPE-V. An adapted protocol of tasks was obtained that preserves the objectives of the original instrument and the relevance of the phonetic criteria in the target languages. The results demonstrated concurrent validity, construct validity, and intrarater reliability. Interrater reliability was found to depend on the extent to which evaluators shared their internal standards. The raters identified CAPE-V as an effective and preferred instrument. CONCLUSION: An adapted, validated version of the CAPE-V is made available to clinical professionals for the evaluation of voice in Catalan and Spanish speakers.


Assuntos
Disfonia , Humanos , Comparação Transcultural , Consenso , Reprodutibilidade dos Testes , Qualidade da Voz , Variações Dependentes do Observador
10.
Semin Speech Lang ; 45(2): 137-151, 2024 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-38417816

RESUMO

Abductor laryngeal dystonia (ABLD) is a rare neurological voice disorder which results in sporadic opening of the vocal folds during speech. Etiology is unknown, and to date there is no identified effective behavioral treatment for it. It is hypothesized that LSVT LOUD®, which was developed to treat dysphonia secondary to Parkinson's disease, may have application to speakers with ABLD to improve outcomes beyond that with botulinum neurotoxin (BoNT) treatment alone. The participant received one injection of BoNT in each vocal fold 2 to 3 months prior to initiating intensive voice therapy via teletherapy. Objective measures of vocal loudness (dB sound pressure level), maximum phonation time, and high/low pitch frequency (Hz) were recorded in all treatment sessions and follow-up sessions. Over the course of treatment, the participant showed steady gains in phonation time, volume, pitch range, and vocal quality with a substantial reduction in aphonic voice breaks by the end of the treatment program. Perceptual symptoms of ABLD were nearly undetectable by the participant and the clinicians up to 12 months posttreatment, with no additional BoNT injections. The results suggest that LSVT LOUD® following BoNT was effective, with long-lasting improvement in vocal function, for this speaker with ABLD.


Assuntos
Toxinas Botulínicas , Disfonia , Distonia , Humanos , Disfonia/tratamento farmacológico , Disfonia/etiologia , Distonia/tratamento farmacológico , Distonia/etiologia , Qualidade da Voz , Fonação , Resultado do Tratamento
11.
J Speech Lang Hear Res ; 67(3): 740-752, 2024 Mar 11.
Artigo em Inglês | MEDLINE | ID: mdl-38315579

RESUMO

PURPOSE: This study set out to investigate whether individuals with dysphonia, as determined by either self-assessment or clinician-based auditory-perceptual judgment, exhibited differences in perilaryngeal muscle activities using surface electromyography (sEMG) during various phonatory tasks. Additionally, the study aimed to assess the effectiveness of sEMG in identifying dysphonic cases. METHOD: A total of 77 adults (44 women, 33 men, Mage = 30.4 years) participated in this study, with dysphonic cases identified separately using either a 10-item Voice Handicap Index (VHI-10) or clinician-based auditory-perceptual voice quality (APVQ) evaluation. sEMG activities were measured from the areas of suprahyoid and sternocleidomastoid muscles during prolonged vowel /i/ phonations at different pitch and loudness levels. Normalized root-mean-square value against the maximal voluntary contraction (RMS %MVC) of the sEMG signals was obtained for each phonation and compared between subject groups and across phonatory tasks. Additionally, binary logistic regression analysis was performed to determine how the sEMG measures could predict the VHI-10-based or APVQ-based dysphonic cases. RESULTS: Participants who scored above the criteria on either the VHI-10 (n = 29) or APVQ judgment (n = 17) exhibited significantly higher RMS %MVC in the right suprahyoid muscles compared to the corresponding control groups. Although the RMS %MVC value from the right suprahyoid muscles alone was not a significant predictor of self-evaluated dysphonic cases, a combination of the RMS %MVC values from both the right and left suprahyoid muscles significantly predicted APVQ-based dysphonic cases with a 69.66% fair level. CONCLUSIONS: This study found that individuals with dysphonia, as determined by either self-assessment or APVQ judgment, displayed more imbalanced suprahyoid muscle activities in voice production compared to nondysphonic groups. The combination of the sEMG measures from both left and right suprahyoid muscles showed potential as a predictor of dysphonia with a fair level of confidence. SUPPLEMENTAL MATERIAL: https://doi.org/10.23641/asha.25112804.


Assuntos
Disfonia , Adulto , Masculino , Humanos , Feminino , Disfonia/diagnóstico , Músculos do Pescoço , Fonação , Qualidade da Voz , Eletromiografia
12.
Int J Occup Med Environ Health ; 37(1): 84-97, 2024 Mar 05.
Artigo em Inglês | MEDLINE | ID: mdl-38375631

RESUMO

OBJECTIVES: Emotions and stress affect voice production. There are only a few reports in the literature on how changes in the autonomic nervous system affect voice production. The aim of this study was to examine emotions and measure stress reactions during a voice examination procedure, particularly changes in the muscles surrounding the larynx. MATERIAL AND METHODS: The study material included 50 healthy volunteers (26 voice workers - opera singers, 24 control subjects), all without vocal complaints. All subjects had good voice quality in a perceptual assessment. The research procedure consisted of 4 parts: an ear, nose, and throat (ENT)­phoniatric examination, surface electromyography, recording physiological indicators (heart rate and skin resistance) using a wearable wristband, and a psychological profile based on questionnaires. RESULTS: The results of the study demonstrated that there was a relationship between positive and negative emotions and stress reactions related to the voice examination procedure, as well as to the tone of the vocal tract muscles. There were significant correlations between measures describing the intensity of experienced emotions and vocal tract muscle maximum amplitude of the cricothyroid (CT) and sternocleidomastoid (SCM) muscles during phonation and non-phonation tasks. Subjects experiencing eustress (favorable stress response) had increased amplitude of submandibular and CT at rest and phonation. Subjects with high levels of negative emotions, revealed positive correlations with SCMmax during the glissando. The perception of positive and negative emotions caused different responses not only in the vocal tract but also in the vegetative system. Correlations were found between emotions and physiological parameters, most markedly in heart rate variability. A higher incidence of extreme emotions was observed in the professional group. CONCLUSIONS: The activity of the vocal tract muscles depends on the type and intensity of the emotions and stress reactions. The perception of positive and negative emotions causes different responses in the vegetative system and the vocal tract. Int J Occup Med Environ Health. 2024;37(1):84-97.


Assuntos
Canto , Humanos , Fonação/fisiologia , Qualidade da Voz/fisiologia , Eletromiografia , Eletrofisiologia
13.
Eur Arch Otorhinolaryngol ; 281(5): 2707-2716, 2024 May.
Artigo em Inglês | MEDLINE | ID: mdl-38319369

RESUMO

PURPOSE: This cross-sectional study aimed to investigate the potential of voice analysis as a prescreening tool for type II diabetes mellitus (T2DM) by examining the differences in voice recordings between non-diabetic and T2DM participants. METHODS: 60 participants diagnosed as non-diabetic (n = 30) or T2DM (n = 30) were recruited on the basis of specific inclusion and exclusion criteria in Iran between February 2020 and September 2023. Participants were matched according to their year of birth and then placed into six age categories. Using the WhatsApp application, participants recorded the translated versions of speech elicitation tasks. Seven acoustic features [fundamental frequency, jitter, shimmer, harmonic-to-noise ratio (HNR), cepstral peak prominence (CPP), voice onset time (VOT), and formant (F1-F2)] were extracted from each recording and analyzed using Praat software. Data was analyzed with Kolmogorov-Smirnov, two-way ANOVA, post hoc Tukey, binary logistic regression, and student t tests. RESULTS: The comparison between groups showed significant differences in fundamental frequency, jitter, shimmer, CPP, and HNR (p < 0.05), while there were no significant differences in formant and VOT (p > 0.05). Binary logistic regression showed that shimmer was the most significant predictor of the disease group. There was also a significant difference between diabetes status and age, in the case of CPP. CONCLUSIONS: Participants with type II diabetes exhibited significant vocal variations compared to non-diabetic controls.


Assuntos
Diabetes Mellitus Tipo 2 , Voz , Humanos , Qualidade da Voz , Acústica da Fala , Diabetes Mellitus Tipo 2/complicações , Estudos Transversais , Medida da Produção da Fala , Acústica
14.
J Acoust Soc Am ; 155(2): 1264-1271, 2024 02 01.
Artigo em Inglês | MEDLINE | ID: mdl-38345424

RESUMO

The problem of characterizing voice quality has long caused debate and frustration. The richness of the available descriptive vocabulary is overwhelming, but the density and complexity of the information voices convey lead some to conclude that language can never adequately specify what we hear. Others argue that terminology lacks an empirical basis, so that language-based scales are inadequate a priori. Efforts to provide meaningful instrumental characterizations have also had limited success. Such measures may capture sound patterns but cannot at present explain what characteristics, intentions, or identity listeners attribute to the speaker based on those patterns. However, some terms continually reappear across studies. These terms align with acoustic dimensions accounting for variance across speakers and languages and correlate with size and arousal across species. This suggests that labels for quality rest on a bedrock of biology: We have evolved to perceive voices in terms of size/arousal, and these factors structure both voice acoustics and descriptive language. Such linkages could help integrate studies of signals and their meaning, producing a truly interdisciplinary approach to the study of voice.


Assuntos
Percepção da Fala , Acústica da Fala , Qualidade da Voz , Som , Audição
15.
J Acoust Soc Am ; 155(2): 1422-1436, 2024 02 01.
Artigo em Inglês | MEDLINE | ID: mdl-38364044

RESUMO

Auditory attribution of speaker gender has historically been assumed to operate within a binary framework. The prevalence of gender diversity and its associated sociophonetic variability motivates an examination of how listeners perceptually represent these diverse voices. Utterances from 30 transgender (1 agender individual, 15 non-binary individuals, 7 transgender men, and 7 transgender women) and 30 cisgender (15 men and 15 women) speakers were used in an auditory free classification paradigm, in which cisgender listeners classified the speakers on perceived general similarity and gender identity. Multidimensional scaling of listeners' classifications revealed two-dimensional solutions as the best fit for general similarity classifications. The first dimension was interpreted as masculinity/femininity, where listeners organized speakers from high to low fundamental frequency and first formant frequency. The second was interpreted as gender prototypicality, where listeners separated speakers with fundamental frequency and first formant frequency at upper and lower extreme values from more intermediate values. Listeners' classifications for gender identity collapsed into a one-dimensional space interpreted as masculinity/femininity. Results suggest that listeners engage in fine-grained analysis of speaker gender that cannot be adequately captured by a gender dichotomy. Further, varying terminology used in instructions may bias listeners' gender judgements.


Assuntos
Minorias Sexuais e de Gênero , Percepção da Fala , Humanos , Masculino , Feminino , Qualidade da Voz , Acústica da Fala , Masculinidade
16.
Acta Otorhinolaryngol Ital ; 44(1): 27-35, 2024 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-38420719

RESUMO

Objective: The aim of this study was to compare the efficacy of voice therapy combined with standard anti-reflux therapy in reducing symptoms and signs of laryngopharyngeal reflux (LPR). Methods: A randomised clinical trial was conducted. Fifty-two patients with LPR diagnosed by 24 h multichannel intraluminal impedance-pH monitoring were randomly allocated in two groups: medical treatment (MT) and medical plus voice therapy (VT). Clinical symptoms and laryngeal signs were assessed at baseline and after 3 months of treatment with the Reflux Symptom Index (RSI), Reflux Finding Score (RFS), Voice Handicap Index (VHI) and GRBAS scales. Results: Groups had similar scores at baseline. At 3-month follow-up, a significant decrease in RSI and RFS total scores were found in both groups although it appeared to be more robust in the VT group. G and R scores of the GRBAS scale significantly improved after treatment in both groups, with better results in the VT group. The VHI total score at 3 months improved more in the VT group (VHI delta 9.54) than in the MT group (VHI delta 5.38) (p < 0.001). Conclusions: The addition of voice therapy to medications and diet appears to be more effective in improving treatment outcomes in subjects with LPR. Voice therapy warrants consideration in addition to medication and diet when treating patients with LPR.


Assuntos
Refluxo Laringofaríngeo , Voz , Humanos , Refluxo Laringofaríngeo/diagnóstico , Refluxo Laringofaríngeo/tratamento farmacológico , Projetos Piloto , Inibidores da Bomba de Prótons/uso terapêutico , Qualidade da Voz
17.
PLoS Comput Biol ; 20(2): e1011849, 2024 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-38315733

RESUMO

Sleep deprivation has an ever-increasing impact on individuals and societies. Yet, to date, there is no quick and objective test for sleep deprivation. Here, we used automated acoustic analyses of the voice to detect sleep deprivation. Building on current machine-learning approaches, we focused on interpretability by introducing two novel ideas: the use of a fully generic auditory representation as input feature space, combined with an interpretation technique based on reverse correlation. The auditory representation consisted of a spectro-temporal modulation analysis derived from neurophysiology. The interpretation method aimed to reveal the regions of the auditory representation that supported the classifiers' decisions. Results showed that generic auditory features could be used to detect sleep deprivation successfully, with an accuracy comparable to state-of-the-art speech features. Furthermore, the interpretation revealed two distinct effects of sleep deprivation on the voice: changes in slow temporal modulations related to prosody and changes in spectral features related to voice quality. Importantly, the relative balance of the two effects varied widely across individuals, even though the amount of sleep deprivation was controlled, thus confirming the need to characterize sleep deprivation at the individual level. Moreover, while the prosody factor correlated with subjective sleepiness reports, the voice quality factor did not, consistent with the presence of both explicit and implicit consequences of sleep deprivation. Overall, the findings show that individual effects of sleep deprivation may be observed in vocal biomarkers. Future investigations correlating such markers with objective physiological measures of sleep deprivation could enable "sleep stethoscopes" for the cost-effective diagnosis of the individual effects of sleep deprivation.


Assuntos
Privação do Sono , Voz , Humanos , Sono , Qualidade da Voz , Vigília
18.
Eur Arch Otorhinolaryngol ; 281(5): 2499-2505, 2024 May.
Artigo em Inglês | MEDLINE | ID: mdl-38365991

RESUMO

PURPOSE: Arytenoid adduction as an addition to medialisation thyroplasty is highly advocated by some surgeons in selected cases but deemed less necessary by others in patients with unilateral vocal fold paralysis. This study aims to evaluate the additional benefits on voice outcome of arytenoid adduction in patients with unilateral vocal fold paralysis undergoing medialisation thyroplasty using intra-operative voice measurements. DESIGN/METHODS: A prospective study was conducted. Voice audio recordings were obtained at 4 moments; 1. direct prior to the start of surgery, 2. during surgery after medialisation thyroplasty, 3. during surgery after medialisation and arytenoid adduction, 3 months postoperative. At these same timepoints patients rated their own voice on a numeric rating scale between 0 and 10. The blinded recordings were rated by consensus in a team of experienced listeners, using the Grade of the GRBAS scale. Furthermore, the Voice Handicap Index was administered before and at 3 months after surgery. RESULTS: Ten patients who underwent medialisation and arytenoid adduction at our tertiary referral hospital between 2021 and 2022, were included. One patient was excluded after surgery. The intraoperative measurements showed a Grade score of 1.4 preoperatively, improving to 1.2 after medialisation, 1.2 after medialisation and arytenoid adduction, and further improving to 0.4 at 3 months postoperative, which was a not statistically significant improvement (p = 0.2). The intraoperative subjective numeric rating scale showed a statistically significant improvement from 3.9 preoperatively, to 6.1 after medialisation, 7.1 after medialisation and arytenoid adduction and a 7.6 at 3 months postoperative (p = 0.001). The Voice Handicap Index total score showed a statistically significant improvement from 71 points before surgery to 13 at 3 months after surgery (p = 0.008). CONCLUSIONS: Our study using intraoperative voice measurements indicate that the addition of arytenoid adduction to medialisation thyroplasty is a benefit in selected patients although more studies are needed due to the many limitations inherent to this field of investigation.


Assuntos
Laringoplastia , Paralisia das Pregas Vocais , Voz , Humanos , Estudos Prospectivos , Qualidade da Voz , Paralisia das Pregas Vocais/cirurgia , Cartilagem Aritenoide/cirurgia , Resultado do Tratamento
19.
J Acoust Soc Am ; 155(1): 18-28, 2024 01 01.
Artigo em Inglês | MEDLINE | ID: mdl-38169520

RESUMO

In an earlier study, we analyzed how audio signals obtained from three professional opera singers varied when they sang one octave wide eight-tone scales in ten different emotional colors. The results showed systematic variations in voice source and long-term-average spectrum (LTAS) parameters associated with major emotion "families". For two of the singers, subglottal pressure (PSub) also was recorded, thus allowing analysis of an additional main physiological voice control parameter, glottal resistance (defined as the ratio between PSub and glottal flow), and related to glottal adduction. In the present study, we analyze voice source and LTAS parameters derived from the audio signal and their correlation with Psub and glottal resistance. The measured parameters showed a systematic relationship with the four emotion families observed in our previous study. They also varied systematically with values of the ten emotions along the valence, power, and arousal dimensions; valence showed a significant correlation with the ratio between acoustic voice source energy and subglottal pressure, while Power varied significantly with sound level and two measures related to the spectral dominance of the lowest spectrum partial. the fundamental.


Assuntos
Canto , Voz , Humanos , Qualidade da Voz , Voz/fisiologia , Acústica , Glote/fisiologia
20.
J Acoust Soc Am ; 155(1): 381-395, 2024 01 01.
Artigo em Inglês | MEDLINE | ID: mdl-38240668

RESUMO

Auditory perceptual evaluation is considered the gold standard for assessing voice quality, but its reliability is limited due to inter-rater variability and coarse rating scales. This study investigates a continuous, objective approach to evaluate hoarseness severity combining machine learning (ML) and sustained phonation. For this purpose, 635 acoustic recordings of the sustained vowel /a/ and subjective ratings based on the roughness, breathiness, and hoarseness scale were collected from 595 subjects. A total of 50 temporal, spectral, and cepstral features were extracted from each recording and used to identify suitable ML algorithms. Using variance and correlation analysis followed by backward elimination, a subset of relevant features was selected. Recordings were classified into two levels of hoarseness, H<2 and H≥2, yielding a continuous probability score y∈[0,1]. An accuracy of 0.867 and a correlation of 0.805 between the model's predictions and subjective ratings was obtained using only five acoustic features and logistic regression (LR). Further examination of recordings pre- and post-treatment revealed high qualitative agreement with the change in subjectively determined hoarseness levels. Quantitatively, a moderate correlation of 0.567 was obtained. This quantitative approach to hoarseness severity estimation shows promising results and potential for improving the assessment of voice quality.


Assuntos
Disfonia , Rouquidão , Humanos , Rouquidão/diagnóstico , Reprodutibilidade dos Testes , Qualidade da Voz , Fonação , Acústica , Acústica da Fala , Medida da Produção da Fala
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...