RESUMO
BACKGROUND: Control of pelvic floor muscles (PFM) is emphasized as important to obtain functional breath support in opera singing, but there is not much research that proves PFM function as part of breath support in classical singing. Transperineal ultrasound is a reliable method for quantification of PFM contraction in urogynecology. Our aim was to establish if transperineal ultrasound can be used for observation of movement of the PFM during singing and to quantify pelvic floor contraction. METHODS: Cross sectional study of 10 professional opera singers examined with transperineal ultrasound in the supine position at rest and contraction, and standing at rest and during singing. Levator hiatal area was measured in a 3D rendered volume. Levator hiatal anteroposterior (AP) diameter and bladder neck distance from symphysis were measured in 2D images. RESULTS: The AP diameter was shortened from supine rest to contraction (15 mm), standing (6 mm) and singing (9 mm), all p < 0.01. The bladder neck had a non-significant descent of 3 mm during singing. The mean proportional change in AP diameter from rest to contraction was 24.2% (moderate to strong contraction) and from rest to singing was 15% (weak to moderate contraction). CONCLUSIONS: Transperineal ultrasound can be used to examine the PFM during singing. The classically trained singers had good voluntary PFM contraction and moderate contraction during singing. AP diameter was significantly shortened from supine to upright position, with further shortening during singing, confirming that female opera singers contracted their pelvic floor during singing.
Assuntos
Diafragma da Pelve , Canto , Feminino , Humanos , Diafragma da Pelve/diagnóstico por imagem , Projetos Piloto , Estudos Transversais , UltrassonografiaRESUMO
BACKGROUND: The Acoustic Voice Quality Index (AVQI) is a correlate of dysphonia. It has been found to differentiate between dysphonic and normophonic speakers and to indicate the effects of voice therapy. This study investigates how the AVQI reacts towards creak and strain, which are common in normophonic speakers. METHODS: The material was obtained from an earlier study on 104 Finnish female university students (mean age 24.3 years, SD 6.3 years) with no known pathology of voice or hearing and a perceptually normal voice (G = 0 in GRBAS), who were recorded while reading aloud a standard text and sustaining the vowel [a:]. Perceptual analysis for the amount of creak and strain was carried out by 2 expert listeners. In this study, the AVQI v03.01 was analyzed and correlated with perceptual evaluations. Samples with low and high amounts of creak and strain were compared with t tests. RESULTS: On average, the AVQI was below the threshold value of dysphonia in the Finnish population. The AVQI (ρ = 0.35, p = 0.000) and its subparameters, smoothed cepstral peak prominence (CPPS; ρ = -0.35, p = 0.000) and harmonics-to-noise ratio (HNR; ρ = -0.30, p = 0.002) showed low but significant correlations with creak. Strain had low but significant correlations with spectral Slope (ρ = 0.38, p = 0.000) and Tilt (ρ = -0.40, p = 0.009). The AVQI was lower (better) in samples that were evaluated as having a high amount of strain, but the difference was not significant. Only CPPS differentiated significantly between low and high amounts of creak. CONCLUSION: The AVQI does not seem to differentiate between high and low amounts of creak and strain in normophonic speakers.
Assuntos
Disfonia , Qualidade da Voz , Acústica , Adulto , Disfonia/diagnóstico , Feminino , Finlândia , Humanos , Reprodutibilidade dos Testes , Índice de Gravidade de Doença , Acústica da Fala , Medida da Produção da Fala , Adulto JovemRESUMO
Classical singers use nasal consonants as "resonance exercises," and experimental results have shown that singers may use some velopharyngeal opening (VPO), most often in [a:] and more seldom in [i:] or [u:]. In particular, male singers have been found to increase VPO as pitch rises toward register change frequencies (passaggi). Laryngoscopic findings have shown that some VPO stabilizes vocal fold vibration; the effect is related to positive reactance. This study investigates the effects of VPO on vocal tract (VT) reactance over the range of fundamental frequencies (f0) used in singing using a computerized tomography-based finite element model of the VT and nose of a female. According to the results, by raising the lowest VT resonances, the VPO increased the VT reactance in the frequency ranges 207-359 Hz for [i:], 265-411 Hz for [u:], and 500-611 Hz for [a:], depending on the VPO size (full or half VPO). These frequency ranges are close to the first and second passaggio of a female singer. The change may have an especially practical stabilizing effect for [a:], which is otherwise characterized by very large changes in VT reactance and negative reactance at the second passaggio.
Assuntos
Canto , Qualidade da Voz , Feminino , Análise de Elementos Finitos , Humanos , Masculino , Vibração , Prega Vocal/diagnóstico por imagemRESUMO
BACKGROUND: The use of a creaky voice in speech has become more common. Laryngeal findings relate creaky voice with strong adductive tension in the glottis, and thus it may be expected to be related to an increased risk of vocal fatigue. METHODS: The present study investigated the relation of creaky voice use and vocal symptoms in 104 Finnish female university students (mean age 24.3 years, SD 6.3 years). They had no known pathology of voice or hearing. The participants were recorded while reading aloud a text of approximately 40 s in duration. They also filled in a questionnaire consisting of the Voice Handicap Index (VHI, translation in Finnish), and half of them also answered questions about the frequency and severity of symptoms of vocal fatigue. The samples were perceptually analysed for the amount of creakiness and strain by a speech therapist and a voice trainer. RESULTS: The interrater reliability of the listeners was acceptable (Pearson's χ2 = 100.159, p = 0.000 for creak; χ2 = 69.199, p = 0.000 for strain). Neither creakiness nor strain correlated with vocal symptoms. Participants with a low and a high amount of creakiness or strain did not differ from each other in terms of vocal symptoms or VHI scores. Symptoms' total score correlated with VHI total score and total scores of VHI's physical subscale. Creakiness and strain correlated positively with each other (r = 0.40, p = 0.000). CONCLUSION: No significant relations were found between creakiness or strain and vocal symptoms in this sample of university students.
Assuntos
Distúrbios da Voz , Voz , Adulto , Feminino , Humanos , Reprodutibilidade dos Testes , Medida da Produção da Fala , Distúrbios da Voz/epidemiologia , Distúrbios da Voz/etiologia , Qualidade da Voz , Adulto JovemRESUMO
Nasal cavities are known to introduce antiresonances (dips) in the sound spectrum reducing the acoustic power of the voice. In this study, a three-dimensional (3D) finite element (FE) model of the vocal tract (VT) of one female subject was created for vowels [a:] and [i:] without and with a detailed model of nasal cavities based on CT (Computer Tomography) images. The 3D FE models were then used for analyzing the resonances, antiresonances and the acoustic pressure response spectra of the VT. The computed results were compared with the measurements of a VT model for the vowel [a:], obtained from the FE model by 3D printing. The nasality affects mainly the lowest formant frequency and decreases its peak level. The results confirm the main effect of nasalization, i.e., that sound pressure level decreases in the frequency region of the formants F1-F2 and emphasizes the frequency region of the formants F3-F5 around the singer's formant cluster. Additionally, many internal local resonances in the nasal and paranasal cavities were found in the 3D FE model. Their effect on the acoustic output was found to be minimal, but accelerometer measurements on the walls of the 3D-printed model suggested they could contribute to structure vibrations.
Assuntos
Qualidade da Voz , Voz , Simulação por Computador , Feminino , Humanos , Cavidade Nasal/diagnóstico por imagem , Fonética , Acústica da FalaRESUMO
AIMS: This study investigated whether speaking a foreign language affects the subjective notions of vocal fatigue, and whether acoustic measurements reveal a higher vocal loading. METHODS: The speech samples of 20 native Finnish-speaking and 23 native English-speaking subjects were recorded in Finnish and in English. From the speech samples, fundamental frequency, equivalent sound level, total duration of voiced speech, speech rate, alpha ratio and L1-L0 level difference were analyzed. Vocal doses were calculated. RESULTS: According to subjective notions, the voice gets tired more quickly when speaking a foreign language. The mean fundamental frequency increased but the speech rate and total duration of voiced speech decreased significantly when speaking a foreign language. Thus, the vocal doses decreased. CONCLUSIONS: The subjective sensations of increased vocal fatigue may be due to increased mental stress rather than to higher vocal loading. However, a trend that speaking a foreign language may involve more loading was found in L1-L0 level difference and in the doses normalized to time dose. Longer speech samples should be studied. Voice quality-based indicators of vocal loading are worth testing in addition to the measures based on the amount of voicing in speech.
Assuntos
Multilinguismo , Espectrografia do Som , Acústica da Fala , Medida da Produção da Fala , Distúrbios da Voz/diagnóstico , Adulto , Idoso , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Distúrbios da Voz/psicologia , Qualidade da Voz , Adulto JovemRESUMO
This study aimed to find the optimal geometrical configuration of the vocal tract (VT) to increase the total acoustic energy output of human voice in the frequency interval 2-3.5 kHz "singer's formant cluster," (SFC) for vowels [a:] and [i:] considering epilaryngeal changes and the velopharyngeal opening (VPO). The study applied 3D volume models of the vocal and nasal tract based on computer tomography images of a female speaker. The epilaryngeal narrowing (EN) increased the total sound pressure level (SPL) and SPL of the SFC by diminishing the frequency difference between acoustic resonances F3 and F4 for [a:] and between F2 and F3 for [i:]. The effect reached its maximum at the low pharynx/epilarynx cross-sectional area ratio 11.4:1 for [a:] and 25:1 for [i:]. The acoustic results obtained with the model optimization are in good agreement with the results of an internationally recognized operatic alto singer. With the EN and the VPO, the VT input reactance was positive over the entire fo singing range (ca 75-1500 Hz). The VPO increased the strength of the SFC and diminished the SPL of F1 for both vowels, but with EN, the SPL decrease was compensated. The effect of EN is not linear and depends on the vowel. Both the EN and the VPO alone and together can support (singing) voice production.
RESUMO
OBJECTIVES: Increased prevalence of social creak particularly among female speakers has been reported in several studies. The study of social creak has been previously conducted by combining perceptual evaluation of speech with conventional acoustical parameters such as the harmonic-to-noise ratio and cepstral peak prominence. In the current study, machine learning (ML) was used to automatically distinguish speech of low amount of social creak from speech of high amount of social creak. METHODS: The amount of creak in continuous speech samples produced in Finnish by 90 female speakers was first perceptually assessed by two voice specialists. Based on their assessments, the speech samples were divided into two categories (low vs high amount of creak). Using the speech signals and their creak labels, seven different ML models were trained. Three spectral representations were used as feature for each model. RESULTS: The results show that the best performance (accuracy of 71.1%) was obtained by the following two systems: an Adaboost classifier using the mel-spectrogram feature and a decision tree classifier using the mel-frequency cepstral coefficient feature. CONCLUSIONS: The study of social creak is becoming increasingly popular in sociolinguistic and vocological research. The conventional human perceptual assessment of the amount of creak is laborious and therefore ML technology could be used to assist researchers studying social creak. The classification systems reported in this study could be considered as baselines in future ML-based studies on social creak.
RESUMO
OBJECTIVES: The purpose of this pilot study was to examine voice quality changes in individuals with early-stage Parkinson's disease (PD) utilizing the Acoustic Voice Quality Index (AVQI) and Acoustic Breathiness Index (ABI) over approximately a 1-year period. STUDY DESIGN: Follow-up study. METHODS: Baseline and follow-up data were gathered from the PDSTUlong speech corpus. The data for both time points included: speaker background information, sustained vowels, reading samples, and measures of PD severity (Hoehn and Yahr scores and Unified Parkinson's Disease Rating Scale III scores [UPDRS-III]). All speakers (N = 12) were native Finnish speakers. AVQIv03.01 and ABI analysis were completed in VOXplot v2.0.1. Changes in AVQI and ABI scores between baseline and follow-up were examined via causal analysis. Further, AVQI and ABI were analyzed in relation to measures of PD severity. RESULTS: Baseline mean AVQI score was 1.79 (range 0.14-4.83, SD=1.60), whereas follow-up mean AVQI score was 2.25 (range 0.55-4.53, SD=1.36). Baseline mean ABI score, in turn, was 2.92 (range 1-27 - 5.31, SD=1.57), whereas follow-up mean ABI score was 3.42 (range 1.40-5.40, SD=1.38). A significant difference was found between baseline and follow-up measures for both AVQI (Z = -2.002, P = 0.045) and ABI (Z = -2.197, P = 0.028). A significant difference in smoothed cepstral peak prominence (Z = -2.118, P = 0.034) and harmonics-to-noise ratio (Z = -1.961, P = 0.050) was also found between the two measurement periods. Change in AVQI and ABI were not correlated with the change in measures of PD severity. CONCLUSION: Over approximately 1-year, a statistical change was observed in AVQI and ABI scores, even in such a small dataset. The specific qualities of breathiness and hoarseness showed the most significant progression. Changes in voice quality were more prominent in ABI analysis.
RESUMO
OBJECTIVES: This study aimed to estimate vocal loading in loud phonation of a vowel and two widely used semiocclusion voice exercises (SOVTEs). Impact stress (IS) was estimated from glottal closing speed, inertial forces from the second derivative of glottal opening and closing. STUDY DESIGN: Experimental study in vivo. METHODS: A vocally healthy male sustained the [o:] vowel with habitual loudness and loudly: (1) without a tube, (2) into a silicone "Lax Vox" type tube (35 cm in length, 10 mm in diameter) outer end submerged 10 cm in water, and (3) into a straw (length 12.6 cm, diameter 2.5 mm) the outer end in air. He tried to use equal effort in all loud samples. High-speed video-laryngo-endoscopy was performed with a rigid scope. Oral air pressure (Poral) was registered in a mouthpiece through which the endoscope was inserted into the larynx and to which the tubes were attached air-tightly. RESULTS: Compared with vowel phonation at habitual loudness, mean of maximal glottal width (max GW) increased by 44.1% for loud tube phonation and decreased by 1.8% for loud straw phonation, and mean absolute value of minimum GW time derivative dmin increased by 57.1% for tube and by 29.5% for straw suggesting faster glottal closing. Compared with loud vowel phonation, max GW increased by 22.6% for loud tube phonation, while it decreased by 16.6% for loud straw phonation. For the tube, dmindecreased by 7.6% and for the straw by 23.8%. Maximal acceleration (ACC) and deceleration (DC) values were larger for the tube and smaller for the straw than the values for both vowel phonations. CONCLUSIONS: IS, deduced from dmin, increased in loud SOVTEs compared to vowel phonation at a conversational loudness, but remained lower in loud SOVTEs than in loud vowel phonation, particularly with a narrow straw, which also reduced inertial forces, as suggested by the reduced ACC and DC.
RESUMO
OBJECTIVES: The purpose of this study was to analyze the acoustic voice quality index (AVQI) in relation to perceptual analysis and disease stage in speakers with Parkinson's disease (PD). STUDY DESIGN: Cross-sectional study. METHODS: The following data were gathered from the Parkinson's Disease Speech corpus of Tampere University (PDSTU): prolonged vowels and reading samples from native Finnish speakers with PD (n = 34), speaker demographic information, and Hoehn and Yahr scale scores. AVQIv03.01 analysis was completed with Praat. Expert raters utilized the GRBASI scale (GRBASI: grade, roughness, breathiness, asthenia, strain, instability) to assess voice quality. AVQI scores were analyzed in relation to GRBASI ratings and disease stage, as measured by the Hoehn and Yahr scale, utilizing correlation analysis. RESULTS: AVQI scores ranged from -0.11 to 4.46 (mean = 1.64, SD = 1.39), GRBASI average total scores ranged from 0-1.83 (mean = 0.66, SD = 0.50), and Hoehn and Yahr scale scores ranged from 1- 4. A statistically significant correlation between AVQI scores and GRBASI ratings was found (rs = 0.508, P = 0.01). Several significant correlations were also found between the parameters of AVQI and scores for G, R, B and A from GRBASI. Hoehn and Yahr scores correlated significantly with GRBASI total score (rs =0.437, P = 0.01) but not with AVQI. CONCLUSION: A significant correlation was found between GRBASI ratings and Hoehn and Yahr scores, but not between AVQI and Hoehn and Yahr scores. DATA AVAILABILITY: Due to the nature of this research, participants of this study did not agree for their data to be shared publicly. Data may be made available on a case-by-case basis. Data will be available after the active phase of the study in 2028.
RESUMO
BACKGROUND: To the best of our knowledge, studies on the relationship between spectral energy distribution and the degree of perceived twang-like voices are still sparse. Through an auditory-perceptual test we aimed to explore the spectral features that may relate with the auditory-perception of twang-like voices. METHODS: Ten judges who were blind to the test's tasks and stimuli rated the amount of twang perceived on seventy-six audio samples. The stimuli consisted of twenty voices recorded from eight CCM singers who sustained the vowel [a:] in different pitches, with and without a twang-like voice. Also, forty filtered and sixteen synthesized-manipulated stimuli were included. RESULTS AND CONCLUSIONS: Based on the intra-rater reliability scores, four judges were identified as suitable to be included in the analyses. Results showed that the frequency of F1 and F2 correlated strongly with the auditory-perception of twang-like voices (0.90 and 0.74, respectively), whereas F3 showed a moderate negative correlation (-0.52). The frequency difference between F1 and F3 showed a strong negative correlation (-0.82). The mean energy between 1-2 kHz and 2-3 kHz correlated moderately (0.51 and 0.42, respectively). The frequency of F4 and F5, and the energy above 3 kHz showed weak correlations. Since the spectral changes under 2 kHz have been associated with the jaw, lips, and tongue adjustments (i.e. vowel articulation) and a higher vertical laryngeal position might affect the frequency of all formants (including F1 and F2), our results suggest that vowel articulation and the laryngeal height may be relevant when performing twang-like voices.
RESUMO
OBJECTIVES: Sound pressure and exhaled flow have been identified as important factors associated with higher particle emissions. The aim of this study was to assess how different vocalizations affect the particle generation independently from other factors. DESIGN: Experimental study. METHODS: Thirty-three experienced singers repeated two different sentences in normal loudness and whispering. The first sentence consisted mainly of consonants like /k/ and /t/ as well as open vowels, while the second sentence also included the /s/ sound and contained primarily closed vowels. The particle emission was measured using condensation particle counter (CPC, 3775 TSI Inc.) and aerodynamic particle sizer (APS, 3321 TSI Inc.). The CPC measured particle number concentration for particles larger than 4 nm and mainly reflects the number of particles smaller than 0.5 µm since these particles dominate total number concentration. The APS measured particle size distribution and number concentration in the size range of 0.5-10 µm and data were divided into >1 µm and <1 µm particle size ranges. Generalized linear mixed-effects models were constructed to assess the factors affecting particle generation. RESULTS: Whispering produced more particles than speaking and sentence 1 produced more particles than sentence 2 while speaking. Sound pressure level had effect on particle production independently from vocalization. The effect of exhaled airflow was not statistically significant. CONCLUSIONS: Based on our results the type of vocalization has a significant effect on particle production independently from other factors such as sound pressure level.
RESUMO
All-pole modeling is a widely used formant estimation method, but its performance is known to deteriorate for high-pitched voices. In order to address this problem, several all-pole modeling methods robust to fundamental frequency have been proposed. This study compares five such previously known methods and introduces a technique, Weighted Linear Prediction with Attenuated Main Excitation (WLP-AME). WLP-AME utilizes temporally weighted linear prediction (LP) in which the square of the prediction error is multiplied by a given parametric weighting function. The weighting downgrades the contribution of the main excitation of the vocal tract in optimizing the filter coefficients. Consequently, the resulting all-pole model is affected more by the characteristics of the vocal tract leading to less biased formant estimates. By using synthetic vowels created with a physical modeling approach, the results showed that WLP-AME yields improved formant frequencies for high-pitched sounds in comparison to the previously known methods (e.g., relative error in the first formant of the vowel [a] decreased from 11% to 3% when conventional LP was replaced with WLP-AME). Experiments conducted on natural vowels indicate that the formants detected by WLP-AME changed in a more regular manner between repetitions of different pitch than those computed by conventional LP.
Assuntos
Glote/fisiologia , Modelos Lineares , Fonação , Fonética , Percepção da Altura Sonora , Acústica da Fala , Qualidade da Voz , Adulto , Algoritmos , Fenômenos Biomecânicos , Pré-Escolar , Simulação por Computador , Feminino , Glote/anatomia & histologia , Humanos , Masculino , Análise Numérica Assistida por Computador , Reconhecimento Automatizado de Padrão , Pressão , Processamento de Sinais Assistido por Computador , Espectrografia do Som , Medida da Produção da Fala , Fatores de Tempo , Prega Vocal/fisiologiaRESUMO
OBJECTIVE: This study investigated whether different electroglottographic (EGG) parameters may be used to estimate voice quality and loading related to it, and whether voice quality is related to self-estimation of voice problems and laryngeal status. The effect of sample type was also considered. METHODS: EGG and acoustic signals of a sustained vowel [a:] (90 dB6 cm) and text reading during noise exposure were recorded from 93 female kindergarten teachers. Analyses were made from the sustained vowel and vowel [a:] from a stressed word. Contact quotient (CQ) was calculated. Maximum velocity of increase in contact area inferred from derivative (MDEGG) was examined. Fundamental frequency was calculated from the EGG signal. From the acoustic signal sound pressure level (SPL) was computed and pressedness in voice quality was evaluated perceptually. Self-evaluation was made with Voice Activity and Participation Profile (VAPP). Indirect laryngoscopy was also performed. RESULTS: Pressedness in voice correlated with EGG results only for the sustained vowel, and better with MDEGG than CQ values. VAPP and laryngeal evaluation did not correlate with the acoustic or EGG parameters or with perception. CONCLUSIONS: MDEGG is worth testing as an indicator of impact stress. Sustained vowel at a controlled SPL is more suitable for voice quality evaluation than a long stressed vowel from connected speech.
Assuntos
Eletrodiagnóstico/métodos , Glote/fisiologia , Fonação/fisiologia , Acústica da Fala , Percepção da Fala , Prega Vocal/fisiologia , Adulto , Eletrodiagnóstico/instrumentação , Feminino , Finlândia , Humanos , Laringoscopia , Pessoa de Meia-Idade , Movimento (Física) , Doenças Profissionais/etiologia , Doenças Profissionais/psicologia , Fonética , Pressão , Valores de Referência , Autoimagem , Autorrelato , Inquéritos e Questionários , Ensino , Distúrbios da Voz/etiologia , Distúrbios da Voz/psicologiaRESUMO
OBJECTIVES: This study investigated whether individual coaching can improve role portrayal and emotional expression in opera secco recitatives. STUDY DESIGN: An experimental comparative study. METHODS: Eighteen students (mean age 24 years) from Norway and Sweden majoring in opera singing volunteered as test persons. As singing tasks each test person studied a short opera secco recitative in their Voice Fach during 14 days. All recitatives were picked up from renowned Italian operas from the Classical period. The texts were translated in the singers' mother tongue but performed in Italian, a foreign language to all test persons Thirteen test persons were enrolled in a Test group, and they were coached individually for 1h in interpretation of their singing tasks. During the intervention the test persons combined speaking the translated text in their native dialects with singing the recitative in Italian. The test persons were asked to distinguish various emotions and motivation in the singing task, and rehearsed how to express the singing task equally distinctly in Italian as in their native dialects. The singing task was recorded a capella immediately before and after the intervention. The test persons in the Control group recorded their singing tasks pre and post 1 h self-directed rehearsing on their singing tasks, without any coaching.. The recordings of all test persons were presented in randomized order for ten expert listeners to rate the expression of emotions and role portrayal. The samples were also analysed acoustically. RESULTS: Post coaching samples were rated significantly better in expression of various emotions and motivation compared to pre coaching samples. The Control group singers showed no significant changes in perceptual variables. Pausing increased in both groups after rehearsing. CONCLUSIONS: A short individual coaching in performance of an opera recitative seemed to stimulate young opera singers` emotional and motivational expression.
RESUMO
Breathiness (perception of turbulence noise in the voice) is one of the major components of hoarseness in dysphonic voices. This study aims to validate a multiparameter analysis tool, the Acoustic Breathiness Index (ABI), for quantification of breathiness in the speaking voice, including both sustained vowels and continuous speech. One hundred and eight speakers with dysphonia (28 M, 80 F, mean age 50, SD 15.4 years) and 87 non-dysphonic controls (18 M, 69 F, mean age 42, SD 14 years) volunteered as participants. They read a standard text and sustained vowel /a:/. Acoustic recordings were made using a head-mounted microphone. Acoustic samples were evaluated perceptually by nine voice experts of different backgrounds (speech therapists, vocologists and laryngologists). Breathiness (B) from the GRBAS scale was rated. Headphones were used in the perceptual analysis. The dysphonic and non-dysphonic speakers differed significantly from each other in the auditory perceptual evaluation of breathiness. A significant difference was also found for ABI, which had a mean value of 2.26 (SD 1.15) for non-dysphonic and 3.07 (SD 1.75) for dysphonic speakers. ABI correlated strongly with B (rs = 0.823, p = 0.01). ABI's power to distinguish the groups was high (88.6%). The highest sensitivity and specificity of ABI (80%) was obtained at threshold value 2.68. ABI is a valid tool for differentiating breathiness in non-dysphonic and dysphonic speakers of Finnish.
RESUMO
OBJECTIVES: This study examines the effects of including acoustic research-based elements of the vocal expression of emotions in the singing lessons of acting students during a seven-week teaching period. This information may be useful in improving the training of interpretation in singing. STUDY DESIGN: Experimental comparative study. METHODS: Six acting students participated in seven weeks of extra training concerning voice quality in the expression of emotions in singing. Song samples were recorded before and after the training. A control group of six acting students were recorded twice within a seven-week period, during which they participated in ordinary training. All participants sang on the vowel [a:] and on a longer phrase expressing anger, sadness, joy, tenderness, and neutral states. The vowel and phrase samples were evaluated by 34 listeners for the perceived emotion. Additionally, the vowel samples were analyzed for formant frequencies (F1-F4), sound pressure level (SPL), spectral structure (Alpha ratio = SPL 1500-5000 Hz - SPL 50-1500 Hz), harmonic-to-noise ratio (HNR), and perturbation (jitter, shimmer). RESULTS: The number of correctly perceived expressions improved in the test group's vowel samples, while no significant change was observed in the control group. The overall recognition was higher for the phrases than for the vowel samples. Of the acoustic parameters, F1 and SPL significantly differentiated emotions in both groups, and HNR specifically differentiated emotions in the test group. The Alpha ratio was found to statistically significantly differentiate emotion expression after training. CONCLUSIONS: The expression of emotion in the singing voice improved after seven weeks of voice quality training. The F1, SPL, Alpha ratio, and HNR differentiated emotional expression. The variation in acoustic parameters became wider after training. Similar changes were not observed after seven weeks of ordinary voice training.
Assuntos
Canto , Voz , Humanos , Acústica , Estudantes , EmoçõesRESUMO
BACKGROUND AND OBJECTIVE: Vocal fatigue is an important complaint that may indicate a voice disorder or a risk thereof. There is a need for a reliable tool to detect and quantify vocal fatigue and distinguish dysphonic and vocally healthy speakers. The Vocal Fatigue Index (VFI) questionnaire has been found valid and reliable among speakers of different languages. This study aims to validate it for speakers of Finnish. STUDY DESIGN: Experimental comparative study. METHODS: The VFI questionnaire was translated from English to Finnish according to the WHO recommendations. Next, it was subjected to the validation procedure. In total, 160 Finnish speakers volunteered to participate in the study. Hundred-and-eight were voice patients (83 males, 25 females) and 52 were vocally healthy controls (37 females, 15 males). As a comparison, the Voice Handicap Index (VHI) questionnaire was completed and voice samples were recorded to enable Acoustic Voice Quality Index (AVQI03.01FIN) analysis. RESULTS: Results from the first and second completions of the VFI(F) questionnaire correlated strongly (Spearman's rho 0.901, P = 0.01). Answers to the individual questions the VFI(F) also correlated strongly, showing high internal consistency. Factor 1 (Tiredness of voice and avoidance of voice use) of the VFI correlated strongly with the VHI, and the two other factors (Physical discomfort associated with voicing and Improvement of symptoms) correlated moderately with the VHI. Factor one of the VFI(F) correlated moderately with AVQI03.01FIN and its sub-parameters, CPPS, HNR, and shimmer. The VFI(F) showed good construct validity, differentiating voice patients and controls at cut-off 13.5, with sensitivity of 0.963 and specificity of 0.885. Discriminatory power was strong for all factors: F1 AROC = 0.985, F2 AROC = 0.864, and F3 AROC = 0.821. CONCLUSION: The VFI(F) correlates with the VHI and with AVQI01.01FIN and it is a valid and reliable tool for detecting vocal fatigue in Finnish speakers.
RESUMO
PURPOSE: Coughing is related to voice problems, since it involves firm glottal closure, fast glottal opening, and high subglottic pressure and flow rate. In this study, the glottal area variation and movements of laryngeal structures during coughing and phonation are compared. METHODS: High-speed laryngoscopy recordings were made of a normophonic male participant with a healthy larynx producing a neutral vowel and coughing. Oral air pressure was registered in a mouthpiece, through which an endoscope was inserted into the pharynx. Electroglottography, acoustic, and pressure signals were recorded simultaneously. The glottal width variation at the membranous and cartilaginous parts of the glottis was derived from the high-speed images, and the strong vibration of the false vocal folds was also registered. RESULTS: In coughing, compared to ordinary vowel phonation in nearly the same sound pressure level (93-94 dB6cm), the glottal width was 25% larger at the middle of the vocal folds, the maximum glottal opening velocity was 39% higher, and the maximum glottal width declination rate during glottal closing was up to three times higher. The maximum acceleration was 40% higher, and the maximum deceleration was 47% higher. Fundamental frequency f0 was the highest (ca. 400 Hz) at the beginning of the first phase of a typical coughing process. During the last part of the coughing process, f0 decreased from ca. 250 Hz to ca. 85 Hz at the phonation offset. CONCLUSIONS: The remarkable increase in maximum glottal width declination rate implies much higher vocal fold loading in coughing compared to phonation.