RESUMO
Importance: Insomnia symptoms affect an estimated 30% to 50% of the 4 million US breast cancer survivors. Previous studies have shown the effectiveness of cognitive behavioral therapy for insomnia (CBT-I), but high insomnia prevalence suggests continued opportunities for delivery via new modalities. Objective: To determine the efficacy of a CBT-I-informed, voice-activated, internet-delivered program for improving insomnia symptoms among breast cancer survivors. Design, Setting, and Participants: In this randomized clinical trial, breast cancer survivors with insomnia (Insomnia Severity Index [ISI] score >7) were recruited from advocacy and survivorship groups and an oncology clinic. Eligible patients were females aged 18 years or older who had completed curative treatment more than 3 months before enrollment and had not undergone other behavioral sleep treatments in the prior year. Individuals were assessed for eligibility and randomized between March 2022 and October 2023, with data collection completed by December 2023. Intervention: Participants were randomized 1:1 to a smart speaker with a voice-interactive CBT-I program or educational control for 6 weeks. Main Outcomes and Measures: Linear mixed models and Cohen d estimates were used to evaluate the primary outcome of changes in ISI scores and secondary outcomes of sleep quality, wake after sleep onset, sleep onset latency, total sleep time, and sleep efficiency. Results: Of 76 women enrolled (38 each in the intervention and control groups), 70 (92.1%) completed the study. Mean (SD) age was 61.2 (9.3) years; 49 (64.5%) were married or partnered, and participants were a mean (SD) of 9.6 (6.8) years from diagnosis. From baseline to follow-up, ISI scores changed by a mean (SD) of -8.4 (4.7) points in the intervention group compared with -2.6 (3.5) in the control group (P < .001) (Cohen d, 1.41; 95% CI, 0.87-1.94). Sleep diary data showed statistically significant improvements in the intervention group compared with the control group for sleep quality (0.56; 95% CI, 0.39-0.74), wake after sleep onset (9.54 minutes; 95% CI, 1.93-17.10 minutes), sleep onset latency (8.32 minutes; 95% CI, 1.91-14.70 minutes), and sleep efficiency (-0.04%; 95% CI, -0.07% to -0.01%) but not for total sleep time (0.01 hours; 95% CI, -0.27 to 0.29 hours). Conclusions and Relevance: This randomized clinical trial of an in-home, voice-activated CBT-I program among breast cancer survivors found that the intervention improved insomnia symptoms. Future studies may explore how this program can be taken to scale and integrated into ambulatory care. Trial Registration: ClinicalTrials.gov Identifier: NCT05233800.
Assuntos
Neoplasias da Mama , Terapia Cognitivo-Comportamental , Distúrbios do Início e da Manutenção do Sono , Humanos , Feminino , Distúrbios do Início e da Manutenção do Sono/terapia , Terapia Cognitivo-Comportamental/métodos , Pessoa de Meia-Idade , Neoplasias da Mama/complicações , Idoso , Sobreviventes de Câncer/psicologia , Resultado do Tratamento , Adulto , VozRESUMO
Introdução: a relação entre voz e trabalho é objeto de estudo constante. Ainda não há investigação sobre a relação de monotonia e autonomia com queixas vocais. Objetivo: investigar a relação entre a monotonia e a autonomia no ambiente de trabalho com o surgimento de queixas vocais entre professores. Método: estudo exploratório, qualitativo e descritivo, realizado a partir de grupo focal considerando o ineditismo da temática do estudo. Dez professores triados em estudo anterior com suspeita de distúrbio de voz pelo Índice de Triagem de Distúrbio de Voz, que indicaram percepção de monotonia e falta de autonomia no ambiente de trabalho por meio do instrumento Condições de Produção Vocal de Professores foram convidados a participar. Sete professores aceitaram e foram conduzidos dois grupos focais. Perguntas disparadoras sobre monotonia e autonomia no ambiente de trabalho foram feitas. Após análise de conteúdo, foram criadas quatro categorias principais e subcategorias de análise. Resultados: os participantes debateram questões relacionadas à quebra de expectativas sobre o trabalho, frustrações, rotina e desafios diários. Considerações sobre a voz estavam relacionadas ao uso repetitivo e por longos períodos e ambiente com acústica desfavorável. Queixas como rouquidão e baixa projeção vocal foram citadas. Conclusão: monotonia no ambiente de trabalho foi percebida como algo repetitivo e as relações com o surgimento de queixas vocais podem estar relacionadas a situações de uso da voz de forma intensa e constante. A falta de autonomia parece ocasionar a monotonia e, consequentemente, desmotivação, frustração com a carreira e adoecimento, dentre eles, o distúrbio de voz. (AU)
Introduction: the relationship between voice and work is the subject of constant study. There is still no investigation into the relationship between monotony and autonomy and vocal complaints. Objective:to investigate the relationship between monotony and autonomy in the workplace with the emergence of vocal complaints among teachers. Method: exploratory, qualitative and descriptive study, carried out through a focus group considering the novelty of the study theme. Ten teachers screened in a previous study with suspected voice disorders using the Voice Disorder Screening Index, who indicated a perception of monotony and lack of autonomy in the work environment using the Teacher Vocal Production Conditions instrument were invited to participate. Seven teachers accepted and two focus groups were conducted. Triggering questions about monotony and autonomy in the workplace were asked. After content analysis, four main categories and subcategories of analysis were created. Results: participants discussed issues related to broken expectations about work, frustrations, routine and daily challenges. Considerations about the voice were related to repetitive use for long periods and an environment with unfavorable acoustics. Complaints such as hoarseness and low vocal projection were cited. Conclusion: monotony in the work environment was perceived as something repetitive and the relationship with the emergence of vocal complaints may be related to situations of intense and constant use of the voice. The lack of autonomy seems to cause monotony and, consequently, demotivation, frustration with one's career and illness, including voice disorders. (AU)
Introducción: la relación entre voz y trabajo es objeto de constante estudio. Todavía no se ha investigado la relación entre monotonía, autonomía y quejas vocales. Objetivo: investigar la relación entre monotonía y autonomía en el lugar de trabajo con la aparición de quejas vocales entre docentes. Método: estudio exploratorio, cualitativo y descriptivo, realizado a través de un grupo focal considerando la novedad del tema de estudio. Se invitó a participar a diez docentes evaluados en un estudio previo con sospecha de trastornos de la voz mediante el Voice Disorder Screening Index, que indicaron una percepción de monotonía y falta de autonomía en el ambiente de trabajo utilizando el instrumento Teacher Vocal Production Conditions. Siete profesores aceptaron y se realizaron dos grupos focales. Se formularon preguntas desencadenantes sobre la monotonía y la autonomía en el lugar de trabajo. Luego del análisis de contenido, se crearon cuatro categorías y subcategorías principales de análisis. Resultados:los participantes discutieron cuestiones relacionadas con expectativas rotas sobre el trabajo, frustraciones, rutina y desafíos diarios. Las consideraciones sobre la voz estuvieron relacionadas con el uso repetitivo por períodos prolongados y un ambiente con acústica desfavorable. Se citaron quejas como ronquera y baja proyección vocal. Conclusión: la monotonía en el ambiente laboral fue percibida como algo repetitivo y la relación con la aparición de quejas vocales puede estar relacionada con situaciones de uso intenso y constante de la voz. La falta de autonomía parece provocar monotonía y, en consecuencia, desmotivación, frustración con la propia carrera y enfermedades, incluidos trastornos de la voz. (AU)
Assuntos
Humanos , Masculino , Feminino , Adulto , Pessoa de Meia-Idade , Distúrbios da Voz/etiologia , Professores Escolares , Condições de Trabalho , Voz , Saúde Mental , Saúde Ocupacional , Grupos Focais , Pesquisa QualitativaRESUMO
Introdução: considera-se importante que fonoaudiólogos apresentem suas vozes como modelo ao realizar uma intervenção fonoaudiológica. Objetivo: conhecer a autoavaliação da voz e sintomas vocais de um grupo de acadêmicos de fonoaudiologia relacionando os achados ao diagrama de desvio fonatório. Método: estudo do tipo analítico, observacional, com 88 estudantes de Fonoaudiologia de uma mesma faculdade, 82 mulheres e seis homens, média de idade de 21,9 anos, sem diagnóstico de disfonia, autorreferidos saudáveis. Foram registrados e comparados dados relativos à autoavaliação da voz e de sintomas vocais, utilizando-se a Escala de Sintomas Vocais. Numa segunda etapa os estudantes foram convidados a realizar uma análise acústica de suas vozes e os que aceitaram (63,6%) procederam com a coleta das amostras de voz, programa VoxMetria® CTS. Para tratamento dos dados foram utilizados Teste T student e Matriz de Correlações construída com os resultados do Teste T- student (nível de confiança de 95%, alpha 5%). Resultados: a Escala de Sintomas Vocais revelou 44,31% dos participantes com escores brutos igual ou superior a 16 pontos, indicando risco vocal, com maior comprometimento do domínio físico. Alunos do último ano obtiveram escores mais elevados, com predomínio de secreção e pigarro na garganta. Houve correlação positiva entre fumar (7,95%) e aumento da nota final. A análise acústica revelou 40% das vozes com diagrama de desvio fonatório fora do quadrante de vozes normais, irregularidade da voz, jitter e shimmer alterados. Conclusão: a combinação dos dois instrumentos utilizados para conhecimento de risco de disfonia em estudantes de Fonoaudiologia mostra-se relevante e reforça a importância de programas de prevenção de saúde vocal também em futuros fonoaudiólogos. (AU)
Introduction: speech therapists must present their voices as a model for a speech therapy intervention. Objective: to understand the voice self-assessment and vocal symptoms of a group of speech therapy students, relating the findings to the phonatory deviation diagram. Method: an analytical observational study was conducted with 88 speech therapy students from the same college, consisting of 82 women and 6 men, averaging 21.9 years old, who reported no diagnosis of dysphonia, and self-reported as healthy. Data relating to voice self-assessment and vocal symptoms were recorded and compared, using the Vocal Symptoms Scale (VoiSS). In the second stage, students were invited to perform an acoustic analysis of their voices and those who accepted (63.6%) proceeded with the collection of voice samples, using the VoxMetria® CTS program. To process the data, the T-student Test and Correlation Matrix constructed with the results of the T-student Test (confidence level of 95%, alpha 5%) were used. Results: The Vocal Symptoms Scale (student T-test) revealed 44.31% of participants with raw scores equal to or greater than 16 points, indicating vocal risk and greater impairment of the physical domain. Final year students obtained higher scores, with a predominance of secretion and throat clearing. There was a positive correlation between smoking (7.95%) and an increase in the final grade. The acoustic analysis revealed 40% of the voices with a phonatory deviation diagram outside the quadrant of normal voices, voice irregularity, altered jitter, and shimmer. Conclusion: The combination of the two instruments used to understand the risk of dysphonia in speech therapy students is relevant and reinforces the importance of vocal health prevention programs for future speech therapists. (AU)
Introducción: los fonoaudiologos deben presentar su voz como modelo para realizar una intervención logopédica. Objetivo: comprender la autoevaluación vocal y los síntomas vocales de un grupo de estudiantes de fonoaudiología, relacionando los hallazgos con el diagrama de desviación fonatoria. Método: se realizó un estudio observacional analítico, observacional, con 88 estudiantes de fonoaudiología de la misma facultad, conformados por 82 mujeres y 6 hombres, com edad promedio de 21,9 años, quienes no refirieron diagnóstico de disfonia y se autorefiriron como sanos. Los datos relacionados con la autoevaluación de la voz y los síntomas vocales se registraron y compararon mediante la Escala de Síntomas Vocales. En la segunda etapa, los estudiantes fueron invitados a realizar un análisis acústico de sus voces y los que aceptaron (63,6%) procedieron a la recolección de muestras de voz, utilizando el programa VoxMetria® CTS. Para procesar los datos se utilizó la Prueba T de Student y la Matriz de Correlación, construida con los resultados de la Prueba T de Student (nivel de confianza del 95%, alfa 5%). Resultados: La Escala de Síntomas Vocales (prueba T de Student) reveló puntuaciones brutas iguales o superiores a 16 puntos (44,31%), lo que indica riesgo vocal y mayor afectación del dominio físico. Los estudiantes de último año obtuvieron puntuaciones más altas, con predominio de secreción y carraspeo. Hubo correlación positiva entre fumar (7,95%) y aumento en la nota final. El análisis acústico reveló voces presentando diagrama de desviación fonatoria fuera del cuadrante de normaliadad (40%), irregularidad de la voz, jitter y shimmer alterados. Conclusión: La combinación de los dos instrumentos utilizados para comprender el riesgo de disfonía en estudiantes de fonoaudiologia es relevante y refuerza la importancia de los programas de prevención de la salud vocal para futuros fonoaudiologos. (AU)
Assuntos
Humanos , Masculino , Feminino , Adulto , Estudantes de Ciências da Saúde , Voz , Fonoaudiologia , Autoteste , Qualidade da Voz , Distúrbios da Voz , Estudos Prospectivos , DisfoniaRESUMO
Introdução: o presente estudo visa mapear e avaliar a produção registrada sobre Fonoaudiologia Empresarial, a fim de identificar as temáticas mais pesquisadas, bem como as temáticas pouco exploradas em dissertações e teses na área. Objetivo: analisar a produção científica brasileira defendida entre 2002-2022, considerando nível de produção, ano, rede de ensino, instituição de ensino superior (localização geográfica), tipo de pesquisa, descritor registrado (primeiro), local, temática, total da amostra pesquisada e áreas de conhecimento. Método: revisão realizada na Biblioteca Digital Brasileira de Teses e Dissertações, em 05 de maio de 2023, considerando os termos "Fonoaudiologia" e "Empresa", pesquisados no período 2002-2022, segundo as variáveis anteriormente descritas, analisados de forma descritiva. Resultados:dentre 30 fontes registradas, 24-80,0% são dissertações, sendo 2007 o ano mais produtivo (6-20,0%). A Região Sudeste liderou a pesquisa (20-66,7%), representada pela PUC-SP (10-33,3%) e o destaque foi de pesquisas do tipo observacional (22-73,3%), sendo Empresas os locais mais pesquisados (20-66,7%) e o descritor "saúde do trabalhador" o mais utilizado (03-10,0%). A área de conhecimento (CNPq) que mais pesquisou foi Ciências da Saúde (25-83,3%) por meio da subárea Fonoaudiologia (20-66,7%%), sendo a Audiologia a temática mais pesquisada (16-53,3%). Conclusão: foram encontrados 16,53,3% registros na área de Audiologia e as pesquisas realizadas na área de Voz (7-23,3%) abordam os temas relacionados a qualidade vocal, comunicação e expressividade, no entanto, não abordam liderança. Tal dado sugere esforços em pesquisas científicas e atuação profissional, já que a Fonoaudiologia tem como objeto de estudo e atuação, a comunicação humana.(AU)
Introduction: this study aims to explore the Speech-Therapy's literature and its contribution to identify the most researched and few explored themes in dissertations and theses in the area. Objective:to analyze the Brazilian scientific production submitted between 2002 and 2022, considering production level, publication year, institution of defense, geographical location, research methodology, the first descriptor, research location, the thematic focus, total sample size and knowledge areas. Method: the review analysis was conducted using data obtained from the Brazilian Digital Library of Theses and Dissertations on May 5, 2023, using the terms: "Speech-Therapy" and "Company" to retrieve theses and dissertations from 2002 to 2022 according to the variables described above. Data were analyzed descriptively. Results: among the 30 entries retrieved, 24-80,0% were dissertations, most of which defended in 2007 (6-20,0%). The majority of the studies were from the Southeast region (20- 66.7%), represented by Pontifícia Universidade Católica de São Paulo: PUC-SP (10-33.3%) and the highlight was observational researches (22-73.3%) and the majority of the research was conducted at business companies (20-66,7%). In addition, "worker's health" was the most used descriptor (3-10,0%). The knowledge area (CNPQ) that produced the most studies was Health Sciences (25-83,3%) through the subarea of Speech-Language-Pathology (20-66,7%%), with Audiology being the most researched theme (16-53,3%). Conclusion: Audiology was the area with the highest number of studies found 16,53,3%. Research conducted in the Voice field (7-23,3%) addresses topics related to vocal quality, communication and expressiveness, however, they do not address leadership. The findings suggest a need for future research. Further studies can build upon insights to advance knowledge and promote evidence-based practice in the field of business companies, considering that Speech-Therapy has as its object of study and activity human communication. (AU)
Introducción: este estudio tiene como objetivo mapear y evaluar la producción grabada sobre Fonoaudiología Empresarial, con el fin de identificar los temas más investigados, así como los temas poco explorados en disertaciones y tesis en el área. Objetivo: analizar la producción científica brasileña defendida entre 2002-2022, considerando nivel de producción, año, red educativa, institución de educación superior (ubicación geográfica), tipo de investigación, descriptor registrado (primero), ubicación, tema, muestra total investigada y áreas. del conocimiento. Método: revisión realizada en la Biblioteca Digital Brasileña de Tesis y Disertaciones, el 5 de mayo de 2023, considerando los términos "Fonoaudiología" y "Empresa", investigados en el período 2002-2022, según las variables previamente descritas, analizadas en una manera descriptiva. Resultados: entre 30 fuentes registradas, 24-80,0% son disertaciones, siendo 2007 el año más productivo (6-20,0%). La Región Sudeste lideró la investigación (20-66,7%), representada por la PUC-SP (10-33,3%) y destaque para la investigación observacional (22-73,3%), siendo las Empresas las localidades más investigadas (20-66,7%) y el descriptor "salud del trabajador" el más utilizado (03-10,0%). El área del conocimiento (CNPq) más investigada fue Ciencias de la Salud (25-83,3%) a través de la subárea Fonoaudiología (20-66,7%), siendo la Audiología el tema más investigado (16-53,3%). Conclusión: Se encontraron 16,53,3% registros en el área de Audiología y las investigaciones realizadas en el área de Voz (7-23,3%) abordan temas relacionados con la calidad vocal, la comunicación y la expresividad, sin embargo, no abordan el liderazgo. Estos datos sugieren esfuerzos en la investigación científica y en el desempeño profesional, ya que la Fonoaudiología tiene como objeto de estudio y actividad la comunicación humana. (AU)
Assuntos
Organizações , Dissertações Acadêmicas como Assunto , Fonoaudiologia , Fala , Voz , Brasil , Bibliometria , Comunicação , LiderançaRESUMO
Many research articles have explored the impact of surgical interventions on voice and speech evaluations, but advances are limited by the lack of publicly accessible datasets. To address this, a comprehensive corpus of 107 Spanish Castilian speakers was recorded, including control speakers and patients who underwent upper airway surgeries such as Tonsillectomy, Functional Endoscopic Sinus Surgery, and Septoplasty. The dataset contains 3,800 audio files, averaging 35.51 ± 5.91 recordings per patient. This resource enables systematic investigation of the effects of upper respiratory tract surgery on voice and speech. Previous studies using this corpus have shown no relevant changes in key acoustic parameters for sustained vowel phonation, consistent with initial hypotheses. However, the analysis of speech recordings, particularly nasalised segments, remains open for further research. Additionally, this dataset facilitates the study of the impact of upper airway surgery on speaker recognition and identification methods, and testing of anti-spoofing methodologies for improved robustness.
Assuntos
Fala , Voz , Humanos , Período Pós-Operatório , Tonsilectomia , Masculino , Feminino , Período Pré-Operatório , AdultoRESUMO
Introdução: A voz é um indicador de estados emocionais, influenciada por fatores como o tônus vagal, a respiração e a variabilidade da frequência cardíaca. O estudo explora esses fatores e a relação com a regulação emocional e a prática meditativa como técnica de autorregulação. Objetivo: Investigar a diferença nas características vocais e na variação da frequência cardíaca em meditadores experientes (EM) e novatos (NM) antes e depois de uma prática meditativa e em não praticantes de meditação grupo controle (CG), antes e depois de um teste controle. Métodos: Estudo quase-fatorial 3 x 2. Três grupos foram avaliados (meditadores experientes EM; meditadores novatos NM; e grupo controle CG, não praticantes de meditação) em dois momentos da manipulação experimental antes e depois de uma sessão meditativa para praticantes de meditação, e antes e depois de uma tarefa de busca de palavras para o grupo controle. A frequência fundamental, jitter, shimmer, relação harmônico-ruído e o primeiro (F1), o segundo (F2) e terceiro (F3) formantes da vogal [a]; a variação da frequência cardíaca (SDNN, RMSSD, LF/HF, SD1 and SD2); estado de ansiedade e autopercepção vocal, foram investigados, antes e após a intervenção. Resultados: O grupo EM alcançou ótimo relaxamento do trato vocal. Os grupos NM e CG apresentaram mudanças em F1. Prática meditativa, de longa duração, está associado com grande diferença em F3, SDNN e SD2 na variação da frequência cardíaca. Conclusão: Os resultados sugerem que prática meditativa influencia a expressão vocal e reação emocional, e que a experiência em prática meditativa favorece esta relação. (AU)
Introduction: The voice is an indicator of emotional states, influenced by factors such as vagal tone, breathing and heart rate variability. This study explores these factors and their relationship with emotional regulation and meditative practice as a self-regulation technique. Purpose: To investigate the difference in vocal characteristics and heart rate variability in experienced (EM) and novice (NM) meditators before and after a meditation practice and in non-meditators - control group (CG), before and after a control test. Methods: 3 x 2 quasi-factorial study. Three groups were evaluated (experienced meditators EM; novice meditators NM; and control group CG, non-meditators) at two points in the experimental manipulation - before and after a meditation session for meditators, and before and after a word search task for the control group. The fundamental frequency, jitter, shimmer, harmonic-to-noise ratio and the first (F1), second (F2) and third (F3) formants of the vowel [a]; heart rate variation (SDNN, RMSSD, LF/HF, SD1 and SD2); anxiety state and vocal self-perception, were investigated, before and after the intervention. Results: The EM group achieved optimal vocal tract relaxation. The NM and CG groups showed changes in F1. Long-term meditative practice was associated with a large difference in F3, SDNN and SD2 in heart rate variation. Conclusion: The results suggest that meditation practice influences vocal expression and emotional reaction, and that experience in meditation practice favors this relationship. (AU)
Introducción: La voz es un indicador de los estados emocionales, influida por factores como el tono vagal, la respiración y la variabilidad de la frecuencia cardiaca. Este estudio explora estos factores y su relación con la regulación emocional y la práctica de la meditación. Objetivo: Investigar la diferencia en las características vocales y variabilidad de la frecuencia cardiaca en meditadores experimentados (EM) y novatos (NM) antes y después de una práctica de meditación y en no meditadores - grupo control (GC), antes y después de una prueba control. Métodos: Estudio cuasi-factorial 3 x 2. Se evaluaron tres grupos (meditadores experimentados EM; meditadores novatos NM; y grupo control CG, no meditadores) en dos momentos - antes y después de una sesión de meditación para los meditadores, y antes y después de una tarea de búsqueda de palabras para el grupo control. Se investigaron la frecuencia fundamental, jitter, shimmer, relación armónico-ruido y los formantes primero (F1), segundo (F2) y tercero (F3) de la vocal [a]; la variación de la frecuencia cardiaca (SDNN, RMSSD, LF/HF, SD1 y SD2); el estado de ansiedad y autopercepción vocal, antes y después de la intervención. Resultados: El grupo EM consiguió una relajación óptima del tracto vocal. Los grupos NM y CG mostraron cambios en F1. La práctica de meditación a largo plazo se asocia con una gran diferencia en F3, SDNN y SD2 en la variación de la frecuencia cardiaca. Conclusión: Los resultados sugieren que la práctica de meditación influye en la expresión vocal y reacción emocional. (AU)
Assuntos
Humanos , Masculino , Feminino , Adulto , Voz , Meditação , Regulação Emocional , Estudos Controlados Antes e Depois , Reconhecimento de Voz/fisiologiaRESUMO
The technology of robot-assisted prostate seed implantation has developed rapidly. However, during the process, there are some problems to be solved, such as non-intuitive visualization effects and complicated robot control. To improve the intelligence and visualization of the operation process, a voice control technology of prostate seed implantation robot in augmented reality environment was proposed. Initially, the MRI image of the prostate was denoised and segmented. The three-dimensional model of prostate and its surrounding tissues was reconstructed by surface rendering technology. Combined with holographic application program, the augmented reality system of prostate seed implantation was built. An improved singular value decomposition three-dimensional registration algorithm based on iterative closest point was proposed, and the results of three-dimensional registration experiments verified that the algorithm could effectively improve the three-dimensional registration accuracy. A fusion algorithm based on spectral subtraction and BP neural network was proposed. The experimental results showed that the average delay of the fusion algorithm was 1.314 s, and the overall response time of the integrated system was 1.5 s. The fusion algorithm could effectively improve the reliability of the voice control system, and the integrated system could meet the responsiveness requirements of prostate seed implantation.
Assuntos
Algoritmos , Realidade Aumentada , Imageamento por Ressonância Magnética , Redes Neurais de Computação , Próstata , Neoplasias da Próstata , Robótica , Humanos , Masculino , Robótica/instrumentação , Imageamento por Ressonância Magnética/métodos , Neoplasias da Próstata/diagnóstico por imagem , Próstata/diagnóstico por imagem , Imageamento Tridimensional , Voz , Procedimentos Cirúrgicos Robóticos/instrumentação , Procedimentos Cirúrgicos Robóticos/métodos , Holografia/métodos , Holografia/instrumentação , Braquiterapia/instrumentação , Reprodutibilidade dos TestesRESUMO
OBJECTIVE: The vocal biomarkers market was worth $1.9B in 2021 and is projected to exceed $5.1B by 2028, for a compound annual growth rate of 15.15%. The investment growth demonstrates a blossoming interest in voice and artificial intelligence (AI) as it relates to human health. The objective of this study was to map the current landscape of start-ups utilizing voice as a biomarker in health-tech. DATA SOURCES: A comprehensive search for start-ups was conducted using Google, LinkedIn, Twitter, and Facebook. A review of the research was performed using company website, PubMed, and Google Scholar. REVIEW METHODS: A 3-pronged approach was taken to thoroughly map the landscape. First, an internet search was conducted to identify current start-ups focusing on products relating to voice as a biomarker of health. Second, Crunchbase was utilized to collect financial and organizational information. Third, a review of the literature was conducted to analyze publications associated with the identified start-ups. RESULTS: A total of 27 start-up start-ups with a focus in the utilization of AI for developing biomarkers of health from the human voice were identified. Twenty-four of these start-ups garnered $178,808,039 in investments. The 27 start-ups published 194 publications combined, 128 (66%) of which were peer reviewed. CONCLUSION: There is growing enthusiasm surrounding voice as a biomarker in health-tech. Academic drive may complement commercialization to best achieve progress in this arena. More research is needed to accurately capture the entirety of the field, including larger industry players, academic institutions, and non-English content.
Assuntos
Biomarcadores , Voz , Humanos , Voz/fisiologia , Inteligência ArtificialRESUMO
OBJECTIVE: This study investigated whether artificial intelligence (AI) models combining voice signals, demographics, and structured medical records can detect glottic neoplasm from benign voice disorders. METHODS: We used a primary dataset containing 2-3 s of vowel "ah", demographics, and 26 items of structured medical records (e.g., symptoms, comorbidity, smoking and alcohol consumption, vocal demand) from 60 patients with pathology-proved glottic neoplasm (i.e., squamous cell carcinoma, carcinoma in situ, and dysplasia) and 1940 patients with benign voice disorders. The validation dataset comprised data from 23 patients with glottic neoplasm and 1331 patients with benign disorders. The AI model combined convolutional neural networks, gated recurrent units, and attention layers. We used 10-fold cross-validation (training-validation-testing: 8-1-1) and preserved the percentage between neoplasm and benign disorders in each fold. RESULTS: Results from the AI model using voice signals reached an area under the ROC curve (AUC) value of 0.631, and additional demographics increased this to 0.807. The highest AUC of 0.878 was achieved when combining voice, demographics, and medical records (sensitivity: 0.783, specificity: 0.816, accuracy: 0.815). External validation yielded an AUC value of 0.785 (voice plus demographics; sensitivity: 0.739, specificity: 0.745, accuracy: 0.745). Subanalysis showed that AI had higher sensitivity but lower specificity than human assessment (p < 0.01). The accuracy of AI detection with additional medical records was comparable with human assessment (82% vs. 83%, p = 0.78). CONCLUSIONS: Voice signal alone was insufficient for AI differentiation between glottic neoplasm and benign voice disorders, but additional demographics and medical records notably improved AI performance and approximated the prediction accuracy of humans. LEVEL OF EVIDENCE: NA Laryngoscope, 134:4585-4592, 2024.
Assuntos
Inteligência Artificial , Glote , Neoplasias Laríngeas , Distúrbios da Voz , Humanos , Neoplasias Laríngeas/diagnóstico , Glote/fisiopatologia , Masculino , Feminino , Pessoa de Meia-Idade , Idoso , Distúrbios da Voz/diagnóstico , Distúrbios da Voz/fisiopatologia , Demografia , Adulto , Carcinoma de Células Escamosas/diagnóstico , Curva ROC , Voz/fisiologia , Diagnóstico Diferencial , Redes Neurais de ComputaçãoRESUMO
Voice disorders resulting from various pathological vocal fold conditions or postoperative recovery of laryngeal cancer surgeries, are common causes of dysphonia. Here, we present a self-powered wearable sensing-actuation system based on soft magnetoelasticity that enables assisted speaking without relying on the vocal folds. It holds a lightweighted mass of approximately 7.2 g, skin-alike modulus of 7.83 × 105 Pa, stability against skin perspiration, and a maximum stretchability of 164%. The wearable sensing component can effectively capture extrinsic laryngeal muscle movement and convert them into high-fidelity and analyzable electrical signals, which can be translated into speech signals with the assistance of machine learning algorithms with an accuracy of 94.68%. Then, with the wearable actuation component, the speech could be expressed as voice signals while circumventing vocal fold vibration. We expect this approach could facilitate the restoration of normal voice function and significantly enhance the quality of life for patients with dysfunctional vocal folds.
Assuntos
Distúrbios da Voz , Voz , Dispositivos Eletrônicos Vestíveis , Humanos , Prega Vocal/fisiologia , Qualidade de Vida , Voz/fisiologiaRESUMO
In recent years, there has been a notable rise in the number of patients afflicted with laryngeal diseases, including cancer, trauma, and other ailments leading to voice loss. Currently, the market is witnessing a pressing demand for medical and healthcare products designed to assist individuals with voice defects, prompting the invention of the artificial throat (AT). This user-friendly device eliminates the need for complex procedures like phonation reconstruction surgery. Therefore, in this review, we will initially give a careful introduction to the intelligent AT, which can act not only as a sound sensor but also as a thin-film sound emitter. Then, the sensing principle to detect sound will be discussed carefully, including capacitive, piezoelectric, electromagnetic, and piezoresistive components employed in the realm of sound sensing. Following this, the development of thermoacoustic theory and different materials made of sound emitters will also be analyzed. After that, various algorithms utilized by the intelligent AT for speech pattern recognition will be reviewed, including some classical algorithms and neural network algorithms. Finally, the outlook, challenge, and conclusion of the intelligent AT will be stated. The intelligent AT presents clear advantages for patients with voice impairments, demonstrating significant social values.
Assuntos
Faringe , Voz , Humanos , Som , Algoritmos , Redes Neurais de ComputaçãoRESUMO
PURPOSE: This cross-sectional study aimed to investigate the potential of voice analysis as a prescreening tool for type II diabetes mellitus (T2DM) by examining the differences in voice recordings between non-diabetic and T2DM participants. METHODS: 60 participants diagnosed as non-diabetic (n = 30) or T2DM (n = 30) were recruited on the basis of specific inclusion and exclusion criteria in Iran between February 2020 and September 2023. Participants were matched according to their year of birth and then placed into six age categories. Using the WhatsApp application, participants recorded the translated versions of speech elicitation tasks. Seven acoustic features [fundamental frequency, jitter, shimmer, harmonic-to-noise ratio (HNR), cepstral peak prominence (CPP), voice onset time (VOT), and formant (F1-F2)] were extracted from each recording and analyzed using Praat software. Data was analyzed with Kolmogorov-Smirnov, two-way ANOVA, post hoc Tukey, binary logistic regression, and student t tests. RESULTS: The comparison between groups showed significant differences in fundamental frequency, jitter, shimmer, CPP, and HNR (p < 0.05), while there were no significant differences in formant and VOT (p > 0.05). Binary logistic regression showed that shimmer was the most significant predictor of the disease group. There was also a significant difference between diabetes status and age, in the case of CPP. CONCLUSIONS: Participants with type II diabetes exhibited significant vocal variations compared to non-diabetic controls.
Assuntos
Diabetes Mellitus Tipo 2 , Voz , Humanos , Qualidade da Voz , Acústica da Fala , Diabetes Mellitus Tipo 2/complicações , Estudos Transversais , Medida da Produção da Fala , AcústicaRESUMO
PURPOSE: Arytenoid adduction as an addition to medialisation thyroplasty is highly advocated by some surgeons in selected cases but deemed less necessary by others in patients with unilateral vocal fold paralysis. This study aims to evaluate the additional benefits on voice outcome of arytenoid adduction in patients with unilateral vocal fold paralysis undergoing medialisation thyroplasty using intra-operative voice measurements. DESIGN/METHODS: A prospective study was conducted. Voice audio recordings were obtained at 4 moments; 1. direct prior to the start of surgery, 2. during surgery after medialisation thyroplasty, 3. during surgery after medialisation and arytenoid adduction, 3 months postoperative. At these same timepoints patients rated their own voice on a numeric rating scale between 0 and 10. The blinded recordings were rated by consensus in a team of experienced listeners, using the Grade of the GRBAS scale. Furthermore, the Voice Handicap Index was administered before and at 3 months after surgery. RESULTS: Ten patients who underwent medialisation and arytenoid adduction at our tertiary referral hospital between 2021 and 2022, were included. One patient was excluded after surgery. The intraoperative measurements showed a Grade score of 1.4 preoperatively, improving to 1.2 after medialisation, 1.2 after medialisation and arytenoid adduction, and further improving to 0.4 at 3 months postoperative, which was a not statistically significant improvement (p = 0.2). The intraoperative subjective numeric rating scale showed a statistically significant improvement from 3.9 preoperatively, to 6.1 after medialisation, 7.1 after medialisation and arytenoid adduction and a 7.6 at 3 months postoperative (p = 0.001). The Voice Handicap Index total score showed a statistically significant improvement from 71 points before surgery to 13 at 3 months after surgery (p = 0.008). CONCLUSIONS: Our study using intraoperative voice measurements indicate that the addition of arytenoid adduction to medialisation thyroplasty is a benefit in selected patients although more studies are needed due to the many limitations inherent to this field of investigation.
Assuntos
Laringoplastia , Paralisia das Pregas Vocais , Voz , Humanos , Estudos Prospectivos , Qualidade da Voz , Paralisia das Pregas Vocais/cirurgia , Cartilagem Aritenoide/cirurgia , Resultado do TratamentoRESUMO
OBJECTIVE: This study aimed to investigate the impact of the implant's vertical location during Type 1 Thyroplasty (T1T) on acoustics and glottal aerodynamics using excised canine larynx model, providing insights into the optimal technique for treating unilateral vocal fold paralysis (UVFP). METHODS: Measurements were conducted in six excised canine larynges using Silastic implants. Two implant locations, glottal and infraglottal, were tested for each larynx at low and high subglottal pressure levels. Acoustic and intraglottal flow velocity field measurements were taken to assess vocal efficiency (VE), cepstral peak prominence (CPP), and the development of intraglottal vortices. RESULTS: The results indicated that the implant's vertical location significantly influenced vocal efficiency (p = 0.045), with the infraglottal implant generally yielding higher VE values. The effect on CPP was not statistically significant (p = 0.234). Intraglottal velocity field measurements demonstrated larger glottal divergence angles and stronger vortices with the infraglottal implant. CONCLUSION: The findings suggest that medializing the paralyzed fold at the infraglottal level rather than the glottal level can lead to improved vocal efficiency. The observed larger divergence angles and stronger intraglottal vortices with infraglottal medialization may enhance voice outcomes in UVFP patients. These findings have important implications for optimizing T1T procedures and improving voice quality in individuals with UVFP. Further research is warranted to validate these results in clinical settings.
Assuntos
Laringoplastia , Laringe , Paralisia das Pregas Vocais , Voz , Humanos , Animais , Cães , Laringe/cirurgia , Glote/cirurgia , Paralisia das Pregas Vocais/cirurgia , Acústica , Prega Vocal/cirurgiaRESUMO
This Viewpoint discusses the need to create standards for audiomics to identify unique audio biomarkers of health and diseasenow possible because of more efficient voice data analysis available through the use of artificial intelligence (AI)and to improve patient care.
Assuntos
Inteligência Artificial , Biomarcadores , Voz , HumanosRESUMO
OBJECTIVE: This study evaluated the swallowing and voice function of laryngeal cancer patients after Supracricoid Partial Laryngectomy(SCPL), and its influence on quality of life to provide a reference for the selection of surgical methods for laryngeal cancer patients. METHODS: Twenty-one patients who received SCPL between April 2015 and November 2021 were included. Each patient's swallowing function and quality of life were assessed through fiberoptic endoscopic examination of swallowing (FEES) and the M.D. Anderson Dysphagia Inventory (MDADI). Fundamental, jitter, shimmer, maximum phonation time (MPT), and voice handicap index-10 (VHI-10) were performed to assess voice function and voice-related quality of life. RESULTS: The results of the FEES of the 21 patients were as follows: the rates of pharyngeal residue after swallowing solid, semiliquid, and liquid food were 0%, 28.57%, and 38.09%, respectively; the rates of laryngeal infiltration after swallowing solid, semiliquid, and liquid food were 0%, 28.57%, and 4.76%, respectively; and aspiration did not occur in any of the patients. In the evaluation of swallowing quality of life, the mean total MDADI score was 92.6 ± 6.32. The voice function evaluation showed that the mean F0, jitter, shimmer, and MPT values were 156.01 ± 120.87 (HZ), 11.57 ± 6.21 (%), 35.37 ± 14.16 (%) and 7.85 ± 6.08 (s), respectively. The mean total VHI-10 score was 7.14 ± 4.84. CONCLUSION: SCPL provides patients with satisfactory swallowing and voice function. The patients in this study were satisfied with their quality of life in terms of swallowing and voice. SCPL can be used as a surgical method to preserve laryngeal function in patients with laryngeal cancer.
Assuntos
Neoplasias Laríngeas , Voz , Humanos , Laringectomia/efeitos adversos , Laringectomia/métodos , Deglutição , Neoplasias Laríngeas/cirurgia , Qualidade de VidaRESUMO
BACKGROUND: Embodied conversational agents (ECAs) are computer-generated animated humanlike characters that interact with users through verbal and nonverbal behavioral cues. They are increasingly used in a range of fields, including health care. OBJECTIVE: This scoping review aims to identify the current practice in the development and evaluation of ECAs for chronic diseases. METHODS: We applied a methodological framework in this review. A total of 6 databases (ie, PubMed, Embase, CINAHL, ACM Digital Library, IEEE Xplore Digital Library, and Web of Science) were searched using a combination of terms related to ECAs and health in October 2023. Two independent reviewers selected the studies and extracted the data. This review followed the PRISMA-ScR (Preferred Reporting Items of Systematic Reviews and Meta-Analyses Extension for Scoping Reviews) statement. RESULTS: The literature search found 6332 papers, of which 36 (0.57%) met the inclusion criteria. Among the 36 studies, 27 (75%) originated from the United States, and 28 (78%) were published from 2020 onward. The reported ECAs covered a wide range of chronic diseases, with a focus on cancers, atrial fibrillation, and type 2 diabetes, primarily to promote screening and self-management. Most ECAs were depicted as middle-aged women based on screenshots and communicated with users through voice and nonverbal behavior. The most frequently reported evaluation outcomes were acceptability and effectiveness. CONCLUSIONS: This scoping review provides valuable insights for technology developers and health care professionals regarding the development and implementation of ECAs. It emphasizes the importance of technological advances in the embodiment, personalized strategy, and communication modality and requires in-depth knowledge of user preferences regarding appearance, animation, and intervention content. Future studies should incorporate measures of cost, efficiency, and productivity to provide a comprehensive evaluation of the benefits of using ECAs in health care.
Assuntos
Fibrilação Atrial , Diabetes Mellitus Tipo 2 , Voz , Pessoa de Meia-Idade , Humanos , Feminino , Comunicação , Doença CrônicaRESUMO
OBJECTIVES: Medialization thyroplasty (MT) using various implants has been employed as a corrective procedure for unilateral vocal fold paralysis (UVFP). A newly developed APrevent® vocal implant system (VOIS) offers an innovative solution with a finely adjustable design. This study aimed to investigate the long-term functional voice outcomes and benefits of postoperative adjustments in patients receiving MT using the VOIS-implant. METHODS: This is a prospective case series study at single tertiary medical center. Fourteen adult patients diagnosed with UVFP received MT with the VOIS implant and were followed up for more than 1 year. Implant adjustment procedure by injecting 0.9% physiological saline solution was performed both during and after the surgery to optimize glottal closure and voice quality. Objective voice outcomes and acoustic parameters were assessed preoperatively and postoperatively at various timepoints. RESULTS: Thirteen patients (93%) received intraoperative balloon adjustment, ranging from 0.05to 0.12 ml. Four patients underwent adjustments postoperatively and exhibited a positive trend towards immediately improving acoustic voice quality. Our long-term results demonstrated a notable improvement after the surgery in voice quality, with significant decreases in VHI-30 and improvements in perceptual parameters of GRBAS scale, acoustic measures such as jitter and signal-to-noise ratio (p < 0.001) and cepstral peak prominence smoothed in sustained vowel and short sentences. The voice outcomes remained stable more than 1 year follow-up. CONCLUSIONS: Overall, MT with VOIS implantation provides a favorable long-term outcomes and stability in voice quality for patients with UVFP and also an effective tool for postoperative adjustment without major revision surgeries.
Assuntos
Laringoplastia , Paralisia das Pregas Vocais , Voz , Adulto , Humanos , Laringoplastia/métodos , Prega Vocal/cirurgia , Paralisia das Pregas Vocais/cirurgia , Qualidade da Voz , Resultado do TratamentoRESUMO
OBJECTIVE: Voice feminizing surgery is frequently needed for transgender female patients. Among several surgical options, Wendler glottoplasty (WG) and laser reduction glottoplasty (LRG) are two endoscopic procedures. However, because a single procedure may not produce sufficient benefit, the two surgeries may sometimes be sequentially performed. This study was carried out to present the voice results of such sequential surgeries. METHODS: This is an individual retrospective cohort study, performed at a tertiary referral center, that is a university hospital. 18 transgender patients were treated with WG initially and then underwent LRG; 17 had LRG first then WG. All 35 cases were performed during a 15-year period and followed for at least 1 year postoperatively. Voice Handicap Index (VHI-30), transsexual voice questionnaire (TVQ), and acoustic analysis with /a/ and running speech were obtained pre- and postoperatively. RESULTS: VHI and TVQ improved significantly postoperatively (p < 0.05). Their preoperative, first, and second postoperative mean sF0 were 146, 175, and 215 Hz, respectively; these differences were statistically significant (p < 0.001). Their postoperative mean jitter percent, shimmer percent, noise to harmonic ratio (NHR), cepstral peak prominence (CPP), and cepstral spectral index of dysphonia (CSID) worsened significantly compared to preop values (p < 0.05); however, mean postoperative acoustic results were still within normal limits. Patients' self-ratings of their postsurgery voices revealed all feminine, leading to a patient gratification score of 100%. CONCLUSION: If transgender female patients are unsatisfied with their voice after WG or LRG, the addition of the alternative procedure may significantly feminize their voice. Sequential WG and LRG is a successful surgical option for voice feminization. LEVEL OF EVIDENCE: 4 Laryngoscope, 134:1133-1138, 2024.
Assuntos
Qualidade da Voz , Voz , Masculino , Humanos , Feminino , Feminização/cirurgia , Estudos Retrospectivos , Acústica da Fala , Resultado do Tratamento , LasersRESUMO
OBJECTIVES: Vocal process granulomas (VPGs) are benign laryngeal lesions that may manifest as ulcerated regions of the vocal fold or nodular polypoid lesions. Gold standard treatments for idiopathic VPG are yet to be established at this time. This study evaluated clinical decision-making and outcomes in the treatment of VPG patients based on experiences of academic laryngologists across the United States. METHODS: A 21-question survey was developed to evaluate each respondent's specific VPG patient population, clinical decision-making in treating VPG, and corresponding treatment outcomes. The survey was distributed to 168 laryngologists at academic institutions across the United States. Data were analyzed through the Qualtrics platform. RESULTS: A total of 106 responses were analyzed, with a completion rate of 63.1%. Etiology of VPG was most commonly attributed to phonotrauma (96.2%) and reflux (71.8%). Primary first-line treatment was most commonly antireflux medications (92%). Other common first line treatments included voice therapy (58.8%) and inhaled steroids (42.5%). With these treatments, the majority of laryngologists report that recurrence is uncommon (68.4%). Dysphonia was cited as the most frequent long-term sequelae at 27.8%. CONCLUSIONS: VPG treatment strategies continue to be controversial across the United States with many treatments described in the literature with variable application in the practice of academic laryngologists today. Based on survey results, antireflux medications and voice therapy may be the most widely used and most effective treatment options. Establishment of gold standard therapy for VPG as well as further research into recurrent or persistent VPG despite antireflux and voice therapy should be explored. LEVEL OF EVIDENCE: 5 Laryngoscope, 134:795-802, 2024.