Pesquisa | BVS - MINISTÉRIO DA SAÚDE

Identifying bias in models that detect vocal fold paralysis from audio recordings using explainable machine learning and clinician ratings.

Low, Daniel M; Rao, Vishwanatha; Randolph, Gregory; Song, Phillip C; Ghosh, Satrajit S.

PLOS Digit Health ; 3(5): e0000516, 2024 May.

Artigo em Inglês | MEDLINE | ID: mdl-38814939

RESUMO

Detecting voice disorders from voice recordings could allow for frequent, remote, and low-cost screening before costly clinical visits and a more invasive laryngoscopy examination. Our goals were to detect unilateral vocal fold paralysis (UVFP) from voice recordings using machine learning, to identify which acoustic variables were important for prediction to increase trust, and to determine model performance relative to clinician performance. Patients with confirmed UVFP through endoscopic examination (N = 77) and controls with normal voices matched for age and sex (N = 77) were included. Voice samples were elicited by reading the Rainbow Passage and sustaining phonation of the vowel "a". Four machine learning models of differing complexity were used. SHapley Additive exPlanations (SHAP) was used to identify important features. The highest median bootstrapped ROC AUC score was 0.87 and beat clinician's performance (range: 0.74-0.81) based on the recordings. Recording durations were different between UVFP recordings and controls due to how that data was originally processed when storing, which we can show can classify both groups. And counterintuitively, many UVFP recordings had higher intensity than controls, when UVFP patients tend to have weaker voices, revealing a dataset-specific bias which we mitigate in an additional analysis. We demonstrate that recording biases in audio duration and intensity created dataset-specific differences between patients and controls, which models used to improve classification. Furthermore, clinician's ratings provide further evidence that patients were over-projecting their voices and being recorded at a higher amplitude signal than controls. Interestingly, after matching audio duration and removing variables associated with intensity in order to mitigate the biases, the models were able to achieve a similar high performance. We provide a set of recommendations to avoid bias when building and evaluating machine learning models for screening in laryngology.

Identifying bias in models that detect vocal fold paralysis from audio recordings using explainable machine learning and clinician ratings.

Low, Daniel M; Rao, Vishwanatha; Randolph, Gregory; Song, Phillip C; Ghosh, Satrajit S.

medRxiv ; 2024 Mar 20.

Artigo em Inglês | MEDLINE | ID: mdl-33501466

RESUMO

Introduction: Detecting voice disorders from voice recordings could allow for frequent, remote, and low-cost screening before costly clinical visits and a more invasive laryngoscopy examination. Our goals were to detect unilateral vocal fold paralysis (UVFP) from voice recordings using machine learning, to identify which acoustic variables were important for prediction to increase trust, and to determine model performance relative to clinician performance. Methods: Patients with confirmed UVFP through endoscopic examination (N=77) and controls with normal voices matched for age and sex (N=77) were included. Voice samples were elicited by reading the Rainbow Passage and sustaining phonation of the vowel "a". Four machine learning models of differing complexity were used. SHapley Additive explanations (SHAP) was used to identify important features. Results: The highest median bootstrapped ROC AUC score was 0.87 and beat clinician's performance (range: 0.74 - 0.81) based on the recordings. Recording durations were different between UVFP recordings and controls due to how that data was originally processed when storing, which we can show can classify both groups. And counterintuitively, many UVFP recordings had higher intensity than controls, when UVFP patients tend to have weaker voices, revealing a dataset-specific bias which we mitigate in an additional analysis. Conclusion: We demonstrate that recording biases in audio duration and intensity created dataset-specific differences between patients and controls, which models used to improve classification. Furthermore, clinician's ratings provide further evidence that patients were over-projecting their voices and being recorded at a higher amplitude signal than controls. Interestingly, after matching audio duration and removing variables associated with intensity in order to mitigate the biases, the models were able to achieve a similar high performance. We provide a set of recommendations to avoid bias when building and evaluating machine learning models for screening in laryngology.

Identifying Medications Underlying Communication Atypicalities in Psychotic and Affective Disorders: A Pharmacovigilance Study Within the FDA Adverse Event Reporting System.

Fusaroli, Michele; Simonsen, Arndis; Borrie, Stephanie A; Low, Daniel M; Parola, Alberto; Raschi, Emanuel; Poluzzi, Elisabetta; Fusaroli, Riccardo.

J Speech Lang Hear Res ; 66(9): 3242-3259, 2023 09 13.

Artigo em Inglês | MEDLINE | ID: mdl-37524118

RESUMO

PURPOSE: Communication atypicalities are considered promising markers of a broad range of clinical conditions. However, little is known about the mechanisms and confounders underlying them. Medications might have a crucial, relatively unknown role both as potential confounders and offering an insight on the mechanisms at work. The integration of regulatory documents with disproportionality analyses provides a more comprehensive picture to account for in future investigations of communication-related markers. The aim of this study was to identify a list of drugs potentially associated with communicative atypicalities within psychotic and affective disorders. METHOD: We developed a query using the Medical Dictionary for Regulatory Activities to search for communicative atypicalities within the FDA Adverse Event Reporting System (updated June 2021). A Bonferroni-corrected disproportionality analysis (reporting odds ratio) was separately performed on spontaneous reports involving psychotic, affective, and non-neuropsychiatric disorders, to account for the confounding role of different underlying conditions. Drug-adverse event associations not already reported in the Side Effect Resource database of labeled adverse drug reactions (unexpected) were subjected to further robustness analyses to account for expected biases. RESULTS: A list of 291 expected and 91 unexpected potential confounding medications was identified, including drugs that may irritate (inhalants) or desiccate (anticholinergics) the larynx, impair speech motor control (antipsychotics), or induce nodules (acitretin) or necrosis (vascular endothelial growth factor receptor inhibitors) on vocal cords; sedatives and stimulants; neurotoxic agents (anti-infectives); and agents acting on neurotransmitter pathways (dopamine agonists). CONCLUSIONS: We provide a list of medications to account for in future studies of communication-related markers in affective and psychotic disorders. The current test case illustrates rigorous procedures for digital phenotyping, and the methodological tools implemented for large-scale disproportionality analyses can be considered a road map for investigations of communication-related markers in other clinical populations. SUPPLEMENTAL MATERIAL: https://doi.org/10.23641/asha.23721345.

Assuntos

Efeitos Colaterais e Reações Adversas Relacionados a Medicamentos , Farmacovigilância , Estados Unidos , Humanos , Sistemas de Notificação de Reações Adversas a Medicamentos , Fator A de Crescimento do Endotélio Vascular , United States Food and Drug Administration , Efeitos Colaterais e Reações Adversas Relacionados a Medicamentos/epidemiologia , Bases de Dados Factuais , Transtornos do Humor , Comunicação

Dissociating COVID-19 from other respiratory infections based on acoustic, motor coordination, and phonemic patterns.

Talkar, Tanya; Low, Daniel M; Simpkin, Andrew J; Ghosh, Satrajit; O'Keeffe, Derek T; Quatieri, Thomas F.

Sci Rep ; 13(1): 1567, 2023 01 28.

Artigo em Inglês | MEDLINE | ID: mdl-36709368

RESUMO

In the face of the global pandemic caused by the disease COVID-19, researchers have increasingly turned to simple measures to detect and monitor the presence of the disease in individuals at home. We sought to determine if measures of neuromotor coordination, derived from acoustic time series, as well as phoneme-based and standard acoustic features extracted from recordings of simple speech tasks could aid in detecting the presence of COVID-19. We further hypothesized that these features would aid in characterizing the effect of COVID-19 on speech production systems. A protocol, consisting of a variety of speech tasks, was administered to 12 individuals with COVID-19 and 15 individuals with other viral infections at University Hospital Galway. From these recordings, we extracted a set of acoustic time series representative of speech production subsystems, as well as their univariate statistics. The time series were further utilized to derive correlation-based features, a proxy for speech production motor coordination. We additionally extracted phoneme-based features. These features were used to create machine learning models to distinguish between the COVID-19 positive and other viral infection groups, with respiratory- and laryngeal-based features resulting in the highest performance. Coordination-based features derived from harmonic-to-noise ratio time series from read speech discriminated between the two groups with an area under the ROC curve (AUC) of 0.94. A longitudinal case study of two subjects, one from each group, revealed differences in laryngeal based acoustic features, consistent with observed physiological differences between the two groups. The results from this analysis highlight the promise of using nonintrusive sensing through simple speech recordings for early warning and tracking of COVID-19.

Assuntos

COVID-19 , Humanos , COVID-19/diagnóstico , Fala/fisiologia , Acústica , Ruído , Medida da Produção da Fala/métodos

Deep neural networks reveal topic-level representations of sentences in medial prefrontal cortex, lateral anterior temporal lobe, precuneus, and angular gyrus.

Acunzo, David J; Low, Daniel M; Fairhall, Scott L.

Neuroimage ; 251: 119005, 2022 05 01.

Artigo em Inglês | MEDLINE | ID: mdl-35176493

RESUMO

When reading a sentence, individual words can be combined to create more complex meaning. In this study, we sought to uncover brain regions that reflect the representation of the meaning of sentences at the topic level, as opposed to the meaning of their individual constituent words when considered irrespective of their context. Using fMRI, we recorded the neural activity of participants while reading sentences. We constructed a topic-level sentence representations using the final layer of a convolutional neural network (CNN) trained to classify Wikipedia sentences into broad semantic categories. This model was contrasted with word-level sentence representations constructed using the average of the word embeddings constituting the sentence. Using representational similarity analysis, we found that the medial prefrontal cortex, lateral anterior temporal lobe, precuneus, and angular gyrus more strongly represent sentence topic-level, compared to word-level, meaning, uncovering the important role of these semantic system regions in the representation of topic-level meaning. Results were comparable when sentence meaning was modelled with a multilayer perceptron that was not sensitive to word order within a sentence, suggesting that the learning objective, in the terms of the topic being modelled, is the critical factor in capturing these neural representational spaces.

Assuntos

Idioma , Lobo Temporal , Mapeamento Encefálico , Humanos , Imageamento por Ressonância Magnética , Redes Neurais de Computação , Lobo Parietal , Córtex Pré-Frontal/diagnóstico por imagem , Semântica , Lobo Temporal/diagnóstico por imagem

It's quality and quantity: the effect of the amount of comments on online suicidal posts.

Low, Daniel M; Zuromski, Kelly L; Kessler, Daniel; Ghosh, Satrajit S; Nock, Matthew; Dempsey, Walter.

Proc Conf Empir Methods Nat Lang Process ; 2021: 95-103, 2021 Nov.

Artigo em Inglês | MEDLINE | ID: mdl-35224567

RESUMO

Every day, individuals post suicide notes on social media asking for support, resources, and reasons to live. Some posts receive few comments while others receive many. While prior studies have analyzed whether specific responses are more or less helpful, it is not clear if the quantity of comments received is beneficial in reducing symptoms or in keeping the user engaged with the platform and hence with life. In the present study, we create a large dataset of users' first r/SuicideWatch (SW) posts from Reddit (N=21,274), collect the comments as well as the user's subsequent posts (N=1,615,699) to determine whether they post in SW again in the future. We use propensity score stratification, a causal inference method for observational data, and estimate whether the amount of comments -as a measure of social support- increases or decreases the likelihood of posting again on SW. One hypothesis is that receiving more comments may decrease the likelihood of the user posting in SW in the future, either by reducing symptoms or because comments from untrained peers may be harmful. On the contrary, we find that receiving more comments increases the likelihood a user will post in SW again. We discuss how receiving more comments is helpful, not by permanently relieving symptoms since users make another SW post and their second posts have similar mentions of suicidal ideation, but rather by reinforcing users to seek support and remain engaged with the platform. Furthermore, since receiving only 1 comment -the most common case- decreases the likelihood of posting again by 14% on average depending on the time window, it is important to develop systems that encourage more commenting.

Natural Language Processing Reveals Vulnerable Mental Health Support Groups and Heightened Health Anxiety on Reddit During COVID-19: Observational Study.

Low, Daniel M; Rumker, Laurie; Talkar, Tanya; Torous, John; Cecchi, Guillermo; Ghosh, Satrajit S.

J Med Internet Res ; 22(10): e22635, 2020 10 12.

Artigo em Inglês | MEDLINE | ID: mdl-32936777

RESUMO

BACKGROUND: The COVID-19 pandemic is impacting mental health, but it is not clear how people with different types of mental health problems were differentially impacted as the initial wave of cases hit. OBJECTIVE: The aim of this study is to leverage natural language processing (NLP) with the goal of characterizing changes in 15 of the world's largest mental health support groups (eg, r/schizophrenia, r/SuicideWatch, r/Depression) found on the website Reddit, along with 11 non-mental health groups (eg, r/PersonalFinance, r/conspiracy) during the initial stage of the pandemic. METHODS: We created and released the Reddit Mental Health Dataset including posts from 826,961 unique users from 2018 to 2020. Using regression, we analyzed trends from 90 text-derived features such as sentiment analysis, personal pronouns, and semantic categories. Using supervised machine learning, we classified posts into their respective support groups and interpreted important features to understand how different problems manifest in language. We applied unsupervised methods such as topic modeling and unsupervised clustering to uncover concerns throughout Reddit before and during the pandemic. RESULTS: We found that the r/HealthAnxiety forum showed spikes in posts about COVID-19 early on in January, approximately 2 months before other support groups started posting about the pandemic. There were many features that significantly increased during COVID-19 for specific groups including the categories "economic stress," "isolation," and "home," while others such as "motion" significantly decreased. We found that support groups related to attention-deficit/hyperactivity disorder, eating disorders, and anxiety showed the most negative semantic change during the pandemic out of all mental health groups. Health anxiety emerged as a general theme across Reddit through independent supervised and unsupervised machine learning analyses. For instance, we provide evidence that the concerns of a diverse set of individuals are converging in this unique moment of history; we discovered that the more users posted about COVID-19, the more linguistically similar (less distant) the mental health support groups became to r/HealthAnxiety (ρ=-0.96, P<.001). Using unsupervised clustering, we found the suicidality and loneliness clusters more than doubled in the number of posts during the pandemic. Specifically, the support groups for borderline personality disorder and posttraumatic stress disorder became significantly associated with the suicidality cluster. Furthermore, clusters surrounding self-harm and entertainment emerged. CONCLUSIONS: By using a broad set of NLP techniques and analyzing a baseline of prepandemic posts, we uncovered patterns of how specific mental health problems manifest in language, identified at-risk users, and revealed the distribution of concerns across Reddit, which could help provide better resources to its millions of users. We then demonstrated that textual analysis is sensitive to uncover mental health complaints as they appear in real time, identifying vulnerable groups and alarming themes during COVID-19, and thus may have utility during the ongoing pandemic and other world-changing events such as elections and protests.

Assuntos

Ansiedade/diagnóstico , Ansiedade/epidemiologia , Infecções por Coronavirus/epidemiologia , Saúde Mental/estatística & dados numéricos , Processamento de Linguagem Natural , Pneumonia Viral/epidemiologia , Grupos de Autoajuda/estatística & dados numéricos , Mídias Sociais/estatística & dados numéricos , Adolescente , Adulto , Ansiedade/psicologia , Betacoronavirus , Transtorno da Personalidade Borderline/epidemiologia , Transtorno da Personalidade Borderline/psicologia , COVID-19 , Feminino , Saúde Global , Humanos , Masculino , Pessoa de Meia-Idade , Pandemias , SARS-CoV-2 , Transtornos de Estresse Pós-Traumáticos/epidemiologia , Transtornos de Estresse Pós-Traumáticos/psicologia , Ideação Suicida , Adulto Jovem

Automated assessment of psychiatric disorders using speech: A systematic review.

Low, Daniel M; Bentley, Kate H; Ghosh, Satrajit S.

Laryngoscope Investig Otolaryngol ; 5(1): 96-116, 2020 Feb.

Artigo em Inglês | MEDLINE | ID: mdl-32128436

RESUMO

OBJECTIVE: There are many barriers to accessing mental health assessments including cost and stigma. Even when individuals receive professional care, assessments are intermittent and may be limited partly due to the episodic nature of psychiatric symptoms. Therefore, machine-learning technology using speech samples obtained in the clinic or remotely could one day be a biomarker to improve diagnosis and treatment. To date, reviews have only focused on using acoustic features from speech to detect depression and schizophrenia. Here, we present the first systematic review of studies using speech for automated assessments across a broader range of psychiatric disorders. METHODS: We followed the Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) guidelines. We included studies from the last 10 years using speech to identify the presence or severity of disorders within the Diagnostic and Statistical Manual of Mental Disorders (DSM-5). For each study, we describe sample size, clinical evaluation method, speech-eliciting tasks, machine learning methodology, performance, and other relevant findings. RESULTS: 1395 studies were screened of which 127 studies met the inclusion criteria. The majority of studies were on depression, schizophrenia, and bipolar disorder, and the remaining on post-traumatic stress disorder, anxiety disorders, and eating disorders. 63% of studies built machine learning predictive models, and the remaining 37% performed null-hypothesis testing only. We provide an online database with our search results and synthesize how acoustic features appear in each disorder. CONCLUSION: Speech processing technology could aid mental health assessments, but there are many obstacles to overcome, especially the need for comprehensive transdiagnostic and longitudinal studies. Given the diverse types of data sets, feature extraction, computational methodologies, and evaluation criteria, we provide guidelines for both acquiring data and building machine learning models with a focus on testing hypotheses, open science, reproducibility, and generalizability. LEVEL OF EVIDENCE: 3a.

The right hemisphere's contribution to discourse processing: A study in temporal lobe epilepsy.

Lomlomdjian, Carolina; Múnera, Claudia P; Low, Daniel M; Terpiluk, Verónica; Solís, Patricia; Abusamra, Valeria; Kochen, Silvia.

Brain Lang ; 171: 31-41, 2017 08.

Artigo em Inglês | MEDLINE | ID: mdl-28478355

RESUMO

OBJECTIVE: Discourse skills - in which the right hemisphere has an important role - enables verbal communication by selecting contextually relevant information and integrating it coherently to infer the correct meaning. However, language research in epilepsy has focused on single word analysis related mainly to left hemisphere processing. The purpose of this study was to investigate discourse abilities in patients with right lateralized medial temporal lobe epilepsy (RTLE) by comparing their performance to that of patients with left temporal lobe epilepsy (LTLE). METHODS: 74 pharmacoresistant temporal lobe epilepsy (TLE) patients were evaluated: 34 with RTLE and 40 with LTLE. Subjects underwent a battery of tests that measure comprehension and production of conversational and narrative discourse. Disease related variables and general neuropsychological data were evaluated. RESULTS: The RTLE group presented deficits in interictal conversational and narrative discourse, with a disintegrated speech, lack of categorization and misinterpretation of social meaning. LTLE group, on the other hand, showed a tendency to lower performance in logical-temporal sequencing. SIGNIFICANCE: RTLE patients showed discourse deficits which have been described in right hemisphere damaged patients due to other etiologies. Medial and anterior temporal lobe structures appear to link semantic, world knowledge, and social cognition associated areas to construct a contextually related coherent meaning.

Assuntos

Epilepsia do Lobo Temporal/fisiopatologia , Epilepsia do Lobo Temporal/psicologia , Lateralidade Funcional/fisiologia , Fala , Adolescente , Adulto , Idade de Início , Feminino , Humanos , Imageamento por Ressonância Magnética , Masculino , Semântica , Lobo Temporal/patologia , Lobo Temporal/fisiopatologia

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA